The Form PREM14A Files Dataset is a complete archive of preliminary merger and acquisition proxy statements filed on EDGAR under Section 14(a) of the Securities Exchange Act of 1934 and Regulation 14A. Each record is a single PREM14A submission, identified by exactly one EDGAR accession number, and contains the preliminary proxy statement, the structured Exhibit 107 filing-fees table, any additional exhibits (merger agreements, fairness opinions, voting and support agreements, charter amendments), and a metadata.json manifest describing the filing and its constituent documents. Filers are domestic registrants subject to the proxy rules — typically the public target in a cash merger, a public issuer in a going-private or affiliated-party transaction, or any registrant whose shareholders must vote on a merger, sale of substantially all assets, recapitalization, or similar fundamental change. Coverage runs from 1994 to the present, packaged as monthly ZIP containers keyed by <YYYY>/<YYYY>-MM.zip, with content delivered as TXT, JSON, HTML, and PDF.
Programmatically retrieve the full list of dataset archive files, download URLs and dataset metadata.
Dataset Index JSON API
Download the entire dataset as a single archive file.
Download Entire Dataset:
Download a single container file (e.g. monthly archive) from the dataset.
Download Single Container:
The dataset packages every PREM14A submission filed on EDGAR — the preliminary form of proxy statement that registrants file when soliciting shareholder votes on a merger, acquisition, sale of substantially all assets, or similar extraordinary corporate transaction. The substantive disclosure inside each filing is governed by Schedule 14A, with the M&A-specific items (Items 11–14, 18, and 21) and the Regulation M-A integration items (notably Item 1001, Item 1002, and Item 1015) carrying most of the weight, supplemented by the merger agreement, fairness opinion(s), and voting agreements filed as annexes or exhibits.
Records are grouped into monthly ZIP containers keyed by <YYYY>/<YYYY>-MM.zip. Inside a monthly ZIP, every record occupies its own folder named with the 18-digit zero-padded accession number (hyphens removed); that folder is the atomic unit of the dataset and contains a metadata.json manifest plus all documents from the original EDGAR submission. The dataset's coverage window stretches from EDGAR's earliest years to the present, and content is delivered as TXT, JSON, HTML, and PDF. Image binaries (.jpg, .gif, .png) are deliberately excluded from the package even though they remain referenced as GRAPHIC entries in the manifest. Modern records consist almost exclusively of metadata.json plus one or more SGML-wrapped .htm documents.
A record is built from four nested content layers, from outermost packaging to innermost disclosure:
metadata.json manifest at the root of the folder.<DOCUMENT> envelopes wrapping each .htm (and any .txt/.pdf) attachment, mirroring the EDGAR submission structure.A typical record folder contains:
metadata.json (always present).<TYPE>PREM14A</TYPE> and <SEQUENCE>1</SEQUENCE> — the full preliminary proxy statement. Filer-specific naming is the norm: <token>_prem14a.htm, formprem14a.htm, d<id>dprem14a.htm, and similar variants.<TYPE>[EX-FILING FEES](https://www.ecfr.gov/current/title-17/chapter-II/part-229/subpart-229.600/section-229.601)</TYPE> — typically named <token>_ex107.htm, ex-107.htm, <token>_exfee.htm, or d<id>dexfilingfees.htm.EX-2.x), fairness opinions (often EX-99.x), voting/support agreements, charter amendments, target-company financial statements, consents of accountants and financial advisors (EX-23.x), and miscellaneous correspondence. Larger filings can also carry PDF and TXT attachments.Image graphics referenced in documentFormatFiles as GRAPHIC entries are intentionally excluded from the ZIP; the manifest entries remain, but no corresponding binary file is present on disk.
metadata.json schemametadata.json describes both the filing as a whole and each constituent document. Top-level fields:
formType — always "PREM14A" for this dataset.accessionNo — canonical hyphenated accession number (e.g., "0001683168-25-004792").linkToFilingDetails — URL to the primary preliminary proxy HTML on sec.gov.linkToTxt — URL to the full concatenated SGML submission .txt on sec.gov.linkToHtml — URL to the EDGAR -index.htm filing index page.linkToXbrl — empty string; PREM14A is not subject to XBRL tagging.dataFiles — empty array, for the same reason.description — "Form PREM14A - Preliminary proxy statements relating to merger or acquisition".filedAt — ISO 8601 acceptance timestamp with timezone offset.periodOfReport — YYYY-MM-DD period-of-report date.documentFormatFiles — array of per-document descriptors (see below).entities — array of filer entity descriptors (typically one).seriesAndClassesContractsInformation — investment-company series/classes data; empty for typical operating-company PREM14A filings.id — 32-hex internal record identifier.Each documentFormatFiles[] element carries a sequence (string; "1", "2", ...; a single space for the complete-submission text file), a size in bytes (string), a documentUrl pointing to the file on sec.gov, an optional description ("PREM14A", "FILING FEES TABLE", "GRAPHIC", "Complete submission text file", etc.), and a type mirroring the SGML <TYPE> ("PREM14A", "EX-FILING FEES", "GRAPHIC", blank for the bundled .txt). The first entry is always the preliminary proxy statement itself; the EX-FILING FEES entry is Exhibit 107; GRAPHIC entries reference excluded image files; the trailing blank-type entry points to the full SGML submission .txt on EDGAR.
Each entities[] element typically describes a single filer/registrant: companyName with a role suffix such as "(Filer)", cik, fileNo, irsNo, stateOfIncorporation, fiscalYearEnd in MMDD form, act ("34" for the Exchange Act), type (the filer-role form type, "PREM14A"), sic (SIC code with description), filmNo, and tickers. Where the registrant is a SPAC or has affiliated reporting tickers, tickers may carry multiple symbols.
Each .htm (and .txt/.pdf) attachment in the folder retains the EDGAR SGML envelope that wraps the actual HTML body, of the form:
1
<DOCUMENT>
2
<TYPE>PREM14A
3
<SEQUENCE>1
4
<FILENAME>collective_prem14a.htm
5
<DESCRIPTION>PRELIMINARY NOTICE OF SPECIAL MEETING
6
<TEXT>
7
<HTML>... full HTML content ...</HTML>
8
</TEXT>
9
</DOCUMENT>
The <TYPE> value mirrors documentFormatFiles[].type and <SEQUENCE> mirrors the declared sequence number, so the manifest and on-disk wrapper can be cross-walked deterministically. Parsers must strip the leading <DOCUMENT>...<TEXT> and trailing </TEXT></DOCUMENT> lines before passing the inner body to a DOM parser.
The primary <TYPE>PREM14A</TYPE> HTML document is consistently the largest file in the record — typically 0.8–2.4 MB of HTML for an operating-company merger — and carries the full Schedule 14A disclosure in a conventional order:
The EX-FILING FEES document is a small structured exhibit (typically 8–35 KB of HTML) containing the "Calculation of Filing Fee Tables" required since the SEC's 2021 amendments implementing Rule 0-11 fee-disclosure modernization. It tabulates:
In current practice the cover page of the proxy statement no longer carries the fee calculation in line; that calculation lives exclusively in Exhibit 107.
When a registrant chooses not to inline annexes into the primary document, additional .htm (or PDF, or TXT) documents in the record folder may carry: the merger agreement and any amendments, the financial advisor's written fairness opinion, voting/support agreements, equity and debt commitment letters, charter amendment text, target-company audited or unaudited financial statements, pro forma financial information, consents of accountants and financial advisors, and miscellaneous correspondence. Each is wrapped in its own <DOCUMENT> envelope with an exhibit-appropriate <TYPE> (e.g., EX-2.1, EX-23.1, EX-99.1).
The package preserves the substantive submission as filed on EDGAR: the preliminary proxy statement HTML, the filing-fees exhibit, any other exhibit documents, and any supplementary PDF or TXT attachments, all retaining their original SGML <DOCUMENT> wrapping. The metadata.json provides a complete manifest of every document the registrant submitted (graphics included, even though their binaries are not), plus filer-level metadata sufficient to identify the registrant, locate the live EDGAR copy, and place the filing in time.
.jpg, .gif, and .png graphics referenced in documentFormatFiles are stripped from the ZIP. Their entries remain in the manifest, but the binary files are not packaged. Consequently, embedded <img> tags in the primary HTML resolve to missing files locally..txt. The concatenated SGML submission text file is referenced in documentFormatFiles (and via linkToTxt) but is not re-shipped inside the folder; the per-document .htm files supersede it.Schedule 14A is the substantive backbone, but several rule changes have left visible imprints on PREM14A content over the dataset's coverage window:
EX-FILING FEES document and the cover page references it.The on-disk format of records reflects EDGAR's own format eras:
.txt documents containing the entire proxy statement and any annexes as plain text, with tables hand-formatted in monospace. Records from this era often consist of one large .txt plus a metadata.json..htm with embedded tables and basic typography; image graphics begin to appear and, in this dataset, are excluded from the ZIP.<DOCUMENT> blocks; PDF attachments appear for items such as scanned fairness opinions..htm document typed EX-FILING FEES alongside the primary proxy. The fee calculation no longer lives on the proxy cover page.<DOCUMENT> files or as inline annexes inside the primary proxy is a registrant/filer-counsel choice; both layouts coexist throughout the dataset and content extraction must accommodate both..htm retains its EDGAR SGML envelope. Cross-walking <TYPE> and <SEQUENCE> against documentFormatFiles[] lets a consumer match each on-disk file to its declared role deterministically.GRAPHIC entries in documentFormatFiles are informative for completeness checking but should not be mistaken for files present on disk.seriesAndClassesContractsInformation field is populated only for investment-company filers; for typical operating-company merger PREM14As it is empty, and that emptiness is itself a useful signal for filer-type segmentation.entities is typically a single-element array, transactions involving co-registrants (jointly filed proxies, certain SPAC structures) produce multiple entries, each with its own role suffix on companyName.The filer of a PREM14A is a registrant subject to the proxy rules under Section 14(a) of the Exchange Act and Regulation 14A — almost always a company with a class of equity registered under Section 12 — that is soliciting proxies, consents, or authorizations from its own security holders in connection with a proposed business combination. The filer is the issuer whose shareholders must vote, not necessarily the counterparty to the deal.
Typical filer profiles:
Outside parties heavily involved but not the named filer: outside counsel drafting Schedule 14A, proxy solicitors (Innisfree, MacKenzie, D.F. King, Georgeson, Okapi), financial advisors delivering fairness opinions, and transfer/distribution agents. They appear as "persons making the solicitation" but do not sign as the registrant.
A PREM14A is event-driven, triggered by the combination of:
The structural timing rule is Rule 14a-6(a): a preliminary copy of the proxy and form of proxy must be filed with the SEC at least 10 calendar days before definitive copies are first sent to security holders, unless the only matters to be voted on are routine items the rule enumerates (uncontested director elections, auditor ratification, Rule 14a-8 shareholder proposals, certain compensation plans). M&A, asset sales, dispositions of significant subsidiaries, share exchanges, and going-private transactions fall outside that exemption — which is the entire reason the PREM14A tag exists, separate from PRE 14A used for non-M&A preliminary proxies.
In practice the staff frequently selects M&A proxies for review. The registrant cannot proceed to definitive until comments are cleared, typically through one or more PRER14A amendments. The DEFM14A is mailed only after staff review concludes (or the 10-day window expires without review). There is no statutory deadline to file PREM14A after signing; timing is driven by the merger agreement's "as promptly as reasonably practicable" covenant and the deal's outside date.
Adjacent communications rule: Rule 14a-12 allows soliciting communications before a proxy statement is furnished, provided each written communication is filed under cover of Schedule 14A on the date of first use with required legends. These appear on EDGAR as DEFA14A and routinely precede the first PREM14A.
PREM14A sits in a tight cluster of M&A disclosure documents. The most useful comparisons are with (1) other stages of the same proxy, (2) parallel non-vote or going-private regimes, (3) alternative deal-structure forms, and (4) structured extractor datasets covering the same filings.
Same transaction, same disclosure scope, later stage. PREM14A is the pre-review draft, often the first public document containing the full deal narrative; DEFM14A is the post-comment, cleared version mailed to shareholders. PREM14A may include placeholders, blanks, and language that later changes. Use PREM14A to study first-disclosed terms and comment-letter-driven revisions; use DEFM14A for the version of record shareholders actually voted on.
Same regime and same deal as the originating PREM14A, but reflects changes made during SEC staff review before going definitive. PREM14A captures the starting point; PRER14A captures the redlined intermediate states (added risk factors, revised fairness language, updated projections). Required if the analytic question is how disclosure evolves through staff review.
Same procedural posture (preliminary, pre-definitive) but for non-M&A matters: director elections, charter amendments, equity plans, say-on-pay. Lacks fairness opinions, transaction backgrounds, and merger-consideration mechanics. PREM14A is the M&A-specific cut; PRE 14A is the residual category. Mixing the two skews any merger-activity count.
Supplementary solicitation material filed after the definitive proxy: investor decks, talking points, press releases, shareholder letters, proxy-advisor responses. Not standalone, no full transaction description. Important in contested or scrutinized deals as the campaign layer around the vote, but does not substitute for PREM14A's foundational disclosure.
Annual-meeting proxy covering director elections, auditor ratification, and executive compensation. Shares Regulation 14A heritage with PREM14A but the disclosure regime (Item 402 tables, audit-committee reports, ownership tables) is orthogonal to merger content. Combining DEF 14A and PREM14A without filtering produces a corpus dominated by governance boilerplate rather than transactions.
14C mirrors 14A when shareholder action is taken by written consent of a controlling holder rather than by solicited vote. Disclosure overlaps heavily (terms, background, fairness opinions) but there is no proxy card and no vote being solicited. Controlled-company deals and sponsor-majority transactions typically appear in 14C. A PREM14A-only sample systematically excludes these.
Required under Rule 13e-3 when an issuer or its affiliates take the company private. In going-private mergers, 13E-3 is filed alongside (often combined with) PREM14A and adds affiliate-specific disclosures: purposes, alternatives considered, fairness determinations by each filer, and full advisor reports. PREM14A frames the proxy solicitation; 13E-3 frames affiliate fairness. Management buyouts, sponsor take-privates, and squeeze-outs require both.
Registers securities issued in business combinations. In stock or mixed-consideration deals, the acquirer files S-4 and the target may file PREM14A; often the S-4 doubles as a joint proxy/prospectus. S-4 carries Securities Act content (issuer description, pro forma financials, tax treatment) absent from PREM14A. All-cash deals usually have PREM14A but no S-4. A PREM14A-only universe under-represents stock deals where the primary disclosure lives in the acquirer's S-4.
Filed by the target in response to a third-party tender offer: board recommendation, negotiation background, fairness opinion, conflicts. Thematic overlap with PREM14A (background and advisor sections) but applies to tender-offer structures rather than vote-driven mergers. One-step merger votes produce PREM14A without Schedule 14D-9; two-step deals produce 14D-9 first and PREM14A/DEFM14A for the back-end merger.
Bidder-side counterpart to 14D-9: offer terms, financing, conditions. Largely mutually exclusive with PREM14A on a per-deal basis — a transaction is either vote-driven (PREM14A) or tender-driven (Schedule TO + 14D-9). Two-step deals are the only routine overlap.
First-day, event-driven announcement of entry into the merger agreement, typically attaching the agreement as Exhibit 2.1 and a press release as Exhibit 99.1. Terse and contractual, not narrative. PREM14A arrives weeks later with the deliberative record (background, alternatives, advisor analyses, projections) that 8-K Item 1.01 omits. Use Form 8-K for deal-announcement timing and merger-agreement text; use PREM14A for the surrounding rationale and process.
This dataset contains the original EDGAR submissions — preliminary proxy, exhibits, attachments, metadata — in native HTML, TXT, PDF, and JSON metadata, excluding images. Layout, exhibit ordering, and full as-filed text are preserved.
A structured PREM14A extractor (where available) parses specific fields — parties, consideration, exchange ratios, vote thresholds, advisor names, fairness-opinion summaries, termination fees — into JSON or tables. Efficient for cross-deal statistics, but discards surrounding narrative, exhibit text, and anything outside the extractor's schema.
Choose the files dataset for full-text search, exhibit-level access (merger agreements, opinion letters, voting agreements), training-data construction, or fidelity to as-filed wording. Choose a structured extractor for deal-level quantitative analysis when its schema covers the needed fields.
PREM14A files capture, in original form, the earliest comprehensive public disclosure of a vote-driven merger: narrative-heavy, exhibit-rich, filed before SEC review and before definitive mailing. It is narrower than DEF 14A and PRE 14A (M&A-only), earlier and rougher than DEFM14A, distinct from PRER14A (initial vs. revised), vote-centric where PREM14C is consent-based, proxy-framed where Schedule 13E-3 is affiliate-fairness-framed, target-side where S-4 is acquirer-side registration, merger-structured where Schedule TO and 14D-9 are tender-structured, and deliberative where 8-K Item 1.01 is announcement-only. No adjacent form reproduces the full pre-clearance proxy submission as it appeared on EDGAR.
PREM14A is the earliest complete public record of a negotiated change-of-control deal: merger mechanics, background of negotiations, fairness opinions, financing, voting agreements, and regulatory conditions. The following users each read different sections for different outputs, and most prefer the as-filed preliminary version to the post-comment DEFM14A.
Underwrite long or short positions on announced deals. They pull consideration mix, exchange ratios and collars, breakup fees, no-shop and fiduciary-out language, support agreements, financing commitments, and the fairness opinion summary in "Reasons for the Merger." Output: a deal memo and position sizing. Waiting for the DEFM14A forfeits weeks of trading edge.
Price the spread. Focus narrowly on MAC carve-outs, regulatory conditions (HSR, foreign investment, sector clearances), termination dates and walk-away rights, reverse termination fees, specific performance, and the meeting timeline. Output: spread targets and break-risk leg sizing. The as-filed PREM14A is operative because staff comments rarely change contractual terms.
Build precedent libraries to inform their own bids and board decks. Harvest deal structure (one-step, two-step, reverse triangular), premium-to-unaffected, option and RSU treatment, retention pools, escrow terms, and the negotiation chronology in "Background of the Merger." Output: precedent tables and process playbooks for go-shops, topping bids, and standstill waivers.
Benchmark fairness opinions and feed comparable-transaction databases. Extract advisor identity, fee structure (announcement, success, opinion), prior-relationship disclosures, full methodology descriptions (selected companies, precedent transactions, DCF, LBO, premiums paid), management projections, and implied valuation ranges. Output: pitch comps and league-table tracking. PREM14A captures the original methodology before any cosmetic edits prompted by staff or plaintiff pressure.
Transactional counsel mine PREM14A for drafting precedent: background wording, fairness opinion descriptions, advisor and management conflicts, Item 402(t) golden parachute tables, appraisal rights notices, antitrust risk factors, and the forms of merger and voting agreements attached as exhibits. Litigation counsel diff PREM14A against DEFM14A to infer staff comments and supplemental disclosures added under settlement pressure. Output: drafting templates and disclosure-defense memos.
Use PREM14A as the trigger document for disclosure-adequacy claims. Scrutinize background chronology for omitted bidders, projections for missing line items, opinion methodology for unsupported assumptions, and banker and management conflicts. Output: demand letters, complaints, and mootness-fee negotiations. The PREM14A's filing date sets the litigation calendar.
Advisors build vote recommendations on the merger and the say-on-parachute vote, reading strategic rationale, valuation analyses, the opinion summary, post-closing governance, equity treatment, and Item 402(t) tables. Solicitors model turnout, map institutional and retail blocks, and design outreach. Output: recommendation reports and solicitation playbooks ready when the DEFM14A mails.
Use the PREM14A series back to 1994 for longitudinal studies of deal-protection devices, premium and control trends, opinion practices, parachute escalation, and disclosure depth across legal regimes. Output: panel datasets and published research. The preliminary version is preferred where the question concerns initial disclosure choices, not post-comment revisions.
Monitor how their own and peer firms' fairness opinions and conflict disclosures appear in public filings. Reconcile disclosed fees, prior engagements, and trading relationships against internal records and calibrate the firm's own conflicts language. PREM14A-versus-DEFM14A diffs flag drafting practices likely to draw staff attention.
Parse filings to populate structured M&A databases: registrant and target identifiers, deal value, consideration type, premiums to one-day and four-week unaffected, advisor names and fees, termination fees and triggers, expected close, and vote thresholds. Output: league tables, completion-probability models, premium analytics, and arbitrage spread feeds. PREM14A is the earliest filing carrying the full structured field set, cutting latency versus DEFM14A or press releases.
Financial reporters use PREM14A as the primary source for deal-mechanics coverage, drawing on background, opinion summary, and parachute tables. Corporate-accountability researchers track change-of-control payouts and workforce and pension treatment. Activist research desks scan the disclosed process, conflicts, and premium for grounds to launch a campaign or topping bid.
PREM14A is the operative document set for negotiated public-company M&A. Arbitrage and event-driven desks act on it in real time; bankers, lawyers, proxy advisors, and compliance use it for benchmarking and drafting; plaintiffs, journalists, academics, and accountability researchers use it for monitoring and longitudinal study. The PREM14A-versus-DEFM14A delta is itself a research signal.
PREM14A files support a small set of operational workflows where the value comes from specific sections of the preliminary proxy. The use cases below name the audience, the records or fields used, the workflow, and the output.
Risk-arb desks parse the "Conditions to the Merger," "Termination," and "Regulatory Approvals" sections of the primary <TYPE>PREM14A</TYPE> HTML to extract MAC carve-outs, HSR and foreign-investment conditions, outside dates, reverse termination fees, and specific-performance language. Combined with the announced consideration and exchange ratio from the summary term sheet, these feed a deal-by-deal spread model that prices break risk and timing risk against the live target quote. Output: a spread sheet with break-probability and expected-close-date inputs for position sizing.
Corporate development teams and M&A bankers extract a fixed field set from each record — parties, deal value, consideration mix, exchange ratio and collar, premium to unaffected, no-shop and fiduciary-out terms, matching rights, breakup fee and reverse breakup fee, go-shop window — by parsing the merger-agreement summary and the merger-agreement annex (inlined or filed as EX-2.x). Indexed by SIC, deal size, and year, the resulting table backs precedent decks, board materials, and league-table comps.
M&A advisors, expert witnesses, and academic researchers harvest the "Opinion of the Financial Advisor" section together with the fairness-opinion exhibit (commonly EX-99.x) to capture advisor identity, fee structure, prior-relationship disclosures, methodologies used (selected companies, precedent transactions, DCF, LBO, premiums paid), implied valuation ranges, and the management projections shared with the advisor. Output: a methodology corpus used for opinion benchmarking, expert-report support, and longitudinal studies of valuation practice.
Plaintiffs' deal-litigation firms ingest each new PREM14A on its filedAt timestamp and scan "Background of the Merger," the projections section, and the conflicts disclosures for omitted bidders, missing forecast line items, advisor fee tail, and undisclosed management discussions. The same workflow later diffs the PREM14A against the corresponding DEFM14A (joined by registrant CIK and periodOfReport) to identify supplemental disclosures added under settlement pressure. Output: demand letters, complaint drafts, and mootness-fee positions calendared off the preliminary filing date.
ML teams use the as-filed HTML bodies — after stripping the SGML <DOCUMENT> envelope — as labeled training data for clause classification (no-shop, fiduciary out, MAC, breakup-fee tiers), risk-factor tagging, and section segmentation. The consistent Schedule 14A and Regulation M-A item structure provides weak labels at scale, while the heterogeneity of filer-counsel drafting habits supplies the variation needed for robust models. Output: clause extractors, deal-term taggers, and summarization models that generalize across DEFM14A, PRER14A, and 13E-3.
Governance researchers and proxy advisors pull the Item 402(t) golden-parachute table and the related say-on-parachute proposal text from each PREM14A to capture per-executive cash, equity acceleration, perquisite, and tax-gross-up amounts together with single- versus double-trigger structure. Joined to the registrant cik and sic, the panel supports parachute-escalation studies, vote-recommendation models, and corporate-accountability reporting on change-of-control payouts.
Quantitative researchers parse the EX-FILING FEES Exhibit 107 document (Tables 1–3) across the 2022-present coverage to extract the proposed maximum aggregate transaction value, fee-calculation rule, Section 6(b) fee rate, and any Rule 0-11(a)(2) offsets. Because Exhibit 107 is structured and small, this yields a clean deal-value series at filing time — earlier than DEFM14A and independent of press-release headline numbers — usable for activity indices, fee-revenue models, and validation of third-party deal databases.
Dataset Index JSON API: https://api.sec-api.io/datasets/form-prem14a-files.json
This endpoint returns dataset metadata and a list of all available container files. The metadata includes the dataset name, description, last updated timestamp, earliest sample date, total record count, total dataset size, covered form types (PREM14A), container format (ZIP), and content file types (TXT, JSON, HTML, PDF). It also includes the full dataset download URL and an array of containers with per-container key, size, records, updatedAt, and downloadUrl fields. Poll this endpoint to detect which containers were modified in the most recent refresh and download only those updated containers on a daily basis. This endpoint does not require an API key.
Example response:
1
{
2
"datasetId": "1f13365b-9ae0-694d-bc5c-7f43f28984b8",
3
"datasetDownloadUrl": "https://api.sec-api.io/datasets/form-prem14a-files.zip",
4
"name": "Form PREM14A Files Dataset",
5
"description": "Form PREM14A filings contain preliminary proxy statements relating to mergers or acquisitions, filed pursuant to Section 14(a) of the Securities Exchange Act of 1934 and Regulation 14A.",
6
"updatedAt": "2026-05-05T02:49:10.523Z",
7
"earliestSampleDate": "1994-01-01",
8
"totalRecords": 6493,
9
"totalSize": 1228626276,
10
"formTypes": ["PREM14A"],
11
"containerFormat": "ZIP",
12
"fileTypes": ["TXT", "JSON", "HTML", "PDF"],
13
"containers": [
14
{
15
"downloadUrl": "https://api.sec-api.io/datasets/form-prem14a-files/2026/2026-03.zip",
16
"key": "2026/2026-03.zip",
17
"size": 13818783,
18
"records": 24,
19
"updatedAt": "2026-05-05T02:49:10.523Z"
20
}
21
]
22
}
Download Entire Dataset: https://api.sec-api.io/datasets/form-prem14a-files.zip?token=YOUR_API_KEY
Downloads the full dataset as a single ZIP archive containing all monthly containers from the earliest sample date (1994-01-01) onward. This endpoint requires an API key passed via the token query parameter.
Download Single Container: https://api.sec-api.io/datasets/form-prem14a-files/2026/2026-03.zip?token=YOUR_API_KEY
Downloads one monthly container ZIP instead of the full dataset. Replace the year and month segments with the key of any container returned by the dataset index API. This endpoint requires an API key passed via the token query parameter.
The dataset covers PREM14A filings — preliminary proxy statements relating to a merger, acquisition, sale of substantially all assets, or similar extraordinary corporate transaction, filed under Section 14(a) of the Securities Exchange Act of 1934 and Regulation 14A. Subsequent amendments (PREM14A/A, PRER14A) and the definitive proxy (DEFM14A) are distinct EDGAR submissions and are not included.
One record is a single PREM14A submission on EDGAR, identified by exactly one accession number. The record is a folder containing a metadata.json manifest and all documents from the original submission — typically the primary preliminary proxy statement (<TYPE>PREM14A</TYPE>), the Exhibit 107 filing-fees document (<TYPE>EX-FILING FEES</TYPE>), and any additional exhibits such as merger agreements, fairness opinions, and voting agreements.
The filer is a registrant subject to the proxy rules under Section 14(a) — typically a domestic public company with equity registered under Section 12 — that is soliciting shareholder votes on a business combination. Common profiles include the public target in a cash merger, an issuer in a Rule 13e-3 going-private transaction, and any registrant whose shareholders must vote on a sale of substantially all assets, dissolution, or similar fundamental change. Foreign private issuers are exempt from Section 14(a) and do not file PREM14A.
Rule 14a-6(a) requires the preliminary copy of the proxy and form of proxy to be filed at least 10 calendar days before definitive copies are first sent to security holders. There is no statutory deadline tied to merger-agreement signing; the timing is driven by the merger agreement's "as promptly as reasonably practicable" covenant and the deal's outside date.
Coverage runs from 1994 to the present. Records are grouped into monthly ZIP containers keyed by <YYYY>/<YYYY>-MM.zip. Inside a monthly ZIP, every record occupies its own folder named with the 18-digit zero-padded accession number. Content file types are TXT, JSON, HTML, and PDF.
PREM14A is the pre-review draft — often the first public document containing the full deal narrative — filed before SEC staff review and before mailing. DEFM14A is the post-comment, cleared version actually mailed to shareholders. PREM14A may include placeholders or language that later changes; the PREM14A-versus-DEFM14A delta is itself a useful research signal for staff comments and supplemental disclosures.
No. All .jpg, .gif, and .png graphics are stripped from the ZIP, although their entries remain in documentFormatFiles as GRAPHIC records. Content incorporated by reference from the registrant's 10-K, 10-Q, or 8-K filings is also not reproduced inside the record. The complete-submission .txt is referenced via linkToTxt but not re-shipped inside the folder.