Form PREM14A Files Dataset

The Form PREM14A Files Dataset is a complete archive of preliminary merger and acquisition proxy statements filed on EDGAR under Section 14(a) of the Securities Exchange Act of 1934 and Regulation 14A. Each record is a single PREM14A submission, identified by exactly one EDGAR accession number, and contains the preliminary proxy statement, the structured Exhibit 107 filing-fees table, any additional exhibits (merger agreements, fairness opinions, voting and support agreements, charter amendments), and a metadata.json manifest describing the filing and its constituent documents. Filers are domestic registrants subject to the proxy rules — typically the public target in a cash merger, a public issuer in a going-private or affiliated-party transaction, or any registrant whose shareholders must vote on a merger, sale of substantially all assets, recapitalization, or similar fundamental change. Coverage runs from 1994 to the present, packaged as monthly ZIP containers keyed by <YYYY>/<YYYY>-MM.zip, with content delivered as TXT, JSON, HTML, and PDF.

Update Frequency
Daily
Updated at
2026-05-09
Earliest Sample Date
1994-01-01
Total Size
1.2 GB
Total Records
6,506
Container Format
ZIP
Content Types
TXT, JSON, HTML, PDF
Form Types
PREM14A

Dataset APIs

Programmatically retrieve the full list of dataset archive files, download URLs and dataset metadata.

Dataset Index JSON API

Download the entire dataset as a single archive file.

Download Entire Dataset:

Download a single container file (e.g. monthly archive) from the dataset.

Download Single Container:

Dataset Files

380 files · 1.2 GB
Download All
2026-05.zip1.4 MB15 records
2026-04.zip2.4 MB21 records
2026-03.zip5.9 MB33 records
2026-02.zip1.7 MB12 records
2026-01.zip4.5 MB30 records
2025-12.zip5.1 MB38 records
2025-11.zip3.6 MB22 records
2025-10.zip4.8 MB31 records
2025-09.zip5.5 MB23 records
2025-08.zip4.5 MB27 records
2025-07.zip5.7 MB34 records
2025-06.zip2.5 MB18 records
2025-05.zip1.7 MB10 records
2025-04.zip3.8 MB18 records
2025-03.zip2.7 MB15 records
2025-02.zip2.9 MB19 records
2025-01.zip4.3 MB21 records
2024-12.zip3.1 MB20 records
2024-11.zip3.7 MB27 records
2024-10.zip3.2 MB22 records
2024-09.zip5.9 MB38 records
2024-08.zip5.1 MB24 records
2024-07.zip3.1 MB18 records
2024-06.zip3.5 MB20 records
2024-05.zip5.1 MB27 records
2024-04.zip2.8 MB20 records
2024-03.zip5.7 MB27 records
2024-02.zip3.6 MB20 records
2024-01.zip6.6 MB35 records
2023-12.zip1.7 MB12 records
2023-11.zip3.8 MB21 records
2023-10.zip3.3 MB17 records
2023-09.zip3.3 MB24 records
2023-08.zip3.9 MB19 records
2023-07.zip2.3 MB12 records
2023-06.zip6.6 MB33 records
2023-05.zip5.2 MB28 records
2023-04.zip4.0 MB25 records
2023-03.zip5.4 MB23 records
2023-02.zip4.2 MB15 records
2023-01.zip2.8 MB12 records
2022-12.zip3.1 MB20 records
2022-11.zip4.4 MB25 records
2022-10.zip3.6 MB15 records
2022-09.zip5.0 MB27 records
2022-08.zip6.5 MB32 records
2022-07.zip4.9 MB22 records
2022-06.zip4.2 MB22 records
2022-05.zip3.8 MB23 records
2022-04.zip5.2 MB30 records
2022-03.zip6.4 MB32 records
2022-02.zip6.6 MB28 records
2022-01.zip4.7 MB10 records
2021-12.zip3.9 MB13 records
2021-11.zip3.8 MB8 records
2021-10.zip5.3 MB14 records
2021-09.zip7.7 MB18 records
2021-08.zip9.2 MB20 records
2021-07.zip5.3 MB16 records
2021-06.zip7.3 MB16 records
2021-05.zip8.9 MB19 records
2021-04.zip3.9 MB11 records
2021-03.zip5.3 MB13 records
2021-02.zip7.1 MB15 records
2021-01.zip6.4 MB18 records
2020-12.zip4.7 MB12 records
2020-11.zip3.9 MB9 records
2020-10.zip3.8 MB10 records
2020-09.zip5.0 MB8 records
2020-08.zip8.0 MB18 records
2020-07.zip3.5 MB6 records
2020-06.zip657.3 KB2 records
2020-05.zip2.0 MB5 records
2020-04.zip1.1 MB4 records
2020-03.zip3.7 MB11 records
2020-02.zip2.6 MB8 records
2020-01.zip4.8 MB10 records
2019-12.zip8.1 MB22 records
2019-11.zip4.6 MB12 records
2019-10.zip4.4 MB9 records
2019-09.zip5.5 MB11 records
2019-08.zip4.7 MB14 records
2019-07.zip6.1 MB20 records
2019-06.zip1.9 MB7 records
2019-05.zip2.7 MB12 records
2019-04.zip2.0 MB8 records
2019-03.zip3.1 MB11 records
2019-02.zip3.2 MB9 records
2019-01.zip2.2 MB7 records
2018-12.zip3.0 MB10 records
2018-11.zip5.9 MB15 records
2018-10.zip3.1 MB10 records
2018-09.zip2.6 MB10 records
2018-08.zip6.0 MB15 records
2018-07.zip6.0 MB18 records
2018-06.zip3.3 MB13 records
2018-05.zip5.0 MB13 records
2018-04.zip3.9 MB13 records
2018-03.zip3.4 MB11 records
2018-02.zip5.2 MB16 records
2018-01.zip1.1 MB4 records
2017-12.zip3.7 MB12 records
2017-11.zip2.7 MB9 records
2017-10.zip4.2 MB13 records
2017-09.zip5.6 MB14 records
2017-08.zip3.4 MB10 records
2017-07.zip3.4 MB13 records
2017-06.zip4.8 MB15 records
2017-05.zip4.8 MB15 records
2017-04.zip3.2 MB9 records
2017-03.zip3.9 MB13 records
2017-02.zip2.2 MB7 records
2017-01.zip3.7 MB9 records
2016-12.zip5.2 MB16 records
2016-11.zip2.9 MB13 records
2016-10.zip2.2 MB9 records
2016-09.zip4.8 MB18 records
2016-08.zip2.1 MB9 records
2016-07.zip6.5 MB18 records
2016-06.zip3.5 MB17 records
2016-05.zip2.4 MB10 records
2016-04.zip3.5 MB13 records
2016-03.zip4.3 MB16 records
2016-02.zip1.8 MB6 records
2016-01.zip1.2 MB4 records
2015-12.zip2.5 MB9 records
2015-11.zip4.6 MB19 records
2015-10.zip3.1 MB11 records
2015-09.zip2.9 MB16 records
2015-08.zip3.8 MB15 records
2015-07.zip2.3 MB9 records
2015-06.zip2.0 MB7 records
2015-05.zip3.8 MB13 records
2015-04.zip2.8 MB8 records
2015-03.zip3.9 MB13 records
2015-02.zip600.6 KB3 records
2015-01.zip1.8 MB6 records
2014-12.zip4.4 MB11 records
2014-11.zip1.1 MB4 records
2014-10.zip4.3 MB15 records
2014-09.zip4.5 MB39 records
2014-08.zip1.1 MB7 records
2014-07.zip3.2 MB10 records
2014-06.zip2.0 MB8 records
2014-05.zip4.9 MB16 records
2014-04.zip2.3 MB10 records
2014-03.zip2.2 MB7 records
2014-02.zip1.5 MB7 records
2014-01.zip1.7 MB7 records
2013-12.zip1.3 MB6 records
2013-11.zip2.3 MB10 records
2013-10.zip2.9 MB14 records
2013-09.zip2.2 MB10 records
2013-08.zip2.0 MB9 records
2013-07.zip2.7 MB13 records
2013-06.zip2.3 MB10 records
2013-05.zip3.3 MB14 records
2013-04.zip1.3 MB6 records
2013-03.zip5.0 MB18 records
2013-02.zip2.3 MB11 records
2013-01.zip2.0 MB9 records
2012-12.zip1.7 MB8 records
2012-11.zip4.3 MB15 records
2012-10.zip1.7 MB6 records
2012-09.zip4.3 MB15 records
2012-08.zip2.9 MB13 records
2012-07.zip3.6 MB11 records
2012-06.zip4.5 MB17 records
2012-05.zip3.7 MB14 records
2012-04.zip3.5 MB14 records
2012-03.zip720.5 KB4 records
2012-02.zip1.3 MB6 records
2012-01.zip1.0 MB5 records
2011-12.zip2.1 MB11 records
2011-11.zip1.7 MB9 records
2011-10.zip5.0 MB17 records
2011-09.zip2.6 MB13 records
2011-08.zip2.4 MB15 records
2011-07.zip4.1 MB11 records
2011-06.zip5.9 MB22 records
2011-05.zip2.4 MB12 records
2011-04.zip4.0 MB24 records
2011-03.zip3.6 MB27 records
2011-02.zip2.9 MB10 records
2011-01.zip2.5 MB14 records
2010-12.zip3.9 MB17 records
2010-11.zip2.9 MB15 records
2010-10.zip4.9 MB28 records
2010-09.zip4.6 MB20 records
2010-08.zip2.7 MB14 records
2010-07.zip4.3 MB18 records
2010-06.zip2.3 MB11 records
2010-05.zip3.0 MB13 records
2010-04.zip7.3 MB21 records
2010-03.zip1.6 MB9 records
2010-02.zip2.0 MB10 records
2010-01.zip2.5 MB11 records
2009-12.zip3.9 MB17 records
2009-11.zip3.6 MB14 records
2009-10.zip3.9 MB15 records
2009-09.zip3.4 MB12 records
2009-08.zip3.9 MB12 records
2009-07.zip5.5 MB21 records
2009-06.zip3.9 MB15 records
2009-05.zip2.0 MB8 records
2009-04.zip1.5 MB7 records
2009-03.zip1.0 MB6 records
2009-02.zip492.4 KB3 records
2009-01.zip2.5 MB7 records
2008-12.zip2.3 MB9 records
2008-11.zip2.0 MB9 records
2008-10.zip4.0 MB14 records
2008-09.zip3.4 MB17 records
2008-08.zip3.8 MB19 records
2008-07.zip4.3 MB20 records
2008-06.zip3.6 MB17 records
2008-05.zip2.8 MB13 records
2008-04.zip10.2 MB26 records
2008-03.zip2.0 MB11 records
2008-02.zip5.2 MB23 records
2008-01.zip3.5 MB13 records
2007-12.zip3.8 MB19 records
2007-11.zip4.8 MB18 records
2007-10.zip5.4 MB27 records
2007-09.zip4.4 MB30 records
2007-08.zip8.8 MB47 records
2007-07.zip7.7 MB36 records
2007-06.zip9.4 MB42 records
2007-05.zip6.3 MB34 records
2007-04.zip4.9 MB26 records
2007-03.zip6.4 MB44 records
2007-02.zip5.5 MB31 records
2007-01.zip6.4 MB35 records
2006-12.zip4.7 MB31 records
2006-11.zip6.6 MB39 records
2006-10.zip7.1 MB37 records
2006-09.zip4.5 MB30 records
2006-08.zip7.5 MB43 records
2006-07.zip5.1 MB26 records
2006-06.zip4.1 MB22 records
2006-05.zip3.3 MB24 records
2006-04.zip3.9 MB35 records
2006-03.zip3.9 MB27 records
2006-02.zip3.6 MB26 records
2006-01.zip2.6 MB19 records
2005-12.zip5.6 MB23 records
2005-11.zip2.9 MB20 records
2005-10.zip3.4 MB21 records
2005-09.zip4.6 MB29 records
2005-08.zip3.8 MB41 records
2005-07.zip4.1 MB30 records
2005-06.zip3.2 MB25 records
2005-05.zip3.8 MB22 records
2005-04.zip1.8 MB15 records
2005-03.zip1.9 MB16 records
2005-02.zip3.5 MB20 records
2005-01.zip3.4 MB15 records
2004-12.zip2.3 MB20 records
2004-11.zip2.9 MB35 records
2004-10.zip4.0 MB31 records
2004-09.zip3.6 MB21 records
2004-08.zip3.5 MB20 records
2004-07.zip2.3 MB17 records
2004-06.zip2.6 MB18 records
2004-05.zip2.4 MB14 records
2004-04.zip3.1 MB22 records
2004-03.zip2.5 MB18 records
2004-02.zip3.0 MB20 records
2004-01.zip829.0 KB11 records
2003-12.zip3.5 MB20 records
2003-11.zip2.2 MB19 records
2003-10.zip2.6 MB23 records
2003-09.zip3.0 MB26 records
2003-08.zip3.4 MB21 records
2003-07.zip2.6 MB27 records
2003-06.zip3.1 MB20 records
2003-05.zip4.7 MB28 records
2003-04.zip1.6 MB17 records
2003-03.zip919.1 KB7 records
2003-02.zip1.5 MB19 records
2003-01.zip2.4 MB18 records
2002-12.zip3.6 MB29 records
2002-11.zip2.8 MB21 records
2002-10.zip3.3 MB39 records
2002-09.zip676.0 KB7 records
2002-08.zip2.8 MB30 records
2002-07.zip1.8 MB18 records
2002-06.zip1.9 MB20 records
2002-05.zip3.8 MB24 records
2002-04.zip2.7 MB22 records
2002-03.zip1.1 MB19 records
2002-02.zip3.5 MB15 records
2002-01.zip2.3 MB16 records
2001-12.zip2.0 MB17 records
2001-11.zip2.0 MB37 records
2001-10.zip2.4 MB17 records
2001-09.zip2.7 MB20 records
2001-08.zip2.9 MB26 records
2001-07.zip3.3 MB26 records
2001-06.zip5.2 MB31 records
2001-05.zip5.1 MB29 records
2001-04.zip2.8 MB29 records
2001-03.zip2.0 MB21 records
2001-02.zip1.9 MB24 records
2001-01.zip2.6 MB30 records
2000-12.zip3.0 MB29 records
2000-11.zip3.5 MB27 records
2000-10.zip2.1 MB28 records
2000-09.zip3.1 MB26 records
2000-08.zip2.8 MB21 records
2000-07.zip2.6 MB24 records
2000-06.zip3.7 MB36 records
2000-05.zip3.4 MB29 records
2000-04.zip2.7 MB20 records
2000-03.zip2.5 MB16 records
2000-02.zip2.2 MB20 records
2000-01.zip2.8 MB35 records
1999-12.zip3.3 MB15 records
1999-11.zip1.5 MB11 records
1999-10.zip1.9 MB14 records
1999-09.zip2.0 MB8 records
1999-08.zip5.9 MB30 records
1999-07.zip1.9 MB16 records
1999-06.zip1.9 MB29 records
1999-05.zip587.7 KB13 records
1999-04.zip2.8 MB21 records
1999-03.zip752.2 KB13 records
1999-02.zip1.4 MB20 records
1999-01.zip1.0 MB12 records
1998-12.zip2.3 MB25 records
1998-11.zip718.4 KB8 records
1998-10.zip899.9 KB12 records
1998-09.zip2.1 MB17 records
1998-08.zip1.2 MB32 records
1998-07.zip2.6 MB25 records
1998-06.zip1.5 MB24 records
1998-05.zip1.0 MB16 records
1998-04.zip1.6 MB15 records
1998-03.zip998.4 KB12 records
1998-02.zip811.6 KB6 records
1998-01.zip1.1 MB14 records
1997-12.zip811.1 KB11 records
1997-11.zip697.0 KB8 records
1997-10.zip2.1 MB14 records
1997-09.zip1.5 MB22 records
1997-08.zip1.5 MB25 records
1997-07.zip1.4 MB15 records
1997-06.zip591.0 KB6 records
1997-05.zip1.2 MB15 records
1997-04.zip1.4 MB21 records
1997-03.zip1.3 MB16 records
1997-02.zip842.5 KB7 records
1997-01.zip1.5 MB11 records
1996-12.zip441.1 KB5 records
1996-11.zip1.4 MB11 records
1996-10.zip1.8 MB15 records
1996-09.zip592.9 KB10 records
1996-08.zip9.8 MB47 records
1996-07.zip1.1 MB32 records
1996-06.zip436.3 KB9 records
1996-05.zip1.4 MB18 records
1996-04.zip22.8 KB1 records
1996-03.zip75.0 KB3 records
1996-02.zip501.7 KB8 records
1996-01.zip483.5 KB7 records
1995-12.zip390.7 KB3 records
1995-11.zip478.2 KB14 records
1995-10.zip327.0 KB6 records
1995-08.zip224.8 KB4 records
1995-07.zip199.3 KB2 records
1995-06.zip494.0 KB5 records
1995-05.zip289.1 KB5 records
1995-04.zip249.8 KB8 records
1995-03.zip207.6 KB4 records
1995-02.zip27.9 KB1 records
1994-11.zip124.2 KB1 records
1994-10.zip159.1 KB2 records
1994-04.zip70.9 KB1 records
1994-03.zip533.7 KB7 records
1994-01.zip82.4 KB1 records

What This Dataset Contains

The dataset packages every PREM14A submission filed on EDGAR — the preliminary form of proxy statement that registrants file when soliciting shareholder votes on a merger, acquisition, sale of substantially all assets, or similar extraordinary corporate transaction. The substantive disclosure inside each filing is governed by Schedule 14A, with the M&A-specific items (Items 11–14, 18, and 21) and the Regulation M-A integration items (notably Item 1001, Item 1002, and Item 1015) carrying most of the weight, supplemented by the merger agreement, fairness opinion(s), and voting agreements filed as annexes or exhibits.

Records are grouped into monthly ZIP containers keyed by <YYYY>/<YYYY>-MM.zip. Inside a monthly ZIP, every record occupies its own folder named with the 18-digit zero-padded accession number (hyphens removed); that folder is the atomic unit of the dataset and contains a metadata.json manifest plus all documents from the original EDGAR submission. The dataset's coverage window stretches from EDGAR's earliest years to the present, and content is delivered as TXT, JSON, HTML, and PDF. Image binaries (.jpg, .gif, .png) are deliberately excluded from the package even though they remain referenced as GRAPHIC entries in the manifest. Modern records consist almost exclusively of metadata.json plus one or more SGML-wrapped .htm documents.

Content Structure of a Single Record

A record is built from four nested content layers, from outermost packaging to innermost disclosure:

  1. The monthly ZIP container and the per-accession folder.
  2. The metadata.json manifest at the root of the folder.
  3. The SGML <DOCUMENT> envelopes wrapping each .htm (and any .txt/.pdf) attachment, mirroring the EDGAR submission structure.
  4. The substantive Schedule 14A disclosure inside the primary preliminary proxy statement, the structured Exhibit 107 fee table, and any additional exhibits.

Folder layout

A typical record folder contains:

  • metadata.json (always present).
  • One primary HTML document declared <TYPE>PREM14A</TYPE> and <SEQUENCE>1</SEQUENCE> — the full preliminary proxy statement. Filer-specific naming is the norm: <token>_prem14a.htm, formprem14a.htm, d<id>dprem14a.htm, and similar variants.
  • One Exhibit 107 filing-fees document declared <TYPE>[EX-FILING FEES](https://www.ecfr.gov/current/title-17/chapter-II/part-229/subpart-229.600/section-229.601)</TYPE> — typically named <token>_ex107.htm, ex-107.htm, <token>_exfee.htm, or d<id>dexfilingfees.htm.
  • Any additional exhibit documents the registrant chose to file as separate attachments: merger agreements (EX-2.x), fairness opinions (often EX-99.x), voting/support agreements, charter amendments, target-company financial statements, consents of accountants and financial advisors (EX-23.x), and miscellaneous correspondence. Larger filings can also carry PDF and TXT attachments.

Image graphics referenced in documentFormatFiles as GRAPHIC entries are intentionally excluded from the ZIP; the manifest entries remain, but no corresponding binary file is present on disk.

metadata.json schema

metadata.json describes both the filing as a whole and each constituent document. Top-level fields:

  • formType — always "PREM14A" for this dataset.
  • accessionNo — canonical hyphenated accession number (e.g., "0001683168-25-004792").
  • linkToFilingDetails — URL to the primary preliminary proxy HTML on sec.gov.
  • linkToTxt — URL to the full concatenated SGML submission .txt on sec.gov.
  • linkToHtml — URL to the EDGAR -index.htm filing index page.
  • linkToXbrl — empty string; PREM14A is not subject to XBRL tagging.
  • dataFiles — empty array, for the same reason.
  • description — "Form PREM14A - Preliminary proxy statements relating to merger or acquisition".
  • filedAt — ISO 8601 acceptance timestamp with timezone offset.
  • periodOfReport — YYYY-MM-DD period-of-report date.
  • documentFormatFiles — array of per-document descriptors (see below).
  • entities — array of filer entity descriptors (typically one).
  • seriesAndClassesContractsInformation — investment-company series/classes data; empty for typical operating-company PREM14A filings.
  • id — 32-hex internal record identifier.

Each documentFormatFiles[] element carries a sequence (string; "1", "2", ...; a single space for the complete-submission text file), a size in bytes (string), a documentUrl pointing to the file on sec.gov, an optional description ("PREM14A", "FILING FEES TABLE", "GRAPHIC", "Complete submission text file", etc.), and a type mirroring the SGML <TYPE> ("PREM14A", "EX-FILING FEES", "GRAPHIC", blank for the bundled .txt). The first entry is always the preliminary proxy statement itself; the EX-FILING FEES entry is Exhibit 107; GRAPHIC entries reference excluded image files; the trailing blank-type entry points to the full SGML submission .txt on EDGAR.

Each entities[] element typically describes a single filer/registrant: companyName with a role suffix such as "(Filer)", cik, fileNo, irsNo, stateOfIncorporation, fiscalYearEnd in MMDD form, act ("34" for the Exchange Act), type (the filer-role form type, "PREM14A"), sic (SIC code with description), filmNo, and tickers. Where the registrant is a SPAC or has affiliated reporting tickers, tickers may carry multiple symbols.

SGML document wrapper

Each .htm (and .txt/.pdf) attachment in the folder retains the EDGAR SGML envelope that wraps the actual HTML body, of the form:

1 <DOCUMENT>
2 <TYPE>PREM14A
3 <SEQUENCE>1
4 <FILENAME>collective_prem14a.htm
5 <DESCRIPTION>PRELIMINARY NOTICE OF SPECIAL MEETING
6 <TEXT>
7 <HTML>... full HTML content ...</HTML>
8 </TEXT>
9 </DOCUMENT>

The <TYPE> value mirrors documentFormatFiles[].type and <SEQUENCE> mirrors the declared sequence number, so the manifest and on-disk wrapper can be cross-walked deterministically. Parsers must strip the leading <DOCUMENT>...<TEXT> and trailing </TEXT></DOCUMENT> lines before passing the inner body to a DOM parser.

Substantive structure of the preliminary proxy statement

The primary <TYPE>PREM14A</TYPE> HTML document is consistently the largest file in the record — typically 0.8–2.4 MB of HTML for an operating-company merger — and carries the full Schedule 14A disclosure in a conventional order:

  • Schedule 14A cover page. Box-marked "Preliminary Proxy Statement," registrant identity, and the payment-of-filing-fee section. In current practice the cover defers the fee calculation to Exhibit 107 rather than carrying it in line.
  • Letter to stockholders. Chair/CEO transmittal letter framing the proposed transaction, the recommended vote, and meeting logistics.
  • Notice of special meeting of stockholders. Meeting date, time, location (or virtual access), record date, agenda items, and the vote required for each proposal (typically a majority of outstanding shares for a statutory merger under Delaware law, or two-thirds where charter or state law require it).
  • Summary term sheet / questions and answers. Required by Item 1001 of Regulation M-A; condenses the proposed merger consideration, exchange ratio, conditions, and key risks.
  • Cautionary statement on forward-looking information.
  • Risk factors. Transaction-specific risks (closing conditions, regulatory clearance, exchange-ratio collar, financing risk, deal litigation, post-closing operating risk) plus, for stock-for-stock deals, risks of holding the buyer's securities.
  • The merger / the transaction. Background of the merger (chronological negotiation narrative), board's reasons and recommendation, opinion of the financial advisor (with summary of analyses — DCF, comparable companies, comparable transactions, premium-paid — and the underlying assumptions and limitations), interests of directors and executive officers (golden parachutes, change-in-control payments, equity-award treatment), regulatory approvals and antitrust filings, accounting and tax treatment, and appraisal/dissenters' rights with the statutory text reproduced as an annex.
  • The merger agreement. Summary of parties, structure, merger consideration mechanics, treatment of equity awards, representations and warranties, interim operating covenants, no-shop / fiduciary-out / matching-rights provisions, closing conditions, termination rights, and termination fees / expense reimbursements.
  • Voting and support agreements. Where principal stockholders have committed to vote in favor.
  • Financing of the merger. Where applicable — debt commitment letters, equity commitments, marketing periods.
  • Information about the parties. Business descriptions of acquirer and target.
  • Selected and unaudited prospective financial information. Management forecasts shared with the financial advisor.
  • Market price and dividend information.
  • Beneficial ownership tables. Item 6 / Section 13(d) disclosures.
  • The special meeting. Quorum, broker non-votes, abstentions, revocation of proxies, solicitation costs.
  • Other proposals on the ballot. "Golden parachute" advisory vote required by Section 14A and Rule 14a-21(c), adjournment proposal, charter amendments incident to the merger.
  • Where you can find more information / incorporation by reference. Cross-references to the registrant's most recent 10-K, 10-Q, and 8-K filings.
  • Annexes. Merger agreement, voting agreements, fairness opinion(s), Section 262 (or analogous statute) text, charter amendments. These annexes are most commonly inlined at the end of the primary HTML document; large filings split them out as separate exhibit files.

Exhibit 107 — the filing-fees table

The EX-FILING FEES document is a small structured exhibit (typically 8–35 KB of HTML) containing the "Calculation of Filing Fee Tables" required since the SEC's 2021 amendments implementing Rule 0-11 fee-disclosure modernization. It tabulates:

  • Table 1 — Transaction Valuation. Identifies the fee-bearing transaction(s), the fee calculation rule, the proposed maximum aggregate value of the transaction, the Section 6(b) fee rate applicable at filing, and the amount of filing fee.
  • Table 2 — Fee Offset Claims and Sources. Where the registrant claims offsets under Rule 0-11(a)(2), with cross-references to the prior filing whose fee is being offset.
  • Table 3 — Combined Prospectus Offerings (when relevant for multi-form transactions).
  • Net fee due, after total fees, total fees previously paid, and total fee offsets.

In current practice the cover page of the proxy statement no longer carries the fee calculation in line; that calculation lives exclusively in Exhibit 107.

Other exhibits and attachments

When a registrant chooses not to inline annexes into the primary document, additional .htm (or PDF, or TXT) documents in the record folder may carry: the merger agreement and any amendments, the financial advisor's written fairness opinion, voting/support agreements, equity and debt commitment letters, charter amendment text, target-company audited or unaudited financial statements, pro forma financial information, consents of accountants and financial advisors, and miscellaneous correspondence. Each is wrapped in its own <DOCUMENT> envelope with an exhibit-appropriate <TYPE> (e.g., EX-2.1, EX-23.1, EX-99.1).

What is included in the dataset record

The package preserves the substantive submission as filed on EDGAR: the preliminary proxy statement HTML, the filing-fees exhibit, any other exhibit documents, and any supplementary PDF or TXT attachments, all retaining their original SGML <DOCUMENT> wrapping. The metadata.json provides a complete manifest of every document the registrant submitted (graphics included, even though their binaries are not), plus filer-level metadata sufficient to identify the registrant, locate the live EDGAR copy, and place the filing in time.

What is excluded or held separately

  • Image files. All .jpg, .gif, and .png graphics referenced in documentFormatFiles are stripped from the ZIP. Their entries remain in the manifest, but the binary files are not packaged. Consequently, embedded <img> tags in the primary HTML resolve to missing files locally.
  • The complete-submission .txt. The concatenated SGML submission text file is referenced in documentFormatFiles (and via linkToTxt) but is not re-shipped inside the folder; the per-document .htm files supersede it.
  • SEC staff comment letters and registrant responses (UPLOAD / CORRESP). Filed under separate accession numbers and form types; not part of a PREM14A record.
  • Subsequent amendments and the definitive proxy. PREM14A/A and DEFM14A are distinct EDGAR submissions with their own accession numbers; they are not merged into the PREM14A record.
  • Content incorporated by reference. Portions of the registrant's most recent 10-K, 10-Q, or 8-K filings that the proxy incorporates by reference are not reproduced inside the record.

Evolution of required content

Schedule 14A is the substantive backbone, but several rule changes have left visible imprints on PREM14A content over the dataset's coverage window:

  • Plain English and Regulation M-A integration (1999–2000). Adoption of Items 1001, 1002, and 1015 standardized the summary term sheet, the background-of-the-merger narrative, and the treatment of fairness-opinion summaries.
  • Sarbanes-Oxley and post-SOX governance disclosures (2002 onward). SOX-driven disclosure of director independence and audit-committee composition flows through to merger proxies that include director-election or post-closing governance components.
  • Dodd-Frank Section 14A and Rule 14a-21(c) (2011). Required a separate non-binding shareholder advisory vote on "golden parachute" compensation tied to the transaction; PREM14A filings from 2011 forward carry a dedicated golden-parachute table and accompanying narrative.
  • Filing-fee modernization (2021–2022). SEC Release 33-10997 introduced the structured Calculation of Filing Fee Tables exhibit (Exhibit 107). Pre-2022 PREM14A records carried the fee calculation on the cover page of the proxy statement itself; from 2022 forward the fee calculation is in a separate EX-FILING FEES document and the cover page references it.
  • Universal proxy (Rule 14a-19, 2022). Primarily relevant to contested elections, but PREM14A filings tied to contested transactions reflect the universal-proxy mechanics introduced in 2022.
  • Pay-versus-performance (Item 402(v), 2023). Where a PREM14A includes pay-related proposals or incorporates compensation disclosures by reference, the pay-versus-performance framework affects the executive-compensation tables it carries.

Evolution of file format

The on-disk format of records reflects EDGAR's own format eras:

  • Early plain-text era (1990s). The earliest PREM14A submissions are single ASCII/SGML .txt documents containing the entire proxy statement and any annexes as plain text, with tables hand-formatted in monospace. Records from this era often consist of one large .txt plus a metadata.json.
  • HTML adoption (late 1990s–early 2000s). EDGAR began accepting HTML and registrants migrated proxy statements to .htm with embedded tables and basic typography; image graphics begin to appear and, in this dataset, are excluded from the ZIP.
  • Mature multi-document HTML era (2000s–2010s). Multi-document submissions become standard, with the proxy statement, merger agreement, and fairness opinion carried as separate <DOCUMENT> blocks; PDF attachments appear for items such as scanned fairness opinions.
  • Fee-table separation (2022–present). Following the effective date of fee-disclosure modernization, PREM14A records consistently carry a second .htm document typed EX-FILING FEES alongside the primary proxy. The fee calculation no longer lives on the proxy cover page.

Interpretation notes

  • Amendments are separate records. A PREM14A and any PREM14A/A share an underlying transaction but carry distinct accession numbers and therefore exist as distinct records; reconstructing the disclosure history of a deal requires joining them by registrant CIK, period-of-report, or transaction context.
  • Inlined annexes vs. separate exhibits. Whether the merger agreement, fairness opinion, and voting agreements appear as separate <DOCUMENT> files or as inline annexes inside the primary proxy is a registrant/filer-counsel choice; both layouts coexist throughout the dataset and content extraction must accommodate both.
  • Incorporation by reference. PREM14A frequently incorporates portions of the registrant's most recent 10-K, 10-Q, and 8-K filings by reference rather than reproducing them; the referenced content is not contained in the record and must be retrieved separately if needed.
  • SGML wrapping. Every .htm retains its EDGAR SGML envelope. Cross-walking <TYPE> and <SEQUENCE> against documentFormatFiles[] lets a consumer match each on-disk file to its declared role deterministically.
  • Image exclusion. Because graphics are stripped, GRAPHIC entries in documentFormatFiles are informative for completeness checking but should not be mistaken for files present on disk.
  • Issuer-specific variation. PREM14A drafting is heterogeneous — filer counsel from different firms produce materially different section orderings, heading conventions, and exhibit-splitting habits — so robust extraction should rely on Schedule 14A item semantics and Regulation M-A item numbers rather than fixed heading text.
  • Operating company vs. investment company filings. The seriesAndClassesContractsInformation field is populated only for investment-company filers; for typical operating-company merger PREM14As it is empty, and that emptiness is itself a useful signal for filer-type segmentation.
  • Multi-filer entities. Although entities is typically a single-element array, transactions involving co-registrants (jointly filed proxies, certain SPAC structures) produce multiple entries, each with its own role suffix on companyName.

Who Files or Publishes This Dataset, and When

Who files the record

The filer of a PREM14A is a registrant subject to the proxy rules under Section 14(a) of the Exchange Act and Regulation 14A — almost always a company with a class of equity registered under Section 12 — that is soliciting proxies, consents, or authorizations from its own security holders in connection with a proposed business combination. The filer is the issuer whose shareholders must vote, not necessarily the counterparty to the deal.

Typical filer profiles:

  • Target in a cash merger. Only the target's shareholders vote, so the public target files. The acquirer is often a private buyer, sponsor vehicle, or strategic whose own shareholders have no vote requirement.
  • Issuer in a going-private or affiliated-party transaction. A public issuer being taken private under Rule 13e-3 files PREM14A whenever a shareholder vote is required, generally combined with a Schedule 13E-3 transaction statement co-filed by the issuer and any affiliates engaged in the Rule 13e-3 transaction.
  • Issuer in a sale of substantially all assets, dissolution, recapitalization, or similar fundamental change structured to require a shareholder vote.
  • Either party in a stock-for-stock deal where no S-4 is used. Where acquirer stock is registered on Form S-4, the joint proxy statement/prospectus inside the S-4 substitutes for separate PREM14A filings on each side. PREM14A is therefore most often seen in cash deals or transactions where the acquirer is not registering securities.

Outside parties heavily involved but not the named filer: outside counsel drafting Schedule 14A, proxy solicitors (Innisfree, MacKenzie, D.F. King, Georgeson, Okapi), financial advisors delivering fairness opinions, and transfer/distribution agents. They appear as "persons making the solicitation" but do not sign as the registrant.

When the record is created

A PREM14A is event-driven, triggered by the combination of:

  • Execution of a definitive transaction agreement (merger agreement, asset purchase, share exchange, scheme implementation agreement) approved by the boards. Signing is separately reported on Form 8-K Item 1.01 within four business days, and the agreement is filed as an exhibit there and later as an annex to the proxy.
  • A required shareholder vote under (i) state corporate law — for Delaware, DGCL Section 251 (mergers), Section 271 (sale of substantially all assets), and Section 253 (short-form parent-subsidiary mergers, generally not requiring a vote) — and/or (ii) listing rules, principally NYSE Listed Company Manual Section 312.03 and Nasdaq Rule 5635, which require shareholder approval for issuances of 20% or more of outstanding shares, change-of-control issuances, and related-party issuances.
  • Rule 13e-3 conditions, where the transaction would let the issuer terminate registration or delist with the issuer or an affiliate as the engaging party.

The structural timing rule is Rule 14a-6(a): a preliminary copy of the proxy and form of proxy must be filed with the SEC at least 10 calendar days before definitive copies are first sent to security holders, unless the only matters to be voted on are routine items the rule enumerates (uncontested director elections, auditor ratification, Rule 14a-8 shareholder proposals, certain compensation plans). M&A, asset sales, dispositions of significant subsidiaries, share exchanges, and going-private transactions fall outside that exemption — which is the entire reason the PREM14A tag exists, separate from PRE 14A used for non-M&A preliminary proxies.

In practice the staff frequently selects M&A proxies for review. The registrant cannot proceed to definitive until comments are cleared, typically through one or more PRER14A amendments. The DEFM14A is mailed only after staff review concludes (or the 10-day window expires without review). There is no statutory deadline to file PREM14A after signing; timing is driven by the merger agreement's "as promptly as reasonably practicable" covenant and the deal's outside date.

Adjacent communications rule: Rule 14a-12 allows soliciting communications before a proxy statement is furnished, provided each written communication is filed under cover of Schedule 14A on the date of first use with required legends. These appear on EDGAR as DEFA14A and routinely precede the first PREM14A.

Important distinctions

  • PREM14A vs PRE 14A. PREM14A is reserved for preliminary proxies relating to mergers, acquisitions, and similar business combinations; PRE 14A covers other preliminary proxies that require Rule 14a-6(a) pre-filing but are not M&A. The dataset filters to PREM14A only.
  • PREM14A vs PRER14A vs DEFM14A. PREM14A is the initial preliminary filing. PRER14A captures revisions filed in response to staff comments or new developments (revised fairness opinion, updated projections, price change, competing bid). DEFM14A is the single definitive version mailed to shareholders. One transaction yields one PREM14A, possibly several PRER14As, and one DEFM14A.
  • PREM14A vs PREM14C / DEFM14C. PREM14C and DEFM14C are the information-statement analogues used when corporate action proceeds without a solicitation of proxies — typically because a controlling shareholder has already executed a written consent. No proxies are solicited, so Schedule 14A does not apply.
  • PREM14A vs Form S-4 joint proxy/prospectus. When deal consideration includes registered acquirer stock, the joint proxy statement/prospectus inside the S-4 generally substitutes for separate PREM14A filings on either side.
  • Schedule 13E-3 overlap. In going-private transactions the issuer, controlling shareholder, acquisition vehicle, and other engaged affiliates are co-filers on Schedule 13E-3. The proxy element is still a PREM14A filed by the issuer; the 13E-3 is a separate (though typically combined) cover under a different form tag and is not in this dataset.
  • Foreign private issuers. FPIs, as defined in Rule 3b-4, are exempt from Section 14(a) by Rule 3a12-3(b) and do not file PREM14A. They disclose mergers via Form 6-K furnishing of home-country proxy circulars or scheme documents, and via Form F-4 for registered share consideration. The PREM14A population is therefore effectively domestic issuers.
  • Dissident and contested solicitations. When a third party (activist, hostile bidder, competing acquirer) solicits against management, the dissident files its own preliminary proxy under Rule 14a-6, tagged PREN14A or PRRN14A, not PREM14A. This dataset captures only the registrant/management side.
  • Confidential preliminary filings. Rule 14a-6(e)(2) allows confidential preliminary filing only for non-contested, non-Rule 13e-3 transactions whose materials have not yet been released. Confidentially submitted PREM14As are not public on EDGAR until they go non-confidential, so confidentially filed but withdrawn deals may not appear.
  • Withdrawn or terminated deals. A PREM14A filed at signing remains on EDGAR even if the transaction terminates before any DEFM14A is mailed, preserving a record of the attempted solicitation.

How This Dataset Differs From Similar Datasets or Filings

PREM14A sits in a tight cluster of M&A disclosure documents. The most useful comparisons are with (1) other stages of the same proxy, (2) parallel non-vote or going-private regimes, (3) alternative deal-structure forms, and (4) structured extractor datasets covering the same filings.

DEFM14A — Definitive Merger Proxy

Same transaction, same disclosure scope, later stage. PREM14A is the pre-review draft, often the first public document containing the full deal narrative; DEFM14A is the post-comment, cleared version mailed to shareholders. PREM14A may include placeholders, blanks, and language that later changes. Use PREM14A to study first-disclosed terms and comment-letter-driven revisions; use DEFM14A for the version of record shareholders actually voted on.

PRER14A — Revised Preliminary Proxy

Same regime and same deal as the originating PREM14A, but reflects changes made during SEC staff review before going definitive. PREM14A captures the starting point; PRER14A captures the redlined intermediate states (added risk factors, revised fairness language, updated projections). Required if the analytic question is how disclosure evolves through staff review.

PRE 14A — Generic Preliminary Proxy

Same procedural posture (preliminary, pre-definitive) but for non-M&A matters: director elections, charter amendments, equity plans, say-on-pay. Lacks fairness opinions, transaction backgrounds, and merger-consideration mechanics. PREM14A is the M&A-specific cut; PRE 14A is the residual category. Mixing the two skews any merger-activity count.

DEFA14A — Additional Definitive Material

Supplementary solicitation material filed after the definitive proxy: investor decks, talking points, press releases, shareholder letters, proxy-advisor responses. Not standalone, no full transaction description. Important in contested or scrutinized deals as the campaign layer around the vote, but does not substitute for PREM14A's foundational disclosure.

DEF 14A — Annual Proxy

Annual-meeting proxy covering director elections, auditor ratification, and executive compensation. Shares Regulation 14A heritage with PREM14A but the disclosure regime (Item 402 tables, audit-committee reports, ownership tables) is orthogonal to merger content. Combining DEF 14A and PREM14A without filtering produces a corpus dominated by governance boilerplate rather than transactions.

PREM14C / DEFM14C — Merger Information Statements

14C mirrors 14A when shareholder action is taken by written consent of a controlling holder rather than by solicited vote. Disclosure overlaps heavily (terms, background, fairness opinions) but there is no proxy card and no vote being solicited. Controlled-company deals and sponsor-majority transactions typically appear in 14C. A PREM14A-only sample systematically excludes these.

Schedule 13E-3 — Going-Private Transactions

Required under Rule 13e-3 when an issuer or its affiliates take the company private. In going-private mergers, 13E-3 is filed alongside (often combined with) PREM14A and adds affiliate-specific disclosures: purposes, alternatives considered, fairness determinations by each filer, and full advisor reports. PREM14A frames the proxy solicitation; 13E-3 frames affiliate fairness. Management buyouts, sponsor take-privates, and squeeze-outs require both.

Form S-4 — Stock-Deal Registration Statement

Registers securities issued in business combinations. In stock or mixed-consideration deals, the acquirer files S-4 and the target may file PREM14A; often the S-4 doubles as a joint proxy/prospectus. S-4 carries Securities Act content (issuer description, pro forma financials, tax treatment) absent from PREM14A. All-cash deals usually have PREM14A but no S-4. A PREM14A-only universe under-represents stock deals where the primary disclosure lives in the acquirer's S-4.

Schedule 14D-9 — Target Tender-Offer Recommendation

Filed by the target in response to a third-party tender offer: board recommendation, negotiation background, fairness opinion, conflicts. Thematic overlap with PREM14A (background and advisor sections) but applies to tender-offer structures rather than vote-driven mergers. One-step merger votes produce PREM14A without Schedule 14D-9; two-step deals produce 14D-9 first and PREM14A/DEFM14A for the back-end merger.

Schedule TO — Tender Offer Statement

Bidder-side counterpart to 14D-9: offer terms, financing, conditions. Largely mutually exclusive with PREM14A on a per-deal basis — a transaction is either vote-driven (PREM14A) or tender-driven (Schedule TO + 14D-9). Two-step deals are the only routine overlap.

Form 8-K Item 1.01 — Merger Agreement Announcement

First-day, event-driven announcement of entry into the merger agreement, typically attaching the agreement as Exhibit 2.1 and a press release as Exhibit 99.1. Terse and contractual, not narrative. PREM14A arrives weeks later with the deliberative record (background, alternatives, advisor analyses, projections) that 8-K Item 1.01 omits. Use Form 8-K for deal-announcement timing and merger-agreement text; use PREM14A for the surrounding rationale and process.

As-filed files vs. structured PREM14A extractors

This dataset contains the original EDGAR submissions — preliminary proxy, exhibits, attachments, metadata — in native HTML, TXT, PDF, and JSON metadata, excluding images. Layout, exhibit ordering, and full as-filed text are preserved.

A structured PREM14A extractor (where available) parses specific fields — parties, consideration, exchange ratios, vote thresholds, advisor names, fairness-opinion summaries, termination fees — into JSON or tables. Efficient for cross-deal statistics, but discards surrounding narrative, exhibit text, and anything outside the extractor's schema.

Choose the files dataset for full-text search, exhibit-level access (merger agreements, opinion letters, voting agreements), training-data construction, or fidelity to as-filed wording. Choose a structured extractor for deal-level quantitative analysis when its schema covers the needed fields.

Boundary summary

PREM14A files capture, in original form, the earliest comprehensive public disclosure of a vote-driven merger: narrative-heavy, exhibit-rich, filed before SEC review and before definitive mailing. It is narrower than DEF 14A and PRE 14A (M&A-only), earlier and rougher than DEFM14A, distinct from PRER14A (initial vs. revised), vote-centric where PREM14C is consent-based, proxy-framed where Schedule 13E-3 is affiliate-fairness-framed, target-side where S-4 is acquirer-side registration, merger-structured where Schedule TO and 14D-9 are tender-structured, and deliberative where 8-K Item 1.01 is announcement-only. No adjacent form reproduces the full pre-clearance proxy submission as it appeared on EDGAR.

Who Uses This Dataset

PREM14A is the earliest complete public record of a negotiated change-of-control deal: merger mechanics, background of negotiations, fairness opinions, financing, voting agreements, and regulatory conditions. The following users each read different sections for different outputs, and most prefer the as-filed preliminary version to the post-comment DEFM14A.

Event-driven hedge fund analysts

Underwrite long or short positions on announced deals. They pull consideration mix, exchange ratios and collars, breakup fees, no-shop and fiduciary-out language, support agreements, financing commitments, and the fairness opinion summary in "Reasons for the Merger." Output: a deal memo and position sizing. Waiting for the DEFM14A forfeits weeks of trading edge.

Risk arbitrage traders

Price the spread. Focus narrowly on MAC carve-outs, regulatory conditions (HSR, foreign investment, sector clearances), termination dates and walk-away rights, reverse termination fees, specific performance, and the meeting timeline. Output: spread targets and break-risk leg sizing. The as-filed PREM14A is operative because staff comments rarely change contractual terms.

Corporate development and strategy teams

Build precedent libraries to inform their own bids and board decks. Harvest deal structure (one-step, two-step, reverse triangular), premium-to-unaffected, option and RSU treatment, retention pools, escrow terms, and the negotiation chronology in "Background of the Merger." Output: precedent tables and process playbooks for go-shops, topping bids, and standstill waivers.

M&A investment bankers and advisors

Benchmark fairness opinions and feed comparable-transaction databases. Extract advisor identity, fee structure (announcement, success, opinion), prior-relationship disclosures, full methodology descriptions (selected companies, precedent transactions, DCF, LBO, premiums paid), management projections, and implied valuation ranges. Output: pitch comps and league-table tracking. PREM14A captures the original methodology before any cosmetic edits prompted by staff or plaintiff pressure.

Securities and M&A lawyers

Transactional counsel mine PREM14A for drafting precedent: background wording, fairness opinion descriptions, advisor and management conflicts, Item 402(t) golden parachute tables, appraisal rights notices, antitrust risk factors, and the forms of merger and voting agreements attached as exhibits. Litigation counsel diff PREM14A against DEFM14A to infer staff comments and supplemental disclosures added under settlement pressure. Output: drafting templates and disclosure-defense memos.

Plaintiffs' deal litigators

Use PREM14A as the trigger document for disclosure-adequacy claims. Scrutinize background chronology for omitted bidders, projections for missing line items, opinion methodology for unsupported assumptions, and banker and management conflicts. Output: demand letters, complaints, and mootness-fee negotiations. The PREM14A's filing date sets the litigation calendar.

Proxy advisors and solicitors

Advisors build vote recommendations on the merger and the say-on-parachute vote, reading strategic rationale, valuation analyses, the opinion summary, post-closing governance, equity treatment, and Item 402(t) tables. Solicitors model turnout, map institutional and retail blocks, and design outreach. Output: recommendation reports and solicitation playbooks ready when the DEFM14A mails.

Governance researchers and academics

Use the PREM14A series back to 1994 for longitudinal studies of deal-protection devices, premium and control trends, opinion practices, parachute escalation, and disclosure depth across legal regimes. Output: panel datasets and published research. The preliminary version is preferred where the question concerns initial disclosure choices, not post-comment revisions.

Sell-side compliance and conflicts teams

Monitor how their own and peer firms' fairness opinions and conflict disclosures appear in public filings. Reconcile disclosed fees, prior engagements, and trading relationships against internal records and calibrate the firm's own conflicts language. PREM14A-versus-DEFM14A diffs flag drafting practices likely to draw staff attention.

Data vendors and quantitative researchers

Parse filings to populate structured M&A databases: registrant and target identifiers, deal value, consideration type, premiums to one-day and four-week unaffected, advisor names and fees, termination fees and triggers, expected close, and vote thresholds. Output: league tables, completion-probability models, premium analytics, and arbitrage spread feeds. PREM14A is the earliest filing carrying the full structured field set, cutting latency versus DEFM14A or press releases.

Journalists, ESG and activist researchers

Financial reporters use PREM14A as the primary source for deal-mechanics coverage, drawing on background, opinion summary, and parachute tables. Corporate-accountability researchers track change-of-control payouts and workforce and pension treatment. Activist research desks scan the disclosed process, conflicts, and premium for grounds to launch a campaign or topping bid.

PREM14A is the operative document set for negotiated public-company M&A. Arbitrage and event-driven desks act on it in real time; bankers, lawyers, proxy advisors, and compliance use it for benchmarking and drafting; plaintiffs, journalists, academics, and accountability researchers use it for monitoring and longitudinal study. The PREM14A-versus-DEFM14A delta is itself a research signal.

Specific Use Cases

PREM14A files support a small set of operational workflows where the value comes from specific sections of the preliminary proxy. The use cases below name the audience, the records or fields used, the workflow, and the output.

Merger-arbitrage spread monitor

Risk-arb desks parse the "Conditions to the Merger," "Termination," and "Regulatory Approvals" sections of the primary <TYPE>PREM14A</TYPE> HTML to extract MAC carve-outs, HSR and foreign-investment conditions, outside dates, reverse termination fees, and specific-performance language. Combined with the announced consideration and exchange ratio from the summary term sheet, these feed a deal-by-deal spread model that prices break risk and timing risk against the live target quote. Output: a spread sheet with break-probability and expected-close-date inputs for position sizing.

Precedent-transaction database for premium and deal-protection analysis

Corporate development teams and M&A bankers extract a fixed field set from each record — parties, deal value, consideration mix, exchange ratio and collar, premium to unaffected, no-shop and fiduciary-out terms, matching rights, breakup fee and reverse breakup fee, go-shop window — by parsing the merger-agreement summary and the merger-agreement annex (inlined or filed as EX-2.x). Indexed by SIC, deal size, and year, the resulting table backs precedent decks, board materials, and league-table comps.

Fairness-opinion methodology mining

M&A advisors, expert witnesses, and academic researchers harvest the "Opinion of the Financial Advisor" section together with the fairness-opinion exhibit (commonly EX-99.x) to capture advisor identity, fee structure, prior-relationship disclosures, methodologies used (selected companies, precedent transactions, DCF, LBO, premiums paid), implied valuation ranges, and the management projections shared with the advisor. Output: a methodology corpus used for opinion benchmarking, expert-report support, and longitudinal studies of valuation practice.

Disclosure-litigation triggers and supplemental-disclosure tracking

Plaintiffs' deal-litigation firms ingest each new PREM14A on its filedAt timestamp and scan "Background of the Merger," the projections section, and the conflicts disclosures for omitted bidders, missing forecast line items, advisor fee tail, and undisclosed management discussions. The same workflow later diffs the PREM14A against the corresponding DEFM14A (joined by registrant CIK and periodOfReport) to identify supplemental disclosures added under settlement pressure. Output: demand letters, complaint drafts, and mootness-fee positions calendared off the preliminary filing date.

Training corpus for M&A-domain NLP

ML teams use the as-filed HTML bodies — after stripping the SGML <DOCUMENT> envelope — as labeled training data for clause classification (no-shop, fiduciary out, MAC, breakup-fee tiers), risk-factor tagging, and section segmentation. The consistent Schedule 14A and Regulation M-A item structure provides weak labels at scale, while the heterogeneity of filer-counsel drafting habits supplies the variation needed for robust models. Output: clause extractors, deal-term taggers, and summarization models that generalize across DEFM14A, PRER14A, and 13E-3.

Golden-parachute and change-of-control compensation tracking

Governance researchers and proxy advisors pull the Item 402(t) golden-parachute table and the related say-on-parachute proposal text from each PREM14A to capture per-executive cash, equity acceleration, perquisite, and tax-gross-up amounts together with single- versus double-trigger structure. Joined to the registrant cik and sic, the panel supports parachute-escalation studies, vote-recommendation models, and corporate-accountability reporting on change-of-control payouts.

Filing-fee and transaction-valuation panel

Quantitative researchers parse the EX-FILING FEES Exhibit 107 document (Tables 1–3) across the 2022-present coverage to extract the proposed maximum aggregate transaction value, fee-calculation rule, Section 6(b) fee rate, and any Rule 0-11(a)(2) offsets. Because Exhibit 107 is structured and small, this yields a clean deal-value series at filing time — earlier than DEFM14A and independent of press-release headline numbers — usable for activity indices, fee-revenue models, and validation of third-party deal databases.

Dataset Access

Dataset Index JSON API: https://api.sec-api.io/datasets/form-prem14a-files.json

This endpoint returns dataset metadata and a list of all available container files. The metadata includes the dataset name, description, last updated timestamp, earliest sample date, total record count, total dataset size, covered form types (PREM14A), container format (ZIP), and content file types (TXT, JSON, HTML, PDF). It also includes the full dataset download URL and an array of containers with per-container key, size, records, updatedAt, and downloadUrl fields. Poll this endpoint to detect which containers were modified in the most recent refresh and download only those updated containers on a daily basis. This endpoint does not require an API key.

Example response:

Example
1 {
2 "datasetId": "1f13365b-9ae0-694d-bc5c-7f43f28984b8",
3 "datasetDownloadUrl": "https://api.sec-api.io/datasets/form-prem14a-files.zip",
4 "name": "Form PREM14A Files Dataset",
5 "description": "Form PREM14A filings contain preliminary proxy statements relating to mergers or acquisitions, filed pursuant to Section 14(a) of the Securities Exchange Act of 1934 and Regulation 14A.",
6 "updatedAt": "2026-05-05T02:49:10.523Z",
7 "earliestSampleDate": "1994-01-01",
8 "totalRecords": 6493,
9 "totalSize": 1228626276,
10 "formTypes": ["PREM14A"],
11 "containerFormat": "ZIP",
12 "fileTypes": ["TXT", "JSON", "HTML", "PDF"],
13 "containers": [
14 {
15 "downloadUrl": "https://api.sec-api.io/datasets/form-prem14a-files/2026/2026-03.zip",
16 "key": "2026/2026-03.zip",
17 "size": 13818783,
18 "records": 24,
19 "updatedAt": "2026-05-05T02:49:10.523Z"
20 }
21 ]
22 }

Download Entire Dataset: https://api.sec-api.io/datasets/form-prem14a-files.zip?token=YOUR_API_KEY

Downloads the full dataset as a single ZIP archive containing all monthly containers from the earliest sample date (1994-01-01) onward. This endpoint requires an API key passed via the token query parameter.

Download Single Container: https://api.sec-api.io/datasets/form-prem14a-files/2026/2026-03.zip?token=YOUR_API_KEY

Downloads one monthly container ZIP instead of the full dataset. Replace the year and month segments with the key of any container returned by the dataset index API. This endpoint requires an API key passed via the token query parameter.

Frequently Asked Questions

What form does the Form PREM14A Files Dataset cover?

The dataset covers PREM14A filings — preliminary proxy statements relating to a merger, acquisition, sale of substantially all assets, or similar extraordinary corporate transaction, filed under Section 14(a) of the Securities Exchange Act of 1934 and Regulation 14A. Subsequent amendments (PREM14A/A, PRER14A) and the definitive proxy (DEFM14A) are distinct EDGAR submissions and are not included.

What does one record in this dataset represent?

One record is a single PREM14A submission on EDGAR, identified by exactly one accession number. The record is a folder containing a metadata.json manifest and all documents from the original submission — typically the primary preliminary proxy statement (<TYPE>PREM14A</TYPE>), the Exhibit 107 filing-fees document (<TYPE>EX-FILING FEES</TYPE>), and any additional exhibits such as merger agreements, fairness opinions, and voting agreements.

Who is required to file a PREM14A?

The filer is a registrant subject to the proxy rules under Section 14(a) — typically a domestic public company with equity registered under Section 12 — that is soliciting shareholder votes on a business combination. Common profiles include the public target in a cash merger, an issuer in a Rule 13e-3 going-private transaction, and any registrant whose shareholders must vote on a sale of substantially all assets, dissolution, or similar fundamental change. Foreign private issuers are exempt from Section 14(a) and do not file PREM14A.

When is a PREM14A required to be filed?

Rule 14a-6(a) requires the preliminary copy of the proxy and form of proxy to be filed at least 10 calendar days before definitive copies are first sent to security holders. There is no statutory deadline tied to merger-agreement signing; the timing is driven by the merger agreement's "as promptly as reasonably practicable" covenant and the deal's outside date.

What time period does the dataset cover, and how is it packaged?

Coverage runs from 1994 to the present. Records are grouped into monthly ZIP containers keyed by <YYYY>/<YYYY>-MM.zip. Inside a monthly ZIP, every record occupies its own folder named with the 18-digit zero-padded accession number. Content file types are TXT, JSON, HTML, and PDF.

How does this dataset differ from the DEFM14A definitive merger proxy?

PREM14A is the pre-review draft — often the first public document containing the full deal narrative — filed before SEC staff review and before mailing. DEFM14A is the post-comment, cleared version actually mailed to shareholders. PREM14A may include placeholders or language that later changes; the PREM14A-versus-DEFM14A delta is itself a useful research signal for staff comments and supplemental disclosures.

Are image files and incorporated content included?

No. All .jpg, .gif, and .png graphics are stripped from the ZIP, although their entries remain in documentFormatFiles as GRAPHIC records. Content incorporated by reference from the registrant's 10-K, 10-Q, or 8-K filings is also not reproduced inside the record. The complete-submission .txt is referenced via linkToTxt but not re-shipped inside the folder.