Form PRE 14A Files Dataset

The Form PRE 14A Files Dataset is a record-by-record archive of preliminary proxy statements filed with the SEC under Regulation 14A of the Securities Exchange Act of 1934. One record corresponds to a single EDGAR PRE 14A submission — identified by its 18-digit accession number and materialized on disk as one folder containing a metadata.json filing-level descriptor and the original EDGAR document set with image files removed. The form is filed by registrants soliciting proxies on non-routine matters such as mergers, going-private transactions, charter amendments, equity-plan adoptions, and contested director elections, at least ten calendar days before the corresponding DEF 14A is sent to security holders. Coverage spans the full EDGAR era from January 1, 1994 to the present, and the dataset is distributed in ZIP containers carrying TXT, JSON, HTML, and PDF file types.

Update Frequency
Daily
Updated at
2026-05-09
Earliest Sample Date
1994-01-01
Total Size
3.1 GB
Total Records
50,493
Container Format
ZIP
Content Types
TXT, JSON, HTML, PDF
Form Types
PRE 14A

Dataset APIs

Programmatically retrieve the full list of dataset archive files, download URLs and dataset metadata.

Dataset Index JSON API

Download the entire dataset as a single archive file.

Download Entire Dataset:

Download a single container file (e.g. monthly archive) from the dataset.

Download Single Container:

Dataset Files

389 files · 3.1 GB
Download All
2026-05.zip4.6 MB68 records
2026-04.zip25.7 MB475 records
2026-03.zip58.0 MB233 records
2026-02.zip6.9 MB73 records
2026-01.zip4.2 MB63 records
2025-12.zip7.1 MB83 records
2025-11.zip5.7 MB88 records
2025-10.zip7.5 MB118 records
2025-09.zip8.3 MB103 records
2025-08.zip9.0 MB113 records
2025-07.zip7.8 MB104 records
2025-06.zip5.7 MB85 records
2025-05.zip8.3 MB108 records
2025-04.zip33.6 MB331 records
2025-03.zip96.5 MB274 records
2025-02.zip34.5 MB96 records
2025-01.zip5.9 MB75 records
2024-12.zip5.5 MB79 records
2024-11.zip6.8 MB103 records
2024-10.zip6.7 MB107 records
2024-09.zip6.7 MB91 records
2024-08.zip6.6 MB101 records
2024-07.zip7.3 MB99 records
2024-06.zip6.9 MB94 records
2024-05.zip8.0 MB94 records
2024-04.zip63.4 MB365 records
2024-03.zip55.3 MB308 records
2024-02.zip12.4 MB102 records
2024-01.zip7.4 MB87 records
2023-12.zip8.1 MB97 records
2023-11.zip9.4 MB121 records
2023-10.zip7.5 MB114 records
2023-09.zip6.6 MB102 records
2023-08.zip7.3 MB121 records
2023-07.zip9.3 MB119 records
2023-06.zip10.9 MB134 records
2023-05.zip9.8 MB137 records
2023-04.zip38.0 MB384 records
2023-03.zip38.2 MB339 records
2023-02.zip8.3 MB108 records
2023-01.zip7.8 MB102 records
2022-12.zip6.1 MB108 records
2022-11.zip10.9 MB172 records
2022-10.zip8.4 MB127 records
2022-09.zip5.5 MB92 records
2022-08.zip6.4 MB103 records
2022-07.zip4.8 MB77 records
2022-06.zip4.8 MB74 records
2022-05.zip5.5 MB79 records
2022-04.zip24.3 MB269 records
2022-03.zip28.9 MB209 records
2022-02.zip4.0 MB43 records
2022-01.zip1.9 MB43 records
2021-12.zip2.9 MB47 records
2021-11.zip2.9 MB43 records
2021-10.zip4.5 MB60 records
2021-09.zip3.9 MB53 records
2021-08.zip3.5 MB65 records
2021-07.zip3.1 MB52 records
2021-06.zip9.2 MB78 records
2021-05.zip5.6 MB74 records
2021-04.zip22.2 MB229 records
2021-03.zip25.5 MB230 records
2021-02.zip5.6 MB66 records
2021-01.zip3.1 MB60 records
2020-12.zip4.4 MB62 records
2020-11.zip3.6 MB45 records
2020-10.zip7.1 MB101 records
2020-09.zip3.7 MB71 records
2020-08.zip3.7 MB72 records
2020-07.zip5.1 MB86 records
2020-06.zip5.8 MB84 records
2020-05.zip5.6 MB92 records
2020-04.zip23.1 MB230 records
2020-03.zip26.7 MB217 records
2020-02.zip18.2 MB97 records
2020-01.zip2.9 MB55 records
2019-12.zip3.7 MB61 records
2019-11.zip2.9 MB55 records
2019-10.zip4.6 MB79 records
2019-09.zip3.6 MB61 records
2019-08.zip2.5 MB42 records
2019-07.zip3.6 MB61 records
2019-06.zip3.6 MB58 records
2019-05.zip4.2 MB67 records
2019-04.zip17.5 MB204 records
2019-03.zip21.7 MB193 records
2019-02.zip6.2 MB89 records
2019-01.zip1.7 MB33 records
2018-12.zip4.5 MB48 records
2018-11.zip2.1 MB43 records
2018-10.zip3.1 MB54 records
2018-09.zip3.5 MB64 records
2018-08.zip12.0 MB129 records
2018-07.zip3.8 MB52 records
2018-06.zip4.3 MB69 records
2018-05.zip4.7 MB71 records
2018-04.zip20.5 MB201 records
2018-03.zip25.8 MB269 records
2018-02.zip5.0 MB74 records
2018-01.zip2.6 MB54 records
2017-12.zip3.5 MB59 records
2017-11.zip3.4 MB50 records
2017-10.zip4.4 MB77 records
2017-09.zip3.5 MB69 records
2017-08.zip3.4 MB58 records
2017-07.zip4.2 MB86 records
2017-06.zip4.1 MB70 records
2017-05.zip5.7 MB89 records
2017-04.zip17.6 MB187 records
2017-03.zip29.8 MB278 records
2017-02.zip9.5 MB85 records
2017-01.zip2.1 MB46 records
2016-12.zip4.3 MB78 records
2016-11.zip2.3 MB48 records
2016-10.zip6.4 MB81 records
2016-09.zip5.3 MB89 records
2016-08.zip4.7 MB79 records
2016-07.zip4.3 MB63 records
2016-06.zip5.2 MB69 records
2016-05.zip4.9 MB80 records
2016-04.zip24.5 MB247 records
2016-03.zip30.0 MB284 records
2016-02.zip6.6 MB99 records
2016-01.zip2.5 MB55 records
2015-12.zip4.5 MB84 records
2015-11.zip2.3 MB44 records
2015-10.zip6.2 MB75 records
2015-09.zip4.4 MB82 records
2015-08.zip4.0 MB77 records
2015-07.zip4.9 MB125 records
2015-06.zip3.9 MB78 records
2015-05.zip5.2 MB78 records
2015-04.zip22.9 MB194 records
2015-03.zip27.1 MB259 records
2015-02.zip8.0 MB115 records
2015-01.zip3.2 MB64 records
2014-12.zip6.2 MB82 records
2014-11.zip3.5 MB63 records
2014-10.zip4.3 MB76 records
2014-09.zip4.7 MB71 records
2014-08.zip3.9 MB74 records
2014-07.zip3.8 MB70 records
2014-06.zip5.5 MB68 records
2014-05.zip6.5 MB96 records
2014-04.zip15.9 MB210 records
2014-03.zip20.8 MB245 records
2014-02.zip7.8 MB114 records
2014-01.zip8.8 MB68 records
2013-12.zip5.3 MB94 records
2013-11.zip3.8 MB62 records
2013-10.zip4.2 MB79 records
2013-09.zip4.2 MB71 records
2013-08.zip3.9 MB71 records
2013-07.zip3.5 MB66 records
2013-06.zip3.7 MB66 records
2013-05.zip5.4 MB78 records
2013-04.zip16.0 MB224 records
2013-03.zip24.6 MB271 records
2013-02.zip5.7 MB85 records
2013-01.zip4.4 MB85 records
2012-12.zip4.0 MB71 records
2012-11.zip3.3 MB60 records
2012-10.zip7.3 MB76 records
2012-09.zip4.4 MB85 records
2012-08.zip4.0 MB61 records
2012-07.zip3.2 MB61 records
2012-06.zip5.2 MB77 records
2012-05.zip4.1 MB69 records
2012-04.zip13.8 MB199 records
2012-03.zip23.5 MB269 records
2012-02.zip7.3 MB124 records
2012-01.zip4.6 MB50 records
2011-12.zip3.1 MB58 records
2011-11.zip2.5 MB50 records
2011-10.zip8.3 MB157 records
2011-09.zip4.8 MB99 records
2011-08.zip4.7 MB92 records
2011-07.zip3.4 MB70 records
2011-06.zip8.9 MB159 records
2011-05.zip4.4 MB79 records
2011-04.zip14.1 MB205 records
2011-03.zip24.8 MB287 records
2011-02.zip8.0 MB104 records
2011-01.zip2.9 MB63 records
2010-12.zip4.5 MB83 records
2010-11.zip3.6 MB69 records
2010-10.zip12.9 MB165 records
2010-09.zip5.0 MB112 records
2010-08.zip5.6 MB120 records
2010-07.zip4.7 MB73 records
2010-06.zip5.5 MB110 records
2010-05.zip9.8 MB139 records
2010-04.zip17.1 MB230 records
2010-03.zip29.8 MB380 records
2010-02.zip10.2 MB151 records
2010-01.zip4.5 MB88 records
2009-12.zip7.0 MB98 records
2009-11.zip4.8 MB107 records
2009-10.zip5.5 MB120 records
2009-09.zip12.4 MB189 records
2009-08.zip6.1 MB99 records
2009-07.zip13.8 MB208 records
2009-06.zip8.4 MB137 records
2009-05.zip8.5 MB128 records
2009-04.zip20.0 MB333 records
2009-03.zip32.6 MB484 records
2009-02.zip7.7 MB117 records
2009-01.zip5.8 MB123 records
2008-12.zip5.4 MB119 records
2008-11.zip6.3 MB189 records
2008-10.zip8.5 MB139 records
2008-09.zip12.4 MB174 records
2008-08.zip4.8 MB86 records
2008-07.zip7.5 MB134 records
2008-06.zip6.3 MB118 records
2008-05.zip6.9 MB130 records
2008-04.zip19.3 MB338 records
2008-03.zip18.7 MB304 records
2008-02.zip13.2 MB176 records
2008-01.zip4.5 MB89 records
2007-12.zip7.9 MB110 records
2007-11.zip6.6 MB101 records
2007-10.zip5.8 MB106 records
2007-09.zip4.4 MB89 records
2007-08.zip18.6 MB246 records
2007-07.zip5.0 MB99 records
2007-06.zip7.5 MB129 records
2007-05.zip6.3 MB118 records
2007-04.zip14.4 MB236 records
2007-03.zip20.2 MB307 records
2007-02.zip7.1 MB126 records
2007-01.zip4.7 MB123 records
2006-12.zip5.2 MB149 records
2006-11.zip3.9 MB100 records
2006-10.zip4.9 MB117 records
2006-09.zip5.5 MB106 records
2006-08.zip7.4 MB123 records
2006-07.zip4.5 MB95 records
2006-06.zip5.1 MB114 records
2006-05.zip11.6 MB160 records
2006-04.zip13.5 MB264 records
2006-03.zip17.6 MB319 records
2006-02.zip6.4 MB187 records
2006-01.zip4.1 MB86 records
2005-12.zip5.5 MB114 records
2005-11.zip5.3 MB85 records
2005-10.zip6.4 MB114 records
2005-09.zip6.4 MB143 records
2005-08.zip7.7 MB153 records
2005-07.zip6.4 MB138 records
2005-06.zip6.7 MB126 records
2005-05.zip9.6 MB243 records
2005-04.zip12.0 MB265 records
2005-03.zip16.9 MB316 records
2005-02.zip7.4 MB146 records
2005-01.zip3.9 MB98 records
2004-12.zip3.8 MB94 records
2004-11.zip5.0 MB110 records
2004-10.zip5.1 MB110 records
2004-09.zip5.3 MB120 records
2004-08.zip3.8 MB100 records
2004-07.zip4.7 MB103 records
2004-06.zip5.6 MB199 records
2004-05.zip6.0 MB128 records
2004-04.zip14.5 MB301 records
2004-03.zip13.4 MB297 records
2004-02.zip6.0 MB153 records
2004-01.zip3.9 MB92 records
2003-12.zip7.0 MB132 records
2003-11.zip3.0 MB90 records
2003-10.zip6.0 MB156 records
2003-09.zip3.8 MB97 records
2003-08.zip5.0 MB111 records
2003-07.zip4.6 MB118 records
2003-06.zip5.2 MB126 records
2003-05.zip7.0 MB154 records
2003-04.zip11.0 MB255 records
2003-03.zip12.5 MB287 records
2003-02.zip5.7 MB131 records
2003-01.zip4.4 MB118 records
2002-12.zip3.3 MB84 records
2002-11.zip4.1 MB114 records
2002-10.zip3.9 MB120 records
2002-09.zip4.0 MB112 records
2002-08.zip4.6 MB119 records
2002-07.zip5.4 MB147 records
2002-06.zip5.3 MB143 records
2002-05.zip6.2 MB185 records
2002-04.zip12.1 MB314 records
2002-03.zip11.2 MB281 records
2002-02.zip4.4 MB118 records
2002-01.zip1.8 MB59 records
2001-12.zip1.8 MB56 records
2001-11.zip2.4 MB76 records
2001-10.zip2.7 MB95 records
2001-09.zip2.9 MB89 records
2001-08.zip4.5 MB125 records
2001-07.zip4.3 MB120 records
2001-06.zip3.9 MB115 records
2001-05.zip5.7 MB167 records
2001-04.zip12.0 MB314 records
2001-03.zip9.8 MB277 records
2001-02.zip4.3 MB126 records
2001-01.zip6.1 MB100 records
2000-12.zip1.9 MB64 records
2000-11.zip3.3 MB95 records
2000-10.zip3.4 MB106 records
2000-09.zip6.2 MB111 records
2000-08.zip4.1 MB124 records
2000-07.zip4.3 MB130 records
2000-06.zip4.9 MB146 records
2000-05.zip7.3 MB208 records
2000-04.zip12.1 MB340 records
2000-03.zip12.4 MB406 records
2000-02.zip5.8 MB168 records
2000-01.zip2.4 MB69 records
1999-12.zip3.6 MB108 records
1999-11.zip2.2 MB70 records
1999-10.zip4.5 MB117 records
1999-09.zip3.8 MB137 records
1999-08.zip3.0 MB107 records
1999-07.zip4.2 MB118 records
1999-06.zip2.9 MB93 records
1999-05.zip5.1 MB180 records
1999-04.zip11.3 MB359 records
1999-03.zip12.5 MB418 records
1999-02.zip5.5 MB166 records
1999-01.zip2.9 MB98 records
1998-12.zip4.1 MB112 records
1998-11.zip3.1 MB92 records
1998-10.zip4.2 MB128 records
1998-09.zip3.8 MB102 records
1998-08.zip2.8 MB99 records
1998-07.zip4.3 MB143 records
1998-06.zip3.7 MB116 records
1998-05.zip6.5 MB202 records
1998-04.zip12.7 MB399 records
1998-03.zip12.8 MB462 records
1998-02.zip6.4 MB228 records
1998-01.zip2.3 MB88 records
1997-12.zip3.0 MB94 records
1997-11.zip3.2 MB96 records
1997-10.zip4.6 MB154 records
1997-09.zip4.0 MB129 records
1997-08.zip4.1 MB130 records
1997-07.zip2.9 MB98 records
1997-06.zip2.9 MB112 records
1997-05.zip4.1 MB139 records
1997-04.zip10.7 MB366 records
1997-03.zip12.8 MB446 records
1997-02.zip7.6 MB244 records
1997-01.zip2.6 MB117 records
1996-12.zip2.6 MB99 records
1996-11.zip2.7 MB88 records
1996-10.zip3.6 MB173 records
1996-09.zip4.0 MB138 records
1996-08.zip3.8 MB155 records
1996-07.zip4.2 MB148 records
1996-06.zip4.7 MB141 records
1996-05.zip5.0 MB173 records
1996-04.zip5.5 MB180 records
1996-03.zip8.1 MB286 records
1996-02.zip4.4 MB170 records
1996-01.zip2.3 MB71 records
1995-12.zip2.0 MB79 records
1995-11.zip1.7 MB48 records
1995-10.zip1.3 MB45 records
1995-09.zip2.1 MB63 records
1995-08.zip1.2 MB45 records
1995-07.zip1.0 MB42 records
1995-06.zip3.0 MB85 records
1995-05.zip1.8 MB50 records
1995-04.zip2.4 MB85 records
1995-03.zip4.8 MB172 records
1995-02.zip2.6 MB85 records
1995-01.zip453.1 KB14 records
1994-12.zip573.2 KB16 records
1994-11.zip450.4 KB14 records
1994-10.zip401.5 KB13 records
1994-09.zip673.7 KB24 records
1994-08.zip2.0 MB48 records
1994-07.zip597.2 KB16 records
1994-06.zip384.7 KB10 records
1994-05.zip6.2 MB110 records
1994-04.zip3.0 MB70 records
1994-03.zip4.1 MB139 records
1994-02.zip2.6 MB88 records
1994-01.zip419.9 KB19 records

What This Dataset Contains

The dataset packages every PRE 14A submission accepted by EDGAR into a per-accession folder. Form PRE 14A is the preliminary proxy statement filed under Regulation 14A. Rule 14a-6(a) requires the registrant to lodge a preliminary copy of the proxy materials with the Commission at least ten calendar days before the definitive materials are released to security holders, giving the staff the option to review and comment before the solicitation reaches investors. PRE 14A is reserved for non-routine matters — mergers and acquisitions, going-private transactions, charter amendments, equity-plan adoptions, contested director elections, reverse stock splits, and similar items — that fall outside the safe harbor in Rule 14a-6(a) for proxies that solely concern routine annual-meeting business. Routine annual-meeting proxies are normally filed directly as DEF 14A.

The substantive content of a PRE 14A is governed by Schedule 14A (Rule 14a-101), which specifies the disclosures item by item. The filing is by construction a draft: the cover sheet and document text bear "PRELIMINARY COPY" and "PRELIMINARY PROXY STATEMENT — SUBJECT TO COMPLETION" markings, and the corresponding box on the Schedule 14A cover page is checked.

The dataset covers the entire population of bare-form PRE 14A filings on EDGAR from January 1994 forward. Amended preliminary proxies filed under the separate PRE 14A/A form code are not included. The on-disk artifacts are TXT (legacy ASCII proxies and the complete-submission text bundle for every filing), HTML (the dominant primary-document format throughout the modern era), JSON (the per-record metadata.json descriptor), and PDF (used occasionally by filers for ancillary exhibits). Image files referenced in the original submission are excluded.

Content Structure of a Single Record

What one record represents

One record in the form-pre-14a-files dataset is a single EDGAR PRE 14A submission, identified by its 18-digit accession number and materialized on disk as one folder named after that accession (dashes stripped). Every artifact that belongs to the submission — the filing-level metadata descriptor and the full set of documents from the original EDGAR package, with image files removed — sits inside that folder. The unit is the filing, not the proposal, the registrant, or the meeting: a single accession folder may carry one operating-company proxy statement or a joint preliminary proxy filed by dozens of fund trusts under one accession.

On-disk layout of a single record

Every accession folder in the dataset is composed of two layers:

  1. A metadata.json descriptor with a fixed name, encoding the filing-level header EDGAR records on the submission and on the filing-index page.
  2. The documents from the original EDGAR submission, in their native EDGAR filenames and formats. This always includes the primary preliminary proxy statement and may include separately filed exhibits.

Image files (the GRAPHIC documents enumerated on the EDGAR index — typically signature scans, photographs, charts, and logo art) are removed from the on-disk container, although they remain enumerated inside metadata.json so the original submission inventory is still recoverable. XBRL side files referenced in metadata.json are likewise often not materialized as separate files because, in modern PRE 14A filings, the XBRL is embedded inline into the primary document.

In practice, most records reduce to two files on disk — metadata.json plus one primary .htm — but the layout is structured to accommodate any number of additional document files when the filer attaches separate exhibits.

The metadata.json descriptor

metadata.json is a single JSON object that mirrors the EDGAR filing-level header plus the document inventory and entity roster from the EDGAR index page. Its principal keys are:

  • formType — the literal string PRE 14A.
  • accessionNo — the dashed 18-digit accession number; the same number with dashes stripped is the folder name.
  • description — the EDGAR short description, typically "Form PRE 14A - Other preliminary proxy statements".
  • filedAt — ISO-8601 timestamp of acceptance, with timezone offset (Eastern time on EDGAR).
  • periodOfReportYYYY-MM-DD, the meeting date or period of report stamped on the filing.
  • linkToFilingDetails — URL of the primary HTML proxy document on www.sec.gov/Archives/edgar/data/....
  • linkToTxt — URL of the EDGAR complete-submission text file (the legacy .txt bundle that wraps every document in the submission).
  • linkToHtml — URL of the EDGAR filing-index HTML page (...-index.htm).
  • linkToXbrl — URL of a standalone XBRL document if one exists; for PRE 14A this is typically empty because structured tagging is embedded inline.
  • id — opaque 32-character hex identifier used internally as a stable record key.
  • documentFormatFiles — array enumerating every file in the original EDGAR submission as listed on the index page. Each entry carries sequence ("1", "2", …; the trailing complete-submission txt entry uses a single space), description (free text such as "PRE 14A", "GRAPHIC", "EXHIBIT 99.1", "Complete submission text file"), type (the EDGAR document type code), documentUrl (canonical EDGAR URL), and size (byte count as a string). Sequence 1 is by convention the primary PRE 14A document; later sequences hold exhibits, graphics, XBRL artifacts, and the complete-submission txt.
  • dataFiles — array of structured data attachments associated with the filing. For operating-company filers subject to inline XBRL tagging this typically lists the schema (EX-101.SCH, *.xsd) and the extracted inline XBRL instance (*_htm.xml) plus related linkbases. For investment-company filers it is typically empty.
  • entities — array of one or more filer entity objects. Each carries cik, companyName (often suffixed with the EDGAR role string "(Filer)"), type (the form type for that entity), fileNo (the Exchange Act file number — 001- / 000- ranges for operating companies, 811- for registered investment companies), filmNo, act (Exchange Act, normally "34"), irsNo, stateOfIncorporation, fiscalYearEnd as a 4-character MMDD string, sic (SIC code with description, present for operating companies), and tickers (array of listed tickers, present for issuers with publicly traded equity). For joint fund-family proxies the array may hold many sibling entities, one per trust CIK, sharing the same accession number.
  • seriesAndClassesContractsInformation — array used by Investment Company Act registrants. Each element is a series block with the EDGAR series identifier (S000XXXXXX), the fund name, and a classesContracts list whose items carry the class identifier (C000XXXXXX), the class/share-class name (e.g., "Class A", "Institutional Class"), and an optional ticker. For operating-company filings the array is empty; for fund-family proxies it may enumerate dozens of series and hundreds of classes.

The primary preliminary proxy statement document

The primary file is one .htm document carrying the full Schedule 14A proxy statement marked as preliminary. Its filename is filer-chosen and not standardized — patterns include <ticker>-<yyyymmdd>.htm, <filer-prefix>_pre14a.htm, <filer-prefix>-pre14a_<slug>.htm, generic names such as formpre14a.htm or prelimproxy.htm, and filing-agent codes such as d<digits>dpre14a.htm (Donnelley) or tm<digits>d<n>_pre14a.htm (Toppan Merrill). The disk filename is mirrored verbatim from the basename of documentFormatFiles[0].documentUrl.

Two structural styles appear in modern PRE 14A primary documents:

  • SGML-wrapped HTML (legacy EDGAR document wrapper). The file opens with the EDGAR-internal <DOCUMENT> envelope:

    1 <DOCUMENT>
    2 <TYPE>PRE 14A
    3 <SEQUENCE>1
    4 <FILENAME>tm2519035d1_pre14a.htm
    5 <DESCRIPTION>PRE 14A
    6 <TEXT>
    7 <HTML>... full HTML body of the preliminary proxy statement ...</HTML>
    8 </TEXT>
    9 </DOCUMENT>

    The SGML tags carry the type, sequence number, filename, and description that EDGAR uses to identify the document inside the submission package; the actual proxy statement is the HTML inside <TEXT>...</TEXT>.

  • Inline XBRL XHTML (iXBRL). The file opens with an XML prologue and an <html> root carrying the inline-XBRL namespace declarations (ix, xbrli, xbrldi, dei, us-gaap, ecd, srt, …). The first <body> contains an <ix:header> block with hidden <ix:nonNumeric> facts (cover-page tags such as dei:DocumentType = "PRE 14A", dei:EntityCentralIndexKey, dei:AmendmentFlag) and a sequence of <xbrli:context> definitions used for the pay-versus-performance disclosures. The visible cover page and proxy statement follow in a normal <div> tree, with selected facts (registrant name, ticker, pay-versus-performance amounts) tagged via inline XBRL elements. iXBRL primaries are noticeably larger than their SGML-wrapped counterparts because of the embedded contexts and tagged facts.

Regardless of structural style, the visible body carries the same Schedule 14A content elements, in roughly the order below.

Schedule 14A cover page (Rule 14a-101 cover)

The first visible page is the Schedule 14A cover sheet. It carries the heading "UNITED STATES SECURITIES AND EXCHANGE COMMISSION" and the legend "SCHEDULE 14A INFORMATION / Proxy Statement Pursuant to Section 14(a) of the Securities Exchange Act of 1934". Below that sits the filing-class checkbox group:

The cover then identifies the registrant (and any other person filing the proxy statement, such as a separate soliciting party in a contested election) and resolves the Payment of Filing Fee section — historically a fee table with three options ("No fee required" / "Fee computed on table below" / "Fee paid previously with preliminary materials"), and since the 2022 amendments to Rule 14a-6(i) a simplified two-option block referencing the new fee-table exhibit (Exhibit 107) when a fee is required.

Notice of meeting and letter to shareholders

Following the cover, the document typically contains a chairman's or president's letter and a formal Notice of Annual or Special Meeting of Stockholders. The notice block states the date, time, location (or virtual-meeting URL/instructions), the record date for voting, and a bulleted list of proposals to be voted on.

Proxy statement body — Schedule 14A item-by-item disclosures

The proxy statement proper follows. Its sections map to the items of Schedule 14A. The set actually present in any given filing depends on the matters being voted on, but the typical order is:

  • Item 1 — Date, time, and place of meeting (also delivered in the notice block).
  • Item 2 — Revocability of proxy.
  • Item 3Dissenters' right of appraisal, where applicable (especially in M&A and going-private contexts).
  • Item 4 — Persons making the solicitation: identification of the registrant or contesting party, methods of solicitation, and estimated solicitation costs.
  • Item 5 — Interests of certain persons in matters to be acted upon (insider and 5%-plus-holder interests in the proposals).
  • Item 6 — Voting securities and principal holders thereof: record-date share count, beneficial ownership tables for directors, named executive officers, and 5% holders.
  • Item 7 — Directors and executive officers (when directors are being elected): biographical information, board composition, committee structure, governance disclosures, director independence, leadership structure, board diversity, and risk-oversight discussion.
  • Item 8 — Compensation of directors and executive officers: Compensation Discussion and Analysis, Summary Compensation Table, Grants of Plan-Based Awards, Outstanding Equity Awards at Fiscal Year-End, Option Exercises and Stock Vested, Pension Benefits, Nonqualified Deferred Compensation, Potential Payments Upon Termination or Change in Control, Pay Ratio disclosure, Pay-versus-Performance table, and director compensation tables.
  • Item 9 — Independent public accountants (auditor ratification): identity of the auditor, audit and non-audit fees, audit-committee pre-approval policies, audit-committee report.
  • Item 10 — Compensation plans (when an equity-incentive plan is being approved or amended): plan summary, share reserve, dilution analysis, burn-rate, summary of material plan terms, U.S. federal income-tax consequences, the plan filed as an exhibit, and the new-plan-benefits table.
  • Item 11 — Authorization or issuance of securities other than for exchange.
  • Item 12 — Modification or exchange of securities.
  • Item 13 — Financial and other information for transactions to which Item 14 applies, including incorporation by reference of audited financial statements when the form requires Form 10-K-equivalent disclosure.
  • Item 14 — Mergers, consolidations, acquisitions, and similar matters: a comprehensive M&A section covering background of the transaction, the board's reasons for approval, fairness opinions and the financial advisor's analyses, management projections, the merger agreement summary, conditions to closing, no-shop and termination-fee provisions, treatment of equity awards, voting and support agreements, regulatory approvals, appraisal rights, U.S. federal income-tax consequences, and accounting treatment.
  • Item 15 — Acquisition or disposition of property.
  • Item 16 — Restatement of accounts.
  • Item 17 — Action with respect to reports.
  • Item 18 — Matters not required to be submitted.
  • Item 19 — Amendment of charter, bylaws, or other documents.
  • Item 20 — Other proposed action (the residual category for items not covered above).
  • Item 21 — Voting procedures.
  • Item 22 — Information required in investment-company proxy statements (special set used by Investment Company Act registrants for fund-specific matters such as approval of advisory contracts, sub-advisory arrangements, fund mergers, distribution-plan amendments, and changes to fundamental investment policies).
  • Item 23 — Delivery of documents to security holders sharing an address (the householding disclosure).
  • Item 24 — Shareholder approval of executive compensation (the Say-on-Pay vote and the Say-on-Frequency vote required by Section 14A of the Exchange Act, added by Dodd-Frank).

Form of proxy / voting instruction card

The closing section contains the form of proxy card or voting-instruction card, presented as a sample ballot enumerating each proposal with the corresponding For/Against/Abstain or For All/Withhold All/For All Except options, the registrant's recommendation on each proposal, signature blocks, and instructions for returning the proxy or voting electronically.

Annexes

Filings that include transactional or governance items routinely append annexes after the proxy-statement body, embedded inside the same primary .htm rather than supplied as separate exhibit files. Common annexes include the merger agreement, the certificate-of-incorporation amendment, the equity-incentive plan document, the fairness opinion, fiscal-year audited financial statements (when incorporated rather than referenced), and reconciliations of any non-GAAP measures used in the proxy statement.

Exhibits and additional documents

When a filer chooses to submit exhibits as separate files rather than annexing them to the primary document, those exhibit files appear as additional sequenced entries in documentFormatFiles and as additional files on disk inside the accession folder. Common standalone exhibits include the form of proxy card filed separately, an EX-99 letter to shareholders or investor presentation, and — since the 2022 fee-disclosure amendments — Exhibit 107 (EX-FILING FEES) carrying the Rule 0-11 fee calculation when a filing fee is required.

The dataset includes every non-image document from the original submission. This embraces HTML documents, plain-text exhibits, and PDFs where filers used PDF for ancillary materials. It excludes GRAPHIC files (.jpg, .gif, .png) regardless of whether the graphics are signature pages, charts, photographs, or logo art; the graphics remain enumerated in documentFormatFiles but are not present on disk. Standalone XBRL artifacts referenced in dataFiles are also typically embedded inline rather than materialized as separate files.

Investment-company variant

Investment Company Act registrants — open-end funds, closed-end funds, and BDCs — file PRE 14A under the same form code but use a different content profile. The cover page identifies the trust by its 1940 Act file number (the 811- series), and the Schedule 14A body emphasizes Item 22 disclosures: approval or amendment of advisory or sub-advisory contracts, election of trustees, fund reorganizations and tax-free mergers, changes to fundamental investment policies, and Rule 12b-1 distribution-plan adoption or amendment. Joint proxies are common, with one accession number covering many fund trusts; in metadata.json this surfaces as a multi-entry entities array and a richly populated seriesAndClassesContractsInformation array enumerating the affected series (S000XXXXXX) and classes/contracts (C000XXXXXX). Operating-company-style executive-compensation tables and pay-versus-performance disclosures are absent from fund proxies.

Amendments

Amended preliminary proxies are filed as PRE 14A/A (a separate form type) and are not included in this dataset. Corrections or revisions to a preliminary proxy issued before the definitive version are therefore captured only in the form-specific PRE 14A/A dataset; PRE 14A records here are first-filed preliminary proxies under the bare PRE 14A form code.

Evolution of required content

The structure of Schedule 14A has been progressively expanded since EDGAR began accepting PRE 14A filings in 1994:

  • 1992 executive-compensation rules (predating EDGAR coverage but in force across the dataset's full history) introduced the modern Summary Compensation Table, the Stock Performance Graph, and the Compensation Committee Report, all of which appear in PRE 14A filings from the start of the dataset.
  • 2003 SOX-era amendments added audit-committee disclosures and the principal-accountant fees table (audit, audit-related, tax, and all-other categories) under Item 9.
  • 2006 SEC executive-compensation reform replaced the legacy compensation tables with the Compensation Discussion and Analysis narrative and the redesigned table set (Summary Compensation, Grants of Plan-Based Awards, Outstanding Equity Awards, etc.) and tightened related-party transaction disclosure under Item 404.
  • 2010 Dodd-Frank Act added Say-on-Pay and Say-on-Frequency votes (Item 24), pay-ratio disclosure (effective for fiscal years beginning on or after January 1, 2017), and hedging-policy disclosure under Item 407(i).
  • 2022 Pay-versus-Performance final rule under Section 14(a)(i) of the Exchange Act added the Pay-versus-Performance table — compensation actually paid, total shareholder return against a peer group, net income, and a company-selected measure — and required inline XBRL tagging of those facts.
  • 2022 Rule 14a-6(i) amendments restructured the Payment of Filing Fee section on the cover page and moved the fee calculation into a dedicated structured exhibit, Exhibit 107 (EX-FILING FEES).
  • 2022 universal-proxy rules under Rule 14a-19 require, in contested director elections, that the proxy card list all duly nominated candidates regardless of which party is soliciting; PRE 14A filings in contested matters since then carry a universal proxy card and Rule 14a-19 disclosures.
  • Periodic refinements have added board-diversity disclosure, cybersecurity oversight disclosure, clawback-policy descriptions tied to Rule 10D-1, and human-capital narrative cross-references.

Evolution of data format

PRE 14A submissions span the full EDGAR era from January 1994 onward and have moved through three presentation regimes:

  • 1994–2001 ASCII / SGML-wrapped plain text. Earliest PRE 14A filings were ASCII-only, with the entire proxy statement rendered as wrapped plain text inside the EDGAR <DOCUMENT> / <TEXT> envelope. Tabular content (Summary Compensation Table, beneficial ownership) was hand-aligned with whitespace.
  • 2001–early 2010s SGML-wrapped HTML. As EDGAR accepted HTML, filings transitioned to HTML bodies inside the same <DOCUMENT> wrapper, with native HTML tables, bold headings, and limited inline images (which the dataset omits). The SGML wrapper continued to carry <TYPE>, <SEQUENCE>, <FILENAME>, and <DESCRIPTION> metadata for each document.
  • Inline XBRL era. Cover-page tagging requirements progressively obligated PRE 14A primary documents to be filed as Inline XBRL XHTML, with dei cover-page facts tagged in an <ix:header>. The 2022 Pay-versus-Performance rule extended inline tagging to a substantive disclosure block within the proxy statement, embedding numerous <xbrli:context> definitions and <ix:nonFraction> facts directly in the primary document. Since this rollout, operating-company PRE 14A primary documents are predominantly iXBRL XHTML, while investment-company filings — which are not subject to PvP — generally remain SGML-wrapped HTML.

Interpretation notes

  • The accession folder is the unit of identity. When a fund family files a joint preliminary proxy, all participating trusts share one accession number and one folder; the entities array in metadata.json is the only place where the individual trust CIKs are surfaced.
  • documentFormatFiles is faithful to the EDGAR index and therefore enumerates more files than are present on disk: graphics are listed but excluded, and structured XBRL side files may be listed even when their content is embedded inline rather than supplied separately. To reconcile what is on disk against what was submitted to EDGAR, compare the basenames of documentUrl entries with the file inventory in the folder.
  • linkToTxt always points to the EDGAR complete-submission txt, the authoritative SGML bundle of every document in the original filing including the omitted graphics. For downstream uses that require byte-for-byte fidelity to the EDGAR submission, that URL is the canonical reference.
  • For iXBRL filings, the primary .htm is simultaneously the human-readable proxy statement and the XBRL instance document. Extracting structured cover-page or pay-versus-performance facts requires parsing <ix:header> and <ix:nonFraction> / <ix:nonNumeric> elements rather than relying solely on visible text.
  • Schedule 14A items frequently incorporate by reference the corresponding sections of the registrant's Form 10-K (audited financials, risk factors, executive compensation tables) — particularly under Item 13 in transactional contexts. The PRE 14A record carries only the proxy itself; the incorporated content remains in the referenced 10-K and is not duplicated in this dataset.
  • "Preliminary" markings and bracketed placeholders ("PRELIMINARY COPY", "SUBJECT TO COMPLETION", and bracketed record dates, proxy-card vote totals, fairness-opinion sign-off dates) are characteristic of PRE 14A text and should be expected; they are replaced or finalized in the corresponding DEF 14A filing that follows.
  • Filing-agent fingerprints (Toppan Merrill tm…, EdgarAgents ea…, Donnelley d…/dp…, Broadridge, RDG Filings) are visible in primary-document filenames and embedded HTML comments but are not surfaced as a structured field; downstream classification of agent or template requires document-level inspection.

Who Files or Publishes This Dataset, and When

Who files the record

Form PRE 14A is filed by registrants soliciting proxies from holders of securities registered under Section 12 of the Securities Exchange Act of 1934. The filer is always the soliciting party on whose proxy card the vote is being requested.

The reporting population includes:

  • Domestic operating companies with equity registered under Section 12(b) (exchange-listed) or Section 12(g) (over-threshold). These dominate the dataset and account for most merger, going-private, and contested-election preliminaries.
  • Registered investment companies (open-end and closed-end funds, BDCs, certain UITs) soliciting fund-shareholder votes on mergers, advisory-contract changes, fundamental policy changes, or board restructurings. Fund proxies are subject to Regulation 14A as supplemented by Section 20 of the Investment Company Act of 1940.
  • Other Section 14(a) entities, including bank holding companies, REITs, Section 12-registered limited partnerships, and SPACs seeking approval of a business combination, charter extension, or trust-account action.
  • Registrants in contested situations soliciting on their own card. Dissidents running a competing slate file under PREC14A, PREN14A, or PRRN14A rather than PRE 14A.

Foreign private issuers are excluded. FPIs are exempt from Regulation 14A under Rule 3a12-3(b) and instead furnish proxy and information materials on Form 6-K. They do not appear in this dataset.

When the record is created or required

PRE 14A is event-driven, not periodic. The trigger is a decision to solicit proxies on at least one matter that is not within the routine list in Rule 14a-6(a). Routine items that bypass preliminary filing include uncontested director elections, ratification of accountants, Rule 14a-8 shareholder proposals, ordinary compensation- and benefit-plan votes, and Rule 14a-21 say-on-pay/say-on-frequency votes. Annual meetings limited to those items go directly to DEF 14A and produce no PRE 14A.

A preliminary filing is required when the solicitation includes any non-routine matter, typically:

  • mergers, consolidations, asset sales, and other business combinations;
  • going-private transactions subject to Rule 13e-3;
  • contested director elections (registrant-side filings under Rule 14a-12 procedures);
  • authorization, modification, or exchange of securities tied to a transaction (including recapitalizations, reclassifications, and reverse splits);
  • substantive charter or bylaw amendments outside Rule 14a-6(a) (anti-takeover provisions, classified boards, supermajority voting);
  • roll-up transactions under Item 901 of Regulation S-K;
  • dissolution, liquidation, or wind-up proposals;
  • fund mergers, fundamental policy changes, and other non-routine investment-company items.

Timing is anchored to the definitive distribution. Under Rule 14a-6, the preliminary proxy must be on file with the SEC at least ten calendar days before the definitive DEF 14A is first sent or given to security holders. The clock runs from the EDGAR filing date and gives Corporation Finance staff (or Investment Management staff, for funds) the chance to issue comments. Revisions in response to comments are filed as PRER14A amendments before the registrant clears to DEF 14A.

The governing framework is Section 14(a) and Regulation 14A (Rules 14a-1 through 14a-21), with line-item content prescribed by Schedule 14A. The PRE 14A document is, in substance, a Schedule 14A marked "PRELIMINARY COPY," subject to the antifraud standard of Rule 14a-9 and the pre-statement-communication rules of Rule 14a-12.

Important distinctions

  • PRE 14A vs. DEF 14A — PRE 14A is the preliminary draft for staff review; DEF 14A is the definitive version delivered to holders. A given meeting normally produces both, separated by the ten-calendar-day window or longer if comments are issued.
  • PRE 14A vs. PRER14A — PRER14A is a revised preliminary proxy filed to address staff comments or other changes. Subsequent amendments use PRER14A, not a new PRE 14A.
  • PRE 14A vs. PREC14A / PREN14A / PRRN14A — The "C," "N," and "RN" forms are contested-solicitation variants. PREC14A is a preliminary contested filing (registrant or contestant); PREN14A and PRRN14A are non-management preliminary and revised non-management preliminary filings used by dissidents. Dissident slates do not file PRE 14A.
  • PRE 14A vs. PRE 14C — PRE 14A covers proxy solicitations. Where no proxies are solicited because the action will be approved by written consent of a controlling holder, the registrant files preliminary information-statement materials on PRE 14C instead.
  • PRE 14A vs. embedded S-4 proxy/prospectus — When a merger requires registration of acquirer securities, the proxy statement is typically combined into a Form S-4 joint proxy/prospectus, and no stand-alone PRE 14A is filed. PRE 14A is used for cash mergers, all-existing-security transactions, and similar deals that do not require Securities Act registration.
  • Filer vs. subject — The filer is the registrant whose holders are being solicited. Counterparties, target companies, director nominees, and beneficial owners listed within the document are disclosed in the proxy but are not themselves filers of the PRE 14A record.

How This Dataset Differs From Similar Datasets or Filings

PRE 14A sits inside a dense cluster of Section 14(a) and 14(c) filings. Boundaries with neighboring forms typically resolve along four axes: filing stage (preliminary, definitive, supplemental, revised), trigger (routine vs non-routine, vote vs written consent, contested vs uncontested), filer side (issuer vs dissident vs voter), and scope (standalone proxy vs combined registration).

DEF 14A — Definitive Proxy Statement

The direct downstream counterpart. Same Schedule 14A scope, same registrant; PRE 14A is filed first for non-routine matters and held at least ten calendar days for staff review before the DEF 14A is mailed and used to solicit votes. PRE 14A may carry placeholder dates or to-be-finalized figures; DEF 14A is the operative, distributed version. Routine annual-meeting proxies skip the preliminary stage under Rule 14a-6(a), so PRE 14A is structurally biased toward non-routine matters while DEF 14A covers the full proxy population.

DEFA14A — Additional Definitive Proxy Soliciting Materials

Supplemental campaign materials (presentations, press releases, shareholder letters, Q&A) filed after the DEF 14A, not a draft proxy. PRE 14A is the full preliminary statement filed before the DEF 14A; DEFA14A accumulates after it. Sequential and complementary, never substitutes.

PRER14A and DEFR14A — Revised Proxy Statements

PRER14A is a refiled preliminary proxy with material changes, typically responding to staff comments; DEFR14A is the revised definitive. PRE 14A captures only the initial preliminary submission. Reconstructing the staff-comment redline requires pairing PRE 14A with PRER14A, both of which sit under separate form codes.

PRE 14C and DEF 14C — Information Statements

Schedule 14C governs corporate actions taken without soliciting proxies, usually because written consents from a majority of voting power are already in hand. Disclosure content overlaps heavily with Schedule 14A, but no proxy card is included and the filer is informing rather than asking. PRE 14A covers matters going to an actual vote; PRE 14C covers the parallel consent-driven universe. The two should not be conflated: the legal posture and the room for shareholder influence differ fundamentally.

PREC14A, PREN14A, PRRN14A, DFAN14A — Contested and Non-Management Variants

Adversarial-side analogs: PREC14A (preliminary contested proxy), PREN14A (preliminary non-management proxy), PRRN14A (revised non-management preliminary), DFAN14A (additional definitive non-management materials, the dissident DEFA14A). PRE 14A can be filed by an issuer in a contested matter, but the C and N series specifically flag dissident or non-management activity under distinct form codes. A full proxy contest record requires PRE 14A on the issuer side combined with the C/N filings on the insurgent side.

PREM14A and DEFM14A — Merger Proxy Statements

Reserved form codes for mergers, acquisitions, and similar extraordinary transactions. Mergers are inherently non-routine and are the most common driver of preliminary-stage filings, but they are routed to PREM14A/DEFM14A rather than PRE 14A/DEF 14A. PRE 14A therefore carries non-routine, non-merger matters: charter amendments, equity plan adoptions over Rule 14a-6 thresholds, going-private elements not labeled as mergers, and contested elections without a transaction component. M&A solicitation studies should anchor on PREM14A/DEFM14A and use PRE 14A only as a secondary source.

Form 8-K Item 5.07 — Submission of Matters to a Vote of Security Holders

Post-meeting vote tally filed within four business days of the meeting. Structurally tabular (votes for, against, abstain, broker non-votes per proposal) versus PRE 14A's narrative and exhibit-heavy form. Timing is opposite: PRE 14A frames the ask before the meeting, Item 5.07 records the outcome after. Neither substitutes for the other; they bookend the solicitation cycle.

Form N-PX — Annual Proxy Voting Record

Voter-side disclosure filed by registered investment companies and, post-2022, certain institutional managers under Section 14A(b), reporting how they voted at portfolio companies. PRE 14A is issuer-side and pre-meeting; N-PX is voter-side and post-meeting. The datasets join on issuer and meeting date but capture opposite ends of the lifecycle.

Form S-4 — Registration Statement for Business Combinations

Securities Act registration used when securities are issued in a business combination, exchange offer, or recapitalization. Where shareholder approval is also required, S-4 is commonly filed as a joint proxy statement/prospectus, folding Schedule 14A disclosures into a registration document subject to Securities Act review standards. For stock-for-stock mergers the S-4/joint proxy is typically the operative filing and no separate PREM14A or PRE 14A is filed. PRE 14A therefore underrepresents merger solicitations bundled with securities issuance; a complete merger corpus needs both.

Form 10-K Part III incorporation by reference

10-K Part III (executive compensation, ownership, director independence, related-party transactions, accountant fees) is routinely satisfied by incorporating the definitive proxy by reference if filed within 120 days of fiscal year-end. The incorporation points to DEF 14A, not PRE 14A. PRE 14A is not the operative source for 10-K-linked governance and compensation disclosures; its value in this context is limited to studying how Part III content evolved between the preliminary and definitive versions.

Boundary summary

PRE 14A is distinctive as the SEC-review-stage version of non-routine, vote-soliciting proxy materials filed under Schedule 14A by the issuer. It is narrower than the full proxy universe (excludes routine DEF 14As that skip the preliminary stage), distinct from Schedule 14C (vote vs consent), distinct from PREM14A (non-merger non-routine matters only), distinct from the contested C/N series (issuer-side rather than dissident), and distinct from S-4 (proxy disclosure only, no securities registration). Its unique value is the pre-clearance window: language and structure as first submitted to the staff, before the redline that produces the definitive proxy.

Who Uses This Dataset

Preliminary proxies surface non-routine ballot items (mergers, charter changes, contested elections, equity-plan requests) before the definitive DEF 14A is mailed. Users anchor on specific sections: Schedule 14A item lists, Background-of-the-Merger, fairness opinions, CD&A, Say-on-Pay, exhibits, meeting and record dates, and the eventual PRE-to-DEF redline.

M&A bankers and merger-arbitrage desks

Event-driven traders price spreads off the Background-of-the-Merger narrative, the financial advisor's fairness opinion and projection tables, deal-protection terms (no-shop, matching rights, termination fees), and the disclosed meeting timeline. The preliminary version lets desks size positions before the record date is fixed and track PRE 14A/A amendments, which often signal SEC comments or litigation pressure on disclosure.

Activist funds and dissident solicitors

Activists screen PRE 14A filings for contested slates, charter items affecting shareholder rights (advance-notice bylaws, supermajority thresholds, classified boards), and Rule 14a-8 proposals that cleared no-action review. They work from the proposed slate, incumbent biographies, and the matters-to-be-voted-on schedule to time counter-solicitations or escalation letters before definitive distribution.

Proxy advisors and governance research teams

Voting-recommendation analysts use the preliminary stage to begin drafting reports on complex ballots: equity-plan share requests, M&A votes, compensation amendments, and governance proposals. They focus on equity-plan dilution and burn-rate tables, CD&A and Say-on-Pay narrative, related-party transactions, and director-independence representations, getting analyses ready before the DEF 14A opens the formal voting window.

Securities and M&A counsel

Disclosure lawyers mine the dataset as a precedent library when drafting their own preliminary proxies, benchmarking treatment of deal background, alternatives considered, advisor conflicts, appraisal rights, and golden-parachute disclosures. Litigation counsel reviews the same sections to assess strike-suit and disclosure-only settlement exposure on merger proxies. Proxy solicitation firms use the filing flow to map upcoming contested or complex votes.

Corporate secretary and IR teams at filers

In-house teams benchmark structure, exhibit usage, and disclosure ordering against peer PRE 14As before filing their own. They study peer PRE 14A/A amendments to anticipate likely SEC staff comments on sensitive items: pay realignments, golden parachutes, dual-class structures, and shareholder proposal responses.

Compensation consultants

Reward-design teams compare CD&A drafts, peer-group selection rationale, performance-metric design, pay-versus-performance tables, clawback descriptions, and equity-plan share-request justifications across preliminary filings. PRE-to-DEF deltas reveal where staff comments or board feedback reshaped pay disclosure language.

Academic governance and forensic accounting researchers

Researchers run PRE-versus-DEF redline studies to identify disclosures modified after staff comments, plaintiff complaints, or institutional engagement. The same filings build event histories for studies linking disclosure quality, deal-protection design, or contest features to vote outcomes and post-deal performance.

Quant and event-driven signal developers

Quant teams treat the dataset as a structured event feed. Filing date, meeting date, record date, matter-type classifications, and deal terms feed completion-probability and vote-outcome models. Textual features from Background-of-the-Merger, fairness-opinion language, and projection tables become inputs to systematic merger-arb and governance strategies.

Compliance and proxy operations at brokers and advisers

Proxy operations teams at broker-dealers, custodians, and advisers use PRE 14A flow to stage upcoming non-routine ballots for pass-through voting, fund-board review, and ERISA fiduciary review. Issuer identifiers, meeting dates, and proposal-type tagging drive ballot setup and vote-recommendation pipelines.

Credit and special-situations analysts

Credit desks read deal-financing terms, pro forma capital structure, change-of-control covenants, and indemnification arrangements in M&A and recapitalization PRE 14As. Special-situations desks watch for asset-sale, going-private, and charter-change proposals that may trigger bond puts or covenant tests.

Data journalists and regulatory researchers

Governance and deal reporters use preliminary filings to surface contested elections, controversial pay, and complex M&A before the definitive proxy mails. Coverage back to 1994 supports long-horizon trend work on deal-protection terms, executive pay, and shareholder-proposal outcomes.

LLM and RAG developers in governance and M&A NLP

Teams building extraction and retrieval systems use the standardized Schedule 14A structure and paired PRE/DEF documents for training and evaluation: proposal classification, fairness-opinion extraction, board-slate parsing, equity-plan term extraction, and redline detection between draft and final disclosures.

Specific Use Cases

PRE 14A records carry the first-filed, pre-clearance version of non-routine proxy materials. The use cases below tie to specific Schedule 14A items, exhibits, and the PRE-to-DEF lifecycle.

Early M&A signaling and merger-arb position sizing

Event desks parse Item 14 sections — Background of the Merger, board reasons for approval, financial advisor fairness opinions and projection tables, no-shop and termination-fee provisions, treatment of equity awards, and the disclosed meeting timeline — out of the primary .htm to seed completion-probability and spread models. The periodOfReport meeting date and filedAt acceptance timestamp anchor the timeline; downstream PRER14A/PRE 14A/A activity flags SEC-comment or litigation pressure on the same disclosure block. The preliminary stage gives desks a 10+ day window to position before record dates are finalized.

PRE-to-DEF redline analytics on governance and pay

Pair each PRE 14A primary document with the matching DEF 14A on the same accession lineage and diff Items 7, 8, 10, and 14 to surface staff-comment or plaintiff-driven edits — reworded fairness-opinion caveats, restated equity-plan dilution figures, softened CD&A language, added appraisal-rights detail, or new related-party disclosures. Output is a redline corpus used by governance researchers, disclosure counsel, and forensic accounting studies linking specific edits to vote outcomes or post-deal performance.

Contested-vote and universal-proxy intelligence

Filter the corpus for filings carrying Rule 14a-19 universal-proxy cards, Item 4 solicitation-cost disclosures by a non-management party, Item 5 interests of certain persons, and bylaw or charter items affecting advance-notice or board-classification provisions. Cross-join with the contested C/N series (PREC14A, PREN14A, DFAN14A) on issuer CIK and meeting date to assemble full proxy-contest records, feeding activist trackers and counter-solicitation timing tools.

Compensation and equity-plan benchmarking

Pull Item 8 CD&A, Summary Compensation Table, Pay-versus-Performance table (extracted via inline XBRL <ix:nonFraction> facts in the iXBRL primaries), peer-group selection rationale, clawback descriptions, and Item 10 equity-plan share-reserve, burn-rate, dilution, and new-plan-benefits tables. Compensation consultants use these to benchmark share-request justification, performance-metric design, and CEO-pay quantum across SIC- and ticker-matched peers drawn from entities[].sic and entities[].tickers.

Drafting precedent library for disclosure counsel

Securities and M&A counsel query the dataset as a template bank: golden-parachute tables, appraisal-rights notices, advisor-conflict disclosures, no-shop and matching-rights language, charter-amendment marked copies, and Exhibit 107 fee-table layouts. Filtering by entities[].sic, transaction type (Item 14 vs Item 19 vs Item 22), and filing-agent fingerprint in the primary filename (tm…, d…, ea…) produces precedent sets matched to deal type and drafting house.

Fund-proxy tracking via series-and-class metadata

For Investment Company Act registrants (file numbers in the 811- range), use the seriesAndClassesContractsInformation array to enumerate every affected series (S000XXXXXX) and class (C000XXXXXX) under a single joint-proxy accession. This drives ballot setup at custodians and advisers, monitoring of Item 22 advisory-contract approvals, sub-advisory changes, fund reorganizations, and 12b-1 plan amendments at the share-class level.

ML training and RAG corpus for Schedule 14A extraction

The standardized Schedule 14A item ordering and the paired PRE/DEF lifecycle make the dataset a labeled-by-structure training source for proposal classification (Items 7, 8, 10, 14, 19, 22, 24), fairness-opinion-section extraction, board-slate parsing, equity-plan term extraction, and redline detection. Inline XBRL cover-page facts in the <ix:header> provide gold labels for dei fields (entity, ticker, document type), and the per-record metadata.json supplies stable join keys (accessionNo, id, CIK) for retrieval indexes.

Dataset Access

The Form PRE 14A Files Dataset is accessible through a JSON index endpoint, a full archive download, and per-container downloads. The dataset covers filings from January 1, 1994 to the present, is delivered in ZIP container format, and includes TXT, JSON, HTML, and PDF file types.

Dataset Index JSON API: https://api.sec-api.io/datasets/form-pre-14a-files.json

Returns dataset-level metadata (name, description, last updated timestamp, earliest sample date, total record count, total size, form types covered, container format, and file types), the full dataset download URL, and the list of all individual container files with per-container size, record count, updated timestamp, and download URL. This endpoint does not require an API key. Use it to monitor which containers were updated in the most recent refresh and to decide which containers to download incrementally.

Example
1 {
2 "datasetId": "1f13365b-9ae0-68fd-a540-d7488b5449a6",
3 "datasetDownloadUrl": "https://api.sec-api.io/datasets/form-pre-14a-files.zip",
4 "name": "Form PRE 14A Files Dataset",
5 "updatedAt": "2026-05-05T02:49:07.617Z",
6 "earliestSampleDate": "1994-01-01",
7 "totalRecords": 50453,
8 "totalSize": 3088655779,
9 "formTypes": ["PRE 14A"],
10 "containerFormat": "ZIP",
11 "fileTypes": ["TXT", "JSON", "HTML", "PDF"],
12 "containers": [
13 {
14 "downloadUrl": "https://api.sec-api.io/datasets/form-pre-14a-files/2026/2026-03.zip",
15 "key": "2026/2026-03.zip",
16 "size": 13818783,
17 "records": 154,
18 "updatedAt": "2026-05-05T02:49:07.617Z"
19 }
20 ]
21 }

Download Entire Dataset: https://api.sec-api.io/datasets/form-pre-14a-files.zip?token=YOUR_API_KEY

Downloads the complete dataset as a single ZIP archive containing all monthly containers. This endpoint requires an API key.

Download Single Container: https://api.sec-api.io/datasets/form-pre-14a-files/2026/2026-03.zip?token=YOUR_API_KEY

Downloads one individual monthly container ZIP, which is useful for incremental updates or when only a specific time period is needed. This endpoint requires an API key.

Frequently Asked Questions

What form does this dataset cover?

The dataset covers Form PRE 14A, the preliminary proxy statement filed under Regulation 14A of the Securities Exchange Act of 1934. The substantive content of each filing is governed by Schedule 14A (Rule 14a-101), and every filing is marked "PRELIMINARY COPY" with the corresponding box on the Schedule 14A cover page checked.

What does one record in this dataset represent?

One record represents a single EDGAR PRE 14A submission, identified by its 18-digit accession number and materialized on disk as one folder named after that accession (dashes stripped). Each folder contains a metadata.json filing-level descriptor and the original EDGAR documents (the primary preliminary proxy statement plus any separately filed exhibits), with image GRAPHIC files removed.

Who is required to file PRE 14A?

PRE 14A is filed by registrants soliciting proxies from holders of securities registered under Section 12 of the Exchange Act when the solicitation includes at least one matter that is not within the routine list in Rule 14a-6(a). The reporting population spans domestic operating companies, registered investment companies (open-end funds, closed-end funds, BDCs), bank holding companies, REITs, Section 12-registered limited partnerships, and SPACs. Foreign private issuers are excluded under Rule 3a12-3(b).

When must a PRE 14A be filed relative to the definitive proxy?

Under Rule 14a-6, the preliminary proxy must be on file with the SEC at least ten calendar days before the definitive DEF 14A is first sent or given to security holders. The window gives Corporation Finance staff (or Investment Management staff, for funds) the chance to issue comments before the solicitation reaches investors; revisions filed in response to staff comments use the PRER14A form code.

How does this dataset differ from the DEF 14A dataset?

PRE 14A is the preliminary, pre-clearance draft submitted for staff review on non-routine matters; DEF 14A is the definitive version mailed to security holders. Routine annual-meeting proxies skip the preliminary stage under Rule 14a-6(a) and are filed directly as DEF 14A, so the PRE 14A dataset is structurally biased toward non-routine items — mergers, going-private deals, charter amendments, equity-plan adoptions, and contested elections — while DEF 14A covers the full proxy population. PRE 14A also captures placeholder dates and to-be-finalized figures that are resolved in the corresponding DEF 14A.

Are PRE 14A/A amendments included in this dataset?

No. Amended preliminary proxies are filed under the separate PRE 14A/A form code and are captured in the form-specific PRE 14A/A dataset. This dataset contains only first-filed preliminary proxies under the bare PRE 14A form code.

What file format is the dataset distributed in?

The dataset is delivered in ZIP container format. Inside each container, records carry TXT (legacy ASCII proxies and the EDGAR complete-submission text bundle), HTML (the primary proxy-statement document, either SGML-wrapped HTML or Inline XBRL XHTML in the modern era), JSON (the per-record metadata.json descriptor), and PDF (used occasionally by filers for ancillary exhibits). Image files referenced in the original submission are excluded from the on-disk container.