The Form DEFC14A Files dataset is a complete EDGAR archive of definitive contested proxy statements filed on Schedule 14A under Section 14(a) of the Securities Exchange Act of 1934 and Regulation 14A. Each record represents one accepted DEFC14A submission — the formal, definitive solicitation document used in a contested shareholder vote — and is materialized as a folder containing a structured metadata.json descriptor plus the as-filed proxy statement and exhibits. Filings come from two opposing populations on the same contests: the registrant (management side) and dissident soliciting persons such as activist hedge funds, shareholder groups, founders, and ad hoc committees. The dataset begins on January 1, 1994, the start of mandatory EDGAR submission for most domestic registrants, and continues through every monthly refresh, distributed as monthly ZIP containers with TXT, JSON, PDF, and HTML payloads.
Programmatically retrieve the full list of dataset archive files, download URLs and dataset metadata.
Dataset Index JSON API
Download the entire dataset as a single archive file.
Download Entire Dataset:
Download a single container file (e.g. monthly archive) from the dataset.
Download Single Container:
A single record in the form-defc14a-files dataset is one complete Form DEFC14A submission accepted by EDGAR — a definitive proxy statement on Schedule 14A filed in connection with a contested solicitation of security-holder votes. Each record is materialized as a folder named with the EDGAR accession number with the dashes removed (an 18-digit numeric string such as 000199937125011925 for accession 0001999371-25-011925). That folder bundles together the entire EDGAR submission package for the filing: a metadata.json descriptor at the root, plus every document the registrant attached to the submission except embedded image files. JPG, GIF, and PNG graphics filed under EDGAR's GRAPHIC type are enumerated in the metadata but are not materialized to disk. The folder is the atomic unit of the dataset; one folder is one filing is one DEFC14A.
DEFC14A is the EDGAR form-type code for a definitive proxy statement on Schedule 14A when the solicitation is contested — that is, when the soliciting party is opposed by another person also soliciting proxies on the same matter. In practice the form-type captures two adjacent populations:
Common contested matters include opposing director slates, opposed mergers or asset sales, "vote no" campaigns, contested bylaw or charter amendments, and shareholder-proposed governance reforms that have escalated into a public solicitation contest. The disclosure framework follows Schedule 14A under Rule 14a-101 — the same enumerated Items as an ordinary DEF 14A — but the presentation is heavily inflected by the adversarial posture: detailed background-of-the-solicitation narratives, point-by-point rebuttals, "reasons to vote" sections, and the participants-in-the-solicitation disclosures required by Items 4 and 5 of Schedule 14A and Instruction 3 of Item 4(b). The dataset spans the full EDGAR record from January 1994 to the present and is distributed in monthly ZIP containers; payload file types are TXT, JSON, PDF, and HTML.
Records are distributed inside monthly ZIP containers organized by calendar year. Inside a container, the top-level directory mirrors the year-month, and each filing lives in its own subfolder keyed by the dash-stripped accession number. A filing folder always contains:
metadata.json, written in a stable schema; and<DOCUMENT> envelope — typically one HTML proxy statement plus any separately filed exhibits, with GRAPHIC images omitted.Filing-document filenames are preserved as supplied by the registrant and commonly follow patterns like <short-issuer-tag>-defc14a_<mmddyy>.htm, <issuer>-px<NN>.htm, or, for dissident-filed contests, <group>-letter.htm, <group>-pxcard.htm, and similar. The file-types present in the dataset are TXT, JSON, PDF, and HTML: JSON is contributed by the metadata layer, HTML is the dominant proxy-statement format for filings since roughly 2002, TXT covers older ASCII filings and any plain-text exhibits, and PDF appears for exhibit attachments such as scanned letters, fairness opinions, or investor presentations.
metadata.json — packaging metadataThe metadata file mirrors EDGAR's submission header and index in a structured JSON schema with stable keys. Empty arrays and empty strings are emitted explicitly rather than omitted, so consumers can rely on the presence of every documented key without defensive existence checks.
Top-level keys:
formType — fixed at "DEFC14A" for this dataset.accessionNo — canonical accession number with dashes (e.g., 0001999371-25-011925); pairs with the dash-stripped folder name as a stable join key.filedAt — acceptance timestamp in ISO 8601 with timezone offset.description — EDGAR's human-readable form description, typically "Form DEFC14A - Definitive proxy statement, contested solicitations".linkToFilingDetails — URL of the primary filing document on www.sec.gov.linkToTxt — URL of the complete EDGAR submission .txt SGML wrapper that concatenates every document.linkToHtml — URL of EDGAR's filing index page for the accession.linkToXbrl — URL of an XBRL instance document if one was filed; an empty string for the typical DEFC14A, since proxy statements are not within the XBRL tagging mandate.documentFormatFiles[] — ordered list of every document attached to the submission, one element per document.dataFiles[] — XBRL data files; typically empty for proxy filings.entities[] — one entry per filer/subject entity associated with the submission.seriesAndClassesContractsInformation[] — populated only for investment-company proxies; typically empty for operating-company contests.id — a 32-character hexadecimal internal record identifier.documentFormatFiles[] entriesEach element describes a single document in the submission and carries:
sequence — the EDGAR submission sequence number as a string ("1", "2", …); the trailing entry that points to the complete .txt wrapper has a blank sequence and type.documentUrl — full sec.gov URL to the file.description — EDGAR-supplied label such as "DEFINITIVE PROXY STATEMENT", "GRAPHIC", "EXHIBIT", or "Complete submission text file".type — EDGAR document-type code, e.g. DEFC14A, GRAPHIC, EX-99, EX-99.1.size — byte size of the document, as a string.The array is ordered by submission sequence and always terminates with the synthetic complete-submission .txt entry.
entities[] entriesEach entry describes one party associated with the filing and carries cik, companyName, type (form type), tickers[], stateOfIncorporation, fiscalYearEnd (as MMDD), and irsNo. Entries representing the subject registrant additionally carry act (e.g., "34" for the Exchange Act), fileNo (e.g., "811-23358" for a registered investment company), and filmNo (the EDGAR film number assigned at acceptance), since those identifiers attach to the registered issuer rather than to the soliciting party.
The role of each entity is encoded into the companyName field by an EDGAR-style suffix:
(Filed by) — the soliciting filer.(Subject) — the company whose securities are the subject of the contest.For management-filed contested proxies both entries point at the same legal entity (the issuer is filing against a dissident opposing it). For dissident-filed contested proxies the filer is the dissident person or group and the subject is the issuer; the two entries carry different cik values. Downstream classification of a record as management-side versus dissident-side is therefore a comparison of the (Filed by) cik to the (Subject) cik.
The on-disk documents are not bare HTML or PDF. Each is an EDGAR SGML <DOCUMENT> envelope whose header lines mirror the metadata:
1
<DOCUMENT>
2
<TYPE>DEFC14A
3
<SEQUENCE>1
4
<FILENAME>angel-defc14a_092625.htm
5
<DESCRIPTION>DEFINITIVE PROXY STATEMENT
6
<TEXT>
7
<HTML>
8
... actual HTML body of the proxy statement ...
9
</HTML>
10
</TEXT>
11
</DOCUMENT>
The unclosed header tags (<TYPE>, <SEQUENCE>, <FILENAME>, <DESCRIPTION>) are SGML conventions; the rendered content lives inside the inner <TEXT> block. For modern filings the <TEXT> block holds a full <HTML>...</HTML> document; for pre-2002 filings it holds plain ASCII; for PDF exhibits it holds a uuencoded payload introduced by <PDF> markers. Strict HTML parsers will fail on the SGML prefix; consumers must either skip past the SGML header to the inner <HTML> block or use a lenient parser.
The HTML payload of a DEFC14A is a Schedule 14A document and follows the Rule 14a-101 Item ordering, although headings as printed often use plain-English captions rather than literal Item numbers. The typical content layers, in approximate order, are:
A record contains everything needed to read and analyze the contested proxy materials as filed:
metadata.json descriptor with header fields, document index, and entity blocks.EX-99 press releases, supplemental letters, investor presentations, "fight letters," bylaw exhibits, and PDFs of related documents.Several categories of submission content are deliberately not materialized to disk inside a record:
GRAPHIC document type — JPG, GIF, and PNG referenced from the HTML proxy statement for logos, photos of nominees, charts, and cover-page graphics). Their URLs are preserved in metadata.json → documentFormatFiles[], and a consumer can resolve them on demand from www.sec.gov. The HTML, when rendered offline, will show broken image references unless those images are fetched separately..txt SGML wrapper. It is enumerated in documentFormatFiles[] (with a blank sequence and type) and reachable via linkToTxt, but is not extracted to disk because it duplicates content already present in the individual document files.DEFA14A), revised cards (DEFR14A), preliminary contested filings (PREC14A), additional soliciting material, and Schedule 14N nominations live under different EDGAR form types and accession numbers and are therefore separate records under different datasets — they are not in form-defc14a-files.dataFiles[] is typically empty and linkToXbrl is an empty string.The DEFC14A form type has existed throughout the EDGAR era (1994 onward), but the substantive content of the proxy statements inside has evolved as Schedule 14A and adjacent rules have changed:
DEFA14A/PRRN14A) that surround a contest. The DEFC14A itself becomes the consolidating definitive document referenced by those earlier communications.The DEFC14A dataset reflects the historical evolution of EDGAR document formats:
<DOCUMENT> envelope. Tables were rendered with monospaced character art; exhibits were separate ASCII documents. Image-bearing materials were largely impossible to convey, and consequently early contested proxies rarely include graphical content. These records appear on disk as .txt files inside the SGML wrapper.GRAPHIC documents; this dataset preserves the HTML but not the graphics.The SGML <DOCUMENT> envelope itself is the constant: every document in every record across the entire 1994-onward span is wrapped in the same EDGAR SGML structure, with header lines for <TYPE>, <SEQUENCE>, <FILENAME>, and <DESCRIPTION> followed by a <TEXT> block.
metadata.json → accessionNo retains the dashes. Either form is a stable join key with EDGAR.entities[] array distinguishes the soliciting party from the issuer via the (Filed by) / (Subject) suffixes on companyName. For management-filed contests both entries point at the same legal entity; for dissident-filed contests they differ. Comparing the two cik values is the canonical way to classify a filing as management-side or dissident-side..htm files as raw HTML will fail in strict parsers because of the leading <DOCUMENT> / <TYPE> / <TEXT> lines. Consumers should either skip past the SGML header to the inner <HTML> block or use a lenient HTML parser that tolerates the prefix.DEFA14A (additional definitive material), DEFR14A (revised definitive proxy), and continued solicitation filings. Each is a separate accession number in EDGAR; only DEFC14A accessions appear in this dataset, so reconstructing the full timeline of a contest may require joining against other form-type datasets.metadata.json rather than omitted, so downstream consumers can rely on the presence of every documented key without defensive existence checks.Form DEFC14A is filed by soliciting persons under Regulation 14A. In any contested matter, the population splits into two opposing sides, both of which file on this submission type:
The dataset therefore mixes issuer-originated and dissident-originated definitive proxy materials. Readers must distinguish the subject company (always the registrant being solicited) from the soliciting person (the actual filer, identified by the filer-side CIK in the EDGAR header).
The disclosure obligation arises under Section 14(a) of the Exchange Act of 1934, implemented by Regulation 14A (Rules 14a-1 through 14a-21). Substantive content is governed by Schedule 14A. Key rules shaping the DEFC14A population:
The trigger is a proxy contest or counter-solicitation. The "C" submission type applies when:
Once any of these conditions exists, both sides' definitive proxy statements adopt the "C" designation. If a contest emerges after a non-contested DEF 14A is already on file, filers re-file on the contested submission types (PREC14A, then DEFC14A).
DEFC14A has no fixed annual deadline; it is event-driven by a contested meeting. The standard flow:
The population concentrates in spring proxy season around annual meetings of calendar-year issuers, with secondary clusters at special meetings called for M&A votes or shareholder-requisitioned actions.
Schedule 14A Item 4 and Item 5(b), together with Schedule 14A Instruction 3 to Item 4, require each side to disclose every "participant." Issuer-side participants typically include the registrant, each director and director-nominee, and named executive officers actively soliciting. Dissident-side participants typically include each nominee on the dissident slate, each member of any nominating shareholder group, the investment adviser and general partner of any nominating fund, and any person financing or directing the solicitation.
For each participant, the DEFC14A must disclose name, business address, principal occupation, beneficial ownership, two-year transaction history in the registrant's securities, arrangements regarding nominations or future employment, and substantial interest in the matters voted on. Dissidents commonly cross-reference their Schedule 13D, since accumulating above 5% before a contest triggers Section 13(d) reporting in parallel.
Section 14(a) and the original proxy rules date to 1934, with the modern participant-disclosure regime substantially fixed by the 1956 and 1992 proxy rule revisions. Pre-EDGAR contested proxies exist only on paper and are outside this dataset. EDGAR became mandatory for most domestic registrants in phases between 1993 and 1996; the dataset begins January 1, 1994. DEFC14A filings dated after August 31, 2022, in director-election contests at non-investment-company registrants reflect the Rule 14a-19 universal-card mechanics.
Form DEFC14A sits inside the Regulation 14A proxy-solicitation family and is most usefully understood by mapping its boundaries against neighboring proxy forms and the ownership and event filings that surround a contested vote. The closest comparisons fall into three groups: other 14A variants (definitive, preliminary, revised, merger, and additional-materials), ownership and tender-offer schedules (13D and 14D-9), and adjacent event or voting-record disclosures (8-K and N-PX).
DEF 14A (uncontested definitive proxy). The direct non-contested analogue. Same Section 14(a) of the Exchange Act framework, same definitive status, same disclosure architecture. The distinction is the contest: DEFC14A is filed when management or a dissident is soliciting in opposition to another solicitation on the same matter. DEF 14A is filed by essentially every public registrant once a year for routine annual meetings; DEFC14A is a small, episodic population concentrated around activist campaigns, contested slates, and contested transactions. DEF 14A is the baseline corpus for governance and compensation research; DEFC14A is the targeted corpus for proxy-fight analysis.
PREC14A (preliminary contested proxy). The preliminary version of the same document. Content overlap is high; the eventual DEFC14A is typically a refined PREC14A after staff review. PREC14A is not the operative solicitation document and may contain provisional language or blanks. PREC14A is the leading indicator that a contest is forming; DEFC14A is the locked-in solicitation actually mailed to shareholders. Pairing the two reveals what changed during review.
PRRN14A and PRER14A (revised preliminaries). Intermediate preliminary amendments. PRRN14A is filed by a non-management (typically dissident) solicitor; PRER14A is filed by the registrant. Both share subject matter with DEFC14A but remain preliminary iterations rather than the definitive document. Useful for tracing how arguments evolved; not substitutes for the final solicitation.
DEFR14A and DEFM14A. Both share definitive status and the 14(a) framework but differ in trigger. DEFR14A amends a prior definitive (DEF 14A or DEFC14A) and is amendment-driven rather than contest-driven. DEFM14A is the definitive proxy for mergers and extraordinary transactions, transaction-driven rather than contest-driven. A contested merger can produce DEFM14A from the registrant and DEFC14A from an opposing dissident in parallel.
DEFA14A and DFAN14A (additional soliciting materials). Supplemental communications — letters, presentations, press releases, advertisements — filed after the definitive proxy is on file. DEFA14A is filed by any soliciting party; DFAN14A is filed specifically by non-management solicitors and is the densest source of dissident campaign rhetoric. They are short, frequent supplements rather than full proxy statements. DEFC14A provides the formal disclosure backbone of a contest; DEFA14A and DFAN14A carry the running communications around it. Reconstructing a contest typically requires all three.
Schedule 13D and 13D/A. Beneficial-ownership disclosure under Section 13(d) for non-passive holders above 5%. Dissident solicitors are commonly 13D filers, and Item 4 often signals nomination or opposition intent before any 14A appears. The regimes differ: 13D is an ownership disclosure focused on stake, financing, and intent, filed near-real-time; DEFC14A is a Section 14(a) solicitation document focused on the matters to be voted and the disclosures required to collect proxies. 13D answers "who and how much"; DEFC14A answers "what is being asked of shareholders and why." Complements, not substitutes.
Schedule 14D-9. Target-company recommendation statement responding to a third-party tender offer, governed by the tender-offer rules under Section 14(d). Both involve contested control situations, but 14D-9 responds to a purchase of shares, not a solicitation of votes. A tender offer can produce 14D-9 without any proxy contest, and a board-level proxy fight produces DEFC14A without any tender offer. The two converge only when an acquirer runs a tender offer and a parallel proxy solicitation.
8-K (current reports around contests). Carries contest-adjacent disclosures via Item 5.02 (director changes), Item 5.07 (voting results), and Item 8.01 (settlements, standstills). 8-K is short, item-coded, and event-driven across many topics; DEFC14A is a long-form solicitation document specific to a contested vote. 8-K typically captures the bookends of a contest — nominations, settlement, final results — while DEFC14A captures the substantive case made to shareholders in between.
N-PX (fund proxy voting records). Annual record of votes cast by registered investment companies and certain institutional managers. Overlap is purely downstream: N-PX records how funds voted on matters DEFC14A describes. N-PX is a tabular voting-record dataset filed by voters across the full universe of votes (mostly uncontested); DEFC14A is the upstream solicitation filed by the parties asking for votes. Useful when linked, never substitutable.
DEFC14A is uniquely defined by the intersection of three attributes: definitive (not preliminary or supplemental), contested (not routine), and a full proxy statement (not an ownership schedule, tender-offer response, event report, or voting record). Preliminary contested filings capture the same campaigns earlier; additional soliciting materials capture the surrounding communications; 13D and 14D-9 capture ownership and tender-offer dimensions; 8-K and N-PX bracket the contest with events and downstream votes. None substitute for DEFC14A when the question is the definitive, formally disclosed case made to shareholders in a proxy contest, but all are valuable complements for reconstructing a contest timeline.
Definitive contested proxy statements sit at the intersection of corporate governance, securities litigation, and event-driven investing. Each filing documents a live proxy fight, so a small set of professional roles draws on it, and each role focuses on different sections.
Used to study the historical record of contested solicitations. Key sections: the dissident's "reasons for the solicitation," nominee biographies, critiques of incumbent strategy, proposed bylaw or charter changes, and fight letters. Output: thesis taxonomies (operational underperformance, governance failure, board refreshment), rhetoric benchmarks, and internal campaign playbooks.
Treat each DEFC14A as a tradable catalyst. Key fields: record date, meeting date, vote standard (plurality vs. majority), and the specific proposals at issue. For "vote no" campaigns and competing-bid situations, they extract deal terms, board-process disclosures, and fairness considerations. Supports position sizing, scenario probabilities, and base-rate libraries on dissident win/settle/withdraw outcomes.
Use the corpus to design campaign strategy and pitch retainers. Focus on participant identification, proxy-card structure (universal vs. traditional), disclosed solicitation costs, Item 5 and Schedule 14A Instruction 3 participant tables, and record-date mechanics. Output: cost-per-holder benchmarks, mailing cadence, and intermediary selection.
Draft response statements, defensive proxies, and motions to compel disclosure. Focus on participant disclosures, beneficial-ownership schedules, group-formation evidence, prior contacts with management, and Section 13(d) and 14(a) representations. Mine exhibits for confidentiality and standstill provisions, cooperation-agreement templates, and disclosure schedules.
Draft compliant DEFC14A filings and Schedule 14A items. Focus on nominee questionnaires, arrangements among participants, nominee independence and qualifications, and Item 5(b) litigation disclosures. The corpus supports universal-proxy-card drafting under Rule 14a-19, including notice content and "vote how" instructions.
Build vote recommendations on contested ballots and longitudinal activism datasets. Focus on dissident qualifications and conflicts, issuer board and committee structure, prior engagement history, and any settlement or standstill record. Output: case memos, policy notes, and historical voting datasets sold to institutional clients.
Use peer DEFC14As proactively. Focus on issuer-response sections, the governance "asks" (declassification, special-meeting rights, proxy access), and disclosed engagement histories. Output: vulnerability assessments, board briefings, advance-notice bylaw refreshes, and pre-positioned legal, banking, and PR retainers.
Study how prior contests framed strategic alternatives, valuation arguments, and process critiques. Focus on financial analyses summarized in the proxy, references to advisor opinions, "background of the solicitation" timelines, and unsolicited-approach descriptions. Inform pitch books for both defense and dissident-side mandates.
At broker-dealers, investment advisers, and custodians: monitor solicitations that touch client positions, validate Section 13 and 16 compliance by participants, and police information barriers. Watch participant tables, ownership schedules, and amendment trails for late disclosures suggesting undisclosed coordination or group formation.
Use the 1994-to-present span to build empirical samples of contested elections. Focus on contest outcomes, nominee characteristics, target financials referenced in filings, dissident-thesis language, and contest counts by industry and year. Supports event studies, textual analysis, and structural models of contest initiation, settlement, and outcomes.
At financial data vendors and quant research groups: build activism feeds, contested-ballot calendars, and participant-graph products. Teams building retrieval-augmented systems ingest full filing text to power question answering on activism precedent, nominee histories, and settlement terms. Focus on the full proxy, exhibits (fight letters, presentations, cooperation agreements), and metadata linking each filing to issuer, filer, accession, and date.
Track active campaigns, profile repeat activists, and analyze full proxy seasons. Focus on dissident narrative letters, issuer rebuttals, disclosed shareholder communications, and references to behind-the-scenes negotiations. Supports both breaking coverage and longer-form analyses of activism trends and settlement rates.
The dataset is small in volume but dense per record. Investors and arbitrageurs trade contests as catalysts; counsel and solicitors run them; proxy advisors and stewardship teams recommend votes; corporate secretaries and IR teams prepare defenses; compliance and surveillance teams monitor them; and researchers, data engineers, and journalists study them in aggregate. Full-document coverage, including exhibits, is what makes the dataset usable across all of these workflows.
The DEFC14A corpus supports a small set of operational workflows where the unit of analysis is a contested solicitation and where the substantive content sits inside specific, predictable sections of Schedule 14A.
Pair each DEFC14A with the matching subject-side filing by joining on the (Subject) cik in metadata.json → entities[] and the meeting date parsed from the notice of meeting. Compare (Filed by) and (Subject) CIKs to label each record as management-side or dissident-side. The result is a deduplicated table of contests with both sides linked, used for base-rate libraries (win/settle/withdraw), event studies around record and meeting dates, and longitudinal activism datasets sold to institutional clients.
Extract the "Reasons for the Solicitation" and "Background of the Solicitation" sections from dissident-filed DEFC14As and classify the thesis (operating underperformance, capital allocation, governance failure, strategic review, board refreshment). The chronological background narrative supplies the engagement timeline; the reasons section supplies the substantive critique; appended fight letters and investor presentations in EX-99 exhibits supply the supporting charts. Output feeds activist-thesis taxonomies, rhetoric benchmarks, and pitch-book precedent libraries for special-situations desks and dissident-side advisers.
Pull the "Information about Director Nominees" section across dissident filings to assemble a structured table of activist nominees: age, principal occupation, prior directorships, committee experience, and stated independence. Cross-reference against the issuer's incumbent slate listed in the same filing under Rule 14a-19 universal-card disclosure. Used by proxy advisors building case memos, by issuer-side counsel vetting opposing nominees, and by researchers studying nominee characteristics and contest outcomes.
Parse the "Participants in the Solicitation" tables required by Item 4(b) and Instruction 3 of Schedule 14A to capture each participant's beneficial ownership and the two-year transaction history in the issuer's securities. Combined with the Item 6 / Item 403 5-percent and insider-ownership table, this supports group-formation analysis, Section 13(d) and Section 16 compliance review at broker-dealers and custodians, and litigation work product for issuer-side counsel arguing late or incomplete group disclosure.
Use post-August-2022 DEFC14As as drafting precedent for the universal-card mechanics: combined director-election proposal listing both slates, Rule 14a-19 notice language, "vote how" instructions, and the non-responsibility disclaimer applied to the opposing slate's biographies. Dissident-side counsel mine recent filings for compliant card layouts, instruction wording, and notice deadlines; issuer-side counsel use the same records to identify drafting weaknesses to challenge.
Filter the dataset by filedAt and the (Subject) ticker to surface new definitive contested solicitations as they are accepted. Extract record date, meeting date, vote standard, and proposal list from the proxy-statement notice and proposals sections. Feeds event-driven and merger-arbitrage workflows, including position sizing around vote-no campaigns, competing-bid scenarios, and contested mergers where a DEFM14A from the registrant is paired with a DEFC14A from an opposing dissident.
Across management-filed DEFC14As, extract the issuer's rebuttal of the dissident thesis, the defense of the incumbent slate, disclosed engagement history, and any standstill or cooperation-agreement language reproduced in exhibits. Corporate secretaries, IR vulnerability teams, and defense counsel use the corpus to benchmark response language, advance-notice bylaw posture, and engagement disclosures against peer contests when preparing pre-positioned defense materials.
Dataset Index JSON API: https://api.sec-api.io/datasets/form-defc14a-files.json
This endpoint returns dataset metadata and the list of available container files. Metadata includes the dataset name, description, last updated timestamp, earliest sample date (1994-01-01), total record and size counters, covered form types (DEFC14A), container format (ZIP), and content file types (TXT, JSON, PDF, HTML). The response also includes the download URL for the full dataset and, for each container, its key, size, record count, last updated timestamp, and direct download URL.
This endpoint does not require an API key. It is the recommended way to monitor which containers were updated in the most recent refresh run, so you can decide on a daily basis which containers to re-download.
Example response:
1
{
2
"datasetId": "1f13365b-9ae0-6967-897e-298eb76d8ece",
3
"datasetDownloadUrl": "https://api.sec-api.io/datasets/form-defc14a-files.zip",
4
"name": "Form DEFC14A Files Dataset",
5
"updatedAt": "2026-05-07T02:50:14.430Z",
6
"earliestSampleDate": "1994-01-01",
7
"totalRecords": 2525,
8
"totalSize": 245606478,
9
"formTypes": ["DEFC14A"],
10
"containerFormat": "ZIP",
11
"fileTypes": ["TXT", "JSON", "PDF", "HTML"],
12
"containers": [
13
{
14
"downloadUrl": "https://api.sec-api.io/datasets/form-defc14a-files/2026/2026-05.zip",
15
"key": "2026/2026-05.zip",
16
"size": 1843271,
17
"records": 6,
18
"updatedAt": "2026-05-07T02:50:14.430Z"
19
}
20
]
21
}
Download Entire Dataset: https://api.sec-api.io/datasets/form-defc14a-files.zip?token=YOUR_API_KEY
Downloads the full dataset as a single ZIP archive containing every monthly container from January 1994 to the latest refresh. This endpoint requires an API key.
Download Single Container: https://api.sec-api.io/datasets/form-defc14a-files/2026/2026-05.zip?token=YOUR_API_KEY
Downloads one individual monthly container instead of the full archive, which is useful for incremental updates after the initial bulk download. This endpoint requires an API key.
The dataset covers Form DEFC14A — the EDGAR submission type for a definitive proxy statement on Schedule 14A filed in connection with a contested solicitation under Section 14(a) of the Exchange Act and Regulation 14A. Preliminary contested filings (PREC14A, PRRN14A, PRER14A), additional soliciting materials (DEFA14A, DFAN14A), and revised definitive proxies (DEFR14A) live under different EDGAR form types and are not included.
One record is one accepted DEFC14A submission, materialized as a folder named with the EDGAR accession number with dashes removed (an 18-digit numeric string). The folder contains a metadata.json descriptor plus every document attached to the submission — typically the HTML proxy statement and any EX-99 exhibits — each wrapped in the EDGAR SGML <DOCUMENT> envelope. GRAPHIC image files (JPG, GIF, PNG) are enumerated in the metadata but are not extracted to disk.
Any "soliciting person" under Regulation 14A involved in a contested vote files on this submission type. In a contest, both sides file: the registrant (issuer-side) and any dissident — activist hedge funds, individual significant shareholders, shareholder groups acting in concert, former officers or directors, founding families, private equity sponsors running vote-no campaigns, or ad hoc shareholder committees. A single proxy fight typically generates two distinct DEFC14A records — one from each side — sharing the same subject CIK but carrying different filer CIKs.
DEFC14A is event-driven, not calendar-driven. After a mandatory PREC14A preliminary filing at least 10 calendar days before mailing, and a Division of Corporation Finance review window, the definitive proxy must be filed with the Commission no later than the date it is first sent or given to security holders. Filers typically aim to mail definitives 20 to 40 days before the meeting, with the population concentrating in the spring proxy season around annual meetings of calendar-year issuers.
The dataset begins January 1, 1994, when EDGAR became mandatory for most domestic registrants in phases between 1993 and 1996, and continues through the latest monthly refresh. Pre-EDGAR contested proxies exist only on paper and are outside this dataset. Filings dated after August 31, 2022, in director-election contests at non-investment-company registrants reflect the Rule 14a-19 universal proxy-card mechanics.
The payload file types are TXT, JSON, PDF, and HTML. JSON comes from the metadata.json descriptor; HTML is the dominant proxy-statement format for filings from roughly 2002 onward; TXT covers older ASCII filings (1994 through early 2002) and any plain-text exhibits; PDF appears for exhibit attachments such as scanned letters, fairness opinions, and investor presentations. Every document — including HTML and PDF — is wrapped in an EDGAR SGML <DOCUMENT> envelope, so strict HTML parsers must skip the SGML header before parsing the inner <HTML> block.
Both are definitive proxy statements on Schedule 14A under the same Section 14(a) framework, but DEF 14A covers uncontested solicitations while DEFC14A covers contested ones. DEF 14A is filed by essentially every public registrant once a year for routine annual meetings; DEFC14A is a small, episodic population concentrated around activist campaigns, contested slates, and contested transactions. The presence of a counter-solicitation or a Schedule 14A Item 5(b) participant disclosure obligation is what pushes a filing from DEF 14A to DEFC14A.
Records are distributed in monthly ZIP containers organized by calendar year (for example, 2026/2026-05.zip). The dataset index JSON at https://api.sec-api.io/datasets/form-defc14a-files.json lists every container with its key, size, record count, and last updated timestamp, and is the recommended way to detect which monthly containers were touched in the most recent refresh so that incremental downloads can be limited to changed containers.