Form 425 Files Dataset

The Form 425 Files dataset is a complete archive of EDGAR submissions filed on form type 425 — written communications made pursuant to Rule 425 under the Securities Act of 1933 in connection with proposed or pending business combinations (mergers, acquisitions, tender offers, exchange offers, and similar transactions). One record is a single EDGAR accession — a press release, investor presentation, analyst-call transcript, employee memo, shareholder letter, merger-agreement exhibit set, or other written communication about a deal — packaged as a directory of extracted filer-submitted documents plus a metadata.json descriptor. Records are filed by participants in the transaction, including the acquirer or issuer registering securities on Form S-4 or F-4, the target company, SPACs and de-SPAC targets, exchange-offer bidders, and affiliated parties such as controlling shareholders or joint bidders. The dataset begins at January 2000, tracking the start of meaningful filing activity under the SEC's 1999 M&A communications regime (Rel. No. 33-7760, effective January 24, 2000), and extends to the present. Records are grouped into monthly ZIP containers and distributed in the file types TXT, JSON, HTML, and PDF; image attachments are enumerated in metadata but stripped from the extracted tree.

Update Frequency
Daily
Updated at
2026-05-19
Earliest Sample Date
2000-01-01
Total Size
2.2 GB
Total Records
134,452
Container Format
ZIP
Content Types
TXT, JSON, HTML, PDF
Form Types
425

Dataset APIs

Programmatically retrieve the full list of dataset archive files, download URLs and dataset metadata.

Dataset Index JSON API

Download the entire dataset as a single archive file.

Download Entire Dataset:

Download a single container file (e.g. monthly archive) from the dataset.

Download Single Container:

Dataset Files

317 files · 2.2 GB
Download All
2026-05.zip42.0 MB404 records
2026-04.zip9.7 MB856 records
2026-03.zip8.5 MB587 records
2026-02.zip8.5 MB537 records
2026-01.zip8.5 MB457 records
2025-12.zip10.8 MB611 records
2025-11.zip10.0 MB585 records
2025-10.zip14.1 MB736 records
2025-09.zip9.3 MB690 records
2025-08.zip12.9 MB665 records
2025-07.zip10.7 MB618 records
2025-06.zip7.4 MB470 records
2025-05.zip5.7 MB393 records
2025-04.zip7.1 MB427 records
2025-03.zip7.4 MB373 records
2025-02.zip9.9 MB362 records
2025-01.zip7.5 MB364 records
2024-12.zip5.7 MB343 records
2024-11.zip5.5 MB325 records
2024-10.zip5.1 MB289 records
2024-09.zip6.3 MB384 records
2024-08.zip7.2 MB381 records
2024-07.zip7.8 MB429 records
2024-06.zip6.5 MB408 records
2024-05.zip10.1 MB447 records
2024-04.zip6.0 MB403 records
2024-03.zip7.2 MB450 records
2024-02.zip10.6 MB617 records
2024-01.zip9.6 MB544 records
2023-12.zip8.0 MB506 records
2023-11.zip9.8 MB494 records
2023-10.zip11.5 MB686 records
2023-09.zip9.7 MB567 records
2023-08.zip14.9 MB824 records
2023-07.zip9.7 MB634 records
2023-06.zip11.5 MB663 records
2023-05.zip13.3 MB719 records
2023-04.zip9.4 MB497 records
2023-03.zip10.0 MB587 records
2023-02.zip9.5 MB577 records
2023-01.zip18.8 MB546 records
2022-12.zip11.8 MB563 records
2022-11.zip9.1 MB477 records
2022-10.zip24.2 MB608 records
2022-09.zip14.5 MB614 records
2022-08.zip17.5 MB530 records
2022-07.zip9.9 MB481 records
2022-06.zip12.5 MB511 records
2022-05.zip13.0 MB685 records
2022-04.zip11.2 MB487 records
2022-03.zip10.8 MB627 records
2022-02.zip9.8 MB583 records
2022-01.zip8.9 MB573 records
2021-12.zip18.2 MB904 records
2021-11.zip16.3 MB845 records
2021-10.zip13.7 MB746 records
2021-09.zip14.4 MB793 records
2021-08.zip14.7 MB979 records
2021-07.zip24.6 MB1,209 records
2021-06.zip19.8 MB1,201 records
2021-05.zip20.5 MB1,098 records
2021-04.zip16.5 MB939 records
2021-03.zip19.8 MB1,098 records
2021-02.zip18.2 MB920 records
2021-01.zip10.7 MB617 records
2020-12.zip13.9 MB752 records
2020-11.zip12.0 MB658 records
2020-10.zip11.4 MB577 records
2020-09.zip7.7 MB496 records
2020-08.zip9.0 MB452 records
2020-07.zip5.8 MB317 records
2020-06.zip5.4 MB288 records
2020-05.zip4.3 MB312 records
2020-04.zip2.8 MB193 records
2020-03.zip4.6 MB330 records
2020-02.zip7.2 MB409 records
2020-01.zip7.5 MB449 records
2019-12.zip10.1 MB437 records
2019-11.zip6.5 MB373 records
2019-10.zip6.9 MB421 records
2019-09.zip5.8 MB337 records
2019-08.zip9.9 MB425 records
2019-07.zip6.6 MB383 records
2019-06.zip8.4 MB425 records
2019-05.zip7.0 MB420 records
2019-04.zip4.7 MB330 records
2019-03.zip5.8 MB417 records
2019-02.zip5.2 MB396 records
2019-01.zip8.8 MB559 records
2018-12.zip4.0 MB279 records
2018-11.zip10.6 MB604 records
2018-10.zip8.6 MB512 records
2018-09.zip8.1 MB507 records
2018-08.zip9.4 MB477 records
2018-07.zip6.4 MB479 records
2018-06.zip9.0 MB615 records
2018-05.zip8.3 MB543 records
2018-04.zip7.4 MB449 records
2018-03.zip9.8 MB599 records
2018-02.zip7.9 MB448 records
2018-01.zip5.3 MB398 records
2017-12.zip7.9 MB454 records
2017-11.zip7.3 MB415 records
2017-10.zip7.3 MB457 records
2017-09.zip3.7 MB281 records
2017-08.zip12.6 MB491 records
2017-07.zip6.5 MB368 records
2017-06.zip6.3 MB349 records
2017-05.zip8.5 MB522 records
2017-04.zip5.3 MB383 records
2017-03.zip4.8 MB340 records
2017-02.zip5.1 MB356 records
2017-01.zip7.4 MB473 records
2016-12.zip6.0 MB405 records
2016-11.zip9.3 MB556 records
2016-10.zip7.5 MB482 records
2016-09.zip5.4 MB411 records
2016-08.zip11.4 MB383 records
2016-07.zip6.5 MB399 records
2016-06.zip6.7 MB438 records
2016-05.zip6.6 MB487 records
2016-04.zip9.7 MB528 records
2016-03.zip6.6 MB493 records
2016-02.zip6.8 MB508 records
2016-01.zip8.6 MB552 records
2015-12.zip7.9 MB508 records
2015-11.zip13.4 MB821 records
2015-10.zip9.3 MB600 records
2015-09.zip10.5 MB643 records
2015-08.zip7.9 MB559 records
2015-07.zip9.7 MB641 records
2015-06.zip8.2 MB639 records
2015-05.zip11.5 MB629 records
2015-04.zip6.3 MB452 records
2015-03.zip7.7 MB566 records
2015-02.zip6.8 MB466 records
2015-01.zip6.5 MB511 records
2014-12.zip7.7 MB516 records
2014-11.zip11.5 MB698 records
2014-10.zip17.1 MB856 records
2014-09.zip12.2 MB585 records
2014-08.zip10.3 MB591 records
2014-07.zip11.1 MB669 records
2014-06.zip9.1 MB692 records
2014-05.zip5.9 MB444 records
2014-04.zip6.2 MB433 records
2014-03.zip4.8 MB315 records
2014-02.zip7.4 MB476 records
2014-01.zip4.4 MB332 records
2013-12.zip3.6 MB241 records
2013-11.zip4.8 MB339 records
2013-10.zip6.0 MB378 records
2013-09.zip4.9 MB336 records
2013-08.zip3.6 MB268 records
2013-07.zip7.1 MB390 records
2013-06.zip4.3 MB347 records
2013-05.zip5.5 MB451 records
2013-04.zip5.0 MB362 records
2013-03.zip6.7 MB390 records
2013-02.zip8.1 MB546 records
2013-01.zip3.5 MB273 records
2012-12.zip4.3 MB258 records
2012-11.zip9.4 MB268 records
2012-10.zip3.7 MB305 records
2012-09.zip3.4 MB266 records
2012-08.zip5.4 MB334 records
2012-07.zip4.3 MB219 records
2012-06.zip2.7 MB220 records
2012-05.zip3.7 MB256 records
2012-04.zip4.1 MB292 records
2012-03.zip4.2 MB276 records
2012-02.zip2.8 MB268 records
2012-01.zip2.2 MB188 records
2011-12.zip4.3 MB347 records
2011-11.zip5.0 MB375 records
2011-10.zip3.6 MB268 records
2011-09.zip4.4 MB329 records
2011-08.zip4.4 MB378 records
2011-07.zip7.2 MB389 records
2011-06.zip3.8 MB316 records
2011-05.zip7.2 MB456 records
2011-04.zip5.5 MB410 records
2011-03.zip3.8 MB341 records
2011-02.zip4.9 MB385 records
2011-01.zip4.4 MB337 records
2010-12.zip3.2 MB265 records
2010-11.zip3.3 MB263 records
2010-10.zip2.4 MB183 records
2010-09.zip4.4 MB369 records
2010-08.zip3.8 MB355 records
2010-07.zip3.8 MB274 records
2010-06.zip4.5 MB332 records
2010-05.zip5.0 MB343 records
2010-04.zip5.8 MB311 records
2010-03.zip15.0 MB351 records
2010-02.zip3.9 MB333 records
2010-01.zip3.3 MB260 records
2009-12.zip4.1 MB384 records
2009-11.zip6.2 MB516 records
2009-10.zip30.1 MB393 records
2009-09.zip5.0 MB377 records
2009-08.zip5.9 MB354 records
2009-07.zip5.7 MB472 records
2009-06.zip7.6 MB396 records
2009-05.zip6.5 MB375 records
2009-04.zip5.1 MB317 records
2009-03.zip3.4 MB291 records
2009-02.zip2.6 MB211 records
2009-01.zip3.0 MB290 records
2008-12.zip1.9 MB182 records
2008-11.zip3.5 MB299 records
2008-10.zip4.0 MB329 records
2008-09.zip4.2 MB333 records
2008-08.zip4.0 MB278 records
2008-07.zip6.0 MB407 records
2008-06.zip5.0 MB407 records
2008-05.zip5.2 MB317 records
2008-04.zip18.9 MB264 records
2008-03.zip2.3 MB177 records
2008-02.zip6.8 MB365 records
2008-01.zip3.7 MB294 records
2007-12.zip3.9 MB328 records
2007-11.zip4.2 MB407 records
2007-10.zip4.1 MB421 records
2007-09.zip4.4 MB396 records
2007-08.zip5.6 MB418 records
2007-07.zip6.2 MB482 records
2007-06.zip5.3 MB455 records
2007-05.zip8.1 MB574 records
2007-04.zip4.4 MB390 records
2007-03.zip5.0 MB518 records
2007-02.zip8.2 MB554 records
2007-01.zip5.9 MB419 records
2006-12.zip4.1 MB385 records
2006-11.zip4.3 MB384 records
2006-10.zip5.7 MB375 records
2006-09.zip5.7 MB413 records
2006-08.zip5.0 MB389 records
2006-07.zip5.1 MB411 records
2006-06.zip5.7 MB480 records
2006-05.zip6.0 MB446 records
2006-04.zip4.3 MB307 records
2006-03.zip4.5 MB346 records
2006-02.zip3.8 MB297 records
2006-01.zip8.7 MB485 records
2005-12.zip6.5 MB453 records
2005-11.zip4.6 MB395 records
2005-10.zip5.7 MB462 records
2005-09.zip4.7 MB496 records
2005-08.zip4.5 MB365 records
2005-07.zip18.7 MB530 records
2005-06.zip4.3 MB522 records
2005-05.zip5.8 MB572 records
2005-04.zip4.7 MB411 records
2005-03.zip5.9 MB523 records
2005-02.zip5.2 MB520 records
2005-01.zip4.1 MB425 records
2004-12.zip3.4 MB344 records
2004-11.zip3.5 MB364 records
2004-10.zip4.3 MB373 records
2004-09.zip2.5 MB255 records
2004-08.zip2.9 MB297 records
2004-07.zip3.0 MB259 records
2004-06.zip2.5 MB271 records
2004-05.zip3.8 MB240 records
2004-04.zip4.6 MB258 records
2004-03.zip2.9 MB325 records
2004-02.zip3.2 MB290 records
2004-01.zip4.1 MB257 records
2003-12.zip2.9 MB208 records
2003-11.zip3.3 MB237 records
2003-10.zip5.0 MB345 records
2003-09.zip2.0 MB234 records
2003-08.zip7.0 MB231 records
2003-07.zip3.0 MB322 records
2003-06.zip2.4 MB270 records
2003-05.zip2.1 MB197 records
2003-04.zip1.7 MB163 records
2003-03.zip1.9 MB230 records
2003-02.zip8.8 MB179 records
2003-01.zip2.8 MB135 records
2002-12.zip1.3 MB156 records
2002-11.zip23.5 MB203 records
2002-10.zip3.3 MB191 records
2002-09.zip1.1 MB129 records
2002-08.zip1.7 MB194 records
2002-07.zip4.7 MB252 records
2002-06.zip1.5 MB240 records
2002-05.zip2.2 MB272 records
2002-04.zip2.3 MB331 records
2002-03.zip2.6 MB336 records
2002-02.zip3.6 MB352 records
2002-01.zip2.2 MB314 records
2001-12.zip4.7 MB353 records
2001-11.zip7.2 MB331 records
2001-10.zip2.6 MB325 records
2001-09.zip2.2 MB188 records
2001-08.zip2.7 MB311 records
2001-07.zip2.9 MB340 records
2001-06.zip4.2 MB294 records
2001-05.zip3.7 MB283 records
2001-04.zip1.9 MB263 records
2001-03.zip1.6 MB204 records
2001-02.zip1.5 MB179 records
2001-01.zip1.2 MB192 records
2000-12.zip1.3 MB174 records
2000-11.zip2.1 MB286 records
2000-10.zip4.4 MB319 records
2000-09.zip10.0 MB241 records
2000-08.zip2.6 MB362 records
2000-07.zip3.4 MB306 records
2000-06.zip1.7 MB258 records
2000-05.zip2.2 MB229 records
2000-04.zip1.4 MB210 records
2000-03.zip1.8 MB319 records
2000-02.zip1.7 MB196 records
2000-01.zip654.5 KB30 records

What This Dataset Contains

The Form 425 Files dataset collects every EDGAR submission carrying form type 425 since January 2000. Each record corresponds to one accession number filed under Rule 425 in connection with a business combination. Physically, each record is a directory named for the 18-digit numeric accession number (for example 000119312525132373) containing a metadata.json descriptor plus the filer-submitted documents extracted from that accession. Accession directories are grouped into monthly ZIP containers. The file types preserved on disk are TXT, JSON, HTML, and PDF; image attachments (logos, slide-deck graphics, charts) that EDGAR received as GRAPHIC documents are enumerated in metadata.json but deliberately excluded from the extracted tree.

Form 425 is a cover designation EDGAR applies to written communications that constitute a prospectus or an offer to sell securities in connection with mergers, acquisitions, tender offers, exchange offers, or other business combinations. It is not a free-standing disclosure document in the sense of a Form 10-K or S-1; it is a filing wrapper around whatever written material the filer is releasing to the market — a press release, an investor presentation, a shareholder letter, a conference-call transcript, a social-media post, an employee memo, an advertisement, a website FAQ, or the text of an analyst-day deck. Rule 425 ensures that any such written communication used while a business-combination registration statement (typically an S-4 or F-4) is on file enters the public record carrying the prescribed legends and cross-references to that registration statement.

The record boundary is always the accession — never the transaction, never the filer, never an individual exhibit. A single multi-month merger campaign therefore produces many records (often dozens), one per 425 communication released as the deal progresses. Reconstructing a deal timeline requires grouping accessions by the filer/subject CIK pair and the registration-statement file number that each record carries. A substantial share of submissions are dual-designated 8-K/425, because the same written communication must be furnished under Form 8-K Item 7.01 (Regulation FD) or Item 8.01 (Other Events) while also satisfying Rule 425. In those cases the primary document is an 8-K body and the 425 content lives in the exhibits, but the EDGAR form type is still 425 for dataset-membership purposes.

Content Structure of a Single Record

Content layers of a single record

Each accession directory contains three logical layers:

  1. A structured JSON descriptor (metadata.json) summarizing the EDGAR header, enumerating the filer and subject entities, and listing every physical document in the submission with its SGML type, sequence, original filename, filer-supplied description, direct URL, and byte size.
  2. One or more extracted document files — predominantly .htm/.html, occasionally .txt, rarely .pdf — each wrapped in the SGML <DOCUMENT> envelope that EDGAR uses inside the full-submission text file.
  3. An implicit layer of image attachments (logos, slide-deck graphics, charts) that EDGAR received as GRAPHIC documents. These assets are enumerated in metadata.json but deliberately excluded from the extracted tree; only textual and document-format content is packaged.

The full-submission SGML text file (<accession>.txt) is enumerated as the trailing entry of documentFormatFiles but is not materialized on disk, because its content is equivalent to the concatenation of the per-document HTMLs together with their SGML headers.

metadata.json structure

metadata.json is a single JSON object per record. Its scalar top-level keys carry the EDGAR header values and pointers back to the original filing on sec.gov:

  • formType — always "425" for this dataset.
  • accessionNo — the dash-formatted accession number (for example "0001193125-25-132373").
  • filedAt — ISO 8601 timestamp of acceptance by EDGAR, including the Eastern Time offset.
  • description — the EDGAR form description, typically "Form 425 - Prospectuses and communications, business combinations".
  • linkToFilingDetails — URL to the primary document on sec.gov.
  • linkToTxt — URL to the full SGML .txt submission.
  • linkToHtml — URL to the EDGAR filing-index page.
  • id — internal deduplication identifier.

Two array-valued fields carry the analytically important content:

documentFormatFiles is the authoritative inventory of every physical document in the submission. Each element is an object with sequence (ordinal position as a string; the trailer entry for the complete-submission text file uses a blank " "), type (the EDGAR document type: "425", "EX-99.1", "EX-2.1", "EX-10.1", "EX-99.2", "GRAPHIC", or an empty string on the trailer), description (free text supplied by the filer), documentUrl (direct URL on sec.gov), and size (byte size as a string). Readers reconciling the manifest against the extracted tree should filter out entries where type == "GRAPHIC" and the blank-sequence trailer entry; what remains is the exact set of files that ship on disk alongside metadata.json.

entities is an array of filer/subject-company blocks. Typical 425 filings carry two entries — one for the acquirer/offeror and one for the target — disambiguated by a suffix appended to companyName: "(Filed by)" for the submitting party and "(Subject)" for the target. These suffixes are the authoritative role markers for the two sides of the transaction and must be stripped when joining to other EDGAR datasets on CIK. Each entity block exposes cik, companyName, fiscalYearEnd, stateOfIncorporation, act (the Securities/Exchange Act number, e.g. "34"), fileNo, irsNo, type (echoing the form type), sic (SIC code plus label), filmNo, and tickers (array of ticker symbols). fileNo is typically populated only on the (Subject) entity, where it references the active registration statement (for example 333-155412); this is the join key to the underlying S-4 or F-4 on the target's filing history. SPAC filings and other self-referential 425s may carry a single entity in which the filer and the subject are the same CIK.

Tail fields are always present but frequently empty on 425 records: seriesAndClassesContractsInformation: [], linkToXbrl: "", and dataFiles: [].

The primary 425 document

Every accession carries exactly one document of type: "425", and it is the sequence: "1" entry in documentFormatFiles. Each extracted HTML is preserved inside its SGML <DOCUMENT> envelope, so the file begins not with <!DOCTYPE html> or <HTML> but with the EDGAR header block:

1 <DOCUMENT>
2 <TYPE>425
3 <SEQUENCE>1
4 <FILENAME>d44940d425.htm
5 <DESCRIPTION>425
6 <TEXT>
7 <HTML>...</HTML>
8 </TEXT>
9 </DOCUMENT>

The header is machine-readable before any HTML parsing and exposes four redundant-but-useful values: <TYPE> (EDGAR document type), <SEQUENCE> (ordinal within the submission), <FILENAME> (original EDGAR filename), and <DESCRIPTION> (filer free text). The <TEXT>...</TEXT> block wraps the HTML body.

Inside the HTML body, the primary 425 document is typically a short cover-and-legend page with five characteristic components in order:

  1. A top-of-page identification block naming the filer, its CIK, the Rule 425 citation, the subject company, and the registration-statement file number — for example Filed by [Acquirer] / CIK No. [NNNNNNNNNN] / Pursuant to Rule 425 under the Securities Act of 1933 / Subject Company: [Target] / Registration Statement File Number: 333-NNNNNN.
  2. A banner describing the nature of the communication (press release, material fact, investor update, etc.).
  3. The "Important Notices and Additional Information" paragraph directing readers to the S-4 or F-4 prospectus on EDGAR and identifying participants in the solicitation.
  4. The "Forward-Looking Statements" safe-harbor paragraph.
  5. An exhibit index table cross-referencing each EX-99.x attachment with a short descriptor and (usually) an intra-filing anchor link.

When the submission is a dual 8-K/425, the primary document is an 8-K body rather than a Rule 425 cover page. It opens with an 8-K cover sheet and item heading (most commonly Item 7.01 Regulation FD Disclosure or Item 8.01 Other Events), carries a short narrative of the triggering event, ends with a signatures block under the reporting company's name, and includes its own exhibit index pointing into the merger-related attachments. The Rule 425 legends in that case migrate into the exhibits themselves rather than sitting on the cover.

Exhibit patterns

The payload shape below the primary document varies with the nature of the written communication:

  • Single-document filings. Many small or SPAC-era submissions ship only the primary 425 HTML with no separate exhibits; the communication is written into the body of the primary document itself. Filer and subject are frequently the same CIK in these cases.
  • Cover plus one EX-99.1. The canonical two-document shape: a short 425 cover carrying the legends plus a single EX-99.1 exhibit carrying the substantive material — typically a press release, a shareholder letter, a translated foreign material-fact disclosure, or a transcript. The body of EX-99.1 is usually a standalone document with its own headline, narrative, tables of key dates or transaction terms, references to prior filings, and a repeat of the forward-looking-statements legend.
  • 8-K/425 combos with full merger-document sets. The richest shape. The primary document is an 8-K body and the exhibit set often runs to five or more HTML documents: EX-2.1 for the merger or business-combination agreement (often tens of thousands of words of operative contract text with schedules and exhibits of its own), EX-10.1 for voting and support agreements or sponsor lock-ups, EX-99.1 for the joint press release, EX-99.2 for the investor presentation, and additional EX-99.x exhibits for transcripts, FAQs, and employee communications. Investor-presentation exhibits drive most of the image-stripping activity, because every slide background and chart image is enumerated as a separate GRAPHIC entry.
  • Cross-border deals. For transactions between a non-U.S. acquirer and a U.S. target (or vice versa), the entities array lists differing filer and subject CIKs, and the URL pointers inside metadata.json are not internally consistent: linkToTxt and linkToHtml reference the filer's CIK path on sec.gov, while linkToFilingDetails references the subject's CIK path. This asymmetry matters when joining 425 records to other CIK-keyed EDGAR datasets.
  • Image-heavy communications with thin on-disk content. When the substantive communication is an infographic mailer, an image-laden shareholder piece, or a slide deck, the extracted tree may consist of nothing but metadata.json and a thin HTML shell even though documentFormatFiles enumerates many GRAPHIC entries. The enumerated but absent images signal that the full visual rendering is not reproducible from the dataset alone.

SGML and HTML envelope quirks

Every extracted document retains the EDGAR SGML <DOCUMENT> framing even though the dataset is distributed as per-document files rather than as one concatenated .txt. Naive HTML parsers will therefore encounter four or five non-HTML header lines before the first <HTML> or <html> tag and must tolerate or strip them. The envelope is nevertheless a reliable secondary source of document metadata — <TYPE>, <SEQUENCE>, <FILENAME>, and <DESCRIPTION> — that can be recovered from the file body even if metadata.json is lost.

HTML content inside <TEXT> is inconsistent across filers and eras. Some documents are clean semantic HTML produced by filing-agent software; others are word-processor exports littered with presentational <font> tags, fixed-width inline styles, and per-page page-break markers; older filings can contain blocks of pre-formatted text inside <PRE> tags rather than flowed HTML. Tables are common — exhibit indexes, key-terms summaries, cutover-date schedules, lists of prior filings — and in the modern era are almost always rendered as HTML <table> elements rather than as ASCII tables.

References to image assets inside the HTML (<img src="..."> tags pointing at the GRAPHIC filenames listed in metadata) resolve to broken links when the document is rendered locally, because those images are excluded from the extracted payload.

Image-stripping behavior

Per the dataset's packaging policy, the file types preserved on disk are TXT, JSON, HTML, and PDF; image files are excluded. Concretely, every document whose EDGAR type is GRAPHIC — JPG, GIF, and PNG logos, slide backgrounds, and chart images — is dropped from the extracted tree but remains fully enumerated in documentFormatFiles with its filename, size, and URL. Two consequences follow:

  • The on-disk file count for an accession is typically smaller than the count implied by documentFormatFiles; reconciliation requires filtering out type == "GRAPHIC" entries.
  • Any rendered view of an extracted HTML document will show placeholder broken-image boxes where the filer embedded logos or slide artwork. Reconstructing the original visual appearance requires fetching documentUrl values directly from sec.gov.

The complete-submission text file (for example 0001193125-25-132373.txt) is likewise enumerated in documentFormatFiles — typically as the trailing entry with a blank sequence of " " and an empty type — but is not materialized on disk. Its content equals the concatenation of the per-document HTMLs with their SGML headers.

Evolution of Form 425 content and format since January 2000

The Form 425 designation itself dates to the SEC's November 1999 adoption of the Rule 165/Rule 425 communications regime, which took effect in early 2000 and permitted parties to business combinations to freely disseminate written communications about a pending deal provided those communications were filed on EDGAR with the requisite legends. The dataset's January 2000 start date tracks the beginning of meaningful filing activity under that regime.

Several material shifts shape how record contents have evolved:

  • Adoption curve (2000–2003). Early-regime filings are sparse and narrow in document-type diversity: most are short press-release exhibits, and the primary 425 cover is often little more than the mandated identification block and legends. Investor presentations and transcripts appear regularly only later in the decade.
  • Text-era formatting (2000–2002). The earliest 425 filings were transmitted as plain ASCII inside the EDGAR SGML envelope. Document bodies use <PRE>-wrapped fixed-width text, tables are drawn with pipes and dashes, and embedded graphics are absent. The <DOCUMENT> envelope structure, however, is identical to the modern one.
  • HTML adoption (2002–2005). As EDGAR progressively permitted and then encouraged HTML submissions, 425 bodies shifted from ASCII blocks to flowed HTML with genuine <table> elements, inline styles, and the first GRAPHIC exhibits (logos and charts). By the late 2000s HTML is the overwhelming norm.
  • Rise of the 8-K/425 combination. Following the 2004 expansion of Form 8-K triggering events, a growing share of 425 communications arrive as 8-Ks with a 425 dual-designation. This shifts the typical record from a thin 425 cover with one EX-99.1 toward a richer 8-K-style record with Item headers, multi-exhibit contract sets (EX-2.x, EX-10.x), and multi-part press-release-plus-presentation payloads.
  • Communications-type expansion. The range of what counts as a Rule 425 written communication has broadened over time to explicitly cover investor-day decks, road-show transcripts, employee Q&A documents, website postings, and social-media content. Record payloads have correspondingly broadened to include slide-deck HTMLs (accompanied by large numbers of stripped GRAPHIC images), transcript exhibits, and occasional screenshot-style HTML pages.
  • SPAC-era surge (2020–2022). The de-SPAC wave produced a surge of 425 filings with a distinctive profile: a SPAC sponsor filing 425 communications about a target, frequently with extensive merger-agreement, PIPE-subscription, and voting-support-agreement exhibits, and often with single-entity metadata because some targets were not yet SEC-registered reporting entities.
  • Legend and participant-disclosure refinements. Periodic SEC guidance has refined the expected language of the "Important Notices" and "Participants in the Solicitation" legend paragraphs; modern 425 covers therefore carry more specific boilerplate — including named directors and officers and pointers to the S-4/F-4 section describing their interests — than their early-2000s counterparts.

Form 425 filings do not carry XBRL structured data; linkToXbrl and dataFiles appear in metadata.json for schema uniformity with other EDGAR datasets but are empty on this form type.

Interpretation and extraction notes

  • The accession is the record, not the transaction. A single merger produces many 425 records over its life. Reconstructing a deal timeline requires grouping on the (Filed by)/(Subject) CIK pair and the registration-statement fileNo.
  • Role markers are embedded in companyName. The (Filed by) and (Subject) suffixes are the authoritative disambiguators for the two entities. Strip them when joining to other EDGAR datasets on CIK.
  • Registration-statement linkage sits on the subject. fileNo under the (Subject) entity is the join key to the underlying S-4 or F-4 on the target's filing history; the corresponding field on the (Filed by) entity is typically blank.
  • Cross-border URL asymmetry. In cross-border deals, linkToTxt and linkToHtml reference the filer's CIK path while linkToFilingDetails references the subject's CIK path. Consumers that canonicalize URLs should expect this inconsistency.
  • Strip or tolerate the SGML envelope. Each HTML is wrapped in a <DOCUMENT> header block; HTML renderers and parsers must skip or accept these pre-<HTML> lines. The envelope doubles as a fallback source of document-type and filename metadata.
  • Images are unavailable locally. <img> references resolve to broken links, and any analysis depending on visual content must fall back to documentUrl on sec.gov.
  • Amendments are out of scope. Submissions bearing form type 425/A are not part of this dataset; only raw 425 form-type accessions are included.
  • 8-K/425 combos are single accessions. When a written communication is filed as a combined 8-K/425, there is exactly one accession and it is included here; when the filer chooses to submit the 8-K and the 425 separately, the 425 side is the accession that appears in this dataset.
  • Canonical on-disk reconciliation. The authoritative inventory of what ships in an accession folder is documentFormatFiles minus (a) entries with type == "GRAPHIC" and (b) the blank-sequence trailer entry for the complete-submission text file. The resulting filtered list should match the non-metadata.json files present in the directory.

Who Files or Publishes This Dataset, and When

Filer population

The filer of a Form 425 record is a participant in a specific business combination (or an affiliate or representative) whose written communication qualifies as a prospectus, offer, or proxy solicitation under Securities Act Rule 425. The defining trait of the filer population is transactional: each filer is tied to a specific business combination and is speaking publicly about it in writing. Ordinary-course capital raises use Form FWP (Rule 433), not Form 425.

  • Acquirer / issuer of the securities to be registered. In a stock or mixed-consideration merger or registered exchange offer, the acquirer registering new securities on Form S-4 or F-4 is typically the primary Form 425 filer for its own written communications about the deal.
  • Target / subject company. The target files Form 425 for written materials it disseminates to its security holders. When the transaction involves a registration statement, those communications are dual-characterized as proxy soliciting material under Exchange Act Rule 14a-12 (and, in a registered exchange offer, as Rule 14d-9 recommendation material), and the Form 425 filing satisfies the Rule 14a-12 and Rule 14d-9 filing obligations in a single channel.
  • SPACs and de-SPAC targets. Both the SPAC shell (nominal acquirer/registrant) and the private operating target file Form 425 during the announcement-to-closing window. De-SPAC transactions dominate modern Form 425 volume because the marketing period runs for months and both parties publish continuously.
  • Exchange-offer bidders. In a registered exchange offer, the bidder files Form 425 for written offering communications used before the S-4/F-4 is effective; it separately complies with Regulation 14D/14E on the tender-offer side.
  • Affiliates, controlling stockholders, and joint bidders. Parent companies, financial sponsors, rollover shareholders, and co-bidders that author or co-sign communications about the deal file Form 425 in their own names.
  • Agents and representatives. Financial advisors, proxy solicitors, and information agents do not file in their own names; communications they distribute on behalf of a party are filed by the principal.

Governing regulatory framework

Form 425 implements Securities Act Rule 425 (17 CFR 230.425), adopted in the SEC's 1999 M&A communications release (Rel. No. 33-7760, effective January 24, 2000). It interlocks with:

  • Securities Act Rule 165 — safe harbor permitting written offers about a registered business combination from the first public announcement forward, conditioned on filing each written communication under Rule 425 on the date of first use.
  • Securities Act Rule 166 — safe harbor for pre-public-announcement communications; these are non-public and not filed.
  • Rule 14a-12 — permits written proxy solicitation before a proxy statement is delivered, conditioned on filing each soliciting communication on the date of first use. A Form 425 filing is deemed to satisfy Rule 14a-12 when the transaction involves a registration statement.
  • Exchange Act Rules 14d-2(b) and 14d-9(a) — govern pre-commencement written communications by bidders and target responses in tender/exchange offers. A Form 425 filing is deemed to satisfy these filing requirements in a registered exchange offer.
  • Forms S-4 and F-4. Form 425 accompanies (and is incorporated by reference into) the S-4 or F-4 covering the securities to be issued. Each Form 425 communication is a statutory "prospectus" under Section 2(a)(10) and carries Section 12(a)(2) liability.
  • Rule 425(b) legend. Every Form 425 filing must carry the prescribed legend pointing investors to the registration statement and proxy/prospectus on EDGAR, with participants in the solicitation identification where Rule 14a-12 applies.

Statutory authority runs through Securities Act Sections 5, 10, and 12 and Exchange Act Sections 14(a) and 14(d)-(e). There is no pre-2000 history for the form; before the M&A release, equivalent communications were either withheld until the registration statement was effective or filed as Schedule 14A additional soliciting material.

What triggers a Form 425 filing

The trigger is the first public use of a written communication that is a prospectus, offer to sell securities, or proxy solicitation in connection with a business combination within Rule 165(f). "Written communication" is read broadly and includes:

  • press releases and joint announcements
  • investor presentations, decks, and road-show materials
  • scripts and transcripts of analyst calls, investor conferences, earnings-call segments addressing the deal, and media interviews
  • letters, memos, and emails to employees, customers, or shareholders
  • Q&A documents, talking points, and FAQs
  • website and microsite content
  • social-media posts, videos, and podcasts
  • third-party content a party adopts or entangles itself with

Each such communication triggers a same-day Form 425 filing on the date of first use. There is no next-business-day grace period. Form 425 is a "filed" document, not "furnished": it is incorporated by reference into the S-4/F-4 and carries Section 11 and Section 12(a)(2) liability, plus Section 14 liability where it also functions as proxy soliciting material.

Triggers by timeline:

  • Pre-announcement (Rule 166). Not filed; non-public.
  • Announcement day. The initial deal press release, joint announcement, term sheet, and investor deck are filed on Form 425 the day they are first used.
  • Pre-effective registered-offer period (Rule 165). Every written communication used between announcement and effectiveness of the S-4/F-4 is filed on Form 425 on the date of first use.
  • Proxy soliciting communications (Rule 14a-12). Written soliciting material issued before the definitive proxy is mailed is filed on Form 425 when the transaction is a registered combination.
  • Exchange-offer pre-commencement communications (Rules 14d-2(b), 14d-9(a)). Bidder and target written communications before tender-offer commencement flow through Form 425 when the exchange offer is registered.
  • Ongoing deal updates. Each new release — regulatory or antitrust clearance, revised terms, synergy updates, vote reminders — triggers its own same-day Form 425. A single transaction typically generates many records.
  • Termination communications. A release announcing abandonment or termination is itself a communication about the combination and is filed on Form 425. The Rule 165 safe harbor and the filing obligation end when the transaction is no longer proposed; prior Form 425 filings are not withdrawn.

Timing and filing mechanics

  • Deadline. Same day of first use. "First use" means first public dissemination, so many Form 425 submissions appear on EDGAR within hours of a wire release.
  • Event-driven, not periodic. There is no schedule; the form exists only because a communication was issued.
  • Filed, not furnished. Rule 425 has no furnishing option comparable to Form 8-K Items 2.02/7.01.
  • No amendment regime. There is no 425/A. Errors are addressed by issuing and filing a corrective communication as a new Form 425.

Important distinctions and edge cases

  • Filer vs. subject company. The cover identifies both. The filer and the "subject company" (the company whose holders are being solicited or whose securities are the subject of the offer) are often different entities, and both sides of a deal routinely file Form 425 referencing the same transaction.
  • Rule 425 vs. Rule 433 (Form FWP). Form FWP is the free writing prospectus channel for ordinary registered offerings; Form 425 is the parallel channel for business combinations. Deal communications go on 425, not FWP.
  • Rule 425 vs. Schedule 14A additional soliciting material. An all-cash merger requiring a shareholder vote but no securities registration uses Schedule 14A additional soliciting material under Rule 14a-12, not Form 425. Form 425 requires a registration statement or registered exchange offer.
  • Rule 425 vs. Schedule TO-C. A cash tender offer with no registration uses Schedule TO-C under Rule 14d-2(b). Form 425 applies only when the exchange offer is registered.
  • SPAC / de-SPAC. Volume is high because both the SPAC and the target communicate continuously during the marketing window; nearly every deck revision, interview, and update is filed on Form 425.
  • Foreign private issuers. FPIs use Form 425 in cross-border deals relying on Form F-4 or a registered exchange offer. Rule 801/802 exempt transactions follow a separate regime.
  • Third-party content. Analyst reports and unaffiliated media are not Form 425 filings unless a transaction participant adopts or entangles itself with the content.
  • Legend defects. Omitting the Rule 425(b) / Rule 14a-12 legend does not make the filing unfiled but creates enforcement exposure; the filing obligation itself does not depend on legend compliance.
  • Cessation. Filers typically continue filing Form 425 for soliciting material after S-4/F-4 effectiveness through the vote and closing, though the Rule 165 pre-effective safe harbor drops out at effectiveness. When the deal is abandoned, the duty to file new Form 425 communications ends.

How This Dataset Differs From Similar Datasets or Filings

Form 425 is defined by the form of a communication, not by the substance of a transaction. Rule 425 sweeps in any written communication a party disseminates about a registered business combination — press releases, slide decks, transcripts, employee FAQs, social media scripts — and requires it be filed as a prospectus. Because the same document frequently satisfies multiple regimes at once, Form 425 filings overlap heavily with registration statements, proxy material, tender offer schedules, 8-Ks, and free writing prospectuses. Choosing between Form 425 and its neighbors usually comes down to three questions: is the consideration stock or cash, is this a standalone communication or a full transactional document, and which statutory safe harbor is the filer trying to preserve.

Form S-4 and Form F-4 registration statements

S-4 (domestic) and F-4 (foreign private issuer) are the registration statements used when securities are issued as consideration in a business combination. An S-4 is typically a joint proxy/prospectus containing business descriptions, pro forma financials, fairness opinions, the merger agreement, and risk factors. Form 425 sits on top of that registration: Rule 425 treats pre-filing written communications about the deal as a prospectus that must be filed to preserve the Rule 165/166 safe harbor leading up to (and alongside) the S-4.

Reach for S-4/F-4 when you need definitive transaction terms and audited financial disclosure. Reach for Form 425 when you want the running stream of shorter communications the parties issue across the deal timeline.

Schedule 14A variants: DEF 14A, DEFA14A, PREC14A, DEFM14A

Schedule 14A governs proxy solicitations under Section 14(a). Three variants are the closest proxy-side analogs to Form 425:

  • DEFA14A ("additional soliciting material") is the Exchange Act counterpart to Rule 425 communications. Most merger press releases and investor decks are dual-filed: once as 425 (to satisfy Securities Act Rule 165/166) and once as DEFA14A (to satisfy Exchange Act Rule 14a-12). Neither corpus alone captures every communication around a stock deal.
  • PREC14A is preliminary contested proxy material, used in proxy fights. It overlaps with 425 only when the contest concerns a business combination involving registered securities.
  • DEFM14A is the definitive merger proxy — the long-form document containing deal background, board deliberations, fairness opinions, voting agreements, and the merger agreement as an annex. It tells you what the parties put to a vote; 425 tells you how they talked about the deal in the weeks and months before.

Reach for DEFM14A and the S-4 for the definitive deal narrative and terms. Reach for 425 (paired with DEF 14A) for the communications stream.

Schedule TO variants (TO-C, TO-I, TO-T) and Schedule 14D-9

Schedule TO covers tender offers. TO-C is pre-commencement written communications — the tender-offer analog to Rule 425. TO-I is an issuer tender offer; TO-T is a third-party tender offer; 14D-9 is the target's recommendation statement.

The split between 425 and the TO family is transaction structure:

  • Cash tender, no securities issued: pre-commencement communications live in TO-C, not 425.
  • One-step stock-for-stock merger: pre-filing communications live in 425.
  • Exchange offer or two-step deal with stock consideration: communications are often dual-filed as 425 and TO-C.

An analyst studying a hostile or two-step deal typically needs 425, TO-C, and Schedule 14D-9 together.

Form 8-K, Items 1.01, 7.01, and 8.01 (the most common confusion)

A single merger press release is commonly filed three ways at once: as an 8-K (Item 1.01 for entry into the merger agreement, Item 7.01 for Reg FD disclosure of the announcement, or Item 8.01 for other material events), as a Form 425, and often as a DEFA14A. The 8-K is an Exchange Act current report by a reporting company; the 425 is a Securities Act prospectus filing preserving the Rule 165/166 safe harbor for written offers.

The sharpest confusions:

  • 8-K Items 2.02/7.01 vs. 425: both can carry the announcement press release. 7.01 is Reg FD disclosure by an Exchange Act registrant; 425 is a written offer filing by any party to the combination. A non-reporting acquirer has no 8-K obligation but still files 425.
  • 8-K Item 1.01 vs. 425: 1.01 attaches the executed merger agreement as a contract disclosure; 425 attaches communications about the deal. The same press release may appear as an exhibit on both.
  • Joint proxy/prospectus vs. 425: the joint proxy/prospectus is the S-4/DEFM14A document itself, not a 425. 425 is the communication that references or precedes it.

Practical implication: an 8-K-only pull misses communications from non-reporting acquirers and supplemental materials (employee memos, investor decks, transcripts) that a filer elects to file only as 425. A 425-only pull misses the formal Item 1.01 contract disclosures and any non-425 exhibits to the 8-K. The two datasets are complementary, not substitutable.

Form FWP (free writing prospectus)

FWP under Securities Act Rule 433 is the procedural cousin to 425: both are Securities Act communication filings sitting outside a statutory prospectus. The split is subject matter. FWP covers registered non-M&A offerings (debt, follow-on equity, structured notes, shelf takedowns); 425 covers business-combination communications. The two corpora do not meaningfully overlap in content.

Schedule 13D and Schedule 13E-3

Neither is a communications filing. Schedule 13D is a beneficial-ownership filing triggered at 5% with non-passive intent, useful for tracking stake accumulation before any 425 traffic appears. Schedule 13E-3 is the going-private transaction statement required when an affiliate takes a public company private, typically filed alongside a Schedule TO or DEFM14A. Both are useful adjacencies when assembling a full deal file, but they answer ownership and transaction-status questions — not communication questions — and are not substitutes for 425.

Boundary summary

Form 425 is the running written-communications layer around stock-for-stock mergers and exchange offers. It is narrower than 8-K (425 only covers written deal communications), narrower than S-4/DEFM14A (which are the comprehensive transaction documents), and narrower than Schedule TO (which covers tender offers regardless of consideration). It is broader only in volume: a single deal can generate dozens of 425 filings across months.

Reach for this dataset when you want:

  • the full sequence of press releases, decks, transcripts, and employee communications around a stock-consideration deal,
  • communications from non-reporting acquirers that never surface in 8-K,
  • a message-tracking or event-study corpus across the pre-announcement, pendency, and closing windows.

Pair it with S-4/F-4 and DEFM14A for definitive terms, 8-K Items 1.01/7.01/8.01 and DEFA14A for overlapping formal disclosure, Schedule TO and 14D-9 for tender/exchange offers, 13D for stake accumulation, and 13E-3 for going-private mechanics.

Who Uses This Dataset

Because Rule 425 captures every written communication that constitutes a prospectus or offer in a business combination, the Form 425 Files dataset is a near-complete archive of how deal parties describe and defend transactions from announcement to close. Different audiences mine different elements: cover-page registrant/subject-company CIKs, forward-looking and solicitation legends, press-release and deck exhibits, dual 425 / DEFA14A designations, and cross-links to the related S-4, F-4, or definitive proxy.

Merger-arbitrage desks and event-driven analysts

Arb traders price the spread and estimate break risk from announcement to close. They pull exchange ratios, collar mechanics, cash/stock mix, termination fees, and antitrust, financing, or regulatory conditions out of the press-release and Q&A exhibits, then track each incremental 425 (road-show scripts, town-hall decks, regulatory update letters) for tone shifts and revised timelines. Filing timestamps anchor event-study windows; spikes in 425 cadence near a vote flag re-pricing risk.

M&A investment banking and fairness-opinion teams

Bankers use the corpus as a precedent library. Juniors assemble pitches from comparable-deal 425 sets to show premium framing, synergy narratives, and strategic rationale. Fairness-opinion teams pull press-release exhibits and investor decks to reconstruct disclosed terms, accretion/dilution framing, and pro forma metrics that peer advisers signed off on. Analyst-day and joint investor materials filed on 425 benchmark disclosure depth for live mandates.

M&A and securities counsel

Deal counsel treat the dataset as a drafting precedent bank. They compare forward-looking-statement legends, participants-in-the-solicitation disclosures, no-offer/no-solicitation language, and additional-information legends across peer transactions to satisfy Rule 425, Rule 14a-12, and Regulation M-A. They also study how others handled employee letters, customer communications, and social-media posts filed as 425 exhibits, and whether peers cross-filed 425 as DEFA14A or as an S-4 communication.

Corporate development and integration teams

In-house strategy and integration groups study how analogous deals communicated rationale and synergies. They harvest cost-synergy phasing, revenue-synergy caveats, integration-leadership announcements, cultural-fit messaging, and customer-retention commitments to shape board materials and their own future 425 playbook.

Compliance officers and proxy solicitors

Compliance teams confirm every client communication in a live deal is filed, legended, and tied to the registration statement or proxy. Proxy solicitors track 425 cadence and content from both sides of a contested or friendly deal to calibrate outreach, vote recommendations, and meeting messaging, focusing on subject-company identification, the exhibit list, and each investor-facing piece.

Financial journalists and deal-database editors

Reporters and deal-database editorial teams reconstruct deal timelines from 425 press-release exhibits, joint statements, and slide decks, capturing the first public articulation of terms and every amendment. Near-real-time 425 feeds surface unexpected filings that signal proxy fights, counter-bids, or structural changes.

Academic researchers in corporate finance and law

Researchers build multi-decade panels of merger communications. Structured metadata plus full text supports studies of premium disclosure, forward-looking-statement boilerplate evolution, communication intensity versus completion probability, linguistic features of completed versus terminated deals, and the role of employee and social-media communications as 425 exhibits. Legal scholars track how participants-in-the-solicitation disclosure has shifted since Regulation M-A modernization.

Data vendors and NLP / LLM teams

Machine-learning teams ingest the corpus to train entity extraction (acquirer/target CIK and name from the cover page), communication-type classification (press release, deck, employee memo, call transcript), quantitative deal-term extraction (exchange ratio, cash component, collars), and per-transaction clustering. Retrieval-augmented systems for M&A workflows use 425 text as grounded source, and CIK links to S-4 / F-4 and DEFA14A unify communications into transaction-level graphs.

Regulatory-affairs staff and SEC comment-letter counsel

Regulatory teams benchmark client legend language, safe-harbor statements, non-GAAP reconciliations, projections, and bring-down risk factors against large 425 samples, and trace how prior staff comments reshaped communication practice.

Deal litigators: 14a-9, fiduciary-duty, and appraisal

Plaintiffs' and defense counsel use 425 filings as primary evidence in disclosure litigation. Every communication is reviewed to reconstruct what shareholders knew, when, and from whom in Section 14(a)(9) and fiduciary-duty claims. Appraisal support teams trace publicly disclosed management projections, synergy representations, and process narratives against board-book and data-room evidence. The complete chronological record across paired acquirer and target filings exposes contradictions and shows whether supplemental 425 disclosures cured or compounded alleged omissions.

Specific Use Cases

Each use case below ties a specific workflow to concrete record fields, exhibit types, or filing combinations in the Form 425 corpus.

Pricing the arb spread on a live stock-for-stock deal

Primary user: merger-arbitrage analyst on an event-driven desk. Pull every 425 accession for the deal by grouping on the (Filed by) / (Subject) CIK pair and the fileNo on the subject entity (the S-4 or F-4 number). From the EX-99.1 press-release exhibit on the announcement 8-K/425, extract exchange ratio, collar mechanics, cash component, termination fees, and regulatory conditions; from each follow-on 425 (road-show transcripts, regulatory-update letters, town-hall decks) extract incremental timeline language. Use filedAt timestamps to anchor event-study windows and to flag cadence spikes near the shareholder vote that signal repricing risk.

Building a forward-looking-statement legend precedent bank

Primary user: M&A securities counsel drafting deal communications. Query the primary type: "425" cover document across accessions in the same SIC code and deal-size band, then pull the "Important Notices and Additional Information" and "Forward-Looking Statements" paragraphs plus the "Participants in the Solicitation" block. Compare legend wording, named directors/officers, and S-4 cross-references against current staff guidance to calibrate client drafts for Rule 425, Rule 14a-12, and Regulation M-A compliance. The dual 8-K/425 submissions expose where peers placed legends on the cover versus migrated them into exhibits.

Extracting merger-agreement and voting-support exhibits at scale

Primary user: corporate-development analyst building a precedent library. Filter documentFormatFiles for type == "EX-2.1" (merger or business-combination agreement) and type == "EX-10.1" (voting/support agreements, sponsor lock-ups) inside 8-K/425 combo accessions. The operative contract text — with schedules, termination triggers, and no-shop carve-outs — supports clause-level comparison of fiduciary outs, reverse-termination fees, and lock-up terms across peer transactions, feeding both pitch decks and board-memo templates without requiring a separate S-4 pull.

Monitoring de-SPAC communication cadence and PIPE disclosure

Primary user: SPAC-focused research analyst or compliance monitor. Isolate SPAC-era accessions (2020-2022 surge) by filtering on single-entity metadata blocks where filer CIK equals subject CIK, or on SIC codes associated with blank-check companies. Track the sequence of 425 filings per target to map PIPE-subscription disclosures, projections updates, and sponsor-forfeiture amendments in the EX-99.2 investor-presentation exhibits. Cadence drops or projection revisions often precede redemption spikes and deal renegotiations.

Reconstructing the disclosure record for 14a-9 and fiduciary-duty litigation

Primary user: plaintiffs' or defense counsel in merger-disclosure litigation. Assemble the full chronological 425 stream for both sides of the transaction by joining on the shared fileNo, then diff successive EX-99.1 press releases, employee FAQs, and analyst-day transcript exhibits against the DEFM14A narrative to identify omissions, corrections, or supplemental disclosures. The paired acquirer/target filings expose contradictions between the two sides' characterizations of synergies, process, and projections — the evidentiary backbone of Section 14(a)(9) claims and appraisal support.

Training deal-intelligence NLP models on labelled communications

Primary user: NLP engineer at a financial-data vendor. Use the type field in documentFormatFiles as weak supervision for communication-class labels: EX-99.1 for press releases, EX-99.2 for investor decks, EX-2.1 for operative contracts, other EX-99.x for transcripts, FAQs, and employee memos. Entity extraction is grounded by the (Filed by) / (Subject) suffixes in companyName plus the CIK and ticker arrays; transaction-level clustering joins 425 records to the underlying S-4/F-4 via the subject's fileNo. The SGML <DOCUMENT> envelope provides fallback labels when metadata.json is unavailable.

Surfacing counter-bids and proxy-contest escalations in near real time

Primary user: financial journalist or deal-database editor. Stream new 425 accessions by filedAt and flag any filing where a new acquirer CIK appears in the entities array against an existing target, or where a target suddenly files its own 425 under a different fileNo. Short primary 425 covers with a single EX-99.1 press-release exhibit bearing unexpected rationale language frequently mark topping bids, revised exchange ratios, or the opening salvo of a contested proxy solicitation before wire-service pickup.

Dataset Access

Dataset Index JSON API: https://api.sec-api.io/datasets/form-425-files.json

This endpoint returns metadata describing the Form 425 Files dataset, including its name, last update timestamp, earliest sample date (2000-01-01), covered form types (425), container format (ZIP), included file types (TXT, JSON, HTML, PDF), the full dataset download URL, and the list of monthly container files. Each container entry includes its key (e.g., 2025/2025-05.zip), size, record count, last updated timestamp, and per-container download URL. The index can be polled to monitor which containers changed in the most recent refresh run, enabling incremental downloads of only the containers that have been updated. No API key is required to read the index.

Example
1 {
2 "datasetId": "1f13365b-9ae0-68e8-b024-63ccb2249299",
3 "datasetDownloadUrl": "https://api.sec-api.io/datasets/form-425-files.zip",
4 "name": "Form 425 Files Dataset",
5 "updatedAt": "2026-04-23T02:56:30.711Z",
6 "earliestSampleDate": "2000-01-01",
7 "totalRecords": 133787,
8 "totalSize": 2176724564,
9 "formTypes": ["425"],
10 "containerFormat": "ZIP",
11 "fileTypes": ["TXT", "JSON", "HTML", "PDF"],
12 "containers": [
13 {
14 "downloadUrl": "https://api.sec-api.io/datasets/form-425-files/2025/2025-05.zip",
15 "key": "2025/2025-05.zip",
16 "size": 13818783,
17 "records": 154,
18 "updatedAt": "2026-04-23T02:56:30.711Z"
19 }
20 ]
21 }

Download Entire Dataset: https://api.sec-api.io/datasets/form-425-files.zip?token=YOUR_API_KEY

Downloads the complete Form 425 Files dataset as a single ZIP archive covering all filings from 2000 to present. This endpoint requires a valid API key passed via the token query parameter.

Download Single Container: https://api.sec-api.io/datasets/form-425-files/2025/2025-05.zip?token=YOUR_API_KEY

Downloads one monthly container ZIP file instead of the full archive. Container paths follow the <YEAR>/<YEAR>-<MONTH>.zip pattern shown under each container's key in the index response. Each container decompresses into per-accession folders, where every folder contains a metadata.json file alongside the filing's primary document and exhibits. This endpoint requires a valid API key passed via the token query parameter.

Frequently Asked Questions

What form does this dataset cover?

The dataset covers EDGAR submissions with form type 425 — written communications filed pursuant to Securities Act Rule 425 in connection with proposed or pending business combinations (mergers, acquisitions, exchange offers, and similar transactions). Amendments bearing form type 425/A are out of scope; only raw 425 form-type accessions are included.

What does one record in the Form 425 Files dataset represent?

One record is a single EDGAR accession — one submission filed under form type 425. Physically, each record is a directory named for the 18-digit numeric accession number containing a metadata.json descriptor plus the filer-submitted documents extracted from that accession (predominantly .htm/.html, occasionally .txt, rarely .pdf). The record boundary is always the accession, never the transaction, so a single merger typically produces many records over its lifecycle.

Who is required to file Form 425?

Form 425 is filed by participants in a business combination whose written communications qualify as a prospectus, offer, or proxy solicitation under Rule 425. Filers include the acquirer or issuer registering securities on Form S-4 or F-4, the target company, SPACs and de-SPAC targets, registered exchange-offer bidders, and affiliated parties such as parent companies, controlling shareholders, and joint bidders. Agents such as financial advisors and proxy solicitors do not file in their own names.

When must a Form 425 filing be made?

Each Form 425 filing is triggered on the date of first public use of a written communication about the business combination. The deadline is same-day — there is no next-business-day grace period. Form 425 is event-driven rather than periodic, and a single deal can generate many filings across the announcement-to-closing window.

How does Form 425 differ from Form 8-K and from DEFA14A?

Form 425 is a Securities Act prospectus filing preserving the Rule 165/166 safe harbor for written offers; Form 8-K is an Exchange Act current report by a reporting company; DEFA14A is additional soliciting material under Exchange Act Rule 14a-12. A single merger press release is often filed all three ways at once. A 425-only pull misses formal 8-K Item 1.01 contract disclosures, while an 8-K-only pull misses communications from non-reporting acquirers and supplemental materials filed only as 425.

What file format is the dataset distributed in, and how are images handled?

The dataset is distributed as monthly ZIP containers organized in a <YEAR>/<YEAR>-<MONTH>.zip path pattern. Preserved file types are TXT, JSON, HTML, and PDF. Image attachments (JPG, GIF, PNG) that EDGAR received as GRAPHIC documents are enumerated in each record's metadata.json under documentFormatFiles but are deliberately excluded from the extracted tree; reconstructing the original visual appearance requires fetching the documentUrl values directly from sec.gov.

What time period does the dataset cover?

The dataset begins January 2000, aligned with the effective date of the SEC's 1999 M&A communications release (Rel. No. 33-7760, effective January 24, 2000) that created the Rule 165/Rule 425 regime. Coverage extends to the present and is refreshed with new monthly containers; the dataset index JSON can be polled to detect which containers have changed since the previous refresh.