Form SC14D9C Files Dataset

The Form SC14D9C Files Dataset is a complete EDGAR archive of subject-company written communications relating to third-party tender offers, filed under Rule 14d-9(a) under Section 14(d)(4) of the Securities Exchange Act of 1934. One record corresponds to a single SC14D9C accession on EDGAR — the Schedule 14D-9 cover plus any attached EX-99.x exhibits and a structured metadata.json summary of the submission. The filer is the target of the tender offer (the subject company), not the bidder, and each separate written communication piece — press release, employee FAQ, customer letter, investor deck, transcript — is filed as its own SC14D9C submission. The dataset begins on January 1, 2000, immediately after the SEC's Regulation M-A rule package took effect, and is delivered as one ZIP container per calendar month under a YYYY/YYYY-MM.zip path.

Update Frequency
Daily
Updated at
2026-05-09
Earliest Sample Date
2000-01-01
Total Size
34.1 MB
Total Records
4,736
Container Format
ZIP
Content Types
TXT, JSON, HTML, PDF
Form Types
SC14D9C

Dataset APIs

Programmatically retrieve the full list of dataset archive files, download URLs and dataset metadata.

Dataset Index JSON API

Download the entire dataset as a single archive file.

Download Entire Dataset:

Download a single container file (e.g. monthly archive) from the dataset.

Download Single Container:

Dataset Files

314 files · 34.1 MB
Download All
2026-05.zip94.9 KB14 records
2026-04.zip137.1 KB33 records
2026-03.zip130.2 KB27 records
2026-02.zip43.9 KB9 records
2026-01.zip54.5 KB10 records
2025-12.zip64.0 KB12 records
2025-11.zip81.8 KB15 records
2025-10.zip63.7 KB12 records
2025-09.zip170.6 KB34 records
2025-08.zip107.9 KB22 records
2025-07.zip29.1 KB7 records
2025-06.zip210.7 KB40 records
2025-05.zip103.6 KB18 records
2025-04.zip52.7 KB11 records
2025-03.zip88.0 KB18 records
2025-02.zip152.4 KB27 records
2025-01.zip213.8 KB25 records
2024-12.zip20.2 KB4 records
2024-11.zip56.7 KB12 records
2024-10.zip81.8 KB14 records
2024-09.zip35.9 KB6 records
2024-08.zip100.2 KB21 records
2024-07.zip41.1 KB8 records
2024-06.zip40.2 KB7 records
2024-05.zip107.8 KB24 records
2024-04.zip109.5 KB19 records
2024-03.zip102.7 KB14 records
2024-02.zip240.7 KB44 records
2024-01.zip129.2 KB15 records
2023-12.zip176.0 KB29 records
2023-11.zip43.2 KB9 records
2023-10.zip272.5 KB31 records
2023-09.zip110.5 KB25 records
2023-08.zip258.8 KB49 records
2023-07.zip10.3 KB2 records
2023-06.zip26.8 KB5 records
2023-05.zip316.4 KB39 records
2023-04.zip51.5 KB10 records
2023-03.zip28.9 KB4 records
2023-02.zip6.4 KB2 records
2023-01.zip242.4 KB43 records
2022-12.zip17.2 KB4 records
2022-11.zip238.1 KB43 records
2022-10.zip139.3 KB32 records
2022-09.zip78.9 KB10 records
2022-08.zip20.6 KB6 records
2022-07.zip194.1 KB15 records
2022-06.zip115.4 KB25 records
2022-05.zip81.0 KB14 records
2022-04.zip176.2 KB21 records
2022-03.zip32.3 KB8 records
2022-02.zip74.6 KB15 records
2022-01.zip113.6 KB22 records
2021-12.zip51.0 KB12 records
2021-11.zip199.8 KB31 records
2021-10.zip94.6 KB16 records
2021-09.zip34.8 KB6 records
2021-08.zip56.8 KB12 records
2021-07.zip38.5 KB6 records
2021-06.zip220.0 KB33 records
2021-05.zip36.8 KB4 records
2021-04.zip31.5 KB6 records
2021-03.zip253.4 KB42 records
2021-02.zip125.9 KB27 records
2021-01.zip38.6 KB11 records
2020-12.zip104.9 KB20 records
2020-11.zip163.4 KB33 records
2020-10.zip56.2 KB12 records
2020-09.zip26.9 KB6 records
2020-08.zip174.6 KB33 records
2020-07.zip20.0 KB5 records
2020-06.zip20.2 KB4 records
2020-05.zip98.0 KB17 records
2020-03.zip107.3 KB19 records
2020-02.zip150.4 KB26 records
2020-01.zip69.7 KB13 records
2019-12.zip263.7 KB39 records
2019-11.zip171.4 KB35 records
2019-10.zip4.7 KB1 records
2019-09.zip153.9 KB28 records
2019-08.zip22.8 KB4 records
2019-07.zip53.8 KB10 records
2019-06.zip60.7 KB15 records
2019-05.zip71.6 KB11 records
2019-04.zip62.2 KB10 records
2019-03.zip11.1 KB2 records
2019-02.zip34.0 KB9 records
2019-01.zip35.5 KB10 records
2018-12.zip49.6 KB9 records
2018-11.zip100.5 KB20 records
2018-10.zip26.2 KB5 records
2018-09.zip196.6 KB19 records
2018-08.zip85.1 KB15 records
2018-07.zip23.2 KB6 records
2018-06.zip12.8 KB3 records
2018-05.zip42.4 KB9 records
2018-04.zip58.9 KB12 records
2018-03.zip26.1 KB7 records
2018-02.zip71.4 KB12 records
2018-01.zip231.2 KB49 records
2017-12.zip50.7 KB10 records
2017-11.zip66.3 KB16 records
2017-10.zip71.3 KB14 records
2017-09.zip43.6 KB9 records
2017-08.zip96.2 KB23 records
2017-07.zip205.3 KB43 records
2017-06.zip137.4 KB30 records
2017-05.zip115.9 KB23 records
2017-04.zip27.9 KB6 records
2017-03.zip142.6 KB27 records
2017-02.zip116.1 KB32 records
2017-01.zip104.1 KB17 records
2016-12.zip764.3 KB20 records
2016-11.zip13.3 KB4 records
2016-10.zip162.4 KB25 records
2016-09.zip170.3 KB32 records
2016-08.zip117.5 KB23 records
2016-07.zip269.3 KB38 records
2016-06.zip172.5 KB30 records
2016-05.zip175.5 KB29 records
2016-04.zip79.7 KB21 records
2016-03.zip188.2 KB30 records
2016-02.zip233.6 KB46 records
2016-01.zip32.3 KB6 records
2015-12.zip15.1 KB2 records
2015-11.zip129.5 KB18 records
2015-10.zip59.2 KB7 records
2015-09.zip86.8 KB20 records
2015-08.zip221.6 KB26 records
2015-07.zip61.5 KB13 records
2015-06.zip82.5 KB9 records
2015-05.zip223.6 KB21 records
2015-04.zip52.8 KB12 records
2015-03.zip55.0 KB7 records
2015-02.zip113.6 KB15 records
2015-01.zip202.4 KB27 records
2014-12.zip274.3 KB29 records
2014-11.zip179.9 KB30 records
2014-10.zip39.5 KB8 records
2014-09.zip79.7 KB12 records
2014-08.zip27.3 KB6 records
2014-07.zip61.2 KB12 records
2014-06.zip79.1 KB12 records
2014-05.zip20.9 KB3 records
2014-04.zip16.2 KB4 records
2014-03.zip14.9 KB2 records
2014-02.zip36.1 KB5 records
2014-01.zip85.6 KB10 records
2013-12.zip122.2 KB18 records
2013-11.zip136.4 KB17 records
2013-10.zip27.0 KB4 records
2013-09.zip104.7 KB9 records
2013-08.zip90.7 KB19 records
2013-07.zip55.9 KB7 records
2013-06.zip34.6 KB4 records
2013-05.zip79.9 KB16 records
2013-04.zip183.4 KB24 records
2013-03.zip90.0 KB14 records
2013-02.zip35.4 KB5 records
2013-01.zip86.4 KB7 records
2012-12.zip43.5 KB8 records
2012-11.zip83.8 KB10 records
2012-10.zip89.8 KB12 records
2012-09.zip34.6 KB7 records
2012-08.zip53.8 KB6 records
2012-07.zip174.2 KB30 records
2012-06.zip84.7 KB19 records
2012-05.zip129.0 KB17 records
2012-04.zip27.3 KB4 records
2012-03.zip152.1 KB19 records
2012-02.zip120.5 KB14 records
2012-01.zip100.2 KB11 records
2011-12.zip100.5 KB13 records
2011-11.zip73.6 KB17 records
2011-10.zip66.9 KB10 records
2011-08.zip38.2 KB10 records
2011-07.zip309.5 KB40 records
2011-06.zip77.8 KB10 records
2011-05.zip198.1 KB18 records
2011-04.zip252.5 KB39 records
2011-03.zip110.0 KB19 records
2011-02.zip107.3 KB15 records
2011-01.zip88.5 KB12 records
2010-12.zip262.8 KB15 records
2010-11.zip158.6 KB10 records
2010-10.zip148.3 KB21 records
2010-09.zip129.7 KB31 records
2010-08.zip124.5 KB17 records
2010-07.zip64.5 KB9 records
2010-06.zip80.3 KB17 records
2010-05.zip140.0 KB20 records
2010-04.zip34.1 KB6 records
2010-03.zip115.6 KB18 records
2010-02.zip110.3 KB17 records
2010-01.zip41.9 KB5 records
2009-12.zip111.2 KB8 records
2009-11.zip39.8 KB11 records
2009-10.zip49.7 KB7 records
2009-09.zip356.8 KB27 records
2009-08.zip144.9 KB12 records
2009-07.zip56.6 KB7 records
2009-06.zip126.3 KB14 records
2009-05.zip59.3 KB9 records
2009-04.zip85.7 KB18 records
2009-03.zip24.1 KB3 records
2009-02.zip25.4 KB7 records
2009-01.zip125.7 KB10 records
2008-12.zip165.3 KB18 records
2008-11.zip22.9 KB4 records
2008-10.zip356.2 KB31 records
2008-09.zip424.1 KB32 records
2008-08.zip192.3 KB19 records
2008-07.zip165.9 KB21 records
2008-06.zip97.5 KB16 records
2008-05.zip115.0 KB14 records
2008-04.zip87.8 KB15 records
2008-03.zip81.3 KB16 records
2008-02.zip200.7 KB30 records
2008-01.zip109.0 KB15 records
2007-12.zip266.2 KB19 records
2007-11.zip108.4 KB14 records
2007-10.zip236.7 KB23 records
2007-09.zip35.1 KB8 records
2007-08.zip143.9 KB21 records
2007-07.zip880.6 KB20 records
2007-06.zip469.8 KB26 records
2007-05.zip199.3 KB10 records
2007-04.zip183.7 KB24 records
2007-03.zip291.9 KB34 records
2007-02.zip143.4 KB13 records
2007-01.zip160.9 KB13 records
2006-12.zip87.6 KB11 records
2006-11.zip185.5 KB17 records
2006-10.zip58.0 KB7 records
2006-09.zip261.8 KB17 records
2006-08.zip586.1 KB16 records
2006-07.zip60.4 KB6 records
2006-06.zip31.1 KB5 records
2006-05.zip82.7 KB7 records
2006-04.zip183.1 KB27 records
2006-03.zip92.7 KB8 records
2006-02.zip362.2 KB30 records
2006-01.zip225.7 KB12 records
2005-12.zip84.5 KB16 records
2005-11.zip76.7 KB17 records
2005-10.zip172.3 KB9 records
2005-09.zip516.6 KB27 records
2005-08.zip11.5 KB2 records
2005-07.zip24.0 KB6 records
2005-06.zip78.7 KB3 records
2005-05.zip7.9 KB2 records
2005-03.zip116.7 KB8 records
2005-02.zip38.0 KB6 records
2005-01.zip37.0 KB7 records
2004-12.zip34.4 KB15 records
2004-11.zip39.2 KB10 records
2004-10.zip87.6 KB11 records
2004-09.zip23.1 KB2 records
2004-08.zip57.1 KB12 records
2004-07.zip12.4 KB3 records
2004-06.zip443.8 KB35 records
2004-05.zip41.7 KB9 records
2004-04.zip25.9 KB8 records
2004-03.zip42.8 KB5 records
2004-02.zip5.8 KB1 records
2004-01.zip24.5 KB7 records
2003-12.zip140.3 KB26 records
2003-11.zip78.2 KB12 records
2003-10.zip83.0 KB9 records
2003-09.zip49.7 KB11 records
2003-08.zip5.1 KB1 records
2003-07.zip134.9 KB23 records
2003-06.zip55.3 KB13 records
2003-05.zip218.7 KB29 records
2003-04.zip62.6 KB11 records
2003-03.zip5.6 KB2 records
2003-02.zip39.8 KB6 records
2003-01.zip49.1 KB8 records
2002-12.zip26.7 KB5 records
2002-11.zip114.1 KB11 records
2002-10.zip54.3 KB6 records
2002-09.zip5.5 KB1 records
2002-08.zip12.8 KB2 records
2002-07.zip107.9 KB5 records
2002-06.zip56.2 KB8 records
2002-05.zip71.5 KB13 records
2002-04.zip9.7 KB2 records
2002-03.zip85.0 KB11 records
2002-02.zip20.9 KB6 records
2002-01.zip28.2 KB6 records
2001-12.zip30.7 KB4 records
2001-11.zip24.1 KB7 records
2001-10.zip106.4 KB12 records
2001-09.zip50.6 KB8 records
2001-08.zip132.9 KB15 records
2001-07.zip118.7 KB13 records
2001-06.zip52.1 KB14 records
2001-05.zip74.7 KB19 records
2001-04.zip194.6 KB27 records
2001-03.zip29.0 KB5 records
2001-02.zip39.8 KB9 records
2001-01.zip120.2 KB11 records
2000-12.zip111.9 KB8 records
2000-11.zip38.4 KB12 records
2000-10.zip111.8 KB26 records
2000-09.zip29.6 KB9 records
2000-08.zip63.4 KB16 records
2000-07.zip19.2 KB5 records
2000-06.zip4.2 KB1 records
2000-05.zip57.4 KB14 records
2000-04.zip40.3 KB12 records
2000-03.zip53.3 KB10 records
2000-02.zip20.7 KB5 records
2000-01.zip4.2 KB1 records

What This Dataset Contains

The dataset packages every Form SC14D9C submission accepted by EDGAR since January 2000. Form SC14D9C is the Schedule 14D-9 cover used by the subject company to file written communications about a pending or anticipated third-party tender offer in advance of its formal solicitation/recommendation statement on Schedule 14D-9. The "C" suffix denotes a communication, not a recommendation: the cover identifies the subject company, the bidder, and the underlying tender offer, while the substantive content rides in attached EX-99.x exhibits or, in some submissions, in a contemporaneous Form 8-K incorporated by reference.

For each accession number, the dataset includes the per-filing metadata.json and every document in the original EDGAR submission except image files. Submission documents are retained inside EDGAR's SGML envelope exactly as accepted. The dataset is distributed as ZIP containers; record-level documents are predominantly HTML, with TXT, PDF, and JSON file types also present across the historical span.

Content Structure of a Single Record

What one record represents

One record in the Form SC14D9C Files Dataset corresponds to a single SC14D9C submission on EDGAR — that is, one accession number — packaged as a folder whose name is the 18-digit accession number with the hyphens removed. Inside that folder sits exactly one metadata.json describing the submission, plus the original EDGAR submission documents: the primary SC14D9C cover and any exhibits attached to it. The dataset is delivered as one ZIP per calendar month under a YYYY/YYYY-MM.zip path, and each monthly ZIP decompresses into a YYYY-MM/ directory whose immediate children are these per-accession folders. The atomic record unit is therefore the per-accession folder — not the monthly ZIP, and not any individual document inside the folder.

What the underlying filing is

Form SC14D9C is a written communication by the subject company relating to a third-party tender offer, filed under cover of Schedule 14D-9 pursuant to Rule 14d-9(a) under Section 14(d)(4) of the Securities Exchange Act of 1934. The "C" suffix denotes that the submission is a communication — typically released before the subject company has filed its formal solicitation/recommendation statement on Schedule 14D-9 — rather than the formal recommendation statement itself. The filer of an SC14D9C is the target/subject company (or someone acting on its behalf), not the bidder; the bidder's parallel pre-commencement vehicle is Schedule TO-C. Each separate written communication piece must be filed as its own SC14D9C submission, which is why the same target frequently appears in close succession when management releases a press release, an employee Q&A, a customer letter, and other tender-offer-related communications in parallel.

Mechanically, an SC14D9C is a thin Schedule 14D-9 cover that brackets one or more attached communication exhibits. The cover identifies the subject company, the bidder, and the underlying tender offer; the substantive content — the actual communication being disseminated to security holders — is delivered in the attached exhibits, normally tagged as EX-99.1, EX-99.2, and so on. In some submissions the cover itself merely incorporates a contemporaneous Form 8-K (or specific items of one) by reference, in which case the exhibit slate may be empty and the substantive content lives in the cross-referenced filing.

Content layers within one record

A single record is composed of three concentric layers:

  1. The accession folder — the outer container, identified by the un-hyphenated EDGAR accession number, holding all files belonging to one SC14D9C submission.
  2. metadata.json — a structured per-filing record summarizing the EDGAR submission header, the filing index, the named parties, and the file manifest.
  3. The submission documents themselves — the primary SC14D9C cover plus any EX-99.x exhibits, each delivered as an HTML file wrapped in EDGAR's SGML envelope.

Image files (logos, photographs, scanned signatures, embedded chart graphics) that may accompany the original EDGAR submission are excluded from the dataset copy. EDGAR's complete-submission single-file SGML/TXT representation is referenced via URL inside metadata.json but is not redistributed as a local file in the record folder.

metadata.json structure

metadata.json is the per-filing structured record. Its top-level fields are:

  • formType — always the literal string "SC14D9C".
  • accessionNo — EDGAR accession in hyphenated form, e.g. "0001193125-25-167252".
  • description — the static descriptor "Form SC14D9C - Written communication relating to third party tender offer".
  • filedAt — ISO-8601 timestamp with timezone offset, capturing the EDGAR acceptance time.
  • linkToFilingDetails — URL to the primary SC14D9C document on EDGAR (the document the index page treats as the principal filing).
  • linkToTxt — URL to the complete EDGAR submission as a single SGML/TXT file with every document concatenated together.
  • linkToHtml — URL to the EDGAR filing index page.
  • linkToXbrl — empty string for SC14D9C.
  • id — a 32-character hexadecimal internal record identifier.
  • documentFormatFiles — array of objects, one per file in the EDGAR submission, plus a final catch-all entry pointing at the complete-submission TXT.
  • entities — array describing the parties named in the EDGAR header (filer and subject; for SC14D9C these are normally the same company recorded twice with different role suffixes).
  • seriesAndClassesContractsInformation — array, empty for SC14D9C (the form does not carry investment-company series/class data).
  • dataFiles — array, empty for SC14D9C.

documentFormatFiles[]

Each element describes one attached file. Keys include sequence (numeric strings "1", "2", "3" for documents, with a single literal space " " reserved for the catch-all complete-submission row), size (file size in bytes as a string), documentUrl (direct EDGAR URL), description (free-text label supplied by the filer, such as "SC14D9C", "EX-99.1", or "Complete submission text file"), and type (the EDGAR document-type label, mirroring the <TYPE> line inside the SGML wrapper). The complete-submission TXT row is conventionally the last entry in the array. Common shapes are: one primary with no exhibits (the cover incorporates another filing by reference), one primary plus one or two EX-99.x exhibits, and occasionally larger exhibit slates when management releases multiple coordinated communication pieces simultaneously.

entities[]

For SC14D9C the same legal entity ordinarily appears twice — once as (Filed by) and once as (Subject) — because the subject company files on its own behalf. Per-entity keys include cik, companyName (with the role suffix appended), tickers (array of trading symbols), irsNo (IRS employer identification number, populated as "000000000" when withheld), stateOfIncorporation (two-letter code), fiscalYearEnd (MMDD), sic (SIC code with a human-readable description appended), and type (mirrors formType). Three keys appear only on the (Subject) entity: act (the Exchange Act number, normally "34"), fileNo (the SEC file number, e.g. "005-60499" for tender-offer files in the 005- series), and filmNo (the SEC film/microfiche number assigned at acceptance).

Submission documents and the SGML envelope

Each .htm file in the accession folder is the original EDGAR document, retained inside EDGAR's SGML wrapper exactly as accepted. The file opens with a small SGML header block — <DOCUMENT>, <TYPE>, <SEQUENCE>, <FILENAME>, <DESCRIPTION>, <TEXT> — encloses an <HTML> body containing the rendered communication, and closes with </TEXT></DOCUMENT>. A representative header block looks like:

1 <DOCUMENT>
2 <TYPE>SC14D9C
3 <SEQUENCE>1
4 <FILENAME>d51736dsc14d9c.htm
5 <DESCRIPTION>SC14D9C
6 <TEXT>
7 <HTML>...</HTML>
8 </TEXT>
9 </DOCUMENT>

The four SGML metadata lines duplicate, line for line, the corresponding entry in documentFormatFiles[], providing a redundant in-document anchor for the file's role.

Document roles inside one record

  • Primary SC14D9C cover (sequence 1). This is the Schedule 14D-9 cover page. It identifies the subject company by name and address; lists the names and addresses of the persons filing the statement; gives the title and CUSIP of the subject class of equity securities; names the contact person authorized to receive notices on behalf of the persons filing; and identifies outside counsel where applicable. The cover indicates which Rule 14d-9 box is checked — most pre-launch communications check the "Pre-Commencement Communications pursuant to Rule 14d-2(b) under the Exchange Act" box (or an analogous box), marking the submission as a preliminary communication. Beneath the cover, some filings embed substantive narrative directly: a description of the merger agreement, a roadmap of attached exhibits, the mandatory "ADDITIONAL INFORMATION AND WHERE TO FIND IT" legend directing security holders to the forthcoming Schedule 14D-9, and a "CAUTIONARY NOTE REGARDING FORWARD-LOOKING STATEMENTS" disclaimer. Other filings use the cover purely as a procedural wrapper, with a short paragraph that incorporates a contemporaneous Form 8-K (or named items of it) by reference, leaving substantive content to flow through the incorporated filing. Signatures, where present, appear on the cover and identify the officer signing on behalf of the subject company.

  • EX-99.x exhibits (sequence 2 and onward). These exhibits hold the actual communications: press releases announcing the deal or its terms, employee Q&A documents, internal "all-colleagues" letters from the CEO, customer or partner letters, transcripts of conference calls or town halls, social-media post collections, investor-presentation slide decks, and similar pieces. Each exhibit document repeats the mandatory legends — the "Important Additional Information and Where to Find It" notice and the forward-looking-statements cautionary language — typically at the foot of the document. These two boilerplate blocks are required by Rule 14d-9(a) and Rule 165 / Regulation M-A and are reliable textual markers when classifying or extracting content programmatically.

  • Complete-submission TXT row. A virtual entry in documentFormatFiles[] (with sequence set to a single space character) points at EDGAR's concatenated SGML/TXT representation of the whole submission. This is a reference URL only; the dataset does not store the file locally.

File-naming conventions

Document filenames within an accession folder follow the conventions of the financial-printer filing agent that prepared the submission. A large share of SC14D9C filings are prepared by Donnelley Financial Solutions (filer-agent CIK 0001193125), which produces filenames of the form d<digits>d<slug>.htm, where d<digits>d is the printer's internal job identifier (e.g. d51736d, d84611d, d49582d, d897100d) and <slug> encodes the document role: sc14d9c for the primary cover, ex991, ex992, ex993, etc. for the numbered exhibits. Other filer agents follow their own, typically self-evident slug schemes (tm25...d1_sc14d9c.htm, ny<digits>_sc14d9c.htm, etc.). The slug is a useful but non-authoritative hint about a file's role; the authoritative role assignment is the <TYPE> line of the SGML envelope and the type field in documentFormatFiles[].

File types in the dataset

Submission documents in modern records are overwhelmingly HTML. The file-types found across the dataset are TXT, JSON, HTML, and PDF, reflecting the historical span: HTML and PDF for rendered documents, TXT for legacy ASCII bodies that survived in older submissions, and JSON for the per-record metadata.json. Image files (.jpg, .gif, .png) referenced by the rendered HTML are excluded from the dataset copy by design, so an exhibit that originally carried embedded images will display as text plus broken-image references when rendered locally.

What is included in a record

A record contains:

  • the metadata.json summary of the EDGAR submission header, filing index, party identifiers, and file manifest;
  • the primary SC14D9C cover document, in its SGML-wrapped HTML form;
  • every exhibit attached to the submission (EX-99.1, EX-99.2, and any further EX-99.x), each in its SGML-wrapped HTML form;
  • any non-image ancillary text or PDF documents that were part of the original submission.

What is not included

A record does not contain:

  • image files referenced by the HTML documents (these are stripped per dataset policy);
  • a local copy of EDGAR's complete-submission single-file SGML/TXT representation (referenced by URL only);
  • the actual cross-referenced documents when an SC14D9C cover incorporates a separate filing (a contemporaneous Form 8-K, for example) by reference — those records live in their own form-type datasets;
  • the formal Schedule 14D-9 solicitation/recommendation statement that follows commencement of the tender offer — that is a separate form (SC 14D9 / SC 14D9/A) with its own dataset boundary.

Evolution of required content over time

The SC14D9C form has existed in essentially its current shape since the SEC's M&A Release (Release No. 33-7760) restructured Regulation 14D and Regulation 14E and adopted Rule 165 / Regulation M-A in October 1999 — placing the dataset's January 2000 start date immediately after the rule's effective date and on the cusp of a deliberate liberalization of pre-commencement communications. That rule package decoupled written communications about a third-party tender offer from the formal commencement of the offer: rather than treating early communications as illegal solicitations, the SEC permitted broad pre-commencement communication subject to the requirement that each piece be filed on the date first used. SC14D9C is the cover used by the subject company for that filing; its parallel for bidders is SC TO-C. The required structural elements of an SC14D9C — the Schedule 14D-9 cover identifying the subject company, the bidder, and the offering security; the explicit legend directing investors to the forthcoming formal Schedule 14D-9; and the forward-looking-statements safe harbor language — have been stable since 2000.

Refinements introduced over the dataset's span have been incremental rather than structural. Rule 162 / 165 amendments and successor SEC interpretive guidance tightened the contents of the "Important Information and Where to Find It" legend (for example, prompting filers to add SEC EDGAR links and tender-offer document references in machine-friendly form), and the staff's M&A guidance through the 2000s and 2010s sharpened the content of forward-looking statements disclaimers in tender-offer communications. These refinements adjust the content of the boilerplate legends rather than the surrounding form structure.

Evolution of file format over time

Filings since January 2000 have always been packaged inside EDGAR's SGML submission envelope. Within that envelope, the document body has shifted across three regimes that may all be visible in the dataset:

  • Early period (2000–early 2000s). Many SC14D9C submissions consisted of plain ASCII text bodies inside the SGML wrapper, sometimes accompanied by a separately filed paper-equivalent or PDF of a press release. Older .txt document bodies, where they exist, are retained as-is.
  • HTML era (mid-2000s onward). EDGAR encouraged and ultimately standardized HTML for document bodies, and SC14D9C filings adopted HTML as the dominant format. The <HTML>...</HTML> content sits inside the same SGML envelope, preserving the same <TYPE>, <SEQUENCE>, <FILENAME>, and <DESCRIPTION> header lines.
  • Modern HTML with PDF attachments. Recent submissions are essentially all HTML, with PDFs occasionally appearing for visual-heavy attachments (an investor deck reproduced as an exhibit, for example). Filing agents — Donnelley Financial Solutions in particular — apply consistent filename slug conventions (d<jobid>d<role>.htm) that make the document role identifiable from the filename alone.

Interpretation notes

  • Same entity twice in entities[]. Because SC14D9C is filed by the subject company itself, the two entries with (Filed by) and (Subject) suffixes normally describe the same legal entity at the same CIK. Treat them as one company with two metadata projections; the (Subject) entry carries the SEC file number, film number, and act keys, while the (Filed by) entry typically does not.
  • Multiple SC14D9C filings per target per day. Because each separate communication piece must be filed as a separate submission, the same subject company commonly appears two, three, or more times on the same day. Each filing should be treated as an independent record, even when the cover narrative and the legends are nearly identical.
  • Cover-only filings via incorporation by reference. A non-trivial share of SC14D9C submissions consist of a cover plus zero exhibits, with the cover incorporating a contemporaneous Form 8-K (or specific items thereof) by reference. To recover the substantive communication in those cases, the cross-referenced 8-K must be retrieved from its own form-type dataset.
  • Mandatory legends as text markers. The "ADDITIONAL INFORMATION AND WHERE TO FIND IT" / "Important Additional Information and Where to Find It" notice and the "CAUTIONARY NOTE REGARDING FORWARD-LOOKING STATEMENTS" block are required for SC14D9C and appear in nearly every record, usually verbatim or nearly so. They are reliable anchors for programmatic identification and segmentation but should not be treated as substantive content.
  • SGML header redundancy. The <TYPE>, <SEQUENCE>, <FILENAME>, and <DESCRIPTION> lines inside each document echo the corresponding entry in metadata.json's documentFormatFiles[]. Either source can be used to assign roles; using the SGML header is robust to JSON parsing issues, while using documentFormatFiles[] is robust to malformed document bodies.
  • fileNo namespace. The SEC file number for a third-party tender offer normally lives in the 005- series; seeing this prefix on the (Subject) entity is a quick sanity check that the filing was correctly classified as a tender-offer communication rather than miscoded.
  • Industry concentration. SC14D9C usage is empirically heavy in pharma/biotech (SIC 2834, 2836) and other M&A-active sectors. This is not a structural feature of the form but a useful prior when validating extraction results.
  • Amendments. SC14D9C does not have a separate amendment form type within this dataset; further communications by the same target on the same offer are filed as additional SC14D9C submissions (each with its own accession number and record folder), not as /A amendments. The progression of communications must therefore be reconstructed by chaining records on subject CIK plus the underlying tender-offer file number.

Who Files or Publishes This Dataset, and When

Who files

The filer of Form SC14D9C is the subject company in a third-party tender offer — the issuer whose equity securities a separate bidder is seeking to acquire. Each record on EDGAR is one accession number containing the SC14D9C cover plus the underlying written communication (press release, employee or customer letter, investor deck, transcript, FAQ, social or website post) and any exhibits.

Filers in this population include:

The filer is not the bidder. Bidder-side pre-commencement written communications are filed under SC TO-C under Rule 14d-2(b). Officers, directors, advisers, and proxy solicitors are not the legal filer; the subject company is. A non-issuer person who independently solicits or recommends action on the offer (a controlling holder, employee group, competing bidder, or adviser publishing a recommendation) files in its own capacity under Rule 14d-9 and may use the SC14D9C cover for its own pre-commencement written communications.

What triggers a filing

The trigger is communication-driven, not periodic. A filing arises each time the subject company disseminates, publishes, sends, or gives to security holders any written communication relating to a third-party tender offer for its securities, before its formal Schedule 14D-9 has been filed. Each separate written communication generally produces its own SC14D9C; integrated communications packages (press release plus employee letter plus FAQ plus investor deck) commonly produce multiple same-day filings.

Typical triggering materials:

  • Press releases announcing or reacting to a tender offer or related merger agreement
  • Letters to employees, customers, suppliers, or business partners
  • Investor presentations, scripts, prepared remarks, and call transcripts
  • Q&A documents, fact sheets, and talking points
  • Email and intranet messages, website postings, and social media content

Purely oral, unrecorded communications do not themselves require an SC14D9C, but written materials prepared for them (slides, scripts, transcripts) do.

Regulatory basis

Form SC14D9C is filed under Rule 14d-9(a)(2) under Section 14(d)(4) of the Exchange Act, as part of the Regulation M-A communications regime adopted in 1999 (Release No. 33-7760, effective January 2000). The rule lets the subject company communicate publicly about a tender offer in advance of its formal recommendation, provided each written communication is filed on or before the date of first use, carries the required Rule 14d-9 legend pointing holders to the forthcoming Schedule 14D-9, and complies with the federal anti-fraud rules (Section 10(b)/Section 14(e), Rule 10b-5, Rule 14e-3, Rule 14e-8). SC14D9C is the EDGAR cover that effectuates that filing requirement; it is not a content safe harbor.

When it must be filed

SC14D9C is a per-communication filing due no later than the date of first use — the same calendar day the communication is first published, sent, or disseminated. In practice:

  • Morning press releases are filed the same business day.
  • Internal employee communications go out and get filed the same day.
  • Deal-announcement materials are often filed concurrently with the underlying 8-K and merger agreement.

The SC14D9C window opens as soon as the subject company begins making written communications about an actual or anticipated third-party tender offer, and closes when the subject company files its formal Schedule 14D-9 (which itself is due within ten business days of the offer's commencement and contains the subject company's Item 1-9 disclosures and its accept/reject/neutral/unable-to-take-a-position recommendation). After Schedule 14D-9 is filed, further subject-company communications about the offer move onto Schedule 14D-9 amendments rather than new SC14D9C filings.

Important distinctions

  • SC14D9C vs. SC TO-C. SC14D9C is the target's cover; SC TO-C is the bidder's. The same transaction commonly produces coordinated filings on both tracks.
  • SC14D9C vs. Schedule 14D-9. SC14D9C carries informal pre-commencement written communications. The "no-C" Schedule 14D-9 is the formal solicitation/recommendation statement with the substantive Item 1-9 disclosures. SC14D9C does not substitute for it.
  • SC14D9C vs. DEFA14A. A one-step merger requiring a shareholder vote falls under the proxy rules; target communications use Rule 14a-12 / DEFA14A, not Rule 14d-9. Two-step deals (tender offer plus back-end merger) can produce SC14D9C during the offer and proxy filings later, though many post-2013 Delaware deals use DGCL Section 251(h) to avoid a back-end vote.
  • Issuer tender offers are excluded. An issuer bidding for its own securities is governed by Section 13(e) / Rule 13e-4 and Schedule TO (issuer tender offer), not by SC14D9C.
  • Amendments. SC14D9C amendments exist on EDGAR but are uncommon; corrections are typically handled by filing a new SC14D9C for a new communication or, post-commencement, by amending Schedule 14D-9.
  • History. SC14D9C as a distinct EDGAR submission type dates from the Regulation M-A rules effective January 2000, which is why the dataset's earliest records appear at that date. Section 14(d) itself originates in the Williams Act of 1968, but the dedicated pre-commencement communications cover is a Regulation M-A creation.

How This Dataset Differs From Similar Datasets or Filings

SC14D9C sits in a tight cluster of tender offer and M&A communication filings. Distinguishing it requires three axes: the speaker (subject company, bidder, or issuer), the timing (pre-commencement communication versus formal statement), and the transaction regime (third-party tender, issuer self-tender, going-private, foreign issuer, or proxy/registered merger).

Schedule 14D-9 — formal solicitation/recommendation statement

The document SC14D9C precedes. Schedule 14D-9 is the subject company's mandated response to a third-party tender offer under Rule 14d-9, filed within ten business days of commencement. It contains the board's recommendation, supporting reasoning, fairness opinions, conflicts, prior contacts, and full item-by-item disclosure.

SC14D9C is the same filer's earlier, informal channel: press releases, employee FAQs, customer letters, or talking points wrapped in a legend pointing holders to the forthcoming 14D-9. 14D-9 is the legally operative recommendation; SC14D9C is the running public commentary around it.

SC TO-C — bidder-side pre-commencement communications

The bidder mirror of SC14D9C. Same timing, same wrapper format, same cautionary legend — but filed by the offering person under Rules 14d-2(b) and Rule 13e-4(c) rather than by the target under Rule 14d-9(a). Reconstructing a deal's communications record almost always requires both; SC14D9C alone gives only the target's voice.

SC TO-T — third-party tender offer statement

The bidder's formal commencement filing, counterpart to Schedule 14D-9. Compared to SC14D9C: later in the timeline, fully structured (offer terms, financing, sources of funds, plans, exhibits), and authored by the opposite party. SC14D9C may discuss the offer that an SC TO-T formalizes, but never contains the offer terms themselves.

SC TO-I — issuer self-tender

Filed under Rule 13e-4 when a company tenders for its own securities (buybacks, Dutch auctions, self-tenders). There is no separate subject company and no Rule 14d-9 obligation, so SC TO-I is structurally inapplicable. The two datasets cover different transaction types and do not overlap.

SC 13E-3 — going-private transactions

Triggered when an affiliate transaction will deregister or delist the company, often layered onto a tender offer or merger. Overlaps with SC14D9C only when a going-private transactions deal takes the form of an affiliate-led third-party tender; in those cases the same transaction may generate SC14D9C, 14D-9, and SC 13E-3 in parallel. 13E-3 compels heightened fairness disclosure (Rule 13e-3 factors, financial advisor reports); SC14D9C is a generic communications wrapper. Narrow and deep versus broad and shallow.

SC14D9F — cross-border solicitation/recommendation

The foreign-private-issuer counterpart to Schedule 14D-9, used when a tender offer qualifies for the SEC's cross-border exemptions and is conducted primarily under home-country rules, often via incorporation by reference of foreign documents. SC14D9F operates under the standard U.S. Rule 14d-9 framework. Different deal structures, rarely substitutes.

DEFA14A — additional proxy soliciting materials

The proxy-context analogue to SC14D9C: communications wrappers (press releases, investor decks, employee letters) filed during a corporate transaction with cautionary legends. The decisive difference is deal mechanism — DEFA14A is used when the transaction proceeds through a shareholder vote (one-step merger), while SC14D9C is used when it proceeds through a tender offer. A two-step merger can produce both: SC14D9C and 14D-9 on the tender front end, DEFA14A on any back-end vote.

Form 425 — communications in registered business combinations

Covers written communications under Rule 425 in business combinations involving registered securities, typically stock-for-stock mergers. Same wrapper format as SC14D9C, but triggered by Securities Act offer concerns rather than Exchange Act Section 14(d). Cash tender offers rarely produce 425s; stock-for-stock mergers rarely produce SC14D9Cs. Mixed-consideration tender offers can generate both, with the same press release filed under different cover pages. See Form 425 for the rule text.

Boundary summary

SC14D9C captures one precise intersection: written communications, by the subject company, about a third-party tender offer, in the pre-commencement or pre-formal-statement window. No other filing occupies that slot. SC TO-C covers the bidder side of the same window; Schedule 14D-9 covers the same filer at the formal stage; DEFA14A and Form 425 cover analogous communications under proxy and registered-merger regimes; SC TO-I and SC 13E-3 cover entirely different transaction structures; SC14D9F substitutes for 14D-9 in cross-border deals.

For most research, SC14D9C is a complement rather than a substitute. A complete tender offer record typically pairs SC14D9C with 14D-9 (target side) and SC TO-C with SC TO-T (bidder side), adding 13E-3 for affiliate deals or swapping in SC14D9F for cross-border ones. Used alone, SC14D9C captures only the target's informal voice in the earliest phase of the offer.

Who Uses This Dataset

SC14D9C filings capture how a target company speaks publicly to security holders in the window between a tender-offer announcement and its formal Schedule 14D-9 recommendation. The professionals below use that record in distinct ways.

M&A Lawyers and Corporate Counsel

Deal lawyers and in-house counsel treat the dataset as a working library of Rule 14d-9 communications. They mine prior filings to draft press releases, employee FAQs, customer letters, and investor talking points that meet the filing and legending requirements: how the offer was identified, how the mandatory pointer to the eventual 14D-9 was worded, how forward-looking disclaimers were framed, and how communications were sequenced before a stop-look-and-listen response or a formal recommendation.

Merger Arbitrage and Event-Driven Analysts

Risk-arb desks and event-driven funds read SC14D9C filings as the earliest signal of how a target board is leaning. Tone, cadence, and references to financial advisers, go-shop language, or litigation feed deal-spread, completion-probability, and timing models, and inform position sizing and hedge construction on tender-offer trades.

M&A Bankers

Bankers advising targets and bidders study how similarly situated companies framed prior tender offers to anticipate counterparty reactions and pricing-adequacy signaling. The text and exhibits feed pitch materials, fairness-opinion comparables, and live-deal commentary.

Healthcare and Biotech Equity Analysts

Because tender offers dominate pharma, biotech, and medical-device M&A, the corpus skews toward life-sciences targets, including small and mid-cap clinical-stage names with thin sell-side coverage. Analysts use it for premium benchmarking, comparable-deal analysis, and reading how managements characterize pipeline value, milestone payments, and contingent value rights.

Proxy Solicitors and Information Agents

Solicitors advising target boards study holder-communication structure, call-center scripts, broker and custodian outreach, and the timing of communications relative to offer expiration. The output is bid-response playbooks and outreach materials that avoid confusion with the eventual 14D-9.

Investor Relations Teams

IR teams at potential targets and companies in takeover-prone sectors use the corpus to benchmark peer responses: how inbound calls were handled in the pre-recommendation window, what employee and customer communications were filed alongside investor materials, and how Reg FD was managed. It supports standby playbooks and templates.

Strategic Communications Advisers

Crisis-PR and strategic-comms firms study tone, framing, and messaging cadence across successive SC14D9C filings in a single contest: how the bidder was characterized, how CEO letters and town-hall remarks were structured, and how language evolved as the contest developed.

Compliance Officers

Compliance teams at registrants, broker-dealers, and law firms track Rule 14d-9 hygiene across the market: legend presence, prompt filing of written communications, and the boundary between ordinary-course and tender-offer communications. Use cases include compliance reviews, internal training, and post-deal audits.

Academic Researchers in Corporate Finance and Law

Academics use the corpus as a primary source on takeover defense, premia, and negotiation dynamics. With filings from 2000 forward, it supports panel studies linking target rhetoric to topping bids, deal completion, and stock-price reactions, and tracks how disclosure practice responds to Delaware case law and Commission guidance.

ML and NLP Teams

Quant funds, fintechs, and data vendors use the corpus to train deal-event detection, board-posture classification (resist, neutral, recommend), bidder/target entity extraction, and retrieval systems that answer live-deal questions. Every filing is anchored to a specific tender-offer event, and consistent EDGAR accession structure makes it easy to join with other M&A forms.

In summary: lawyers and bankers draft and benchmark, arb and sector analysts read signals and price risk, solicitors and IR shape messaging, compliance monitors hygiene, and academics and ML teams treat the filings as a clean, event-anchored text corpus on the most price-sensitive days of a tender offer.

Specific Use Cases

Concrete workflows the SC14D9C corpus supports, tied to its actual contents — subject-company written communications about third-party tender offers, with Rule 14d-9 cover, EX-99.x exhibits, and per-filing metadata.json.

Building a board-posture signal for merger-arbitrage models

Pull every SC14D9C tied to a specific subject CIK and tender-offer file number (the 005- series value in entities[].fileNo), order them by filedAt, and run sentiment and stance classification over the EX-99.x exhibit bodies (press releases, CEO letters, employee Q&As). The output is a per-deal time series of board posture — resist, neutral, supportive — feeding deal-spread and completion-probability models before the formal Schedule 14D-9 lands.

Drafting Rule 14d-9 communication legends from prior filings

For a live deal, query the corpus by SIC and recent filedAt to surface comparable subject-company communications, then extract the "Important Additional Information and Where to Find It" legend and the "Cautionary Note Regarding Forward-Looking Statements" block from each EX-99.x exhibit. The output is a vetted legend bank, segmented by deal type and law firm, used by deal counsel as drafting starting points.

Reconstructing the full pre-commencement communications record for one tender offer

Join SC14D9C records on subject CIK and tender-offer fileNo with the bidder-side SC TO-C dataset on the same fileNo, and pull any Form 8-K cross-references named in cover narratives that incorporate by reference. The output is a single chronological log of every filed written communication for a specific contest, used in litigation discovery prep, post-mortems, and academic event studies.

Benchmarking target messaging in pharma/biotech tender offers

Filter entities[] to SIC 2834 and 2836 and group records by subject CIK, then extract recurring themes across exhibits — pipeline value framing, milestone payments, CVR mechanics, employee retention language. The output is a comparables library used by healthcare bankers for pitch materials, by IR teams for standby playbooks, and by sector analysts for premium-adequacy benchmarking.

Compliance monitoring of Rule 14d-9 filing hygiene

Iterate every record, parse each EX-99.x for the two mandatory legends, and flag submissions where either is missing, truncated, or filed late relative to the underlying communication's release time inferred from the exhibit body. The output is a compliance dashboard scoring filers, financial printers, and outside counsel on legend adherence and same-day-filing discipline, used in internal audits and CLE training materials.

Training deal-event extraction and entity-linking models

Use metadata.json entities[] (subject CIK, ticker, SIC) and the fileNo as labeled anchors, paired with the SGML-wrapped HTML bodies, to train models that extract bidder names, offer prices, expiration dates, and board recommendations from unstructured pre-commencement text. The dataset's stable accession-folder layout and consistent SGML headers (<TYPE>, <SEQUENCE>, <DESCRIPTION>) make it a clean supervised-learning corpus for live-deal NLP pipelines.

Dataset Access

Dataset Index JSON API: https://api.sec-api.io/datasets/form-sc14d9c-files.json

This endpoint returns dataset metadata including the name, description, last updated timestamp, earliest sample date, total record and size counters, covered form types, container format, and file types. It also lists every container file in the dataset along with each container's size, record count, updated timestamp, and direct download URL. Use this endpoint to monitor which containers were modified in the most recent refresh run and decide which files to download incrementally on a day-by-day basis. No API key is required to query this endpoint.

Example response:

Example
1 {
2 "datasetId": "1f13365b-9ae0-6978-a481-cbb6d7b09b5f",
3 "datasetDownloadUrl": "https://api.sec-api.io/datasets/form-sc14d9c-files.zip",
4 "name": "Form SC14D9C Files Dataset",
5 "updatedAt": "2026-05-06T02:51:07.241Z",
6 "earliestSampleDate": "2000-01-01",
7 "totalRecords": 4731,
8 "totalSize": 34014686,
9 "formTypes": ["SC14D9C"],
10 "containerFormat": "ZIP",
11 "fileTypes": ["TXT", "JSON", "HTML", "PDF"],
12 "containers": [
13 {
14 "downloadUrl": "https://api.sec-api.io/datasets/form-sc14d9c-files/2026/2026-05.zip",
15 "key": "2026/2026-05.zip",
16 "size": 13818783,
17 "records": 154,
18 "updatedAt": "2026-05-06T02:51:07.241Z"
19 }
20 ]
21 }

Download Entire Dataset: https://api.sec-api.io/datasets/form-sc14d9c-files.zip?token=YOUR_API_KEY

Use this URL to download the complete dataset as a single ZIP archive containing all SC14D9C filings from January 2000 to the latest refresh. This endpoint requires an API key.

Download Single Container: https://api.sec-api.io/datasets/form-sc14d9c-files/2026/2026-05.zip?token=YOUR_API_KEY

Use a per-container URL to download an individual monthly archive instead of the full dataset, which is useful for incremental syncs against the containers reported as updated by the index API. This endpoint requires an API key.

Frequently Asked Questions

What form does this dataset cover?

The dataset covers Form SC14D9C — written communications by the subject company relating to a third-party tender offer, filed under cover of Schedule 14D-9 pursuant to Rule 14d-9(a) under Section 14(d)(4) of the Securities Exchange Act of 1934. The "C" suffix marks the submission as a communication (typically pre-commencement), distinct from the formal Schedule 14D-9 solicitation/recommendation statement.

What does one record in this dataset represent?

One record corresponds to a single SC14D9C accession on EDGAR, packaged as a folder named after the 18-digit accession number with hyphens removed. Each folder contains a per-filing metadata.json plus the original EDGAR submission documents — the Schedule 14D-9 cover and any EX-99.x exhibits — wrapped in EDGAR's SGML envelope. Image files are excluded by design.

Who is required to file Form SC14D9C?

The subject company in a third-party tender offer files SC14D9C — that is, the issuer whose equity is being sought, not the bidder. The filer population includes U.S. domestic Exchange Act reporting companies, foreign private issuers with U.S.-registered equity subject to Section 14(d), and registered issuers such as closed-end funds, REITs, and listed limited partnerships. Bidder-side pre-commencement communications go on Schedule TO-C, not SC14D9C.

When must an SC14D9C be filed?

SC14D9C is a per-communication filing due no later than the date of first use — the same calendar day the written communication is published, sent, or disseminated to security holders. The window opens with the first written communication about an actual or anticipated third-party tender offer and closes when the subject company files its formal Schedule 14D-9, after which further communications move onto 14D-9 amendments.

What time period does the dataset cover?

The dataset includes all Form SC14D9C filings submitted to EDGAR from January 1, 2000 to present. The start date aligns with the SEC's Regulation M-A rule package (Release No. 33-7760), which created the dedicated pre-commencement communications cover used by SC14D9C.

How does this dataset differ from the Schedule 14D-9 dataset?

Schedule 14D-9 is the subject company's formal, item-by-item solicitation/recommendation statement, filed within ten business days of a tender offer's commencement and containing the board's recommendation, fairness opinions, prior contacts, and conflicts disclosure. SC14D9C is the same filer's earlier, informal channel — press releases, employee FAQs, customer letters, investor decks — wrapped in a Rule 14d-9 cover with a legend pointing holders to the forthcoming 14D-9. SC14D9C is running pre-commencement commentary; 14D-9 is the legally operative recommendation.

What file format is the dataset distributed in?

The dataset is distributed as ZIP containers, one per calendar month under a YYYY/YYYY-MM.zip path. Inside each monthly archive, per-accession folders hold the metadata.json plus the SGML-wrapped submission documents. File types found across the corpus are HTML (the dominant document body format), TXT (legacy ASCII bodies in older submissions), PDF (occasional visual-heavy attachments such as investor decks), and JSON (the per-record metadata.json).