Form 8-K Files Dataset

The Form 8-K Files Dataset is a full-fidelity archive of every Form 8-K and Form 8-K/A current-report submission accepted by the SEC's EDGAR system, where each record is one complete EDGAR submission identified by an 18-digit accession number and packaged as an accession-number folder containing the structured submission metadata, the primary 8-K document, and every non-image exhibit and ancillary attachment filed with it. Form 8-K is the event-driven "current report" that domestic Exchange Act reporting issuers use to disclose enumerated material events — from material agreements and acquisitions to officer departures, auditor changes, earnings releases, Regulation FD disclosures, and material cybersecurity incidents — typically within four business days of the triggering event. Coverage begins on October 1, 1993, the start of mandatory EDGAR phase-in, and continues to the present, preserving every filed exhibit family (EX-2, EX-3, EX-4, EX-10, EX-16, EX-17, EX-23, EX-99, and the EX-101/EX-104 XBRL components) across the SGML/ASCII, HTML, and inline-XBRL eras. The dataset is distributed as ZIP containers and contains TXT, JSON, HTML/XHTML, PDF, XFD, and FRM file types.

Update Frequency
Daily
Updated at
2026-05-19
Earliest Sample Date
1993-10-01
Total Size
62.5 GB
Total Records
4,585,772
Container Format
ZIP
Content Types
TXT, JSON, HTML, PDF, XFD, FRM
Form Types
8-K, 8-K/A

Dataset APIs

Programmatically retrieve the full list of dataset archive files, download URLs and dataset metadata.

Dataset Index JSON API

Download the entire dataset as a single archive file.

Download Entire Dataset:

Download a single container file (e.g. monthly archive) from the dataset.

Download Single Container:

Dataset Files

390 files · 62.5 GB
Download All
2026-05.zip168.5 MB16,048 records
2026-04.zip193.6 MB17,185 records
2026-03.zip191.4 MB12,145 records
2026-02.zip230.2 MB13,490 records
2026-01.zip188.1 MB10,792 records
2025-12.zip177.5 MB10,138 records
2025-11.zip202.9 MB13,788 records
2025-10.zip245.2 MB12,730 records
2025-09.zip202.7 MB10,160 records
2025-08.zip207.9 MB14,102 records
2025-07.zip320.4 MB12,626 records
2025-06.zip170.4 MB11,096 records
2025-05.zip235.3 MB16,528 records
2025-04.zip190.8 MB11,353 records
2025-03.zip189.5 MB11,982 records
2025-02.zip246.9 MB13,676 records
2025-01.zip197.7 MB11,120 records
2024-12.zip164.7 MB9,888 records
2024-11.zip223.3 MB13,731 records
2024-10.zip217.6 MB12,417 records
2024-09.zip164.4 MB9,214 records
2024-08.zip211.6 MB14,184 records
2024-07.zip234.3 MB11,649 records
2024-06.zip156.4 MB10,537 records
2024-05.zip239.2 MB17,835 records
2024-04.zip250.0 MB12,370 records
2024-03.zip177.6 MB12,232 records
2024-02.zip200.4 MB14,114 records
2024-01.zip188.1 MB11,744 records
2023-12.zip166.5 MB9,970 records
2023-11.zip248.7 MB15,239 records
2023-10.zip207.6 MB12,072 records
2023-09.zip166.1 MB9,517 records
2023-08.zip250.8 MB15,987 records
2023-07.zip208.5 MB10,879 records
2023-06.zip201.3 MB11,960 records
2023-05.zip318.5 MB19,030 records
2023-04.zip211.8 MB11,414 records
2023-03.zip192.7 MB14,039 records
2023-02.zip290.4 MB14,272 records
2023-01.zip190.1 MB11,235 records
2022-12.zip158.9 MB10,546 records
2022-11.zip286.8 MB15,817 records
2022-10.zip207.2 MB11,511 records
2022-09.zip137.1 MB9,449 records
2022-08.zip254.7 MB15,781 records
2022-07.zip188.7 MB10,346 records
2022-06.zip183.4 MB10,957 records
2022-05.zip269.0 MB17,207 records
2022-04.zip209.2 MB11,806 records
2022-03.zip199.7 MB13,747 records
2022-02.zip245.0 MB13,961 records
2022-01.zip170.7 MB11,373 records
2021-12.zip192.6 MB11,890 records
2021-11.zip336.8 MB18,792 records
2021-10.zip229.1 MB12,855 records
2021-09.zip159.0 MB10,527 records
2021-08.zip255.3 MB16,240 records
2021-07.zip229.1 MB13,268 records
2021-06.zip196.9 MB13,015 records
2021-05.zip289.2 MB17,270 records
2021-04.zip211.9 MB13,465 records
2021-03.zip258.5 MB16,648 records
2021-02.zip278.8 MB16,670 records
2021-01.zip176.9 MB12,431 records
2020-12.zip190.9 MB11,819 records
2020-11.zip211.3 MB13,924 records
2020-10.zip233.2 MB13,487 records
2020-09.zip152.5 MB9,948 records
2020-08.zip228.7 MB14,405 records
2020-07.zip235.3 MB12,610 records
2020-06.zip186.9 MB12,547 records
2020-05.zip258.8 MB17,511 records
2020-04.zip202.3 MB14,816 records
2020-03.zip182.0 MB14,021 records
2020-02.zip251.4 MB13,579 records
2020-01.zip164.2 MB11,013 records
2019-12.zip200.7 MB9,758 records
2019-11.zip256.5 MB13,840 records
2019-10.zip213.2 MB12,325 records
2019-09.zip133.8 MB9,250 records
2019-08.zip226.1 MB13,325 records
2019-07.zip186.5 MB11,790 records
2019-06.zip157.7 MB10,279 records
2019-05.zip229.2 MB16,954 records
2019-04.zip175.6 MB11,900 records
2019-03.zip177.3 MB11,892 records
2019-02.zip246.2 MB13,287 records
2019-01.zip150.7 MB11,018 records
2018-12.zip259.6 MB9,241 records
2018-11.zip231.9 MB14,034 records
2018-10.zip243.0 MB13,401 records
2018-09.zip120.1 MB8,752 records
2018-08.zip253.3 MB14,135 records
2018-07.zip217.0 MB11,520 records
2018-06.zip159.7 MB10,748 records
2018-05.zip267.5 MB17,447 records
2018-04.zip189.0 MB12,005 records
2018-03.zip183.1 MB12,539 records
2018-02.zip242.2 MB14,046 records
2018-01.zip171.9 MB11,719 records
2017-12.zip183.8 MB10,553 records
2017-11.zip284.6 MB15,310 records
2017-10.zip223.8 MB12,927 records
2017-09.zip201.6 MB9,655 records
2017-08.zip306.4 MB15,112 records
2017-07.zip198.3 MB11,188 records
2017-06.zip162.0 MB11,412 records
2017-05.zip240.5 MB18,367 records
2017-04.zip184.3 MB12,242 records
2017-03.zip203.3 MB13,765 records
2017-02.zip225.8 MB13,949 records
2017-01.zip189.3 MB11,814 records
2016-12.zip215.6 MB10,829 records
2016-11.zip270.2 MB14,979 records
2016-10.zip203.9 MB12,286 records
2016-09.zip155.9 MB10,489 records
2016-08.zip231.7 MB14,922 records
2016-07.zip201.4 MB12,403 records
2016-06.zip153.8 MB11,462 records
2016-05.zip479.4 MB18,085 records
2016-04.zip192.4 MB12,995 records
2016-03.zip172.1 MB13,316 records
2016-02.zip280.9 MB14,905 records
2016-01.zip136.3 MB10,960 records
2015-12.zip153.8 MB11,186 records
2015-11.zip225.9 MB15,283 records
2015-10.zip213.4 MB13,859 records
2015-09.zip144.9 MB10,159 records
2015-08.zip202.1 MB14,235 records
2015-07.zip225.3 MB14,711 records
2015-06.zip183.9 MB12,812 records
2015-05.zip292.3 MB17,676 records
2015-04.zip214.9 MB14,690 records
2015-03.zip203.0 MB14,817 records
2015-02.zip261.5 MB15,205 records
2015-01.zip172.6 MB12,449 records
2014-12.zip154.2 MB11,558 records
2014-11.zip234.7 MB14,989 records
2014-10.zip323.6 MB15,197 records
2014-09.zip153.7 MB11,104 records
2014-08.zip249.5 MB14,475 records
2014-07.zip210.4 MB14,892 records
2014-06.zip190.9 MB12,114 records
2014-05.zip280.5 MB18,095 records
2014-04.zip344.3 MB14,733 records
2014-03.zip205.3 MB13,453 records
2014-02.zip267.6 MB15,062 records
2014-01.zip149.5 MB12,560 records
2013-12.zip173.1 MB11,363 records
2013-11.zip239.5 MB15,135 records
2013-10.zip190.7 MB14,749 records
2013-09.zip129.7 MB10,341 records
2013-08.zip220.7 MB14,511 records
2013-07.zip214.8 MB14,079 records
2013-06.zip161.3 MB11,352 records
2013-05.zip242.7 MB18,866 records
2013-04.zip177.4 MB14,671 records
2013-03.zip192.6 MB13,507 records
2013-02.zip204.0 MB14,924 records
2013-01.zip158.6 MB13,103 records
2012-12.zip175.6 MB12,092 records
2012-11.zip215.5 MB15,419 records
2012-10.zip208.2 MB14,671 records
2012-09.zip145.3 MB9,901 records
2012-08.zip257.7 MB15,456 records
2012-07.zip175.1 MB13,586 records
2012-06.zip153.4 MB11,087 records
2012-05.zip276.2 MB18,737 records
2012-04.zip170.3 MB14,216 records
2012-03.zip217.5 MB14,286 records
2012-02.zip208.8 MB15,716 records
2012-01.zip187.9 MB12,548 records
2011-12.zip151.3 MB11,370 records
2011-11.zip279.9 MB15,873 records
2011-10.zip193.2 MB13,173 records
2011-09.zip133.6 MB10,793 records
2011-08.zip200.7 MB16,104 records
2011-07.zip159.7 MB13,033 records
2011-06.zip190.0 MB13,221 records
2011-05.zip285.2 MB19,439 records
2011-04.zip188.8 MB15,060 records
2011-03.zip198.1 MB15,615 records
2011-02.zip195.7 MB16,261 records
2011-01.zip144.3 MB13,201 records
2010-12.zip152.1 MB12,761 records
2010-11.zip243.3 MB17,368 records
2010-10.zip198.2 MB14,367 records
2010-09.zip152.1 MB11,691 records
2010-08.zip200.0 MB15,588 records
2010-07.zip177.1 MB14,036 records
2010-06.zip161.7 MB12,543 records
2010-05.zip207.2 MB17,433 records
2010-04.zip214.5 MB15,741 records
2010-03.zip173.5 MB15,796 records
2010-02.zip213.1 MB15,489 records
2010-01.zip135.4 MB12,540 records
2009-12.zip192.6 MB13,202 records
2009-11.zip183.5 MB15,967 records
2009-10.zip184.9 MB15,745 records
2009-09.zip157.2 MB12,435 records
2009-08.zip181.2 MB15,369 records
2009-07.zip210.9 MB15,229 records
2009-06.zip159.9 MB12,701 records
2009-05.zip184.3 MB17,171 records
2009-04.zip160.0 MB15,175 records
2009-03.zip140.1 MB14,347 records
2009-02.zip158.1 MB15,960 records
2009-01.zip140.6 MB14,284 records
2008-12.zip149.5 MB15,243 records
2008-11.zip184.8 MB16,414 records
2008-10.zip177.7 MB17,127 records
2008-09.zip119.2 MB12,597 records
2008-08.zip182.1 MB16,359 records
2008-07.zip190.7 MB17,931 records
2008-06.zip148.3 MB13,778 records
2008-05.zip216.6 MB20,040 records
2008-04.zip203.5 MB18,508 records
2008-03.zip148.7 MB15,994 records
2008-02.zip231.9 MB19,057 records
2008-01.zip174.0 MB17,115 records
2007-12.zip181.4 MB16,100 records
2007-11.zip233.1 MB20,654 records
2007-10.zip229.6 MB20,033 records
2007-09.zip202.9 MB13,374 records
2007-08.zip252.5 MB20,824 records
2007-07.zip249.1 MB18,583 records
2007-06.zip260.1 MB16,276 records
2007-05.zip323.7 MB23,227 records
2007-04.zip285.4 MB20,149 records
2007-03.zip248.6 MB19,855 records
2007-02.zip258.8 MB20,612 records
2007-01.zip260.1 MB19,029 records
2006-12.zip214.6 MB17,427 records
2006-11.zip302.7 MB21,631 records
2006-10.zip298.3 MB21,156 records
2006-09.zip234.8 MB15,109 records
2006-08.zip274.0 MB21,453 records
2006-07.zip254.8 MB18,638 records
2006-06.zip223.0 MB17,214 records
2006-05.zip269.9 MB23,007 records
2006-04.zip265.6 MB19,896 records
2006-03.zip252.2 MB21,297 records
2006-02.zip240.7 MB20,799 records
2006-01.zip250.7 MB20,281 records
2005-12.zip267.0 MB20,880 records
2005-11.zip292.5 MB23,006 records
2005-10.zip274.5 MB22,266 records
2005-09.zip241.3 MB18,340 records
2005-08.zip239.0 MB22,374 records
2005-07.zip258.7 MB20,631 records
2005-06.zip198.2 MB18,112 records
2005-05.zip217.4 MB22,262 records
2005-04.zip218.6 MB21,839 records
2005-03.zip198.0 MB20,799 records
2005-02.zip219.3 MB20,423 records
2005-01.zip200.6 MB19,308 records
2004-12.zip197.1 MB18,710 records
2004-11.zip212.7 MB20,217 records
2004-10.zip209.6 MB21,166 records
2004-09.zip167.3 MB15,234 records
2004-08.zip171.0 MB15,736 records
2004-07.zip168.1 MB16,763 records
2004-06.zip129.0 MB12,561 records
2004-05.zip135.4 MB13,902 records
2004-04.zip160.2 MB16,618 records
2004-03.zip138.2 MB13,997 records
2004-02.zip133.4 MB14,251 records
2004-01.zip145.6 MB13,842 records
2003-12.zip116.0 MB11,212 records
2003-11.zip124.6 MB12,819 records
2003-10.zip169.8 MB16,869 records
2003-09.zip113.6 MB10,060 records
2003-08.zip113.1 MB12,365 records
2003-07.zip148.7 MB15,867 records
2003-06.zip105.6 MB10,012 records
2003-05.zip114.3 MB13,019 records
2003-04.zip129.2 MB14,755 records
2003-03.zip83.3 MB9,384 records
2003-02.zip84.5 MB8,527 records
2003-01.zip99.8 MB9,468 records
2002-12.zip81.9 MB7,933 records
2002-11.zip83.5 MB9,188 records
2002-10.zip96.1 MB9,320 records
2002-09.zip78.2 MB7,506 records
2002-08.zip84.5 MB12,238 records
2002-07.zip84.6 MB8,588 records
2002-06.zip74.6 MB7,792 records
2002-05.zip78.7 MB8,489 records
2002-04.zip82.3 MB8,149 records
2002-03.zip65.3 MB7,094 records
2002-02.zip65.0 MB6,661 records
2002-01.zip78.0 MB7,666 records
2001-12.zip62.2 MB6,528 records
2001-11.zip65.7 MB6,744 records
2001-10.zip67.2 MB7,798 records
2001-09.zip47.9 MB5,216 records
2001-08.zip67.2 MB6,962 records
2001-07.zip65.8 MB6,880 records
2001-06.zip57.3 MB6,110 records
2001-05.zip59.1 MB7,010 records
2001-04.zip56.5 MB6,731 records
2001-03.zip55.0 MB6,895 records
2001-02.zip55.8 MB6,460 records
2001-01.zip63.2 MB7,564 records
2000-12.zip53.6 MB6,123 records
2000-11.zip62.3 MB6,356 records
2000-10.zip62.3 MB6,540 records
2000-09.zip51.3 MB5,758 records
2000-08.zip49.7 MB5,757 records
2000-07.zip57.0 MB5,848 records
2000-06.zip48.7 MB5,876 records
2000-05.zip47.7 MB5,889 records
2000-04.zip48.5 MB5,513 records
2000-03.zip56.9 MB6,424 records
2000-02.zip52.0 MB6,685 records
2000-01.zip51.0 MB6,217 records
1999-12.zip62.5 MB6,419 records
1999-11.zip49.8 MB5,493 records
1999-10.zip56.0 MB5,942 records
1999-09.zip47.2 MB5,328 records
1999-08.zip51.9 MB5,427 records
1999-07.zip59.8 MB6,052 records
1999-06.zip58.1 MB5,946 records
1999-05.zip48.1 MB5,012 records
1999-04.zip55.7 MB5,751 records
1999-03.zip55.6 MB6,049 records
1999-02.zip52.6 MB5,590 records
1999-01.zip52.7 MB5,676 records
1998-12.zip64.6 MB6,471 records
1998-11.zip49.8 MB5,257 records
1998-10.zip63.2 MB6,361 records
1998-09.zip46.5 MB5,041 records
1998-08.zip49.8 MB5,209 records
1998-07.zip71.9 MB6,409 records
1998-06.zip60.2 MB6,069 records
1998-05.zip49.9 MB5,305 records
1998-04.zip63.5 MB6,064 records
1998-03.zip61.2 MB6,339 records
1998-02.zip51.6 MB5,677 records
1998-01.zip60.5 MB6,137 records
1997-12.zip66.3 MB6,168 records
1997-11.zip48.8 MB4,838 records
1997-10.zip70.7 MB6,223 records
1997-09.zip51.3 MB5,006 records
1997-08.zip42.2 MB4,719 records
1997-07.zip58.1 MB5,586 records
1997-06.zip45.7 MB4,692 records
1997-05.zip33.8 MB4,099 records
1997-04.zip44.8 MB4,814 records
1997-03.zip41.9 MB4,724 records
1997-02.zip39.5 MB4,623 records
1997-01.zip48.4 MB5,038 records
1996-12.zip44.1 MB4,418 records
1996-11.zip40.6 MB4,180 records
1996-10.zip44.8 MB4,706 records
1996-09.zip36.3 MB3,865 records
1996-08.zip32.5 MB3,575 records
1996-07.zip44.0 MB4,416 records
1996-06.zip39.1 MB3,851 records
1996-05.zip30.3 MB3,224 records
1996-04.zip19.0 MB1,847 records
1996-03.zip19.5 MB1,839 records
1996-02.zip15.8 MB1,751 records
1996-01.zip18.4 MB1,830 records
1995-12.zip15.4 MB1,602 records
1995-11.zip15.0 MB1,531 records
1995-10.zip18.6 MB1,830 records
1995-09.zip12.5 MB1,244 records
1995-08.zip13.8 MB1,359 records
1995-07.zip12.6 MB1,224 records
1995-06.zip14.5 MB1,381 records
1995-05.zip14.4 MB1,328 records
1995-04.zip11.5 MB1,146 records
1995-03.zip8.8 MB1,029 records
1995-02.zip8.0 MB916 records
1995-01.zip7.3 MB802 records
1994-12.zip8.2 MB741 records
1994-11.zip4.9 MB578 records
1994-10.zip8.0 MB825 records
1994-09.zip6.2 MB601 records
1994-08.zip6.6 MB630 records
1994-07.zip5.7 MB700 records
1994-06.zip4.6 MB506 records
1994-05.zip6.5 MB559 records
1994-04.zip4.4 MB587 records
1994-03.zip6.7 MB684 records
1994-02.zip6.8 MB660 records
1994-01.zip6.7 MB782 records
1993-10.zip5.0 KB1 records

What This Dataset Contains

A single record in the Form 8-K Files Dataset is one complete EDGAR submission of either a Form 8-K (current report) or a Form 8-K/A (amendment to a previously filed current report), uniquely identified by its 18-digit accession number. Each record is materialized on disk as one accession-number folder that bundles the structured metadata describing the filing, the primary 8-K document, and every supporting exhibit and ancillary attachment submitted in the same EDGAR package, with the sole exception of binary image attachments. The unit of observation is therefore the filing, not the issuer, not the individual reportable event, and not the individual exhibit: a single 8-K that reports several Items under Form 8-K's Item taxonomy and attaches multiple exhibits is still one record.

Form 8-K is the "current report" prescribed by Sections 13 and 15(d) of the Securities Exchange Act of 1934. It is the vehicle through which a domestic registrant discloses, on a near-real-time basis, the occurrence of one or more enumerated material events that fall outside the regular cadence of quarterly (10-Q) and annual (10-K) reporting. The form is event-triggered rather than calendar-triggered, must generally be filed with EDGAR within four business days of the triggering event, and is structured as a list of "Items," each Item corresponding to one specific category of disclosable event — entry into a material definitive agreement (Item 1.01), material cybersecurity incidents (Item 1.05), completion of acquisition or disposition (Item 2.01), results of operations and financial condition (Item 2.02), creation of a direct financial obligation (Item 2.03), changes in registrant's certifying accountant (Item 4.01), changes in control (Item 5.01), departures and elections of directors and officers and compensatory arrangements (Item 5.02), amendments to articles or bylaws (Item 5.03), submission of matters to a vote of security holders (Item 5.07), Regulation FD disclosure (Item 7.01), other events (Item 8.01), and financial statements and exhibits (Item 9.01).

Form 8-K/A is an amendment to a previously filed 8-K, most commonly used to supply Item 9.01(a) financial statements of a business acquired under Item 2.01 within the 71-calendar-day extension window allowed by the Item 9.01 instructions, or to correct, restate, or supplement an earlier disclosure. An 8-K/A retains the same accession-number lifecycle as a fresh 8-K — it carries its own accession number and is therefore its own record — and references the underlying original by date and Item. The dataset's earliest records begin October 1, 1993 with the start of mandatory EDGAR phase-in; pre-EDGAR paper 8-Ks are not part of the electronic corpus.

The body of Form 8-K itself is short. It identifies the registrant on the cover page, lists the Items being reported, narrates each Item in plain English, and incorporates exhibits by reference under Item 9.01. The substantive content typically resides in the exhibits: press releases (EX-99.1), agreements (EX-10.x), auditor letters (EX-16.x), severance and separation arrangements, pro forma financial statements (EX-99.2), engagement letters, indentures, supplemental indentures, and analyst-call slide decks.

Content Structure of a Single Record

Each record is a directory containing two layers of content:

  1. A structured metadata layer in a single metadata.json file generated from the parsed EDGAR submission header. This is the addressable, machine-readable spine of the record — it carries the accession number, the form type, the filer identifiers, the filing and period timestamps, the Items reported, and the manifest of every document that was part of the original submission.
  2. An unstructured document layer consisting of the raw EDGAR documents themselves: the primary 8-K filing document and the full set of exhibits and ancillary attachments. These are the files an investor would actually read on the EDGAR filing-detail page.

The two layers are tightly cross-referenced. Every entry in the documentFormatFiles[] and dataFiles[] arrays of metadata.json corresponds (by sequence, type, description, and filename) to a sibling file in the same folder, except for excluded image attachments. SGML headers embedded inside individual documents, where present, mirror the manifest entry exactly.

The metadata.json spine

Every accession folder contains exactly one metadata.json. Top-level fields include:

  • formType: "8-K" or "8-K/A".
  • accessionNo: dashed accession number, e.g. "0000950170-25-087681".
  • id: a 32-character hex identifier serving as the record id within the dataset.
  • linkToFilingDetails: URL of the primary 8-K document on sec.gov.
  • linkToTxt: URL of the complete .txt SGML submission on EDGAR.
  • linkToHtml: URL of the EDGAR filing-index page.
  • linkToXbrl: URL of the XBRL instance, when present (frequently empty for 8-Ks because the form's tagging mandate covers only the cover page rather than financial statements).
  • description: a human-readable summary that typically enumerates the Items reported, e.g. "Form 8-K - Current report - Item 1.01 Item 2.03 Item 9.01".
  • filedAt: ISO-8601 acceptance datetime with timezone offset.
  • periodOfReport: YYYY-MM-DD date of the earliest event reported (the "as of" date on the cover page).
  • items: an array of fully spelled-out Item labels triggered by the filing, e.g. "Item 5.02: Departure of Directors or Certain Officers; Election of Directors; Appointment of Certain Officers; Compensatory Arrangements of Certain Officers". This is the canonical structured handle on which 8-K Items the filing invokes.
  • documentFormatFiles: an array of attachment manifest entries (primary doc, exhibits, complete-submission text, graphics), each carrying sequence, size in bytes (as a string), documentUrl, description, and type (e.g. "8-K", "EX-10.1", "EX-99.1", "EX-16.1", "GRAPHIC").
  • dataFiles: an array of XBRL-related ancillary files in the same shape as documentFormatFiles, holding the schema (EX-101.SCH), label (EX-101.LAB), presentation (EX-101.PRE), calculation (EX-101.CAL), definition (EX-101.DEF), and any extracted instance documents.
  • entities: an array of one or more parties on the filing (filer, subject company, filed-by entity). Each entity carries companyName with a role suffix in parentheses (e.g. "Koppers Holdings Inc. (Filer)"), cik, tickers, type, act ("34" for the Exchange Act), fileNo, filmNo, irsNo, fiscalYearEnd as MMDD, stateOfIncorporation, and sic (the SIC code with industry label).
  • seriesAndClassesContractsInformation: an array populated for fund filers; ordinarily empty for operating-company 8-Ks.

The primary 8-K document

The canonical landing document of the filing is the primary 8-K. It is the file the EDGAR filing-detail page links to first and the file whose type in documentFormatFiles[] is "8-K" (or "8-K/A"). Its internal anatomy mirrors the form itself:

  • A cover page identifying the registrant by exact legal name, jurisdiction of incorporation, IRS Employer Identification Number, SEC file number, principal executive office address, telephone number, and former name(s) if changed since the last report. The cover page also carries the four "written communications" check-boxes (Rule 425, Rule 14a-12, Rule 14d-2(b), Rule 13e-4(c)) for pre-commencement communications around mergers, proxy solicitations, tender offers, and issuer tender offers; the post-Dodd-Frank "Emerging Growth Company" designation along with the Section 107(b) accounting-standards-transition election; and the Section 12(b) title-of-each-class / trading-symbol / name-of-each-exchange table mandated by the August 2017 amendments to General Instruction B.
  • A list of Items being reported, given by Item number and Item title in the order specified by the form.
  • Per-Item narrative blocks, each opening with the Item heading and proceeding in plain English to describe the event, its terms, and its anticipated effect on the registrant. Item 9.01 is structurally distinct: it serves as the index of attached exhibits and typically consists of a tabular list mapping exhibit number to exhibit description, with little or no narrative.
  • A signature block containing the registrant's exact legal name, the name and title of the signing officer (most commonly a CFO, General Counsel, Corporate Secretary, or Principal Accounting Officer), and the date of signature.

In modern filings the primary document is delivered as an Inline XBRL (iXBRL) XHTML file rather than a plain HTML file. The <html> root declares the xmlns:ix="http://www.xbrl.org/2013/inlineXBRL", xmlns:dei="http://xbrl.sec.gov/dei/...", and xmlns:xbrli="http://www.xbrl.org/2003/instance" namespaces, and an <ix:header> block at the top of the body carries the iXBRL contexts, units, and references. Cover-page facts — registrant name, CIK, document type, period of report, trading symbol, exchange, emerging-growth-company status, and the four pre-commencement check-box states — are tagged inline via <ix:nonNumeric> and <ix:nonFraction> elements wrapping the human-readable cover-page text. iXBRL tagging on Form 8-K is confined to the cover page; the per-Item narrative body and the exhibits are not XBRL-tagged.

Exhibits

Below the primary document sit zero or more exhibit files, each a separate document in the accession folder. The exhibit-type vocabulary used in 8-K filings is governed by Item 601 of Regulation S-K and recurs in metadata.json -> documentFormatFiles[].type. The most common exhibit families on 8-K filings are:

  • EX-1.x: underwriting agreements.
  • EX-2.x: plans of acquisition, reorganization, arrangement, liquidation, or succession (typically filed under Item 1.01 / Item 2.01).
  • EX-3.x: amended articles of incorporation or bylaws (typically filed under Item 5.03).
  • EX-4.x: instruments defining the rights of security holders, including indentures and supplemental indentures (typically filed under Item 1.01 / Item 2.03).
  • EX-10.x: material contracts — credit agreements, employment agreements, separation agreements, equity-award agreements, lease agreements (commonly filed under Item 1.01, Item 2.03, or Item 5.02).
  • EX-16.x: letters from former independent accountants concurring or disagreeing with disclosures made under Item 4.01.
  • EX-17.x: letters of resignation from directors (filed under Item 5.02).
  • EX-23.x: consents of experts and counsel.
  • EX-99.x: additional exhibits, most commonly press releases (EX-99.1) and supplemental or pro forma financial information (EX-99.2 / EX-99.3) filed or furnished under Item 2.02, Item 7.01, or Item 9.01.
  • EX-104: cover-page interactive data file (the pointer to the iXBRL-tagged cover page on modern filings).
  • EX-101.SCH / .LAB / .PRE / .CAL / .DEF: XBRL taxonomy components, listed in dataFiles[] rather than documentFormatFiles[].

Each exhibit is delivered as either a bare HTML/XHTML document or, for some attachments, an HTML body wrapped in the original EDGAR SGML <DOCUMENT> envelope. The SGML wrapper preserves the four-line submission header (<TYPE>, <SEQUENCE>, <FILENAME>, <DESCRIPTION>) followed by a <TEXT> body containing the actual HTML; those four header values mirror the corresponding entry in metadata.json -> documentFormatFiles[]. PDF and XFD attachments occur occasionally for older or specialized exhibits (auditor letters, rendered presentations, paper-form FDS submissions); FRM files appear as legacy form-data attachments. The dominant on-disk file types in modern filings are HTM/HTML for documents and JSON for the manifest.

Filename conventions

Filenames vary by filer and filing agent but follow a small number of recurring patterns:

  • <ticker>-<YYYYMMDD>.htm for the primary 8-K, where the date is the period of report.
  • <ticker>-ex<N>_<M>.htm, ex<N>-<M>.htm, <filer>_ex<N>-<M>.htm, or exhibit<N><M>.htm for numbered exhibits.
  • metadata.json for the per-folder structured manifest, always present.

The accession-number folder name itself is the 18-digit packed accession number (no dashes), which uniquely identifies the filing and matches the dashed form in metadata.json -> accessionNo.

Included content

Each record includes:

  • The full structured submission-header metadata, as metadata.json.
  • The primary 8-K document, in its filed form (typically iXBRL XHTML for modern filings, plain HTML for older ones, plain text wrapped in SGML for the earliest filings).
  • Every exhibit listed under Item 9.01 and every other document filed in the same EDGAR submission, including legal exhibits (EX-2, EX-3, EX-4, EX-10, EX-16, EX-17, EX-23) and additional exhibits (EX-99.x).
  • The XBRL taxonomy components and extracted XBRL instance data when present, listed via dataFiles[].
  • The cover-page interactive data file (EX-104) when present.

The file types found in the dataset are TXT, JSON, HTML/XHTML, PDF, XFD, and FRM, with HTML/XHTML and JSON dominating modern records.

Excluded or separately structured content

Image attachments (JPG, PNG, GIF) referenced by the primary document and exhibits are intentionally excluded from the dataset, even though metadata.json -> documentFormatFiles[] continues to list them with type: "GRAPHIC". Where an exhibit's text or layout depends on inline images (logos, signature scans, charts, photographs in press releases), the HTML retains the <img> references but the binary image targets are not present in the folder. Hyperlinked external content — websites, third-party press-release portals, transcripts hosted off-EDGAR — is also not retrieved, since it was never part of the EDGAR submission. The dataset confines itself to what was filed.

Changes in required content and structure over time

Form 8-K's content requirements have changed materially across the dataset's three-decade span, and the structure of records reflects those changes:

  • Pre-2004 regime. Until August 23, 2004, Form 8-K had a much smaller and more loosely defined Item set (Items 1 through 9, covering changes in control, acquisitions/dispositions, bankruptcy, change in auditors, resignations of directors, and Regulation FD disclosure). Filing was generally due within five business days, or fifteen calendar days for certain Items. Records from this era have shorter cover pages, no pre-commencement communications check-boxes, no Section 12(b) trading-symbol table, and Item numbering that does not match the modern decimal scheme.
  • 2004 expansion. The SEC's August 2004 final rule "Additional Form 8-K Disclosure Requirements and Acceleration of Filing Date" reorganized the form into the modern numbered Item structure (Items 1.01 through 9.01 grouped under Sections 1 through 9), added several new Items (notably Item 1.01 entry into a material definitive agreement, Item 2.03 creation of a direct financial obligation, Item 2.04 triggering events that accelerate or increase a direct financial obligation, and the expanded Item 5.02), and accelerated the filing deadline to four business days. Records on or after September 2004 carry the modern Item taxonomy seen in metadata.json -> items[].
  • Pre-commencement communications check-boxes. Following the 1999/2000 amendments to Rules 425, 14a-12, 14d-2, and 13e-4, cover pages began carrying the four check-boxes for written communications made in connection with mergers, proxy solicitations, tender offers, and issuer tender offers. These boxes are now a standard part of the iXBRL cover-page tagging.
  • Dodd-Frank Item 5.07. Beginning in 2011, registrants disclose under Item 5.07 stockholder votes including non-binding say-on-pay and say-on-frequency results.
  • JOBS Act emerging-growth-company designation. From 2012 onward, the cover page carries the EGC check-box and the related accounting-standards-transition election.
  • August 2017 cover-page amendments. The SEC's 2017 final rule required the title-of-each-class / trading-symbol / name-of-each-exchange table on the cover page of Form 8-K. Records filed on or after November 5, 2017 carry this table.
  • Cybersecurity Item 1.05. The SEC's July 2023 final rule on cybersecurity disclosure added Item 1.05 "Material Cybersecurity Incidents," generally effective December 2023. Records reporting a material cybersecurity incident on or after that date list Item 1.05 in items[].
  • Item 9.01 financial-statement amendments. The 71-day extension window for 8-K/A filings reporting financial statements of an acquired business under Item 9.01(a) is a recurring driver of 8-K/A volume; these amendments are flagged by formType = "8-K/A".

Changes in data format over time

The substantive Item structure changed materially over the dataset's span, and so did the file-format envelope of the underlying EDGAR submission:

  • 1993-2001: ASCII / plain-text era. The earliest 8-K filings (October 1993 onward) were submitted as ASCII text wrapped in EDGAR SGML envelopes. The complete-submission .txt carries <DOCUMENT> blocks for the primary 8-K and each exhibit, each with the four-line <TYPE> / <SEQUENCE> / <FILENAME> / <DESCRIPTION> header followed by a <TEXT> body of plain ASCII. There was no HTML layout, no inline tagging, and no XBRL.
  • 2001-2009: HTML adoption. EDGAR began accepting HTML primary documents and HTML exhibits, and registrants progressively migrated from ASCII to HTML. SGML <DOCUMENT> wrappers continued to carry the per-document headers, but the <TEXT> body increasingly held HTML.
  • 2009-2018: financial-statement XBRL elsewhere, not on 8-K. XBRL adoption during this period was driven by 10-K/10-Q financial-statement tagging; Form 8-K itself was not a primary XBRL-tagged form, and linkToXbrl is therefore frequently empty for 8-K records of this era.
  • 2019-present: inline XBRL on the cover page. The SEC's June 2018 final rule "Inline XBRL Filing of Tagged Data" required registrants to tag the cover-page data of Form 8-K (registrant name, CIK, document type, period, trading-symbol table, EGC status, pre-commencement check-boxes) using inline XBRL embedded in the primary HTML document. Phase-in began in 2019 for large accelerated filers and concluded in 2021 for smaller reporting companies. The EX-104 cover-page interactive data exhibit and the dataFiles[] array on 8-Ks were introduced as part of this regime. Modern primary 8-K documents are therefore XHTML files with <ix:header> blocks and <ix:nonNumeric> / <ix:nonFraction> elements on the cover page; the body of the filing below the cover page remains conventional HTML.

The dataset preserves all of these formats as filed: a 1995 record will be a single SGML/ASCII text submission, a 2010 record will be an HTML primary document with HTML exhibits, and a 2025 record will be an iXBRL XHTML primary document with HTML exhibits. Consumer code must therefore be tolerant of the range of encodings and of the optional SGML envelope around individual documents.

Interpretation and extraction notes

  • Amendments. Records with formType = "8-K/A" are amendments and are linked back to the original 8-K by registrant CIK and approximate filing date; the dataset does not store an explicit pointer from amendment to original, but the periodOfReport value and the registrant's prior filings on EDGAR establish the linkage.
  • Items as the canonical event index. The items[] array in metadata.json is the cleanest structured handle on what the filing is reporting. The Item numbers and labels are normalized to the modern decimal taxonomy. The free-text description field contains the same Item numbers in shorter form.
  • Exhibits as substantive payload. For most analytical purposes the substantive content of an 8-K lives in its exhibits, not in its primary document. Press releases (EX-99.1) carry earnings announcements, deal announcements, and management transitions; agreements (EX-10.x) carry the full contract terms; auditor letters (EX-16.x) carry the former auditor's response under Item 4.01.
  • Incorporation by reference. Item bodies frequently incorporate the attached exhibits by reference rather than restating their content, so reading the primary document alone systematically understates the disclosure. The full record must be read to recover the substantive event description.
  • Furnished vs. filed. Items 2.02 (results of operations) and 7.01 (Regulation FD disclosure), and any exhibits attached solely under those Items, are "furnished" rather than "filed" for Section 18 purposes. The dataset does not distinguish furnished from filed Items beyond the Item number itself, but the distinction is implicit in which Item is reported.
  • iXBRL extraction. The cover-page facts on modern primary documents can be extracted by parsing <ix:nonNumeric> and <ix:nonFraction> elements; the same facts are also available structurally in metadata.json -> entities[] and on the cover-page EX-104 interactive data file.
  • SGML-wrapper tolerance. Some exhibits arrive bare (an <html>...</html> document) and some arrive wrapped in an SGML <DOCUMENT> envelope. Parsers must detect and strip the envelope when present, otherwise downstream HTML parsers will choke on the leading <DOCUMENT> / <TYPE> / <SEQUENCE> / <FILENAME> / <DESCRIPTION> / <TEXT> lines.
  • Multiple entities on a single 8-K. Ordinary 8-Ks have a single filer entity, but certain transactional 8-Ks carry multiple entities in entities[] (filer plus subject company plus filed-by). Each entity object carries its own role-suffixed companyName, cik, and identifiers.
  • Excluded graphics. Because image binaries are excluded, any visual or photographic content in press-release exhibits (charts, product photos, signature scans, logos) is not present in the record; the surrounding HTML and <img> references remain intact for context.
  • Period of report vs. filing date. periodOfReport is the date of the earliest event being disclosed, not the filing date. filedAt is the EDGAR acceptance datetime. The four-business-day rule means these typically sit within a week of each other, but Item 9.01 amendments on an 8-K/A may extend the gap by up to 71 calendar days.

Who Files or Publishes This Dataset, and When

Each record in this dataset is a current report submitted to the SEC by a U.S. domestic Exchange Act reporting issuer. The legal filer is the registrant itself, acting through its officers and counsel. One accession equals one Form 8-K (or amending 8-K/A) covering one or more enumerated triggering events at that registrant, packaged on EDGAR with its cover-page items, narrative body, and exhibits.

The filing population consists of issuers subject to Section 13(a) or Section 15(d) of the Securities Exchange Act of 1934, including:

  • Domestic operating companies with a class of securities registered under Section 12(b) (typically exchange-listed) or Section 12(g) (over-the-counter above thresholds).
  • Domestic issuers reporting under Section 15(d) because of an effective Securities Act registration statement, even with no Section 12 class.
  • Subsidiary issuers of registered debt that remain Exchange Act reporters in their own right.
  • Asset-backed issuers, which file under the tailored Item 6.0X series.
  • Successor and post-merger surviving registrants that have assumed an Exchange Act reporting obligation.

Form 8-K does not branch by issuer size: smaller reporting companies, emerging growth companies, accelerated filers, and large accelerated filers all use the same form.

When the record is created or required

Form 8-K is event-driven, not periodic. A filing exists only when a specific item in the form is triggered by an actual corporate event. The items group those events into numbered sections:

  • Section 1: Business and operations (material agreements, bankruptcy, mine safety, cybersecurity incidents).
  • Section 2: Financial information (acquisitions or dispositions, results of operations, direct financial obligations, exit costs, material impairments).
  • Section 3: Securities and trading markets (delisting, unregistered equity sales, modifications of holder rights).
  • Section 4: Accountants and financial statements (auditor changes, non-reliance on prior financials).
  • Section 5: Corporate governance and management (control changes, director and officer changes, charter or bylaw amendments, fiscal-year change, code-of-ethics waivers, shareholder vote results).
  • Section 6: Asset-backed securities (servicer or trustee changes, distribution failures, credit-enhancement changes, ABS updating disclosure).
  • Section 7: Regulation FD disclosures.
  • Section 8: Other events (voluntary disclosures the registrant deems important).
  • Section 9: Financial statements and exhibits.

A single 8-K may invoke multiple items when one underlying event has several disclosure consequences (for example, a CEO departure that also involves a separation agreement and a successor appointment).

The default deadline is four business days after the triggering event, with "occurrence" defined item by item: execution date for an Item 1.01 agreement, notification date for an Item 5.02 officer departure, issuance date for an Item 2.02 earnings release. Specialized timing applies to several items:

  • Item 5.07 (vote results) is generally due within four business days of the meeting, amended when contested-election results become final.
  • Item 1.05 (material cybersecurity incidents, added 2023) runs from the materiality determination, not from discovery, with a national-security delay available on Attorney General determination.
  • Item 7.01 Regulation FD disclosures must be simultaneous with intentional selective disclosure, or made promptly (no later than 24 hours or the next trading day) after a non-intentional one.

Some items are "furnished" rather than "filed" for purposes of Section 18, most notably Item 2.02 and Item 7.01. Furnished material avoids Section 18 liability and is not auto-incorporated into Securities Act registration statements unless the registrant elects. Both filed and furnished submissions appear in this dataset under form types 8-K and 8-K/A.

Form 8-K/A is used to amend an earlier 8-K. The most common pattern is the Item 9.01 financial-statements and pro-forma amendment after a completed business acquisition under Item 2.01, which is permitted up to 71 calendar days after the initial report. Other amendments correct or supplement prior disclosures and have no fixed deadline beyond the general duty to amend promptly.

Important distinctions

  • The registrant is always the legal filer. Officers, directors, acquirers, departing executives, auditors, and counterparties may appear in the disclosure but do not themselves file the 8-K. Their own reporting goes on Forms 3, 4, and 5 (insider transactions) or Schedules TO, 13D, and 13G (third-party acquisitions and beneficial ownership).
  • Foreign private issuers do not file Form 8-K; they furnish Form 6-K, which is excluded from this dataset. Canadian MJDS issuers on Form 40-F likewise use Form 6-K for interim disclosures.
  • Registered investment companies generally use Forms N-CSR, N-PX, N-CEN, and related ICA forms rather than 8-K. Closed-end funds and BDCs operating as Exchange Act reporters do file 8-K and are included.
  • Voluntary Item 8.01 filings are discretionary, but the four-business-day convention is generally observed and antifraud rules apply once the issuer chooses to disclose.
  • Form 8-K does not satisfy periodic reporting on Form 10-K or 10-Q. The current and periodic regimes run independently; the same disclosure may appear in both.
  • A late 8-K does not automatically cost an issuer Form S-3 eligibility: General Instruction I.A.3(b) of Form S-3 preserves eligibility for specified late items, though the underlying Section 13(a) obligation still applies.

How This Dataset Differs From Similar Datasets or Filings

The Form 8-K Files Dataset overlaps with three families: periodic disclosure forms (10-K, 10-Q), foreign-issuer current reports (6-K), and narrower products that extract items, exhibits, or press releases from the same 8-K corpus. Adjacent insider and ownership filings (Forms 3/4/5, Schedules 13D/G) occasionally cover related events but differ in filer and structure.

Form 10-K and 10-Q (Periodic Reports). Periodic, calendar-driven, and broad. The 10-K carries audited annual financials, MD&A, risk factors, and governance; the Form 10-Q carries unaudited quarterly statements. The 8-K is event-driven and narrow, due within four business days of a triggering event. Earnings often appear first as an 8-K Item 2.02 press release, then in the 10-Q's structured statements. Use 8-K for the date and characterization of a discrete event (e.g., Item 5.02 for an officer departure); use 10-K/10-Q for audited or XBRL-tagged financial detail.

Form 6-K (Foreign Private Issuers). The foreign-issuer analogue to 8-K, used to furnish material information already disclosed abroad. The 8-K Files Dataset excludes 6-K: filer populations differ (domestic registrants vs. FPIs) and triggers differ (6-K keys off home-country publication; 8-K keys off an enumerated item list). Cross-jurisdiction event studies require both; US-only studies need only 8-K.

10-K/A and 10-Q/A (Amendments). Parallel to the 8-K/A filings included here, but typically used for restatements or material corrections. 8-K/A is most often used to supply information unavailable at the original four-day deadline, especially the financial statements required by Item 9.01 after a completed acquisition.

Schedule 13D / 13G (Beneficial Ownership). Filed by acquirers crossing the 5 percent threshold. Overlap with 8-K is limited to change-in-control events triggering Item 5.01, and the viewpoint differs: 13D/G is the holder's perspective, 8-K Item 5.01 is the issuer's. For position accumulation, use 13D/G; for issuer characterization of control changes, use 8-K.

Forms 3, 4, and 5 (Insider Transactions). Structured, per-transaction reports by insiders. They sometimes correlate with 8-K Item 5.02 officer changes but answer different questions: Forms 3/4/5 give tabular trading data by individuals; 8-K gives narrative issuer disclosure of the underlying corporate event.

Item-level structured 8-K datasets. Downstream products that parse 8-Ks into one row per item (1.01, 2.02, 5.02, 7.01, 8.01, etc.) with normalized fields and event tags. The Files Dataset is the upstream source: complete original EDGAR submissions per accession, including the primary document, all exhibits, and the metadata file. Use structured extracts for event studies on a fixed schema; use the Files Dataset when full text, exact exhibits, or uncovered items are needed.

Item-specific extract datasets (1.01, 2.02, 5.02, 7.01, etc.). Single-item slices for material agreements, earnings releases, officer changes, or Reg FD disclosures. The Files Dataset is a superset spanning every item, every amendment, and every exhibit from October 1993 onward, which is required for cross-item analyses, co-occurring item flags, and items without dedicated extracts.

Exhibits-only and press release datasets. Products that isolate Exhibit 99.1 press releases or Exhibit 10 contracts attached to 8-Ks. These are slices of the same submissions but typically drop the link between an exhibit and the specific item under which it was furnished or filed. The Files Dataset preserves that relationship; exhibits-only products are lighter when only bulk press release text is needed.

Boundary summary

The Form 8-K Files Dataset is distinguished by four properties that no neighbor replicates together: (1) full-fidelity preservation of the original EDGAR submission per accession, including primary document, non-image exhibits, and metadata; (2) coverage of all items and amendments from October 1993 to present; (3) event-driven cadence on a four-business-day window, unlike any periodic form; and (4) scope limited to domestic registrants, separating it from 6-K. Use it when the question concerns timely material events from US issuers and the analysis needs the full filing. Use 10-K for audited financials, 10-Q for quarterly statement detail, 6-K for foreign issuers, Forms 3/4/5 for insider trades, and Schedules 13D/G for ownership accumulation. It complements rather than replaces item-level and exhibit-level extracts, which trade fidelity for schema convenience.

Who Uses This Dataset

The Form 8-K corpus is the primary record of material corporate events between periodic reports. Different roles read different items and exhibits, but all use the same underlying filings.

Event-driven analysts and special-situations desks

Used as the canonical source of inter-quarter disclosures for trading. Focus on Items 1.01, 2.01, 2.02, Item 2.05, Item 4.02, 5.02, and 8.01, with Exhibit 99.1 press releases and underlying agreements (2.x, 10.x). Filing timestamps and item tags drive event chronologies, post-event drift analysis, restatement watch lists, and merger-arbitrage sizing.

Quantitative researchers

Ingest the full 1993-onward corpus to engineer event features. They rely on accession numbers, precise filing timestamps, item-number tagging, and machine-readable exhibit text to build factor signals, drift models, restatement risk scores, and tone-shift features from earnings exhibits.

Forensic accountants and restatement researchers

Concentrate on Item 4.01, Item 4.02, and Exhibit 16 auditor letters. They reconstruct auditor turnover, disagreements, and restatement triggers, linking 8-Ks to subsequent 10-K/A and 10-Q/A filings to support case files, expert reports, and audit-quality studies.

Securities lawyers and disclosure counsel

Benchmark drafting across peer 8-Ks: Item 1.01 agreement summaries, Item 5.02 separation terms, Item 8.01 voluntary disclosures, and forward-looking-statement blocks. The corpus feeds precedent libraries, trigger analyses, and assessments of timeliness and completeness.

M&A and transactional diligence teams

Mine Items 1.01 and 2.01 plus the underlying merger, asset-purchase, credit, and indenture agreements filed as Exhibits 2.x, 10.x, and 4.x. Used to extract conditions precedent, MAC clauses, termination and break fees, and financing contingencies for deal-term comparables and negotiation positions.

Credit analysts

Watch Items Item 1.03, 2.03, 2.04, Item 3.03, and Item 8.01 covenant waivers and amendments for events that change credit risk between covenant reporting dates. Cross-reference textual disclosure with the indenture or credit agreement exhibit for covenant monitoring, default-watch lists, and recovery analysis.

Governance and executive compensation analysts

Parse Items 5.02, 5.03, 5.07, and Item 5.08 to extract appointment and separation terms, equity grants, bylaw amendments, and vote tallies. Output supports proxy-season notes, severance benchmarking, and board-composition tracking.

Compliance and surveillance teams

Route filings by item code: Item 5.02 updates insider lists, Items 1.03 and 2.04 affect counterparty exposure, Item 4.02 may trigger restricted-list status, and Item 7.01 informs Regulation FD reviews. The dataset drives automated alerts and conflict checks.

Data engineering teams

At vendors and large buy- and sell-side firms, engineers use the metadata file, accession numbers, exhibit indexes, and consistent multi-decade file mix to populate event-history tables and feed internal event APIs and search interfaces.

LLM and RAG developers

Build retrieval and extraction systems on a long-history, item-tagged corpus. Full filing bodies and varied exhibit formats (TXT, HTML, PDF, JSON) support fine-tuning, evaluation, and citation grounding for summarization, classification, and QA over current reports.

Investor relations and disclosure benchmarking

In-house IR and outside disclosure advisers compare peer practice on Item 2.02 earnings exhibits, Item 7.01 Regulation FD presentations, Item 5.02 transition language, and forward-looking-statement updates to inform drafting checklists and investor-day planning.

Academic and policy researchers

Use near-complete 1993-onward coverage and exhibit text for event studies, executive-turnover work, disclosure-timeliness research, and analyses of structural changes after Form 8-K rule amendments.

Activist and shareholder-engagement researchers

Track governance triggers: poison pills (Items 3.03, 8.01), board changes (5.02), bylaw amendments (5.03), shareholder settlements (1.01), and contested vote outcomes (5.07), tying disclosure text to the filed agreement or amended bylaws to plan and evaluate campaigns.

Specific Use Cases

The Form 8-K Files Dataset supports a small set of recurring, concrete workflows built on accession-level metadata, item tags, and full exhibit text.

  • Item 4.01 auditor-change monitoring. Filter records where metadata.json -> items[] contains Item 4.01, then pull the EX-16.x letter from the former independent accountant in the same accession folder. Used to maintain auditor-turnover watch lists, flag disagreements that may precede a restatement, and link the 8-K to the registrant's subsequent 10-K/A or 10-Q/A.

  • Item 2.02 earnings-release event studies. Stream all filings tagged with Item 2.02 across the 1993-onward corpus, extract the EX-99.1 press release text, and align filedAt timestamps with intraday returns. Drives post-earnings drift research, tone-shift signals, and surprise-vs-guidance benchmarks where the 10-Q's structured statements arrive weeks later.

  • Item 1.01 / EX-10.x material-agreement comparables. Pull every accession reporting Item 1.01 along with its EX-2.x, EX-4.x, and EX-10.x attachments to build a corpus of credit agreements, merger agreements, and indentures. Used by M&A and credit teams to extract MAC clauses, termination fees, covenant packages, and financing conditions for deal-term comparables.

  • Item 5.02 executive-transition tracking. Combine the Item 5.02 narrative in the primary 8-K with attached separation, employment, and equity-award exhibits (EX-10.x, EX-17.x). Feeds severance benchmarking, board-composition timelines, insider-list updates for compliance surveillance, and trigger flags for forensic reviews of officer departures clustered with restatements.

  • Item 1.05 cybersecurity-incident corpus. Filter post-December-2023 records carrying Item 1.05 to assemble the full population of material cybersecurity incident disclosures. Supports incident-rate benchmarking, peer-language comparison on materiality determinations, and timeliness studies against the four-business-day deadline.

  • 8-K/A acquisition financial-statement linkage. Identify records with formType = "8-K/A" and Item 9.01, then match back to the original Item 2.01 8-K by registrant CIK and periodOfReport. Used to measure the lag inside the 71-day window, recover the EX-99.x audited target financials and pro formas, and complete acquisition-history datasets for diligence and post-deal performance studies.

  • RAG and extraction grounding over current reports. Index the per-accession folder — metadata.json plus primary iXBRL document plus exhibits — as the retrieval unit, with items[] as a structured filter. Powers LLM workflows for current-report summarization, item classification, and citation-grounded QA where answers must point back to a specific exhibit within a specific accession.

Dataset Access

Dataset Index JSON API: https://api.sec-api.io/datasets/form-8k-files.json This endpoint returns dataset-level metadata (name, description, last updated timestamp, earliest sample date, total records, total size, covered form types, container format, and file types), the full dataset download URL, and the complete list of monthly container files. Each container entry includes its key, size in bytes, record count, updated timestamp, and direct download URL. Poll this endpoint to detect which containers were refreshed in the latest run and selectively download only those archives on a daily basis. No API key is required to call this endpoint.

Example response:

Example
1 {
2 "datasetId": "1f1333bd-dbdd-6a50-bdf2-2106570945e7",
3 "datasetDownloadUrl": "https://api.sec-api.io/datasets/form-8k-files.zip",
4 "name": "Form 8-K Files Dataset",
5 "updatedAt": "2026-04-29T03:00:29.691Z",
6 "earliestSampleDate": "1993-10-01",
7 "totalRecords": 4566293,
8 "totalSize": 62254196451,
9 "formTypes": ["8-K", "8-K/A"],
10 "containerFormat": "ZIP",
11 "fileTypes": ["TXT", "JSON", "HTML", "PDF", "XFD", "FRM"],
12 "containers": [
13 {
14 "downloadUrl": "https://api.sec-api.io/datasets/form-8k-files/2026/2026-04.zip",
15 "key": "2026/2026-04.zip",
16 "size": 13818783,
17 "records": 154,
18 "updatedAt": "2026-04-29T03:00:29.691Z"
19 }
20 ]
21 }

Download Entire Dataset: https://api.sec-api.io/datasets/form-8k-files.zip?token=YOUR_API_KEY Downloads the complete Form 8-K Files dataset as a single ZIP archive covering all filings from October 1993 to the present. This endpoint requires an API key.

Download Single Container: https://api.sec-api.io/datasets/form-8k-files/2026/2026-04.zip?token=YOUR_API_KEY Downloads a single monthly container archive instead of the full dataset, which is useful for incremental updates or targeted backfills. This endpoint requires an API key.

Frequently Asked Questions

What forms does this dataset cover?

The dataset covers Form 8-K (current report) and Form 8-K/A (amendment to a current report) submissions filed on EDGAR by domestic Exchange Act reporting issuers. It does not include Form 6-K, which is the foreign-private-issuer analogue, nor any periodic forms such as Form 10-K or Form 10-Q.

What does one record in this dataset represent?

One record is one complete EDGAR submission, identified by an 18-digit accession number and materialized as one accession-number folder. The folder bundles a single metadata.json file, the primary 8-K document, and every non-image exhibit and ancillary attachment that was part of the same EDGAR package. A single 8-K that reports several Items and attaches multiple exhibits is still one record.

Who is required to file Form 8-K?

U.S. domestic issuers subject to Section 13(a) or Section 15(d) of the Securities Exchange Act of 1934 — including domestic operating companies registered under Section 12(b) or 12(g), Section 15(d) reporters, subsidiary debt issuers, asset-backed issuers, and successor registrants. Foreign private issuers furnish Form 6-K instead and are not part of this dataset; registered investment companies generally use the N-series forms, although closed-end funds and BDCs that are Exchange Act reporters do file 8-K.

When must Form 8-K be filed?

The default deadline is four business days after the triggering event. Item-specific timing applies to several Items: Item 5.07 (vote results) runs from the meeting, Item 1.05 (material cybersecurity incidents) runs from the materiality determination, and Item 7.01 (Regulation FD) requires simultaneous or prompt disclosure. Form 8-K/A amendments supplying Item 9.01 financial statements after an acquisition are permitted up to 71 calendar days after the original report.

What time period does the dataset cover?

The earliest records begin on October 1, 1993, the start of mandatory EDGAR phase-in, and coverage continues to the present. Pre-EDGAR paper 8-Ks are not included. The dataset preserves filings as filed across the SGML/ASCII era (1993-2001), the HTML era (2001-2018), and the inline-XBRL cover-page era (2019-present).

What file format is the dataset distributed in, and what file types do records contain?

The dataset is distributed as ZIP container archives, organized into monthly containers that can be downloaded individually or as a single full-dataset archive. Inside the records, the file types found are TXT, JSON, HTML/XHTML, PDF, XFD, and FRM, with HTML/XHTML and JSON dominating modern records. Every accession folder contains exactly one metadata.json file as its structured spine.

How does this dataset differ from item-level structured 8-K datasets?

Item-level structured datasets parse 8-Ks into one row per Item with normalized fields and event tags. The Form 8-K Files Dataset is the upstream source — complete original EDGAR submissions per accession, including the primary document, all non-image exhibits, and the metadata file. Use structured extracts for event studies on a fixed schema; use the Files Dataset when full text, exact exhibits, cross-item analyses, or coverage of Items without dedicated extracts is required.