SEC Form 4 Files Dataset

The Form 4 Files Dataset contains the complete filing packages for every Form 4 and Form 4/A submitted to SEC EDGAR from January 1996 to the present. Each record represents a single EDGAR submission — one insider's Statement of Changes in Beneficial Ownership of Securities — and includes the primary XML ownership document, a filing-level metadata JSON file, a pre-rendered HTML view, and any exhibit attachments such as powers of attorney. Form 4 is filed by officers, directors, and ten-percent beneficial owners of issuers with a Section 12-registered equity class, within two business days of each reportable transaction in the issuer's equity securities. The dataset preserves each filing in its original structured form, supporting both machine extraction from XML and visual inspection through the rendered HTML.

Update Frequency
Daily
Updated at
2026-05-09
Earliest Sample Date
1996-01-01
Total Size
28.4 GB
Total Records
9,608,927
Container Format
ZIP
Content Types
TXT, JSON, HTML, PDF, XML
Form Types
4, 4/A

Dataset APIs

Programmatically retrieve the full list of dataset archive files, download URLs and dataset metadata.

Dataset Index JSON API

Download the entire dataset as a single archive file.

Download Entire Dataset:

Download a single container file (e.g. monthly archive) from the dataset.

Download Single Container:

Dataset Files

364 files · 28.4 GB
Download All
2026-05.zip32.9 MB14,903 records
2026-04.zip49.8 MB23,278 records
2026-02.zip136.0 MB43,909 records
2026-01.zip89.0 MB28,770 records
2025-12.zip84.1 MB27,233 records
2025-11.zip70.4 MB22,357 records
2025-10.zip61.0 MB19,550 records
2025-09.zip64.6 MB20,662 records
2025-08.zip75.5 MB23,524 records
2025-07.zip70.8 MB21,942 records
2025-06.zip103.0 MB33,383 records
2025-05.zip122.0 MB38,931 records
2025-04.zip84.0 MB25,592 records
2025-03.zip140.6 MB44,852 records
2025-02.zip139.0 MB44,750 records
2025-01.zip96.4 MB30,840 records
2024-12.zip92.9 MB27,283 records
2024-11.zip80.1 MB25,852 records
2024-10.zip61.2 MB20,005 records
2024-09.zip73.9 MB19,883 records
2024-08.zip80.4 MB24,734 records
2024-07.zip70.7 MB23,148 records
2024-06.zip102.4 MB33,596 records
2024-05.zip129.5 MB41,684 records
2024-04.zip86.7 MB26,877 records
2024-03.zip150.1 MB46,861 records
2024-02.zip157.6 MB48,332 records
2024-01.zip101.7 MB33,180 records
2023-12.zip89.1 MB27,339 records
2023-11.zip76.3 MB25,051 records
2023-10.zip61.5 MB19,899 records
2023-09.zip66.7 MB19,818 records
2023-08.zip84.7 MB27,342 records
2023-07.zip70.5 MB22,294 records
2023-06.zip115.8 MB37,192 records
2023-05.zip139.3 MB44,078 records
2023-04.zip80.4 MB25,998 records
2023-03.zip152.0 MB49,450 records
2023-02.zip141.7 MB47,309 records
2023-01.zip111.1 MB35,302 records
2022-12.zip78.6 MB26,147 records
2022-11.zip78.8 MB26,474 records
2022-10.zip62.5 MB20,816 records
2022-09.zip62.8 MB20,717 records
2022-08.zip85.5 MB28,753 records
2022-07.zip71.1 MB21,512 records
2022-06.zip112.7 MB36,842 records
2022-05.zip133.3 MB42,354 records
2022-04.zip81.0 MB27,294 records
2022-03.zip151.0 MB50,406 records
2022-02.zip141.1 MB47,382 records
2022-01.zip104.8 MB35,148 records
2021-12.zip90.8 MB30,000 records
2021-11.zip93.9 MB31,089 records
2021-10.zip68.1 MB22,606 records
2021-09.zip70.6 MB23,147 records
2021-08.zip90.7 MB29,369 records
2021-07.zip77.5 MB25,810 records
2021-06.zip114.2 MB38,140 records
2021-05.zip125.0 MB42,049 records
2021-04.zip89.4 MB29,903 records
2021-03.zip158.1 MB52,616 records
2021-02.zip147.1 MB49,157 records
2021-01.zip101.7 MB33,967 records
2020-12.zip91.3 MB30,298 records
2020-11.zip81.7 MB27,326 records
2020-10.zip64.5 MB21,630 records
2020-09.zip62.3 MB20,737 records
2020-08.zip78.6 MB26,546 records
2020-07.zip71.5 MB24,133 records
2020-06.zip100.1 MB33,879 records
2020-05.zip112.3 MB38,430 records
2020-04.zip77.3 MB26,234 records
2020-03.zip143.2 MB48,566 records
2020-02.zip144.9 MB48,902 records
2020-01.zip98.1 MB33,270 records
2019-12.zip76.6 MB25,738 records
2019-11.zip75.4 MB25,438 records
2019-10.zip61.0 MB20,628 records
2019-09.zip59.1 MB19,910 records
2019-08.zip75.3 MB25,502 records
2019-07.zip73.7 MB24,884 records
2019-06.zip89.4 MB30,250 records
2019-05.zip122.6 MB42,163 records
2019-04.zip83.6 MB28,223 records
2019-03.zip131.6 MB43,825 records
2019-02.zip142.6 MB48,179 records
2019-01.zip102.1 MB34,702 records
2018-12.zip79.9 MB26,782 records
2018-11.zip80.0 MB26,864 records
2018-10.zip71.7 MB24,107 records
2018-09.zip61.1 MB20,514 records
2018-08.zip81.9 MB27,754 records
2018-07.zip70.8 MB24,101 records
2018-06.zip93.9 MB31,946 records
2018-05.zip127.1 MB43,629 records
2018-04.zip78.7 MB26,725 records
2018-03.zip130.0 MB43,765 records
2018-02.zip141.4 MB47,787 records
2018-01.zip106.5 MB36,266 records
2017-12.zip78.7 MB26,691 records
2017-11.zip81.0 MB27,377 records
2017-10.zip66.8 MB22,690 records
2017-09.zip60.5 MB20,527 records
2017-08.zip79.1 MB26,885 records
2017-07.zip66.2 MB22,640 records
2017-06.zip94.4 MB32,370 records
2017-05.zip128.7 MB44,241 records
2017-04.zip78.1 MB26,645 records
2017-03.zip135.2 MB45,579 records
2017-02.zip145.9 MB49,347 records
2017-01.zip105.9 MB36,087 records
2016-12.zip82.8 MB28,041 records
2016-11.zip88.5 MB29,891 records
2016-10.zip62.7 MB21,378 records
2016-09.zip65.3 MB21,952 records
2016-08.zip86.6 MB29,338 records
2016-07.zip69.8 MB23,777 records
2016-06.zip91.0 MB31,133 records
2016-05.zip126.0 MB43,440 records
2016-04.zip81.0 MB27,698 records
2016-03.zip126.5 MB42,965 records
2016-02.zip147.4 MB50,089 records
2016-01.zip102.4 MB35,145 records
2015-12.zip89.1 MB30,211 records
2015-11.zip84.8 MB28,828 records
2015-10.zip68.9 MB23,410 records
2015-09.zip63.4 MB21,649 records
2015-08.zip87.4 MB29,872 records
2015-07.zip80.6 MB27,410 records
2015-06.zip100.7 MB34,491 records
2015-05.zip126.7 MB43,569 records
2015-04.zip88.8 MB30,386 records
2015-03.zip138.9 MB47,138 records
2015-02.zip152.0 MB51,403 records
2015-01.zip109.3 MB37,339 records
2014-12.zip100.1 MB33,872 records
2014-11.zip88.9 MB30,106 records
2014-10.zip73.4 MB24,908 records
2014-09.zip68.3 MB23,234 records
2014-08.zip84.7 MB28,831 records
2014-07.zip79.7 MB27,243 records
2014-06.zip94.7 MB32,420 records
2014-05.zip129.4 MB44,652 records
2014-04.zip88.5 MB30,349 records
2014-03.zip128.3 MB43,535 records
2014-02.zip149.0 MB50,681 records
2014-01.zip114.4 MB39,222 records
2013-12.zip89.8 MB30,546 records
2013-11.zip92.7 MB31,427 records
2013-10.zip77.3 MB26,221 records
2013-09.zip69.3 MB23,516 records
2013-08.zip89.3 MB30,388 records
2013-07.zip85.8 MB29,266 records
2013-06.zip86.9 MB29,731 records
2013-05.zip142.9 MB49,149 records
2013-04.zip86.5 MB29,711 records
2013-03.zip127.3 MB43,322 records
2013-02.zip138.1 MB46,943 records
2013-01.zip112.3 MB38,678 records
2012-12.zip103.7 MB35,366 records
2012-11.zip92.8 MB31,637 records
2012-10.zip72.1 MB24,624 records
2012-09.zip66.4 MB22,683 records
2012-08.zip90.7 MB30,977 records
2012-07.zip78.6 MB26,953 records
2012-06.zip85.9 MB29,816 records
2012-05.zip130.7 MB45,265 records
2012-04.zip86.2 MB29,591 records
2012-03.zip123.9 MB42,341 records
2012-02.zip147.4 MB50,059 records
2012-01.zip113.2 MB38,931 records
2011-12.zip86.3 MB29,578 records
2011-11.zip84.6 MB29,031 records
2011-10.zip68.0 MB23,361 records
2011-09.zip63.6 MB21,967 records
2011-08.zip89.2 MB31,012 records
2011-07.zip77.5 MB26,569 records
2011-06.zip89.0 MB30,800 records
2011-05.zip139.0 MB48,154 records
2011-04.zip90.7 MB31,198 records
2011-03.zip131.3 MB44,955 records
2011-02.zip137.7 MB46,791 records
2011-01.zip116.2 MB40,012 records
2010-12.zip103.9 MB35,470 records
2010-11.zip97.8 MB33,441 records
2010-10.zip75.0 MB25,720 records
2010-09.zip69.9 MB24,107 records
2010-08.zip86.6 MB29,877 records
2010-07.zip79.8 MB27,490 records
2010-06.zip91.4 MB31,758 records
2010-05.zip127.0 MB44,053 records
2010-04.zip99.4 MB34,205 records
2010-03.zip135.6 MB46,508 records
2010-02.zip127.8 MB43,730 records
2010-01.zip107.9 MB37,310 records
2009-12.zip95.2 MB32,764 records
2009-11.zip88.0 MB30,267 records
2009-10.zip73.8 MB25,445 records
2009-09.zip70.5 MB24,312 records
2009-08.zip86.3 MB29,909 records
2009-07.zip82.3 MB28,549 records
2009-06.zip87.4 MB30,573 records
2009-05.zip118.0 MB41,330 records
2009-04.zip85.2 MB29,639 records
2009-03.zip121.5 MB42,390 records
2009-02.zip126.2 MB43,602 records
2009-01.zip109.8 MB38,190 records
2008-12.zip99.6 MB34,510 records
2008-11.zip97.6 MB33,929 records
2008-10.zip88.4 MB30,722 records
2008-09.zip79.0 MB27,354 records
2008-08.zip99.4 MB34,307 records
2008-07.zip100.1 MB34,903 records
2008-06.zip104.9 MB36,400 records
2008-05.zip153.7 MB53,406 records
2008-04.zip112.3 MB38,732 records
2008-03.zip129.7 MB44,876 records
2008-02.zip149.7 MB51,488 records
2008-01.zip134.7 MB46,745 records
2007-12.zip111.9 MB38,540 records
2007-11.zip120.3 MB41,392 records
2007-10.zip105.2 MB35,980 records
2007-09.zip79.3 MB27,264 records
2007-08.zip126.6 MB43,586 records
2007-07.zip106.5 MB36,839 records
2007-06.zip109.4 MB37,957 records
2007-05.zip166.2 MB57,718 records
2007-04.zip111.3 MB38,597 records
2007-03.zip129.2 MB44,644 records
2007-02.zip152.9 MB52,499 records
2007-01.zip143.0 MB49,361 records
2006-12.zip112.2 MB38,643 records
2006-11.zip116.0 MB39,982 records
2006-10.zip98.4 MB34,117 records
2006-09.zip78.8 MB27,375 records
2006-08.zip110.3 MB38,505 records
2006-07.zip95.5 MB33,740 records
2006-06.zip107.9 MB37,962 records
2006-05.zip158.7 MB55,352 records
2006-04.zip111.8 MB38,895 records
2006-03.zip138.5 MB47,886 records
2006-02.zip145.2 MB50,007 records
2006-01.zip144.9 MB50,236 records
2005-12.zip121.2 MB42,032 records
2005-11.zip113.6 MB39,438 records
2005-10.zip93.5 MB32,613 records
2005-09.zip89.8 MB31,261 records
2005-08.zip119.5 MB41,478 records
2005-07.zip107.3 MB37,521 records
2005-06.zip112.2 MB39,262 records
2005-05.zip148.5 MB52,056 records
2005-04.zip111.4 MB38,873 records
2005-03.zip131.5 MB45,594 records
2005-02.zip141.3 MB48,739 records
2005-01.zip136.2 MB47,377 records
2004-12.zip127.5 MB44,256 records
2004-11.zip125.3 MB43,417 records
2004-10.zip94.6 MB32,842 records
2004-09.zip85.3 MB29,743 records
2004-08.zip104.2 MB36,382 records
2004-07.zip102.7 MB35,778 records
2004-06.zip111.7 MB39,030 records
2004-05.zip138.4 MB48,667 records
2004-04.zip126.3 MB44,060 records
2004-03.zip131.0 MB45,586 records
2004-02.zip152.2 MB52,799 records
2004-01.zip143.7 MB50,281 records
2003-12.zip125.7 MB43,943 records
2003-11.zip112.9 MB39,417 records
2003-10.zip108.9 MB38,020 records
2003-09.zip94.5 MB33,292 records
2003-08.zip110.1 MB39,058 records
2003-07.zip104.4 MB38,109 records
2003-06.zip62.3 MB22,661 records
2003-05.zip59.0 MB20,795 records
2003-04.zip42.5 MB8,958 records
2003-03.zip35.6 MB8,085 records
2003-02.zip43.8 MB9,006 records
2003-01.zip41.9 MB8,934 records
2002-12.zip33.8 MB6,896 records
2002-11.zip25.3 MB5,563 records
2002-10.zip25.3 MB5,179 records
2002-09.zip14.8 MB3,714 records
2002-08.zip7.1 MB2,021 records
2002-07.zip3.7 MB1,008 records
2002-06.zip4.9 MB1,367 records
2002-05.zip4.1 MB1,166 records
2002-04.zip4.1 MB1,194 records
2002-03.zip3.9 MB1,137 records
2002-02.zip3.9 MB1,123 records
2002-01.zip3.6 MB1,051 records
2001-12.zip3.1 MB922 records
2001-11.zip2.8 MB807 records
2001-10.zip2.7 MB788 records
2001-09.zip3.2 MB930 records
2001-08.zip3.2 MB942 records
2001-07.zip3.1 MB909 records
2001-06.zip4.1 MB1,222 records
2001-05.zip3.2 MB928 records
2001-04.zip3.5 MB1,028 records
2001-03.zip3.1 MB932 records
2001-02.zip3.9 MB1,157 records
2001-01.zip4.2 MB1,267 records
2000-12.zip3.1 MB940 records
2000-11.zip3.1 MB967 records
2000-10.zip2.9 MB903 records
2000-09.zip3.1 MB975 records
2000-08.zip2.9 MB933 records
2000-07.zip2.8 MB877 records
2000-06.zip3.4 MB1,091 records
2000-05.zip3.2 MB1,065 records
2000-04.zip3.2 MB1,042 records
2000-03.zip3.6 MB1,159 records
2000-02.zip3.5 MB1,118 records
2000-01.zip3.3 MB1,094 records
1999-12.zip3.2 MB921 records
1999-11.zip2.5 MB831 records
1999-10.zip2.5 MB822 records
1999-09.zip2.5 MB820 records
1999-08.zip2.4 MB758 records
1999-07.zip2.2 MB721 records
1999-06.zip2.8 MB929 records
1999-05.zip2.9 MB974 records
1999-04.zip3.0 MB983 records
1999-03.zip2.9 MB951 records
1999-02.zip2.8 MB913 records
1999-01.zip3.0 MB996 records
1998-12.zip2.6 MB857 records
1998-11.zip2.5 MB867 records
1998-10.zip2.1 MB714 records
1998-09.zip2.8 MB961 records
1998-08.zip2.6 MB874 records
1998-07.zip2.2 MB709 records
1998-06.zip2.5 MB837 records
1998-05.zip2.2 MB760 records
1998-04.zip2.2 MB755 records
1998-03.zip2.1 MB715 records
1998-02.zip2.3 MB784 records
1998-01.zip2.7 MB923 records
1997-12.zip1.9 MB634 records
1997-11.zip1.8 MB649 records
1997-10.zip1.7 MB599 records
1997-09.zip1.6 MB564 records
1997-08.zip1.7 MB600 records
1997-07.zip1.7 MB620 records
1997-06.zip1.8 MB642 records
1997-05.zip1.4 MB490 records
1997-04.zip1.3 MB461 records
1997-03.zip1.3 MB470 records
1997-02.zip1.5 MB527 records
1997-01.zip1.4 MB514 records
1996-12.zip1.1 MB398 records
1996-11.zip1.0 MB360 records
1996-10.zip745.7 KB280 records
1996-09.zip763.7 KB305 records
1996-08.zip846.3 KB336 records
1996-07.zip506.8 KB191 records
1996-06.zip623.1 KB232 records
1996-05.zip315.3 KB114 records
1996-04.zip112.2 KB37 records
1996-03.zip132.3 KB52 records
1996-02.zip84.3 KB26 records
1996-01.zip54.8 KB17 records

What This Dataset Contains

The Form 4 Files Dataset is built from Form 4 and Form 4/A filings as submitted to EDGAR. Form 4 is the Statement of Changes in Beneficial Ownership of Securities, required under Section 16(a) of the Securities Exchange Act of 1934. It discloses transactions in an issuer's equity securities by corporate insiders: officers, directors, and beneficial owners of more than ten percent of a registered equity class. Form 4/A is an amendment that fully restates the original Form 4 disclosure.

The dataset spans all Form 4 and Form 4/A filings from January 1996 to the present. Filings before mid-2003 represent voluntary electronic submissions and do not capture all Form 4 filers from that period; from mid-2003 onward — when the SEC mandated electronic filing — coverage is essentially complete. The dataset is distributed as monthly ZIP containers, each containing one folder per filing. Each folder is named by the zero-padded, dash-stripped accession number (e.g., accession 0001185185-25-001587 becomes folder 000118518525001587).

Content Structure of a Single Record

A single record in the Form 4 Files Dataset corresponds to one EDGAR submission of Form 4 or Form 4/A, identified by its accession number. Each record folder contains a metadata.json file with structured filing-index data, the primary XML ownership document, a pre-rendered HTML view of that document in an xslF345X05/ subdirectory, and occasionally one or more exhibit files (most commonly a power-of-attorney under Exhibit 24). Image files referenced in the original EDGAR submission are excluded. The record unit is the complete textual content of a single Form 4 or Form 4/A filing as submitted to EDGAR, minus graphic attachments.

Files within a single record

Each record folder contains three to five files:

Always present:

  • metadata.json — filing-level metadata drawn from the EDGAR filing index: accession number, form type, filing timestamp, period of report, entity identifiers, document inventory, and SEC.gov links.
  • A primary XML file (variable filename, e.g., form4.xml, ownership.xml, primary_doc.xml, primarydocument.xml, rdgdoc.xml, edgardoc.xml, marketforms-*.xml, wk-form4_*.xml, tm*_4seq1.xml, fp*_4.xml, or other filing-agent-specific names) — the machine-readable ownership document conforming to the SEC's ownershipDocument XML schema.
  • xslF345X05/<same-filename>.xml — a pre-rendered HTML file produced by applying the SEC's F345X05 XSL stylesheet to the XML. Despite retaining a .xml extension, this file contains HTML markup (with a <!DOCTYPE html> declaration) and replicates the tabular layout of the official printed Form 4.

Occasionally present (roughly 7% of filings):

  • One or more Exhibit 24 (EX-24) files in .htm, .html, or .txt format, containing a limited power of attorney authorizing a named individual to execute and file Section 16 forms on the reporting person's behalf. These files are wrapped in the SEC's SGML <DOCUMENT> envelope carrying <TYPE>, <SEQUENCE>, <FILENAME>, and optionally <DESCRIPTION> headers before the <TEXT> block with the exhibit content.

Excluded:

  • GRAPHIC-type files (.jpg, .gif, .png) referenced in the EDGAR submission — typically signature images, letterheads, or notary seals embedded in power-of-attorney exhibits — appear in metadata.json document listings but are not included in the dataset.

Metadata file: metadata.json

The metadata.json file provides the filing-level index record. Key fields:

Top-level scalars. formType"4" or "4/A". accessionNo — the SEC accession number in dash-delimited form. id — a 32-character hexadecimal identifier. filedAt — ISO 8601 timestamp with timezone offset recording when the filing was accepted by EDGAR. periodOfReport — the transaction date or report period in YYYY-MM-DD format. description — human-readable filing description; amendments typically include "[Amend]". linkToFilingDetails — URL to the XSL-rendered view on SEC.gov. linkToTxt — URL to the full-text SGML submission file. linkToHtml — URL to the EDGAR filing index page. linkToXbrl — always an empty string, since Form 4 is not filed in XBRL.

Entities array. The entities array typically contains two objects: the issuer and the reporting owner, distinguished by the "(Issuer)" or "(Reporting)" suffix on companyName. Element ordering within the array is not guaranteed. The issuer entity carries cik, sic (SIC code with industry description), irsNo, stateOfIncorporation, fiscalYearEnd (in MMDD format), and a tickers array listing trading symbols (absent or empty for issuers without listed equity). The reporting-owner entity carries cik, act (typically "34" for the Exchange Act), fileNo, type (form type), and filmNo.

Document inventory. The documentFormatFiles array lists all documents in the submission. Each entry includes sequence, size (in bytes; may be a blank string for the XSL-rendered view), documentUrl, type (e.g., "4", "EX-24", "GRAPHIC"), and description. GRAPHIC entries appear in this inventory even though the image files are excluded from the dataset. The complete submission text file appears as an entry with blank sequence and type. The dataFiles and seriesAndClassesContractsInformation arrays are always empty for Form 4 filings.

Primary XML document: the ownership document

The core of each Form 4 Files Dataset record is the XML ownership document, conforming to the SEC's ownershipDocument schema (version X0508 in current filings). It contains all substantive Form 4 disclosure, organized into the following sections in document order:

Schema version and document header. schemaVersion identifies the XML schema version. documentType is 4 or 4/A. periodOfReport states the transaction date. notSubjectToSection16 is a boolean flag (0 or 1) indicating whether the reporting person claims exemption from Section 16. These elements appear at the top of the document, before the issuer block.

Issuer block. The issuer element contains issuerCik (zero-padded CIK), issuerName, and issuerTradingSymbol. The trading symbol may be absent, empty, or contain placeholder text for issuers without a listed ticker.

Reporting owner block. The reportingOwner element nests three sub-blocks. reportingOwnerId contains rptOwnerCik and rptOwnerName. reportingOwnerAddress provides street lines, city, state, zip code, and an optional rptOwnerStateDescription (used for foreign addresses). reportingOwnerRelationship contains four boolean flags — isDirector, isOfficer, isTenPercentOwner, isOther — plus officerTitle when the person is an officer (e.g., "Chairman, CEO, and Secretary") and otherText when isOther is set. Multiple reportingOwner blocks appear in joint filings, though these are uncommon on Form 4.

Rule 10b5-1 flag. The aff10b5One element (0 or 1) indicates whether reported transactions were effected pursuant to a Rule 10b5-1(c) trading plan. This element appears between the reporting-owner block and the transaction tables. It was introduced in 2023 and is absent in older filings.

Non-derivative table. The nonDerivativeTable contains zero or more nonDerivativeTransaction and nonDerivativeHolding entries. Each transaction includes: securityTitle (e.g., "Common Stock"), transactionDate, optional deemedExecutionDate, transactionCoding (with transactionFormType, transactionCode, and equitySwapInvolved), optional transactionTimeliness, transactionAmounts (with transactionShares, transactionPricePerShare, and transactionAcquiredDisposedCodeA for acquired, D for disposed), postTransactionAmounts (with sharesOwnedFollowingTransaction), and ownershipNature (with directOrIndirectOwnershipD for direct, I for indirect — and optionally natureOfOwnership describing the indirect arrangement). Standard transaction codes include P (purchase), S (sale), A (grant/award), M (exercise/conversion of derivative), F (tax-withholding disposition), G (gift), J (other), C (conversion), among others. Holding entries report positions without an associated transaction and lack transaction-specific fields. Individual value elements may carry both a <value> child and one or more <footnoteId> references.

Derivative table. The derivativeTable contains zero or more derivativeTransaction and derivativeHolding entries. These carry the same transactional fields as non-derivative entries plus derivative-specific elements: conversionOrExercisePrice, exerciseDate, expirationDate, and an underlyingSecurity block (with underlyingSecurityTitle and underlyingSecurityShares). Derivative fields frequently contain only a <footnoteId> reference instead of a literal <value>, particularly for exercise dates, expiration dates, and conversion prices that depend on vesting schedules or plan terms.

Footnotes. The footnotes element contains footnote elements keyed by id attributes (F1, F2, F3, etc.). These provide narrative explanations referenced throughout the transaction and holding entries — describing vesting conditions, indirect ownership arrangements, plan-based transaction details, price computation methods, tax-withholding mechanics, and other qualifications. Footnotes are a critical interpretive layer: many derivative-table fields contain only a footnote reference rather than a literal value, making footnote resolution essential for meaningful data extraction.

Owner signature. The ownerSignature element contains signatureName (typically /s/ FIRSTNAME LASTNAME, or with "by" attribution when signed under power of attorney) and signatureDate. Multiple signature blocks appear when multiple reporting owners are listed.

Remarks. An optional remarks element may contain free-text commentary; it is rarely populated.

XSL-rendered HTML view

The xslF345X05/ subdirectory holds a copy of the primary document that has been transformed through the SEC's F345X05 XSL stylesheet into a complete HTML page. The file retains the original .xml filename but contains HTML content. It presents the Form 4 data in the standard SEC tabular layout: a header block with issuer and reporting-person identification, Table I (Non-Derivative Securities Acquired, Disposed of, or Beneficially Owned), Table II (Derivative Securities Acquired, Disposed of, or Beneficially Owned), footnotes, and the signature block. The rendered view is useful for visual inspection but contains no data beyond what the raw XML provides.

Exhibit files

Approximately 7% of Form 4 filings include Exhibit 24 (EX-24) attachments — limited powers of attorney authorizing a named individual to execute and file Section 16 forms on the reporting person's behalf. The exhibit content is typically a short legal instrument naming the grantor, the attorney-in-fact, the scope of authority (limited to Section 16 filings), the grantor's signature, and in many cases a notarial acknowledgment with notary signature and commission expiration date. Powers of attorney may cover a single filing or authorize ongoing filings for a stated period.

Each exhibit file is wrapped in the SEC's SGML <DOCUMENT> envelope with <TYPE>, <SEQUENCE>, <FILENAME>, and <TEXT> tags. The exhibit body within <TEXT> is usually HTML or plain text.

Other exhibit types are rare on Form 4. The dataset excludes GRAPHIC-type attachments (signature images, letterheads, notary seals) that occasionally accompany power-of-attorney exhibits.

Historical content and format changes

Pre-XML era (1996 to mid-2003). Form 4 was originally filed as flat text or HTML without structured XML. Filings from 1996 through approximately June 2003 are plain-text or HTML submissions presenting ownership-change data in unstructured or semi-structured tabular formats. These filings lack the ownershipDocument XML schema and require text or HTML parsing for field-level extraction.

Mandatory XML schema (June 30, 2003 onward). The SEC's mandate for electronic filing of Section 16 forms introduced the ownershipDocument XML schema. From this date forward, Form 4 filings are structured XML, enabling direct machine extraction of transaction data, security titles, share counts, prices, ownership types, and relationship flags. The schema has been revised incrementally through versions X0306, X0407, and X0508, each adding or refining elements.

Rule 10b5-1 disclosure (2023). Amendments to Section 16 rules effective in 2023, implementing the SEC's December 2022 rulemaking on insider trading arrangements, added the aff10b5One element. Filings before this rule change do not contain the element.

XSL stylesheet versions. The stylesheet identifier in the filing path has changed across schema versions (xslF345X02, xslF345X03, xslF345X05, etc.), with xslF345X05 current. Each version corresponds to layout refinements in the rendered HTML.

Filing-agent naming conventions. The primary XML filename varies by filing agent and era. Common patterns include form4.xml, ownership.xml, primary_doc.xml, primarydocument.xml, rdgdoc.xml, edgardoc.xml, marketforms-*.xml, wk-form4_*.xml (Workiva), tm*_4seq1.xml (Toppan Merrill), and fp*_4.xml (FilingPoint). This variation is cosmetic and does not affect XML content or schema conformance.

Interpretation and extraction notes

Amendments are full restatements. A Form 4/A replaces the entire original disclosure, not just the changed fields. The dataset stores each amendment as a separate record with its own accession number. The amendment XML does not contain a pointer to the original filing's accession number; linking amendments to originals requires matching on issuer CIK, reporting-owner CIK, and period of report.

Footnote-dependent fields. Many fields in the derivative table — particularly conversionOrExercisePrice, exerciseDate, and expirationDate — contain only a <footnoteId> element without a <value>. Non-derivative fields such as transactionShares, transactionPricePerShare, and sharesOwnedFollowingTransaction may also carry supplementary footnote references alongside their literal values. Extracting complete data from derivative transactions requires resolving footnote references to their narrative text.

Multiple transactions per filing. A single Form 4 may report multiple transactions across both tables. Each transaction is a separate XML element within its table. Transactions within one filing may span different dates, different security titles, and different transaction codes.

Direct versus indirect ownership. The directOrIndirectOwnership field distinguishes direct holdings (D) from indirect holdings (I). Indirect holdings include shares held through a spouse, trust, partnership, or other entity, with natureOfOwnership describing the arrangement. A single filing may report both direct and indirect positions in the same security as separate transaction or holding lines.

Ticker availability. The tickers array in issuer metadata and issuerTradingSymbol in the XML are populated when the issuer has listed equity. For issuers without a listed class — certain investment funds, private issuers with registered debt, or delisting-stage issuers — these fields may be absent, empty, or contain placeholder text such as "N/A" or "None". The tickers array may be omitted entirely from the issuer entity in metadata.json rather than appearing as an empty array.

File sizes. Primary XML files are typically 2 to 10 KB. Metadata JSON files run 1 to 3 KB. Power-of-attorney exhibits range from 3 to 8 KB. The XSL-rendered HTML views are similar in size to the primary XML. Records are lightweight; aggregate dataset size reflects the volume of filings rather than per-record bulk.

Who Files or Publishes This Dataset, and When

Who files

The reporting person — not the issuer — files Form 4 on EDGAR. The obligation arises under Section 16(a) of the Securities Exchange Act of 1934 and applies to three classes of corporate insiders of issuers with a Section 12-registered equity class:

Officers. As defined in Rule 16a-1(f): the president, principal financial officer, principal accounting officer, any vice president heading a principal business unit or function, and any other officer performing a policy-making function. This is narrower than general corporate usage; not every person with an officer title is a Section 16 reporting person.

Directors. Every board member, including directors of a corporate general partner of a limited partnership issuer or of a managing member entity.

Ten-percent beneficial owners. Any person — natural or legal — beneficially owning more than ten percent of any Section 12-registered equity class. Beneficial ownership for threshold purposes is measured under Section 13(d). Groups acting together may trigger the threshold on aggregate holdings.

In practice, filings are often prepared by the issuer's counsel or a filing agent under power of attorney, but the legal obligation remains the reporting person's. Each filing is indexed on EDGAR under the reporting person's CIK, with the issuer identified by its own CIK, name, and ticker within the document.

Issuer scope

Form 4 applies only to securities of issuers with a Section 12-registered equity class — primarily companies listed on NYSE, Nasdaq, or other national securities exchanges, plus companies registered under Section 12(g). The issuer is identified in the filing but is never the filer.

Excluded from the Form 4 regime:

  • Foreign private issuers are exempt from Section 16 under Rule 3a12-3; their insiders do not file Form 4.
  • Section 15(d)-only reporters with no Section 12-registered equity class are outside Section 16.
  • Issuers of debt securities only do not generate Form 4 filings.

What triggers a filing

Form 4 is event-driven, not periodic. A filing is triggered by any change in the reporting person's beneficial ownership of the issuer's equity securities, including:

  • Open-market purchases and sales
  • Trades under Rule 10b5-1 plans
  • Option, warrant, or conversion exercises
  • Equity compensation grants, awards, and vestings (stock options, RSUs, restricted stock, performance shares) that change the ownership position
  • Gifts and inheritances that transfer beneficial ownership
  • Dispositions to the issuer (e.g., shares withheld for tax on vesting)
  • Acquisitions or dispositions of derivative securities (options, SARs, convertibles)
  • Changes resulting from mergers, tender offers, or similar corporate events

Transactions exempt from short-swing profit liability under Section 16(b) — such as Rule 16b-3 employee benefit plan acquisitions — still require Form 4 reporting. The Sarbanes-Oxley Act of 2002 moved most formerly deferrable transactions from Form 5 to Form 4; only a narrow residual category (certain small acquisitions, certain exempt transactions) remains eligible for deferred annual reporting on Form 5.

Deadline

Form 4 must be filed before the end of the second business day after the transaction date (trade date, not settlement date). This accelerated deadline took effect under the Sarbanes-Oxley Act, replacing the former tenth-of-the-following-month rule.

Multiple same-day transactions in one issuer's securities are typically reported on a single filing. Transactions on different dates or in different issuers require separate filings.

Form 4 is part of the Section 16 form series:

  • Form 3 — initial ownership statement, due within ten days of becoming an officer, director, or ten-percent owner.
  • Form 4 — statement of changes, filed per transaction as described above.
  • Form 5 — annual catch-up for deferred transactions, due within 45 days after the issuer's fiscal year-end. Rare since Sarbanes-Oxley narrowed deferral eligibility.

This dataset covers only Form 4 and Form 4/A (amendment) filings.

Amendments

Form 4/A corrects a prior Form 4 — fixing transaction dates, share amounts, prices, or ownership figures. Each amendment contains the complete corrected disclosure, not just changed fields. Both originals and amendments appear in the dataset as separate records.

Exclusions from the filing population

  • Foreign private issuers and their insiders — exempt under Rule 3a12-3.
  • 13F filers (institutional investment managers) — separate regime under Section 13(f), reporting quarterly portfolio holdings.
  • Schedule 13D/13G filers — separate regime under Section 13(d)/(g) for five-percent owners. A person can be both a 13D/13G filer and a Form 4 filer, but the obligations are independent.
  • The issuer itself — identified in the filing but never the filer.
  • Insiders of Section 15(d)-only reporters without a Section 12-registered equity class.

How This Dataset Differs From Similar Datasets or Filings

Form 4 sits within the SEC's ownership-disclosure regime alongside several overlapping filing types. The closest comparisons are Forms 3 and 5 (the other Section 16 forms), Schedules 13D/13G (large beneficial ownership), Form 144 (notice of proposed sale), and structured transaction extracts derived from Form 4 XML itself.

Form 3: Initial Statement of Beneficial Ownership

Form 3 shares Form 4's filer population (officers, directors, ten-percent holders), XML schema, and document structure. The sole difference is trigger: Form 3 is filed once, within ten days of becoming a Section 16 insider, and reports only existing holdings at that moment — no transactions. Form 4 is filed within two business days of each subsequent reportable transaction. Form 3 establishes the baseline; Form 4 records every change after it. Form 3 filings are sparse (one per insider-issuer relationship), while a single insider may generate dozens of Form 4 filings over time.

Form 5: Annual Statement of Changes in Beneficial Ownership

Form 5 uses the same XML schema and filer population as Forms 3 and 4. It captures transactions eligible for deferred reporting — small acquisitions, gifts, inheritance transfers, certain benefit-plan transactions — filed within 45 days of the issuer's fiscal year-end.

Since the SEC shortened the Form 4 deadline to two business days in 2002 and narrowed deferral eligibility, Form 5 volume has dropped substantially. Form 4 now captures nearly all insider transactions in near-real time. Relying solely on the Form 4 dataset misses only a small residual of deferred items that appear exclusively on Form 5.

Schedules 13D and 13G: Beneficial Ownership Above Five Percent

Schedules 13D/13G overlap with Form 4 only for the narrow subset of filers who are both Section 16 insiders and holders above five percent. The two regimes differ on every major dimension:

  • Filer population. Form 4 covers officers, directors, and ten-percent holders. Schedules 13D/13G cover any person or group crossing five percent, including activist investors, passive funds, and institutional holders with no Section 16 status. Most Form 4 filers hold well under five percent; many 13D/13G filers never file Form 4.
  • Content. Form 4 discloses individual transactions with dates, share counts, prices, and transaction codes. Schedule 13D is a narrative disclosure covering acquisition purpose, funding sources, and plans regarding the issuer. Schedule 13G is a shorter tabular variant for passive/institutional holders. Neither reports transaction-level detail.
  • Cadence. Form 4 is event-driven with a two-business-day deadline. Schedule 13D requires filing within ten days of crossing five percent, with amendments "promptly" after material changes. Schedule 13G follows annual amendment deadlines with interim amendments only at certain thresholds.

Form 4 delivers high-frequency transactional granularity; Schedules 13D/13G deliver ownership-level snapshots with qualitative context about intent and control.

Form 144: Notice of Proposed Sale of Securities

Form 144 is filed by affiliates and insiders intending to sell restricted or control securities under Rule 144. Many planned insider sales trigger both a Form 144 (filed at or before the sale) and a Form 4 (filed within two business days of execution). The key distinction is temporal: Form 144 is a forward-looking notice of intent; Form 4 is a backward-looking report of a completed transaction. Form 144 also covers restricted-security sales by affiliates who may lack Section 16 status and thus never file Form 4. Form 4 is the more reliable record of actual transactions; Form 144 signals planned but not necessarily executed dispositions.

Structured transaction extracts from Form 4 XML

Some data providers parse Form 4 XML into flat tabular records (insider name, ticker, transaction date, code, shares, price). The Form 4 Files Dataset differs by preserving the complete filing package: raw XML, metadata JSON, rendered HTML, and exhibits such as powers of attorney. This retains footnotes, derivative-table structures, multi-transaction filing context, amendment chains, and full reporting-owner metadata. Tabular extracts are more convenient for quantitative screening; the full-file dataset is necessary when footnote text, exhibit content, or original document structure matters.

Who Uses This Dataset

The Form 4 Files Dataset serves professionals who track, analyze, or act on insider activity across the full spectrum of Section 16 reporting persons.

Quantitative researchers and systematic trading teams

Quant teams use Form 4 data as a signal source. They extract transaction dates, transaction codes (open-market purchases, dispositions, option exercises), share volumes, and prices to build insider-sentiment indicators at the issuer, sector, or market level. The reporting person's role — CEO, CFO, director, or large holder — drives signal weighting, since C-suite purchases are typically treated as more informative than routine option exercises. The full 1996-to-present history and structured XML support clean backtesting with minimal manual cleanup.

Fundamental equity analysts

Equity analysts at asset managers and sell-side firms monitor Form 4 filings for companies they cover. A cluster of insider purchases by multiple officers reinforces a bullish thesis; large CEO or CFO dispositions near earnings dates prompt scrutiny. Key fields: transaction type, dollar size relative to the insider's existing holdings, and post-transaction ownership totals. These feed into investment notes, earnings previews, and recommendation changes.

Event-driven and activist-monitoring analysts

Event-driven analysts use the dataset to detect early accumulation or disposition by ten-percent holders. A series of open-market purchases disclosed on Form 4 may precede a 13D filing or activist campaign. They track the reporting person's identity, cumulative ownership changes over rolling windows, and the issuer's share count to estimate evolving stakes. Historical depth supports pattern recognition across prior contested situations.

Securities lawyers and compliance counsel

Disclosure counsel review Section 16 filing patterns for clients and counterparties. They verify two-business-day filing deadlines, check whether amendments corrected material errors, and assess consistency with insider trading policies and Rule 10b5-1 plans. In enforcement defense or internal investigations, lawyers reconstruct full trading timelines by pulling every Form 4 for a reporting person or issuer. Transaction codes, footnotes, and amendment history are the critical fields.

Corporate governance analysts

Governance teams at institutional asset managers and proxy advisory firms examine insider transaction patterns to assess board and management alignment with shareholders. They focus on the relationship field (officer, director, or large holder), transaction direction and size, and post-transaction holdings to gauge ongoing equity exposure. The analysis feeds proxy voting recommendations, engagement agendas, and governance scoring models.

Compliance and market-surveillance teams

Broker-dealer compliance departments and exchange surveillance units cross-reference Form 4 filings against unusual trading activity. When a suspicious price move occurs, they check transaction dates, codes, and reporting-person roles for any insider trades within the relevant window. The dataset also supports routine monitoring for late filings and Section 16(a) noncompliance patterns.

Financial data engineers

Data engineering teams at financial data vendors and institutional research platforms ingest the full dataset to build structured insider-transaction databases. They parse XML submissions to extract transaction tables, reporting-person metadata, issuer identifiers (CIK, ticker), and footnote text, then normalize and deduplicate across originals and amendments. The resulting cleaned data powers screening tools, alerting systems, and API endpoints consumed by analysts and compliance teams.

Forensic accountants and fraud investigators

Forensic teams reconstruct insider trading timelines during fraud investigations, SEC enforcement matters, and shareholder litigation. They look for abnormal selling ahead of negative earnings surprises, restatements, or regulatory actions. Transaction price, date, and volume fields enable precise calculation of insider profits or avoided losses. Amendments (Form 4/A) are especially relevant, since corrections to previously reported transactions can indicate disclosure problems.

M&A and due diligence teams

Diligence teams review Form 4 filings for a target company's insiders to identify unusual transaction activity before deal announcement. They also verify timeliness and accuracy of Section 16 reporting — late or amended filings can signal weak internal controls. Post-transaction holdings help estimate management equity stakes relevant to rollover and retention negotiations.

Investor relations teams

IR professionals at public companies monitor their own insiders' filings for accuracy and timeliness, and review peer-company insider activity to anticipate market interpretation. Before earnings calls or investor meetings, they track recent insider sales likely to draw analyst questions, focusing on transaction type, timing, size, and any footnote indicating a 10b5-1 plan.

Academic researchers

Finance and economics researchers use nearly three decades of insider transaction records to study information asymmetry, market efficiency, executive compensation, and governance. Structured fields — dates, prices, volumes, transaction codes, and reporting-person roles — support panel datasets, event studies, and natural-experiment designs around Section 16 regulatory changes.

LLM and RAG developers

Teams building language-model applications for financial use cases use the dataset as a structured-text corpus for training, fine-tuning, and retrieval pipelines. Filings combine tabular transaction data with free-text footnotes, supporting entity extraction, transaction classification, and question-answering over insider-activity records. The mix of XML, HTML, and plain-text formats across millions of filings provides both structured grounding fields and unstructured text for model development.

Specific Use Cases

Detecting insider buying clusters as conviction signals

Equity analysts and quant teams screen for multiple officers or directors purchasing shares of the same issuer within a short window. The reportingOwnerRelationship flags (isDirector, isOfficer, officerTitle) identify each buyer's role, while transactionCode isolates open-market purchases (P) from grants and option exercises. Aggregating transactionShares and transactionPricePerShare across filings for a single issuer CIK over rolling periods produces a cluster score that feeds buy-side conviction models or systematic trading signals.

Building insider-sentiment backtests from structured XML history

Quantitative researchers construct multi-decade panel datasets by parsing the nonDerivativeTransaction and derivativeTransaction elements from XML filings back to the June 2003 mandatory-XML cutoff. Transaction codes, share volumes, prices, and postTransactionAmounts supply the core signal features, while the reportingOwnerRelationship fields enable weighting by insider seniority. The periodOfReport date anchors each observation, and the aff10b5One flag (available from 2023 onward) allows researchers to control for planned-trade effects in recent data.

Monitoring Section 16 filing compliance and amendment patterns

Securities lawyers and compliance teams verify whether filings meet the two-business-day deadline by comparing transactionDate in the XML against filedAt in metadata.json. Amendment records (Form 4/A) are identified by formType and linked to originals by matching on issuer CIK, reporting-owner CIK, and periodOfReport. Patterns of late filings or frequent amendments for a particular reporting person or issuer flag potential Section 16(a) noncompliance or weak internal controls during due diligence reviews.

Tracking early accumulation by large holders before activist campaigns

Event-driven analysts monitor ten-percent owners by filtering on the isTenPercentOwner flag and tracking cumulative acquired shares (transactionAcquiredDisposedCode = A) across sequential filings for the same reporting-owner CIK and issuer. Rising sharesOwnedFollowingTransaction totals over rolling windows can signal stake-building ahead of a Schedule 13D filing or proxy contest. Historical filing depth supports pattern matching against prior contested situations for the same issuer or holder.

Resolving derivative-transaction terms through footnote extraction

Data engineers building structured insider-transaction databases parse the derivativeTable to capture option exercises, conversions, and award vesting. Fields such as conversionOrExercisePrice, exerciseDate, and expirationDate frequently contain only a footnoteId reference rather than a literal value. Resolving these references against the footnotes element recovers vesting schedules, plan names, and price computation formulas needed to produce complete derivative-transaction records for downstream analytics and screening tools.

Reconstructing insider trading timelines for enforcement and litigation

Forensic accountants and investigators pull every Form 4 and Form 4/A for a given reporting-owner CIK to build a chronological record of all transactions across issuers. Transaction dates, prices, share volumes, and transaction codes enable precise calculation of profits or avoided losses relative to material events such as earnings misses or restatements. Amendment history is critical: corrections disclosed in Form 4/A filings can reveal originally misreported transactions relevant to enforcement proceedings or shareholder litigation.

Dataset Access

Dataset Index JSON API: https://api.sec-api.io/datasets/form-4-files.json

This endpoint returns metadata about the Form 4 Files Dataset, including its name, description, last updated timestamp, earliest sample date, total records, total size, form types covered (4 and 4/A), container format (ZIP), and content file types (TXT, JSON, HTML, PDF, XML). It also returns the download URL for the entire dataset and a list of all individual container files with per-container metadata such as size, record count, last updated timestamp, and download URL. This endpoint does not require an API key.

Use this API to monitor which containers have been updated in the most recent daily refresh, so you can selectively download only the containers that changed rather than re-downloading the full dataset each time.

Example
1 {
2 "datasetId": "1f1333bd-dbdb-6340-ba36-580af17fba9d",
3 "datasetDownloadUrl": "https://api.sec-api.io/datasets/form-4-files.zip",
4 "name": "Form 4 Files Dataset",
5 "updatedAt": "2026-04-17T02:54:16.820Z",
6 "earliestSampleDate": "1996-01-01",
7 "totalRecords": 9579934,
8 "totalSize": 28354410534,
9 "formTypes": ["4", "4/A"],
10 "containerFormat": "ZIP",
11 "fileTypes": ["TXT", "JSON", "HTML", "PDF", "XML"],
12 "containers": [
13 {
14 "downloadUrl": "https://api.sec-api.io/datasets/form-4-files/2025/2025-10.zip",
15 "key": "2025/2025-10.zip",
16 "size": 135291847,
17 "records": 29543,
18 "updatedAt": "2026-04-17T02:54:16.820Z"
19 }
20 ]
21 }

Download Entire Dataset: https://api.sec-api.io/datasets/form-4-files.zip?token=YOUR_API_KEY

Downloads the full dataset as a single ZIP archive containing all containers. This endpoint requires an API key passed as the token query parameter.

Download Single Container: https://api.sec-api.io/datasets/form-4-files/2025/2025-10.zip?token=YOUR_API_KEY

Downloads one individual monthly container instead of the full dataset. Use the container paths returned by the dataset index JSON API to construct the download URL for any specific month. This endpoint requires an API key passed as the token query parameter.

Frequently Asked Questions

What form does this dataset cover?

The Form 4 Files Dataset covers SEC Form 4 (Statement of Changes in Beneficial Ownership of Securities) and Form 4/A (amendments). These are the filings required under Section 16(a) of the Securities Exchange Act of 1934 to report insider transactions.

What does one record in this dataset represent?

One record corresponds to a single EDGAR submission of Form 4 or Form 4/A, identified by accession number. Each record folder contains a metadata.json file, the primary XML ownership document, a pre-rendered HTML view, and occasionally Exhibit 24 (power-of-attorney) attachments.

Who is required to file Form 4?

Three classes of corporate insiders must file: officers (as defined in Rule 16a-1(f)), directors, and beneficial owners of more than ten percent of any Section 12-registered equity class. Foreign private issuers and their insiders are exempt.

How quickly must Form 4 be filed after a transaction?

Form 4 must be filed before the end of the second business day after the transaction date. This accelerated deadline was established by the Sarbanes-Oxley Act of 2002.

What time period does the dataset cover?

The dataset spans January 1996 to the present. Filings before mid-2003 represent voluntary electronic submissions; from June 30, 2003 onward — when the SEC mandated electronic filing of Section 16 forms — coverage is essentially complete.

What file format is the dataset distributed in?

The dataset is distributed as monthly ZIP containers. Each container holds one folder per filing, containing XML, JSON, and HTML files. The primary ownership document is structured XML conforming to the SEC's ownershipDocument schema (for filings from mid-2003 onward); earlier filings are plain text or HTML.

How does this dataset differ from tabular insider-transaction extracts?

Tabular extracts flatten Form 4 XML into row-per-transaction records (insider, ticker, date, code, shares, price). The Form 4 Files Dataset preserves the complete filing package — raw XML, metadata JSON, rendered HTML, and exhibits — retaining footnotes, derivative-table structures, multi-transaction context, amendment chains, and power-of-attorney documents that tabular extracts discard.