Form SC 13G Files Dataset

The Form SC 13G Files Dataset contains the complete EDGAR filing packages for every Schedule 13G and Schedule 13G/A beneficial ownership disclosure submitted to the SEC from January 1994 to the present. Each record represents a single filing in which a reporting person — an institutional investor, passive investor, or exempt investor — declares beneficial ownership of more than 5% of a class of registered equity securities without a purpose to influence or change control of the issuer. The dataset covers all four EDGAR form-type variants: SC 13G, SC 13G/A, SCHEDULE 13G, and SCHEDULE 13G/A. Records are packaged as ZIP containers organized by month, and each filing folder includes the structured XML document (for post-2024 filings), an XHTML rendering, a JSON metadata sidecar, and any exhibits such as joint filing agreements or powers of attorney.

Update Frequency
Daily
Updated at
2026-05-19
Earliest Sample Date
1994-01-01
Total Size
4.2 GB
Total Records
837,987
Container Format
ZIP
Content Types
TXT, JSON, HTML, PDF, XML
Form Types
SC 13G, SC 13G/A, SCHEDULE 13G, SCHEDULE 13G/A

Dataset APIs

Programmatically retrieve the full list of dataset archive files, download URLs and dataset metadata.

Dataset Index JSON API

Download the entire dataset as a single archive file.

Download Entire Dataset:

Download a single container file (e.g. monthly archive) from the dataset.

Download Single Container:

Dataset Files

389 files · 4.2 GB
Download All
2026-05.zip66.8 MB12,864 records
2026-04.zip33.8 MB9,745 records
2026-03.zip24.9 MB5,963 records
2026-02.zip73.3 MB10,024 records
2026-01.zip16.7 MB4,323 records
2025-12.zip8.6 MB1,235 records
2025-11.zip58.3 MB8,345 records
2025-10.zip20.7 MB4,706 records
2025-09.zip6.3 MB1,128 records
2025-08.zip47.7 MB8,275 records
2025-07.zip21.0 MB5,543 records
2025-06.zip4.5 MB1,055 records
2025-05.zip50.8 MB8,434 records
2025-04.zip23.7 MB7,365 records
2025-03.zip8.0 MB1,418 records
2025-02.zip80.9 MB12,687 records
2025-01.zip16.3 MB3,069 records
2024-12.zip4.6 MB866 records
2024-11.zip47.4 MB8,828 records
2024-10.zip14.8 MB2,734 records
2024-09.zip2.9 MB557 records
2024-08.zip4.4 MB657 records
2024-07.zip3.9 MB792 records
2024-06.zip3.4 MB534 records
2024-05.zip3.6 MB572 records
2024-04.zip4.4 MB642 records
2024-03.zip5.9 MB612 records
2024-02.zip89.6 MB15,041 records
2024-01.zip26.4 MB4,868 records
2023-12.zip3.0 MB537 records
2023-11.zip2.6 MB499 records
2023-10.zip2.7 MB542 records
2023-09.zip3.4 MB485 records
2023-08.zip3.2 MB498 records
2023-07.zip3.6 MB602 records
2023-06.zip4.2 MB501 records
2023-05.zip4.1 MB511 records
2023-04.zip5.5 MB600 records
2023-03.zip3.4 MB636 records
2023-02.zip104.2 MB17,164 records
2023-01.zip26.7 MB4,294 records
2022-12.zip3.0 MB555 records
2022-11.zip2.9 MB526 records
2022-10.zip3.2 MB468 records
2022-09.zip2.7 MB487 records
2022-08.zip4.7 MB646 records
2022-07.zip6.1 MB661 records
2022-06.zip3.3 MB581 records
2022-05.zip4.4 MB635 records
2022-04.zip3.8 MB684 records
2022-03.zip5.1 MB879 records
2022-02.zip111.9 MB19,406 records
2022-01.zip23.6 MB4,147 records
2021-12.zip5.2 MB861 records
2021-11.zip4.2 MB746 records
2021-10.zip4.3 MB697 records
2021-09.zip5.8 MB594 records
2021-08.zip4.3 MB727 records
2021-07.zip5.3 MB934 records
2021-06.zip5.6 MB556 records
2021-05.zip5.3 MB678 records
2021-04.zip5.5 MB780 records
2021-03.zip5.8 MB924 records
2021-02.zip95.2 MB16,941 records
2021-01.zip21.9 MB3,629 records
2020-12.zip4.2 MB656 records
2020-11.zip3.7 MB589 records
2020-10.zip4.9 MB677 records
2020-09.zip3.0 MB498 records
2020-08.zip3.1 MB496 records
2020-07.zip3.5 MB655 records
2020-06.zip5.0 MB593 records
2020-05.zip3.3 MB567 records
2020-04.zip3.3 MB639 records
2020-03.zip3.3 MB619 records
2020-02.zip88.5 MB16,250 records
2020-01.zip12.3 MB2,131 records
2019-12.zip2.7 MB501 records
2019-11.zip3.3 MB514 records
2019-10.zip2.3 MB414 records
2019-09.zip2.4 MB447 records
2019-08.zip3.3 MB451 records
2019-07.zip3.3 MB479 records
2019-06.zip2.8 MB490 records
2019-05.zip2.7 MB509 records
2019-04.zip3.0 MB524 records
2019-03.zip3.2 MB576 records
2019-02.zip83.6 MB15,575 records
2019-01.zip17.6 MB3,172 records
2018-12.zip2.7 MB454 records
2018-11.zip2.7 MB489 records
2018-10.zip2.8 MB497 records
2018-09.zip2.1 MB398 records
2018-08.zip2.7 MB508 records
2018-07.zip3.3 MB612 records
2018-06.zip3.0 MB488 records
2018-05.zip2.7 MB502 records
2018-04.zip3.0 MB538 records
2018-03.zip2.7 MB505 records
2018-02.zip82.0 MB14,565 records
2018-01.zip19.9 MB3,936 records
2017-12.zip2.7 MB494 records
2017-11.zip2.9 MB541 records
2017-10.zip2.4 MB456 records
2017-09.zip2.3 MB460 records
2017-08.zip2.8 MB532 records
2017-07.zip3.0 MB564 records
2017-06.zip2.9 MB520 records
2017-05.zip2.9 MB528 records
2017-04.zip3.2 MB576 records
2017-03.zip3.3 MB623 records
2017-02.zip76.2 MB14,179 records
2017-01.zip26.7 MB4,671 records
2016-12.zip3.2 MB604 records
2016-11.zip2.7 MB501 records
2016-10.zip2.7 MB505 records
2016-09.zip2.9 MB539 records
2016-08.zip3.2 MB502 records
2016-07.zip3.0 MB574 records
2016-06.zip2.5 MB470 records
2016-05.zip2.5 MB492 records
2016-04.zip3.2 MB602 records
2016-03.zip3.6 MB649 records
2016-02.zip88.7 MB15,925 records
2016-01.zip24.9 MB4,078 records
2015-12.zip2.9 MB540 records
2015-11.zip2.9 MB521 records
2015-10.zip2.9 MB531 records
2015-09.zip4.4 MB1,046 records
2015-08.zip3.1 MB591 records
2015-07.zip3.8 MB702 records
2015-06.zip3.2 MB590 records
2015-05.zip3.9 MB571 records
2015-04.zip3.9 MB595 records
2015-03.zip5.0 MB659 records
2015-02.zip89.2 MB16,005 records
2015-01.zip20.3 MB3,878 records
2014-12.zip3.3 MB622 records
2014-11.zip3.3 MB557 records
2014-10.zip4.3 MB606 records
2014-09.zip3.0 MB490 records
2014-08.zip13.1 MB506 records
2014-07.zip3.3 MB608 records
2014-06.zip3.2 MB577 records
2014-05.zip3.2 MB548 records
2014-04.zip3.8 MB610 records
2014-03.zip3.4 MB651 records
2014-02.zip80.4 MB14,725 records
2014-01.zip22.1 MB4,360 records
2013-12.zip3.0 MB513 records
2013-11.zip3.2 MB570 records
2013-10.zip2.6 MB467 records
2013-09.zip2.5 MB468 records
2013-08.zip2.9 MB537 records
2013-07.zip2.8 MB515 records
2013-06.zip2.5 MB479 records
2013-05.zip3.5 MB526 records
2013-04.zip3.3 MB620 records
2013-03.zip3.7 MB729 records
2013-02.zip83.8 MB15,538 records
2013-01.zip15.6 MB2,979 records
2012-12.zip2.6 MB486 records
2012-11.zip2.6 MB494 records
2012-10.zip2.6 MB504 records
2012-09.zip2.0 MB404 records
2012-08.zip2.8 MB507 records
2012-07.zip2.4 MB486 records
2012-06.zip2.7 MB531 records
2012-05.zip3.1 MB556 records
2012-04.zip4.0 MB662 records
2012-03.zip3.1 MB600 records
2012-02.zip85.5 MB15,568 records
2012-01.zip11.6 MB2,399 records
2011-12.zip2.3 MB446 records
2011-11.zip2.6 MB461 records
2011-10.zip2.4 MB455 records
2011-09.zip2.9 MB544 records
2011-08.zip3.0 MB534 records
2011-07.zip2.8 MB580 records
2011-06.zip3.1 MB592 records
2011-05.zip3.7 MB552 records
2011-04.zip3.3 MB615 records
2011-03.zip3.6 MB679 records
2011-02.zip83.0 MB15,494 records
2011-01.zip10.7 MB2,226 records
2010-12.zip3.4 MB528 records
2010-11.zip3.1 MB502 records
2010-10.zip2.5 MB504 records
2010-09.zip2.4 MB458 records
2010-08.zip2.4 MB456 records
2010-07.zip3.0 MB565 records
2010-06.zip3.4 MB659 records
2010-05.zip3.0 MB581 records
2010-04.zip3.6 MB639 records
2010-03.zip4.7 MB1,058 records
2010-02.zip73.2 MB13,645 records
2010-01.zip24.2 MB4,562 records
2009-12.zip2.6 MB527 records
2009-11.zip3.5 MB573 records
2009-10.zip3.9 MB575 records
2009-09.zip2.7 MB531 records
2009-08.zip2.5 MB502 records
2009-07.zip3.3 MB605 records
2009-06.zip3.9 MB738 records
2009-05.zip3.9 MB586 records
2009-04.zip3.2 MB621 records
2009-03.zip4.3 MB875 records
2009-02.zip99.5 MB18,424 records
2009-01.zip15.5 MB3,024 records
2008-12.zip3.8 MB877 records
2008-11.zip4.3 MB856 records
2008-10.zip4.0 MB796 records
2008-09.zip3.5 MB727 records
2008-08.zip3.9 MB820 records
2008-07.zip4.3 MB819 records
2008-06.zip4.1 MB794 records
2008-05.zip4.0 MB799 records
2008-04.zip3.8 MB771 records
2008-03.zip4.8 MB958 records
2008-02.zip95.0 MB19,269 records
2008-01.zip14.5 MB2,944 records
2007-12.zip4.7 MB933 records
2007-11.zip4.6 MB911 records
2007-10.zip4.4 MB784 records
2007-09.zip3.6 MB752 records
2007-08.zip5.3 MB1,078 records
2007-07.zip4.1 MB825 records
2007-06.zip4.5 MB831 records
2007-05.zip3.8 MB774 records
2007-04.zip3.5 MB754 records
2007-03.zip4.9 MB1,025 records
2007-02.zip80.9 MB17,062 records
2007-01.zip15.1 MB3,384 records
2006-12.zip3.9 MB858 records
2006-11.zip3.7 MB736 records
2006-10.zip3.1 MB673 records
2006-09.zip2.8 MB595 records
2006-08.zip2.9 MB636 records
2006-07.zip3.1 MB639 records
2006-06.zip3.0 MB673 records
2006-05.zip3.6 MB764 records
2006-04.zip3.2 MB693 records
2006-03.zip4.4 MB914 records
2006-02.zip75.2 MB16,086 records
2006-01.zip11.5 MB2,678 records
2005-12.zip2.7 MB604 records
2005-11.zip2.6 MB570 records
2005-10.zip2.9 MB656 records
2005-09.zip2.7 MB596 records
2005-08.zip2.9 MB650 records
2005-07.zip2.4 MB530 records
2005-06.zip2.7 MB574 records
2005-05.zip3.2 MB663 records
2005-04.zip3.4 MB766 records
2005-03.zip3.6 MB827 records
2005-02.zip67.3 MB14,874 records
2005-01.zip8.9 MB2,118 records
2004-12.zip2.7 MB584 records
2004-11.zip2.5 MB550 records
2004-10.zip2.0 MB463 records
2004-09.zip2.2 MB505 records
2004-08.zip2.3 MB529 records
2004-07.zip2.7 MB558 records
2004-06.zip2.4 MB538 records
2004-05.zip2.4 MB522 records
2004-04.zip3.6 MB655 records
2004-03.zip3.5 MB752 records
2004-02.zip64.1 MB14,116 records
2004-01.zip7.3 MB1,869 records
2003-12.zip2.2 MB520 records
2003-11.zip2.5 MB572 records
2003-10.zip2.3 MB491 records
2003-09.zip2.4 MB491 records
2003-08.zip2.4 MB512 records
2003-07.zip2.6 MB539 records
2003-06.zip2.2 MB517 records
2003-05.zip2.2 MB513 records
2003-04.zip2.3 MB529 records
2003-03.zip4.0 MB870 records
2003-02.zip58.7 MB13,622 records
2003-01.zip8.6 MB2,080 records
2002-12.zip1.8 MB444 records
2002-11.zip2.1 MB540 records
2002-10.zip1.9 MB464 records
2002-09.zip1.8 MB455 records
2002-08.zip2.1 MB507 records
2002-07.zip2.1 MB537 records
2002-06.zip2.1 MB555 records
2002-05.zip2.3 MB565 records
2002-04.zip2.6 MB550 records
2002-03.zip3.0 MB747 records
2002-02.zip56.3 MB14,248 records
2002-01.zip7.6 MB2,007 records
2001-12.zip2.0 MB500 records
2001-11.zip2.0 MB518 records
2001-10.zip1.9 MB465 records
2001-09.zip1.6 MB390 records
2001-08.zip2.1 MB514 records
2001-07.zip2.1 MB531 records
2001-06.zip3.4 MB548 records
2001-05.zip2.6 MB669 records
2001-04.zip2.3 MB628 records
2001-03.zip3.3 MB832 records
2001-02.zip60.2 MB15,336 records
2001-01.zip9.3 MB2,436 records
2000-12.zip2.0 MB531 records
2000-11.zip2.9 MB766 records
2000-10.zip1.7 MB451 records
2000-09.zip2.0 MB510 records
2000-08.zip2.1 MB552 records
2000-07.zip1.9 MB477 records
2000-06.zip2.3 MB587 records
2000-05.zip2.2 MB589 records
2000-04.zip2.7 MB729 records
2000-03.zip3.9 MB996 records
2000-02.zip57.7 MB16,054 records
2000-01.zip8.3 MB2,148 records
1999-12.zip2.3 MB609 records
1999-11.zip2.0 MB510 records
1999-10.zip2.1 MB530 records
1999-09.zip2.0 MB515 records
1999-08.zip2.1 MB577 records
1999-07.zip2.0 MB535 records
1999-06.zip1.9 MB539 records
1999-05.zip2.1 MB550 records
1999-04.zip2.6 MB714 records
1999-03.zip3.5 MB879 records
1999-02.zip59.2 MB16,467 records
1999-01.zip7.6 MB1,984 records
1998-12.zip2.3 MB611 records
1998-11.zip2.2 MB581 records
1998-10.zip2.6 MB685 records
1998-09.zip3.1 MB768 records
1998-08.zip2.2 MB593 records
1998-07.zip5.0 MB1,392 records
1998-06.zip2.1 MB582 records
1998-05.zip2.3 MB636 records
1998-04.zip2.9 MB763 records
1998-03.zip4.6 MB1,269 records
1998-02.zip56.3 MB15,820 records
1998-01.zip9.4 MB2,437 records
1997-12.zip1.4 MB387 records
1997-11.zip1.3 MB369 records
1997-10.zip1.3 MB344 records
1997-09.zip1.3 MB359 records
1997-08.zip1.4 MB407 records
1997-07.zip1.7 MB473 records
1997-06.zip1.5 MB410 records
1997-05.zip1.8 MB526 records
1997-04.zip2.6 MB725 records
1997-03.zip3.1 MB896 records
1997-02.zip50.4 MB14,026 records
1997-01.zip7.8 MB2,148 records
1996-12.zip1.3 MB325 records
1996-11.zip1.1 MB302 records
1996-10.zip894.9 KB234 records
1996-09.zip987.5 KB258 records
1996-08.zip1.2 MB340 records
1996-07.zip1.1 MB274 records
1996-06.zip1.1 MB298 records
1996-05.zip1.2 MB316 records
1996-04.zip851.6 KB225 records
1996-03.zip1.8 MB444 records
1996-02.zip26.5 MB7,234 records
1996-01.zip3.3 MB904 records
1995-12.zip608.3 KB164 records
1995-11.zip540.9 KB150 records
1995-10.zip604.5 KB183 records
1995-09.zip527.5 KB145 records
1995-08.zip553.8 KB143 records
1995-07.zip517.6 KB135 records
1995-06.zip715.3 KB191 records
1995-05.zip622.7 KB165 records
1995-04.zip985.2 KB279 records
1995-03.zip1.0 MB298 records
1995-02.zip19.4 MB5,271 records
1995-01.zip1.6 MB472 records
1994-12.zip420.1 KB110 records
1994-11.zip311.7 KB87 records
1994-10.zip372.3 KB97 records
1994-09.zip429.9 KB108 records
1994-08.zip472.8 KB129 records
1994-07.zip566.8 KB153 records
1994-06.zip445.7 KB121 records
1994-05.zip376.4 KB105 records
1994-04.zip372.0 KB101 records
1994-03.zip612.9 KB160 records
1994-02.zip12.3 MB3,356 records
1994-01.zip652.1 KB170 records

What This Dataset Contains

Schedule 13G is a shortened alternative to Schedule 13D, available to beneficial owners of more than 5% of a voting equity class who do not hold the securities with a purpose or effect of changing or influencing control of the issuer. Three categories of filers qualify: institutional investors under Rule 13d-1(b) (registered investment advisers, broker-dealers, banks, insurance companies, and similar regulated entities); passive investors under Rule 13d-1(c), who acquired the securities in the ordinary course of business without a control purpose; and exempt investors under Rule 13d-1(d), who acquired the securities before the issuer's registration or in certain non-public transactions. The filing obligation triggers when beneficial ownership crosses the 5% threshold and continues through amendment filings (SC 13G/A or SCHEDULE 13G/A) whenever material changes in ownership position, percentage, or filer eligibility occur.

The dataset spans from January 1994 to the present and is distributed as ZIP containers, each containing individual filing folders. Each folder is named by its zero-padded, dash-stripped accession number (e.g., 000001961725000501 for accession 0000019617-25-000501). The files within each folder include structured XML, browser-rendered XHTML, JSON metadata, and any submitted exhibits, in TXT, JSON, HTML, PDF, and XML formats.

Content Structure of a Single Record

A single record in the Form SC 13G Files Dataset is one complete Schedule 13G or Schedule 13G/A filing packaged as a self-contained folder. The folder contains the structured XML filing, a browser-rendered XHTML view, a JSON metadata sidecar, and any exhibits submitted with the schedule. Each record represents a single beneficial ownership disclosure: one reporting person (or a group of co-reporting persons) declaring beneficial ownership of more than 5% of a class of registered equity securities under the abbreviated Schedule 13G regime.

Folder Contents

Each record folder contains three baseline files present in every filing, plus zero or more exhibit files:

primary_doc.xml — The structured Schedule 13G filing in SEC EDGAR XML schema format (http://www.sec.gov/edgar/schedule13g namespace). This is the machine-readable core containing all substantive beneficial ownership disclosures.

xslSCHEDULE_13G_X01/primary_doc.xml — An XHTML rendering of the same XML data, produced by the SEC's XSL stylesheet transformation. This styled, browser-readable document is the version linked from the EDGAR filing details page. It contains no information beyond what appears in the primary XML; it is a presentation layer only.

metadata.json — A JSON sidecar file containing EDGAR index-level metadata for the filing, generated from the EDGAR filing index rather than from the submission itself.

Exhibit files (zero or more) — Additional documents submitted alongside the schedule, typically in .htm, .txt, or .pdf format. Common exhibit types are EX-99 / EX-99.1 (joint filing agreements, Item 7 subsidiary lists, supplementary materials) and EX-24 (powers of attorney).

metadata.json: Filing-Level Index Data

The metadata sidecar captures EDGAR index fields that contextualize the filing. Key fields include:

  • formType: The specific form variant (SC 13G, SC 13G/A, SCHEDULE 13G, or SCHEDULE 13G/A).
  • accessionNo: The SEC accession number in standard dashed format.
  • filedAt: ISO 8601 timestamp recording when EDGAR accepted the filing.
  • id: A 32-character hexadecimal identifier.
  • description: A human-readable description string (e.g., "Form SCHEDULE 13G/A - Statement of Beneficial Ownership by Certain Investors: [Amend]").
  • linkToFilingDetails, linkToTxt, linkToHtml: URLs pointing to the EDGAR rendered filing page, the complete submission text file, and the filing index page, respectively.
  • linkToXbrl: Always empty; Schedule 13G filings are not subject to XBRL tagging.
  • entities: An array typically containing two entries: one for the reporting person (marked (Filed by) in the companyName field) and one for the issuer (marked (Subject)). The ordering varies across filings — either entity may appear first. Each entity entry includes CIK, IRS number, SIC code, fiscal year end, state of incorporation, and any associated ticker symbols. The Subject entity additionally carries act, fileNo, and filmNo fields.
  • documentFormatFiles: An array listing every document in the EDGAR submission package, with sequence number, file size, document URL, MIME type, and description. The first entry is typically the XSL-rendered view (with blank size), the second is the raw XML, subsequent entries are exhibits, and the final entry is the complete EDGAR submission text file (with blank sequence and type).
  • dataFiles and seriesAndClassesContractsInformation: Always empty arrays for this filing type.

primary_doc.xml: Structured Beneficial Ownership Disclosure

The XML document follows the SEC EDGAR Schedule 13G XML schema and is organized into a strict hierarchy under the root edgarSubmission element.

Submission Header (headerData)

The opening block identifies the submission type (e.g., SCHEDULE 13G or SCHEDULE 13G/A), the filer's CIK via filerCredentials (with the CCC confirmation code redacted as XXXXXXXX), and a liveTestFlag set to LIVE in production filings.

Cover Page Header (coverPageHeader)

This block contains the core identifying fields for the filing:

  • amendmentNo: Present only on amendments; the sequential amendment number (e.g., 1, 2).
  • securitiesClassTitle: A textual description of the equity class (e.g., "Common Stock, $0.0001 Par Value Per Share" or "Class A Common Stock, par value $0.0001 per share").
  • eventDateRequiresFilingThisStatement: The date of the triggering event — a threshold crossing, calendar quarter-end, or material change in ownership — in MM/DD/YYYY format.
  • issuerInfo: A nested block containing the issuer's CIK (issuerCik), legal name (issuerName), CUSIP number (issuerCusip), and principal executive office address (street, city, state/country, ZIP code).
  • designateRulesPursuantThisScheduleFiled: Contains one or more designateRulePursuantThisScheduleFiled elements specifying the SEC rule under which the filer claims eligibility for Schedule 13G — typically Rule 13d-1(b), Rule 13d-1(c), or Rule 13d-1(d).

Reporting Person Details (coverPageHeaderReportingPersonDetails)

One block appears per reporting person. Joint filings produce multiple blocks. Each block contains:

  • reportingPersonName: The legal name of the beneficial owner.
  • memberGroup: A group identifier (e.g., b) present when multiple persons file jointly; absent in single-filer schedules.
  • citizenshipOrOrganization: State or country of incorporation or citizenship.
  • reportingPersonBeneficiallyOwnedNumberOfShares: Four numeric sub-elements — soleVotingPower, sharedVotingPower, soleDispositivePower, and sharedDispositivePower — reporting the number of shares under each type of authority.
  • reportingPersonBeneficiallyOwnedAggregateNumberOfShares: Total shares beneficially owned.
  • aggregateAmountExcludesCertainSharesFlag: Y or N, indicating whether the aggregate amount excludes certain shares pursuant to Rule 13d-4.
  • classPercent: The percentage of the outstanding class.
  • typeOfReportingPerson: One or more two-letter classification codes: HC (holding company), BD (broker-dealer), IA (investment adviser), IN (insurance company), BK (bank), IC (investment company), EP (employee benefit plan), CO (corporation), PN (partnership), FI (financial institution), OO (other), among others.
  • comments: An optional free-text field for supplementary remarks. Some filers use this to explain warrant-based ownership, beneficial ownership limitations, or other qualifications that do not fit the structured fields.

Items 1 Through 10 (items)

The XML encodes structured responses to the ten numbered items prescribed by the Schedule 13G form. Each item contains a notApplicableFlag (Y/N) and, when applicable, the substantive content:

  • Item 1 (item1): Issuer name and principal executive office address (restated from the cover page).
  • Item 2 (item2): Reporting person name (filingPersonName), principal business office or residence address, and citizenship/state of organization. Large institutional filers sometimes embed extended narrative disclaimers in the filingPersonName element (e.g., explanations of "Reporting Business Units" and disaggregated beneficial ownership per SEC Release No. 34-39538).
  • Item 3 (item3): The type of person filing, expressed via typeOfPersonFiling or otherTypeOfPersonFiling (the element name varies by filer). Uses the same two-letter codes as the cover page.
  • Item 4 (item4): Beneficial ownership detail: amountBeneficiallyOwned, classPercent, and the four-field voting/dispositive power breakdown (solePowerOrDirectToVote, sharedPowerOrDirectToVote, solePowerOrDirectToDispose, sharedPowerOrDirectToDispose). These figures parallel those in the cover page reporting person block.
  • Item 5 (item5): Whether ownership is five percent or less (classOwnership5PercentOrLess). A Y value signals an exit filing reporting that the position has dropped below the reporting threshold.
  • Item 6 (item6): Whether beneficial ownership is held on behalf of another person (ownershipMoreThan5PercentOnBehalfOfAnotherPerson), with identification of that person if applicable.
  • Item 7 (item7): Identification and classification of subsidiaries, controlled entities, or affiliates through which the reporting person exercises beneficial ownership (subsidiaryIdentificationAndClassification). For large institutional holding companies, this may contain an inline list of entities or a cross-reference such as "See Exhibit 99" directing to a separate exhibit with a detailed subsidiary list.
  • Item 8 (item8): Identification of members of a jointly-filing group (identificationAndClassificationOfGroupMembers), if applicable.
  • Item 9 (item9): Notice of group dissolution (groupDissolutionNotice), if the filing reports that a previously disclosed group has ceased to exist.
  • Item 10 (item10): Certification text (certifications) establishing the reporting person's eligibility to file on Schedule 13G, typically a standard institutional-investor or passive-investor certification declaring the securities were acquired in the ordinary course of business without a control purpose.

Exhibit Information (exhibitInfo)

An optional text element listing the exhibits attached to the filing (e.g., "Exhibit 24: Power of Attorney Exhibit 99: Item 7"). This element appears in the XML when exhibits are included but is absent from filings with no exhibits.

Signature Block (signatureInformation)

Each filing concludes with one or more signature blocks. Each contains the reportingPersonName, and nested signatureDetails with the typed signature, title (or capacity), and date of execution.

Exhibit Files

Exhibit files are wrapped in SGML document headers when stored in the EDGAR submission format. Each begins with <DOCUMENT> tags containing <TYPE>, <SEQUENCE>, <FILENAME>, <DESCRIPTION>, and <TEXT> elements before the exhibit content itself. The most common exhibit types are:

EX-99 and EX-99.1 (Supplementary Materials) — These serve several distinct purposes. Joint filing agreements are the most common: short agreements between co-reporting persons (e.g., a parent holding company and its investment advisory subsidiaries) confirming consent to file a single Schedule 13G jointly under Rule 13d-1(k). Item 7 subsidiary lists are another frequent use, particularly for large asset managers; these enumerate each subsidiary through which the reporting person exercises beneficial ownership, sometimes annotating which subsidiaries independently own 5% or more of the class. Other EX-99 exhibits may include cover letters or supplementary narrative disclosures. Formats include .htm, .txt, and occasionally .pdf.

EX-24 (Powers of Attorney) — These authorize named individuals to execute and file beneficial ownership reports on behalf of the reporting entity. A typical power of attorney identifies the grantor entity, lists the attorneys-in-fact by name, defines the scope of authority (usually limited to ownership reporting filings under Sections 13(d) and 13(g) of the Securities Exchange Act), and includes a dated signature block.

Included and Excluded Content

Each record folder includes the complete filing as submitted to EDGAR: the structured XML, the rendered XHTML view, the metadata sidecar, and all exhibits. The dataset does not include the raw EDGAR complete submission text file (the monolithic .txt file that concatenates all documents with SGML wrappers), though a URL to it is provided in metadata.json. Schedule 13G filings are not subject to XBRL tagging requirements, so no XBRL instance documents, inline XBRL overlays, or structured financial data files are present.

Format Heterogeneity Across the Dataset

The dataset spans from 1994 to the present, and the internal file structure of records varies substantially by era:

Plain-text era (1994 through mid-2000s): Early filings were submitted as unstructured plain-text documents within EDGAR's SGML submission framework. The primary document is a single .txt file with fixed-format headers and free-text item responses. Parsing requires interpreting unstructured prose with irregular spacing and filer-specific formatting.

HTML era (late 1990s through 2024): Many filers transitioned to HTML-formatted submissions, with the primary document as an .htm file. Internal structure depended entirely on the filer's formatting choices — tabular ownership breakdowns, styled certifications, and varied heading conventions.

Structured XML era (late 2024 onward): The SEC's 2023 modernization rules mandated structured XML submission using the Schedule 13G XML schema, with a September 30, 2024 compliance date. From this point forward, the primary document is primary_doc.xml conforming to the http://www.sec.gov/edgar/schedule13g namespace, and the SEC's SCHEDULE_13G_X01 XSL stylesheet generates the human-readable XHTML rendering in the xslSCHEDULE_13G_X01/ subfolder. The XML structure described in the sections above applies to these modern filings.

Practical implication: A record from 1996 may contain only a plain-text document and a metadata sidecar. A record from 2010 may contain an HTML document with tabular formatting. A record from 2025 contains the structured XML, the XHTML rendering, and any exhibits. Users working across the full historical range must account for this structural heterogeneity.

Interpretation Notes

Amendments restate the full schedule. Records with form type ending in /A are amendments. The amendmentNo element records the sequential amendment number. Each amendment restates the complete schedule; it is not a differential patch. The most recent amendment for a given reporting person–issuer pair supersedes all prior filings.

Joint filings produce multiple reporting-person blocks. When co-reporting persons (e.g., a holding company and its subsidiaries) file a single Schedule 13G, the XML contains multiple coverPageHeaderReportingPersonDetails blocks, each with its own ownership figures, classification codes, and optional comments. A joint filing agreement exhibit (EX-99) is typically attached. The memberGroup element links the reporting persons to their declared group.

Beneficial ownership aggregation follows Rule 13d-3. Ownership figures include shares over which the reporting person has or shares voting power or investment power. For institutional holding companies, the aggregate figure typically represents combined holdings across all controlled subsidiaries, while Item 7 and its associated exhibit break down holdings by subsidiary.

CUSIP provides security-level identification. The issuerCusip field identifies the specific class of equity, which is more precise than the issuer CIK for issuers with multiple equity classes outstanding.

Type-of-reporting-person codes follow a fixed vocabulary. A single reporting person may carry multiple codes (e.g., HC and IA for a holding company that is also a registered investment adviser). These codes are defined in the form instructions and are essential for classifying filers by regulatory category.

Five-percent-or-less amendments serve as exit filings. When a filer's position drops to 5% or below, the amendment reports the reduced position in Item 5 (classOwnership5PercentOrLess = Y) and classPercent, satisfying the obligation to disclose that the position no longer exceeds the reporting threshold.

Entity ordering in metadata.json is not fixed. The entities array may list the Subject (issuer) entity first or the Filed by (reporting person) entity first. Consumers should match on the (Filed by) or (Subject) suffix in the companyName field rather than relying on array position.

Exhibit structure varies by filer. Large institutional filers (e.g., BlackRock, Vanguard, JPMorgan) tend to produce highly standardized filings with consistent exhibit structures — named files like PowerOfAttorney.txt and Item_7.txt. Smaller filers may submit minimal or idiosyncratically formatted exhibits. The SGML document wrapper within exhibit files provides type, sequence, and description metadata useful for programmatic classification.

Who Files or Publishes This Dataset, and When

The filer of a Schedule 13G is the reporting person — the outside investor or entity that beneficially owns more than 5% of an equity class registered under Section 12 of the Exchange Act. The filer is not the issuer; the issuer appears only as the subject company whose shares are owned.

A reporting person may be a natural person, corporation, LLC, partnership, trust, investment adviser, bank, broker-dealer, insurance company, registered investment company, employee benefit plan, or any other entity. Joint filings are common: a parent holding company and its advisory subsidiaries, or multiple members of a group, often file a single Schedule 13G listing each reporting person.

Filer Eligibility Categories

Qualified Institutional Investors (Rule 13d-1(b)). The largest filer category. Includes banks, registered broker-dealers, insurance companies, registered investment companies, registered investment advisers, ERISA plans, savings associations, church plans, comparable foreign institutions, and parent/control persons of any of the foregoing. They must have acquired and hold the securities in the ordinary course of business, without a purpose to influence or change control of the issuer. Most filings in this dataset come from large asset managers (BlackRock, Vanguard, State Street, JPMorgan, etc.) filing under this category across hundreds of issuers.

Passive Investors (Rule 13d-1(c)). Any person — institutional or not — who beneficially owns more than 5% but not more than 20% of the class and certifies no purpose or effect of changing or influencing control. Exceeding 20% forces a switch to Schedule 13D.

Exempt Investors (Rule 13d-1(d)). A narrow category for persons whose beneficial ownership arose without an "acquisition" triggering Section 13(d) — mainly pre-registration holders and similar situations.

When Filings Are Triggered

The core trigger is crossing the 5% beneficial ownership threshold in a covered equity class (common stock, preferred stock, or other equity registered under Section 12).

Initial filing deadlines (post-February 5, 2024 rules)

Filer categoryDeadline
Qualified institutional investor45 days after end of calendar quarter in which 5% is first exceeded
Passive investor5 business days after crossing 5%
Exempt investor45 days after end of calendar year in which 5% is first exceeded

Amendment triggers (post-February 5, 2024 rules)

  • Qualified institutional investors: Amended 13G/A due within 45 days after quarter-end for any material change. If ownership exceeds 10%, an amendment is due within 5 business days after month-end, and thereafter within 5 business days after the end of any month in which ownership changes by more than 5 percentage points.
  • Passive investors: Amended 13G/A due within 2 business days of any material change, including crossing a whole-number percentage point.
  • Exempt investors: Amended 13G/A due within 45 days after year-end for any material change.

Pre-2024 deadlines

Before the February 5, 2024 amendments, qualified institutional investors filed initial 13Gs within 45 days after calendar year-end (not quarter-end) and amended annually unless crossing 10%. Passive investors had a 10-day initial deadline. The dataset spans from 1994, so a large share of filings follow the prior timing regime.

Important Distinctions

Schedule 13D vs. 13G. Any beneficial owner above 5% who does not qualify for Schedule 13G — or who acquires with a control purpose — must file Schedule 13D, which requires more detailed disclosure and has a tighter initial deadline (5 business days). Activist investors and potential acquirers use 13D. This dataset does not contain Schedule 13D filings.

Form conversions. A 13G filer who loses eligibility (e.g., forms control intent, or a passive investor exceeds 20%) must convert to 13D within the applicable deadline. Conversely, a 13D filer who becomes eligible may convert to 13G. Conversions appear in the record as amendments.

Group filings. When two or more persons agree to act together regarding securities and their aggregate holdings exceed 5%, the group has a filing obligation. A group may use Schedule 13G only if every member independently qualifies.

The filer is the investor, not the issuer. In EDGAR metadata, the CIK associated with the submission belongs to the reporting person (the investor), not the subject company. This is the reverse of most SEC filing datasets.

Foreign filers. Non-U.S. persons and entities are subject to the same obligations if they own more than 5% of a Section 12 class. Foreign institutions may qualify as qualified institutional investors if comparable to an enumerated domestic type. There is no separate foreign-filer form.

Fund complexes. Large asset managers frequently file jointly for parent companies, advisory subsidiaries, and individual funds, reflecting shares held across many client accounts and portfolios.

How This Dataset Differs From Similar Datasets or Filings

Schedule 13D (SC 13D / SC 13D/A)

Schedule 13D is the closest relative. Both forms trigger at 5% beneficial ownership of a registered equity class under Exchange Act Section 13(d), identify the same issuer-and-class fields, and report ownership amounts and percentages.

The dividing line is investor intent. Schedule 13D applies when the holder acquires or holds shares with a purpose of influencing or changing control. Schedule 13G is the abbreviated alternative reserved for passive investors — institutional investors, registered investment companies, broker-dealers, banks, insurance companies, and certain exempt holders who certify no activist intent. Because of this, 13D filings are substantially longer: they require narrative disclosure of funding sources, transaction purposes, plans for corporate changes, and agreements concerning the issuer's securities. Schedule 13G filings omit most of this and are largely tabular.

The populations overlap dynamically. A holder who shifts from passive to activist must reclassify from 13G to 13D, and vice versa. Filing cadence also differs: 13G amendments generally cluster around quarterly or annual deadlines, while 13D amendments are event-driven, due promptly after material changes. A complete view of large beneficial ownership for any issuer requires both filing streams.

Form 13F (13F-HR / 13F-HR/A)

Form 13F requires institutional managers with over $100 million in Section 13(f) securities to report all qualifying holdings quarterly. A large fund complex often appears in both datasets for the same issuer, creating apparent overlap.

The key differences are trigger, scope, and granularity. 13F is portfolio-wide: it lists every qualifying position regardless of ownership percentage, often thousands per filing. Schedule 13G covers exactly one issuer-security relationship per filing, triggered only when the position crosses 5% of the class. 13F is filed strictly quarterly by institutional managers meeting the AUM threshold. 13G is filed by a broader set of qualifying beneficial owners (not only managers but also banks, broker-dealers, and exempt persons) and is threshold-triggered rather than calendar-driven. 13F data is highly tabular with CUSIP-level structure; 13G filings are semi-structured documents with less standardized formatting.

Forms 3, 4, and 5 (Section 16 Insider Reporting)

Forms 3/4/5 report ownership and transactions by officers, directors, and 10%-or-greater beneficial owners under Exchange Act Section 16. The 10% owner category creates direct overlap with Schedule 13G filers who cross that threshold.

The regimes serve different purposes. Section 16 is a transaction-disclosure and short-swing-profit regime: Form 4 reports individual trades within two business days, with grant-level detail on options and derivatives. Schedule 13G reports aggregate ownership positions, not transactions. Below 10%, the populations diverge entirely — Forms 3/4/5 cover officers and directors regardless of ownership size, while 13G covers any qualifying passive investor above 5% with no insider relationship required. Form 4 filings are XBRL-tagged and transaction-level; 13G filings are semi-structured and position-level.

Full-Filing Files vs. Extracted-Field Datasets

This dataset provides complete EDGAR filing packages — primary documents, exhibits, cover-page certifications, joint-filing agreements, and metadata. Structured or extracted-field datasets derived from Schedule 13G parse these into tabular fields (reporting person, issuer CIK, shares owned, percentage of class, amendment status). The full-filing dataset preserves content that extractions typically discard — footnotes qualifying ownership calculations, co-filer identity in group filings, powers of attorney, and the exact filing text. It requires downstream parsing but supports text-level and exhibit-level analysis that tabular extracts cannot.

Boundary Summary

The Form SC 13G Files Dataset isolates passive large-block equity ownership: positions crossing 5% of an issuer's equity class, held by investors certifying no intent to influence control. Schedule 13D covers the same threshold for activist holders with far richer narrative disclosure. Form 13F captures full institutional portfolios without a concentration filter. Forms 3/4/5 track insider transactions at a granular level for a different statutory population. This dataset is distinct in combining the 5% concentration threshold, passive-investor scope, abbreviated disclosure format, and full-filing document preservation.

Who Uses This Dataset

Schedule 13G filings document passive beneficial ownership above the 5% threshold. The structured fields — reporting person identity, aggregate shares, voting and dispositive power, percentage of class, and passive-intent certification — support a range of professional workflows.

Institutional Sales and Capital-Markets Advisory Desks

Sales teams at broker-dealers use holder identity, CUSIP, share counts, and percentage of class to build ownership maps across their coverage universe. Amendments (SC 13G/A) reveal when an institutional holder has increased, decreased, or exited a position, feeding client targeting, roadshow planning, and block-trade origination.

Activist Monitoring and Corporate Defense Advisors

Proxy solicitors and issuer defense teams watch for holders who drop off the 13G dataset or fail to amend — a signal of possible transition to 13D (control intent). The sole-versus-shared voting power and dispositive power fields show how much influence a single entity actually wields, which drives proxy-contest preparation and shareholder engagement strategy.

Equity Research Analysts

Sector analysts use percentage of class, reporting person identity, and amendment history to assess shareholder concentration risk, predict voting dynamics before annual meetings, and gauge institutional conviction. A large passive holder reducing its stake below 5% signals potential liquidity risk and supply-demand imbalance.

Compliance Officers at Asset Managers

Compliance teams cross-reference their own firm's filings against aggregate shares, percentage of class, and event dates to verify accuracy and timeliness. They review peer treatment of the Rule 13d-1(b)/13d-1(c)/13d-1(d) designation and passive-intent certification language to ensure consistent conventions. Amendment history confirms updates were filed within regulatory deadlines.

Securities Lawyers and Disclosure Counsel

Attorneys advising reporting persons examine the rule-designation field to confirm eligibility classification. Item 6 (ownership on behalf of another person) and Item 7 (subsidiary-identification exhibit) matter most for holding-company filers aggregating ownership across subsidiaries. Amendment sequences show how peers handle corrections and annual updates.

Quantitative Researchers and Systematic Strategy Teams

Quant teams extract percentage of class, aggregate shares, and voting/dispositive power fields from structured XML filings to build institutional-ownership concentration factors, passive-holder turnover signals, and ownership-change event indicators. The dataset's coverage from 1994 to present provides longitudinal depth for backtesting; amendment sequences yield holder-issuer ownership time series.

Corporate Governance Researchers

Stewardship teams and proxy advisory analysts segment the shareholder base using the type-of-reporting-person field, passive-intent certification, and voting-power breakdown. Key questions include how ownership concentration among passive holders has evolved across sectors and how frequently holders transition between 13G and 13D status.

Financial Data Engineers

Engineering teams at data vendors and fintech platforms parse the raw filings (XML, HTML, TXT, PDF) into structured ownership databases. Metadata JSON files supply accession numbers, CIK codes, tickers, SIC codes, and entity relationships needed to link 13G records into broader ownership graphs. Amendment chains keyed by accession number support current-state tables with full history.

Investor Relations Teams

IR professionals track which investors have crossed the 5% threshold and how positions shift over time. Aggregate shares, percentage of class, and the sole-versus-shared voting power distinction inform earnings-call preparation, shareholder outreach, and proxy-season planning.

M&A and Due Diligence Teams

Deal teams review 13G filings to identify significant block holders in a target or acquirer. Ownership percentages, holder identities, and the passive-intent certification help assess likely shareholder reception to a proposed transaction. Amendment history reveals recent accumulation or reduction trends relevant to deal timing.

Specific Use Cases

The Form SC 13G Files Dataset supports workflows that depend on knowing which institutional and passive investors hold 5%-or-greater stakes in public companies, how those positions change, and who controls the voting and dispositive power behind them.

Mapping institutional ownership concentration by issuer

Equity research and capital-markets teams extract the reporting person identity, CUSIP, aggregate shares, and percentage of class from each filing to build a per-issuer ownership map of all disclosed 5%-plus passive holders. Combining initial filings with amendment sequences produces a time series of ownership concentration. This feeds shareholder-base segmentation, liquidity-risk assessment, and proxy-season vote forecasting.

Detecting 13G-to-13D reclassification signals

Corporate defense advisors and proxy solicitors monitor for holders who stop filing 13G amendments or file exit amendments (Item 5 classOwnership5PercentOrLess = Y) without a corresponding reduction visible in 13F data. A holder disappearing from the 13G stream while maintaining a large position may be transitioning to Schedule 13D, signaling a shift from passive to activist intent. The rule-designation field (Rule 13d-1(b), 13d-1(c), or 13d-1(d)) and the passive-intent certification in Item 10 provide the baseline against which reclassification is measured.

Building subsidiary-level ownership graphs for large asset managers

Holding-company filers aggregate beneficial ownership across controlled subsidiaries, with Item 7 and its associated EX-99 exhibits listing each subsidiary through which the parent exercises voting or dispositive power. Parsing these exhibits alongside the sole/shared voting power and sole/shared dispositive power fields in the cover page produces an entity-level ownership graph showing which subsidiaries independently hold 5% or more of a given equity class. Financial data engineers use this to populate entity-relationship databases linking parent managers to their sub-advisers and fund vehicles.

Verifying filing timeliness and disclosure consistency for compliance

Compliance teams at asset managers cross-reference their own firm's 13G filings against the dataset to confirm that amendments were filed within regulatory deadlines (annual or quarterly, depending on filer category). They compare the rule-designation field, aggregate share counts, percentage of class, and certification language against peer filings for the same issuer to ensure consistent conventions. The amendment number and event-date fields establish the audit trail.

Constructing passive-holder turnover factors for quantitative strategies

Quantitative researchers extract percentage of class, aggregate shares, and the four-field voting/dispositive power breakdown from the structured XML filings to build ownership-change event indicators. Amendment sequences keyed by reporting person and issuer CUSIP yield holder-issuer time series spanning 1994 to present, supporting backtests of signals such as passive-holder exit momentum, concentration shifts, and institutional turnover. The type-of-reporting-person codes (IA, BK, IC, BD, etc.) allow segmentation by filer category.

Identifying block holders in M&A due diligence

Deal teams query the dataset by issuer CIK or CUSIP to surface all reporting persons holding 5% or more of a target or acquirer. The passive-intent certification and type-of-reporting-person codes distinguish index-fund holders from potentially influential block owners. Recent amendment history reveals accumulation or reduction trends, and the sole-versus-shared voting power split indicates how much influence each holder independently wields over a shareholder vote on the proposed transaction.

Dataset Access

Dataset Index JSON API: https://api.sec-api.io/datasets/form-sc-13g-files.json

This endpoint returns metadata about the Form SC 13G Files Dataset, including the dataset name, description, last updated timestamp, earliest sample date, total records and total size, form types covered (SC 13G, SC 13G/A, SCHEDULE 13G, SCHEDULE 13G/A), the container format (ZIP), and content file types (TXT, JSON, HTML, PDF, XML). It also returns the download URL for the entire dataset and a list of all individual container files with per-container metadata such as size, record count, updated timestamp, and download URL. This endpoint does not require an API key.

Use this API to monitor which containers have been updated in the most recent refresh run, allowing you to selectively download only the containers that changed on a given day.

Example
1 {
2 "datasetId": "1f1333bd-dbdd-6a51-9d2b-a6f0a44b21c2",
3 "datasetDownloadUrl": "https://api.sec-api.io/datasets/form-sc-13g-files.zip",
4 "name": "Form SC 13G Files Dataset",
5 "updatedAt": "2026-04-17T02:59:44.197Z",
6 "earliestSampleDate": "1994-01-01",
7 "totalRecords": 816360,
8 "totalSize": 4128198204,
9 "formTypes": ["SC 13G", "SC 13G/A", "SCHEDULE 13G", "SCHEDULE 13G/A"],
10 "containerFormat": "ZIP",
11 "fileTypes": ["TXT", "JSON", "HTML", "PDF", "XML"],
12 "containers": [
13 {
14 "downloadUrl": "https://api.sec-api.io/datasets/form-sc-13g-files/2025/2025-06.zip",
15 "key": "2025/2025-06.zip",
16 "size": 13818783,
17 "records": 154,
18 "updatedAt": "2026-04-17T02:59:44.197Z"
19 }
20 ]
21 }

Download Entire Dataset: https://api.sec-api.io/datasets/form-sc-13g-files.zip?token=YOUR_API_KEY

Downloads the complete dataset as a single ZIP archive containing all containers. This endpoint requires an API key passed as the token query parameter.

Download Single Container: https://api.sec-api.io/datasets/form-sc-13g-files/2025/2025-06.zip?token=YOUR_API_KEY

Downloads one individual monthly container file instead of the full dataset. Replace the year and month path segments to target a specific period. This endpoint requires an API key passed as the token query parameter.

Frequently Asked Questions

What forms does the Form SC 13G Files Dataset cover?

The dataset covers four EDGAR form-type variants: SC 13G, SC 13G/A, SCHEDULE 13G, and SCHEDULE 13G/A. These represent initial Schedule 13G filings and their amendments.

What does one record in this dataset represent?

One record is a single Schedule 13G or Schedule 13G/A filing, packaged as a self-contained folder containing the structured XML filing (for post-2024 submissions), an XHTML rendering, a JSON metadata sidecar, and any exhibits such as joint filing agreements or powers of attorney.

Who is required to file Schedule 13G?

Schedule 13G must be filed by any person or entity that beneficially owns more than 5% of a class of equity securities registered under Exchange Act Section 12 and qualifies as a qualified institutional investor (Rule 13d-1(b)), a passive investor (Rule 13d-1(c)), or an exempt investor (Rule 13d-1(d)). Holders who acquire with a control purpose must file Schedule 13D instead.

How often are new records added to the dataset?

New filings are added as they are submitted to EDGAR. Initial filings are triggered by crossing the 5% ownership threshold. Amendments follow varying schedules depending on filer category — quarterly for qualified institutional investors, within 2 business days of material changes for passive investors, and annually for exempt investors under the post-February 2024 rules.

What time period does the dataset cover?

The dataset includes Schedule 13G filings submitted to the SEC via EDGAR from January 1994 to the present. Filing format varies by era: plain text (1994 through mid-2000s), HTML (late 1990s through 2024), and structured XML (late 2024 onward).

How does this dataset differ from Schedule 13D filings?

Schedule 13D covers beneficial owners above 5% who acquire or hold shares with a purpose of influencing or changing control of the issuer — activist investors and potential acquirers. Schedule 13G is the abbreviated alternative for passive holders. 13D filings include detailed narrative disclosure of funding sources, transaction purposes, and plans for corporate changes, while 13G filings are largely tabular. This dataset contains only Schedule 13G filings.

What file format is the dataset distributed in?

The dataset is distributed as ZIP containers organized by month. Each container holds individual filing folders. Files within each folder may include TXT, JSON, HTML, PDF, and XML formats depending on the filing era.