The Form SC 13G Files Dataset contains the complete EDGAR filing packages for every Schedule 13G and Schedule 13G/A beneficial ownership disclosure submitted to the SEC from January 1994 to the present. Each record represents a single filing in which a reporting person — an institutional investor, passive investor, or exempt investor — declares beneficial ownership of more than 5% of a class of registered equity securities without a purpose to influence or change control of the issuer. The dataset covers all four EDGAR form-type variants: SC 13G, SC 13G/A, SCHEDULE 13G, and SCHEDULE 13G/A. Records are packaged as ZIP containers organized by month, and each filing folder includes the structured XML document (for post-2024 filings), an XHTML rendering, a JSON metadata sidecar, and any exhibits such as joint filing agreements or powers of attorney.
Programmatically retrieve the full list of dataset archive files, download URLs and dataset metadata.
Dataset Index JSON API
Download the entire dataset as a single archive file.
Download Entire Dataset:
Download a single container file (e.g. monthly archive) from the dataset.
Download Single Container:
Schedule 13G is a shortened alternative to Schedule 13D, available to beneficial owners of more than 5% of a voting equity class who do not hold the securities with a purpose or effect of changing or influencing control of the issuer. Three categories of filers qualify: institutional investors under Rule 13d-1(b) (registered investment advisers, broker-dealers, banks, insurance companies, and similar regulated entities); passive investors under Rule 13d-1(c), who acquired the securities in the ordinary course of business without a control purpose; and exempt investors under Rule 13d-1(d), who acquired the securities before the issuer's registration or in certain non-public transactions. The filing obligation triggers when beneficial ownership crosses the 5% threshold and continues through amendment filings (SC 13G/A or SCHEDULE 13G/A) whenever material changes in ownership position, percentage, or filer eligibility occur.
The dataset spans from January 1994 to the present and is distributed as ZIP containers, each containing individual filing folders. Each folder is named by its zero-padded, dash-stripped accession number (e.g., 000001961725000501 for accession 0000019617-25-000501). The files within each folder include structured XML, browser-rendered XHTML, JSON metadata, and any submitted exhibits, in TXT, JSON, HTML, PDF, and XML formats.
A single record in the Form SC 13G Files Dataset is one complete Schedule 13G or Schedule 13G/A filing packaged as a self-contained folder. The folder contains the structured XML filing, a browser-rendered XHTML view, a JSON metadata sidecar, and any exhibits submitted with the schedule. Each record represents a single beneficial ownership disclosure: one reporting person (or a group of co-reporting persons) declaring beneficial ownership of more than 5% of a class of registered equity securities under the abbreviated Schedule 13G regime.
Each record folder contains three baseline files present in every filing, plus zero or more exhibit files:
primary_doc.xml — The structured Schedule 13G filing in SEC EDGAR XML schema format (http://www.sec.gov/edgar/schedule13g namespace). This is the machine-readable core containing all substantive beneficial ownership disclosures.
xslSCHEDULE_13G_X01/primary_doc.xml — An XHTML rendering of the same XML data, produced by the SEC's XSL stylesheet transformation. This styled, browser-readable document is the version linked from the EDGAR filing details page. It contains no information beyond what appears in the primary XML; it is a presentation layer only.
metadata.json — A JSON sidecar file containing EDGAR index-level metadata for the filing, generated from the EDGAR filing index rather than from the submission itself.
Exhibit files (zero or more) — Additional documents submitted alongside the schedule, typically in .htm, .txt, or .pdf format. Common exhibit types are EX-99 / EX-99.1 (joint filing agreements, Item 7 subsidiary lists, supplementary materials) and EX-24 (powers of attorney).
The metadata sidecar captures EDGAR index fields that contextualize the filing. Key fields include:
SC 13G, SC 13G/A, SCHEDULE 13G, or SCHEDULE 13G/A).(Filed by) in the companyName field) and one for the issuer (marked (Subject)). The ordering varies across filings — either entity may appear first. Each entity entry includes CIK, IRS number, SIC code, fiscal year end, state of incorporation, and any associated ticker symbols. The Subject entity additionally carries act, fileNo, and filmNo fields.The XML document follows the SEC EDGAR Schedule 13G XML schema and is organized into a strict hierarchy under the root edgarSubmission element.
headerData)The opening block identifies the submission type (e.g., SCHEDULE 13G or SCHEDULE 13G/A), the filer's CIK via filerCredentials (with the CCC confirmation code redacted as XXXXXXXX), and a liveTestFlag set to LIVE in production filings.
coverPageHeader)This block contains the core identifying fields for the filing:
1, 2).issuerCik), legal name (issuerName), CUSIP number (issuerCusip), and principal executive office address (street, city, state/country, ZIP code).designateRulePursuantThisScheduleFiled elements specifying the SEC rule under which the filer claims eligibility for Schedule 13G — typically Rule 13d-1(b), Rule 13d-1(c), or Rule 13d-1(d).coverPageHeaderReportingPersonDetails)One block appears per reporting person. Joint filings produce multiple blocks. Each block contains:
b) present when multiple persons file jointly; absent in single-filer schedules.soleVotingPower, sharedVotingPower, soleDispositivePower, and sharedDispositivePower — reporting the number of shares under each type of authority.Y or N, indicating whether the aggregate amount excludes certain shares pursuant to Rule 13d-4.HC (holding company), BD (broker-dealer), IA (investment adviser), IN (insurance company), BK (bank), IC (investment company), EP (employee benefit plan), CO (corporation), PN (partnership), FI (financial institution), OO (other), among others.items)The XML encodes structured responses to the ten numbered items prescribed by the Schedule 13G form. Each item contains a notApplicableFlag (Y/N) and, when applicable, the substantive content:
item1): Issuer name and principal executive office address (restated from the cover page).item2): Reporting person name (filingPersonName), principal business office or residence address, and citizenship/state of organization. Large institutional filers sometimes embed extended narrative disclaimers in the filingPersonName element (e.g., explanations of "Reporting Business Units" and disaggregated beneficial ownership per SEC Release No. 34-39538).item3): The type of person filing, expressed via typeOfPersonFiling or otherTypeOfPersonFiling (the element name varies by filer). Uses the same two-letter codes as the cover page.item4): Beneficial ownership detail: amountBeneficiallyOwned, classPercent, and the four-field voting/dispositive power breakdown (solePowerOrDirectToVote, sharedPowerOrDirectToVote, solePowerOrDirectToDispose, sharedPowerOrDirectToDispose). These figures parallel those in the cover page reporting person block.item5): Whether ownership is five percent or less (classOwnership5PercentOrLess). A Y value signals an exit filing reporting that the position has dropped below the reporting threshold.item6): Whether beneficial ownership is held on behalf of another person (ownershipMoreThan5PercentOnBehalfOfAnotherPerson), with identification of that person if applicable.item7): Identification and classification of subsidiaries, controlled entities, or affiliates through which the reporting person exercises beneficial ownership (subsidiaryIdentificationAndClassification). For large institutional holding companies, this may contain an inline list of entities or a cross-reference such as "See Exhibit 99" directing to a separate exhibit with a detailed subsidiary list.item8): Identification of members of a jointly-filing group (identificationAndClassificationOfGroupMembers), if applicable.item9): Notice of group dissolution (groupDissolutionNotice), if the filing reports that a previously disclosed group has ceased to exist.item10): Certification text (certifications) establishing the reporting person's eligibility to file on Schedule 13G, typically a standard institutional-investor or passive-investor certification declaring the securities were acquired in the ordinary course of business without a control purpose.exhibitInfo)An optional text element listing the exhibits attached to the filing (e.g., "Exhibit 24: Power of Attorney Exhibit 99: Item 7"). This element appears in the XML when exhibits are included but is absent from filings with no exhibits.
signatureInformation)Each filing concludes with one or more signature blocks. Each contains the reportingPersonName, and nested signatureDetails with the typed signature, title (or capacity), and date of execution.
Exhibit files are wrapped in SGML document headers when stored in the EDGAR submission format. Each begins with <DOCUMENT> tags containing <TYPE>, <SEQUENCE>, <FILENAME>, <DESCRIPTION>, and <TEXT> elements before the exhibit content itself. The most common exhibit types are:
EX-99 and EX-99.1 (Supplementary Materials) — These serve several distinct purposes. Joint filing agreements are the most common: short agreements between co-reporting persons (e.g., a parent holding company and its investment advisory subsidiaries) confirming consent to file a single Schedule 13G jointly under Rule 13d-1(k). Item 7 subsidiary lists are another frequent use, particularly for large asset managers; these enumerate each subsidiary through which the reporting person exercises beneficial ownership, sometimes annotating which subsidiaries independently own 5% or more of the class. Other EX-99 exhibits may include cover letters or supplementary narrative disclosures. Formats include .htm, .txt, and occasionally .pdf.
EX-24 (Powers of Attorney) — These authorize named individuals to execute and file beneficial ownership reports on behalf of the reporting entity. A typical power of attorney identifies the grantor entity, lists the attorneys-in-fact by name, defines the scope of authority (usually limited to ownership reporting filings under Sections 13(d) and 13(g) of the Securities Exchange Act), and includes a dated signature block.
Each record folder includes the complete filing as submitted to EDGAR: the structured XML, the rendered XHTML view, the metadata sidecar, and all exhibits. The dataset does not include the raw EDGAR complete submission text file (the monolithic .txt file that concatenates all documents with SGML wrappers), though a URL to it is provided in metadata.json. Schedule 13G filings are not subject to XBRL tagging requirements, so no XBRL instance documents, inline XBRL overlays, or structured financial data files are present.
The dataset spans from 1994 to the present, and the internal file structure of records varies substantially by era:
Plain-text era (1994 through mid-2000s): Early filings were submitted as unstructured plain-text documents within EDGAR's SGML submission framework. The primary document is a single .txt file with fixed-format headers and free-text item responses. Parsing requires interpreting unstructured prose with irregular spacing and filer-specific formatting.
HTML era (late 1990s through 2024): Many filers transitioned to HTML-formatted submissions, with the primary document as an .htm file. Internal structure depended entirely on the filer's formatting choices — tabular ownership breakdowns, styled certifications, and varied heading conventions.
Structured XML era (late 2024 onward): The SEC's 2023 modernization rules mandated structured XML submission using the Schedule 13G XML schema, with a September 30, 2024 compliance date. From this point forward, the primary document is primary_doc.xml conforming to the http://www.sec.gov/edgar/schedule13g namespace, and the SEC's SCHEDULE_13G_X01 XSL stylesheet generates the human-readable XHTML rendering in the xslSCHEDULE_13G_X01/ subfolder. The XML structure described in the sections above applies to these modern filings.
Practical implication: A record from 1996 may contain only a plain-text document and a metadata sidecar. A record from 2010 may contain an HTML document with tabular formatting. A record from 2025 contains the structured XML, the XHTML rendering, and any exhibits. Users working across the full historical range must account for this structural heterogeneity.
Amendments restate the full schedule. Records with form type ending in /A are amendments. The amendmentNo element records the sequential amendment number. Each amendment restates the complete schedule; it is not a differential patch. The most recent amendment for a given reporting person–issuer pair supersedes all prior filings.
Joint filings produce multiple reporting-person blocks. When co-reporting persons (e.g., a holding company and its subsidiaries) file a single Schedule 13G, the XML contains multiple coverPageHeaderReportingPersonDetails blocks, each with its own ownership figures, classification codes, and optional comments. A joint filing agreement exhibit (EX-99) is typically attached. The memberGroup element links the reporting persons to their declared group.
Beneficial ownership aggregation follows Rule 13d-3. Ownership figures include shares over which the reporting person has or shares voting power or investment power. For institutional holding companies, the aggregate figure typically represents combined holdings across all controlled subsidiaries, while Item 7 and its associated exhibit break down holdings by subsidiary.
CUSIP provides security-level identification. The issuerCusip field identifies the specific class of equity, which is more precise than the issuer CIK for issuers with multiple equity classes outstanding.
Type-of-reporting-person codes follow a fixed vocabulary. A single reporting person may carry multiple codes (e.g., HC and IA for a holding company that is also a registered investment adviser). These codes are defined in the form instructions and are essential for classifying filers by regulatory category.
Five-percent-or-less amendments serve as exit filings. When a filer's position drops to 5% or below, the amendment reports the reduced position in Item 5 (classOwnership5PercentOrLess = Y) and classPercent, satisfying the obligation to disclose that the position no longer exceeds the reporting threshold.
Entity ordering in metadata.json is not fixed. The entities array may list the Subject (issuer) entity first or the Filed by (reporting person) entity first. Consumers should match on the (Filed by) or (Subject) suffix in the companyName field rather than relying on array position.
Exhibit structure varies by filer. Large institutional filers (e.g., BlackRock, Vanguard, JPMorgan) tend to produce highly standardized filings with consistent exhibit structures — named files like PowerOfAttorney.txt and Item_7.txt. Smaller filers may submit minimal or idiosyncratically formatted exhibits. The SGML document wrapper within exhibit files provides type, sequence, and description metadata useful for programmatic classification.
The filer of a Schedule 13G is the reporting person — the outside investor or entity that beneficially owns more than 5% of an equity class registered under Section 12 of the Exchange Act. The filer is not the issuer; the issuer appears only as the subject company whose shares are owned.
A reporting person may be a natural person, corporation, LLC, partnership, trust, investment adviser, bank, broker-dealer, insurance company, registered investment company, employee benefit plan, or any other entity. Joint filings are common: a parent holding company and its advisory subsidiaries, or multiple members of a group, often file a single Schedule 13G listing each reporting person.
Qualified Institutional Investors (Rule 13d-1(b)). The largest filer category. Includes banks, registered broker-dealers, insurance companies, registered investment companies, registered investment advisers, ERISA plans, savings associations, church plans, comparable foreign institutions, and parent/control persons of any of the foregoing. They must have acquired and hold the securities in the ordinary course of business, without a purpose to influence or change control of the issuer. Most filings in this dataset come from large asset managers (BlackRock, Vanguard, State Street, JPMorgan, etc.) filing under this category across hundreds of issuers.
Passive Investors (Rule 13d-1(c)). Any person — institutional or not — who beneficially owns more than 5% but not more than 20% of the class and certifies no purpose or effect of changing or influencing control. Exceeding 20% forces a switch to Schedule 13D.
Exempt Investors (Rule 13d-1(d)). A narrow category for persons whose beneficial ownership arose without an "acquisition" triggering Section 13(d) — mainly pre-registration holders and similar situations.
The core trigger is crossing the 5% beneficial ownership threshold in a covered equity class (common stock, preferred stock, or other equity registered under Section 12).
| Filer category | Deadline |
|---|---|
| Qualified institutional investor | 45 days after end of calendar quarter in which 5% is first exceeded |
| Passive investor | 5 business days after crossing 5% |
| Exempt investor | 45 days after end of calendar year in which 5% is first exceeded |
Before the February 5, 2024 amendments, qualified institutional investors filed initial 13Gs within 45 days after calendar year-end (not quarter-end) and amended annually unless crossing 10%. Passive investors had a 10-day initial deadline. The dataset spans from 1994, so a large share of filings follow the prior timing regime.
Schedule 13D vs. 13G. Any beneficial owner above 5% who does not qualify for Schedule 13G — or who acquires with a control purpose — must file Schedule 13D, which requires more detailed disclosure and has a tighter initial deadline (5 business days). Activist investors and potential acquirers use 13D. This dataset does not contain Schedule 13D filings.
Form conversions. A 13G filer who loses eligibility (e.g., forms control intent, or a passive investor exceeds 20%) must convert to 13D within the applicable deadline. Conversely, a 13D filer who becomes eligible may convert to 13G. Conversions appear in the record as amendments.
Group filings. When two or more persons agree to act together regarding securities and their aggregate holdings exceed 5%, the group has a filing obligation. A group may use Schedule 13G only if every member independently qualifies.
The filer is the investor, not the issuer. In EDGAR metadata, the CIK associated with the submission belongs to the reporting person (the investor), not the subject company. This is the reverse of most SEC filing datasets.
Foreign filers. Non-U.S. persons and entities are subject to the same obligations if they own more than 5% of a Section 12 class. Foreign institutions may qualify as qualified institutional investors if comparable to an enumerated domestic type. There is no separate foreign-filer form.
Fund complexes. Large asset managers frequently file jointly for parent companies, advisory subsidiaries, and individual funds, reflecting shares held across many client accounts and portfolios.
Schedule 13D is the closest relative. Both forms trigger at 5% beneficial ownership of a registered equity class under Exchange Act Section 13(d), identify the same issuer-and-class fields, and report ownership amounts and percentages.
The dividing line is investor intent. Schedule 13D applies when the holder acquires or holds shares with a purpose of influencing or changing control. Schedule 13G is the abbreviated alternative reserved for passive investors — institutional investors, registered investment companies, broker-dealers, banks, insurance companies, and certain exempt holders who certify no activist intent. Because of this, 13D filings are substantially longer: they require narrative disclosure of funding sources, transaction purposes, plans for corporate changes, and agreements concerning the issuer's securities. Schedule 13G filings omit most of this and are largely tabular.
The populations overlap dynamically. A holder who shifts from passive to activist must reclassify from 13G to 13D, and vice versa. Filing cadence also differs: 13G amendments generally cluster around quarterly or annual deadlines, while 13D amendments are event-driven, due promptly after material changes. A complete view of large beneficial ownership for any issuer requires both filing streams.
Form 13F requires institutional managers with over $100 million in Section 13(f) securities to report all qualifying holdings quarterly. A large fund complex often appears in both datasets for the same issuer, creating apparent overlap.
The key differences are trigger, scope, and granularity. 13F is portfolio-wide: it lists every qualifying position regardless of ownership percentage, often thousands per filing. Schedule 13G covers exactly one issuer-security relationship per filing, triggered only when the position crosses 5% of the class. 13F is filed strictly quarterly by institutional managers meeting the AUM threshold. 13G is filed by a broader set of qualifying beneficial owners (not only managers but also banks, broker-dealers, and exempt persons) and is threshold-triggered rather than calendar-driven. 13F data is highly tabular with CUSIP-level structure; 13G filings are semi-structured documents with less standardized formatting.
Forms 3/4/5 report ownership and transactions by officers, directors, and 10%-or-greater beneficial owners under Exchange Act Section 16. The 10% owner category creates direct overlap with Schedule 13G filers who cross that threshold.
The regimes serve different purposes. Section 16 is a transaction-disclosure and short-swing-profit regime: Form 4 reports individual trades within two business days, with grant-level detail on options and derivatives. Schedule 13G reports aggregate ownership positions, not transactions. Below 10%, the populations diverge entirely — Forms 3/4/5 cover officers and directors regardless of ownership size, while 13G covers any qualifying passive investor above 5% with no insider relationship required. Form 4 filings are XBRL-tagged and transaction-level; 13G filings are semi-structured and position-level.
This dataset provides complete EDGAR filing packages — primary documents, exhibits, cover-page certifications, joint-filing agreements, and metadata. Structured or extracted-field datasets derived from Schedule 13G parse these into tabular fields (reporting person, issuer CIK, shares owned, percentage of class, amendment status). The full-filing dataset preserves content that extractions typically discard — footnotes qualifying ownership calculations, co-filer identity in group filings, powers of attorney, and the exact filing text. It requires downstream parsing but supports text-level and exhibit-level analysis that tabular extracts cannot.
The Form SC 13G Files Dataset isolates passive large-block equity ownership: positions crossing 5% of an issuer's equity class, held by investors certifying no intent to influence control. Schedule 13D covers the same threshold for activist holders with far richer narrative disclosure. Form 13F captures full institutional portfolios without a concentration filter. Forms 3/4/5 track insider transactions at a granular level for a different statutory population. This dataset is distinct in combining the 5% concentration threshold, passive-investor scope, abbreviated disclosure format, and full-filing document preservation.
Schedule 13G filings document passive beneficial ownership above the 5% threshold. The structured fields — reporting person identity, aggregate shares, voting and dispositive power, percentage of class, and passive-intent certification — support a range of professional workflows.
Sales teams at broker-dealers use holder identity, CUSIP, share counts, and percentage of class to build ownership maps across their coverage universe. Amendments (SC 13G/A) reveal when an institutional holder has increased, decreased, or exited a position, feeding client targeting, roadshow planning, and block-trade origination.
Proxy solicitors and issuer defense teams watch for holders who drop off the 13G dataset or fail to amend — a signal of possible transition to 13D (control intent). The sole-versus-shared voting power and dispositive power fields show how much influence a single entity actually wields, which drives proxy-contest preparation and shareholder engagement strategy.
Sector analysts use percentage of class, reporting person identity, and amendment history to assess shareholder concentration risk, predict voting dynamics before annual meetings, and gauge institutional conviction. A large passive holder reducing its stake below 5% signals potential liquidity risk and supply-demand imbalance.
Compliance teams cross-reference their own firm's filings against aggregate shares, percentage of class, and event dates to verify accuracy and timeliness. They review peer treatment of the Rule 13d-1(b)/13d-1(c)/13d-1(d) designation and passive-intent certification language to ensure consistent conventions. Amendment history confirms updates were filed within regulatory deadlines.
Attorneys advising reporting persons examine the rule-designation field to confirm eligibility classification. Item 6 (ownership on behalf of another person) and Item 7 (subsidiary-identification exhibit) matter most for holding-company filers aggregating ownership across subsidiaries. Amendment sequences show how peers handle corrections and annual updates.
Quant teams extract percentage of class, aggregate shares, and voting/dispositive power fields from structured XML filings to build institutional-ownership concentration factors, passive-holder turnover signals, and ownership-change event indicators. The dataset's coverage from 1994 to present provides longitudinal depth for backtesting; amendment sequences yield holder-issuer ownership time series.
Stewardship teams and proxy advisory analysts segment the shareholder base using the type-of-reporting-person field, passive-intent certification, and voting-power breakdown. Key questions include how ownership concentration among passive holders has evolved across sectors and how frequently holders transition between 13G and 13D status.
Engineering teams at data vendors and fintech platforms parse the raw filings (XML, HTML, TXT, PDF) into structured ownership databases. Metadata JSON files supply accession numbers, CIK codes, tickers, SIC codes, and entity relationships needed to link 13G records into broader ownership graphs. Amendment chains keyed by accession number support current-state tables with full history.
IR professionals track which investors have crossed the 5% threshold and how positions shift over time. Aggregate shares, percentage of class, and the sole-versus-shared voting power distinction inform earnings-call preparation, shareholder outreach, and proxy-season planning.
Deal teams review 13G filings to identify significant block holders in a target or acquirer. Ownership percentages, holder identities, and the passive-intent certification help assess likely shareholder reception to a proposed transaction. Amendment history reveals recent accumulation or reduction trends relevant to deal timing.
The Form SC 13G Files Dataset supports workflows that depend on knowing which institutional and passive investors hold 5%-or-greater stakes in public companies, how those positions change, and who controls the voting and dispositive power behind them.
Equity research and capital-markets teams extract the reporting person identity, CUSIP, aggregate shares, and percentage of class from each filing to build a per-issuer ownership map of all disclosed 5%-plus passive holders. Combining initial filings with amendment sequences produces a time series of ownership concentration. This feeds shareholder-base segmentation, liquidity-risk assessment, and proxy-season vote forecasting.
Corporate defense advisors and proxy solicitors monitor for holders who stop filing 13G amendments or file exit amendments (Item 5 classOwnership5PercentOrLess = Y) without a corresponding reduction visible in 13F data. A holder disappearing from the 13G stream while maintaining a large position may be transitioning to Schedule 13D, signaling a shift from passive to activist intent. The rule-designation field (Rule 13d-1(b), 13d-1(c), or 13d-1(d)) and the passive-intent certification in Item 10 provide the baseline against which reclassification is measured.
Holding-company filers aggregate beneficial ownership across controlled subsidiaries, with Item 7 and its associated EX-99 exhibits listing each subsidiary through which the parent exercises voting or dispositive power. Parsing these exhibits alongside the sole/shared voting power and sole/shared dispositive power fields in the cover page produces an entity-level ownership graph showing which subsidiaries independently hold 5% or more of a given equity class. Financial data engineers use this to populate entity-relationship databases linking parent managers to their sub-advisers and fund vehicles.
Compliance teams at asset managers cross-reference their own firm's 13G filings against the dataset to confirm that amendments were filed within regulatory deadlines (annual or quarterly, depending on filer category). They compare the rule-designation field, aggregate share counts, percentage of class, and certification language against peer filings for the same issuer to ensure consistent conventions. The amendment number and event-date fields establish the audit trail.
Quantitative researchers extract percentage of class, aggregate shares, and the four-field voting/dispositive power breakdown from the structured XML filings to build ownership-change event indicators. Amendment sequences keyed by reporting person and issuer CUSIP yield holder-issuer time series spanning 1994 to present, supporting backtests of signals such as passive-holder exit momentum, concentration shifts, and institutional turnover. The type-of-reporting-person codes (IA, BK, IC, BD, etc.) allow segmentation by filer category.
Deal teams query the dataset by issuer CIK or CUSIP to surface all reporting persons holding 5% or more of a target or acquirer. The passive-intent certification and type-of-reporting-person codes distinguish index-fund holders from potentially influential block owners. Recent amendment history reveals accumulation or reduction trends, and the sole-versus-shared voting power split indicates how much influence each holder independently wields over a shareholder vote on the proposed transaction.
Dataset Index JSON API: https://api.sec-api.io/datasets/form-sc-13g-files.json
This endpoint returns metadata about the Form SC 13G Files Dataset, including the dataset name, description, last updated timestamp, earliest sample date, total records and total size, form types covered (SC 13G, SC 13G/A, SCHEDULE 13G, SCHEDULE 13G/A), the container format (ZIP), and content file types (TXT, JSON, HTML, PDF, XML). It also returns the download URL for the entire dataset and a list of all individual container files with per-container metadata such as size, record count, updated timestamp, and download URL. This endpoint does not require an API key.
Use this API to monitor which containers have been updated in the most recent refresh run, allowing you to selectively download only the containers that changed on a given day.
1
{
2
"datasetId": "1f1333bd-dbdd-6a51-9d2b-a6f0a44b21c2",
3
"datasetDownloadUrl": "https://api.sec-api.io/datasets/form-sc-13g-files.zip",
4
"name": "Form SC 13G Files Dataset",
5
"updatedAt": "2026-04-17T02:59:44.197Z",
6
"earliestSampleDate": "1994-01-01",
7
"totalRecords": 816360,
8
"totalSize": 4128198204,
9
"formTypes": ["SC 13G", "SC 13G/A", "SCHEDULE 13G", "SCHEDULE 13G/A"],
10
"containerFormat": "ZIP",
11
"fileTypes": ["TXT", "JSON", "HTML", "PDF", "XML"],
12
"containers": [
13
{
14
"downloadUrl": "https://api.sec-api.io/datasets/form-sc-13g-files/2025/2025-06.zip",
15
"key": "2025/2025-06.zip",
16
"size": 13818783,
17
"records": 154,
18
"updatedAt": "2026-04-17T02:59:44.197Z"
19
}
20
]
21
}
Download Entire Dataset: https://api.sec-api.io/datasets/form-sc-13g-files.zip?token=YOUR_API_KEY
Downloads the complete dataset as a single ZIP archive containing all containers. This endpoint requires an API key passed as the token query parameter.
Download Single Container: https://api.sec-api.io/datasets/form-sc-13g-files/2025/2025-06.zip?token=YOUR_API_KEY
Downloads one individual monthly container file instead of the full dataset. Replace the year and month path segments to target a specific period. This endpoint requires an API key passed as the token query parameter.
The dataset covers four EDGAR form-type variants: SC 13G, SC 13G/A, SCHEDULE 13G, and SCHEDULE 13G/A. These represent initial Schedule 13G filings and their amendments.
One record is a single Schedule 13G or Schedule 13G/A filing, packaged as a self-contained folder containing the structured XML filing (for post-2024 submissions), an XHTML rendering, a JSON metadata sidecar, and any exhibits such as joint filing agreements or powers of attorney.
Schedule 13G must be filed by any person or entity that beneficially owns more than 5% of a class of equity securities registered under Exchange Act Section 12 and qualifies as a qualified institutional investor (Rule 13d-1(b)), a passive investor (Rule 13d-1(c)), or an exempt investor (Rule 13d-1(d)). Holders who acquire with a control purpose must file Schedule 13D instead.
New filings are added as they are submitted to EDGAR. Initial filings are triggered by crossing the 5% ownership threshold. Amendments follow varying schedules depending on filer category — quarterly for qualified institutional investors, within 2 business days of material changes for passive investors, and annually for exempt investors under the post-February 2024 rules.
The dataset includes Schedule 13G filings submitted to the SEC via EDGAR from January 1994 to the present. Filing format varies by era: plain text (1994 through mid-2000s), HTML (late 1990s through 2024), and structured XML (late 2024 onward).
Schedule 13D covers beneficial owners above 5% who acquire or hold shares with a purpose of influencing or changing control of the issuer — activist investors and potential acquirers. Schedule 13G is the abbreviated alternative for passive holders. 13D filings include detailed narrative disclosure of funding sources, transaction purposes, and plans for corporate changes, while 13G filings are largely tabular. This dataset contains only Schedule 13G filings.
The dataset is distributed as ZIP containers organized by month. Each container holds individual filing folders. Files within each folder may include TXT, JSON, HTML, PDF, and XML formats depending on the filing era.