The Form SD Files Dataset is a complete EDGAR archive of every Form SD and Form SD/A specialized disclosure report filed under the Securities Exchange Act of 1934 since the form went into effect. Each record represents one EDGAR accession — a single Form SD or Form SD/A submission filed by one issuer on one date — bundled with a metadata.json manifest and the textual filing documents from the original submission. Form SD itself is the cover form for two distinct Dodd-Frank disclosure regimes administered by the SEC: Rule 13p-1 conflict-minerals disclosures (Section 1502 of Dodd-Frank) and Rule 13q-1 resource-extraction payment disclosures (Section 1504). The dataset's earliest sample date is April 1, 2014, corresponding to the first Form SD filings made for calendar-year 2013 conflict-minerals reporting, and it covers every issuer subject to either rule from that point forward. Containers are distributed as monthly ZIP archives holding HTML, JSON, TXT, and PDF payloads.
Programmatically retrieve the full list of dataset archive files, download URLs and dataset metadata.
Dataset Index JSON API
Download the entire dataset as a single archive file.
Download Entire Dataset:
Download a single container file (e.g. monthly archive) from the dataset.
Download Single Container:
The dataset captures every Form SD and Form SD/A filing submitted to EDGAR from April 2014 to the present. Form SD is a specialized disclosure report filed pursuant to Rule 13p-1 and Rule 13q-1 under the Securities Exchange Act of 1934, both promulgated under the Dodd-Frank Wall Street Reform and Consumer Protection Act. Rule 13p-1, mandated by Section 1502 of Dodd-Frank, requires issuers whose products contain tin, tantalum, tungsten, or gold ("3TG" conflict minerals) necessary to functionality or production to disclose whether those minerals originated in the Democratic Republic of the Congo or an adjoining country, and to attach a Conflict Minerals Report when triggered. Rule 13q-1, mandated by Section 1504 of Dodd-Frank, requires resource extraction issuers engaged in the commercial development of oil, natural gas, or minerals to disclose payments made to the U.S. federal government and to foreign governments. Form SD/A filings are amendment counterparts used to correct, restate, or supplement a previously submitted Form SD.
For each accession number the dataset includes a metadata file and all documents in the original EDGAR submission except image files. Each filing typically contains the issuer's conflict-minerals disclosure, an attached Conflict Minerals Report describing due diligence measures and the source and chain of custody of applicable minerals, or — for resource extraction issuers — payment information organized by project, type, and government recipient. Records are organized as accession-numbered folders inside year/month ZIP containers and are distributed in HTML, JSON, TXT, and PDF formats; in practice modern Form SD filings consist almost exclusively of HTM/HTML documents plus the JSON manifest, with PDF and TXT payloads appearing rarely in older or unusually formatted submissions.
One record in the Form SD Files dataset is a single EDGAR accession — that is, one Form SD or Form SD/A submission filed by a single issuer on a specific date — packaged as an accession-numbered folder inside a year/month ZIP container. The folder is named with the 18-digit, dash-stripped accession number (for example 000143774925038455) and bundles the textual filing documents that comprise the submission together with a metadata.json describing the accession and the filer. The unit is therefore the filing as a whole, not a single exhibit, a single payment row, or a single conflict-mineral disclosure paragraph; an issuer that files an annual Form SD and later submits an amending SD/A contributes two independent records.
Form SD is the cover form that introduces these disclosures; the substantive content lives partly on the cover form and partly in attached exhibits. The form is structured around three operative items:
Item 3 lists the exhibits accompanying the form. The form closes with a signature block executed under Section 13 of the Exchange Act by an authorized officer.
A single accession folder contains three concentric layers of content: an outer manifest layer (metadata.json), an EDGAR SGML document-wrapper layer surrounding each filed document, and the inner HTML and inline-XBRL bodies that carry the actual disclosure text and structured data.
metadata.json manifestEvery accession folder carries one metadata.json, a single JSON object that catalogs the submission and the filer. Its top-level fields are:
formType — "SD" or "SD/A".accessionNo — the dashed EDGAR accession (for example "0001437749-25-038455"); the folder name is the same identifier with the dashes removed.description — human-readable form name, typically "Form SD - Specialized disclosure report".filedAt — ISO-8601 timestamp with timezone offset capturing the EDGAR acceptance moment.linkToFilingDetails, linkToTxt, linkToHtml, linkToXbrl — EDGAR URLs to the primary document, the consolidated SGML submission text file, the filing index page, and (when applicable) the inline-XBRL viewer. linkToXbrl is generally empty for Form SD because the inline-XBRL payload, when present, is embedded inside Exhibit 2.01 rather than published as a standalone XBRL document.id — a 32-character hex hash that uniquely identifies the record internally.seriesAndClassesContractsInformation — array, generally empty for Form SD because the form does not pertain to investment-company series and classes.documentFormatFiles — an ordered array, one element per document in the original EDGAR submission, including the form body, every exhibit, every graphic, and the trailing complete-submission text file. Each element carries sequence (the SGML sequence number, or " " for the wrapping .txt), size (byte size as a string), documentUrl (EDGAR URL — for inline-XBRL exhibits the URL points at the ix?doc= viewer), description (for example "FORM SD", "EX-2.01", "EXHIBIT 99.1", "GRAPHIC", "Complete submission text file"), and type (the <TYPE> token from the SGML header — "SD", "EX-2.01", "EX-99", "GRAPHIC", or " ").dataFiles — a parallel array for XBRL artifacts. For Form SD this is populated only when an Exhibit 2.01 payments exhibit is filed, and typically contains the inline-XBRL instance (type: "EX-2.01.INS"), the taxonomy schema (type: "EX-2.01.SCH"), and the extracted XBRL instance (type: "XML").entities — an array of filer entity objects, almost always of length one. Each entity carries companyName (usually suffixed with (Filer)), cik, irsNo, fileNo, filmNo, act (typically "34"), type (the form type as recorded in the EDGAR header), stateOfIncorporation (two-letter state or country code), fiscalYearEnd as MMDD, sic (the four-digit SIC code with its industry description), and tickers (an array of trading symbols associated with the issuer).Each document in the folder is preserved with its EDGAR SGML envelope intact — the same wrapper that appears inside the consolidated complete-submission .txt on EDGAR. The envelope opens with <DOCUMENT> and contains <TYPE>, <SEQUENCE>, <FILENAME>, <DESCRIPTION>, and <TEXT> lines before the inner HTML body. The <TYPE> token is the authoritative role indicator:
SD — the Form SD body itself.EX-1.02 — the Conflict Minerals Report.EX-2.01 — the resource-extraction payments exhibit (inline XBRL).EX-99 (often EX-99.1) — supplemental conflict-minerals attachments such as supplier lists, smelter rosters, or supporting graphics.GRAPHIC — image documents catalogued in the manifest but not redistributed.The <SEQUENCE> value matches the sequence field in metadata.json, so the manifest and the on-disk wrappers can be cross-referenced one-to-one.
Inside the SGML wrapper, each document is a self-contained HTML file. The Form SD body is a narrative HTML document with a cover page identifying the registrant, the reporting period, and the address of the principal executive offices, followed by the checked items (1.01, 1.02, 2.01 as applicable), the disclosure text, an "Item 3 - Exhibits" listing, and a signature block. Conflict-minerals exhibits are typically narrative HTML documents that read as legal-style reports. The Exhibit 2.01 resource-extraction payments exhibit is an inline-XBRL document — an XHTML root that opens with an XML declaration, declares the rxp (resource-extraction payments) namespace, and uses <ix:header>, <ix:nonNumeric>, and <ix:nonFraction> tags to mark up project names, government recipients, country, payment categories (taxes, royalties, fees, production entitlements, bonuses, dividends, infrastructure improvement payments), payment amounts, currencies, and resource types.
<TYPE>SD)The SD body is always present, always sequence 1, and always identifies the registrant, the reporting period, and the items being addressed. Its substantive content is conditional on which rule the filer is reporting under:
When required, Exhibit 1.02 is the Conflict Minerals Report itself: a narrative legal/operational report describing the design and execution of due diligence on the source and chain of custody of conflict minerals, conformed to a nationally or internationally recognized framework — in practice the OECD Due Diligence Guidance for Responsible Supply Chains of Minerals from Conflict-Affected and High-Risk Areas. Typical sections include:
For periods up to and including the 2013 reporting year, the report had to be accompanied by an Independent Private Sector Audit (IPSA) of the due diligence framework when the issuer described products as "DRC conflict free." In practice the audit obligation effectively fell away after the 2014 District of Columbia Circuit decisions on the rule, and IPSA reports are now uncommon.
Exhibit 2.01, where present, is an inline-XBRL document containing the structured payment disclosure required by Rule 13q-1. The payments are organized at the project level and disaggregated by category, government recipient (with country), and resource. Each tagged data point indicates:
The XBRL markup enables extraction of every payment row as structured data without parsing the rendered narrative tables.
EX-99 files (commonly EX-99.1) appear when issuers attach supplemental materials such as supplier or smelter lists, mineral-flow diagrams, due-diligence framework citations, or related graphics. These are HTML wrappers that may reference image files (logos, signature scans, flow charts) which themselves are not included in the dataset.
Every record's manifest also lists a trailing complete-submission .txt file (sequence " ", type " "), which on EDGAR is the concatenated SGML container holding all submission documents in one stream. The manifest records its size and URL but the file itself is not redistributed because its content is the union of the wrapped HTM documents that are already included.
Each record bundles metadata.json and the textual documents from the original EDGAR submission — at minimum the Form SD body, plus any HTM-format exhibits (EX-1.02, EX-2.01, EX-99.x). The SGML headers around each document are retained, so document role, sequence, and description can be read directly from the file. The HTML payloads preserve original formatting, tables, footnotes, and inline-XBRL markup (in the case of Exhibit 2.01).
GRAPHIC documents (such as .jpg logos, signature scans, and embedded chart images) referenced by the SGML wrapper are catalogued in documentFormatFiles but are not redistributed inside the ZIP. Their original EDGAR URLs remain available via documentUrl..xsd taxonomy schema and the extracted _htm.xml or .xml XBRL instance documents that EDGAR publishes alongside Exhibit 2.01 are catalogued under dataFiles but are not redistributed; the inline-XBRL payload remains accessible because it is embedded inside the EX-2.01 HTM file that is included..txt. Referenced in the manifest but not redistributed for the reason given above.The Form SD record structure has shifted materially in response to litigation and rulemaking since the form went into effect in 2014:
rxp).The Item structure of the form itself (1.01 / 1.02 / 2.01 / 3) has remained stable across these regulatory shifts; what has changed is which items carry substantive content and which exhibits accompany the body.
Form SD entered effect in 2014, well after EDGAR's transition away from ASCII-only filing, so every record in the dataset is filed in HTML wrapped in the EDGAR SGML envelope. There is no ASCII or paper era to account for. The principal format evolution within the dataset concerns the resource-extraction exhibit:
rxp) taxonomy. From this point forward, records that include EX-2.01 also carry dataFiles entries describing the inline-XBRL instance and schema, and the EX-2.01 HTM file itself contains the structured <ix:*> markup that allows machine extraction of every payment row.formType of "SD/A" indicates a Form SD amendment. SD/A records are full filings in their own right (with their own accession number, manifest, and document set) rather than diffs against the prior filing; reconstructing the amendment trail requires linking SD and SD/A records by issuer, period, and filedAt chronology.<TYPE> token (and the matching type field in documentFormatFiles) rather than filename heuristics, because issuers' filename conventions vary widely.GRAPHIC files that are not present on disk, narrative references inside HTML exhibits (for example image-based supplier diagrams or signature blocks) will resolve to broken image links if rendered locally. The textual content of the exhibit remains intact.entities array reflects the EDGAR header at the moment of filing, including SIC code, state of incorporation, fiscal year end, and tickers. Industry concentration in the dataset skews toward extractive SIC codes (10xx metal mining, 13xx oil and gas extraction, 1040 gold and silver ores, 1090 miscellaneous metal ores) for resource-extraction filers and toward manufacturing SIC codes (3xxx) for conflict-minerals filers.Form SD is an issuer-level filing. The filer is always an Exchange Act reporting issuer subject to Section 13(a) or Section 15(d) of the Securities Exchange Act of 1934. It is not filed by individuals, beneficial owners, institutional investment managers, broker-dealers, transfer agents, or registered investment companies in their capacity as such. Suppliers, smelters, refiners, mine operators, and government recipients named inside the disclosure are subjects of the report, not filers.
The form carries two structurally distinct disclosure regimes that share one cover form. A given issuer may be in scope for one, the other, or in rare cases both.
Rule 13p-1 applies to any reporting issuer (domestic registrant, foreign private issuer, or smaller reporting company) for which "conflict minerals" are necessary to the functionality or production of a product the issuer manufactures or contracts to be manufactured. "Conflict minerals" means the 3TG minerals: cassiterite, columbite-tantalite, wolframite, and gold, and their derivatives (tin, tantalum, tungsten, gold).
The obligation is product-driven, not industry-driven. There is no SIC or sector limitation. Consumer electronics, industrial machinery, automotive, aerospace, medical device, and branded jewelry issuers are commonly in scope. Pure distributors, resellers, or service providers that do not contract or influence manufacturing are generally outside the rule, even when the goods they handle contain 3TG.
Rule 13q-1 applies only to "resource extraction issuers": reporting issuers engaged in the commercial development of oil, natural gas, or minerals. "Commercial development" covers exploration, extraction, processing, export, and the acquisition of a license for any such activity. The population is concentrated among integrated oil and gas majors, independent E&P companies, and mining companies, including foreign private issuers in those industries. Refiners, marketers, transporters, retailers, and oilfield service contractors that do not themselves hold development rights are typically outside the rule.
Form SD is annual. There is no event-driven trigger; the obligation arises from the issuer's status combined with the passage of a reporting period.
The first Rule 13p-1 cycle covered calendar year 2013, with filings appearing on EDGAR in spring 2014. This explains the dataset's earliest sample date of April 1, 2014. Form SD has been an electronic EDGAR filing since inception; there is no pre-EDGAR analog.
If the issuer's reasonable country of origin inquiry indicates that 3TG originated in the Democratic Republic of the Congo or an adjoining "Covered Country," or origin is undeterminable, the issuer must attach a separate Conflict Minerals Report (CMR) as an exhibit to Form SD, describing due diligence on source and chain of custody under a recognized framework (in practice, the OECD Due Diligence Guidance). The CMR is part of the same Form SD submission, not a separate filing. Following 2017 SEC staff guidance, many issuers now file only the cover disclosure without the full step-two CMR.
Form SD/A is filed to correct, supplement, or restate a previously submitted Form SD. Typical reasons include factual corrections to country-of-origin or due diligence descriptions, addition of an omitted CMR exhibit, restatement of project-level payment data, or response to staff comments. SD/A has no independent triggering event; it relates back to the original filing's reporting period.
Form SD is a narrow Dodd-Frank specialty disclosure covering Rule 13p-1 (conflict minerals) and Rule 13q-1 (resource extraction payments). It is annual but not a periodic report, rule-specific rather than principles-based, and produces structured payment data that exists nowhere else in EDGAR. The most useful comparisons are the annual reports it sits beside, the current reports it is sometimes confused with, the XBRL payment exhibit it carries, and the non-SEC transparency regimes that cover overlapping payment data.
The 10-K (domestic) and 20-F (foreign private issuer) are comprehensive annual reports covering business, risk factors, MD&A, and audited financials. Form SD is filed on a separate annual cycle (due May 31 for the prior calendar year) and is not an exhibit to either form. Critically, conflict-minerals and resource-extraction-payment disclosures are carved out of the annual report entirely — even FPIs subject to Rule 13p-1 or 13q-1 must file Form SD alongside their 20-F. A 10-K or 20-F will not contain country-of-origin, smelter-level, or project-level payment data at the granularity Form SD requires. Amendments use SD/A, independent of 10-K/A or 20-F/A.
8-K (domestic) and 6-K (FPI) are event-driven disclosures triggered by enumerated material events on short deadlines. Form SD is the opposite: a fixed-cadence annual compliance report tied to specific Dodd-Frank rules, not to any corporate event. There is essentially no content overlap; conflict-minerals and government-payment disclosures will not appear in 8-K or 6-K datasets, and an issuer's SD obligation is unaffected by any 8-K it does or does not file.
Within an SD filing, the cover form contains issuer identification, the relevant rule citation, and a high-level disclosure statement. The Conflict Minerals Report (CMR), required as an attached exhibit when due diligence is triggered under Rule 13p-1, contains the substantive material: reasonable country-of-origin inquiry, OECD-aligned due diligence framework, facility and smelter lists, products containing conflict minerals, and independent private sector audit results. Users analyzing supply-chain due diligence need the CMR exhibit; the cover form alone is metadata. The dataset packages both at the accession level so the right granularity is preserved.
Rule 13q-1 filers attach payment data as inline XBRL in an EX-2.01 exhibit to Form SD, tagged by project, payment type (taxes, royalties, fees, production entitlements, bonuses, dividends, infrastructure improvements), government recipient, country, and currency. This is structurally similar to the SEC's Financial Statement Data Sets (FSDS) — both are tag-driven structured extracts — but the content domains do not overlap. FSDS contains GAAP line items from 10-K, 10-Q, and 20-F filings; the EX-2.01 extract contains payment-level disclosures that never appear at this granularity in the financial statements. Conflict-minerals-only SD filings do not produce this XBRL extract.
For resource extraction issuers, Canada's ESTMA, the EU Accounting and Transparency Directives, and the UK's Reports on Payments to Governments Regulations are the closest analogues to Rule 13q-1, requiring issuer-level disclosure at comparable project-and-recipient granularity. Some SEC filers cross-reference an ESTMA or Home Country Report to satisfy Rule 13q-1's alternative reporting provision, so the underlying payment events overlap substantially for dual-listed issuers. EITI is a country-level voluntary reconciliation framework rather than an issuer disclosure and is a weaker analogue. The Form SD dataset captures only the SEC-filed instance; parallel filings live in separate national registries. Conflict-minerals disclosures under Rule 13p-1 have no direct foreign mandatory analogue at the same granularity, although the EU Conflict Minerals Regulation and OECD Due Diligence Guidance shape similar narratives.
Form SD is the only SEC filing dedicated to conflict-minerals due diligence and resource-extraction payments. It is annual but separate from the 10-K and 20-F, structurally unrelated to event-driven 8-K and 6-K reports, and produces structured XBRL payment data (under 13q-1) that exists nowhere else in EDGAR. Its closest substitute for extractive payments is the ESTMA/EU/UK regimes outside SEC EDGAR; for conflict-minerals due diligence there is no SEC substitute at all, making this dataset the authoritative source for both disclosures from April 2014 to present.
The dual mandate of Form SD — conflict minerals under Rule 13p-1 and resource extraction payments under Rule 13q-1 — gives the dataset an unusually broad professional user base, with each role drawing on different parts of the package: the SD cover body, the attached Conflict Minerals Report (CMR), the EX-2.01 payments exhibit (typically inline-XBRL), EX-99 narrative exhibits, and the per-accession metadata.json.
In-house program leads, trade compliance officers, and securities counsel use the dataset to draft and revise their own SD and CMR filings. They read peer SD bodies for "necessary to functionality or production" determinations, contract-manufacturer carve-outs, and product-scope language, and study CMRs for OECD-aligned due-diligence framework descriptions, smelter inventories, and risk-mitigation plans. SD/A amendment histories in metadata.json show what peers revised and when, informing internal redline cycles and board-level compliance memos.
Procurement, supplier-risk, and responsible-sourcing teams at downstream manufacturers mine CMR exhibits for named smelters and refiners, country-of-origin lists, and supplier survey methodologies disclosed by peers. They benchmark escalation, remediation, and off-boarding language to refine internal policy. Outputs include supplier scorecards, due-diligence playbooks, and Tier-N mapping projects.
Analysts at investment managers, ratings providers, and ESG data vendors construct firm-level conflict-minerals exposure scores and track year-over-year changes in due-diligence maturity. They parse CMR HTML for RCOI results, smelter identification, and OECD alignment language, and pull SD-body determinations and metadata.json filer attributes to align disclosures with sector and fiscal period. Output feeds ESG scoring models, thematic fund screens, and engagement priority lists.
Audit firms engaged for Independent Private-Sector Audits study the population of issuers commissioning IPSAs, the scope statements used, and the design-and-performance criteria referenced. They focus on attached audit reports inside the CMR exhibit and on qualifications or scope limitations, and they use metadata to identify recurring engagements across filing years. The dataset supports audit-methodology calibration and peer benchmarking of opinion language.
Sell-side and buy-side analysts covering oil, gas, and mining issuers parse the EX-2.01 inline-XBRL payments exhibit for project-level payments to governments, disaggregated by payment type (taxes, royalties, fees, production entitlements, bonuses, dividends, infrastructure improvements) and recipient. This drives effective government-take estimates, fiscal-regime sensitivity models, and project-NPV adjustments.
Researchers at extractive-industry watchdogs and corporate-accountability organizations use both halves of the dataset. On conflict minerals, they rank issuers on smelter naming, country-of-origin disclosure, and due-diligence credibility. On resource extraction, they reconcile EX-2.01 payment tables against host-country budget data to flag potential revenue leakage. Outputs include public scorecards, regulatory comment letters, and shareholder engagement campaigns.
Negative-screening analysts at asset owners and fund managers use the dataset to identify issuers with unresolved conflict-minerals exposure for exclusion-list maintenance. Proxy advisors assess management responses to related shareholder proposals, focusing on CMR remediation timelines, supplier-engagement results, and named high-risk sourcing locations to inform vote recommendations.
Analysts at host-government revenue authorities, international transparency initiatives, and multilateral institutions use EX-2.01 to triangulate corporate-side disclosures with government-side receipts. They focus on structured payment fields (project, government recipient, currency, category, totals) for reconciliation reports and country-level transparency assessments.
Specialists tracking the tin, tantalum, tungsten, and gold midstream extract every named smelter or refiner and reported operating country across CMR exhibits to build cross-issuer sourcing maps. Output supports industry validation programs, midstream concentration analysis, and high-risk facility flagging.
Researchers in law, accounting, economics, and supply-chain management treat the corpus from April 2014 onward as a longitudinal panel. CMR text supports studies of boilerplate convergence, the impact of the 2014 D.C. Circuit ruling, and changes in due-diligence specificity. EX-2.01 supports empirical work on host-government revenue and fiscal regimes. Metadata enables panel construction by CIK, period, and accession; SD/A amendments enable revision-behavior studies.
Engineering teams building ESG and compliance retrieval systems ingest the full package (HTML, TXT, PDF, JSON) to train and evaluate extractors for smelter names, country-of-origin lists, payment tables, and due-diligence narratives. Heterogeneous exhibit formats stress-test parsers handling inline-XBRL, HTML, and PDF in one pipeline; metadata.json provides clean filer-level grounding for training data and evaluation sets.
The dataset supports concrete workflows across both Form SD disclosure regimes, drawing on the SD body, the Conflict Minerals Report (Exhibit 1.02), the inline-XBRL payments exhibit (Exhibit 2.01), and the per-accession metadata.json.
Parse Exhibit 1.02 narratives to extract every named smelter or refiner together with reported country of operation and Responsible Minerals Assurance Process (RMAP) conformance status. Aggregating across CIKs and reporting years yields a midstream facility roster, supports concentration analysis, and flags facilities that newly appear or drop off issuer lists year over year.
Pull <ix:nonFraction> and <ix:nonNumeric> tags from EX-2.01 documents (rxp taxonomy) to materialize a structured table of payments by project, recipient government, country, payment category (taxes, royalties, fees, production entitlements, bonuses, dividends, infrastructure improvements), and currency. The resulting table feeds effective government-take models, fiscal-regime sensitivity analyses, and reconciliation against ESTMA and EU/UK Reports on Payments to Governments for dual-listed issuers.
Filter metadata.json by SIC code and fiscal year end to assemble a peer set, then compare Item 1.01 country-of-origin determinations and Exhibit 1.02 sections on OECD five-step due diligence, contract-manufacturer scope, and supplier escalation. Output supports redline cycles for in-house counsel preparing the issuer's own Form SD and structuring board-level compliance memos.
Link SD and SD/A records by issuer CIK, reporting period, and filedAt timestamp to reconstruct each amendment trail. Diffing the original Form SD body and Conflict Minerals Report against the SD/A surfaces what issuers correct (smelter list revisions, RCOI restatements, scope clarifications), supporting restatement-behavior studies and disclosure-quality scoring.
Combine entities metadata (CIK, SIC, state of incorporation, tickers, fiscal year end) with parsed Item 1.01 determinations and CMR maturity indicators (named smelters, audit references, remediation plans) to build a longitudinal firm-level panel from 2014 forward. The panel feeds ESG scoring models, exclusion-list maintenance for negative screens, and academic studies of boilerplate convergence following the 2014 D.C. Circuit decision.
For transparency-focused analysts, sum EX-2.01 payment facts by recipient government and resource type, then compare against host-government budget receipts and EITI country reports. Discrepancies between corporate-side disclosures and government-side receipts drive country transparency assessments, regulatory comment letters, and shareholder engagement on revenue governance.
Use the bundled HTML, inline-XBRL, and JSON manifest as a multi-format training corpus for compliance and ESG retrieval systems. The mix of narrative CMR prose, structured rxp-tagged payment tables, and EX-99 supplementary attachments stress-tests parsers for smelter-name extraction, country-of-origin list parsing, and payment-row reconciliation, with metadata.json providing CIK- and period-level ground truth for evaluation sets.
Dataset Index JSON API: https://api.sec-api.io/datasets/form-sd-files.json
This endpoint returns dataset metadata including the name, description, last updated timestamp, earliest sample date (2014-04-01), total record count, total size, covered form types (SD, SD/A), container format (ZIP), and the file types included in each container (HTML, JSON, TXT, PDF). It also returns the full dataset download URL and the list of all individual container files, each with its size, record count, last updated timestamp, and download URL. The endpoint does not require an API key.
Poll this endpoint daily to detect which containers were modified in the most recent refresh run, then decide which containers to re-download based on their updatedAt timestamps.
1
{
2
"datasetId": "1f13365b-9ae0-6923-8ed4-6e2115d6cf27",
3
"datasetDownloadUrl": "https://api.sec-api.io/datasets/form-sd-files.zip",
4
"name": "Form SD Files Dataset",
5
"updatedAt": "2026-05-07T02:51:40.258Z",
6
"earliestSampleDate": "2014-04-01",
7
"totalRecords": 24611,
8
"totalSize": 192016410,
9
"formTypes": ["SD", "SD/A"],
10
"containerFormat": "ZIP",
11
"fileTypes": ["HTML", "JSON", "TXT", "PDF"],
12
"containers": [
13
{
14
"downloadUrl": "https://api.sec-api.io/datasets/form-sd-files/2026/2026-03.zip",
15
"key": "2026/2026-03.zip",
16
"size": 13818783,
17
"records": 154,
18
"updatedAt": "2026-05-07T02:51:40.258Z"
19
}
20
]
21
}
Download Entire Dataset: https://api.sec-api.io/datasets/form-sd-files.zip?token=YOUR_API_KEY
Downloads the complete Form SD Files dataset as a single ZIP archive containing every monthly container from April 2014 to present. This endpoint requires an API key.
Download Single Container: https://api.sec-api.io/datasets/form-sd-files/2026/2026-03.zip?token=YOUR_API_KEY
Downloads one monthly container ZIP file instead of the full archive, which is useful for incremental updates or when only a specific time period is needed. This endpoint requires an API key.
The dataset covers Form SD and Form SD/A filings submitted to EDGAR. Form SD is the specialized disclosure report filed under Rule 13p-1 (conflict minerals) and Rule 13q-1 (resource extraction payments) of the Securities Exchange Act of 1934, and Form SD/A is the amendment counterpart used to correct, restate, or supplement a previously submitted Form SD.
One record is a single EDGAR accession — one Form SD or Form SD/A submission filed by a single issuer on a specific date — packaged as an accession-numbered folder containing a metadata.json manifest and the textual filing documents from the original submission. An issuer that files an annual Form SD and later submits an amending SD/A contributes two independent records.
Two issuer populations file. Under Rule 13p-1, any Exchange Act reporting issuer for which conflict minerals (tin, tantalum, tungsten, gold) are necessary to the functionality or production of a product they manufacture or contract to manufacture must file. Under Rule 13q-1, "resource extraction issuers" engaged in the commercial development of oil, natural gas, or minerals must file. Foreign private issuers are not exempt from either rule.
Rule 13p-1 conflict-minerals filings are due by May 31 of the year following the calendar-year reporting period, regardless of the issuer's fiscal year-end. Rule 13q-1 resource-extraction filings are due no later than 270 days after the issuer's fiscal year-end. There is no event-driven trigger; the obligation arises annually from the issuer's status.
The dataset's earliest sample date is April 1, 2014, corresponding to the first Form SD filings made for calendar-year 2013 conflict-minerals disclosures, and it covers every Form SD and Form SD/A filing through the present. Form SD has been an electronic EDGAR filing since inception, so there is no pre-EDGAR analog.
Records are distributed inside monthly ZIP containers and contain HTML, JSON, TXT, and PDF payloads — at minimum the Form SD body, plus any HTM-format exhibits (EX-1.02, EX-2.01, EX-99.x) and a metadata.json. Image files (GRAPHIC documents such as logos, signature scans, and chart images), the standalone .xsd taxonomy schema, the extracted _htm.xml XBRL instance, and the consolidated complete-submission .txt are catalogued in the manifest but not redistributed inside the ZIP.
Conflict-minerals and resource-extraction-payment disclosures are carved out of the periodic annual report entirely. A 10-K or 20-F will not contain country-of-origin, smelter-level, or project-level payment data at the granularity Form SD requires. Even foreign private issuers subject to Rule 13p-1 or 13q-1 must file Form SD alongside their 20-F, on a separate annual cycle, and amendments use SD/A independently of 10-K/A or 20-F/A.