The Form 10-12G Files Dataset is a full-text collection of EDGAR submissions of Form 10-12G and Form 10-12G/A — the general-purpose registration statement (and its amendments) used to register a class of equity securities under Section 12(g) of the Securities Exchange Act of 1934 when the issuer is not listing on a national securities exchange. Each record represents one EDGAR submission: a single registration statement or a single amendment, materialized as a dedicated filing folder containing a metadata.json header plus one or more SGML-wrapped HTM sub-documents (the primary form and its exhibits). The filer is always the issuer of the class being registered, ranging from threshold-crossing domestic operating companies to Business Development Companies, spin-offs, bank and savings-and-loan holding companies, and voluntary registrants entering the reporting system without a concurrent offering. The dataset begins in May 1994 with the phased EDGAR mandate and extends to the present, covering both initial 10-12G registrations and the 10-12G/A amendment tail that accompanies the 60-day automatic effectiveness window.
Programmatically retrieve the full list of dataset archive files, download URLs and dataset metadata.
Dataset Index JSON API
Download the entire dataset as a single archive file.
Download Entire Dataset:
Download a single container file (e.g. monthly archive) from the dataset.
Download Single Container:
The dataset is built from two EDGAR submission types — 10-12G (initial class registration under Section 12(g)) and 10-12G/A (amendments to a prior 10-12G) — covering the complete population of such submissions filed to EDGAR from May 1994 through the present. Form 10-12G is the general-purpose Exchange Act registration statement used when an issuer must register a class of securities under Section 12(g) because it has crossed the Section 12(g) thresholds on total assets and holders of record, or when an issuer elects to enter Exchange Act reporting voluntarily without listing on a national exchange. Registration on Form 10-12G becomes effective automatically sixty days after filing unless withdrawn or amended, so amendment records (10-12G/A) are a first-class population in the dataset — each amendment has its own accession number, its own folder, and its own metadata file.
The dataset is distributed as monthly ZIP containers. Inside each container, every filing is stored as a folder named by the 18-digit dashless SEC accession number, holding a metadata.json header plus the filing's SGML-wrapped HTM sub-documents (the primary registration statement and attached exhibits). The content file types that may appear across the historical span are TXT, JSON, HTML, XFD, and PDF, reflecting the evolution of EDGAR-accepted submission formats from the mid-1990s forward. Form 10-12G is not an XBRL-tagged form; no XBRL or XML structured-financial-data files accompany these records.
One record in the Form 10-12G Files Dataset is a single EDGAR submission of Form 10-12G or Form 10-12G/A: one complete registration statement, or one amendment to a registration statement, filed under Securities Exchange Act of 1934 Section 12(g). A record is materialized as a dedicated filing folder inside a monthly ZIP container. The folder is named after the 18-digit, dashless SEC accession number of the submission (for example, 000199937125007032 corresponds to accession 0001999371-25-007032) and groups together every document that composes that one accession. Each folder contains exactly one metadata.json file plus one or more SGML-wrapped HTM sub-documents — the primary registration statement and, when attached, its exhibits. Amendments (10-12G/A) are stored as independent records: each amendment receives its own accession number, its own folder, and its own metadata file, rather than being merged into the original filing it amends.
Form 10-12G is the general-purpose registration statement used to register a class of securities under Section 12(g) of the Exchange Act, typically because the issuer has crossed the Section 12(g) thresholds on total assets and holders of record and must become a reporting company even without an exchange listing. Registration on Form 10-12G becomes effective automatically sixty days after filing unless withdrawn or amended. The content requirements are defined by the Form 10 instructions and by Regulation S-K, which prescribe a numbered sequence of Items covering business description, risk factors, financial information, properties, security ownership, directors and officers, executive compensation, related-party transactions, legal proceedings, market for the registrant's securities, recent sales of unregistered securities, description of the securities being registered, indemnification provisions, financial statements and supplementary data, changes and disagreements with accountants, and a list of exhibits. Form 10-12G/A follows the same content architecture but re-states, supplements, or corrects portions of a previously filed 10-12G; in practice amendments range from single-line corrections to wholesale replacements of the prospectus-like body.
At the container level, a record is a directory whose contents fall into two layers: a machine-readable header (metadata.json) and the filer's submitted narrative and exhibit documents (all .htm). The typical physical layout is:
metadata.json — the EDGAR filing header plus a manifest enumerating every sub-document in the submission.TYPE=10-12G or TYPE=10-12G/A in its SGML wrapper.EX-... type in its SGML wrapper and corresponding one-to-one with an EX-... entry in the manifest.Inside each HTM file, the EDGAR SGML wrapper precedes the HTML payload. The wrapper opens with <DOCUMENT>, declares <TYPE> (the EDGAR document type, e.g. 10-12G, EX-3.1, EX-10.13), <SEQUENCE> (the document's 1-based ordinal within the submission), <FILENAME> (the original filer-supplied filename), and <DESCRIPTION> (a free-form label), then opens <TEXT> around the <HTML> body and closes with matching tags. These SGML header fields mirror the sequence, type, filename, and description entries inside metadata.json, so the two layers cross-reference cleanly. HTM filenames are filer-chosen and are not standardized (d925208d1012ga.htm, risecompaniescorp-form10a.htm, nuveenchurchillbdcv-form10.htm, ex3-1.htm, exhibit101-form10a.htm, cnlrcred-1012g_060225.htm); the stable way to identify a given document's role is through the EDGAR TYPE value in the SGML wrapper or the type field in the manifest.
The primary HTM — the largest file in most records, often exceeding a megabyte for fund and BDC filings — carries the full registration statement text. The narrative opens with a cover page that identifies the registrant's exact legal name, state of incorporation or organization, IRS Employer Identification Number, principal executive office address and telephone number, the title of each class of securities to be registered, the name of each exchange on which each class is to be registered (typically "None" for 12(g)-only registrations), and the title of any classes registered under Section 12(b). Following the cover, the body is organized as a numbered sequence of Items prescribed by the Form 10 instructions and the governing Regulation S-K Items, typically:
The narrative closes with a signature block executed by a duly authorized officer, dated as of the filing date. Because Form 10-12G resembles a Securities Act S-1 in content and depth, the prose is prospectus-like: dense, heavily footnoted, and interleaved with embedded tables for compensation, ownership, capitalization, selected financial data, and the financial statements themselves.
Exhibits are listed under Item 15 and filed as independent sub-documents inside the record. The exhibit-number taxonomy follows Item 601 of Regulation S-K; the families that recur in 10-12G filings are:
EX-3.1, EX-3.2, EX-3.3, etc. carry the Certificate or Articles of Incorporation (or, for LPs and LLCs, the Limited Partnership Agreement or Operating Agreement), any amendments and restatements thereof, and the Bylaws. These documents fix the charter terms of the registered class and are present even in minimal filings.EX-4.1, EX-4.2, and so on: distribution-reinvestment plans, share-repurchase plans, indentures, warrant agreements, and specimen stock certificates — any text that defines the contractual rights of the class being registered.Each exhibit appears as a standalone HTM with <TYPE>EX-... in its SGML wrapper and a matching documentFormatFiles[] entry. Exhibits already on file with the SEC are frequently incorporated by reference rather than re-filed, which is why many substantive initial 10-12G filings ship with only EX-3 and a subset of EX-10 documents, and why most 10-12G/A records contain only the amended narrative with no attached exhibits at all.
metadata.json headermetadata.json sits at the root of every filing folder and carries the EDGAR filing-level header plus the manifest of every sub-document. The fields have the following meaning:
formType — 10-12G for an initial registration, 10-12G/A for an amendment.accessionNo — the dashed accession number in canonical EDGAR form (e.g. 0001641172-25-017030).description — the human-readable form description, Form 10-12G - Registration of securities [Section 12(g)] or the [Amend] variant.filedAt — ISO 8601 timestamp with timezone offset capturing the EDGAR acceptance time (e.g. 2025-06-30T06:06:55-04:00).linkToFilingDetails, linkToTxt, linkToHtml — URLs to the primary HTM, the full SGML submission text bundle, and the EDGAR filing index page on www.sec.gov, respectively.linkToXbrl — an empty string for this dataset; Form 10-12G is not an XBRL-tagged form.documentFormatFiles[] — an ordered manifest with one entry per sub-document in the original submission. Each entry carries sequence (1-based, with the terminal "complete submission text file" entry using a single space), type (the EDGAR document type such as 10-12G, EX-3.1, EX-10.13, GRAPHIC), description (free-form filer label such as "ARTICLES OF AMENDMENT AND RESTATEMENT OF ..."), documentUrl (absolute archive URL), and size (byte size as a string). Entries for GRAPHIC artifacts and the complete submission .txt remain in the manifest even though those files are not shipped inside the ZIP.entities[] — one entry per filer or subject party, normally a single Filer. Each entry carries cik, companyName (suffixed with the role, e.g. "... INC (Filer)"), type (echoes the form type), act ("34" for the Securities Exchange Act of 1934), fileNo (SEC-assigned file number, e.g. 000-55963), filmNo, irsNo ("000000000" when omitted), fiscalYearEnd in MMDD form, stateOfIncorporation as a two-letter code, sic (SIC code and label, e.g. "6282 Investment Advice", "6770 Blank Checks"), and tickers[] when the filer has listed symbols.seriesAndClassesContractsInformation[] — an empty array in this dataset; fund series/class structure is not encoded for 10-12G filings.dataFiles[] — an empty array; no XBRL or XML data files accompany 10-12G filings.id — an opaque 32-character hex identifier stable per filing.Each record physically contains metadata.json plus every HTML/HTM sub-document that was part of the original EDGAR submission: the primary form and each attached exhibit in its own file. Together, these reconstitute the full textual content of the 10-12G or 10-12G/A as filed — cover page, Item-by-Item narrative, embedded tables (compensation, ownership, capitalization, selected financial data, financial statements and notes), signature block, and every exhibit body.
The dataset is deliberately trimmed of non-textual and fully redundant artifacts. Records exclude:
TYPE=GRAPHIC in EDGAR remain referenced in documentFormatFiles[] but are not packed into the ZIP..txt submission file (the manifest entry whose type is a single space) is likewise referenced but omitted — the individual sub-document HTMs already contain all textual content.The dataset spans May 1994 to the present, a window during which the content requirements of Form 10 and the underlying Regulation S-K Items have evolved materially. Risk Factors disclosure (Item 1A in current form) was added as an explicit Item only after the SEC's 2005 Securities Offering Reform, which required plain-English risk-factor disclosure in annual reports and registration statements; earlier 10-12G filings typically embed risk-type disclosures inside the Business item rather than breaking them out. Executive compensation disclosure was substantially overhauled in 2006 (Compensation Discussion and Analysis, revised summary compensation table categories, expanded director compensation) and again incrementally with pay-ratio and pay-versus-performance rules, so early-era 10-12G records carry a much simpler compensation section than modern ones. Related-party transaction disclosures (Item 404), director independence disclosures (Item 407), and the cyber-incident and human-capital disclosures added in later Regulation S-K amendments appear only in 10-12G filings post-dating the respective rule effective dates. Exhibit requirements have likewise drifted: the EX-21 subsidiary list and EX-23 auditor consent have been stable, but the scope of EX-10 material-contract filings has been refined by later amendments to Item 601, and hyperlinking of exhibits became mandatory in 2017. Registrants subject to emerging-growth-company, smaller-reporting-company, or investment-company scaling provisions file content that is proportionately reduced relative to the full Form 10 template.
The file-types found in this dataset — TXT, JSON, HTML, XFD, and PDF — reflect the evolution of EDGAR's accepted submission formats across the 1994-to-present window. The earliest 10-12G filings, from May 1994 through the late 1990s, were submitted as ASCII plain-text documents: narrative and tables alike rendered as fixed-width text, often with page-break markers and no embedded formatting. HTML submissions became the predominant format after the SEC's 1999 modernization of EDGAR document-format standards, and by the mid-2000s essentially all 10-12G filings ship as HTM inside an EDGAR SGML wrapper. XFD and PDF appear only as tail-distribution artifacts for a small subset of filings, largely exhibits rendered in those formats by particular filers. Modern HTML bodies are frequently produced through Wdesk / Workiva (identifiable by <!-- Document created using Wdesk --> and <!-- Copyright ... Workiva --> comments inside the HTML) or by legacy EDGAR converters (inline Times New Roman styling and deprecated HTML attributes). Form 10-12G is not an XBRL-tagged form, and both linkToXbrl and dataFiles[] are empty for every record; structured financial data is not part of this collection.
Several nuances matter when reading or extracting from a record:
EX-... files in the folder is not a complete picture of the registrant's exhibit universe.<TYPE> header and from documentFormatFiles[*].type in metadata.json, not from filename patterns — naming is idiosyncratic across filers and filing agents.entities[] sub-fields. irsNo is populated with "000000000" rather than omitted when the filer supplies no IRS Employer ID.<TYPE> tag and the corresponding manifest entry are the authoritative classification of what the sub-document actually is.The filer on a Form 10-12G is always the issuer itself, registering one of its own equity security classes under Section 12(g). Typical filers:
Investment companies registered under the Investment Company Act of 1940 do not use Form 10-12G; they register on Form N-8A and related 1940 Act forms. Asset-backed issuers use Form SF-1/SF-3 and Form 10-D.
Section 12(g)(1) requires an issuer engaged in or affecting interstate commerce to register a class of equity securities (other than an exempted security) within 120 days after the last day of the first fiscal year on which it has:
Section 12(g) as enacted by the Securities Acts Amendments of 1964 set the thresholds at $1 million in assets and 500 record holders. The SEC raised the asset threshold in stages through Rule 12g-1 ($3 million, $5 million, then $10 million). Subsequent statutory changes:
Mandatory filers have 120 days after the triggering fiscal year-end to file the 10-12G. Voluntary filers may file at any time.
Form 10 is the general Exchange Act registration form; content is governed by its General Instructions, Regulation S-K (non-financial items including business, risk factors, MD&A, executive compensation, beneficial ownership, related-party transactions, and description of securities), and Regulation S-X (audited financial statements prepared to PCAOB standards). The same form is used for two EDGAR submission types:
A Section 12(g) registration on Form 10 is not declared effective by SEC staff. Under Section 12(g)(1), the registration statement becomes effective automatically 60 days after filing, unless the SEC accelerates effectiveness at the issuer's request or the issuer withdraws the filing. During the 60-day window, the Division of Corporation Finance may issue comments; the issuer typically responds by filing 10-12G/A amendments. Amendments do not, by themselves, re-toll the 60-day statutory clock — the issuer must either consent to delay effectiveness or withdraw and refile to reset it.
On effectiveness, the issuer becomes a Section 13(a) reporting company, subject to Forms 10-K, 10-Q, and 8-K, Section 14 proxy rules, and Section 16 insider reporting (for officers, directors, and 10% beneficial owners). Form 10-12G is a one-time entry point, not a recurring filing.
10-12G/A records are amendments to a prior Form 10-12G. Common triggers:
Amendments may be filed before or after statutory effectiveness.
Form 10 registration under Section 12(g) has existed since the 1964 Exchange Act amendments, but EDGAR submissions under the 10-12G type begin in May 1994 with the phased EDGAR mandate. Earlier Section 12(g) Form 10 filings exist only in paper and are not part of this dataset.
Form 10-12G is an Exchange Act class registration under Section 12(g), triggered by size and holder thresholds rather than by an offering or exchange listing. Because it overlaps in name, purpose, and disclosure content with several adjacent regimes, the useful comparisons fall into four groups: sister Exchange Act registrations (10-12B, 8-A), Securities Act offering forms (S-1, 1-A), life-cycle bookends and follow-ons (10-K/10-Q, Form 15), and specialized populations (N-2, 20-F).
The 10-12G dataset is narrow and specific: long-form Section 12(g) Exchange Act class registrations by domestic issuers, filed outside a concurrent Securities Act offering. It is the correct source for studying non-IPO entry into the reporting system (spin-offs onto OTC markets, holder-count crossings, voluntary registrants), first-disclosure baselines for companies that never filed an S-1, BDC Exchange Act registration events, and amendment trails via 10-12G/A. It is the wrong source for exchange-listing registrations (10-12B), IPO short-form listings (8-A), offerings (S-1, 1-A), ongoing reporting (10-K, 10-Q, 8-K), exits (Form 15), foreign issuers (20-F, 40-F), or investment-company offerings (N-2).
Form 10-12G is an issuer's first full public disclosure when registering a class of securities under Section 12(g) without a public offering. The dataset serves a narrow but varied user base: legal, regulatory, analytical, transactional, academic, and engineering teams, each working a different slice of the record.
Used as a precedent library when drafting or amending a 10-12G. Attorneys pull filings from issuers of comparable size, industry, and capital structure to benchmark Item 1 business descriptions, Item 1A risk factors, Item 10 securities descriptions (preferred terms, transfer restrictions, anti-takeover provisions), and EX-3 and EX-10 exhibit scopes. 10-12G/A amendments are mined to anticipate staff comments. Outputs: drafted registration statements, comment-response memos, and effectiveness checklists.
Staff attorneys and accountants use historical filings as a reference set when assessing Regulation S-K and S-X compliance. They compare how peer registrants disclosed related-party transactions, going-concern language, revenue recognition, Item 4 ownership tables, and exhibit indexes. Supports reviewer training, comment-letter consistency, and detection of recurring deficiencies.
Fundamental analysts treat the 10-12G as the initiation document for companies that become reporting without a traditional S-1. Item 1 feeds segment and revenue models; Item 1A populates downside frameworks; Item 6 informs incentive-alignment analysis; Item 4 drives free-float and holder estimates; EX-10 material contracts and EX-3 governance documents support initiation reports and valuation memos. BDC and closed-end fund analysts additionally use Item 6 and EX-10 advisory, administration, and credit-facility agreements to benchmark fee structures across the BDC universe.
Used to track portfolio companies taking the direct registration path to public status. Item 4 ownership tables reveal post-registration cap tables, sponsor holdings, and preferred-to-common conversion mechanics; EX-3 charters and EX-10 stockholder agreements document surviving investor rights. Supports competitive monitoring of sponsor-backed issuers entering the reporting population.
Bankers advising on reverse mergers, direct listings, and Reg A+ follow-ons study how prior registrants sequenced Section 12(g) registration with other transactions, how Item 10 accommodated multi-class structures, and what EX-10 exhibits (merger, share-exchange, lock-up agreements) accompanied registration. Outputs: structuring memos, pitch materials, and effectiveness-timing checklists.
Build baseline governance profiles for newly reporting issuers ahead of their first proxy cycle. EX-3 exhibits are coded for classified boards, supermajority votes, exclusive forum clauses, and advance-notice bylaws; Item 6 flags early pay-design signals; Item 4 reveals controlling-shareholder and dual-class dynamics. Feeds governance scoring models and client alerts on new entrants to the reporting universe.
Use the full 1994-to-present span as a study sample for work on registration effectiveness, capital-formation pathways, JOBS Act effects on Section 12(g) thresholds, and the private-to-public transition. Cover-page and metadata.json fields build panels of registration dates and effectiveness windows; Item 1 and Item 1A text support industry classification and risk-factor evolution studies; Item 6 supports compensation research. Outputs: working papers, journal articles, and replication archives.
Use the corpus as training and retrieval material for models that parse business descriptions, extract risk factors, classify material contracts, and segment exhibits. metadata.json enables deterministic indexing by CIK, filing date, form type (10-12G vs. 10-12G/A), and document type; TXT, HTML, and PDF variants provide raw content. Workflows: fine-tuning financial LLMs on registration prose, RAG over pre-public disclosure, and entity linking between registrants and their later 10-K, 10-Q, and 8-K filings.
Evaluate recently reporting targets or benchmark clients against a peer set. Item 1 informs market positioning; Item 1A and Item 3 surface operational, regulatory, and litigation risks; EX-10 contracts expose customer concentration, supplier dependencies, and financing covenants. Outputs: target-screening memos, competitive landscape decks, and pre-LOI red-flag reports.
Build registration-drafting assistants, exhibit trackers, disclosure-control platforms, and Section 16 readiness tools. Historical exhibit indexes and item-level structures train template libraries and recommendation engines; metadata drives filing-event detection features.
When a borrower or prospect registers under Section 12(g), analysts pull financial statements, Item 1A liquidity and going-concern language, and EX-10 credit agreements, indentures, security agreements, and guarantees. Outputs: updated credit memos covering capital structure, reporting obligations, and expected disclosure cadence.
The following workflows are distinctive to Form 10-12G: they exploit that a 10-12G is an issuer's first full Exchange Act disclosure filed without an accompanying offering document, and that the dataset's span from May 1994 to the present includes a large BDC sub-population and a long amendment tail.
Build a panel of first-disclosure records for issuers that entered Exchange Act reporting through spin-off, holder-count crossover, or voluntary registration rather than an S-1. Join metadata.json fields (entities[].cik, filedAt, fileNo, sic, stateOfIncorporation) across all 10-12G records, then extract Item 1 (Business), Item 1A (Risk Factors), and Item 13 (Financial Statements) from the primary HTM to create a "t=0" reference set that can be chained against later 10-K, 10-Q, and 8-K filings by CIK. Supports event studies of post-registration performance and risk-factor drift.
Filter the corpus to Business Development Companies (the large BDC sub-population that must file 10-12G under 1940 Act Section 54(a)(2)), then extract each record's EX-10 investment advisory agreements, sub-advisory agreements, and administration agreements. Parse the fee schedules, hurdle rates, catch-up provisions, and expense-reimbursement caps into a structured comparables table. Output: a BDC fee-terms benchmark used by fund analysts, LP due-diligence teams, and advisers pricing new BDC launches.
Group amendments to each parent 10-12G by CIK and accession chronology, then diff the amended primary-form HTM against the prior version to isolate what changed at each amendment stage. Because Form 10-12G goes effective automatically at 60 days unless amended, amendment patterns are a proxy for SEC staff feedback. Output: a corpus of "comment-response signals" for securities counsel training registration-drafting assistants and building effectiveness-timing models.
Extract EX-3.1, EX-3.2, and EX-3.3 (Certificate or Articles of Incorporation, LP agreements, bylaws) from every 10-12G record and tag each for classified boards, supermajority thresholds, exclusive forum clauses, dual-class voting, advance-notice bylaws, and blank-check preferred authorizations. Combined with Item 4 ownership tables, this produces a governance-profile dataset for every new entrant to the reporting universe, ready for ISS/Glass Lewis-style scoring ahead of the issuer's first proxy cycle.
Index every sub-document by the SGML <TYPE> wrapper value (10-12G, EX-3.1, EX-10.13, etc.) rather than by filer-chosen filenames, then build a retrieval layer keyed on form type, SIC, state of incorporation, and exhibit type. Securities counsel query "EX-10 placement-agent agreements from REIT 10-12Gs filed after 2020" or "Item 11 securities descriptions for multi-class LP interests" and retrieve precedent passages. Feeds drafting assistants, clause banks, and comment-response memo templates.
Use the type-tagged HTM files as labeled training data for NLP models that segment long registration statements into Items 1 through 15 and classify exhibits into the EX-3, EX-4, EX-10, EX-21, EX-23, and EX-99 families. The SGML <TYPE> header provides exhibit labels without manual annotation; the numbered Item headings in the primary form provide within-document section labels. Output: Item-aware parsers and exhibit classifiers deployable across 10-12G, 10-12B, S-1, and 10-K corpora.
Join each 10-12G record by CIK to the issuer's eventual Form 15 (termination of Exchange Act reporting) to measure reporting tenure, reasons for exit, and lifecycle survival. Item 1 and Item 1A text from the 10-12G characterize the issuer at entry; the Form 15 check-box notice characterizes the exit state. Output: survival-analysis panels for academic work on Section 12(g) threshold changes (including JOBS Act effects) and for RegTech products that forecast deregistration likelihood.
The Form 10-12G Files Dataset covers filings from May 1994 through the present and is distributed as monthly ZIP containers enumerated in a single dataset index. The dataset can be accessed in three ways: via the dataset index JSON API, as a full dataset archive, or by downloading individual container files.
Dataset Index JSON API: https://api.sec-api.io/datasets/form-1012g-files.json
This endpoint returns the dataset metadata and the complete list of available container files. The metadata includes the dataset name, description, last updated timestamp, earliest sample date, total record count and total size, form types covered (10-12G and 10-12G/A), the container format (ZIP), and the content file types (TXT, JSON, HTML, XFD, PDF). Each container entry lists its download URL, relative key, size, record count, and last updated timestamp. Poll this endpoint to monitor which containers have been refreshed in the latest run and to decide which ones to download incrementally.
This endpoint does not require an API key.
Example response:
1
{
2
"datasetId": "1f13365b-9ae0-6937-8ddb-2b6a7f42d7bc",
3
"datasetDownloadUrl": "https://api.sec-api.io/datasets/form-1012g-files.zip",
4
"name": "Form 10-12G Files Dataset",
5
"updatedAt": "2026-04-15T06:13:33.839Z",
6
"earliestSampleDate": "1994-05-01",
7
"totalRecords": 47906,
8
"totalSize": 1524182797,
9
"formTypes": ["10-12G", "10-12G/A"],
10
"containerFormat": "ZIP",
11
"fileTypes": ["TXT", "JSON", "HTML", "XFD", "PDF"],
12
"containers": [
13
{
14
"downloadUrl": "https://api.sec-api.io/datasets/form-1012g-files/2026/2026-04.zip",
15
"key": "2026/2026-04.zip",
16
"size": 13818783,
17
"records": 154,
18
"updatedAt": "2026-04-15T06:13:33.839Z"
19
}
20
]
21
}
Download Entire Dataset: https://api.sec-api.io/datasets/form-1012g-files.zip?token=YOUR_API_KEY
Downloads the complete dataset as a single ZIP archive containing all monthly containers from May 1994 to the present. This endpoint requires an API key.
Download Single Container: https://api.sec-api.io/datasets/form-1012g-files/2026/2026-04.zip?token=YOUR_API_KEY
Downloads a single monthly container ZIP instead of the full dataset. Use the downloadUrl values returned by the dataset index JSON API to select specific months. This endpoint requires an API key.
The dataset covers EDGAR submissions of Form 10-12G — the general-purpose registration statement used to register a class of equity securities under Section 12(g) of the Securities Exchange Act of 1934 — together with its amendments, submission type 10-12G/A. Both submission types are filed under the same Form 10 template and governed by Regulation S-K and Regulation S-X.
One record represents a single EDGAR submission: either one initial Form 10-12G registration statement or one 10-12G/A amendment. Each record is a dedicated filing folder named by the 18-digit dashless SEC accession number, containing exactly one metadata.json header plus one or more SGML-wrapped HTM sub-documents (the primary form and attached exhibits).
The filer is always the issuer of the class of equity securities being registered. Mandatory filers are domestic issuers (including bank and savings-and-loan holding companies administered by the SEC) that have crossed the Section 12(g) thresholds — currently more than $10 million in total assets and either 2,000 record holders or 500 non-accredited record holders for non-bank issuers — but are not listed on a national exchange. Issuers may also file voluntarily to enter Exchange Act reporting without being required to, for example to qualify for OTC Markets tiers, support employee stock plans, or prepare for a future exchange uplisting.
A Form 10-12G is not declared effective by SEC staff; under Section 12(g)(1) it becomes effective automatically 60 days after filing unless the SEC accelerates effectiveness or the issuer withdraws the filing. During the 60-day window, the Division of Corporation Finance may issue comments, which issuers typically answer by filing 10-12G/A amendments.
Form S-1 registers an offer and sale of specific securities under the Securities Act of 1933 and requires SEC declaration of effectiveness; Form 10-12G registers a class of securities under the Exchange Act, involves no offering mechanics, and goes effective automatically after 60 days. The two forms overlap heavily on business, risk-factor, MD&A, compensation, and audited-financial content, but S-1 adds pricing, underwriting, use of proceeds, and dilution disclosure, while 10-12G omits all offering mechanics and is the correct source for non-IPO entry into the reporting system.
All three are Exchange Act registration forms, but they map to different pathways. Form 10-12B uses the same Form 10 content as 10-12G but registers under Section 12(b) in connection with listing on a national securities exchange. Form 8-A is a short-form registration available to issuers that already have a current Securities Act registration (such as an S-1 IPO) to incorporate by reference from, whereas Form 10-12G is the full standalone disclosure package used when no such Securities Act filing exists to lean on.
The dataset is distributed as monthly ZIP containers. Inside each container, every filing is stored as a folder named by the accession number and contains a metadata.json header plus SGML-wrapped HTM sub-documents. Content file types that may appear across the historical span are TXT, JSON, HTML, XFD, and PDF; Form 10-12G is not an XBRL-tagged form, so no XBRL or XML structured-data files are included.
The dataset covers Form 10-12G and 10-12G/A submissions on EDGAR starting in May 1994 — when the phased EDGAR mandate began accepting such filings electronically — through the present. Earlier Section 12(g) Form 10 filings exist only in paper and are not part of this dataset.