Form 10-12G Files Dataset

The Form 10-12G Files Dataset is a full-text collection of EDGAR submissions of Form 10-12G and Form 10-12G/A — the general-purpose registration statement (and its amendments) used to register a class of equity securities under Section 12(g) of the Securities Exchange Act of 1934 when the issuer is not listing on a national securities exchange. Each record represents one EDGAR submission: a single registration statement or a single amendment, materialized as a dedicated filing folder containing a metadata.json header plus one or more SGML-wrapped HTM sub-documents (the primary form and its exhibits). The filer is always the issuer of the class being registered, ranging from threshold-crossing domestic operating companies to Business Development Companies, spin-offs, bank and savings-and-loan holding companies, and voluntary registrants entering the reporting system without a concurrent offering. The dataset begins in May 1994 with the phased EDGAR mandate and extends to the present, covering both initial 10-12G registrations and the 10-12G/A amendment tail that accompanies the 60-day automatic effectiveness window.

Update Frequency
Daily
Updated at
2026-05-09
Earliest Sample Date
1994-05-01
Total Size
1.5 GB
Total Records
47,978
Container Format
ZIP
Content Types
TXT, JSON, HTML, XFD, PDF
Form Types
10-12G, 10-12G/A

Dataset APIs

Programmatically retrieve the full list of dataset archive files, download URLs and dataset metadata.

Dataset Index JSON API

Download the entire dataset as a single archive file.

Download Entire Dataset:

Download a single container file (e.g. monthly archive) from the dataset.

Download Single Container:

Dataset Files

374 files · 1.5 GB
Download All
2026-05.zip33.0 KB5 records
2026-04.zip3.5 MB101 records
2026-03.zip5.1 MB149 records
2026-02.zip2.8 MB59 records
2026-01.zip3.0 MB48 records
2025-12.zip5.7 MB112 records
2025-11.zip3.2 MB95 records
2025-10.zip4.5 MB107 records
2025-09.zip4.7 MB88 records
2025-08.zip5.1 MB100 records
2025-07.zip5.0 MB120 records
2025-06.zip3.2 MB45 records
2025-05.zip3.4 MB106 records
2025-04.zip5.5 MB105 records
2025-03.zip3.0 MB95 records
2025-02.zip2.6 MB79 records
2025-01.zip3.3 MB65 records
2024-12.zip3.1 MB84 records
2024-11.zip4.9 MB63 records
2024-10.zip5.2 MB106 records
2024-09.zip3.7 MB108 records
2024-08.zip3.7 MB81 records
2024-07.zip4.4 MB123 records
2024-06.zip1.2 MB20 records
2024-05.zip2.6 MB55 records
2024-04.zip2.8 MB43 records
2024-03.zip2.1 MB35 records
2024-02.zip11.8 MB96 records
2024-01.zip2.6 MB113 records
2023-12.zip6.2 MB97 records
2023-11.zip3.7 MB47 records
2023-10.zip4.4 MB130 records
2023-09.zip3.6 MB92 records
2023-08.zip4.5 MB147 records
2023-07.zip6.5 MB117 records
2023-06.zip6.1 MB137 records
2023-05.zip4.4 MB78 records
2023-04.zip5.5 MB163 records
2023-03.zip2.8 MB89 records
2023-02.zip2.7 MB86 records
2023-01.zip2.7 MB85 records
2022-12.zip2.9 MB79 records
2022-11.zip2.6 MB94 records
2022-10.zip2.4 MB77 records
2022-09.zip6.0 MB141 records
2022-08.zip6.7 MB153 records
2022-07.zip5.0 MB132 records
2022-06.zip7.3 MB164 records
2022-05.zip6.7 MB134 records
2022-04.zip10.7 MB199 records
2022-03.zip4.9 MB100 records
2022-02.zip8.0 MB155 records
2022-01.zip5.5 MB137 records
2021-12.zip8.3 MB217 records
2021-11.zip11.7 MB199 records
2021-10.zip16.5 MB187 records
2021-09.zip10.4 MB209 records
2021-08.zip6.2 MB184 records
2021-07.zip5.9 MB208 records
2021-06.zip6.3 MB193 records
2021-05.zip9.2 MB210 records
2021-04.zip4.2 MB148 records
2021-03.zip4.9 MB157 records
2021-02.zip2.3 MB86 records
2021-01.zip4.4 MB148 records
2020-12.zip5.6 MB154 records
2020-11.zip7.8 MB220 records
2020-10.zip4.8 MB140 records
2020-09.zip2.6 MB87 records
2020-08.zip3.4 MB107 records
2020-07.zip2.0 MB58 records
2020-06.zip6.5 MB175 records
2020-05.zip540.6 KB21 records
2020-04.zip1.5 MB62 records
2020-03.zip4.4 MB49 records
2020-02.zip1.9 MB78 records
2020-01.zip5.2 MB123 records
2019-12.zip5.4 MB112 records
2019-11.zip12.9 MB91 records
2019-10.zip2.4 MB92 records
2019-09.zip2.1 MB78 records
2019-08.zip3.4 MB170 records
2019-07.zip2.4 MB74 records
2019-06.zip8.3 MB99 records
2019-05.zip3.3 MB122 records
2019-04.zip2.9 MB110 records
2019-03.zip1.5 MB78 records
2019-02.zip1.8 MB77 records
2019-01.zip1.8 MB93 records
2018-12.zip1.8 MB56 records
2018-11.zip1.7 MB53 records
2018-10.zip2.0 MB73 records
2018-09.zip1.5 MB68 records
2018-08.zip3.5 MB187 records
2018-07.zip1.6 MB89 records
2018-06.zip3.6 MB123 records
2018-05.zip1.9 MB133 records
2018-04.zip2.0 MB141 records
2018-03.zip1.7 MB68 records
2018-02.zip2.4 MB118 records
2018-01.zip3.1 MB124 records
2017-12.zip1.5 MB76 records
2017-11.zip3.6 MB140 records
2017-10.zip3.2 MB80 records
2017-09.zip4.6 MB153 records
2017-08.zip4.9 MB142 records
2017-07.zip2.6 MB92 records
2017-06.zip4.3 MB138 records
2017-05.zip4.1 MB117 records
2017-04.zip2.3 MB87 records
2017-03.zip1.6 MB56 records
2017-02.zip3.0 MB96 records
2017-01.zip2.1 MB97 records
2016-12.zip3.0 MB74 records
2016-11.zip2.4 MB75 records
2016-10.zip2.8 MB48 records
2016-09.zip3.4 MB122 records
2016-08.zip1.8 MB92 records
2016-07.zip2.7 MB96 records
2016-06.zip2.3 MB99 records
2016-05.zip2.2 MB111 records
2016-04.zip3.4 MB111 records
2016-03.zip4.2 MB123 records
2016-02.zip1.5 MB46 records
2016-01.zip3.4 MB112 records
2015-12.zip2.1 MB77 records
2015-11.zip749.9 KB21 records
2015-10.zip2.3 MB87 records
2015-09.zip3.0 MB164 records
2015-08.zip2.9 MB169 records
2015-07.zip88.4 MB81 records
2015-06.zip1.9 MB104 records
2015-05.zip2.0 MB87 records
2015-04.zip71.4 MB156 records
2015-03.zip2.8 MB161 records
2015-02.zip3.1 MB127 records
2015-01.zip1.8 MB61 records
2014-12.zip4.0 MB246 records
2014-11.zip3.2 MB161 records
2014-10.zip4.3 MB206 records
2014-09.zip8.3 MB118 records
2014-08.zip2.6 MB95 records
2014-07.zip9.5 MB216 records
2014-06.zip3.6 MB248 records
2014-05.zip11.7 MB89 records
2014-04.zip3.2 MB98 records
2014-03.zip1.2 MB52 records
2014-02.zip3.4 MB120 records
2014-01.zip4.0 MB95 records
2013-12.zip4.1 MB181 records
2013-11.zip3.9 MB143 records
2013-10.zip3.2 MB106 records
2013-09.zip3.5 MB194 records
2013-08.zip4.7 MB193 records
2013-07.zip4.3 MB228 records
2013-06.zip4.2 MB203 records
2013-05.zip3.7 MB134 records
2013-04.zip3.7 MB193 records
2013-03.zip2.0 MB107 records
2013-02.zip2.3 MB133 records
2013-01.zip2.8 MB124 records
2012-12.zip2.2 MB93 records
2012-11.zip2.5 MB99 records
2012-10.zip3.9 MB172 records
2012-09.zip2.2 MB73 records
2012-08.zip5.7 MB167 records
2012-07.zip2.4 MB119 records
2012-06.zip5.3 MB262 records
2012-05.zip3.2 MB171 records
2012-04.zip4.9 MB237 records
2012-03.zip2.9 MB138 records
2012-02.zip3.4 MB119 records
2012-01.zip6.7 MB194 records
2011-12.zip5.8 MB259 records
2011-11.zip4.5 MB157 records
2011-10.zip18.0 MB122 records
2011-09.zip2.8 MB108 records
2011-08.zip5.4 MB201 records
2011-07.zip5.4 MB298 records
2011-06.zip4.1 MB192 records
2011-05.zip5.4 MB333 records
2011-04.zip7.5 MB298 records
2011-03.zip7.9 MB327 records
2011-02.zip8.4 MB364 records
2011-01.zip9.5 MB344 records
2010-12.zip8.0 MB302 records
2010-11.zip7.8 MB377 records
2010-10.zip8.9 MB327 records
2010-09.zip5.9 MB267 records
2010-08.zip8.9 MB460 records
2010-07.zip6.0 MB287 records
2010-06.zip6.8 MB209 records
2010-05.zip6.5 MB193 records
2010-04.zip7.9 MB258 records
2010-03.zip3.7 MB135 records
2010-02.zip3.4 MB172 records
2010-01.zip6.9 MB145 records
2009-12.zip6.1 MB139 records
2009-11.zip7.3 MB217 records
2009-10.zip4.5 MB218 records
2009-09.zip3.5 MB135 records
2009-08.zip7.8 MB211 records
2009-07.zip6.8 MB249 records
2009-06.zip4.5 MB153 records
2009-05.zip4.9 MB130 records
2009-04.zip9.3 MB382 records
2009-03.zip2.7 MB59 records
2009-02.zip7.9 MB278 records
2009-01.zip4.1 MB206 records
2008-12.zip5.8 MB186 records
2008-11.zip4.8 MB233 records
2008-10.zip25.9 MB338 records
2008-09.zip7.4 MB270 records
2008-08.zip10.0 MB388 records
2008-07.zip11.1 MB443 records
2008-06.zip6.8 MB252 records
2008-05.zip6.2 MB396 records
2008-04.zip9.5 MB283 records
2008-03.zip6.7 MB322 records
2008-02.zip7.8 MB345 records
2008-01.zip3.1 MB120 records
2007-12.zip3.0 MB67 records
2007-11.zip1.7 MB87 records
2007-10.zip2.2 MB41 records
2007-09.zip1.0 MB44 records
2007-08.zip1.8 MB42 records
2007-07.zip3.1 MB53 records
2007-06.zip3.0 MB73 records
2007-05.zip2.7 MB58 records
2007-04.zip9.2 MB341 records
2007-03.zip1.0 MB33 records
2007-02.zip1.6 MB36 records
2007-01.zip2.8 MB88 records
2006-12.zip1.2 MB44 records
2006-11.zip1.4 MB56 records
2006-10.zip2.2 MB83 records
2006-09.zip790.8 KB27 records
2006-08.zip2.8 MB57 records
2006-07.zip3.2 MB79 records
2006-06.zip4.4 MB113 records
2006-05.zip7.8 MB252 records
2006-04.zip5.8 MB200 records
2006-03.zip3.3 MB130 records
2006-02.zip4.5 MB111 records
2006-01.zip1.7 MB75 records
2005-12.zip4.6 MB172 records
2005-11.zip1.7 MB63 records
2005-10.zip1.8 MB49 records
2005-09.zip4.2 MB103 records
2005-08.zip1.9 MB46 records
2005-07.zip4.1 MB122 records
2005-06.zip6.8 MB223 records
2005-05.zip3.5 MB217 records
2005-04.zip3.0 MB124 records
2005-03.zip1.1 MB38 records
2005-02.zip1.1 MB20 records
2005-01.zip1.6 MB64 records
2004-12.zip1.8 MB96 records
2004-11.zip534.2 KB10 records
2004-10.zip552.4 KB17 records
2004-09.zip778.1 KB25 records
2004-08.zip1.5 MB54 records
2004-07.zip1.4 MB40 records
2004-06.zip2.8 MB99 records
2004-05.zip2.6 MB53 records
2004-04.zip5.5 MB196 records
2004-03.zip1.8 MB92 records
2004-02.zip1.2 MB46 records
2004-01.zip876.3 KB16 records
2003-12.zip1.7 MB65 records
2003-11.zip1.7 MB57 records
2003-10.zip1.1 MB30 records
2003-09.zip1.4 MB45 records
2003-08.zip2.1 MB67 records
2003-07.zip1.1 MB15 records
2003-06.zip1.8 MB68 records
2003-05.zip2.3 MB89 records
2003-04.zip4.4 MB170 records
2003-03.zip1.1 MB47 records
2003-02.zip560.4 KB16 records
2003-01.zip1.2 MB39 records
2002-12.zip1.3 MB38 records
2002-11.zip2.1 MB65 records
2002-10.zip731.6 KB29 records
2002-09.zip999.5 KB38 records
2002-08.zip924.2 KB34 records
2002-07.zip1.7 MB76 records
2002-06.zip1.5 MB48 records
2002-05.zip2.9 MB120 records
2002-04.zip3.9 MB201 records
2002-03.zip737.0 KB29 records
2002-02.zip1.1 MB54 records
2002-01.zip1.2 MB37 records
2001-12.zip698.0 KB20 records
2001-11.zip1.9 MB93 records
2001-10.zip2.0 MB109 records
2001-09.zip707.4 KB22 records
2001-08.zip2.8 MB129 records
2001-07.zip2.1 MB123 records
2001-06.zip1.8 MB72 records
2001-05.zip1.2 MB47 records
2001-04.zip4.4 MB232 records
2001-03.zip1.2 MB51 records
2001-02.zip1.3 MB86 records
2001-01.zip1.3 MB37 records
2000-12.zip1.6 MB105 records
2000-11.zip1.7 MB85 records
2000-10.zip2.5 MB154 records
2000-09.zip1.2 MB57 records
2000-08.zip3.3 MB137 records
2000-07.zip3.1 MB142 records
2000-06.zip3.8 MB207 records
2000-05.zip6.2 MB340 records
2000-04.zip4.6 MB369 records
2000-03.zip3.6 MB212 records
2000-02.zip4.2 MB305 records
2000-01.zip3.1 MB241 records
1999-12.zip6.1 MB499 records
1999-11.zip3.5 MB275 records
1999-10.zip4.0 MB271 records
1999-09.zip2.9 MB197 records
1999-08.zip3.3 MB252 records
1999-07.zip3.7 MB250 records
1999-06.zip3.6 MB276 records
1999-05.zip3.0 MB255 records
1999-04.zip4.2 MB215 records
1999-03.zip2.1 MB170 records
1999-02.zip1.8 MB155 records
1999-01.zip1.3 MB86 records
1998-12.zip2.0 MB118 records
1998-11.zip2.2 MB143 records
1998-10.zip2.2 MB81 records
1998-09.zip2.3 MB141 records
1998-08.zip2.7 MB193 records
1998-07.zip3.0 MB153 records
1998-06.zip4.0 MB199 records
1998-05.zip1.1 MB38 records
1998-04.zip3.6 MB235 records
1998-03.zip1.4 MB72 records
1998-02.zip2.1 MB116 records
1998-01.zip2.2 MB83 records
1997-12.zip1.1 MB86 records
1997-11.zip1.6 MB65 records
1997-10.zip1.5 MB95 records
1997-09.zip2.4 MB99 records
1997-08.zip2.4 MB112 records
1997-07.zip3.0 MB103 records
1997-06.zip2.7 MB110 records
1997-05.zip2.3 MB109 records
1997-04.zip3.3 MB176 records
1997-03.zip2.2 MB97 records
1997-02.zip2.4 MB113 records
1997-01.zip1.2 MB65 records
1996-12.zip2.0 MB129 records
1996-11.zip3.7 MB140 records
1996-10.zip2.0 MB119 records
1996-09.zip2.3 MB125 records
1996-08.zip1.7 MB79 records
1996-07.zip1.9 MB83 records
1996-06.zip868.4 KB53 records
1996-05.zip1.5 MB71 records
1996-04.zip10.3 KB2 records
1996-03.zip190.7 KB16 records
1996-01.zip103.1 KB3 records
1995-12.zip100.6 KB3 records
1995-10.zip283.0 KB12 records
1995-08.zip130.8 KB4 records
1995-06.zip126.4 KB4 records
1995-05.zip119.9 KB3 records
1995-03.zip781.7 KB2 records
1995-02.zip159.9 KB1 records
1994-12.zip286.4 KB6 records
1994-06.zip75.8 KB4 records
1994-05.zip71.7 KB8 records

What This Dataset Contains

The dataset is built from two EDGAR submission types — 10-12G (initial class registration under Section 12(g)) and 10-12G/A (amendments to a prior 10-12G) — covering the complete population of such submissions filed to EDGAR from May 1994 through the present. Form 10-12G is the general-purpose Exchange Act registration statement used when an issuer must register a class of securities under Section 12(g) because it has crossed the Section 12(g) thresholds on total assets and holders of record, or when an issuer elects to enter Exchange Act reporting voluntarily without listing on a national exchange. Registration on Form 10-12G becomes effective automatically sixty days after filing unless withdrawn or amended, so amendment records (10-12G/A) are a first-class population in the dataset — each amendment has its own accession number, its own folder, and its own metadata file.

The dataset is distributed as monthly ZIP containers. Inside each container, every filing is stored as a folder named by the 18-digit dashless SEC accession number, holding a metadata.json header plus the filing's SGML-wrapped HTM sub-documents (the primary registration statement and attached exhibits). The content file types that may appear across the historical span are TXT, JSON, HTML, XFD, and PDF, reflecting the evolution of EDGAR-accepted submission formats from the mid-1990s forward. Form 10-12G is not an XBRL-tagged form; no XBRL or XML structured-financial-data files accompany these records.

Content Structure of a Single Record

What one record represents

One record in the Form 10-12G Files Dataset is a single EDGAR submission of Form 10-12G or Form 10-12G/A: one complete registration statement, or one amendment to a registration statement, filed under Securities Exchange Act of 1934 Section 12(g). A record is materialized as a dedicated filing folder inside a monthly ZIP container. The folder is named after the 18-digit, dashless SEC accession number of the submission (for example, 000199937125007032 corresponds to accession 0001999371-25-007032) and groups together every document that composes that one accession. Each folder contains exactly one metadata.json file plus one or more SGML-wrapped HTM sub-documents — the primary registration statement and, when attached, its exhibits. Amendments (10-12G/A) are stored as independent records: each amendment receives its own accession number, its own folder, and its own metadata file, rather than being merged into the original filing it amends.

What the underlying filing is

Form 10-12G is the general-purpose registration statement used to register a class of securities under Section 12(g) of the Exchange Act, typically because the issuer has crossed the Section 12(g) thresholds on total assets and holders of record and must become a reporting company even without an exchange listing. Registration on Form 10-12G becomes effective automatically sixty days after filing unless withdrawn or amended. The content requirements are defined by the Form 10 instructions and by Regulation S-K, which prescribe a numbered sequence of Items covering business description, risk factors, financial information, properties, security ownership, directors and officers, executive compensation, related-party transactions, legal proceedings, market for the registrant's securities, recent sales of unregistered securities, description of the securities being registered, indemnification provisions, financial statements and supplementary data, changes and disagreements with accountants, and a list of exhibits. Form 10-12G/A follows the same content architecture but re-states, supplements, or corrects portions of a previously filed 10-12G; in practice amendments range from single-line corrections to wholesale replacements of the prospectus-like body.

Content structure of a single record

At the container level, a record is a directory whose contents fall into two layers: a machine-readable header (metadata.json) and the filer's submitted narrative and exhibit documents (all .htm). The typical physical layout is:

  • metadata.json — the EDGAR filing header plus a manifest enumerating every sub-document in the submission.
  • One HTM for the primary form — the full Form 10-12G or 10-12G/A narrative, carrying TYPE=10-12G or TYPE=10-12G/A in its SGML wrapper.
  • Zero or more exhibit HTMs — each carrying an EX-... type in its SGML wrapper and corresponding one-to-one with an EX-... entry in the manifest.

Inside each HTM file, the EDGAR SGML wrapper precedes the HTML payload. The wrapper opens with <DOCUMENT>, declares <TYPE> (the EDGAR document type, e.g. 10-12G, EX-3.1, EX-10.13), <SEQUENCE> (the document's 1-based ordinal within the submission), <FILENAME> (the original filer-supplied filename), and <DESCRIPTION> (a free-form label), then opens <TEXT> around the <HTML> body and closes with matching tags. These SGML header fields mirror the sequence, type, filename, and description entries inside metadata.json, so the two layers cross-reference cleanly. HTM filenames are filer-chosen and are not standardized (d925208d1012ga.htm, risecompaniescorp-form10a.htm, nuveenchurchillbdcv-form10.htm, ex3-1.htm, exhibit101-form10a.htm, cnlrcred-1012g_060225.htm); the stable way to identify a given document's role is through the EDGAR TYPE value in the SGML wrapper or the type field in the manifest.

The primary Form 10-12G narrative, Item by Item

The primary HTM — the largest file in most records, often exceeding a megabyte for fund and BDC filings — carries the full registration statement text. The narrative opens with a cover page that identifies the registrant's exact legal name, state of incorporation or organization, IRS Employer Identification Number, principal executive office address and telephone number, the title of each class of securities to be registered, the name of each exchange on which each class is to be registered (typically "None" for 12(g)-only registrations), and the title of any classes registered under Section 12(b). Following the cover, the body is organized as a numbered sequence of Items prescribed by the Form 10 instructions and the governing Regulation S-K Items, typically:

  • Item 1. Business. General development, operating segments, products or services, markets and distribution, competition, intellectual property, regulation, and human capital. For investment vehicles, this Item discloses the fund's investment objective, strategies, principal investments, and leverage profile.
  • Item 1A. Risk Factors. Material risks specific to the registrant, its industry, and the securities being registered, set out as discrete enumerated risks.
  • Item 2. Financial Information. Selected financial data (historically), supplementary quarterly data where applicable, and Management's Discussion and Analysis of financial condition and results of operations. Often abbreviated for funds and newly organized issuers with limited operating history.
  • Item 3. Properties. Principal physical properties, leases, or, for real-estate vehicles, portfolio property holdings.
  • Item 4. Security Ownership of Certain Beneficial Owners and Management. Tabular disclosure of 5% holders, officers, directors, and the director/officer group as a whole, with amount and percent of class.
  • Item 5. Directors and Executive Officers. Name, age, position, and background of each director and executive officer, including family relationships and involvement in legal proceedings.
  • Item 6. Executive Compensation. Summary compensation table, narrative compensation discussion (where required), director compensation, and related plans.
  • Item 7. Certain Relationships and Related Transactions, and Director Independence. Related-party transactions over applicable dollar thresholds and board independence determinations.
  • Item 8. Legal Proceedings. Material pending litigation, regulatory proceedings, and similar matters.
  • Item 9. Market Price of and Dividends on the Registrant's Common Equity and Related Stockholder Matters. Whether a public trading market exists, approximate number of holders, dividend history, and equity compensation plan information.
  • Item 10. Recent Sales of Unregistered Securities. Private placements and other unregistered issuances during the relevant period, including the exemption claimed.
  • Item 11. Description of Registrant's Securities to be Registered. Rights, preferences, and limitations of each class being registered (common stock, preferred stock, debt, units, LP interests), including voting, dividends, liquidation, redemption, conversion, and anti-takeover provisions.
  • Item 12. Indemnification of Directors and Officers. Statutory and charter-based indemnification provisions and any related insurance.
  • Item 13. Financial Statements and Supplementary Data. Audited financial statements (balance sheet, income or operations statement, cash-flow statement, statement of changes in equity, notes, and auditor's report) plus any supplementary schedules.
  • Item 14. Changes in and Disagreements with Accountants on Accounting and Financial Disclosure. Any reportable disagreements or auditor changes.
  • Item 15. Financial Statements and Exhibits. A list of financial statements furnished and a numbered index of every exhibit filed with, or incorporated by reference into, the registration statement.

The narrative closes with a signature block executed by a duly authorized officer, dated as of the filing date. Because Form 10-12G resembles a Securities Act S-1 in content and depth, the prose is prospectus-like: dense, heavily footnoted, and interleaved with embedded tables for compensation, ownership, capitalization, selected financial data, and the financial statements themselves.

The exhibit layer

Exhibits are listed under Item 15 and filed as independent sub-documents inside the record. The exhibit-number taxonomy follows Item 601 of Regulation S-K; the families that recur in 10-12G filings are:

  • EX-3 — Articles / Certificate / Bylaws. EX-3.1, EX-3.2, EX-3.3, etc. carry the Certificate or Articles of Incorporation (or, for LPs and LLCs, the Limited Partnership Agreement or Operating Agreement), any amendments and restatements thereof, and the Bylaws. These documents fix the charter terms of the registered class and are present even in minimal filings.
  • EX-4 — Instruments Defining Rights of Security Holders. EX-4.1, EX-4.2, and so on: distribution-reinvestment plans, share-repurchase plans, indentures, warrant agreements, and specimen stock certificates — any text that defines the contractual rights of the class being registered.
  • EX-10 — Material Contracts. The largest and most variable family. For operating companies this includes employment agreements, credit agreements, supply agreements, and lease agreements. For funds and BDCs it encompasses investment advisory and sub-advisory agreements, administrative services agreements, placement-agent agreements, broker-dealer selling agreements, indemnification agreements, escrow agreements, servicing and master-servicing agreements, custody agreements, and stock-purchase agreements. Large fund 10-12Gs commonly ship with more than a dozen separate EX-10 sub-documents.
  • EX-21 — Subsidiaries of the Registrant. A short list of subsidiary legal names and their jurisdictions of incorporation.
  • EX-23 — Consents of Experts and Counsel. Auditor consents where financial statements were audited.
  • EX-99 — Additional Exhibits. Miscellaneous items outside the other families.

Each exhibit appears as a standalone HTM with <TYPE>EX-... in its SGML wrapper and a matching documentFormatFiles[] entry. Exhibits already on file with the SEC are frequently incorporated by reference rather than re-filed, which is why many substantive initial 10-12G filings ship with only EX-3 and a subset of EX-10 documents, and why most 10-12G/A records contain only the amended narrative with no attached exhibits at all.

The metadata.json header

metadata.json sits at the root of every filing folder and carries the EDGAR filing-level header plus the manifest of every sub-document. The fields have the following meaning:

  • formType10-12G for an initial registration, 10-12G/A for an amendment.
  • accessionNo — the dashed accession number in canonical EDGAR form (e.g. 0001641172-25-017030).
  • description — the human-readable form description, Form 10-12G - Registration of securities [Section 12(g)] or the [Amend] variant.
  • filedAt — ISO 8601 timestamp with timezone offset capturing the EDGAR acceptance time (e.g. 2025-06-30T06:06:55-04:00).
  • linkToFilingDetails, linkToTxt, linkToHtml — URLs to the primary HTM, the full SGML submission text bundle, and the EDGAR filing index page on www.sec.gov, respectively.
  • linkToXbrl — an empty string for this dataset; Form 10-12G is not an XBRL-tagged form.
  • documentFormatFiles[] — an ordered manifest with one entry per sub-document in the original submission. Each entry carries sequence (1-based, with the terminal "complete submission text file" entry using a single space), type (the EDGAR document type such as 10-12G, EX-3.1, EX-10.13, GRAPHIC), description (free-form filer label such as "ARTICLES OF AMENDMENT AND RESTATEMENT OF ..."), documentUrl (absolute archive URL), and size (byte size as a string). Entries for GRAPHIC artifacts and the complete submission .txt remain in the manifest even though those files are not shipped inside the ZIP.
  • entities[] — one entry per filer or subject party, normally a single Filer. Each entry carries cik, companyName (suffixed with the role, e.g. "... INC (Filer)"), type (echoes the form type), act ("34" for the Securities Exchange Act of 1934), fileNo (SEC-assigned file number, e.g. 000-55963), filmNo, irsNo ("000000000" when omitted), fiscalYearEnd in MMDD form, stateOfIncorporation as a two-letter code, sic (SIC code and label, e.g. "6282 Investment Advice", "6770 Blank Checks"), and tickers[] when the filer has listed symbols.
  • seriesAndClassesContractsInformation[] — an empty array in this dataset; fund series/class structure is not encoded for 10-12G filings.
  • dataFiles[] — an empty array; no XBRL or XML data files accompany 10-12G filings.
  • id — an opaque 32-character hex identifier stable per filing.

What the dataset record includes

Each record physically contains metadata.json plus every HTML/HTM sub-document that was part of the original EDGAR submission: the primary form and each attached exhibit in its own file. Together, these reconstitute the full textual content of the 10-12G or 10-12G/A as filed — cover page, Item-by-Item narrative, embedded tables (compensation, ownership, capitalization, selected financial data, financial statements and notes), signature block, and every exhibit body.

What the dataset record excludes

The dataset is deliberately trimmed of non-textual and fully redundant artifacts. Records exclude:

  • Graphics. JPG, GIF, and other image artifacts declared with TYPE=GRAPHIC in EDGAR remain referenced in documentFormatFiles[] but are not packed into the ZIP.
  • The complete submission text bundle. EDGAR's concatenated .txt submission file (the manifest entry whose type is a single space) is likewise referenced but omitted — the individual sub-document HTMs already contain all textual content.
  • PDF and XFD artifacts. These formats occasionally appear in the manifest for very old or alternative-format filings and are not unpacked; HTML is the canonical body format for this dataset.
  • Cross-filing context. Because each amendment has its own accession number and folder, the dataset does not pre-join a 10-12G/A to its parent 10-12G. Amendments typically rely on incorporation by reference for exhibits filed with the original registration statement, so the amendment folder may be small on its own while the substantive record is spread across multiple accessions.

Changes in required content over time

The dataset spans May 1994 to the present, a window during which the content requirements of Form 10 and the underlying Regulation S-K Items have evolved materially. Risk Factors disclosure (Item 1A in current form) was added as an explicit Item only after the SEC's 2005 Securities Offering Reform, which required plain-English risk-factor disclosure in annual reports and registration statements; earlier 10-12G filings typically embed risk-type disclosures inside the Business item rather than breaking them out. Executive compensation disclosure was substantially overhauled in 2006 (Compensation Discussion and Analysis, revised summary compensation table categories, expanded director compensation) and again incrementally with pay-ratio and pay-versus-performance rules, so early-era 10-12G records carry a much simpler compensation section than modern ones. Related-party transaction disclosures (Item 404), director independence disclosures (Item 407), and the cyber-incident and human-capital disclosures added in later Regulation S-K amendments appear only in 10-12G filings post-dating the respective rule effective dates. Exhibit requirements have likewise drifted: the EX-21 subsidiary list and EX-23 auditor consent have been stable, but the scope of EX-10 material-contract filings has been refined by later amendments to Item 601, and hyperlinking of exhibits became mandatory in 2017. Registrants subject to emerging-growth-company, smaller-reporting-company, or investment-company scaling provisions file content that is proportionately reduced relative to the full Form 10 template.

Changes in data format over time

The file-types found in this dataset — TXT, JSON, HTML, XFD, and PDF — reflect the evolution of EDGAR's accepted submission formats across the 1994-to-present window. The earliest 10-12G filings, from May 1994 through the late 1990s, were submitted as ASCII plain-text documents: narrative and tables alike rendered as fixed-width text, often with page-break markers and no embedded formatting. HTML submissions became the predominant format after the SEC's 1999 modernization of EDGAR document-format standards, and by the mid-2000s essentially all 10-12G filings ship as HTM inside an EDGAR SGML wrapper. XFD and PDF appear only as tail-distribution artifacts for a small subset of filings, largely exhibits rendered in those formats by particular filers. Modern HTML bodies are frequently produced through Wdesk / Workiva (identifiable by <!-- Document created using Wdesk --> and <!-- Copyright ... Workiva --> comments inside the HTML) or by legacy EDGAR converters (inline Times New Roman styling and deprecated HTML attributes). Form 10-12G is not an XBRL-tagged form, and both linkToXbrl and dataFiles[] are empty for every record; structured financial data is not part of this collection.

Interpretation notes

Several nuances matter when reading or extracting from a record:

  • Amendments are first-class, standalone records. A 10-12G/A is not automatically bound to its parent 10-12G in the dataset; reconstructing the full effective registration statement for a given issuer generally requires joining the original filing to subsequent amendments by CIK and accession-number chronology and reading each amendment's narrative to determine which portions of the original it supersedes.
  • Incorporation by reference is pervasive. Item 15 exhibit lists frequently point to exhibits filed with other registration statements, earlier 10-12Gs, or periodic reports; the physical presence of EX-... files in the folder is not a complete picture of the registrant's exhibit universe.
  • Filer-chosen filenames are unreliable identifiers. Exhibit identity should be read from the SGML <TYPE> header and from documentFormatFiles[*].type in metadata.json, not from filename patterns — naming is idiosyncratic across filers and filing agents.
  • Size heterogeneity is substantial. Records range from single-HTM amendments of a few kilobytes (common for minor 10-12G/A corrections) to fund and BDC initial filings with twenty or more exhibits and primary-narrative HTMs exceeding a megabyte. Any processing pipeline should expect this spread.
  • Embedded tables dominate several Items. Security ownership, executive compensation, capitalization, selected financial data, and the financial statements themselves are delivered as HTML tables within the primary form; their extraction requires table-aware parsing rather than plain text extraction.
  • Entity-block SIC, state, and ticker fields are optional. Some filers omit SIC code, state of incorporation, or ticker symbols at the EDGAR header level even when those facts are discoverable inside the narrative; downstream consumers should not assume completeness of the entities[] sub-fields. irsNo is populated with "000000000" rather than omitted when the filer supplies no IRS Employer ID.
  • The SGML wrapper is the authoritative document-type signal. Where an HTML body's visible header and the EDGAR type disagree (unusual but possible when filers mislabel a description), the <TYPE> tag and the corresponding manifest entry are the authoritative classification of what the sub-document actually is.
  • Three recurring record shapes. Records tend to fall into one of three shapes: small operating-company initial filings (primary form plus two or three EX-3 charter exhibits, with any graphics only referenced in metadata); large fund or BDC initial filings (primary form plus a long exhibit tail spanning EX-3, EX-4, EX-10, EX-21); and amendment-only packages (a single amended primary-form HTM with prior exhibits incorporated by reference).

Who Files or Publishes This Dataset, and When

Filer population

The filer on a Form 10-12G is always the issuer itself, registering one of its own equity security classes under Section 12(g). Typical filers:

  • Mandatory filers (threshold-crossing). Domestic operating companies whose total assets and record-holder counts have crossed Section 12(g)(1) / Rule 12g-1 limits but whose shares are not listed on a national exchange. This includes OTC-quoted companies, companies with widely dispersed employee equity, and companies that acquired a large holder base through stock plans, secondary trading, prior Regulation A or Regulation Crowdfunding offerings, or corporate conversions.
  • Spin-offs and carve-outs without listing. Holding companies or subsidiaries that, through reorganization or separation, end up with their own equity class requiring independent Section 12(g) registration.
  • Bank holding companies and savings and loan holding companies whose Exchange Act registration is administered by the SEC. (Insured depository institutions for which Section 12(i) shifts registration to the FDIC, OCC, or Federal Reserve do not appear in this dataset.)
  • Former exemption users. Issuers exiting Regulation A, Regulation Crowdfunding, or the Rule 12g3-2(b) foreign private issuer exemption by moving to full Exchange Act registration.
  • Voluntary filers. Issuers that elect Exchange Act reporting without being required to, typically to qualify for OTC Markets tiers requiring current reporting, to support employee stock plans, to prepare for a later Nasdaq or NYSE uplisting, or to establish an acquisition currency, without conducting a concurrent public offering.

Investment companies registered under the Investment Company Act of 1940 do not use Form 10-12G; they register on Form N-8A and related 1940 Act forms. Asset-backed issuers use Form SF-1/SF-3 and Form 10-D.

The Section 12(g) trigger

Section 12(g)(1) requires an issuer engaged in or affecting interstate commerce to register a class of equity securities (other than an exempted security) within 120 days after the last day of the first fiscal year on which it has:

  • total assets exceeding the dollar threshold set by the SEC under Rule 12g-1, and
  • a class of equity security held of record by the number of persons specified in Section 12(g)(1).

Section 12(g) as enacted by the Securities Acts Amendments of 1964 set the thresholds at $1 million in assets and 500 record holders. The SEC raised the asset threshold in stages through Rule 12g-1 ($3 million, $5 million, then $10 million). Subsequent statutory changes:

  • JOBS Act Section 501 (2012) amended Section 12(g)(1) so that, for non-bank issuers, registration is required when the issuer has more than $10 million in total assets and a class of equity security held of record by either 2,000 persons or 500 persons who are not accredited investors.
  • JOBS Act Section 502 excluded from the record-holder count securities held by persons who received them in exempt transactions under an employee compensation plan (now reflected in Rule 12g5-1(a)(7)).
  • Dodd-Frank Section 601 (2010) raised the Section 12(g)(4) deregistration threshold for bank holding companies to 1,200 holders of record.
  • EGRRCPA Section 509 (2018) extended the 2,000-holder registration threshold and 1,200-holder deregistration threshold to savings and loan holding companies, aligning them with banks and bank holding companies.
  • Rule 12g-6 conditionally excludes Regulation Crowdfunding securities from the record-holder count.

Mandatory filers have 120 days after the triggering fiscal year-end to file the 10-12G. Voluntary filers may file at any time.

Form 10 content and submission types

Form 10 is the general Exchange Act registration form; content is governed by its General Instructions, Regulation S-K (non-financial items including business, risk factors, MD&A, executive compensation, beneficial ownership, related-party transactions, and description of securities), and Regulation S-X (audited financial statements prepared to PCAOB standards). The same form is used for two EDGAR submission types:

  • 10-12G when registration is under Section 12(g) (no national exchange listing).
  • Form 10-12B when registration is under Section 12(b) in connection with listing on a national exchange (commonly Nasdaq, NYSE, or NYSE American; typical in spin-offs, carve-outs, and direct listings).

Effectiveness: the 60-day automatic mechanic

A Section 12(g) registration on Form 10 is not declared effective by SEC staff. Under Section 12(g)(1), the registration statement becomes effective automatically 60 days after filing, unless the SEC accelerates effectiveness at the issuer's request or the issuer withdraws the filing. During the 60-day window, the Division of Corporation Finance may issue comments; the issuer typically responds by filing 10-12G/A amendments. Amendments do not, by themselves, re-toll the 60-day statutory clock — the issuer must either consent to delay effectiveness or withdraw and refile to reset it.

On effectiveness, the issuer becomes a Section 13(a) reporting company, subject to Forms 10-K, 10-Q, and 8-K, Section 14 proxy rules, and Section 16 insider reporting (for officers, directors, and 10% beneficial owners). Form 10-12G is a one-time entry point, not a recurring filing.

Form 10-12G/A amendments

10-12G/A records are amendments to a prior Form 10-12G. Common triggers:

  • Responses to SEC staff comments during the 60-day review.
  • Financial statement updates when information becomes stale under Rule 3-12 of Regulation S-X before effectiveness.
  • Revisions to executive compensation, beneficial ownership, or risk factor disclosures.
  • Refiling of corrected or updated exhibits (articles, bylaws, material contracts).
  • Substantive business or capital-structure changes before effectiveness.

Amendments may be filed before or after statutory effectiveness.

Coverage note

Form 10 registration under Section 12(g) has existed since the 1964 Exchange Act amendments, but EDGAR submissions under the 10-12G type begin in May 1994 with the phased EDGAR mandate. Earlier Section 12(g) Form 10 filings exist only in paper and are not part of this dataset.

How This Dataset Differs From Similar Datasets or Filings

Form 10-12G is an Exchange Act class registration under Section 12(g), triggered by size and holder thresholds rather than by an offering or exchange listing. Because it overlaps in name, purpose, and disclosure content with several adjacent regimes, the useful comparisons fall into four groups: sister Exchange Act registrations (10-12B, 8-A), Securities Act offering forms (S-1, 1-A), life-cycle bookends and follow-ons (10-K/10-Q, Form 15), and specialized populations (N-2, 20-F).

Form 10-12B — Section 12(b) registration

  • Who files: domestic issuers registering a class in connection with listing on a national securities exchange; common for spin-offs that will list on NYSE or Nasdaq.
  • Trigger difference: 12(b) exchange listing vs. 12(g) size/holder crossover.
  • Content: essentially identical item set (Reg S-K, Reg S-X, same exhibit schedule). The filing body of a 10-12B and a 10-12G are interchangeable in substance.
  • Right dataset for: exchange-listing registrations and listed-spin-off studies — 10-12B. Non-listed Section 12(g) registrants (OTC spin-offs, holder-count crossings, voluntary reporters) — 10-12G.

Form 8-A — short-form Exchange Act registration

  • Who files: issuers with a current Securities Act registration already on file (typically an S-1 or S-11 IPO, or a Reg A issuer transitioning up), under either 12(b) or 12(g).
  • Trigger difference: 8-A incorporates by reference from a contemporaneous 33 Act filing; 10-12G is used when no such filing exists to lean on.
  • Content: 8-A is a few pages of securities description plus incorporation; 10-12G is a full standalone disclosure package with audited financials.
  • Right dataset for: IPO listing mechanics and the registration step of IPOs — 8-A. Full narrative, risk, and financial content at first Exchange Act entry — 10-12G. "Form 10" in older literature maps to 10-12G (and 10-12B); there is no separate Form 10 dataset on EDGAR.

Form S-1 — Securities Act offering registration

  • Who files: domestic issuers registering an offer and sale of specific securities under the 1933 Act.
  • Trigger difference: S-1 registers an offering and requires SEC declaration of effectiveness; 10-12G registers a class and goes effective automatically 60 days after filing with no offering.
  • Content overlap: both include business, risk factors, MD&A, audited financials, compensation, and exhibits. Divergence: S-1 adds pricing, underwriting, use of proceeds, and dilution; 10-12G omits all offering mechanics and, in spin-off cases, adds reasons-for-registration and distribution-mechanics disclosure.
  • Downstream: S-1 triggers Section 15(d) reporting; 10-12G triggers full Section 12(g) reporting, which also carries proxy (Section 14), insider reporting (Section 16), and Schedule 13D/G obligations.
  • Right dataset for: capital-raising, pricing, and IPO studies — S-1. Entry into Exchange Act reporting without an offering — 10-12G.

Form 10-K / Form 10-Q — post-effectiveness periodic reports

  • Who files: domestic reporting companies after any Exchange Act registration (including 10-12G) becomes effective.
  • Trigger difference: 10-12G is a one-time registration (plus 10-12G/A amendments); 10-K is annual, 10-Q quarterly.
  • Content overlap: 10-K covers similar ground to 10-12G's business and financial sections but refreshes them annually; 10-Q is narrower (interim financials, MD&A).
  • Right dataset for: "day one" disclosure baseline, reasons for registering, predecessor history, and spin-off distribution detail — 10-12G. Ongoing operating and financial trajectory — 10-K/10-Q. The datasets complement rather than substitute.

Form 15 — deregistration / termination of reporting

  • Who files: reporting issuers that have fallen below 12(g) holder thresholds or otherwise qualify to terminate or suspend reporting.
  • Trigger difference: 10-12G starts Exchange Act reporting; Form 15 ends it.
  • Content: Form 15 is a one-page check-box notice; 10-12G is a full disclosure package.
  • Right dataset for: reporting-tenure and exit studies — pair 10-12G (entry) with Form 15 (exit) as natural bookends.

Form N-2 — closed-end funds and BDCs

  • Who files: registered closed-end investment companies and Business Development Companies (BDCs), for 1940 Act registration and Securities Act offerings.
  • Trigger difference: N-2 is investment-company registration and offering disclosure; 10-12G is class registration under the Exchange Act.
  • BDC dual-filing note: BDCs routinely appear in both datasets. Section 54(a)(2) of the 1940 Act requires BDCs to have a class registered under the Exchange Act, so a BDC typically files Form 10-12G for its common stock class and Form N-2 for share offerings and 1940 Act-specific disclosure (investment policy, fee table, senior securities table, asset coverage). Traditional registered closed-end funds file only N-2 and do not appear in 10-12G.
  • Right dataset for: BDC Exchange Act entry and initial class disclosure — 10-12G. BDC or closed-end fund offering terms and investment-company disclosure — N-2.

Form 1-A — Regulation A offering circular

  • Who files: smaller issuers conducting exempt offerings under Regulation A / A+ (Tier 1 or Tier 2).
  • Trigger difference: 1-A is a Securities Act exempt-offering document; 10-12G is Exchange Act class registration. Tier 2 issuers report on 1-K/1-SA/1-U unless they separately register under the Exchange Act.
  • Content: 1-A disclosure is scaled relative to S-1 and 10-12G; audit requirements are lighter (though Tier 2 requires audited financials).
  • Right dataset for: small-issuer capital formation under Reg A — 1-A. The subset of those issuers that later graduate into full Exchange Act reporting — follow with 10-12G or 8-A.

Form 20-F — foreign private issuers

  • Who files: foreign private issuers; Canadian MJDS filers use 40-F instead.
  • Trigger difference: 20-F is dual-purpose — it serves as both the initial Section 12(b)/12(g) registration and the annual report. 10-12G is registration-only; annual updating is done on a separate 10-K.
  • Content: 20-F permits IFRS without U.S. GAAP reconciliation; 10-12G financials are U.S. GAAP.
  • Right dataset for: cross-border Exchange Act entry studies of non-U.S. issuers — 20-F. Domestic Section 12(g) registrants — 10-12G.

Boundary summary

The 10-12G dataset is narrow and specific: long-form Section 12(g) Exchange Act class registrations by domestic issuers, filed outside a concurrent Securities Act offering. It is the correct source for studying non-IPO entry into the reporting system (spin-offs onto OTC markets, holder-count crossings, voluntary registrants), first-disclosure baselines for companies that never filed an S-1, BDC Exchange Act registration events, and amendment trails via 10-12G/A. It is the wrong source for exchange-listing registrations (10-12B), IPO short-form listings (8-A), offerings (S-1, 1-A), ongoing reporting (10-K, 10-Q, 8-K), exits (Form 15), foreign issuers (20-F, 40-F), or investment-company offerings (N-2).

Who Uses This Dataset

Form 10-12G is an issuer's first full public disclosure when registering a class of securities under Section 12(g) without a public offering. The dataset serves a narrow but varied user base: legal, regulatory, analytical, transactional, academic, and engineering teams, each working a different slice of the record.

Used as a precedent library when drafting or amending a 10-12G. Attorneys pull filings from issuers of comparable size, industry, and capital structure to benchmark Item 1 business descriptions, Item 1A risk factors, Item 10 securities descriptions (preferred terms, transfer restrictions, anti-takeover provisions), and EX-3 and EX-10 exhibit scopes. 10-12G/A amendments are mined to anticipate staff comments. Outputs: drafted registration statements, comment-response memos, and effectiveness checklists.

SEC Division of Corporation Finance reviewers

Staff attorneys and accountants use historical filings as a reference set when assessing Regulation S-K and S-X compliance. They compare how peer registrants disclosed related-party transactions, going-concern language, revenue recognition, Item 4 ownership tables, and exhibit indexes. Supports reviewer training, comment-letter consistency, and detection of recurring deficiencies.

Equity research analysts covering newly reporting issuers

Fundamental analysts treat the 10-12G as the initiation document for companies that become reporting without a traditional S-1. Item 1 feeds segment and revenue models; Item 1A populates downside frameworks; Item 6 informs incentive-alignment analysis; Item 4 drives free-float and holder estimates; EX-10 material contracts and EX-3 governance documents support initiation reports and valuation memos. BDC and closed-end fund analysts additionally use Item 6 and EX-10 advisory, administration, and credit-facility agreements to benchmark fee structures across the BDC universe.

Private equity and venture capital portfolio monitoring

Used to track portfolio companies taking the direct registration path to public status. Item 4 ownership tables reveal post-registration cap tables, sponsor holdings, and preferred-to-common conversion mechanics; EX-3 charters and EX-10 stockholder agreements document surviving investor rights. Supports competitive monitoring of sponsor-backed issuers entering the reporting population.

Capital markets bankers on alternative go-public pathways

Bankers advising on reverse mergers, direct listings, and Reg A+ follow-ons study how prior registrants sequenced Section 12(g) registration with other transactions, how Item 10 accommodated multi-class structures, and what EX-10 exhibits (merger, share-exchange, lock-up agreements) accompanied registration. Outputs: structuring memos, pitch materials, and effectiveness-timing checklists.

Corporate governance analysts and proxy advisers

Build baseline governance profiles for newly reporting issuers ahead of their first proxy cycle. EX-3 exhibits are coded for classified boards, supermajority votes, exclusive forum clauses, and advance-notice bylaws; Item 6 flags early pay-design signals; Item 4 reveals controlling-shareholder and dual-class dynamics. Feeds governance scoring models and client alerts on new entrants to the reporting universe.

Academic researchers in finance, law, and accounting

Use the full 1994-to-present span as a study sample for work on registration effectiveness, capital-formation pathways, JOBS Act effects on Section 12(g) thresholds, and the private-to-public transition. Cover-page and metadata.json fields build panels of registration dates and effectiveness windows; Item 1 and Item 1A text support industry classification and risk-factor evolution studies; Item 6 supports compensation research. Outputs: working papers, journal articles, and replication archives.

Data engineering and AI/NLP teams

Use the corpus as training and retrieval material for models that parse business descriptions, extract risk factors, classify material contracts, and segment exhibits. metadata.json enables deterministic indexing by CIK, filing date, form type (10-12G vs. 10-12G/A), and document type; TXT, HTML, and PDF variants provide raw content. Workflows: fine-tuning financial LLMs on registration prose, RAG over pre-public disclosure, and entity linking between registrants and their later 10-K, 10-Q, and 8-K filings.

M&A diligence and strategic consulting

Evaluate recently reporting targets or benchmark clients against a peer set. Item 1 informs market positioning; Item 1A and Item 3 surface operational, regulatory, and litigation risks; EX-10 contracts expose customer concentration, supplier dependencies, and financing covenants. Outputs: target-screening memos, competitive landscape decks, and pre-LOI red-flag reports.

RegTech and compliance-tech product teams

Build registration-drafting assistants, exhibit trackers, disclosure-control platforms, and Section 16 readiness tools. Historical exhibit indexes and item-level structures train template libraries and recommendation engines; metadata drives filing-event detection features.

Credit analysts and private-debt teams

When a borrower or prospect registers under Section 12(g), analysts pull financial statements, Item 1A liquidity and going-concern language, and EX-10 credit agreements, indentures, security agreements, and guarantees. Outputs: updated credit memos covering capital structure, reporting obligations, and expected disclosure cadence.

Specific Use Cases

The following workflows are distinctive to Form 10-12G: they exploit that a 10-12G is an issuer's first full Exchange Act disclosure filed without an accompanying offering document, and that the dataset's span from May 1994 to the present includes a large BDC sub-population and a long amendment tail.

1. Assembling a day-one disclosure baseline for non-IPO reporters

Build a panel of first-disclosure records for issuers that entered Exchange Act reporting through spin-off, holder-count crossover, or voluntary registration rather than an S-1. Join metadata.json fields (entities[].cik, filedAt, fileNo, sic, stateOfIncorporation) across all 10-12G records, then extract Item 1 (Business), Item 1A (Risk Factors), and Item 13 (Financial Statements) from the primary HTM to create a "t=0" reference set that can be chained against later 10-K, 10-Q, and 8-K filings by CIK. Supports event studies of post-registration performance and risk-factor drift.

2. Benchmarking BDC advisory-fee and incentive-fee terms

Filter the corpus to Business Development Companies (the large BDC sub-population that must file 10-12G under 1940 Act Section 54(a)(2)), then extract each record's EX-10 investment advisory agreements, sub-advisory agreements, and administration agreements. Parse the fee schedules, hurdle rates, catch-up provisions, and expense-reimbursement caps into a structured comparables table. Output: a BDC fee-terms benchmark used by fund analysts, LP due-diligence teams, and advisers pricing new BDC launches.

3. Mining 10-12G/A amendment chains to anticipate staff comments

Group amendments to each parent 10-12G by CIK and accession chronology, then diff the amended primary-form HTM against the prior version to isolate what changed at each amendment stage. Because Form 10-12G goes effective automatically at 60 days unless amended, amendment patterns are a proxy for SEC staff feedback. Output: a corpus of "comment-response signals" for securities counsel training registration-drafting assistants and building effectiveness-timing models.

4. Coding charter-level governance provisions at reporting entry

Extract EX-3.1, EX-3.2, and EX-3.3 (Certificate or Articles of Incorporation, LP agreements, bylaws) from every 10-12G record and tag each for classified boards, supermajority thresholds, exclusive forum clauses, dual-class voting, advance-notice bylaws, and blank-check preferred authorizations. Combined with Item 4 ownership tables, this produces a governance-profile dataset for every new entrant to the reporting universe, ready for ISS/Glass Lewis-style scoring ahead of the issuer's first proxy cycle.

5. Building a registration-drafting precedent library with exhibit-type retrieval

Index every sub-document by the SGML <TYPE> wrapper value (10-12G, EX-3.1, EX-10.13, etc.) rather than by filer-chosen filenames, then build a retrieval layer keyed on form type, SIC, state of incorporation, and exhibit type. Securities counsel query "EX-10 placement-agent agreements from REIT 10-12Gs filed after 2020" or "Item 11 securities descriptions for multi-class LP interests" and retrieve precedent passages. Feeds drafting assistants, clause banks, and comment-response memo templates.

6. Training exhibit-classification and Item-segmentation models

Use the type-tagged HTM files as labeled training data for NLP models that segment long registration statements into Items 1 through 15 and classify exhibits into the EX-3, EX-4, EX-10, EX-21, EX-23, and EX-99 families. The SGML <TYPE> header provides exhibit labels without manual annotation; the numbered Item headings in the primary form provide within-document section labels. Output: Item-aware parsers and exhibit classifiers deployable across 10-12G, 10-12B, S-1, and 10-K corpora.

7. Pairing 10-12G entry records with Form 15 exits for reporting-tenure studies

Join each 10-12G record by CIK to the issuer's eventual Form 15 (termination of Exchange Act reporting) to measure reporting tenure, reasons for exit, and lifecycle survival. Item 1 and Item 1A text from the 10-12G characterize the issuer at entry; the Form 15 check-box notice characterizes the exit state. Output: survival-analysis panels for academic work on Section 12(g) threshold changes (including JOBS Act effects) and for RegTech products that forecast deregistration likelihood.

Dataset Access

The Form 10-12G Files Dataset covers filings from May 1994 through the present and is distributed as monthly ZIP containers enumerated in a single dataset index. The dataset can be accessed in three ways: via the dataset index JSON API, as a full dataset archive, or by downloading individual container files.

Dataset Index JSON API: https://api.sec-api.io/datasets/form-1012g-files.json

This endpoint returns the dataset metadata and the complete list of available container files. The metadata includes the dataset name, description, last updated timestamp, earliest sample date, total record count and total size, form types covered (10-12G and 10-12G/A), the container format (ZIP), and the content file types (TXT, JSON, HTML, XFD, PDF). Each container entry lists its download URL, relative key, size, record count, and last updated timestamp. Poll this endpoint to monitor which containers have been refreshed in the latest run and to decide which ones to download incrementally.

This endpoint does not require an API key.

Example response:

Example
1 {
2 "datasetId": "1f13365b-9ae0-6937-8ddb-2b6a7f42d7bc",
3 "datasetDownloadUrl": "https://api.sec-api.io/datasets/form-1012g-files.zip",
4 "name": "Form 10-12G Files Dataset",
5 "updatedAt": "2026-04-15T06:13:33.839Z",
6 "earliestSampleDate": "1994-05-01",
7 "totalRecords": 47906,
8 "totalSize": 1524182797,
9 "formTypes": ["10-12G", "10-12G/A"],
10 "containerFormat": "ZIP",
11 "fileTypes": ["TXT", "JSON", "HTML", "XFD", "PDF"],
12 "containers": [
13 {
14 "downloadUrl": "https://api.sec-api.io/datasets/form-1012g-files/2026/2026-04.zip",
15 "key": "2026/2026-04.zip",
16 "size": 13818783,
17 "records": 154,
18 "updatedAt": "2026-04-15T06:13:33.839Z"
19 }
20 ]
21 }

Download Entire Dataset: https://api.sec-api.io/datasets/form-1012g-files.zip?token=YOUR_API_KEY

Downloads the complete dataset as a single ZIP archive containing all monthly containers from May 1994 to the present. This endpoint requires an API key.

Download Single Container: https://api.sec-api.io/datasets/form-1012g-files/2026/2026-04.zip?token=YOUR_API_KEY

Downloads a single monthly container ZIP instead of the full dataset. Use the downloadUrl values returned by the dataset index JSON API to select specific months. This endpoint requires an API key.

Frequently Asked Questions

What form does this dataset cover?

The dataset covers EDGAR submissions of Form 10-12G — the general-purpose registration statement used to register a class of equity securities under Section 12(g) of the Securities Exchange Act of 1934 — together with its amendments, submission type 10-12G/A. Both submission types are filed under the same Form 10 template and governed by Regulation S-K and Regulation S-X.

What does one record in this dataset represent?

One record represents a single EDGAR submission: either one initial Form 10-12G registration statement or one 10-12G/A amendment. Each record is a dedicated filing folder named by the 18-digit dashless SEC accession number, containing exactly one metadata.json header plus one or more SGML-wrapped HTM sub-documents (the primary form and attached exhibits).

Who is required to file Form 10-12G?

The filer is always the issuer of the class of equity securities being registered. Mandatory filers are domestic issuers (including bank and savings-and-loan holding companies administered by the SEC) that have crossed the Section 12(g) thresholds — currently more than $10 million in total assets and either 2,000 record holders or 500 non-accredited record holders for non-bank issuers — but are not listed on a national exchange. Issuers may also file voluntarily to enter Exchange Act reporting without being required to, for example to qualify for OTC Markets tiers, support employee stock plans, or prepare for a future exchange uplisting.

When does a Form 10-12G become effective?

A Form 10-12G is not declared effective by SEC staff; under Section 12(g)(1) it becomes effective automatically 60 days after filing unless the SEC accelerates effectiveness or the issuer withdraws the filing. During the 60-day window, the Division of Corporation Finance may issue comments, which issuers typically answer by filing 10-12G/A amendments.

How does this dataset differ from a Form S-1 dataset?

Form S-1 registers an offer and sale of specific securities under the Securities Act of 1933 and requires SEC declaration of effectiveness; Form 10-12G registers a class of securities under the Exchange Act, involves no offering mechanics, and goes effective automatically after 60 days. The two forms overlap heavily on business, risk-factor, MD&A, compensation, and audited-financial content, but S-1 adds pricing, underwriting, use of proceeds, and dilution disclosure, while 10-12G omits all offering mechanics and is the correct source for non-IPO entry into the reporting system.

How does Form 10-12G differ from Form 10-12B and Form 8-A?

All three are Exchange Act registration forms, but they map to different pathways. Form 10-12B uses the same Form 10 content as 10-12G but registers under Section 12(b) in connection with listing on a national securities exchange. Form 8-A is a short-form registration available to issuers that already have a current Securities Act registration (such as an S-1 IPO) to incorporate by reference from, whereas Form 10-12G is the full standalone disclosure package used when no such Securities Act filing exists to lean on.

What file format is the dataset distributed in?

The dataset is distributed as monthly ZIP containers. Inside each container, every filing is stored as a folder named by the accession number and contains a metadata.json header plus SGML-wrapped HTM sub-documents. Content file types that may appear across the historical span are TXT, JSON, HTML, XFD, and PDF; Form 10-12G is not an XBRL-tagged form, so no XBRL or XML structured-data files are included.

What time period does the dataset cover?

The dataset covers Form 10-12G and 10-12G/A submissions on EDGAR starting in May 1994 — when the phased EDGAR mandate began accepting such filings electronically — through the present. Earlier Section 12(g) Form 10 filings exist only in paper and are not part of this dataset.