Form 1-A Files Dataset

The Form 1-A Files Dataset is a complete, monthly-refreshed corpus of EDGAR offering-statement submissions filed under Regulation A — both initial Form 1-A filings and Form 1-A/A pre-qualification and post-qualification amendments. Each record represents one EDGAR submission identified by its 18-digit accession number and bundles the dataset header, the structured Form 1-A cover-page XML, the XSL-rendered cover view, the narrative offering circular, and every text-bearing exhibit attached to the submission. Records are filed by the issuer itself — early- and growth-stage operating companies, real-estate sponsors, special-purpose vehicles, non-listed REITs, and eligible Canadian issuers — seeking SEC qualification before any sales of securities under Reg A. Electronic Form 1-A coverage on EDGAR begins in January 2002, with filing volume heavily concentrated after the Reg A+ rewrite (Release 33-9741) took effect on June 19, 2015. The dataset is distributed as monthly ZIP containers covering form types 1-A and 1-A/A, with file payloads in XML, HTML, JSON, TXT, and PDF.

Update Frequency
Daily
Updated at
2026-05-19
Earliest Sample Date
2002-01-01
Total Size
2.6 GB
Total Records
57,065
Container Format
ZIP
Content Types
XML, HTML, JSON, TXT, PDF
Form Types
1-A, 1-A/A

Dataset APIs

Programmatically retrieve the full list of dataset archive files, download URLs and dataset metadata.

Dataset Index JSON API

Download the entire dataset as a single archive file.

Download Entire Dataset:

Download a single container file (e.g. monthly archive) from the dataset.

Download Single Container:

Dataset Files

293 files · 2.6 GB
Download All
2026-05.zip4.7 MB228 records
2026-04.zip6.5 MB347 records
2026-03.zip5.2 MB281 records
2026-02.zip4.0 MB129 records
2026-01.zip5.7 MB212 records
2025-12.zip67.4 MB288 records
2025-11.zip2.9 MB154 records
2025-10.zip4.6 MB125 records
2025-09.zip10.9 MB494 records
2025-08.zip7.2 MB295 records
2025-07.zip7.9 MB382 records
2025-06.zip8.3 MB416 records
2025-05.zip7.5 MB315 records
2025-04.zip8.1 MB361 records
2025-03.zip11.6 MB368 records
2025-02.zip84.1 MB384 records
2025-01.zip6.8 MB302 records
2024-12.zip9.1 MB390 records
2024-11.zip8.0 MB370 records
2024-10.zip13.2 MB357 records
2024-09.zip6.3 MB313 records
2024-08.zip12.0 MB410 records
2024-07.zip30.1 MB296 records
2024-06.zip25.8 MB355 records
2024-05.zip6.3 MB296 records
2024-04.zip5.4 MB251 records
2024-03.zip13.6 MB312 records
2024-02.zip7.9 MB321 records
2024-01.zip6.7 MB214 records
2023-12.zip7.7 MB350 records
2023-11.zip29.3 MB363 records
2023-10.zip9.3 MB349 records
2023-09.zip6.4 MB265 records
2023-08.zip8.9 MB426 records
2023-07.zip8.1 MB326 records
2023-06.zip12.1 MB572 records
2023-05.zip9.8 MB515 records
2023-04.zip11.6 MB523 records
2023-03.zip14.5 MB663 records
2023-02.zip12.3 MB573 records
2023-01.zip17.1 MB700 records
2022-12.zip17.5 MB716 records
2022-11.zip32.7 MB816 records
2022-10.zip13.6 MB648 records
2022-09.zip13.2 MB548 records
2022-08.zip20.8 MB550 records
2022-07.zip9.5 MB555 records
2022-06.zip20.5 MB527 records
2022-05.zip9.7 MB515 records
2022-04.zip7.6 MB411 records
2022-03.zip12.3 MB651 records
2022-02.zip11.3 MB507 records
2022-01.zip18.7 MB558 records
2021-12.zip36.2 MB805 records
2021-11.zip15.3 MB783 records
2021-10.zip15.7 MB591 records
2021-09.zip38.2 MB814 records
2021-08.zip37.2 MB645 records
2021-07.zip28.2 MB700 records
2021-06.zip20.1 MB737 records
2021-05.zip26.6 MB630 records
2021-04.zip27.3 MB551 records
2021-03.zip104.3 MB961 records
2021-02.zip31.2 MB721 records
2021-01.zip99.8 MB784 records
2020-12.zip72.2 MB754 records
2020-11.zip45.4 MB587 records
2020-10.zip23.2 MB769 records
2020-09.zip33.0 MB653 records
2020-08.zip130.5 MB664 records
2020-07.zip15.5 MB725 records
2020-06.zip72.1 MB765 records
2020-05.zip19.6 MB644 records
2020-04.zip6.8 MB380 records
2020-03.zip11.7 MB612 records
2020-02.zip10.9 MB480 records
2020-01.zip11.2 MB386 records
2019-12.zip14.2 MB444 records
2019-11.zip11.8 MB508 records
2019-10.zip15.1 MB658 records
2019-09.zip18.8 MB498 records
2019-08.zip20.0 MB531 records
2019-07.zip20.4 MB718 records
2019-06.zip22.1 MB451 records
2019-05.zip9.0 MB427 records
2019-04.zip10.3 MB581 records
2019-03.zip9.0 MB505 records
2019-02.zip4.7 MB294 records
2019-01.zip3.4 MB186 records
2018-12.zip17.4 MB331 records
2018-11.zip5.3 MB283 records
2018-10.zip15.1 MB352 records
2018-09.zip5.7 MB335 records
2018-08.zip18.8 MB528 records
2018-07.zip9.4 MB459 records
2018-06.zip7.0 MB438 records
2018-05.zip6.3 MB339 records
2018-04.zip2.5 MB133 records
2018-03.zip10.6 MB372 records
2018-02.zip12.0 MB370 records
2018-01.zip27.4 MB363 records
2017-12.zip9.8 MB531 records
2017-11.zip11.8 MB341 records
2017-10.zip8.0 MB335 records
2017-09.zip9.9 MB553 records
2017-08.zip7.5 MB390 records
2017-07.zip5.2 MB303 records
2017-06.zip10.2 MB284 records
2017-05.zip23.2 MB209 records
2017-04.zip4.5 MB159 records
2017-03.zip5.6 MB254 records
2017-02.zip28.8 MB346 records
2017-01.zip39.1 MB423 records
2016-12.zip16.2 MB440 records
2016-11.zip76.4 MB411 records
2016-10.zip99.9 MB319 records
2016-09.zip70.0 MB374 records
2016-08.zip40.8 MB423 records
2016-07.zip49.8 MB306 records
2016-06.zip37.3 MB282 records
2016-05.zip5.4 MB282 records
2016-04.zip6.0 MB223 records
2016-03.zip16.1 MB380 records
2016-02.zip5.4 MB187 records
2016-01.zip10.1 MB302 records
2015-12.zip5.7 MB284 records
2015-11.zip3.4 MB149 records
2015-10.zip17.9 MB187 records
2015-09.zip6.1 MB237 records
2015-08.zip7.1 MB210 records
2015-07.zip903.0 KB56 records
2015-06.zip488.2 KB47 records
2015-05.zip22 B0 records
2015-04.zip22 B0 records
2015-03.zip22 B0 records
2015-02.zip22 B0 records
2015-01.zip22 B0 records
2014-12.zip22 B0 records
2014-11.zip22 B0 records
2014-10.zip22 B0 records
2014-09.zip22 B0 records
2014-08.zip22 B0 records
2014-07.zip22 B0 records
2014-06.zip22 B0 records
2014-05.zip22 B0 records
2014-04.zip22 B0 records
2014-03.zip22 B0 records
2014-02.zip22 B0 records
2014-01.zip22 B0 records
2013-12.zip22 B0 records
2013-11.zip22 B0 records
2013-10.zip22 B0 records
2013-09.zip22 B0 records
2013-08.zip22 B0 records
2013-07.zip22 B0 records
2013-06.zip22 B0 records
2013-05.zip22 B0 records
2013-04.zip22 B0 records
2013-03.zip22 B0 records
2013-02.zip22 B0 records
2013-01.zip22 B0 records
2012-12.zip22 B0 records
2012-11.zip22 B0 records
2012-10.zip22 B0 records
2012-09.zip22 B0 records
2012-08.zip22 B0 records
2012-07.zip22 B0 records
2012-06.zip22 B0 records
2012-05.zip22 B0 records
2012-04.zip22 B0 records
2012-03.zip22 B0 records
2012-02.zip22 B0 records
2012-01.zip22 B0 records
2011-12.zip22 B0 records
2011-11.zip22 B0 records
2011-10.zip22 B0 records
2011-09.zip22 B0 records
2011-08.zip22 B0 records
2011-07.zip22 B0 records
2011-06.zip22 B0 records
2011-05.zip22 B0 records
2011-04.zip22 B0 records
2011-03.zip22 B0 records
2011-02.zip22 B0 records
2011-01.zip22 B0 records
2010-12.zip22 B0 records
2010-11.zip22 B0 records
2010-10.zip22 B0 records
2010-09.zip22 B0 records
2010-08.zip22 B0 records
2010-07.zip22 B0 records
2010-06.zip22 B0 records
2010-05.zip22 B0 records
2010-04.zip22 B0 records
2010-03.zip22 B0 records
2010-02.zip22 B0 records
2010-01.zip22 B0 records
2009-12.zip22 B0 records
2009-11.zip22 B0 records
2009-10.zip22 B0 records
2009-09.zip22 B0 records
2009-08.zip22 B0 records
2009-07.zip22 B0 records
2009-06.zip22 B0 records
2009-05.zip22 B0 records
2009-04.zip22 B0 records
2009-03.zip22 B0 records
2009-02.zip22 B0 records
2009-01.zip22 B0 records
2008-12.zip22 B0 records
2008-11.zip22 B0 records
2008-10.zip22 B0 records
2008-09.zip22 B0 records
2008-08.zip22 B0 records
2008-07.zip22 B0 records
2008-06.zip22 B0 records
2008-05.zip22 B0 records
2008-04.zip22 B0 records
2008-03.zip22 B0 records
2008-02.zip22 B0 records
2008-01.zip22 B0 records
2007-12.zip22 B0 records
2007-11.zip22 B0 records
2007-10.zip22 B0 records
2007-09.zip22 B0 records
2007-08.zip22 B0 records
2007-07.zip22 B0 records
2007-06.zip22 B0 records
2007-05.zip22 B0 records
2007-04.zip22 B0 records
2007-03.zip22 B0 records
2007-02.zip22 B0 records
2007-01.zip22 B0 records
2006-12.zip22 B0 records
2006-11.zip22 B0 records
2006-10.zip22 B0 records
2006-09.zip22 B0 records
2006-08.zip22 B0 records
2006-07.zip22 B0 records
2006-06.zip22 B0 records
2006-05.zip22 B0 records
2006-04.zip22 B0 records
2006-03.zip22 B0 records
2006-02.zip22 B0 records
2006-01.zip22 B0 records
2005-12.zip22 B0 records
2005-11.zip22 B0 records
2005-10.zip22 B0 records
2005-09.zip22 B0 records
2005-08.zip22 B0 records
2005-07.zip22 B0 records
2005-06.zip22 B0 records
2005-05.zip22 B0 records
2005-04.zip22 B0 records
2005-03.zip22 B0 records
2005-02.zip22 B0 records
2005-01.zip22 B0 records
2004-12.zip22 B0 records
2004-11.zip22 B0 records
2004-10.zip22 B0 records
2004-09.zip22 B0 records
2004-08.zip22 B0 records
2004-07.zip22 B0 records
2004-06.zip22 B0 records
2004-05.zip22 B0 records
2004-04.zip22 B0 records
2004-03.zip22 B0 records
2004-02.zip22 B0 records
2004-01.zip22 B0 records
2003-12.zip22 B0 records
2003-11.zip22 B0 records
2003-10.zip22 B0 records
2003-09.zip22 B0 records
2003-08.zip22 B0 records
2003-07.zip22 B0 records
2003-06.zip22 B0 records
2003-05.zip22 B0 records
2003-04.zip22 B0 records
2003-03.zip22 B0 records
2003-02.zip22 B0 records
2003-01.zip22 B0 records
2002-12.zip22 B0 records
2002-11.zip22 B0 records
2002-10.zip22 B0 records
2002-09.zip22 B0 records
2002-08.zip22 B0 records
2002-07.zip22 B0 records
2002-06.zip22 B0 records
2002-05.zip22 B0 records
2002-04.zip22 B0 records
2002-03.zip22 B0 records
2002-02.zip22 B0 records
2002-01.zip22 B0 records

What This Dataset Contains

The dataset assembles, on a per-submission basis, every Form 1-A and Form 1-A/A filing made on EDGAR. Form 1-A is the offering statement prescribed by Regulation A under Section 3(b) of the Securities Act of 1933. It functions as a quasi-prospectus: it must qualify with the SEC before sales are permitted, and it carries substantive disclosures about the issuer, the securities, the terms of the offering, the use of proceeds, the risks, the management, and supporting financials. Modern Reg A is two-tiered: Tier 1 covers offerings up to $20 million in any rolling 12-month period and remains subject to state blue-sky review with no audit requirement; Tier 2 covers offerings up to $75 million, requires audited GAAP financial statements, preempts state review, and triggers ongoing reporting on Forms 1-K, 1-SA, and 1-U. A Form 1-A/A amends a previously filed offering statement under the same 024-##### offering file number (distinct from the 333-##### series used for full registration).

Internally an EDGAR Form 1-A submission is a hybrid of (a) a structured cover-page XML produced through the EDGAR Online Forms "Reg A Filer" workflow and (b) a set of attached HTML documents containing the actual narrative and legal content. The XML carries the machine-readable issuer/offering data; the HTML attachments carry the prose, tables, signatures, and legal text. The dataset preserves both layers as separate files and ships records inside monthly ZIP containers keyed YYYY-MM/, packaged as form-1a-files/YYYY/YYYY-MM.zip. Coverage runs from January 2002 to present and spans the pre-2015 legacy Reg A regime, the 2015 Reg A+ rewrite, and the 2021 amendment that raised the Tier 2 cap from $50 million to $75 million.

Content Structure of a Single Record

What one record represents

One record in the Form 1-A Files Dataset is a single EDGAR offering-statement submission filed under Regulation A — either an original Form 1-A (initial offering statement) or a Form 1-A/A (pre-qualification amendment) — identified by its 18-digit accession number. A record is not a single document but the full bundle EDGAR received as one submission: the EDGAR-generated dataset header, the structured Form 1-A cover-page XML, the XSL-rendered view of that cover page, the narrative offering circular, and every text-bearing exhibit attached to the submission. On disk the record is one folder named with the dash-stripped accession number (e.g. 000149315225025045 for accession 0001493152-25-025045), placed inside a monthly ZIP keyed YYYY-MM/ and shipped as form-1a-files/YYYY/YYYY-MM.zip. Every file in that folder, taken together, is the record.

Container and folder layout

Within the monthly ZIP, each record occupies one folder whose name is the accession number with all dashes removed. Representative tree:

1 form-1a-files/2025/2025-11.zip
2 └── 2025-11/
3 ├── 000149315225025045/
4 │ ├── metadata.json
5 │ ├── primary_doc.xml
6 │ ├── partiiandiii.htm
7 │ ├── ex1-1.htm
8 │ ├── ex2-2.htm
9 │ ├── ex3-1.htm
10 │ ├── ex3-2.htm
11 │ ├── ex8-1.htm
12 │ ├── ex11-1.htm
13 │ ├── ex11-2.htm
14 │ ├── ex12-1.htm
15 │ ├── ex17-1.htm
16 │ └── xsl1-A_X01/
17 │ └── primary_doc.xml
18 └── ...

Five file roles populate that folder:

FileFormatRole
metadata.jsonJSONDataset-produced submission header (always present)
primary_doc.xmlEDGAR XML (oneafiler schema)Structured Form 1-A cover page (Issuer Information / Part I)
xsl1-A_X01/primary_doc.xmlXHTML 1.0 StrictXSL-rendered human-readable view of the cover page
partiiandiii.htm (or filer-slug variant)SGML-wrapped HTMLOffering circular: Part II narrative plus Part III exhibit index
ex<N>[-<M>].htmSGML-wrapped HTMLOne file per attached exhibit

metadata.json is the one component never absent. primary_doc.xml and its XSL render appear in essentially every modern submission. The narrative offering-circular file and the exhibit set vary in count from a single document (a minimal cover-only amendment) to several dozen (a fully exhibited initial filing).

metadata.json — the dataset header

The dataset's normalized header for the submission, derived from the EDGAR submission index. Fields:

  • formType"1-A" or "1-A/A".
  • accessionNo — dashed accession (e.g. "0001493152-25-025045").
  • description — EDGAR human-readable form description (e.g. "Form 1-A/A - Offering Statement [Regulation A]: [Amend]").
  • filedAt — ISO-8601 timestamp with timezone.
  • linkToFilingDetails, linkToTxt, linkToHtml, linkToXbrl — canonical sec.gov URLs for the rendered primary doc, the SGML full-submission text file, the EDGAR filing-index page, and any XBRL data (empty for Reg A).
  • documentFormatFiles[] — one element per document in the EDGAR submission, each with sequence, size, documentUrl, optional description, and type (e.g. "1-A/A", "PART II AND III", "ADD EXHB", "GRAPHIC"). The terminating element with empty sequence/type and description: "Complete submission text file" references the SGML .txt wrapper.
  • entities[] — filer/issuer records, each with companyName, cik, irsNo, fileNo (the 024-##### Reg A offering file number), filmNo, type, act (always "33" for Reg A), sic with description, stateOfIncorporation, fiscalYearEnd (MMDD), and optional tickers[].
  • seriesAndClassesContractsInformation[] — typically empty for Form 1-A.
  • dataFiles[] — typically empty (Reg A does not carry XBRL data files).
  • id — opaque dataset record identifier.

primary_doc.xml — the structured Form 1-A cover page

An XML instance against the http://www.sec.gov/edgar/rega/oneafiler namespace. Root element <edgarSubmission> splits into <headerData> and <formData>.

<headerData> carries submission-level routing fields: submissionType, and <filerInfo> containing the issuer's CIK/CCC, liveTestFlag, and a <filer> block holding <issuerCredentials> and <issuerCik>. The Reg A offering file number is exposed at the form level (see below) rather than as a header attribute.

<formData> carries the substantive cover-page disclosures, organized into the following sub-blocks (note the schema's idiosyncratic spelling — juridication instead of jurisdiction — appears in element names exactly as shown):

  • <issuerInfo> — issuer name, address, contact telephone, jurisdiction of organization, year of incorporation, primary SIC, IRS employer-ID, full-time / part-time employee headcounts, the offering file number, and a balance-sheet/income-statement snapshot covering cash, accounts receivable, PP&E, total assets, accounts payable, long-term debt, total liabilities, stockholder equity, total revenues, total expenses, depreciation and amortization, net income, EPS basic and diluted, and the auditor's name.
  • <commonEquity>, <preferredEquity>, <debtSecurities> (each repeatable) — class name, outstanding amount, CUSIP, and listing venue for each class of issued security.
  • <issuerEligibility>, <applicationRule262> — yes/no certifications that the issuer is eligible under Rule 251 and not subject to the bad-actor disqualifications of Rule 262.
  • <summaryInfo> — the offering-summary block: indicateTier1Tier2Offering, financialStatementAuditStatus, types of securities offered, securitiesOffered count, pricePerSecurity, issuerAggregateOffering, totalAggregateOffering, named service providers (sales-commissions broker, finders, auditors, legal counsel) with their fees and CRD numbers, and estimatedNetAmount to the issuer.
  • <juridictionSecuritiesOffered> — repeating two-letter codes for both issueJuridicationSecuritiesOffering (states where the issuer offers directly) and dealersJuridicationSecuritiesOffering (states where dealer activity is contemplated). Tier 2 filings often list all 50 states plus DC and PR; Tier 1 filings tend to list only the states actually qualified.
  • <securitiesIssued>, <unregisteredSecuritiesAct> — disclosure of prior unregistered issuances within the lookback window and the exemption(s) relied upon.

This XML is the structured machine-readable counterpart to the cover-page items of the form. It does not contain the offering-circular narrative; that lives in the HTML attachment.

xsl1-A_X01/primary_doc.xml — the rendered cover page

The EDGAR XSL-rendered HTML view of the structured primary_doc.xml. Despite the .xml extension and the path under xsl1-A_X01/, this file is XHTML 1.0 Strict (declared via <!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Strict//EN" ...>) styled with /css/REGA_1A_print.css. It presents the structured fields as a tabular cover page and adds no information beyond what is already in the XML, but it is the rendered form most users see on the EDGAR website. It is the only HTML-flavored file in the record that is not SGML-wrapped.

The offering circular (partiiandiii.htm and variants)

The narrative body of the filing — the offering circular itself — is delivered as a single SGML-wrapped HTML document, conventionally named partiiandiii.htm because it embodies both Part II (the prospectus-like Offering Circular) and Part III (the Index to Exhibits). Some filers substitute a custom filename formed as <issuer-prefix>_1a.htm or <issuer-prefix>_1aa.htm (e.g. glob_1aa.htm, spot_1aa.htm, bdcc_1a.htm, karx_1a.htm, tranquil_1a.htm, alkaline_1a.htm, bhic1a111925.htm).

The narrative typically follows the post-2015 SEC-prescribed Offering Circular item order:

  • Cover page with offering summary, price-per-share, maximum and minimum offering amounts, Tier indicator, and required Reg A legends.
  • Summary of the offering and the issuer.
  • Risk factors.
  • Dilution.
  • Plan of distribution and selling-securityholder details.
  • Use of proceeds to issuer.
  • Description of business.
  • Description of property.
  • Management's discussion and analysis of financial condition and results of operations (MD&A).
  • Directors, executive officers, and significant employees.
  • Compensation of directors and executive officers.
  • Security ownership of management and certain securityholders.
  • Interest of management and others in certain transactions.
  • Securities being offered.
  • Financial statements (Tier 2: audited; Tier 1: reviewed or unaudited; sometimes inline within the offering circular, sometimes referenced and re-attached as an exhibit).
  • Signatures of the issuer, the principal executive officer, principal financial officer, principal accounting officer, and a majority of the board.
  • Part III: Index to Exhibits, listing each exhibit by its prescribed Item 17 numeric category.

Exhibits

Exhibits are attached as separate SGML-wrapped HTML files, one per exhibit. The post-2015 Item 17 exhibit table groups exhibits by numeric category — for example: 1 (underwriting/placement-agent agreements), 2 (charter and bylaws), 3 (instruments defining rights of securityholders), 4 (subscription agreements), 6 (material contracts), 7 (acquisition-related plans), 8 (escrow agreements), 9 (letters re unaudited interim information), 10 (powers of attorney), 11 (consents of experts and auditors), 12 (opinions of legal counsel re legality), 13 (testing-the-waters materials), 15 (additional exhibits), 17 (additional schedules or correspondence).

Three exhibit-filename schemes are observed in the dataset, and they coexist within the same monthly ZIP:

  1. SEC two-level numbering matching the Item 17 index: ex1-1.htm, ex3-2.htm, ex11-1.htm, ex11-2.htm, ex12-1.htm, ex17-1.htm. The first number is the Item 17 category; the second is the within-category sequence.
  2. Flat sequential numbering that ignores the category split: ex1.htm, ex2.htm, … ex31.htm. Position within the sequence does not indicate exhibit type.
  3. Filer-slug variants prefixed with an issuer or ticker token: spot_ex6z3.htm, bdcc_ex41.htm, tranquil_ex0303.htm, bhicex11-1.htm. Some embed the category-and-sequence digits, others do not.

Exhibit content is heterogeneous: charter documents, bylaws, certificates of designation, subscription agreements, escrow agreements, broker/placement-agent agreements, material contracts, employment agreements, consents of independent registered public accounting firms, legal opinions on legality of securities, testing-the-waters communications, and supplemental financial schedules.

The SGML document wrapper inside each .htm

Every .htm file in the record except the XSL render — i.e. the offering circular and every exhibit — is not plain HTML. Each is one <DOCUMENT> segment lifted from the EDGAR full-submission .txt, with the actual HTML payload nested inside <TEXT>...</TEXT> and prefaced by an SGML header:

1 <DOCUMENT>
2 <TYPE>ADD EXHB
3 <SEQUENCE>8
4 <FILENAME>ex11-1.htm
5 <DESCRIPTION>ADD EXHB
6 <TEXT>
7 <HTML>
8 ... (full HTML payload) ...
9 </HTML>
10 </TEXT>
11 </DOCUMENT>

These tags are SGML, not XML: the preamble lines (<TYPE>, <SEQUENCE>, <FILENAME>, <DESCRIPTION>) have no closing tags and rely on newline-as-terminator, and <TEXT> is closed by </TEXT> only at the document's end. The <TYPE> value mirrors the documentFormatFiles[].type in metadata.json (e.g. "1-A/A", "PART II AND III", "ADD EXHB"); <SEQUENCE> matches the EDGAR submission ordering. Robust extraction strips the preamble and the <TEXT>/</TEXT> envelope and parses only the inner HTML payload.

File types and formats

The file types found in the dataset are XML, HTML, JSON, TXT, and PDF. In practice:

  • JSON appears as metadata.json only.
  • XML appears as the EDGAR primary_doc.xml only.
  • HTML/HTM is the dominant format for substantive content — used for the XSL render, the offering circular, and every exhibit. Every .htm except the XSL render is SGML-wrapped.
  • TXT is referenced in metadata.json (the EDGAR full-submission .txt) but is not bundled into the ZIP; the per-document SGML segments are already broken out as the individual .htm files.
  • PDF is uncommon for Form 1-A bodies and exhibits; when present it appears as a per-document attachment in place of, or alongside, an HTML version.

What is included and excluded

Included in each record folder:

  • The dataset's metadata.json header.
  • The EDGAR structured primary_doc.xml and its XSL-rendered XHTML view.
  • Every text-bearing document the issuer attached to the submission: the offering circular, all exhibits, and any supplemental narrative.

Excluded from each record folder:

  • Image files (JPG, PNG, GIF) referenced by the filing — issuer logos, photographs, charts, diagrams, and embedded graphics. These are deliberately stripped from the on-disk record even when they appear in metadata.json -> documentFormatFiles with type: "GRAPHIC". A heavily illustrated offering circular may reference dozens of image_###.jpg and <exhibit>_###.jpg files at sec.gov URLs that have no on-disk counterpart. Renderers that need the images must fetch them via the documentUrl of each GRAPHIC entry.
  • The aggregated EDGAR full-submission .txt wrapper. Its content is preserved in decomposed form across the per-document .htm files; the consolidated wrapper itself is referenced by URL in the metadata but not duplicated locally.
  • Externally hosted exhibits or materials that were merely referenced by URL in the offering circular without being filed as attachments.

Structural evolution: pre-2015 vs post-2015 Reg A+

The dataset spans January 2002 through the present, which straddles the most consequential rewrite of Form 1-A in its modern history. On June 19, 2015, the SEC's "Regulation A+" amendments adopted under Title IV of the JOBS Act took effect, replacing Form 1-A in its entirety. Records on either side of that transition differ substantially in content structure:

  • Pre-2015 (legacy Reg A). The offering ceiling was $5 million in a 12-month period and the form was single-tier. Form 1-A was organized into Part I (Notification), Part II (Offering Circular, with two alternative formats — Model A, a question-and-answer template aimed at smaller issuers, and Model B, a free-form narrative more like an S-1 prospectus), and Part III (Exhibits). Required disclosures were materially lighter, financial statements were generally unaudited, and testing-the-waters materials and ongoing reporting were not part of the regime in their modern form. EDGAR uptake of Reg A was low (many years carry only a handful of filings) and the structured cover page used an earlier, simpler XML schema that predates the oneafiler namespace.
  • Post-2015 (Reg A+). Form 1-A was rewritten into the current Part I (Notification, captured in the structured primary_doc.xml under the oneafiler namespace), Part II (a single Offering Circular item set replacing the Model A / Model B fork), and Part III (Index to Exhibits, governed by the Item 17 numeric categories). Two tiers were established — Tier 1 up to $20 million and Tier 2 originally up to $50 million, raised to $75 million effective March 15, 2021. Tier 2 introduced audited-financials requirements, ongoing periodic reporting (Form 1-K annual, Form 1-SA semiannual, Form 1-U current), and federal preemption of state blue-sky review. Testing-the-waters communications acquired a defined exhibit slot (Item 17(13)). Reg A filing volume rose sharply after 2015, so the bulk of records reflect the post-2015 architecture described above.

The summaryInfo/indicateTier1Tier2Offering value, the breadth of juridictionSecuritiesOffered lists, and the presence of testing-the-waters exhibits are all post-2015 artifacts. Pre-2015 records use the older structured-cover layout and a Model A or Model B narrative offering circular.

Interpretation notes

  • Amendments do not replace prior filings on disk. Each Form 1-A/A is its own record under its own accession number; the original 1-A and any 1-A/A amendments must be assembled across multiple records to reconstruct the qualification history of an offering. The shared entities[].fileNo (024-#####) is the join key.
  • The structured XML and the narrative HTML are not redundant. The XML cover page carries machine-readable issuer/offering fields that are not duplicated as structured data anywhere in the offering circular. The offering circular carries the prose disclosures, risk factors, MD&A, signatures, and financial statements that are absent from the XML. Both layers must be parsed for a complete view.
  • The SGML wrapper must be stripped before HTML parsing. Standard HTML parsers will tolerate the leading <DOCUMENT>/<TYPE>/<SEQUENCE>/<FILENAME>/<DESCRIPTION>/<TEXT> lines but will misclassify them as text nodes, polluting downstream extraction. The robust approach is to detect the SGML preamble, extract the substring between <TEXT> and </TEXT>, and parse that as HTML.
  • Image references are dangling. The offering circular and exhibits frequently contain <img src="image_001.jpg"> (and similar) references. These point to files listed in metadata.json -> documentFormatFiles but absent from the ZIP. Resolve them via the sec.gov documentUrl of the matching GRAPHIC entry.
  • Filename heuristics are unreliable for exhibit identification. Because filers use three different naming conventions (SEC two-level, flat numeric, issuer-slug), exhibit category cannot be inferred from filename alone. The authoritative source is the <TYPE>/<DESCRIPTION> SGML preamble inside the file paired with the matching documentFormatFiles[] entry in metadata.json.
  • Financial statements may live in either the offering circular or as exhibits. Some issuers embed audited financials inline in partiiandiii.htm; others attach them as separate exhibits. Tier 2 filings carry audited GAAP statements; Tier 1 filings may carry unaudited statements. Extraction logic should look in both locations and disambiguate via exhibit <DESCRIPTION>/<TYPE>.
  • Tier 1 vs Tier 2 is determinative for downstream interpretation. summaryInfo/indicateTier1Tier2Offering drives whether audited financials, ongoing reporting, and federal preemption apply, and it correlates with the financialStatementAuditStatus field, the breadth of the jurisdiction lists, and the offering-amount ceilings. It is the single most important structured field in primary_doc.xml for downstream classification.
  • Pre-2015 records require different parsing assumptions than post-2015 records: legacy Model A vs Model B circular templates, lighter exhibit indexes, and a different XML cover-page schema. Code that assumes the post-2015 oneafiler namespace will not validate against pre-2015 cover pages.

Who Files or Publishes This Dataset, and When

A Form 1-A record is filed on EDGAR by an issuer seeking to qualify a securities offering under Regulation A (Securities Act of 1933, Section 3(b)(2), as added by Title IV of the JOBS Act of 2012). The filer is always the issuer itself; underwriters, placement agents, selling securityholders, and signing officers and directors appear inside the offering statement but do not file it.

Eligible filer population

To be eligible under Rule 251(a), an issuer must be organized in, and have its principal place of business in, the United States or Canada. In practice the eligible filer population includes:

  • Domestic operating companies, typically early- and growth-stage businesses raising outside Section 5 registration.
  • Real-estate sponsors and project vehicles (LLCs, limited partnerships) — a large share of post-2015 Tier 2 filers.
  • Special-purpose vehicles holding a defined asset, business line, or token-related instrument, provided they are not 1940 Act investment companies.
  • Non-listed REITs that are not investment companies and not Exchange Act reporting.
  • Canadian issuers meeting the U.S./Canada incorporation and principal-place-of-business test. No other foreign issuers qualify.

Rule 251(b) categorically excludes:

Exchange Act reporting companies were originally barred but became eligible following the 2018 Economic Growth, Regulatory Relief, and Consumer Protection Act amendment to Rule 251(b)(2).

The two tiers and what each produces

Form 1-A is a single form, but the issuer elects one of two tiers on the cover, and the election controls disclosure depth, state-law preemption, and ongoing reporting:

  • Tier 1 — up to $20 million in any rolling 12 months (no more than $6 million by affiliated selling securityholders). Subject to state blue-sky review in addition to SEC review (typically NASAA coordinated review). Audited financials are not required. No ongoing Reg A reporting beyond a final Form 1-Z.
  • Tier 2 — up to $75 million in any rolling 12 months (no more than $22.5 million by affiliated selling securityholders), reflecting the 2021 amendment that raised the prior $50 million cap. State registration and qualification are preempted under Section 18 of the Securities Act; states retain notice-filing and antifraud authority. Requires audited U.S. GAAP financials (PCAOB standards or AICPA standards). Imposes a per-purchaser investment cap for non-accredited natural persons of 10% of the greater of annual income or net worth, unless the securities are listed at qualification.

Tier election determines whether a filing chain produces downstream Form 1-K, 1-SA, 1-U, and 1-Z filings; those records live in their own form-type populations.

When a record is created

Form 1-A is filed before any sales occur: securities may not be sold in reliance on Regulation A until the SEC issues a notice of qualification under Rule 252(e). Records arise from these triggers:

  • Initial Form 1-A — issuer-driven, with no statutory deadline. Under Rule 252(d), an issuer that has not previously sold securities under Reg A or a Securities Act registration may submit a draft offering statement for non-public review before public filing.
  • Pre-qualification 1-A/A amendments — filed in response to SEC staff comments, to update stale interim financials, to reflect material developments, or to switch tiers. Each comment round typically produces a 1-A/A.
  • Post-qualification 1-A/A amendments — required for any fundamental change (new security type or terms, new selling securityholders, changed plan of distribution, material business change) and must be re-qualified before sales resume on the new terms.
  • Rule 255 testing-the-waters materials — solicitations of interest used before or after the public filing must be filed as Exhibit 13 to Form 1-A (Item 17). New or revised materials can themselves trigger a 1-A/A.
  • Withdrawal — an abandoned offering is removed by Form RW, which is not part of this dataset but explains chains that never reach qualification.

Timing logic

  • Initial filing tracks the issuer's capital-raising schedule, not a periodic calendar.
  • Qualification occurs upon SEC notice under Rule 252(e).
  • Rule 252(b)(3) / Part F/S financial-statement currency forces refresh amendments: interim financials in a not-yet-qualified offering statement may not be older than nine months at qualification, with the most recent period at least six months old.
  • Rule 251(d)(3) continuous offerings may run up to three years from initial qualification, during which 1-A/A refreshes are routine.

Regulatory framework

  • Securities Act Section 3(b)(2), added by Title IV of the JOBS Act of 2012, authorizes the modern Tier 2 framework.
  • Regulation A, Rules 251–263 govern issuer eligibility (251), offering statement content and qualification mechanics (252), offering circulars and supplements (253), Rule 255 testing-the-waters, ongoing reporting (257), suspension and revocation (258), and bad-actor disqualification (262).
  • Form 1-A itself: Part I (XML cover/notification), Part II (offering circular and financial statements), Part III (exhibits).
  • Release No. 33-9741 (Reg A+ rewrite, effective June 19, 2015) created the two-tier structure, preempted state registration for Tier 2, and introduced the 1-K / 1-SA / 1-U / 1-Z reporting regime.
  • Release No. 33-10884 (2021) raised the Tier 2 cap to $75 million and the affiliated-selling-securityholder cap to $22.5 million.

Important distinctions

  • 1-A/A vs. Rule 253(g) supplement — the boundary that most often causes confusion. Pre-qualification changes and fundamental post-qualification changes go through Form 1-A/A. Non-fundamental updates after qualification (pricing within permitted bounds, minor disclosure updates) go through an offering-circular supplement filed under Rule 253(g) and appear under separate EDGAR types 253G1, 253G2, 253G3, 253G4 — not in this dataset.
  • Filer vs. parties named — only the issuer is the filer. Officers, directors, selling securityholders, promoters, placement agents, and Rule 262 covered persons are disclosed within the filing but do not themselves file it.
  • Bad-actor waivers — an issuer otherwise disqualified under Rule 262 may obtain an SEC waiver; the waiver document itself is not in the dataset.

Historical anchor

Section 3(b) small-offering authority dates to the Securities Act of 1933, and Form 1-A in some form has existed since the SEC's earliest rulemaking. Paper-era Reg A filings predating EDGAR are not in this dataset. Electronic Form 1-A coverage on EDGAR begins in January 2002, with filing volume heavily concentrated after June 19, 2015, when the Reg A+ rewrite under Release 33-9741 made Tier 2 a viable capital-raising channel.

How This Dataset Differs From Similar Datasets or Filings

Form 1-A overlaps with several SEC offering and reporting forms, but each differs on dollar cap, eligibility, audit standard, ongoing-reporting consequences, and the legal mechanics of qualification versus registration. The closest neighbors are full Securities Act registrations (S-1, S-3, F-1), other exempt-offering filings (Form D, Form C), and the surrounding Regulation A family (1-K, 1-SA, 1-U, 1-Z, and 253(g) supplements). Form 10 sits adjacent on the Exchange Act side.

Form S-1 / Form S-1/A. Structurally the nearest cousin: both are pre-offering disclosure documents containing a prospectus or offering circular, risk factors, MD&A, business description, use of proceeds, officer and director disclosure, and financials. Differences:

  • S-1 is full Section 5 registration with no dollar cap. Form 1-A is an exemption capped at $20M (Tier 1) or $75M (Tier 2) per rolling 12 months.
  • S-1 financials must be audited under PCAOB standards. Tier 1 1-A is unaudited; Tier 2 1-A may use PCAOB or AICPA standards.
  • S-1 effectiveness triggers full Exchange Act reporting (10-K, 10-Q, 8-K). Tier 2 1-A issuers report on the lighter 1-K/1-SA/1-U track; Tier 1 issuers typically have no ongoing federal reporting.
  • S-1 is "declared effective"; 1-A is "qualified."
  • Tier 2 1-A preempts state blue-sky review under Section 18 (covered-security treatment). Tier 1 does not preempt and remains subject to state review. S-1 does not preempt unless the security is otherwise covered (e.g., listed on a national exchange).

Form S-3. A streamlined shelf registration limited to seasoned issuers meeting public-float and reporting-history tests, supporting incorporation by reference. Form 1-A has no float test and is not available for shelf takedowns; most 1-A issuers cannot incorporate by reference because they are not Exchange Act reporters. S-3 is a follow-on tool for established public companies; 1-A is an entry path for smaller and emerging issuers.

Form F-1. Foreign analog of S-1, with home-country accounting accommodations (IFRS without reconciliation in many cases). Form 1-A is restricted to issuers organized in and principally based in the U.S. or Canada, so the populations rarely intersect.

Form 10. Registers a class of securities under the Exchange Act, triggering 10-K/10-Q/8-K reporting; it does not register an offering. Form 1-A qualifies the offering itself but does not register a class. The two are sometimes paired when a Tier 2 issuer voluntarily files Form 10 or 8-A to obtain a listing. Form 10 creates a reporting company; Form 1-A creates a qualified offering.

Regulation A companion filings

These are post-qualification complements to Form 1-A, not substitutes:

  • Form 1-K (Tier 2 annual): audited financials, MD&A, business update. Lighter analog of Form 10-K.
  • Form 1-SA (Tier 2 semiannual): unaudited interim financials. Half-year cadence, lighter than Form 10-Q.
  • Form 1-U (Tier 2 current report): narrower trigger set than Form 8-K.
  • Form 1-Z: exit/termination report ending Reg A reporting obligations.
  • Form 253(g)1-4: post-qualification offering-circular supplements under Rule 253(g), updating pricing or sales results without a full post-qualification amendment. Functionally analogous to Rule 424(b) prospectus supplements; they depend on a previously qualified 1-A and contain only the deltas, not the full circular.

A full Reg A lifecycle requires the 1-A (base offering document), 1-A/A (pre-qualification amendments), 253(g) supplements (post-qualification updates), and the 1-K/1-SA/1-U stream until 1-Z exit.

Other exempt-offering filings

Form D (Regulation D). A brief XML notice filing for Rule 504 ($10M cap) and Rule 506(b)/(c) private placements. Differences from Form 1-A:

  • Content: Form D is a short notice (issuer identity, offering size, exemption claimed, basic sales data). Form 1-A is a full offering circular with risk factors, MD&A, and financials.
  • Cap: Rule 506 has no dollar cap; Rule 504 is capped at $10M. Reg A is capped at $20M / $75M.
  • Solicitation: Rule 506(b) and 504 prohibit general solicitation; Rule 506(c) permits it but requires verified accredited investors only. Reg A permits general solicitation to all investors, with Tier 2 imposing per-investor investment limits on non-accredited buyers.
  • Investor eligibility: 506(b) allows up to 35 non-accredited sophisticated investors; 506(c) is accredited-only. Reg A is open to non-accredited retail participation.
  • State law: Tier 2 Reg A preempts state blue-sky under Section 18; Rule 506 offerings are also covered securities and preempted, but Rule 504 is not.

Form D and Form 1-A cover overlapping issuer populations (small private companies raising capital) but are not interchangeable: Form D is metadata; Form 1-A is a full disclosure document.

Form C (Regulation Crowdfunding). Section 4(a)(6) exemption capped at $5M per 12 months, conducted exclusively through registered intermediaries (funding portals or broker-dealers). Like Form 1-A, Form C requires structured disclosure including financials and risk factors, but content is narrower, the cap is far lower, secondary trading is restricted for one year, and the issuer population skews earlier-stage. Form C does not preempt state blue-sky in the same Section 18 sense as Tier 2 Reg A (Reg CF has its own preemption mechanic under Section 4(a)(6)). Form C-AR, Form C-U, and Form C-TR provide ongoing/exit reporting analogous to the 1-K/1-U/1-Z stream.

Boundary summary

Form 1-A is the only SEC offering document that is (a) qualified rather than registered, (b) capped at $20M (Tier 1) or $75M (Tier 2) per 12 months, (c) able to preempt state blue-sky review under Section 18 (Tier 2 only) while permitting both general solicitation and non-accredited retail participation, and (d) followed by a purpose-built lighter ongoing-reporting regime (1-K/1-SA/1-U) rather than the full 10-K/10-Q/8-K stack. It is narrower in cap and lighter in audit and reporting consequences than S-1/F-1; far richer in narrative and financial content than Form D or Form C; and aimed at a different issuer stage and population than S-3 or Form 10. The 1-A/A amendments, 253(g) supplements, and 1-K/1-SA/1-U/1-Z reports are its true companion datasets, together describing a Reg A issuer from initial qualification through exit.

Who Uses This Dataset

Form 1-A offering statements are used as a precedent library, market dataset, diligence file, enforcement record, and research corpus. The roles below anchor on different parts of the filing, including offering-circular sections, exhibits, financial statements, metadata, and amendment chains.

Securities lawyers and disclosure counsel

Issuer's counsel preparing Tier 1 and Tier 2 offerings use the corpus as a drafting precedent. They pull the "Plan of Distribution," "Use of Proceeds," "Risk Factors," and "Business" sections from comparable issuers to benchmark hedge language and operational detail. Placement-agent counsel compare commission structures, lock-ups, and escrow arrangements in exhibits. The 1-A versus 1-A/A diff is the most useful signal: it reveals which disclosures staff push back on, which drives comment-letter prediction on a new filing.

SEC review staff

Corporation Finance reviewers focused on Regulation A use the dataset to prepare comment letters and maintain consistency across the small-issuer pipeline. They focus on financial-statement footnotes, going-concern language, related-party transactions, officer and director disclosures, and material-contract exhibits. Amendment frequency and the substance of corrections feed both individual review and aggregate program assessment.

Investment bankers and Reg A-focused broker-dealers

Capital markets desks build league tables and pricing comps from the dataset. Tier, maximum offering amount, and security-type metadata, combined with cover-page disclosures, set headline economics; the "Plan of Distribution" exhibit reveals selling-agent commissions, expense reimbursements, and warrant coverage. Coverage analysts use the pipeline to identify candidates for follow-on raises, PIPEs, or graduation to S-1.

Online capital formation platforms

Diligence and onboarding teams at funding portals and Tier 2 placement marketplaces vet prospective issuers against prior 1-A filings, financial statements, bad-actor disclosures, and the exhibit index (organizational documents, escrow agreements, subscription documents). Aggregate metrics on tier mix, sector mix, average raise size, and amendment frequency feed product, pricing, and risk dashboards.

Venture and growth equity analysts

Pre-IPO and growth analysts use Form 1-A as a window into otherwise opaque private-company fundamentals. The financial statements, capitalization disclosures, and prior-round narratives in the offering circular support diligence memos on alternative-asset sponsors and consumer brands raising under Reg A. Use-of-proceeds and risk-factor language help classify a raise as a bridge, Series B substitute, or retail-distribution play.

M&A and private-credit deal teams

Corporate development and private-credit underwriters mine the "Business" section, customer and supplier disclosures, property descriptions, and material-contract exhibits for intelligence on targets that do not file 10-Ks. Segment-level disclosures in the offering circular support revenue-mix, gross-margin, and working-capital triangulation in target memos.

Issuer-side finance and compliance officers

CFOs, controllers, general counsel, and compliance officers at Reg A issuers track ongoing obligations and align internal disclosure with peer practice. They reference the exhibit index, qualification-correspondence patterns across amendment chains, bad-actor representations, ongoing reporting commitments, and state blue-sky carve-outs to maintain internal disclosure libraries.

Academic researchers

Empirical researchers studying the JOBS Act, retail investor protection, and exempt offerings use the dataset as a primary corpus. Tier selection, offering size, time-to-qualification (from amendment timestamps), industry classification, and outcomes linked to 1-K, 1-SA, 1-U follow-ons or full registration support panel datasets. The offering-circular text supports NLP work on risk-factor evolution, readability, and disclosure tone.

Financial journalists and trade press

Reporters covering Reg A markets and small-issuer fraud source stories from the offering-circular narrative, named officers and directors, related-party disclosures, and use-of-proceeds language. Amendment history, qualification status, and withdrawal indicators support stories on stalled or pulled deals; aggregate tier, sector, and raise-size metadata supports trend coverage.

Enforcement, forensic accounting, and plaintiff-side litigation

Enforcement attorneys, forensic accountants, and plaintiff securities firms generate enforcement leads from the corpus. Forensic accountants compare financial statements across 1-A/A amendments and later 1-K, 1-SA, and 1-U filings to identify restated figures, shifting going-concern language, and emergent related-party transactions. Plaintiff teams mine biographical disclosures and bad-actor representations for inconsistencies; material-contract, escrow, and selling-agent exhibits anchor specific allegations.

Data engineering and quant research teams

Engineering teams at alternative-data vendors and systematic shops parse the XML metadata, extract structured fields (tier, offering amount, security type, dates, filer identifiers), and run document parsing over the HTML and PDF offering circulars to lift financial statements, officer lists, and risk-factor text. Outputs feed entity-resolution layers linking Reg A issuers to later registrations, name changes, and post-qualification reporting, and power models that flag anomalous amendment patterns.

Credit and private-debt analysts

Analysts covering Reg A debt, preferred, and asset-backed offerings underwrite non-traditional credits from the description of indebtedness, indenture and covenant exhibits, collateral descriptions, and sponsor or servicer disclosures. Comparative analysis across peer Reg A debt issuances supports coupon, tenor, redemption, and structural-protection benchmarks.

LLM and retrieval-system developers

Teams building retrieval-augmented systems use the corpus as a structured-yet-narrative body of small-issuer text. Each filing pairs metadata XML with offering-circular HTML, exhibits, and PDFs, supporting chunking, embedding, and retrieval indexes for small-issuer Q&A, comparable-company lookup, and disclosure-pattern detection.

Specific Use Cases

1. Building a Reg A pricing and commission comp set

Capital markets desks at Reg A-focused broker-dealers construct league tables and economics comps. They pull summaryInfo/indicateTier1Tier2Offering, pricePerSecurity, totalAggregateOffering, and the named-service-provider blocks (sales-commissions broker, finders, auditors, legal counsel, with CRD numbers and fees) from primary_doc.xml, then cross-reference selling-agent commissions, expense caps, and warrant coverage in the ex1-*.htm placement-agent agreements. Output is a quarterly Tier 2 comp table by sector and security type, used to set headline economics on new mandates.

2. Diffing 1-A vs 1-A/A to predict staff comments

Issuer's counsel preparing a Tier 2 filing assemble the qualification history of comparable offerings by joining all records sharing the same entities[].fileNo (024-#####) and ordering them by filedAt. They diff the offering-circular HTML (partiiandiii.htm) and exhibit set across the 1-A and successive 1-A/A records, focusing on revisions to risk factors, going-concern language, related-party transactions, and use-of-proceeds tables. The resulting diff log feeds a comment-letter prediction memo for new drafts.

3. Forensic comparison of financials across the Reg A lifecycle

Forensic accountants and plaintiff-side analysts pull the <issuerInfo> balance-sheet and income-statement snapshot from primary_doc.xml plus the audited statements embedded in partiiandiii.htm or attached as exhibits, then chain them to later 1-K, 1-SA, and 1-U filings via CIK and offering file number. Restated revenue, shifting going-concern language, and newly disclosed related-party transactions become enforcement and litigation leads, with specific exhibit citations to material contracts and consents (ex11-*.htm, ex6-*.htm).

4. Tier 2 issuer pipeline monitoring for placement marketplaces

Onboarding teams at funding portals and Reg A placement marketplaces ingest each new monthly ZIP, filter records where summaryInfo/indicateTier1Tier2Offering is Tier 2 and financialStatementAuditStatus indicates audited GAAP, and rank candidates by totalAggregateOffering, SIC, and stateOfIncorporation. The bad-actor certification in <applicationRule262> and the auditor name from <issuerInfo> feed a pre-screen scorecard; the juridictionSecuritiesOffered block tells sales which states are in scope.

5. Panel dataset for academic research on Reg A+ outcomes

Empirical researchers studying the 2015 and 2021 Reg A+ amendments build panels keyed on accession number and CIK. They extract tier, offering ceiling, audit status, security type, primary SIC, and filedAt from primary_doc.xml; compute time-to-qualification from the timestamp gap between an initial 1-A and the final 1-A/A in the same 024-##### chain; and link issuers forward to 1-K, 1-SA, 1-U, or full S-1 registrations. The resulting panel supports regressions on amendment count, qualification latency, and post-qualification survival.

6. Retrieval index over small-issuer disclosures

LLM application teams chunk and embed the offering-circular HTML and exhibit set after stripping the SGML <DOCUMENT>/<TEXT> wrapper from each .htm. Chunks are tagged with accessionNo, exhibit <TYPE> and <DESCRIPTION> from the SGML preamble, Item 17 category, CIK, SIC, and tier. The index powers comparable-issuer Q&A ("show risk-factor language from Tier 2 consumer-brand issuers raising more than $20M") and disclosure-pattern lookup that conventional EDGAR search cannot answer.

7. Subscription-agreement and escrow-clause precedent library

Disclosure counsel and platform legal teams build a clause library by harvesting Item 17 category 4 (subscription agreements) and category 8 (escrow agreements) exhibits across the corpus. They identify the right files via the <TYPE>/<DESCRIPTION> SGML preamble paired with documentFormatFiles[] entries (filename heuristics are unreliable due to mixed naming schemes), then cluster by issuer SIC and security class to surface standard versus aggressive terms on minimum investment, refund mechanics, escrow agent identity, and break conditions.

8. M&A target intelligence on non-10-K filers

Corporate development teams at strategic acquirers mine Form 1-A as a fundamentals source on private targets that do not file 10-Ks. They extract the "Description of Business," "Description of Property," MD&A, and customer/supplier disclosures from partiiandiii.htm, plus material-contract exhibits in Item 17 category 6, to triangulate revenue mix, segment economics, working capital, and key contractual relationships. Output is a target memo built from primary disclosure rather than vendor-summarized data.

Dataset Access

Dataset Index JSON API: https://api.sec-api.io/datasets/form-1a-files.json Returns dataset-level metadata (name, description, last updated timestamp, earliest sample date, total record and size counts, covered form types, container format, and file types) along with a list of all individual container files. Each container entry includes its key, size, records, updatedAt, and a downloadUrl. Use this endpoint to monitor which containers were updated in the most recent refresh and to decide which files to download incrementally. This endpoint does not require an API key.

Example response:

Example
1 {
2 "datasetId": "1f13365b-9ae0-6939-bf28-d0916cc9b4df",
3 "datasetDownloadUrl": "https://api.sec-api.io/datasets/form-1a-files.zip",
4 "name": "Form 1-A Files Dataset",
5 "updatedAt": "2026-04-25T02:56:39.138Z",
6 "earliestSampleDate": "2002-01-01",
7 "totalRecords": 56805,
8 "totalSize": 2593830705,
9 "formTypes": ["1-A", "1-A/A"],
10 "containerFormat": "ZIP",
11 "fileTypes": ["XML", "HTML", "JSON", "TXT", "PDF"],
12 "containers": [
13 {
14 "downloadUrl": "https://api.sec-api.io/datasets/form-1a-files/2026/2026-04.zip",
15 "key": "2026/2026-04.zip",
16 "size": 13818783,
17 "records": 154,
18 "updatedAt": "2026-04-25T02:56:39.138Z"
19 }
20 ]
21 }

Download Entire Dataset: https://api.sec-api.io/datasets/form-1a-files.zip?token=YOUR_API_KEY Downloads the complete dataset as a single ZIP archive containing all Form 1-A and 1-A/A filings from January 2002 to present. This endpoint requires an API key.

Download Single Container: https://api.sec-api.io/datasets/form-1a-files/2026/2026-04.zip?token=YOUR_API_KEY Downloads one monthly ZIP container instead of the full archive. Use the downloadUrl values from the index JSON's containers array to fetch specific months. This endpoint requires an API key.

Frequently Asked Questions

What forms does this dataset cover?

The dataset covers Form 1-A (initial Regulation A offering statements) and Form 1-A/A (pre-qualification and post-qualification amendments). Both form types share the same underlying schema and offering file number (024-#####), and amendments do not replace prior filings on disk — each is its own record under its own accession number.

What does one record represent?

One record is a single EDGAR offering-statement submission identified by its 18-digit accession number. It is the full bundle EDGAR received: the dataset's metadata.json header, the structured Form 1-A cover-page XML (primary_doc.xml), the XSL-rendered cover view, the narrative offering circular (partiiandiii.htm or a filer-slug variant), and every text-bearing exhibit attached to the submission.

Who is required to file Form 1-A?

The filer is always the issuer itself, seeking to qualify a securities offering under Regulation A before any sales occur. Eligible issuers must be organized in and have their principal place of business in the United States or Canada under Rule 251(a); investment companies, BDCs, blank-check companies, mineral-rights issuers, asset-backed issuers, and Rule 262 bad-actor disqualified issuers are categorically excluded.

What is the difference between Tier 1 and Tier 2?

Tier 1 covers offerings up to $20 million in any rolling 12 months, requires no audit, and remains subject to state blue-sky review. Tier 2 covers offerings up to $75 million, requires audited U.S. GAAP financials, preempts state registration under Section 18 of the Securities Act, and triggers ongoing reporting on Forms 1-K (annual), 1-SA (semiannual), 1-U (current), and 1-Z (exit). The election is captured in summaryInfo/indicateTier1Tier2Offering in primary_doc.xml.

What time period does the dataset cover?

Electronic Form 1-A coverage on EDGAR begins in January 2002 and runs to the present, refreshed monthly. The dataset spans the pre-2015 legacy Reg A regime, the June 19, 2015 Reg A+ rewrite (Release 33-9741) that established the two-tier structure, and the 2021 amendment (Release 33-10884) that raised the Tier 2 cap from $50 million to $75 million. Paper-era Reg A filings predating EDGAR are not included.

What file formats are inside each record?

Each record folder contains JSON (metadata.json), XML (the EDGAR primary_doc.xml under the oneafiler schema), and HTML (the XSL-rendered cover page and every offering-circular and exhibit file). PDFs appear occasionally as per-document exhibit alternatives. Every .htm except the XSL render is SGML-wrapped — the actual HTML payload is nested inside a <DOCUMENT>/<TEXT> envelope that must be stripped before HTML parsing. Image files referenced by filings are excluded from the ZIP and must be fetched from sec.gov via the documentUrl in metadata.json.

How does Form 1-A differ from Form S-1, Form D, and Form C?

Form 1-A is qualified rather than registered, capped at $20M (Tier 1) or $75M (Tier 2) per 12 months, and followed by the lighter 1-K/1-SA/1-U reporting regime rather than the full 10-K/10-Q/8-K stack. It is far narrower in cap and lighter in audit and reporting consequences than S-1; it is a full disclosure document rather than a brief notice like Form D; and it permits substantially larger raises with broader retail participation than Form C (Reg CF), which is capped at $5M and conducted exclusively through registered intermediaries.

How are post-qualification updates handled?

Fundamental post-qualification changes — new security type or terms, new selling securityholders, changed plan of distribution, or material business change — require a full Form 1-A/A and re-qualification before sales resume. Non-fundamental updates (pricing within permitted bounds, minor disclosure updates) go through an offering-circular supplement under Rule 253(g) and appear under separate EDGAR types 253G1 through 253G4, which are not part of this dataset.