Form 424B4 Files Dataset

The Form 424B4 Files Dataset is a complete corpus of final, priced prospectuses filed with the U.S. Securities and Exchange Commission under Rule 424(b)(4) of the Securities Act of 1933. Each record represents one Form 424B4 submission accepted by EDGAR — the canonical legal selling document of a registered offering, carrying the public offering price, final share count, underwriting discounts and commissions, net proceeds, and any substantive changes from the preliminary (red-herring) prospectus. Filings are made by the issuer-registrant of the underlying Securities Act registration statement (most often Form S-1, S-3, F-1, or F-3) once the SEC has declared that registration statement effective and the offering has priced.

The dataset is delivered as monthly ZIP containers, with each container holding accession-number folders that contain a structured metadata.json envelope plus the primary 424B4 prospectus document (typically a single self-contained .htm/.html file, occasionally .txt for very early ASCII filings, and rarely .pdf). Coverage begins with the earliest EDGAR 424B4 submissions in January 1994 and continues to the present, spanning IPOs, follow-on equity offerings, registered direct offerings, rights offerings, resale registrations, debt and convertible offerings, and fund and variable-product prospectuses.

Update Frequency
Daily
Updated at
2026-05-09
Earliest Sample Date
1994-01-01
Total Size
2.6 GB
Total Records
14,045
Container Format
ZIP
Content Types
TXT, JSON, PDF, HTML
Form Types
424B4

Dataset APIs

Programmatically retrieve the full list of dataset archive files, download URLs and dataset metadata.

Dataset Index JSON API

Download the entire dataset as a single archive file.

Download Entire Dataset:

Download a single container file (e.g. monthly archive) from the dataset.

Download Single Container:

Dataset Files

389 files · 2.6 GB
Download All
2026-05.zip4.9 MB28 records
2026-04.zip13.4 MB100 records
2026-03.zip7.4 MB27 records
2026-02.zip22.1 MB72 records
2026-01.zip14.0 MB52 records
2025-12.zip12.8 MB50 records
2025-11.zip10.8 MB43 records
2025-10.zip14.2 MB46 records
2025-09.zip15.0 MB53 records
2025-08.zip14.3 MB51 records
2025-07.zip17.3 MB59 records
2025-06.zip12.9 MB51 records
2025-05.zip17.6 MB75 records
2025-04.zip14.6 MB56 records
2025-03.zip8.6 MB29 records
2025-02.zip14.6 MB48 records
2025-01.zip14.2 MB49 records
2024-12.zip11.4 MB48 records
2024-11.zip10.0 MB44 records
2024-10.zip15.7 MB55 records
2024-09.zip11.3 MB46 records
2024-08.zip7.7 MB38 records
2024-07.zip8.2 MB31 records
2024-06.zip8.1 MB36 records
2024-05.zip8.0 MB49 records
2024-04.zip9.5 MB43 records
2024-03.zip6.4 MB31 records
2024-02.zip11.3 MB42 records
2024-01.zip8.0 MB28 records
2023-12.zip5.0 MB28 records
2023-11.zip8.3 MB35 records
2023-10.zip7.4 MB37 records
2023-09.zip6.9 MB34 records
2023-08.zip5.9 MB41 records
2023-07.zip7.2 MB30 records
2023-06.zip9.3 MB46 records
2023-05.zip5.3 MB35 records
2023-04.zip7.3 MB31 records
2023-03.zip8.3 MB36 records
2023-02.zip9.6 MB39 records
2023-01.zip5.2 MB22 records
2022-12.zip3.9 MB20 records
2022-11.zip3.5 MB18 records
2022-10.zip5.3 MB28 records
2022-09.zip7.7 MB28 records
2022-08.zip8.8 MB51 records
2022-07.zip4.0 MB18 records
2022-06.zip4.8 MB19 records
2022-05.zip6.8 MB36 records
2022-04.zip6.0 MB27 records
2022-03.zip4.5 MB18 records
2022-02.zip12.9 MB48 records
2022-01.zip12.8 MB47 records
2021-12.zip19.9 MB83 records
2021-11.zip36.2 MB135 records
2021-10.zip29.9 MB110 records
2021-09.zip22.0 MB87 records
2021-08.zip15.2 MB64 records
2021-07.zip33.9 MB124 records
2021-06.zip28.8 MB107 records
2021-05.zip17.0 MB78 records
2021-04.zip18.4 MB73 records
2021-03.zip46.3 MB186 records
2021-02.zip45.3 MB172 records
2021-01.zip37.0 MB141 records
2020-12.zip27.5 MB103 records
2020-11.zip16.9 MB70 records
2020-10.zip31.5 MB123 records
2020-09.zip23.6 MB92 records
2020-08.zip18.3 MB75 records
2020-07.zip18.1 MB75 records
2020-06.zip14.8 MB60 records
2020-05.zip7.3 MB37 records
2020-04.zip5.0 MB27 records
2020-03.zip2.6 MB15 records
2020-02.zip9.7 MB39 records
2020-01.zip8.2 MB31 records
2019-12.zip7.0 MB28 records
2019-11.zip10.8 MB55 records
2019-10.zip8.8 MB32 records
2019-09.zip7.1 MB35 records
2019-08.zip6.3 MB25 records
2019-07.zip8.1 MB33 records
2019-06.zip10.8 MB43 records
2019-05.zip13.8 MB56 records
2019-04.zip8.1 MB36 records
2019-03.zip7.2 MB32 records
2019-02.zip7.0 MB35 records
2019-01.zip474.8 KB2 records
2018-12.zip5.8 MB26 records
2018-11.zip7.3 MB26 records
2018-10.zip9.7 MB42 records
2018-09.zip11.0 MB40 records
2018-08.zip6.4 MB30 records
2018-07.zip10.2 MB45 records
2018-06.zip13.8 MB63 records
2018-05.zip8.9 MB45 records
2018-04.zip5.0 MB23 records
2018-03.zip5.8 MB25 records
2018-02.zip9.8 MB35 records
2018-01.zip7.2 MB28 records
2017-12.zip7.0 MB31 records
2017-11.zip12.8 MB57 records
2017-10.zip11.3 MB38 records
2017-09.zip6.4 MB31 records
2017-08.zip3.7 MB17 records
2017-07.zip5.8 MB28 records
2017-06.zip7.9 MB31 records
2017-05.zip8.6 MB40 records
2017-04.zip6.2 MB31 records
2017-03.zip7.1 MB29 records
2017-02.zip5.6 MB25 records
2017-01.zip3.5 MB16 records
2016-12.zip6.0 MB29 records
2016-11.zip3.1 MB20 records
2016-10.zip7.8 MB31 records
2016-09.zip8.0 MB35 records
2016-08.zip3.5 MB17 records
2016-07.zip5.9 MB28 records
2016-06.zip5.1 MB22 records
2016-05.zip7.7 MB33 records
2016-04.zip3.2 MB14 records
2016-03.zip3.1 MB18 records
2016-02.zip2.7 MB21 records
2016-01.zip1.1 MB11 records
2015-12.zip1.9 MB18 records
2015-11.zip9.8 MB46 records
2015-10.zip7.2 MB29 records
2015-09.zip2.0 MB14 records
2015-08.zip6.1 MB26 records
2015-07.zip8.2 MB39 records
2015-06.zip11.9 MB53 records
2015-05.zip10.3 MB48 records
2015-04.zip8.9 MB34 records
2015-03.zip6.1 MB31 records
2015-02.zip8.1 MB37 records
2015-01.zip7.3 MB29 records
2014-12.zip8.1 MB35 records
2014-11.zip11.8 MB41 records
2014-10.zip10.9 MB37 records
2014-09.zip9.4 MB32 records
2014-08.zip8.3 MB33 records
2014-07.zip11.1 MB44 records
2014-06.zip12.7 MB50 records
2014-05.zip10.8 MB42 records
2014-04.zip11.3 MB46 records
2014-03.zip11.4 MB53 records
2014-02.zip9.6 MB41 records
2014-01.zip8.8 MB35 records
2013-12.zip8.8 MB36 records
2013-11.zip14.0 MB57 records
2013-10.zip13.8 MB45 records
2013-09.zip8.7 MB37 records
2013-08.zip8.3 MB33 records
2013-07.zip8.7 MB34 records
2013-06.zip7.6 MB34 records
2013-05.zip13.4 MB58 records
2013-04.zip6.1 MB27 records
2013-03.zip4.2 MB20 records
2013-02.zip3.5 MB20 records
2013-01.zip4.3 MB17 records
2012-12.zip4.1 MB20 records
2012-11.zip4.1 MB12 records
2012-10.zip8.4 MB34 records
2012-09.zip6.6 MB21 records
2012-08.zip3.2 MB16 records
2012-07.zip4.6 MB22 records
2012-06.zip4.2 MB25 records
2012-05.zip4.2 MB19 records
2012-04.zip4.9 MB18 records
2012-03.zip7.7 MB41 records
2012-02.zip6.6 MB26 records
2012-01.zip1.7 MB12 records
2011-12.zip5.0 MB15 records
2011-11.zip5.1 MB21 records
2011-10.zip2.0 MB8 records
2011-09.zip604.3 KB3 records
2011-08.zip771.5 KB3 records
2011-07.zip5.2 MB22 records
2011-06.zip5.7 MB24 records
2011-05.zip9.1 MB38 records
2011-04.zip5.5 MB26 records
2011-03.zip4.6 MB22 records
2011-02.zip5.8 MB24 records
2011-01.zip3.3 MB22 records
2010-12.zip10.1 MB40 records
2010-11.zip8.7 MB34 records
2010-10.zip7.3 MB32 records
2010-09.zip2.5 MB14 records
2010-08.zip3.2 MB15 records
2010-07.zip4.2 MB21 records
2010-06.zip5.7 MB27 records
2010-05.zip4.9 MB25 records
2010-04.zip3.8 MB23 records
2010-03.zip4.0 MB18 records
2010-02.zip2.1 MB12 records
2010-01.zip3.4 MB16 records
2009-12.zip5.0 MB19 records
2009-11.zip4.1 MB21 records
2009-10.zip5.2 MB24 records
2009-09.zip2.9 MB13 records
2009-08.zip3.1 MB14 records
2009-07.zip1.1 MB10 records
2009-06.zip2.2 MB13 records
2009-05.zip1.2 MB7 records
2009-04.zip886.4 KB5 records
2009-03.zip252.6 KB1 records
2009-02.zip478.5 KB3 records
2009-01.zip525.6 KB2 records
2008-12.zip1.1 MB5 records
2008-11.zip550.8 KB4 records
2008-10.zip216.0 KB4 records
2008-09.zip498.3 KB13 records
2008-08.zip1.3 MB10 records
2008-07.zip1.2 MB7 records
2008-06.zip2.0 MB14 records
2008-05.zip2.6 MB15 records
2008-04.zip2.6 MB16 records
2008-03.zip1.9 MB10 records
2008-02.zip2.9 MB14 records
2008-01.zip3.4 MB17 records
2007-12.zip7.2 MB32 records
2007-11.zip10.8 MB50 records
2007-10.zip9.0 MB48 records
2007-09.zip2.6 MB16 records
2007-08.zip6.1 MB27 records
2007-07.zip6.1 MB27 records
2007-06.zip9.2 MB61 records
2007-05.zip8.4 MB42 records
2007-04.zip3.4 MB18 records
2007-03.zip6.0 MB31 records
2007-02.zip7.3 MB42 records
2007-01.zip4.7 MB32 records
2006-12.zip9.4 MB49 records
2006-11.zip7.3 MB44 records
2006-10.zip7.1 MB41 records
2006-09.zip4.9 MB26 records
2006-08.zip3.4 MB20 records
2006-07.zip5.4 MB29 records
2006-06.zip7.1 MB46 records
2006-05.zip6.2 MB36 records
2006-04.zip3.8 MB26 records
2006-03.zip4.9 MB38 records
2006-02.zip6.9 MB32 records
2006-01.zip5.0 MB25 records
2005-12.zip8.1 MB42 records
2005-11.zip5.8 MB37 records
2005-10.zip5.0 MB31 records
2005-09.zip7.4 MB38 records
2005-08.zip9.3 MB50 records
2005-07.zip5.3 MB30 records
2005-06.zip9.6 MB53 records
2005-05.zip6.1 MB37 records
2005-04.zip2.4 MB21 records
2005-03.zip4.0 MB24 records
2005-02.zip8.4 MB43 records
2005-01.zip4.1 MB21 records
2004-12.zip7.8 MB44 records
2004-11.zip6.6 MB34 records
2004-10.zip8.0 MB40 records
2004-09.zip4.8 MB33 records
2004-08.zip5.6 MB35 records
2004-07.zip6.4 MB43 records
2004-06.zip7.5 MB52 records
2004-05.zip6.5 MB47 records
2004-04.zip5.5 MB50 records
2004-03.zip6.4 MB42 records
2004-02.zip5.5 MB34 records
2004-01.zip3.3 MB23 records
2003-12.zip4.9 MB32 records
2003-11.zip6.2 MB41 records
2003-10.zip5.1 MB41 records
2003-09.zip3.6 MB38 records
2003-08.zip3.1 MB26 records
2003-07.zip2.8 MB25 records
2003-06.zip3.1 MB23 records
2003-05.zip2.6 MB24 records
2003-04.zip166.0 KB10 records
2003-03.zip696.3 KB9 records
2003-02.zip1.1 MB24 records
2003-01.zip963.4 KB15 records
2002-12.zip2.5 MB23 records
2002-11.zip3.0 MB23 records
2002-10.zip7.1 MB30 records
2002-09.zip352.8 KB5 records
2002-08.zip1.0 MB11 records
2002-07.zip2.1 MB17 records
2002-06.zip5.9 MB34 records
2002-05.zip3.6 MB43 records
2002-04.zip1.5 MB25 records
2002-03.zip2.9 MB27 records
2002-02.zip2.7 MB31 records
2002-01.zip1.2 MB17 records
2001-12.zip2.2 MB24 records
2001-11.zip2.3 MB34 records
2001-10.zip3.1 MB35 records
2001-09.zip906.0 KB12 records
2001-08.zip1.9 MB22 records
2001-07.zip3.4 MB29 records
2001-06.zip3.0 MB34 records
2001-05.zip2.3 MB31 records
2001-04.zip4.1 MB28 records
2001-03.zip2.6 MB35 records
2001-02.zip6.0 MB43 records
2001-01.zip1.0 MB22 records
2000-12.zip985.1 KB18 records
2000-11.zip3.4 MB43 records
2000-10.zip4.2 MB53 records
2000-09.zip7.7 MB48 records
2000-08.zip6.8 MB70 records
2000-07.zip5.2 MB49 records
2000-06.zip4.1 MB44 records
2000-05.zip3.3 MB36 records
2000-04.zip5.5 MB57 records
2000-03.zip7.8 MB84 records
2000-02.zip10.6 MB102 records
2000-01.zip2.9 MB31 records
1999-12.zip4.2 MB44 records
1999-11.zip7.3 MB69 records
1999-10.zip8.6 MB80 records
1999-09.zip4.4 MB49 records
1999-08.zip5.8 MB50 records
1999-07.zip6.6 MB70 records
1999-06.zip6.3 MB65 records
1999-05.zip6.5 MB65 records
1999-04.zip4.9 MB51 records
1999-03.zip4.7 MB49 records
1999-02.zip5.6 MB46 records
1999-01.zip2.5 MB28 records
1998-12.zip3.2 MB38 records
1998-11.zip1.9 MB18 records
1998-10.zip1.6 MB13 records
1998-09.zip1.2 MB10 records
1998-08.zip2.6 MB27 records
1998-07.zip5.1 MB46 records
1998-06.zip8.5 MB71 records
1998-05.zip7.9 MB77 records
1998-04.zip5.9 MB60 records
1998-03.zip4.7 MB57 records
1998-02.zip6.0 MB59 records
1998-01.zip3.3 MB30 records
1997-12.zip4.7 MB41 records
1997-11.zip9.9 MB103 records
1997-10.zip8.2 MB90 records
1997-09.zip4.8 MB54 records
1997-08.zip8.5 MB67 records
1997-07.zip5.4 MB61 records
1997-06.zip5.7 MB64 records
1997-05.zip2.4 MB31 records
1997-04.zip2.6 MB28 records
1997-03.zip4.9 MB52 records
1997-02.zip4.8 MB59 records
1997-01.zip6.2 MB53 records
1996-12.zip3.7 MB42 records
1996-11.zip8.3 MB83 records
1996-10.zip9.6 MB102 records
1996-09.zip4.6 MB48 records
1996-08.zip6.7 MB60 records
1996-07.zip4.1 MB47 records
1996-06.zip9.7 MB95 records
1996-05.zip5.5 MB65 records
1996-04.zip516.2 KB7 records
1996-03.zip988.2 KB13 records
1996-02.zip126.1 KB3 records
1996-01.zip727.2 KB12 records
1995-12.zip1.5 MB19 records
1995-11.zip1.2 MB10 records
1995-10.zip369.8 KB6 records
1995-09.zip503.0 KB11 records
1995-08.zip400.1 KB5 records
1995-07.zip453.8 KB5 records
1995-06.zip1.0 MB16 records
1995-05.zip486.0 KB10 records
1995-04.zip351.7 KB6 records
1995-03.zip275.6 KB6 records
1995-02.zip567.5 KB7 records
1995-01.zip2.8 KB1 records
1994-12.zip235.3 KB3 records
1994-11.zip631.7 KB6 records
1994-10.zip267.1 KB3 records
1994-09.zip365.8 KB3 records
1994-08.zip391.5 KB4 records
1994-07.zip216.5 KB3 records
1994-06.zip586.0 KB6 records
1994-05.zip1.1 MB12 records
1994-04.zip1.1 MB10 records
1994-03.zip633.3 KB12 records
1994-02.zip695.7 KB7 records
1994-01.zip645.9 KB7 records

What This Dataset Contains

The dataset is scoped strictly to Form 424B4 accessions: the definitive prospectus filed under Rule 424(b)(4) after the SEC has declared the underlying registration statement effective with pricing-related information omitted under Rule 430A or Rule 430B. The (b)(4) designation specifically signals that the final prospectus reflects information constituting a substantive change from, or material addition to, the preliminary prospectus filed under Rule 424(b)(1) or 424(b)(2). In substance, the 424B4 is the canonical legal selling document for the offering — the version that controls Section 11 and Section 12 liability under the Securities Act — and is used for IPOs, follow-on equity offerings, registered direct offerings, rights offerings, resale registrations on behalf of selling stockholders, debt offerings, convertible-security offerings, and fund/variable-product prospectuses.

The dataset materialises two artifacts per record: a metadata.json envelope describing the filing exactly as EDGAR received it, and the primary 424B4 prospectus document carrying the full narrative, tabular, and legal text of the priced prospectus. Records are delivered inside monthly ZIP containers keyed by year and month (for example, 2025/2025-11.zip). When unzipped, each record sits inside a top-level month folder (2025-11/) whose immediate children are the per-accession folders. Image attachments (GRAPHIC documents such as JPEG/PNG illustrations, charts, organisational diagrams, and issuer logos) referenced inside the metadata are inventoried but not materialised on disk; they remain accessible via the EDGAR archive URLs carried in each metadata entry. The file types present in the dataset are TXT, JSON, PDF, and HTML; Form 424B4 does not carry an XBRL instance, so XBRL data files are absent.

Content Structure of a Single Record

What one record represents

A single record in the Form 424B4 Files Dataset corresponds to one Form 424B4 submission accepted by EDGAR, identified by its accession number. On disk a record is a folder named after the 18-digit accession number with no dashes, and it contains exactly two artifacts: a metadata.json envelope describing the filing, and the primary 424B4 prospectus document — almost always a single self-contained .htm/.html file (rarely .txt for very old filings). The JSON gives a structured handle on filer identity, document inventory, and timestamps; the prospectus document carries the full narrative, tabular, and legal text of the priced prospectus exactly as it was filed on EDGAR.

Container layout and record envelope

Each accession folder beneath the month directory holds:

  • metadata.json — the structured filing envelope, always present.
  • A single prospectus document (.htm/.html, occasionally .txt for legacy ASCII filings) whose filename is whatever the filer or its EDGAR agent submitted. Common filename shapes include Donnelley/Edgar Agents-style (d60114d424b4.htm), Toppan Merrill-style (tm2512009-13_424b4.htm), EGS/EdgarAgents-style (ea0267241-424b4_endra.htm), Issuer Direct boilerplate (form424b4.htm), and issuer-specific names (apvo-2025-424b4_-_eloc2.htm, project_grace_424b4.htm, f424b4_110625.htm).

Image attachments — the JPEG/PNG illustrations, charts, organisational diagrams, and issuer logos that the filer also submitted as GRAPHIC documents — are not present in the ZIP. They are inventoried inside metadata.json but are not materialised locally; <IMG SRC="..."> tags inside the prospectus HTML therefore resolve only against the original EDGAR archive URLs, not against the local folder.

metadata.json — structured envelope

metadata.json is a flat JSON object describing the filing as EDGAR received it. The fields are consistent across records:

  • formType — always "424B4" for this dataset.
  • accessionNo — the dashed accession number (e.g. "0001193125-25-288774"); the surrounding folder name is the digits-only form of the same identifier.
  • filedAt — ISO 8601 timestamp with EDGAR's eastern-time offset, marking the moment EDGAR accepted the filing.
  • description — human-readable form description, typically "Form 424B4 - Prospectus [Rule 424(b)(4)]".
  • linkToFilingDetails — direct URL to the primary prospectus document on EDGAR.
  • linkToTxt — URL to the full SGML-wrapped submission text bundle.
  • linkToHtml — URL to the EDGAR filing-index page.
  • linkToXbrl — empty for 424B4 prospectuses; the form does not carry an XBRL instance.
  • id — internal record identifier (md5-shaped).
  • documentFormatFiles — ordered array of every document EDGAR received with the submission.
  • dataFiles — empty for 424B4 (there are no associated XBRL data files).
  • seriesAndClassesContractsInformation — populated for fund and variable-product 424B4s with series and class identifiers; empty for operating-company prospectuses.
  • entities — array of registrant and co-filer descriptors.

documentFormatFiles

Each entry describes one attached document with a stringified 1-based sequence, a byte size, an absolute documentUrl on www.sec.gov/Archives/edgar/..., a description, and an EDGAR type. For 424B4 submissions the dominant types are 424B4 for the prospectus itself (whose description may be either the type literal "424B4" or "PROSPECTUS") and GRAPHIC for embedded JPEG/PNG illustrations referenced from the prospectus HTML. The trailing entry — with a single space " " as sequence — is a synthetic pointer to the EDGAR full-submission text bundle (the .txt file). The list is therefore a complete inventory of what EDGAR holds; the dataset materialises only the 424B4 document.

entities

Each entry corresponds to a registrant or co-filer recorded in the EDGAR header and carries: cik, companyName (with an inline role suffix in parentheses such as (Filer) or (Subject)), fileNo (the registration file number — e.g. 333-290663 — that ties the 424B4 back to its underlying S-1/F-1/S-3), sic (EDGAR SIC code with its short label, such as "6331 Fire, Marine & Casualty Insurance" or "2834 Pharmaceutical Preparations"), tickers (array of trading symbols, with multi-class issuers listing each class — e.g. ["GLIBA","GLIBK","GLIBR","GLIBB"]), irsNo (EIN), stateOfIncorporation (which may be omitted for some entities), fiscalYearEnd in MMDD format, act (typically "33" for the Securities Act of 1933), type (form type for that entity), and filmNo (EDGAR film number). Together cik and fileNo uniquely tie the priced prospectus to a specific registration statement.

Prospectus document — wrapper and body

Every prospectus file is a single self-contained document wrapped in the SGML envelope EDGAR uses inside .txt submissions. The opening lines are a fixed wrapper of the form:

1 <DOCUMENT>
2 <TYPE>424B4
3 <SEQUENCE>1
4 <FILENAME>...
5 <DESCRIPTION>PROSPECTUS
6 <TEXT>
7 <HTML> ... </HTML>
8 </TEXT>
9 </DOCUMENT>

Inside the SGML wrapper sits the full HTML body of the prospectus. The body is paginated for print using embedded <!-- Field: Page; Sequence: N --> / <!-- Field: /Page --> comments and <DIV STYLE="break-before: page; ..."> constructs. Different filer toolchains (Edgar Agents, Toppan Merrill, Donnelley/edgarfilings, Issuer Direct, EGS) leave distinguishable markup signatures, but the section structure is essentially the same across providers.

Typical sections of a 424B4 prospectus

A 424B4 prospectus is a continuous document, navigated via a hyperlinked TABLE OF CONTENTS near the front and anchored throughout with <A NAME="..."> targets. The canonical sections, in approximate order, are:

  1. Cover page and offering terms. The first HTML page carries the EDGAR cover-page boilerplate ("Filed Pursuant to Rule 424(b)(4)", the registration number, the issuer name, the security and amount being offered) followed by a cover-page pricing table. For an underwritten cash offering the cover table presents, per share and per total, the public offering price, underwriting discount and commissions, and net proceeds to the issuer or to selling stockholders, with footnotes describing the underwriters' option to purchase additional shares (commonly 30 days, up to 15% of the base deal). For resale 424B4s the cover instead identifies the selling stockholders, the share categories being registered (PIPE shares, pre-funded warrants shares, common warrant shares, placement-agent warrant shares, etc.), the listed exchange and trading symbol, and the most recent reported sale price; for rights offerings it identifies the rights mechanics, exercise price, expiration date, and oversubscription privilege.

  2. About this prospectus and cautionary note on forward-looking statements. Short framing pages describing how to use the document and setting out the standard PSLRA-style forward-looking statements safe-harbor language.

  3. Prospectus Summary. A condensed overview of the issuer, the offering, and the strategic rationale; full-form on IPOs and follow-on offerings, often abbreviated or omitted on resale registrations and rights offerings.

  4. The Offering. A stylised table-form summary box restating in catalog form the number and type of shares offered, who is offering them, the offering terms, use of proceeds, the listing exchange and ticker, the post-offering shares-outstanding count, and a pointer to the Risk Factors page.

  5. Risk Factors. Typically anchored as <A NAME="a_004"> or similar and introduced by a short italic lead-in. On larger deals this is a multi-page enumeration of business, industry, regulatory, financial, and securities-related risks, sometimes running to dozens of pages and (since the 2020 amendments to Item 105 of Regulation S-K) prefaced by a summary of risk factors and grouped by category when the section exceeds 15 pages. On smaller deals — especially smaller-reporting-company resale registrations — Risk Factors may be brief and explicitly incorporate by reference the Risk Factors of the issuer's most recent 10-K and subsequent 10-Q filings.

  6. Use of Proceeds. Identifies the dollar amount the issuer expects to receive net of underwriting discounts and offering expenses and how it intends to deploy that capital (working capital, capital expenditures, debt repayment, acquisitions, R&D). On resale 424B4s this section instead states that the issuer will receive no proceeds from the resale itself, sometimes noting that the issuer will receive proceeds from any cash exercise of warrants underlying the resale shares. On rights offerings it states the intended use of the rights-offering proceeds.

  7. Capitalization. A tabular presentation of the issuer's cash and capitalization as of a recent balance-sheet date on an actual basis and on an as-adjusted basis giving effect to the offering. Standard for IPOs and primary follow-ons; routinely omitted on resale registrations and rights offerings.

  8. Dilution. An IPO-specific arithmetic schedule comparing the public offering price per share to pro forma net tangible book value per share before and after the offering, quantifying the per-share dilution to new investors.

  9. Management's Discussion and Analysis of Financial Condition and Results of Operations. Present in full on IPO 424B4s, where the prospectus must be a stand-alone disclosure document; typically incorporated by reference from periodic reports on follow-on prospectuses where Form S-3 eligibility permits incorporation.

  10. Business. Description of the issuer's operations, products or services, markets, competition, intellectual property, regulatory environment, employees and human capital, properties, and material legal proceedings; full-form on IPOs, frequently incorporated by reference on follow-ons.

  11. Management. Directors and executive officers with biographies, board structure, committee composition, and director independence; corporate-governance disclosures; executive-compensation summary, including (where applicable) pay-versus-performance, clawback-policy, and Item 105/Item 106 cybersecurity disclosures reflected by reference or in summary.

  12. Principal and (where applicable) Selling Stockholders. Beneficial ownership tables identifying 5%-or-greater holders, directors, and officers, and — on resale 424B4s — the named selling stockholders together with shares owned before the offering, shares offered, and shares owned after the offering.

  13. Description of Capital Stock (or Description of Securities / Description of Notes). Authorised and outstanding share counts, par value, voting rights, dividend rights, liquidation preferences, anti-takeover provisions, exchange listing, and transfer agent. For debt or convertible offerings this is replaced or supplemented by a Description of Notes / Description of the Offered Securities covering interest rate, maturity, ranking, covenants, conversion mechanics, redemption, and events of default. For multi-class structures each class is described separately.

  14. Material U.S. Federal Income Tax Considerations. Tax-counsel disclosure addressing U.S. holders, non-U.S. holders, withholding, information reporting, and (on certain offerings) FATCA.

  15. Underwriting / Plan of Distribution. On underwritten deals an UNDERWRITING section names the syndicate, sets out the firm-commitment or best-efforts structure, the underwriters' option to purchase additional shares, lock-up agreements with directors, officers, and major holders, indemnification, stabilisation and short-position mechanics, FINRA disclosures, and electronic distribution. On resale 424B4s a PLAN OF DISTRIBUTION section instead enumerates permitted resale methods (ordinary brokerage transactions, block trades, at-the-market sales through broker-dealers, privately negotiated transactions, exchange or off-exchange transactions, settlement of short sales) and the issuer's expense-reimbursement arrangement. Rights-offering 424B4s describe the subscription rights, oversubscription privilege, dealer-manager arrangement, and settlement mechanics.

  16. Legal Matters and Experts. Names of issuer counsel, underwriters' counsel, and the independent registered public accounting firm whose audit report on the financial statements is referenced.

  17. Where You Can Find More Information / Incorporation by Reference. Pointers to EDGAR and, on shelf-eligible 424B4s, an explicit list of incorporated periodic reports, current reports, and proxy materials.

  18. Financial statements. On IPO 424B4s, audited annual statements (balance sheets, statements of operations, comprehensive income, stockholders' equity, and cash flows) with notes and the auditor's report, plus unaudited interim statements where applicable; on follow-on shelf prospectuses the statements are typically incorporated by reference rather than reproduced in full. Where present, the statements are rendered as HTML tables.

  19. Back matter. Signature/dater block and any final notices.

For fund and variable-insurance-product 424B4s (signalled by a non-empty seriesAndClassesContractsInformation), the section labels follow the relevant fund-prospectus item structure rather than the S-K-driven order: fee tables, investment objectives, principal investment strategies, principal risks, performance, portfolio management, purchase and sale information, tax information, payments to broker-dealers, and similar items.

Included content

A record materialises:

  • The structured filing envelope (metadata.json) with form type, accession number, filing timestamp, EDGAR document inventory, registration file number, filer and co-filer identity, SIC, tickers, IRS number, state of incorporation, fiscal year end, film number, and (for fund filers) series-and-class identifiers.
  • The full text of the 424B4 prospectus as one HTM/HTML document with its SGML wrapper intact, including all narrative, tabular, and legal content as filed.

Excluded content

  • GRAPHIC image files referenced in metadata.json -> documentFormatFiles are not stored in the ZIP. These are the embedded JPEG/PNG illustrations, organisational charts, product photos, maps, and issuer logos that the prospectus HTML references; they remain accessible only via the original EDGAR archive URLs carried in each documentFormatFiles entry. A single IPO 424B4 may list one prospectus document plus dozens of GRAPHIC entries; only the prospectus is materialised.
  • The synthetic "Complete submission text file" pointer in documentFormatFiles is a reference to the EDGAR full-submission .txt bundle and is not stored as a separate artifact in the record.
  • Documents from the underlying registration statement (S-1, F-1, S-3, etc.), preliminary prospectuses (424B1/B2/B3), pricing-only supplements (424B5), free writing prospectuses (FWP), selling-security-holder updates (424B7), and other ancillary EDGAR filings are not included; the dataset is scoped strictly to 424B4 accessions.
  • Documents incorporated by reference (Form 10-K, Form 10-Q, Form 8-K, DEF 14A) are not materialised in the record; the prospectus body links to or names them but the underlying content lives in their own EDGAR filings.

Changes in required content over time

The structural skeleton of a 424B4 prospectus has been stable since the form's introduction, but several SEC rule changes have shifted what appears inside it:

  • Plain English Rule (1998). Rule 421(d) imposed plain-English requirements on the cover, summary, and risk factors of prospectuses, materially changing the readability and section construction of 424B4 covers and Risk Factors filed from 1998 onward.
  • Securities Offering Reform (2005). Reform of the Securities Act offering process expanded the use of free writing prospectuses, codified the well-known seasoned issuer (WKSI) regime, and broadened incorporation by reference into shelf 424B4s, allowing many follow-on 424B4s to omit full Business, MD&A, and financial-statement reproduction in favor of incorporation from periodic reports.
  • Risk Factor reform (2020). Amendments to Item 105 of Regulation S-K required risk-factor sections longer than 15 pages to begin with a summary of risk factors and grouped Risk Factors by category, reshaping the Risk Factors section of 424B4 prospectuses filed thereafter.
  • Pay-versus-performance, cybersecurity, clawback, and human-capital disclosures. Where a 424B4 contains or incorporates the issuer's executive-compensation, cybersecurity, clawback-policy, and human-capital disclosures, those subsections reflect the rule additions of 2020 (human-capital disclosures), 2022 (clawbacks, pay-versus-performance), and 2023 (cybersecurity disclosures).
  • Capital-structure conventions. Dilution and Capitalization tables have remained convention-driven rather than rule-driven, but the as-adjusted columns reflect the offering terms shown on the cover page and have evolved alongside disclosures of pre-funded warrants, PIPE arrangements, equity lines (ELOCs), and SPAC redemptions, which appear with greater frequency in 424B4 prospectuses from the 2010s onward.

Changes in data format over time

Form 424B4 filings span EDGAR from January 1994 to the present, and the on-disk format of the prospectus document evolved across that span:

  • 1994 to late 1990s: ASCII / plain text. Early 424B4 submissions are plain-text filings with hard line breaks, monospace tabular formatting, and <S> / <C> column markers. Tables are rendered as fixed-width ASCII; there are no embedded images or hyperlinks. The SGML <DOCUMENT> / <TYPE>424B4 / <TEXT> envelope is already present, but the body is text rather than HTML.
  • Late 1990s to 2000s: HTML adoption. As EDGAR began accepting HTML, 424B4 prospectuses transitioned to HTML bodies inside the same SGML wrapper, gaining proper tables (<TABLE>), hyperlinked tables of contents with <A NAME> anchors, typographic styling, and embedded GRAPHIC references for illustrations and logos.
  • 2000s to present: rich HTML with print pagination. Modern 424B4 documents are richly styled HTML with inline CSS, paginated for print using <DIV STYLE="break-before: page"> and toolchain-specific page-marker comments, frequent use of <TABLE> for capitalization, dilution, beneficial-ownership, and selling-stockholder schedules, and consistent cross-referencing via anchor IDs.

The dataset preserves each file as filed, so the era's formatting conventions are visible in each record. The file types found in the dataset are TXT, JSON, PDF, and HTML: TXT for legacy ASCII prospectuses, JSON for the per-record metadata envelope, HTML for the modern prospectus body, and PDF for rare submissions in which a filer attached a PDF rendition. Form 424B4 does not carry an XBRL instance, and linkToXbrl is correspondingly empty for every record.

Interpretation and extraction notes

  • Incorporation by reference. Many follow-on 424B4 prospectuses, particularly those filed off Form S-3 by seasoned issuers, are short documents whose Business, MD&A, and financial-statement disclosures live in incorporated periodic reports rather than in the prospectus body itself. A complete reading of such a record requires consulting the issuer's referenced 10-K and 10-Q filings, which are not part of this dataset.
  • Amendments. A 424B4 is itself a definitive prospectus, not an amendment; subsequent corrective filings appear under Form 424B7 (selling-security-holder updates) or as post-effective amendments to the underlying registration statement, neither of which is included.
  • Rule-pursuant variations. A small fraction of filings that EDGAR classifies as 424B4 carry cover language citing Rule 424(b)(1) or 424(b)(7) in addition to or instead of (b)(4), reflecting filer practice rather than data error.
  • Multi-class and co-issuer filings. The entities array can contain multiple registrants — parent and finance-subsidiary co-issuers, multi-class share structures with multiple tickers per issuer, or selling-shareholder entities tagged (Subject). Consumers should treat the array as a list rather than picking only the first entry.
  • Image references that do not resolve locally. Because GRAPHIC files are excluded, <IMG SRC="..."> tags inside the HTML resolve only against the EDGAR archive URL pattern, not against the local folder. Rendering the HTML offline therefore shows broken images while preserving all textual and tabular content.
  • Filename heterogeneity. The prospectus filename is whatever the filer or its EDGAR agent submitted; consumers should rely on the single non-metadata.json file in the folder, on the documentFormatFiles entry whose type is "424B4", or on the <TYPE>424B4 SGML header, rather than on filename pattern matching.
  • Pagination markers. Embedded page-break comments and <DIV>-based page constructs are presentation-layer artifacts and should be stripped or treated as section delimiters, not as content, by extraction pipelines.
  • Section anchors. Internal navigation relies on <A NAME="..."> targets (often opaque IDs such as a_001, a_004); the readable section name typically appears in the immediately surrounding <P>, <TR>, or <FONT> element rather than in the anchor itself, so extractors should resolve section boundaries from the table-of-contents pairing of anchor and label rather than from the anchor string.
  • Funds vs. operating companies. When seriesAndClassesContractsInformation is non-empty, the prospectus is a fund or variable-insurance-product 424B4 whose internal structure (fee tables, investment objectives, principal investment strategies, principal risks, performance) differs materially from the operating-company anatomy described above; the SGML wrapper and JSON envelope are identical, but section labels follow fund-prospectus item conventions.
  • Underwritten vs. resale vs. rights structure. The cover, Use of Proceeds, "The Offering" box, and Underwriting/Plan of Distribution sections vary materially with deal type. An underwritten cash offering shows a per-share/per-total pricing table and an UNDERWRITING section; a resale registration shows a "shares offered by selling stockholders" cover, a "we will not receive proceeds" Use of Proceeds, and a PLAN OF DISTRIBUTION enumeration of permitted methods; a rights offering shows subscription mechanics, an oversubscription privilege, and dealer-manager arrangements. Extraction logic should branch on these structural variants rather than assume a single template.

Who Files or Publishes This Dataset, and When

Who files or discloses the record

The filer of every Form 424B4 is the issuer-registrant of the underlying Securities Act registration statement. The dataset's filer population consists of entities that have an effective registration statement on file and are conducting a registered public offering whose final pricing terms were omitted from the registration statement under Rule 430A or Rule 430B.

In practice, that population includes:

  • Domestic operating company issuers filing off an effective Form S-1 (most IPOs and many non-shelf follow-on offerings) or Form S-3 (shelf and non-shelf primary or secondary offerings by short-form-eligible issuers).
  • Foreign private issuers conducting U.S. registered offerings off Form F-1 or Form F-3.
  • IPO issuers, the most characteristic users of 424(b)(4): a company files the final prospectus the morning after pricing, reflecting the price set after effectiveness.
  • Follow-on and secondary offering issuers where Rule 430A/430B information was omitted from the effective registration statement.
  • Issuers in registered resale offerings on behalf of selling security holders, where final terms are completed via a 424(b)(4) prospectus.

The legal filer is always the registrant. Underwriters set the price and sell the securities, and selling security holders may be the economic sellers, but neither files the prospectus on EDGAR. They appear inside the document, not on the cover as filer of record.

Outside this population: private issuers relying on Regulation D, Regulation S, Rule 144A, or Section 4(a)(2) (no Securities Act registration, therefore no Rule 424 prospectus); and reporting companies whose only EDGAR activity is Exchange Act periodic reporting without a contemporaneous registered offering.

When the record is created or required

Form 424B4 is event-driven, not periodic. A record exists when three conditions align:

  1. The SEC has declared the underlying registration statement (S-1, S-3, F-1, F-3, or analogous) effective with pricing-related information omitted under Rule 430A or 430B.
  2. The issuer and its underwriters have priced the offering and finalized size, public offering price, underwriting discount, and related terms.
  3. The issuer is therefore obligated under Rule 424(b) to file a final prospectus that supplies the omitted information and reflects any substantive changes from the last preliminary prospectus.

Filing deadline. Rule 424(b) requires the prospectus to be filed no later than the second business day following the earlier of (i) the date the prospectus is first used after effectiveness, or (ii) the date the offering price is determined.

Typical IPO cadence. The issuer prices after the close on day T, trading opens on T+1, and the 424B4 is filed on the morning of T+1 (or by T+2 at the latest) so the final statutory prospectus is publicly available at or before first sales.

Each priced offering generally produces one 424B4 filing. Additional 424B4 or 424B3 submissions may follow if the offering is upsized, repriced, or otherwise materially updated; each is a separate EDGAR submission with its own accession number.

What 424(b)(4) specifically captures

Rule 424(b) routes post-effectiveness prospectuses to several paragraphs. The defining features of paragraph (b)(4) are the combination of:

  • Rule 430A/430B information being completed (price, offering size, underwriting discount, net proceeds), and
  • substantive changes or additions relative to the preliminary prospectus already on file.

That combination distinguishes 424(b)(4) from neighboring codes:

  • 424(b)(1) — substantive new information but no Rule 430A pricing was omitted (price was already in the effective registration statement).
  • 424(b)(2) / 424(b)(5) — standard shelf takedown codes under Rule 415 from an effective S-3 or F-3; Rule 424(b)(5) requires no substantive changes beyond the previously omitted Rule 430B information.
  • 424(b)(3) — substantive changes or prospectus stickers without Rule 430A pricing information.
  • 424(b)(7) / 424(b)(8) — selling-security-holder shelf prospectuses and substantive supplements to prior (b)(2)/(b)(5)/(b)(7) filings.

424(b)(4) is therefore the standard EDGAR code for an IPO issuer the morning after pricing, and for many priced follow-on offerings off an S-1 or non-automatic S-3 where additional substantive disclosure accompanies the omitted pricing data.

Important distinctions

  • Issuer files, not underwriters or selling holders. The registrant is the EDGAR filer even when the syndicate prices the deal and selling stockholders are the economic sellers.
  • Not used when price was already effective. An issuer whose registration statement went effective with the price already set would file under 424(b)(1), not (b)(4).
  • Shelf debt and equity takedowns. Routine shelf takedowns from an S-3 or F-3 typically use 424(b)(2) or 424(b)(5); 424(b)(4) appears in the shelf context only when substantive changes accompany the priced takedown.
  • Foreign private issuers. FPIs use the same Rule 424(b)(4) mechanic for U.S. registered offerings off F-1 or F-3, despite reporting periodically on Form 20-F or Form 40-F rather than 10-K/10-Q.
  • No pricing, no filing. If an effective registration statement never prices (postponed or withdrawn IPO), no 424B4 is filed. The presence of a 424B4 in the dataset is direct evidence the offering went forward.
  • Adjacent documents. The 424B4 sits between the effective S-1/S-3/F-1/F-3 (with EFFECT notice), any preliminary Form 424B1/424B3 red-herrings, free writing prospectus (FWP), Form 8-A exchange registration, and any later 424B3/Form 424B5/424B7 supplements. It is the definitive Section 10(a) statutory prospectus, not a marketing document and not the registration statement.

Historical scope

The Rule 424 filing obligation traces to the Securities Act of 1933; Rule 430A, which makes the (b)(4) code necessary by allowing pricing to be omitted at effectiveness, was adopted in 1987. Mandatory EDGAR filing of Securities Act prospectuses was phased in between 1993 and May 1996 under Regulation S-T. The earliest 424B4 records in this dataset accordingly date from 1994; coverage is effectively complete for domestic registrants from May 1996 onward and for foreign private issuers from the early 2000s.

How This Dataset Differs From Similar Datasets or Filings

Form 424B4 sits inside a tightly related family of Rule 424(b) prospectus variants, the registration statements they derive from, and the marketing documents filed alongside them. These neighbors share long blocks of identical text and are often filed in close sequence, but they differ in legal trigger, timing within the offering process, and what information they finalize.

Form S-1 and S-1/A (parent registration statement)

The 424B4 prospectus is functionally a subset of the S-1 it points back to: it republishes the prospectus portion with previously omitted Rule 430A pricing terms filled in. S-1 and Form S-1/A are broader, including Part II content (unregistered sales, exhibits, undertakings, signatures) and presenting price, share count, and underwriting spread as ranges or placeholders. 424B4 is narrower, post-effective, and definitive, locking in the actual offering price, final share count, and underwriting discount.

Form 424B1 (definitive prospectus, no Rule 430A reliance)

424B1 is the definitive prospectus when the registration statement was already complete on pricing at effectiveness. 424B4 is used when the issuer relied on Rule 430A to defer pricing until after effectiveness, the standard pattern for traditional IPOs. The two are mutually exclusive for any given offering.

Form 424B5 (priced prospectus for shelf takedowns)

424B5 is the closest functional sibling: a definitive priced prospectus, but for follow-on offerings off an effective S-3 or F-3 shelf. 424B4 pairs with S-1/F-1 and dominates IPOs and first-time registrants; 424B5 pairs with shelves and dominates seasoned-issuer follow-ons, ATMs, and block trades. A complete priced-equity dataset typically requires both.

Form 424B2 (shelf takedown pricing supplement)

Form 424B2 is a thinner supplement adding offering-specific terms on top of an already-effective base prospectus, used heavily for debt and continuous programs. 424B4, by contrast, republishes a full prospectus tied to a 430A-pending S-1.

Form 424B3 (sticker and Section 10(a)(3) updates)

Form 424B3 updates an existing prospectus for material developments or required updates (Section 10(a)(3)); it generally does not establish the priced terms of an offering. 424B3 filings are shorter, more frequent across an offering's life, and oriented to amendment rather than initial pricing.

Form 424B7 and 424B8

424B7 supplements information about selling security holders in resale registrations — secondary distributions by existing holders rather than the issuer. Form 424B8 fills gaps under Rule 430C (offerings outside 430A and 430B). Both differ from 424B4 in the underlying omission rule, the role of the supplement, and the typical issuer.

Form FWP (free writing prospectus)

FWPs are Rule 433 marketing communications — term sheets, roadshow decks, press releases — filed in the lead-up to pricing. They are not the statutory prospectus and do not establish binding terms. FWP and 424B4 are complementary: FWP captures the pre-pricing trail, 424B4 captures the definitive priced document.

Form F-1 (foreign analog of S-1)

F-1 is the foreign private issuer counterpart to S-1, with extra disclosures on home-country regulation, currency, and financial-statement reconciliation. The 424B4 dataset includes filings tied to both S-1 and F-1 parents; cross-border users should join 424B4 records to their parent F-1 rather than treat foreign filings as a separate corpus.

Form 8-A12B (Section 12(b) class registration)

Form 8-A12B registers a class of securities under the Exchange Act and enables exchange listing. It is typically filed at or just before IPO pricing, paired in time with 424B4 but serving a distinct statutory purpose: 424B4 is the Securities Act prospectus describing the offering, 8-A12B triggers ongoing Exchange Act reporting and listing.

Key differences at a glance

  • Rule trigger: 424B4 is the only 424(b) variant tied specifically to Rule 430A omission of pricing.
  • Parent form: 424B4 attaches to S-1 or F-1; 424B2 and 424B5 attach to S-3 or F-3 shelves.
  • Function: 424B4 establishes definitive priced terms; 424B3 updates, 424B7 refreshes selling holders, FWP markets.
  • Typical issuer: 424B4 skews toward IPOs and first-time registrants; 424B5 toward seasoned issuers; 424B2 toward shelf debt programs.
  • Scope: 424B4 is the priced prospectus only, not the full registration record (which lives in S-1/S-1/A) and not the periodic disclosure record (10-K, 10-Q, 8-K, DEF 14A).

Boundary summary

Within the 424(b) family, Form 424B4 is uniquely the definitive priced prospectus for offerings that relied on Rule 430A to defer pricing past effectiveness — the canonical IPO pathway. It is distinct from 424B1 (no deferral), from 424B2 and 424B5 (shelf rather than S-1/F-1), from 424B3 (update rather than initial pricing), from 424B7 and 424B8 (different omission rules and roles), and from FWP (marketing, not statutory). It is a post-effective subset of the S-1 record, focused on locking in the legal and economic terms of the offering at pricing. That makes 424B4 the canonical filing for IPO event studies, underwriting-spread research, and use-of-proceeds analysis, while related datasets (S-1, FWP, 8-A12B, periodic filings) supply context but do not substitute for it.

Who Uses This Dataset

Form 424B4 is the definitive prospectus that closes out a registered securities offering. It carries the final price, share count, gross spread, syndicate, lockups, dilution math, use of proceeds, risk factors, and audited financials, so it is consumed by capital markets, legal, research, quantitative, and litigation functions.

Equity Capital Markets Bankers

ECM origination, syndicate, and pricing desks use the corpus as a precedent library for IPOs and follow-ons. They pull cover-page and underwriting fields, final price versus marketed range, primary/secondary split, over-allotment size, gross spread, and bookrunner roster, to build pricing comps, fee benchmarks, and live deal recommendations for pitch books and bake-offs.

IPO and Buy-Side Equity Analysts

Fundamental analysts and IPO desks read 424B4s as the primary source for deal-day decisions. The business overview, KPI tables, capitalization and dilution sections, financial statements, and non-boilerplate risk factors feed initiation notes, valuation models, and allocation requests sized against conviction.

Securities Lawyers and Capital Markets Counsel

Issuer and underwriter counsel mine the corpus for sector- and structure-specific risk-factor language, plan-of-distribution clauses, use-of-proceeds and dilution wording, EGC/SRC representations, and forum-selection or jury-trial waivers. Diffing the definitive 424B4 against earlier S-1/S-1A versions also surfaces the exact edits driven by SEC review, supporting first drafts, comment responses, and disclosure-committee review.

Section 11 Litigators and Expert Witnesses

Plaintiff- and defense-side securities litigators treat the 424B4 as the legally operative offering document for Section 11 and Section 12(a)(2) claims. They map alleged misstatements to specific prospectus passages, compare the 424B4 to later 8-K and 10-Q filings to build corrective-disclosure timelines, and use the underwriter list to identify defendants. The dataset feeds complaints, defense memoranda, damages models, and expert reports.

Forensic Accountants and Audit Advisers

Forensic accountants compare the audited financials, MD&A discussion, related-party notes, and contingent-liability disclosures inside 424B4 prospectuses against the registrant's later 10-K and 10-Q filings, looking for restatements, segment redefinitions, or going-concern shifts that surface post-IPO.

Pre-IPO Corporate Finance and IR Teams

In-house finance, treasury, and IR teams at private issuers preparing to go public, and at recently public issuers planning follow-ons, benchmark comparable deals: price range, free float, primary-versus-secondary mix, lockup duration, use-of-proceeds language, and dilution per share. The work shapes filing timing, deal structure, and equity-story positioning.

Quantitative and Data Science Teams

Quant researchers parse the corpus into structured event tables of IPO and follow-on pricing, share counts, and proceeds; run NLP over risk factors and business descriptions to score novelty and similarity; and join new tickers to post-IPO price and volume data to study underpricing, lockup-expiration drift, and underwriter-reputation effects in factor research and systematic strategies.

Regulators and Academic Researchers

Staff at supervisory bodies and academics in finance, accounting, and law construct longitudinal offering datasets from 1994 onward to study underpricing, gross-spread clustering at seven percent, hot-issue cycles, and the disclosure effects of regulatory changes such as the JOBS Act, producing working papers and policy reviews.

Compliance and Surveillance

Broker-dealer compliance officers, syndicate supervisors, and exchange surveillance staff use the plan-of-distribution and lockup sections to verify allocation eligibility, build lockup-expiration calendars, and check whether insider sales fall outside contractual restrictions.

Specific Use Cases

The Form 424B4 corpus supports a small set of high-value workflows where the priced prospectus is the canonical artifact. Each use case below ties to specific record fields or prospectus sections.

Build an IPO underpricing dataset from 1994 to present

Parse the cover-page pricing table on every 424B4 to extract public offering price, base share count, and over-allotment shares, then join on entities[].tickers and entities[].cik to first-day trading data. Diff the final price against the filing-range midpoint disclosed in the parent S-1/S-1A to compute pricing revisions, and aggregate by entities[].sic, filedAt year, and lead-underwriter name parsed from the UNDERWRITING section. The resulting panel feeds underpricing studies, hot-issue cycle analysis, and partial-adjustment tests.

Benchmark gross spreads and underwriting economics

Extract the per-share and aggregate "underwriting discounts and commissions" line from the cover-page pricing table and the syndicate roster from the UNDERWRITING section across every 424B4. Group by deal size buckets, SIC code, and year to build gross-spread comp tables, quantify clustering at seven percent for mid-cap IPOs, and rank bookrunners by deal count and proceeds. ECM desks plug the output directly into pitch-book fee pages and bake-off comps.

Track risk-factor language drift across sectors

Segment Risk Factors using the <A NAME="a_004">-style anchors and the post-2020 summary-of-risk-factors structure, then run topic modeling and embedding similarity over the resulting corpus. Filter entities[].sic to a sector (e.g., 2834 Pharmaceutical Preparations, 7372 Prepackaged Software) and trace how clinical-trial, supply-chain, AI-model, or cybersecurity risk language enters and evolves year over year. Output supports counsel drafting precedent and quant signals on disclosure novelty.

Construct a Section 11 mapping table for litigation

For a defendant issuer, pull the 424B4 by accessionNo and entities[].cik, then mechanically extract every numerical and forward-looking statement from Risk Factors, Business, MD&A, and the audited financials. Cross-reference each statement to subsequent 10-K, 10-Q, and 8-K disclosures to build a corrective-disclosure timeline, and use the UNDERWRITING section to enumerate Section 11 defendants. The mapping table feeds complaint drafting, motion-to-dismiss exhibits, and damages models.

Quantify use-of-proceeds patterns and dilution by deal type

Extract the Use of Proceeds dollar allocations (working capital, debt repayment, R&D, acquisitions) and the Dilution table's pro forma net tangible book value figures from each IPO 424B4. Branch the parser on deal type — underwritten cash, resale registration ("we will not receive proceeds"), or rights offering — using cover-page cues, then aggregate by sector and vintage. The output supports academic studies on capital deployment, IR-team benchmarking for follow-on filings, and lender-side analysis of post-IPO leverage trajectories.

Build a lockup-expiration and insider-sale surveillance calendar

Parse the lockup paragraphs in the UNDERWRITING section to capture lockup duration, covered parties (directors, officers, 5% holders), and carve-outs, then key the schedule by filedAt plus lockup days. Combine with the Principal Stockholders table to identify covered share counts. Compliance and surveillance teams use the resulting calendar to flag Form 4 sales that fall inside contractual restrictions and to anticipate supply overhangs.

Map IPO governance and anti-takeover provisions across vintages

Extract authorised share counts, multi-class voting structures, classified-board language, forum-selection clauses, jury-trial waivers, and supermajority provisions from the Description of Capital Stock section, keyed by entities[].stateOfIncorporation and IPO year. The resulting panel supports governance research, ISS/Glass Lewis-style scoring, and counsel precedent searches when drafting or negotiating charter provisions for new issuers.

Dataset Access

Dataset Index JSON API: https://api.sec-api.io/datasets/form-424b4-files.json Returns dataset-level metadata (name, description, last updated timestamp, earliest sample date, total records and size, form types, container format, and file types) along with the full dataset download URL and a list of individual monthly container files with per-container size, record counts, updated timestamps, and download URLs. Poll this endpoint to detect which containers were modified during the most recent refresh and download only those that changed. This endpoint does not require an API key.

Example
1 {
2 "datasetId": "1f13365b-9ae0-691e-9e25-f2510da800ec",
3 "datasetDownloadUrl": "https://api.sec-api.io/datasets/form-424b4-files.zip",
4 "name": "Form 424B4 Files Dataset",
5 "updatedAt": "2026-04-28T02:57:27.107Z",
6 "earliestSampleDate": "1994-01-01",
7 "totalRecords": 13982,
8 "totalSize": 2628326650,
9 "formTypes": ["424B4"],
10 "containerFormat": "ZIP",
11 "fileTypes": ["TXT", "JSON", "PDF", "HTML"],
12 "containers": [
13 {
14 "downloadUrl": "https://api.sec-api.io/datasets/form-424b4-files/2026/2026-04.zip",
15 "key": "2026/2026-04.zip",
16 "size": 13818783,
17 "records": 154,
18 "updatedAt": "2026-04-28T02:57:27.107Z"
19 }
20 ]
21 }

Download Entire Dataset: https://api.sec-api.io/datasets/form-424b4-files.zip?token=YOUR_API_KEY Downloads the complete archive of all Form 424B4 filings from January 1994 to present in a single ZIP file. This endpoint requires an API key.

Download Single Container: https://api.sec-api.io/datasets/form-424b4-files/2026/2026-04.zip?token=YOUR_API_KEY Downloads a single monthly container ZIP file, where each container holds accession-number subfolders containing a metadata.json file and the 424B4 prospectus HTM document along with any attached exhibits. This endpoint requires an API key.

Frequently Asked Questions

What form does this dataset cover?

The dataset covers Form 424B4 — the definitive prospectus filed under Rule 424(b)(4) of the Securities Act of 1933. A 424B4 is filed after the SEC declares the underlying registration statement effective and the offering has priced, supplying the pricing information omitted from the registration statement under Rule 430A or 430B together with any substantive changes from the preliminary prospectus.

What does one record in this dataset represent?

One record represents a single Form 424B4 submission accepted by EDGAR, identified by its 18-digit accession number. On disk the record is a folder containing exactly two artifacts: a metadata.json envelope describing the filing and the primary 424B4 prospectus document (typically a single self-contained .htm/.html file, occasionally .txt for legacy ASCII filings).

Who is required to file Form 424B4?

The issuer-registrant of the underlying Securities Act registration statement is the legal filer. This includes domestic operating companies filing off Form S-1 or S-3, foreign private issuers filing off Form F-1 or F-3, IPO issuers, follow-on and secondary offering issuers, and issuers in registered resale offerings on behalf of selling security holders. Underwriters and selling stockholders appear inside the prospectus but never file it; the registrant is always the filer of record.

When does a 424B4 have to be filed after pricing?

Rule 424(b) requires the prospectus to be filed no later than the second business day following the earlier of (i) the date the prospectus is first used after effectiveness, or (ii) the date the offering price is determined. In a typical IPO the issuer prices after the close on day T, trading opens on T+1, and the 424B4 is filed on the morning of T+1 (or by T+2 at the latest).

What time period does the dataset cover?

The earliest 424B4 records date from January 1994, when EDGAR began accepting Securities Act prospectuses electronically. Coverage is effectively complete for domestic registrants from May 1996 onward (after Regulation S-T phased in mandatory EDGAR filing) and for foreign private issuers from the early 2000s, and the dataset continues to the present.

What file format is the dataset distributed in?

The dataset is delivered as monthly ZIP containers keyed by year and month (for example, 2025/2025-11.zip). Inside each container are accession-number folders, each holding a metadata.json envelope and the primary prospectus document. The file types present are TXT, JSON, PDF, and HTML; Form 424B4 does not carry an XBRL instance, so no XBRL data files are included.

How does Form 424B4 differ from Form 424B5?

Both 424B4 and 424B5 are definitive priced prospectuses, but they attach to different parent forms. 424B4 pairs with Form S-1 or F-1 and dominates IPOs and first-time registrants where Rule 430A pricing was deferred. 424B5 pairs with the Form S-3 or F-3 shelf and dominates seasoned-issuer follow-ons, ATMs, and block trades. A complete priced-equity dataset typically requires both.

Are documents incorporated by reference (10-K, 10-Q, 8-K) included with each record?

No. The dataset materialises only the 424B4 prospectus and its metadata.json envelope. Documents from the underlying registration statement (S-1, F-1, S-3), preliminary prospectuses (424B1/B2/B3), other 424(b) supplements, free writing prospectuses, and any periodic reports incorporated by reference live in their own EDGAR filings and are outside the scope of this dataset.