The Form 424B1 Files dataset is a structured corpus of every Form 424B1 prospectus submitted to EDGAR — the final, pricing-completion prospectuses filed pursuant to Rule 424(b)(1) under the Securities Act of 1933. Each record in the dataset corresponds to one EDGAR submission of Form 424B1, identified by its SEC accession number, and packages the EDGAR submission metadata together with the original submission documents (image attachments excluded). The underlying form is filed by Securities Act registrants — most often companies pricing an initial public offering off a freshly effective Form S-1 — to deliver the final offering price, share count, underwriting discounts, and related pricing-dependent terms that Rule 430A allowed the issuer to omit at the time the registration statement became effective. Coverage runs from January 1994, the start of EDGAR, through the most recent refresh, distributed as monthly ZIP containers.
Programmatically retrieve the full list of dataset archive files, download URLs and dataset metadata.
Dataset Index JSON API
Download the entire dataset as a single archive file.
Download Entire Dataset:
Download a single container file (e.g. monthly archive) from the dataset.
Download Single Container:
The dataset assembles the full population of Form 424B1 filings transmitted to EDGAR. Form 424B1 is the prospectus filed under Rule 424(b) of the Securities Act of 1933 to deliver a final prospectus containing information previously omitted from the prospectus included in an effective registration statement in reliance on Rule 430A — most importantly the final public offering price, the number of securities being offered, the underwriting discounts and commissions, and any related pricing-dependent terms. Filers must submit the prospectus no later than the second business day following the earlier of the date the offering price is determined or the date the prospectus is first used in connection with a public offering.
A single record corresponds to one EDGAR submission of Form 424B1, identified by its SEC accession number. Physically, the record is one accession-number subfolder containing (a) a metadata.json file describing the EDGAR submission and (b) the original submission documents that were transmitted to EDGAR for that filing, with image attachments excluded. The dataset is partitioned into monthly ZIP archives organized under a year directory (for example, 2025/2025-12.zip), and inside each archive a top-level period folder (e.g., 2025-12/) contains one subfolder per accession number (with dashes removed, e.g., 000110465925120346/).
The record unit is the filing, not the issuer, the registration statement, or the offering. Each new 424B1 transmission to EDGAR produces its own folder, and a single registration statement that triggers multiple 424B1 filings will appear as multiple separate records. The dataset spans EDGAR submissions from January 1994 to the present and is distributed as ZIP containers; the file types delivered inside each record are TXT, JSON, HTML, and PDF.
Form 424B1 is the pricing-supplement variant of the Rule 424(b) family, so the document inside a record is a complete, ready-to-deliver final prospectus rather than a preliminary or supplementary one. This distinguishes it structurally from sister forms such as 424B2 (primary at-market offerings of debt), 424B3 (substantive change supplements), 424B4 (pricing variant where the preliminary prospectus also relied on Rule 430A but with later changes), and 424B5 (forward-incorporation supplements to shelf registrations).
The filing is not a standalone registration; it operates against an effective registration statement (typically Form S-1 or F-1 for initial offerings, or S-3/F-3 for shelf takedowns) and inherits that statement's SEC file number (the 333- series number visible on the prospectus cover and recorded in the metadata fileNo field). The prospectus itself is the disclosure document delivered to investors, written as an investor-facing offering document rather than as an internal SEC form.
Each record contains two distinct content layers:
metadata.json and in the SGML wrappers that precede the body of each document. This layer encodes filing-level facts: who filed, when, under what file number, against which act, with which CIKs, and with what list of constituent documents..htm/.html files for modern filings, as .txt for legacy filings from the 1990s and early 2000s, and occasionally as .pdf. Each substantive document is wrapped in EDGAR's <DOCUMENT> envelope with <TYPE>, <SEQUENCE>, <FILENAME>, <DESCRIPTION>, and <TEXT> markers preceding the underlying body, mirroring the original SGML submission as transmitted to EDGAR.A typical 424B1 submission consists of the primary prospectus document (sequence 1, type 424B1) and one or more graphic attachments (type GRAPHIC) referenced from the prospectus. The graphic attachments are intentionally excluded from this dataset, so the folder generally contains the metadata file, the prospectus document, and any non-image exhibits or supporting documents that accompanied the submission.
The metadata.json file is a JSON object that describes the submission as EDGAR sees it. Its top-level fields include:
formType — always "424B1" for records in this dataset.accessionNo — the canonical SEC accession number with dashes (e.g., "0001104659-25-120346"); the same number, with dashes removed, names the enclosing folder.id — an opaque internal record identifier.filedAt — the ISO-8601 timestamp (with timezone offset) at which EDGAR accepted the filing.description — a human-readable form description such as "Form 424B1 - Prospectus [Rule 424(b)(1)]".linkToFilingDetails — URL to the primary 424B1 document on sec.gov.linkToTxt — URL to the complete SGML-wrapped submission text bundle.linkToHtml — URL to the EDGAR filing index page for the submission.linkToXbrl — URL to XBRL data when present; typically empty for 424B1 because Rule 424(b)(1) prospectus filings do not carry inline XBRL financial reports as their primary content.documentFormatFiles — an array describing each constituent document of the submission. Each entry contains sequence (numeric string ordering within the submission, with " " reserved for the complete submission text bundle), size (bytes, as a string), documentUrl (direct sec.gov URL), description (short label such as "424B1", "GRAPHIC", or "Complete submission text file"), and type (the EDGAR document-type tag, e.g., "424B1", "GRAPHIC").dataFiles — an array of XBRL/data file descriptors; typically empty for 424B1 filings.seriesAndClassesContractsInformation — an array used for fund-style filings to record series/class identifiers; usually empty for ordinary 424B1 prospectuses but populated for fund issuers.entities — an array of filer entities involved in the submission. Each entity object carries companyName with a role suffix in parentheses (e.g., "NextCure, Inc. (Filer)"), cik, tickers, type (matching the submission form type), act (the Securities Act number, "33" for the 1933 Act), fileNo (the 333- registration file number), filmNo (EDGAR film number), irsNo (IRS Employer Identification Number, digits only), stateOfIncorporation (two-letter jurisdiction code), fiscalYearEnd (MMDD), and sic (Standard Industrial Classification code with text description).The entities array reflects the EDGAR header roster: in addition to the issuer, it can include selling shareholders and other reporting persons with their own CIKs when the submission identifies them as filers. The role suffix on companyName is the canonical way to distinguish issuer from selling shareholders and other co-filers.
The substantive content of a 424B1 record is the prospectus itself. After the SGML envelope (<DOCUMENT>/<TYPE>424B1/<SEQUENCE>1/<FILENAME>.../<DESCRIPTION>424B1/<TEXT>), the body of the document opens with the standard prospectus cover page and proceeds through the conventional sections of a Securities Act prospectus.
The cover page carries the filing notice line "Filed Pursuant to Rule 424(b)(1)" together with the corresponding registration statement number (the same 333- file number that appears in metadata.json). It identifies the issuer by legal name, declares the type and amount of securities offered (e.g., "Common Stock — Up To 2,523,477 Shares of Common Stock Offered By The Selling Stockholders"), states the offering price or the basis for determining it, identifies the listing exchange and trading symbol where applicable, includes the last reported sale price for already-listed securities, prints the standard SEC disclaimer ("Neither the Securities and Exchange Commission nor any state securities commission has approved or disapproved..."), dates the prospectus, and points readers to the Risk Factors section.
For underwritten offerings, a price-and-proceeds table on the cover summarizes the public offering price, the underwriting discount, and the proceeds to the issuer (and, where applicable, to selling shareholders), per share and in aggregate, with separate columns for any over-allotment option.
A table of contents follows the cover, and then the Prospectus Summary opens the body. The summary distills the issuer's business, market opportunity, competitive strengths, and strategy, and recites the key terms of the offering — number of shares offered, pre- and post-offering share counts, summary use of proceeds, lock-up arrangements, listing details, and the assumed initial public offering price (or established price for follow-on offerings).
The Risk Factors section enumerates material risks across business, industry, regulatory, financial, and securities-related categories. For initial public offerings it typically includes risks specific to becoming a public company and to the dilutive impact of the offering. The section is narrative, organized under bolded sub-headings, and is among the longest sections of the document.
Use of Proceeds describes how the issuer (and selling shareholders, if any) intend to apply the offering proceeds. Dividend Policy states whether dividends are expected. Capitalization presents an as-of-date capitalization table on actual, pro forma, and as-adjusted bases that reflect the offering. Dilution computes the difference between the offering price and the post-offering net tangible book value per share for new investors.
The prospectus reproduces or incorporates by reference the Management's Discussion and Analysis of Financial Condition and Results of Operations, the Business section (products, customers, competition, intellectual property, regulation, properties, employees), and the Management section (directors and executive officers, board committees, compensation, related-party transactions, principal stockholders).
For secondary offerings, a Selling Stockholders table identifies the holders of the registered shares with their pre-offering ownership, the number of shares being offered, and their post-offering ownership. Footnotes describe each holder's affiliations and the source of their shares (for example, conversion of preferred stock, exercise of warrants, or PIPE participation).
The Description of Capital Stock section summarizes the rights of the registered class, the issuer's authorized capital structure, anti-takeover provisions in the charter and bylaws, and applicable state-law provisions. Material U.S. Federal Income Tax Considerations summarizes tax treatment for U.S. and non-U.S. holders.
For underwritten offerings, an Underwriting section names the underwriters, sets out the underwriting agreement terms, the over-allotment option, lock-up agreements, stabilization activities, indemnification, and FINRA conduct rules. For non-underwritten or selling-shareholder offerings, a Plan of Distribution section describes the methods by which the securities may be sold (market transactions, block trades, at-the-market sales, privately negotiated transactions, and similar mechanics).
Closing sections identify counsel passing on the validity of the securities, the independent registered public accounting firm whose report is included or incorporated, and instructions for accessing additional SEC filings. The "Incorporation by Reference" section lists the prior periodic and current reports (10-K, 10-Q, 8-K, proxy statements) that are incorporated into the prospectus, which matters because financial statements and other disclosures may live outside the 424B1 document itself.
For initial public offerings and other Form S-1-based filings, audited financial statements and accompanying notes appear within the prospectus, including the balance sheet, statements of operations, comprehensive income (loss), changes in stockholders' equity, cash flows, and the auditor's report. For shelf-takedown 424B1 filings, financial statements are typically incorporated by reference from the registrant's most recent Form 10-K and intervening Form 10-Qs and Form 8-Ks rather than reproduced in full.
Form 424B1 prospectuses do not carry separate signature pages within the prospectus body the way registration statements do; the filing is authorized through the underlying registration statement and the EDGAR submission process. The cover page date and the EDGAR filedAt timestamp serve as the operative dating of the document.
The file types found in the dataset are TXT, JSON, HTML, and PDF. The JSON file is always the per-record metadata.json. Modern 424B1 prospectuses are delivered almost exclusively as .htm/.html documents; legacy filings from the 1990s and early 2000s deliver the prospectus as plain .txt (with tables rendered through monospace alignment); and a minority of filings include or substitute a .pdf rendering of the prospectus. The primary prospectus is typically a single document of substantial size, and multi-document submissions occur when the issuer files separate exhibits or supporting documents alongside the prospectus.
Each record includes the metadata file describing the EDGAR submission and the full set of non-image submission documents, preserving EDGAR's SGML document envelopes around each document body. The substantive prospectus is delivered intact, so cover-page disclosures, narrative sections, tables, and (where applicable) embedded financial statements all flow through to the dataset. Cross-document context such as the underlying registration statement (Form S-1, Form S-3, Form F-1, Form F-3, etc.) is identified by fileNo in the metadata but is not bundled into the record.
Image files referenced by the prospectus — typically issuer logos, charts, and product photographs delivered as .jpg, .gif, or similar — are intentionally excluded from the ZIP, even though they appear as GRAPHIC-type entries in documentFormatFiles within metadata.json. The original registration statement and any prior or subsequent prospectus supplements are separate EDGAR submissions with their own accession numbers and are not included in a 424B1 record. Documents incorporated by reference (10-K, 10-Q, 8-K, proxy statements) are likewise separate filings, identified inside the prospectus narrative but not packaged into the record. XBRL data, when applicable, is referenced via linkToXbrl and dataFiles rather than embedded, and is generally absent for Rule 424(b)(1) prospectus filings.
Form 424B1 has been used continuously since the SEC adopted the modern Rule 424(b) numbering convention, and the dataset spans EDGAR submissions from January 1994 to the present. The substantive disclosure structure of a final prospectus has been stable for decades, but several layers of regulatory and presentational evolution affect what a modern record looks like compared with one from the mid-1990s.
On the content side, the Risk Factors, MD&A, and executive compensation sections have grown materially over time as a result of the SEC's 2005 Securities Offering Reform, the JOBS Act (2012) introduction of emerging growth company accommodations and confidential submission treatment, the 2018 Smaller Reporting Company threshold expansion, and ongoing rulemaking around cybersecurity, human capital, and pay-versus-performance disclosure. Post-2005 IPO prospectuses also reflect the SEC's free-writing-prospectus framework, which is referenced in the Plan of Distribution and Underwriting sections. Selling Stockholders disclosure expanded after the 2007 amendments to Rule 144 and related staff guidance on resale registrations.
On the format side, prospectuses filed in the 1990s and very early 2000s are predominantly plain ASCII text inside the EDGAR SGML envelope, with tabular content rendered using monospace alignment and rules. Beginning in the late 1990s and accelerating through the early 2000s, HTML became the dominant body format, enabling typography, embedded tables, and graphics. PDF renderings appear sporadically across the time range, sometimes as supplemental representations alongside an HTML or text version. The EDGAR <DOCUMENT> envelope itself has remained consistent across the period, so the SGML headers preceding each document body in the dataset look essentially the same in 1994 and in 2025.
Several nuances matter for downstream interpretation:
fileNo back to the parent registration statement and forward to any subsequent 424B-family supplements.entities array in metadata.json may list multiple filers — issuer plus selling shareholders — each with its own CIK. The role suffix on companyName (e.g., (Filer)) is the canonical way to distinguish them.documentFormatFiles array is the authoritative inventory of what EDGAR received; comparing it against the actual files on disk reveals the image exclusion and identifies multi-document submissions.<TYPE>, <SEQUENCE>, <FILENAME>, and <DESCRIPTION> tags inside that envelope mirror the corresponding entries in documentFormatFiles.seriesAndClassesContractsInformation is populated with series and class identifiers that do not appear for ordinary corporate prospectuses; consumers handling fund issuers should account for this variability.The filer of record is the Securities Act of 1933 registrant whose registration statement is already on file and effective. The 424B1 supplements that effective registration statement with the final, deliverable statutory prospectus containing the pricing-dependent information that Rule 430A allowed the issuer to omit at effectiveness — most importantly the public offering price, underwriting discounts and commissions, and any price-derived terms.
Filers typically include:
Underwriters, selling security holders, and other deal participants are described inside the prospectus but are never the EDGAR filer of the submission.
The filing is event-driven, not periodic. The deadline is mechanical: the completed prospectus must be filed no later than the second business day following the earlier of:
In practice, a 424B1 lands on EDGAR within a day or two of pricing, making it a reliable transactional marker that a registered offering (typically an IPO) has been priced and launched. There is no annual or periodic 424B1; each record is tied to one specific offering under one specific effective registration statement.
The filed prospectus is deemed part of the registration statement as of the time it is first used after effectiveness, with attendant Section 11 liability for the registrant, its directors, signing officers, and underwriters.
Form 424B1 sits inside a tightly clustered family of Rule 424(b) prospectus filings and is closely tied to the registration statements those prospectuses complete. The most useful comparison targets are the other Rule 424(b) sub-types, the preliminary Form 424A, the parent S-1 registration statement, and the Rule 497 prospectus regime used by registered investment companies. The key axis of differentiation is which Rule 430 omission is being cured, what offering structure underlies the filing, and whether the document is preliminary, pricing-completion, or update-driven.
Form 424B4 is the source of most user confusion. Both 424B1 and 424B4 deliver the final prospectus containing Rule 430A pricing information after a just-effective registration statement. The distinction is narrow but firm: 424B1 is used when the only material added is the previously omitted pricing information; 424B4 is used when the final prospectus also reflects substantive changes from the most recent preliminary prospectus. Any comprehensive analysis of priced primary offerings (especially IPOs) must combine 424B1 with 424B4.
Both are pricing-completion filings, but for shelf offerings rather than initial offerings. Form 424B2 covers primary delayed offerings under Rule 415 (typically MTNs and structured notes off an S-3); Form 424B5 covers prospectus supplements that add information not in the base prospectus or a prior supplement (typical for follow-ons, secondaries, and registered direct placements off an S-3 shelf). 424B1 closes a discrete offering off a freshly effective S-1; 424B2/B5 close individual take-downs off an existing shelf and are usually supplements layered on a base prospectus rather than full standalone prospectuses.
424B3 carries substantive changes or additions to a previously filed prospectus, including post-effective supplements and merger/exchange offer prospectuses tied to Form S-4. It is update-driven, not pricing-driven; it does not exist to cure a Rule 430A omission at the moment of pricing.
Form 424B7 supplies information previously omitted in reliance on Rule 430B, in connection with resales by selling security holders. It names selling stockholders, share amounts, and plan-of-distribution details. 424B1 is issuer primary, Rule 430A pricing; 424B7 is selling-stockholder mechanics, Rule 430B information.
Form 424B8 covers prospectuses delivered after the time of sale under access-equals-delivery timing rules. It is a procedural-timing variant, not a substantive content peer to 424B1, and does not exist to cure Rule 430A omissions.
Form 424A is the pre-effectiveness preliminary prospectus carrying changes that would require recirculation. It is the same offering as the eventual 424B1 but earlier in the timeline and without final price terms. Researchers reconstructing the path from red herring to priced deal will pair 424A with the matching 424B1 or 424B4.
Form 424H is the ABS-specific preliminary prospectus filed at least three business days before first sale under Regulation AB. It shares the 424 prefix but applies to a narrow ABS issuer population, is preliminary rather than pricing-completion, and does not serve the Rule 430A function that defines 424B1. It is also tracked as Form 424H in filing-type catalogs.
S-1 is the parent registration statement that 424B1 most often supplements. The S-1 carries the full disclosure package but is permitted under Rule 430A to omit final price-related terms; the 424B1 then supplies the price, underwriting discount, and offering size after effectiveness. Other parents pair with different 424(b) sub-types: S-3 with 424B2/B5/B7, Form S-11 with the 424(b) family for REITs, F-1/F-3 for foreign private issuers, and Form S-4 typically with 424B3. The registration statement and the 424B1 are complementary, not substitutable: the registration statement supplies the substantive disclosure, the 424B1 binds it to a real priced transaction. For programmatic access, Form S-3 and other registration-statement types are also cataloged in the SEC-API filing-type list.
Form 497 plays the role for registered investment companies (funds registered on Form N-1A, Form N-2, or Form N-3) that the 424(b) series plays for operating companies. The substantive content is fund-oriented (investment objectives, fees, share classes, performance) rather than offering-oriented (pricing, underwriting, use of proceeds). The filer populations and disclosure schemas do not overlap; a comprehensive prospectus universe must pull both but should not merge them without preserving the operating-company versus investment-company distinction.
The 424B1 dataset is defined by the conjunction of three features: it is a final prospectus, not a preliminary one; it is the specific 424(b) sub-type used when the only material added relative to the registration statement is Rule 430A omitted pricing information; and it is most often tied to discrete primary offerings (frequently IPOs) off a just-effective S-1 rather than to shelf take-downs. Adjacent 424(b) sub-types replace it when the offering structure shifts: 424B4 when post-pricing changes go beyond price, 424B2 and 424B5 for shelf take-downs, 424B7 for selling-stockholder offerings under Rule 430B, and 424B3 for substantive updates. For full coverage of priced primary offerings, 424B1 should be analyzed jointly with 424B4 and matched back to the corresponding 424A and S-1 filings.
Form 424B1 filings are the final, priced prospectus that completes information omitted under Rule 430A. Each professional group below reads a different slice of the same document.
ECM bankers use the cover page (final price, share count, gross proceeds), the underwriting section (gross spread, reallowances, over-allotment, stabilization language), and lock-up terms to reconstruct league tables, benchmark spreads by deal size and sector, and build precedent-transaction pages for pitches.
Initiating-coverage analysts treat the 424B1 as the canonical issuer description at pricing. The business section, MD&A, audited financials, use of proceeds, and risk factors feed first models, valuation frameworks, and bear-case write-ups.
IPO and post-listing investors use the filing as the primary diligence document. They focus on revenue and customer concentration, segment economics, the cap table, dilution schedules, selling-shareholder participation, and lock-up expiry to time allocations and entries.
Issuer- and underwriter-side counsel benchmark drafting across recently priced deals: risk factor framing, plan of distribution, directed-share programs, FINRA disclosures, and EGC/SRC status. The corpus supports precedent searches and comment-letter responses.
Reviewers use the financials, MD&A, and critical accounting policies as the IPO baseline, watching revenue recognition, segment reporting, non-GAAP reconciliations, ICFR disclosures, related-party items, and restatements. Used for pre-IPO accounting-quality scoring and post-IPO trend comparison.
When 424B1 filings cover debt or convertibles, credit teams pull indenture summaries, covenants, ranking, change-of-control puts, and use of proceeds, then reconcile final terms against preliminary marketing to feed spread and relative-value models.
Quant teams extract features such as offer-price-to-range positioning, upsize/downsize patterns, syndicate composition, lock-up length, insider participation, and float ratios. These power IPO underperformance models, post-lock-up drift signals, and priced-deal calendar strategies.
Diligence teams pull pricing multiples at offer, post-money capitalization, founder and sponsor stakes, and anti-dilution mechanics from the description of capital stock for valuation triangulation and exit-comparable work.
Broker-dealer and exchange compliance functions use timing and content for syndicate-conduct review, stabilization monitoring, and verification that final terms match marketed terms within the Rule 424(b)(1) two-business-day window. Metadata supports reconciliation against internal blotter and allocation systems.
Teams preparing follow-ons or future deals study peer 424B1 filings to benchmark disclosure structure, executive compensation summaries, governance language, and growth-strategy framing for disclosure committee review.
Researchers studying IPO underpricing, underwriter spreads, lock-up effects, and long-run performance use the corpus as a structured panel of priced offerings back to 1994 for cross-vintage studies of pricing, fee compression, and disclosure evolution.
Vendor data engineers normalize the filings into tables of price, shares offered, underwriting discount, and use-of-proceeds categories. RAG builders use the mix of HTML, TXT, PDF, and JSON metadata for prospectus-grounded Q&A, peer comparison, and drafting assistance for capital-markets workflows.
The Form 424B1 corpus supports a small number of high-leverage workflows that center on priced primary offerings, most often IPOs off a freshly effective S-1.
IPO league-table and gross-spread benchmarking. Parse the cover-page price-and-proceeds table and the Underwriting section to extract final offering price, shares offered (base and over-allotment), gross spread, and net proceeds across vintages. Joining on the entities array and sic code yields per-bank, per-sector league tables and gross-spread distributions for ECM pitch books and fee benchmarking.
Offer-price-vs-range and upsize/downsize signals for quant IPO models. Pair each 424B1 with its matching 424A preliminary prospectus (via fileNo) to compute price-vs-range positioning, share-count revisions, and final-versus-marketed deal size. These features feed first-day-pop, lock-up-expiry drift, and post-IPO underperformance models.
Lock-up calendar and float-release tracking. Extract lock-up length, covered holders, and early-release triggers from the Underwriting section, plus pre- and post-offering ownership from the Selling Stockholders and Principal Stockholders tables. Output is a forward calendar of unlock dates and unlocked-share counts used by event-driven desks and risk overlays.
Use-of-proceeds classification for sector and macro analysis. Run the Use of Proceeds narrative through a classifier (debt repayment, R&D, acquisitions, working capital, selling-shareholder cash-out) to build time series of how IPO capital is being deployed by sector and cohort. Useful for strategy notes and policy research on capital formation.
Risk-factor precedent search for disclosure counsel. Index the Risk Factors section across recently priced deals so issuer- and underwriter-side counsel can retrieve peer language by sector, EGC/SRC status, and risk theme (cybersecurity, supply chain, going-concern, customer concentration). Supports drafting committee review and SEC comment-letter responses.
Selling-stockholder and insider-participation mapping. Mine the Selling Stockholders table and footnotes to identify VC, PE, founder, and PIPE participants by CIK, the share counts they are monetizing at IPO, and the conversion or warrant source of those shares. Feeds cap-table reconstructions, sponsor-exit tracking, and follow-on diligence.
Prospectus-grounded RAG for capital-markets Q&A. Use the per-record metadata.json plus the HTML/TXT prospectus body as a clean retrieval corpus for issuer-specific Q&A, peer comparison ("show me cover-page economics for biotech IPOs above $200M in 2024"), and drafting assistance, with the SGML envelope stripped and section headings preserved as chunk boundaries.
The Form 424B1 Files dataset is accessible through three endpoints: a JSON metadata API, a full archive download, and per-container downloads.
Dataset Index JSON API: https://api.sec-api.io/datasets/form-424b1-files.json
This endpoint returns dataset-level metadata, including name, description, last updated timestamp, earliest sample date, total record and size counters, covered form types (424B1), container format (ZIP), included file types (TXT, JSON, HTML, PDF), the full dataset download URL, and the list of individual container files. Each container entry includes its key, size, record count, last updated timestamp, and direct download URL. This endpoint does not require an API key and can be polled to detect which containers were updated in the most recent refresh, so downstream pipelines only re-download what changed.
1
{
2
"datasetId": "1f13365b-9ae0-6936-94a1-5039a224092a",
3
"datasetDownloadUrl": "https://api.sec-api.io/datasets/form-424b1-files.zip",
4
"name": "Form 424B1 Files Dataset",
5
"updatedAt": "2026-04-15T06:12:45.223Z",
6
"earliestSampleDate": "1994-01-01",
7
"totalRecords": 6266,
8
"totalSize": 727466298,
9
"formTypes": ["424B1"],
10
"containerFormat": "ZIP",
11
"fileTypes": ["TXT", "JSON", "HTML", "PDF"],
12
"containers": [
13
{
14
"downloadUrl": "https://api.sec-api.io/datasets/form-424b1-files/2026/2026-04.zip",
15
"key": "2026/2026-04.zip",
16
"size": 13818783,
17
"records": 154,
18
"updatedAt": "2026-04-15T06:12:45.223Z"
19
}
20
]
21
}
Download Entire Dataset: https://api.sec-api.io/datasets/form-424b1-files.zip?token=YOUR_API_KEY
Downloads the complete Form 424B1 Files dataset, covering filings from January 1994 to the latest refresh, as a single ZIP archive. This endpoint requires an API key.
Download Single Container: https://api.sec-api.io/datasets/form-424b1-files/2026/2026-04.zip?token=YOUR_API_KEY
Downloads one monthly container ZIP instead of the full archive, which is useful for incremental syncs or backfilling a specific period. Replace the year and month segment with any container key returned by the index API. This endpoint requires an API key.
The dataset covers Form 424B1, a final prospectus filed under Rule 424(b)(1) of the Securities Act of 1933 to deliver pricing information (offering price, share count, underwriting discounts, and related terms) that the issuer omitted from the effective registration statement in reliance on Rule 430A. It is most often associated with priced primary offerings such as IPOs filed off a freshly effective Form S-1.
One record corresponds to one EDGAR submission of Form 424B1, identified by its SEC accession number. Physically, each record is an accession-number subfolder containing a metadata.json file that describes the EDGAR submission and the original submission documents transmitted to EDGAR (image attachments excluded).
The filer is the Securities Act of 1933 registrant whose registration statement is already effective — typically a domestic operating company on Form S-1 conducting an IPO or follow-on, a foreign private issuer on Form F-1, a smaller reporting company or emerging growth company, or a REIT/BDC issuer. Underwriters and selling security holders are described inside the prospectus but are never the EDGAR filer of record.
The prospectus must be filed no later than the second business day following the earlier of (a) the date the offering price is determined or (b) the date the prospectus is first used in connection with the public offering after the registration statement has become effective. In practice, a 424B1 lands on EDGAR within a day or two of pricing.
Both 424B1 and 424B4 are final prospectuses delivering Rule 430A pricing information after the registration statement is effective. 424B1 is used when the only material added is the previously omitted pricing information, while 424B4 is used when the final prospectus also reflects substantive changes from the most recent preliminary prospectus. Comprehensive analysis of priced primary offerings should combine the two.
Each record includes a JSON metadata file (metadata.json) plus the original submission documents wrapped in EDGAR's SGML <DOCUMENT> envelopes. Document bodies are delivered as HTML for modern filings, plain TXT for legacy filings from the 1990s and early 2000s, and occasionally PDF. Image attachments (GRAPHIC-type entries) are intentionally excluded from the ZIP.
The dataset spans EDGAR submissions of Form 424B1 from January 1994 — the start of EDGAR — through the most recent monthly refresh. Pre-EDGAR equivalents that existed in paper under the same Rule 424(b)(1) regime are not included.