The Form 424B2 Files Dataset is a full-text archive of every final prospectus filed with the SEC under Rule 424(b)(2) of the Securities Act of 1933 — the pricing supplements that large bank holding companies, broker-dealer holding companies, and their finance-subsidiary issuers use to take specific tranches of structured notes, medium-term notes, and other shelf-registered securities off an already-effective S-3 or F-3 shelf. One record is one EDGAR accession: a single 424B2 submission uniquely identified by its 18-digit accession number, bundling a parsed metadata.json header together with the primary HTML pricing supplement, the inline-XBRL EX-FILING FEES exhibit when present, the extracted XBRL instance XML, and any PDF rendering the filer included in the same submission. Electronic coverage begins January 1, 1994 and extends through the most recent monthly refresh; records are distributed as monthly ZIP containers whose file types are TXT, JSON, HTML, and PDF. Because Rule 424(b)(2) is transactional rather than periodic, the dataset's growth tracks takedown activity on bank and finance-subsidiary shelf programs rather than any fixed reporting calendar.
Programmatically retrieve the full list of dataset archive files, download URLs and dataset metadata.
Dataset Index JSON API
Download the entire dataset as a single archive file.
Download Entire Dataset:
Download a single container file (e.g. monthly archive) from the dataset.
Download Single Container:
The dataset is the corpus of Form 424B2 submissions accepted by EDGAR, packaged as one folder per accession inside monthly ZIP containers. Form 424B2 is a final prospectus filed pursuant to Rule 424(b)(2) under the Securities Act of 1933. Rule 424(b)(2) applies when a prospectus is used in a primary offering of securities on a delayed basis under Rule 415 (shelf registration) and the prospectus discloses the public offering price, description of securities, or specific terms that were omitted from the effective registration statement. In practice, the overwhelming majority of 424B2 filings are pricing supplements filed under an effective S-3 by large financial institutions — money-center banks and their finance-subsidiary issuers — to launch individual tranches of structured notes, equity-linked notes, autocallable and buffered jump securities, medium-term notes, currency-linked notes, and similar programmatic debt instruments.
A 424B2 document is therefore not a standalone prospectus. It is a thin supplement that sits on top of an earlier base prospectus, product supplement, and (for structured products) underlying supplement. Its role is to complete the disclosure with pricing-date-specific terms: the exact security name, CUSIP/ISIN, aggregate principal amount, public offering price, underwriting discount and commissions, net proceeds, pricing date, trade date, original issue date, stated maturity date, coupon or contingent-coupon mechanics, payoff formula, reference asset (single stock, ETF, index, basket, currency, commodity, or rate), downside protection parameters (barrier, buffer, trigger), call and redemption features, estimated value on the pricing date, and supplemental U.S. federal income tax characterization.
The dataset is distributed in the container format ZIP, with contained file types TXT, JSON, HTML, and PDF. Electronic coverage begins January 1, 1994 and spans three distinct EDGAR format eras — ASCII, HTML, and inline-XBRL for filing fees — so each record reflects whichever format era applies to its filedAt timestamp.
One record in the Form 424B2 Files Dataset is one EDGAR accession: a single 424B2 submission accepted by the SEC at a specific date and timestamp, uniquely identified by its 18-digit accession number. On disk, each record is a folder inside a monthly ZIP container, named with the accession number with dashes stripped (accession 0001213900-25-048708 becomes folder 000121390025048708). The folder bundles the documents the filer transmitted to EDGAR for that acceptance together with a metadata.json file that reproduces the parsed EDGAR filing header.
A 424B2 accession almost always corresponds to a single takedown off an effective Form S-3 shelf — most commonly a pricing supplement for one tranche of structured notes, medium-term notes, or other shelf-registered securities — together with any filing-fees exhibit, preliminary pricing supplement, and PDF copy included in that same submission. The record is self-contained to that accession: the base prospectus, product supplement, and (for structured programs) underlying supplement that the pricing supplement incorporates by reference live in their own EDGAR filings under separate accession numbers and are therefore not present in this record.
Every record folder contains at minimum two components: a metadata.json describing the parsed EDGAR header and a primary .htm prospectus / pricing supplement carrying the substantive disclosure. Most recent records (post-2022 filing-fee rule) additionally contain a third component — the inline-XBRL EX-FILING FEES exhibit — and a fourth — the extracted XBRL instance XML produced from that exhibit and referenced under dataFiles with type: "XML". A smaller subset of records (most visibly HSBC-family filings made through filer-agent CIK 0001104659) additionally ships a PDF rendering of the prospectus alongside the HTML.
The file extensions actually present on disk are .json, .htm, and .pdf. The dataset index lists TXT as a possible type, but this refers to the EDGAR "complete submission text" file (the flat .txt rendering of the full accession); that file is referenced by URL in metadata.json but is deliberately not shipped. Graphic files (.jpg, .gif) referenced inside the HTML prospectus are likewise omitted by design — the dataset excludes image binaries.
All .htm payloads (both the primary 424B2 and the filing-fees exhibit) retain the EDGAR SGML document wrapper around the HTML body:
1
<DOCUMENT>
2
<TYPE>424B2
3
<SEQUENCE>1
4
<FILENAME>ms8136_424b2-14138.htm
5
<DESCRIPTION>PRICING SUPPLEMENT NO. 8,136
6
<TEXT>
7
<HTML>...</HTML>
8
</TEXT>
9
</DOCUMENT>
Consumers who need a clean HTML document must strip the <DOCUMENT>…<TEXT>…</TEXT></DOCUMENT> envelope before handing the body to an HTML parser; the pseudo-tags <DOCUMENT>, <TYPE>, <SEQUENCE>, <FILENAME>, <DESCRIPTION>, and <TEXT> are SGML header tokens, not HTML.
A typical recent record has this shape:
1
000121390025048663/
2
├── metadata.json
3
├── ea0243835-01_424b2.htm # primary pricing supplement (SGML-wrapped HTML)
4
└── ea024383501_ex-filingfees.htm # inline-XBRL filing-fee exhibit
An HSBC-style record additionally carries a PDF:
1
000110465925047770/
2
├── metadata.json
3
├── tm2513813d91_424b2.htm
4
├── tm2513813d91_424b2.pdf
5
└── tm2513813d91_ex-filingfees.htm
metadata.jsonExactly one metadata.json sits at the root of every accession folder and mirrors the parsed EDGAR filing header. Its top-level keys include:
formType — always "424B2" in this dataset.accessionNo — canonical dash-formatted accession (e.g. "0001213900-25-048708").description — consistently "Form 424B2 - Prospectus [Rule 424(b)(2)]".filedAt — ISO-8601 timestamp with timezone offset of EDGAR acceptance.linkToFilingDetails, linkToTxt, linkToHtml, linkToXbrl — absolute sec.gov URLs to the primary document, the complete-submission text, the filing index page, and, when present, the XBRL instance.documentFormatFiles — ordered array of every artefact uploaded to EDGAR for this accession.dataFiles — machine-readable data artefacts; for post-2022 filings this is almost always the single extracted XBRL instance document produced from the filing-fees exhibit.entities — one object per filer or co-filer identified in the EDGAR header.seriesAndClassesContractsInformation — structurally present but empty for 424B2 (the slot is populated for registered investment-company forms, not shelf takedowns).id — 32-character hex identifier assigned by the publisher.Each object in documentFormatFiles carries sequence (stringified; the complete-submission text uses a single space), size in bytes (as a string), documentUrl (absolute sec.gov URL; inline-XBRL exhibits are served through the sec.gov/ix?doc=… viewer endpoint), a filer-supplied description such as PRICING SUPPLEMENT, PRICING SUPPLEMENT NO. 8,136, PRELIMINARY PRICING SUPPLEMENT, EX-FILING FEES, or GRAPHIC, and a type drawn from the EDGAR document-type vocabulary (424B2, EX-FILING FEES, GRAPHIC, XML, or a blank type used for the complete-submission text). The array preserves the filer's original sequence ordering on EDGAR.
Each object in entities describes one filer or co-filer and typically carries cik, companyName with a role suffix ("JPMORGAN CHASE & CO (Filer)"), fileNo (the S-3 shelf registration number, typically a 333-xxxxxx base value with -01, -02, … suffixes distinguishing co-registrants), filmNo, stateOfIncorporation, fiscalYearEnd, act (Securities Act "33"), sic, type ("424B2"), and a tickers array populated for publicly listed issuers.
A defining characteristic of 424B2 filings is that two-entity records are the norm: a finance-subsidiary issuer co-files with its parent guarantor. Representative pairs seen in the sample include Citigroup Global Markets Holdings Inc. / Citigroup Inc.; JPMorgan Chase Financial Company LLC / JPMorgan Chase & Co.; Morgan Stanley Finance LLC / Morgan Stanley; GS Finance Corp. / The Goldman Sachs Group, Inc.; and BofA Finance LLC / Bank of America Corporation. The two entities share a common 333-xxxxxx shelf file number with -01 and -02 suffixes distinguishing the co-registrants.
sequence: "1")The document at sequence 1 is the prospectus supplement itself — most commonly a final pricing supplement, occasionally a preliminary pricing supplement, and in some cases a longer prospectus supplement enumerating specific terms. It is stored as a single .htm file wrapped in the SGML envelope shown above. The HTML body is typically tens to hundreds of kilobytes, built from narrative paragraphs interleaved with extensive tabular content. Filename conventions trace the filer agent used:
dp228666_424b2-us2563639d.htm, tm2513813d91_424b2.htm, ms8136_424b2-14138.htm.ea0243856-01_424b2.htm.form424b2.htm._prelim.htm.The substantive content of the prospectus supplement follows a conventional order:
Prospectus genres most common in this dataset are structured notes (autocallable, buffered jump, leveraged, digital, range-accrual), medium-term notes, equity- and index-linked notes, currency-linked notes, and occasional asset-backed or trust notes.
EX-FILING FEES exhibitMost post-2022 records ship a second .htm file containing the Rule 408 filing-fee exhibit — named variously ex-filingfees.htm, exfilingfees.htm, or an agent-prefixed variant such as ea024385601_ex-filingfees.htm. This exhibit is the Filing Fee Table rendered as inline XBRL: a valid XHTML document whose visible fee-table layout is interleaved with ix: tags declaring machine-readable tagged values. The inline-XBRL header declares the relevant namespaces (http://www.xbrl.org/2013/inlineXBRL, http://xbrl.sec.gov/ffd/20XXqX, http://xbrl.sec.gov/dei/20XX) and uses an <ix:header> block with <ix:hidden> elements to emit key non-numeric facts:
ffd:FormTp — parent form type, typically S-3.ffd:SubmissnTp — 424B2.ffd:RegnFileNb — the base shelf registration number (for example 333-270004).dei:EntityCentralIndexKey and dei:EntityRegistrantName — identifying the registrant.The visible body renders the fee-table rows with ix:nonFraction values for tagged monetary amounts — maximum aggregate offering price, fee rate, fee amount, previously paid offsets, and net fee due — and ix:nonNumeric values for the final-prospectus flag and narrative descriptions. The paired extracted instance document (ex-filingfees_htm.xml) is what populates the single entry commonly seen under dataFiles with type: "XML" and description: "EXTRACTED XBRL INSTANCE DOCUMENT". The same tagged facts therefore exist in two encodings: inline-in-HTML and as a standalone XML instance.
Note on scope: inline-XBRL tagging on 424B2 filings covers only the filing-fee exhibit. The prospectus supplement body itself is not XBRL-tagged on this form.
A subset of filers — most visibly HSBC USA Inc. through filer-agent CIK 0001104659 — includes a PDF rendering of the pricing supplement alongside the HTML. The PDF is a conventional binary PDF, named with a parallel stem (tm2513813d91_424b2.pdf alongside tm2513813d91_424b2.htm), and its documentFormatFiles entry uses type: "424B2" with description: "PRICING SUPPLEMENT", matching rather than shadowing the HTML entry. The HTML and PDF are redundant renderings of the same prospectus supplement.
Some 424B2 accessions carry a PRELIMINARY PRICING SUPPLEMENT either instead of, or in addition to, the final pricing supplement. When present, it shares filename conventions with the primary document but typically appends a _prelim marker (for example …_424b2_prelim.htm), and its description in documentFormatFiles states the preliminary character explicitly. A preliminary 424B2 is itself a standalone EDGAR submission and therefore its own record in this dataset; it is not a supersession of a later final filing.
Each record contains:
metadata.json,EX-FILING FEES exhibit HTML when present,The filer's original EDGAR sequence numbers and document ordering are preserved via the documentFormatFiles array in metadata.json.
Several items referenced in metadata.json are intentionally not shipped inside the folder:
.jpg, .gif) embedded in the HTML prospectus. The HTML retains the <img> references, but the binaries are omitted per the dataset's stated exclusion of image files. Historical-price charts, issuer logos, and reference-asset diagrams therefore appear as broken image links locally even though the surrounding narrative and tabular data are fully present..txt rendering of the full submission (the document with blank sequence and blank type in documentFormatFiles) is addressable via linkToTxt but not shipped inside the ZIP.Separately, several documents that are logically required to fully read a 424B2 are structurally outside the accession and therefore not in the record:
These live in their own accession records and must be resolved via the shelf fileNo in entities and located in the matching S-3 / 424B3 / 424B5 records.
The underlying Rule 424(b)(2) obligation has been stable since shelf-registration modernization in the 1980s, but the content of the 424B2 pricing supplement has evolved substantially across the 1994-to-present range covered by the dataset:
EX-FILING FEES exhibit becomes standard in the record, accompanied by the paired extracted XBRL instance XML. Earlier filings typically carry a smaller documentFormatFiles array and an empty or absent dataFiles list.The 1994-to-present span covers three distinct EDGAR format eras, and each record reflects whichever era applies to its filedAt:
<DOCUMENT> envelopes wrapping ASCII prospectus text, with tabular content hand-laid in monospace. The SGML wrapper is identical in shape to the modern one, but <TEXT> contains text rather than HTML.<DOCUMENT> envelope. This is the dominant format across the bulk of the dataset.EX-FILING FEES exhibit is submitted as inline-XBRL XHTML, and the extracted XBRL instance XML is published alongside. The prospectus body itself remains untagged. Records from this era contain a mixture of conventional HTML (primary document) and inline-XBRL HTML plus extracted XML (fee exhibit).Across all three eras the SGML <DOCUMENT> wrapper is retained verbatim in the stored files; format detection for a given record can rely on the envelope's <TYPE> and the body's opening tokens.
.htm in the record carries the EDGAR <DOCUMENT>…<TEXT>…</TEXT></DOCUMENT> wrapper. HTML tokenizers will misparse the pseudo-tags unless the envelope is removed first.fileNo in entities and located in the corresponding S-3 / 424B3 / 424B5 records.entities is the default rather than the exception for this form type. Treating the first entity as "the issuer" and the second as "the guarantor" matches the dominant industry pattern (finance-subsidiary issuer + parent guarantor), but the EDGAR header role suffix in companyName ((Filer), (Subject)) is the authoritative indicator.seriesAndClassesContractsInformation empty: the field is structurally present but always empty for 424B2 records. It is populated for registered investment-company forms, not shelf takedowns, and should be treated as a non-signal on this form.<img> tags resolve to broken links locally because the dataset excludes graphic binaries. Narrative and table content is fully preserved..txt referenced but not shipped: pipelines that rely on linkToTxt must fetch the flat text from EDGAR if they need it.EX-FILING FEES HTML and the extracted .xml instance encode the same facts. Either one can be parsed for programmatic access to registration file number, submission type, final-prospectus flag, and aggregate offering price, but the XML is the straighter path for tooling that already speaks XBRL.0001213900), Toppan Merrill (0001839882, 0001104659), Donnelley (0000950103), and Workiva-based filers (0001918704) each impose distinct filename conventions on the primary document and exhibits, so filename patterns vary across records even when the issuer is the same.The filer is the Securities Act registrant issuing securities off an already-effective shelf registration statement, typically Form S-3 for domestic issuers or Form F-3 for eligible foreign private issuers. The shelf itself is authorized under Rule 415, which permits delayed or continuous offerings.
The population is dominated by:
Most filers are seasoned S-3/F-3 issuers, and a large share are Well-Known Seasoned Issuers (WKSIs) using automatic shelf registration. WKSI status is what enables pay-as-you-go fee payment under Rule 456(b) and Rule 457(r), allowing the registration fee to be calculated and paid at the time of each takedown rather than upfront. This is the structural reason WKSI programs can generate many 424B2 filings per day.
Underwriters, dealers, agents, trustees, calculation agents, and paying agents are disclosed inside the prospectus supplement but are not filers of the record.
A 424B2 is transactional, not periodic. It is triggered when an issuer prices a specific takedown off a pre-existing effective shelf and needs to file a prospectus supplement disclosing information that had been omitted from the base prospectus in reliance on Rule 430B (or, for post-effective pricing, Rule 430A). Typical triggers:
Filing deadline. Rule 424(b)(2) requires the prospectus to be filed no later than the second business day after the earlier of (i) the date it is first used or (ii) the date of determination of the public offering price. In practice, MTN and structured note desks file on trade date or the next business day. Rule 424(b)(8) provides a limited cure path for certain inadvertent failures to file timely, but issuers treat the two-business-day window as firm because a missed filing can raise Section 5 concerns.
For WKSI automatic shelves, each takedown typically also carries a fee table reflecting the Rules 456(b)/457(r) pay-as-you-go payment for that tranche.
There is no annual or quarterly cadence. Volumes track program activity and market conditions: dense clusters during active issuance windows, dormant stretches when the shelf is not being drawn.
Rule 424(b) has several subsections, and the dataset boundary is defined by the subsection the filer selects in the EDGAR submission type, not by the economic nature of the security:
The most frequent confusion is between (b)(2) and (b)(5). Both are shelf supplements, but convention aligns (b)(2) with debt/structured/MTN programs and (b)(5) with equity takedowns. Because the split is filer-elected, some economically similar offerings land in different subsections across issuers.
Selling-shareholder resale updates, non-shelf IPO pricing, and issuers ineligible for S-3/F-3 (for example, smaller reporting issuers limited to Form S-1) fall outside this dataset, as do any pre-EDGAR paper 424(b)(2) prospectuses from the 1980s and very early 1990s. Electronic coverage begins in January 1994, and volumes step up materially after the 2005 Securities Offering Reform introduced WKSI status, automatic shelf registration, and pay-as-you-go fees.
Form 424B2 sits inside the Rule 424(b) family of final prospectus filings under the Securities Act of 1933. The most useful comparisons are the other 424(b) variants (especially 424B5), the free writing prospectus (FWP) regime that runs alongside shelf takedowns, the S-3/F-3 base prospectuses that 424B2 supplements reference, and the EX-FILING FEES exhibits introduced under Rule 457.
424B5 is the most easily confused with 424B2. Both are final prospectus supplements off a shelf registration, and both can carry pricing. The practical split is:
Rule of thumb: high-frequency structured and MTN issuance lands in 424B2; one-off capital-raise takedowns land in 424B5. Tracking bank-issued structured products requires 424B2; tracking investment-grade new-issue debt or follow-on equity requires 424B5.
FWPs are Rule 433 written offers distributed outside the statutory prospectus. In structured-note programs they carry preliminary term sheets, indicative pricing, hypothetical payoff scenarios, and marketing material filed before or alongside the 424B2. The distinction is timing and legal status:
FWP and 424B2 cover the same deal at different stages; neither substitutes for the other. A complete deal timeline for a structured product usually requires both.
S-3 and F-3 are the shelf registration statements that establish the program and the base prospectus; 424B2 supplements must be read together with that base. The relationship is hierarchical, not parallel:
Shelf registration tells you what could be issued; 424B2 tells you what was issued.
Since the 2022 amendments, filing-fee information travels as a separate inline-XBRL EX-FILING FEES exhibit rather than inside the prospectus. The exhibit is a structured, tabular companion to the 424B2:
Use EX-FILING FEES for a quick, structured census of takedown size; use 424B2 for the actual contractual and disclosure content. They are complements with opposite structural properties (tabular XBRL vs. long-form HTML).
Choose the 424B2 dataset when the research question concerns:
Reach elsewhere when the question is about pre-pricing marketing (FWP), shelf capacity and program design (S-3/F-3), IPO or non-shelf pricing (424B1/B4), discrete follow-on equity or straight debt takedowns with substantive new disclosure (424B5), or purely quantitative fee and deal-size accounting (EX-FILING FEES).
The 424B2 Files Dataset is the corpus of final Rule 424(b)(2) prospectus supplements — narrower than a combined 424(b) collection, more complete than a structured-terms extract, downstream of S-3/F-3, parallel to FWP marketing material, and complementary to the EX-FILING FEES tabular layer. Its defining trait is that it captures the legally operative pricing supplements for high-frequency shelf takedowns, with a filer population dominated by bank and broker-dealer holding companies running structured-product and MTN programs.
Form 424B2 supplements carry the final, priced terms of shelf takedowns: structured notes, medium-term notes, preferred stock, covered bonds, and similar bank-issued securities. The users below each read different fields out of the same record to drive a specific workflow.
Analysts covering autocallables, reverse convertibles, buffered/leveraged notes, and market-linked CDs treat 424B2 as the contractual source of record. They pull the underlying reference asset, coupon or contingent-coupon formula, barrier and buffer levels, autocall observation dates, initial and final valuation dates, participation rate, cap, and principal-repayment formula, then join to CUSIP, ISIN, aggregate principal, and public offering price. Output: issuance databases used for payoff replication, secondary-mark pricing, and issuer league tables.
Rates and credit strategists track MTN programs, senior/subordinated debt, callable step-ups, and floating-rate notes from bank and finance-company issuers. They focus on coupon structure, maturity, call/put schedules, benchmark spread, underwriting discount, and net proceeds. Workflow: funding-cost surveillance, new-issue concession analysis, per-issuer curve construction, and wholesale-funding-mix monitoring.
Hedging desks use 424B2 to size and price the embedded option package they will warehouse or lay off. Relevant fields: reference asset, notional, strike, barrier terms, observation schedule, settlement mechanics. Secondary market makers use CUSIP, aggregate principal, and initial pricing-supplement terms to mark positions and quote on illiquid notes. Structurers also reverse-engineer competitor payoff profiles to infer implied vol, correlation, and dividend assumptions in rival books.
Internal control teams reconcile takedowns against the consolidated EDGAR record. They match offering terms, pricing-supplement references, underwriter syndicate, and the EX-FILING FEES exhibit to internal deal tickets and reg-fee accounting. New-issue review verifies Rule 424(b)(2) timeliness, consistent base-prospectus references, and current risk-factor language per product family.
Product review committees at broker-dealers and wealth platforms that sell third-party notes use 424B2 as the pre-approval source. They read the estimated initial value versus public offering price, underwriting discount, dealer concession, plan-of-distribution fees, scenario analyses, and structure-specific risk factors. Output: product-approval memos, Reg BI/suitability reviews, single-issuer and single-payoff concentration limits.
Transactional and disclosure counsel use the dataset as a precedent library. They benchmark risk factors, tax and ERISA language, plan of distribution, conflicts-of-interest sections, estimated-value language, and use-of-proceeds wording across peer issuers. Regulatory counsel check the EX-FILING FEES table for fee calculation, carryforward, and aggregate-offering tie-back to the shelf. Workflow: precedent search, comment-letter responses, internal drafting guides.
Credit analysts covering large bank issuers aggregate 424B2 flow to monitor senior unsecured, subordinated, and structured-note issuance as a share of wholesale funding. They focus on tenor distribution, call features, currency, and net proceeds to assess rollover risk, TLAC stack composition, and funding-mix shifts, using EX-FILING FEES totals instead of waiting for investor-day disclosures.
Quants build time series of structured-note and MTN issuance by issuer, structure, underlying, tenor, and size. They parse term-sheet text and fee exhibits for deal counts and aggregate notional. Downstream: factor research on retail demand cycles, cross-sectional studies of estimated-value-to-offer gaps, and issuer funding-behavior models across rate regimes. Coverage back to 1994 supports long-horizon work.
Researchers in household finance, disclosure quality, and bank funding use the archive to study retail-note markups, realized performance versus estimated initial value, risk-factor evolution, and diffusion of new payoff structures. Full historical coverage and all supporting exhibits make replication and longitudinal panels tractable.
Engineering teams train and evaluate extraction models for term-sheet parsing, risk-factor classification, and payoff tagging. Targets include normalizing security descriptions into structured schemas (underlying, barrier type, coupon type, autocall logic), extracting CUSIPs and pricing-supplement cross-references, linking each 424B2 to its base prospectus, and building RAG systems that answer per-note questions from source documents. HTML, TXT, JSON, and PDF formats support cross-parser testing; the EX-FILING FEES exhibit is a clean table-extraction target.
Surveillance groups at exchanges, SROs, and regulators monitor volume and composition of retail-targeted structured products, flag rapidly expanding programs, and scan disclosure patterns. Fields of interest: issuer identity, underwriter and selling group, aggregate offering amount, estimated-initial-value disclosures, and risk-factor language on conflicts, liquidity, and secondary-market pricing.
The workflows below are driven by the fields and documents that actually appear in a 424B2 accession: cover-page pricing terms, CUSIP/ISIN, payoff mechanics, risk factors, the EX-FILING FEES inline-XBRL exhibit, and the co-filer entities block.
Parse each pricing supplement to extract CUSIP, ISIN, aggregate principal, public offering price, underwriting discount, estimated value on the pricing date, reference asset, barrier/buffer/trigger levels, contingent-coupon rate, autocall observation schedule, and stated maturity. Join on the co-filer pair in entities (finance-subsidiary issuer and parent guarantor, e.g. JPMorgan Chase Financial Company LLC / JPMorgan Chase & Co.) to attribute each note to a program. Output: a queryable issuance table used for payoff replication, secondary-mark pricing, and issuer league tables by payoff type and underlying.
Pull ffd:AggregateOfferingPrice and related ix:nonFraction values from every EX-FILING FEES exhibit for a given issuer CIK and bucket by filedAt month. Cross-reference with the primary supplement to split structured notes from MTNs and senior unsecured debt. Output: near-real-time wholesale-funding-mix dashboards for credit and TLAC analysis, months ahead of investor-day disclosures.
Extract the "estimated value on the pricing date" and the public offering price from the cover page of each supplement, then compute the gap in basis points by issuer, payoff family, tenor, and underlying. Track drift over time within a single program (for example Morgan Stanley Finance LLC autocallables on SPX) and cross-sectionally across issuers. Output: inputs for retail-note markup studies, Reg BI product-approval memos, and concentration-limit calibrations at wealth platforms.
Index the risk-factor, conflicts-of-interest, estimated-value, and supplemental U.S. federal income tax sections of every pricing supplement. Query by issuer, underlying type (single stock, ETF, basket, worst-of, FX, commodity), or payoff structure to surface peer language. Output: drafting precedent libraries for capital-markets counsel, comment-letter responses, and firm-wide disclosure consistency checks across product families.
For each deal identifier (CUSIP or internal deal ticket), align the pre-pricing FWP term sheet against the final 424B2 on reference-asset level, barrier, contingent coupon, participation rate, and aggregate principal. Flag material movement between indicative and executed terms. Output: a new-issue governance workflow at issuer banks that verifies Rule 424(b)(2) timeliness, pricing consistency, and accurate base-prospectus / product-supplement cross-references.
Use the SGML-wrapped HTML body as input to term-sheet parsers that emit a normalized schema (issuer, guarantor, CUSIP, underlying, barrier type, coupon type, autocall logic, maturity). Use the paired EX-FILING FEES inline-XBRL and extracted .xml instance as a labeled gold standard for table extraction and XBRL tag recovery. Output: evaluation suites for RAG systems answering per-note questions, plus training data for risk-factor classification and payoff tagging models.
Leverage coverage back to 1994 across the ASCII, HTML, and inline-XBRL eras to panel the diffusion of new payoff structures (digital, buffered jump, contingent coupon, worst-of-basket), the post-2008 emergence of estimated-initial-value disclosures, and the post-2022 Filing Fee Exhibit mandate. Output: academic papers on disclosure quality, retail-note performance, and issuer funding behavior across rate regimes.
Dataset Index JSON API: https://api.sec-api.io/datasets/form-424b2-files.json
This endpoint returns the dataset metadata, including name, description, last updated timestamp, earliest sample date (1994-01-01), total records and total size, covered form types (424B2), container format (ZIP), and contained file types (TXT, JSON, HTML, PDF). The response also lists every available container file with its size, record count, updated timestamp, and download URL, along with the download URL for the entire dataset. Poll this endpoint to detect which containers changed in the most recent refresh run and fetch only the updated containers day by day. No API key is required to call this endpoint.
Example response:
1
{
2
"datasetId": "1f1333bd-dbdd-6a52-aac2-8a24e6530acb",
3
"datasetDownloadUrl": "https://api.sec-api.io/datasets/form-424b2-files.zip",
4
"name": "Form 424B2 Files Dataset",
5
"updatedAt": "2026-04-22T02:57:55.814Z",
6
"earliestSampleDate": "1994-01-01",
7
"totalRecords": 880721,
8
"totalSize": 31840793695,
9
"formTypes": ["424B2"],
10
"containerFormat": "ZIP",
11
"fileTypes": ["TXT", "JSON", "HTML", "PDF"],
12
"containers": [
13
{
14
"downloadUrl": "https://api.sec-api.io/datasets/form-424b2-files/2026/2026-04.zip",
15
"key": "2026/2026-04.zip",
16
"size": 13818783,
17
"records": 154,
18
"updatedAt": "2026-04-22T02:57:55.814Z"
19
}
20
]
21
}
Download Entire Dataset: https://api.sec-api.io/datasets/form-424b2-files.zip?token=YOUR_API_KEY
Downloads the complete Form 424B2 archive as a single ZIP file covering all filings from January 1994 to the latest refresh. This endpoint requires an API key.
Download Single Container: https://api.sec-api.io/datasets/form-424b2-files/2026/2026-04.zip?token=YOUR_API_KEY
Downloads one monthly container ZIP instead of the full dataset, useful for incremental syncs or targeted time ranges. Replace the year and month segments with any container key listed in the index JSON. This endpoint requires an API key.
The dataset covers Form 424B2, a final prospectus filed pursuant to Rule 424(b)(2) under the Securities Act of 1933. In practice it is used as a pricing supplement for individual takedowns off an already-effective Form S-3 or F-3 shelf, typically for structured notes, medium-term notes, and similar programmatic debt issued by large financial institutions.
One record is one EDGAR accession — a single 424B2 submission accepted by the SEC at a specific timestamp and uniquely identified by its 18-digit accession number. Each record is a folder containing a metadata.json header, the primary HTML pricing supplement, the inline-XBRL EX-FILING FEES exhibit when present, the extracted XBRL instance XML, and any PDF rendering the filer included in the same accession.
The filer is the Securities Act registrant issuing securities off an already-effective S-3 or F-3 shelf. The population is dominated by large U.S. bank holding companies, broker-dealer holding companies, foreign banks with U.S.-registered programs, and their finance-subsidiary issuers, who typically co-file on the same accession (finance-subsidiary issuer plus parent guarantor). A large share of filers are Well-Known Seasoned Issuers using automatic shelf registration and pay-as-you-go fees under Rules 456(b)/457(r).
Rule 424(b)(2) requires the prospectus to be filed no later than the second business day after the earlier of (i) the date it is first used or (ii) the date of determination of the public offering price. The trigger is transactional — pricing a specific takedown — rather than periodic, so there is no annual or quarterly cadence; volumes track program activity and market conditions.
Electronic coverage begins January 1, 1994 and extends through the most recent monthly refresh. The archive spans three distinct EDGAR format eras: ASCII/plain-text (1994 through roughly 2001–2002), HTML (dominant from the mid-2000s), and the inline-XBRL era for filing fees introduced by the 2022 Filing Fee Exhibit mandate.
Both are final prospectus supplements off a shelf registration, but 424B2 is used when the supplement discloses information omitted from the base prospectus in reliance on Rule 430B — overwhelmingly for high-frequency structured-note, MTN, and serial debt programs. 424B5 is used for discrete, one-off takedowns such as follow-on equity, senior unsecured note deals, and convertibles, where the supplement itself carries substantive deal-specific disclosure. Tracking bank-issued structured products requires 424B2; tracking investment-grade new-issue debt or follow-on equity requires 424B5.
The file extensions actually present on disk are .json (the parsed EDGAR header), .htm (both the primary pricing supplement and the inline-XBRL filing-fee exhibit), and .pdf (a conventional binary PDF included by a subset of filers such as HSBC). Graphic files referenced inside the HTML and the EDGAR complete-submission .txt file are referenced in metadata.json but deliberately not shipped inside the ZIP container.