The Form S-6 Files Dataset is a complete electronic corpus of Form S-6 and Form S-6/A registration statements filed on EDGAR by unit investment trusts (UITs) since January 1994. Each record corresponds to a single EDGAR submission identified by its 18-digit accession number and bundles the trust's full prospectus — cover page, trust organization, investment objectives, deposited-securities schedule, fee and expense tables, distribution arrangements, risk factors, tax-matters disclosure, and certified statement of financial condition — together with a structured metadata.json sidecar describing the EDGAR submission envelope, filer entity, document manifest, and canonical source URLs. Form S-6 is the Securities Act of 1933 registration statement used to register the units offered by UITs that are themselves registered as investment companies under the Investment Company Act of 1940 on Form N-8B-2; Form S-6/A covers pre-effective and post-effective amendments. Records are delivered as monthly ZIP containers and are sourced from sponsored UIT programs that file equity, fixed-income, and hybrid trust series — but not from variable annuity (Form N-4) or variable life (Form N-6) separate accounts, which file under different forms.
Programmatically retrieve the full list of dataset archive files, download URLs and dataset metadata.
Dataset Index JSON API
Download the entire dataset as a single archive file.
Download Entire Dataset:
Download a single container file (e.g. monthly archive) from the dataset.
Download Single Container:
The dataset captures every Form S-6 and Form S-6/A submission accepted by EDGAR from January 1994 to the present. Form S-6 is the registration statement under the Securities Act of 1933 used by unit investment trusts to register the units they offer to the public. UITs are a distinct category of registered investment company defined under the Investment Company Act of 1940; they hold a fixed, generally unmanaged portfolio of securities and issue redeemable units representing undivided interests in that portfolio. Because a UIT has no board of directors or investment adviser actively rebalancing the portfolio, its disclosure regime is purpose-built: the trust must first register itself as an investment company on Form N-8B-2 (the 1940-Act registration statement of unit investment trusts), and then register the units it sells to the public under the 1933 Act on Form S-6.
Form S-6 is structurally a "wrapper" form — it incorporates by reference much of the underlying N-8B-2 registration and supplements it with the prospectus delivered to investors, fee tables, current portfolio composition, distribution arrangements, and a certified statement of financial condition of the trust. Form S-6/A is the same instrument used to file amendments, including pre-effective amendments that add or update content before the registration becomes effective and post-effective amendments that update the prospectus for a continuously offered or rolling series of trusts.
Records are grouped into monthly ZIP containers keyed by filing month under a <year>/<year>-<month>.zip path scheme. Each container holds individual accession-numbered folders, and each folder contains a metadata.json plus the registration-statement document. The dataset distribution format set includes TXT, JSON, HTML, and PDF: JSON is the metadata sidecar; HTML is the dominant body format for modern records; TXT covers the earliest 1990s filings whose prospectus body is plain ASCII inside the SGML envelope; and PDF rows are permitted by EDGAR but rarely used by UIT filers for the primary S-6 document.
One record in the Form S-6 Files Dataset corresponds to a single Form S-6 or Form S-6/A submission accepted by EDGAR, identified end-to-end by its 18-digit SEC accession number. A record is materialized on disk as a folder named after that accession number (the dashed accessionNo with the hyphens removed) and contains exactly two artifacts: a metadata.json describing the EDGAR submission envelope, and a single .htm registration statement that carries the full prospectus text. The dataset unit is therefore the filing-as-package — one accession number, one prospectus document, one structured metadata sidecar — scoped to the form types S-6 (initial registration statements) and S-6/A (pre-effective and post-effective amendments). The unit of granularity is always the individual accession-numbered folder inside the monthly container.
Each accession-numbered record folder contains two layers of content:
metadata.json — a flat JSON object capturing EDGAR's submission-level metadata: form type, accession number, acceptance timestamp, filer entities, document manifest, and a set of canonical EDGAR URLs pointing back at the source filing..htm file wrapped in EDGAR's SGML document envelope, with the full prospectus body serialized as inline HTML inside the <TEXT> block.Image attachments that the filer originally bundled with the submission (logos, signature graphics, fund-family marks, counsel and trustee marks such as cac.jpg, chapman.jpg, morganlewis.jpg) are intentionally excluded from the dataset on disk, although their identities remain visible in the documentFormatFiles[] manifest carried by metadata.json. There are no separate exhibit files broken out in the per-record folder; Form S-6 is in practice a single-document submission, and the dataset reflects that.
metadata.json is the structured anchor of every record. Its top-level fields fall into four conceptual groups.
Submission identity and timing
formType — the literal "S-6" or "S-6/A".accessionNo — the SEC accession number in dashed form (e.g. 0001445546-25-006540); the same value with hyphens stripped is the folder name.description — the human-readable form description supplied by EDGAR (e.g. "Form S-6 - Registration statement for unit investment trusts", with [Amend] appended for S-6/A).filedAt — an ISO-8601 timestamp with timezone offset (e.g. 2025-10-01T12:17:13-04:00) capturing the EDGAR acceptance time.id — an opaque 32-character content hash used as a stable internal record identifier.Canonical EDGAR URLs back to the live submission
linkToFilingDetails — direct URL of the primary registration HTML on sec.gov.linkToTxt — URL of the full SGML/text submission bundle.linkToHtml — URL of the EDGAR filing-index page.linkToXbrl — URL of any XBRL instance; empty for S-6, which is not subject to inline-XBRL financial-statement tagging.Document manifest
documentFormatFiles[] — every primary document item EDGAR exposes for the submission. Each entry carries sequence, size (bytes, encoded as a string), documentUrl, description, and type. type is typically S-6 (or S-6/A) for the registration document itself, GRAPHIC for embedded images, and a blank value for the trailing "complete submission text file" pseudo-document EDGAR appends to every submission. GRAPHIC rows remain in the manifest for completeness even though their bytes are not materialized inside the dataset folder, allowing downstream consumers to reconstruct the original submission's file inventory or fetch the omitted images directly via documentUrl.dataFiles[] — reserved for structured data attachments (XBRL, R-files); empty for S-6.Filer-entity block
entities[] — the array of EDGAR-recognized filers on the submission. Each entity carries companyName (with the role suffix (Filer) appended), cik, fileNo (the 333-prefixed 1933-Act registration file number assigned to the offering), irsNo, stateOfIncorporation, fiscalYearEnd, act (always "33" for S-6, reflecting the 1933 Act registration nature of the form), type (the form type), and filmNo (the EDGAR film number identifying the specific submission). For sponsor-driven UIT families, the entity is typically the trust series itself rather than the sponsor; multi-entity submissions are uncommon for S-6.seriesAndClassesContractsInformation[] — reserved for filings that use EDGAR's series/class identifier system; structurally present but empty for S-6 records.The single .htm file in each record folder is not pure HTML — it is EDGAR's SGML document envelope with HTML serialized inside it. The file opens with a small SGML preamble naming the document, transitions into a standard HTML body, and closes the SGML envelope. The shape is:
1
<DOCUMENT>
2
<TYPE>S-6
3
<SEQUENCE>1
4
<FILENAME>s-6.htm
5
<DESCRIPTION>INITIAL REGISTRATION STATEMENT
6
<TEXT>
7
<HTML>
8
<HEAD>...</HEAD>
9
<BODY>
10
... full prospectus body ...
11
</BODY>
12
</HTML>
13
</TEXT>
14
</DOCUMENT>
The <TYPE> tag echoes the form type; <SEQUENCE> numbers the document inside the submission (1 for the primary S-6 document); <FILENAME> is the on-disk filename, which is not standardized across filers — common patterns include s-6.htm, s6wraps.htm and s6awraps.htm (the a infix marking an amendment), and filer-specific patterns embedding internal series codes such as b1015250s6.htm, adt2315-s6.htm, or g1021250s6.htm. <DESCRIPTION> carries the filer's free-text label for the document (typically "INITIAL REGISTRATION STATEMENT", "REGISTRATION STATEMENT", or amendment-specific phrasing). The substantive content of the filing — the prospectus — is the HTML inside <TEXT>...</TEXT>. The <HEAD> is often minimal (frequently an empty <TITLE>); document-level styling and layout live inline in the body.
Within the inline HTML body, an S-6 prospectus assembles, in approximately this order, the disclosure elements that the SEC and the prospectus-delivery rules require for a UIT.
For Form S-6/A, the same skeleton is reused but the disclosures are updated to reflect the matters being amended — typically updated portfolio data, refreshed fee tables, an updated financial-condition statement at a more recent date, or curative changes prompted by SEC staff comment.
A Form S-6 record cannot be read in isolation from the trust's Form N-8B-2. The S-6 prospectus is structured to incorporates by reference much of the detailed organizational and operational disclosure already on file under the 1940 Act on Form N-8B-2 — for example, the full text of the trust indenture, detailed descriptions of trustee duties, and standing terms of the trust. The S-6 itself focuses on offering-specific content (the present series, current portfolio, current fees, current financial condition). The dataset captures the S-6 / S-6/A submission as filed; it does not bundle the underlying N-8B-2 registration, which lives in a separate EDGAR submission and is not part of this dataset's accession-keyed record set.
Although Form S-6 is in practice a single-document filing, EDGAR's submission model still permits multiple items per accession, and metadata.json faithfully records each one in documentFormatFiles[]. The type field discriminates among them:
type = S-6 (or S-6/A) — the primary registration statement, and the only document materialized on disk.type = GRAPHIC — embedded images such as logos or signature graphics. These appear in the manifest with a documentUrl pointing to the file on sec.gov, but the bytes are deliberately omitted from the per-record folder.type and a single-space sequence — EDGAR's "complete submission text file", the SGML concatenation of every document in the submission. It is exposed by EDGAR but is not a real exhibit and should be filtered when iterating over real documents.Conventional 1933-Act exhibits (legal opinions, consents, indentures) are not typically broken out on S-6 because the underlying instruments live with the N-8B-2; when they do appear, they show up as additional rows in documentFormatFiles[] and are addressable through documentUrl.
Each record contains the structured submission metadata in metadata.json plus the full text of the registration statement in the inline-HTML document. Together these cover the cover page, full prospectus body, fee and expense tables, portfolio schedule, distribution arrangements, risk factors, tax-matters disclosure, certified statement of financial condition, signatures, and any inline tabular content. The metadata also exposes back-pointers to the original EDGAR submission URLs, the document manifest with sizes and types for every item the filer submitted, and the filer-entity record with CIK, file number, IRS number, state of incorporation, fiscal year end, film number, and 1933-Act designation.
The dataset excludes image files (JPG/PNG/GIF graphics that appear as GRAPHIC items in the manifest); they remain listed in documentFormatFiles[] but their bytes are not materialized in the per-record folder. Because Form S-6 incorporates by reference much of Form N-8B-2, substantial portions of the trust's organizational disclosure live outside this dataset in the N-8B-2 submission, which is not part of the Form S-6 Files Dataset. Subsequent amendments are stored as separate accession-numbered records under form type S-6/A rather than being merged into the original S-6 record; reconstructing the latest effective prospectus for a given series requires walking the chain of S-6 and S-6/A accessions for that filer and series. XBRL instance files are not produced for S-6 (the corresponding linkToXbrl is empty and dataFiles[] is empty), so structured financial-statement data is not part of any record. Correspondence, comment-and-response letters, and EDGAR header-only meta items are likewise outside the per-record file set.
The disclosure backbone of Form S-6 has been stable since the modern UIT regime was put in place: it has always been a 1933-Act registration form keyed to a separate 1940-Act registration on Form N-8B-2, and the core prospectus elements — trust organization, investment objective, portfolio schedule, fees, distribution, financial condition, signatures — have been required throughout. Several refinements have shaped the content of more recent records, including the standardized fee-table format and expense-example presentation that now governs UIT prospectuses, sales-charge disclosure conventions reflecting deferred and creation-and-development charges that became prevalent in unit-trust offerings, and increasingly granular risk-factor disclosure tracking the asset classes UITs have moved into (sector equity baskets, closed-end fund baskets, international exposures). For sponsors that file recurring series of similar trusts, the prospectus convention has converged on a "wrap" prospectus that incorporates most boilerplate by reference and updates only series-specific content — reflected in document filenames such as s6wraps.htm and s6awraps.htm. None of these changes altered the form's overall section ordering, but they materially changed the density and structure of the fee and risk sections within the prospectus body.
Form S-6 records span the full EDGAR era from January 1994 onward, and the on-the-wire format of the prospectus document has evolved with EDGAR's general filing technology. The earliest filings are plain ASCII text inside the SGML envelope — a single <DOCUMENT> block with <TYPE>S-6, <TEXT> containing fixed-width text rather than HTML, and tabular content rendered with whitespace-aligned columns. These records correspond to the TXT file type found in the dataset. From the late 1990s and through the 2000s, filers transitioned to HTML inside the SGML envelope, which is the dominant shape of records today: the <TEXT> block contains a complete <HTML>...</HTML> document, prospectus tables are real HTML tables, and typography is preserved. The SGML envelope itself (<DOCUMENT><TYPE>...<SEQUENCE>...<FILENAME>...<DESCRIPTION>...<TEXT>...</TEXT></DOCUMENT>) has remained constant across both eras, so a single parser keyed on the SGML markers can extract the inner document regardless of vintage. PDF attachments are permitted by EDGAR and appear in the dataset's file-type set, but UIT filers rarely use PDF for the primary S-6 document, so PDF rows are uncommon in practice. Inline XBRL and standalone XBRL never applied to Form S-6 — the form is exempt from the inline-XBRL financial-statement tagging regime — so the structured-data slots in metadata.json (linkToXbrl, dataFiles[]) are empty across the entire history of the dataset.
A handful of nuances matter when consuming records:
accessionNo with the hyphens removed; both forms map to the same EDGAR submission, and the dashed form is what EDGAR's URL scheme uses.entities[] block typically names the trust series as the filer rather than the sponsor; joining records into sponsor-level views requires resolving CIKs against EDGAR's filer hierarchy, since sponsors often operate hundreds of separate trust-series CIKs.documentFormatFiles[] includes the EDGAR "complete submission text file" entry with a blank type and a single-space sequence; filter it out when iterating over real documents..htm document is wrapped in SGML, not naked HTML — code that assumes a clean HTML root must either trim the SGML preamble and trailer or parse from the inner <HTML> tag.S-6/A records sit alongside the original S-6 in their own accession folders; linking an amendment to its base filing requires matching on fileNo and entity CIK rather than on accession number.GRAPHIC rows in the manifest whose bytes are deliberately absent from disk; if visual fidelity is required, the corresponding documentUrl on sec.gov can be used to fetch the original.The legal registrant on every Form S-6 is a unit investment trust — a category of investment company defined in Section 4(2) of the Investment Company Act of 1940, distinct from management companies (mutual funds, closed-end funds) and face-amount certificate companies. Because a UIT has no board of directors and no investment adviser, the registration statement is prepared and signed by the entities that organize and distribute the trust:
UITs filing on Form S-6 cover equity UITs (fixed strategy or index baskets), fixed-income UITs (municipal, taxable, and government bond trusts), and certain hybrid trusts. Insurance-product UITs do not file S-6: variable annuity separate accounts use Form N-4 and variable life insurance separate accounts use Form N-6, even though those accounts are themselves UITs registered under the 1940 Act.
Form S-6 is event-driven, not periodic. Filings arise from Section 5 of the Securities Act (no offers without a registration on file; no sales until effective) and from the obligation to keep the prospectus current while units are offered. The principal triggers are:
There is no fixed quarterly or annual calendar tied to fiscal year-end. Cadence is driven by offering activity and prospectus-currency obligations: an initial S-6 must be effective before any sale; pre-effective S-6/A amendments cover the staff-review period; post-effective S-6/A amendments cover annual financial-statement rollovers and any material change during secondary-market maintenance.
Form S-6 sits at the intersection of two statutes and two operative rules:
The dataset begins in January 1994, reflecting the phase-in of mandatory EDGAR electronic filing for UITs in the early-to-mid 1990s. Pre-EDGAR S-6 filings exist on paper in SEC archives but are not part of this electronic dataset.
Form S-6 sits at a narrow intersection: a Securities Act of 1933 registration statement reserved for unit investment trusts that have already registered as investment companies under the Investment Company Act of 1940 on Form N-8B-2. Its closest neighbors are therefore (a) the 1940 Act registration it leans on, (b) other 1933 Act registration forms, (c) registration forms used by other investment-company structures, and (d) the prospectus and amendment filings that follow an S-6 through its post-effective life.
The Form S-6 Files Dataset covers 1933 Act registration statements (and pre-effective amendments) for fixed-portfolio retail unit investment trusts. It is not interchangeable with any neighbor: N-8B-2 registers the trust entity (1940 Act side); S-1 serves the same statute for non-investment-company issuers; N-1A and N-2 cover managed open- and closed-end funds; N-4 and N-6 cover insurance-wrapped UITs; 24F-2 carries fee accounting only; 497/497K carry post-effective prospectus delivery; and 485APOS/485BPOS belong to the evergreen amendment regime that S-6 filers do not use. Reconstructing a UIT's full lifecycle requires pairing S-6 with N-8B-2 (formation), 497/497K (post-effective prospectuses), and 24F-2 (sales and fees); within its own slot, S-6 is the authoritative source for the prospectus-level offering record of retail unit investment trusts.
Different professions anchor to different fields of the record — the fee table, the deposited-securities schedule, the investment-objective statement, the certified statement of financial condition, the exhibit list, and the structured metadata.json keys (accession number, CIK, form type, filing date, document inventory).
Product strategists at sponsor and depositor firms track competitor S-6 and S-6/A activity to time their own series launches. They pull metadata.json filer/CIK/filing-date fields into daily monitoring pipelines, then read the prospectus investment-objective section, deposited-securities schedule (CUSIPs, tickers, weights, units), and fees-and-expenses table (initial sales charge, deferred sales charge, creation and development fee, deferred sales charges, organization costs, supervisory and bookkeeping fees) to benchmark basket themes, fee levels, and rollout cadence. Output: launch dashboards, fee-benchmark studies, and structuring inputs for the next-series prospectus.
Analysts covering UITs, model portfolios, and basket strategies extract the deposited-securities schedule, stated selection methodology, sector/country breakdowns, and the comparison-of-public-offering-price disclosures. They reverse-engineer rules-based screens, measure overlap across competing trusts, and back-test strategy performance. Output: peer-comparison notes and quantitative thematic research.
In-house and outside counsel use prior S-6 filings as drafting precedent. They read risk-factor language, RIC and grantor-trust tax sections, the certified statement of financial condition, and the exhibit list (trust indenture, distribution agreement, legal opinions, evaluator's consents, accountant consents) to benchmark disclosure for new series and novel structures (defined-outcome, buffered, actively curated). The metadata.json form type (S-6 vs. S-6/A) and accession number drive precedent searches and amendment-history reconstruction.
Sponsor compliance and product-governance teams managing rollover into successor series review fee-table evolution, sales-charge disclosure, the rollover-privilege description, and the trust agreement exhibit across the family of prior series. CIK plus accession number lets them assemble the full historical chain for disclosure-consistency review.
Product-review committees at registered investment advisors and broker-dealers approving UITs for the platform shelf compare the fee table, investment objective, portfolio construction methodology, deposit date, mandatory termination date, and secondary-market/redemption mechanics. Output: product-approval memos and fee-transparency reviews under fiduciary and Reg BI workflows.
Researchers studying packaged-product economics use the 1994-forward corpus as a panel. They parse fee tables and deposited-securities schedules and join them to metadata.json filing dates to build longitudinal series on fee dispersion, sponsor concentration, and basket-theme cycles.
Engineers ingest metadata.json (accession, CIK, form type, filing date, document inventory) to drive pipelines, then parse prospectus HTML/TXT to extract fee tables, deposited-securities schedules, and termination dates into internal UIT databases and coverage feeds.
Teams building financial retrieval and extraction systems use S-6 prospectuses as a structurally repeatable corpus (objectives, portfolio, fees, distribution, financial condition) for fine-tuning table extractors, benchmarking QA over packaged-product disclosure, and training section classifiers.
Reporters covering the UIT industry track filer name, filing date, prospectus front matter, investment-objective summary, and fee table to identify launch trends (thematic, ESG, defined-outcome, target-term) and unusual fee structures. Output: launch columns, annual industry round-ups, fee investigations.
Experts in UIT sales-practice, suitability, and trustee disputes pull the original S-6 or S-6/A by accession number to establish disclosure-as-of-date. They cite risk factors, fee disclosures, portfolio composition, and trustee/evaluator identities exactly as filed; preservation of every amendment and exhibit is essential evidentiary support.
Concrete workflows the Form S-6 Files Dataset supports, anchored to specific record components.
A product strategist at a UIT sponsor compares competing series-launch economics. The team filters metadata.json for formType == "S-6" filed in the last 12 months, joins on entities[].cik to roll trust-series CIKs up to sponsor families, then parses the fees-and-expenses table from the prospectus HTML to extract initial sales charge, deferred sales charge, and creation-and-development fee per series. Output is a sponsor-by-sponsor fee distribution used to set the fee schedule on the next-series S-6.
A buy-side analyst covering thematic UITs reconstructs basket overlap across competing trusts. From each S-6 prospectus body the workflow extracts the schedule of deposited securities (issuer, CUSIP, ticker, share count, deposit weight) and joins it to the trust name, sponsor CIK, and filedAt from metadata.json. The result is a longitudinal holdings panel used to compute pairwise basket overlap, identify crowded names, and back-test rules-based selection methodologies disclosed in the prospectus.
A testifying expert in a UIT suitability dispute needs the prospectus exactly as delivered on the deposit date. The workflow pulls every record matching the trust's entities[].cik and entities[].fileNo (the 333-prefixed registration number), orders the resulting S-6 and S-6/A accessions by filedAt, and uses each accession's HTML registration statement to cite risk factors, fee tables, and trustee/evaluator identities as of the operative date. Output is an exhibit set keyed to accession numbers for evidentiary submission.
Disclosure counsel preparing a defined-outcome or buffered UIT searches prior S-6 prospectuses for analogous risk-factor and tax-treatment language. The workflow runs full-text retrieval over the inline-HTML <TEXT> body across the corpus, filters by form type and filing era, and returns precedent paragraphs (grantor-trust vs. RIC tax sections, structured-payoff risk disclosures, sponsor-default risk language) for adaptation into the new prospectus.
An LLM team building a retrieval system over UIT disclosure indexes the prospectus HTML body section by section (cover page, trust organization, investment objective, portfolio composition, fees, distribution, risk factors, tax matters, statement of financial condition). Section anchors plus accessionNo, cik, and filedAt from metadata.json form the chunk metadata. The output supports grounded QA on questions such as "what is the C and D fee on this series" or "when does this trust mandatorily terminate" with citations back to the EDGAR source URL in linkToFilingDetails.
A broker-dealer product-review committee evaluating a UIT for the platform pulls the latest S-6 (or terminating S-6/A) by sponsor CIK, then extracts the fee table, investment-objective statement, deposit date, mandatory termination date, rollover-privilege description, and secondary-market mechanics from the prospectus. These fields populate a standardized approval memo used in Reg BI and fiduciary documentation; rejected items feed back into sponsor negotiations on fee breakpoints and dealer concessions.
The dataset is accessible through three endpoints: a public index JSON for metadata discovery, a full archive download, and per-container monthly downloads. Filings span from 1994-01-01 to present and cover form types S-6 and S-6/A, packaged as monthly ZIP containers.
Dataset Index JSON API: https://api.sec-api.io/datasets/form-s6-files.json
This endpoint returns dataset-level metadata (name, description, last updated timestamp, earliest sample date, total records, total size, form types, container format, and file types), the full dataset download URL, and the list of all monthly container files. Each container entry includes its key (e.g. 2025/2025-10.zip), size in bytes, record count, updated timestamp, and a direct downloadUrl. Polling this endpoint allows you to detect which containers were refreshed in the latest run and selectively download only those that changed. This endpoint does not require an API key.
Example response:
1
{
2
"datasetId": "1f13365b-9ae0-6906-9c73-1fbd5eb2543d",
3
"datasetDownloadUrl": "https://api.sec-api.io/datasets/form-s6-files.zip",
4
"name": "Form S-6 Files Dataset",
5
"updatedAt": "2026-05-08T02:54:09.628Z",
6
"earliestSampleDate": "1994-01-01",
7
"totalRecords": 56491,
8
"totalSize": 984762474,
9
"formTypes": ["S-6", "S-6/A"],
10
"containerFormat": "ZIP",
11
"fileTypes": ["TXT", "JSON", "HTML", "PDF"],
12
"containers": [
13
{
14
"downloadUrl": "https://api.sec-api.io/datasets/form-s6-files/2025/2025-10.zip",
15
"key": "2025/2025-10.zip",
16
"size": 13818783,
17
"records": 154,
18
"updatedAt": "2026-05-08T02:54:09.628Z"
19
}
20
]
21
}
Download Entire Dataset: https://api.sec-api.io/datasets/form-s6-files.zip?token=YOUR_API_KEY
Downloads the complete dataset as a single ZIP archive containing every monthly container. This endpoint requires an API key passed as the token query parameter.
Download Single Container: https://api.sec-api.io/datasets/form-s6-files/2025/2025-10.zip?token=YOUR_API_KEY
Downloads one monthly container ZIP. The path after the dataset slug corresponds to the key field returned in the index JSON (e.g. 2025/2025-10.zip), so you can construct any container URL directly from the index response. This endpoint requires an API key passed as the token query parameter.
The dataset covers Form S-6 (initial 1933 Act registration statements for unit investment trusts) and Form S-6/A (pre-effective and post-effective amendments to those registration statements). Both form types are mixed together in the dataset and distinguishable via the formType field in each record's metadata.json.
One record corresponds to a single Form S-6 or Form S-6/A submission accepted by EDGAR, identified by its 18-digit accession number. On disk a record is a folder named after the accession number (with hyphens removed) containing two artifacts: a metadata.json describing the EDGAR submission envelope and a single .htm file holding the SGML-wrapped registration statement and prospectus.
Unit investment trusts that register units for public sale under the Securities Act of 1933 file Form S-6, with the registration prepared and signed by the sponsor / depositor on the trust's behalf. Sponsors are typically broker-dealers or asset managers running sequential UIT programs in equity, fixed-income, or hybrid asset classes. Insurance-product UITs (variable annuity and variable life separate accounts) file Form N-4 or Form N-6 instead and are not present in this dataset.
The dataset begins on January 1, 1994 — reflecting the phase-in of mandatory EDGAR electronic filing for UITs in the early-to-mid 1990s — and continues to the present. Pre-EDGAR S-6 filings exist on paper in SEC archives but are not part of this electronic dataset.
The dataset is delivered as monthly ZIP containers under a <year>/<year>-<month>.zip path scheme, each holding the accession-numbered record folders for that filing month. File types inside the records include JSON (the metadata sidecar), HTML (the dominant prospectus body format for modern filings), TXT (plain ASCII bodies in the earliest 1990s filings), and occasional PDF attachments.
Form N-1A registers managed open-end mutual funds and most ETFs, Form N-2 registers managed closed-end funds, and Forms N-4 and N-6 register insurance-wrapped UITs (variable annuity and variable life separate accounts respectively). Form S-6 is reserved for fixed-portfolio, unmanaged retail UITs whose units are registered under the 1933 Act on top of a 1940 Act registration on Form N-8B-2. Substantively, S-6 carries the deposited-securities schedule, sponsor and trustee disclosure, and certified statement of financial condition that those neighbor forms do not.
Image attachments (logos, signature graphics, fund-family marks, counsel and trustee marks) are intentionally excluded from the dataset on disk to keep records focused on the substantive registration content. Their identities remain visible in the documentFormatFiles[] manifest inside metadata.json with a type of GRAPHIC and a documentUrl pointing to the original file on sec.gov, so consumers needing visual fidelity can fetch them directly from EDGAR.