The Form 1-SA Files Dataset is a full-population corpus of Regulation A semiannual reports filed on EDGAR by Tier 2 issuers under Rule 257(b)(3), packaged as the original EDGAR submission with image files removed. Each record is one EDGAR accession of Form 1-SA or Form 1-SA/A — a single semiannual report or its amendment — comprising a structured metadata.json plus every issuer-authored document from the filing in its native form (HTML, PDF, or plain text). The filer is always the Tier 2 Regulation A issuer itself: a domestic operating company in the United States or Canada that has qualified an offering statement on Form 1-A and remains subject to Regulation A ongoing reporting. Form 1-SA was created by the SEC's March 2015 Regulation A+ rulemaking (Release No. 33-9741, effective June 19, 2015), so the dataset's earliest sample date is in 2016, when the first Tier 2 issuers reached their initial 90-day semiannual deadline. The dataset is distributed as monthly ZIP containers that group accession folders under a year and month-bucketed subdirectory.
Programmatically retrieve the full list of dataset archive files, download URLs and dataset metadata.
Dataset Index JSON API
Download the entire dataset as a single archive file.
Download Entire Dataset:
Download a single container file (e.g. monthly archive) from the dataset.
Download Single Container:
The Form 1-SA Files Dataset captures the mid-year disclosure layer of Regulation A's continuous reporting regime. Form 1-SA is the semiannual report mandated by Rule 257; Tier 2 issuers must file it within 90 calendar days after the end of the first six months of their fiscal year. The form sits between the more comprehensive Form 1-K annual report and the event-driven Form 1-U current report, occupying the middle layer of Regulation A's continuous disclosure cadence. Form 1-SA/A is the amendment variant, used to correct, restate, or supplement a previously submitted Form 1-SA.
The form is comparatively short. Its prescribed contents are dominated by interim, unaudited financial statements covering the first six months of the fiscal year (with prior-year comparatives), management's discussion and analysis of financial condition and results of operations for that period, and any matters that would otherwise have been reportable on Form 1-U during the period that the issuer elects (or is required) to disclose here. The disclosure obligation flows from Securities Act Section 3(b)(2) and the Tier 2 conditional reporting framework adopted in the 2015 "Regulation A+" rulemaking. Form 1-SA filings have therefore existed in EDGAR only from 2016 onward, once the earliest cohort of post-amendment Tier 2 issuers reached their first semiannual reporting milestone.
The dataset covers the entire Tier 2 reporting population: every Form 1-SA and Form 1-SA/A submission accepted by EDGAR. The file types found in the dataset are HTML, JSON, PDF, and TXT; image files referenced from the HTML payload are deliberately excluded. Containers are distributed in ZIP format, with monthly buckets keyed by acceptance date.
A single record in the Form 1-SA Files Dataset is one complete EDGAR submission of Form 1-SA or Form 1-SA/A: one semiannual report, or amendment to a semiannual report, filed by a Tier 2 Regulation A issuer under Rule 257(b)(3). The record unit is the accession; every distinct EDGAR accession number maps to exactly one record. In the dataset each record materializes as a folder named after the 18-digit accession number (no dashes), holding a single metadata.json plus every document that was part of the original EDGAR submission, with image files removed. Monthly ZIP containers group accession folders under a year and a month-bucketed subdirectory, but the logical record remains the accession folder.
Each accession folder contains two structural layers:
metadata.json file capturing the EDGAR submission header and document index in structured JSON form.<DOCUMENT> envelope; auxiliary materials (PDF or plain-text exhibits) are carried in their submitted form.Most 1-SA records resolve to a metadata.json plus a single primary HTML document, occasionally accompanied by supplemental exhibits or text artifacts.
metadata.jsonmetadata.json is the canonical structured representation of the EDGAR header and document index for the filing. Its fields include:
formType — 1-SA or 1-SA/A.accessionNo — the EDGAR accession in dashed form, for example 0001641172-25-017516.filedAt — an ISO-8601 timestamp (with timezone) marking EDGAR's acceptance of the submission.periodOfReport — the end date of the semiannual reporting period, typically YYYY-06-30 or YYYY-12-31 depending on fiscal-year alignment.description — a short EDGAR description string (commonly "Form 1-SA -").linkToFilingDetails, linkToHtml, linkToTxt, linkToXbrl — URLs pointing to the primary document, the EDGAR submission index page, the complete submission .txt master file, and (when present) an XBRL instance. The XBRL link is generally empty for Form 1-SA, since Regulation A semiannual reports are not subject to the structured-data tagging mandate that applies to Exchange Act periodic reports.documentFormatFiles — an array of per-document objects, each describing one filed document with sequence, size (bytes, encoded as a string), documentUrl, description, and type (for example 1-SA, or blank for the bundled complete-submission text artifact).dataFiles — an array reserved for XBRL or other structured data files; generally empty for Form 1-SA.seriesAndClassesContractsInformation — an array for fund-style series/class identifiers; generally empty for Form 1-SA, since Regulation A issuers are typically operating companies rather than registered funds.entities — an array of entities associated with the filing. Each entity object captures companyName (with role suffix such as (Filer)), cik, irsNo, fileNo (the Regulation A file number, of the form 24R-NNNNN), filmNo, the form type repeated, act (typically 33 for Securities Act), sic (SIC code paired with its label, e.g. 6500 Real Estate), stateOfIncorporation (two-letter code), fiscalYearEnd (in MMDD form), and tickers (an array of ticker symbols when the issuer has any).id — an opaque 32-hex-character dataset record identifier.The primary document is the Form 1-SA itself. Filenames are not standardized; common patterns include form1-sa.htm, 1-sa.htm, and issuer-specific variants such as identify_1sa-123124.htm. The file is not pure HTML; it is an EDGAR SGML-wrapped envelope of the form:
1
<DOCUMENT>
2
<TYPE>1-SA
3
<SEQUENCE>1
4
<FILENAME>form1-sa.htm
5
<DESCRIPTION>1-SA
6
<TEXT>
7
<HTML>... full HTML body ...</HTML>
8
</TEXT>
9
</DOCUMENT>
The inner HTML body carries the substantive disclosure. Auxiliary documents, when present, appear as additional files in the accession folder and may include PDF exhibits, plain-text artifacts, or supplemental schedules, each wrapped in the same SGML envelope. Image files referenced from the HTML are stripped from the dataset, so figure or logo references inside the HTML may resolve to missing assets when rendered.
The HTML body of a Form 1-SA filing is short relative to Exchange Act periodic reports, but it follows a recognizable, regulator-prescribed sequence.
Cover page. Identifies the issuer (legal name, jurisdiction of incorporation, IRS Employer Identification Number, principal executive office address, and telephone number), names the form (FORM 1-SA), states the period covered (the close of the first six months of the fiscal year), provides the Commission file number (in 24R- form), and identifies the title of each class of securities issued under the Regulation A offering. Functionally analogous to the cover of a 10-Q, but tailored to the Regulation A reporting framework.
Item 1 — Management's Discussion and Analysis of Financial Condition and Results of Operations. Narrative analysis of the issuer's results for the six-month interim period and material trends, demands, commitments, events, or uncertainties known to management that are reasonably likely to have a material effect on the issuer's financial condition or operating performance. It typically covers liquidity, capital resources, results of operations, and known trends. The MD&A is shorter and less formulaic than the 10-Q analogue, reflecting the lighter Regulation A regime, but it must give a fair presentation of financial condition and results.
Item 2 — Other Information. A vehicle for the issuer to disclose any matter that would otherwise have been reportable on Form 1-U during the semiannual period. Form 1-U covers a defined catalog of current events (fundamental changes in the nature of business; bankruptcy or receivership; material modification to the rights of securityholders; changes in the issuer's certifying accountant; non-reliance on previous financial statements; changes in control; departure of certain officers; and unregistered sales of equity securities), and Item 2 lets an issuer fold any such updates into the semiannual report. In practice many filings have minimal or no Item 2 content.
Item 3 — Financial Statements. The core quantitative content. Form 1-SA requires interim financial statements for the first six months of the fiscal year, ordinarily presented as an unaudited balance sheet as of the end of the semiannual period; unaudited statements of comprehensive income, cash flows, and changes in stockholders' equity for the interim period and the comparable prior-year period; and condensed notes. Regulation A permits these interim statements to be unaudited but requires preparation in accordance with U.S. GAAP (or, where eligible, IFRS) and presentation on a basis consistent with the most recent Form 1-K annual financial statements.
Item 4 — Exhibits. A short exhibit index referencing any exhibits filed with the report, prepared in accordance with the exhibit table in Part III of Form 1-A.
Signatures. Signature blocks executed by the issuer and by specified officers (typically the principal executive officer, principal financial officer, and principal accounting officer or controller), each accompanied by name, title, and date.
For Form 1-SA/A the same structural skeleton applies, but the body is conventionally preceded by an explanatory note describing the nature and scope of the amendment, with the rest of the document re-stated to reflect the corrected or supplemented content.
Each record includes the structured EDGAR header in metadata.json, the SGML-wrapped primary 1-SA HTML document, and any other non-image documents that accompanied the original EDGAR submission (PDF exhibits, plain-text artifacts, the bundled complete-submission text file when present). Identifiers — accession number, CIK, Regulation A file number (24R-), IRS EIN, SIC code, fiscal year end, state of incorporation, and tickers where applicable — are surfaced both in metadata.json and within the cover page of the HTML body.
Image files referenced by the HTML payload are deliberately excluded from the dataset; renderings of issuer logos, charts, signatures-as-images, or graphical exhibits will not resolve from the package. Materials filed under separate accession numbers — for example a subsequent Form 1-U disclosing an event after the semiannual period, or the issuer's Form 1-K annual report — are not part of the same record; they constitute separate records, and in the case of Forms 1-K and 1-U, separate datasets entirely. Audited financial statements, which appear on Form 1-K rather than Form 1-SA, are likewise outside the scope of any individual 1-SA record.
The substantive content requirements of Form 1-SA have been stable since the form's introduction. The Regulation A+ rulemaking that took effect on June 19, 2015 created the form, and the first 1-SA filings began appearing in EDGAR in 2016. Subsequent amendments to Regulation A (including the 2021 increase of the Tier 2 offering ceiling) altered eligibility and offering parameters that govern who must file, but they did not materially change the prescribed contents of Form 1-SA itself: the cover page, MD&A, Form 1-U-style updates, interim financial statements, exhibit index, and signature page have remained the form's structural backbone throughout.
Form 1-SA has been an HTML-era form for its entire existence. Because the form was created in 2015 and only entered live filing in 2016, there is no ASCII-only or pre-HTML legacy in the dataset; all primary documents appear as SGML-wrapped HTML payloads from the earliest records onward. Auxiliary materials are carried as PDF or plain text where issuers chose those formats.
Several practical points matter when reading or extracting from these records:
<DOCUMENT> envelope; consumers that expect bare HTML must strip the leading SGML header lines (<DOCUMENT>, <TYPE>, <SEQUENCE>, <FILENAME>, <DESCRIPTION>, <TEXT>) before parsing the inner HTML.documentFormatFiles[i].type == "1-SA" and the corresponding documentUrl, rather than a fixed filename.periodOfReport is the cleanest semantic marker of which six-month interval the report covers; filedAt reflects EDGAR acceptance time and may sit substantially later in the year, up to the 90-day deadline.formType. An amendment typically restates the full filing rather than supplying just the changed sections, often with a leading explanatory note.entities array — particularly the Regulation A file number (24R-NNNNN) and the CIK — are the most reliable joins to other Regulation A datasets (Form 1-A offering statements, Form 1-K annual reports, Form 1-U current reports).Each record in the Form 1-SA Files Dataset is a single semiannual report (Form 1-SA) or amendment (Form 1-SA/A) filed on EDGAR by a Tier 2 Regulation A issuer. The filer is the issuer itself, never a placement agent, transfer agent, selling securityholder, or other offering participant.
The reporting population is narrow:
The following do not file Form 1-SA:
Form 1-SA is periodic, not event-driven. It is triggered by the passage of time, not by a corporate event:
A Form 1-SA/A is filed as needed to correct, restate, or supplement a previously filed Form 1-SA. Amendments have no independent periodic trigger.
Form 1-SA was created by the SEC's March 2015 amendments to Regulation A (Release No. 33-9741, effective June 19, 2015), implementing the mandate of Section 3(b)(2) of the Securities Act as expanded by Title IV of the JOBS Act of 2012. The 2021 amendments raised the Tier 2 offering ceiling but left the Rule 257(b)(3) semiannual trigger and 90-day deadline unchanged.
Because the form did not exist before mid-2015, the earliest Form 1-SA filings on EDGAR appear in 2016, when the first Tier 2 issuers reached their initial semiannual deadlines. There is no paper-filing history; Regulation A ongoing reports have always been electronic.
Form 1-SA sits inside the Regulation A ongoing-reporting regime for Tier 2 issuers after qualification. The most useful comparisons are the other forms in that regime (1-K, 1-U, 1-Z), the qualification-stage offering documents (1-A and 1-A POS), and the Exchange Act interim form most often confused with it (10-Q).
Form 1-K (Reg A annual report). Same filer population, same regime, but annual rather than interim. 1-K covers the full fiscal year, carries audited financial statements, and includes deeper narrative on business, related-party transactions, executive compensation, and beneficial ownership. It is due 120 days after fiscal year-end. 1-SA covers only the first six months, contains unaudited condensed financials, has a narrower required narrative, and is due within 90 days of the half-year mark. 1-K is the natural annual complement; 1-SA cannot substitute for it.
Form 1-U (Reg A current report). The event-driven analog to 8-K, triggered by fundamental changes, bankruptcy, auditor change, non-reliance on prior financials, change in control, departure of key officers, or unregistered sales. 1-SA is time-driven and periodic, but explicitly required to update any 1-U-reportable information arising during the half-year window. The two are complementary: 1-U captures events as they occur; 1-SA captures the periodic interim financial picture and may aggregate 1-U-eligible items.
Form 1-A and 1-A POS (offering statement and post-qualification amendments). Pre- and offering-period documents containing the offering circular, risk factors, and use of proceeds. 1-SA is strictly post-qualification and contains no offering mechanics. 1-A answers who is raising capital under Reg A and on what terms; 1-SA answers how those issuers perform on a half-year basis afterward. See also Form 1-A POS for post-qualification amendments to offering statements.
Form 1-Z (Reg A exit report). A one-time termination-of-reporting filing used to suspend ongoing obligations or report final Tier 1 sales. 1-SA is recurring during the active reporting period. The two bracket opposite ends of the reporting lifecycle.
Form 10-Q (Exchange Act quarterly report). The closest functional analog outside Reg A: both are unaudited interim reports with MD&A. The differences are substantial. 10-Q is filed three times per year by Exchange Act registrants, follows full Regulation S-X interim requirements, and includes Item 4 controls and procedures certifications. 1-SA is filed once per year, uses lighter Reg A interim financial standards, lacks equivalent SOX-level certifications, and is restricted to Tier 2 Reg A issuers. The two datasets cover almost entirely disjoint filer populations.
Form 1-SA/A (amendments). Restates, corrects, or supplements a prior 1-SA rather than introducing a new period. Researchers tracking restatement behavior should align 1-SA and 1-SA/A pairs by accession lineage rather than treat each as independent.
The Form 1-SA Files Dataset is the only source of mandated semiannual interim disclosure for Tier 2 Regulation A issuers, captured as the original EDGAR submission with all documents except images. It is the interim sibling of 1-K, the periodic counterpart to event-driven 1-U, the post-qualification successor to 1-A, and the predecessor to 1-Z. No adjacent dataset substitutes for 1-SA when the question concerns half-year financial condition or interim MD&A of Reg A Tier 2 issuers; conversely, 1-SA cannot substitute for audited annual results (1-K), event disclosures (1-U), offering terms (1-A), or any Exchange Act issuer population.
Form 1-SA semiannual reports are the mid-year disclosure for Tier 2 Regulation A issuers, a small-cap and pre-IPO segment thinly covered by mainstream data providers. Users work with the interim financial statements, MD&A, embedded 1-U-style updates, and the 1-SA/1-SA/A amendment trail.
Counsel advising Tier 2 issuers and offering platforms benchmark drafting against peer 1-SA filings: going-concern language, related-party transactions, executive comp updates, and the scope of 1-SA/A amendments. Supports drafting checklists, comment-letter response, and Rule 257(b)(3) completeness review.
Compliance staff at issuers and broker-dealer offering platforms monitor timeliness against the 90-day window, completeness of interim financials, and amendment frequency. Outputs include filing calendars, peer-comparison templates, and watchlists for late or repeat-amending issuers.
Used as an early-warning channel between annual 1-K reports. Focus is on revenue recognition shifts, cash trajectory, related-party balances, accrual patterns, and gaps between 1-SA mid-year figures and the subsequent 1-K. Supports restatement detection via 1-SA/A traffic and forensic memos for litigation or internal review.
Analysts sizing private-credit, vendor, or counterparty exposure to small issuers refresh views between annual reports using liquidity disclosures, debt schedules, MD&A covenant commentary, and burn-rate trends. Feeds credit memos and probability-of-default inputs for unrated issuers.
Buy-side diligence on Reg A targets uses 1-SA for six-month revenue, gross margin, customer concentration, and capital-raising activity that would otherwise be unavailable for non-fully-reporting companies. The 1-SA/A trail surfaces prior accounting issues. Supports diligence memos and quality-of-earnings analysis.
Research teams at online offering platforms track post-offering performance of issuers they hosted: use-of-proceeds vs. plan, revenue and cash six months in, and filing delinquency. Outputs include investor dashboards, cohort performance reports, and screening tools for follow-on offerings.
Academic and policy researchers study compliance behavior, restatement frequency, and Management's Discussion and Analysis completeness across the Tier 2 population. Outputs include empirical studies, policy commentary, and rulemaking comment letters.
Analysts tracking issuers likely to transition from Reg A to S-1 registration use 1-SA as the primary mid-year update on revenue trajectory, capitalization, related-party flows, and embedded 1-U-style events (auditor changes, departures, material contracts). Feeds watchlists and IPO-readiness flags.
Auditors serving Reg A clients benchmark interim disclosure practice, footnote depth, going-concern language, and subsequent-events drafting across comparable Tier 2 filers. Supports engagement planning and template QC.
Engineers build extraction pipelines for interim line items, MD&A narratives, and amendment relationships using the per-accession metadata and HTML/PDF/TXT renditions. Outputs include normalized small-issuer financial panels and linkage tables across 1-A, 1-K, 1-SA, 1-U, and 1-Z.
Teams building retrieval and Q&A systems over SEC content use the corpus for fine-tuning summarization, evaluating small-issuer extraction accuracy, and grounding agents that answer questions on Tier 2 issuer performance. MD&A, footnotes, and risk-factor updates are the primary retrieval material.
In aggregate, users cluster around the Reg A Tier 2 ecosystem: counsel and compliance staff who prepare filings, diligence and credit analysts who evaluate issuers between annual reports, forensic and audit specialists who scrutinize interim numbers, platform and academic researchers who study the market in aggregate, and data and ML teams turning the corpus into structured products. The dataset's value is systematic, full-population access to mid-year disclosures from issuers otherwise sparsely covered.
Concrete workflows the Form 1-SA Files Dataset supports for Tier 2 Regulation A research, compliance, and tooling.
Parse Item 3 HTML tables across the full corpus to extract balance-sheet, income-statement, and cash-flow line items keyed by cik, periodOfReport, and fileNo (24R-NNNNN). Joined to the Form 1-K dataset on the same identifiers, this yields a six-month interim panel that fills the gap between audited annual results and feeds credit-scoring models, burn-rate estimates, and small-issuer benchmarking dashboards.
Filter records where formType == "1-SA/A", group by issuer, and align each amendment to its preceding 1-SA accession to build a restatement lineage table. The leading explanatory note in the amended HTML pinpoints what was corrected. Outputs feed forensic-accounting watchlists, audit-quality studies, and counsel checklists for repeat amenders.
Compare filedAt against periodOfReport plus 90 days under Rule 257(b)(3) to flag filings that landed late, then aggregate per issuer to identify chronic delinquency. Compliance teams at offering platforms use the result as an issuer-level watchlist; researchers use it to study population-wide compliance with the Reg A reporting cadence.
Run targeted text extraction on Item 2 ("Other Information") to surface auditor changes, officer departures, control changes, non-reliance statements, and unregistered sales that issuers folded into the semiannual report rather than filing a standalone Form 1-U. The output augments event-driven datasets with disclosures that would otherwise be missed by 1-U-only pipelines.
Strip the SGML envelope, isolate Item 1 MD&A from each primary HTML document, and index the resulting corpus for similarity search. Counsel use it to compare going-concern, liquidity, and related-party language against industry peers (joining via sic); RAG developers use the same index as grounding material for Tier 2 issuer Q&A agents and summarization fine-tuning.
Use the entities array (CIK, 24R- file number, EIN) as the join key to stitch each issuer's Form 1-A, 1-K, 1-SA, 1-U, and 1-Z filings into a chronological lifecycle record. Buy-side diligence teams use the assembled timeline to evaluate Reg A acquisition targets; platform analytics teams use it to track use-of-proceeds versus plan and to score follow-on-offering candidates.
Dataset Index JSON API: https://api.sec-api.io/datasets/form-1sa-files.json
This endpoint returns dataset-level metadata (name, description, last updated timestamp, earliest sample date, total records and total size, form types covered, container format, and file types) along with the full dataset download URL and the list of all available container files. Each container entry includes its key, size, record count, updated timestamp, and direct download URL. Use this endpoint to monitor which containers were updated in the most recent refresh run and decide which ones to download incrementally on a daily basis.
This endpoint does not require an API key.
Example response:
1
{
2
"datasetId": "1f13365b-9ae0-6974-afbd-d1e836a441ab",
3
"datasetDownloadUrl": "https://api.sec-api.io/datasets/form-1sa-files.zip",
4
"name": "Form 1-SA Files Dataset",
5
"updatedAt": "2026-04-23T02:55:48.711Z",
6
"earliestSampleDate": "2016-04-01",
7
"totalRecords": 3432,
8
"totalSize": 93438376,
9
"formTypes": ["1-SA", "1-SA/A"],
10
"containerFormat": "ZIP",
11
"fileTypes": ["HTML", "JSON", "PDF", "TXT"],
12
"containers": [
13
{
14
"downloadUrl": "https://api.sec-api.io/datasets/form-1sa-files/2026/2026-04.zip",
15
"key": "2026/2026-04.zip",
16
"size": 1842715,
17
"records": 47,
18
"updatedAt": "2026-04-23T02:55:48.711Z"
19
}
20
]
21
}
Download Entire Dataset: https://api.sec-api.io/datasets/form-1sa-files.zip?token=YOUR_API_KEY
Downloads the complete Form 1-SA Files Dataset as a single ZIP archive covering all filings from April 2016 to present. This endpoint requires an API key.
Download Single Container: https://api.sec-api.io/datasets/form-1sa-files/2026/2026-04.zip?token=YOUR_API_KEY
Downloads one individual monthly container instead of the full dataset, which is useful for incremental updates or working with a specific time window. Replace the year and month in the path to target a different container listed in the dataset index. This endpoint requires an API key.
The dataset covers Form 1-SA, the Regulation A semiannual report, and Form 1-SA/A, its amendment variant. Both are filed on EDGAR by Tier 2 Regulation A issuers under Rule 257(b)(3).
One record is a single complete EDGAR submission of Form 1-SA or Form 1-SA/A, identified by its 18-digit accession number. The record is materialized as an accession-named folder containing one metadata.json plus every issuer-authored document from the original submission, with image files removed.
Tier 2 Regulation A issuers — domestic operating companies organized in the United States or Canada that have qualified an offering statement on Form 1-A under Tier 2 — must file Form 1-SA. Tier 1 issuers, foreign private issuers, investment companies, BDCs, blank-check companies, and Exchange Act reporting companies do not file this form.
Form 1-SA is filed once per fiscal year, within 90 calendar days after the end of the first six months of the issuer's fiscal year, under Rule 257(b)(3). It is periodic and time-driven, not event-driven, and there is no second-half semiannual filing — the next periodic report after Form 1-SA is the annual Form 1-K.
Form 1-SA was created by the SEC's March 2015 Regulation A+ rulemaking (effective June 19, 2015), and the earliest filings in EDGAR appear in 2016 when the first Tier 2 issuers reached their initial semiannual deadlines. The dataset's earliest sample date is in April 2016 and coverage runs to the most recent refresh.
Form 10-Q is filed three times per year by Exchange Act registrants under full Regulation S-X interim requirements and includes SOX-level controls and procedures certifications. Form 1-SA is filed once per fiscal year by Tier 2 Regulation A issuers under lighter interim financial standards and lacks equivalent certifications. The two datasets cover almost entirely disjoint filer populations.
The dataset is distributed as monthly ZIP containers grouped under year and month-bucketed subdirectories. Inside each container, accession folders carry the original EDGAR documents in HTML, JSON, PDF, and TXT form; image files referenced from the HTML are excluded.