Form 20-F Files Dataset

The Form 20-F Files Dataset is the document-level corpus of annual reports filed on Form 20-F and Form 20-F/A by foreign private issuers (FPIs) registered with the U.S. Securities and Exchange Commission. Each record is a single EDGAR submission identified by its accession number, materialized on disk as a flat folder containing a metadata.json inventory, the primary 20-F or 20-F/A document, and every separately-filed exhibit. The filer is always the issuer itself — a non-U.S.-organized company with securities registered under Section 12 of the Exchange Act or with a continuing reporting obligation under Section 15(d). Coverage runs from the earliest electronic 20-F filings in June 1995 through the present, spanning the full Item 1–19 architecture introduced by the 1999 IOSCO-aligned modernization, the 2007 elimination of the U.S. GAAP reconciliation for IFRS filers, and every subsequent SOX, Dodd-Frank, HFCAA, clawback, insider-trading, and cybersecurity rulemaking. Records are delivered as monthly ZIP containers partitioned by EDGAR filing month (YYYY/YYYY-MM.zip); the file types found inside a record are TXT, JSON, PDF, and HTML.

Update Frequency
Daily
Updated at
2026-05-19
Earliest Sample Date
1995-06-01
Total Size
9.1 GB
Total Records
200,999
Container Format
ZIP
Content Types
TXT, JSON, PDF, HTML
Form Types
20-F, 20-F/A

Dataset APIs

Programmatically retrieve the full list of dataset archive files, download URLs and dataset metadata.

Dataset Index JSON API

Download the entire dataset as a single archive file.

Download Entire Dataset:

Download a single container file (e.g. monthly archive) from the dataset.

Download Single Container:

Dataset Files

359 files · 9.1 GB
Download All
2026-05.zip27.6 MB834 records
2026-04.zip213.3 MB4,643 records
2026-03.zip190.7 MB2,609 records
2026-02.zip65.6 MB913 records
2026-01.zip10.8 MB338 records
2025-12.zip4.2 MB115 records
2025-11.zip9.6 MB266 records
2025-10.zip20.8 MB499 records
2025-09.zip7.0 MB169 records
2025-08.zip46.4 MB367 records
2025-07.zip24.8 MB618 records
2025-06.zip17.5 MB379 records
2025-05.zip47.5 MB1,263 records
2025-04.zip297.7 MB4,086 records
2025-03.zip144.6 MB2,580 records
2025-02.zip58.5 MB866 records
2025-01.zip10.8 MB269 records
2024-12.zip7.6 MB205 records
2024-11.zip9.2 MB239 records
2024-10.zip23.2 MB583 records
2024-09.zip6.8 MB160 records
2024-08.zip23.6 MB498 records
2024-07.zip19.3 MB538 records
2024-06.zip13.0 MB260 records
2024-05.zip43.5 MB1,143 records
2024-04.zip274.4 MB4,261 records
2024-03.zip173.2 MB2,665 records
2024-02.zip54.4 MB847 records
2024-01.zip8.6 MB280 records
2023-12.zip4.5 MB148 records
2023-11.zip6.6 MB152 records
2023-10.zip20.2 MB465 records
2023-09.zip41.3 MB257 records
2023-08.zip11.9 MB322 records
2023-07.zip16.1 MB304 records
2023-06.zip19.1 MB346 records
2023-05.zip57.5 MB1,347 records
2023-04.zip207.1 MB3,525 records
2023-03.zip179.3 MB2,538 records
2023-02.zip48.6 MB578 records
2023-01.zip4.8 MB132 records
2022-12.zip5.3 MB144 records
2022-11.zip7.4 MB167 records
2022-10.zip16.1 MB311 records
2022-09.zip39.9 MB242 records
2022-08.zip52.1 MB411 records
2022-07.zip12.9 MB226 records
2022-06.zip21.3 MB431 records
2022-05.zip99.8 MB1,386 records
2022-04.zip262.3 MB3,560 records
2022-03.zip190.2 MB2,362 records
2022-02.zip40.2 MB538 records
2022-01.zip3.5 MB138 records
2021-12.zip4.7 MB131 records
2021-11.zip7.6 MB154 records
2021-10.zip12.4 MB259 records
2021-09.zip8.3 MB184 records
2021-08.zip5.2 MB155 records
2021-07.zip12.0 MB280 records
2021-06.zip14.1 MB324 records
2021-05.zip22.2 MB628 records
2021-04.zip145.3 MB3,378 records
2021-03.zip127.6 MB2,409 records
2021-02.zip36.3 MB444 records
2021-01.zip1.4 MB36 records
2020-12.zip3.5 MB136 records
2020-11.zip6.1 MB136 records
2020-10.zip8.2 MB204 records
2020-09.zip5.3 MB145 records
2020-08.zip6.3 MB205 records
2020-07.zip12.1 MB294 records
2020-06.zip32.6 MB917 records
2020-05.zip21.2 MB563 records
2020-04.zip125.1 MB2,948 records
2020-03.zip90.3 MB1,685 records
2020-02.zip31.3 MB502 records
2020-01.zip1.4 MB59 records
2019-12.zip4.2 MB149 records
2019-11.zip4.0 MB112 records
2019-10.zip10.3 MB232 records
2019-09.zip4.5 MB119 records
2019-08.zip4.2 MB155 records
2019-07.zip10.7 MB271 records
2019-06.zip11.1 MB312 records
2019-05.zip19.2 MB528 records
2019-04.zip114.4 MB2,825 records
2019-03.zip104.5 MB1,827 records
2019-02.zip30.7 MB389 records
2019-01.zip2.4 MB72 records
2018-12.zip2.3 MB84 records
2018-11.zip3.4 MB106 records
2018-10.zip9.2 MB235 records
2018-09.zip5.1 MB89 records
2018-08.zip4.3 MB130 records
2018-07.zip11.6 MB303 records
2018-06.zip11.0 MB308 records
2018-05.zip20.3 MB612 records
2018-04.zip115.5 MB2,731 records
2018-03.zip89.0 MB1,623 records
2018-02.zip20.3 MB457 records
2018-01.zip1.6 MB62 records
2017-12.zip3.7 MB134 records
2017-11.zip4.3 MB125 records
2017-10.zip9.8 MB194 records
2017-09.zip2.4 MB91 records
2017-08.zip5.5 MB194 records
2017-07.zip8.3 MB223 records
2017-06.zip12.2 MB343 records
2017-05.zip25.1 MB745 records
2017-04.zip95.1 MB2,379 records
2017-03.zip75.1 MB1,632 records
2017-02.zip16.6 MB400 records
2017-01.zip2.2 MB73 records
2016-12.zip1.8 MB60 records
2016-11.zip4.9 MB115 records
2016-10.zip14.1 MB204 records
2016-09.zip6.1 MB131 records
2016-08.zip4.4 MB117 records
2016-07.zip8.2 MB233 records
2016-06.zip8.7 MB269 records
2016-05.zip31.5 MB853 records
2016-04.zip108.4 MB2,889 records
2016-03.zip72.5 MB1,528 records
2016-02.zip15.3 MB365 records
2016-01.zip3.6 MB129 records
2015-12.zip3.0 MB82 records
2015-11.zip7.3 MB135 records
2015-10.zip14.2 MB178 records
2015-09.zip2.8 MB108 records
2015-08.zip3.1 MB89 records
2015-07.zip8.1 MB239 records
2015-06.zip9.3 MB274 records
2015-05.zip18.6 MB460 records
2015-04.zip115.4 MB3,083 records
2015-03.zip89.5 MB1,679 records
2015-02.zip19.7 MB459 records
2015-01.zip2.7 MB46 records
2014-12.zip2.0 MB95 records
2014-11.zip4.2 MB106 records
2014-10.zip8.9 MB204 records
2014-09.zip5.9 MB145 records
2014-08.zip3.9 MB87 records
2014-07.zip11.7 MB327 records
2014-06.zip9.0 MB245 records
2014-05.zip15.7 MB508 records
2014-04.zip116.9 MB3,004 records
2014-03.zip65.9 MB1,513 records
2014-02.zip18.3 MB459 records
2014-01.zip1.7 MB67 records
2013-12.zip2.8 MB119 records
2013-11.zip5.2 MB171 records
2013-10.zip11.6 MB295 records
2013-09.zip3.8 MB123 records
2013-08.zip3.6 MB123 records
2013-07.zip10.9 MB295 records
2013-06.zip10.6 MB345 records
2013-05.zip13.5 MB429 records
2013-04.zip120.0 MB3,367 records
2013-03.zip66.0 MB1,308 records
2013-02.zip12.0 MB344 records
2013-01.zip4.2 MB152 records
2012-12.zip3.4 MB111 records
2012-11.zip3.8 MB100 records
2012-10.zip12.3 MB284 records
2012-09.zip5.0 MB161 records
2012-08.zip6.8 MB197 records
2012-07.zip15.7 MB351 records
2012-06.zip10.1 MB355 records
2012-05.zip20.8 MB684 records
2012-04.zip109.4 MB2,953 records
2012-03.zip70.6 MB1,599 records
2012-02.zip10.7 MB312 records
2012-01.zip3.2 MB82 records
2011-12.zip9.1 MB256 records
2011-11.zip8.3 MB323 records
2011-10.zip7.3 MB169 records
2011-09.zip15.0 MB349 records
2011-08.zip4.7 MB164 records
2011-07.zip14.4 MB435 records
2011-06.zip68.8 MB1,803 records
2011-05.zip28.3 MB808 records
2011-04.zip45.1 MB1,288 records
2011-03.zip59.8 MB1,330 records
2011-02.zip10.1 MB294 records
2011-01.zip2.9 MB115 records
2010-12.zip11.6 MB338 records
2010-11.zip10.4 MB274 records
2010-10.zip8.0 MB183 records
2010-09.zip10.7 MB293 records
2010-08.zip8.5 MB220 records
2010-07.zip14.9 MB530 records
2010-06.zip80.8 MB2,100 records
2010-05.zip24.3 MB772 records
2010-04.zip40.2 MB998 records
2010-03.zip65.0 MB1,406 records
2010-02.zip9.3 MB239 records
2010-01.zip4.2 MB140 records
2009-12.zip12.7 MB366 records
2009-11.zip7.8 MB204 records
2009-10.zip7.7 MB244 records
2009-09.zip9.6 MB298 records
2009-08.zip17.6 MB134 records
2009-07.zip20.9 MB640 records
2009-06.zip86.4 MB2,370 records
2009-05.zip33.2 MB885 records
2009-04.zip33.1 MB1,014 records
2009-03.zip49.5 MB1,086 records
2009-02.zip7.8 MB246 records
2009-01.zip5.9 MB203 records
2008-12.zip13.3 MB290 records
2008-11.zip4.4 MB139 records
2008-10.zip8.9 MB244 records
2008-09.zip11.4 MB336 records
2008-08.zip8.5 MB238 records
2008-07.zip21.3 MB541 records
2008-06.zip79.6 MB2,564 records
2008-05.zip30.2 MB800 records
2008-04.zip40.6 MB860 records
2008-03.zip46.0 MB1,166 records
2008-02.zip7.7 MB237 records
2008-01.zip7.3 MB248 records
2007-12.zip18.3 MB470 records
2007-11.zip10.0 MB242 records
2007-10.zip8.2 MB330 records
2007-09.zip10.7 MB447 records
2007-08.zip7.2 MB339 records
2007-07.zip40.9 MB1,129 records
2007-06.zip80.0 MB2,399 records
2007-05.zip24.1 MB703 records
2007-04.zip28.6 MB904 records
2007-03.zip49.7 MB1,044 records
2007-02.zip7.8 MB273 records
2007-01.zip8.4 MB145 records
2006-12.zip19.0 MB409 records
2006-11.zip18.8 MB316 records
2006-10.zip6.9 MB243 records
2006-09.zip12.4 MB431 records
2006-08.zip8.0 MB252 records
2006-07.zip22.6 MB840 records
2006-06.zip98.0 MB3,134 records
2006-05.zip25.2 MB737 records
2006-04.zip26.8 MB878 records
2006-03.zip45.6 MB1,223 records
2006-02.zip7.5 MB273 records
2006-01.zip7.6 MB228 records
2005-12.zip19.1 MB462 records
2005-11.zip9.4 MB304 records
2005-10.zip14.4 MB423 records
2005-09.zip12.5 MB449 records
2005-08.zip9.9 MB382 records
2005-07.zip36.5 MB1,006 records
2005-06.zip101.4 MB3,012 records
2005-05.zip32.1 MB838 records
2005-04.zip28.9 MB989 records
2005-03.zip41.1 MB1,139 records
2005-02.zip5.6 MB230 records
2005-01.zip8.9 MB270 records
2004-12.zip10.1 MB312 records
2004-11.zip6.3 MB237 records
2004-10.zip10.4 MB200 records
2004-09.zip13.9 MB415 records
2004-08.zip22.1 MB186 records
2004-07.zip25.4 MB759 records
2004-06.zip107.1 MB3,139 records
2004-05.zip21.8 MB723 records
2004-04.zip42.3 MB830 records
2004-03.zip57.5 MB1,261 records
2004-02.zip7.4 MB234 records
2004-01.zip5.7 MB168 records
2003-12.zip16.5 MB386 records
2003-11.zip10.0 MB267 records
2003-10.zip12.5 MB262 records
2003-09.zip13.0 MB480 records
2003-08.zip14.4 MB183 records
2003-07.zip26.4 MB665 records
2003-06.zip128.9 MB2,656 records
2003-05.zip27.7 MB584 records
2003-04.zip21.1 MB433 records
2003-03.zip43.0 MB634 records
2003-02.zip6.9 MB201 records
2003-01.zip4.5 MB89 records
2002-12.zip12.3 MB287 records
2002-11.zip4.7 MB90 records
2002-10.zip3.6 MB58 records
2002-09.zip4.9 MB113 records
2002-08.zip2.3 MB57 records
2002-07.zip14.1 MB353 records
2002-06.zip15.0 MB353 records
2002-05.zip12.5 MB239 records
2002-04.zip8.0 MB214 records
2002-03.zip12.8 MB179 records
2002-02.zip3.1 MB84 records
2002-01.zip828.3 KB33 records
2001-12.zip2.6 MB74 records
2001-11.zip1.6 MB62 records
2001-10.zip2.3 MB85 records
2001-09.zip4.0 MB52 records
2001-08.zip1.2 MB44 records
2001-07.zip11.3 MB246 records
2001-06.zip8.5 MB244 records
2001-05.zip7.9 MB198 records
2001-04.zip3.8 MB114 records
2001-03.zip4.2 MB101 records
2001-02.zip987.1 KB20 records
2001-01.zip1.1 MB38 records
2000-12.zip756.9 KB32 records
2000-11.zip743.5 KB24 records
2000-10.zip2.7 MB27 records
2000-09.zip2.6 MB69 records
2000-08.zip517.6 KB27 records
2000-07.zip4.3 MB99 records
2000-06.zip5.2 MB125 records
2000-05.zip2.9 MB103 records
2000-04.zip2.2 MB56 records
2000-03.zip3.9 MB116 records
2000-02.zip341.7 KB9 records
2000-01.zip329.4 KB6 records
1999-12.zip271.2 KB23 records
1999-11.zip348.3 KB5 records
1999-10.zip187.2 KB4 records
1999-09.zip1.0 MB32 records
1999-08.zip1.3 MB72 records
1999-07.zip1.6 MB39 records
1999-06.zip3.4 MB88 records
1999-05.zip2.0 MB74 records
1999-04.zip1.1 MB41 records
1999-03.zip1.9 MB68 records
1999-02.zip107.7 KB7 records
1999-01.zip183.7 KB6 records
1998-12.zip249.0 KB12 records
1998-11.zip241.0 KB7 records
1998-10.zip371.1 KB10 records
1998-09.zip828.8 KB17 records
1998-08.zip201.1 KB4 records
1998-07.zip339.3 KB22 records
1998-06.zip2.3 MB63 records
1998-05.zip1.4 MB46 records
1998-04.zip532.8 KB19 records
1998-03.zip764.3 KB21 records
1998-01.zip72.1 KB1 records
1997-12.zip194.8 KB12 records
1997-10.zip547.6 KB22 records
1997-09.zip184.8 KB14 records
1997-08.zip261.9 KB4 records
1997-07.zip373.1 KB17 records
1997-06.zip948.1 KB25 records
1997-05.zip597.7 KB27 records
1997-04.zip251.0 KB12 records
1997-03.zip156.1 KB15 records
1996-12.zip70.3 KB1 records
1996-11.zip72.2 KB1 records
1996-09.zip134.1 KB4 records
1996-08.zip99.2 KB6 records
1996-07.zip351.0 KB12 records
1996-06.zip296.8 KB17 records
1996-05.zip76.0 KB2 records
1996-04.zip137.9 KB9 records
1996-01.zip29.8 KB1 records
1995-06.zip3.0 KB1 records

What This Dataset Contains

The Form 20-F Files Dataset captures one foreign-private-issuer annual report (or one amendment) per record, in essentially the textual form in which it was originally accepted by EDGAR, minus image assets and standalone XBRL data files (both of which are referenced in metadata.json but not packaged on disk). On disk a record materializes as a flat folder named with the 18-digit dash-stripped accession number (for example 000119312525319187), nested under a monthly partition YYYY/YYYY-MM.zip keyed to the EDGAR filing month (filedAt), not the fiscal period being reported. Inside the folder every artifact for the filing sits as a flat sibling: a metadata.json inventory plus the primary annual report document and each separately-filed exhibit as its own file. There are no nested sub-directories.

Form 20-F is the annual report mandated by Sections 13 or 15(d) of the Securities Exchange Act of 1934 for foreign private issuers whose equity securities are registered or listed in the United States. It plays the same disclosure role as Form 10-K for domestic registrants but accommodates non-U.S. accounting frameworks (commonly IFRS as issued by the IASB), home-country governance regimes, and foreign corporate-law concepts. The form is also used as a registration statement under Section 12 of the Exchange Act when a foreign issuer first lists in the U.S., though the dataset's primary content is annual-report use. Form 20-F/A is an amendment to a previously filed 20-F: it carries the same form architecture, restates or supplements specific Items, and reuses the original periodOfReport of the report being amended — so the filing-month partition of the dataset does not coincide with the fiscal-year partition for amendments. A 20-F/A may republish the entire annual report with corrections, or it may target a small number of Items or exhibits (the dataset includes minimal amendments consisting of only the primary document plus, for example, a single clawback policy exhibit). The amendment does not replace the original record; it coexists with it under its own accession number.

The substantive disclosure schema is set by the SEC's General Instructions to Form 20-F. Parts I through III contain the substantive Items; Part IV captures supplemental information; signatures close the body; and a list of exhibits — many filed as separate documents — is governed by the Instructions as to Exhibits. The Item architecture is closely aligned with the IOSCO international disclosure standards for cross-border offerings.

Content Structure of a Single Record

Content layers of a single record

Inside an accession folder the content layers, in load order, are:

  1. metadata.json — a single JSON object describing the filing, its filer entity, and a complete inventory of every document and data file in the original EDGAR submission.
  2. The primary annual report document — the Form 20-F or 20-F/A itself, in modern filings a single large XHTML file with inline-XBRL markup. This is the document tagged type: "20-F" or type: "20-F/A" in documentFormatFiles.
  3. Exhibit documents — separate files for each exhibit numbered under the Form 20-F exhibit instructions (description of securities, articles of association, material contracts, list of subsidiaries, certifications, consents, code of ethics, clawback policy, etc.).
  4. Other supporting attachments referenced by EDGAR submission type codes (additional consents, voluntary attachments, press releases tagged as EX-99.x).

What is enumerated in metadata.json but not packaged on disk:

  • All GRAPHIC-type files (JPG, GIF, PNG embedded illustrations, charts, signatures-as-images, and inline-XBRL inline-image references) — excluded by dataset policy.
  • The XBRL data files listed under dataFiles (the standalone instance document *_htm.xml and the linkbase files EX-101.SCH/CAL/DEF/LAB/PRE) — referenced via documentUrl but not physically included.
  • The complete-submission .txt envelope, which is enumerated as the trailing row in documentFormatFiles (with a sentinel single-space sequence and type) but not bundled.

The authoritative description of what each file is comes from the documentFormatFiles array inside metadata.json — file names themselves are filer-agent-specific and do not encode role information consistently.

metadata.json — field-by-field anatomy

metadata.json is the canonical record header. Its top-level keys are:

  • formType"20-F" or "20-F/A".
  • accessionNo — the canonical dashed accession number (e.g. "0001104659-25-124926"); the folder name is the same value with dashes removed.
  • filedAt — ISO-8601 timestamp with timezone offset capturing the EDGAR acceptance moment.
  • periodOfReport — fiscal-period end date in YYYY-MM-DD form. For 20-F/A this points to the period of the original report being amended, which can predate filedAt by years.
  • description — human-readable form description such as "Form 20-F - Annual and transition report of foreign private issuers [Sections 13 or 15(d)]".
  • linkToFilingDetails — direct EDGAR URL to the primary report HTM, prefixed by the inline-XBRL viewer path https://www.sec.gov/ix?doc=... when the primary document carries iXBRL.
  • linkToTxt — URL to the complete EDGAR submission text bundle (<accessionNo>.txt).
  • linkToHtml — URL to the EDGAR submission index page (...-index.htm).
  • linkToXbrl — present but consistently an empty string, even when XBRL data clearly exists; rely on dataFiles[] and the inline tagging in the primary document instead.
  • id — internal 32-character hex content hash.
  • seriesAndClassesContractsInformation — empty array for 20-F filings (the structure is reused from investment-company filings).
  • documentFormatFiles — ordered array, one element per document in the EDGAR submission (primary report, each exhibit, each graphic, plus the trailing complete-submission text bundle).
  • dataFiles — array enumerating the XBRL schema, linkbase files, and the extracted instance document.
  • entities — array of filer descriptors, almost always one element for a 20-F.

Each documentFormatFiles[] entry carries sequence (string-typed; the complete-submission .txt row uses a literal space), size (bytes, string-typed), documentUrl, free-text description (e.g. "20-F", "ANNUAL REPORT", "CERTIFICATION", "DESCRIPTION OF SECURITIES", "LIST OF SUBSIDIARIES", "CONSENT OF ...", "GRAPHIC", "AMENDMENT NO 1", "Complete submission text file"), and the EDGAR submission type code. Type codes seen on Form 20-F filings include the primary 20-F / 20-F/A; narrative exhibits such as EX-1.2, EX-2, EX-2.1, EX-4.A(8); EX-8 / EX-8.1 (list of subsidiaries); governance and policy exhibits EX-11.1 (code of ethics), EX-14.1, EX-19.1 (insider-trading policies), EX-97 (clawback policy); SOX certifications EX-12.1 / EX-12.2 (Section 302) and EX-13.1 / EX-13.2 (Section 906); auditor and counsel consents EX-15.1 / EX-15.2 / EX-15.3 and EX-23.1; miscellaneous EX-16.1, EX-22.1, EX-99.1; and GRAPHIC rows for embedded image assets that are excluded from the on-disk package. The codes are not normalized across filers — the same conceptual exhibit may appear as EX-8 for one filer and EX-8.1 for another — so consumers should treat them as filer-supplied tags rather than an enumerated controlled vocabulary. The order roughly follows EDGAR submission sequence: primary 20-F, narrative exhibits, certifications, consents, other exhibits, graphics, and finally the complete-submission .txt bundle.

dataFiles[] entries follow the same shape and enumerate the XBRL artifacts referenced by the filing: EX-101.SCH taxonomy schema, EX-101.CAL calculation linkbase, EX-101.DEF definition linkbase, EX-101.LAB label linkbase, EX-101.PRE presentation linkbase, and an XML row labeled "EXTRACTED XBRL INSTANCE DOCUMENT" pointing at *_htm.xml. A filing whose taxonomy has no calculations may omit EX-101.CAL. Sequence numbers on dataFiles[] are typically non-contiguous because EDGAR slots them after the GRAPHIC rows.

entities[] describes the filer with companyName (with the role suffix (Filer) appended), cik (no leading zeros), fileNo (e.g. "001-14840"), filmNo, irsNo (commonly "000000000" for foreign filers), fiscalYearEnd in MMDD form (e.g. "0930", "1231", "0731"), stateOfIncorporation as the EDGAR state/country code (e.g. X0 Israel, D8 Cayman, E9 Singapore, A1 Alberta), act set to "34", type echoing the form type, sic carrying both the four-digit SIC code and its description (e.g. "7372 Services-Prepackaged Software"), and tickers as an array of trading symbols (e.g. ["DOX"], ["DFSC", "KWE", "KWEMF", "DFSCW"]).

Section-by-section structure of the primary 20-F document

The primary annual report document carries the form's substantive disclosure, organized by the General Instructions to Form 20-F into the architecture below. Filings are not required to render the headings literally, but the conceptual order is consistent.

Cover page. Identifies the registrant and the report; marks whether the filing is an annual report, transition report, or shell-company report; declares the title and number of outstanding shares of each class as of the period end; indicates whether the registrant is an emerging-growth company or a shell company; identifies the basis of accounting (U.S. GAAP, IFRS as issued by the IASB, or "Other") and, when "Other," the specific items the auditor has reviewed; states whether the registrant has filed all reports required during the preceding 12 months; and (where applicable) indicates whether the registrant has prepared an Interactive Data File. Modern cover pages are exhaustively iXBRL-tagged via the dei (Document and Entity Information) namespace.

Part I.

  • Item 1 — Identity of Directors, Senior Management and Advisers. Required only in registration-statement use; omitted or marked "Not applicable" in annual-report 20-Fs.
  • Item 2 — Offer Statistics and Expected Timetable. Same — registration-statement context only.
  • Item 3 — Key Information. Historically the home of selected financial data and dividend history; since the 2020 amendments under SEC Release 33-10890 the tabular Selected Financial Data sub-item (formerly 3.A) was eliminated, leaving capitalization and indebtedness disclosure (only required for offerings) and, importantly, Item 3.D — Risk Factors, which is the dominant narrative of this Part. The forward-looking-statements legend typically sits in Item 3.
  • Item 4 — Information on the Company. Business description, history and development, organizational structure, principal capital expenditures, property/plant/equipment, and the regulatory environment in the issuer's home jurisdiction.
  • Item 4A — Unresolved Staff Comments. A short disclosure on outstanding SEC Staff comments more than 180 days old.
  • Item 5 — Operating and Financial Review and Prospects. The 20-F equivalent of MD&A: results of operations, liquidity and capital resources, research and development policy, trend information, and (since the 2020 amendments) critical accounting estimates. The pre-2020 sub-items on off-balance-sheet arrangements and the tabular contractual-obligations disclosure were eliminated.
  • Item 6 — Directors, Senior Management and Employees. Biographies, compensation (aggregate and individual where required by home-country law or otherwise disclosed), board practices, employee headcount, share ownership, and any share-option arrangements.
  • Item 7 — Major Shareholders and Related Party Transactions. Beneficial-ownership tables, related-party transactions, and interests of experts and counsel.
  • Item 8 — Financial Information. Cross-reference to the consolidated financial statements, legal proceedings, dividend policy, and significant subsequent changes.
  • Item 9 — The Offer and Listing. Trading market, plan of distribution, markets on which the securities trade, and historical price ranges (the price-history sub-item was streamlined in the 2008 amendments).
  • Item 10 — Additional Information. Memorandum and articles of association, material contracts, exchange controls, taxation (including the U.S. federal-income-tax discussion for U.S. holders), dividends and paying agents, statements by experts, documents on display, and subsidiary information.
  • Item 11 — Quantitative and Qualitative Disclosures About Market Risk.
  • Item 12 — Description of Securities Other Than Equity Securities. Including, for ADR issuers, Item 12.D — American Depositary Shares, which since 2008 must disclose ADR fees and depositary payments to the issuer.

Part II.

  • Item 13 — Defaults, Dividend Arrearages and Delinquencies.
  • Item 14 — Material Modifications to the Rights of Security Holders and Use of Proceeds.
  • Item 15 — Controls and Procedures. Disclosure Controls (15(a)), Management's Annual Report on Internal Control over Financial Reporting (15(b)), the auditor's attestation report (15(c)) where required, and changes in internal control (15(d)) — required since the SEC's 2007 implementation of SOX Section 404 for foreign private issuers.
  • Item 16 — Audit Committee, Code of Ethics, and related governance disclosures. Item 16 has accreted significant sub-items: 16A (audit-committee financial expert), 16B (code of ethics), 16C (principal-accountant fees and services), 16D (audit-committee exemptions), 16E (purchases of equity securities by the issuer and affiliated purchasers), 16F (change in registrant's certifying accountant), 16G (corporate-governance differences from NYSE/Nasdaq home-country practice), 16H (mine-safety disclosure), 16I (HFCAA disclosure on auditors located in jurisdictions where the PCAOB cannot inspect, added 2021), 16J (insider-trading policies, added under the 2022 Item 408 alignment), and 16K (cybersecurity, added by the 2023 cybersecurity disclosure rule, effective for FPI annual reports for fiscal years ending on or after 15 December 2024).

Part III.

  • Item 17 — Financial Statements. Largely vestigial; new uses of Item 17 were eliminated by the 2007–2008 amendments removing the Item 17/18 distinction for most filers.
  • Item 18 — Financial Statements. The full audited consolidated financial statements, prepared either under U.S. GAAP, IFRS as issued by the IASB (no reconciliation required since November 2007), or another GAAP with a U.S. GAAP reconciliation. Includes the auditor's report (post-2017 with Critical Audit Matters when the auditor is U.S.-PCAOB-registered and CAMs apply) and the full notes to the financial statements.
  • Item 19 — Exhibits. The exhibit index keyed to the Instructions as to Exhibits.

Signatures. A signature block by an officer of the registrant certifying that the registrant has duly caused the report to be signed.

Inline XBRL in the primary document

For modern 20-F filings the primary document is full-fidelity XHTML carrying inline-XBRL markup. The file begins with an <?xml ... ?> prologue and an <html> element bearing many namespace declarations (xmlns:dei, xmlns:us-gaap, xmlns:ix, xmlns:xbrli, xmlns:srt, plus xmlns:ifrs-full for IFRS filers and a filer-specific extension namespace). Tagged facts are embedded in elements such as <ix:nonNumeric contextRef="..." name="dei:DocumentType">20-F</ix:nonNumeric>, with <ix:header>, <ix:hidden>, <ix:references>, and <ix:resources> blocks at the top of <body>. These primary documents are large — frequently 1–6 MB — because they carry the entire annual report plus its structured tagging. DEI tagging blankets the cover page, the financial statements are tagged against the US-GAAP taxonomy or IFRS taxonomy as applicable, and (per the 2020 amendments) risk-factor section headings are detail-tagged. Inline XBRL was phased in for FPIs by SEC Release 33-10514, with FPIs using IFRS required to comply for fiscal periods ending on or after 15 June 2021.

Exhibit-by-exhibit breakdown

Form 20-F's Instructions as to Exhibits define a numbered exhibit scheme. Each exhibit typically appears as a separate file in the accession folder. The most common exhibits and their roles:

  • Exhibit 1 — Memorandum and articles of association, charter, bylaws, or equivalent constitutional documents under home-country law. Sub-numbering (EX-1.1, EX-1.2, EX-1.3) is used when multiple constitutional documents are filed.
  • Exhibit 2 — Instruments defining the rights of security holders (indentures, deposit agreements for ADRs, share certificates). EX-2.D (or a numbered sub-exhibit) is the post-2019 home of the Description of Securities Registered under Section 12.
  • Exhibit 4 — Material contracts (including the EX-4.A series for material agreements, employment and management contracts, financing agreements). Sub-references such as EX-4.A(8) reflect the deeply nested numbering used by long-running filers.
  • Exhibit 7 — Statement explaining ratio calculations, when ratios are presented.
  • Exhibit 8 — List of subsidiaries (EX-8 or EX-8.1), required by Item 19 and analogous to a 10-K's Exhibit 21.
  • Exhibit 11 — Code of ethics (EX-11.1), required since the 2003 SOX-driven amendments.
  • Exhibit 12SOX Section 302 certifications by the principal executive and principal financial officers (EX-12.1, EX-12.2).
  • Exhibit 13SOX Section 906 certifications (EX-13.1, EX-13.2).
  • Exhibit 14 — Consent of persons named as having prepared or certified parts of the report when relied upon as experts (legacy use; mostly subsumed by Exhibit 15).
  • Exhibit 15 — Auditor consents (EX-15.1) and consents of other experts and counsel (EX-15.2, EX-15.3), required when financial statements are incorporated by reference into Securities Act registration statements.
  • Exhibit 16 — Letter from former auditor on dismissal/resignation, when applicable.
  • Exhibit 19 — Insider-trading policies (added with the SEC's 2022 amendments; analogous to the 10-K Item 408(b) disclosure).
  • Exhibit 22 — Subsidiary guarantor / affiliate-issuer disclosure under Rule 13-01 (added 2020).
  • Exhibit 23 — Auditor consents in the alternative numbering used by some filers.
  • Exhibit 97 — Compensation recovery (clawback) policy required by the 2023 listing-standards rule.
  • Exhibit 99 — Other miscellaneous exhibits at the filer's discretion (press releases, additional consents, voluntary attachments).

The exhibit type code in documentFormatFiles[].type is the canonical role tag; the filename pattern (tmb-20250930xex12d1.htm, dox-ex4_a8.htm, birk-ex2_1.htm, etc.) is filer-agent-specific and not standardized.

File formats inside an accession folder

The file types found in the dataset are TXT, JSON, PDF, and HTML, but in modern filings the on-disk content is overwhelmingly HTM plus the JSON manifest, with PDF and plain TXT appearing primarily in older filings. Two distinct HTM dialects coexist within a single accession folder:

  1. Inline-XBRL XHTML (the primary 20-F / 20-F/A document). As described above, this is full-fidelity XHTML beginning with an <?xml ... ?> declaration and a long namespace-laden <html> element, with <ix:...> tagged facts embedded throughout the body.

  2. SGML-wrapped legacy HTML (most exhibits). The file opens with the EDGAR submission-bundle SGML envelope tags and then an inner uppercase-tag HTML body:

    1 <DOCUMENT>
    2 <TYPE>EX-12.1
    3 <SEQUENCE>4
    4 <FILENAME>ea027079501ex12-1_ezgotech.htm
    5 <DESCRIPTION>CERTIFICATION
    6 <TEXT>
    7 <HTML>
    8 <HEAD>
    9 <TITLE></TITLE>
    10 </HEAD>
    11 <BODY STYLE="font: 10pt Times New Roman, Times, Serif">
    12 ...exhibit body (uppercase HTML tags, inline styles, no XBRL)...

    These files are not standalone XHTML — the <DOCUMENT> / <TYPE> / <SEQUENCE> / <FILENAME> / <DESCRIPTION> / <TEXT> lines are normally wrapped inside the complete-submission .txt envelope but here remain on each exhibit. A consumer that wants clean HTML must strip those leading lines (and any trailing </TEXT></DOCUMENT>) before passing to an HTML parser.

File-naming patterns

There is no enforced naming standard; each filer agent has its own. Patterns commonly observed include:

  • Filer-agent prefix + period/slug: tmb-20250930x20f.htm, tmb-20250930xex12d1.htm (RR Donnelley convention — x separator, d denoting sub-numbering such that 12d1 represents 12.1).
  • Edgar-Online style: ea0270795-20f_<slug>.htm, ea027079501ex12-2_<slug>.htm (sequential ea + accession suffix + ex<X-Y>_<slug>).
  • Issuer slug + numeric exhibit: dox-20250930.htm, dox-ex4_a8.htm; birk-20250930.htm, birk-ex2_1.htm; umewf_20f.htm, umewf_ex121.htm.
  • Period-of-report named primary: arqq-20250930x20f.htm, lvro-20250630.htm — the primary 20-F is frequently named after the reporting period date.

The only reliable way to map a file to its role is via documentFormatFiles[] in metadata.json (matched on the trailing path segment of documentUrl).

Included content

A record physically includes:

  • metadata.json describing the filing, its filer, and the full original-submission inventory.
  • The primary 20-F or 20-F/A document.
  • All exhibits enumerated in documentFormatFiles whose type is not GRAPHIC.
  • Any non-graphic supporting attachments included in the EDGAR submission.

Excluded or separately referenced content

  • All GRAPHIC-type files (embedded JPG/GIF/PNG illustrations, charts, signatures, and inline-image references) are excluded from the on-disk package by dataset policy. Their documentUrl values in metadata.json remain the authoritative source if they are needed.
  • The XBRL linkbase files plus the standalone extracted XBRL instance document (*_htm.xml) are listed in dataFiles but not packaged. The structured XBRL facts are nonetheless embedded inline in the primary document via the <ix:...> tags, so the structured data is fully recoverable from the on-disk material alone.
  • The complete-submission .txt envelope is enumerated (with literal-space sequence and type) but not bundled — consumers who need the raw EDGAR submission can dereference linkToTxt.

Evolution of required content over time

Form 20-F's substantive disclosure schema has been amended multiple times since 1995, and the dataset spans those eras:

  • 1999 — Modernization to align with IOSCO international disclosure standards. SEC Release 34-41936 reorganized Form 20-F into the Part I–IV / Item 1–19 architecture in use today, replacing a much older format whose Items did not align with the international standard.
  • 2003 — SOX-era additions. Code-of-ethics exhibit (Exhibit 11), audit-committee financial expert disclosure (Item 16A), principal-accountant fees and services (Item 16C), and Section 302/906 certifications (Exhibits 12 and 13).
  • 2007 — IFRS without reconciliation. Foreign private issuers preparing financial statements under IFRS as issued by the IASB were no longer required to reconcile to U.S. GAAP, materially shrinking the financial-statements footnote burden for many filers.
  • 2007–2008 — General amendments. Removal of the Item 17/Item 18 distinction for most filers; elimination of the five-year price-history sub-item; Item 16E (issuer purchases of equity securities); Item 16F (change of certifying accountant); Item 16G (corporate-governance home-country differences); Item 12.D ADR fee disclosure; and acceleration of the filing deadline to four months after fiscal year end.
  • 2010 — Mine-safety disclosure (Item 16H) added under Dodd-Frank.
  • 2017 — Critical Audit Matters introduced into the auditor's report (PCAOB AS 3101) with phased adoption.
  • 2019 — Description of Securities exhibit (Exhibit 2.D) required to consolidate the Section 12 description of registered securities.
  • 2020 — MD&A and Item 3 modernization. SEC Release 33-10890 eliminated Selected Financial Data, off-balance-sheet arrangements as a separate sub-item, and the tabular contractual-obligations disclosure; consolidated critical accounting estimates into Item 5; and streamlined risk-factor headers.
  • 2020 — Subsidiary-guarantor disclosure (Exhibit 22) added under Rule 13-01.
  • 2021 — Holding Foreign Companies Accountable Act (Item 16I) disclosure for issuers whose auditors are located in jurisdictions that do not permit PCAOB inspection.
  • 2022–2023 — Insider-trading policies (Item 16J / Exhibit 19) and clawback policy (Exhibit 97) required by Dodd-Frank rulemakings.
  • 2023 — Cybersecurity disclosure added as Item 16K, requiring disclosure of cybersecurity risk management, strategy, and governance, with separate prompt 6-K disclosure of material cybersecurity incidents — effective for FPI annual reports for fiscal years ending on or after 15 December 2024.

Evolution of file format over time

The dataset begins in June 1995, when EDGAR submissions were predominantly plain ASCII text, and extends to the present:

  • 1995–c. 2002 — Plain-text era. The primary 20-F and its exhibits were delivered as ASCII .txt files inside the SGML submission envelope, with tabular data rendered as fixed-width text.
  • Early 2000s — HTML adoption. Filers transitioned to HTML for the body of the report, while the EDGAR submission remained an SGML-wrapped bundle. This is the era from which the <DOCUMENT>/<TYPE>/<SEQUENCE>/<FILENAME>/<DESCRIPTION>/<TEXT> header style for exhibits dates, and that style persists in many exhibits even today.
  • 2008–2009 — XBRL pilot and phase-in. Financial-statement XBRL tagging arrived as separate EX-101.* exhibits filed alongside the primary document. FPIs using IFRS were phased in for fiscal years ending on or after 15 June 2011, once the IFRS taxonomy was available.
  • 2017–2021 — Inline XBRL (iXBRL). Inline-XBRL tagging was phased in by SEC Release 33-10514, with FPIs using IFRS required to comply for fiscal periods ending on or after 15 June 2021. After phase-in, the primary 20-F became a single XHTML document carrying both the human-readable narrative and the structured tagging, eliminating the need for a separate instance document for human consumption (although an EXTRACTED XBRL INSTANCE DOCUMENT continues to be listed in dataFiles).
  • Modern era. The primary 20-F is an XHTML/iXBRL document with DEI tagging on the cover page, taxonomy-tagged financial-statement facts, and detail-tagged risk-factor headings. Exhibits remain SGML-wrapped legacy HTML. PDF documents and standalone plain-text documents appear sporadically, primarily as artifacts of older filings or unusual exhibit content.

Interpretation and extraction notes

  • Filing-month vs fiscal-year partition. Records are bucketed by the YYYY-MM of filedAt, not periodOfReport. A 20-F/A in a December 2025 partition can amend a fiscal year ending years earlier; a 20-F filed in early 2026 will sit in a 2026 partition even though it covers fiscal 2025.
  • Inline-XBRL viewer URL. linkToFilingDetails carries the https://www.sec.gov/ix?doc=/... viewer prefix when the primary document is iXBRL. Drop the prefix to retrieve the raw XHTML directly.
  • linkToXbrl is consistently empty even when XBRL is plainly present. Rely on dataFiles[] and the inline tagging in the primary document.
  • Filename heuristics are unreliable. The mapping from file to role must be done through documentFormatFiles[] by matching the trailing path segment of documentUrl. Filer-agent naming patterns vary (RR Donnelley tmb-...x20f.htm, Edgar Online ea...-20f_<slug>.htm, issuer-slug dox-20250930.htm, etc.).
  • Type codes are not normalized. The same conceptual exhibit can appear under different type codes across filers (EX-8 vs EX-8.1 for list of subsidiaries; minor variation in EX-12.x / EX-13.x / EX-15.x numbering). Treat type codes as filer-supplied tags rather than a controlled vocabulary.
  • SGML-wrapped exhibits need pre-processing. Exhibit files often retain the EDGAR <DOCUMENT>...<TEXT> header outside the <HTML> body. Strip those header lines (and any trailing </TEXT></DOCUMENT>) before parsing as HTML.
  • sequence: " " sentinel. The complete-submission .txt row uses a literal space for both sequence and type; consumers should not assume sequence is numeric.
  • Image and standalone-XBRL fetching. For applications needing graphics or the standalone XBRL instance/linkbase files, documentUrl values in metadata.json resolve directly to the EDGAR-hosted originals.
  • 20-F/A scope. The body of a 20-F/A typically opens with an explanatory note describing the scope of the amendment; that note is the most reliable signal of what changed. Amendment packages range from a near-full re-publication of the annual report down to the primary document plus a single targeted exhibit (e.g. a clawback-policy update). The amendment coexists with the original record under its own accession number and does not replace it.
  • File-count is not a completeness signal. metadata.json is the canonical inventory; the on-disk file count alone does not indicate whether a filing is "minimal" or "complete," because graphics and standalone XBRL artifacts are deliberately excluded from the package.

Who Files or Publishes This Dataset, and When

Who files

Each record is an EDGAR submission of Form 20-F or Form 20-F/A by a foreign private issuer (FPI) registered with the SEC. The filer is always the issuer itself: a non-U.S.-organized company with securities registered under Section 12 of the Securities Exchange Act of 1934, or with a continuing reporting obligation under Section 15(d).

"Foreign private issuer" is defined in Rule 405 (Securities Act) and Rule 3b-4 (Exchange Act). The status test is reapplied annually as of the last business day of the second fiscal quarter:

  1. The issuer must be organized outside the United States.
  2. If 50% or less of voting securities are held of record by U.S. residents, the issuer qualifies. If more than 50% are U.S.-held, it still qualifies unless any of the following is true: a majority of executive officers or directors are U.S. citizens or residents; more than 50% of assets are in the U.S.; or the business is administered principally in the U.S.

An issuer that fails the test loses FPI status at fiscal year end and transitions to domestic forms (10-K, 10-Q, 8-K, Regulation 14A proxy rules) the following fiscal year.

The 20-F population spans operating companies, holding companies, banks, insurers, mining and resource companies, and other non-U.S. enterprises listed on NYSE, NYSE American, Nasdaq, or OTC markets, or otherwise SEC-registered. Securities may take the form of ADRs, ordinary share listings, or registered debt. Dual-listed and U.S.-only-listed FPIs both file on Form 20-F.

What triggers the record

The 20-F is a schedule-driven annual report, not an event-driven filing. The obligation arises from:

  • Section 13(a) of the Exchange Act for FPIs with a class of securities registered under Section 12 (typically through U.S. exchange listing); or
  • Section 15(d) for FPIs with a continuing reporting obligation from a registered Securities Act offering (e.g., a prior Form F-1).

Rules 13a-1 and 15d-1 require an annual report on the form prescribed for the issuer class; Form 20-F's General Instructions designate it as the FPI annual report (other than for Form 40-F MJDS filers). Form 20-F can also serve as an Exchange Act Section 12 registration statement, but the overwhelming majority of records in this dataset are annual reports.

When it is filed

  • Annual deadline: within four months after fiscal year end. The deadline is uniform across FPIs and does not vary by accelerated filer, large accelerated filer, or non-accelerated status. (The four-month rule replaced the prior six-month deadline for fiscal years ending on or after December 15, 2011.)
  • Late-filing extension: under Rule 12b-25, an FPI that files Form NT 20-F before the original due date receives an automatic 15-calendar-day extension. A 20-F filed inside that window is deemed timely; outside it, the issuer is delinquent and may lose Form F-3/S-3 short-form eligibility.
  • Transition reports: Form 20-F is also used as a transition report when an FPI changes its fiscal year (Rules 13a-10 / 15d-10).

Amendments (Form 20-F/A)

Form 20-F/A records amend a previously filed 20-F. Common triggers:

  • restatement of audited financial statements
  • correction of disclosures (risk factors, related-party items, executive compensation, material contracts)
  • inclusion of items omitted from the original filing
  • response to SEC staff comment letters
  • revised auditor consents, certifications, or exhibits

There is no fixed deadline; an amendment is filed when the need arises. A 20-F/A neither resets the original due date nor cures delinquent status for the underlying report.

Coverage and earliest records

Form 20-F was adopted in 1979 (Release No. 34-16371), consolidating prior foreign-issuer annual report forms. EDGAR availability begins around June 1995, consistent with the earliest dataset sample; mandatory electronic filing of 20-F by FPIs was substantially complete by the late 1990s. The dataset therefore reflects the full electronic record of FPI annual reports and amendments from June 1995 to the present.

Filer versus subjects of disclosure

The legal filer is always the issuer. Directors, senior management, controlling shareholders, subsidiaries, joint ventures, and counterparties are described in the report but are not filers of the 20-F record. In a corporate group, only the SEC-registered issuer files; non-registered parents and subsidiaries appear only as described entities.

How This Dataset Differs From Similar Datasets or Filings

Form 20-F sits within the SEC's foreign private issuer reporting regime. The most useful comparisons are with the domestic annual report (10-K), the MJDS Canadian annual report (40-F), the FPI interim/event report (6-K), the FPI registration statement family (F-1/F-3/F-4 and the domestic S-1), 20-F/A amendments, a hypothetical Items extraction product, and structured XBRL fact datasets. Each overlaps with 20-F in subject matter or filer population but differs sharply in scope, cadence, structure, or content type.

Form 10-K

The closest functional analog. Both are Exchange Act annual reports with audited financials, business descriptions, risk factors, and MD&A. They are mutually exclusive at the filer level: 10-K covers domestic registrants, 20-F covers FPIs.

Key differences:

  • Item schedule. 20-F follows the FPI schedule (Items 3-19, including Key Information, Operating and Financial Review, Major Shareholders and Related Party Transactions, Financial Statements at Item 17 or Item 18). 10-K uses Items 1-15 with entirely different headings.
  • Accounting framework. 20-F filers may report under U.S. GAAP or IFRS as issued by the IASB; 10-K filers must use U.S. GAAP. 20-F is therefore the appropriate corpus for IFRS research.
  • Deadline. 20-F is due four months after fiscal year end; 10-K accelerated and large accelerated filers face 60 or 75 day deadlines.
  • Internal controls. Both include SOX 302/404 attestations, but 20-F applies FPI-specific accommodations and exemptions.
  • Compensation. 20-F generally permits aggregate disclosure consistent with home-country practice; 10-K requires Item 402 named-executive-officer tables.

To assemble a complete SEC annual-report universe, 10-K and 20-F datasets must be combined.

Form 40-F

The annual report used by qualifying Canadian issuers under the Multijurisdictional Disclosure System. Same role as 20-F (annual report by a non-U.S. issuer) but a different regime:

  • Eligibility. 40-F is restricted to MJDS-eligible Canadian issuers; 20-F is the default for all other FPIs and for non-MJDS Canadian filers.
  • Content. 40-F is largely a wrapper around home-country Canadian disclosure (AIF, MD&A, audited financials). 20-F is a self-contained SEC-prescribed document built to the FPI item schedule.

A 40-F dataset is complementary to 20-F, not a substitute.

Form 6-K

The FPI interim and event-driven report. FPIs do not file Form 10-Q or Form 8-K; Form 6-K covers both functions. Differences from 20-F:

  • Cadence. 6-K is event-driven and filed throughout the year; 20-F is annual.
  • Content. 6-K wraps press releases, interim financials, shareholder communications, and exchange filings. 20-F is the comprehensive audited annual report.
  • Structure. 6-K has no SEC-prescribed item schedule and varies widely; 20-F is standardized.
  • Audit. 6-K interim financials are typically unaudited; 20-F financials are audited.

6-K is the natural year-round complement to 20-F for the same filer population.

Forms F-1, F-3, F-4 (and the S-1 distinction)

F-1, Form F-3, and Form F-4 are the FPI Securities Act registration statements (long-form, shelf, and business combinations respectively). Form S-1 is the domestic equivalent of F-1; the S/F divide tracks the same domestic/FPI line as 10-K vs 20-F.

Relationship to 20-F:

  • Trigger. F-series filings are tied to offerings or business combinations, not fiscal-year passage. 20-F is periodic.
  • Incorporation by reference. F-3 leans heavily on existing 20-F content, making 20-F the primary source even during an F-3 offering.
  • Content overlap. F-1 prospectuses share business, risk factor, MD&A, and audited financial content with 20-F but add offering-specific items (use of proceeds, plan of distribution, dilution, underwriting) absent from 20-F.

20-F is the source for periodic FPI disclosure; F-1/F-3/Form S-3/F-4 datasets are the source for offering-stage disclosure.

Form 20-F/A amendments

20-F/A filings amend a previously filed 20-F to correct errors, restate financials, add omitted exhibits, or respond to staff comments. They are included in this dataset alongside originals.

  • Point-in-time research must reconcile each original 20-F with subsequent 20-F/A filings to recover the as-amended record.
  • Restatement studies should isolate 20-F/A accessions and diff them against the original.
  • Treating originals and amendments as independent annual filings will double-count.

Section-extracted ("Items") datasets vs this Files dataset

A Form 20-F Items dataset would deliver parsed text aligned to the FPI item schedule (Item 3D Risk Factors, Item 4, Item 5, etc.). The Files dataset is different in kind:

  • Granularity. Files delivers the complete EDGAR submission, including primary document, exhibits, and supporting attachments (excluding images). Items delivers parsed, section-aligned text.
  • Format. Files preserves original HTML, TXT, PDF, and JSON metadata. Items exposes structured fields per item.
  • Use. Files supports full-text search across the entire filing, exhibit-level analysis (material contracts, subsidiaries, certifications), and downstream parsing pipelines. Items is the right product when only narrative content of specific sections is needed.

The Files dataset is the source-of-truth raw corpus from which Items extractions can be derived.

XBRL annual report data

XBRL datasets expose structured financial facts (revenues, assets, liabilities, cash flows, segments) tagged under US-GAAP or IFRS taxonomies. They overlap with 20-F because 20-F filers tag financials in Inline XBRL.

  • Content type. XBRL provides structured numeric facts; the Files dataset provides the underlying narrative documents and exhibits. Risk factor prose, MD&A, related-party narratives, and contract text are absent from XBRL.
  • Coverage. XBRL datasets typically normalize across 10-K, 20-F, 10-Q, and 6-K under one schema. The Files dataset is form-scoped and document-centric.
  • Use. XBRL supports time-series and ratio analysis; Files supports textual, qualitative, and exhibit-level work.

The two are complementary halves (numeric and textual) of the same underlying filings.

Boundary summary

Form 20-F Files is the comprehensive, document-level corpus of FPI annual reports and their amendments, delivered as the original EDGAR submission contents. It is not the domestic annual report (10-K), the MJDS Canadian annual report (40-F), the FPI interim/event report (6-K), an FPI registration statement (F-1/F-3/F-4), a section-parsed Items product, or a structured XBRL fact dataset. It is the right corpus when research requires the full text, exhibits, and metadata of FPI annual filings under the 20-F item schedule, including IFRS financial statements and 20-F/A amendment tracking. Adjacent datasets cover offering-stage disclosure, intra-year FPI events, structured numeric data, or domestic-issuer comparisons, but none substitute for the 20-F filing record itself.

Who Uses This Dataset

Form 20-F is the controlling annual disclosure for foreign private issuers listed on U.S. exchanges. Because no domestic 10-K equivalent exists for these companies, a narrow but professionally diverse set of users treat this dataset as the authoritative source.

Equity research analysts on ADR and international desks

Sell-side and buy-side analysts covering non-U.S. issuers build models from Item 5 (operating and financial review), Item 4 (business and segments), and Item 18 financial statements. They track segment redefinitions, FX exposure, dividend policy, and (for older years) IFRS-to-GAAP reconciliations. The 20-F is often the only English-language audited reference, so it anchors initiation reports, quarterly model updates, and ADR-vs-local pair trades.

Buy-side portfolio managers and credit analysts

Equity PMs use Item 3 risk factors, Item 7 related-party and controlling-shareholder disclosures, and Item 10 share-capital and exchange-control disclosures to size positions in foreign-domiciled names. Credit analysts pull debt schedules, covenant terms, and guarantor structures from the financial-statement notes to model recovery on cross-border bonds. The 1995-to-present history and 20-F/A amendments let them walk prior cycles and restructurings on the same issuer.

Securities and capital-markets lawyers

Disclosure counsel benchmark drafting against peer 20-Fs on risk factors, sanctions exposure, controlling-shareholder arrangements, ADR taxation, and exhibits such as charters and indentures. F-1 and F-3 teams use prior 20-Fs as the incorporation-by-reference base. Litigation and enforcement counsel reconstruct what was disclosed and when for Section 10(b), Section 18, and FCPA matters, using 20-F/A history to track restatements and comment-response patterns.

Technical accounting and IFRS research teams

Accounting policy groups at audit firms, standard-setter staffs, and corporate technical-accounting teams treat 20-Fs as a large comparable corpus of IFRS-as-filed-in-the-U.S. They study impairment, lease, financial-instrument, revenue, and pension choices, plus segment reporting. The longitudinal span supports tracking IFRS adoption by 20-F filers and the elimination of the GAAP reconciliation.

Cross-border M&A and corporate development

Deal teams use Item 4 segment and subsidiary descriptions, Item 7 related-party disclosures, Item 10 share-capital tables, and the exhibit set (charters, bylaws, material agreements) as the deepest public diligence on foreign targets, partners, and competitors. Corporate development groups monitor competitor capex, capacity, and capital allocation from Item 5 to source acquisition and partnership candidates.

Depositary-receipt program teams

ADR program managers and cross-listing advisors track program-related disclosures, fee arrangements, dividend tax treatment, and changes in share-capital or voting rights affecting ADR holders. The dataset supports onboarding diligence for new programs and periodic review of existing ones.

Quant, NLP, and RAG engineering teams

Quant research and financial-NLP groups use the corpus to train and back-test cross-border equity signals from risk-factor language, MD&A tone, and forward-looking statements. The TXT, HTML, and PDF mix supports both clean-text and layout-aware extraction. RAG developers ground research-assistant answers about foreign issuers in the primary filing rather than summaries.

Stewardship and ESG research

Governance analysts use Item 6 (directors, senior management, employees) and Item 7 (major shareholders, related parties) to assess board composition, independence, executive pay, and controlling-family or sovereign influence. ESG teams extract environmental, supply-chain, sanctions, and human-rights disclosures, plus referenced sustainability exhibits, to feed voting recommendations and engagement priorities for non-U.S. portfolios.

Compliance and sanctions monitoring

Compliance functions at broker-dealers, asset managers, and banks use 20-Fs for customer due diligence and counterparty review on foreign issuers. They focus on Section 13(r) Iran-related disclosures, operations in sanctioned jurisdictions, beneficial-ownership and government-proceedings disclosures, and ICFR and auditor-identification information relevant to PCAOB inspection risk. 20-F/A history supports restatement and remediation tracking.

Academic researchers in international reporting and governance

Accounting and finance researchers use the dataset for text-based studies of risk-factor evolution, readability, and tone; empirical work on accruals, impairments, and segment reporting; and studies linking disclosure to ADR liquidity, pricing, and short interest. Full-document coverage including exhibits supports hand-collected variables beyond what structured databases capture.

What unifies these users is the absence of a domestic equivalent: for U.S.-listed foreign private issuers, the 20-F is the authoritative annual record, and this dataset is the practical means of working with it at scale across analytical, legal, accounting, governance, compliance, and machine-learning workflows.

Specific Use Cases

Concrete workflows the Form 20-F Files dataset supports across research, legal, accounting, and engineering teams.

  • Building an IFRS-as-filed comparables corpus. Technical-accounting researchers filter metadata.json for filings whose primary document declares the ifrs-full namespace and harvest the inline-XBRL-tagged Item 18 financial statements to build peer panels for impairment, lease, revenue-recognition, and pension policy choices. The longitudinal span (1995 to present) supports before/after studies of the 2007 elimination of the U.S. GAAP reconciliation.

  • Benchmarking risk-factor language across FPI peers. Disclosure counsel and equity analysts pull Item 3.D from the primary 20-F across a peer set defined by SIC code (entities[].sic) and stateOfIncorporation, then diff risk-factor headings (detail-tagged since the 2020 amendments) year-over-year for a single issuer or across a peer group. Output feeds drafting markup for the next 20-F or risk-mapping decks for buy-side coverage.

  • Tracking restatements and amendment scope via 20-F/A. Litigation, enforcement, and forensic-accounting teams isolate formType: "20-F/A" records, pair each with its original 20-F (matched on filer cik plus periodOfReport), and parse the amendment's explanatory note to classify scope (full re-publication vs. single-exhibit add such as EX-97 clawback policy). The output is a restatement timeline keyed to specific Items and exhibits for Section 10(b), Section 18, and PCAOB-inspection narratives.

  • Mining the exhibit set for material contracts and subsidiary maps. M&A and credit teams index exhibits by documentFormatFiles[].type to extract EX-4.x material contracts (financing agreements, employment contracts, indentures), EX-8 / EX-8.1 subsidiary lists, and EX-2.D securities descriptions. Pipelines must strip the SGML <DOCUMENT>/<TYPE>/<TEXT> envelope from legacy exhibit HTML before parsing. Output supports diligence packs, recovery models on cross-border bonds, and entity-graph construction.

  • Detecting newly-required disclosures as rules phase in. Compliance and governance analysts watch for first appearances of Item 16I (HFCAA auditor-jurisdiction disclosure, 2021), Exhibit 19 insider-trading policies (2022), Exhibit 97 clawback policies (2023), and Item 16K cybersecurity (effective for fiscal years ending on or after 15 December 2024). Filtering on exhibit type codes and partition month (YYYY/YYYY-MM.zip) flags filers that lag rule adoption and produces a coverage map for stewardship and PCAOB-risk reviews.

  • Grounding RAG and FPI-focused agents. NLP engineering teams chunk the iXBRL primary document by Item, attach accessionNo, cik, ticker, periodOfReport, and SIC as retrieval metadata, and use the result as the retrieval corpus for analyst-assistant agents covering ADR names. Because linkToFilingDetails carries the iXBRL viewer prefix, citations resolve directly back to the SEC-hosted source for end-user verification.

  • Cohort construction by jurisdiction and fiscal calendar. Quant and academic teams build issuer cohorts by combining entities[].stateOfIncorporation (e.g. X0 Israel, D8 Cayman, E9 Singapore), entities[].fiscalYearEnd (MMDD), and SIC, then align panel observations on periodOfReport rather than filedAt to avoid mixing fiscal years across the monthly partition. The cohort feeds event studies on cross-listing, sanctions exposure, and PCAOB-inspection access.

Dataset Access

The Form 20-F Files Dataset can be accessed in three ways: through the dataset index JSON API, by downloading the full dataset archive, or by retrieving individual monthly container files.

Dataset Index JSON API: https://api.sec-api.io/datasets/form-20f-files.json

This endpoint returns dataset-level metadata (name, description, last updated timestamp, earliest sample date, total record count, total size, form types covered, container format, and file types) along with the full list of container files. Each container entry includes its size, record count, last updated timestamp, and direct download URL. Use this endpoint to monitor which containers were refreshed in the most recent update run and to selectively download only the containers that changed. This endpoint does not require an API key.

Example response:

Example
1 {
2 "datasetId": "1f13365b-9ae0-6914-8189-afd3117ffa88",
3 "datasetDownloadUrl": "https://api.sec-api.io/datasets/form-20f-files.zip",
4 "name": "Form 20-F Files Dataset",
5 "updatedAt": "2026-04-28T02:56:43.957Z",
6 "earliestSampleDate": "1995-06-01",
7 "totalRecords": 198017,
8 "totalSize": 8987538136,
9 "formTypes": ["20-F", "20-F/A"],
10 "containerFormat": "ZIP",
11 "fileTypes": ["TXT", "JSON", "PDF", "HTML"],
12 "containers": [
13 {
14 "downloadUrl": "https://api.sec-api.io/datasets/form-20f-files/2026/2026-04.zip",
15 "key": "2026/2026-04.zip",
16 "size": 89241733,
17 "records": 312,
18 "updatedAt": "2026-04-28T02:56:43.957Z"
19 }
20 ]
21 }

Fetch the index with curl:

1 curl -s https://api.sec-api.io/datasets/form-20f-files.json

Download Entire Dataset: https://api.sec-api.io/datasets/form-20f-files.zip?token=YOUR_API_KEY

Downloads the complete archive containing all Form 20-F and Form 20-F/A filings from June 1995 to the present in a single ZIP. This endpoint requires an API key passed via the token query parameter.

1 curl -o form-20f-files.zip \
2 "https://api.sec-api.io/datasets/form-20f-files.zip?token=YOUR_API_KEY"

Download Single Container: https://api.sec-api.io/datasets/form-20f-files/2026/2026-04.zip?token=YOUR_API_KEY

Downloads one monthly container ZIP instead of the full dataset. Containers follow the path pattern /form-20f-files/<year>/<year>-<month>.zip, and the complete list of available containers is enumerated in the dataset index JSON. This endpoint requires an API key.

1 curl -o 2026-04.zip \
2 "https://api.sec-api.io/datasets/form-20f-files/2026/2026-04.zip?token=YOUR_API_KEY"

Frequently Asked Questions

What forms does this dataset cover?

The dataset covers Form 20-F and Form 20-F/A — the annual report and amendment forms used by foreign private issuers under Sections 13 or 15(d) of the Securities Exchange Act of 1934. It does not include Form 10-K (domestic annual reports), Form 40-F (Canadian MJDS annual reports), or Form 6-K (FPI interim and event reports).

What does one record represent?

One record is a single Form 20-F or Form 20-F/A submission as accepted by EDGAR, identified by its accession number. On disk it is a flat folder named with the 18-digit dash-stripped accession number, containing a metadata.json inventory, the primary 20-F or 20-F/A document, and each separately-filed exhibit as its own file.

Who is required to file Form 20-F?

Form 20-F is filed by foreign private issuers — non-U.S.-organized companies with securities registered under Section 12 of the Exchange Act or with a continuing reporting obligation under Section 15(d). FPI status is defined in Rule 405 and Rule 3b-4 and reapplied annually as of the last business day of the second fiscal quarter. MJDS-eligible Canadian issuers may instead use Form 40-F.

When is Form 20-F due?

The annual deadline is four months after fiscal year end, uniform across FPIs regardless of accelerated-filer status. A Form NT 20-F filed before the original due date triggers an automatic 15-calendar-day extension under Rule 12b-25. Form 20-F/A amendments have no fixed deadline and are filed when needed.

What time period does the dataset cover?

The dataset begins in June 1995, the earliest period of EDGAR availability for Form 20-F filings, and runs to the present. It spans the 1999 IOSCO-aligned modernization, the 2007 elimination of the U.S. GAAP reconciliation for IFRS filers, and every subsequent Sarbanes-Oxley Act, Dodd-Frank, HFCAA, clawback, insider-trading, and cybersecurity rulemaking applicable to FPIs.

What file formats appear inside a record?

The file types are TXT, JSON, PDF, and HTML. In modern filings the on-disk content is overwhelmingly HTM (an inline-XBRL XHTML primary document plus SGML-wrapped legacy HTML exhibits) plus the metadata.json manifest; PDF and plain TXT appear primarily in older filings. The dataset is delivered as monthly ZIP containers partitioned by EDGAR filing month under the path pattern YYYY/YYYY-MM.zip.

How does this dataset differ from a Form 20-F Items dataset or an XBRL dataset?

The Files dataset delivers the complete EDGAR submission — primary document, exhibits, and supporting attachments (excluding images) — preserving the original HTML, TXT, PDF, and JSON metadata. A Form 20-F Items dataset would deliver parsed text aligned to the FPI item schedule (Item 3D Risk Factors, Item 4, Item 5, etc.), while an XBRL dataset would deliver structured numeric facts tagged under US-GAAP or IFRS taxonomies. Files is the source-of-truth raw corpus from which Items extractions and XBRL fact tables can be derived.