Form N-30D Files Dataset

The Form N-30D Files dataset is a per-filing archive of every Form N-30D and Form N-30D/A submission accepted by EDGAR, the annual and semi-annual shareholder reports filed by registered management investment companies under Rule 30e-1 of the Investment Company Act of 1940. Each record corresponds to one EDGAR submission — one accession number — and bundles a metadata.json header together with the documents that originally constituted the submission, with image attachments removed. The EDGAR filer is the registrant fund itself: open-end mutual funds, closed-end funds, business development companies, and certain insurance company separate accounts registered as management investment companies. Coverage begins on 1994-01-01, the EDGAR phase-in date for investment company filers, with the dense core of records concentrated between January 1994 and the mid-2003 cutover to Form N-CSR/N-CSRS, and a long tail of late filings and amendments thereafter. The dataset is delivered in ZIP container format with monthly per-container archives and a continuously refreshed JSON index.

Update Frequency
Daily
Updated at
2026-04-14
Earliest Sample Date
1994-01-01
Total Size
2.5 GB
Total Records
78,245
Container Format
ZIP
Content Types
TXT, JSON, HTML, PDF
Form Types
N-30D, N-30D/A

Dataset APIs

Programmatically retrieve the full list of dataset archive files, download URLs and dataset metadata.

Dataset Index JSON API

Download the entire dataset as a single archive file.

Download Entire Dataset:

Download a single container file (e.g. monthly archive) from the dataset.

Download Single Container:

Dataset Files

326 files · 2.5 GB
Download All
2026-03.zip156.7 KB47 records
2025-12.zip77.0 KB2 records
2025-11.zip59.9 KB1 records
2025-09.zip115.3 KB39 records
2025-08.zip109.6 KB10 records
2025-06.zip26.5 KB1 records
2025-05.zip56.9 KB1 records
2025-03.zip149.5 KB51 records
2025-02.zip25.2 KB1 records
2024-12.zip29.0 KB1 records
2024-11.zip106.4 KB2 records
2024-09.zip129.9 KB43 records
2024-08.zip93.7 KB2 records
2024-06.zip26.9 KB1 records
2024-05.zip58.0 KB1 records
2024-03.zip164.6 KB48 records
2023-12.zip76.6 KB2 records
2023-11.zip61.2 KB1 records
2023-09.zip216.9 KB67 records
2023-08.zip71.0 KB1 records
2023-06.zip27.3 KB1 records
2023-05.zip58.4 KB1 records
2023-03.zip212.9 KB66 records
2022-12.zip77.7 KB2 records
2022-11.zip61.1 KB1 records
2022-09.zip183.3 KB63 records
2022-08.zip89.9 KB4 records
2022-06.zip27.1 KB1 records
2022-05.zip58.5 KB1 records
2022-04.zip13.6 KB1 records
2022-03.zip305.5 KB99 records
2021-12.zip77.5 KB2 records
2021-11.zip61.1 KB1 records
2021-09.zip205.6 KB64 records
2021-08.zip155.5 KB34 records
2021-06.zip21.6 KB1 records
2021-05.zip47.4 KB1 records
2021-04.zip13.7 KB1 records
2021-03.zip201.8 KB59 records
2021-02.zip80.9 KB31 records
2021-01.zip48.0 KB1 records
2020-12.zip22.9 KB1 records
2020-11.zip49.0 KB1 records
2020-09.zip284.5 KB92 records
2020-08.zip91.8 KB1 records
2020-06.zip69.5 KB2 records
2020-03.zip177.6 KB58 records
2020-02.zip476.7 KB52 records
2020-01.zip45.8 KB1 records
2019-12.zip22.2 KB1 records
2019-11.zip48.7 KB1 records
2019-09.zip263.3 KB89 records
2019-08.zip186.7 KB16 records
2019-06.zip20.2 KB1 records
2019-05.zip63.2 KB2 records
2019-04.zip112.3 KB5 records
2019-03.zip220.0 KB75 records
2019-02.zip162.6 KB33 records
2019-01.zip46.9 KB1 records
2018-12.zip22.6 KB1 records
2018-11.zip239.3 KB3 records
2018-09.zip612.4 KB75 records
2018-08.zip194.0 KB35 records
2018-06.zip20.1 KB1 records
2018-05.zip306.3 KB5 records
2018-04.zip815.0 KB6 records
2018-03.zip3.8 MB107 records
2018-02.zip1.4 MB41 records
2018-01.zip47.0 KB1 records
2017-12.zip21.8 KB1 records
2017-11.zip239.7 KB3 records
2017-09.zip283.2 KB106 records
2017-08.zip552.0 KB5 records
2017-06.zip20.7 KB1 records
2017-05.zip231.2 KB3 records
2017-04.zip73.9 KB4 records
2017-03.zip5.1 MB147 records
2017-02.zip87.8 KB2 records
2017-01.zip46.4 KB1 records
2016-12.zip211.4 KB3 records
2016-11.zip45.9 KB1 records
2016-09.zip220.2 KB75 records
2016-08.zip468.9 KB36 records
2016-06.zip19.7 KB1 records
2016-05.zip236.9 KB3 records
2016-04.zip475.2 KB8 records
2016-03.zip4.8 MB110 records
2016-02.zip773.8 KB40 records
2016-01.zip46.3 KB1 records
2015-12.zip20.2 KB1 records
2015-11.zip274.0 KB3 records
2015-09.zip452.1 KB150 records
2015-08.zip359.1 KB5 records
2015-06.zip60.3 KB2 records
2015-05.zip321.5 KB6 records
2015-04.zip741.1 KB8 records
2015-03.zip5.0 MB163 records
2015-02.zip75.7 KB31 records
2015-01.zip46.2 KB1 records
2014-12.zip63.2 KB2 records
2014-11.zip238.3 KB2 records
2014-09.zip435.2 KB141 records
2014-08.zip382.9 KB8 records
2014-06.zip17.6 KB1 records
2014-05.zip232.2 KB3 records
2014-04.zip89.0 KB4 records
2014-03.zip4.7 MB160 records
2014-02.zip129.0 KB44 records
2014-01.zip45.1 KB1 records
2013-12.zip18.7 KB1 records
2013-11.zip274.8 KB3 records
2013-09.zip450.1 KB152 records
2013-08.zip364.4 KB7 records
2013-06.zip17.7 KB1 records
2013-05.zip378.6 KB8 records
2013-03.zip5.5 MB159 records
2013-02.zip11.8 KB1 records
2012-12.zip63.7 KB2 records
2012-11.zip279.1 KB3 records
2012-09.zip454.9 KB158 records
2012-08.zip347.0 KB12 records
2012-06.zip17.8 KB1 records
2012-05.zip263.0 KB3 records
2012-04.zip109.1 KB5 records
2012-03.zip6.6 MB162 records
2012-02.zip31.5 KB2 records
2012-01.zip41.0 KB1 records
2011-12.zip20.2 KB1 records
2011-11.zip303.0 KB4 records
2011-09.zip450.6 KB123 records
2011-08.zip204.3 KB20 records
2011-06.zip19.2 KB1 records
2011-05.zip259.3 KB3 records
2011-04.zip95.0 KB5 records
2011-03.zip5.3 MB193 records
2011-02.zip32.5 KB14 records
2010-12.zip56.6 KB2 records
2010-11.zip175.3 KB4 records
2010-09.zip450.7 KB150 records
2010-08.zip170.5 KB6 records
2010-06.zip60.5 KB2 records
2010-05.zip85.3 KB2 records
2010-04.zip93.0 KB5 records
2010-03.zip4.8 MB191 records
2010-02.zip101.9 KB3 records
2009-12.zip57.7 KB2 records
2009-11.zip141.3 KB4 records
2009-10.zip48.9 KB1 records
2009-09.zip601.7 KB165 records
2009-06.zip13.1 KB1 records
2009-05.zip80.1 KB2 records
2009-04.zip190.7 KB12 records
2009-03.zip4.6 MB213 records
2009-01.zip36.4 KB1 records
2008-12.zip16.1 KB2 records
2008-11.zip137.8 KB4 records
2008-10.zip49.7 KB3 records
2008-09.zip1.9 MB193 records
2008-08.zip140.4 KB4 records
2008-07.zip11.8 KB1 records
2008-05.zip78.4 KB2 records
2008-04.zip152.4 KB6 records
2008-03.zip6.5 MB253 records
2008-02.zip34.4 KB1 records
2008-01.zip12.8 KB1 records
2007-12.zip25.3 KB1 records
2007-11.zip79.6 KB2 records
2007-10.zip44.0 KB1 records
2007-09.zip1.3 MB201 records
2007-08.zip1.8 MB11 records
2007-06.zip11.3 KB1 records
2007-05.zip92.5 KB7 records
2007-04.zip27.6 KB1 records
2007-03.zip5.6 MB251 records
2007-02.zip19.1 KB1 records
2007-01.zip55.2 KB2 records
2006-11.zip100.9 KB3 records
2006-09.zip1.5 MB172 records
2006-08.zip2.5 MB26 records
2006-06.zip10.7 KB1 records
2006-05.zip73.8 KB2 records
2006-04.zip14.8 KB5 records
2006-03.zip6.0 MB237 records
2006-02.zip39.3 KB2 records
2006-01.zip12.3 KB1 records
2005-12.zip99.9 KB3 records
2005-09.zip1.2 MB125 records
2005-08.zip1.3 MB21 records
2005-06.zip75.3 KB2 records
2005-05.zip4.4 KB2 records
2005-04.zip15.2 KB5 records
2005-03.zip3.5 MB161 records
2005-02.zip483.5 KB24 records
2005-01.zip56.4 KB3 records
2004-12.zip976.8 KB5 records
2004-11.zip82.9 KB3 records
2004-10.zip33.6 KB1 records
2004-09.zip3.0 MB180 records
2004-08.zip772.4 KB30 records
2004-07.zip17.8 KB3 records
2004-06.zip832.7 KB13 records
2004-05.zip80.9 KB8 records
2004-04.zip98.5 KB8 records
2004-03.zip3.5 MB224 records
2004-02.zip2.1 MB64 records
2004-01.zip59.9 KB8 records
2003-12.zip888.9 KB12 records
2003-11.zip360.1 KB13 records
2003-10.zip361.8 KB21 records
2003-09.zip2.2 MB166 records
2003-08.zip2.3 MB64 records
2003-07.zip955.0 KB23 records
2003-06.zip12.9 MB262 records
2003-05.zip17.4 MB432 records
2003-04.zip14.3 MB357 records
2003-03.zip54.0 MB1,009 records
2003-02.zip44.7 MB928 records
2003-01.zip29.9 MB607 records
2002-12.zip33.8 MB700 records
2002-11.zip29.0 MB512 records
2002-10.zip15.9 MB388 records
2002-09.zip21.8 MB495 records
2002-08.zip67.5 MB1,426 records
2002-07.zip28.3 MB566 records
2002-06.zip29.3 MB740 records
2002-05.zip27.6 MB618 records
2002-04.zip14.3 MB393 records
2002-03.zip58.8 MB1,138 records
2002-02.zip35.4 MB786 records
2002-01.zip28.2 MB692 records
2001-12.zip28.6 MB690 records
2001-11.zip23.4 MB600 records
2001-10.zip13.7 MB371 records
2001-09.zip26.6 MB742 records
2001-08.zip51.5 MB1,192 records
2001-07.zip22.8 MB572 records
2001-06.zip28.7 MB737 records
2001-05.zip21.2 MB586 records
2001-04.zip13.4 MB381 records
2001-03.zip61.1 MB1,199 records
2001-02.zip34.2 MB917 records
2001-01.zip26.6 MB693 records
2000-12.zip26.8 MB708 records
2000-11.zip22.4 MB571 records
2000-10.zip12.1 MB334 records
2000-09.zip30.9 MB813 records
2000-08.zip40.8 MB1,154 records
2000-07.zip20.0 MB586 records
2000-06.zip26.1 MB722 records
2000-05.zip21.3 MB605 records
2000-04.zip9.9 MB324 records
2000-03.zip63.4 MB1,100 records
2000-02.zip34.6 MB1,097 records
2000-01.zip18.6 MB581 records
1999-12.zip30.3 MB834 records
1999-11.zip17.5 MB609 records
1999-10.zip11.0 MB334 records
1999-09.zip28.5 MB810 records
1999-08.zip40.9 MB1,266 records
1999-07.zip17.7 MB558 records
1999-06.zip25.2 MB890 records
1999-05.zip16.7 MB561 records
1999-04.zip12.3 MB380 records
1999-03.zip44.7 MB1,332 records
1999-02.zip26.1 MB1,010 records
1999-01.zip17.4 MB603 records
1998-12.zip24.8 MB899 records
1998-11.zip17.9 MB645 records
1998-10.zip10.4 MB367 records
1998-09.zip29.6 MB987 records
1998-08.zip34.3 MB1,261 records
1998-07.zip17.2 MB645 records
1998-06.zip25.3 MB829 records
1998-05.zip15.1 MB569 records
1998-04.zip11.7 MB376 records
1998-03.zip38.2 MB1,324 records
1998-02.zip27.2 MB1,051 records
1998-01.zip19.1 MB665 records
1997-12.zip25.9 MB918 records
1997-11.zip12.7 MB461 records
1997-10.zip11.1 MB359 records
1997-09.zip27.1 MB872 records
1997-08.zip29.5 MB1,134 records
1997-07.zip17.0 MB617 records
1997-06.zip21.3 MB771 records
1997-05.zip14.5 MB518 records
1997-04.zip10.1 MB339 records
1997-03.zip33.1 MB1,219 records
1997-02.zip23.6 MB868 records
1997-01.zip16.0 MB570 records
1996-12.zip18.0 MB666 records
1996-11.zip13.6 MB500 records
1996-10.zip9.2 MB311 records
1996-09.zip23.6 MB903 records
1996-08.zip29.1 MB1,147 records
1996-07.zip14.1 MB611 records
1996-06.zip15.2 MB594 records
1996-05.zip14.9 MB554 records
1996-04.zip7.9 MB300 records
1996-03.zip30.0 MB1,010 records
1996-02.zip23.6 MB898 records
1996-01.zip16.5 MB540 records
1995-12.zip11.3 MB511 records
1995-11.zip12.8 MB476 records
1995-10.zip9.0 MB310 records
1995-09.zip19.4 MB702 records
1995-08.zip18.1 MB772 records
1995-07.zip7.7 MB357 records
1995-06.zip12.7 MB558 records
1995-05.zip8.3 MB354 records
1995-04.zip5.0 MB187 records
1995-03.zip9.8 MB380 records
1995-02.zip5.8 MB175 records
1995-01.zip4.7 MB168 records
1994-12.zip3.2 MB138 records
1994-11.zip3.2 MB112 records
1994-10.zip2.3 MB87 records
1994-09.zip3.0 MB119 records
1994-08.zip2.7 MB129 records
1994-07.zip2.1 MB88 records
1994-06.zip3.6 MB171 records
1994-05.zip2.3 MB100 records
1994-04.zip1.4 MB71 records
1994-03.zip2.6 MB133 records
1994-02.zip2.4 MB92 records
1994-01.zip3.3 MB94 records

What This Dataset Contains

The Form N-30D Files dataset preserves three decades of as-filed fund shareholder reports submitted electronically to EDGAR. Form N-30D was the historical SEC form by which registered management investment company entities — primarily open-end mutual funds, closed-end funds, and unit investment trusts operating as management companies — delivered their periodic shareholder reports to the SEC. Rule 30e-1 (originally Rule 30d-1) requires every such company to transmit a report to shareholders at least semi-annually within 60 days of the close of the reporting period and to file a copy with the Commission no later than 10 days after that transmission. The form is a wrapper around the substantive shareholder report — itself a hybrid narrative–financial document combining the portfolio manager's discussion, audited or unaudited financial statements prepared under Regulation S-X Article 6, a complete schedule of portfolio investments, expense and per-share financial highlights, and governance disclosures concerning the fund's directors, trustees, and officers. Form N-30D/A is the amendment variant, used to correct or supplement a previously filed shareholder report; it carries the same internal structure and references the original accession being amended.

N-30D was the dominant fund shareholder-report vehicle from the inception of EDGAR filings in January 1994 until 2003, when the SEC's certified-shareholder-report rules implemented under the Sarbanes-Oxley Act (and accompanying amendments to Regulation S-X) introduced Form N-CSR and N-CSRS as the required forms for transmittal of shareholder reports. The vast majority of records in this dataset therefore cluster in the 1994–2003 window, with a long, thinning tail of late filings, stragglers, and amendments after 2003.

The dataset is distributed as ZIP-per-month containers. The file types found inside the containers are TXT, JSON, HTML, and PDF: modern filings consist of an HTML primary document plus the JSON manifest; older filings from the mid-1990s are predominantly plain-ASCII TXT submissions; PDF appears sporadically as a primary or supplementary report document.

Content Structure of a Single Record

A record has two structural layers:

  1. A filing-level metadata header — metadata.json — that captures the EDGAR submission envelope, registrant identification, period and date fields, document manifest, and reference URLs back to sec.gov.
  2. The submission documents themselves — at minimum the primary N-30D shareholder report, plus any additional non-image attachments (cover letters, certifications, supplemental schedules). Image files (GRAPHIC entries such as JPG, GIF, PNG) referenced by the report and the complete-submission .txt bundle are intentionally omitted from on-disk extraction; they remain accessible only as URL references inside metadata.json.

On disk, the record is a per-filing folder named after the 18-digit normalized EDGAR accession number (the dashed accession 0001206774-25-000089 becomes the folder 000120677425000089). Each ZIP container holds a single top-level folder matching the period (e.g., 2025-02/) with one accession-keyed subfolder per filing. Monthly volume is highly uneven: months aligned with calendar quarter-ends and common fiscal-year cutovers (December, March, June, September) carry many filings as funds cluster their semi-annual and annual transmittals, while off-cycle months may carry only a handful of records.

metadata.json

Always present, exactly one per record. The file is a single JSON object combining EDGAR header fields with dataset-specific extensions. The salient fields are:

  • formType"N-30D" for original filings, "N-30D/A" for amendments.
  • accessionNo — canonical dashed accession number, e.g., "0001206774-25-000089".
  • filedAt — ISO-8601 timestamp with timezone reflecting when EDGAR accepted the submission.
  • effectivenessDate — date the filing became effective on EDGAR.
  • periodOfReport — the reporting period the shareholder report covers, expressed as the period-end date (e.g., "2024-12-31" for an annual report through year-end 2024).
  • description — human-readable form description from the EDGAR header, typically "Form N-30D - Annual and semi-annual reports mailed to shareholders [Rule 30d-1]".
  • linkToFilingDetails, linkToTxt, linkToHtml, linkToXbrl — direct sec.gov URLs to the primary document, the complete-submission text bundle, the EDGAR filing-index page, and any XBRL data; the XBRL link is empty for N-30D since the form is not part of the XBRL regime.
  • id — opaque dataset-internal record identifier.
  • documentFormatFiles — array enumerating every document originally submitted to EDGAR. Each entry carries sequence, size, documentUrl, description, and type (e.g., N-30D, GRAPHIC, EX-99.CERT). The trailing entry uses a single space " " for both sequence and type and a description of "Complete submission text file" — a sentinel pointing at the submission-level .txt wrapper, not a missing value. GRAPHIC entries are listed here even though their bytes are not stored on disk.
  • entities — array of registrant entities. Each entry carries cik, companyName (with the EDGAR role suffix such as "(Filer)" appended), fiscalYearEnd in MMDD form, stateOfIncorporation, act (Investment Company Act = "40"), fileNo, irsNo, type (form type as filed by that entity), and filmNo. Routine fund filings produce a single-element array; joint filings produce multiple entries.
  • seriesAndClassesContractsInformation — array of series/class identifiers under the SEC's series and class taxonomy. Empty for trust-style filers and pre-series-regime filings; populated for fund families that file under the series/class scheme.
  • dataFiles — array of structured data attachments such as XBRL/XML; empty for N-30D.

Primary submission document

The substantive content of the record is the N-30D shareholder report itself. On modern filings this is an HTML document (filenames typically follow the filer's own pattern, e.g., *-n30d.htm) wrapped in EDGAR's SGML document envelope. The wrapper takes the form:

1 <DOCUMENT>
2 <TYPE>N-30D
3 <SEQUENCE>1
4 <FILENAME>...-n30d.htm
5 <DESCRIPTION>N-30D
6 <TEXT>
7 <HTML>
8 <HEAD>...</HEAD>
9 <BODY>... full report content ...</BODY>
10 </HTML>
11 </TEXT>
12 </DOCUMENT>

The <TYPE>, <SEQUENCE>, <FILENAME>, and <DESCRIPTION> tags repeat the manifest entry for the document inside the file body itself; the rendered report sits inside the <TEXT> block. Pre-HTML-era filings replace the <HTML>...</HTML> payload with plain ASCII text (or, less often, an embedded PDF reference) but retain the same outer SGML wrapper.

Body of the shareholder report

Inside the HTML or text payload, the N-30D report follows a conventional mutual-fund shareholder-report ordering. Exact section names and depth vary by issuer and time period, but the canonical structure is:

  • Cover page and front matter — fund name, report type (annual or semi-annual), period covered, fiscal-year-end designation, and boilerplate disclosure cautions. Closed-end funds typically include ticker and exchange information; open-end funds include share-class identification.
  • Letter to shareholders / Manager's Report — narrative discussion by the portfolio manager or fund officer, covering period performance, market environment, attribution of returns, positioning changes, and forward outlook. This block frequently includes performance comparison tables against benchmarks and one or more performance graphs (the graphs are typically GIF/JPG image attachments referenced from the HTML and are excluded from on-disk extraction).
  • Report of Independent Registered Public Accounting Firm — auditor's opinion on the financial statements; present in annual reports, generally absent from semi-annual reports (which carry unaudited financials).
  • Statement of Assets and Liabilities — point-in-time balance-sheet equivalent showing total investments at value and at cost, receivables, payables, and net assets, broken out by component (paid-in capital, undistributed net investment income, accumulated realized gains/losses, unrealized appreciation/depreciation).
  • Statement of Operations — period income and expense statement: dividend and interest income, advisory and administration fees, distribution (12b-1 fees), custody fees, transfer-agent fees, professional fees, expense waivers and reimbursements, and net realized and unrealized gains/losses on investments.
  • Statements of Changes in Net Assets — comparative two-period reconciliation of net assets, including operations subtotals, distributions to shareholders by class, and capital share transactions.
  • Financial Highlights — per-share operating performance over five fiscal periods: net asset value at period start, net investment income, realized and unrealized gains/losses, distributions, ending NAV, total return, and ratios (expense ratio, net investment income ratio, portfolio turnover).
  • Notes to Financial Statements — accounting policies (valuation, securities transactions, federal tax status under Subchapter M, distributions policy), related-party transactions with the adviser and affiliates, capital-share transactions detail, purchases and sales of investment securities, and tax-basis information.
  • Schedule of Investments / Portfolio of Investments — line-item enumeration of every security held at period end, typically grouped by sector or asset class, listing issuer name, principal amount or share quantity, cost (sometimes), and fair value, with subtotals and percentage-of-net-assets weightings. Fixed-income funds add coupon and maturity columns; funds holding derivatives include separate schedules for futures, swaps, written options, and forward contracts.
  • Expense Example — hypothetical $1,000-investment expense disclosure required for periods after the SEC's 2004 expense-disclosure rules; appears in later N-30D filings but is not standard in the earliest 1994-era reports.
  • Information about Directors and Officers / Trustees and Officers — name, age, position, term, principal occupations during the past five years, number of portfolios overseen, and other directorships, separated into interested and independent directors. Required disclosure scope expanded materially after 2001.
  • Tax information / Other disclosures — qualified dividend income percentages, dividends-received-deduction percentages, foreign-tax-credit pass-through, and similar tax-character notices to shareholders.
  • Back matter — fund service-provider directory (adviser, custodian, transfer agent, distributor, auditor, counsel), proxy-voting policy availability statements, and shareholder account information.

Layout in HTML-era filings is achieved with inline-styled <TABLE> and <DIV> elements; financial statements and portfolio schedules are dense, multi-column tables that often span many printed pages. In ASCII-era filings, the same content is rendered with monospaced column alignment and ruled lines drawn from hyphens and equal signs.

Other documents in the submission

Beyond the primary *-n30d.htm (or .txt) report, individual filings may include supplemental documents enumerated in documentFormatFiles. These can include president/treasurer certifications (in late N-30D filings made under emerging Sarbanes-Oxley guidance prior to the formal switch to N-CSR), cover letters, and occasional exhibits. Image attachments — performance-graph GIFs, fund-logo JPGs, signature PNGs — appear in the manifest but are excluded from on-disk extraction; references to them inside the HTML body therefore appear as broken <IMG SRC="..."> links when the HTML is rendered standalone, which is expected behavior.

Included content

For each accession number, the dataset bundles:

  • Exactly one metadata.json capturing the EDGAR header, registrant identification, period and date metadata, document manifest, and back-references to sec.gov URLs.
  • All non-image documents originally filed in the EDGAR submission, preserved with their original filenames, original SGML document wrappers, and original byte content.

Excluded or separate content

The following are deliberately not stored on disk inside the per-filing folder:

  • Image files (GRAPHIC type — JPG, GIF, PNG) referenced by the report. They appear in documentFormatFiles with their documentUrl and remain retrievable from sec.gov, but the binary bytes are not extracted into the dataset. Performance graphs and fund logos are therefore absent from the on-disk content.
  • The complete-submission .txt bundle — the concatenated SGML envelope EDGAR generates that contains every document of the submission inline. It appears as the trailing sentinel entry in documentFormatFiles (with sequence and type set to a single space and description "Complete submission text file") and is reachable via linkToTxt, but it is not stored as its own file because the individual constituent documents are already extracted.
  • Attached structured-data files — any dataFiles entries are listed in metadata but not written to disk; for N-30D this array is routinely empty.

Changes in required content and structure over time

The internal structure of the N-30D shareholder report evolved meaningfully across its EDGAR-era life (1994–2003 dominant phase, with a long tail thereafter):

  • 1994–1998 — Reports follow Regulation S-X Article 6 financial-statement requirements but with relatively light narrative and governance disclosure. Director/trustee biographical disclosure is concise; no expense-example block; performance discussions are brief.
  • 2001 governance expansion — SEC rule changes on fund governance (independent-director composition, audit-committee disclosures) expanded the back-matter director and officer information into structured tables identifying interested versus independent directors, with five-year occupational history and number of portfolios overseen.
  • 2001–2002 portfolio-disclosure rules — Portfolio-schedule presentation tightens, with more consistent grouping by sector or asset class and explicit percentage-of-net-assets columns.
  • Post-2003 tail — After the Sarbanes-Oxley-driven adoption of Form N-CSR (and the semi-annual N-CSRS) for certified shareholder reports, new-style shareholder reports moved to those forms. N-30D filings after 2003 are therefore predominantly amendments (N-30D/A), late filings, or filings by issuers and fund types still permissibly using N-30D for narrower transmittal purposes. The post-2003 records are sparse and qualitatively different in volume from the dense 1994–2003 core.

Changes in data format over time

The form's source-file presentation tracks the broader EDGAR format evolution:

  • 1994 to roughly 1999 — Filings are submitted as plain-ASCII text. The shareholder report is rendered with monospaced fonts, ruled separators, and column-aligned tables. Financial statements and portfolio schedules survive purely through whitespace alignment, which complicates downstream tabular extraction. The SGML document wrapper is present, but the <TEXT> payload is plain text rather than HTML.
  • Late 1990s to early 2000s — HTML adoption begins. Filers transition to embedding <HTML>...</HTML> content inside the <TEXT> block, with inline-styled tables for financial statements and richer typography. Some filings carry both an HTML primary document and ASCII or PDF supplements.
  • Post-2000 stragglers — Late filings and amendments after 2003 are routinely HTML, occasionally PDF. The SGML document envelope remains stable across all eras, so the <DOCUMENT> / <TYPE> / <SEQUENCE> / <FILENAME> / <DESCRIPTION> / <TEXT> framing is the most reliable structural anchor for parsers regardless of payload format.

Interpretation notes

  • Amendments: N-30D/A records share the on-disk layout and metadata schema of original N-30D records. The formType field is the discriminator. Amendments may replace, supplement, or correct specific sections of the underlying report; reconciling an amendment to the original requires matching by registrant CIK and periodOfReport.
  • Manifest vs. on-disk content: documentFormatFiles is the authoritative enumeration of what was originally filed. On-disk content is a subset that excludes images and the complete-submission wrapper. A document listed in the manifest is not guaranteed to be present as a file in the folder, but its documentUrl always resolves on sec.gov.
  • Sentinel entry in documentFormatFiles: the trailing entry with single-space sequence and type denoting the complete-submission text file is a structural sentinel, not malformed data. Parsers should detect and skip it explicitly rather than treat it as a missing-value error.
  • SGML envelope inside HTML files: the modern primary document is an HTML file that begins with non-HTML SGML tags (<DOCUMENT>, <TYPE>, etc.) before the <HTML> opening tag. Strict HTML parsers may misinterpret the leading SGML lines; robust extraction either strips the SGML preamble first or operates from the inner <TEXT> block onward.
  • Broken image references: HTML reports contain <IMG> tags pointing to sibling JPG/GIF files that are absent from the on-disk folder. This is intentional dataset behavior; rendering pipelines that need the images must fetch them via the documentUrl values in documentFormatFiles.
  • Series and classes: seriesAndClassesContractsInformation is empty for filers outside the SEC series/class regime (single-trust funds, pre-2002 filers) and populated for multi-series fund-family filings. Linking a record to a specific series or share class therefore requires consulting this array first and falling back to entity-level CIK matching when it is empty.
  • Period vs. filing date: periodOfReport is the substantive temporal anchor for analytical use (it identifies the period the financials cover), while filedAt and effectivenessDate reflect when the report reached EDGAR. Annual reports typically carry periodOfReport equal to the fund's fiscal-year end and filedAt within roughly 70 days afterward (the 60-day shareholder-transmittal window plus the 10-day SEC filing window).
  • Format heterogeneity: any pipeline consuming the dataset across the full date range must handle ASCII, HTML, and occasional PDF primary documents. Section-level extraction calibrated to HTML will fail silently on the ASCII-era core of the corpus; conversely, whitespace-aware ASCII parsers will fragment modern HTML tables. Format-aware dispatch — keyed on the primary document's extension and on content sniffing of the <TEXT> payload — is the typical handling pattern.
  • Joint filings: the entities array is single-element for routine fund filings but can carry multiple entries when several registrants file the report jointly (common for fund complexes that share a registrant umbrella). All entities listed share the same accession and the same on-disk record; downstream linking should iterate the array rather than assume a single CIK.

Who Files or Publishes This Dataset, and When

Who files

Each record is a Form N-30D or N-30D/A submission filed on EDGAR by a registered management investment company under the Investment Company Act of 1940. The EDGAR filer is the registrant fund itself, identified by its registrant CIK. A single submission can carry the shareholder report for one series, multiple series sharing a fiscal period, or, in early EDGAR years, an entire family of series under one trust.

In-scope filer classes:

The shareholder report inside each filing is prepared by the fund's adviser and officers, audited by the independent registered public accountant for annual reports, and approved by the fund's board. None of those parties is the EDGAR filer; only the registrant fund is.

When the record is triggered

The obligation is periodic and tied to the fund's own fiscal calendar, not the calendar year. Section 30(e) of the Investment Company Act of 1940 and its implementing rule require every registered management investment company to transmit reports to its shareholders at least semi-annually. Two reporting periods trigger an N-30D:

  1. Annual report at fiscal year-end, with audited financial statements certified by the fund's independent registered public accountant.
  2. Semi-annual report six months into the fiscal year, with unaudited financial statements.

Because fund fiscal year-ends are staggered (commonly December 31, October 31, June 30, or March 31), N-30D submissions appear on EDGAR throughout the year, with no single calendar-year peak.

Deadlines:

  • Transmission to shareholders: within 60 days after the close of the reporting period.
  • Filing with the SEC: within 10 days after transmission to shareholders.

The EDGAR filing date therefore reflects the 10-day window after actual transmission, not a fixed offset from period-end. Late transmittals still fix the SEC filing window at 10 days from actual transmission.

Regulatory framework and the N-CSR transition

The shareholder-report obligation originates in Section 30 of the Investment Company Act of 1940. The implementing rule was originally Rule 30d-1 under the Investment Company Act, with the modern semi-annual-transmission and 60-day/10-day filing structure settled by the Commission's 1972 release IC-7113. Form N-30D was the designated form for these transmittals.

In 2002, the SEC renumbered the rule from Rule 30d-1 to Rule 30e-1, carrying over the substantive obligations.

The Sarbanes-Oxley Act of 2002 then changed the form. SEC release 33-8188 / IC-25914, "Certification of Disclosure in Certain Exchange Act Reports" (effective 2003), created Form N-CSR as the new certified shareholder-report vehicle for management investment companies. For reporting periods ending on or after the 2003 compliance date, registered management investment companies file:

instead of Form N-30D. From that point, N-30D was no longer the primary shareholder-report form.

Consequences for this dataset:

  • Records concentrate between January 1994 (when EDGAR phase-in made N-30D electronically filable) and mid-2003 to 2004 (the N-CSR cutover).
  • A long tail of N-30D and N-30D/A submissions continues after 2003 for late filings, supplemental transmittals, and amendments to pre-cutover reports.
  • Post-2003 fund shareholder reports generally live on N-CSR / N-CSRS, not N-30D.

Amendments (N-30D/A)

A Form N-30D/A is an amendment to a previously filed N-30D, filed by the same registrant fund and tied to the same fiscal reporting period. Typical reasons:

  • correcting typographical or numerical errors;
  • replacing exhibits or financial statements;
  • adding a missing auditor's consent or certification page;
  • restating part of the financial statements or schedule of investments;
  • re-filing in a corrected EDGAR submission format.

What does not belong here

Adjacent registered investment company populations use different reporting paths and are out of scope:

  • Unit investment trusts (UITs) that are not management companies historically used Form N-30B-2 and related N-30 series forms (such as N-30D-1), not N-30D.
  • Foreign issuers and non-investment-company SEC registrants never used N-30D; their annual reports flow through 10-K, 20-F, 40-F, or similar.
  • Money market fund portfolio holdings under Rule 2a-7 are reported on Form N-MFP, a separate regime unrelated to N-30D.
  • Post-2003 management investment company shareholder reports belong on Form N-CSR / N-CSRS rather than N-30D.

EDGAR availability

The Investment Company Act shareholder-report obligation dates to 1940, and the 60-day/10-day cadence was settled by 1972 under IC-7113, but pre-EDGAR N-30D reports were filed on paper and microfiche and are not in this dataset. Electronic N-30D submissions begin in January 1994 with the EDGAR phase-in for investment company filers. The dataset's earliest sample date of 1994-01-01 reflects that EDGAR boundary, not the legal origin of the obligation.

How This Dataset Differs From Similar Datasets or Filings

Form N-30D sits inside a tightly clustered family of investment-company disclosure regimes: direct successors that replaced it after 2003, siblings that carry pieces of fund disclosure into more structured channels, and adjacent forms that look similar but cover a different filer population. The comparisons below focus on the closest neighbors a researcher would confuse with, substitute for, or pair against an N-30D corpus.

Form N-CSR and N-CSRS — direct successors (2003-present)

N-CSR (annual) and N-CSRS (semi-annual) are the post-Sarbanes-Oxley replacements, adopted in SEC Release Nos. 33-8188 / IC-25914 and effective for periods ending on or after July 9, 2003. They share the Rule 30e-1 transmittal framework and the same core body: shareholder letter, financial statements, schedule of investments, financial highlights, and expense disclosure.

The differences are additive on the N-CSR side: Sections 302 and 906 CEO/CFO certifications, internal-control disclosure, audit-committee financial-expert identification, principal-accountant fee disclosure, code-of-ethics representation, and (later) the tailored shareholder report regime and inline XBRL-tagged financial exhibits.

Coverage is sequential, not parallel. N-CSR/N-CSRS is authoritative from mid-2003 forward; N-30D is the only source from 1994 through mid-2003. A long-horizon shareholder-report corpus must splice the two, accepting that the earlier segment lacks officer certifications, audit-committee disclosure, and structured tagging.

Form N-Q — quarterly portfolio holdings (2004-2019)

N-Q required management investment companies to file complete portfolio holdings as of the first and third fiscal quarter-ends, the two quarters not covered by an N-CSR/N-CSRS report. It was rescinded effective May 2019 once N-PORT public reporting matured.

Form N-Q overlaps with N-30D only on the schedule of investments. It carries no shareholder letter, no narrative, no financial statements, and no expense disclosure. As a research complement, it fills intra-year holdings gaps in the 2004-2016 window; it does not substitute for N-30D's narrative content.

Form N-PORT — monthly structured portfolio holdings (2016-present)

Form N-PORT, adopted in SEC Release IC-32314 under Rule 30b1-9, requires registered management funds (excluding money market and small UITs) to report monthly portfolio holdings, risk metrics, derivatives detail, securities-lending activity, and liquidity classifications in structured XML, with public release on a calendar quarter delay.

N-PORT is the modern structured analogue of the schedule-of-investments portion of N-30D — dramatically broader on per-position risk and liquidity, dramatically narrower on everything else (no narrative, no shareholder letter, no financial statements). For quantitative holdings analysis it is far superior to N-30D; for qualitative fund disclosure it contains nothing comparable.

Form N-CEN — annual fund census (2018-present)

Form N-CEN, adopted alongside N-PORT, is a structured (XML) annual census of fund identifiers, fund type, service providers, securities-lending agents, fees, and exemptive orders relied upon. It replaced N-SAR for fiscal periods ending on or after June 1, 2018.

N-CEN does not duplicate N-30D. It carries no performance discussion, no security-level holdings, and no financial statements — it is operational metadata. The two are complementary: N-30D tells you how a fund described results to investors; N-CEN describes its structural and operational characteristics.

Form N-SAR — semi-annual operations report (1985-2018)

Form N-SAR, the N-CEN predecessor, was a fixed-field structured filing covering transfer agents, custodians, sales loads, expenses paid, and portfolio turnover. For the 1994-2003 N-30D era, N-SAR is the natural pairing for operational and fee data, occupying the slot N-CEN now fills. It is not a substitute for N-30D content — no narrative, no shareholder letter, no audited statements.

Form N-PX — annual proxy-voting record (2003-present)

Form N-PX, filed annually under Rule 30b1-4, reports each fund's proxy votes across portfolio companies. There is no content overlap with N-30D; the only link is shared filer population. Voting research cannot use N-30D, and performance, expense, or holdings research cannot use N-PX.

Form N-30B-2 and N-30D-1 — adjacent legacy report channels

N-30B-2 collects periodic reports from registered investment companies — commonly UITs and other non-management vehicles — whose shareholder-style reports flow under Investment Company Act provisions outside the Rule 30e-1 management-company channel that drives N-30D. N-30D-1 was a related historical filing pathway.

The boundary is filer population, not content. N-30D (and successor N-CSR/N-CSRS) covers registered management investment companies — mutual funds, closed-end funds, ETFs organized as management companies. Comprehensive RIC shareholder-report coverage requires the N-30B-2 family alongside N-30D; using N-30D alone systematically omits the UIT segment.

Form 10-K and 10-Q — operating-company periodic reports

Form 10-K and Form 10-Q are the Exchange Act Sections 13(a)/15(d) periodic-reporting framework for operating companies and do not apply to registered management investment companies. The structures look parallel — periodic financials, MD&A, operational results — but the filer populations are disjoint. The comparison matters only because new users sometimes look for "fund 10-Ks"; the correct equivalents are N-30D for 1994-2003 and N-CSR for the modern era.

Boundary clarification

N-30D is the unstructured, uncertified, narrative-plus-financials shareholder report of the pre-Sarbanes-Oxley fund-disclosure era — a single form bundling the letter to shareholders, MD&A-style discussion, financial highlights, audited financial statements, and schedule of investments in free-form HTML/TXT/PDF. Its modern function has been split: the certified narrative report now lives in N-CSR/N-CSRS, while structured carve-outs of holdings, census, and voting have moved to N-PORT, N-CEN, and N-PX respectively.

That makes N-30D the only source for 1994 through mid-2003 fund shareholder-report content — N-CSR cannot reach back, N-PORT and N-CEN did not yet exist, and N-SAR carries only operational data. From mid-2003 onward, the same content is available with greater rigor and structure across the successor forms, but no single modern form replaces N-30D on its own. The practical rule: treat N-30D as the canonical fund-narrative-and-financials source for 1994-2003, pair it with N-SAR for operational data of that era, and switch to N-CSR/N-CSRS plus N-PORT, N-CEN, and N-PX for the same content from mid-2003 forward.

Who Uses This Dataset

Form N-30D shareholder reports are the main public record of how registered management investment companies described portfolios, performance, expenses, and strategy from 1994 until structured forms (N-CSR, N-Q, N-PORT) replaced them. Each accession bundles a metadata file, the primary HTML or TXT report, financial statements, the schedule of investments, and any amendments, and different professions extract different layers.

Quantitative researchers and academic finance faculty

Build survivorship-bias-free fund panels reaching back to 1994, before N-Q (2004) and N-PORT (2019). They parse the schedule of investments in the primary HTML/TXT for CUSIPs, share counts, and market values, and pull total return, expense ratio, and turnover from the financial highlights. Outputs: performance-attribution panels, fund-flow studies, and replication files for papers on window dressing, fee competition, and stale-pricing arbitrage.

Buy-side data engineers and multi-manager analysts

Data teams at multi-manager platforms, fund-of-funds, and outsourced CIOs enrich competitor and target-fund coverage where modern structured holdings are missing. They normalize the schedule of investments into a holdings table keyed by accession and period, lift expense and waiver figures from the financial highlights, and extract the Letter to Shareholders for stated strategy. Output: holdings-overlap analytics, manager-style classification feeds, and sub-adviser due-diligence dossiers.

Fund-rating and fund-research vendors

Commercial fund-analytics shops backfill performance, expense, and holdings series for funds liquidated, merged, or rebranded before structured filings. They parse the metadata file for CIK, series, and class IDs, the financial highlights for per-share data and total returns, and the schedule of investments for asset-class composition. Output: extended fund histories and historical peer-group statistics fed into commercial platforms.

Litigation-support analysts and forensic accountants

Used in 401(k) excessive-fee class actions, early-2000s market-timing and late-trading matters, soft-dollar disputes, and fiduciary-breach claims. They focus on 12b-1 and expense disclosures in the financial highlights, the schedule of investments around alleged market-timing windows, related-party and affiliated-broker notes, and the auditor's report. Output: expert reports, damages models, and exhibits keyed to as-filed contemporaneous disclosures.

Compliance and product teams at fund advisers

Fund counsel, compliance officers, and product staff diligence acquired, reorganized, or merged predecessor funds. They review the Letter to Shareholders for stated objectives and strategy changes, the schedule of investments for legacy positions with tax or compliance carryovers, and N-30D/A amendments to see what was restated. Output: 15(c) board materials, merger files, and predecessor-performance disclosure in registration statements.

Financial historians and economic-policy researchers

Study the post-1987 fund expansion, the dot-com bubble, the 2000-2002 bear market, and the rise of indexing. They read the Letter to Shareholders for contemporaneous manager narrative, the schedule of investments for sector and geographic shifts, and the financial highlights for asset-growth trajectories. Output: monographs and policy papers on industry evolution, retail participation, and monetary-policy transmission through pooled vehicles.

NLP and machine-learning teams

Use shareholder letters and Manager's Reports inside the primary HTML/TXT as training and evaluation data for tone classification, risk-language extraction, and disclosure parsing. They pair narrative passages with the same filing's schedule of investments and financial highlights to link commentary to realized portfolio changes. Output: fine-tuned summarization models, RAG benchmarks over fund disclosures, and labeled corpora for disclosure-quality research.

ESG and stewardship researchers

Reconstruct fund-family historical positioning on sectors and issuers before mainstream sustainability regimes. They mine the schedule of investments for fossil-fuel, tobacco, defense, and controversial-issuer exposures and the Letter to Shareholders for early references to social or environmental screens. Output: longitudinal greenwashing studies, SRI evolution research, and historical baselines for current ESG fund classifications.

Fund accountants, auditors, and assurance teams

Reference library when onboarding legacy series, reconciling historical NAV series, or answering inspection requests on prior-period disclosure. They pull the audited financial statements, fair-valuation and related-party notes, and the fiscal-year-end schedule of investments, using the metadata file to align accessions with reporting periods. Output: reconciliation memos, audit working papers, and historical pricing-policy reviews.

Each group reaches for a different layer of the same filing — the metadata file, the schedule of investments, the financial statements, or the Letter to Shareholders — but all depend on the dataset because no other source preserves three decades of as-filed registered investment company shareholder reports in a uniformly accessible form.

Specific Use Cases

The workflows below draw on specific record components — metadata.json, the primary HTML/TXT shareholder report, the schedule of investments, the financial highlights, or the manager's letter — and produce a defined artifact.

Building a pre-2004 survivorship-bias-free fund panel

A quantitative finance researcher reconstructing mutual-fund performance back to 1994 — the era before N-Q (2004) and N-PORT (2019) — iterates the dataset by periodOfReport and entities[].cik, pulls total return, expense ratio, net investment income ratio, and portfolio turnover from the Financial Highlights table, and parses the Schedule of Investments for asset-class weights. Filings from liquidated and merged funds, indexed by accession and CIK, are retained alongside survivors. Output: a fund-period panel feeding fee-competition, window-dressing, and stale-pricing replication studies.

Backfilling holdings histories for funds that died before structured filings

A fund-research vendor extending Morningstar-style coverage backward parses the Schedule of Investments inside each *-n30d.htm (or ASCII payload) into a normalized holdings table keyed by accessionNo, periodOfReport, and CIK, joining seriesAndClassesContractsInformation to map share classes. The Financial Highlights block supplies per-share NAV history. Output: extended historical holdings and total-return series for funds that disappeared before N-CSR/N-PORT, loaded into a commercial fund-analytics platform.

Reconstructing as-filed disclosures for 401(k) excessive-fee and market-timing litigation

A forensic accountant supporting an ERISA excessive-fee class action or an early-2000s market-timing matter pulls the audited Statement of Operations and Financial Highlights for 12b-1 fees, advisory fees, and expense waivers, the Notes to Financial Statements for affiliated-broker and related-party transactions, and the Schedule of Investments dated near alleged market-timing windows. The auditor's opinion and formType = N-30D/A amendments anchor what was restated and when. Output: damages-model exhibits and expert-report appendices citing contemporaneous as-filed disclosure rather than reconstructed data.

Predecessor-fund diligence for mergers and registration statements

A fund-adviser product or compliance team preparing a registration statement for a reorganized or merged fund reads the Letter to Shareholders for stated investment objective and strategy at the predecessor, the Schedule of Investments for legacy positions with tax-lot or compliance carryovers, the Trustees-and-Officers section for governance continuity, and any N-30D/A amendments tied to the same CIK and periodOfReport. Output: 15(c) board memos, predecessor-performance disclosure for the new prospectus, and merger-file working papers.

Training and evaluating NLP models on three decades of fund-narrative text

An ML team building a fund-disclosure summarization or risk-language model treats the Letter to Shareholders and Manager's Report sections inside the primary HTML/TXT document as a labeled corpus, paired per accession with quantitative outcomes lifted from the same filing's Financial Highlights and Schedule of Investments. Format-aware dispatch on the SGML <DOCUMENT><TYPE>N-30D</TYPE> envelope handles ASCII-era and HTML-era payloads uniformly. Output: fine-tuned tone and risk-disclosure classifiers, and a RAG benchmark linking narrative claims to realized portfolio changes.

Longitudinal ESG and stewardship baselines before mainstream sustainability regimes

An ESG researcher studying sector exposures and early SRI language across the 1994–2003 fund universe filters the dataset to relevant fund families via entities[].companyName and cik, mines each filing's Schedule of Investments for fossil-fuel, tobacco, defense, and other controversial issuers (matched on issuer name strings and, where available, CUSIPs), and scans the Letter to Shareholders for early screening or stewardship language. Output: longitudinal greenwashing studies and historical exposure baselines used to validate or challenge present-day ESG fund classifications.

Auditor and fund-accounting reference for legacy NAV and valuation reconciliations

A fund-accounting or assurance team onboarding a legacy series, or responding to an SEC inspection touching a pre-2003 period, uses metadata.json to align accessions with periodOfReport and fiscal-year ends, pulls the Statement of Assets and Liabilities and the fair-valuation footnote from the Notes to Financial Statements, and ties the period-end Schedule of Investments to historical pricing sources. Output: NAV reconciliation memos, valuation-policy review notes, and audit working papers citing the original audited filing.

Dataset Access

The Form N-30D Files dataset is accessible through three endpoints: a JSON index for metadata and container listings, a single archive containing the full dataset, and per-container archives for selective downloads. The dataset covers Form N-30D and N-30D/A filings from January 1994 onward, is delivered in ZIP container format, and is refreshed continuously. Use the updatedAt field on the index to track refreshes and decide which containers to re-download.

Dataset Index JSON API: https://api.sec-api.io/datasets/form-n30d-files.json

Returns dataset-level metadata (name, description, updatedAt, earliestSampleDate, totalRecords, totalSize, formTypes, containerFormat, fileTypes), the full dataset download URL, and the list of all container files with per-container key, size, records, updatedAt, and downloadUrl. This endpoint does not require an API key and can be polled to determine which containers were updated in the most recent refresh.

Example response:

Example
1 {
2 "datasetId": "1f13365b-9ae0-68f7-9e74-7824e75dadf8",
3 "datasetDownloadUrl": "https://api.sec-api.io/datasets/form-n30d-files.zip",
4 "name": "Form N-30D Files Dataset",
5 "updatedAt": "2026-04-14T13:28:43.950Z",
6 "earliestSampleDate": "1994-01-01",
7 "totalRecords": 78245,
8 "totalSize": 2543345574,
9 "formTypes": ["N-30D", "N-30D/A"],
10 "containerFormat": "ZIP",
11 "fileTypes": ["TXT", "JSON", "HTML", "PDF"],
12 "containers": [
13 {
14 "downloadUrl": "https://api.sec-api.io/datasets/form-n30d-files/2026/2026-03.zip",
15 "key": "2026/2026-03.zip",
16 "size": 13818783,
17 "records": 154,
18 "updatedAt": "2026-04-14T13:28:43.950Z"
19 }
20 ]
21 }

Fetch the index with curl:

1 curl https://api.sec-api.io/datasets/form-n30d-files.json

Download Entire Dataset: https://api.sec-api.io/datasets/form-n30d-files.zip

Downloads the complete dataset as a single ZIP archive containing every container. This endpoint requires authentication via an Authorization: Bearer YOUR_API_KEY header.

1 curl -H "Authorization: Bearer YOUR_API_KEY" \
2 -o form-n30d-files.zip \
3 https://api.sec-api.io/datasets/form-n30d-files.zip

Download Single Container: https://api.sec-api.io/datasets/form-n30d-files/2026/2026-03.zip

Downloads a single container archive (typically a monthly or quarterly slice) instead of the full dataset. Use the downloadUrl values from the index to target specific containers. This endpoint also requires the Authorization: Bearer YOUR_API_KEY header.

1 curl -H "Authorization: Bearer YOUR_API_KEY" \
2 -o 2026-03.zip \
3 https://api.sec-api.io/datasets/form-n30d-files/2026/2026-03.zip

Frequently Asked Questions

What is Form N-30D?

Form N-30D is the historical SEC form by which registered management investment companies — open-end mutual funds, closed-end funds, business development companies, and certain insurance company separate accounts — delivered their annual and semi-annual shareholder reports to the SEC. It is governed by Section 30(e) of the Investment Company Act of 1940 and Rule 30e-1 (originally Rule 30d-1), which require transmission of the report to shareholders within 60 days after the close of the reporting period and filing with the Commission within 10 days after that transmission.

What does one record in the dataset represent?

One record corresponds to one EDGAR submission of Form N-30D or Form N-30D/A — a single annual or semi-annual shareholder report (or amendment) filed by a registered management investment company. On disk, each record is a per-filing folder named after the 18-digit normalized accession number, containing a metadata.json header plus all non-image documents that originally constituted the EDGAR submission.

How does Form N-30D differ from Form N-CSR?

N-CSR (annual) and N-CSRS (semi-annual) are the post-Sarbanes-Oxley successors to N-30D, effective for reporting periods ending on or after July 9, 2003. They share the Rule 30e-1 transmittal framework and the same core body — shareholder letter, financial statements, schedule of investments, financial highlights — but add CEO/CFO certifications under Section 302 and Section 906, internal-control disclosure, audit-committee financial-expert identification, principal-accountant fee disclosure, and inline-XBRL-tagged exhibits. Coverage is sequential: N-30D is the only source for 1994 through mid-2003, and N-CSR/N-CSRS takes over from there.

Is XBRL data included in the dataset?

No. Form N-30D predates the XBRL regime for fund filings, so the linkToXbrl field in metadata.json is empty and the dataFiles array is routinely empty. Structured XBRL-tagged financial exhibits only appear later under the N-CSR/N-CSRS and N-PORT regimes.

What file formats are inside each record?

The dataset contains TXT, JSON, HTML, and PDF files. Modern filings consist of an HTML primary document plus the metadata.json manifest; filings from the mid-1990s through roughly 1999 are predominantly plain-ASCII TXT submissions; PDF appears sporadically as a primary or supplementary report document. Image attachments (JPG, GIF, PNG) referenced in the manifest are intentionally excluded from on-disk extraction but remain retrievable from sec.gov via the documentUrl values in documentFormatFiles.

What time period does the dataset cover?

The dataset's earliest sample date is 1994-01-01, reflecting the EDGAR phase-in for investment company filers. Records concentrate between January 1994 and the mid-2003 to 2004 cutover to Form N-CSR, with a long, thinning tail of late filings and N-30D/A amendments after 2003. Pre-EDGAR shareholder reports filed on paper or microfiche are not in the dataset.

How do I download the data?

Three endpoints are available. The JSON index at https://api.sec-api.io/datasets/form-n30d-files.json lists all container files and their metadata and does not require an API key. The full dataset is available as a single ZIP at https://api.sec-api.io/datasets/form-n30d-files.zip, and individual monthly containers are downloadable at URLs of the form https://api.sec-api.io/datasets/form-n30d-files/YYYY/YYYY-MM.zip. Both download endpoints require an Authorization: Bearer YOUR_API_KEY header.