SEC Form N-CSR Files Dataset

The SEC Form N-CSR Files dataset is a per-accession archive of every Form N-CSR and Form N-CSR/A certified shareholder report filed on EDGAR by registered management investment companies — open-end mutual funds, ETFs organized as open-end funds, and closed-end funds — since the form's adoption in February 2003. One record is one complete EDGAR submission identified by its 18-digit accession number, packaged as a folder inside a monthly ZIP container that holds a structured metadata.json sidecar alongside the SGML-wrapped primary shareholder report and its certifying exhibits. The dataset covers both N-CSR originals and N-CSR/A amendments, preserves the filing-agent SGML envelope around every payload document, and reflects the regulatory evolution of the form including the Sarbanes-Oxley certification regime, the inline-XBRL phase-in, and the Tailored Shareholder Reports rule that took effect July 24, 2024. The form is filed under Section 30(b)(2) of the Investment Company Act of 1940 and Sections 13(a)/15(d) of the Securities Exchange Act, with the underlying shareholder report required by Rule 30e-1 and the ten-day N-CSR filing window set by Rule 30b2-1. Files are distributed in HTML, plain text, JSON, and occasional PDF, packed into year-organized monthly ZIP containers.

Update Frequency
Daily
Updated at
2026-05-14
Earliest Sample Date
2003-02-01
Total Size
17.7 GB
Total Records
319,790
Container Format
ZIP
Content Types
HTML, JSON, TXT, PDF
Form Types
N-CSR, N-CSR/A

Dataset APIs

Programmatically retrieve the full list of dataset archive files, download URLs and dataset metadata.

Dataset Index JSON API

Download the entire dataset as a single archive file.

Download Entire Dataset:

Download a single container file (e.g. monthly archive) from the dataset.

Download Single Container:

Dataset Files

280 files · 17.7 GB
Download All
2026-05.zip53.7 MB369 records
2026-04.zip28.2 MB475 records
2026-03.zip245.3 MB2,322 records
2026-02.zip147.3 MB1,492 records
2026-01.zip165.1 MB1,375 records
2025-12.zip197.0 MB1,902 records
2025-11.zip74.7 MB800 records
2025-10.zip88.5 MB1,009 records
2025-09.zip97.3 MB954 records
2025-08.zip58.5 MB583 records
2025-07.zip79.7 MB903 records
2025-06.zip97.5 MB1,200 records
2025-05.zip90.8 MB589 records
2025-04.zip38.1 MB448 records
2025-03.zip245.7 MB2,433 records
2025-02.zip133.5 MB1,521 records
2025-01.zip184.6 MB1,623 records
2024-12.zip178.7 MB1,704 records
2024-11.zip81.0 MB952 records
2024-10.zip87.1 MB1,093 records
2024-09.zip89.1 MB880 records
2024-08.zip77.8 MB948 records
2024-07.zip47.6 MB584 records
2024-06.zip82.6 MB1,030 records
2024-05.zip105.3 MB622 records
2024-04.zip37.3 MB422 records
2024-03.zip260.4 MB2,191 records
2024-02.zip246.9 MB1,709 records
2024-01.zip113.3 MB1,422 records
2023-12.zip185.4 MB1,742 records
2023-11.zip72.0 MB871 records
2023-10.zip82.5 MB907 records
2023-09.zip70.1 MB757 records
2023-08.zip67.4 MB649 records
2023-07.zip162.5 MB859 records
2023-06.zip93.5 MB1,085 records
2023-05.zip135.7 MB655 records
2023-04.zip39.5 MB394 records
2023-03.zip195.8 MB2,415 records
2023-02.zip201.9 MB1,848 records
2023-01.zip121.9 MB1,370 records
2022-12.zip191.2 MB1,863 records
2022-11.zip68.7 MB826 records
2022-10.zip75.2 MB945 records
2022-09.zip80.2 MB986 records
2022-08.zip40.5 MB521 records
2022-07.zip37.8 MB927 records
2022-06.zip77.4 MB1,077 records
2022-05.zip79.0 MB582 records
2022-04.zip25.1 MB497 records
2022-03.zip181.0 MB2,314 records
2022-02.zip141.1 MB1,748 records
2022-01.zip99.2 MB1,378 records
2021-12.zip137.2 MB1,745 records
2021-11.zip61.7 MB862 records
2021-10.zip50.8 MB788 records
2021-09.zip85.1 MB870 records
2021-08.zip30.5 MB443 records
2021-07.zip41.3 MB905 records
2021-06.zip67.5 MB844 records
2021-05.zip53.1 MB553 records
2021-04.zip21.3 MB378 records
2021-03.zip167.6 MB2,360 records
2021-02.zip121.6 MB1,575 records
2021-01.zip86.5 MB1,329 records
2020-12.zip140.4 MB1,751 records
2020-11.zip55.6 MB787 records
2020-10.zip53.2 MB864 records
2020-09.zip40.7 MB683 records
2020-08.zip44.8 MB584 records
2020-07.zip30.8 MB866 records
2020-06.zip54.8 MB851 records
2020-05.zip62.1 MB595 records
2020-04.zip13.7 MB362 records
2020-03.zip143.8 MB2,247 records
2020-02.zip98.1 MB1,706 records
2020-01.zip79.2 MB1,512 records
2019-12.zip121.1 MB1,846 records
2019-11.zip52.0 MB870 records
2019-10.zip43.3 MB782 records
2019-09.zip40.2 MB730 records
2019-08.zip37.4 MB588 records
2019-07.zip28.9 MB840 records
2019-06.zip36.9 MB746 records
2019-05.zip66.6 MB707 records
2019-04.zip13.9 MB376 records
2019-03.zip146.2 MB2,374 records
2019-02.zip68.2 MB1,528 records
2019-01.zip84.8 MB1,676 records
2018-12.zip82.7 MB1,690 records
2018-11.zip62.7 MB1,024 records
2018-10.zip37.9 MB861 records
2018-09.zip35.8 MB676 records
2018-08.zip41.7 MB689 records
2018-07.zip30.0 MB842 records
2018-06.zip42.4 MB882 records
2018-05.zip67.5 MB666 records
2018-04.zip14.4 MB385 records
2018-03.zip139.2 MB2,379 records
2018-02.zip64.2 MB1,599 records
2018-01.zip90.3 MB1,755 records
2017-12.zip86.4 MB1,748 records
2017-11.zip52.1 MB898 records
2017-10.zip44.6 MB908 records
2017-09.zip35.2 MB700 records
2017-08.zip38.5 MB618 records
2017-07.zip34.7 MB908 records
2017-06.zip39.8 MB931 records
2017-05.zip55.1 MB596 records
2017-04.zip17.9 MB353 records
2017-03.zip137.9 MB2,438 records
2017-02.zip68.5 MB1,613 records
2017-01.zip84.2 MB1,682 records
2016-12.zip81.7 MB1,807 records
2016-11.zip50.9 MB1,022 records
2016-10.zip37.2 MB769 records
2016-09.zip40.2 MB798 records
2016-08.zip36.4 MB671 records
2016-07.zip28.7 MB868 records
2016-06.zip48.7 MB1,040 records
2016-05.zip34.9 MB553 records
2016-04.zip16.3 MB390 records
2016-03.zip138.3 MB2,432 records
2016-02.zip67.9 MB1,751 records
2016-01.zip80.7 MB1,503 records
2015-12.zip88.8 MB1,955 records
2015-11.zip51.2 MB1,017 records
2015-10.zip36.2 MB809 records
2015-09.zip35.4 MB800 records
2015-08.zip35.2 MB655 records
2015-07.zip31.0 MB868 records
2015-06.zip46.8 MB1,105 records
2015-05.zip36.0 MB546 records
2015-04.zip14.0 MB378 records
2015-03.zip127.6 MB2,436 records
2015-02.zip75.7 MB1,724 records
2015-01.zip87.0 MB1,750 records
2014-12.zip77.1 MB1,663 records
2014-11.zip43.3 MB890 records
2014-10.zip42.4 MB927 records
2014-09.zip35.1 MB774 records
2014-08.zip29.5 MB584 records
2014-07.zip34.8 MB910 records
2014-06.zip49.2 MB1,076 records
2014-05.zip30.3 MB627 records
2014-04.zip14.9 MB396 records
2014-03.zip110.3 MB2,183 records
2014-02.zip77.5 MB1,687 records
2014-01.zip85.0 MB1,731 records
2013-12.zip67.3 MB1,594 records
2013-11.zip51.4 MB938 records
2013-10.zip35.2 MB835 records
2013-09.zip28.6 MB697 records
2013-08.zip34.9 MB734 records
2013-07.zip27.7 MB750 records
2013-06.zip45.2 MB1,075 records
2013-05.zip30.2 MB686 records
2013-04.zip17.7 MB431 records
2013-03.zip119.6 MB2,305 records
2013-02.zip65.1 MB1,663 records
2013-01.zip81.1 MB1,700 records
2012-12.zip68.3 MB1,500 records
2012-11.zip48.1 MB937 records
2012-10.zip33.4 MB830 records
2012-09.zip30.8 MB654 records
2012-08.zip30.8 MB670 records
2012-07.zip27.2 MB737 records
2012-06.zip46.4 MB1,037 records
2012-05.zip34.6 MB698 records
2012-04.zip14.1 MB581 records
2012-03.zip129.6 MB2,377 records
2012-02.zip54.1 MB1,537 records
2012-01.zip78.4 MB1,583 records
2011-12.zip64.1 MB1,515 records
2011-11.zip43.9 MB971 records
2011-10.zip31.4 MB815 records
2011-09.zip30.5 MB702 records
2011-08.zip30.0 MB673 records
2011-07.zip21.3 MB707 records
2011-06.zip49.6 MB1,027 records
2011-05.zip36.9 MB701 records
2011-04.zip19.8 MB470 records
2011-03.zip121.8 MB2,347 records
2011-02.zip49.5 MB1,554 records
2011-01.zip80.3 MB1,662 records
2010-12.zip64.3 MB1,418 records
2010-11.zip44.3 MB936 records
2010-10.zip30.7 MB877 records
2010-09.zip31.8 MB705 records
2010-08.zip24.5 MB661 records
2010-07.zip20.3 MB747 records
2010-06.zip48.3 MB950 records
2010-05.zip21.0 MB459 records
2010-04.zip19.4 MB493 records
2010-03.zip129.0 MB2,730 records
2010-02.zip42.1 MB1,326 records
2010-01.zip69.9 MB1,635 records
2009-12.zip63.4 MB1,515 records
2009-11.zip39.6 MB971 records
2009-10.zip35.7 MB955 records
2009-09.zip34.6 MB663 records
2009-08.zip23.6 MB683 records
2009-07.zip23.1 MB814 records
2009-06.zip45.9 MB1,057 records
2009-05.zip19.9 MB504 records
2009-04.zip14.6 MB511 records
2009-03.zip120.9 MB2,715 records
2009-02.zip48.5 MB1,481 records
2009-01.zip76.3 MB1,794 records
2008-12.zip55.7 MB1,552 records
2008-11.zip42.1 MB1,076 records
2008-10.zip34.6 MB946 records
2008-09.zip33.3 MB728 records
2008-08.zip27.1 MB832 records
2008-07.zip25.8 MB747 records
2008-06.zip54.7 MB1,212 records
2008-05.zip28.6 MB530 records
2008-04.zip20.6 MB494 records
2008-03.zip118.7 MB2,742 records
2008-02.zip59.5 MB1,503 records
2008-01.zip84.1 MB2,108 records
2007-12.zip55.4 MB1,504 records
2007-11.zip42.1 MB1,144 records
2007-10.zip28.0 MB822 records
2007-09.zip63.3 MB766 records
2007-08.zip24.6 MB774 records
2007-07.zip25.6 MB698 records
2007-06.zip54.9 MB1,337 records
2007-05.zip19.6 MB500 records
2007-04.zip18.3 MB462 records
2007-03.zip122.3 MB3,081 records
2007-02.zip60.5 MB1,434 records
2007-01.zip74.4 MB2,039 records
2006-12.zip52.1 MB1,489 records
2006-11.zip38.9 MB984 records
2006-10.zip27.1 MB764 records
2006-09.zip27.0 MB799 records
2006-08.zip36.3 MB852 records
2006-07.zip19.3 MB579 records
2006-06.zip51.6 MB1,310 records
2006-05.zip20.2 MB478 records
2006-04.zip21.6 MB521 records
2006-03.zip115.2 MB3,269 records
2006-02.zip58.2 MB1,390 records
2006-01.zip64.0 MB2,012 records
2005-12.zip73.4 MB1,535 records
2005-11.zip40.1 MB1,136 records
2005-10.zip22.4 MB811 records
2005-09.zip51.0 MB1,048 records
2005-08.zip32.8 MB1,029 records
2005-07.zip15.7 MB819 records
2005-06.zip52.6 MB1,298 records
2005-05.zip13.1 MB443 records
2005-04.zip13.5 MB576 records
2005-03.zip105.0 MB3,406 records
2005-02.zip81.5 MB1,469 records
2005-01.zip69.5 MB1,920 records
2004-12.zip90.6 MB1,873 records
2004-11.zip61.3 MB1,249 records
2004-10.zip23.1 MB922 records
2004-09.zip58.4 MB1,250 records
2004-08.zip98.2 MB1,214 records
2004-07.zip19.4 MB745 records
2004-06.zip47.5 MB1,432 records
2004-05.zip11.3 MB521 records
2004-04.zip11.1 MB591 records
2004-03.zip87.0 MB3,736 records
2004-02.zip32.2 MB1,184 records
2004-01.zip43.3 MB2,159 records
2003-12.zip39.4 MB2,001 records
2003-11.zip24.0 MB1,118 records
2003-10.zip18.1 MB1,037 records
2003-09.zip29.5 MB1,329 records
2003-08.zip26.1 MB1,368 records
2003-07.zip35.9 MB1,814 records
2003-06.zip22.1 MB1,022 records
2003-05.zip10.2 MB212 records
2003-04.zip1.4 MB90 records
2003-03.zip6.8 MB276 records
2003-02.zip2.4 MB156 records

What This Dataset Contains

The dataset is a complete, per-accession archive of Form N-CSR (Certified Annual Shareholder Report of Registered Management Investment Companies) and Form N-CSR/A (amendments) submissions transmitted to EDGAR. Each accession is materialized as a single folder named with the dashes-stripped accession number (for example, 000119312525190817 for accession 0001193125-25-190817), and the dataset packages both the filing payload and a structured manifest describing the filer, the fund's series and share-class structure, and every document EDGAR received — including those that are deliberately excluded from the dataset copy (image graphics and standalone XBRL data files).

Form N-CSR itself is the Certified Shareholder Report of Registered Management Investment Companies, mandated by Investment Company Act Section 30(b)(2) of the Investment Company Act of 1940 and by Sections 13(a)/15(d) of the Securities Exchange Act of 1934, and implemented by Rule 30b2-1 under the 1940 Act. A registered management investment company must file N-CSR within ten days after transmitting an annual shareholder report under Rule 30e-1. The form is largely a transmission shell: the substantive disclosure is the shareholder report itself, which is attached as the primary document and supplemented by a fixed roster of certifying exhibits required by the Sarbanes-Oxley Act of 2002Section 302 officer certifications (EX-99.CERT) and Section 906 officer certifications (EX-99.906CERT) — together with the registrant's code of ethics (EX-99.CODEETH) when newly adopted or amended, and the auditor's consent (EX-99.D) where applicable. Form N-CSR/A carries the same internal anatomy but is filed to amend a previously submitted N-CSR; the dataset layout is identical except that the primary document filename ordinarily contains ncsra rather than ncsr, formType is "N-CSR/A", and description reads "Form N-CSR/A - Certified Shareholder Report: [Amend]".

The dataset is distributed as monthly ZIP containers organized into year subdirectories, with the earliest records dating from February 2003. File types found in the archive are HTML, plain text, JSON, and occasional PDF, with HTML inside the SGML envelope accounting for the overwhelming majority of payload documents.

Content Structure of a Single Record

Container and folder layout

When a monthly ZIP container is decompressed, it expands into a flat directory of per-filing folders. Each folder is one record. Inside each folder a consumer will find:

  • exactly one metadata.json — always present, always at the folder root, and the only filename guaranteed to exist
  • one primary N-CSR HTML document — always at documentFormatFiles[] sequence "1" with type equal to N-CSR or N-CSR/A, but with no fixed filename
  • a small set of exhibit HTML files — Section 302 certification, Section 906 certification, code of ethics where applicable, and auditor consent where applicable
  • occasionally a PDF attachment or a plain-text exhibit, where the filer chose those formats

Image files (EDGAR GRAPHIC documents — .jpg, .gif, .png chart and logo images referenced inline by the shareholder report) and standalone XBRL artifacts (.xsd schema, .xml instance documents, and linkbase files) are listed inside metadata.json so the full EDGAR manifest is preserved, but they are not written to disk. Consequently the on-disk file count is always lower than the length of the metadata's document manifest — a filing whose manifest lists forty-plus documents may ship only a handful of HTML files in the folder.

The metadata.json sidecar

metadata.json is the structured anchor of every record. Its top-level keys carry filing-level descriptors, the EDGAR document manifest, the registrant (entity) blocks, and, for open-end fund trusts, the registered series and share-class tree. The fields are:

  • formType"N-CSR" for original filings, "N-CSR/A" for amendments.
  • accessionNo — dashed accession number, e.g. "0001193125-25-190817".
  • effectivenessDate — date EDGAR deemed the filing effective, YYYY-MM-DD.
  • filedAt — ISO-8601 acceptance timestamp with timezone offset, e.g. "2025-08-28T11:22:46-04:00".
  • periodOfReport — the report period-end (fund fiscal year-end, or, for legacy N-CSR records, fiscal half-year), YYYY-MM-DD.
  • description — EDGAR-style human-readable form description, e.g. "Form N-CSR - Certified Shareholder Report".
  • linkToFilingDetails — deep link to the primary document on sec.gov; for inline-XBRL primaries this is the ix?doc=... viewer URL, otherwise the bare document URL.
  • linkToTxt — URL of the full SGML submission .txt.
  • linkToHtml — URL of the EDGAR filing index page (*-index.htm).
  • linkToXbrlXBRL viewer URL; frequently empty for N-CSR records prior to the inline-XBRL phase-in.
  • id — opaque 32-character hex dataset row identifier.
  • documentFormatFiles[] — the EDGAR <DOCUMENT> manifest (see below).
  • dataFiles[] — the XBRL schema, instance, and linkbase manifest; empty for filings without inline XBRL.
  • entities[] — one entry per registrant block in the EDGAR header.
  • seriesAndClassesContractsInformation[] — series and share-class tree (open-end fund trusts only; empty for closed-end trusts and operating-company N-CSR filers).

documentFormatFiles[]

Each entry mirrors one row of the EDGAR <DOCUMENT> table and carries:

  • sequence — a string such as "1", "2", or a literal single space for the trailing complete-submission entry
  • size — byte count as a string
  • documentUrl — absolute sec.gov URL (ix?doc=... when the primary is inline XBRL)
  • description — filer's free-text label; usually the fund name on the primary and the exhibit purpose on exhibits
  • type — EDGAR document type code

Sequence numbers are strings, are not always contiguous (graphics typically introduce gaps), and the trailing complete-submission row uses " " (single space) for both sequence and type and "Complete submission text file" for description.

The type vocabulary in the dataset includes:

  • N-CSR / N-CSR/A — primary shareholder report, always at sequence "1".
  • EX-99.CERTSarbanes-Oxley Act Section 302 officer certifications.
  • EX-99.906CERT (sometimes EX-99.906 CERT with an interior space) — Sarbanes-Oxley Section 906 officer certifications.
  • EX-99.CODEETH (sometimes EX-99.CODE ETH with an interior space) — code of ethics applicable to the registrant's principal executive and senior financial officers, filed when adopted, materially amended, or when a waiver is granted.
  • EX-99.D — consent of the independent registered public accounting firm; most often present for closed-end funds and BDCs (Form N-2 registrants) where audited financials are incorporated by reference into a Securities Act registration statement.
  • EX-19.A3, EX-19.BItem 19 exhibits used by operating-company N-CSR filers (a small minority — for example, certain BDCs and other non-fund N-CSR registrants).
  • GRAPHIC — embedded chart, table, and logo images; referenced in the manifest but excluded from the on-disk folder.
  • XML, EX-101.SCH, EX-101.CAL, EX-101.DEF, EX-101.LAB, EX-101.PRE — XBRL instance document and linkbase files associated with inline-XBRL N-CSRs; also surfaced through dataFiles[] and excluded from the on-disk folder.

Because filers do not normalize exhibit-type strings, consumers should treat EX-99.906CERT/EX-99.906 CERT and EX-99.CODEETH/EX-99.CODE ETH as equivalents. The reliable way to resolve the primary document on disk is to find the documentFormatFiles[] entry whose type is N-CSR or N-CSR/A and whose sequence is "1", then take the basename of its documentUrl.

dataFiles[]

This array mirrors documentFormatFiles[] but enumerates only the XBRL attachments — taxonomy schema (EX-101.SCH), calculation/definition/label/presentation linkbases (EX-101.CAL, EX-101.DEF, EX-101.LAB, EX-101.PRE), and the XML instance document (whose description typically reads "EXTRACTED XBRL INSTANCE DOCUMENT"). For filings without inline XBRL — the majority of older N-CSRs — the array is empty.

entities[]

One entry per registrant block in the EDGAR header. Fields include:

  • cik — numeric string, no leading zeros
  • companyName — EDGAR-style label preserving the parenthetical role tag, e.g. "TOUCHSTONE STRATEGIC TRUST (Filer)" or "(Subject)"
  • type — form type from that header block
  • act"40" for Investment Company Act of 1940 filers
  • fileNo — SEC file number; generally 811-... for registered management investment companies and 814-... for BDCs
  • filmNo — EDGAR film number
  • irsNo — IRS employer identifier
  • fiscalYearEnd — four-digit MMDD (sometimes omitted)
  • stateOfIncorporation — US two-letter code, or empty for non-US registrants
  • tickers — array of exchange tickers; populated for closed-end funds, ETFs, and BDCs that are exchange-listed; commonly absent or empty for open-end mutual fund trusts whose shares are not exchange-listed at the trust level

seriesAndClassesContractsInformation[]

This array exposes the EDGAR series-and-class structure that open-end fund trusts must report. Each entry describes one registered series and carries the series identifier (S0000…), a name, and a classesContracts[] list. Each share class has its own EDGAR class-contract identifier (C0000…), a name (typically the class letter or label such as A, C, Institutional, R6), and a ticker where one is assigned. A single multi-fund trust filing routinely enumerates many series and dozens of share classes within one N-CSR. For closed-end fund trusts and for operating-company N-CSR filers, this array is empty and consumers should fall back to entities[].tickers and entities[].fileNo for issuer identification.

The primary N-CSR document and its internal sections

The primary document at sequence "1" carries the substantive content. It is delivered inside an SGML envelope of the form:

1 <DOCUMENT>
2 <TYPE>N-CSR
3 <SEQUENCE>1
4 <FILENAME>…
5 <DESCRIPTION>…
6 <TEXT>
7 … HTML payload …
8 </TEXT>
9 </DOCUMENT>

Parsers must strip this preamble before treating the inner content as HTML. The header markers <TYPE>, <SEQUENCE>, <FILENAME>, and <DESCRIPTION> are authoritative and should be reconciled against the corresponding documentFormatFiles[] entry.

The Item structure that has applied across most of the dataset's lifetime contains, in order:

  • Item 1 — Reports to Stockholders. The shareholder report itself: the letter to shareholders, management's discussion of fund performance (MDFP), performance graphs and tables, schedule of investments (portfolio holdings as of period-end with security identifiers, principal amount or shares, fair value, and percentage of net assets), statement of assets and liabilities, statement of operations, statements of changes in net assets, statement of cash flows where applicable, financial highlights per share class, notes to financial statements, the report of the independent registered public accounting firm (for annual reports), and tax information disclosures.
  • Item 2 — Code of Ethics. Covers adoption, amendments, and waivers; the code itself, when newly adopted or amended, is filed as the EX-99.CODEETH exhibit.
  • Item 3 — Audit Committee Financial Expert. Identification of the designated financial expert and independence statement.
  • Item 4 — Principal Accountant Fees and Services. Audit, audit-related, tax, and all-other fees for the two most recent fiscal years, plus pre-approval policies.
  • Item 5 — Audit Committee of Listed Registrants. Applicable to listed closed-end funds and BDCs.
  • Item 6 — Investments. Historically a stub pointing to Item 1 for the schedule of investments; under the Tailored Shareholder Reports rule, this item carries the complete portfolio holdings when they are not in the streamlined shareholder report.
  • Items 7–11. Proxy voting policies and procedures, proxy voting record (closed-end), portfolio manager information, share repurchases (closed-end), and changes in nominating procedures.
  • Item 12 — Disclosure of Securities Lending Activities for Closed-End Management Investment Companies. Added by later amendments to Form N-CSR.
  • Item 13 — Exhibits. Pointers to the certifications and code-of-ethics exhibits.

The signature block at the foot of the primary document carries the principal executive officer's and principal financial officer's signatures with titles and dates. Form N-CSR is signed on behalf of the registrant and separately by each certifying officer; the standalone Section 302 and Section 906 statements live in their own exhibit files.

For filings on or after the Tailored Shareholder Reports compliance date (July 24, 2024), the open-end-fund layout shifts: Item 1 attaches a streamlined retail-facing tailored shareholder report (TSR) per share class, while Items 6 and following carry the full portfolio holdings, complete financial statements, management information, and approval-of-advisory-contract disclosures previously housed inside Item 1.

The certifying and supporting exhibits

The exhibit roster attached to a typical N-CSR is short and stable across filers:

  • EX-99.CERT — Section 302 certifications. Two near-identical certification statements, one by the principal executive officer and one by the principal financial officer, in the form prescribed by Form N-CSR's General Instructions, attesting to the officer's review, disclosure controls, and internal control over financial reporting.
  • EX-99.906CERT — Section 906 certifications. Officer statements furnished pursuant to 18 U.S.C. § 1350 affirming that the report fully complies with the requirements of Section 13(a)/15(d) and fairly presents the financial condition and operations of the registrant. Section 906 certifications are "furnished" rather than "filed" and carry different legal exposure, but ship as exhibits alongside the Section 302 certifications.
  • EX-99.CODEETH — Code of ethics. Filed when the registrant adopts a new code or materially amends an existing one; not present on every N-CSR.
  • EX-99.D — Auditor consent. A short consent letter from the registered public accounting firm to the incorporation of its audit report; most commonly present for closed-end funds and BDCs whose audited financials are incorporated by reference into a Securities Act registration statement.
  • EX-19.A3 / EX-19.B. Item 19 exhibits used only by operating-company N-CSR filers.

Each exhibit is wrapped in the same SGML envelope as the primary document, with <TYPE>, <SEQUENCE>, <FILENAME>, and <DESCRIPTION> markers preserved exactly as transmitted to EDGAR.

File-naming conventions

Filenames are not standardized; they reflect the conventions of the filing agent that prepared the submission. Patterns found in the dataset include:

  • Donnelley / Toppan-style (d<docid>dncsr.htm, d<docid>dex99cert.htm, d<docid>dex99906cert.htm, d<docid>dex99d.htm).
  • Toppan Merrill-style (tm<jobid>d<rev>_ncsr.htm, tm<jobid>d<rev>_ex99-cert.htm, tm<jobid>d<rev>_ex99-906cert.htm, tm<jobid>d<rev>_ex99-codeeth.htm; amendments use _ncsra in place of _ncsr).
  • EdgarAgents-style (opaque hex-prefixed primary names such as 8dde5aa36ebc3f9.htm alongside descriptive exhibit names like JIF_6-30_N-CSR_EX-99_6.30.25.htm).
  • Generic in-trust (primary-document.htm, cert302.htm, section906.htm, coe.htm, ncsrcodeofethicsjan2022.htm).
  • In-house filer-style (formn-csr.htm, ex19-a3.htm, ex19-b.htm).

The only guaranteed filename in any record is metadata.json. Everything else must be resolved through documentFormatFiles[].

Included content

For each accession the record contains the dataset pipeline's metadata.json and every primary and exhibit document in the EDGAR submission whose payload is HTML, plain text, JSON, or PDF. The dataset thus preserves the full narrative and certifying content of each filing: the shareholder report and its embedded financials, schedule of investments, MDFP, and signature block; the Section 302 and Section 906 certifications; the code of ethics where filed; the auditor consent where filed; and any non-image, non-XBRL exhibits the filer attached.

Excluded or separate content

Two classes of attachment are deliberately omitted from the on-disk record:

  • Image and graphic files. Every GRAPHIC entry listed in documentFormatFiles[] (chart images, fund-performance graphics, logos, embedded figures) is excluded. The manifest still records their filenames, sizes, and EDGAR URLs so the original submission can be reconstructed by fetching the missing assets directly from sec.gov.
  • Standalone XBRL data files. Schema (.xsd), instance documents (.xml), and linkbase files are tracked in dataFiles[] (and the corresponding EX-101.* entries of documentFormatFiles[]) but are not written to the folder. Inline XBRL facts embedded inside the primary HTML remain present in that HTML document, since inline-XBRL tags live in the rendered HTML stream.

The complete .txt submission (the SGML envelope wrapping every document EDGAR received) is also not packaged into the folder; it remains accessible through the linkToTxt URL.

Amendments (N-CSR/A)

Form N-CSR/A records are structurally identical to N-CSR records in this dataset. The folder layout, exhibit roster, and metadata.json schema are unchanged; only formType shifts to "N-CSR/A", description reads "Form N-CSR/A - Certified Shareholder Report: [Amend]", and the primary document filename typically contains ncsra instead of ncsr. An amendment is a self-contained re-submission that supersedes the prior accession for the periods and items it addresses, and is therefore stored as its own record under its own accession folder rather than being merged with the original.

Open-end versus closed-end record divergence

The internal anatomy diverges in two practical respects depending on registrant type.

Open-end fund trusts (typically Section 8 registrants on Form N-1A, file number prefix 811-) populate seriesAndClassesContractsInformation[] with a full series-and-class tree. The primary N-CSR usually consolidates multiple series under one trust-level filing, so the financial statements and schedules of investments are stacked series-by-series within Item 1 (or, post-Tailored Shareholder Reports, within Items 6 and following). The entities[].tickers array is frequently absent because shares are not exchange-listed at the trust level.

Closed-end fund trusts and BDCs (file number prefixes 811- for registered closed-end funds, 814- for BDCs) leave seriesAndClassesContractsInformation[] empty, expose their listed ticker through entities[].tickers, almost always carry an EX-99.D auditor consent, and may carry Item 5 (audit committee listing requirements) and Item 10 (share repurchases) disclosures absent from open-end records.

Operating-company N-CSR filers — a small minority — depart further: they may include EX-19.A3 and EX-19.B item exhibits and tend to use plainer in-house filenames.

Structural evolution over time

Form N-CSR's content and structure have been adjusted by several rule changes since the form's inception in 2003:

  • Initial adoption (effective February 2003). Form N-CSR was created by Investment Company Act Release No. 25914 / Exchange Act Release No. 47262 to implement the certified shareholder reporting regime under Sections 302 and 906 of the Sarbanes-Oxley Act. The earliest dataset records begin on this effective date with the original Item set covering code of ethics, audit committee financial expert, and principal accountant fees.
  • Proxy voting record disclosure (2003). Item 7 was populated by Release No. IC-25922 requiring funds to file their full annual proxy voting records on Form N-PX, with cross-references appearing in N-CSR.
  • Audit committee and accountant disclosures. Subsequent rule packages added and renumbered Items 4–5 to align with SEC accountant pre-approval and listed-issuer audit committee requirements.
  • Securities lending disclosure (Item 12). Closed-end management investment companies acquired a dedicated Item 12 on securities lending activities, expanding the closed-end-only sections of the Item set.
  • Tailored Shareholder Reports rule (Release No. IC-34731, October 2022; compliance date July 24, 2024). The SEC adopted amendments creating a short, retail-facing tailored shareholder report (TSR) for open-end funds, while moving the previously-required schedule of investments, financial statements, and other detailed disclosures out of the shareholder report and into Form N-CSR's Items (notably Items 6, 7, 8, 9, 10, and 11). The shape of the primary N-CSR document for open-end funds changed materially as a result.
  • Separation of Form N-CSRS (semi-annual reports), effective July 24, 2024. The same rule release introduced Form N-CSRS as a distinct EDGAR form type for semi-annual reports, splitting semi-annual filings off from the annual N-CSR. This dataset includes only N-CSR and N-CSR/A form types; semi-annual reports on the new N-CSRS form type fall outside the dataset's scope. Records filed on N-CSR before the compliance date may have periodOfReport values corresponding to either fiscal year-ends or fiscal half-year-ends; after the compliance date, periodOfReport values correspond to fiscal year-ends only.
  • Inline XBRL tagging. The SEC's structured-data initiative for fund disclosures progressively required inline-XBRL tagging of specified financial and identifying data on N-CSR. As tagging is phased in, linkToFilingDetails increasingly points at the ix?doc= viewer URL, dataFiles[] populates with EX-101.* linkbase entries, and the primary HTML embeds inline-XBRL fact tags inline.

Format evolution

N-CSR has been an EDGAR-filed form throughout its life, and the dataset's earliest records (February 2003 onward) are already HTML-based, wrapped in the same SGML <DOCUMENT> envelope used today. Two material format shifts have occurred within that long-stable framing:

  • Plain HTML to inline-XBRL HTML. Earlier records consist of plain HTML payloads inside the SGML wrapper, with no associated dataFiles[] entries. Later records, as fund inline-XBRL tagging requirements have taken effect, embed inline-XBRL facts inside the primary HTML and surface taxonomy schema and linkbase files through dataFiles[]. The presence of an ix?doc= prefix on linkToFilingDetails is a fast indicator of an inline-XBRL primary.
  • Exhibit-type label drift. EDGAR has accepted slight variants of the same exhibit type code throughout the form's history. Both EX-99.906CERT and EX-99.906 CERT appear, as do EX-99.CODEETH and EX-99.CODE ETH. The dataset preserves these strings exactly as transmitted; downstream consumers must normalize them.

Interpretation and extraction notes

Several anatomy details matter for accurate extraction:

  • Strip the SGML preamble first. Every payload file is wrapped in <DOCUMENT>...<TEXT>...</TEXT></DOCUMENT> markers. The inner content can only be parsed as HTML after the preamble is removed. The preamble's <TYPE>, <SEQUENCE>, <FILENAME>, and <DESCRIPTION> lines are authoritative and should be reconciled against documentFormatFiles[].
  • Resolve the primary through the manifest, not the filename. Find the documentFormatFiles[] entry where type is N-CSR or N-CSR/A and sequence is "1", then take the basename of documentUrl (stripping the ix?doc= prefix when present).
  • Normalize exhibit-type strings. Collapse EX-99.906CERT/EX-99.906 CERT and EX-99.CODEETH/EX-99.CODE ETH before grouping.
  • Account for sequence-string quirks. Sequence values are strings, not always contiguous (graphics introduce gaps), and the trailing complete-submission row uses a single space in both sequence and type.
  • Expect a manifest-versus-disk delta. The on-disk file count is always less than documentFormatFiles.length because GRAPHIC and standalone XBRL entries are listed in the manifest but excluded from the folder. Consumers that need those assets must fetch them via each entry's documentUrl.
  • Identify funds at the right level. For open-end trusts, fund-level identification should flow through seriesAndClassesContractsInformation[] because the trust-level entities[] block aggregates multiple economically distinct funds. For closed-end funds, BDCs, and operating-company filers, identification should flow through entities[].tickers and entities[].fileNo.
  • Interpret periodOfReport in light of the July 24, 2024 bifurcation. Earlier records may report fiscal half-year-ends; later records (within this N-CSR-only dataset) report fiscal year-ends only.
  • Inline XBRL behavior. When the primary is inline XBRL, the structured facts live inside the rendered HTML stream and remain available even though the standalone .xml instance and EX-101.* linkbase files are excluded from the on-disk folder; the dataFiles[] array provides the URLs needed to fetch those companion artifacts from sec.gov.

Who Files or Publishes This Dataset, and When

Who files the record

Form N-CSR is filed by registered management investment companies — investment companies registered with the SEC under the Investment Company Act of 1940 and organized as either open-end or closed-end management investment companies. The legal EDGAR filer is the registrant entity (typically a Delaware or Massachusetts statutory trust, a Maryland corporation, or, less commonly, a limited partnership), not the individual series within it.

In scope:

  • Open-end management investment companies (mutual funds) registered on Form N-1A.
  • Closed-end management investment companies registered on Form N-2, including listed closed-end funds and non-listed/interval/tender-offer funds.
  • Exchange-traded funds organized as open-end management companies. These file N-CSR on the same basis as conventional mutual funds.
  • Series trusts. A single trust on EDGAR commonly files N-CSR covering one or more of its series at a time, depending on fiscal-year alignment.

Out of scope (do not file N-CSR):

Although the entity is the filer, the Sarbanes-Oxley Section 302 of the Sarbanes-Oxley Act and Section 906 of the Sarbanes-Oxley Act certifications inside each filing are personally signed by the registrant's principal executive officer and principal financial officer.

Statutory and rule framework

Form N-CSR rests on three statutes and two operative rules:

  • Investment Company Act Section 30(b)(2) — the primary statutory source of the filing obligation; requires registered management investment companies to file with the SEC the same reports they transmit to shareholders, on a schedule set by Commission rule.
  • Securities Exchange Act Sections 13(a) and 15(d) — the secondary filing authority. Because most registered funds are Exchange Act reporting issuers, N-CSR is also a "report filed under Sections 13(a)/15(d)," which is what makes the SOX certifications enforceable.
  • Sarbanes-Oxley Act Sections 302 and 906 — require the principal executive and financial officers to certify the report's accuracy, the effectiveness of disclosure controls and internal control over financial reporting (Section 302), and full Exchange Act compliance and fair presentation (Section 906, with criminal liability).
  • Rule 30e-1 (Investment Company Act) — requires transmission of annual (audited) and semi-annual (unaudited) shareholder reports.
  • Rule 30b2-1 (Investment Company Act) — sets the filing deadline for Form N-CSR.

The form was adopted in early 2003; the earliest EDGAR N-CSR filings date from February 2003.

When the record is created: the Rule 30e-1 trigger

A Form N-CSR filing is event-driven, not calendar-driven. The triggering event is the transmission of a Rule 30e-1 shareholder report to investors:

  • Once after the annual report (audited financial statements, full fiscal year).
  • Once after the semi-annual report (unaudited financial statements, first six months of the fiscal year).

Because the trigger is the fund's fiscal-period-end shareholder report rather than a calendar-year cycle, N-CSR filings cluster by each registrant's fiscal-year-end and the six-month half-year date. A series trust whose series have staggered fiscal year-ends generates N-CSR filings throughout the calendar year.

Timing: the ten-day filing window

Under Rule 30b2-1, the registrant must file Form N-CSR with the SEC no later than ten days after the date the shareholder report is transmitted to shareholders. The clock runs from the transmission date, not from the fiscal-period-end.

Because Rule 30e-1 requires the shareholder report itself to be transmitted within 60 days after the close of the relevant fiscal period, the outside boundary for an N-CSR filing is roughly 70 days after period-end.

Form N-CSR/A (amendments) are filed to correct, restate, or supplement a prior N-CSR — for example, restated financials, corrected schedules of investments, or re-executed certifications. Amendments have no fixed deadline; they are filed when the need is identified and inherit the filer and triggering event of the original report.

Tailored Shareholder Reports (effective July 24, 2024)

The SEC's 2022 amendments to Rule 30e-1 and Form N-CSR — the Tailored Shareholder Reports (TSR) rule — became effective with a compliance date of July 24, 2024. They restructured shareholder reporting for open-end funds only (mutual funds and ETFs on Form N-1A):

  • Shareholders now receive a concise, retail-oriented tailored shareholder report (one per share class).
  • Detailed financial statements, the full schedule of investments, and other previously bundled disclosures are moved into Form N-CSR and made available online, by request, and on EDGAR rather than mailed.
  • The tailored shareholder report itself is filed as an exhibit to N-CSR.
  • The optional Rule 30e-3 "notice and access" mailing-suppression mechanism is no longer available to open-end funds, which are now expected to deliver the tailored report directly.

The semi-annual filing remains a Form N-CSR submission (often informally labeled Form N-CSRS by filers and vendors), but its content is conformed to the TSR regime: a concise semi-annual tailored report for shareholders, with full unaudited semi-annual financials filed on N-CSR.

Closed-end funds and BDCs are not in scope of the tailored shareholder reports framework. Closed-end funds continue to file N-CSR under the pre-TSR content model; BDCs continue to report on Forms 10-K and 10-Q.

Important distinctions

  • Filer vs. series. The CIK on an N-CSR submission belongs to the registrant trust or corporation, not the individual series; one filing may cover multiple series sharing a fiscal period.
  • Annual vs. semi-annual content. The independent auditor's report appears only on the annual N-CSR; semi-annual filings contain unaudited financials.
  • N-CSR vs. N-PORT. Portfolio holdings between shareholder reports are filed on Form N-PORT (which replaced Form N-Q). N-PORT is monthly (non-public for the first two months of each quarter, public for the third); N-CSR remains the certified semi-annual/annual shareholder report.
  • N-CSR vs. N-CEN. Form N-CEN (which replaced Form N-SAR) carries annual registrant-level census data and is filed by all registered investment companies including UITs. N-CSR is limited to management companies and carries the certified shareholder report.
  • BDCs. Despite resembling closed-end funds (listed, non-redeemable), BDCs sit on the Exchange Act operating-company side; their annual report to shareholders is typically incorporated into Form 10-K, not filed on N-CSR.

How This Dataset Differs From Similar Datasets or Filings

Form N-CSR sits within a tight cluster of investment company filings. Several adjacent forms share content (holdings, fund financials, governance, performance) but differ in rule basis, cadence, structure, and filer scope. The comparisons below mark where N-CSR is the right source and where another filing is.

Form N-CSRS (legacy semi-annual)

Pre-consolidation, N-CSR covered annual certified shareholder reports and N-CSRS covered semi-annuals. The SEC retired the N-CSRS designation; semi-annuals now file on N-CSR. Historical N-CSRS filings remain in EDGAR under the old form type and are not in this dataset. A complete pre-consolidation semi-annual time series requires combining an N-CSRS dataset with this one.

Form N-PORT

Monthly position-level portfolio holdings in structured XML: identifiers, fair values, derivatives, counterparties, and risk metrics. Only every third month is made public, on a delay.

N-CSR also discloses holdings but as a period-end schedule embedded in the HTML/PDF shareholder report, not parsed rows. Use N-PORT for systematic holdings analytics and intra-period changes; use N-CSR for audited financial statements, performance narrative, and officer certifications, none of which N-PORT carries.

Form N-Q (rescinded)

The pre-2019 quarterly holdings report for Q1 and Q3, replaced by N-PORT. Not in this dataset. Relevant only for long-horizon holdings series spanning the pre-N-PORT era; N-CSR remains the source for annual and semi-annual periods throughout.

Form N-1A

The registration statement for open-end funds (mutual funds, most ETFs): prospectus, SAI, and Part C exhibits. Filed at launch and refreshed via post-effective amendments and Rule 485 filings. Prospective and registration-stage, not periodic.

N-CSR is the post-registration periodic counterpart: realized returns, audited financials, and the period-end schedule of investments. Complementary, never substitutable.

Form N-2

The closed-end fund analogue of N-1A, also covering BDCs and interval funds. Registration-stage and prospectus-oriented. Closed-end funds also file N-CSR, so lifecycle tracking pairs N-2 (offering) with this dataset (periodic).

Forms 10-K and 10-Q

Operating-company periodic reports under the Exchange Act. They share the SOX 302 certification framework with N-CSR but cover consolidated corporate financials, segments, and risk factors for operating businesses. Registered investment companies do not file 10-K or 10-Q for fund operations; N-CSR is the 1940 Act analogue. Do not merge the two regimes without explicit reconciliation.

Form N-CEN

Annual census filing in structured XML: organizational facts, service providers, securities lending, fund-of-fund relationships, classifications. No financial statements, no holdings schedule, no performance narrative. Content sets are largely disjoint from N-CSR; the two are complements on the annual cadence.

Form N-PX

Annual proxy voting record (and, post-2024 amendments, say-on-pay reports from certain institutional managers). N-CSR may reference voting policies but contains no votes. Stewardship analysis needs N-PX for the ballots and N-CSR for the financial context.

Form N-CSR/A (included in this dataset)

Amendments to prior N-CSR filings, included alongside originals. Amendments may correct certifications, restate financials, refile a missing exhibit, or revise the shareholder report. Both accessions are preserved: the latest amendment supersedes for the period, but the original remains the contemporaneous record. Version reconstruction is the user's responsibility; this dataset does not collapse amendments into a single canonical record.

Tailored Shareholder Reports (Item 27A, effective July 24, 2024)

Not a separate form, but a regime change inside N-CSR. Open-end funds now produce concise, fund-and-class-specific shareholder reports under new Item 27A, replacing the long-form report under legacy Item 27. The new reports use prescribed sections (fund expenses, MDFP, fund statistics, graphical holdings, material changes) and structured tagging. Closed-end funds and BDCs remain on the legacy regime.

Practical effect on this dataset: open-end filings before July 24, 2024 follow Item 27 and look materially different from those on or after that date in length, layout, and tagging. Longitudinal text mining and layout-dependent extraction must accommodate the break; section locations and the boundary between the shareholder report and the financial statements shift at the effective date.

Boundary summary

N-CSR is uniquely the certified, audited (or reviewed) shareholder-report package for registered investment companies. Only this dataset contains, in their filed form together:

  • Fund-level financial statements (assets and liabilities, operations, changes in net assets, financial highlights)
  • The period-end schedule of portfolio investments as shown to shareholders
  • Management's discussion of fund performance
  • SOX 302 officer certifications applied under the 1940 Act
  • The Rule 30e-1 shareholder report, including post-July 2024 Item 27A reports for open-end funds
  • Original EDGAR exhibits (excluding images), preserved per accession

N-PORT delivers finer-grained holdings, N-CEN delivers structural census data, N-PX delivers voting records, and N-1A or N-2 deliver prospectus content. None substitutes for N-CSR when the question concerns reported fund results, the shareholder communication itself, or the SOX-certified periodic record of a registered investment company.

Who Uses This Dataset

Because Form N-CSR bundles audited or unaudited financial statements, the full schedule of investments, the manager's discussion, financial highlights, officer certifications, the code-of-ethics exhibit, and the auditor consent into one record, different professions read different sections of that record.

Fund research analysts

Buy-side, sell-side, and third-party fund analysts covering mutual funds, closed-end funds, ETFs, and variable insurance trusts pull the schedule of investments for share-lot holdings, the statement of operations and financial highlights for gross vs. net expense ratios and per-share metrics, and the manager's discussion for attribution and positioning. Outputs feed fund ratings, peer screens, and style-box classifications.

Quantitative researchers

Quant teams ingest the schedule of investments and financial statements to build point-in-time holdings panels and detect style drift, factor tilts, derivatives use, and leverage. They reconcile N-CSR positions against N-PORT and 13F for cross-validation and use financial highlights and the statement of changes to drive performance-attribution and crowding models.

Allocators and manager-selection teams

Fund-of-funds, OCIO desks, and RIA due-diligence groups use N-CSR to verify what a fund actually holds versus marketing claims. They scan the schedule of investments for illiquids, 144A securities, and derivatives; the notes for Level 3 assets, securities-lending terms, and affiliate transactions; and the financial highlights for expense waivers and ratios. These feed manager scorecards and redemption or onboarding decisions.

Fund counsel and compliance officers

Legal and compliance staff at advisers and fund boards benchmark peer disclosure across complexes: principal-risk wording, expense-limitation agreements, affiliated brokerage, proxy voting summaries, and Rule 22e-4 liquidity classifications. The code-of-ethics exhibit and ICFR change disclosures inform their own filings and board advice.

Auditors and inspection reviewers

Audit teams and internal quality reviewers covering fund clients reference the financial statements, notes, schedule of investments, auditor's report, and consent exhibit to assess fair-value hierarchy classifications, related-party disclosures, and opinion consistency across a firm's fund practice. Independent valuation specialists benchmark Level 1/2/3 methodologies against the significant accounting policies.

Academic researchers

Finance, accounting, and economics researchers exploit the long N-CSR history to study expense ratios, turnover, fee competition, performance persistence, soft-dollar practices, and how disclosure language predicts returns. Governance and ESG scholars analyze the manager's discussion and code-of-ethics exhibits for changes in language over time.

Financial journalists and consumer advocates

Reporters covering asset management and retirement use the financial highlights and notes to investigate expense growth, acquired-fund fees, securities-lending revenue splits, and underperformance against benchmarks. Consumer-advocacy researchers surface fee inflation and undisclosed strategy shifts affecting retail and retirement investors.

Governance and activist researchers

Stewardship and proxy-campaign teams cross the schedule of investments with N-PX records to map which funds hold which issuers at certified semi-annual snapshots, supporting share-mobilization estimates and engagement targeting. N-CSR adds an audited layer to the more frequent N-PORT and 13F views.

AI, ML, and RAG engineering teams

ML teams treat N-CSR as a high-value training and evaluation corpus: long-form shareholder letters and manager's discussions paired with structured tables (financial statements, schedule of investments, financial highlights). Use cases include fine-tuning on manager commentary, building retrieval pipelines that link narrative to numbers, training table extractors, and evaluating models on SOX certification language and audit-opinion classification.

Risk and operations at fund service providers

Custodians, fund administrators, and prime brokers use N-CSR to reconcile custody books to public disclosures, validate fair-value methodologies, and monitor counterparty exposures in derivatives schedules and securities-lending notes. Section 302 certifications and disclosure-controls language feed vendor-risk reviews and SOC-style attestations supplied to fund clients.

Credit and counterparty analysts

Lenders and derivatives counterparties facing registered funds read the financial statements, schedule of investments, and notes on collateral and counterparty exposure to gauge fund-level liquidity, derivative leverage, and concentration. Output supports counterparty limit setting and stress modeling.

Specific Use Cases

The Form N-CSR Files dataset supports a range of concrete workflows that draw on specific parts of each filing record.

Build point-in-time fund holdings panels

Parse the schedule of investments from the primary N-CSR HTML at sequence "1" to produce per-fund, per-period-end position tables (security identifier, principal amount or shares, fair value, percent of net assets). Key the panel by seriesAndClassesContractsInformation[].seriesId for open-end trusts and by entities[].fileNo plus entities[].tickers for closed-end funds and BDCs. Reconcile against N-PORT month-end positions to validate holdings extraction and to fill the audited annual anchor in a longer holdings time series.

Track expense ratios, fee waivers, and acquired-fund fees per share class

Extract the financial highlights table from Item 1 (or Items 6 and following for post-July 2024 open-end TSR filings) at the share-class level using the classesContracts[] identifiers in metadata.json. Feed gross expense ratio, net expense ratio, waiver/recoupment lines, turnover, and per-share NAV/returns into longitudinal manager-fee studies, RIA due-diligence scorecards, or journalism on fee inflation across complexes.

Monitor auditor changes and going-concern language across a fund family

Diff the report of the independent registered public accounting firm inside Item 1 and the EX-99.D auditor consent across consecutive annual N-CSRs for the same cik. Flag changes in auditor identity, opinion modifications, critical audit matters, or absence of EX-99.D where one was previously present. Output drives audit-quality monitoring lists and PCAOB inspection prep.

Benchmark Sarbanes-Oxley certification and ICFR disclosure language

Pull every EX-99.CERT (Section 302) and EX-99.906CERT (Section 906) exhibit across the dataset, normalizing the EX-99.906CERT/EX-99.906 CERT label variants. Cluster certification text against the Form N-CSR General Instructions template to detect deviations, missing paragraphs, or disclosure-control weaknesses noted by the principal executive or financial officer. Output supports compliance peer benchmarking and inspection-readiness reviews at fund advisers.

Detect code-of-ethics adoptions, amendments, and waivers

Filter the dataset for records containing an EX-99.CODEETH (or EX-99.CODE ETH) exhibit, since the code is filed only on adoption, material amendment, or waiver. Cross with Item 2 of the primary document to classify the event type, and build a longitudinal log of governance changes by registrant. Useful for fund counsel benchmarking peer codes and for governance researchers studying ethics-regime evolution.

Train and evaluate retrieval pipelines on MDFP narrative paired with financials

Use the manager's discussion of fund performance, shareholder letter, and notes from the primary HTML as long-form text passages, anchored to the structured tables (statement of operations, statement of assets and liabilities, financial highlights) in the same record. Train RAG systems that answer "why did this fund underperform" by linking narrative attribution to numbers, and evaluate table-extraction models on the schedule of investments under both legacy Item 27 and post-July-2024 Tailored Shareholder Report layouts.

Reconstruct amendment histories per accession

Group records by cik and periodOfReport, then chain originals (formType == "N-CSR") to their amendments (formType == "N-CSR/A") using filedAt ordering. Diff financial statements, schedule of investments, and certifying exhibits between the original and the amendment to identify restatements, refiled exhibits, or corrected officer certifications. Output supports forensic-accounting reviews and contemporaneous-versus-superseded versioning for litigation support.

Map open-end trust series-and-class structures over time

Walk seriesAndClassesContractsInformation[] across every N-CSR for a given trust cik to track series launches, terminations, class additions (for example, new R6 or institutional classes), and ticker assignments. Combined with the share-class-level financial highlights, this yields a class-level lifecycle dataset for product-strategy and competitive-intelligence work at asset managers.

Dataset Access

The Form N-CSR Files Dataset is available through three access methods: a JSON metadata API, a full dataset archive download, and per-container ZIP downloads. The dataset covers filings from February 2003 to present, organized into ZIP containers grouped by year and month.

Dataset Index JSON API: https://api.sec-api.io/datasets/form-ncsr-files.json

This endpoint returns dataset-level metadata (name, description, last update timestamp, earliest sample date, total records, total size, form types, container format, file types), the full dataset download URL, and the complete list of container files with per-container size, record count, last updated timestamp, and individual download URL. Poll this endpoint daily to detect which containers were refreshed in the most recent run and decide which ones to re-download. This endpoint does not require an API key.

Example response:

Example
1 {
2 "datasetId": "1f13365b-9ae0-68f3-aa7e-ed98699545fd",
3 "datasetDownloadUrl": "https://api.sec-api.io/datasets/form-ncsr-files.zip",
4 "name": "Form N-CSR Files Dataset",
5 "updatedAt": "2026-05-14T02:55:05.498Z",
6 "earliestSampleDate": "2003-02-01",
7 "totalRecords": 319790,
8 "totalSize": 17744322275,
9 "formTypes": ["N-CSR", "N-CSR/A"],
10 "containerFormat": "ZIP",
11 "fileTypes": ["HTML", "JSON", "TXT", "PDF"],
12 "containers": [
13 {
14 "downloadUrl": "https://api.sec-api.io/datasets/form-ncsr-files/2026/2026-05.zip",
15 "key": "2026/2026-05.zip",
16 "size": 138187830,
17 "records": 412,
18 "updatedAt": "2026-05-14T02:55:05.498Z"
19 }
20 ]
21 }

Download Entire Dataset: https://api.sec-api.io/datasets/form-ncsr-files.zip?token=YOUR_API_KEY

Downloads the complete dataset as a single ZIP archive containing every monthly container from February 2003 through the most recent refresh. This endpoint requires an API key.

Download Single Container: https://api.sec-api.io/datasets/form-ncsr-files/2026/2026-03.zip?token=YOUR_API_KEY

Downloads one monthly container ZIP rather than the full archive. Use the downloadUrl values returned by the index API to target specific year-month containers. This endpoint requires an API key.

Frequently Asked Questions

What form types does the Form N-CSR Files dataset cover?

The dataset covers two EDGAR form types: N-CSR (the Certified Annual Shareholder Report of Registered Management Investment Companies) and N-CSR/A (amendments to a previously filed N-CSR). Semi-annual reports filed on the new N-CSRS form type (separated from N-CSR effective July 24, 2024) fall outside this dataset's scope.

What does one record in this dataset represent?

One record is one complete EDGAR submission identified by an 18-digit accession number, materialized as a folder named with the dashes-stripped accession (for example, 000119312525190817). Inside the folder are a metadata.json sidecar plus the SGML-wrapped EDGAR documents — typically the primary N-CSR HTML and a short roster of certifying exhibits.

Who is required to file Form N-CSR?

Registered management investment companies under the Investment Company Act of 1940 — open-end mutual funds and ETFs registered on Form N-1A, and closed-end management investment companies registered on Form N-2 — file Form N-CSR. Business development companies, unit investment trusts, face-amount certificate companies, private funds, and foreign funds are out of scope and do not file N-CSR.

When must Form N-CSR be filed?

Under Rule 30b2-1, the registrant must file Form N-CSR no later than ten days after the shareholder report is transmitted to shareholders under Rule 30e-1. Because the shareholder report itself must be transmitted within 60 days after the close of the relevant fiscal period, the outside boundary for an N-CSR filing is roughly 70 days after period-end.

What time period does the dataset cover, and how is it distributed?

The dataset begins with the earliest EDGAR N-CSR filings from February 2003 and extends through the most recent refresh. It is distributed as monthly ZIP containers organized into year subdirectories, with file types limited to HTML, plain text, JSON, and occasional PDF.

How is this dataset different from Form N-PORT?

Form N-PORT delivers monthly position-level portfolio holdings as structured XML (with public disclosure on a delay every third month), while Form N-CSR delivers the certified period-end shareholder report — including audited or unaudited financial statements, the schedule of investments embedded in the report, management's discussion of fund performance, and SOX 302/906 officer certifications. Use N-PORT for systematic holdings analytics and intra-period changes; use N-CSR for audited financials, performance narrative, and the certified shareholder communication itself.

How are amendments (N-CSR/A) handled in the dataset?

Each Form N-CSR/A amendment is stored as its own record under its own accession folder rather than merged into the original filing. Both accessions are preserved — the latest amendment supersedes for the affected period and items, but the original remains the contemporaneous record. To reconstruct an amendment history, group records by cik and periodOfReport and chain originals to their amendments using filedAt ordering.