Form NSAR-A Files Dataset

The Form NSAR-A Files Dataset is the complete EDGAR collection of semi-annual reports filed by registered open-end and closed-end management investment companies under Section 30 of the Investment Company Act of 1940. Each record is one EDGAR submission of a Form NSAR-A (original mid-fiscal-year report) or Form NSAR-A/A (amendment), identified by its 18-character SEC accession number and packaged as a flat folder containing a metadata.json header alongside the line-coded answer.fil questionnaire body and any accompanying exhibits. The filer is the registered fund — the trust or corporation — not its adviser, distributor, or service providers. Coverage starts on January 1, 1994 with the earliest EDGAR submissions and continues to the present, with original filings concentrated through June 1, 2018 (when the SEC's investment company reporting modernization rulemaking retired Form N-SAR in favor of Form N-CEN and Form N-PORT) and a residual tail of late originals and NSAR-A/A amendments thereafter. The dataset is distributed as ZIP containers carrying TXT, JSON, HTML, XFD, FRM, FIL, and PDF documents.

Update Frequency
Daily
Updated at
2026-04-14
Earliest Sample Date
1994-01-01
Total Size
778.9 MB
Total Records
227,628
Container Format
ZIP
Content Types
TXT, JSON, HTML, XFD, FRM, FIL, PDF
Form Types
NSAR-A, NSAR-A/A

Dataset APIs

Programmatically retrieve the full list of dataset archive files, download URLs and dataset metadata.

Dataset Index JSON API

Download the entire dataset as a single archive file.

Download Entire Dataset:

Download a single container file (e.g. monthly archive) from the dataset.

Download Single Container:

Dataset Files

298 files · 778.9 MB
Download All
2018-12.zip2.7 KB1 records
2018-10.zip3.2 KB1 records
2018-08.zip599.1 KB57 records
2018-07.zip57.8 KB16 records
2018-06.zip214.0 KB38 records
2018-05.zip2.9 MB598 records
2018-04.zip3.6 MB623 records
2018-03.zip2.7 MB509 records
2018-02.zip1.1 MB207 records
2018-01.zip1.8 MB341 records
2017-12.zip1.2 MB264 records
2017-11.zip2.4 MB524 records
2017-10.zip2.2 MB346 records
2017-09.zip751.2 KB175 records
2017-08.zip9.0 MB1,684 records
2017-07.zip1.5 MB388 records
2017-06.zip4.9 MB958 records
2017-05.zip2.7 MB621 records
2017-04.zip2.7 MB490 records
2017-03.zip2.3 MB547 records
2017-02.zip1.1 MB186 records
2017-01.zip1.6 MB320 records
2016-12.zip1.4 MB288 records
2016-11.zip2.3 MB464 records
2016-10.zip1.3 MB277 records
2016-09.zip618.0 KB138 records
2016-08.zip9.7 MB1,620 records
2016-07.zip1.5 MB384 records
2016-06.zip5.3 MB975 records
2016-05.zip2.8 MB638 records
2016-04.zip2.7 MB493 records
2016-03.zip2.4 MB545 records
2016-02.zip1.9 MB324 records
2016-01.zip1.5 MB295 records
2015-12.zip1.2 MB281 records
2015-11.zip2.2 MB461 records
2015-10.zip1.5 MB249 records
2015-09.zip665.4 KB139 records
2015-08.zip10.2 MB1,568 records
2015-07.zip2.1 MB453 records
2015-06.zip5.5 MB1,001 records
2015-05.zip2.3 MB544 records
2015-04.zip3.2 MB625 records
2015-03.zip2.2 MB474 records
2015-02.zip1.2 MB223 records
2015-01.zip1.9 MB410 records
2014-12.zip1.6 MB387 records
2014-11.zip2.2 MB408 records
2014-10.zip1.5 MB369 records
2014-09.zip660.5 KB149 records
2014-08.zip9.2 MB1,485 records
2014-07.zip2.1 MB381 records
2014-06.zip5.8 MB856 records
2014-05.zip2.5 MB605 records
2014-04.zip3.2 MB601 records
2014-03.zip2.0 MB391 records
2014-02.zip1.1 MB213 records
2014-01.zip2.0 MB355 records
2013-12.zip1.4 MB285 records
2013-11.zip2.4 MB514 records
2013-10.zip4.7 MB774 records
2013-09.zip1.2 MB224 records
2013-08.zip9.7 MB1,543 records
2013-07.zip2.3 MB500 records
2013-06.zip4.0 MB739 records
2013-05.zip3.0 MB603 records
2013-04.zip3.2 MB560 records
2013-03.zip2.1 MB425 records
2013-02.zip1.2 MB211 records
2013-01.zip1.5 MB349 records
2012-12.zip1.1 MB241 records
2012-11.zip2.4 MB497 records
2012-10.zip3.7 MB467 records
2012-09.zip745.9 KB166 records
2012-08.zip9.0 MB1,447 records
2012-07.zip1.3 MB371 records
2012-06.zip6.0 MB936 records
2012-05.zip2.3 MB544 records
2012-04.zip3.0 MB578 records
2012-03.zip1.7 MB340 records
2012-02.zip1.4 MB306 records
2012-01.zip1.8 MB359 records
2011-12.zip1.2 MB226 records
2011-11.zip2.2 MB461 records
2011-10.zip1.5 MB400 records
2011-09.zip677.0 KB175 records
2011-08.zip8.5 MB1,408 records
2011-07.zip1.4 MB377 records
2011-06.zip4.3 MB888 records
2011-05.zip3.0 MB602 records
2011-04.zip2.6 MB549 records
2011-03.zip1.6 MB351 records
2011-02.zip794.2 KB185 records
2011-01.zip1.4 MB306 records
2010-12.zip1.2 MB263 records
2010-11.zip2.7 MB562 records
2010-10.zip1.2 MB209 records
2010-09.zip823.8 KB202 records
2010-08.zip8.9 MB1,614 records
2010-07.zip1.5 MB395 records
2010-06.zip6.1 MB1,180 records
2010-05.zip2.1 MB494 records
2010-04.zip2.9 MB589 records
2010-03.zip2.4 MB418 records
2010-02.zip1.2 MB263 records
2010-01.zip6.0 MB922 records
2009-12.zip904.6 KB234 records
2009-11.zip2.7 MB538 records
2009-10.zip1.1 MB253 records
2009-09.zip629.1 KB137 records
2009-08.zip9.2 MB1,682 records
2009-07.zip1.5 MB354 records
2009-06.zip5.6 MB1,115 records
2009-05.zip2.4 MB514 records
2009-04.zip2.8 MB648 records
2009-03.zip1.7 MB343 records
2009-02.zip1.1 MB270 records
2009-01.zip1.2 MB302 records
2008-12.zip1.3 MB310 records
2008-11.zip2.0 MB391 records
2008-10.zip880.7 KB180 records
2008-09.zip687.5 KB138 records
2008-08.zip8.9 MB1,728 records
2008-07.zip1.9 MB398 records
2008-06.zip4.5 MB993 records
2008-05.zip2.3 MB560 records
2008-04.zip2.1 MB519 records
2008-03.zip1.6 MB387 records
2008-02.zip1.3 MB301 records
2008-01.zip1.9 MB507 records
2007-12.zip963.6 KB262 records
2007-11.zip2.6 MB564 records
2007-10.zip1.6 MB290 records
2007-09.zip876.3 KB166 records
2007-08.zip10.4 MB1,938 records
2007-07.zip1.8 MB426 records
2007-06.zip5.5 MB1,213 records
2007-05.zip2.5 MB584 records
2007-04.zip2.8 MB597 records
2007-03.zip2.0 MB515 records
2007-02.zip872.5 KB227 records
2007-01.zip1.5 MB432 records
2006-12.zip894.3 KB217 records
2006-11.zip2.7 MB603 records
2006-10.zip1.1 MB244 records
2006-09.zip742.7 KB168 records
2006-08.zip9.5 MB1,845 records
2006-07.zip2.0 MB473 records
2006-06.zip6.1 MB1,148 records
2006-05.zip3.5 MB760 records
2006-04.zip2.4 MB580 records
2006-03.zip2.3 MB584 records
2006-02.zip759.3 KB198 records
2006-01.zip1.4 MB448 records
2005-12.zip869.3 KB236 records
2005-11.zip3.3 MB614 records
2005-10.zip963.7 KB248 records
2005-09.zip1.2 MB312 records
2005-08.zip11.1 MB1,917 records
2005-07.zip1.7 MB436 records
2005-06.zip5.3 MB1,260 records
2005-05.zip2.8 MB751 records
2005-04.zip3.1 MB750 records
2005-03.zip1.6 MB380 records
2005-02.zip976.6 KB229 records
2005-01.zip1.4 MB369 records
2004-12.zip756.8 KB199 records
2004-11.zip2.3 MB559 records
2004-10.zip640.0 KB166 records
2004-09.zip887.9 KB230 records
2004-08.zip9.6 MB1,789 records
2004-07.zip1.5 MB394 records
2004-06.zip5.3 MB1,255 records
2004-05.zip1.7 MB430 records
2004-04.zip3.7 MB559 records
2004-03.zip3.9 MB486 records
2004-02.zip1.2 MB332 records
2004-01.zip1.6 MB336 records
2003-12.zip1.1 MB251 records
2003-11.zip1.8 MB419 records
2003-10.zip690.1 KB137 records
2003-09.zip982.2 KB165 records
2003-08.zip8.3 MB1,562 records
2003-07.zip1.4 MB314 records
2003-06.zip4.1 MB925 records
2003-05.zip2.8 MB882 records
2003-04.zip2.4 MB739 records
2003-03.zip1.8 MB607 records
2003-02.zip1.6 MB555 records
2003-01.zip1.7 MB530 records
2002-12.zip1.0 MB294 records
2002-11.zip2.5 MB753 records
2002-10.zip899.1 KB260 records
2002-09.zip665.2 KB264 records
2002-08.zip7.7 MB1,544 records
2002-07.zip2.1 MB390 records
2002-06.zip3.6 MB782 records
2002-05.zip2.0 MB481 records
2002-04.zip2.2 MB428 records
2002-03.zip1.4 MB311 records
2002-02.zip1.3 MB310 records
2002-01.zip1.2 MB259 records
2001-12.zip907.4 KB184 records
2001-11.zip1.7 MB389 records
2001-10.zip1.8 MB346 records
2001-09.zip825.3 KB158 records
2001-08.zip6.9 MB1,429 records
2001-07.zip1.7 MB347 records
2001-06.zip4.8 MB999 records
2001-05.zip1.9 MB423 records
2001-04.zip1.8 MB403 records
2001-03.zip1.7 MB469 records
2001-02.zip1.0 MB272 records
2001-01.zip1.3 MB405 records
2000-12.zip1.4 MB744 records
2000-11.zip3.5 MB1,810 records
2000-10.zip1.4 MB671 records
2000-09.zip1.7 MB901 records
2000-08.zip14.2 MB6,814 records
2000-07.zip3.0 MB1,466 records
2000-06.zip8.2 MB4,246 records
2000-05.zip4.1 MB2,323 records
2000-04.zip2.6 MB1,553 records
2000-03.zip1.9 MB1,120 records
2000-02.zip3.1 MB1,792 records
2000-01.zip1.8 MB1,041 records
1999-12.zip1.4 MB816 records
1999-11.zip2.8 MB1,718 records
1999-10.zip1.2 MB654 records
1999-09.zip1.7 MB955 records
1999-08.zip10.9 MB5,786 records
1999-07.zip2.9 MB1,582 records
1999-06.zip6.6 MB3,687 records
1999-05.zip3.2 MB1,872 records
1999-04.zip2.9 MB1,484 records
1999-03.zip2.6 MB1,485 records
1999-02.zip2.6 MB1,199 records
1999-01.zip1.3 MB718 records
1998-12.zip1.8 MB979 records
1998-11.zip2.1 MB1,143 records
1998-10.zip1.2 MB655 records
1998-09.zip1.5 MB856 records
1998-08.zip11.0 MB5,328 records
1998-07.zip2.6 MB1,443 records
1998-06.zip5.8 MB3,054 records
1998-05.zip2.9 MB1,644 records
1998-04.zip2.6 MB1,296 records
1998-03.zip2.1 MB1,207 records
1998-02.zip3.0 MB1,293 records
1998-01.zip1.3 MB736 records
1997-12.zip1.5 MB847 records
1997-11.zip2.3 MB1,309 records
1997-10.zip1.1 MB587 records
1997-09.zip1.4 MB797 records
1997-08.zip9.4 MB5,065 records
1997-07.zip2.4 MB1,307 records
1997-06.zip4.5 MB2,494 records
1997-05.zip2.6 MB1,494 records
1997-04.zip2.4 MB1,323 records
1997-03.zip1.6 MB924 records
1997-02.zip2.2 MB1,237 records
1997-01.zip1.1 MB578 records
1996-12.zip1.1 MB608 records
1996-11.zip2.1 MB1,065 records
1996-10.zip1.2 MB673 records
1996-09.zip1.3 MB717 records
1996-08.zip8.4 MB4,389 records
1996-07.zip2.3 MB1,255 records
1996-06.zip3.7 MB1,993 records
1996-05.zip2.8 MB1,495 records
1996-04.zip2.0 MB1,034 records
1996-03.zip1.7 MB970 records
1996-02.zip2.2 MB1,115 records
1996-01.zip1.1 MB572 records
1995-12.zip852.7 KB425 records
1995-11.zip1.6 MB810 records
1995-10.zip876.8 KB453 records
1995-09.zip1.1 MB568 records
1995-08.zip6.1 MB2,950 records
1995-07.zip1.6 MB796 records
1995-06.zip4.0 MB1,689 records
1995-05.zip2.0 MB992 records
1995-04.zip1.4 MB711 records
1995-03.zip1.4 MB711 records
1995-02.zip757.6 KB287 records
1995-01.zip600.9 KB260 records
1994-12.zip605.3 KB273 records
1994-11.zip773.3 KB309 records
1994-10.zip483.2 KB212 records
1994-09.zip590.5 KB202 records
1994-08.zip2.1 MB544 records
1994-07.zip494.5 KB121 records
1994-06.zip1.2 MB297 records
1994-05.zip791.3 KB203 records
1994-04.zip531.4 KB128 records
1994-03.zip478.8 KB124 records
1994-02.zip341.2 KB94 records
1994-01.zip243.3 KB62 records

What This Dataset Contains

The dataset captures every Form NSAR-A and Form NSAR-A/A submission accepted by EDGAR from January 1994 to the present. Form N-SAR was the SEC's standardized questionnaire-style report for registered management investment companies and small business investment companies; the "-A" suffix designates the mid-fiscal-year (six-month interim) installment, with the year-end installment filed on Form NSAR-B. Registrants responded to a fixed schedule of numbered questions ("items") using binary flags, single-letter codes, numeric figures, and short uppercase text strings, producing a highly structured machine-readable record of organizational identity, service-provider relationships, capital structure, sales and redemption activity, portfolio composition and transactions, expenses and fee arrangements, monthly average net assets, and compliance attestations.

The N-SAR body was always filed as a flat ASCII text document wrapped in the EDGAR SGML document envelope. The form was never converted to HTML, never wrapped in XBRL, and never adopted inline XBRL, because the fixed-column item structure already supplied the machine-readable structure those formats provide elsewhere on EDGAR. Supplementary exhibits — auditor attestations, internal-control reports under Rule 17f-2, opinion letters, and similar attachments — could accompany the primary answer file as TXT, HTM/HTML, PDF, XFD, or FRM documents. The questionnaire schedule remained substantially stable from EDGAR's adoption in the early 1990s through the form's retirement, giving the dataset unusually consistent record anatomy across roughly a quarter-century.

Content Structure of a Single NSAR-A Record

A single record in the Form NSAR-A Files dataset is one complete EDGAR submission, physically organized as an accession-named folder (dashes stripped) that holds a metadata.json describing the submission together with the document files that EDGAR received for it. Logically, the record captures one semi-annual report by a registered investment management company under Section 30(b)(1) of the Investment Company Act of 1940 covering the first six months of the registrant's fiscal year — or, in the case of NSAR-A/A, a subsequent amendment to such a report.

Each accession folder bundles two layers of content in a flat directory layout (no nested subfolders):

  1. A metadata.json derived from the EDGAR submission header, providing canonical filing identifiers, registrant identity, period of report, filer/series/class identifiers, and an enumeration of every document file in the submission.
  2. The submission documents themselves. The primary document is almost always answer.fil — the SGML-wrapped, line-coded N-SAR questionnaire body — accompanied, when present, by exhibit files (TXT, HTM/HTML, PDF, XFD, FRM) and by the rolled-up complete-submission .txt produced by EDGAR.

metadata.json structure

metadata.json is the structured spine of every record. Its meaningful fields are:

  • formType"NSAR-A" for an original semi-annual report or "NSAR-A/A" for an amendment.
  • accessionNo — the EDGAR accession number in its canonical dashed form, e.g. "0001140361-18-045903".
  • filedAt — ISO-8601 acceptance timestamp from EDGAR, including the Eastern-time offset.
  • periodOfReport — the report's period-end date in YYYY-MM-DD form, corresponding to the six-month interim cut-off.
  • description — short human label such as "Form NSAR-A - Semi-annual report for management companies".
  • linkToFilingDetails — direct URL to the primary submission document on EDGAR.
  • linkToTxt — URL to the rolled-up complete-submission .txt on EDGAR.
  • linkToHtml — URL to the EDGAR filing-index page (-index.htm).
  • linkToXbrl — present in the shared schema but unused for N-SAR filings.
  • id — internal 32-character hexadecimal identifier for the record.
  • documentFormatFiles — array of every document in the EDGAR submission. Each element carries sequence (submission ordinal as a string, blank for the rolled-up complete-submission text), size (uncompressed byte count as a string), documentUrl, type (EDGAR document-type label such as "NSAR-A"), and description.
  • dataFiles — array of supplementary structured data files; typically empty for N-SAR, since the questionnaire is delivered as a single coded text body rather than as separate data files.
  • entities — array of EDGAR entity records associated with the submission. Each entity carries companyName with a role suffix (e.g. "… (Filer)"), zero-padded ten-digit cik, type (form type recorded against the entity), act (governing statute, "40" for the Investment Company Act of 1940), fileNo (the SEC file number, generally in the 811- series for registered investment companies), filmNo (EDGAR film number), irsNo (employer identification number), and fiscalYearEnd in MMDD form.
  • seriesAndClassesContractsInformation — array of investment-company series and class-contract identifiers (CIK, series S… identifier, class-contract C… identifier). Populated when the filer is a series trust; empty for stand-alone funds and for filings predating the SEC's series/class identifier framework.

answer.fil — the line-coded NSAR-A body

The primary document is a flat ASCII text file conventionally named answer.fil, wrapped in the EDGAR SGML document envelope:

1 <DOCUMENT>
2 <TYPE>NSAR-A
3 <SEQUENCE>1
4 <FILENAME>answer.fil
5 <DESCRIPTION>NSAR-A
6 <TEXT>
7 <PAGE> PAGE 1
8 ... line-coded answers ...
9 <PAGE> PAGE 2
10 ...
11 SIGNATURE <name>
12 TITLE <role>
13 </TEXT>
14 </DOCUMENT>

Inside the <TEXT> block, each non-pagination line is a fixed-column response to one N-SAR item. The general pattern is a three-digit item number, an optional sub-item letter, an optional numeric sub-index for repeating rows or sub-fields, and a value. For example, an adviser name on row 2 of item 8 sub-item A appears as 008 A000002 followed by the value in the response column; flag responses appear as Y or N; numeric responses appear as right-justified integers or decimals; short text responses appear in uppercase. Repeating items emit successive lines sharing the same item/sub-item prefix and incrementing the trailing ordinal. <PAGE> markers and the literal PAGE n text are pagination artifacts and carry no semantic content.

Item-by-item structure

Form N-SAR is organized as a numbered questionnaire running from item 1 through item 133, with sub-items addressed by letters (and, where needed, numeric sub-indices). The recurring blocks inside an NSAR-A answer body are:

  • Header block (item 1 and preliminary identifiers). Reporting-period dates, registrant name, CIK, file number, amendment flag, fiscal-year-end indicator, interim-vs-annual indicator, audited-financials flag, and filer-type/structure flags. This block re-states the high-level filing identity inside the answer body itself.
  • Items 2–6 — registrant address and organizational structure. Street, city, state and ZIP, principal telephone number, and yes/no flags for series-company status, multiple-class status, master-feeder status, and similar structural attributes.
  • Item 7 — investment advisers (repeating). Sub-items capture adviser name, SEC file number, affiliation code, and address components for each adviser; multiple advisers are enumerated by the row index.
  • Items 8–11 — sub-advisers, principal underwriters, and affiliations. Repeated blocks following the same name/file-number/affiliation/address pattern.
  • Items 12–18 — custodians, sub-custodians, transfer agents, shareholder-servicing agents, administrators, independent public accountant, and other principal service providers. Names, addresses, and affiliation/relationship codes.
  • Items 19–27 — capital structure, share-class attributes, and holders. Number of holders of record, share classes offered, fund-of-funds and money-market indicators, and related structural flags.
  • Items 28–35 — sales, redemptions, and repurchases. Monthly share-sale and redemption activity, broken out by load/no-load class and by month-of-period; includes contingent deferred sales loads and exchanges.
  • Items 36–41 — portfolio transactions in tax-exempt obligations and other repeating issuer-level tables. Repeated per issuer with sub-items for issuer name, purchases, and sales.
  • Items 42–53 — distribution arrangements, 12b-1 plan attributes, sales loads, and brokerage allocation. Includes the 12b-1 plan attribute matrix and dealer-concession structure.
  • Items 54–61 — management fee and advisory-fee structure. Includes the management-fee breakpoint ladder with sub-items addressing each tier and the corresponding rate, plus performance-fee and waiver indicators.
  • Items 62–69 — portfolio composition, asset categories, and policy limits. Percentages by asset category, average portfolio maturity (for money-market funds), and policy-limit caps.
  • Item 70 — portfolio-transaction matrix. A two-column (purchases / sales) by multi-row (transaction-type) matrix capturing portfolio activity by instrument class.
  • Items 71–75 — aggregate portfolio activity, sales-load income, total expenses by category, NAV components, and monthly average net assets.
  • Item 76 — period-end net asset value per share.
  • Items 77–88 — compliance and disclosure flags, fidelity-bond information, errors-and-omissions coverage, exemptive-order references, and additional attestations. Several of these items are the cross-reference hooks for attached exhibits (auditor reports, internal-control reports, accountant opinion letters, fidelity-bond detail).
  • Items 89–133 — series-level and money-market-specific schedules. For series companies and money-market funds, additional repeating blocks at the bottom of the form report per-series identifiers and per-series financial detail (assets, sales, redemptions, NAV, expenses). Stand-alone funds and non-money-market filers skip the items that do not apply.
  • Signature trailer. Two uppercase lines, SIGNATURE <name> and TITLE <role>, precede the closing </TEXT> and </DOCUMENT> tags.

Items that do not apply to a given filer (for example, item ranges addressing 12b-1 plan attributes for a fund without a 12b-1 plan, money-market-only items for a non-money-market fund, or per-series schedules for a stand-alone trust) are simply absent from the answer body rather than emitted with empty values. Consumers should treat missing item lines as "not applicable" rather than as zero or false.

Included document files

The accession folder always includes metadata.json and the primary .fil answer document. Depending on the filing, it may additionally include:

  • The rolled-up complete-submission text file — a .txt that wraps every document of the submission inside the EDGAR <SEC-DOCUMENT> header — enumerated in documentFormatFiles with a blank sequence.
  • One or more text exhibits (.txt) containing auditor opinions, internal-control attestations under Rule 17f-2, fidelity-bond detail, or other narrative attachments.
  • HTML exhibits (.htm, .html) carrying the same exhibit roles when the filer attached HTML rather than plain text.
  • PDF exhibits (.pdf) for scanned or rendered documents such as signed accountant letters.
  • .xfd files (EDGAR's XFDL form-data format) used by certain N-SAR submissions for structured form-data attachments.
  • .frm files representing EDGAR FRM-format exhibits.

The file-types found in the dataset are TXT, JSON, HTML, XFD, FRM, FIL, and PDF. The overwhelming majority of NSAR-A records contain only metadata.json plus answer.fil; richer exhibit assemblages appear in a minority of filings and sit flat inside the accession folder alongside the primary document.

Excluded content

Image attachments (GIF, JPG, and similar binary image formats) that may have accompanied an EDGAR submission are not carried in the dataset. The dataset is otherwise the verbatim, decompressed copy of the EDGAR submission documents. EDGAR-side correspondence, the filer's working files, and any document not submitted as part of the original EDGAR package are likewise outside the record. There is no per-share-class or per-holding breakout beyond what the questionnaire itself encodes; portfolio detail at the individual security level lives in separate reporting regimes (Form N-Q, later Form N-PORT) and is not present here.

Amendments (NSAR-A/A)

Amendments are emitted as independent records under their own accession numbers with formType set to "NSAR-A/A". The internal structure mirrors the original NSAR-A: a metadata.json plus an answer.fil carrying the amended line-coded answer set and any re-attached exhibits. The amendment indicator inside the answer body's header block is set accordingly. Because each amendment carries its own accession number and metadata.json, the link between an amendment and the filing it amends is not encoded as an explicit foreign key inside the record; it is reconstructed by matching the registrant CIK (or series identifier) and the periodOfReport. Successive amendments to the same period appear as separate records sharing CIK and period-end but differing in accessionNo, filedAt, and formType. Point-in-time reconstructions of a registrant's most recently amended semi-annual report require sorting same-period records by filedAt and taking the latest.

Historical evolution

The body file format is stable across the dataset's span: a flat ASCII document inside an SGML envelope, line-coded by item number, signed off in uppercase. What evolved was the population of supplementary exhibit formats — early-era filings tend to ship as answer.fil plus, occasionally, plain-text exhibits, while filings from the 2000s and 2010s more often include HTML, PDF, XFD, and FRM exhibits for auditor attestations and internal-control reports. The series/class identifier system (S… for series, C… for class contracts) became meaningful only after the SEC introduced the framework in the mid-2000s; earlier filings carry an empty seriesAndClassesContractsInformation array. The discontinuation of the form after June 1, 2018 produces a sharp drop-off in record volume after mid-2018, with the residual tail comprising late originals and amendments.

Interpretation and extraction notes

  • The answer body is line-oriented and column-sensitive: parsers should treat each non-<PAGE> line as a tuple of (item number, sub-item letter, numeric sub-index, value) rather than as free text. The <PAGE> markers and accompanying PAGE n strings are pagination artifacts of the SGML envelope.
  • Repeating items (advisers, sub-advisers, underwriters, custodians, transfer agents, fee-tier breakpoints, per-issuer purchases and sales, per-series schedules) are enumerated by the numeric sub-index. Consumers reconstructing tables should group lines by the (item, sub-item) prefix and use the sub-index as the row ordinal.
  • Several items reuse the same column codes to encode month-of-period or share-class breakdowns; the meaning of those columns is item-specific and must be read against the official N-SAR instructions for the item in question.
  • Numeric values are recorded as filed, without unit annotations inside the file. Some financial figures are reported in whole dollars and others in thousands per the form's instructions; correct interpretation depends on the item.
  • The signature block at the foot of the answer body is the only natural-language content of the questionnaire; it identifies the signing officer and title but does not duplicate the registrant or service-provider data already coded into the earlier items.
  • entities[].companyName carries a parenthetical role suffix (e.g. "(Filer)") that distinguishes the registrant from other associated entities recorded by EDGAR; downstream joins on company name should strip this suffix.
  • For series trusts, the registrant identified at the top of metadata.json is the trust as a whole, while per-series identifiers are carried in seriesAndClassesContractsInformation and the per-series financial schedules sit at the tail of the answer body. Joining a specific series' financial detail therefore requires correlating the series S… identifier in metadata with the series-numbered items inside answer.fil.

Who Files or Publishes This Dataset, and When

Who files

The filer is the registered investment management company itself (the trust or corporation), not its adviser, distributor, or service providers. The CIK on the submission belongs to the fund registrant.

Two filer classes use NSAR-A:

Both filer classes use the same form, answering conditional items based on type.

Entities outside the NSAR-A population:

Regulatory framework

Authority comes from Section 30 of the Investment Company Act of 1940 (15 U.S.C. 80a-29), which empowers the SEC to require periodic reports from registered investment companies. The implementing rules during the N-SAR era were:

  • Rule 30a-1, which required management investment companies to file semi-annual reports on Form N-SAR (later amended by the reporting modernization rulemaking to require Form N-CEN instead).
  • Rule 30b1-1, governing the semi-annual filing cadence for management companies.

N-SAR was a structured, fixed-field data collection covering portfolio composition (broad categories), fees, sales loads, distribution arrangements, service providers, board composition, expense ratios, and operational metrics — not narrative disclosure.

Filing trigger and deadline

NSAR-A is schedule-driven, not event-driven. The trigger is the close of the first six months of the registrant's fiscal year (the mid-year semi-annual period). There is no materiality test or transaction threshold.

The deadline is 60 days after the end of the reporting period. For a December 31 fiscal year, the mid-year period ends June 30 and NSAR-A is due around August 29. NSAR-B (the annual N-SAR covering the full fiscal year, including audit-related items) is also due 60 days after fiscal year end.

Because deadlines track each registrant's fiscal year, NSAR-A filings appear throughout the calendar year, clustering after common fiscal year-ends (March, June, October, December).

Historical lifecycle

Form N-SAR was adopted in 1985, replacing predecessor Forms N-1R and N-1Q. EDGAR submissions begin in the mid-1990s; the dataset's earliest records date to January 1994. Pre-EDGAR paper filings sit in SEC paper archives outside this dataset.

The form was rescinded effective June 1, 2018, under the SEC's investment company reporting modernization rulemaking (Investment Company Act Release No. 32314, adopted October 13, 2016). After that date, registrants no longer file new NSAR-A or NSAR-B reports. N-SAR's data collection was replaced by:

  • Form N-CEN, an annual XML census report, and
  • Form N-PORT, a monthly XML portfolio holdings report.

The dataset is therefore historically bounded: original NSAR-A filings run from 1994 through mid-2018, with any later submissions being amendments of pre-2018 periods.

Amendments (NSAR-A/A)

Form NSAR-A/A is the amendment variant. It is filed by the same registrant to correct, restate, or supplement a previously filed NSAR-A covering the same six-month period. Amendments may be triggered by:

  • clerical or data-entry corrections shortly after the original,
  • later restatements due to audit adjustments or service-provider corrections,
  • responses to SEC staff comments.

Each NSAR-A/A is a separate EDGAR submission and a separate record in this dataset. There is no statutory cap on how late an amendment may be filed, so NSAR-A/A submissions can post-date the June 1, 2018 rescission when they correct pre-rescission periods.

Important distinctions

  • Filer versus reported parties. The registrant files; advisers, sub-advisers, custodians, transfer agents, principal underwriters, administrators, auditors, and directors are described in the form but do not file it.
  • Registrant versus series. A single registrant may comprise many series; one NSAR-A covers the registrant and answers series-level items inside.
  • NSAR-A versus NSAR-B. Same filer population; NSAR-A is the mid-year semi-annual, NSAR-B is the annual filing with audit-related items.
  • NSAR-A versus NSAR-U. NSAR-U is the UIT variant; UITs are not management companies.
  • N-SAR versus N-CSR/N-CSRS. N-CSR (annual) and N-CSRS (semi-annual) carry the certified shareholder financial statements and remain in force; N-SAR carried structured operational data and was retired in 2018.
  • N-SAR versus N-Q. N-Q (quarterly portfolio holdings, later replaced by N-PORT) is a separate operational report and not part of the N-SAR regime.

How This Dataset Differs From Similar Datasets or Filings

Form NSAR-A sits within a tightly clustered family of investment-company filings. The sharpest comparisons are with its N-SAR siblings, the post-2018 successor forms (N-CEN, N-PORT), the shareholder-report regime (N-CSR/N-CSRS, N-Q), and the registration-statement family (N-1A/N-2/N-3). The differences below are organized around timing, filer population, content scope, and format.

Form NSAR-B (annual report for management investment companies)

The single closest comparison. Same filer population, same statutory basis (Section 30 of the 1940 Act), same line-coded answer.fil format, and largely the same line-item universe.

Key differences:

  • Period: NSAR-A covers the first six months of the fiscal year; NSAR-B covers the full fiscal year.
  • Scope: NSAR-B adds annual-only items absent from NSAR-A — auditor identification, internal-control matters, fidelity-bond coverage, and certain affiliate-transaction disclosures.
  • Use together: NSAR-A and NSAR-B are complements, not substitutes; only the pair yields within-year change.

Form NSAR-U (semi-annual/annual report for unit investment trusts)

Same statute and cadence, but a disjoint filer population: UITs rather than management investment companies. Because UITs lack an active adviser and managed portfolio, NSAR-U omits or restructures items on portfolio turnover, adviser compensation, and management-company governance. NSAR-A contains no UIT data, and NSAR-U contains no management-company data — treat them as non-overlapping by filer type.

Forms NSAR-AT and NSAR-BT (transitional filings)

Same content fields as NSAR-A/NSAR-B respectively, but filed for stub periods triggered by fiscal-year changes. The reporting interval is non-standard and the filings appear sporadically. For panel analysis, NSAR-AT observations must be reconciled to the underlying fiscal-year change; mixing them into a regular semi-annual series distorts period-over-period comparisons. The same caveats apply to NSAR-BT filings.

Form N-CEN (successor annual census report, post-2018)

The direct successor to NSAR-B on the operational/census axis. Key contrasts with NSAR-A:

  • Cadence: annual, not semi-annual — N-CEN has no semi-annual counterpart, so NSAR-A's mid-year cadence simply ended.
  • Format: structured XML, not legacy answer.fil.
  • Content: reorganized census-style questions on structure, service providers, and securities lending; field mappings to NSAR-A are partial, not one-to-one.
  • Timing: N-CEN begins June 2018; NSAR-A ends June 1, 2018. The series do not overlap.

Form N-PORT (successor monthly portfolio holdings report)

Clarifies what NSAR-A is not. N-PORT is filed monthly in XML and reports position-level holdings, derivatives exposure, liquidity classifications, and risk metrics. NSAR-A contains only aggregate operational metrics (totals, ratios, turnover rates) — no security-by-security holdings. Pre-2018 holdings cannot be reconstructed from NSAR-A; researchers must use N-CSR/N-CSRS or the former N-Q.

Forms N-CSR and N-CSRS (certified shareholder reports)

N-CSRS shares NSAR-A's semi-annual cadence and filer base, but the documents differ in audience and content:

  • N-CSR/N-CSRS: shareholder-facing narrative, full financial statements, schedule of investments, MD&A, auditor's report (N-CSR only).
  • NSAR-A: regulator-facing, coded numeric line items, no narrative, no audited statements, no holdings.

They are complements for the same period: N-CSRS provides narrative and holdings, NSAR-A provides structured operating metrics. Neither substitutes for the other.

Form N-Q (former quarterly holdings report, 2004–2019)

Rescinded like NSAR-A, but content was holdings-focused, filling the off-cycle gap between N-CSR and N-CSRS. Its successor is N-PORT, not N-CEN. The comparison highlights how the legacy regime split disclosure across narrow forms (NSAR-A/B for operations, N-CSR/N-CSRS for shareholder reports, N-Q for interim holdings) — a structure consolidated post-2018 into N-CEN plus N-PORT.

Forms N-1A, N-2, and N-3 (registration statements)

Same filer universe (open-end funds, closed-end funds, and separate accounts organized as management companies), but a different disclosure stage. Registration statements are forward-looking prospectus documents describing objectives, strategies, risks, fees, and governance. NSAR-A is a backward-looking operational return reporting actual mid-year results. Fee tables in N-1A and expense ratios in NSAR-A are cross-referenceable, but N-1A states what the fund intends; NSAR-A states what it did. The same relationship holds for N-2 (closed-end registration) and N-3 (separate-account registration).

Boundary summary

NSAR-A is distinct by the combination of four properties that no other dataset reproduces together:

  1. Semi-annual cadence covering the first half of a management investment company's fiscal year.
  2. Operational/financial line-item scope — not narrative, not holdings, not registration content.
  3. Legacy answer.fil format requiring line-number-aware parsing rather than XML or XBRL.
  4. Closed historical window ending June 1, 2018 — a finite, fully bounded corpus.

NSAR-B is the closest complement within the legacy regime; N-CEN is the closest successor on the operational axis (annual only); N-PORT and the former N-Q occupy a holdings axis NSAR-A never covered; N-CSR/N-CSRS occupy the shareholder-narrative axis; and N-1A/N-2/N-3 occupy the offering-document axis. For mid-year operational data on management investment companies between 1994 and 2018, no alternative substitutes for NSAR-A.

Who Uses This Dataset

NSAR-A captures semi-annual operating data for registered open-end and closed-end funds from 1994 through mid-2018, when N-CEN and N-PORT replaced it. The dataset is used by professionals who need a continuous, series-level view of the pre-modernization U.S. fund industry — flows, fees, turnover, NAV, service providers, and compliance attestations.

Academic researchers in fund economics

Used for peer-reviewed work on fee dispersion, scale economies, flow-performance sensitivity, and after-cost returns. Researchers pull Items 028 to 031 (sales, redemptions, exchanges) for flow panels, Items 042 to 048 (management fees, 12b-1, expenses) for fee studies, Items 070 to 072 (purchases, sales, turnover) for trading-cost work, and Items 074 to 076 (NAV, total net assets) for size normalization.

Quantitative analysts at asset managers

Quants backfill fund-characteristic factors that commercial vendors only carry from later dates. They use Items 074 to 076 for long-run AUM series, Items 028 to 031 for flow-momentum and capacity models, Items 070 to 072 for turnover features, and Items 042 to 048 for net-of-fee performance reconstruction. Output feeds factor pipelines and active-strategy capacity models.

Fund compliance and outside counsel

Used to reconstruct registrant operating history for SEC examinations, fee litigation defense, and reorganization due diligence. Items 001 to 008 trace registrant, series, and class identifiers across successions and mergers. Items 010 to 020 evidence adviser, sub-adviser, underwriter, custodian, transfer agent, and auditor history. Item 077 attestations on sales loads, 12b-1 plans, Rule 17a-7, and affiliated brokerage are reviewed alongside the NSAR-A/A amendment population to verify restatements.

Forensic accountants and damages experts

Used as primary source material in excessive-fee, market-timing, and late-trading cases when contemporaneous records are unavailable. Items 042 to 048 establish expense-ratio paths for Section 36(b) damages; Items 028 to 031 quantify flow consequences of alleged disclosure failures; Items 070 to 072 support trading-behavior analysis; Items 010 to 020 identify responsible service providers; Items 074 to 076 supply the asset base for fee-damages calculations. Output feeds expert reports and deposition exhibits.

Data engineers building fund history panels

Used as the upstream feed for pre-2018 series-level panels, bridged to N-CEN and N-PORT for 2018-forward continuity. Engineers parse identification (001 to 008), service providers (010 to 020), flows (028 to 031), expenses (042 to 048), portfolio activity (070 to 072), and NAV/assets (074 to 076) into normalized tables keyed by CIK, series ID, and class ID, so downstream consumers query one longitudinal panel without a 2018 gap.

Regulatory economists and examination staff

Used in retrospective rule reviews and cost-benefit assessments of the reporting modernization rulemaking. Items 042 to 048 support long-run fee and 12b-1 trend studies; Items 070 to 072 inform turnover distributions; Items 028 to 031 quantify historical flow patterns; compliance-attestation items reveal exemptive-order reliance and affiliated-transaction activity. Feeds sweep exam targeting and economic analyses for new rule proposals.

Fund consultants and competitive-intelligence analysts

Used to fill series-level historical fields missing from commercial fund databases. Items 074 to 076 build AUM histories, Items 042 to 048 reconstruct fee schedules and 12b-1 splits, Items 028 to 031 produce organic-growth and redemption metrics, Items 070 to 072 support peer turnover comparisons, and Items 010 to 020 feed service-provider league tables. Output supports board 15(c) fee benchmarking, RFP responses, and long-history industry studies.

LLM and retrieval-system developers

Used to give models structured, citation-ready access to pre-2018 fund operating data. Item-numbered fields are indexed so queries about fee history, flow history, turnover, or service-provider succession at the series level resolve to specific filed records, supporting compliance research copilots and literature-review tools that ground answers in NSAR-A facts.

These users share one requirement: a continuous, series-level view of registered fund operations from 1994 to 2018 that N-CEN and N-PORT do not cover. They pull different item ranges — flows in 028 to 031, expenses in 042 to 048, portfolio activity in 070 to 072, assets in 074 to 076, and service providers in 010 to 020 — but all depend on NSAR-A to make the pre-modernization era analytically usable.

Specific Use Cases

The NSAR-A corpus supports a small set of concrete workflows that depend on its line-coded semi-annual operating data for registered management investment companies from 1994 through mid-2018.

Building a pre-2018 fund flow panel

Parse Items 028 to 031 from answer.fil (monthly share sales, redemptions, repurchases, and exchanges) and join to metadata.json keys (entities[].cik, seriesAndClassesContractsInformation[], periodOfReport) to assemble a series-level six-month flow panel. Combined with the NSAR-B annual counterpart, this produces a continuous flow history that bridges directly into N-CEN/N-PORT for 2018-forward analysis. Output feeds flow-performance regressions, capacity models, and organic-growth metrics in commercial fund databases.

Reconstructing fee schedules for Section 36(b) litigation

Extract the management-fee breakpoint ladder from Items 054 to 061, 12b-1 plan attributes from Items 042 to 048, and monthly average net assets from Item 075 to rebuild a fund's effective fee path across every semi-annual period. Item 076 (period-end NAV per share) and Item 074 (total net assets) supply the asset base for damages calculations. Output drops into expert reports and deposition exhibits where contemporaneous prospectus and shareholder-report data alone are insufficient.

Service-provider league tables and succession tracking

Iterate Items 007 to 018 (advisers, sub-advisers, principal underwriters, custodians, sub-custodians, transfer agents, shareholder-servicing agents, administrators, independent public accountants) across the full corpus to produce per-period rosters keyed by CIK and series identifier. Sorting same-registrant filings by periodOfReport and filedAt exposes adviser changes, custody transitions, and auditor rotations. The same workflow supports 15(c) board materials, RFP competitive intelligence, and pre-examination service-provider history packets.

Turnover and portfolio-activity benchmarking

Pull the Item 070 portfolio-transaction matrix (purchases and sales by instrument class) together with Items 071 and 072 (aggregate portfolio activity and turnover) to build mid-year turnover distributions by Lipper-style peer set. The asset-category percentages in Items 062 to 069 segment funds by mandate without relying on third-party classifications. Output supports trading-cost research, peer-turnover comparisons in 15(c) packages, and capacity ceilings for active-strategy backtests.

Amendment-aware point-in-time reconstruction

Group records sharing entities[].cik and periodOfReport, then sort by formType (NSAR-A versus NSAR-A/A) and filedAt to reconstruct the latest amended view of any six-month period. Diffing the line-coded answers between an original and its amendments isolates which items (fees, flows, attestations) were restated, supporting compliance restatement reviews and forensic accounting workups where the timing of the correction matters.

Series-trust identifier bridging

For series companies, correlate the S… series and C… class-contract identifiers in seriesAndClassesContractsInformation with the per-series schedules at the tail of the answer body (Items 089 to 133) to attribute assets, sales, redemptions, and NAV to the correct series rather than the trust as a whole. This is the only practical bridge from trust-level CIK to series-level operations in the pre-2018 record and is required to join NSAR-A history cleanly to N-CEN and N-PORT, which are natively series-keyed.

Grounding compliance copilots in pre-2018 fund operating data

Index each (accessionNo, item, sub-item, sub-index, value) tuple from answer.fil alongside metadata.json identifiers so retrieval systems answer fee-history, flow-history, turnover, and service-provider succession questions with direct citations to filed records. The deterministic item-number schema lets LLM tools cite specific lines (for example "Item 075 column 6, accession 0001140361-18-045903") rather than paraphrasing narrative disclosures.

Dataset Access

The Form NSAR-A Files dataset is available through three access methods: a JSON index endpoint for metadata and container discovery, a full archive download, and direct per-container downloads.

Dataset Index JSON API: https://api.sec-api.io/datasets/form-nsara-files.json

This endpoint returns dataset-level metadata along with the list of monthly container files. Each container entry includes its key, size in bytes, record count, updated timestamp, and download URL. Use this endpoint to monitor which containers changed in the most recent refresh run and selectively download only the updated archives. The endpoint does not require an API key.

Example response:

Example
1 {
2 "datasetId": "1f13365b-9ae0-68f8-9dbd-f708173eb663",
3 "datasetDownloadUrl": "https://api.sec-api.io/datasets/form-nsara-files.zip",
4 "name": "Form NSAR-A Files Dataset",
5 "updatedAt": "2026-05-18T03:12:44.000Z",
6 "earliestSampleDate": "1994-01-01",
7 "totalRecords": 227628,
8 "totalSize": 778876619,
9 "formTypes": ["NSAR-A", "NSAR-A/A"],
10 "containerFormat": "ZIP",
11 "fileTypes": ["TXT", "JSON", "HTML", "XFD", "FRM", "FIL", "PDF"],
12 "containers": [
13 {
14 "downloadUrl": "https://api.sec-api.io/datasets/form-nsara-files/2018/2018-06.zip",
15 "key": "2018/2018-06.zip",
16 "size": 4185293,
17 "records": 142,
18 "updatedAt": "2026-05-18T03:12:44.000Z"
19 }
20 ]
21 }

Download Entire Dataset: https://api.sec-api.io/datasets/form-nsara-files.zip?token=YOUR_API_KEY

Downloads the complete dataset as a single ZIP archive covering all NSAR-A and NSAR-A/A filings from January 1994 through June 2018 (with sparse records in the second half of 2018). This endpoint requires a valid SEC API key passed as the token query parameter.

Download Single Container: https://api.sec-api.io/datasets/form-nsara-files/2018/2018-06.zip?token=YOUR_API_KEY

Downloads one monthly container ZIP instead of the full archive. Replace the year and month path segment with any container key listed under containers[].key in the dataset index JSON. This endpoint requires a valid SEC API key.

Frequently Asked Questions

What forms does this dataset cover?

The dataset covers Form NSAR-A (the original semi-annual report for registered management investment companies) and Form NSAR-A/A (amendments to previously filed NSAR-A reports). Both form types are accepted by EDGAR under the same submission schema.

What does one record in this dataset represent?

One record is a complete EDGAR submission of a single NSAR-A or NSAR-A/A filing, identified by its 18-character SEC accession number. Each record is a flat folder containing a metadata.json header and the submission documents — most commonly the line-coded answer.fil questionnaire body plus any exhibits filed alongside it.

Who is required to file Form NSAR-A?

Open-end management investment companies (mutual funds, including ETFs organized as open-end funds) and closed-end management investment companies (including listed closed-end funds and interval funds) filed Form NSAR-A. Unit investment trusts, business development companies, investment advisers, private funds relying on Section 3(c)(1) or 3(c)(7), and unregistered foreign funds are outside the NSAR-A population.

When was Form NSAR-A discontinued, and why does the dataset extend past that date?

The SEC's investment company reporting modernization rulemaking rescinded Form N-SAR effective June 1, 2018, replacing it with Form N-CEN (annual census) and Form N-PORT (monthly portfolio holdings). Original NSAR-A filings therefore end in mid-2018, but the dataset still receives NSAR-A/A amendments to pre-rescission periods because there is no statutory cap on how late an amendment may be filed.

What time period does the dataset cover?

The dataset's earliest records date to January 1, 1994, when EDGAR submissions for Form N-SAR began. Original NSAR-A filings run continuously through June 1, 2018, with later amendments and a sparse tail of late originals continuing thereafter.

What file format is the dataset distributed in?

The dataset is distributed as ZIP containers organized by month. Inside each container, every record is a flat accession-named folder containing metadata.json plus the EDGAR submission documents in their original formats — TXT, HTML, XFD, FRM, FIL, and PDF.

How does NSAR-A differ from NSAR-B?

Both share the same filer population, statutory basis (Section 30 of the Investment Company Act of 1940), and line-coded answer.fil format. NSAR-A covers the first six months of the registrant's fiscal year; NSAR-B covers the full fiscal year and adds annual-only items such as auditor identification, internal-control matters, fidelity-bond coverage, and certain affiliate-transaction disclosures. The two forms are complements, not substitutes.