Audit Fees Dataset (2001-Present)

The Audit Fees dataset contains structured annual audit fee disclosures extracted from DEF 14A proxy statements filed on EDGAR. Each record represents a single company-year observation: the fees paid by one SEC registrant to its principal independent registered public accounting firm for a completed fiscal year. The dataset captures all four fee categories mandated by SEC disclosure rules — audit fees, audit-related fees, tax fees, and all other fees — along with the auditor name, company identifiers (ticker, CIK, company name), and filing metadata. The dataset contains more than 139,000 audit fee records extracted from over 69,000 DEF 14A filings, covering more than 10,000 unique issuers from 2001 to the present. It is updated daily and distributed as .jsonl.gz (gzip-compressed JSON Lines) files for efficient bulk access.

Update Frequency
Daily
Updated at
2026-04-09
Earliest Sample Date
2001-03-01
Total Size
9.8 MB
Container Format
.jsonl.gz
Content Types
JSONL
Form Types
DEF 14A

Dataset APIs

Programmatically retrieve the full list of dataset archive files, download URLs and dataset metadata.

Dataset Index JSON API

Download the entire dataset as a single archive file.

Download Entire Dataset:

Download a single container file (e.g. monthly archive) from the dataset.

Download Single Container:

Dataset Files

304 files · 9.8 MB
Download All
2026-04.jsonl.gz62.2 KB
2026-03.jsonl.gz105.5 KB
2026-02.jsonl.gz20 B
2026-01.jsonl.gz20 B
2025-12.jsonl.gz20 B
2025-11.jsonl.gz20 B
2025-10.jsonl.gz20 B
2025-09.jsonl.gz20 B
2025-08.jsonl.gz20 B
2025-07.jsonl.gz20 B
2025-06.jsonl.gz13.7 KB
2025-05.jsonl.gz24.0 KB
2025-04.jsonl.gz210.4 KB
2025-03.jsonl.gz74.1 KB
2025-02.jsonl.gz6.1 KB
2025-01.jsonl.gz10.5 KB
2024-12.jsonl.gz11.1 KB
2024-11.jsonl.gz10.2 KB
2024-10.jsonl.gz18.4 KB
2024-09.jsonl.gz10.1 KB
2024-08.jsonl.gz12.4 KB
2024-07.jsonl.gz11.9 KB
2024-06.jsonl.gz13.9 KB
2024-05.jsonl.gz18.9 KB
2024-04.jsonl.gz188.0 KB
2024-03.jsonl.gz68.7 KB
2024-02.jsonl.gz6.5 KB
2024-01.jsonl.gz12.3 KB
2023-12.jsonl.gz11.5 KB
2023-11.jsonl.gz11.4 KB
2023-10.jsonl.gz19.3 KB
2023-09.jsonl.gz14.5 KB
2023-08.jsonl.gz14.1 KB
2023-07.jsonl.gz11.4 KB
2023-06.jsonl.gz14.3 KB
2023-05.jsonl.gz30.7 KB
2023-04.jsonl.gz172.3 KB
2023-03.jsonl.gz76.8 KB
2023-02.jsonl.gz4.3 KB
2023-01.jsonl.gz12.0 KB
2022-12.jsonl.gz10.0 KB
2022-11.jsonl.gz9.0 KB
2022-10.jsonl.gz15.7 KB
2022-09.jsonl.gz10.9 KB
2022-08.jsonl.gz11.5 KB
2022-07.jsonl.gz13.8 KB
2022-06.jsonl.gz13.4 KB
2022-05.jsonl.gz26.8 KB
2022-04.jsonl.gz169.3 KB
2022-03.jsonl.gz72.0 KB
2022-02.jsonl.gz4.6 KB
2022-01.jsonl.gz10.2 KB
2021-12.jsonl.gz9.2 KB
2021-11.jsonl.gz8.7 KB
2021-10.jsonl.gz14.5 KB
2021-09.jsonl.gz10.8 KB
2021-08.jsonl.gz8.8 KB
2021-07.jsonl.gz11.8 KB
2021-06.jsonl.gz13.2 KB
2021-05.jsonl.gz14.9 KB
2021-04.jsonl.gz156.2 KB
2021-03.jsonl.gz71.8 KB
2021-02.jsonl.gz4.3 KB
2021-01.jsonl.gz10.2 KB
2020-12.jsonl.gz9.1 KB
2020-11.jsonl.gz8.0 KB
2020-10.jsonl.gz13.7 KB
2020-09.jsonl.gz11.9 KB
2020-08.jsonl.gz8.3 KB
2020-07.jsonl.gz12.5 KB
2020-06.jsonl.gz14.7 KB
2020-05.jsonl.gz18.5 KB
2020-04.jsonl.gz159.4 KB
2020-03.jsonl.gz70.4 KB
2020-02.jsonl.gz5.0 KB
2020-01.jsonl.gz9.0 KB
2019-12.jsonl.gz8.3 KB
2019-11.jsonl.gz7.7 KB
2019-10.jsonl.gz12.5 KB
2019-09.jsonl.gz9.5 KB
2019-08.jsonl.gz8.3 KB
2019-07.jsonl.gz12.5 KB
2019-06.jsonl.gz10.3 KB
2019-05.jsonl.gz15.2 KB
2019-04.jsonl.gz155.8 KB
2019-03.jsonl.gz83.4 KB
2019-02.jsonl.gz7.3 KB
2019-01.jsonl.gz9.3 KB
2018-12.jsonl.gz9.5 KB
2018-11.jsonl.gz6.9 KB
2018-10.jsonl.gz14.9 KB
2018-09.jsonl.gz13.2 KB
2018-08.jsonl.gz11.8 KB
2018-07.jsonl.gz13.9 KB
2018-06.jsonl.gz15.7 KB
2018-05.jsonl.gz21.2 KB
2018-04.jsonl.gz156.1 KB
2018-03.jsonl.gz87.7 KB
2018-02.jsonl.gz5.2 KB
2018-01.jsonl.gz10.4 KB
2017-12.jsonl.gz10.3 KB
2017-11.jsonl.gz7.6 KB
2017-10.jsonl.gz12.1 KB
2017-09.jsonl.gz11.1 KB
2017-08.jsonl.gz8.7 KB
2017-07.jsonl.gz11.3 KB
2017-06.jsonl.gz13.1 KB
2017-05.jsonl.gz23.3 KB
2017-04.jsonl.gz131.6 KB
2017-03.jsonl.gz113.0 KB
2017-02.jsonl.gz6.4 KB
2017-01.jsonl.gz14.3 KB
2016-12.jsonl.gz12.7 KB
2016-11.jsonl.gz8.9 KB
2016-10.jsonl.gz18.3 KB
2016-09.jsonl.gz13.6 KB
2016-08.jsonl.gz10.7 KB
2016-07.jsonl.gz14.5 KB
2016-06.jsonl.gz17.0 KB
2016-05.jsonl.gz22.0 KB
2016-04.jsonl.gz199.4 KB
2016-03.jsonl.gz117.3 KB
2016-02.jsonl.gz8.6 KB
2016-01.jsonl.gz14.3 KB
2015-12.jsonl.gz14.0 KB
2015-11.jsonl.gz9.6 KB
2015-10.jsonl.gz18.3 KB
2015-09.jsonl.gz15.0 KB
2015-08.jsonl.gz10.5 KB
2015-07.jsonl.gz16.4 KB
2015-06.jsonl.gz17.6 KB
2015-05.jsonl.gz18.0 KB
2015-04.jsonl.gz167.2 KB
2015-03.jsonl.gz103.4 KB
2015-02.jsonl.gz8.0 KB
2015-01.jsonl.gz15.1 KB
2014-12.jsonl.gz15.6 KB
2014-11.jsonl.gz9.0 KB
2014-10.jsonl.gz20.9 KB
2014-09.jsonl.gz14.9 KB
2014-08.jsonl.gz10.4 KB
2014-07.jsonl.gz16.6 KB
2014-06.jsonl.gz20.8 KB
2014-05.jsonl.gz18.8 KB
2014-04.jsonl.gz208.7 KB
2014-03.jsonl.gz117.4 KB
2014-02.jsonl.gz8.1 KB
2014-01.jsonl.gz16.6 KB
2013-12.jsonl.gz14.3 KB
2013-11.jsonl.gz10.6 KB
2013-10.jsonl.gz22.0 KB
2013-09.jsonl.gz13.0 KB
2013-08.jsonl.gz13.1 KB
2013-07.jsonl.gz9.9 KB
2013-06.jsonl.gz14.7 KB
2013-05.jsonl.gz16.2 KB
2013-04.jsonl.gz198.6 KB
2013-03.jsonl.gz114.0 KB
2013-02.jsonl.gz10.7 KB
2013-01.jsonl.gz16.8 KB
2012-12.jsonl.gz14.2 KB
2012-11.jsonl.gz10.2 KB
2012-10.jsonl.gz21.9 KB
2012-09.jsonl.gz17.1 KB
2012-08.jsonl.gz13.5 KB
2012-07.jsonl.gz17.4 KB
2012-06.jsonl.gz18.8 KB
2012-05.jsonl.gz18.2 KB
2012-04.jsonl.gz209.9 KB
2012-03.jsonl.gz124.4 KB
2012-02.jsonl.gz9.8 KB
2012-01.jsonl.gz18.1 KB
2011-12.jsonl.gz16.3 KB
2011-11.jsonl.gz11.5 KB
2011-10.jsonl.gz19.4 KB
2011-09.jsonl.gz20.2 KB
2011-08.jsonl.gz14.7 KB
2011-07.jsonl.gz18.0 KB
2011-06.jsonl.gz18.3 KB
2011-05.jsonl.gz33.3 KB
2011-04.jsonl.gz209.5 KB
2011-03.jsonl.gz122.3 KB
2011-02.jsonl.gz10.8 KB
2011-01.jsonl.gz18.6 KB
2010-12.jsonl.gz17.7 KB
2010-11.jsonl.gz12.9 KB
2010-10.jsonl.gz22.7 KB
2010-09.jsonl.gz17.5 KB
2010-08.jsonl.gz15.6 KB
2010-07.jsonl.gz18.7 KB
2010-06.jsonl.gz21.6 KB
2010-05.jsonl.gz21.6 KB
2010-04.jsonl.gz217.8 KB
2010-03.jsonl.gz126.0 KB
2010-02.jsonl.gz12.4 KB
2010-01.jsonl.gz19.1 KB
2009-12.jsonl.gz18.0 KB
2009-11.jsonl.gz14.7 KB
2009-10.jsonl.gz22.3 KB
2009-09.jsonl.gz17.4 KB
2009-08.jsonl.gz14.3 KB
2009-07.jsonl.gz19.2 KB
2009-06.jsonl.gz21.3 KB
2009-05.jsonl.gz26.2 KB
2009-04.jsonl.gz221.0 KB
2009-03.jsonl.gz116.9 KB
2009-02.jsonl.gz9.8 KB
2009-01.jsonl.gz21.3 KB
2008-12.jsonl.gz16.7 KB
2008-11.jsonl.gz12.3 KB
2008-10.jsonl.gz24.8 KB
2008-09.jsonl.gz15.7 KB
2008-08.jsonl.gz17.1 KB
2008-07.jsonl.gz20.6 KB
2008-06.jsonl.gz18.1 KB
2008-05.jsonl.gz30.2 KB
2008-04.jsonl.gz244.0 KB
2008-03.jsonl.gz111.5 KB
2008-02.jsonl.gz13.6 KB
2008-01.jsonl.gz20.7 KB
2007-12.jsonl.gz16.2 KB
2007-11.jsonl.gz15.7 KB
2007-10.jsonl.gz26.1 KB
2007-09.jsonl.gz17.8 KB
2007-08.jsonl.gz14.4 KB
2007-07.jsonl.gz21.9 KB
2007-06.jsonl.gz20.6 KB
2007-05.jsonl.gz27.1 KB
2007-04.jsonl.gz225.9 KB
2007-03.jsonl.gz92.9 KB
2007-02.jsonl.gz10.9 KB
2007-01.jsonl.gz17.0 KB
2006-12.jsonl.gz17.5 KB
2006-11.jsonl.gz11.3 KB
2006-10.jsonl.gz22.8 KB
2006-09.jsonl.gz19.3 KB
2006-08.jsonl.gz15.1 KB
2006-07.jsonl.gz17.2 KB
2006-06.jsonl.gz20.6 KB
2006-05.jsonl.gz41.9 KB
2006-04.jsonl.gz183.2 KB
2006-03.jsonl.gz121.2 KB
2006-02.jsonl.gz10.0 KB
2006-01.jsonl.gz16.7 KB
2005-12.jsonl.gz16.2 KB
2005-11.jsonl.gz11.5 KB
2005-10.jsonl.gz21.8 KB
2005-09.jsonl.gz19.4 KB
2005-08.jsonl.gz14.2 KB
2005-07.jsonl.gz17.5 KB
2005-06.jsonl.gz20.7 KB
2005-05.jsonl.gz34.8 KB
2005-04.jsonl.gz174.2 KB
2005-03.jsonl.gz102.4 KB
2005-02.jsonl.gz9.9 KB
2005-01.jsonl.gz14.2 KB
2004-12.jsonl.gz13.0 KB
2004-11.jsonl.gz8.2 KB
2004-10.jsonl.gz15.6 KB
2004-09.jsonl.gz14.9 KB
2004-08.jsonl.gz10.2 KB
2004-07.jsonl.gz17.2 KB
2004-06.jsonl.gz15.9 KB
2004-05.jsonl.gz20.0 KB
2004-04.jsonl.gz148.8 KB
2004-03.jsonl.gz95.5 KB
2004-02.jsonl.gz6.9 KB
2004-01.jsonl.gz11.0 KB
2003-12.jsonl.gz8.3 KB
2003-11.jsonl.gz4.4 KB
2003-10.jsonl.gz7.8 KB
2003-09.jsonl.gz6.2 KB
2003-08.jsonl.gz4.8 KB
2003-07.jsonl.gz4.2 KB
2003-06.jsonl.gz5.1 KB
2003-05.jsonl.gz7.5 KB
2003-04.jsonl.gz43.8 KB
2003-03.jsonl.gz25.7 KB
2003-02.jsonl.gz2.4 KB
2003-01.jsonl.gz1.7 KB
2002-12.jsonl.gz1.6 KB
2002-11.jsonl.gz735 B
2002-10.jsonl.gz2.7 KB
2002-09.jsonl.gz1.1 KB
2002-08.jsonl.gz1.7 KB
2002-07.jsonl.gz1.5 KB
2002-06.jsonl.gz2.0 KB
2002-05.jsonl.gz1.6 KB
2002-04.jsonl.gz9.2 KB
2002-03.jsonl.gz5.2 KB
2002-02.jsonl.gz390 B
2002-01.jsonl.gz761 B
2001-12.jsonl.gz567 B
2001-11.jsonl.gz568 B
2001-10.jsonl.gz990 B
2001-09.jsonl.gz1.0 KB
2001-08.jsonl.gz1.2 KB
2001-07.jsonl.gz819 B
2001-06.jsonl.gz697 B
2001-05.jsonl.gz961 B
2001-04.jsonl.gz3.9 KB
2001-03.jsonl.gz2.1 KB
2001-02.jsonl.gz20 B
2001-01.jsonl.gz20 B

What This Dataset Contains

Each record is a structured extraction from the "Principal Accountant Fees and Services" section of a DEF 14A proxy statement. Rather than preserving the entire proxy document, the dataset normalizes the specific fee data points that public companies must disclose under Item 9(e) of Schedule 14A into machine-readable JSON records. The result is an analysis-ready dataset of auditor compensation across the full EDGAR-filing population — from micro-cap registrants paying under $100,000 in audit fees to S&P 500 companies paying millions annually. Across the full dataset, the median audit fee is approximately $732,000 and the mean is approximately $1.26 million, reflecting the wide spread between small filers and the largest public companies.

Content Structure of a Single Record

Each record in the dataset represents a single DEF 14A filing and contains the following structured fields:

Filing-Level Fields

  • id — system-internal unique identifier for the filing record
  • accessionNo — unique SEC accession number of the source DEF 14A filing (e.g., 0001193125-16-543341)
  • formType — the SEC form type, typically DEF 14A
  • filedAt — timestamp when the filing was accepted by EDGAR (e.g., 2016-04-15T17:09:07-04:00)
  • periodOfReport — the reporting period covered by the proxy statement, corresponding to the fiscal year end (e.g., 2016-05-31)

Entities Array

Each filing includes an entities array identifying the registrant(s):

  • cik — Central Index Key, the SEC's unique numeric identifier for the filing entity (e.g., 1318605)
  • ticker — trading symbol of the registrant (e.g., TSLA)
  • companyName — registrant name as it appears on EDGAR (e.g., TESLA MOTORS INC (Filer))
  • irsNo — IRS Employer Identification Number
  • fiscalYearEnd — fiscal year-end in four-digit month-day format (e.g., 1231 for December 31)
  • stateOfIncorporation — two-letter state code (e.g., DE) or country name for non-U.S. entities
  • sic — Standard Industrial Classification code and description (e.g., 3711 Motor Vehicles & Passenger Car Bodies)
  • act — regulatory act under which the entity files (e.g., 34 for the Securities Exchange Act of 1934)
  • fileNo — SEC file number (e.g., 001-34756)
  • filmNo — SEC film number for tracking the filing

Records Array (Fee Categories)

Each filing contains a records array with one or more fee records — typically two, covering the two most recent fiscal years as required by SEC disclosure rules. Each fee record includes:

  • year — the fiscal year to which the disclosed audit fees apply (e.g., 2015)
  • auditFees — fees in USD for the audit of the annual financial statements, reviews of quarterly financial statements included in Form 10-Q filings, and services normally provided in connection with statutory and regulatory filings or engagements (comfort letters, consents, SEC registration statement reviews)
  • auditRelatedFees — fees in USD for assurance and related services reasonably related to the audit but not classified as audit fees: due diligence for mergers and acquisitions, employee benefit plan audits, accounting consultations, attest services not required by statute or regulation, and internal control advisory work
  • taxFees — fees in USD for tax compliance (return preparation, claims for refunds), tax advice (planning, restructuring), and tax planning services (transfer pricing studies, tax opinion letters)
  • allOtherFees — fees in USD for any permissible non-audit services not captured above, such as licensing fees for accounting research tools, benchmarking studies, or advisory services unrelated to audit or tax
  • totalFees — the sum of all fee categories for the given year in USD
  • auditor — the name of the principal independent registered public accounting firm (e.g., PricewaterhouseCoopers LLP, Deloitte & Touche LLP, Ernst & Young LLP, KPMG LLP, Grant Thornton LLP, BDO USA LLP, or any of the hundreds of smaller and regional firms registered with the PCAOB)

Important Content Nuances

Two-year comparative disclosure: SEC rules require companies to disclose audit fees for the two most recent fiscal years in each proxy statement. A single DEF 14A filing may therefore produce records for both the current-year and prior-year fiscal period.

Aggregate principal-auditor fees only: Disclosed figures represent total fees billed by the principal auditor. They do not break down fees by individual engagement, subsidiary audit, or geographic jurisdiction. Companies using multiple audit firms disclose fees only for the principal firm in the structured fee table.

Fiscal year alignment: Fee disclosures correspond to the registrant's fiscal year, not the calendar year. A company with a fiscal year ending June 30 reports fees for the 12 months ending June 30, and its proxy typically appears in the fall rather than spring.

Auditor transitions: When a company changes auditors, the proxy may report fees paid to both the predecessor and successor firm. The dataset captures the auditor identified as the principal auditor for the disclosure period.

Fee magnitude range: Audit fees span a wide range — from under $100,000 for smaller reporting companies to millions of dollars annually for the largest public companies. The 25th percentile sits around $270,000 and the 75th percentile around $1.66 million. The dataset spans the entire filer population, creating a wide distribution useful for cross-sectional analysis.

Who Discloses Audit Fees and When

Filer Population

Audit fee disclosures originate from SEC registrants that file definitive proxy statements (DEF 14A) on EDGAR. The disclosing population spans:

  • Public operating companies across all industries and all filer categories — large accelerated filers, accelerated filers, non-accelerated filers, and smaller reporting companies
  • REITs — Real Estate Investment Trusts subject to Exchange Act proxy requirements
  • BDCs — Business Development Companies subject to Exchange Act reporting
  • Closed-end funds — registered closed-end investment companies that solicit proxies under Section 14(a)
  • SPACs — Special Purpose Acquisition Companies, though pre-combination fee disclosures are typically minimal
  • Emerging growth companies — subject to the same four-category fee disclosure framework

Companies outside the DEF 14A population are excluded: foreign private issuers (who disclose fees in Form 20-F), registered open-end investment companies (who disclose in Form N-CSR), and private companies.

Disclosure Trigger and Timing

Audit fee disclosure is not event-driven. It is a mandatory annual disclosure embedded in the proxy statement filed in connection with the company's annual meeting of shareholders. The proxy includes a "Principal Accountant Fees and Services" section that itemizes fees billed by the principal independent registered public accounting firm for the two most recent completed fiscal years.

Most companies file their proxy statements 30 to 60 days before the annual shareholder meeting. For calendar-year-end companies, the proxy filing window runs primarily between March and June, meaning the fee disclosure appears roughly 3 to 6 months after the fiscal year-end to which the fees relate. Non-calendar-year filers follow their own schedule.

Regulatory Framework

The four-category fee disclosure requirement is governed by several interlocking rules:

Auditor Independence Context

Fee disclosures serve a specific regulatory purpose: enabling shareholders and regulators to assess whether the auditor's non-audit fee revenue from a client creates a financial dependency that could compromise audit objectivity. The ratio of non-audit fees (audit-related + tax + all other) to audit fees is the standard quantitative metric. Rule 2-01 of Regulation S-X prohibits specific non-audit services outright — bookkeeping, financial information systems design, appraisal services, actuarial services, internal audit outsourcing, management functions, broker-dealer services, legal services, and expert services unrelated to the audit. Permissible non-audit services require audit committee pre-approval. The proxy fee table provides the quantitative evidence for evaluating compliance with these independence safeguards.

How This Dataset Differs from Similar Datasets and Sources

vs. DEF 14A Proxy Statement Full-Text Datasets

A full-text proxy dataset such as the DEF 14A Filings dataset preserves the entire DEF 14A document — executive compensation, board composition, say-on-pay proposals, related-party transactions, and dozens of other topics. The audit fees dataset extracts only the structured fee data from the "Principal Accountant Fees and Services" section. Researchers focused on auditor compensation skip document parsing entirely and work directly with structured numeric records.

vs. Form 10-K Auditor Disclosures

Form 10-K identifies the principal accounting firm and includes the auditor's report, but it does not disclose fee amounts. Fee disclosure is a proxy statement requirement under Schedule 14A, not a 10-K requirement. The 10-K reveals who the auditor is and what opinion they issued; the proxy reveals what the company paid and for which categories of service.

vs. Form AP (Auditor Reporting of Certain Audit Participants)

Form AP, filed with the PCAOB since June 2017, discloses the engagement partner name and participation percentages of other accounting firms involved in the audit. Form AP does not disclose fees. The two sources are complementary: Form AP identifies who performed the work; the audit fees dataset quantifies what the company paid.

vs. Form 20-F and Form N-CSR

Foreign private issuers disclose audit fees in Form 20-F annual reports; registered investment companies disclose in Form N-CSR. Both follow similar fee category structures but cover different filer populations. This dataset captures domestic registrant proxy (DEF 14A) disclosures only.

DimensionAudit Fees DatasetDEF 14A Full TextForm 10-KForm APCommercial (Audit Analytics)
Fee amountsYes (4 categories)Embedded in narrativeNoNoYes (enriched)
Auditor nameYesYesYesYesYes
Engagement partnerNoNoNoYesSometimes
Full document textNoYesYesNoNo
Filer populationDomestic proxy filersSameDomesticAll PCAOB-registeredVendor-selected
Machine-readableYes (JSONL)Requires parsingRequires parsingStructured XMLStructured
Historical depth2001-present2001-presentVaries2017-present2001-present

Who Uses This Dataset

Audit committee members and board directors benchmark their company's audit costs against peers matched by revenue, industry, and operational complexity. They track the ratio of non-audit fees to audit fees to evaluate auditor independence risk and use fee comparisons when evaluating proposals during auditor selection or rotation.

Institutional investors and proxy advisory firms incorporate audit fee analysis into governance assessments. Elevated non-audit fee ratios may trigger negative voting recommendations on auditor ratification proposals. Abrupt fee changes or auditor switches accompanied by fee reductions can signal audit scope disputes or opinion-shopping risk.

Academic researchers in accounting and auditing rely on the dataset for empirical studies on audit pricing determinants, the relationship between non-audit services and earnings quality, Big Four vs. non-Big Four fee premiums, auditor tenure effects, fee consequences of SOX Section 404, and the impact of PCAOB inspections on audit pricing.

Audit partners and practice leaders at accounting firms analyze public fee disclosures to calibrate engagement pricing, prepare competitive proposals, and track fee trends by industry vertical and company size tier.

Securities analysts and equity researchers treat audit fee anomalies as supplementary risk indicators. Unexpected fee spikes may precede restatements, material weakness disclosures, or going concern opinions.

Compliance and risk professionals track peer fee levels to confirm their company's fee structure is consistent with market norms and satisfies the independence safeguards of Rule 2-01 of Regulation S-X.

Regulators and policymakers (SEC staff, PCAOB, GAO) study aggregate fee data to assess audit market concentration, evaluate the economic impact of new auditing standards, and determine whether fees correlate with audit effort and quality.

Legal counsel in securities litigation use fee data to support or defend auditor independence claims, citing non-audit fee ratios as evidence of potential financial dependency.

Specific Use Cases

1. Auditor Independence Risk Screening

Compute the non-audit fee ratio (audit-related + tax + all other fees divided by total fees) for every company-year in the dataset. Flag registrants whose non-audit fee ratio exceeds a chosen threshold as potential independence risk cases. Cross-reference flagged companies against subsequent restatements, material weakness disclosures, and SEC enforcement actions to evaluate whether high non-audit fee ratios are predictive of audit failures. Proxy advisory firms can integrate this screening into voting recommendation models for auditor ratification proposals.

2. Audit Fee Benchmarking by Industry and Size Tier

Group companies by SIC or NAICS industry code and revenue decile, then compute median and percentile distributions for audit fees and total fees. Audit committees use these benchmarks to evaluate whether their company's fees are in line with peers of comparable size and complexity. Accounting firms use them to price competitive proposals. The dataset's coverage of the full EDGAR population enables benchmarking across the entire market, not just a vendor-curated subset.

3. Audit Market Concentration and Big Four Share Analysis

Calculate the percentage of total audit fees captured by each Big Four firm (Deloitte, PwC, EY, KPMG) versus mid-tier firms (Grant Thornton, BDO, RSM, Crowe) and smaller regional firms. Track concentration trends over time and segment by filer size category to understand how market concentration varies across the filer population. Regulators and policymakers use these analyses to inform audit market competition policy.

4. Auditor Change Detection and Fee Impact Analysis

Identify companies that changed auditors by comparing auditor names across consecutive fiscal years. Measure the fee change associated with transitions — how much fees rise or fall when switching between Big Four and non-Big Four, or between Big Four firms. Researchers use these patterns to study opinion-shopping hypotheses and the competitive dynamics of auditor selection.

5. SOX 404 and Regulatory Cost Impact Studies

Track audit fee trajectories before and after major regulatory events — the initial SOX Section 404 internal control audit requirement (effective 2004), PCAOB Auditing Standard No. 5 (2007), and subsequent standard changes. Compare fee trends for affected filers against control groups (e.g., smaller reporting companies exempted from SOX 404(b)) to measure incremental regulatory cost. These analyses are central to cost-benefit evaluations of audit regulation.

6. Predictive Modeling of Audit Risk Using Fee Anomalies

Build predictive models using year-over-year audit fee changes, non-audit fee ratio shifts, and auditor changes as features to forecast restatements, material weaknesses, or going concern opinions. A sudden increase in audit fees may reflect the auditor expanding procedures in response to identified risks. Combined with financial and governance variables, fee-based features improve early-warning models for audit-related adverse events.

7. Longitudinal Study of Audit Pricing Determinants

Use the dataset's 20+ year history to estimate audit fee models controlling for company size (total assets, revenue), complexity (number of subsidiaries, segments, foreign operations), risk (leverage, losses, litigation exposure), industry, and auditor identity. These models are foundational in academic audit research and have practical applications in fee negotiation, audit planning, and regulatory impact assessment.

Dataset Access

Full Dataset Download

Download the complete dataset archive containing all audit fee records from 2001 to the present:

https://api.sec-api.io/datasets/audit-fees.zip

The full archive contains more than 139,000 structured audit fee records covering over two decades of audit fee disclosures across more than 10,000 unique issuers.

Individual Container Files

The dataset is organized into container files in .jsonl.gz (gzip-compressed JSON Lines) format. Download individual containers using the pattern:

https://api.sec-api.io/datasets/audit-fees/{container-file-name}.jsonl.gz

Each container file holds audit fee records for a specific time period. Each line in the decompressed JSONL file is a standalone JSON object representing one company-year audit fee record.

Dataset Index JSON API

Retrieve metadata about all available containers from the dataset index API:

https://api.sec-api.io/datasets/audit-fees.json

The index API returns a JSON object with dataset-level metadata and a list of all available container files with their download URLs, file sizes, record counts, and last-updated timestamps. Use this endpoint to identify which containers have been updated since your last download for incremental synchronization.

Authentication

All download endpoints require authentication. Include your API key as a query parameter (?token=YOUR_API_KEY) with each download request. The Dataset Index JSON API does not require authentication.

Update Frequency

The dataset is updated daily. New audit fee records extracted from recently filed DEF 14A documents are added to the bulk dataset each day between 10:30 PM and 11:30 PM ET.

Frequently Asked Questions

What are the four fee categories in the dataset?

The four categories are: (1) audit fees — for the annual audit and quarterly reviews, (2) audit-related fees — for assurance services related to but separate from the audit, (3) tax fees — for tax compliance, advice, and planning, and (4) all other fees — for any other permissible services. This four-category framework was established by SEC Release No. 33-8183 in January 2003, replacing an earlier two-category system.

Where do the audit fee disclosures come from?

The data is extracted from the "Principal Accountant Fees and Services" section of DEF 14A definitive proxy statements filed on EDGAR. This disclosure is required by Item 9(e) of Schedule 14A (17 CFR 240.14a-101).

Does the dataset include foreign private issuers?

No. Foreign private issuers disclose audit fees in Form 20-F annual reports, not in DEF 14A proxy statements. This dataset covers domestic registrants that file proxy statements.

Does the dataset include registered investment companies (mutual funds, ETFs)?

No. Registered investment companies disclose audit fees in Form N-CSR, not in proxy statements. This dataset covers operating companies, REITs, BDCs, and other entities that file DEF 14A.

How far back does the dataset go?

Coverage begins in 2001, reflecting the earliest proxy filings with itemized fee disclosures following the SEC's initial audit fee disclosure requirement adopted in Release No. 34-42266 (December 2000).

Why do some records show two fiscal years from the same filing?

SEC rules require companies to disclose fees for the two most recent fiscal years in each proxy statement, creating a built-in year-over-year comparison. A single DEF 14A may therefore produce records for both fiscal years.

What is the non-audit fee ratio and why does it matter?

The non-audit fee ratio is the sum of audit-related fees, tax fees, and all other fees divided by total fees (or by audit fees alone, depending on the formulation). It is the standard quantitative metric for assessing auditor independence risk. Higher ratios suggest greater financial dependency of the auditor on non-audit revenue from the client, which may compromise audit objectivity.

Are delisted or bankrupt companies included?

Yes. The dataset is survivorship-bias-free. It includes fee disclosures from every registrant that has filed a DEF 14A with audit fee data, regardless of whether the company is currently publicly traded.

How is the dataset formatted?

The dataset uses .jsonl.gz (gzip-compressed JSON Lines) format. Each line in the decompressed file is a standalone JSON object representing one company-year audit fee record. This format supports efficient streaming, line-by-line parsing, and integration into data pipelines.

Can I identify auditor changes using this dataset?

Yes. By comparing the auditor name field across consecutive fiscal years for the same company (matched by CIK), you can detect auditor changes and measure the associated fee impact.