Filing Download & PDF Generator API

The Filing Download API supports downloading all 20 million SEC filings, and all 100 million exhibits, attachments, and other EDGAR files from any of the 800,000 filers between 1993/1994 to the present. All EDGAR form types are accessible and the API supports up to 60,000 downloads per API key within a five-minute window.

The Download API has a total throughput capacity of over 1 million requests per second, and provides access to more than 1,000 terabytes of SEC filings data, including original filing publications in HTML/TXT format, filing index pages, SGML headers, complete submission files, PDFs, images, XML and XBRL files, Excel files, and many more.

New EDGAR files (filings, exhibits, etc.) are accessible via the Download API in approximately 300 milliseconds after publication on EDGAR.

Mapping of EDGAR Filing URL to Download API and New Endpoint
┌──────────────────────────────────────────────────────────────────────────────┐ │ Original EDGAR File URL │ │ 10 requests / second │ └──────────────────────────────────────────────────────────────────────────────┘ https://www.sec.gov/Archives/edgar/data/815094/000156459021006205/abmd-8k_20210211.htm │ │ ▼ ┌──────────────────────────────────────────────────────────────────────────────┐ │ Download API URL │ │ 50 requests / second │ └──────────────────────────────────────────────────────────────────────────────┘ https://archive.sec-api.io/815094/000156459021006205/abmd-8k_20210211.htm │ │ ▼ ┌──────────────────────────────────────────────────────────────────────────────┐ │ New Download API URL (Beta) │ │ Capacity of 1+ million requests / second │ └──────────────────────────────────────────────────────────────────────────────┘ https://edgar-mirror.sec-api.io/815094/000156459021006205/abmd-8k_20210211.htm https://edgar-mirror.sec-api.io/0000815094/000156459021006205/abmd-8k_20210211.htm https://edgar-mirror.sec-api.io/000156459021006205/abmd-8k_20210211.htm https://edgar-mirror.sec-api.io/Archives/edgar/data/815094/000156459021006205/abmd-8k_20210211.htm

All 400+ EDGAR form types are accessible, including annual and quarterly reports (Forms 10-K, 10-Q, 20-F, 40-F), event-driven disclosures (Forms 8-K, 6-K), registration statements (S-1, S-3, S-4, S-8, F-1, etc.), prospectuses (424B2, 424B4, 424B5, etc.) insider trading reports (Forms 3, 4, 5), proxy statements (DEF 14A), voting records (PRE 14A, N-PX), and any other type. Examples of available exhibits are Exhibit 1 (Underwriting Agreement), Exhibit 8 (Opinion Re Tax Matters), Exhibit 16 (Letter Re Change in Certifying Accountant), Exhibit 21 (Subsidiaries), and all other existing exhibits.

All original file formats and contents are preserved and provided as-is without modifications. Content types include HTML, XML, XSD, XBRL, TXT, PDF, Excel, Word, ZIP, and image files. In addition, HTML, XML, and TXT filings and exhibits can be converted to PDF using the PDF Generator API by supplying the source URL and receiving the generated PDF in response.

Dataset size:
All 20+ million SEC EDGAR filings, and all 100+ million exhibits, and attachments filed since 1993 to present. Includes HTML, TXT, XML, PDF, Excel files, images, and more. Covers all EDGAR form types, such as 10-K, 10-Q, 8-K, S-1, DEF 14A.
Data update frequency:
New filings, exhibits, and attachments are available in less than 600 milliseconds after they are published on EDGAR.
Survivorship bias free:
Yes. All filings, exhibits, and attachments are available from all 800,000+ EDGAR filers, including those that are active and those that have ceased to file.

Use Cases

The Filing Download and PDF Generator APIs are often used alongside the real-time filing Stream API or the pull-based filing Query API. Filings or exhibit URLs pointing to the sec.gov source are typically retrieved from the metadata returned by the Stream or Query API, and these URLs are then used to download the content via the Download API or convert it to PDF using the PDF Generator API. Use cases include:

  • Download historical filings, such as annual reports on Form 10-K, to train LLMs or perform textual analysis
  • Display EDGAR filings in different formats on investor relationship websites
  • Access and monitor specific filing exhibits, such as material contracts, for compliance
  • Download original XBRL files for financial analysis
  • Convert HTML, XML, and TXT filings to PDF for user exports

Download API

Filings, exhibits and attachments can be downloaded by calling the Download API endpoint with the path to the original file on SEC EDGAR. The Download API returns the original file content, such as HTML, XML or TXT.

API Endpoint

https://archive.sec-api.io

Supported HTTP Request Method: GET

New Endpoint (Beta)
https://edgar-mirror.sec-api.io
Rate limit:Up to 200 requests/second (60,000 files per 5-minute interval)
Bandwidth costs:Free during beta for early adopters. Users who join during beta will keep free bandwidth forever.

This endpoint is a drop-in replacement for:

  • the existing Download API. Replace https://archive.sec-api.io with https://edgar-mirror.sec-api.io to access all EDGAR filings, exhibits, and attachments with improved rate limits.
  • the original SEC EDGAR file URLs. Simply replace https://www.sec.gov with https://edgar-mirror.sec-api.io to access the same content with higher rate limits.

The response content type depends on the requested file. For instance, requesting a HTML file returns a text/html MIME content type. The API supports all MIME types, such as text/xml for XML files or images/png for images. By default, transmitted content between the Download API and a client application is compressed with gzip. Typically, the response is automatically decompressed by the client application, such as a browser or a Python/Node.js script. In case of low-level HTTP requests, the decompression needs to be handled by the client application.

Authentication

Two authentication methods are available. Choose the method that best fits your use case:

  • Authorization header: Set the API key as the value of the Authorization header. Do not add "Bearer" or any other words in front of the API key. Example: Authorization: YOUR_API_KEY
  • Query parameter: Set the API key as the token query parameter.
    Example: https://archive.sec-api.io/path/to/edgar-file?token=YOUR_API_KEY. Instead of performing requests to https://archive.sec-api.io/path/to/edgar-file, you always append ?token=YOUR_API_KEY to the end of the URL.

When using the Python or Node.js SDK, the API key is automatically added to the request headers.

Request Parameters

The Download API accepts the file path of the original SEC EDGAR URL. A generic example looks like this:

https://archive.sec-api.io/<cik>/<accession-number>/<filename>

New Endpoint (Beta)

https://edgar-mirror.sec-api.io/<cik>/<accession-number>/<filename>
  • <cik>: The filer's CIK, without leading zeros, e.g. 815094.
  • <accession-number>: The filing's accession number, using only numeric characters (hyphens removed). For example, 000156459021006205 instead of 0001564590-21-006205.
  • <filename>: The name of the file (any file type is supported).

All three values can be obtained from the original file URL and the original file path on SEC EDGAR can always be used directly with the Download API.

For example, to download the filing with the URL:

        https://www.sec.gov/Archives/edgar/data/815094/000156459021006205/abmd-8k_20210211.htm

The path after data/ is appended to the Download API endpoint like this:

        https://archive.sec-api.io/815094/000156459021006205/abmd-8k_20210211.htm

Similarly, to download the same filing using the new beta endpoint, the URL would be:

        https://edgar-mirror.sec-api.io/815094/000156459021006205/abmd-8k_20210211.htm

Mapping of original EDGAR URL to Download API and new beta endpoint
┌───────────────────────────────────────────────────────────────────────────┐ │ Original EDGAR File URL │ └───────────────────────────────────────────────────────────────────────────┘ https://www.sec.gov/Archives/edgar/data/815094/000156459021006205/abmd-8k_20210211.htm │ │ ▼ ┌───────────────────────────────────────────────────────────────────────────┐ │ Download API URL │ └───────────────────────────────────────────────────────────────────────────┘ https://archive.sec-api.io/815094/000156459021006205/abmd-8k_20210211.htm │ │ ▼ ┌───────────────────────────────────────────────────────────────────────────┐ │ New Endpoint (Beta) │ └───────────────────────────────────────────────────────────────────────────┘ https://edgar-mirror.sec-api.io/815094/000156459021006205/abmd-8k_20210211.htm https://edgar-mirror.sec-api.io/0000815094/000156459021006205/abmd-8k_20210211.htm https://edgar-mirror.sec-api.io/000156459021006205/abmd-8k_20210211.htm https://edgar-mirror.sec-api.io/Archives/edgar/data/815094/000156459021006205/abmd-8k_20210211.htm

PDF Generator API

Download any SEC filing or exhibit as a PDF file.

Since most SEC filings, exhibits and attachments are not published in PDF format, converting the original content is necessary to download them as PDFs. The PDF Generator API offers this functionality by converting HTML, XML, or text-based filings and exhibits into PDFs while preserving the original formatting, including images and tables. The API supports downloading all EDGAR form types as PDFs, including filings and exhibits such as Form 10-K, 10-Q, 8-K, DEF14A, and more, while preserving the original formatting.

Images are optimized and scaled for high-quality printing, such as in proxy statements, and invisible inline XBRL tags are removed to reduce PDF file size and prevent unnecessary bloating. All original content is preserved without alteration. The PDFs are designed to be easily shareable, printable, and suitable for archiving.

Legacy text-based filings (.txt) were not originally designed for PDF-printable output. As a result, single table rows in these older filings may span multiple lines and may not fit within the standard A4 PDF page width. NLP, LLM or RAG-based tasks might encounter difficulties when parsing such files as PDFs. In these cases, it is recommended to use the original text content instead of the PDF version for better accuracy.

API Endpoint

Filings and exhibits can be converted to and downloaded as PDF by calling the following PDF Generator API endpoint with the URL of the original filing or exhibit as a query parameter:

https://api.sec-api.io/filing-reader

Supported HTTP Method: GET

Response content type: application/pdf

The API returns the filing or exhibit as a PDF file, preserving the original formatting, including images and tables.

Request Parameters

  • token (required) - Your API key.
  • url (required) - URL of the filing or exhibit attachement. All file types are supported (HTML, XML, TXT, etc). Must be a valid "sec.gov/Archives" URL. Remove inline XBRL parameters from the URL before using it with the API by replacing /ix?doc= with an empty string.
    Example URLs:
    https://www.sec.gov/Archives/edgar/data/1833764/000089924321006812/xslF345X02/doc3.xml
    https://www.sec.gov/Archives/edgar/data/815094/000156459021006205/abmd-8k_20210211.htm

Example

Replace YOUR_API_KEY with your actual API key, and copy and paste any of the following example URLs into a browser to download the filing as a PDF.

URL to download a Form 10-K filing as PDF:
    https://api.sec-api.io/filing-reader?
    token=YOUR_API_KEY&
    url=https://www.sec.gov/Archives/edgar/data/320193/000032019323000106/aapl-20230930.htm

URL to download a Form 4 filing in XML format as PDF:
    https://api.sec-api.io/filing-reader?
    token=YOUR_API_KEY&
    url=https://www.sec.gov/Archives/edgar/data/1833764/000089924321006812/xslF345X02/doc3.xml