HomeToolsFinancial StatementsBalance Sheet Extractor

// FINANCIAL STATEMENTS

Balance Sheet Extractor

Convert balance sheets from annual reports into a clean, line-item-level table with comparative periods.

balance-sheet-extractor.run
> extract --balance-sheet ar_FY24.pdf
BALANCE SHEET — STANDALONE FY24 FY23 ASSETS Property, plant & equipment 12,418 11,902 Right-of-use assets 642 518 Intangible assets 1,205 1,180 Investments 8,330 7,022 Trade receivables 4,180 3,945 Cash & equivalents 2,016 1,544 ... TOTAL ASSETS 38,801 34,720 source: ar_FY24.pdf · p.118 · Table 1

// EXAMPLE INPUT

command
$ extract --balance-sheet ar_FY24.pdf

// EXAMPLE OUTPUT

output
BALANCE SHEET — STANDALONE
                                  FY24       FY23
ASSETS
  Property, plant & equipment   12,418     11,902
  Right-of-use assets              642        518
  Intangible assets              1,205      1,180
  Investments                    8,330      7,022
  Trade receivables              4,180      3,945
  Cash & equivalents             2,016      1,544
  ...
TOTAL ASSETS                    38,801     34,720

source: ar_FY24.pdf · p.118 · Table 1

// EXTRACTION LOGIC

Schedule III / IFRS line items are mapped to a canonical ontology so 'Property, plant and equipment', 'Net fixed assets', and 'PP&E' resolve to the same row.

// SOURCE-LINKED OUTPUT

Every figure carries a source pointer of the form `{file, page, table_id, row_id, cell_id}` returned in the export.

anchor (per value)
{ file, page, table_id, row_id, cell_id, label, value, unit, period }

// FAQ

Are line items normalized across companies?

Yes — a canonical Schedule III / IFRS ontology maps synonyms (PP&E, Net fixed assets, etc.) to the same canonical row.

Does it pick up Notes referenced in the balance sheet?

The Note number next to each line item is captured as a separate column and links to the relevant Notes page.

Can it handle scanned PDFs?

Yes. The pipeline runs OCR on image-based PDFs before layout parsing.

// EARLY ACCESS

Get early access to the Balance Sheet Extractor

Paper Data is currently in private beta. Request access to start converting your financial documents into source-linked tables.