// FINANCIAL STATEMENTS
Balance Sheet Extractor
Convert balance sheets from annual reports into a clean, line-item-level table with comparative periods.
> extract --balance-sheet ar_FY24.pdf
BALANCE SHEET — STANDALONE FY24 FY23 ASSETS Property, plant & equipment 12,418 11,902 Right-of-use assets 642 518 Intangible assets 1,205 1,180 Investments 8,330 7,022 Trade receivables 4,180 3,945 Cash & equivalents 2,016 1,544 ... TOTAL ASSETS 38,801 34,720 source: ar_FY24.pdf · p.118 · Table 1
// EXAMPLE INPUT
$ extract --balance-sheet ar_FY24.pdf
// EXAMPLE OUTPUT
BALANCE SHEET — STANDALONE
FY24 FY23
ASSETS
Property, plant & equipment 12,418 11,902
Right-of-use assets 642 518
Intangible assets 1,205 1,180
Investments 8,330 7,022
Trade receivables 4,180 3,945
Cash & equivalents 2,016 1,544
...
TOTAL ASSETS 38,801 34,720
source: ar_FY24.pdf · p.118 · Table 1// EXTRACTION LOGIC
Schedule III / IFRS line items are mapped to a canonical ontology so 'Property, plant and equipment', 'Net fixed assets', and 'PP&E' resolve to the same row.
// SOURCE-LINKED OUTPUT
Every figure carries a source pointer of the form `{file, page, table_id, row_id, cell_id}` returned in the export.
{ file, page, table_id, row_id, cell_id, label, value, unit, period }// FAQ
Are line items normalized across companies?
Yes — a canonical Schedule III / IFRS ontology maps synonyms (PP&E, Net fixed assets, etc.) to the same canonical row.
Does it pick up Notes referenced in the balance sheet?
The Note number next to each line item is captured as a separate column and links to the relevant Notes page.
Can it handle scanned PDFs?
Yes. The pipeline runs OCR on image-based PDFs before layout parsing.
// RELATED TOOLS
Financial Statements
Profit & Loss Extractor
Turn statements of profit and loss into structured income-statement tables with comparative periods.
Financial Statements
Cash Flow Statement Extractor
Extract operating, investing, and financing cash flow sections into a clean, structured table.
Financial Statements
Financial Statement Extractor
Pull the three core financial statements — Balance Sheet, P&L, and Cash Flow — into a clean, comparable workbook.
// EARLY ACCESS
Get early access to the Balance Sheet Extractor
Paper Data is currently in private beta. Request access to start converting your financial documents into source-linked tables.
