HomeToolsInvestor PresentationsInvestor Presentation Extractor

// INVESTOR PRESENTATIONS

Investor Presentation Extractor

Convert investor presentation decks into structured slide-level data — titles, bullets, tables, and KPI tiles.

investor-presentation-extractor.run
> extract investor_deck_Q4FY24.pdf
Slides: 42 Title slide: p.1 Business overview: p.2-6 Segment performance: p.7-15 KPI tiles detected: 38 Tables detected: 12 Charts annotated: 18 Output: deck.xlsx (per-slide rows) + kpis.json

// EXAMPLE INPUT

command
$ extract investor_deck_Q4FY24.pdf

// EXAMPLE OUTPUT

output
Slides: 42
Title slide:              p.1
Business overview:        p.2-6
Segment performance:      p.7-15
KPI tiles detected:       38
Tables detected:          12
Charts annotated:         18
Output: deck.xlsx (per-slide rows) + kpis.json

// EXTRACTION LOGIC

Each slide is parsed into title, body bullets, tables, and visual tiles. KPI tiles (large number + label) are detected and pulled into a flat KPIs sheet.

// SOURCE-LINKED OUTPUT

Every extracted value carries the slide number and a bounding-box reference, so any KPI can be opened back to its exact tile in the deck.

anchor (per value)
{ file, page, table_id, row_id, cell_id, label, value, unit, period }

// FAQ

Are KPI tiles separated from regular text?

Yes. Tiles with a prominent number + label are detected and exported into a dedicated KPI sheet.

Does it work on image-heavy decks?

Yes. Text inside images and infographics is OCR'd before being attached to its slide.

Can I export by slide range?

Yes — extraction can be scoped to a slide range or to slides matching a section heading.

// EARLY ACCESS

Get early access to the Investor Presentation Extractor

Paper Data is currently in private beta. Request access to start converting your financial documents into source-linked tables.