HomeToolsPDF to ExcelShareholding Pattern Extractor

// PDF TO EXCEL

Shareholding Pattern Extractor

Convert shareholding pattern disclosures into a clean, category-level table.

shareholding-pattern-extractor.run
> extract shareholding_Q4FY24.pdf
SHAREHOLDING PATTERN — 31-Mar-2024 category shares (cr) %_total Promoters & Promoter Group 12.40 45.20 Indian 10.10 36.80 Foreign 2.30 8.40 Public 14.90 54.30 Mutual Funds 4.21 15.34 Foreign Portfolio Investors 5.62 20.47 Insurance Companies 1.10 4.01 Retail 3.97 14.48 Non-Promoter Non-Public 0.13 0.50 TOTAL 27.43 100.00 source: shareholding_Q4FY24.pdf · p.1

// EXAMPLE INPUT

command
$ extract shareholding_Q4FY24.pdf

// EXAMPLE OUTPUT

output
SHAREHOLDING PATTERN — 31-Mar-2024
category                            shares (cr)   %_total
Promoters & Promoter Group                12.40     45.20
  Indian                                  10.10     36.80
  Foreign                                  2.30      8.40
Public                                    14.90     54.30
  Mutual Funds                             4.21     15.34
  Foreign Portfolio Investors              5.62     20.47
  Insurance Companies                      1.10      4.01
  Retail                                   3.97     14.48
Non-Promoter Non-Public                    0.13      0.50
TOTAL                                     27.43    100.00

source: shareholding_Q4FY24.pdf · p.1

// EXTRACTION LOGIC

Standard SEBI shareholding pattern format is recognized. Sub-category indentation is preserved as a 'level' column so the hierarchy is queryable.

// SOURCE-LINKED OUTPUT

Each row carries the source page and section identifier. Locked-in, pledged, and encumbered shares are kept on separate sheets.

anchor (per value)
{ file, page, table_id, row_id, cell_id, label, value, unit, period }

// FAQ

Are pledged and encumbered shares captured?

Yes. They are kept on dedicated sheets so promoter pledge percentage can be calculated.

Is the sub-category hierarchy preserved?

Yes — the indentation level is stored as a column so parent / child rows can be reconstructed.

Can multiple quarters be processed together?

Yes. Multiple shareholding PDFs can be combined into a long-form table with the as-of date as a column.

// EARLY ACCESS

Get early access to the Shareholding Pattern Extractor

Paper Data is currently in private beta. Request access to start converting your financial documents into source-linked tables.