Feb-12-2020, 11:21 AM
I have a lot of company annual reports in PDF format, and they are scanned copies (an example is in link 1 below). I need to extract data from the financial statements from the PDF, such as 'revenue' and other items on page 13 of the example.
Another task is to extract names of shareholders and their shareholding figures from documents (an example is in link 2 below).
Can anybody help or tell me what is the most efficient way to do that? Thank you very much.
Link 1:
https://beta.companieshouse.gov.uk/compa...download=0
Link 2:
https://beta.companieshouse.gov.uk/compa...download=0
Another task is to extract names of shareholders and their shareholding figures from documents (an example is in link 2 below).
Can anybody help or tell me what is the most efficient way to do that? Thank you very much.
Link 1:
https://beta.companieshouse.gov.uk/compa...download=0
Link 2:
https://beta.companieshouse.gov.uk/compa...download=0