For this project I would be using and RPA (Robotic Process Automation) and PDF parsing tool that I built.
My system is currently being used to automate the download process of complex IT & telecom PDF invoices. We then take the PDF and parse and normalize the data, using a parsing engine that was specifically built for it.
This software can be easily be adapted to read bank statements.
Results can be stored in any file format such as Excel files, CSV, XML. TXT or others. If desired, we can also store results on a database for further analysis.
Everything is configurable through scripts, and adding the logic to parse a new PDF normally takes between 4 and 6 hrs depending of complexity.
You can see a video demostrating this software in my profile here:
https://www.freelancer.com/u/rvaldezv?ref_project_id=17567481
I do want to make the clarification that I don't hack sites to steal information since it's not ethical.
What I do is to use the robot to automate the tools and options available on the site to read and normalize data for further use.
That being said, if awarded, we would need to work together to define criterias that makes sense, so we can get the information in the format you are looking for.
Looking forward to work with you.
Please let me know if you have any questions for me.
Have a great day.
RV