Hello,
After reading your description, this sounds like a good fit for programmatically analyzing your files. This would let me pull the data you need out in a "table-like" format. From there, I'd be able to set any standard you want for any given data field.
Judging by the "Web Scraping" requirement, you're looking to access online webpages/databases/repositories? This can be looped through in order to handle every file/page and pull the data you need.
Therefore, you can provide as specific of a requirement for each data field you'd like and it will end up the same from every source you need the data from. Ultimately, it would appear the data will be ending up in Excel, which can also be written into programmatically.
The main benefit of a programmatic approach is speed, accuracy and standardization.
Please let me know if you have any questions as to my proposal for solving your project.
Respectfully,
Jake Thompson