We are currently looking to have help developing a good web scraping tool & database buildup.
1. Linkedin profile & company scraper - modify to obtain all energy company info on linkedin
This obviously isn't built for our purposes, but it's getting most of the information we meed (if you keep the company info option on).
2. Company info from corporations Canada
I did a search for "energy" on corporations Canada which returned 2622 results. They have registration info, corp number, address, etc. Ideally there are other listings like "lithium", "oil", "gas", "efficiency", "solar", etc. This can be a good source of info for Canada.
3. Getting TSX / TSXV, then ultimately all publicly listed company info from the web
You can use the TMX stock screener, google and yahoo stock screeners. get company info as well, and earnings.
This would go on from there, but you get the idea. I think the linkedin one is the easiest since it already has a (somewhat purpose-built) scraper, then try to adapt it to other websites.
4. Scraping sedar for oil field data (a few steps down the road)
Obviously this project has different parts to it, and I'd like to have a price for each item.