I am looking for some alternative data as there is very little information available around my topic.
Wikipedia, Amazon and Twitter are kinds of sources I am interested in. When it comes to Wikipedia, the data are available in tables and spreads over 100s of linked pages. Amazon product categories and reviews are also a complicated network of pages.
It is quite a humongous job to extract the data by just copying and pasting. Is there any way I can extract the data in Excel in a neat and clean form?
I think you need to parse/scrape it through python or some other programme language.
If it is a one-off job, you can pay a few bucks to someone to do the job. You can easily find on Fiver or Fivesquid. PPH is rubbish and expensive.
If you are going to do the task on a frequent basis, then you should buy the software. I would recommend WebHarvy. It has a simple and easy interface and can do most scraping jobs.
Well, it depends on what type of data is there and in which format. If the data is in the form for tables on Wikipedia, you can download the tables through this online tool. https://wikitable2csv.ggor.de/
Below is the top 40 tools and software for web scraping, crawling and parsing.
There are both paid and free tools available.
Make sure that you take care of copyrights.
Hello,
We are in the scraping business and can fulfil your custom requirements in the alternative data space. To know more about us visit http://www.krawlnet.com or write to us at info [at] Krawlnet [dot] com
Thanks,
Raj