Portia

„Portia is an open source visual scraping tool developed by Scrapinghub to make it easier to get data from the web without needing to write a line of code.“ Link: scrapinghub.com/portia/

Mehr erfahren

magic.import.io

Simples Scraping-Tool: Nach Eingabe einer URL gibt es eine Liste mit den Elementen einer Seite wieder, so vorhanden. Link: magic.import.io

Mehr erfahren

ScraperWiki

„Table Xtract from ScraperWiki is a data scraper that removes the need for laborious and error prone copying and pasting of tabular data from PDFs, spreadsheet files and web pages. ScraperWiki’s free ‘Code in your browser’ tool lets you do just that. Write in whatever language you like, using built-in and third-party packages, and then […]

Mehr erfahren

morph.io

Ersetzt das ausgelaufende ScraperWiki „Classic“, kostenfrei, Open Source. „Write your scrapers in Ruby, Python, PHP or Perl Simple API to grab data Schedule scrapers or run manually Process isolation via Docker Trivial to move scraper code and data from ScraperWiki Classic Email alerts for broken scrapers“ Link: morph.io

Mehr erfahren

Tabula (for pdf)

„If you’ve ever tried to do anything with data provided to you in PDFs, you know how painful it is — there’s no easy way to copy-and-paste rows of data out of PDF files. Tabula allows you to extract that data into a CSV or Microsoft Excel spreadsheet using a simple, easy-to-use interface. Tabula works […]

Mehr erfahren