Web scraping has traditionally been done using scripting languages such as Python or Perl. These scripting languages provide a good platform for web scraping with easy access to important web grabbing tools such as XPATH and regular expressions. However, most people without a strong IT background will find these tools very difficult to learn and even experienced web scrapers will find these tools very time consuming to use. When using standard scripting tools for data extraction, the web scraping process is normally implemented this way: 1. The script downloads the HTML page where you want to start extracting data. 2.… Read More
August 8th, 2010
Admin
Posted in
Tags:
