PHP Classes

Web scraper: Extract information from Web site pages

Recommend this page to a friend!
  Info   Screenshots   View files Files   Install with Composer Install with Composer   Download Download   Reputation   Support forum   Blog    
Ratings Unique User Downloads Download Rankings
StarStarStarStar 65%Total: 1,692 All time: 2,326 This week: 55Up
Version License PHP version Categories
web-scraper 1.0BSD License5.0HTML, PHP 5, Web services
Description 

Author

This class can extract information from Web site pages.

It can either retrieve a single page, a group of pages from a Web site given a base URL and a range of values that will replace template parameters in that URL, or a group of pages with given URLs.

The class can use selector path values to define elements in the page from which it will extract the relevant page content values.

Innovation Award
PHP Programming Innovation award winner
October 2011
Winner


Prize: One subscription to the PDF edition of the PHP Architect magazine
Some applications need to retrieve information that is only available to the public in Web site pages.

This class makes it easier to retrieve and parse many Web pages at once to extract information that is displayed in the same relative position of the pages.

Manuel Lemos
Picture of Jacek Lukasiewicz
Name: Jacek Lukasiewicz is available for providing paid consulting. Contact Jacek Lukasiewicz .
Classes: 6 packages by
Country: Poland Poland
Age: 49
All time rank: 2773 in Poland Poland
Week rank: 180 Up1 in Poland Poland Up
Innovation award
Innovation award
Nominee: 2x

Winner: 2x

Screenshots (1)  
  • screen.jpg
  Files folder image Files (6)  
File Role Description
Files folder imagelib (1 file)
Accessible without login Plain text file documentation.txt Doc. documentation
Accessible without login Plain text file index.php Example example using
Accessible without login Plain text file scraper.php Class scraper class
Accessible without login Plain text file test.html Data test file

  Files folder image Files (6)  /  lib  
File Role Description
  Plain text file phpQuery-onefile.php Class phpQuery library

The PHP Classes site has supported package installation using the Composer tool since 2013, as you may verify by reading this instructions page.
Install with Composer Install with Composer
 Version Control Unique User Downloads Download Rankings  
 0%
Total:1,692
This week:0
All time:2,326
This week:55Up
User Ratings User Comments (1)
 All time
Utility:83%StarStarStarStarStar
Consistency:83%StarStarStarStarStar
Documentation:75%StarStarStarStar
Examples:83%StarStarStarStarStar
Tests:-
Videos:-
Overall:65%StarStarStarStar
Rank:579
 
Easy to use, working well.
12 years ago (miron1)
77%StarStarStarStar