In this collections blog post, we’ll be listing a collection of web crawling and indexing libraries for PHP. Allowing you to index and parse web pages from your PHP project. As always, this list is updated regularly. Email over the URL, if you wish to be included.

Web Spiders, Web Crawlers and Indexers

Libraries for indexing websites.

  • Chrome PHP – Instrument headless Chrome/Chromium instances from PHP.
  • DiDOM – A super fast HTML indexer and parser.
  • Embed – An information extractor from any web service or page.
  • Goutte – A simple web indexer and parser.
  • Symfony Panther – A browser testing and web crawling library for PHP and Symfony.
  • PHP Spider – A configurable and extensible PHP web spider.

Tags: PHP, Web Spiders, Web Indexers, Web Crawlers, DiDOM, embed, goutte, symfony panther, php spider

License: license | Image Credits: Unsplash