Featured Posts

Challenges One of the best kicks I get in the writing industry is that whenever I feel like I've covered all possible subjects on topic X, the next set of 10 articles is on the exact same topic. How do y'all feel...

Read more

Quickies Tip: Set your Glype tmp, logs and cache folders outside the web root. No, chmod'ing to 700 won't work, lol. WTF: Uneven Google! Useful: Fairly decent and promising project: proxycoder.com Useful:...

Read more

Triond 1,000 challenge? Yeah, right... image via Over the last few months I kept bumping into a "challenge" on Triond: people are either trying to get 1,000 articles by the end of 2010 or make $1,000 in any given 30 days (there are at...

Read more

EzineArticles scraper Piece of code that queries EZA for a given string and grabs an article at random from the first results page. Again, this is slow as fuck and shouldn't be used for production sites. [php]<?php ini_set('error_reporting',...

Read more

Basic scraper with PHP and DOM Who says you need UBot to run basic scraping tasks? Here's a trivial script that scrapes centurian.org. It's unoptimized (i.e. slow as fuck) but it still does a great job. [php] <?php // DOMDocument()s...

Read more

  • Prev
  • Next

Basic scraper with PHP and DOM

2

Category : Uncategorized

Who says you need UBot to run basic scraping tasks? Here’s a trivial script that scrapes centurian.org. It’s unoptimized (i.e. slow as fuck) but it still does a great job.
→ Continue