Grow your CSS skills. Land your dream job.

Last updated on:

Find All Links on a Page

Here's the basic principal behind spiders.

$html = file_get_contents('');

$dom = new DOMDocument();

// grab all the on the page
$xpath = new DOMXPath($dom);
$hrefs = $xpath->evaluate("/html/body//a");

for ($i = 0; $i < $hrefs->length; $i++) {
       $href = $hrefs->item($i);
       $url = $href->getAttribute('href');
       echo $url.'<br />';


  1. Permalink to comment#

    Exactly what I needed. Thanks.

  2. Perfect for affiliate sites!

  3. RONIT


  4. daniel
    Permalink to comment#

    I didnt understand quite how to use this? where to I type that? I’m kinda confused.. I need more explanation


  5. Permalink to comment#

    This post is just too good. thumbs up!!
    keep up the good work ;)

  6. Permalink to comment#

    Muchas gracias por la ayuda!

  7. Permalink to comment#

    Works perfect! thx!

  8. juan
    Permalink to comment#

    Can someone please show me step by step in how to use this. Thank you in advance

  9. kazi tanvir ahsan
    Permalink to comment#

    perfect.Was using php simple DOM but not good enough like this.!

  10. shail.dw
    Permalink to comment#

    The unique power of PHP and DOM unleashed. cURL and REGEX based techinques can never match this. Though they have their own uses, ofcourse. Many thanx.

  11. Permalink to comment#

    how to follow all other children pages ?

  12. Permalink to comment#


    Whats about performance on xPath?

  13. obliviga
    Permalink to comment#

    This is amazing. Thank you so much.

  14. Lorenzo
    Permalink to comment#

    Thanks, very simple. Great!

Leave a Comment

Posting Code

  • Use Markdown, and it will escape the code for you, like `<div class="cool">`.
  • Use triple-backticks for blocks of code.
      <h1>multi-line block of code</h1>
      <span>be cool yo.</span>
  • Otherwise, escape your code, like <code>&lt;div class="cool"&gt;</code>. Markdown is just easier though.

Current ye@r *

*May or may not contain any actual "CSS" or "Tricks".