Find All Links on a Page | CSS-Tricks

Here’s the basic principal behind spiders.

$html = file_get_contents('http://www.example.com');

$dom = new DOMDocument();
@$dom->loadHTML($html);

// grab all the on the page
$xpath = new DOMXPath($dom);
$hrefs = $xpath->evaluate("/html/body//a");

for ($i = 0; $i < $hrefs->length; $i++) {
       $href = $hrefs->item($i);
       $url = $href->getAttribute('href');
       echo $url.'<br />';
}

Comments

Oleg

# March 18, 2010

Exactly what I needed. Thanks.

Jens Törnell

# April 13, 2011

Perfect for affiliate sites!

RONIT

# June 3, 2011

WAAAAO ITS GREAT….I WAS SEARCHING FOR THIS ONLY

daniel

# August 15, 2011

I didnt understand quite how to use this? where to I type that? I’m kinda confused.. I need more explanation

Thanks

hey

Permalink to comment# November 13, 2012

EXACTLY, how to use all these php snippet

see, my method is more clear (hope that they teach us HOW)

View post on imgur.com

View post on imgur.com

try a fews of these already, I don’t know how to implement these php snippets

lande

# September 6, 2011

This post is just too good. thumbs up!!
keep up the good work ;)

Dario

# October 4, 2011

Muchas gracias por la ayuda!

Zbigniew

# November 17, 2011

Works perfect! thx!

juan

# November 24, 2011

Can someone please show me step by step in how to use this. Thank you in advance

kazi tanvir ahsan

# March 8, 2012

perfect.Was using php simple DOM but not good enough like this.!

shail.dw

# August 18, 2012

The unique power of PHP and DOM unleashed. cURL and REGEX based techinques can never match this. Though they have their own uses, ofcourse. Many thanx.

Milan

# December 2, 2012

how to follow all other children pages ?

Zen

# October 30, 2013

Thanks.

Whats about performance on xPath?

obliviga

# January 12, 2014

This is amazing. Thank you so much.

Lorenzo

# March 25, 2014

Thanks, very simple. Great!

Sif Eddine

# September 26, 2015

Hi, tnx it’s very helpful yet I have a question,
what if I have to get a link with a specific class
wil this do it? : (html/body//a.class)

alexander

# June 27, 2016

Hi
Is there a curl version of this?
I’ll be appreciate that if anyone write it with curl.
tnx

JoshuaFrancis

# December 26, 2016

Thanks Man. This is exactly what I need.

Satbir

# December 12, 2018

This code is for only one link, I need any link… like http://www.abc.com … or … http://www.xyz.com etc

Comments

Leave a Reply to Lorenzo Cancel reply