Extract all links of a webpage in php

Home
Drop menu
- Menu 1
- Menu 2
- Menu 3
- Menu 4
- Menu 5
Drop menu 2
- Menu 1
- Menu 2
- Menu 3
  - Menu 3.1
  - Menu 3.2
  - Menu 3.3
  - Menu 3.4
- Menu 4
- Menu 5
Drop menu 3
- Menu 1
- Menu 2
- Menu 3
- Menu 4
- Menu 5
Instruction to Use
Free Template

Home » php » Extract all links of a webpage in php

Written By Unknown on Sabtu, 12 Januari 2013 | 10.09

Extracting all links of a webpage is quite easy with simple HTML Dom and Curl. Just read the webpage with Curl and parse it with Dom to extract all links. Here is the script.



    $url = "http://google.com";
    $agent = "Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US; rv:1.8.1.13) Gecko/20080311 Firefox/2.0.0.13";

    $ch = curl_init();
    curl_setopt($ch, CURLOPT_URL, $url);
    curl_setopt($ch, CURLOPT_RETURNTRANSFER, TRUE);
    curl_setopt($ch, CURLOPT_FOLLOWLOCATION, TRUE);
    curl_setopt($ch, CURLOPT_MAXREDIRS, 2);
    curl_setopt($ch, CURLOPT_USERAGENT, $agent);
    curl_setopt($ch, CURLOPT_CONNECTTIMEOUT, 10);
    curl_setopt($ch, CURLOPT_TIMEOUT, 20);
    $html= curl_exec($ch);
    curl_close($ch);
    
    $dom = new DOMDocument();
    $dom->loadHTML($html);
 
    $hrefs = $dom->getElementsByTagName('a');
    $urls=array();
 
    for ($i = 0; $i < $hrefs->length; $i++) {
      $href = $hrefs->item($i);
      $urls[] = $href->getAttribute('href');
    }
    
    print_r($urls);

All Links will be stores in $urls array.

Share this article :

Posting Komentar

Template Created by Creating Website Published by Mas Template
Proudly powered by Blogger

Kumpulan Kata Broadcast Blackberry

Extract all links of a webpage in php

Written By Unknown on Sabtu, 12 Januari 2013 | 10.09

Posting Komentar

Subscribe me

Popular post