WWW::CheckSite::SpiderWWW::CheckSite::Spider is a base class for spidering the web. | |
Download |
WWW::CheckSite::Spider Ranking & Summary
Advertisement
- License:
- Perl Artistic License
- Price:
- FREE
- Publisher Name:
- Abe Timmerman
- Publisher web site:
- http://search.cpan.org/~abeltje/WWW-CheckSite-0.018/lib/WWW/CheckSite/Spider.pm
WWW::CheckSite::Spider Tags
WWW::CheckSite::Spider Description
WWW::CheckSite::Spider is a base class for spidering the web. WWW::CheckSite::Spider is a base Perl class for spidering the web.SYNOPSIS use WWW::CheckSite::Spider; my $sp = WWW::CheckSite::Spider->new( uri => 'http://www.test-smoke.org', ); while ( my $page = $sp->get_page ) { # $page is a hashref with basic information }or to spider a site behind HTTP basic authentication: package BA_Mech; use base 'WWW::Mechanize'; sub get_basic_credentials { ( 'abeltje', '********' ) } package main; use WWW::CheckSite::Spider; my $sp = WWW::CheckSite::Spider->new( ua_class => 'BA_Mech', uri => 'http://your.site.with.ba/', ); while ( my $page = $sp->get_page ) { # $page is a hashref with basic information }This module implements a basic web-spider, based on WWW::Mechanize. It takes care of putting pages on the "still-to-fetch" stack. Only uri's with the same origin will be stacked, taking the robots-rules on the server into account. Requirements: · Perl
WWW::CheckSite::Spider Related Software