WWW::Search::WebCrawler - class for searching WebCrawler |
WWW::Search::WebCrawler - class for searching WebCrawler
use WWW::Search; my $oSearch = new WWW::Search('WebCrawler'); my $sQuery = WWW::Search::escape_query("+sushi restaurant +Columbus Ohio"); $oSearch->native_query($sQuery); while (my $oResult = $oSearch->next_result()) print $oResult->url, "\n";
This class is a WebCrawler specialization of WWW::Search. It handles making and interpreting WebCrawler searches http://www.WebCrawler.com.
This class exports no public interface; all interaction should be done through the WWW::Search manpage objects.
To make new back-ends, see the WWW::Search manpage.
native_setup_search
is called (from WWW::Search::setup_search
)
before we do anything. It initializes our private variables (which
all begin with underscore) and sets up a URL to the first results
page in {_next_url}
.
native_retrieve_some
is called (from WWW::Search::retrieve_some
)
whenever more hits are needed. It calls WWW::Search::http_request
to fetch the page specified by {_next_url}
.
It then parses this page, appending any search hits it finds to
{cache}
. If it finds a ``next'' button in the text,
it sets {_next_url}
to point to the page for the next
set of results, otherwise it sets it to undef to indicate we''re done.
Please tell the author if you find any!
This module adheres to the WWW::Search
test suite mechanism.
See $TEST_CASES below.
As of 1998-03-16, WWW::Search::WebCrawler
is maintained by Martin Thurn
(MartinThurn@iname.com)
WWW::Search::WebCrawler
was originally written by Martin Thurn
based on WWW::Search::HotBot
.
THIS SOFTWARE IS PROVIDED ``AS IS'' AND WITHOUT ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, WITHOUT LIMITATION, THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE.
If it's not listed here, then it wasn't a meaningful or released version.
now uses hash_to_cgi_string()
Remove extraneous HTML from description (thanks to Jim Smyser jsmyser@bigfoot.com)
Now uses split_lines function
1998-08-20: New format of www.webcrawler.com output.
\n changed to \012 for MacPerl compatibility
1998-05-29: New format of www.webcrawler.com output.
First publicly-released version.
WWW::Search::WebCrawler - class for searching WebCrawler |