用perl获取google搜索的结果

来源:互联网 发布:印度支那联邦 知乎 编辑:程序博客网 时间:2024/05/01 22:17

以前google的API还开放的时候,可以直接使用CPAN上的包来进行获取,现在只能自己码代码了。。。


代码如下

sub google_search{my $keyword = $_[0];print "keyword is $keyword\n";#we must first convert our search term into utf8, then can we use uri_escape to encode the term to search-engine like encodingmy $search = encode("utf-8",decode("gb2312", $keyword)); $search = uri_escape($search);print "keyword is $search\n";my $ua = new LWP::UserAgent; #Construct the request object$ua->agent( "Mozilla/5.0 (X11; Linux i686; rv:2.0.0) Gecko/20100101" ); #Google checks UAs, so set this to something common$ua->timeout( 10 ); #Give up after 10 seconds of waiting$ua->max_redirect( 0 ); #We only want the URL that google sends, so stop after we submit$ua->env_proxy; #Load proxy info from the environment, if any is setmy $response = $ua->get("http://www.google.com/search?q=".$search #The URL to request) or die( $! );#my $firstUrl = $response->contents; #This contains the link of the result pagereturn encode("utf8", $response->decoded_content()); #convert to utf8}
这里用到了3个package:

LWP::UserAgent 用来下载网页

URI::Escape 用来对中文进行转义,生成google链接中的奇奇怪怪的编码

use Encode 对我们输入的检索关键词编码至utf8

原创粉丝点击