用 Web::Scraper 解析 html / xml 数据

教程:http://e8y.net/mag/013-web-scraper/

注意:tag如果有大写字母,例如这种,在Web::Scraper中写解析锚点时,要用小写的message,否则会提取失败

#!/usr/bin/perl
use strict;
use warnings;
use Web::Scraper;
use Data::Dumper;
 
my $html =q[
<Message name="testn" value="testv">
<Sub_Message title="abc">just abc</Sub_Message>
<Sub_Message title="efg">just efg</Sub_Message>
</Message>];
 
my $scraper = scraper {
    process_first 'message', 'msg_name' => '@name', 'msg_value' => '@value';

    process 'sub_message' , 'sub_message[]' => {
          'title' => '@title',
          'content' => 'TEXT',
    };
};
 
my $res = $scraper->scrape(\$html);
print Dumper($res);


blog comments powered by Disqus

Published

26 February 2013

Tags