English 中文(简体)
使用困难的 Regex 从 XML 到文件 Perl 写入字符串
原标题:Perl write String from XML to File using difficult Regex
  • 时间:2012-05-25 11:48:13
  •  标签:
  • regex
  • perl

i 正在拥有一个 XML 文件, 需要将其传送到 Perl 列表( 不使用 XSLT ) 。

这是我的 XML :

...
<XMLTAG ID="1" name="NAME1" status="0" date1="24.05.2012 13:37:00" date2="25.05.2012 13:37:00" />
<XMLTAG ID="2" name="NAME2" status="1" date1="24.05.2012 13:37:00" date2="25.05.2012 13:37:00" />
<XMLTAG ID="3" name="NAME3" status="0" date1="24.05.2012 13:37:00" date2="25.05.2012 13:37:00" />
...

我到现在为止得到了什么:

my $input = in.xml;
my $output = out.txt;

# open input
open( INPUT, $input )
  || die "Can t find $input: $_";

# open output
open( OUTPUT, ">$output" )
  || die "Can t find $output: $_";

    # run until perl returns undef (at the end of the file)
    while (<INPUT>) {
        if ($_ == /date1="[0-3]?[0-9].[0-3]?[0-9].(?:[0-9]{2})?[0-9]{2} [0-5][0-9]:[0-5][0-9]:[0-5][0-9]"/) {
        print OUTPUT $_;};
    }
    close(INPUT);
    close(OUTPUT);

输出文件应该像这个样子 :

date1="24.05.2012 13:37:00"
date1="24.05.2012 13:37:01"
date1="24.05.2012 13:37:02"
...

Thanks in advance, Marley

最佳回答
use XML::LibXML qw();
my $dom = XML::LibXML->load_xml(location =>  in.xml );
printf qq(date1="%s"
), $_->getAttribute( date1 )
    for $dom->findnodes( //XMLTAG );
问题回答

您应该使用适当的 XML 解析模块。 有很多可用的模块, 但这里使用 < a href=" https:// metacpan. org/ module/ XML% 3a% 3aSmart" rel=" nofollow"\\\ code> XML:: Smart 。

这不是我愿意选择的解决方案 但我想知道你为什么注销XSLT?

use strict;
use warnings;

use XML::Smart;

my $input =  in.xml ;
my $output =  out.txt ;

open my $out,  > , $output or die qq(Can t open output file "$output": $!);

my $xml = XML::Smart->new($input);
my $text = $xml->{root}{XMLTAG};

my $xmltags = $xml->{root}{XMLTAG};

for my $tag (@$xmltags) {
  print $out qq(date1="$tag->{date1}"
);
}

<强 > 输出

date1="24.05.2012 13:37:00"
date1="24.05.2012 13:37:00"
date1="24.05.2012 13:37:00"

使用 < a href=> "http://p3rl.org/XML%3a%3aXSH2" rel="nofollow">XML::XSH2 :

open in.xml ;
ls //@date1 ;

您可以使用非贪婪的匹配方式, 像这样 :

if ($_ =~ /(date1=".*?")/ ) {
       print OUTPUT "$1
";
    }

尝试 :

date1="(.*?)"

为了你的regex, 它会是一个不贪婪的搜索。

更新:

他们警告我,没有必要逃避双重报价,所以

date1="(.*?)"

将做。





相关问题
Why does my chdir to a filehandle not work in Perl?

When I try a "chdir" with a filehandle as argument, "chdir" returns 0 and a pwd returns still the same directory. Should that be so? I tried this, because in the documentation to chdir I found: "...

How do I use GetOptions to get the default argument?

I ve read the doc for GetOptions but I can t seem to find what I need... (maybe I am blind) What I want to do is to parse command line like this myperlscript.pl -mode [sth] [inputfile] I can use ...

Object-Oriented Perl constructor syntax and named parameters

I m a little confused about what is going on in Perl constructors. I found these two examples perldoc perlbot. package Foo; #In Perl, the constructor is just a subroutine called new. sub new { #I ...

Where can I find object-oriented Perl tutorials? [closed]

A Google search yields a number of results - but which ones are the best? The Perl site appears to contain two - perlboot and perltoot. I m reading these now, but what else is out there? Note: I ve ...

热门标签