link[@rel= alternate  and @type= text/html  or not(@rel)]/@href | link/text()




在XPath的管道(< >)配对。 因此,将归还连接要素(与上游相匹配)和任何链接内容的原始属性。


  <link href="http://www.google.com">Google</link>
  <link rel="zzzz" href="http://www.stackoverflow.com">Stack Overflow</link>

页: 1

Stack Overflow

符号是: 它贪.了与左侧或右侧相匹配的所有因素。


  • Grab the href attribute of all link tags that have an attribute "rel=alternate" and an attribute "type=text/html", or grab the href of all link tags that do not have the rel attribute


  • Grab the inner text of all link tags on the page.


Acctually I don t understand the symbol |

www.un.org/Depts/DGACM/index_spanish.htm http://www.w3.org/TR/1999/REC-xpath-19991116/#NT-UnionExpr”rel=“nofollow” XPath Unionmente/a>。


•The union and | operators are equivalent. They take two node sequences as operands and return a sequence containing all the nodes that occur in either of the operands.

当然,只有XPath 2.0和XPath 1.0中加入“un”经营者,我们才有其早先的同义词,由>>>> 代码表示。

www.un.org/Depts/DGACM/index_spanish.htm 因此,在的特定情况下:

link[@rel= alternate  and @type= text/html  or not(@rel)]/@href | link/text()

<>XPath 以上表述选择了两套:的结合:

  1. 由以下单位选定的所有节点:link[@rel=校正和@type=text/html或(@rel)]/@href

  2. 所有节点:<代码>link/text()


www.un.org/Depts/DGACM/index_spanish.htm http://en.wikipedia.org/wiki/Union_%28set_theory%29”rel=“nofollow” Wikipedia:

The union of two sets A and B is the collection of points which are in A or in B (or in both):

www.un.org/Depts/DGACM/index_spanish.htm 简单的例子:

A = {1,2,3,4,5,6}

B = {1,5,6,7,8}

A U B = {1,2,3,4,5,6,7,8}

