This is the mail archive of the xsl-list@mulberrytech.com mailing list .


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]

Re: Extracting a list of unique base urls from anchors in a html document.



> The second predicate should (I think) limit the returned node-set to contain
> only anchors that have a href attribute that has a unique base-url 

No as 
[not(substring-before(@href,'#')=substring-before(preceding::a/@href,'#'))]
                                 ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

this takes the string value of the node set which is the string value of
the first node, so that only tests the current href attribute with the
first href attribute in the document, not with all the preceding ones.

This is really a grouping broblem see th egrouping techniques at Jeni's
site.

Basically you want to construct a key that uses
match="a[contains(@href,'#')" use="substring-before(@href,'#')"
then using key() you can get hold of all the links with the same base.

David
 

_____________________________________________________________________
This message has been checked for all known viruses by Star Internet
delivered through the MessageLabs Virus Scanning Service. For further
information visit http://www.star.net.uk/stats.asp or alternatively call
Star Internet for details on the Virus Scanning Service.

 XSL-List info and archive:  http://www.mulberrytech.com/xsl/xsl-list


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]