Hey Chris, "Although many of the steps in a Ya...

2007-11-29T21:12:00.000+00:00

Hey Chris,

"Although many of the steps in a Yahoo pipeline can be handled within a single XSLT script, some of the processing I want to demonstrate involves processing HTML pages which are not XHTML, so I needed a tidy service too, and to be able to pipeline them together."

So here's a fun one,

http://personplacething.info/service/proxy/return-xml-from-html/?uri=http://www.xml.com//html:html/html:body//html:p[contains(.,'M.%20David%20Peterson')]

Live dynamic searching of the (X)HTML web for pipelining into whatever you might want. This uses an XSLT 2.0 extension function written in C# that accesses an SgmlReader with the URI specified in the URI query string param and then returns the XPath specified at the end of the URI using // as the delimiter between the URI and the XPath expression (the second / represents the root of the document)

Code is @ http://nuxleus.com/dev/browser/trunk/nuxleus/Web/Development/transform/controller/proxy/base.xslt which is driven by http://nuxleus.com/dev/browser/trunk/nuxleus/Web/Development/service/proxy/return-xml-from-html/service.op

Too bad you're not using .NET! :D ;-) Of course this same thing could be replicated using a servlet and John Cowan's TagSoup HTML > XHTML processor.

Comments on The Wallace Line: Pipelines

Hey Chris, "Although many of the steps in a Ya...