This is the mail archive of the
xsl-list@mulberrytech.com
mailing list .
RE: A doubt in XML
- To: "'xsl-list at mulberrytech dot com'" <xsl-list at mulberrytech dot com>
- Subject: RE: A doubt in XML
- From: Linda van den Brink <lvdbrink at baan dot nl>
- Date: Thu, 5 Oct 2000 14:24:41 +0200
- Reply-To: xsl-list at mulberrytech dot com
For HTML you could use lots of different approaches, such as Perl, Omnimark,
DSSSL, etc. What I'd maybe do is use a tool, Tidy, instead, which can
convert the HTML to XHTML for you. If that's not enough, you can then use
XSLT to transform the XHTML to the exact XML format you want.
For Word, again you can use things like Perl and Omnimark. There are scripts
freely available in those languages (at least I know of a widely used one in
Omnimark), that give you some XML format, which you can then transform to
the XML format you want using XSLT.
Take a look at Tidy: http://www.xmlsoftware.com/convert/#tidy, and other
conversion tools listed on that page.
Linda
> -----Original Message-----
> From: Girija Vijayaraghavan [mailto:v_girija_naidu@usa.net]
> Sent: Thursday, October 05, 2000 12:41 PM
> To: XSL-List@mulberrytech.com
> Subject: A doubt in XML
>
>
> Hi
> My name is Girija. I am from India.
> COuld you please tell me which scipting languages would allow
> me to convert
> HTML and Word documents into XML.
> Thanks
> Lotsa luck
> Girija
>
> ____________________________________________________________________
> Get free email and a permanent address at
> http://www.netaddress.com/?N=1
>
>
> XSL-List info and archive:
> http://www.mulberrytech.com/xsl/xsl-list
>
XSL-List info and archive: http://www.mulberrytech.com/xsl/xsl-list