This is the mail archive of the
mailing list for the cygwin project.
Re: cygwn uses for public document retrieval
- From: Carlo Florendo <subscribermail at gmail dot com>
- To: The Cygwin-Talk Maiming List <cygwin-talk at cygwin dot com>
- Date: Tue, 10 Oct 2006 09:09:46 +0800
- Subject: Re: cygwn uses for public document retrieval
- References: <BAY108-F3252B89E43442A8362B6BEBE110@phx.gbl>
- Reply-to: The Cygwin-Talk Maiming List <cygwin-talk at cygwin dot com>
Mike Marchywka wrote:
( this was originally rejected from main list, thought to be
marginally relevant here)
( I searched the archives, this hasn't come up before and the question is
at the bottom- sorry for the long intro. I posted this on cygwin because
I run my scripts on cygwin and cygwin illustrates the relationship
graphiically oriented things like windoze and information oriented
like linux. )
So, my question is, are there other people who have used cygwin for
these purposes and what sites have you accessed or attempted to access
in some script based way? Has anyone approached govt sites at
any level requesting computer friendly interaction mechanisms?
What responses have you gotten?
I have. But each site has a different structure. You have to make
customized perl scripts to scrape the web pages. If the documents are
in XML, you just need an XML parser.
I don't know what got you here, but don't be surprised if you get tons
of hippos responding to your mail. This list is not for technical
questions. But if you want to have fun, you may want to lurk.
Thanks a lot!
Astra Philippines Inc. (www.astra.ph)
Member of the Astra Group (www.astra.co.jp)