[prev in list] [next in list] [prev in thread] [next in thread] 

List:       ruby-talk
Subject:    Re: HTML Parsing?
From:       Dave Lee <davel () canuck ! com>
Date:       2004-02-05 18:39:15
Message-ID: Pine.SGI.4.21.0402051133380.38955915-100000 () the-gimp
[Download RAW message or body]


Martin Hart wrote:
> What do people use to parse this into something useful?  Is REXML an option 
> (although the html is not likely to be valid xml)?  I have looked at the 
> html-parser on RAA but do not seem to be able to individually access the 
> components of the returned page (for example I need to see what the contents 
> of a text control are - or what the caption of the <h2> tag is.

see http://ruby-htmltools.rubyforge.org/

I used this library about a year ago, and found it pretty buggy.

Dave


[prev in list] [next in list] [prev in thread] [next in thread] 

Configure | About | News | Add a list | Sponsored by KoreLogic