[prev in list] [next in list] [prev in thread] [next in thread] 

List:       ruby-talk
Subject:    Re: HTML parsing
From:       Emmanuel Touzery <emmanuel.touzery () wanadoo ! fr>
Date:       2004-02-02 12:48:38
Message-ID: 401E47B9.30002 () wanadoo ! fr
[Download RAW message or body]

Emmanuel Touzery wrote:

> E) to convert it to REXML: I would use HTML tidy (which is already 
> needed for ~60% of the pages i'm parsing now)

(it was needed for many pages due to sloppy/invalid HTML, that tidy is 
correcting)

emmanuel


[prev in list] [next in list] [prev in thread] [next in thread] 

Configure | About | News | Add a list | Sponsored by KoreLogic