[prev in list] [next in list] [prev in thread] [next in thread] 

List:       perl-beginners
Subject:    Re: Match HTML <div> ...... </dv> string over multiple
From:       Lars Noodén <lars.nooden () gmail ! com>
Date:       2014-11-19 6:57:06
Message-ID: 546C3F42.7040308 () gmail ! com
[Download RAW message or body]

On Tue, Nov 18, 2014 at 12:22 PM, mimicafe@gmail.com
<mimicafe@gmail.com> wrote:
> I am trying to extract a table (<table class="xxxx"><tr><td>...... 
> until </table>) and its content from an HTML file.

Because there can be nested tables and other elements that can throw off
simple regex parsing, I would try either HTML::TokeParser,
HTML::TokeParser::Simple, or XML::TokeParser.  The latter would be only
if the input is valid, well-formed XHTML.

TokeParser can be used to find a table within a specific div.

Regards,
/Lars


-- 
To unsubscribe, e-mail: beginners-unsubscribe@perl.org
For additional commands, e-mail: beginners-help@perl.org
http://learn.perl.org/


[prev in list] [next in list] [prev in thread] [next in thread] 

Configure | About | News | Add a list | Sponsored by KoreLogic