[prev in list] [next in list] [prev in thread] [next in thread]
List: perl-beginners
Subject: Re: Match HTML <div> ...... </dv> string over multiple
From: Lars Noodén <lars.nooden () gmail ! com>
Date: 2014-11-19 6:57:06
Message-ID: 546C3F42.7040308 () gmail ! com
[Download RAW message or body]
On Tue, Nov 18, 2014 at 12:22 PM, mimicafe@gmail.com
<mimicafe@gmail.com> wrote:
> I am trying to extract a table (<table class="xxxx"><tr><td>......
> until </table>) and its content from an HTML file.
Because there can be nested tables and other elements that can throw off
simple regex parsing, I would try either HTML::TokeParser,
HTML::TokeParser::Simple, or XML::TokeParser. The latter would be only
if the input is valid, well-formed XHTML.
TokeParser can be used to find a table within a specific div.
Regards,
/Lars
--
To unsubscribe, e-mail: beginners-unsubscribe@perl.org
For additional commands, e-mail: beginners-help@perl.org
http://learn.perl.org/
[prev in list] [next in list] [prev in thread] [next in thread]
Configure |
About |
News |
Add a list |
Sponsored by KoreLogic