[prev in list] [next in list] [prev in thread] [next in thread] 

List:       haskell
Subject:    Re: [Haskell] Current XML libraries status
From:       "Krasimir Angelov" <kr.angelov () gmail ! com>
Date:       2008-10-25 8:09:37
Message-ID: c97c08d20810250109x304e3f2csc1859792dafe1a69 () mail ! gmail ! com
[Download RAW message or body]

[Attachment #2 (multipart/alternative)]


Hi Marc,

Thanks for the pointer. Fortunately I don't have this problem. Probably 2GB
of memory are enough to parse 20MB file even with this space leak. HXML
still works better than the other libraries and has a nice API so I use it.
The patch is useful but it is not applied. I also did some other changes in
HXML. It seems like the library is not maintained. Should I package it and
upload to Hackage?

Regards,
  Krasimir



On Sat, Oct 25, 2008 at 2:14 AM, Marc A. Ziegert <coeus@gmx.de> wrote:

> there was a thread about xml parsing, one month ago.
> <http://thread.gmane.org/gmane.comp.lang.haskell.cafe/44708>
> well, i don't know much about xml, except what it looks like;
> but i know about that interesting parsing problem behind it.
> maybe Lev Walkin has fixed that in HXML. at least he wrote this patch...
> <http://article.gmane.org/gmane.comp.lang.haskell.cafe/45033>
>
> - marc
>
>
> -----
> sometimes i think, i should write a paper about it. but then... naah, i'm
> like haskell: non-strict.
>
>
>
>
> Am Donnerstag, 23. Oktober 2008 schrieb Krasimir Angelov:
>  > Hi,
> >
> > Does some one have made performance tests on the different XML libraries
> for
> > Haskell? I have a 20MB xml file that I want to read. I remember from my
> > earlier experiments (years ago) that all libraries were too slow and were
> > consuming too much memory. I hoped that this situation had changed but
> maybe
> > not. I looked at HaXML, libxml, HXML and HXT. HaXML eats a lot of memory
> and
> > is still very slow. libxml is unfinished binding to the C library.
> Currently
> > it only allows to create documents. HXML seems to be very promising. It
> > works fast and it doesn't eat memory. Unfortunately it is that it seems
> to
> > be rather old. It uses its own Arrow and Tree libraries instead of the
> > standard libraries. I have not jumped into HXT yet because it seems to be
> > very large library. Could someone recomend which one is the state of the
> > art?
> >
> > Best Regards,
> >   Krasimir
> >
>

[Attachment #5 (text/html)]

<div>Hi Marc,</div>
<div>&nbsp;</div>
<div>Thanks for the pointer. Fortunately I don&#39;t have this problem. Probably 2GB \
of memory are enough to parse 20MB file even with this space leak. HXML still works \
better than the other libraries and has a nice API so I use it. The patch is useful \
but it is not applied. I also did some other changes in HXML. It seems like the \
library is not maintained. Should I package it and upload to Hackage?</div>

<div>&nbsp;</div>
<div>Regards,</div>
<div>&nbsp; Krasimir</div>
<div><br><br>&nbsp;</div>
<div class="gmail_quote">On Sat, Oct 25, 2008 at 2:14 AM, Marc A. Ziegert <span \
dir="ltr">&lt;<a href="mailto:coeus@gmx.de">coeus@gmx.de</a>&gt;</span> wrote:<br> \
<blockquote class="gmail_quote" style="PADDING-LEFT: 1ex; MARGIN: 0px 0px 0px 0.8ex; \
BORDER-LEFT: #ccc 1px solid">there was a thread about xml parsing, one month \
ago.<br>&lt;<a href="http://thread.gmane.org/gmane.comp.lang.haskell.cafe/44708" \
target="_blank">http://thread.gmane.org/gmane.comp.lang.haskell.cafe/44708</a>&gt;<br>
 well, i don&#39;t know much about xml, except what it looks like;<br>but i know \
about that interesting parsing problem behind it.<br>maybe Lev Walkin has fixed that \
in HXML. at least he wrote this patch...<br>&lt;<a \
href="http://article.gmane.org/gmane.comp.lang.haskell.cafe/45033" \
target="_blank">http://article.gmane.org/gmane.comp.lang.haskell.cafe/45033</a>&gt;<br>
 <br>- marc<br><br><br>-----<br>sometimes i think, i should write a paper about it. \
but then... naah, i&#39;m like haskell: non-strict.<br><br><br><br><br>Am Donnerstag, \
23. Oktober 2008 schrieb Krasimir Angelov:<br> <div>
<div></div>
<div class="Wj3C7c">&gt; Hi,<br>&gt;<br>&gt; Does some one have made performance \
tests on the different XML libraries for<br>&gt; Haskell? I have a 20MB xml file that \
I want to read. I remember from my<br>&gt; earlier experiments (years ago) that all \
libraries were too slow and were<br> &gt; consuming too much memory. I hoped that \
this situation had changed but maybe<br>&gt; not. I looked at HaXML, libxml, HXML and \
HXT. HaXML eats a lot of memory and<br>&gt; is still very slow. libxml is unfinished \
binding to the C library. Currently<br> &gt; it only allows to create documents. HXML \
seems to be very promising. It<br>&gt; works fast and it doesn&#39;t eat memory. \
Unfortunately it is that it seems to<br>&gt; be rather old. It uses its own Arrow and \
Tree libraries instead of the<br> &gt; standard libraries. I have not jumped into HXT \
yet because it seems to be<br>&gt; very large library. Could someone recomend which \
one is the state of the<br>&gt; art?<br>&gt;<br>&gt; Best Regards,<br>&gt; &nbsp; \
Krasimir<br> &gt;<br></div></div></blockquote></div><br>



_______________________________________________
Haskell mailing list
Haskell@haskell.org
http://www.haskell.org/mailman/listinfo/haskell


[prev in list] [next in list] [prev in thread] [next in thread] 

Configure | About | News | Add a list | Sponsored by KoreLogic