[prev in list] [next in list] [prev in thread] [next in thread] 

List:       xmlrpc-user
Subject:    [jira] [Comment Edited] (AXIOM-478) Solution for parsing large XML
From:       "Andreas Veithen (JIRA)" <jira () apache ! org>
Date:       2016-01-20 20:07:39
Message-ID: JIRA.12926917.1451899223000.162644.1453320459934 () Atlassian ! JIRA
[Download RAW message or body]


    [ https://issues.apache.org/jira/browse/AXIOM-478?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15107557#comment-15107557 \
] 

Andreas Veithen edited comment on AXIOM-478 at 1/20/16 8:07 PM:
----------------------------------------------------------------

A snapshot version (1.2.18-SNAPSHOT) containing the fix is available. You can pull it \
from the Maven snapshot repository at \
https://repository.apache.org/content/repositories/snapshots/ or download the binary \
distribution at https://builds.apache.org/job/axiom-1.2.x/lastStableBuild/org.apache.ws.commons.axiom$distribution/. \
Note that for this to work it is mandatory to close the Reader returned by \
getTextAsStream before accessing other nodes in the tree.

Please let me know if this solves the issue.


was (Author: veithen):
A snapshot version (1.2.18-SNAPSHOT) containing the fix is available. You can pull it \
from the Maven snapshot repository at \
https://repository.apache.org/content/repositories/snapshots/ or download the binary \
distribution at https://builds.apache.org/job/axiom-trunk/lastStableBuild/org.apache.ws.commons.axiom$distribution/. \
Note that for this to work it is mandatory to close the Reader returned by \
getTextAsStream before accessing other nodes in the tree.

Please let me know if this solves the issue.

> Solution for parsing large XML
> ------------------------------
> 
> Key: AXIOM-478
> URL: https://issues.apache.org/jira/browse/AXIOM-478
> Project: Axiom
> Issue Type: Question
> Reporter: LU Jie
> 
> This is LU Jie from IBM. We use axiom to parse Atom in our project. 
> One of our CMIS API will attach file content to the XML. If the file size is large, \
> we will get a large atom. If we use Entry.getExtension(QName) to parse the content, \
> it will allocate a large memory(around 5-6 times of the file size). We need you \
> help to clarify if we can use DOM-like API of axiom to get the text of a certain \
> element as stream. That is without allocating a large object in memory. Or is there \
> an alternative solution for this use case? We DO know that we can use pull-parser \
> to parse the XML as stream. But we need help to investigate if axiom has already \
> provided an API or solution to avoid writing parser by ourselves. Here's the sample \
> XML. We need to parse the text of cmisra:base64 element: {noformat}
> <atom:entry
> xmlns:atom="http://www.w3.org/2005/Atom"
> xmlns:cmisra="http://docs.oasis-open.org/ns/cmis/restatom/200908/"
> xmlns:chemistry="http://chemistry.apache.org/"
> xmlns:cmis="http://docs.oasis-open.org/ns/cmis/core/200908/">
> <atom:id
> xmlns:atom="http://www.w3.org/2005/Atom">urn:uuid:00000000-0000-0000-0000-00000000000
>  </atom:id>
> <atom:title
> xmlns:atom="http://www.w3.org/2005/Atom" type="text">doucment1446016556658.txt
> </atom:title>
> <atom:updated
> xmlns:atom="http://www.w3.org/2005/Atom">2015-10-28T07:15:57.594Z
> </atom:updated>
> <cmisra:content
> xmlns:cmisra="http://docs.oasis-open.org/ns/cmis/restatom/200908/">
> <cmisra:mediatype
> xmlns:cmisra="http://docs.oasis-open.org/ns/cmis/restatom/200908/">text/plain
> </cmisra:mediatype>
> <chemistry:filename
> xmlns:chemistry="http://chemistry.apache.org/">doucment1446016556658.txt
> </chemistry:filename>
> <cmisra:base64
> xmlns:cmisra="http://docs.oasis-open.org/ns/cmis/restatom/200908/">Base64 encoded \
> content of large file </cmisra:base64>
> </cmisra:content>
> <cmisra:object
> xmlns:cmisra="http://docs.oasis-open.org/ns/cmis/restatom/200908/">
> <cmis:properties
> xmlns:cmis="http://docs.oasis-open.org/ns/cmis/core/200908/">
> <cmis:propertyId
> xmlns:cmis="http://docs.oasis-open.org/ns/cmis/core/200908/" \
> propertyDefinitionId="cmis:objectTypeId"> <cmis:value
> xmlns:cmis="http://docs.oasis-open.org/ns/cmis/core/200908/">snx:file
> </cmis:value>
> </cmis:propertyId>
> <cmis:propertyString
> xmlns:cmis="http://docs.oasis-open.org/ns/cmis/core/200908/" \
> propertyDefinitionId="cmis:name"> <cmis:value
> xmlns:cmis="http://docs.oasis-open.org/ns/cmis/core/200908/">doucment1446016556658.txt
>  </cmis:value>
> </cmis:propertyString>
> </cmis:properties>
> </cmisra:object>
> </atom:entry>
> {noformat}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@ws.apache.org
For additional commands, e-mail: dev-help@ws.apache.org


[prev in list] [next in list] [prev in thread] [next in thread] 

Configure | About | News | Add a list | Sponsored by KoreLogic