[prev in list] [next in list] [prev in thread] [next in thread]
List: hadoop-user
Subject: Re: XML files in HDFS
From: Hyunsik Choi <c0d3h4ck () gmail ! com>
Date: 2009-07-30 12:16:27
Message-ID: 4b2c7b610907300516k5db15dc0nf08461e92f357c9a () mail ! gmail ! com
[Download RAW message or body]
Hi,
Actually, I don't know there exists any well-made XML InputFormat or
Record reader.
To the best of my knowledge, StreamXmlRecordReader (
http://hadoop.apache.org/common/docs/current/api/org/apache/hadoop/streaming/StreamXmlRecordReader.html
) of Hadoop streaming is only solution.
Good luck!
--
Hyunsik Choi
Database & Information Systems Group, Korea University
http://diveintodata.org
On Thu, Jul 30, 2009 at 5:30 PM, Wasim Bari<wasimbari@msn.com> wrote:
>
>
>
> Hi All,
>
> I am looking to store some real big xml files in HDFS and then process them using \
> MapReduce.
>
>
> Do we have some utility which uploads the xml files to hdfs making sure split up \
> of file in block doen't brake an elemet ( mean half element on one block and half \
> on someother ) ?
>
>
> Any suggestions to work thos out will be appreciated greatly.
>
>
>
> Thanks
>
>
>
> Bari
>
[prev in list] [next in list] [prev in thread] [next in thread]
Configure |
About |
News |
Add a list |
Sponsored by KoreLogic