[prev in list] [next in list] [prev in thread] [next in thread] 

List:       lucene-dev
Subject:    [jira] [Comment Edited] (SOLR-8981) Upgrade to Tika 1.13 when it is available
From:       "Tim Allison (JIRA)" <jira () apache ! org>
Date:       2016-05-31 19:28:13
Message-ID: JIRA.12958597.1460579584000.341085.1464722893033 () Atlassian ! JIRA
[Download RAW message or body]


    [ https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15308355#comment-15308355 \
] 

Tim Allison edited comment on SOLR-8981 at 5/31/16 7:27 PM:
------------------------------------------------------------

I'm getting a failure on that test too.  I'm getting exactly the same output with the \
standalone Tika 1.7 and 1.13 apps on the test file...argh...

For some reason, it looks like Tika is now emitting 2 bodies, if you double the body \
in both tests, this now works: {noformat}
ExtractingParams.XPATH_EXPRESSION, \
"/xhtml:html/xhtml:body/xhtml:body/xhtml:a/descendant::node()", {noformat}
{noformat}
"xpath", "/xhtml:html/xhtml:body/xhtml:body/xhtml:div//node()",
{noformat}


was (Author: tallison@mitre.org):
I'm getting a failure on that test too.  I can't figure out what's going on.  I'm \
getting exactly the same output with the standalone Tika 1.7 and 1.13 apps on the \
test file...argh...

> Upgrade to Tika 1.13 when it is available
> -----------------------------------------
> 
> Key: SOLR-8981
> URL: https://issues.apache.org/jira/browse/SOLR-8981
> Project: Solr
> Issue Type: Improvement
> Reporter: Tim Allison
> Priority: Minor
> 
> Tika 1.13 should be out within a month.  This includes PDFBox 2.0.0 and a number of \
> other upgrades and improvements.   If there are any showstoppers in 1.13 from \
> Solr's side or requests before we roll 1.13, let us know.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


[prev in list] [next in list] [prev in thread] [next in thread] 

Configure | About | News | Add a list | Sponsored by KoreLogic