[prev in list] [next in list] [prev in thread] [next in thread]
List: xsl-list
Subject: Re: [xsl] Does =?windows-1256?Q?=27Lec=9Cur=27_occur_in_=24?= =?windows-1256?Q?text=3F_Do_you_have_a
From: Michael Kay <mike () saxonica ! com>
Date: 2013-01-18 22:59:57
Message-ID: 50F9D3ED.90002 () saxonica ! com
[Download RAW message or body]
If you want to write queries that handle all the nuances of natural
language text, I would strongly recommend using a text retrieval
language rather than XPath. Many XQuery implementations have free text
retrieval modules.
Michael Kay
Saxonica
On 18/01/2013 22:12, Costello, Roger L. wrote:
> Hi Folks,
>
> I want to determine if 'Lecœur' occurs in $text.
>
> A naïve solution is this XPath expression:
>
> contains($text, 'Lecœur')
>
> However, that does not take into account many important factors:
>
> 1. Perhaps 'Lecœur' occurs, but in $text it is in uppercase
>
> 2. Perhaps instead of the 'œ' ligature, $text uses 'oe'
>
> 3. Perhaps in $text 'Lecœur' is split over two lines and thus is hyphenated
>
> 4. Perhaps 'Lecœur' is slightly misspelled in $text and therefore requires fuzzy matching
>
> And there are many other important factors.
>
> Do you have an XPath solution to this problem that takes into account the many important factors?
>
> /Roger
>
> --~------------------------------------------------------------------
> XSL-List info and archive: http://www.mulberrytech.com/xsl/xsl-list
> To unsubscribe, go to: http://lists.mulberrytech.com/xsl-list/
> or e-mail: <mailto:xsl-list-unsubscribe@lists.mulberrytech.com>
> --~--
>
>
--~------------------------------------------------------------------
XSL-List info and archive: http://www.mulberrytech.com/xsl/xsl-list
To unsubscribe, go to: http://lists.mulberrytech.com/xsl-list/
or e-mail: <mailto:xsl-list-unsubscribe@lists.mulberrytech.com>
--~--
[prev in list] [next in list] [prev in thread] [next in thread]
Configure |
About |
News |
Add a list |
Sponsored by KoreLogic