[prev in list] [next in list] [prev in thread] [next in thread] 

List:       xsl-list
Subject:    Re: [xsl] Does =?windows-1256?Q?=27Lec=9Cur=27_occur_in_=24?= =?windows-1256?Q?text=3F_Do_you_have_a
From:       Michael Kay <mike () saxonica ! com>
Date:       2013-01-18 22:59:57
Message-ID: 50F9D3ED.90002 () saxonica ! com
[Download RAW message or body]

If you want to write queries that handle all the nuances of natural 
language text, I would strongly recommend using a text retrieval 
language rather than XPath. Many XQuery implementations have free text 
retrieval modules.

Michael Kay
Saxonica

On 18/01/2013 22:12, Costello, Roger L. wrote:
> Hi Folks,
>
> I want to determine if 'Lecœur' occurs in $text.
>
> A naïve solution is this XPath expression:
>
>        contains($text, 'Lecœur')
>
> However, that does not take into account many important factors:
>
> 1. Perhaps 'Lecœur' occurs, but in $text it is in uppercase
>
> 2. Perhaps instead of the 'œ' ligature, $text uses 'oe'
>
> 3. Perhaps in $text 'Lecœur' is split over two lines and thus is hyphenated
>
> 4. Perhaps 'Lecœur' is slightly misspelled in $text and therefore requires fuzzy matching
>
> And there are many other important factors.
>
> Do you have an XPath solution to this problem that takes into account the many important factors?
>
> /Roger
>
> --~------------------------------------------------------------------
> XSL-List info and archive:  http://www.mulberrytech.com/xsl/xsl-list
> To unsubscribe, go to: http://lists.mulberrytech.com/xsl-list/
> or e-mail: <mailto:xsl-list-unsubscribe@lists.mulberrytech.com>
> --~--
>
>


--~------------------------------------------------------------------
XSL-List info and archive:  http://www.mulberrytech.com/xsl/xsl-list
To unsubscribe, go to: http://lists.mulberrytech.com/xsl-list/
or e-mail: <mailto:xsl-list-unsubscribe@lists.mulberrytech.com>
--~--

[prev in list] [next in list] [prev in thread] [next in thread] 

Configure | About | News | Add a list | Sponsored by KoreLogic