[prev in list] [next in list] [prev in thread] [next in thread] 

List:       aspell-user
Subject:    [Aspell-user] compile dictionary list with wordlists containing non-alphanumeric characters
From:       "Xiang, Jiayan X" <Jiayan.X.Xiang () questdiagnostics ! com>
Date:       2008-06-20 15:21:13
Message-ID: 40C59B6990FA8040B9A9D758759BA977019D5BE6 () QDCWS0116 ! us ! qdx ! com
[Download RAW message or body]

Hi:
 
I am trying to compile a custom dictionary with words like:
 
A's
A+D
A-Fil
A-Hist
A-Hydrocort
A-Methapred
A-bomb
A-bomb's
A-bombs
A-bombs'
A-one
A.A.
A.B.
A.B.A.
A.D.
A.F.L.-C.I.O.
A.F.L.-C.I.O.'s
 
I tried to use the command:
 
aspell --lang=en create master ./meditest < mywords.txt
 
and it errors out for every word that has non-alphanumeric characters
like '+-, etc.  What's the best way of dealing with this?  Do I have to
remove all these characters for it to work?  
 
I am doing this to create a dictionary specific to Pathlogist; the
current medical dictionary on Aspell doesn't have much words in that
particualr area.
 
Thanks.
 
Jon


------------------------------------------
The contents of this message, together with any attachments, are
intended only for the use of the person(s) to which they are
addressed and may contain confidential and/or privileged
information. Further, any medical information herein is
confidential and protected by law. It is unlawful for unauthorized
persons to use, review, copy, disclose, or disseminate confidential
medical information. If you are not the intended recipient,
immediately advise the sender and delete this message and any
attachments. Any distribution, or copying of this message, or any
attachment, is prohibited.
[Attachment #3 (text/html)]

<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN">
<HTML><HEAD>
<META http-equiv=Content-Type content="text/html; charset=us-ascii">
<META content="MSHTML 6.00.2800.1609" name=GENERATOR></HEAD>
<BODY>
<DIV><SPAN class=502531615-20062008><FONT face=Arial 
size=2>Hi:</FONT></SPAN></DIV>
<DIV><SPAN class=502531615-20062008><FONT face=Arial 
size=2></FONT></SPAN>&nbsp;</DIV>
<DIV><SPAN class=502531615-20062008><FONT face=Arial size=2>I am trying to 
compile a custom dictionary with words like:</FONT></SPAN></DIV>
<DIV><SPAN class=502531615-20062008><FONT face=Arial 
size=2></FONT></SPAN>&nbsp;</DIV>
<DIV><SPAN class=502531615-20062008><FONT face=Arial 
size=2>A's<BR>A+D<BR>A-Fil<BR>A-Hist<BR>A-Hydrocort<BR>A-Methapred<BR>A-bomb<BR>A-bomb \
's<BR>A-bombs<BR>A-bombs'<BR>A-one<BR>A.A.<BR>A.B.<BR>A.B.A.<BR>A.D.<BR>A.F.L.-C.I.O.<BR>A.F.L.-C.I.O.'s</FONT></SPAN></DIV>
 <DIV><SPAN class=502531615-20062008><FONT face=Arial 
size=2></FONT></SPAN>&nbsp;</DIV>
<DIV><SPAN class=502531615-20062008><FONT face=Arial size=2>I tried to use the 
command:</FONT></SPAN></DIV>
<DIV><SPAN class=502531615-20062008><FONT face=Arial 
size=2></FONT></SPAN>&nbsp;</DIV>
<DIV><SPAN class=502531615-20062008>aspell --lang=en create master ./meditest 
&lt; mywords.txt</SPAN></DIV>
<DIV><SPAN class=502531615-20062008></SPAN>&nbsp;</DIV>
<DIV><SPAN class=502531615-20062008><FONT face=Arial size=2>and it errors out 
for every word that has non-alphanumeric characters like '+-, etc.&nbsp; What's 
the best way of dealing with this?&nbsp; Do I have to remove all these 
characters for it to work?&nbsp; </FONT></SPAN></DIV>
<DIV><SPAN class=502531615-20062008><FONT face=Arial 
size=2></FONT></SPAN>&nbsp;</DIV>
<DIV><SPAN class=502531615-20062008><FONT face=Arial size=2>I am doing this to 
create a dictionary specific to Pathlogist; the current medical dictionary on 
Aspell doesn't have much words in that particualr area.</FONT></SPAN></DIV>
<DIV><SPAN class=502531615-20062008><FONT face=Arial 
size=2></FONT></SPAN>&nbsp;</DIV>
<DIV><SPAN class=502531615-20062008><FONT face=Arial 
size=2>Thanks.</FONT></SPAN></DIV>
<DIV><SPAN class=502531615-20062008><FONT face=Arial 
size=2></FONT></SPAN>&nbsp;</DIV>
<DIV><SPAN class=502531615-20062008><FONT face=Arial 
size=2>Jon</FONT></SPAN></DIV></BODY></HTML>

<br>
------------------------------------------
<br>
The contents of this message, together with any attachments, are intended only for \
the use of the person(s) to which they are addressed and may contain confidential \
and/or privileged information. Further, any medical information herein is \
confidential and protected by law. It is unlawful for unauthorized persons to use, \
review, copy, disclose, or disseminate confidential medical information. If you are \
not the intended recipient, immediately advise the sender and delete this message and \
any attachments. Any distribution, or copying of this message, or any attachment, is \
prohibited.



[prev in list] [next in list] [prev in thread] [next in thread] 

Configure | About | News | Add a list | Sponsored by KoreLogic