[prev in list] [next in list] [prev in thread] [next in thread]
List: koffice-devel
Subject: patch: correct sentence count
From: Daniel Naber <daniel.naber () t-online ! de>
Date: 2001-06-28 12:54:11
[Download RAW message or body]
Hi,
trivial patch, will apply if nobody objects.
What to do about things like "e.g.", this is currently taken as the end of
a sentence (even two). Can we assume that a sentence only starts if it
ends in a dot and a space? Any languages where this would fail?
regards
Daniel
--
Daniel Naber, Paul-Gerhardt-Str. 2, 33332 Guetersloh, Germany
Tel. 05241-59371, Mobil 0170-4819674
["sentence-count.diff" (text/x-c++)]
Index: kwtextframeset.cc
===================================================================
RCS file: /home/kde/koffice/kword/kwtextframeset.cc,v
retrieving revision 1.311
diff -u -r1.311 kwtextframeset.cc
--- kwtextframeset.cc 2001/06/20 17:11:06 1.311
+++ kwtextframeset.cc 2001/06/28 12:45:09
@@ -536,6 +536,8 @@
}
re.setPattern("[.?!]+"); // count "..." as only one "."
s.replace(re, ".");
+ re.setPattern("\\d\\.\\d"); // don't count floating point numbers as sentences
+ s.replace(re, "0,0");
re.setPattern("[A-Z]\\.+"); // don't count "U.S.A." as three sentences
s.replace(re, "*");
for ( uint i = 0 ; i < s.length() ; ++i )
_______________________________________________
Koffice-devel mailing list
Koffice-devel@master.kde.org
http://master.kde.org/mailman/listinfo/koffice-devel
[prev in list] [next in list] [prev in thread] [next in thread]
Configure |
About |
News |
Add a list |
Sponsored by KoreLogic