[prev in list] [next in list] [prev in thread] [next in thread]
List: pear-doc
Subject: [PEAR-DOC] =?utf-8?q?svn:_/pear/peardoc/trunk/en/package/text/_text-languagedetect/detect.xml_text-l
From: Christian_Weiske <cweiske () php ! net>
Date: 2012-01-16 9:28:23
Message-ID: svn-cweiske-1326706103-322358-1302531489 () svn ! php ! net
[Download RAW message or body]
cweiske Mon, 16 Jan 2012 09:28:23 +0000
Revision: http://svn.php.net/viewvc?view=revision&revision=322358
Log:
talk about setNameMode, which is new in version 0.3.0
Changed paths:
U pear/peardoc/trunk/en/package/text/text-languagedetect/detect.xml
U pear/peardoc/trunk/en/package/text/text-languagedetect/example.xml
A pear/peardoc/trunk/en/package/text/text-languagedetect/examples/
A pear/peardoc/trunk/en/package/text/text-languagedetect/examples/detect.php
A pear/peardoc/trunk/en/package/text/text-languagedetect/examples/detect.txt
A pear/peardoc/trunk/en/package/text/text-languagedetect/examples/setNameMode.php
A pear/peardoc/trunk/en/package/text/text-languagedetect/examples/setNameMode.txt
A pear/peardoc/trunk/en/package/text/text-languagedetect/language-names.xml
U pear/peardoc/trunk/en/package/text/text-languagedetect.xml
["svn-diffs-322358.txt" (text/x-diff)]
Modified: pear/peardoc/trunk/en/package/text/text-languagedetect/detect.xml
===================================================================
--- pear/peardoc/trunk/en/package/text/text-languagedetect/detect.xml 2012-01-16 \
09:09:18 UTC (rev 322357)
+++ pear/peardoc/trunk/en/package/text/text-languagedetect/detect.xml 2012-01-16 \
09:28:23 UTC (rev 322358) @@ -1,10 +1,18 @@
<?xml version="1.0" encoding="utf-8"?>
-<section xmlns="http://docbook.org/ns/docbook" version="lillet" \
xml:id="package.text.text-languagedetect.detect"><info><title>Detecting the \
language</title></info>
-
+<section version="lillet"
+ xmlns="http://docbook.org/ns/docbook"
+ xmlns:phd="http://www.php.net/ns/phd"
+ xml:id="package.text.text-languagedetect.detect"
+>
+ <info>
+ <title>Detecting the language</title>
+ </info>
<para>
At first, you might want to get a list of supported languages.
- It can be retrieved by calling <function>getLanguages</function>
+ It can be retrieved by calling
+ <phd:pearapi phd:package="Text_LanguageDetect"
+ phd:linkend="Text_LanguageDetect::getLanguages">getLanguages</phd:pearapi>
on a <classname>Text_LanguageDetect</classname> object. It returns
an array of strings that represent the languages, e.g.
<literal>array('albanian', 'arabic', 'azeri')</literal>.
@@ -12,7 +20,9 @@
<para>
To actually detect the language of a piece of text, use the
- <function>detect</function> method on the
+ <phd:pearapi phd:package="Text_LanguageDetect"
+ phd:linkend="Text_LanguageDetect::detect">detect</phd:pearapi>
+ method on the
<classname>Text_LanguageDetect</classname> object. It takes
the text as first parameter, and an optional
<literal>$limit</literal> as second parameter, determining
@@ -23,7 +33,9 @@
</para>
<para>
- To get the most likely language only, use <function>detectSimple</function>
+ To get the most likely language only, use
+ <phd:pearapi phd:package="Text_LanguageDetect"
+ phd:linkend="Text_LanguageDetect::detectSimple">detectSimple</phd:pearapi>
which directly returns the string of the language, or
<literal>null</literal> if none was detected.
</para>
Modified: pear/peardoc/trunk/en/package/text/text-languagedetect/example.xml
===================================================================
--- pear/peardoc/trunk/en/package/text/text-languagedetect/example.xml 2012-01-16 \
09:09:18 UTC (rev 322357)
+++ pear/peardoc/trunk/en/package/text/text-languagedetect/example.xml 2012-01-16 \
09:28:23 UTC (rev 322358) @@ -1,55 +1,32 @@
<?xml version="1.0" encoding="utf-8"?>
-<section xmlns="http://docbook.org/ns/docbook" version="lillet" \
xml:id="package.text.text-languagedetect.example"><info><title>Example</title></info>
-
+<section xmlns="http://docbook.org/ns/docbook" version="lillet"
+ xml:id="package.text.text-languagedetect.example"
+>
+ <info>
+ <title>Example</title>
+ </info>
- <programlisting role="php"><![CDATA[
-<?php
-require_once 'Text/LanguageDetect.php';
-$l = new Text_LanguageDetect();
+ <programlisting role="php">
+ <xi:include parse="text"
+ xmlns:xi="http://www.w3.org/2001/XInclude"
+ href="&package.text.text-languagedetect.examples.detect.php;"
+ >
+ <xi:fallback>FIXME:MISSING XINCLUDE CONTENT</xi:fallback>
+ </xi:include>
+ </programlisting>
-echo "Supported languages:\n";
-$langs = $l->getLanguages();
-if (PEAR::isError($langs)) {
- die($langs->getMessage());
-}
-sort($langs);
-echo implode(', ', $langs) . "\n\n";
-
-$text = <<<EOD
-Hallo! Das ist ein Text in deutscher Sprache.
-Mal sehen, ob die Klasse erkennt, welche Sprache das hier ist.
-EOD;
-
-$result = $l->detect($text, 4);
-if (PEAR::isError($result)) {
- echo $result->getMessage(), "\n";
-} else {
- print_r($result);
-}
-?>
-]]></programlisting>
-
<para>
The above example would give the following output:
</para>
- <screen><![CDATA[
-Supported languages:
-albanian, arabic, azeri, bengali, bulgarian, cebuano, croatian,
-czech, danish, dutch, english, estonian, farsi, finnish, french,
-german, hausa, hawaiian, hindi, hungarian, icelandic, indonesian,
-italian, kazakh, kyrgyz, latin, latvian, lithuanian, macedonian,
-mongolian, nepali, norwegian, pashto, pidgin, polish, portuguese,
-romanian, russian, serbian, slovak, slovene, somali, spanish,
-swahili, swedish, tagalog, turkish, ukrainian, urdu, uzbek,
-vietnamese, welsh
-
-Array
-(
- [german] => 0.407037037037
- [dutch] => 0.288065843621
- [english] => 0.283333333333
- [danish] => 0.234526748971
-)
-]]></screen>
+ <example>
+ <title>Output</title>
+ <screen><xi:include parse="text"
+ xmlns:xi="http://www.w3.org/2001/XInclude"
+ href="&package.text.text-languagedetect.examples.detect.txt;"
+ >
+ <xi:fallback>FIXME:MISSING XINCLUDE CONTENT</xi:fallback>
+ </xi:include>
+ </screen>
+ </example>
</section>
Added: pear/peardoc/trunk/en/package/text/text-languagedetect/examples/detect.php
===================================================================
--- pear/peardoc/trunk/en/package/text/text-languagedetect/examples/detect.php \
(rev 0)
+++ pear/peardoc/trunk/en/package/text/text-languagedetect/examples/detect.php 2012-01-16 \
09:28:23 UTC (rev 322358) @@ -0,0 +1,28 @@
+<?php
+require_once 'Text/LanguageDetect.php';
+$l = new Text_LanguageDetect();
+
+echo "Supported languages:\n";
+try {
+ $langs = $l->getLanguages();
+ sort($langs);
+ echo implode(', ', $langs) . "\n\n";
+} catch (Text_LanguageDetect_Exception $e) {
+ die($e->getMessage());
+}
+
+$text = <<<EOD
+Hallo! Das ist ein Text in deutscher Sprache.
+Mal sehen, ob die Klasse erkennt, welche Sprache das hier ist.
+EOD;
+
+try {
+ //return 2-letter language codes only
+ $l->setNameMode(2);
+
+ $result = $l->detect($text, 4);
+ print_r($result);
+} catch (Text_LanguageDetect_Exception $e) {
+ die($e->getMessage());
+}
+?>
Added: pear/peardoc/trunk/en/package/text/text-languagedetect/examples/detect.txt
===================================================================
--- pear/peardoc/trunk/en/package/text/text-languagedetect/examples/detect.txt \
(rev 0)
+++ pear/peardoc/trunk/en/package/text/text-languagedetect/examples/detect.txt 2012-01-16 \
09:28:23 UTC (rev 322358) @@ -0,0 +1,16 @@
+Supported languages:
+albanian, arabic, azeri, bengali, bulgarian, cebuano, croatian, czech,
+danish, dutch, english, estonian, farsi, finnish, french, german, hausa,
+hawaiian, hindi, hungarian, icelandic, indonesian, italian, kazakh, kyrgyz,
+latin, latvian, lithuanian, macedonian, mongolian, nepali, norwegian, pashto,
+pidgin, polish, portuguese, romanian, russian, serbian, slovak, slovene, somali,
+spanish, swahili, swedish, tagalog, turkish, ukrainian, urdu, uzbek, vietnamese,
+welsh
+
+Array
+(
+ [de] => 0.40703703703704
+ [nl] => 0.2880658436214
+ [en] => 0.28333333333333
+ [da] => 0.23452674897119
+)
Added: pear/peardoc/trunk/en/package/text/text-languagedetect/examples/setNameMode.php
===================================================================
--- pear/peardoc/trunk/en/package/text/text-languagedetect/examples/setNameMode.php \
(rev 0)
+++ pear/peardoc/trunk/en/package/text/text-languagedetect/examples/setNameMode.php 2012-01-16 \
09:28:23 UTC (rev 322358) @@ -0,0 +1,18 @@
+<?php
+$text = 'Das ist ein kleiner deutscher Text';
+
+require_once 'Text/LanguageDetect.php';
+$ld = new Text_LanguageDetect();
+
+//default mode: full language name: "german"
+echo $ld->detectSimple($text) . "\n";
+
+//two-letter mode: "de"
+$ld->setNameMode(2);
+echo $ld->detectSimple($text) . "\n";
+
+//three-letter mode: "deu"
+$ld->setNameMode(3);
+echo $ld->detectSimple($text) . "\n";
+
+?>
\ No newline at end of file
Added: pear/peardoc/trunk/en/package/text/text-languagedetect/examples/setNameMode.txt
===================================================================
--- pear/peardoc/trunk/en/package/text/text-languagedetect/examples/setNameMode.txt \
(rev 0)
+++ pear/peardoc/trunk/en/package/text/text-languagedetect/examples/setNameMode.txt 2012-01-16 \
09:28:23 UTC (rev 322358) @@ -0,0 +1,3 @@
+german
+de
+deu
Added: pear/peardoc/trunk/en/package/text/text-languagedetect/language-names.xml
===================================================================
--- pear/peardoc/trunk/en/package/text/text-languagedetect/language-names.xml \
(rev 0)
+++ pear/peardoc/trunk/en/package/text/text-languagedetect/language-names.xml 2012-01-16 \
09:28:23 UTC (rev 322358) @@ -0,0 +1,51 @@
+<?xml version="1.0" encoding="utf-8"?>
+<section xmlns="http://docbook.org/ns/docbook" version="lillet"
+ xmlns:phd="http://www.php.net/ns/phd"
+ xml:id="package.text.text-languagedetect.language-names"
+>
+ <info>
+ <title>Language names</title>
+ </info>
+
+ <para>
+ <classname>Text_LanguageDetect</classname> works with language names.
+ It accepts language names for a number of methods, e.g.
+ <phd:pearapi phd:package="Text_LanguageDetect"
+ phd:linkend="Text_LanguageDetect::omitLanguages">omitLanguages()</phd:pearapi>
+ and returns language names.
+ By default, a "language name" is a lowercase english name of a language.
+ </para>
+ <para>
+ Often, applications work with ISO 639-1 or ISO 639-2 language codes
+ - two-letter or three letter codes.
+ <classname>Text_LanguageDetect</classname> supports them since version 0.3.0,
+ and you may enable them with
+ <phd:pearapi phd:package="Text_LanguageDetect"
+ phd:linkend="Text_LanguageDetect::setNameMode">setNameMode()</phd:pearapi>:
+ </para>
+
+ <programlisting role="php">
+ <xi:include parse="text"
+ xmlns:xi="http://www.w3.org/2001/XInclude"
+ href="&package.text.text-languagedetect.examples.setNameMode.php;"
+ >
+ <xi:fallback>FIXME:MISSING XINCLUDE CONTENT</xi:fallback>
+ </xi:include>
+ </programlisting>
+
+ <para>
+ The above example gives the following output:
+ </para>
+
+ <example>
+ <title>Output</title>
+ <screen><xi:include parse="text"
+ xmlns:xi="http://www.w3.org/2001/XInclude"
+ href="&package.text.text-languagedetect.examples.setNameMode.txt;"
+ >
+ <xi:fallback>FIXME:MISSING XINCLUDE CONTENT</xi:fallback>
+ </xi:include>
+ </screen>
+ </example>
+
+</section>
Modified: pear/peardoc/trunk/en/package/text/text-languagedetect.xml
===================================================================
--- pear/peardoc/trunk/en/package/text/text-languagedetect.xml 2012-01-16 09:09:18 \
UTC (rev 322357)
+++ pear/peardoc/trunk/en/package/text/text-languagedetect.xml 2012-01-16 09:28:23 \
UTC (rev 322358) @@ -21,6 +21,7 @@
<chapter>
&package.text.text-languagedetect.detect;
+ &package.text.text-languagedetect.language-names;
&package.text.text-languagedetect.example;
</chapter>
</book>
--
PEAR Documentation List Mailing List (http://pear.php.net/)
To unsubscribe, visit: http://www.php.net/unsub.php
[prev in list] [next in list] [prev in thread] [next in thread]
Configure |
About |
News |
Add a list |
Sponsored by KoreLogic