[prev in list] [next in list] [prev in thread] [next in thread] 

List:       kde-devel
Subject:    Internship: Release of a multilingual linguistic analysis software under a Free/Libre Open Source So
From:       kleag () free ! fr
Date:       2013-02-21 10:21:28
Message-ID: 757520326.188995622.1361442088677.JavaMail.root () zimbra61-e11 ! priv ! proxad ! net
[Download RAW message or body]

Hello,

I did not post to kde-devel since a long time and I hope that this one will be well received \
even if it is not immediately related to KDE. In fact, some of you will remember that I talked \
a few years ago about the possible release under an open source licence of the natural language \
analyzer I work on at work and that could be useful to KDE, for example in Nepomuk.

Well, here we are, finally, and thus we propose a (paid) internship to help us clean up and set \
up before putting it online.

You'll find below the internship proposition.

Regards,

Gaël


Internship: Release of a multilingual linguistic analysis software under a Free/Libre Open \
Source Software Licence (Compulsory internship with internship agreement, Master 1 or 2 level)

CEA LIST
Vision and Content Engineering Laboratory

The internship will take place in the premises of LVIC at Nano-INNOV located in Palaiseau 25 km \
south of Paris, France.


TOPIC

Context

Since 2002, the LVIC develops the multilingual linguistic analyzer LIMA. It is now a very \
modular tool able to analyse (tokenization, morphological, syntactic and semantic parsing) \
texts in languages ​​as diverse as English, French, Arabic, Chinese , Spanish, German or \
Italian. LIMA currently represents more than 100,000 lines of code (excluding linguistic \
resources). LIMA is already used in several industrial products, but the CEA LIST has decided \
to distribute it under Free/Libre Open Source Software License (FLOSS) to facilitate its use, \
its dissemination and to get faster returns from a broader community of users. LIMA is coded in \
standard C++. It uses extensively boost and Qt libraries and is cross-platform (GNU/Linux and \
MS Windows so far). Its architecture makes it easily extensible and integratable into \
applications.

Objectives

This release, which is within ASFALDA project (funded by the French National Research Agency) \
                requires further improvements to the software before its distribution on \
                several aspects:
- API documentation;
- User documentation;
- Unit tests;
- Functional tests.

LIMA depends on linguistic resources to operate (dictionaries, parsing rules, ...). Even if the \
laboratory is the owner of some of them, others are from commercial resources and may not be \
distributed freely. Another goal of the intern will thus to produce alternative resources from \
freely available linguistic resources.

The intern will work on these topics in order to make available LIMA on a software forge at the \
end of the course. The selected candidate will have a good level in C++, an understanding of \
issues related to software release (testing, documentation ...) and ideally have participated \
in a free software project.


Course Duration: 4 to 6 months

Training required: Master 1 or 2.

Contact:
Gaël de Chalendar
Mail: Gael.de-Chalendar@cea.fr
Phone: +33 6 76 36 70 31
Skype: kleagg
XMPP: kleag@kdetalk.net

> > Visit http://mail.kde.org/mailman/listinfo/kde-devel#unsub to unsubscribe <<


[prev in list] [next in list] [prev in thread] [next in thread] 

Configure | About | News | Add a list | Sponsored by KoreLogic