[prev in list] [next in list] [prev in thread] [next in thread] 

List:       kde-devel
Subject:    Internship: Release of a multilingual linguistic analysis software under a Free/Libre Open Source So
From:       kleag () free ! fr
Date:       2013-02-21 10:21:28
Message-ID: 757520326.188995622.1361442088677.JavaMail.root () zimbra61-e11 ! priv ! proxad ! net
[Download RAW message or body]

Hello,

I did not post to kde-devel since a long time and I hope that this one will be well received even if it \
is not immediately related to KDE. In fact, some of you will remember that I talked a few years ago about \
the possible release under an open source licence of the natural language analyzer I work on at work and \
that could be useful to KDE, for example in Nepomuk.

Well, here we are, finally, and thus we propose a (paid) internship to help us clean up and set up before \
putting it online.

You'll find below the internship proposition.

Regards,

Gaël


Internship: Release of a multilingual linguistic analysis software under a Free/Libre Open Source \
Software Licence (Compulsory internship with internship agreement, Master 1 or 2 level)

CEA LIST
Vision and Content Engineering Laboratory

The internship will take place in the premises of LVIC at Nano-INNOV located in Palaiseau 25 km south of \
Paris, France.


TOPIC

Context

Since 2002, the LVIC develops the multilingual linguistic analyzer LIMA. It is now a very modular tool \
able to analyse (tokenization, morphological, syntactic and semantic parsing) texts in languages ​​as \
diverse as English, French, Arabic, Chinese , Spanish, German or Italian. LIMA currently represents more \
than 100,000 lines of code (excluding linguistic resources). LIMA is already used in several industrial \
products, but the CEA LIST has decided to distribute it under Free/Libre Open Source Software License \
(FLOSS) to facilitate its use, its dissemination and to get faster returns from a broader community of \
users. LIMA is coded in standard C++. It uses extensively boost and Qt libraries and is cross-platform \
(GNU/Linux and MS Windows so far). Its architecture makes it easily extensible and integratable into \
applications.

Objectives

This release, which is within ASFALDA project (funded by the French National Research Agency) requires \
                further improvements to the software before its distribution on several aspects:
- API documentation;
- User documentation;
- Unit tests;
- Functional tests.

LIMA depends on linguistic resources to operate (dictionaries, parsing rules, ...). Even if the \
laboratory is the owner of some of them, others are from commercial resources and may not be distributed \
freely. Another goal of the intern will thus to produce alternative resources from freely available \
linguistic resources.

The intern will work on these topics in order to make available LIMA on a software forge at the end of \
the course. The selected candidate will have a good level in C++, an understanding of issues related to \
software release (testing, documentation ...) and ideally have participated in a free software project.


Course Duration: 4 to 6 months

Training required: Master 1 or 2.

Contact:
Gaël de Chalendar
Mail: Gael.de-Chalendar@cea.fr
Phone: +33 6 76 36 70 31
Skype: kleagg
XMPP: kleag@kdetalk.net

> > Visit http://mail.kde.org/mailman/listinfo/kde-devel#unsub to unsubscribe <<


[prev in list] [next in list] [prev in thread] [next in thread] 

Configure | About | News | Add a list | Sponsored by KoreLogic