doctor in informatics
Transcription
doctor in informatics
Najeh HAJLAOUI Postdoctoral Research Fellow at university of Wolverhampton United Kingdom Mobile: + 33 (0)6 62 20 66 49 Mel : [email protected] Web site: http://www-clips.imag.fr/geta/User/najeh.hajlaoui/ DOCTOR IN INFORMATICS Experience in teaching computer science Experience in Natural Language Processing and Machine Translation Sense of teamwork Quadrilingual (Arabic, French, English, Italian) EXPERIENCES & EDUCATION 01/2011-today Postdoctoral research fellow at university of Wolverhampton – United Kingdom « Estimating the adequacy of Arabic -English statistical machine translation, supported by DARPA, under GALE scheme, as part of the IBM team New York, USA ». June 2011 Visiting researcher at Qatar University - Doha “Collaboration about Arabic Processing and preparation of project proposal to be submitted”. 12/2009-11/2010 Postdoc at France Telecom R&D Lannion France “Acquisition of adequate terminology by multilevel extension of applicative corpus”. July 2010 Visiting researcher at NII “ National Institute of Informatics” at Tokyo in Japan “systematic experimentation of the usefulness of statistical machine translations systems”. 2007-2009 Associate Lecturer at Jean Monnet University, ISEAG « Institut Supérieur d'Economie, d'Administration et de Gestion » at Saint Etienne. Researcher at LHC “Laboratoire Hubert Curien” at Saint Etienne “supervised classification of documents based on inter-documents links”. 2003-2009 Researcher at LIG lab (Laboratoire d’Informatique de Grenoble), GETALP group (Groupe d’Etude en Traduction/Traitement Automatique des Langues et de la Parole) at Grenoble. Short Postdoc 3 months « End to End evaluation of linguistic porting of Management Content systems » Ph.D in computer science “Multilingualization of e-commerce systems handling spontaneous utterances in natural language” at UJF (Université Joseph Fourier, Grenoble 1) under supervision of Professor Christian BOITET. Ph.D selected and published by European University Editions. 1/7 2001-2002 Master in "information systems" at Joseph Fourier university (Grenoble 1), with honors. Joint European Diploma MATIS (Management And Technology of Information Systems) at Geneva university and Lausanne university in Switzerland, UJF (Université Joseph Fourier) and UdS (Université de Savoie) in France, with honors. Master student at CSI (Conception des Circuits Intégrés / Integrated Circuits Design) laboratory at Grenoble. 10 months research project under the supervision of Professor Gabrielle SAUCIER: « Exchange of virtual electronic components via Intranet / Internet ». 1996-2001 Undergraduate education, BSc in Computer University, Monastir – Tunisia, with honors. Science, Science PFE « Projet de Fin d’Etudes » internship: 5 months at SITEX company (Société Industrielle de TEXtiles), Sousse, Tunisia, on production management assisted by computer. Summer Internship: 2 months at ATS “Advanced Technical of informatics Systems” company at Sidi Bouzid - Tunisia. 1989-1996 Higher education, Baccalaureate of mathematics at institute of Lessouda Sidi-Bouzid Tunisia. RESEARCH ACTIVITY Centers of interest Domains: NLP “Natural Language Processing”, MT “Machine Translation”, Multilinguisation, Localisation… • • • • • • • • • Arabic Natural Language Processing. Multilinguisation of content management systems treating spontaneous utterances in natural language. Generation of real corpora (natural corpora) for tests and SMT building. Automatic Processing of sub-languages. Machine Translation in particular Statistical MT. Management of multilingual corpora . Acquisition of adequate corpus. Information Extraction Classification of documents. Keywords: spontaneous utterances, natural language, e-commerce, linguistic porting, sub-language, content extraction, statistical translation, acquisition of corpus, analyse of corpus, acquisition of terminology Supervision 9 students: • 2009-2010 : Prem-Kumar CASIMIR, Master 2 computer science « Evaluation acquisition of terminology tools in a multimedia application », Université Pierre et Marie Curie Jussieu, Paris VI. 2/7 • • • • • • • Publications 2008-2009 : Xiang YIN, TER « Evaluation task of content management system », UJF « Université Joseph Fourier ». 2007-2008: Sana HAJLAOUI, Imen YOUSFI, PFE “Information retrieval in a numeric, multilingual and multimedia library”. ISIMS « Institut Supérieur d’Informatique et de Multimédia de Sfax». 2007-2008: Houssem ZAIR, TER “internal and external evaluation of linguistic porting of a content processing system”, UJF « Université Joseph Fourier ». 2004-2005: Yan WANG, TER “evaluation of aids for translators, locally and on the Web, and a first study of UNL”, UJF « Université Joseph Fourier ». 2003-2004: Mélanie PINEAU, TER “Comparison of translation results (automatic or not)”, UJF « Université Joseph Fourier ». 2002-2003: Song ZHICHAO, TER “Translation of a corpus of parallel sentences”, UJF « Université Joseph Fourier ». 2002-2003: Kamel AMIROU, TER “System for helping human translation assisted by machine”, UJF « Université Joseph Fourier ». 1 article of book, 1 article in international journal, 17 papers in internationals conferences (MT-Summit, COLING x 2, EGC x 3, IBIMA x 2, LREC, TOTh x 2, NLDB, MEDES…) Article of book • Hajlaoui, N. and C. Boitet (2003), A "pivot" XML-based architecture for multilingual, multiversion documents: parallel monolingual documents aligned through a central correspondence descriptor and possible use of UNL. Convergences ’03, Alexandria, 2-6 December 2003. 8p. (http://www.cicling.org/2005/UNL-book/) Internationals journals • Hajlaoui, N. (2011). Automatic Acquisition of Corpus for Multimedia Applications. Journal of CIBIMA (Communications of International Business Information Management Association). Volume 2011 (2011), Article ID 254926, 12 p. (http://www.ibimapublishing.com/journals/CIBIMA/2011/254926/a254926.html) Internationals conferences • • • Specia, L., Hajlaoui, N., Hallett, C., and Aziz, W. (2011) Predicting Machine Translation Adequacy. Proc. Machine Translation Summit XIII. Xiamen, China. 19-23 September 2011. 8 p. Hajlaoui, N. (2011). Système de recherche de musique adaptable à la perception de chaque utilisateur. Proc. EGC-2011_Demos. Démonstration de logiciels dans la 11ème Conférence Internationale Francophone sur l'Extraction et la Gestion des Connaissances. Brest, France. 25-28 janvier 2011. 3 p. Hajlaoui, N. (2011). Extraction d’information à partir des SMS. Proc. EGC-2011 (11ème Conférence Internationale Francophone 3/7 • • • • • • • • • • • • sur l'Extraction et la Gestion des Connaissances). Atelier SoS'2011 (Sources ouvertes et services). Brest, France. 25-28-janvier 2011. 10 p. Hajlaoui, N. (2011). Acquisition de terminologie adéquate par extension multiniveau de corpus. Proc. EGC-2011. (11ème Conférence Internationale Francophone sur l'Extraction et la Gestion des Connaissances) Atelier QDC'2011 (Qualité des Données et des Connaissances). Brest, France. 25-28-janvier 2011. 12 p. Hajlaoui, N. (2010). Extension multiniveau de corpus applicatif. Proc. IBIMA (15th International Business Information Management Association) Conference. Caire, Egypt. 6-7 Novembre 2010. 11 p. Hajlaoui, N. and C. Boitet (2010). Multilinguization and Personalization of NL-based Systems. Proc. COLING Workshop CLIA (Cross Lingual Information Access). Chine, août 23-28, 2010. 9 p. Hajlaoui, N. (2009). End to end evaluation of Impression based Music retrieval service. Proc. ACM MEDES'09 (The international ACM Conference on Management of Emergent Digital EcoSystems). Lyon, France. October 27-30, 2009. 5 p. Hajlaoui, N. and C. Boitet (2009). Portage linguistique de l’arabe et vers l’arabe. Proc. Infol@ngues III 2009 Conférence sur l’informatique et les langues. Tabarka, Tunisie. 20-21 février 2009. 10 p. Hajlaoui, N. and C. Boitet (2008). TA statistique à petits corpus pour des petits sous-langages. Proc. TOTh 2008 Conférence sur la Terminologie & Ontologie : Théories et Applications. Annecy, France. 5-6 juin 2008. 15 p. Hajlaoui, N., D. M. Daoud, et al. (2008). Methods for porting NLbased restricted e-commerce systems into other languages. Proc. LREC 2008 (Language Resources and Evaluation Conference). Marrakech, Maroc. june 26-31, 2008. 7 p. Hajlaoui, N. (2007), Multilinguïsation de services de gestion de contenu. Proc. IC (Ingénierie des Connaissances) dans le cadre de la plate-forme AFIA (Association Française pour l’Intelligence Artificielle), France, Grenoble 2-6 juillet 2007. Hajlaoui, N. (2007), Localizing content management application for spontaneous textual utterances in natural language, Proc. NLDB (12th international conference on Applications of Natural Language to Information Systems) France, Paris 27-29 juin 2007. Hajlaoui, N. and C. Boitet (2007). Portage linguistique d'applications de gestion de contenu. Proc. TOTh 2007 Conférence sur la Terminologie & Ontologie : Théories et Applications. Annecy, France. 1 juin 2007. 13 p. Hajlaoui, N. (2006). Recherche et production de corpus de messages pour la multilinguisation de sites de e-commerce en SMS, initialement en arabe. Proc. IBIMA (6th International Business Management Information Association) Conference. Bonn, Allemagne. 5-6 juin 2006. 11 p. Hajlaoui, N. (2006), PolyphraZ, un outil pour la gestion de corpus multilingues parallèles à travers le web, CALS (Colloque international d’Albi Langages et Signification), France, Albi, 10-14 4/7 • • Member of scientific committee • • • • • • juillet 2006. Hajlaoui, N. and C. Boitet (2004). PolyphraZ: a tool for the quantitaive and subjective evaluation of parallel corpora. Proc. IWSLT 2004: International Workshop on Spoken Language Translation. Kyoto, Japan. September 30–October 01, 2004. 8p. Hajlaoui, N. and C. Boitet (2004), PolyphraZ : a tool for the management of parallel corpora. Proc. COLING Workshop MLR2004 (on Multilingual Linguistic Resources). Geneva. August 28th 2004. 8p. RANLP 2011, RANLP SRW 2011 TALN 2010, TALN 2011. IBIMA 2010, IBIMA 2011. LREC 2010, Semitics languages workshop. CORIA 2010. COSI 2010 TEACHING: 636 H TD 2009-2010 Lecturer at the private school “Sup2i” at Grenoble: 16 H of courses. • 2008-2009 Associate Lecturer at Jean Monnet University, Saint Etienne : 96 h TD (tutorials) • 2007-2008 2003-2007 Module “real-time and embedded systems”: 16 H courses. Course and TD on “tools and software”, Licence 1 Economie et gestion : 96 h TD Associate Lecturer at Jean Monnet University, Saint Etienne: 96 h TD • Course and TD of “information system and database”, Master 1 “Economie et Management” : 48 h TD . • Course and TD “web technologies, tools and languages” : license AES « Administration Economique et Sociale : 48 h TD. Attached teacher at UPMF (Université Pierre Mendès France, Grenoble 2): 428 h TD • • • Initiation to informatics, office software, basic notions of network and database, DEUG psychology : 146 h TD = (56 h TD in 02/03 + 90 h TD in 03/04) Design and definition of a course programme in “informatics and documentation”: DEUG sociology (96 h C/TD = (32 h TD in 02/03 + 64 h TD in 03/04)), L1 sociology (120 h C/TD in 04/05) Animation of work on management of dynamic Web applications and databases (XHTML/CSS/PHP5/Postgres), 5/7 • Licence Professionnelle SIL « Systèmes Informatiques et Logiciels » at IUT2 (18 h TD) E-learning: C2I « Certificat Informatique et Internet » at UPMF (48 h TD) COLLECTIVE/COMMUNITY ACTIVITIES 2011-2012 Submission of project: following my stay as visiting researcher at university of Qatar, a proposal project is under submission. 2004-2007 Elective function: member representing by election all PhD students on the CLIPS laboratory called now LIG. 2003-2004 Responsible of project proposals: in the framework of an internal project at GETA, TraCorpeX (Traduction de Corpus d’eXemples), preparation and submission of two project proposals in the framework of the French-Tunisian cooperation to the CMCU (Comité Mixte de Coopération Universitaire). 2003-Today Elected member of the CA (Conseil d’Administration / board) of the Champollion association, a non-profit association supporting research on MT and NLP. SCIENTIFIC COMPETENCES INFORMATICS COMPETENCES Systems: Windows, Linux, Unix, Mac Os X. Environments: Visual BASIC 6, Visual C++ 6, AMC*Designer 6, Developer 2000. SGBD: ORACLE 7-8, MYSQL, ACCESS. POSTGRES… Languages: C/C++, SQL, XML, HTML,XHTML, Xpath, CSS, PHP, Java, Java script, Visual Basic, Prolog, Perl, TclTk… Architecture: Enhydra/Postgres, PHP/MySQL/Postgres Methods: UML, MERISE Specialised Language for Linguistic Processing (SLP) : EnCo (EnConversion environment), et DeCo (Deconversion environment) : two SLP diffused by UNL « Universal Networking Language » centre of Tokyo). LANGUAGES Arabic: mother tongue French: bilingual English: read (good), speak (good), write (good) Italian: read (little), speak (little), write (little) 6/7 COMPLEMENTARY INFORMATIONS CIVIL STATUS Date of birth: 01/21/1978 Nationality: Tunisian, French Driving license: category B Civil status: single INTEREST CENTRES Sport: football, swimming … Travel New technologies PROJECTS SHORT-TERM GOALS Finish my actual research fellow. MEDIUM-TERM GOALS Refine the linguistic porting technologies on which I worked during my PhD. Do further research on the types of sublanguages for which it is possible to develop efficient (useful) statistical MT systems in the context of applications extracting content from spontaneous utterances. LONG-TERM GOALS Join some establishment offering an adequate research and/or development environment. 7/7