Team : AMA
Transcription
Team : AMA
Team: AMA Scientific leader: E. Gaussier, M. B. Gordon Project for the creation of a team Apprentissage : Modèles & Algorithmes (AMA) (Machine Learning: Models & Algorithms) Web site: http://equipes-liglab.imag.fr/ Parent Organizations: Université Grenoble 1, Université Grenoble 2, Grenoble INP, CNRS Contents 1 General presentation 2 2 Team Composition 2 3 Research Themes 3.1 Fundamentals of machine learning . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3.2 Analysis and exploration of complex data . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3.3 Learning and social systems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4 4 5 6 4 Application Domains and Social, Economic or Interdisciplinary Impact 6 5 Contracts and Grants 5.1 External Contracts and Grants (Industry, European, National) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5.2 Research Networks (European, National, Regional, Local) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5.3 Internal Funding . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7 7 7 7 6 Principal International Collaborations 8 7 Visibility, Scientific and Public Prominence 7.1 Contribution to the Scientific Community . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7.2 Prizes and awards . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7.3 Public Dissemination . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8 8 10 10 8 Software and Research Infrastructure 10 9 Educational Activities 10 10 Self-Assessment 10.1 Self-positioning . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10.2 SWOT analysis . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11 11 12 11 Perspectives for the Research Team 13 12 Publications 14 This document presents the creation of a new team within the LIG. This new team relies on 6 permanent individuals with complementary expertise. These individuals have collaborated at various degrees in the past years (through informal and formal projects, through the coordination of teaching activities in machine learning/data analysis and through the organization of events) and share a common interest for machine learning and data analysis. We believe that the time is ripe now for integrating our various expertise and common interests into a single team within the LIG, for the following reasons: 1. Our research area is anchored in theoretical computer science (how to make machines learn from, reason on and adapt to their environment) with applications in many different domains related to several LIG teams: robotics, natural language processing, information retrieval, computer vision, image processing; 2. Our research agenda directly addresses some of the major challenges on which the LIG is positioned, as (we give in parentheses the associated LIG challenge; see the LIG project for the 2011-2014 quadrennial): Page 1 on 22 • Large scale information access (conceptual challenge: Information Access) • Modeling of complex, dynamic and structured data (conceptual challenge: Information Access) • Analysis and modeling of social systems (societal challenge: Open Enterprise) 3. Our team rallies and merges activities that were previously spread over different teams and labs, and offers the opportunity for a strong positioning around machine learning, data analysis and social systems. The remainder of this document is organized as follows. We first provide in sections 1, 2 and 3 the composition of the envisaged team and the research axes we want to develop. We then give in sections 4 to 9 general background on the team members and their activities as researchers and teachers. These sections should not be read as a summary of the past activity of a team, but as a summary of the past activity of individuals, the goal here being to illustrate the fact that we are already engaged in several scientific and contractual activities which will benefit to the team. Sections 10 and 11 provide a positioning of the team within the LIG and summarize the research activity we want to develop. Lastly, section 12 lists the publications of the team members. Here again, the goal is not to present a summary of a team’s publication activity, but to provide an idea of the publications of the individual members of the envisaged team. 1 General presentation Machine learning as a research field appeared at the end of 50s, when one came to the conclusion that it was impossible to manually encode, within artificial intelligence applications, all the information and knowledge necessary to solve practical problems. Right from the start, two main research tracks were developed: (a) an activity oriented towards modeling and inference of decision functions, which corresponds to what is currently referred to as machine learning and which is close to data analysis, and (b) an activity aiming at studying cognitive mechanisms at play during a learning activity, and which is close to cognitive sciences. Nowadays, depending on the information to be extracted and the application targeted, several paradigms allow one to cluster (unsupervised learning) or discriminate (supervised, semi-supervised, transductive learning) vectorial or relational data, in a static or dynamic way (active learning), or even pilot complex systems (reinforcement learning). In each of these paradigms, several learning methods are called for in order to analyze data and build models. These methods relate to logics (e.g. inductive logic programming), statistics (e.g. support vector machines, hidden Markov models), data analysis (factorial analysis), biology (evolutionary algorithms) or statistical physics (Boltzmann machines). A distinction is traditionally made between symbolic and numerical methods, depending on their reliance on discrete or continuous models. In practice, this distinction operates as a trade-off between interpretability and effectiveness of the model learnt (the notion of interpretability is related to the fact that a user can easily comprehend the knowledge inferred by a certain model). Machine learning has been so popular in the past years that it has been applied to many different domains, to name but a few: social network analysis, Web usage mining, text clustering and categorization, ontology learning, bio- and chemo-informatics, decision making, robotics, pattern recognition, ... The AMA team The overall goal of the AMA team is to design methods and models for the adaptation of a system to its environment, based on data collected from this environment. Our research thus fits within the two axes mentioned previously: (a) inference of decision functions and (b) computational models of the behavior of individuals and groups of individuals. In the first axis, we will focus on structured and dynamic data, as graphs or networks evolving over time (most social networks and RSS flows can be formalized in this way). Several methods have been developed for learning from structured or dynamic data, but very few ones have been proposed for both structured and dynamic data. In the second axis, we will focus on the modeling of cognitive processes at play when several social agents interact, in both economic and sociological domains. This research line, which has connections with the previous one, is thus at the intersection of machine learning, cognitive sciences and the study of complex systems. Our will to develop these two aspects of machine learning, while bridging them through the study of complex and structured data/systems evolving over time, is a key differentiator of our team with respect to other machine learning teams in France and abroad. 2 Team Composition The AMA team will rely on 6 permanent individuals with complementary expertise: • Cécile Amblard, MCF, UJF, currently within the team TIMB, TIMC lab, bringing in competencies in applied mathematics, and especially statistics (26th section, CNU); Page 2 on 22 • Gilles Bisson, CR CNRS, currently within the team AMA, TIMC lab, bringing in competencies in machine learning and artificial intelligence (7th section, CNRS); • Ahlame Douzal, MCF UJF, currently within the team TIMB, TIMC lab, bringing in competencies in data analysis and machine learning (27th section, CNU); • Catherine Garbay, DR CNRS, currently within the team MAGMA, LIG lab, bringing in competencies in artificial intelligence and cognitive sciences (7th section, CNRS); • Eric Gaussier, PR UJF, currently within the team MRIM, LIG lab, bringing in competencies in machine learning and information access (27th section, CNU); • Mirta Gordon, DR CNRS, currently within the team AMA, TIMC lab, bringing in competencies in statistical physics and cognitive science (7th section, CNRS). The envisaged team is thus composed of two persons at the heart of the current AMA team existing in the TIMC lab (G. Bisson and M. B. Gordon), two additional members of the TIMC lab (C. Amblard and A. Douzal), and two members from the LIG (C. Garbay and E. Gaussier). In addition, several PhD students are currently supervised by one (or several) team members. Some of these students have just started their PhD, and will be part of the new team, provided, of course, it is created. Others will have finished before such a creation is made possible. We mention them here anyway as this gives an indication of the number of PhD students we want to have, in a continuous flow. Doctoral Students Name University Supervisors Funding (sources and dates) Date of first registration S. Clinchant UJF E. Gaussier CIFRE 09/11) Sept. 08 A. Diallo UJF C. Frambourg UJF C. Grimal UJF F. Hussain UJF C. Lagnier UJF B. Li UJF C. Metzig UJF M. B. Gordon F. Meyer UJF E. Gaussier J.-F. Patri UJF A. M. Qamar UJF A. Douzal (co-supervision F. Giroud) A. Douzal (co-supervision J. Demongeot) G. Bisson (co-supervision E. Gaussier) G. Bisson (co-supervision M. B. Gordon) E. Gaussier (co-supervision G. Bisson) E. Gaussier (co-supervision J.P. Chevallet) M. B. Gordon (co-supervision J.-P. Nadal) E. Gaussier (XRCE; 09/08- None Sept. 2005 MENRT (09/08-09/11) Sept. 08 ANR (02/09-02/12) Nov. 09 Allocation from Pakistan (as par of the French-Pakistan scientific collaboration) Jan. 07 MENRT (10/09-10/12) Oct. 09 ANR (02/09-02/12) April 09 Allocation from EDIS doctoral school in Cognitive Science (04/10-04/13) Orange Labs (full time employee; 33% of his time devoted to his PhD; 03/0803/11) Apil 10 March 08 DGA (10/09-10/12) Oct. 09 MENRT (11/07-11/10) Nov. 07 Lastly, 8 PhD theses, supervised by one or several members of the team, were successfully defended during the current quadrennial (J. Bourbeillon, T. Guyet, L. Kefi, D. Renaudie, V. Robinet, B. Scherrer, V. Semeshenko, S. Wieczorek). Page 3 on 22 3 Research Themes The research we want to develop within the AMA team can be structured into three main themes, which can in turn be decomposed into theoretical and practical axes. 1. Theme 1: Fundamentals of machine learning • Axis: Proximity indices and metrics for complex data • Axis: Phase transition in learning • Axis: Learning in high dimensions 2. Theme 2: Analysis and exploration of complex data • Axis: Classification, clustering and (co-)clustering • Axis: Visualization of relational data 3. Theme 3: Learning and social networks • Axis: Social systems of learning individuals • Axis: Agent based dynamic learning 3.1 Fundamentals of machine learning This theme is devoted to the the study of three fundamental problems in machine learning, which have impacts on the following research themes. Axis: Proximity indices and metrics for complex data (Persons involved: C. Amblard, G. Bisson, A. Douzal, E. Gaussier, M. B. Gordon) Proximity indices and metrics play a crucial role in learning as they are at the basis of similarity and distance measures used to represent datasets (many classification methods directly take as input a similarity or dissimilarity matrix). We first want to study in this axis the different measures proposed for relational and temporal data so as to (a) if possible, unify them within a common framework, (b) compare them from first principles, (c) be able to chose the appropriate measure for a targeted application or a given goal, and (d) extend the methods used for temporal relations to other contiguity relations (as spatial relations). We also want to study the learning of metrics appropriate to the data under focus, i.e. metrics reflecting the underlying geometry of the data. There have been a lot of works on metric learning, aiming at learning an appropriate distance function. We want to extend these works so as to learn appropriate similarity functions (e.g. generalized cosines) as similarity functions are sometimes preferred over distance functions. Lastly, we plan on studying inverse methods associated with Bayesian modeling so as to derive new dissimilarity functions. Axis: Phase transition in learning (Persons involved: G. Bisson, M. B. Gordon) In ILP (Inductive Logic Programming), learning often relies on algorithms of the type generate and test. A major component of such algorithms is the coverage test, based on θ-subsumption, which yields a yes/no answer: Either the current hypothesis covers the example, or it does not. Several researchers have shown that the use of θ-subsumption was associated with a phase transition phenomenon, rendering the generate and test approaches inadequate for some learning problems. In this context, we plan to generalize the standard coverage test used in ILP through the introduction of the notion of partial coverage between an hypothesis and an example. More precisely, given two clauses, h and e, we want (a) to determine whether a sub-clause h0 of h covers e, and (b) to quantify the degree, as a real value, of the partial subsumption. This work thus aims at proposing new learning methods in ILP. Axis: Learning in high dimensions (Persons involved: C. Amblard, A. Douzal, E. Gaussier) The applications we are targeting often involve high dimensional data. As an illustration, DMOZ (www.dmoz.org), the largest repository of the web, contains hundred of thousands of categories, hierarchically organized, and several million documents lying in a vector space of hundred of thousands of dimensions. It is not possible to directly deploy existing methods in such datasets, and one needs to revisit existing techniques through parallelism, model simplification and sampling (e.g. through the use of few annotated data and semi-supervised, transductive or active learning). More specifically, we want to address the following problems: Can one define criteria allowing, without resorting to classification, to compare two category systems Page 4 on 22 so as to select the best one? Can such criteria be used to infer hierarchical category systems? Can one learn parsimonious models, leading to faster algorithms, behaving well in high dimensions, a situation where non-parsimonious models are usually required? Part of these problems will be addressed first within the family of decision functions learned via regularized maximum likelihood, a family for which consistency results and limiting distributions should help establish the criteria we are looking for. 3.2 Analysis and exploration of complex data This theme is devoted to the development of models and algorithms for analyzing and visualizing complex data. It focuses on certain data types and methods. Axis: Classification, clustering and co-clustering (Persons involved: C. Amblard, G. Bisson, A. Douzal, E. Gaussier, M. B. Gordon) Members of the team have a strong experience and expertise in classification (supervised learning) and clustering (unsupervised learning). We want to extend this experience and expertise to new types of data and to new applications. • Classification of temporal data More and more applications involve variables evolving over time, for which it is necessary to develop solutions for classification and regression. We are particularly interested in this area in extending decision and regression trees, as one of our goals is to produce interpretable models, which is the case for the models associated with classification and regression trees. This problem has only recently been addressed, and very few results have been obtained. The approach we want to follow relies on two main elements: (a) proximity indices and metrics for temporal data as described in the previous theme (research theme 1), adapted to the particular case of discrimination, and (b) the identification of highly discriminant sub-sequences. This last point is crucial and opens the way to feature set selection with contiguity constraints. • Clustering of chemical data Relational data is present in many domains under various forms (logical expressions, graphs, object representations, ...). The targeted application domain here is chemo-informatics, and more particularly the study of screening data. High throughput screening of chemo-libraries is a standard experimental approach to discover effectors. In this context, it is important to provide chemists with tools to help them explore the content of chemo-libraries and to organize (semi-)automatically this content so as to better localize active molecules and better evaluate the diversity of chemical structures. We want to develop here new clustering tools based on new proximity indices and metrics, as the ones explored in the previous theme (research theme 1). • Co-clustering Co-clustering is a data mining technique which allows simultaneous clustering of the rows (e.g. individuals) and columns (e.g. variables or features) of a matrix (typically a collection matrix). This definition can be extended to the case where the data is not only represented by a matrix, but by a set of matrices or tensors (defined here as multi-dimensional extensions of matrices). Several members of the team have worked on different co-clustering models, sometimes referred to as co-occurrence models: LSA (Latent Semantic Analysis), PLSA (Probabilistic Latent Semantic Analysis), NMF (Non-negative Matrix Factorization) and χ-Sim (developed by G. Bisson and his students). All these models were originally defined for matrices, but extensions to tensors begin to appear. We plan on revisiting all these models in the case of both set of matrices and tensors, which raise different generalization problems. In particular, we would like to explore the links between generalizations of NMFs and χ-Sim with probabilistic models for n-uples. Some of us were the first ones to establish a formal relation between PLSA and NMF, and we plan to follow the same approach for generalized versions of these techniques. Axis: Data visualization (Person involved: G. Bisson) Hierarchical clustering is a valuable tool for analyzing and modeling data. However, when one works with dendrograms containing more than a few hundred leaves, it becomes difficult to visualize and explore the tree. It is indeed possible to visualize only part of the data (e.g. through zooming on a particular zone), but one lacks, in doing so, a global view of the data. One of us has introduced a new model for visualizing dendrograms (a model called Stacked Trees) allowing one to simultaneously represent the information contained in tens of thousands of objects. This work was proposed in the framework of chemical data analysis and needs be extended to cover other data types, as texts or temporal data. We plan to investigate such an extension within the new AMA team. Page 5 on 22 3.3 Learning and social systems This theme is devoted to the study of social systems so as to (a) propose accurate, computational models of such systems, and (b) be able to deploy learning algorithms when a complex system interacts with a dynamically evolving environment. The research we plan to develop within the second theme (Analysis and exploration of complex data) on co-clustering also fits within this theme as it is partly concerned with accessing information in social networks. We do not repeat it here. Axis: Social systems of learning individuals (Persons involved: G. Bisson, M. B. Gordon) This axis aims at studying the passage from individual to collective behaviors (how to model a community of individuals from a model of each individual?) as well as the macroscopic dynamics of social systems (can we elaborate models accounting for the evolution of social systems?). This question is crucial from a theoretical point of view, but also from a societal one as it can impact the definition of public policies. From a theoretical point of view, we want to go beyond models with complementary interactions (as Schelling’s and Granovetter’s models), where individuals tend to behave in conformity with others (positive interactions), as such models do not account for all the situations encountered in practice. Our goal here is to propose other models to account for both positive and negative interactions (as in fashion), non-symmetrical interactions and interactions mediated through larger networks (as for criminals). The use of models derived from dynamic systems and statistical physics, as well as the use of numerical simulation, should allow the development of social models accounting for the various phenomena observed in practice. Furthermore, additional clustering and co-clustering methods can also be used to study certain types of data, as the ones pertaining to criminality. Axis: Agent based dynamic learning (Persons involved: C. Garbay, E. Gaussier) The general context here is the one of autonomous systems interacting with a complex environment, for which the learning problem involves heterogeneous processes (human and artificial agents, constructivist and classification-based approaches, ...). Furthermore, in many situations, the available annotated data does not cover all the situations one can face (as in image processing). Having agents collaborating to dynamically construct partial yet complementary solutions to the overall problem thus seems a promising direction. In particular, agents can collaborate to incrementally construct a complete solution gradually covering all the data types and situations. The learning framework one faces here has the following characteristics: • It is multi-scale as individuals as well as communities of agents are considered; • It calls for learning paradigms dedicated to situations where few annotated data is available, as: – semi-supervised or transductive learning, which allows one to leverage small amount of annotated data with large amount of unannotated data, – and active learning, which allows the dynamic annotation of data so as to improve the solution learned so far; • It is contextual and collaborative (as agents interact to construct partial solutions which depend on the situations considered). The goal of learning in this framework is thus to articulate machine learning methods within a complex environments where solutions can only be built incrementally, by gradually covering more and more situations, in a dynamic, evolving environment. 4 Application Domains and Social, Economic or Interdisciplinary Impact The study of complex, dynamic systems through computational models has expanded to social sciences, and the modeling of the interactions between individuals has become the focus of several research lines. Within this field, we are interested in the modeling of systems of social agents, evolving over time and interacting to achieve a certain goal, be it the one of sharing information or fostering collaborations. Mining such systems so as to discover recurrent or emerging patterns will help identify communities and outliers, as well as predict their evolution over time. This research area, related to the third research theme presented in the previous section, has a direct societal impact, inasmuch as computationl models and software one can develop could be used to reduce criminality and help peolple interacting in smoother ways. On more traditional aspects, the research we envisage on large-scale classification could directly benefit to patent offices, to companies involved in intellectual property management or to organizations maintaining large category systems (as the DMOZ association). Similarly, the research related to the second theme of section 3, Analysis and exploration of complex data, can lead to new tools for exploiting data streams, as RSS flows. Such tools can have a direct economic impact, either through existing companies (TEMIS, MatrixWare, ...) or through start-ups to be created. Page 6 on 22 5 5.1 Contracts and Grants External Contracts and Grants (Industry, European, National) • [Industry]] Contract with Orange Labs on machine learning techniques for recommendation systems; 3 years, from Feb. 2008 till Feb. 2011 (15ke) • [Industry]] Contract with Xerox Research Centre Europe on machine learning for information access in connected collections; 3 years, from June 2008 till June 2011 (13ke) • [Industry] Contract with THEMIS on survival analysis of clinical data; 6 months (ca. 8ke) • [European] Resarch collaboration with the Information Retrieval Facility (IRF) on Large Scale Text Classification. This collaboration is not associated with funding but allows us to use part of the computing infrastructure available at the IRF in Vienna, Austria • [ACI IMPIO (past)] ACCAMBA project on the design of tools for analyzing chemolibraries; 3 years, from Sept. 2004 till Sept. 2007 (90ke) • [ANR] METRICC project on exploitation of comparable corpora for bilingual information access; 3 years, from Feb. 2009 till Feb. 2012 (ca. 150ke) • [ANR] FRAGRANCES project on mining and learning from social networks; 3 years, from Feb. 2009 till Feb. 2012 (ca. 120ke) • [ANR] SYSCOMM DyXi project on dynamics of social groups; 3 years, from Feb. 2009 till Feb. 2012. Our participation to this project is administered by ENS Paris (for operation costs) and EHESS (for personnel) • [ARC-INRIA] Selmic project on longitudinal and mulimodal segmentation of cerebral abnormalities with MRI; 3 years, 2007-2009 (ca. 70ke) • [DGA] Project on the physiological follow-up of a warrior; 3 years, from July 2009 till July 2012 (ca. 120ke) 5.2 Research Networks (European, National, Regional, Local) • [European NoE] Several team members are part of the Network of Excellence PASCAL (Pattern Analysis, Statistical Modeling and Computational Learning). This Network of Excellence allows us to benefit from funding to attend conferences and set up challenges, as the Large Scale Hierarchical Text Classificaiton Challenge we co-organized in 2009 (with researchers from Demokritos, Athens) • [Regional project] Several team members are part of the CLUSTER project Web Intelligence • [Regional project] IXXI project on modellization of criminality; 2008 (ca; 5ke) 5.3 Internal Funding • [MSTIC project] LASCAR project on LArge Scale CAtegoRization; 2 years, from Jan. 2008 till Dec. 2009 (10ke plus funding for a one year post-doc) • [MSH project] (past)] Project on economical analysis of the dynamics of social norms; 28 months, 2004-2006 (ca. 5.3ke) • [IPI project] (past)] Project on efficient coordination with sub-optimal choices; 2 years, 2005-2006 (ca. 5ke) • [BQR project] (past)] Project on human cooperation, BQR INPG, 2006 (ca; 5ke) Page 7 on 22 6 Principal International Collaborations In addition to some of the international projects mentioned previously, we have collaborations with researchers in different countries, on different research themes, including exploratory data analysis, proximity measures, information retrieval and large scale classification. These collaborations involve joint publications and/or joint organization of events, with funding. In particular: A. Douzal has collaborations with L. Billard from Georgia University (USA) and P. N. Nagabhushan from Mysore University (India). E. Gaussier has collaborations with J. Savoy from Neuchâtel University (Swiss), G. Paliouras from NCSR Demokritos (Greece) and J.-C. Chappelier from EPFL (Swiss). 7 7.1 Visibility, Scientific and Public Prominence Contribution to the Scientific Community Management of Scientific Organizations • M. B. Gordon, co-director of the Laboratoire Leibniz (2004-2006) • M. B. Gordon member of the Scientific Committee of the Institut des Sciences de la Complexité Paris-Ile-de France, since 2006 • M. B. Gordon member of the Scientific Committee of IXXI (Institut des Sciences de la COmplexité de Lyon), since 2007 • C. Garbay, director of the CLIPS laboratory (2006) • C. Garbay, co-director of the Laboratoire LIG (since 2006) • C. Garbay member of the scientific council of the Institut Géographique National • C. Garbay member of the scientific council of the MSH Paris Nord • C. Garbay member of the Comité de Pilotage de l’Institut des Sciences de la Communication (CNRS) • C. Garbay member of the Comité de Pilotage de l’ARP ANR PIRSTEC (Prospective Interdisciplinaire en Réseau pour les Sciences et TEchnologies Cognitives) Administration of Professional Societies • E. Gaussier member of the European Research Council, Computer Science panel, for Starting Grants, since 2006 • E. Gaussier member of the Executive board of the Euroapean Association for Computaitonal Linguistics, since 2007 • M. B. Gordon member of the Administrative Board of the Association pour la Recherche Corgnitive (ARCo), since 2006 Editorial Boards • Computational Linguistics, E. Gaussier, 2005-2007 • Traitement Automatique des Langues, E. Gaussier, 2006-2008 • AIM Journal Artificial Intelligence in Medecine (advisory board), C. Garbay, 1997-2009 • Revue I3 Information-Interaction-Intelligence (founder - Chief Editor), C. Garbay, 2001-2007 Organization of Conferences and Workshops • Co-presidency of the organization committee of AFIA 2007 (conference from the Association Française d’Intelligence Artificielle) • Co-presidency of the organization committee of the 1ière Ecole d’Eté Web Intelligence, Autrans, July 2008 • Co-presidency of the organization committee of the 2ième Ecole d’Automne sur la Recherche d’Information et ses Applications, Toulouse, Oct. 2008 Page 8 on 22 • Co-presidency of the organization committee of the LREC 2008 Workshop on Comparable Corpora, Marrakech, Maroc, 2008 • Presidency of the organization committee of the XVIièmes Rencontres de la Société Française de Classification, Sept. 2009 Program Committee Members • Co-presidency of the program committee of the International Conference on Empirical Methods in Natural Language Processing, EMNLP’2006 (E. Gaussier) • Presidency of the program committee of the INTECH seminar, INRIA Rhône-ALpes, Jan. 2008 (E. Gaussier) • International Conference on Information Retrieval (ECIR), E. Gaussier, 2006-2009 • European Conference on Information Retrieval (ECIR), E. Gaussier, 2006-2009 • European Conference on Computational Linguistics (EACL), E. Gaussier, 2006 • International Conference on Computational Linguistics (ACL), E. Gaussier, 2006 • European Conference on Machine Learning (ECML), E. Gaussier, 2007, 2009 • Conférence sur la Recherche d’Information et ses Applications (CORIA), E. Gaussier, 2006-2009 • Conférence d’Apprentissage (CAp), E. Gaussier, 2009 • European Symposium on Artificial Neural Networks, M. B. Gordon, 2003-2007 • Colloques ARCo, M. B. Gordon, 2006-2008 • Computational Methods for Modeling and leArning in Social and Human Sciences (MASHS), M. B. Gordon, 20072009 • International Conference on Tools with Artificial Intelligence (ICTAI), G. Bisson, 2008 • International Conference on Advanced Computing and Communication, A. Douzal, 2005 • International Conference on Pattern Recognition and Machine Intelligence, A. Douzal, 2005 • 15ièmes Rencontres de la Société Française de Classification, A. Douzal, 2008 International Expertise • National Swiss Foundation, E. Gaussier, 2007, 2009 • Fonds de recherche sur la nature et les technologies, Québec, M. B. Gordon, 2005 • CyT, Agentina, M. B. Gordon, 2005 • CERG - Hong-Kong Research Grants Council, M. B. Gordon, 2005, 2007, 2008 National Expertise • ANR, Masse de données et Contenus et Interactions, E. Gaussier, 2006, 2008, 2009 • Pôle DIGITEO, E. Gaussier, 2008 • AERES (présidence comité d’évaluation), E. Gaussier, 2007 • Member of the evaluation board, ANR Techlog, C. Garbay, 2005-2006 • Presidency of the evaluation committee of the ANR program DEFI, C. Garbay, 2006 • Member of the evaluation committee of the ANR program Communication, C. Garbay • Member of the Comité National (CNRS), C. Garbay, 2008-2012 Page 9 on 22 • Member then Head of the ASTI Thesis Award committee, C. Garbay, 2005-2009 • Member and Head of Laboratory Evaluation Board (CNRS), C. Garbay, 2006 • Head of Laboratory Evaluation Board, IGN, C. Garby, 2004-2008 • Member of Laboratory Evaluation Board, Institut Telecom, C. Garbay, 2008 7.2 Prizes and awards Best student paper award: B. Scherrer, F. Forbes, C. Garbay, M. Dojat. Fully Bayesian Joint Model for MR Brain Scan Tissue and Structure Segmentation. Proceedings of the 11th International Conference on Medical Image Computing and Computer Assisted Intervention, Berlin, 2008. 7.3 Public Dissemination M. B. Gordon: Interview (50mns) on Radio RCF 103.7 FM Mathématiques et sciences sociales. Broadcasted within Les Chemins de la Connaissance on Monday, May 29, 2006, and Saturday, June 3, 2006 Within L’Université Populaire de Grenoble, the aim of which is to organize seminars open to everybody (http://fr.wikipedia.org/wiki/Universite populaire de Grenoble): • G. Bisson – Intelligence Artificielle : du mythe à la réalité. 8 March 2007 – Apprentissage du raisonnement en Intelligence Artificielle. 3 May 2007 – Hasard et nécessité des normes industrielles en Informatique. 28 May 2008 • M. B. Gordon – Les réseaux de neurones (29-03-07) – De l’apprentissage artificiel à l’apprentissage humain : comment rendre compte des paradoxes sociaux ? (13-0607) 8 Software and Research Infrastructure Nothing in particular to report here. 9 Educational Activities Supervision of Educational Programs • E. Gaussier: director of the Master Informatique de l’UJF (since Sept. 2008) • E. Gaussier: director of the Magistère Inforamtique de l’UJF (Sept. 2006-Sept. 2008) Teaching Name Position C. Amblard MDC UJF G. Bisson CR CNRS A. Douzal MDC UJF Year 20062009 20062009 20062009 Number of hours to teach Academic Program University 192/year Licence, M1, M2 UJF 30/year Licence, CUEFA 218/year Licence, M1, M2 M2, Cycle C UJF UJF Page 10 on 22 E. Gaussier PR UJF M. B. Gordon DR CNRS 10 20062009 20062009 210/year Licence, M1, M2 UJF 15/year M2, Ecole doctorale UJF Self-Assessment The creation of a new team within a laboratory is a challenging task, one aspect of which lies in positioning the new team within the existing research landscape. The axes we plan on developing intersect the research axes of other teams within the LIG. We first detail here our positioning within the LIG, as well as the possible collaborations with teams which lead a research complementary to ours, prior to provide a SWOT analysis of the creation of the AMA team in the LIG. 10.1 Self-positioning The E-MOTION team conducts research on models and algorithms for the construction of intelligent systems. Three main axes are developed: (1) Perception and multimodal modeling of space and movement, (2) movement planning and autonomous navigation, and (3) Bayesian models for learning, decision making and sensory-motor systems. This last axis shares several aspects with AMA’s research axes. It however differs from them as: (a) The application domains considered in E-MOTION are linked to robotics and Bayesian modeling of cognitive systems, and (b) the learning aspect is just an element in the Bayesian approach advocated by E-MOTION. Model building and inference are two important aspects of their approach which are dealt with in the AMA research agenda. Despite these differences, the two teams do have common interests in terms of probabilistic models and related problems: regularization, feature selection, ... Collaborations can thus be developed. The GETALP team is interested in natural language processing (both written and spoken language), and makes use of probabilistic models, mainly for speech recognition and statistical machine translation. The training of such model in situations where few annotated data is available is of primary importance for this team, which tries to solve the problem through acquisition of new data and active learning. Furthermore, GETALP is also interested in in learning abstract semantic representations, for machine translation for example. This type of learning has traditionally been addressed with logical approaches (as ILP Inductive Logic Programming). Researchers now try to have a mixed approach, combining probabilities and logic, a domain on which AMA could contribute. The HADAS team conducts research on the deployment and running of data services in situations where both data and services are distributed and dynamic (ubiquitous computing) and accessible (ambient computing). Part of its activity focuses on data mining (through the works of A. Termier and M.-C. Rousset) and the discovery of (frequent) patterns in structured data, which is strongly connected to part of our research agenda. We hope to be able to benefit from their expertise in several aspects of the problem. Conversely, HADAS is also interested in using machine learning techniques (in particular case-based reasoning) for the coordination of services (in particular the optimization of a request plan). Even if AMA does not intend to work on case-based reasoning, we are interested in deploying other machine learning techniques in this context. The METAH team conducts research on human learning, and makes use of models which radically differ from the ones used in AMA. This being said, common interests exist between the two teams. On the one hand, METAH is interested in mining and analyzing student logs derived during a certain learning task, a problem which involves both data analysis and machine learning techniques. On the other hand, AMA is interested in social systems of learning individuals for which the models developed in METAH could be relevant. We thus plan on further investigating these two points with members of the METAH team. The MAGMA team conducts research on multi-agent systems, and is particularly interested in understanding the notion of ”emergence” in such systems. The research we want to develop on agent-based dynamic learning partly intersects the MAGMA research agenda. As the focus of our research is on learning models within complex environments, we are likely to be, at first, users of multi-agent systems. We however hope, in a near future, to foster stronger collaborations with the MAGMA team on learning models in multi-agent systems. The MRIM team conducts research on information retrieval models, for multilingual, multimedia data, with or without structure. Several works in this team rely on machine learning techniques. The video annotation system developed in the team, for example, relies on a cascade of SVM classifiers ( la stacking) and uses active learning to reduce the annotation cost. Similarly, the problem of collaborative filtering studied in the team can be formulated as learning to rank problem and addressed within this framework. More directly, the ANR project METRICC (see the contract section above), in which both J.-P. Chevallet and E. Gaussier are involved, will allow a direct, formal collaboration between MRIM and AMA on the probabilistic modeling of multilingual collection for information extraction. This collaboration can be extended to other aspects, as learning to rank for information retrieval. The PRIMA team is interested in using machine learning models (as reinforcement learning) for various aspects of its research. It is also interested in the integration of a temporal dimension in the analysis of the data it is dealing with. We thus Page 11 on 22 hope to be able to collaborate with members of this team, definitely open to various aspects of machine learning and data analysis. Other teams in the LIG use machine learning techniques in their research. The AMA team is definitely interested in developing collaborations in order to evaluate the models and methods developed in different application contexts, and to identify new, important problems. We thus hope that fruitful collaborations and research will come out of the creation of the AMA team. 10.2 SWOT analysis Strengths Our main strength lies in the complementary expertise/experience of the individuals composing the team: • Cécile Amblard, MCF, UJF, with expertise/experience in applied mathematics, and especially statistics (26th section, CNU); • Gilles Bisson, CR CNRS, with expertise/experoience in machine learning and artificial intelligence (7th section, CNRS); • Ahlame Douzal, MCF UJF, with expertise/experience in data analysis and data mining (27th section, CNU); • Catherine Garbay, DR CNRS, with expertise/experience in artificial intelligence and cognitive sciences (7th section, CNRS); • Eric Gaussier, PR UJF, with expertise/experience in machine learning and information access (27th section, CNU); • Mirta Gordon, DR CNRS, with expertise/experience in statistical physics and cognitive science (7th section, CNRS). and the fact that such expertise/experience is used to develop two aspects of machine learning: (a) inference of decision functions and (b) computational models of the behavior of individuals and groups of individuals. The first aspect lies at the intersection of artificial intelligence, computer science and applied mathematics, whereas the other one at the intersection of dynamic systems, computer science and cognitive sciences. We bridge these two aspects through the study of complex and structured data/systems evolving over time, which we believe makes our team unique in the French machine learning landscape. Furthermore, despite the complementary aspects or our expertise/experience, we already have collaborated on research projects as well as on teaching activities. We thus believe our functioning as a single team should not raise big problems. Weaknesses Our main weakness lies the fact that, having different backgrounds, we also face some difficulties to understand each other. Sharing a common vocabulary takes time, and that is something we have to develop in order to be able to work as a team (we are currently working in that direction through weekly seminars and reading groups). Another weakness is related to the fact that we are currently targeting publications in different domains. It will certainly be necessary to re-target some of our publications towards machine learning in order to give a stronger positioning to the team. Opportunities As we said previously, we believe that the time is ripe now for integrating our various expertise (in applied mathematics, artificial intelligence, machine learning, data analysis, cognitive sciences, information access and statistical physics) and common interests into a single team within the LIG, the largest computer science lab in Grenoble. The main reasons for this are: 1. Our research area is anchored in theoretical computer science (how to make machines learn from, reason on and adapt to their environment) with applications in many different domains related to several LIG teams: Robotics, natural language processing, information retrieval, computer vision, image processing; 2. Our research agenda directly addresses some of the major challenges on which the LIG is positioned: • Large scale information access (conceptual challenge: Information Access) • Modeling of complex, dynamic and structured data (conceptual challenge: Information Access) • Analysis and modeling of social systems (societal challenge: Open Enterprise) and fits well within the major themes put forward by the LIG: • Sustainable and ambient systems: Our goal to develop models and algorithms to render systems adaptable to their environment contributes to this theme; Page 12 on 22 • Knowledge: Our goal to mine, analyze and model complex data represents an important step of the knowledge extraction process; • Green-by-IT: Environmental data presents interesting characteristics (sources are heterogeneous, the data is dynamic and partly structured) on which we could test our approaches and thus contribute to make better decisions regarding the management of environment; our addressing this problem will mainly depend on the availability of such data and the collaborations we can develop in this area. 3. Our team rallies and merges activities that were previously spread over different teams and labs, and offers the opportunity for a strong positioning around machine learning, data analysis and social systems. Threats No major threats identified. 11 Perspectives for the Research Team As mentioned previously, machine learning has yielded, over the last years, different theoretical and practical tools, which can be efficiently used in different contexts. These two aspects, theory and applications, will be present in our work, even though we tend to position ourselves more strongly on the theoretical side than on the applied one (as can be shown from our current publications, see next section, which rather target theoretical conference and journal venues in our domains). Besides the permanent members of the team, we also plan on relying, to conduct our work, on PhD students and post-docs (9 PhD students are currently (co-)supervised by team members) as well as on existing collaborations and collaborations to be set-up. Among the other institutes we are already working with, one can cite: CEA Grenoble, INRA (Grenoble and Jouy-enJosas), INRIA, LIP6, LJK, EPFL, ENS Paris, EHESS, Neuchâtel University, Orange Labs, Exalead, Xerox Research Center, or Porto Alegre University. Section 3 provided a detailed description of the research themes we want to develop in the AMA team. These research themes are articulated around axes that will help us develop our research agenda, which we recall here while providing some of the associated perspectives (see section 3 for more details): 1. Theme 1: Fundamentals of machine learning • Axis: Proximity indices and metrics for complex data, aiming at studying and unifying different measures proposed for relational and temporal data. • Axis: Phase transition in learning, aiming at developing new learning methods for Inductive Logic Programming through the notion of partial coverage. • Axis: Learning in high dimensions, aiming at developing parsimonious models for high dimensional data, through regularization and pachinko-like approaches. 2. Theme 2: Analysis and exploration of complex data • Axis: Classification, clustering and (co-)clustering, aiming at providing new methods for classification and clustering of complex, dynamic data. • Axis: Visualization of relational data, aiming at providing better tools to visualize large dendrograms. 3. Theme 3: Learning and social networks • Axis: Social systems of learning individuals, aiming at developing models which can account for all the interaction types observed in groups of individuals. • Axis: Agent based dynamic learning, aiming at articulating machine learning methods within a complex environment where solutions can only be built incrementally. These themes are inter-related and aim at jointly developing two aspects of machine learning: One at the intersection of artificial intelligence, computer science and applied mathematics, the other one at the intersection of dynamic systems, computer science and cognitive sciences. We bridge these two aspects through the study of complex and structured data/systems evolving over time. Page 13 on 22 12 Publications International peer reviewed journal [ACL] 1. A.S. Silvent, Michel Dojat, Catherine Garbay. Multi-level temporal abstraction for medical scenario construction. International Journal of Adaptive Control and Signal Processing, 19(5):377–394, 2005. 2. Amblard C., Girard S. (2005) Estimation procedures for a semiparametric family of bivariate copulas, Journal of Computational and Graphical Statistics, vol 14(2), pp 1-15. 3. H. Déjean, E. Gaussier, J.-M. Renders, F. Sadat. Automatic processing of multilingual medical terminology: applications to thesaurus enrichment and cross-language information retrieval. Artificial Intelligence in Medecine. Volume 33, Issue 2. Feb., 2005. 4. Gordon M. B., Phan D., Nadal J-P., Vannimenus J. Seller’s dilemma due to social interactions between customers. Physica A 356, Issues 2-4 (2005), pp. 628-640. 5. Nadal J-P., Phan D., Gordon M. B., Vannimenus J. Multiple equilibria in a monopoly market with heterogeneous agents and externalities. Quantitative Finance vol 5, N6 (2005) 557-568. 6. Robinet V., Bisson G., Gordon M. B., Lemaire B. 2007. Induction of High-level Behaviors from Problem-solving Traces using Machine Learning Tools. IEEE Intelligent Systems. 22(4), pp. 22-30. 7. Douzal-Chouakria, A., Nagabhushan, P.N. (2007). Adaptive dissimilarity index for measuring time series proximity. Advances in Data Analysis and Classification Journal. 1, 5-21, Springer Berlin / Heidelberg. 8. Nathalie Richard, Michel Dojat, Catherine Garbay. Distributed Markovian segmentation: Application to MR brain scans. Pattern Recognition, 40(12):3467-3480, 2007. 9. Florence Duchêne, Catherine Garbay, Vincent Rialle. Learning recurrent behaviors from heterogeneous multivariate time-series. Artificial Intelligence in Medicine, 39(1):25-47, 2007. 10. Thomas Guyet, Catherine Garbay, Michel Dojat. Knowledge construction from time series data using a collaborative exploration system. Journal of Biomedical Informatics, 40(6):672-687, 2007. 11. Prudent R., Moucadel V., Lopez-Ramos M., Aci S,. Laudet B., Mouawad L., Barette C., Einhorn J., Einhorn C., Denis J-N., Bisson G., Schmidt F., Roy S,. Lafanechere L., Florent J-C., Cochet C. Expanding the chemical diversity of CK2 inhibitors. Molecular and Cellular Biochemistry Journal. 2008. 12. Semeshenko V., Gordon M B, Nadal J-P. Collective states in social systems with interacting learning agents. Physica A 387 (2008) 4903-4916. 13. Gordon M B, Nadal J-P, Phan D, Semeshenko V. Discrete choices under social influence: generic properties. Mathematical Models and Methods in Applied Sciences (M3AS) 19 (Supplementary Issue 1) (2009) 1441-1481. 14. Gordon M B, Iglesias J R, Nadal J-P, Semeshenko V. Crime and punishment: the economic burden of impunity. Eur. Phys. J. B 68, 133-233 (2009). 15. Ma Y P, Gonçalves S, Mignot S,Nadal J.-P. and Gordon M.B. Cycles of cooperation and free-riding in social systems. Eur. Phys. J. B 71, 597-610 (2009). 16. Semeshenko V, Garapin A, Ruffieux B, Gordon MB. Information-driven coordination: experimental results with heterogeneous individuals. Accepté pour publication dans Theory and Decision (2009) 17. Amblard C., Girard S. (2009), A new extention of bivariate FGM copulas, Metrika, vol 70, 1-17, 2009. 18. Douzal-Chouakria, A., Diallo, A., Giroud, F. (2009). Adaptive clustering for time series: application for identifying cell cycle expressed genes. Computational Statistics and Data Analysis 53 (4), 1414-1426. Elsevier. 19. Benoit Scherrer, Florence Forbes, Catherine Garbay, Michel Dojat. Distributed Local MRF Models for Tissue and Structure Brain Segmentation. IEEE transactions on medical imaging, 28(8):1296–1307, 2009. 20. Julie Bourbeillon, Catherine Garbay, Françoise Giroud. Mass data exploration in oncology: An information synthesis approach. Journal of Biomedical Informatics, 42(6):612-623, 2009. 21. Benoit Scherrer, Michel Dojat, Florence Forbes, Catherine Garbay. Agentification of Markov model-based segmentation: Application to magnetic resonance brain scans. Artificial Intelligence In Medicine, 46(1):81–95, 2009. Page 14 on 22 National peer-reviewed journal [ACLN] 1. Nadal J-P., Gordon M. B. Physique statistique de phénomènes collectifs en sciences économiques et sociales. Mathématiques et Sciences Humaines (Mathematical Social Sciences) 172 (2005) 67-89. 2. Julie Bourbeillon, Catherine Garbay, Françoise Giroud. Une perspective analytique pour la recherche d’information. Application : conception et evaluation de Tissue Microarrays. Revue ISI (Ingenierie des Systèmes d’Information), 1(11):109–135, 2006. 3. Leila Kefi, Catherine Berrut, Eric Gaussier. Un modèle de recherche d’informations fondé sur des critères d’obligation et de certitude. Information Interaction Intelligence, 7(1):9–30, 2007. Note: Cépaduès - Edition. 4. Diallo, A., Douzal-Chouakria, A., Giroud, F. (2008). Classification adaptative de séries temporelles : application à l identification des gènes exprimés au cours du cycle cellulaire. Revue des Nouvelles Technologies de l Information (RNTI-E-11, ECG), 487-498, Cépaduès. 5. Stéphane Clinchant, Eric Gaussier. Modélisation probabiliste de collections textuelles et distributions de mots. Revue des Nouvelles Technologies de l’Information - Apprentissage Automatique et Fouille de Données. Eds Y. Bennani et E. Viennet, 2009. Invited conferences, seminars and tutorials [INV] 1. E. Gaussier. Groupe de travail du GdR-PRC I3, Indexation et Recherche d’Informations, 8 juillet 2005. Apprentissage statistique pour l’indexation et la recherche d’information multilingue. 2. E. Gaussier. Conférence sur la Recherche d’Information et ses Applications, CORIA 2005. Apprentissage et Recherche d’Information. 3. E. Gaussier. Seminar of the Computer Science Department, Cornell University, USA, 2005; Machine Learning for Textual Inforamtion Access 4. Amblard C., Girard S. (2006) A semiparametric family of bivariate copulas: dependence properties and estimation procedures, 69th Annual Metting of Institute of Mathematical Statistics, Rio de Janeiro, Brésil. 5. E. Gaussier. Ecole d’Automne en Recherche d’Information et ses Applications (EARIA), oct. 2006 ; Quelle langue pour la RI ? (cours partagé avec J. Savoy, de l’Université de Neuchâtel) 6. Bisson G. Distances entre données et application aux cartes de Kohonen. Rencontres RIA 2006 (Rencontre InterAssociations). 20-21 mars 2006, Lyon. 7. Gordon M. B. Interaction entre capacité d’apprentissage et évolution. Journées 2006 Génétique et Cognition du Rescif (Réseau de Sciences Cognitives Ile de France), Paris, 12-13 mai 2006 8. E. Gaussier. Recherche d’Information : fondements et modèles. Congrès INFORSID, Perros Guirec, France, 2007. 9. E. Gaussier. Seminar of the Computer Science Department, Glasgow University, UK, 2007; Bustiness and IR models 10. Gordon M. B. Emergent collective states in social systems with heterogeneous interacting agents. MASHS2007, Mai 10-11 2007, Brest, France. 11. Gordon M. B. Emergent collective states in social systems with heterogeneous interacting agents. NICO Conference: Dynamics and Complexity in People and Societies, Northwestern University, Illinois, 22-24 October 2007 12. Bisson G. Fouille de données éducatives. 5ème école thématique du CNRS sur les EIAH. 7-12 juillet 2007, Saint Quentin sur Isère. 13. Catherine Garbay. Cognition & Communication Homme-Machine : De nouveaux défis pour les sciences de la complexité. Journée Cognition : entre sciences et technologies organisée à l’occasion de la remise à Hélène Lovenbruck de la médaille de bronze du CNRS, Grenoble, France, may 2007. 14. E. Gaussier. Annotation and Ontologies in the Context of Information Retrieval. Information Retrieval Facility Symposium, Vienna, Austria, 2008. Page 15 on 22 15. E. Gaussier. Modélisation probabiliste de collections textuelles. Troisième édition du colloque Apprentissage Artificiel et Fouille de Données, Paris, France, 2008. 16. Bisson G. Clustering of Molecules and structured data. The 32th Conférence GfKL 2008 (German Classication Society), 16-18 juillet, Hambourg. 17. Gordon M. B. Crime and punishment: the economic burden of impunity. Mathematical Models for Criminality. Pisa, April 17-18, 2008. 18. Gordon M. B. Cycles in cooperation and free-riding. The 5th European Conference on Complex Systems, Jerusalem, September 14-19, 2008 19. Catherine Garbay. Exposé de synthèse. Journée satellite Enaction : une voie possible pour un dialogue entre sciences informatiques et sciences cognitives à RFIA’08, Amiens, France, jan 2008. 20. Catherine Garbay, D. Kayser, J. Lorenceau, S. Ploux. Intelligence Artificielle et Sciences Cognitives (exposé de synthèse). Journées d’Intelligence Artificielle Fondamentale 2008 (IAF’08), Paris, France, nov 2008. 21. E. Gaussier. Séminaire INRIA/LEAR, jan. 2009, INRIA Rhône-Alpes; Probabilistic Models of Text Collections for Information Access. 22. E. Gaussier. Seminar of the Computer Science Lab of NCRS Demokritos, Athens, 2009; Probabilistic Models for Textual Information Retrieval 23. Gordon M. B. Cycles in social systems without reciprocity. ANR-NSF Workshop on Dynamics in the Human Sciences: Cognitive, Behavioral & Social Complexity. 27 et 28 Avril 2009 REIMS (France) 24. Gordon M. B. Models of social systems : multiple equilibria and cycles. International Symposium on Complex Systems Science. Paris, September 17-18, 2009 25. Catherine Garbay. Computer Vision: A Plea for a Constructivist View. Proc. of the 12th Conference on Artificial Intelligence in Medicine Europe (AIME’09), Lecture Notes in Artificial Intelligence, (C. Combi, Y. Shahar & A. AbuHanna eds), 2009. International peer-reviewed conferences with proceedings [ACTI] 1. Thomas Guyet, Catherine Garbay, Michel Dojat. Silvia Miksch. Human-Computer interaction to learn scenarios from ICU multivariate time series. Proc of the 10th Conference on Artificial Intelligence in Medicine Europe (AIME’05), Lecture Notes in Artificial Intelligence, (S. Miksch, J. Hunter & E. Keravnou, eds), 2005. 2. Julie Bourbeillon, Catherine Garbay, Jolle Simony-Lafontaine, Françoise Giroud. Multimedia Data Management to Assist Tissue Microarrays Design. Proc of the 10th Conference on Artificial Intelligence In Medicine, AIME, Lecture Notes in Artificial Intelligence, (S. Miksch, J. Hunter & E. Keravnou, eds), 2005. 3. E. Gaussier, C. Goutte. Learning with partially labelled data - with confidence. International Workshop on Learning with Partially Classified Data, International Conference on Machine Learning, 2005. 4. M. Simard, N. Cancedda, B. Cavestro, M. Dymetman, E. Gaussier, C. Goutte, K. Yamada, P. Langlais, A. Mauser. Translating with non-contiguous phrases. Joint HLT/EMNLP conference, 2005. 5. C. Goutte, E. Gaussier. A probabilistic interpretation of precision, recall and F-score, with implication for evaluation. Proceedings of the 27th European Conference on Inforamtion Retrieval 2005 (ECIR-05). 6. Gordon M. B., Phan D., Waldeck R., Nadal J-P. Cooperation and free-riding with moral cost. In Proceedings of International Conference on Cognitive Economics, Sofia-Bulgaria, August 5-8 2005. In Advances in Cognitive Economics, NBU Series in Cognitive Sciences, 2005 B. Kokinov ed. 7. Pradeep, K., P.N. Nagabhushan, Douzal-Chouakria, A. (2006). WaveSim and Adaptive Transform for subsequence Time series Clustering. IEEE 9th International Conference on Information Technology (ICIT’06), 197-202. 8. J.-M. Renders, E. Gaussier, C. Goutte, F. Pacull, G. Csurka. Categorization in multiple category systems. International Conference on Machine Learning, International Conference on Machine Learning, 2006. Page 16 on 22 9. M. Nyffenegger, J.-C. Chappelier, E. Gaussier. Revisiting Fisher Kernels for Document Similarities. European Conference on Machine Learning, European Conference on Machine Learning, 2006. 10. S. Clinchant, C. Goutte, E. Gaussier. Lexical Entailment for Information Retrieval. European Conference on Information Retrieval, European Conference on Information Retrieval, 2006. 11. Aci S., Bisson G., GordonM. B., Roy S., Wieczorek S. Partial Subsumption Test and Phase Transition. Proceedings of ILP conference. Santiago 24-27 august 2006, Spain, pp 40-44. 12. Wieczorek S., Bisson G, Gordon M. B. Guiding the Search in the NO Region of the Phase Transition Problem with a Partial Subsumption Test. European Conference on Machine Learning (ECML 2006) Berlin, September 18-22 2006, in Machine Learning: ECML 2006. Lecture Notes in Computer Science (2006) 817-824 13. Thomas Guyet, Catherine Garbay, Michel Dojat. Benoit Bardy, Denis Mottet. Computer/Human Structural Coupling for Data Interpretation. Proc ENACTIVE 2006: Third International Conference on Enactive Interfaces, Enaction and Complexity, Montpellier, France, nov 2006. 14. Benoit Scherrer, Michel Dojat, Florence Forbes, Catherine Garbay. Distributed and Cooperative Markovian Segmentation of tissues and structures in MRI brain scans. Human Brain Mapping Meeting, HBM 2006, June, 2006, Florence, Italie, 2006. 15. Benoit Scherrer, Michel Dojat, Florence Forbes, Catherine Garbay. Une Approche SMA pour la Segmentation Markovienne des Tissus et Structures Présents dans les IRM Cérébrales. 2èmes journées algéro-françaises en imagerie médicale, JETIM 2006, November, 2006, Alger, Algérie, 2006. 16. Douzal-Chouakria, A., A. Diallo, F. Giroud (2007). Adaptive clustering for time series: application for identifying cell cycle expressed genes. International Association for Statistical Computing, Statistics for Data Mining, Learning and Knowledge Extraction (IASC 07), Aveiro, Portugal. 17. M. Barbaiani, N. Cancedda, C. Dance, S. Fazekas, T. Gaal, Eric Gaussier. Asymmetric Term Alignment with Selective Contiguity Constraints by Multi-Tape Automata. Finite State Methods in Natural Language Processing, Postdam, Germany, 2007. 18. Marchal, H., Bianco, M., Dessus, P., Lemaire, B. The Development of Lexical Knowledge: Toward a Model of the Acquisition of Lexical Gender in French. In S. Vosniadou, D. Kayser (Eds), Proceedings of the Second European Conference on Cognitive Science. 2007. 19. Robinet V., Bisson G., Gordon M., Lemaire B. Searching for student intermediate mental steps. In Proceedings of the Workshop on Data Mining for User Modeling. 11th International Conference on User Modeling, Corfu, Greece, June 25, 2007. 20. Semeshenko V., Garapin A., Ruffieux B., Gordon M. B. Coordination and self-organization in social systems: experiments and learning models, in Multi-Agents for modeling Complex Systems (MA4CS), Dresden, Germany, October 4th, 2007. 21. Semeshenko V., Garapin A., Ruffieux B., Gordon M. B. Self-organization of social systems: empirical results and learning models. Conférence ARCo’07, Nancy, 28-30 nov 2007, in Acta-Cognitica (2007) 103-114 22. Thomas Guyet, Catherine Garbay, Michel Dojat. A Human-Machine Cooperative Approach for Time Series Data Interpretation. Proceedings of the 11th Conference on Artificial Intelligence In Medicine, AIME’07, Lecture Notes in Articial Intelligence, (Riccardo Bellazzi, Ameen Abu-Hanna Jim Hunter eds), Amsterdam, Neerderland, jul 2007. 23. Benoit Scherrer, Michel Dojat, Florence Forbes, Catherine Garbay. LOCUS: LOcal Cooperative Unified Segmentation of MRI Brain Scans. 11th International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI), Berlin, Germany, 2007. 24. Benoit Scherrer, Michel Dojat, Florence Forbes, Catherine Garbay. MRF Agent Based Segmentation: Application to MRI Brain Scans. Proc. of the 11th Conference on Artificial Intelligence in Medicine Europe (AIME’07), Lecture Notes in Artificial Intelligence, (Riccardo Bellazzi, Ameen Abu-Hanna & Jim Hunter eds), 2007. 25. Y. Kabir, Michel Dojat, Benoit Scherrer, Catherine Garbay, Florence Forbes. Multimodal MRI segmentation of ischemic stroke lesions. Proc of the 29th International Conference of the IEEE Engineering in Medicine and Biology Society (EMBS 2007), 2007. Page 17 on 22 26. Stéphane Clinchant, Eric Gaussier. The Beta-Negative Binomial for Text Modeling. 30th European Conference on Information Retrieval (ECIR 2008), Glasgow, 2008. 27. Ali Mustafa Qamar, Eric Gaussier, Jean-Pierre Chevallet, Joo-Hwee Lim. Similarity Learning for Nearest Neighbor Classification. 8th IEEE International Conference on Data Mining (ICDM) 2008, Pisa, Italy, dec 2008. 28. Loic Maisonnasse, Eric Gaussier, Jean-Pierre Chevallet. Multiplying Concept Sources for Graph Modeling. Advances in Multilingual and Multimodal Information Retrieval, 2008. Note: C. Peters, V. Jijkoun, T. Mandl, H. Muller, D.W. Oard, A. Peas, V. Petras, D. Santos, (Eds.). Selected Proceedings of CLEF 2007. 29. Loic Maisonnasse, Philippe Mulhem, Eric Gaussier, Jean-Pierre Chevallet. LIG at ImageCLEF 2008, Evaluating Systems for Multilingual and Multimodal Information Access. Proc. of the 9th Workshop of the Cross-Language Evaluation Forum, LNCS, 2008. 30. Bisson G. Hussain F. χ-Sim: A new Similarity Measure for the Co-clustering Task. Proceedings of the 7th International conference on Machine Learning and Application (ICMLA). 11-13 décembre, San Diego, United States. 2008. 31. Bisson G., Hussain F. A Co-Similarity Measure. Proceedings of the first joint conference of SFC and CLADAG. 11-13 june, Caserta, Italy. pp 197-200. 2008. 32. Loic Maisonnasse, Eric Gaussier, Jean-Pierre Chevallet. Model Fusion in Conceptual Language Modeling. 31st European Conference on Information Retrieval (ECIR 09), Toulouse (France), 2009. 33. Trong-Ton Pham, Loic Maisonnasse, Philippe Mulhem, Eric Gaussier. Visual Language Model for Scene Recognition. Singaporean-French IPAL Symposium, SinFra 2009, 2009. 34. Stéphane Clinchant, Eric Gaussier. Bridging Language Modeling and Divergence from Randomness Approaches: a Log-Logistic Model for IR. 2nd International Conference on the Theory of Information Retrieval, Cambridge, UK, 2009. 35. Stéphane Clinchant, Eric Gaussier. A Log-Logistic Model for Information Retrieval. 18th ACM Conference on Information and Knowledge Management, Hong-Kong, China, 2009. 36. Ali Mustafa Qamar, Eric Gaussier. Online and Batch Learning of Generalized Cosine Similarities. IEEE International Conference on Data Mining, Florida, USA, 2009. 37. A Diallo, A Douzal-Chouakria, Franoise Giroud (2009). Which Distance for the Identification and the Differentiation of cell-cycle Expressed Genes?. 8th International Symposium on Intelligent Data Analysis. 273-284.Springer-Verlag (2009). National peer-reviewed conferences with proceedings [ACTN] 1. Thomas Guyet, Catherine Garbay, Michel Dojat. Apprentissage de scénarios à partir de séries temporelles multivariées. Journées francophones d’Extraction et de Gestion des Connaissances (EGC’2005), Paris, France, jan 2005. 2. Florence Duchêne, Catherine Garbay, Vincent Rialle. Apprentissage non supervisé de motifs temporels, multidimensionnels et hétérogènes. Application à la télésurveillance médicale. Conférence francophone sur l’apprentissage automatique (Cap05), Nice, France, 2005. 3. Julie Bourbeillon, Catherine Garbay, Françoise Giroud. Génération de documents multimedia adaptatifs dans une perspective analytique. INFORSID, Grenoble, may 2005. 4. Thomas Guyet, Catherine Garbay, Michel Dojat. Partenariat utilisateur-SMA pour l’apprentissage à partir de séries temporelles. Journées Françaises de Systèmes Multi-Agents (JFSMA), Calais, France, nov 2005. 5. M. Simard, N. Cancedda, B.Cavestro, M. Dymetman, E. Gaussier, C. Goutte, P. Langlais, A. Mauser, K. Yamada. Traduction automatique statistique avec des segments discontinus. Conférence sur le Traitement Automatique des Langues Naturelles, Dourdan, 2005. 6. Leila Kefi, Catherine Berrut, Eric Gaussier. Indexation complexe de documents : vers une vérification qualitative. Congrès INFORSID, Grenoble, may 2005. Page 18 on 22 7. Michelland S., Godart F., Amblard C., de Fraipont F., Moro-Sibilot D., Garin J., Seve M., Favrot MC, (2006) Mises en évidence de marqueurs du cancer du poumon non à petites cellules dans le plasma par la technique de SELDITOF (Surface Enhanced Laser Desorption Ionization - Time Of Flight), premières journées scientifique du CLARA, Vulcania, Clermont-Ferrand, France et acte dans Bulletin du Cancer 2006, 93-99. 8. Leila Kefi, Catherine Berrut, Eric Gaussier. Un modèle de RI basé sur des critères d’obligation et de certitude. Conférence en Recherche d’Information et Applications CORIA’06, Lyon France, mar 2006. 9. Robinet V., Bisson G., Lemaire B., Gordon M.B. Modélisation de profils d’élèves à partir de traces de résolution de problèmes. Actes de la 8eme conférence francophone sur l’apprentissage automatique-CAp 2006 (Trégastel, 22-24 mai 2006) 401-402. 10. Wieczorek S., Bisson G., Roy S., Gordon M.B. Etude du comportement d’un indice de subsomption sur des données relationnelles. Actes de la 8eme conférence francophone sur l’apprentissage automatique-CAp 2006 (Trégastel, 22-24 mai 2006) 399-400. 11. Benoit Scherrer, Michel Dojat, Florence Forbes, Catherine Garbay. Segmentation Markovienne distribuée et coopérative des tissus et des structures présents dans des IRM cérébrales. 15ème Congres Reconnaissance des Formes et Intelligence Artificielle, RFIA 2006, Tours, France, 2006. 12. Bisson G., Wieczorek S,. Aci S., Roy S.. classification automatique de molécules. Actes de la 9eme conférence francophone sur l’apprentissage automatique-CAp 2007 (Grenoble, 4-6 juillet 2007). 13. Loic Maisonnasse, Eric Gaussier, Jean-Pierre Chevallet. Modélisation de relations dans l’approche modèle de langue en recherche d’information. CORIA 2008, Perros Guirec, France, 2008. 14. Bisson G., Wieczorek S., Aci S., Roy S. Impact du choix de la distance sur la classification d’un ensemble de molécules. Actes de l atelier Mesures de Similarité Sémantique de la conférence EGC. 29 janvier 2008, Sophia-Antipolis, pp 109119. 15. Amblard C, S. Michelland, F. de Fraipont, D. Moro-Sibilot, F. Godard, MC Favrot, M. Seve (2009) Recherche d’une signature protéomique du cancer du poumon, Journées de Statistiques, Bordeaux. 16. Ali Mustafa Qamar, Eric Gaussier. Apprentissage de différentes classes de similarité dans les k-PPV. XVIièmes Rencontres de la Société Française de Classification, Grenoble, France, 2009. 17. Trong-Ton Pham, Loic Maisonnasse, Philippe Mulhem, Eric Gaussier. Modèle de langue visuel pour la reconnaissance de scènes. Actes de la conférence CORIA, Giens, France, 2009. Oral communications, without proceedings, in international or national events (e.g. tutorials, courses in summer schools,. . . ) [COM] 1. Chouakria-Douzal, A. (2005). On the distance measure between time series. International Association for Statistical Computing (IASC 05), Chypres. 2. Wieczorek S, Barette C, Lafanechère L, Gordon M-B, Maréchal E, Horvath D, Bisson G, Roy S. Projet Accamba : comment décrire de façon pertinente des molécules pour expliquer et prédire leur bio-activité par apprentissage artificiel? . Poster au congrès de la SFBBM (Société Française de Biochimie et Biologie Moléculaire). Nantes les 24-26 octobre 2005. 3. Phan D., Waldeck R., Gordon M.B., Nadal J-P. Adoption and cooperation in communities: mixed equilibrium in polymorphic populations. Oral presentation at Wehia05 (University of Essex, United Kingdom, June 13-15, 2005). 4. Douzal-Chouakria, A. (2006). Réduction de la dimension de séries temporelles multidimensionnelles par extraction des tendances locales. Extraction et Gestion des Connaissances (EGC’06), Atelier Fouille de données temporelles, Lille. 5. Douzal-Chouakria, A., Diallo, A., Giroud, F. (2006). Adaptive dissimilarity index for genes expression profiles classification. Integrative Post Genomics Conference. Lyon. 6. Aci S., Wieczorek S , Barette C, Lafanechère L, Gordon MB, Maréchal E, Horvath D, Bisson G, Roy S. 2006. The ACCAMBA Project: finding the proper way to describe molecules in order to explain and predict their bioactivity using machine learning methods. Workshop Chemoinformatics in Europe: Research and Teaching, Obernai, France, 29 May - 1st June, 2006. Page 19 on 22 7. Aci S, Charavay S, Wieczorek S, Lafanechère L, Maréchal E, Horvath D, Bisson G, Roy S. 2006. Using ChemAxon tools in Accamba, a project for modeling screening results based on machine learning methods. ChemAxon 2006 User Group Meeting Budapest, Hungary 7-8 June 2006. 8. Garapin A., Ruffieux B., Gordon M.B., Semeshenko V. Designing efficient coordination in games with inefficient equilibria. Présentation orale, 8ièmes Journées d’Économie Expérimentale, 1-2 juin 2006, BETA, Strasbourg. 9. Garapin A., Ruffieux B., Gordon M.B., Semeshenko V. From dying to rising seminars: Designing efficient coordination in the Schelling critical mass game. Présentation orale à IAREP-SABE Meeting on Behavioural Economics and Economic Psychology, July 5-6 2006, Paris. 10. Gordon M.B., Nadal J-P., Phan D., Semeshenko V.. Discrete choices under social influence: General properties. 1st International conference on Economic Sciences with Heterogeneous Interacting Agents (ESHIA)-WEHIA 2006, Bologna, Italy, 15-17 June 2006. 11. Ma Y., Gordon M. B., Nadal J-P. Dynamics of social systems : cooperation and free-riding. Poster presentation. Physics of Socio-Economic Systems - AKSOE (Dresden, 27-30 mars 2006). 12. Semeshenko V., Gordon M. B., Nadal J-P. Learning through social interactions. Oral presentation. Physics of SocioEconomic Systems - AKSOE (Dresden, 27-30 mars 2006). 13. Douzal-Chouakria, A., N. Hammami, C. Garbay (2007). Local Factorial Analysis of Time Series. 56th Session of the International Statistical Institute (ISI 07), Lisboa, Portugal. 14. Rizk, G., Douzal-Chouakria, A., Amblard C.(2007). Temporal Decision Trees. 56th Session of the International Statistical Institute (ISI 07), Lisboa, Portugal. 15. Giroud, F., Diallo, A., Douzal-Chouakria, A. (2007). Identification of cell cycle expressed genes. Workshop Towards Systems Biology, Grenoble. 16. Giroud, F., Diallo, A., Douzal-Chouakria, A. (2007). A new approach for molecular dynamic network analysis. Réaumur Meeting, Grenoble. 17. Giroud, F., Diallo, A., Douzal-Chouakria, A. (2007). Identification of cell cycle expressed genes: a new approach for molecular dynamic network analysis. Workshop Towards Systems Biology 2007, Grenoble, France. 18. N. Cancedda, Marc Dymetman, Eric Gaussier, C. Goutte. An Elastic-Phrase Model for Statistical Machine Translation. Journées de l’ATALA (Assocation pour le Traitement Automatique des Langues), Paris, 2007. 19. Bisson G., Aci S., Sylvaine Roy, Wieczorek S. (2007). Classification automatique de molécules. Actes de la Conférence d Apprentissage Cap 2007. Grenoble 4-6 juillet 2007. 20. Ali Mustafa Qamar, Eric Gaussier. Similarity Learning in Nearest Neighbor and Application to Information Retrieval. Symposium on Future Directions in Information Access, Padua, Italy, 2009. Posters in international or national conferences and workshops [AFF] 1. E. Gaussier, C. Goutte. Relation between PLSA and NMF and Implications. 28th Annual International ACM SIGIR Conference, Poster Session, 2005. 2. Loic Maisonnasse, Eric Gaussier, Jean-Pierre Chevallet. Revisiting the dependence language model for information retrieval. SIGIR ’07: Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval. Poster session., New York, NY, USA, 2007. Scientific books and book chapters [OS] 1. Chouakria-Douzal, A., and P.N. Nagabhushan (2006). Improved Fréchet Distance for Time Series. In: V. Batagelj, H.-H. Bock, A. Ferligoj, A. Ziberna (eds.) Data Science and Classification, 13-20, Springer, ISBN: 3-540-34415-2. 2005. 2. Dreyfus, J-M. Martinez, M. Samuelides, M. B. Gordon, F. Badran, S. Thiria, L. Hérault. Neural Networks, Methodology and Applications. Collection : Artificial Intelligence (2005), ISBN 3-540-22980-9, Springer. Page 20 on 22 3. Viktoriya Semeshenko, Mirta B. Gordon, Jean-Pierre Nadal, Denis Phan. Choice under social influence: effects of learning behaviors on the collective dynamics. Book Chapter 8, in Contributions to Economic Analysis, volume 280, Cognitive Economics: New Trends edited by R. Topol and B. Walliser, Elsevier (2006) 4. Douzal-Chouakria, A., Diallo, A., F. Giroud (2007). Adaptive dissimilarity index for Gene Expression Profiles Classification. In: Selected Contributions in Data Analysis and Classification, Series: Studies in Classification, Data Analysis, and Knowledge Organization, Brito, P., Bertrand, P., Cucumel, G., De Carvalho, F. (Eds.). XIII, 483-494, Springer Berlin Heidelberg. 2006. 5. H. Déjean, E. Gaussier. Une nouvelle approche à l’extraction de lexiques bilingues à partir de corpus comparables. Corpus Linguistics: Critical Concepts in Linguistics, edited by W. Teubert and R. Krishnamurthy, Routledge Publishers, June 2007 (compilation in 6 volumes of papers on corpus linguistics). 6. Aci S., Bisson G., Sylvaine Roy, Wieczorek S. Clustering of Molecules: Influence of the Similarity Measures. In Selected Contributions in Data Analysis and Classification. Springer. Brito, P., Bertrand, P., Cucumel, G., De Carvalho, F. (Eds). Page 433-445, 2007. 7. Bisson G. Chapitre 14: Apprentissage artificiel et données de criblage . Chémogénomique. Des petites molécules pour explorer le vivant, Une introduction à l’usage des biologistes, chimistes et informaticiens. Maréchal E., Roy S., Lafanechère L., Cros J. (éditeurs). (2007). EDP SCIENCES. 2007. 8. E. Gaussier, F. Pacull. Distributed Categorizer for Large Category System. Mining Massive Datasets for Security Selected Proceedings of the MMDSS 2007 NATO workshop, IOS Press, 2008. 9. Dreyfus G., Martinez J-M., Samuelides M., Gordon M. B., Badran F., Thiria S., Hérault L. Apprentissage statistique (3ème édition de: Réseaux de neurones - Méthodologie et applications). Collection Algorithmes, Eyrolles (2002, 2ème édition 2004) Livre de 386 pages, ISBN 2-212-11019-7. 3ème édition : Chapitre 6 (auteur: M. B. Gordon) augmenté avec Support Vector Machines. 2008. 10. Nadal J-P., Gordon M. B. Choix sous influence sociale : heurs et malheurs de la coordination. Systèmes Complexes en SHS, Eds. Walliser et Orléan (2008). 11. Viktoriya Semeshenko. Interacting Learning Agents: Models, Simulations, Experiments. PhD Thesis published as a book. VDM Verlag Publisher. 2009. Scientific popularization [OV] None. Book or Proceedings editing [DO] None. Doctoral Dissertations and Habilitations Thesis [TH] 1. Renaudie David : ”Application de techniques d’apprentissage machine à la modélisation d’élèves en interaction avec un environnement informatique d’apprentissage humain : apprentissage de l’algèbre avec Aplusix”. Co-encadrement Mirta Gordon (50%), Gilles Bisson (50%). Thèse INPG, soutenue le 14/01/05. 2. Kefi Leila : ”Un modèle général de recherche dinformation : Application à la recherche de documents techniques par des professionnels”. Co-encadrement C. Berrut (50%), E. Gaussier (50%). Thèse UJF, soutenue le 10/10/06. 3. Semeshenko Viktoriya : ”Modèles de choix discrets de systèmes économiques et sociaux : approches par la physique statistique.” Directeur de thèse : Mirta Gordon. Thèse INPG, soutenue le 25/06/2007. 4. Bourbeillon J. ”Vers une synthèse d’information orientée tâche - Application à la conception et à l’évaluation de Tissue MicroArrays”. Co-encadrement C. Garbay (50%), F. Giroud (50%). Thèse UJF, soutenue en octobre 2007. 5. Guyet T. ”Interprétation collaborative de séries temporelles. Application à des données de réanimation médicale”. Co-encadrement C. Garbay. PhD thesis, INPG. November 2007. Page 21 on 22 6. Scherrer B. ”Segmentation des tissus et structures sur les IRM crbrales : agents markoviens locaux coopratifs et formulation baysienne”. Co-encadrement C. Garbay (50%), M. Dojat (50%). Thèse INPG, soutenue le 12/12/2008. 7. Wieczorek Samuel : ”Analyse de Chimiothèques et Construction Automatique de Modèles de Bio-Activité.” Coencadrement Gilles Bisson (80%), Mirta Gordon (20%). Thèse UJF, spécialité Informatique, soutenue le 7/7/2009. 8. Robinet Vivien : ”Modélisation de profil cognitif d’élèves à partir de traces de résolution de problèmes.” Co-encadrement Benot Lemaire (80%), Mirta Gordon (20%). Thèse EDICSE, INPG, spécialité sciences cognitives, soutenance le 7/12/2009. Summary The following table summarizes the different publications per year. ACL -International peer reviewed journal ACLN National peer-reviewed journal INV - Invited conferences, seminars and tutorials ACTI - International peer-reviewed conference proceedings ACTN - National peer-reviewed conferences with proceedings COM - Oral communications, without proceedings, in international or national events (e.g. tutorials, courses in summer schools,. . . ) AFF- Posters in international or national conferences and workshops OS – Scientific books and book chapters OV –Scientific popularization DO – Book or Proceedings editing TH –Doctoral Dissertations and Habilitations Theses Total 2005 5 1 3 6 6 2006 1 4 9 5 2007 5 1 6 13 1 2008 2 1 7 6 2 2009 9 1 5 6 3 Total 21 5 25 39 17 3 9 7 - 1 20 1 2 1 28 1 1 30 1 4 3 41 3 1 22 1 2 28 2 11 8 149 Page 22 on 22