Recent Changes - Search:

edit SideBar

Call 2014

Workshop on Interactions between Data Mining and Natural Language Processing


ECML/PKDD 2014 Workshop

September 15th 2014, Nancy, France

Program available here || Proceedings of the workshop


On the one hand, in the field of Natural Language Processing (NLP), numerical Machine Learning methods (e.g., SVM, CRF) have been intensively explored and applied. Despite the good results obtained by the numerical methods, one major drawback is that they do not provide a human readable model. A promising direction is the integration of symbolic knowledge. On the other hand, research in Data Mining has progressed significantly in the last decades, through the development of advanced algorithms and techniques to extract knowledge from data in different forms. In particular, for two decades Pattern Mining has been one of the most active field in Knowledge Discovery.

Recently, a new field has emerged taking benefit of both domains: Data Mining and NLP. The objective of DMNLP is thus to provide a forum to discuss how Data Mining can be interesting for NLP tasks, providing symbolic knowledge, but also how NLP can enhance data mining approaches by providing richer and/or more complex information to mine and by integrating linguistics knowledge directly in the mining process.

The workshop aims at bringing together researchers from both communities in order to stimulate discussions about the cross-fertilization of those two research fields. The idea of this workshop is to discuss future directions and new challenges emerging from the cross-fertilization of Data Mining and NLP and in the same time initiate collaborations between researchers of both communities.


The workshop promotes works where the two following dimensions are combined in one as symbiosis.

The first dimension is Data Mining, for instance Pattern Mining (itemsets, sequences, trees, graphs, association rules), classification (decision trees, FCA,...), inductive logic programming.

The second dimension is NLP, for example question/answering systems, translation, information extraction, linguistic analysis (lexical analysis, terminology, syntax, semantics, discourse, stylistics), classification, knowledge extraction/ontology building from texts, information retrieval, corpus annotation, social/opinion mining.

A list of non-exhaustive topics that fit the scope of the workshop is thus:

  • Pattern discovery for NLP
  • Constraint-based Pattern Mining in text
  • Data Mining query languages for expressing NLP tasks
  • Data representation (sequences, trees, graphs) for NLP
  • Modelization of text for Data Mining
  • Relationships between Data Mining and NLP
  • Modeling and visualizing Data Mining results on text
  • Integrating NLP characteristics in Data Mining
  • Data mining approaches for linguistic knowledge building
  • Knowledge Discovery for linguistic analysis (e.g. stylistics, socio-linguistics,\ldots)
  • Linguistically-informed text representations for Data Mining


  • Deadline for submissions: Friday, June 20, 2014 extended to Friday, June 27, 2014
  • Author notification: Friday, July 11, 2014
  • Final version: Friday July 25, 2014
  • Workshop date: September 15, 2014


The duration of the workshop will be 1 day. Our main goal is to stimulate discussion, collaboration and the sharing of experiences. In that respect, we would have three submission types:

  • unpublished works (max 16 pages, double submissions allowed)
  • short papers and vision statements (max 8 pages)
  • recently published works (special oral-only track, no page limits)

Authors instructions and style files follow main conference recommendations and can be downloaded at:

The submission has to be done via the DMNLP 2014 EasyChair account.


  • Martin Atzmueller, University of Kassel, Germany
  • Delphine Battistelli, MoDyCo-Université Paris Ouest, France
  • Yves Bestgen, Université catholique de Louvain, Belgium
  • Philipp Cimiano, University of Bielefeld, Germany
  • Bruno Crémilleux, Université de Caen, France
  • Beatrice, Daille, LINA, France
  • Yves Lepage, Waseda University, Japan
  • Francois Jacquenet, Laboratoire Hubert Curien, Saint-Etienne, France
  • Jiri Klema, Czech Technical University, Prague, Czech Republic
  • Amedeo Napoli, LORIA Nancy, France
  • Adeline Nazarenko, Université de Paris 13, LIPN, France
  • Claire Nédellec, Institut National de Recherche Agronomique, France
  • Pascal Poncelet, LIRMM Montpellier, France
  • Maria Teresa Pazienza, University of Roma "Tor Vergata", Italy
  • Stephen Poteet, Boeing, USA
  • Solen Quiniou, LINA-Université de Nantes, France
  • Mathieu Roche, TETIS, Montpellier, France
  • Arnaud Soulet, Université François Rabelais, Tours, France
  • Steffen Staab, University of Koblenz-Landau, Germany
  • Koichi Takeuchi, Okayama University, Japan
  • Isabelle Tellier, Lattice, Paris, France
  • Johanna Völker, University of Mannheim, Germany
  • Xifeng Yan, University of California, Santa Barbara, USA
  • Pierre Zweigenbaum, LIMSI-CNRS, Paris, France


Contact :

  • Peggy Cellier: INSA Rennes, IRISA (UMR 6074), Rennes, France (
  • Thierry Charnois: Université de Paris 13, LIPN (UMR 7030), France (
  • Andreas Hotho: University of Kassel, Germany (
  • Stan Matwin: Dalhousie University, Canada (
  • Marie-Francine Moens: Katholieke Universiteit, Leuven, Belgium (
  • Yannick Toussaint: INRIA, LORIA (UMR 7503), 54506 Vandoeuvre-les-Nancy, France (
Edit - History - Print - Recent Changes - Search
Page last modified on 2015-04-03 14:42