Workshop on Interactions between Data Mining and Natural Language Processing
ECML/PKDD 2016 Workshop
September 23th 2016, Riva del Garda, Italy
DMNLP’16 will be the third edition of the Data Mining and Natural Language Processing (DMNLP) workshop.
DMNLP’14 and DMNLP’15 was held in conjunction with the ECML-PKDD 2014 in Nancy, France and ECML-PKDD 2015 in Porto, Portugal. DMNLP is dedicated to Data Mining (DM) and Natural Language Processing (NLP) cross-fertilization, i.e a workshop where NLP brings new challenges to DM, and where DM gives future prospects to NLP. It is well-known that texts provide a very challenging context
to both NLP and DM with a huge volume of low-structured, complex, domain-dependent and task-dependent data. DMNLP’14 workshop and DMNLP’15
were a success with about 50 participants from 15 different countries. Proceedings are available on the CEUR Workshop Proceeding site (http://ceur-ws.org/Vol-1202/ and http://ceur-ws.org/Vol-1410/).
The workshop will favor the use of symbolic methods. Indeed, statistical and machine learning methods (CRF, SVM, Naive Bayes) holds a predominant position in NLP researches and ”may have been too successful (...) as there is no longer much room for anything else”. They have proved their effectiveness for some tasks but one major drawback is that they do not provide human readable models. By contrast, symbolic machine learning methods are known to provide more human-readable model that could be an end in itself (e.g., for stylistics) or improve, by combination, further methods including numerical ones. Research in Data Mining has progressed significantly in the last decades, through the development of advanced algorithms and techniques to extract knowledge from data in different forms. In particular, for two decades Pattern Mining has been one of the most active field in Knowledge Discovery.
Recently, a new field has emerged taking benefit of both domains: Data Mining and NLP. The objective of DMNLP is thus to provide a forum to discuss how Data Mining can be interesting for NLP tasks, providing symbolic knowledge, but also how NLP can enhance data mining approaches by providing richer and/or more complex information to mine and by integrating linguistic knowledge directly in the mining process. The workshop aims at bringing together researchers from both communities in order to stimulate discussions about the cross-fertilization of those two research fields. The idea of this workshop is to discuss future directions and new challenges emerging from this cross-fertilization of Data Mining and NLP and in the same time to initiate collaborations between researchers of both communities.
CALL FOR PAPERS
The workshop promotes works where the two following dimensions are combined in one as symbiosis. The first dimension is Data Mining, for instance Pattern Mining (itemsets, sequences, trees, graphs, association rules), classification (decision trees, FCA...), inductive logic programming. The second dimension is NLP, for example question/answering systems, translation, information extrac- tion, linguistic analysis (lexical analysis, terminology, syntax, semantics, discourse, stylistics), classification, knowledge extraction/ontology building from texts, information retrieval, corpus annotation,social/opinion mining. A list of non-exhaustive topics that fit the scope of the workshop is thus:
- Pattern discovery for NLP
- Constraint-based Pattern Mining in text
- Data Mining query languages for expressing NLP tasks
- Data representation (sequences, trees, graphs) for NLP
- Text modeling for Data Mining
- Mining complex data for / coming from NLP
- Representation learning in NLP (e.g., deep learning, compositional models...)
- Relationships between Data Mining and NLP
- Modeling and visualizing Data Mining results on text
- Integrating natural language features in Data Mining
- Data mining approaches for linguistic knowledge building
- Knowledge discovery for linguistic analysis (e.g. stylistics, socio-linguistics...)
- Knowledge-based data mining for NLP (e.g., ontologies, linked open data...)
- Workshop paper submission deadline:
Tuesday, June 21, 2016 (!!!! Extended deadline: Monday, June 27, 2016 !!!!)
- Workshop paper acceptance notification: Monday, July 18, 2016
- Workshop paper camera-ready deadline: Sunday, July 24, 2016
- Workshop date: September 23nd, 2016
The duration of the workshop will be 1 day.
Our main goal is to stimulate discussion, collaboration and the sharing of experiences. In that respect, we would have three submission types:
- unpublished works (max 8 pages, double submissions allowed)
- extended abstracts and vision statements (max 4 pages)
- recently published works (special oral-only track, no page limits)
As for previous editions, the papers will be published in DMNLP proceedings at CEUR. Note that multiple submissions are allowed, however before submitting to DMNLP the authors should check there is no conflict with the other publication policies.
Authors instructions and style files follow main conference recommendations and can be downloaded at: http://www.springer.de/comp/lncs/authors.html.
The submission has to be done via the DMNLP 2016 EasyChair account: https://easychair.org/conferences/?conf=dmnlp2016.
Program Commitee is still under construction. The following persons were involved in the DMNLP’14 and DMNLP’15 PC and will be contacted for participation to the next program committee :
- Martin Atzmueller, University of Kassel, Germany
- Delphine Battistelli, Université Paris X, France
- Yves Bestgen, Université catholique de Louvain, Belgium
- Bruno Crémilleux, Université de Caen, France
- Beatrice, Daille, LINA, France
- Luigi Di Caro, University of Torino, Italy
- Pierre Geurts, University of Liège, Belgium
- Jiri Kléma, Czech Technical University, Prague, Czech Republic
- Yves Lepage, Waseda University, Japan
- Amedeo Napoli, LORIA Nancy, France
- Adeline Nazarenko, Université de Paris 13, LIPN, France
- Claire Nédellec, Institut National de Recherche Agronomique, France
- Maria Teresa Pazienza, University of Roma "Tor Vergata", Italy
- Pascal Poncelet, LIRMM Montpellier, France
- Solen Quiniou, LINA-Université de Nantes, France
- Mathieu Roche, TETIS, Montpellier, France
- Christin Seifert, Universitat Passau, Germany
- Arnaud Soulet, Université François Rabelais, Tours, France
- Koichi Takeuchi, Okayama University, Japan
WORKSHOP ORGANISING COMMITTEE
Contact : email@example.com
- Peggy Cellier: INSA Rennes, IRISA (UMR 6074), Rennes, France (Peggy.Cellier@irisa.fr)
- Thierry Charnois: Université de Paris 13, LIPN (UMR 7030), France (firstname.lastname@example.org)
- Andreas Hotho: University of Kassel, Germany (email@example.com)
- Stan Matwin: Dalhousie University, Canada (firstname.lastname@example.org)
- Marie-Francine Moens: Katholieke Universiteit, Leuven, Belgium (email@example.com)
- Yannick Toussaint: INRIA, LORIA (UMR 7503), 54506 Vandoeuvre-les-Nancy, France (Yannick.Toussaint@loria.fr)