Methodology for identifying study sites in scientific corpus
Authors:
Eric Kergosien,
Marie-Noëlle Bessagnet,
Maguelonne Teisseire,
Joachim Schöpfel,
Mohammad Amin Farvardin,
Stéphane Chaudiron,
Bernard Jacquemin,
Annig Le Parc-Lacayrelle,
Mathieu Roche,
Christian Sallaberry,
Jean-Philippe Tonneau,
Marie-Noelle Bessagnet,
Amin Farvardin,
Annig Lacayrelle
Abstract:
The TERRE-ISTEX project aims at identifying the evolution of research working relation to study areas, disciplinary crossings and concrete research methods based on the heterogeneous digital content available in scientific corpora. The project is divided into three main actions: (1) to identify the periods and places which have been the subject of empirical studies, and which reflect the publicati…
▽ More
The TERRE-ISTEX project aims at identifying the evolution of research working relation to study areas, disciplinary crossings and concrete research methods based on the heterogeneous digital content available in scientific corpora. The project is divided into three main actions: (1) to identify the periods and places which have been the subject of empirical studies, and which reflect the publications resulting from the corpus analyzed, (2) to identify the thematics addressed in these works and (3) to develop a web-based geographical information retrieval tool (GIR). The first two actions involve approaches combining Natural languages processing patterns with text mining methods. By crossing the three dimensions (spatial, thematic and temporal) in a GIR engine, it will be possible to understand what research has been carried out on which territories and at what time. In the project, the experiments are carried out on a heterogeneous corpus including electronic thesis and scientific articles from the ISTEX digital libraries and the CIRAD research center.
△ Less
Submitted 13 August, 2018;
originally announced August 2018.
Automatic Identification of Research Fields in Scientific Papers
Authors:
Eric Kergosien,
Amin Farvardin,
Maguelonne Teisseire,
Marie-Noëlle Bessagnet,
Joachim Schöpfel,
Stéphane Chaudiron,
Bernard Jacquemin,
Annig Le Parc-Lacayrelle,
Mathieu Roche,
Christian Sallaberry,
Jean-Philippe Tonneau
Abstract:
The TERRE-ISTEX project aims to identify scientific research dealing with specific geographical territories areas based on heterogeneous digital content available in scientific papers. The project is divided into three main work packages: (1) identification of the periods and places of empirical studies, and which reflect the publications resulting from the analyzed text samples, (2) identificatio…
▽ More
The TERRE-ISTEX project aims to identify scientific research dealing with specific geographical territories areas based on heterogeneous digital content available in scientific papers. The project is divided into three main work packages: (1) identification of the periods and places of empirical studies, and which reflect the publications resulting from the analyzed text samples, (2) identification of the themes which appear in these documents, and (3) development of a web-based geographical information retrieval tool (GIR). The first two actions combine Natural Language Processing patterns with text mining methods. The integration of the spatial, thematic and temporal dimensions in a GIR contributes to a better understanding of what kind of research has been carried out, of its topics and its geographical and historical coverage. Another originality of the TERRE-ISTEX project is the heterogeneous character of the corpus, including PhD theses and scientific articles from the ISTEX digital libraries and the CIRAD research center.
△ Less
Submitted 8 June, 2018;
originally announced June 2018.