Skip to main content

Showing 1–5 of 5 results for author: Dura, B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2305.13817  [pdf

    cs.CL

    Detecting automatically the layout of clinical documents to enhance the performances of downstream natural language processing

    Authors: Christel Gérardin, Perceval Wajsbürt, Basile Dura, Alice Calliger, Alexandre Moucher, Xavier Tannier, Romain Bey

    Abstract: Objective:Develop and validate an algorithm for analyzing the layout of PDF clinical documents to improve the performance of downstream natural language processing tasks. Materials and Methods: We designed an algorithm to process clinical PDF documents and extract only clinically relevant text. The algorithm consists of several steps: initial text extraction using a PDF parser, followed by classif… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

    Comments: 22 pages, 5 figures

  2. arXiv:2303.13451  [pdf

    cs.CL

    Development and validation of a natural language processing algorithm to pseudonymize documents in the context of a clinical data warehouse

    Authors: Xavier Tannier, Perceval Wajsbürt, Alice Calliger, Basile Dura, Alexandre Mouchet, Martin Hilka, Romain Bey

    Abstract: The objective of this study is to address the critical issue of de-identification of clinical reports in order to allow access to data for research purposes, while ensuring patient privacy. The study highlights the difficulties faced in sharing tools and resources in this domain and presents the experience of the Greater Paris University Hospitals (AP-HP) in implementing a systematic pseudonymizat… ▽ More

    Submitted 23 March, 2023; originally announced March 2023.

  3. arXiv:2207.12940  [pdf, other

    cs.CL stat.ML

    Learning structures of the French clinical language:development and validation of word embedding models using 21 million clinical reports from electronic health records

    Authors: Basile Dura, Charline Jean, Xavier Tannier, Alice Calliger, Romain Bey, Antoine Neuraz, Rémi Flicoteaux

    Abstract: Background Clinical studies using real-world data may benefit from exploiting clinical reports, a particularly rich albeit unstructured medium. To that end, natural language processing can extract relevant information. Methods based on transfer learning using pre-trained language models have achieved state-of-the-art results in most NLP applications; however, publicly available models lack expos… ▽ More

    Submitted 26 July, 2022; originally announced July 2022.

  4. arXiv:1906.03040  [pdf, other

    cs.CY cs.LG eess.SP eess.SY stat.ML

    FASTER: Fusion AnalyticS for public Transport Event Response

    Authors: Sebastien Blandin, Laura Wynter, Hasan Poonawala, Sean Laguna, Basile Dura

    Abstract: Increasing urban concentration raises operational challenges that can benefit from integrated monitoring and decision support. Such complex systems need to leverage the full stack of analytical methods, from state estimation using multi-sensor fusion for situational awareness, to prediction and computation of optimal responses. The FASTER platform that we describe in this work, deployed at nation… ▽ More

    Submitted 14 May, 2019; originally announced June 2019.

  5. arXiv:1905.12131  [pdf, other

    cs.LG stat.ML

    Adaptive Deep Kernel Learning

    Authors: Prudencio Tossou, Basile Dura, Francois Laviolette, Mario Marchand, Alexandre Lacoste

    Abstract: Deep kernel learning provides an elegant and principled framework for combining the structural properties of deep learning algorithms with the flexibility of kernel methods. By means of a deep neural network, we learn a parametrized kernel operator that can be combined with a differentiable kernel algorithm during inference. While previous work within this framework has focused on learning a singl… ▽ More

    Submitted 11 December, 2020; v1 submitted 28 May, 2019; originally announced May 2019.