-
RETSim: Resilient and Efficient Text Similarity
Authors:
Marina Zhang,
Owen Vallis,
Aysegul Bumin,
Tanay Vakharia,
Elie Bursztein
Abstract:
This paper introduces RETSim (Resilient and Efficient Text Similarity), a lightweight, multilingual deep learning model trained to produce robust metric embeddings for near-duplicate text retrieval, clustering, and dataset deduplication tasks. We demonstrate that RETSim is significantly more robust and accurate than MinHash and neural text embeddings, achieving new state-of-the-art performance on…
▽ More
This paper introduces RETSim (Resilient and Efficient Text Similarity), a lightweight, multilingual deep learning model trained to produce robust metric embeddings for near-duplicate text retrieval, clustering, and dataset deduplication tasks. We demonstrate that RETSim is significantly more robust and accurate than MinHash and neural text embeddings, achieving new state-of-the-art performance on dataset deduplication, adversarial text retrieval benchmarks, and spam clustering tasks. We also introduce the W4NT3D benchmark (Wiki-40B 4dversarial Near-T3xt Dataset) for evaluating multilingual, near-duplicate text retrieval capabilities under adversarial settings. RETSim and the W4NT3D benchmark are open-sourced under the MIT License at https://github.com/google/unisim.
△ Less
Submitted 28 November, 2023;
originally announced November 2023.
-
Transformers in Healthcare: A Survey
Authors:
Subhash Nerella,
Sabyasachi Bandyopadhyay,
Jiaqing Zhang,
Miguel Contreras,
Scott Siegel,
Aysegul Bumin,
Brandon Silva,
Jessica Sena,
Benjamin Shickel,
Azra Bihorac,
Kia Khezeli,
Parisa Rashidi
Abstract:
With Artificial Intelligence (AI) increasingly permeating various aspects of society, including healthcare, the adoption of the Transformers neural network architecture is rapidly changing many applications. Transformer is a type of deep learning architecture initially developed to solve general-purpose Natural Language Processing (NLP) tasks and has subsequently been adapted in many fields, inclu…
▽ More
With Artificial Intelligence (AI) increasingly permeating various aspects of society, including healthcare, the adoption of the Transformers neural network architecture is rapidly changing many applications. Transformer is a type of deep learning architecture initially developed to solve general-purpose Natural Language Processing (NLP) tasks and has subsequently been adapted in many fields, including healthcare. In this survey paper, we provide an overview of how this architecture has been adopted to analyze various forms of data, including medical imaging, structured and unstructured Electronic Health Records (EHR), social media, physiological signals, and biomolecular sequences. Those models could help in clinical diagnosis, report generation, data reconstruction, and drug/protein synthesis. We identified relevant studies using the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines. We also discuss the benefits and limitations of using transformers in healthcare and examine issues such as computational cost, model interpretability, fairness, alignment with human values, ethical implications, and environmental impact.
△ Less
Submitted 30 June, 2023;
originally announced July 2023.
-
Proceedings of eNTERFACE 2015 Workshop on Intelligent Interfaces
Authors:
Matei Mancas,
Christian Frisson,
Joëlle Tilmanne,
Nicolas d'Alessandro,
Petr Barborka,
Furkan Bayansar,
Francisco Bernard,
Rebecca Fiebrink,
Alexis Heloir,
Edgar Hemery,
Sohaib Laraba,
Alexis Moinet,
Fabrizio Nunnari,
Thierry Ravet,
Loïc Reboursière,
Alvaro Sarasua,
Mickaël Tits,
Noé Tits,
François Zajéga,
Paolo Alborno,
Ksenia Kolykhalova,
Emma Frid,
Damiano Malafronte,
Lisanne Huis in't Veld,
Hüseyin Cakmak
, et al. (49 additional authors not shown)
Abstract:
The 11th Summer Workshop on Multimodal Interfaces eNTERFACE 2015 was hosted by the Numediart Institute of Creative Technologies of the University of Mons from August 10th to September 2015. During the four weeks, students and researchers from all over the world came together in the Numediart Institute of the University of Mons to work on eight selected projects structured around intelligent interf…
▽ More
The 11th Summer Workshop on Multimodal Interfaces eNTERFACE 2015 was hosted by the Numediart Institute of Creative Technologies of the University of Mons from August 10th to September 2015. During the four weeks, students and researchers from all over the world came together in the Numediart Institute of the University of Mons to work on eight selected projects structured around intelligent interfaces. Eight projects were selected and their reports are shown here.
△ Less
Submitted 19 January, 2018;
originally announced January 2018.