Skip to main content

Showing 1–6 of 6 results for author: Anibal, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.01620  [pdf

    cs.SD cs.AI cs.CY eess.AS

    Voice EHR: Introducing Multimodal Audio Data for Health

    Authors: James Anibal, Hannah Huth, Ming Li, Lindsey Hazen, Yen Minh Lam, Hang Nguyen, Phuc Hong, Michael Kleinman, Shelley Ost, Christopher Jackson, Laura Sprabery, Cheran Elangovan, Balaji Krishnaiah, Lee Akst, Ioan Lina, Iqbal Elyazar, Lenny Ekwati, Stefan Jansen, Richard Nduwayezu, Charisse Garcia, Jeffrey Plum, Jacqueline Brenner, Miranda Song, Emily Ricotta, David Clifton , et al. (3 additional authors not shown)

    Abstract: Large AI models trained on audio data may have the potential to rapidly classify patients, enhancing medical decision-making and potentially improving outcomes through early detection. Existing technologies depend on limited datasets using expensive recording equipment in high-income, English-speaking countries. This challenges deployment in resource-constrained, high-volume settings where audio d… ▽ More

    Submitted 1 June, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

    Comments: 19 pages, 2 figures, 7 tables

  2. arXiv:2402.03484  [pdf, other

    cs.IR cs.CL

    Harnessing PubMed User Query Logs for Post Hoc Explanations of Recommended Similar Articles

    Authors: Ashley Shin, Qiao **, James Anibal, Zhiyong Lu

    Abstract: Searching for a related article based on a reference article is an integral part of scientific research. PubMed, like many academic search engines, has a "similar articles" feature that recommends articles relevant to the current article viewed by a user. Explaining recommended items can be of great utility to users, particularly in the literature search process. With more than a million biomedica… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

  3. arXiv:2110.04257  [pdf, other

    cs.CL

    VieSum: How Robust Are Transformer-based Models on Vietnamese Summarization?

    Authors: Hieu Nguyen, Long Phan, James Anibal, Alec Peltekian, Hieu Tran

    Abstract: Text summarization is a challenging task within natural language processing that involves text generation from lengthy input sequences. While this task has been widely studied in English, there is very limited research on summarization for Vietnamese text. In this paper, we investigate the robustness of transformer-based encoder-decoder architectures for Vietnamese abstractive summarization. Lever… ▽ More

    Submitted 8 October, 2021; originally announced October 2021.

  4. arXiv:2106.09997  [pdf, other

    cs.CL

    SPBERT: An Efficient Pre-training BERT on SPARQL Queries for Question Answering over Knowledge Graphs

    Authors: Hieu Tran, Long Phan, James Anibal, Binh T. Nguyen, Truong-Son Nguyen

    Abstract: In this paper, we propose SPBERT, a transformer-based language model pre-trained on massive SPARQL query logs. By incorporating masked language modeling objectives and the word structural objective, SPBERT can learn general-purpose representations in both natural language and SPARQL query language. We investigate how SPBERT and encoder-decoder architecture can be adapted for Knowledge-based QA cor… ▽ More

    Submitted 30 June, 2021; v1 submitted 18 June, 2021; originally announced June 2021.

  5. arXiv:2106.03598  [pdf, other

    cs.CL cs.AI cs.LG

    SciFive: a text-to-text transformer model for biomedical literature

    Authors: Long N. Phan, James T. Anibal, Hieu Tran, Shaurya Chanana, Erol Bahadroglu, Alec Peltekian, Grégoire Altan-Bonnet

    Abstract: In this report, we introduce SciFive, a domain-specific T5 model that has been pre-trained on large biomedical corpora. Our model outperforms the current SOTA methods (i.e. BERT, BioBERT, Base T5) on tasks in named entity relation, relation extraction, natural language inference, and question-answering. We show that text-generation methods have significant potential in a broad array of biomedical… ▽ More

    Submitted 28 May, 2021; originally announced June 2021.

  6. arXiv:2105.08645  [pdf, other

    cs.AI cs.PL

    CoTexT: Multi-task Learning with Code-Text Transformer

    Authors: Long Phan, Hieu Tran, Daniel Le, Hieu Nguyen, James Anibal, Alec Peltekian, Yanfang Ye

    Abstract: We present CoTexT, a pre-trained, transformer-based encoder-decoder model that learns the representative context between natural language (NL) and programming language (PL). Using self-supervision, CoTexT is pre-trained on large programming language corpora to learn a general understanding of language and code. CoTexT supports downstream NL-PL tasks such as code summarizing/documentation, code gen… ▽ More

    Submitted 21 June, 2021; v1 submitted 18 May, 2021; originally announced May 2021.