Skip to main content

Showing 1–9 of 9 results for author: Zerveas, G

.
  1. arXiv:2305.15720  [pdf, other

    cs.IR

    Enhancing the Ranking Context of Dense Retrieval Methods through Reciprocal Nearest Neighbors

    Authors: George Zerveas, Navid Rekabsaz, Carsten Eickhoff

    Abstract: Sparse annotation poses persistent challenges to training dense retrieval models; for example, it distorts the training signal when unlabeled relevant documents are used spuriously as negatives in contrastive learning. To alleviate this problem, we introduce evidence-based label smoothing, a novel, computationally efficient method that prevents penalizing the model for assigning high relevance to… ▽ More

    Submitted 22 October, 2023; v1 submitted 25 May, 2023; originally announced May 2023.

    Comments: EMNLP 2023

  2. arXiv:2302.06321  [pdf, other

    cs.CL cs.AI

    Parameter-efficient Modularised Bias Mitigation via AdapterFusion

    Authors: Deepak Kumar, Oleg Lesota, George Zerveas, Daniel Cohen, Carsten Eickhoff, Markus Schedl, Navid Rekabsaz

    Abstract: Large pre-trained language models contain societal biases and carry along these biases to downstream tasks. Current in-processing bias mitigation approaches (like adversarial training) impose debiasing by updating a model's parameters, effectively transferring the model to a new, irreversible debiased state. In this work, we propose a novel approach to develop stand-alone debiasing functionalities… ▽ More

    Submitted 18 June, 2023; v1 submitted 13 February, 2023; originally announced February 2023.

    Comments: Post EACL 2023 version

  3. Unsupervised Multivariate Time-Series Transformers for Seizure Identification on EEG

    Authors: İlkay Yıldız Potter, George Zerveas, Carsten Eickhoff, Dominique Duncan

    Abstract: Epilepsy is one of the most common neurological disorders, typically observed via seizure episodes. Epileptic seizures are commonly monitored through electroencephalogram (EEG) recordings due to their routine and low expense collection. The stochastic nature of EEG makes seizure identification via manual inspections performed by highly-trained experts a tedious endeavor, motivating the use of auto… ▽ More

    Submitted 3 January, 2023; originally announced January 2023.

    Comments: IEEE Conference on Machine Learning and Applications (ICMLA) 2022

  4. arXiv:2201.11990  [pdf, other

    cs.CL

    Using DeepSpeed and Megatron to Train Megatron-Turing NLG 530B, A Large-Scale Generative Language Model

    Authors: Shaden Smith, Mostofa Patwary, Brandon Norick, Patrick LeGresley, Samyam Rajbhandari, Jared Casper, Zhun Liu, Shrimai Prabhumoye, George Zerveas, Vijay Korthikanti, Elton Zhang, Rewon Child, Reza Yazdani Aminabadi, Julie Bernauer, Xia Song, Mohammad Shoeybi, Yuxiong He, Michael Houston, Saurabh Tiwary, Bryan Catanzaro

    Abstract: Pretrained general-purpose language models can achieve state-of-the-art accuracies in various natural language processing domains by adapting to downstream tasks via zero-shot, few-shot and fine-tuning techniques. Because of their success, the size of these models has increased rapidly, requiring high-performance hardware, software, and algorithmic techniques to enable training such large models.… ▽ More

    Submitted 4 February, 2022; v1 submitted 28 January, 2022; originally announced January 2022.

    Comments: Shaden Smith and Mostofa Patwary contributed equally

  5. CODER: An efficient framework for improving retrieval through COntextual Document Embedding Reranking

    Authors: George Zerveas, Navid Rekabsaz, Daniel Cohen, Carsten Eickhoff

    Abstract: Contrastive learning has been the dominant approach to training dense retrieval models. In this work, we investigate the impact of ranking context - an often overlooked aspect of learning dense retrieval models. In particular, we examine the effect of its constituent parts: jointly scoring a large number of negatives per query, using retrieved (query-specific) instead of random negatives, and a fu… ▽ More

    Submitted 3 November, 2022; v1 submitted 16 December, 2021; originally announced December 2021.

    Comments: EMNLP 2022

  6. arXiv:2010.05757  [pdf

    cs.AI cs.CL

    Extracting Angina Symptoms from Clinical Notes Using Pre-Trained Transformer Architectures

    Authors: Aaron S. Eisman, Nishant R. Shah, Carsten Eickhoff, George Zerveas, Elizabeth S. Chen, Wen-Chih Wu, Indra Neil Sarkar

    Abstract: Anginal symptoms can connote increased cardiac risk and a need for change in cardiovascular management. This study evaluated the potential to extract these symptoms from physician notes using the Bidirectional Encoder from Transformers language model fine-tuned on a domain-specific corpus. The history of present illness section of 459 expert annotated primary care physician notes from consecutive… ▽ More

    Submitted 12 October, 2020; originally announced October 2020.

    Journal ref: AMIA Annual Symposium 2020

  7. arXiv:2010.02803  [pdf, other

    cs.LG cs.AI

    A Transformer-based Framework for Multivariate Time Series Representation Learning

    Authors: George Zerveas, Srideepika Jayaraman, Dhaval Patel, Anuradha Bhamidipaty, Carsten Eickhoff

    Abstract: In this work we propose for the first time a transformer-based framework for unsupervised representation learning of multivariate time series. Pre-trained models can be potentially used for downstream tasks such as regression and classification, forecasting and missing value imputation. By evaluating our models on several benchmark datasets for multivariate time series regression and classificatio… ▽ More

    Submitted 8 December, 2020; v1 submitted 6 October, 2020; originally announced October 2020.

  8. arXiv:2009.04016  [pdf, ps, other

    cs.IR cs.CL cs.LG

    Brown University at TREC Deep Learning 2019

    Authors: George Zerveas, Ruochen Zhang, Leila Kim, Carsten Eickhoff

    Abstract: This paper describes Brown University's submission to the TREC 2019 Deep Learning track. We followed a 2-phase method for producing a ranking of passages for a given input query: In the the first phase, the user's query is expanded by appending 3 queries generated by a transformer model which was trained to rephrase an input query into semantically similar queries. The expanded query can exhibit g… ▽ More

    Submitted 8 September, 2020; originally announced September 2020.

    Journal ref: Proceedings of the Twenty-Eighth Text REtrieval Conference, TREC 2019, Gaithersburg, Maryland, USA, November 13-15, 2019. NIST Special Publication 1250, National Institute of Standards and Technology (NIST) 2019

  9. arXiv:1812.00490  [pdf, other

    cs.LG stat.ML

    Improving Clinical Predictions through Unsupervised Time Series Representation Learning

    Authors: Xinrui Lyu, Matthias Hueser, Stephanie L. Hyland, George Zerveas, Gunnar Raetsch

    Abstract: In this work, we investigate unsupervised representation learning on medical time series, which bears the promise of leveraging copious amounts of existing unlabeled data in order to eventually assist clinical decision making. By evaluating on the prediction of clinically relevant outcomes, we show that in a practical setting, unsupervised representation learning can offer clear performance benefi… ▽ More

    Submitted 2 December, 2018; originally announced December 2018.

    Comments: Machine Learning for Health (ML4H) Workshop at NeurIPS 2018 arXiv:1811.07216

    Report number: ML4H/2018/171