Skip to main content

Showing 1–12 of 12 results for author: Stanojevic, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.01981  [pdf, ps, other

    cs.LG cs.SD eess.AS

    Zero-Shot Multi-Lingual Speaker Verification in Clinical Trials

    Authors: Ali Akram, Marija Stanojevic, Malikeh Ehghaghi, Jekaterina Novikova

    Abstract: Due to the substantial number of clinicians, patients, and data collection environments involved in clinical trials, gathering data of superior quality poses a significant challenge. In clinical trials, patients are assessed based on their speech data to detect and monitor cognitive and mental health disorders. We propose using these speech recordings to verify the identities of enrolled patients… ▽ More

    Submitted 5 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

  2. arXiv:2401.18046  [pdf, other

    cs.CL

    Multipath parsing in the brain

    Authors: Berta Franzluebbers, Donald Dunagan, Miloš Stanojević, Jan Buys, John T. Hale

    Abstract: Humans understand sentences word-by-word, in the order that they hear them. This incrementality entails resolving temporary ambiguities about syntactic relationships. We investigate how humans process these syntactic ambiguities by correlating predictions from incremental generative dependency parsers with timecourse data from people undergoing functional neuroimaging while listening to an audiobo… ▽ More

    Submitted 6 June, 2024; v1 submitted 31 January, 2024; originally announced January 2024.

    Comments: Accepted at ACL2024, main conference. 15 pages

  3. arXiv:2308.03291  [pdf, other

    cs.LG cs.AI cs.CL

    SynJax: Structured Probability Distributions for JAX

    Authors: Miloš Stanojević, Laurent Sartran

    Abstract: The development of deep learning software libraries enabled significant progress in the field by allowing users to focus on modeling, while letting the library to take care of the tedious and time-consuming task of optimizing execution for modern hardware accelerators. However, this has benefited only particular types of deep learning models, such as Transformers, whose primitives map easily to th… ▽ More

    Submitted 15 October, 2023; v1 submitted 7 August, 2023; originally announced August 2023.

  4. arXiv:2306.12444  [pdf, other

    eess.AS cs.AI cs.LG cs.SD

    Factors Affecting the Performance of Automated Speaker Verification in Alzheimer's Disease Clinical Trials

    Authors: Malikeh Ehghaghi, Marija Stanojevic, Ali Akram, Jekaterina Novikova

    Abstract: Detecting duplicate patient participation in clinical trials is a major challenge because repeated patients can undermine the credibility and accuracy of the trial's findings and result in significant health and financial risks. Develo** accurate automated speaker verification (ASV) models is crucial to verify the identity of enrolled individuals and remove duplicates, but the size and quality o… ▽ More

    Submitted 20 June, 2023; originally announced June 2023.

    Comments: Accepted to the 5th Clinical Natural Language Processing Workshop (ClinicalNLP) at ACL 2023

  5. arXiv:2212.14490  [pdf, other

    cs.SD cs.CL cs.MM eess.AS

    Multi-modal deep learning system for depression and anxiety detection

    Authors: Brian Diep, Marija Stanojevic, Jekaterina Novikova

    Abstract: Traditional screening practices for anxiety and depression pose an impediment to monitoring and treating these conditions effectively. However, recent advances in NLP and speech modelling allow textual, acoustic, and hand-crafted language-based features to jointly form the basis of future mental health screening and condition detection. Speech is a rich and readily available source of insight into… ▽ More

    Submitted 29 December, 2022; originally announced December 2022.

    Comments: accepted to the PAI4MH workshop at NeurIPS 2022

  6. arXiv:2210.16147  [pdf, other

    cs.CL

    Modeling structure-building in the brain with CCG parsing and large language models

    Authors: Miloš Stanojević, Jonathan R. Brennan, Donald Dunagan, Mark Steedman, John T. Hale

    Abstract: To model behavioral and neural correlates of language comprehension in naturalistic environments researchers have turned to broad-coverage tools from natural-language processing and machine learning. Where syntactic structure is explicitly modeled, prior work has relied predominantly on context-free grammars (CFG), yet such formalisms are not sufficiently expressive for human languages. Combinator… ▽ More

    Submitted 16 April, 2023; v1 submitted 28 October, 2022; originally announced October 2022.

  7. arXiv:2205.12621  [pdf, other

    cs.CL cs.DS

    Unbiased and Efficient Sampling of Dependency Trees

    Authors: Miloš Stanojević

    Abstract: Most computational models of dependency syntax consist of distributions over spanning trees. However, the majority of dependency treebanks require that every valid dependency tree has a single edge coming out of the ROOT node, a constraint that is not part of the definition of spanning trees. For this reason all standard inference algorithms for spanning trees are suboptimal for inference over dep… ▽ More

    Submitted 28 November, 2022; v1 submitted 25 May, 2022; originally announced May 2022.

    Comments: 16 pages, 4 algorithms, 7 figures

  8. Transformer Grammars: Augmenting Transformer Language Models with Syntactic Inductive Biases at Scale

    Authors: Laurent Sartran, Samuel Barrett, Adhiguna Kuncoro, Miloš Stanojević, Phil Blunsom, Chris Dyer

    Abstract: We introduce Transformer Grammars (TGs), a novel class of Transformer language models that combine (i) the expressive power, scalability, and strong performance of Transformers and (ii) recursive syntactic compositions, which here are implemented through a special attention mask and deterministic transformation of the linearized tree. We find that TGs outperform various strong baselines on sentenc… ▽ More

    Submitted 6 December, 2022; v1 submitted 1 March, 2022; originally announced March 2022.

    Comments: 17 pages, 5 figures, 2 tables and 1 algorithm. To appear in TACL, to be presented at EMNLP 2022

  9. Modality and Negation in Event Extraction

    Authors: Sander Bijl de Vroe, Liane Guillou, Miloš Stanojević, Nick McKenna, Mark Steedman

    Abstract: Language provides speakers with a rich system of modality for expressing thoughts about events, without being committed to their actual occurrence. Modality is commonly used in the political news domain, where both actual and possible courses of events are discussed. NLP systems struggle with these semantic phenomena, often incorrectly extracting events which did not happen, which can lead to issu… ▽ More

    Submitted 20 September, 2021; originally announced September 2021.

    Comments: S. Bijl de Vroe, L. Guillou, M. Stanojević, N. McKenna, and M. Steedman. 2021. Modality and Negation in Event Extraction. In Proceedings of the 4th Workshop on Challenges and Applications of Automated Extraction of Socio-political Events from Text (CASE 2021), pages 31-42, online. Association for Computational Linguistics

    Journal ref: In Proceedings of CASE 2021, pages 31-42, online. Association for Computational Linguistics

  10. arXiv:2103.06130  [pdf, other

    cs.IR cs.CL cs.LG

    Stay on Topic, Please: Aligning User Comments to the Content of a News Article

    Authors: Jumanah Alshehri, Marija Stanojevic, Eduard Dragut, Zoran Obradovic

    Abstract: Social scientists have shown that up to 50% if the content posted to a news article have no relation to its journalistic content. In this study we propose a classification algorithm to categorize user comments posted to a new article base don their alignment to its content. The alignment seek to match user comments to an article based on similarity off content, entities in discussion, and topic. W… ▽ More

    Submitted 3 March, 2021; originally announced March 2021.

    Comments: Accepted as a full paper at the 43rd European Conference on Information Retrieval

  11. arXiv:2005.00950  [pdf

    cs.IR cs.CL

    Extracting Entities and Topics from News and Connecting Criminal Records

    Authors: Quang Pham, Marija Stanojevic, Zoran Obradovic

    Abstract: The goal of this paper is to summarize methodologies used in extracting entities and topics from a database of criminal records and from a database of newspapers. Statistical models had successfully been used in studying the topics of roughly 300,000 New York Times articles. In addition, these models had also been used to successfully analyze entities related to people, organizations, and places (… ▽ More

    Submitted 2 May, 2020; originally announced May 2020.

    Comments: This is a report submitted by an undergraduate student as preliminary work on this problem

  12. arXiv:1508.02445  [pdf, ps, other

    cs.CL

    Removing Biases from Trainable MT Metrics by Using Self-Training

    Authors: Miloš Stanojević

    Abstract: Most trainable machine translation (MT) metrics train their weights on human judgments of state-of-the-art MT systems outputs. This makes trainable metrics biases in many ways. One of them is preferring longer translations. These biased metrics when used for tuning are evaluating different types of translations -- n-best lists of translations with very diverse quality. Systems tuned with these met… ▽ More

    Submitted 10 August, 2015; originally announced August 2015.