Skip to main content

Showing 1–12 of 12 results for author: Sicilia, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.03284  [pdf, other

    cs.CL cs.AI cs.LG

    Deal, or no deal (or who knows)? Forecasting Uncertainty in Conversations using Large Language Models

    Authors: Anthony Sicilia, Hyunwoo Kim, Khyathi Raghavi Chandu, Malihe Alikhani, Jack Hessel

    Abstract: Effective interlocutors account for the uncertain goals, beliefs, and emotions of others. But even the best human conversationalist cannot perfectly anticipate the trajectory of a dialogue. How well can language models represent inherent uncertainty in conversations? We propose FortUne Dial, an expansion of the long-standing "conversation forecasting" task: instead of just accuracy, evaluation is… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

    Comments: 2 Figures; 7 Tables; 27 pages

  2. arXiv:2307.04303  [pdf, other

    cs.CL cs.AI

    Learning to Generate Equitable Text in Dialogue from Biased Training Data

    Authors: Anthony Sicilia, Malihe Alikhani

    Abstract: The ingrained principles of fairness in a dialogue system's decision-making process and generated responses are crucial for user engagement, satisfaction, and task achievement. Absence of equitable and inclusive principles can hinder the formation of common ground, which in turn negatively impacts the overall performance of the system. For example, misusing pronouns in a user interaction may cause… ▽ More

    Submitted 9 July, 2023; originally announced July 2023.

  3. arXiv:2305.14195  [pdf, other

    cs.CL cs.AI

    HumBEL: A Human-in-the-Loop Approach for Evaluating Demographic Factors of Language Models in Human-Machine Conversations

    Authors: Anthony Sicilia, Jennifer C. Gates, Malihe Alikhani

    Abstract: While demographic factors like age and gender change the way people talk, and in particular, the way people talk to machines, there is little investigation into how large pre-trained language models (LMs) can adapt to these changes. To remedy this gap, we consider how demographic factors in LM language skills can be measured to determine compatibility with a target demographic. We suggest clinical… ▽ More

    Submitted 5 February, 2024; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: 17 pages, 9 figures, 5 tables

  4. arXiv:2210.07777  [pdf, other

    cs.CL cs.LG

    LEATHER: A Framework for Learning to Generate Human-like Text in Dialogue

    Authors: Anthony Sicilia, Malihe Alikhani

    Abstract: Algorithms for text-generation in dialogue can be misguided. For example, in task-oriented settings, reinforcement learning that optimizes only task-success can lead to abysmal lexical diversity. We hypothesize this is due to poor theoretical understanding of the objectives in text-generation and their relation to the learning process (i.e., model training). To this end, we propose a new theoretic… ▽ More

    Submitted 14 October, 2022; originally announced October 2022.

  5. arXiv:2207.07255  [pdf, other

    cs.CL cs.LG

    Modeling Non-Cooperative Dialogue: Theoretical and Empirical Insights

    Authors: Anthony Sicilia, Tristan Maidment, Pat Healy, Malihe Alikhani

    Abstract: Investigating cooperativity of interlocutors is central in studying pragmatics of dialogue. Models of conversation that only assume cooperative agents fail to explain the dynamics of strategic conversations. Thus, we investigate the ability of agents to identify non-cooperative interlocutors while completing a concurrent visual-dialogue task. Within this novel setting, we study the optimality of c… ▽ More

    Submitted 14 July, 2022; originally announced July 2022.

  6. arXiv:2207.05685  [pdf, other

    cs.LG

    PAC-Bayesian Domain Adaptation Bounds for Multiclass Learners

    Authors: Anthony Sicilia, Katherine Atwell, Malihe Alikhani, Seong Jae Hwang

    Abstract: Multiclass neural networks are a common tool in modern unsupervised domain adaptation, yet an appropriate theoretical description for their non-uniform sample complexity is lacking in the adaptation literature. To fill this gap, we propose the first PAC-Bayesian adaptation bounds for multiclass learners. We facilitate practical use of our bounds by also proposing the first approximation techniques… ▽ More

    Submitted 12 July, 2022; originally announced July 2022.

  7. arXiv:2205.06427  [pdf, other

    cs.CV cs.AI cs.LG

    Test-time Fourier Style Calibration for Domain Generalization

    Authors: Xingchen Zhao, Chang Liu, Anthony Sicilia, Seong Jae Hwang, Yun Fu

    Abstract: The topic of generalizing machine learning models learned on a collection of source domains to unknown target domains is challenging. While many domain generalization (DG) methods have achieved promising results, they primarily rely on the source domains at train-time without manipulating the target domains at test-time. Thus, it is still possible that those methods can overfit to source domains a… ▽ More

    Submitted 18 May, 2022; v1 submitted 12 May, 2022; originally announced May 2022.

    Comments: 31st International Joint Conference on Artificial Intelligence (IJCAI) 2022

  8. arXiv:2203.11317  [pdf, other

    cs.CL cs.LG

    The Change that Matters in Discourse Parsing: Estimating the Impact of Domain Shift on Parser Error

    Authors: Katherine Atwell, Anthony Sicilia, Seong Jae Hwang, Malihe Alikhani

    Abstract: Discourse analysis allows us to attain inferences of a text document that extend beyond the sentence-level. The current performance of discourse models is very low on texts outside of the training distribution's coverage, diminishing the practical utility of existing models. There is need for a measure that can inform us to what extent our model generalizes from the training to the test sample whe… ▽ More

    Submitted 21 March, 2022; originally announced March 2022.

  9. arXiv:2104.05600  [pdf, other

    cs.LG cs.CV stat.ML

    PAC Bayesian Performance Guarantees for Deep (Stochastic) Networks in Medical Imaging

    Authors: Anthony Sicilia, Xingchen Zhao, Anastasia Sosnovskikh, Seong Jae Hwang

    Abstract: Application of deep neural networks to medical imaging tasks has in some sense become commonplace. Still, a "thorn in the side" of the deep learning movement is the argument that deep networks are prone to overfitting and are thus unable to generalize well when datasets are small (as is common in medical imaging tasks). One way to bolster confidence is to provide mathematical guarantees, or bounds… ▽ More

    Submitted 8 July, 2021; v1 submitted 12 April, 2021; originally announced April 2021.

    Comments: MICCAI 2021

  10. arXiv:2102.13147  [pdf, other

    cs.CV cs.LG eess.IV

    Multi-Domain Learning by Meta-Learning: Taking Optimal Steps in Multi-Domain Loss Landscapes by Inner-Loop Learning

    Authors: Anthony Sicilia, Xingchen Zhao, Davneet Minhas, Erin O'Connor, Howard Aizenstein, William Klunk, Dana Tudorascu, Seong Jae Hwang

    Abstract: We consider a model-agnostic solution to the problem of Multi-Domain Learning (MDL) for multi-modal applications. Many existing MDL techniques are model-dependent solutions which explicitly require nontrivial architectural changes to construct domain-specific modules. Thus, properly applying these MDL techniques for new problems with well-established models, e.g. U-Net for semantic segmentation, m… ▽ More

    Submitted 25 February, 2021; originally announced February 2021.

    Comments: IEEE International Symposium on Biomedical Imaging 2021

  11. arXiv:2102.06650  [pdf, other

    cs.CV

    Robust White Matter Hyperintensity Segmentation on Unseen Domain

    Authors: Xingchen Zhao, Anthony Sicilia, Davneet Minhas, Erin O'Connor, Howard Aizenstein, William Klunk, Dana Tudorascu, Seong Jae Hwang

    Abstract: Typical machine learning frameworks heavily rely on an underlying assumption that training and test data follow the same distribution. In medical imaging which increasingly begun acquiring datasets from multiple sites or scanners, this identical distribution assumption often fails to hold due to systematic variability induced by site or scanner dependent factors. Therefore, we cannot simply expect… ▽ More

    Submitted 16 February, 2021; v1 submitted 12 February, 2021; originally announced February 2021.

    Comments: IEEE International Symposium on Biomedical Imaging 2021

  12. arXiv:2102.03924  [pdf, other

    cs.LG cs.CV

    Domain Adversarial Neural Networks for Domain Generalization: When It Works and How to Improve

    Authors: Anthony Sicilia, Xingchen Zhao, Seong Jae Hwang

    Abstract: Theoretically, domain adaptation is a well-researched problem. Further, this theory has been well-used in practice. In particular, we note the bound on target error given by Ben-David et al. (2010) and the well-known domain-aligning algorithm based on this work using Domain Adversarial Neural Networks (DANN) presented by Ganin and Lempitsky (2015). Recently, multiple variants of DANN have been pro… ▽ More

    Submitted 18 March, 2022; v1 submitted 7 February, 2021; originally announced February 2021.