Skip to main content

Showing 1–14 of 14 results for author: Gallagher, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2305.17207  [pdf, other

    cs.CV

    Building One-class Detector for Anything: Open-vocabulary Zero-shot OOD Detection Using Text-image Models

    Authors: Yunhao Ge, Jie Ren, Jia** Zhao, Kaifeng Chen, Andrew Gallagher, Laurent Itti, Balaji Lakshminarayanan

    Abstract: We focus on the challenge of out-of-distribution (OOD) detection in deep learning models, a crucial aspect in ensuring reliability. Despite considerable effort, the problem remains significantly challenging in deep learning models due to their propensity to output over-confident predictions for OOD inputs. We propose a novel one-class open-set OOD detector that leverages text-image pre-trained mod… ▽ More

    Submitted 26 May, 2023; originally announced May 2023.

    Comments: 16 pages (including appendix and references), 3 figures

  2. arXiv:2212.01758  [pdf, other

    cs.CV

    Improving Zero-shot Generalization and Robustness of Multi-modal Models

    Authors: Yunhao Ge, Jie Ren, Andrew Gallagher, Yuxiao Wang, Ming-Hsuan Yang, Hartwig Adam, Laurent Itti, Balaji Lakshminarayanan, Jia** Zhao

    Abstract: Multi-modal image-text models such as CLIP and LiT have demonstrated impressive performance on image classification benchmarks and their zero-shot generalization ability is particularly exciting. While the top-5 zero-shot accuracies of these models are very high, the top-1 accuracies are much lower (over 25% gap in some cases). We investigate the reasons for this performance gap and find that many… ▽ More

    Submitted 25 May, 2023; v1 submitted 4 December, 2022; originally announced December 2022.

    Comments: CVPR 2023

  3. arXiv:2211.05183  [pdf, other

    cs.CV cs.LG

    An Empirical Study on Clustering Pretrained Embeddings: Is Deep Strictly Better?

    Authors: Tyler R. Scott, Ting Liu, Michael C. Mozer, Andrew C. Gallagher

    Abstract: Recent research in clustering face embeddings has found that unsupervised, shallow, heuristic-based methods -- including $k$-means and hierarchical agglomerative clustering -- underperform supervised, deep, inductive methods. While the reported improvements are indeed impressive, experiments are mostly limited to face datasets, where the clustered embeddings are highly discriminative or well-separ… ▽ More

    Submitted 9 November, 2022; originally announced November 2022.

  4. arXiv:2103.15718  [pdf, other

    cs.LG cs.CV

    von Mises-Fisher Loss: An Exploration of Embedding Geometries for Supervised Learning

    Authors: Tyler R. Scott, Andrew C. Gallagher, Michael C. Mozer

    Abstract: Recent work has argued that classification losses utilizing softmax cross-entropy are superior not only for fixed-set classification tasks, but also by outperforming losses developed specifically for open-set tasks including few-shot learning and retrieval. Softmax classifiers have been studied using different embedding geometries -- Euclidean, hyperbolic, and spherical -- and claims have been mad… ▽ More

    Submitted 3 December, 2021; v1 submitted 29 March, 2021; originally announced March 2021.

    Comments: ICCV 2021

  5. arXiv:2006.09273  [pdf, other

    cs.LG stat.ML

    Density of States Estimation for Out-of-Distribution Detection

    Authors: Warren R. Morningstar, Cusuh Ham, Andrew G. Gallagher, Balaji Lakshminarayanan, Alexander A. Alemi, Joshua V. Dillon

    Abstract: Perhaps surprisingly, recent studies have shown probabilistic model likelihoods have poor specificity for out-of-distribution (OOD) detection and often assign higher likelihoods to OOD data than in-distribution data. To ameliorate this issue we propose DoSE, the density of states estimator. Drawing on the statistical physics notion of ``density of states,'' the DoSE decision rule avoids direct com… ▽ More

    Submitted 22 June, 2020; v1 submitted 16 June, 2020; originally announced June 2020.

    Comments: Submitted to NeurIPS. Corrected footnote from: "34th Conference on Neural Information Processing Systems (NeurIPS 2020), Vancouver, Canada" to "Preprint. Under review."

  6. arXiv:2005.07545  [pdf, other

    eess.IV cs.CV

    3D deformable registration of longitudinal abdominopelvic CT images using unsupervised deep learning

    Authors: Maureen van Eijnatten, Leonardo Rundo, K. Joost Batenburg, Felix Lucka, Emma Beddowes, Carlos Caldas, Ferdia A. Gallagher, Evis Sala, Carola-Bibiane Schönlieb, Ramona Woitek

    Abstract: This study investigates the use of the unsupervised deep learning framework VoxelMorph for deformable registration of longitudinal abdominopelvic CT images acquired in patients with bone metastases from breast cancer. The CT images were refined prior to registration by automatically removing the CT table and all other extra-corporeal components. To improve the learning capabilities of VoxelMorph w… ▽ More

    Submitted 15 May, 2020; originally announced May 2020.

  7. arXiv:2003.01687  [pdf, other

    cs.LG stat.ML

    Automatic Differentiation Variational Inference with Mixtures

    Authors: Warren R. Morningstar, Sharad M. Vikram, Cusuh Ham, Andrew Gallagher, Joshua V. Dillon

    Abstract: Automatic Differentiation Variational Inference (ADVI) is a useful tool for efficiently learning probabilistic models in machine learning. Generally approximate posteriors learned by ADVI are forced to be unimodal in order to facilitate use of the reparameterization trick. In this paper, we show how stratified sampling may be used to enable mixture distributions as the approximate posterior, and d… ▽ More

    Submitted 24 June, 2020; v1 submitted 3 March, 2020; originally announced March 2020.

    Comments: Submitted to NeurIPS 2020, Corrected footnote from: "34th Conference on Neural Information Processing Systems (NeurIPS 2020), Vancouver, Canada" to "Preprint. Under review."

  8. arXiv:1901.01342  [pdf, other

    cs.CV cs.MM cs.SD eess.AS

    AVA-ActiveSpeaker: An Audio-Visual Dataset for Active Speaker Detection

    Authors: Joseph Roth, Sourish Chaudhuri, Ondrej Klejch, Radhika Marvin, Andrew Gallagher, Liat Kaver, Sharadh Ramaswamy, Arkadiusz Stopczynski, Cordelia Schmid, Zhonghua Xi, Caroline Pantofaru

    Abstract: Active speaker detection is an important component in video analysis algorithms for applications such as speaker diarization, video re-targeting for meetings, speech enhancement, and human-robot interaction. The absence of a large, carefully labeled audio-visual dataset for this task has constrained algorithm evaluations with respect to data diversity, environments, and accuracy. This has made com… ▽ More

    Submitted 24 May, 2019; v1 submitted 4 January, 2019; originally announced January 2019.

  9. arXiv:1810.00319  [pdf, other

    cs.LG cs.CV stat.ML

    Modeling Uncertainty with Hedged Instance Embedding

    Authors: Seong Joon Oh, Kevin Murphy, Jiyan Pan, Joseph Roth, Florian Schroff, Andrew Gallagher

    Abstract: Instance embeddings are an efficient and versatile image representation that facilitates applications like recognition, verification, retrieval, and clustering. Many metric learning methods represent the input as a single point in the embedding space. Often the distance between points is used as a proxy for match confidence. However, this can fail to represent uncertainty arising when the input is… ▽ More

    Submitted 26 August, 2019; v1 submitted 30 September, 2018; originally announced October 2018.

    Comments: 15 pages, 11 figures, updated version of ICLR'19

  10. arXiv:1808.00606  [pdf, other

    cs.SD eess.AS

    AVA-Speech: A Densely Labeled Dataset of Speech Activity in Movies

    Authors: Sourish Chaudhuri, Joseph Roth, Daniel P. W. Ellis, Andrew Gallagher, Liat Kaver, Radhika Marvin, Caroline Pantofaru, Nathan Reale, Loretta Guarino Reid, Kevin Wilson, Zhonghua Xi

    Abstract: Speech activity detection (or endpointing) is an important processing step for applications such as speech recognition, language identification and speaker diarization. Both audio- and vision-based approaches have been used for this task in various settings, often tailored toward end applications. However, much of the prior work reports results in synthetic settings, on task-specific datasets, or… ▽ More

    Submitted 23 August, 2018; v1 submitted 1 August, 2018; originally announced August 2018.

    Comments: Interspeech, 2018

  11. arXiv:1806.05252  [pdf, other

    cs.CV

    Finding your Lookalike: Measuring Face Similarity Rather than Face Identity

    Authors: Amir Sadovnik, Wassim Gharbi, Thanh Vu, Andrew Gallagher

    Abstract: Face images are one of the main areas of focus for computer vision, receiving on a wide variety of tasks. Although face recognition is probably the most widely researched, many other tasks such as kinship detection, facial expression classification and facial aging have been examined. In this work we propose the new, subjective task of quantifying perceived face similarity between a pair of faces.… ▽ More

    Submitted 13 June, 2018; originally announced June 2018.

    Comments: Accepted to the 1st CVPR Workshop on Visual Understanding of Subjective Attributes of Data 2018

  12. arXiv:1502.05678  [pdf, other

    cs.CV

    VIP: Finding Important People in Images

    Authors: Clint Solomon Mathialagan, Andrew C. Gallagher, Dhruv Batra

    Abstract: People preserve memories of events such as birthdays, weddings, or vacations by capturing photos, often depicting groups of people. Invariably, some individuals in the image are more important than others given the context of the event. This paper analyzes the concept of the importance of individuals in group photographs. We address two specific questions -- Given an image, who are the most import… ▽ More

    Submitted 16 April, 2015; v1 submitted 19 February, 2015; originally announced February 2015.

  13. arXiv:1205.6867  [pdf, other

    q-bio.PE cs.DM

    Minimizing the average distance to a closest leaf in a phylogenetic tree

    Authors: Frederick A. Matsen, Aaron Gallagher, Connor McCoy

    Abstract: When performing an analysis on a collection of molecular sequences, it can be convenient to reduce the number of sequences under consideration while maintaining some characteristic of a larger collection of sequences. For example, one may wish to select a subset of high-quality sequences that represent the diversity of a larger collection of sequences. One may also wish to specialize a large datab… ▽ More

    Submitted 31 August, 2012; v1 submitted 30 May, 2012; originally announced May 2012.

    Comments: Please contact us with any comments or questions!

  14. arXiv:1109.5423  [pdf, other

    q-bio.PE cs.DS

    Reconciling taxonomy and phylogenetic inference: formalism and algorithms for describing discord and inferring taxonomic roots

    Authors: Frederick A. Matsen, Aaron Gallagher

    Abstract: Although taxonomy is often used informally to evaluate the results of phylogenetic inference and find the root of phylogenetic trees, algorithmic methods to do so are lacking. In this paper we formalize these procedures and develop algorithms to solve the relevant problems. In particular, we introduce a new algorithm that solves a "subcoloring" problem for expressing the difference between the tax… ▽ More

    Submitted 1 October, 2011; v1 submitted 25 September, 2011; originally announced September 2011.

    Comments: Version submitted to Algorithms for Molecular Biology. A number of fixes from previous version