Skip to main content

Showing 1–4 of 4 results for author: Vouitsis, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2401.13744  [pdf, other

    cs.LG cs.HC stat.ML

    Conformal Prediction Sets Improve Human Decision Making

    Authors: Jesse C. Cresswell, Yi Sui, Bhargava Kumar, Noël Vouitsis

    Abstract: In response to everyday queries, humans explicitly signal uncertainty and offer alternative answers when they are unsure. Machine learning models that output calibrated prediction sets through conformal prediction mimic this human behaviour; larger sets signal greater uncertainty while providing alternatives. In this work, we study the usefulness of conformal prediction sets as an aid for human de… ▽ More

    Submitted 9 June, 2024; v1 submitted 24 January, 2024; originally announced January 2024.

    Comments: Published at ICML 2024. Code available at https://github.com/layer6ai-labs/hitl-conformal-prediction

  2. arXiv:2312.10144  [pdf, other

    cs.LG cs.AI cs.CV

    Data-Efficient Multimodal Fusion on a Single GPU

    Authors: Noël Vouitsis, Zhaoyan Liu, Satya Krishna Gorti, Valentin Villecroze, Jesse C. Cresswell, Guangwei Yu, Gabriel Loaiza-Ganem, Maksims Volkovs

    Abstract: The goal of multimodal alignment is to learn a single latent space that is shared between multimodal inputs. The most powerful models in this space have been trained using massive datasets of paired inputs and large-scale computational resources, making them prohibitively expensive to train in many practical scenarios. We surmise that existing unimodal encoders pre-trained on large amounts of unim… ▽ More

    Submitted 10 April, 2024; v1 submitted 15 December, 2023; originally announced December 2023.

    Comments: CVPR 2024 (Highlight)

  3. arXiv:2304.13742  [pdf, other

    cs.LG cs.AI stat.ML

    TR0N: Translator Networks for 0-Shot Plug-and-Play Conditional Generation

    Authors: Zhaoyan Liu, Noel Vouitsis, Satya Krishna Gorti, Jimmy Ba, Gabriel Loaiza-Ganem

    Abstract: We propose TR0N, a highly general framework to turn pre-trained unconditional generative models, such as GANs and VAEs, into conditional models. The conditioning can be highly arbitrary, and requires only a pre-trained auxiliary model. For example, we show how to turn unconditional models into class-conditional ones with the help of a classifier, and also into text-to-image models by leveraging CL… ▽ More

    Submitted 26 April, 2023; originally announced April 2023.

    Comments: Accepted at ICML 2023

  4. arXiv:2203.15086  [pdf, other

    cs.CV

    X-Pool: Cross-Modal Language-Video Attention for Text-Video Retrieval

    Authors: Satya Krishna Gorti, Noel Vouitsis, Junwei Ma, Keyvan Golestan, Maksims Volkovs, Animesh Garg, Guangwei Yu

    Abstract: In text-video retrieval, the objective is to learn a cross-modal similarity function between a text and a video that ranks relevant text-video pairs higher than irrelevant pairs. However, videos inherently express a much wider gamut of information than texts. Instead, texts often capture sub-regions of entire videos and are most semantically similar to certain frames within videos. Therefore, for… ▽ More

    Submitted 28 March, 2022; originally announced March 2022.

    Comments: CVPR 2022