Skip to main content

Showing 1–8 of 8 results for author: Ma, M Q

Searching in archive cs. Search in all archives.
.
  1. arXiv:2306.04898  [pdf, other

    cs.LG cs.CV

    Understanding Masked Autoencoders via Hierarchical Latent Variable Models

    Authors: Ling**g Kong, Martin Q. Ma, Guangyi Chen, Eric P. Xing, Yuejie Chi, Louis-Philippe Morency, Kun Zhang

    Abstract: Masked autoencoder (MAE), a simple and effective self-supervised learning framework based on the reconstruction of masked image regions, has recently achieved prominent success in a variety of vision tasks. Despite the emergence of intriguing empirical observations on MAE, a theoretically principled understanding is still lacking. In this work, we formally characterize and justify existing empiric… ▽ More

    Submitted 7 June, 2023; originally announced June 2023.

    Comments: CVPR 2023 Highlight

  2. arXiv:2208.01036  [pdf, other

    cs.LG cs.AI cs.CV

    Face-to-Face Contrastive Learning for Social Intelligence Question-Answering

    Authors: Alex Wilf, Martin Q. Ma, Paul Pu Liang, Amir Zadeh, Louis-Philippe Morency

    Abstract: Creating artificial social intelligence - algorithms that can understand the nuances of multi-person interactions - is an exciting and emerging challenge in processing facial expressions and gestures from multimodal videos. Recent multimodal methods have set the state of the art on many tasks, but have difficulty modeling the complex face-to-face conversational dynamics across speaking turns in so… ▽ More

    Submitted 27 October, 2022; v1 submitted 29 July, 2022; originally announced August 2022.

  3. arXiv:2202.05458  [pdf, other

    cs.LG

    Conditional Contrastive Learning with Kernel

    Authors: Yao-Hung Hubert Tsai, Tianqin Li, Martin Q. Ma, Han Zhao, Kun Zhang, Louis-Philippe Morency, Ruslan Salakhutdinov

    Abstract: Conditional contrastive learning frameworks consider the conditional sampling procedure that constructs positive or negative data pairs conditioned on specific variables. Fair contrastive learning constructs negative pairs, for example, from the same gender (conditioning on sensitive information), which in turn reduces undesirable information from the learned representations; weakly supervised con… ▽ More

    Submitted 15 March, 2022; v1 submitted 11 February, 2022; originally announced February 2022.

  4. arXiv:2106.02866  [pdf, other

    cs.LG

    Conditional Contrastive Learning for Improving Fairness in Self-Supervised Learning

    Authors: Martin Q. Ma, Yao-Hung Hubert Tsai, Paul Pu Liang, Han Zhao, Kun Zhang, Ruslan Salakhutdinov, Louis-Philippe Morency

    Abstract: Contrastive self-supervised learning (SSL) learns an embedding space that maps similar data pairs closer and dissimilar data pairs farther apart. Despite its success, one issue has been overlooked: the fairness aspect of representations learned using contrastive SSL. Without mitigation, contrastive SSL techniques can incorporate sensitive information such as gender or race and cause potentially un… ▽ More

    Submitted 27 June, 2022; v1 submitted 5 June, 2021; originally announced June 2021.

  5. arXiv:2104.01422  [pdf, other

    cs.LG

    A Large-scale Study on Unsupervised Outlier Model Selection: Do Internal Strategies Suffice?

    Authors: Martin Q. Ma, Yue Zhao, Xiaorong Zhang, Leman Akoglu

    Abstract: Given an unsupervised outlier detection task, how should one select a detection algorithm as well as its hyperparameters (jointly called a model)? Unsupervised model selection is notoriously difficult, in the absence of hold-out validation data with ground-truth labels. Therefore, the problem is vastly understudied. In this work, we study the feasibility of employing internal model evaluation stra… ▽ More

    Submitted 12 April, 2021; v1 submitted 3 April, 2021; originally announced April 2021.

  6. arXiv:2103.11275  [pdf, other

    cs.LG cs.IT

    Self-supervised Representation Learning with Relative Predictive Coding

    Authors: Yao-Hung Hubert Tsai, Martin Q. Ma, Muqiao Yang, Han Zhao, Louis-Philippe Morency, Ruslan Salakhutdinov

    Abstract: This paper introduces Relative Predictive Coding (RPC), a new contrastive representation learning objective that maintains a good balance among training stability, minibatch size sensitivity, and downstream task performance. The key to the success of RPC is two-fold. First, RPC introduces the relative parameters to regularize the objective for boundedness and low variance. Second, RPC contains no… ▽ More

    Submitted 12 April, 2021; v1 submitted 20 March, 2021; originally announced March 2021.

  7. arXiv:2004.14198  [pdf, other

    cs.CL

    Multimodal Routing: Improving Local and Global Interpretability of Multimodal Language Analysis

    Authors: Yao-Hung Hubert Tsai, Martin Q. Ma, Muqiao Yang, Ruslan Salakhutdinov, Louis-Philippe Morency

    Abstract: The human language can be expressed through multiple sources of information known as modalities, including tones of voice, facial gestures, and spoken language. Recent multimodal learning with strong performances on human-centric tasks such as sentiment analysis and emotion recognition are often black-box, with very limited interpretability. In this paper we propose Multimodal Routing, which dynam… ▽ More

    Submitted 5 October, 2020; v1 submitted 29 April, 2020; originally announced April 2020.

  8. arXiv:1910.10202  [pdf, other

    cs.LG cs.SD eess.AS stat.ML

    Complex Transformer: A Framework for Modeling Complex-Valued Sequence

    Authors: Muqiao Yang, Martin Q. Ma, Dongyu Li, Yao-Hung Hubert Tsai, Ruslan Salakhutdinov

    Abstract: While deep learning has received a surge of interest in a variety of fields in recent years, major deep learning models barely use complex numbers. However, speech, signal and audio data are naturally complex-valued after Fourier Transform, and studies have shown a potentially richer representation of complex nets. In this paper, we propose a Complex Transformer, which incorporates the transformer… ▽ More

    Submitted 6 August, 2021; v1 submitted 22 October, 2019; originally announced October 2019.