Skip to main content

Showing 1–26 of 26 results for author: Sabour, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.20055  [pdf, other

    cs.CV cs.LG

    SpotlessSplats: Ignoring Distractors in 3D Gaussian Splatting

    Authors: Sara Sabour, Lily Goli, George Kopanas, Mark Matthews, Dmitry Lagun, Leonidas Guibas, Alec Jacobson, David J. Fleet, Andrea Tagliasacchi

    Abstract: 3D Gaussian Splatting (3DGS) is a promising technique for 3D reconstruction, offering efficient training and rendering speeds, making it suitable for real-time applications.However, current methods require highly controlled environments (no moving people or wind-blown elements, and consistent lighting) to meet the inter-view consistency assumption of 3DGS. This makes reconstruction of real-world c… ▽ More

    Submitted 28 June, 2024; originally announced June 2024.

  2. arXiv:2402.12071  [pdf, other

    cs.CL cs.AI

    EmoBench: Evaluating the Emotional Intelligence of Large Language Models

    Authors: Sahand Sabour, Siyang Liu, Zheyuan Zhang, June M. Liu, **feng Zhou, Alvionna S. Sunaryo, Juanzi Li, Tatia M. C. Lee, Rada Mihalcea, Minlie Huang

    Abstract: Recent advances in Large Language Models (LLMs) have highlighted the need for robust, comprehensive, and challenging benchmarks. Yet, research on evaluating their Emotional Intelligence (EI) is considerably limited. Existing benchmarks have two major shortcomings: first, they mainly focus on emotion recognition, neglecting essential EI capabilities such as emotion regulation and thought facilitati… ▽ More

    Submitted 7 June, 2024; v1 submitted 19 February, 2024; originally announced February 2024.

    Comments: ACL 2024 Main Conference

  3. arXiv:2311.16832  [pdf, other

    cs.CL cs.AI

    CharacterGLM: Customizing Chinese Conversational AI Characters with Large Language Models

    Authors: **feng Zhou, Zhuang Chen, Dazhen Wan, Bosi Wen, Yi Song, Jifan Yu, Yongkang Huang, Libiao Peng, Jiaming Yang, Xiyao Xiao, Sahand Sabour, Xiaohan Zhang, Wen**g Hou, Yijia Zhang, Yuxiao Dong, Jie Tang, Minlie Huang

    Abstract: In this paper, we present CharacterGLM, a series of models built upon ChatGLM, with model sizes ranging from 6B to 66B parameters. Our CharacterGLM is designed for generating Character-based Dialogues (CharacterDial), which aims to equip a conversational AI system with character customization for satisfying people's inherent social desires and emotional needs. On top of CharacterGLM, we can custom… ▽ More

    Submitted 28 November, 2023; originally announced November 2023.

    Comments: Work in progress

  4. arXiv:2310.05317  [pdf, other

    cs.CL cs.AI

    Task-Adaptive Tokenization: Enhancing Long-Form Text Generation Efficacy in Mental Health and Beyond

    Authors: Siyang Liu, Naihao Deng, Sahand Sabour, Yilin Jia, Minlie Huang, Rada Mihalcea

    Abstract: We propose task-adaptive tokenization as a way to adapt the generation pipeline to the specifics of a downstream task and enhance long-form generation in mental health. Inspired by insights from cognitive science, our task-adaptive tokenizer samples variable segmentations from multiple outcomes, with sampling probabilities optimized based on task-specific data. We introduce a strategy for building… ▽ More

    Submitted 13 November, 2023; v1 submitted 8 October, 2023; originally announced October 2023.

    Comments: Accepted at the main conference of The 2023 Conference on Empirical Methods in Natural Language Processing; 8 pages

    MSC Class: 68 ACM Class: I.2.7

    Journal ref: The 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP 2023)

  5. arXiv:2305.05390  [pdf, other

    cs.CL

    COKE: A Cognitive Knowledge Graph for Machine Theory of Mind

    Authors: **cenzi Wu, Zhuang Chen, Jiawen Deng, Sahand Sabour, Helen Meng, Minlie Huang

    Abstract: Theory of mind (ToM) refers to humans' ability to understand and infer the desires, beliefs, and intentions of others. The acquisition of ToM plays a key role in humans' social cognition and interpersonal relations. Though indispensable for social intelligence, ToM is still lacking for modern AI and NLP systems since they cannot access the human mental state and cognitive process beneath the train… ▽ More

    Submitted 18 May, 2024; v1 submitted 9 May, 2023; originally announced May 2023.

    Comments: ACL 2024

  6. arXiv:2302.00833  [pdf, other

    cs.CV cs.LG

    RobustNeRF: Ignoring Distractors with Robust Losses

    Authors: Sara Sabour, Suhani Vora, Daniel Duckworth, Ivan Krasin, David J. Fleet, Andrea Tagliasacchi

    Abstract: Neural radiance fields (NeRF) excel at synthesizing new views given multi-view, calibrated images of a static scene. When scenes include distractors, which are not persistent during image capture (moving objects, lighting variations, shadows), artifacts appear as view-dependent effects or 'floaters'. To cope with distractors, we advocate a form of robust estimation for NeRF training, modeling dist… ▽ More

    Submitted 1 February, 2023; originally announced February 2023.

  7. arXiv:2212.09235  [pdf, other

    cs.CL

    PAL: Persona-Augmented Emotional Support Conversation Generation

    Authors: Jiale Cheng, Sahand Sabour, Hao Sun, Zhuang Chen, Minlie Huang

    Abstract: Due to the lack of human resources for mental health support, there is an increasing demand for employing conversational agents for support. Recent work has demonstrated the effectiveness of dialogue models in providing emotional support. As previous studies have demonstrated that seekers' persona is an important factor for effective support, we investigate whether there are benefits to modeling s… ▽ More

    Submitted 29 May, 2023; v1 submitted 18 December, 2022; originally announced December 2022.

    Comments: Accepted to ACL 2023 findings

  8. arXiv:2211.16564  [pdf, other

    cs.CV cs.LG

    Testing GLOM's ability to infer wholes from ambiguous parts

    Authors: Laura Culp, Sara Sabour, Geoffrey E. Hinton

    Abstract: The GLOM architecture proposed by Hinton [2021] is a recurrent neural network for parsing an image into a hierarchy of wholes and parts. When a part is ambiguous, GLOM assumes that the ambiguity can be resolved by allowing the part to make multi-modal predictions for the pose and identity of the whole to which it belongs and then using attention to similar predictions coming from other possibly am… ▽ More

    Submitted 29 November, 2022; originally announced November 2022.

  9. arXiv:2211.01600  [pdf, other

    cs.CV cs.AI cs.RO

    nerf2nerf: Pairwise Registration of Neural Radiance Fields

    Authors: Lily Goli, Daniel Rebain, Sara Sabour, Animesh Garg, Andrea Tagliasacchi

    Abstract: We introduce a technique for pairwise registration of neural fields that extends classical optimization-based local registration (i.e. ICP) to operate on Neural Radiance Fields (NeRF) -- neural 3D scene representations trained from collections of calibrated images. NeRF does not decompose illumination and color, so to make registration invariant to illumination, we introduce the concept of a ''sur… ▽ More

    Submitted 3 November, 2022; originally announced November 2022.

  10. arXiv:2209.10183  [pdf, other

    cs.CL cs.HC

    Chatbots for Mental Health Support: Exploring the Impact of Emohaa on Reducing Mental Distress in China

    Authors: Sahand Sabour, Wen Zhang, Xiyao Xiao, Yuwei Zhang, Yinhe Zheng, Jiaxin Wen, Jialu Zhao, Minlie Huang

    Abstract: The growing demand for mental health support has highlighted the importance of conversational agents as human supporters worldwide and in China. These agents could increase availability and reduce the relative costs of mental health support. The provided support can be divided into two main types: cognitive and emotional support. Existing work on this topic mainly focuses on constructing agents th… ▽ More

    Submitted 21 September, 2022; originally announced September 2022.

    Comments: Work Under Review

  11. arXiv:2203.03570  [pdf, other

    cs.CV cs.GR cs.LG

    Kubric: A scalable dataset generator

    Authors: Klaus Greff, Francois Belletti, Lucas Beyer, Carl Doersch, Yilun Du, Daniel Duckworth, David J. Fleet, Dan Gnanapragasam, Florian Golemo, Charles Herrmann, Thomas Kipf, Abhijit Kundu, Dmitry Lagun, Issam Laradji, Hsueh-Ti, Liu, Henning Meyer, Yishu Miao, Derek Nowrouzezahrai, Cengiz Oztireli, Etienne Pot, Noha Radwan, Daniel Rebain, Sara Sabour, Mehdi S. M. Sajjadi , et al. (10 additional authors not shown)

    Abstract: Data is the driving force of machine learning, with the amount and quality of training data often being more important for the performance of a system than architecture and training details. But collecting, processing and annotating real data at scale is difficult, expensive, and frequently raises additional privacy, fairness and legal concerns. Synthetic data is a powerful tool with the potential… ▽ More

    Submitted 7 March, 2022; originally announced March 2022.

    Comments: 21 pages, CVPR2022

  12. arXiv:2202.13587  [pdf, other

    cs.CL cs.AI

    Rethinking and Refining the Distinct Metric

    Authors: Siyang Liu, Sahand Sabour, Yinhe Zheng, Pei Ke, Xiaoyan Zhu, Minlie Huang

    Abstract: Distinct-$n$ score\cite{Li2016} is a widely used automatic metric for evaluating diversity in language generation tasks. However, we observed that the original approach for calculating distinct scores has evident biases that tend to assign higher penalties to longer sequences. We refine the calculation of distinct scores by scaling the number of distinct tokens based on their expectations. We prov… ▽ More

    Submitted 3 April, 2022; v1 submitted 28 February, 2022; originally announced February 2022.

    Comments: 4 pages, to be published at ACL2022

    ACM Class: I.2.7

  13. arXiv:2202.13047  [pdf, other

    cs.CL

    AugESC: Dialogue Augmentation with Large Language Models for Emotional Support Conversation

    Authors: Chujie Zheng, Sahand Sabour, Jiaxin Wen, Zheng Zhang, Minlie Huang

    Abstract: Crowdsourced dialogue corpora are usually limited in scale and topic coverage due to the expensive cost of data curation. This would hinder the generalization of downstream dialogue models to open-domain topics. In this work, we leverage large language models for dialogue augmentation in the task of emotional support conversation (ESC). By treating dialogue augmentation as a dialogue completion ta… ▽ More

    Submitted 18 May, 2023; v1 submitted 25 February, 2022; originally announced February 2022.

    Comments: Findings of ACL 2023

  14. arXiv:2111.12594  [pdf, other

    cs.CV cs.LG stat.ML

    Conditional Object-Centric Learning from Video

    Authors: Thomas Kipf, Gamaleldin F. Elsayed, Aravindh Mahendran, Austin Stone, Sara Sabour, Georg Heigold, Rico Jonschkowski, Alexey Dosovitskiy, Klaus Greff

    Abstract: Object-centric representations are a promising path toward more systematic generalization by providing flexible abstractions upon which compositional world models can be built. Recent work on simple 2D and 3D datasets has shown that models with object-centric inductive biases can learn to segment and represent meaningful objects from the statistical structure of the data alone without the need for… ▽ More

    Submitted 15 March, 2022; v1 submitted 24 November, 2021; originally announced November 2021.

    Comments: Published at ICLR 2022. Project page at https://slot-attention-video.github.io/

  15. arXiv:2109.05739  [pdf, other

    cs.CL cs.AI

    CEM: Commonsense-aware Empathetic Response Generation

    Authors: Sahand Sabour, Chujie Zheng, Minlie Huang

    Abstract: A key trait of daily conversations between individuals is the ability to express empathy towards others, and exploring ways to implement empathy is a crucial step towards human-like dialogue systems. Previous approaches on this topic mainly focus on detecting and utilizing the user's emotion for generating empathetic responses. However, since empathy includes both aspects of affection and cognitio… ▽ More

    Submitted 6 December, 2021; v1 submitted 13 September, 2021; originally announced September 2021.

    Comments: Accepted to AAAI 2022

    Journal ref: Proceedings of the AAAI Conference on Artificial Intelligence 2022

  16. arXiv:2108.12016  [pdf, other

    cs.LG

    DeepFlow: Abnormal Traffic Flow Detection Using Siamese Networks

    Authors: Sepehr Sabour, Sanjeev Rao, Majid Ghaderi

    Abstract: Nowadays, many cities are equipped with surveillance systems and traffic control centers to monitor vehicular traffic for road safety and efficiency. The monitoring process is mostly done manually which is inefficient and expensive. In recent years, several data-driven solutions have been proposed in the literature to automatically analyze traffic flow data using machine learning techniques. Howev… ▽ More

    Submitted 26 August, 2021; originally announced August 2021.

    Comments: 7 pages, 12 figures, 3 tables

  17. arXiv:2106.01144  [pdf, other

    cs.CL

    Towards Emotional Support Dialog Systems

    Authors: Siyang Liu, Chujie Zheng, Orianna Demasi, Sahand Sabour, Yu Li, Zhou Yu, Yong Jiang, Minlie Huang

    Abstract: Emotional support is a crucial ability for many conversation scenarios, including social interactions, mental health support, and customer service chats. Following reasonable procedures and using various support skills can help to effectively provide support. However, due to the lack of a well-designed task and corpora of effective emotional support conversations, research on building emotional su… ▽ More

    Submitted 2 June, 2021; originally announced June 2021.

    Comments: Accepted to ACL 2021 (Long Paper)

  18. arXiv:2012.04718  [pdf, other

    cs.CV cs.LG

    Canonical Capsules: Self-Supervised Capsules in Canonical Pose

    Authors: Weiwei Sun, Andrea Tagliasacchi, Boyang Deng, Sara Sabour, Soroosh Yazdani, Geoffrey Hinton, Kwang Moo Yi

    Abstract: We propose a self-supervised capsule architecture for 3D point clouds. We compute capsule decompositions of objects through permutation-equivariant attention, and self-supervise the process by training with pairs of randomly rotated objects. Our key idea is to aggregate the attention masks into semantic keypoints, and use these to supervise a decomposition that satisfies the capsule invariance/equ… ▽ More

    Submitted 24 November, 2021; v1 submitted 8 December, 2020; originally announced December 2020.

    Comments: NeurIPS 2021; The first two authors contributed equally; Project website: https://canonical-capsules.github.io

  19. arXiv:2011.13920  [pdf, other

    cs.CV cs.LG

    Unsupervised part representation by Flow Capsules

    Authors: Sara Sabour, Andrea Tagliasacchi, Soroosh Yazdani, Geoffrey E. Hinton, David J. Fleet

    Abstract: Capsule networks aim to parse images into a hierarchy of objects, parts and relations. While promising, they remain limited by an inability to learn effective low level part descriptions. To address this issue we propose a way to learn primary capsule encoders that detect atomic parts from a single image. During training we exploit motion as a powerful perceptual cue for part definition, with an e… ▽ More

    Submitted 19 February, 2021; v1 submitted 27 November, 2020; originally announced November 2020.

  20. arXiv:1907.02957  [pdf, other

    cs.LG cs.CR cs.CV stat.ML

    Detecting and Diagnosing Adversarial Images with Class-Conditional Capsule Reconstructions

    Authors: Yao Qin, Nicholas Frosst, Sara Sabour, Colin Raffel, Garrison Cottrell, Geoffrey Hinton

    Abstract: Adversarial examples raise questions about whether neural network models are sensitive to the same visual features as humans. In this paper, we first detect adversarial examples or otherwise corrupted images based on a class-conditional reconstruction of the input. To specifically attack our detection mechanism, we propose the Reconstructive Attack which seeks both to cause a misclassification and… ▽ More

    Submitted 18 February, 2020; v1 submitted 5 July, 2019; originally announced July 2019.

    Journal ref: ICLR 2020

  21. arXiv:1906.06818  [pdf, other

    stat.ML cs.CV cs.LG cs.NE

    Stacked Capsule Autoencoders

    Authors: Adam R. Kosiorek, Sara Sabour, Yee Whye Teh, Geoffrey E. Hinton

    Abstract: Objects are composed of a set of geometrically organized parts. We introduce an unsupervised capsule autoencoder (SCAE), which explicitly uses geometric relationships between parts to reason about objects. Since these relationships do not depend on the viewpoint, our model is robust to viewpoint changes. SCAE consists of two stages. In the first stage, the model predicts presences and poses of par… ▽ More

    Submitted 2 December, 2019; v1 submitted 16 June, 2019; originally announced June 2019.

    Comments: NeurIPS 2019; 14 pages, 7 figures, 4 tables, code is available at https://github.com/google-research/google-research/tree/master/stacked_capsule_autoencoders

  22. arXiv:1902.08295  [pdf, other

    cs.LG stat.ML

    Lingvo: a Modular and Scalable Framework for Sequence-to-Sequence Modeling

    Authors: Jonathan Shen, Patrick Nguyen, Yonghui Wu, Zhifeng Chen, Mia X. Chen, Ye Jia, Anjuli Kannan, Tara Sainath, Yuan Cao, Chung-Cheng Chiu, Yanzhang He, Jan Chorowski, Smit Hinsu, Stella Laurenzo, James Qin, Orhan Firat, Wolfgang Macherey, Suyog Gupta, Ankur Bapna, Shuyuan Zhang, Ruoming Pang, Ron J. Weiss, Rohit Prabhavalkar, Qiao Liang, Benoit Jacob , et al. (66 additional authors not shown)

    Abstract: Lingvo is a Tensorflow framework offering a complete solution for collaborative deep learning research, with a particular focus towards sequence-to-sequence models. Lingvo models are composed of modular building blocks that are flexible and easily extensible, and experiment configurations are centralized and highly customizable. Distributed training and quantized inference are supported directly w… ▽ More

    Submitted 21 February, 2019; originally announced February 2019.

  23. arXiv:1811.06969  [pdf, other

    cs.LG cs.CR cs.CV stat.ML

    DARCCC: Detecting Adversaries by Reconstruction from Class Conditional Capsules

    Authors: Nicholas Frosst, Sara Sabour, Geoffrey Hinton

    Abstract: We present a simple technique that allows capsule models to detect adversarial images. In addition to being trained to classify images, the capsule model is trained to reconstruct the images from the pose parameters and identity of the correct top-level capsule. Adversarial images do not look like a typical member of the predicted class and they have much larger reconstruction errors when the reco… ▽ More

    Submitted 16 November, 2018; originally announced November 2018.

    Comments: To be presented at NIPS 2018 Workshop on Security in Machine Learning

  24. arXiv:1810.01398  [pdf, other

    cs.LG cs.AI cs.CL stat.ML

    Optimal Completion Distillation for Sequence Learning

    Authors: Sara Sabour, William Chan, Mohammad Norouzi

    Abstract: We present Optimal Completion Distillation (OCD), a training procedure for optimizing sequence to sequence models based on edit distance. OCD is efficient, has no hyper-parameters of its own, and does not require pretraining or joint optimization with conditional log-likelihood. Given a partial sequence generated by the model, we first identify the set of optimal suffixes that minimize the total e… ▽ More

    Submitted 14 January, 2019; v1 submitted 2 October, 2018; originally announced October 2018.

  25. arXiv:1710.09829  [pdf, other

    cs.CV

    Dynamic Routing Between Capsules

    Authors: Sara Sabour, Nicholas Frosst, Geoffrey E Hinton

    Abstract: A capsule is a group of neurons whose activity vector represents the instantiation parameters of a specific type of entity such as an object or an object part. We use the length of the activity vector to represent the probability that the entity exists and its orientation to represent the instantiation parameters. Active capsules at one level make predictions, via transformation matrices, for the… ▽ More

    Submitted 7 November, 2017; v1 submitted 26 October, 2017; originally announced October 2017.

  26. arXiv:1511.05122  [pdf, other

    cs.CV cs.LG cs.NE

    Adversarial Manipulation of Deep Representations

    Authors: Sara Sabour, Yanshuai Cao, Fartash Faghri, David J. Fleet

    Abstract: We show that the representation of an image in a deep neural network (DNN) can be manipulated to mimic those of other natural images, with only minor, imperceptible perturbations to the original image. Previous methods for generating adversarial images focused on image perturbations designed to produce erroneous class labels, while we concentrate on the internal layers of DNN representations. In t… ▽ More

    Submitted 4 March, 2016; v1 submitted 16 November, 2015; originally announced November 2015.

    Comments: Accepted as a conference paper at ICLR 2016