Skip to main content

Showing 151–200 of 591 results for author: XIong, C

.
  1. arXiv:2205.05675  [pdf, other

    cs.CV eess.IV

    NTIRE 2022 Challenge on Efficient Super-Resolution: Methods and Results

    Authors: Yawei Li, Kai Zhang, Radu Timofte, Luc Van Gool, Fangyuan Kong, Mingxi Li, Songwei Liu, Zongcai Du, Ding Liu, Chenhui Zhou, **gyi Chen, Qingrui Han, Zheyuan Li, Yingqi Liu, Xiangyu Chen, Haoming Cai, Yu Qiao, Chao Dong, Long Sun, **shan Pan, Yi Zhu, Zhikai Zong, Xiaoxiao Liu, Zheng Hui, Tao Yang , et al. (86 additional authors not shown)

    Abstract: This paper reviews the NTIRE 2022 challenge on efficient single image super-resolution with focus on the proposed solutions and results. The task of the challenge was to super-resolve an input image with a magnification factor of $\times$4 based on pairs of low and corresponding high resolution images. The aim was to design a network for single image super-resolution that achieved improvement of e… ▽ More

    Submitted 11 May, 2022; originally announced May 2022.

    Comments: Validation code of the baseline model is available at https://github.com/ofsoundof/IMDN. Validation of all submitted models is available at https://github.com/ofsoundof/NTIRE2022_ESR

  2. arXiv:2205.03284  [pdf, other

    cs.IR

    Dimension Reduction for Efficient Dense Retrieval via Conditional Autoencoder

    Authors: Zhenghao Liu, Han Zhang, Chenyan Xiong, Zhiyuan Liu, Yu Gu, Xiaohua Li

    Abstract: Dense retrievers encode queries and documents and map them in an embedding space using pre-trained language models. These embeddings need to be high-dimensional to fit training signals and guarantee the retrieval effectiveness of dense retrievers. However, these high-dimensional embeddings lead to larger index storage and higher retrieval latency. To reduce the embedding dimensions of dense retrie… ▽ More

    Submitted 22 October, 2022; v1 submitted 6 May, 2022; originally announced May 2022.

    Comments: Accepted by EMNLP 2022

  3. P^3 Ranker: Mitigating the Gaps between Pre-training and Ranking Fine-tuning with Prompt-based Learning and Pre-finetuning

    Authors: Xiaomeng Hu, Shi Yu, Chenyan Xiong, Zhenghao Liu, Zhiyuan Liu, Ge Yu

    Abstract: Compared to other language tasks, applying pre-trained language models (PLMs) for search ranking often requires more nuances and training signals. In this paper, we identify and study the two mismatches between pre-training and ranking fine-tuning: the training schema gap regarding the differences in training objectives and model architectures, and the task knowledge gap considering the discrepanc… ▽ More

    Submitted 4 May, 2022; v1 submitted 4 May, 2022; originally announced May 2022.

    Comments: Accepted by SIGIR 2022

    ACM Class: H.3.3

  4. arXiv:2205.01730  [pdf, other

    cs.CL cs.HC

    Quiz Design Task: Hel** Teachers Create Quizzes with Automated Question Generation

    Authors: Philippe Laban, Chien-Sheng Wu, Lidiya Murakhovs'ka, Wenhao Liu, Caiming Xiong

    Abstract: Question generation (QGen) models are often evaluated with standardized NLG metrics that are based on n-gram overlap. In this paper, we measure whether these metric improvements translate to gains in a practical setting, focusing on the use case of hel** teachers automate the generation of reading comprehension quizzes. In our study, teachers building a quiz receive question suggestions, which t… ▽ More

    Submitted 3 May, 2022; originally announced May 2022.

    Comments: Accepted at NAACL 2022 Special HCI Theme (Findings, short paper), 10 pages, 6 figures

  5. arXiv:2204.13207  [pdf, other

    cs.CV cs.AI cs.LG

    Use All The Labels: A Hierarchical Multi-Label Contrastive Learning Framework

    Authors: Shu Zhang, Ran Xu, Caiming Xiong, Chetan Ramaiah

    Abstract: Current contrastive learning frameworks focus on leveraging a single supervisory signal to learn representations, which limits the efficacy on unseen data and downstream tasks. In this paper, we present a hierarchical multi-label representation learning framework that can leverage all available labels and preserve the hierarchical relationship between classes. We introduce novel hierarchy preservi… ▽ More

    Submitted 27 April, 2022; originally announced April 2022.

    Comments: Accepted by CVPR, 2022

  6. arXiv:2204.06644  [pdf, other

    cs.LG cs.AI cs.CL

    METRO: Efficient Denoising Pretraining of Large Scale Autoencoding Language Models with Model Generated Signals

    Authors: Payal Bajaj, Chenyan Xiong, Guolin Ke, Xiaodong Liu, Di He, Saurabh Tiwary, Tie-Yan Liu, Paul Bennett, Xia Song, Jianfeng Gao

    Abstract: We present an efficient method of pretraining large-scale autoencoding language models using training signals generated by an auxiliary model. Originated in ELECTRA, this training strategy has demonstrated sample-efficiency to pretrain models at the scale of hundreds of millions of parameters. In this work, we conduct a comprehensive empirical study, and propose a recipe, namely "Model generated d… ▽ More

    Submitted 16 April, 2022; v1 submitted 13 April, 2022; originally announced April 2022.

    Comments: Update details in scaled initialization and add acknowledgement

  7. arXiv:2204.05356  [pdf, other

    cs.CL

    A Generative Language Model for Few-shot Aspect-Based Sentiment Analysis

    Authors: Ehsan Hosseini-Asl, Wenhao Liu, Caiming Xiong

    Abstract: Sentiment analysis is an important task in natural language processing. In recent works, pre-trained language models are often used to achieve state-of-the-art results, especially when training data is scarce. It is common to fine-tune on the downstream task, usually by adding task-specific layers on top of the model. In this paper, we focus on aspect-based sentiment analysis, which involves extra… ▽ More

    Submitted 11 April, 2022; originally announced April 2022.

    Comments: Accepted to Findings of NAACL 2022

  8. arXiv:2204.03329  [pdf

    cs.RO eess.SY

    Information-driven Path Planning for Hybrid Aerial Underwater Vehicles

    Authors: Zheng Zeng, Chengke Xiong, Xinyi Yuan, Yulin Bai, Yufei **, Di Lu, Lian Lian

    Abstract: This paper presents a novel Rapidly-exploring Adaptive Sampling Tree (RAST) algorithm for the adaptive sampling mission of a hybrid aerial underwater vehicle (HAUV) in an air-sea 3D environment. This algorithm innovatively combines the tournament-based point selection sampling strategy, the information heuristic search process and the framework of Rapidly-exploring Random Tree (RRT) algorithm. Hen… ▽ More

    Submitted 8 April, 2022; v1 submitted 7 April, 2022; originally announced April 2022.

  9. arXiv:2204.03243  [pdf, other

    cs.CL cs.LG

    Pretraining Text Encoders with Adversarial Mixture of Training Signal Generators

    Authors: Yu Meng, Chenyan Xiong, Payal Bajaj, Saurabh Tiwary, Paul Bennett, Jiawei Han, Xia Song

    Abstract: We present a new framework AMOS that pretrains text encoders with an Adversarial learning curriculum via a Mixture Of Signals from multiple auxiliary generators. Following ELECTRA-style pretraining, the main encoder is trained as a discriminator to detect replaced tokens generated by auxiliary masked language models (MLMs). Different from ELECTRA which trains one MLM as the generator, we jointly t… ▽ More

    Submitted 7 April, 2022; originally announced April 2022.

    Comments: ICLR 2022. (Code and Models: https://github.com/microsoft/AMOS)

  10. arXiv:2204.02011  [pdf, other

    cs.AI

    ELECRec: Training Sequential Recommenders as Discriminators

    Authors: Yongjun Chen, Jia Li, Caiming Xiong

    Abstract: Sequential recommendation is often considered as a generative task, i.e., training a sequential encoder to generate the next item of a user's interests based on her historical interacted items. Despite their prevalence, these methods usually require training with more meaningful samples to be effective, which otherwise will lead to a poorly trained model. In this work, we propose to train the sequ… ▽ More

    Submitted 21 July, 2022; v1 submitted 5 April, 2022; originally announced April 2022.

    Comments: Accepted to SIGIR 2022

  11. arXiv:2203.15508  [pdf, other

    cs.LG cs.AI cs.IR

    Improving Contrastive Learning with Model Augmentation

    Authors: Zhiwei Liu, Yongjun Chen, Jia Li, Man Luo, Philip S. Yu, Caiming Xiong

    Abstract: The sequential recommendation aims at predicting the next items in user behaviors, which can be solved by characterizing item relationships in sequences. Due to the data sparsity and noise issues in sequences, a new self-supervised learning (SSL) paradigm is proposed to improve the performance, which employs contrastive learning between positive and negative views of sequences. However, existing… ▽ More

    Submitted 25 March, 2022; originally announced March 2022.

    Comments: Preprint. Still under reivew

  12. arXiv:2203.13474  [pdf, other

    cs.LG cs.CL cs.PL

    CodeGen: An Open Large Language Model for Code with Multi-Turn Program Synthesis

    Authors: Erik Nijkamp, Bo Pang, Hiroaki Hayashi, Lifu Tu, Huan Wang, Yingbo Zhou, Silvio Savarese, Caiming Xiong

    Abstract: Program synthesis strives to generate a computer program as a solution to a given problem specification, expressed with input-output examples or natural language descriptions. The prevalence of large language models advances the state-of-the-art for program synthesis, though limited training resources and data impede open access to such models. To democratize this, we train and release a family of… ▽ More

    Submitted 27 February, 2023; v1 submitted 25 March, 2022; originally announced March 2022.

  13. arXiv:2203.12187  [pdf, other

    cs.CL cs.AI

    Converse: A Tree-Based Modular Task-Oriented Dialogue System

    Authors: Tian Xie, Xinyi Yang, Angela S. Lin, Feihong Wu, Kazuma Hashimoto, ** Qu, Young Mo Kang, Wenpeng Yin, Huan Wang, Semih Yavuz, Gang Wu, Michael Jones, Richard Socher, Yingbo Zhou, Wenhao Liu, Caiming Xiong

    Abstract: Creating a system that can have meaningful conversations with humans to help accomplish tasks is one of the ultimate goals of Artificial Intelligence (AI). It has defined the meaning of AI since the beginning. A lot has been accomplished in this area recently, with voice assistant products entering our daily lives and chat bot systems becoming commonplace in customer service. At first glance there… ▽ More

    Submitted 9 May, 2022; v1 submitted 23 March, 2022; originally announced March 2022.

  14. arXiv:2203.08512  [pdf, other

    cs.CL

    ConTinTin: Continual Learning from Task Instructions

    Authors: Wenpeng Yin, Jia Li, Caiming Xiong

    Abstract: The mainstream machine learning paradigms for NLP often work with two underlying presumptions. First, the target task is predefined and static; a system merely needs to learn to solve it exclusively. Second, the supervision of a task mainly comes from a set of labeled examples. A question arises: how to build a system that can keep learning new tasks from their instructions? This work defines a ne… ▽ More

    Submitted 18 March, 2022; v1 submitted 16 March, 2022; originally announced March 2022.

    Comments: ACL'2022 camera-ready

  15. arXiv:2203.07586  [pdf, other

    cs.CL

    Long Document Summarization with Top-down and Bottom-up Inference

    Authors: Bo Pang, Erik Nijkamp, Wojciech Kryściński, Silvio Savarese, Yingbo Zhou, Caiming Xiong

    Abstract: Text summarization aims to condense long documents and retain key information. Critical to the success of a summarization model is the faithful inference of latent representations of words or tokens in the source documents. Most recent models infer the latent representations with a transformer encoder, which is purely bottom-up. Also, self-attention-based inference models face the challenge of qua… ▽ More

    Submitted 14 March, 2022; originally announced March 2022.

    Comments: 21 pages

  16. arXiv:2203.00073  [pdf, other

    cs.CL cs.AI

    Structure Extraction in Task-Oriented Dialogues with Slot Clustering

    Authors: Liang Qiu, Chien-Sheng Wu, Wenhao Liu, Caiming Xiong

    Abstract: Extracting structure information from dialogue data can help us better understand user and system behaviors. In task-oriented dialogues, dialogue structure has often been considered as transition graphs among dialogue states. However, annotating dialogue states manually is expensive and time-consuming. In this paper, we propose a simple yet effective approach for structure extraction in task-orien… ▽ More

    Submitted 15 March, 2022; v1 submitted 28 February, 2022; originally announced March 2022.

  17. arXiv:2202.11091  [pdf, other

    cs.LG cs.AI math.ST stat.ME stat.ML

    Efficient and Differentiable Conformal Prediction with General Function Classes

    Authors: Yu Bai, Song Mei, Huan Wang, Yingbo Zhou, Caiming Xiong

    Abstract: Quantifying the data uncertainty in learning tasks is often done by learning a prediction interval or prediction set of the label given the input. Two commonly desired properties for learned prediction sets are \emph{valid coverage} and \emph{good efficiency} (such as low length or low cardinality). Conformal prediction is a powerful technique for learning prediction sets with valid coverage, yet… ▽ More

    Submitted 29 May, 2022; v1 submitted 22 February, 2022; originally announced February 2022.

    Comments: Appearing at ICLR 2022

  18. Intent Contrastive Learning for Sequential Recommendation

    Authors: Yongjun Chen, Zhiwei Liu, Jia Li, Julian McAuley, Caiming Xiong

    Abstract: Users' interactions with items are driven by various intents (e.g., preparing for holiday gifts, shop** for fishing equipment, etc.).However, users' underlying intents are often unobserved/latent, making it challenging to leverage such latent intents forSequentialrecommendation(SR). To investigate the benefits of latent intents and leverage them effectively for recommendation, we proposeIntentCo… ▽ More

    Submitted 5 February, 2022; originally announced February 2022.

  19. arXiv:2201.12086  [pdf, other

    cs.CV

    BLIP: Bootstrap** Language-Image Pre-training for Unified Vision-Language Understanding and Generation

    Authors: Junnan Li, Dongxu Li, Caiming Xiong, Steven Hoi

    Abstract: Vision-Language Pre-training (VLP) has advanced the performance for many vision-language tasks. However, most existing pre-trained models only excel in either understanding-based tasks or generation-based tasks. Furthermore, performance improvement has been largely achieved by scaling up the dataset with noisy image-text pairs collected from the web, which is a suboptimal source of supervision. In… ▽ More

    Submitted 15 February, 2022; v1 submitted 28 January, 2022; originally announced January 2022.

  20. arXiv:2201.05966  [pdf, other

    cs.CL

    UnifiedSKG: Unifying and Multi-Tasking Structured Knowledge Grounding with Text-to-Text Language Models

    Authors: Tianbao Xie, Chen Henry Wu, Peng Shi, Ruiqi Zhong, Torsten Scholak, Michihiro Yasunaga, Chien-Sheng Wu, Ming Zhong, Pengcheng Yin, Sida I. Wang, Victor Zhong, Bailin Wang, Chengzu Li, Connor Boyle, Ansong Ni, Ziyu Yao, Dragomir Radev, Caiming Xiong, Lingpeng Kong, Rui Zhang, Noah A. Smith, Luke Zettlemoyer, Tao Yu

    Abstract: Structured knowledge grounding (SKG) leverages structured knowledge to complete user requests, such as semantic parsing over databases and question answering over knowledge bases. Since the inputs and outputs of SKG tasks are heterogeneous, they have been studied separately by different communities, which limits systematic and compatible research on SKG. In this paper, we overcome this limitation… ▽ More

    Submitted 18 October, 2022; v1 submitted 15 January, 2022; originally announced January 2022.

    Comments: EMNLP 2022

  21. arXiv:2201.05176  [pdf, other

    cs.IR cs.CL

    Neural Approaches to Conversational Information Retrieval

    Authors: Jianfeng Gao, Chenyan Xiong, Paul Bennett, Nick Craswell

    Abstract: A conversational information retrieval (CIR) system is an information retrieval (IR) system with a conversational interface which allows users to interact with the system to seek information via multi-turn conversations of natural language, in spoken or written form. Recent progress in deep learning has brought tremendous improvements in natural language processing (NLP) and conversational AI, lea… ▽ More

    Submitted 13 January, 2022; originally announced January 2022.

    Comments: Book Draft

  22. arXiv:2201.04399  [pdf, other

    cs.IR cs.AI cs.LG

    RGRecSys: A Toolkit for Robustness Evaluation of Recommender Systems

    Authors: Zohreh Ovaisi, Shelby Heinecke, Jia Li, Yongfeng Zhang, Elena Zheleva, Caiming Xiong

    Abstract: Robust machine learning is an increasingly important topic that focuses on develo** models resilient to various forms of imperfect data. Due to the pervasiveness of recommender systems in online technologies, researchers have carried out several robustness studies focusing on data sparsity and profile injection attacks. Instead, we propose a more holistic view of robustness for recommender syste… ▽ More

    Submitted 12 January, 2022; originally announced January 2022.

    Journal ref: In Proceedings of the Fifteenth ACM International Conference on Web Search and Data Mining (WSDM 22), February 2022, ACM, 4 pages

  23. arXiv:2201.01427  [pdf, other

    cs.CV eess.IV

    Attention-based Dual Supervised Decoder for RGBD Semantic Segmentation

    Authors: Yang Zhang, Yang Yang, Chenyun Xiong, Guodong Sun, Yanwen Guo

    Abstract: Encoder-decoder models have been widely used in RGBD semantic segmentation, and most of them are designed via a two-stream network. In general, jointly reasoning the color and geometric information from RGBD is beneficial for semantic segmentation. However, most existing approaches fail to comprehensively utilize multimodal information in both the encoder and decoder. In this paper, we propose a n… ▽ More

    Submitted 14 March, 2022; v1 submitted 4 January, 2022; originally announced January 2022.

    Comments: 12 pages, 6 figures

  24. Explaining with Examples: Lessons Learned from Crowdsourced Introductory Description of Information Visualizations

    Authors: Leni Yang, Cindy Xiong, Jason K. Wong, Aoyu Wu, Huamin Qu

    Abstract: Data visualizations have been increasingly used in oral presentations to communicate data patterns to the general public. Clear verbal introductions of visualizations to explain how to interpret the visually encoded information are essential to convey the takeaways and avoid misunderstandings. We contribute a series of studies to investigate how to effectively introduce visualizations to the audie… ▽ More

    Submitted 23 December, 2021; originally announced December 2021.

    Comments: 12 pages, 5 figures, accepted to IEEE Transaction on Visualization and Graphics

  25. arXiv:2112.08542  [pdf, other

    cs.CL

    QAFactEval: Improved QA-Based Factual Consistency Evaluation for Summarization

    Authors: Alexander R. Fabbri, Chien-Sheng Wu, Wenhao Liu, Caiming Xiong

    Abstract: Factual consistency is an essential quality of text summarization models in practical settings. Existing work in evaluating this dimension can be broadly categorized into two lines of research, entailment-based and question answering (QA)-based metrics, and different experimental setups often lead to contrasting conclusions as to which paradigm performs the best. In this work, we conduct an extens… ▽ More

    Submitted 29 April, 2022; v1 submitted 15 December, 2021; originally announced December 2021.

    Comments: NAACL 2022

  26. arXiv:2112.07820  [pdf, other

    cs.CV cs.AI

    Value Retrieval with Arbitrary Queries for Form-like Documents

    Authors: Mingfei Gao, Le Xue, Chetan Ramaiah, Chen Xing, Ran Xu, Caiming Xiong

    Abstract: We propose value retrieval with arbitrary queries for form-like documents to reduce human effort of processing forms. Unlike previous methods that only address a fixed set of field items, our method predicts target value for an arbitrary query based on the understanding of the layout and semantics of a form. To further boost model performance, we propose a simple document language modeling (Simple… ▽ More

    Submitted 15 April, 2022; v1 submitted 14 December, 2021; originally announced December 2021.

  27. arXiv:2111.10497  [pdf, ps, other

    cs.CL

    Combining Data-driven Supervision with Human-in-the-loop Feedback for Entity Resolution

    Authors: Wenpeng Yin, Shelby Heinecke, Jia Li, Nitish Shirish Keskar, Michael Jones, Shouzhong Shi, Stanislav Georgiev, Kurt Milich, Joseph Esposito, Caiming Xiong

    Abstract: The distribution gap between training datasets and data encountered in production is well acknowledged. Training datasets are often constructed over a fixed period of time and by carefully curating the data to be labeled. Thus, training datasets may not contain all possible variations of data that could be encountered in real-world production environments. Tasked with building an entity resolution… ▽ More

    Submitted 19 November, 2021; originally announced November 2021.

    Comments: Camera-ready for Data-Centric AI Workshop at NeurIPS 2021

  28. arXiv:2111.10292  [pdf

    physics.optics physics.app-ph

    Ultrahigh-$Q$ on-chip silicon-germanium microresonators

    Authors: Ryan Schilling, Chi Xiong, Swetha Kamlapurkar, Abram Falk, Nathan Marchack, Stephen Bedell, Richard Haight, Christopher Scerbo, Hanhee Paik, Jason S. Orcutt

    Abstract: We demonstrate fully crystalline, single-mode ultrahigh quality factor integrated microresonators comprising epitaxially grown Si$_{0.86}$Ge$_{0.14}$ waveguide cores with silicon claddings. These waveguides support resonances with internal $Q >10^8$ for both polarization modes, a nearly order-of-magnitude improvement over that seen in prior integrated Si photonics platforms. The maximum $Q$ is… ▽ More

    Submitted 19 November, 2021; originally announced November 2021.

  29. arXiv:2111.09452  [pdf, other

    cs.CV

    Open Vocabulary Object Detection with Pseudo Bounding-Box Labels

    Authors: Mingfei Gao, Chen Xing, Juan Carlos Niebles, Junnan Li, Ran Xu, Wenhao Liu, Caiming Xiong

    Abstract: Despite great progress in object detection, most existing methods work only on a limited set of object categories, due to the tremendous human effort needed for bounding-box annotations of training data. To alleviate the problem, recent open vocabulary and zero-shot detection methods attempt to detect novel object categories beyond those seen during training. They achieve this goal by training on… ▽ More

    Submitted 13 July, 2022; v1 submitted 17 November, 2021; originally announced November 2021.

    Comments: ECCV 2022

  30. arXiv:2110.15439  [pdf, other

    cs.IR

    Dense Hierarchical Retrieval for Open-Domain Question Answering

    Authors: Ye Liu, Kazuma Hashimoto, Yingbo Zhou, Semih Yavuz, Caiming Xiong, Philip S. Yu

    Abstract: Dense neural text retrieval has achieved promising results on open-domain Question Answering (QA), where latent representations of questions and passages are exploited for maximum inner product search in the retrieval process. However, current dense retrievers require splitting documents into short passages that usually contain local, partial, and sometimes biased context, and highly depend on the… ▽ More

    Submitted 28 October, 2021; originally announced October 2021.

    Comments: EMNLP 2021 Findings

  31. arXiv:2110.10832  [pdf, other

    cs.LG cs.CV

    Ensemble of Averages: Improving Model Selection and Boosting Performance in Domain Generalization

    Authors: Devansh Arpit, Huan Wang, Yingbo Zhou, Caiming Xiong

    Abstract: In Domain Generalization (DG) settings, models trained independently on a given set of training domains have notoriously chaotic performance on distribution shifted test domains, and stochasticity in optimization (e.g. seed) plays a big role. This makes deep learning models unreliable in real world settings. We first show that this chaotic behavior exists even along the training optimization traje… ▽ More

    Submitted 10 October, 2022; v1 submitted 20 October, 2021; originally announced October 2021.

    Comments: Accepted at NeurIPS 2022

  32. arXiv:2110.10303  [pdf, other

    cs.CV cs.LG

    Momentum Contrastive Autoencoder: Using Contrastive Learning for Latent Space Distribution Matching in WAE

    Authors: Devansh Arpit, Aadyot Bhatnagar, Huan Wang, Caiming Xiong

    Abstract: Wasserstein autoencoder (WAE) shows that matching two distributions is equivalent to minimizing a simple autoencoder (AE) loss under the constraint that the latent space of this AE matches a pre-specified prior distribution. This latent space distribution matching is a core component of WAE, and a challenging task. In this paper, we propose to use the contrastive learning framework that has been s… ▽ More

    Submitted 15 February, 2023; v1 submitted 19 October, 2021; originally announced October 2021.

  33. arXiv:2110.10293  [pdf, other

    cs.LG cs.CV

    Learning Rich Nearest Neighbor Representations from Self-supervised Ensembles

    Authors: Bram Wallace, Devansh Arpit, Huan Wang, Caiming Xiong

    Abstract: Pretraining convolutional neural networks via self-supervision, and applying them in transfer learning, is an incredibly fast-growing field that is rapidly and iteratively improving performance across practically all image domains. Meanwhile, model ensembling is one of the most universally applicable techniques in supervised learning literature and practice, offering a simple solution to reliably… ▽ More

    Submitted 19 October, 2021; originally announced October 2021.

  34. arXiv:2110.10048  [pdf, other

    cs.CV

    Improving Tail-Class Representation with Centroid Contrastive Learning

    Authors: Anthony Meng Huat Tiong, Junnan Li, Guosheng Lin, Boyang Li, Caiming Xiong, Steven C. H. Hoi

    Abstract: In vision domain, large-scale natural datasets typically exhibit long-tailed distribution which has large class imbalance between head and tail classes. This distribution poses difficulty in learning good representations for tail classes. Recent developments have shown good long-tailed model can be learnt by decoupling the training into representation learning and classifier balancing. However, th… ▽ More

    Submitted 4 May, 2023; v1 submitted 19 October, 2021; originally announced October 2021.

    Comments: Add in acknowledgment

  35. arXiv:2110.08222  [pdf, other

    cs.CL

    DialFact: A Benchmark for Fact-Checking in Dialogue

    Authors: Prakhar Gupta, Chien-Sheng Wu, Wenhao Liu, Caiming Xiong

    Abstract: Fact-checking is an essential tool to mitigate the spread of misinformation and disinformation. We introduce the task of fact-checking in dialogue, which is a relatively unexplored area. We construct DialFact, a testing benchmark dataset of 22,245 annotated conversational claims, paired with pieces of evidence from Wikipedia. There are three sub-tasks in DialFact: 1) Verifiable claim detection tas… ▽ More

    Submitted 24 March, 2022; v1 submitted 15 October, 2021; originally announced October 2021.

    Comments: ACL 2022

  36. arXiv:2110.08175  [pdf, other

    cs.CL

    MixQG: Neural Question Generation with Mixed Answer Types

    Authors: Lidiya Murakhovs'ka, Chien-Sheng Wu, Philippe Laban, Tong Niu, Wenhao Liu, Caiming Xiong

    Abstract: Asking good questions is an essential ability for both human and machine intelligence. However, existing neural question generation approaches mainly focus on the short factoid type of answers. In this paper, we propose a neural question generator, MixQG, to bridge this gap. We combine 9 question answering datasets with diverse answer types, including yes/no, multiple-choice, extractive, and abstr… ▽ More

    Submitted 31 May, 2022; v1 submitted 15 October, 2021; originally announced October 2021.

    Comments: camera-ready version

  37. arXiv:2110.07581  [pdf, other

    cs.IR cs.CL cs.LG

    Zero-Shot Dense Retrieval with Momentum Adversarial Domain Invariant Representations

    Authors: Ji Xin, Chenyan Xiong, Ashwin Srinivasan, Ankita Sharma, Damien Jose, Paul N. Bennett

    Abstract: Dense retrieval (DR) methods conduct text retrieval by first encoding texts in the embedding space and then matching them by nearest neighbor search. This requires strong locality properties from the representation space, i.e, the close allocations of each small group of relevant texts, which are hard to generalize to domains without sufficient training data. In this paper, we aim to improve the g… ▽ More

    Submitted 14 October, 2021; originally announced October 2021.

  38. arXiv:2110.05367  [pdf, other

    cs.CL cs.CY cs.LG

    Improving Gender Fairness of Pre-Trained Language Models without Catastrophic Forgetting

    Authors: Zahra Fatemi, Chen Xing, Wenhao Liu, Caiming Xiong

    Abstract: Existing studies addressing gender bias of pre-trained language models, usually build a small gender-neutral data set and conduct a second phase pre-training on the model with such data. However, given the limited size and concentrated focus of the gender-neutral data, catastrophic forgetting would occur during second-phase pre-training. Forgetting information in the original training data may dam… ▽ More

    Submitted 30 June, 2023; v1 submitted 11 October, 2021; originally announced October 2021.

    Comments: This paper has been accepted at the 61st Annual Meeting of the Association for Computational Linguistics (ACL 2023)

  39. arXiv:2110.04889  [pdf, other

    cs.CL

    Distantly-Supervised Evidence Retrieval Enables Question Answering without Evidence Annotation

    Authors: Chen Zhao, Chenyan Xiong, Jordan Boyd-Graber, Hal Daumé III

    Abstract: Open-domain question answering answers a question based on evidence retrieved from a large corpus. State-of-the-art neural approaches require intermediate evidence annotations for training. However, such intermediate annotations are expensive, and methods that rely on them cannot transfer to the more common setting, where only question-answer pairs are available. This paper investigates whether mo… ▽ More

    Submitted 10 October, 2021; originally announced October 2021.

    Comments: EMNLP 2021

  40. arXiv:2110.04413  [pdf, other

    cs.CV cs.AI

    Robustness Evaluation of Transformer-based Form Field Extractors via Form Attacks

    Authors: Le Xue, Mingfei Gao, Zeyuan Chen, Caiming Xiong, Ran Xu

    Abstract: We propose a novel framework to evaluate the robustness of transformer-based form field extraction methods via form attacks. We introduce 14 novel form transformations to evaluate the vulnerability of the state-of-the-art field extractors against form attacks from both OCR level and form level, including OCR location/order rearrangement, form background manipulation and form field-value augmentati… ▽ More

    Submitted 8 October, 2021; originally announced October 2021.

  41. arXiv:2110.04282  [pdf, other

    cs.CV cs.AI

    Field Extraction from Forms with Unlabeled Data

    Authors: Mingfei Gao, Zeyuan Chen, Nikhil Naik, Kazuma Hashimoto, Caiming Xiong, Ran Xu

    Abstract: We propose a novel framework to conduct field extraction from forms with unlabeled data. To bootstrap the training process, we develop a rule-based method for mining noisy pseudo-labels from unlabeled forms. Using the supervisory signal from the pseudo-labels, we extract a discriminative token representation from a transformer-based model by modeling the interaction between text in the form. To pr… ▽ More

    Submitted 11 April, 2022; v1 submitted 8 October, 2021; originally announced October 2021.

    Comments: Spa-NLP@ACL2022

  42. arXiv:2109.11654  [pdf, other

    cs.AI cs.IR

    Modeling Dynamic Attributes for Next Basket Recommendation

    Authors: Yongjun Chen, Jia Li, Chenghao Liu, Chenxi Li, Markus Anderle, Julian McAuley, Caiming Xiong

    Abstract: Traditional approaches to next item and next basket recommendation typically extract users' interests based on their past interactions and associated static contextual information (e.g. a user id or item category). However, extracted interests can be inaccurate and become obsolete. Dynamic attributes, such as user income changes, item price changes (etc.), change over time. Such dynamics can intri… ▽ More

    Submitted 23 September, 2021; originally announced September 2021.

  43. arXiv:2109.09265  [pdf, other

    cs.LG cs.MS stat.ML

    Merlion: A Machine Learning Library for Time Series

    Authors: Aadyot Bhatnagar, Paul Kassianik, Chenghao Liu, Tian Lan, Wenzhuo Yang, Rowan Cassius, Doyen Sahoo, Devansh Arpit, Sri Subramanian, Gerald Woo, Amrita Saha, Arun Kumar Jagota, Gokulakrishnan Gopalakrishnan, Manpreet Singh, K C Krithika, Sukumar Maddineni, Daeki Cho, Bo Zong, Yingbo Zhou, Caiming Xiong, Silvio Savarese, Steven Hoi, Huan Wang

    Abstract: We introduce Merlion, an open-source machine learning library for time series. It features a unified interface for many commonly used models and datasets for anomaly detection and forecasting on both univariate and multivariate time series, along with standard pre/post-processing layers. It has several modules to improve ease-of-use, including visualization, anomaly score calibration to improve in… ▽ More

    Submitted 19 September, 2021; originally announced September 2021.

    Comments: 22 pages, 1 figure, 14 tables

  44. arXiv:2109.08678  [pdf, other

    cs.CL

    RnG-KBQA: Generation Augmented Iterative Ranking for Knowledge Base Question Answering

    Authors: Xi Ye, Semih Yavuz, Kazuma Hashimoto, Yingbo Zhou, Caiming Xiong

    Abstract: Existing KBQA approaches, despite achieving strong performance on i.i.d. test data, often struggle in generalizing to questions involving unseen KB schema items. Prior ranking-based approaches have shown some success in generalization, but suffer from the coverage issue. We present RnG-KBQA, a Rank-and-Generate approach for KBQA, which remedies the coverage issue with a generation model while pres… ▽ More

    Submitted 21 March, 2022; v1 submitted 17 September, 2021; originally announced September 2021.

    Comments: ACL 2022 Camera-ready

  45. arXiv:2109.04562  [pdf, other

    cs.CL

    TIAGE: A Benchmark for Topic-Shift Aware Dialog Modeling

    Authors: Huiyuan Xie, Zhenghao Liu, Chenyan Xiong, Zhiyuan Liu, Ann Copestake

    Abstract: Human conversations naturally evolve around different topics and fluently move between them. In research on dialog systems, the ability to actively and smoothly transition to new topics is often ignored. In this paper we introduce TIAGE, a new topic-shift aware dialog benchmark constructed utilizing human annotations on topic shifts. Based on TIAGE, we introduce three tasks to investigate differen… ▽ More

    Submitted 9 September, 2021; originally announced September 2021.

    Comments: Accepted to appear in Findings of EMNLP 2021

  46. arXiv:2108.13454  [pdf, other

    cs.IR cs.AI

    Improving Query Representations for Dense Retrieval with Pseudo Relevance Feedback

    Authors: HongChien Yu, Chenyan Xiong, Jamie Callan

    Abstract: Dense retrieval systems conduct first-stage retrieval using embedded representations and simple similarity metrics to match a query to documents. Its effectiveness depends on encoded embeddings to capture the semantics of queries and documents, a challenging task due to the shortness and ambiguity of search queries. This paper proposes ANCE-PRF, a new query encoder that uses pseudo relevance feedb… ▽ More

    Submitted 30 August, 2021; originally announced August 2021.

    Comments: Accepted at CIKM 2021

  47. arXiv:2108.13270  [pdf, ps, other

    cs.HC

    Making the Invisible Visible: Risks and Benefits of Disclosing Metadata in Visualization

    Authors: Alyxander Burns, Thai On, Christiana Lee, Rachel Shapiro, Cindy Xiong, Narges Mahyar

    Abstract: Accompanying a data visualization with metadata may benefit readers by facilitating content understanding, strengthening trust, and providing accountability. However, providing this kind of information may also have negative, unintended consequences, such as biasing readers' interpretations, a loss of trust as a result of too much transparency, and the possibility of opening visualization creators… ▽ More

    Submitted 30 August, 2021; originally announced August 2021.

    Comments: To appear in the Visualization for Social Good Workshop at VIS 2021

  48. arXiv:2108.08407  [pdf, other

    cs.HC

    Show or Tell? Visual and Verbal Representations Bias Position Recall

    Authors: Cristina R. Ceja, Cindy Xiong

    Abstract: When we view visualizations, we not only have a visual representation of the data, but also a verbal one. Recent work has shown that these visual representations of data can be biased, such that the position of a line in a chart will be consistently underestimated. But are the verbal representations of position encodings also biased in the same manner, or is this a purely visual bias that can be m… ▽ More

    Submitted 18 August, 2021; originally announced August 2021.

  49. arXiv:2108.06479  [pdf, other

    cs.IR cs.AI

    Contrastive Self-supervised Sequential Recommendation with Robust Augmentation

    Authors: Zhiwei Liu, Yongjun Chen, Jia Li, Philip S. Yu, Julian McAuley, Caiming Xiong

    Abstract: Sequential Recommendationdescribes a set of techniques to model dynamic user behavior in order to predict future interactions in sequential user data. At their core, such approaches model transition probabilities between items in a sequence, whether through Markov chains, recurrent networks, or more recently, Transformers. However both old and new issues remain, including data-sparsity and noisy d… ▽ More

    Submitted 14 August, 2021; originally announced August 2021.

    Comments: Under-review. Work done during Zhiwei's intern at Salesforce. Inc

  50. arXiv:2108.06370  [pdf, other

    cs.HC

    Visual Arrangements of Bar Charts Influence Comparisons in Viewer Takeaways

    Authors: Cindy Xiong, Vidya Setlur, Benjamin Bach, Kylie Lin, Eunyee Koh, Steven Franconeri

    Abstract: Well-designed data visualizations can lead to more powerful and intuitive processing by a viewer. To help a viewer intuitively compare values to quickly generate key takeaways, visualization designers can manipulate how data values are arranged in a chart to afford particular comparisons. Using simple bar charts as a case study, we empirically tested the comparison affordances of four common arran… ▽ More

    Submitted 13 August, 2021; originally announced August 2021.

    Comments: 12 pages (9 + 2 pages of references), 9 figures, for IEEE VIS conference (TVCG journal)