Skip to main content

Showing 1–11 of 11 results for author: Zhao, V Y

.
  1. arXiv:2311.08422  [pdf

    cs.LG

    k-Parameter Approach for False In-Season Anomaly Suppression in Daily Time Series Anomaly Detection

    Authors: Vincent Yuansang Zha, Vaishnavi Kommaraju, Okenna Obi-Njoku, Vijay Dakshinamoorthy, Anirudh Agnihotri, Nantes Kirsten

    Abstract: Detecting anomalies in a daily time series with a weekly pattern is a common task with a wide range of applications. A typical way of performing the task is by using decomposition method. However, the method often generates false positive results where a data point falls within its weekly range but is just off from its weekday position. We refer to this type of anomalies as "in-season anomalies",… ▽ More

    Submitted 10 November, 2023; originally announced November 2023.

    Comments: 5 pages, 7 figures

  2. arXiv:2305.14128  [pdf, other

    cs.CL cs.AI

    Dr.ICL: Demonstration-Retrieved In-context Learning

    Authors: Man Luo, Xin Xu, Zhuyun Dai, Panupong Pasupat, Mehran Kazemi, Chitta Baral, Vaiva Imbrasaite, Vincent Y Zhao

    Abstract: In-context learning (ICL), teaching a large language model (LLM) to perform a task with few-shot demonstrations rather than adjusting the model parameters, has emerged as a strong paradigm for using LLMs. While early studies primarily used a fixed or random set of demonstrations for all test queries, recent research suggests that retrieving semantically similar demonstrations to the input from a p… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

  3. arXiv:2304.04947  [pdf, other

    cs.CL

    Conditional Adapters: Parameter-efficient Transfer Learning with Fast Inference

    Authors: Tao Lei, Junwen Bai, Siddhartha Brahma, Joshua Ainslie, Kenton Lee, Yanqi Zhou, Nan Du, Vincent Y. Zhao, Yuexin Wu, Bo Li, Yu Zhang, Ming-Wei Chang

    Abstract: We propose Conditional Adapter (CoDA), a parameter-efficient transfer learning method that also improves inference efficiency. CoDA generalizes beyond standard adapter approaches to enable a new way of balancing speed and accuracy using conditional computation. Starting with an existing dense pretrained model, CoDA adds sparse activation together with a small number of new parameters and a light-w… ▽ More

    Submitted 26 November, 2023; v1 submitted 10 April, 2023; originally announced April 2023.

    Comments: NeurIPS camera ready version

  4. arXiv:2304.01982  [pdf, other

    cs.CL cs.IR

    Rethinking the Role of Token Retrieval in Multi-Vector Retrieval

    Authors: **hyuk Lee, Zhuyun Dai, Sai Meher Karthik Duddu, Tao Lei, Iftekhar Naim, Ming-Wei Chang, Vincent Y. Zhao

    Abstract: Multi-vector retrieval models such as ColBERT [Khattab and Zaharia, 2020] allow token-level interactions between queries and documents, and hence achieve state of the art on many information retrieval benchmarks. However, their non-linear scoring function cannot be scaled to millions of documents, necessitating a three-stage process for inference: retrieving initial candidates via token retrieval,… ▽ More

    Submitted 8 April, 2024; v1 submitted 4 April, 2023; originally announced April 2023.

    Comments: NeurIPS 2023. Code available at https://github.com/google-deepmind/xtr

  5. arXiv:2211.01267  [pdf, other

    cs.CL cs.IR

    Multi-Vector Retrieval as Sparse Alignment

    Authors: Yujie Qian, **hyuk Lee, Sai Meher Karthik Duddu, Zhuyun Dai, Siddhartha Brahma, Iftekhar Naim, Tao Lei, Vincent Y. Zhao

    Abstract: Multi-vector retrieval models improve over single-vector dual encoders on many information retrieval tasks. In this paper, we cast the multi-vector retrieval problem as sparse alignment between query and document tokens. We propose AligneR, a novel multi-vector retrieval model that learns sparsified pairwise alignments between query and document tokens (e.g. `dog' vs. `puppy') and per-token unary… ▽ More

    Submitted 2 November, 2022; originally announced November 2022.

  6. arXiv:2210.08726  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    RARR: Researching and Revising What Language Models Say, Using Language Models

    Authors: Luyu Gao, Zhuyun Dai, Panupong Pasupat, Anthony Chen, Arun Tejasvi Chaganty, Yicheng Fan, Vincent Y. Zhao, Ni Lao, Hongrae Lee, Da-Cheng Juan, Kelvin Guu

    Abstract: Language models (LMs) now excel at many tasks such as few-shot learning, question answering, reasoning, and dialog. However, they sometimes generate unsupported or misleading content. A user cannot easily determine whether their outputs are trustworthy or not, because most LMs do not have any built-in mechanism for attribution to external evidence. To enable attribution while still preserving all… ▽ More

    Submitted 31 May, 2023; v1 submitted 16 October, 2022; originally announced October 2022.

    Comments: ACL 2023

  7. arXiv:2209.11755  [pdf, other

    cs.CL cs.IR

    Promptagator: Few-shot Dense Retrieval From 8 Examples

    Authors: Zhuyun Dai, Vincent Y. Zhao, Ji Ma, Yi Luan, Jianmo Ni, **g Lu, Anton Bakalov, Kelvin Guu, Keith B. Hall, Ming-Wei Chang

    Abstract: Much recent research on information retrieval has focused on how to transfer from one task (typically with abundant supervised data) to various other tasks where supervision is limited, with the implicit assumption that it is possible to generalize from one task to all the rest. However, this overlooks the fact that there are many diverse and unique retrieval tasks, each targeting different search… ▽ More

    Submitted 23 September, 2022; originally announced September 2022.

  8. arXiv:2204.06159  [pdf

    physics.bio-ph q-bio.BM

    Systematic conformation-to-phenotype map** via limited deep-sequencing of proteins

    Authors: Eugene Serebryany, Victor Y. Zhao, Kibum Park, Amir Bitran, Sunia A. Trauger, Bogdan Budnik, Eugene I. Shakhnovich

    Abstract: Non-native conformations drive protein misfolding diseases, complicate bioengineering efforts, and fuel molecular evolution. No current experimental technique is well-suited for elucidating them and their phenotypic effects. Especially intractable are the transient conformations populated by intrinsically disordered proteins. We describe an approach to systematically discover, stabilize, and purif… ▽ More

    Submitted 29 January, 2023; v1 submitted 12 April, 2022; originally announced April 2022.

  9. arXiv:2112.07899  [pdf, other

    cs.IR cs.CL

    Large Dual Encoders Are Generalizable Retrievers

    Authors: Jianmo Ni, Chen Qu, **g Lu, Zhuyun Dai, Gustavo Hernández Ábrego, Ji Ma, Vincent Y. Zhao, Yi Luan, Keith B. Hall, Ming-Wei Chang, Yinfei Yang

    Abstract: It has been shown that dual encoders trained on one domain often fail to generalize to other domains for retrieval tasks. One widespread belief is that the bottleneck layer of a dual encoder, where the final score is simply a dot-product between a query vector and a passage vector, is too limited to make dual encoders an effective retrieval model for out-of-domain generalization. In this paper, we… ▽ More

    Submitted 15 December, 2021; originally announced December 2021.

  10. arXiv:2112.06646  [pdf

    econ.GN

    The Burst Market: the Next Leap for Humanity

    Authors: Vincent Yuansang Zha

    Abstract: Contemporary society grapples with a critical challenge in knowledge sharing: the scarcity of rapid, yet specific advice from relevant individuals. This situation underscores a deficiency in the existing labor market, hereafter referred to as the Conventional Market (CM), which struggles to support "Burst Jobs" - brief conversational tasks. This paper identifies high transaction costs, owing to te… ▽ More

    Submitted 10 January, 2024; v1 submitted 5 December, 2021; originally announced December 2021.

    Comments: 19 pages, no figure

  11. arXiv:2109.01652  [pdf, other

    cs.CL

    Finetuned Language Models Are Zero-Shot Learners

    Authors: Jason Wei, Maarten Bosma, Vincent Y. Zhao, Kelvin Guu, Adams Wei Yu, Brian Lester, Nan Du, Andrew M. Dai, Quoc V. Le

    Abstract: This paper explores a simple method for improving the zero-shot learning abilities of language models. We show that instruction tuning -- finetuning language models on a collection of tasks described via instructions -- substantially improves zero-shot performance on unseen tasks. We take a 137B parameter pretrained language model and instruction-tune it on over 60 NLP tasks verbalized via natur… ▽ More

    Submitted 8 February, 2022; v1 submitted 3 September, 2021; originally announced September 2021.

    Comments: Version 5. Find list of changes in Appendix F (page 35)