Skip to main content

Showing 1–23 of 23 results for author: Kao, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.01641  [pdf, other

    cs.MA cs.AI

    Reciprocal Reward Influence Encourages Cooperation From Self-Interested Agents

    Authors: John L. Zhou, Weizhe Hong, Jonathan C. Kao

    Abstract: Emergent cooperation among self-interested individuals is a widespread phenomenon in the natural world, but remains elusive in interactions between artificially intelligent agents. Instead, naïve reinforcement learning algorithms typically converge to Pareto-dominated outcomes in even the simplest of social dilemmas. An emerging class of opponent-sha** methods have demonstrated the ability to re… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: 9 pages, 4 figures

  2. arXiv:2406.01538  [pdf, other

    cs.CL cs.AI

    What Are Large Language Models Map** to in the Brain? A Case Against Over-Reliance on Brain Scores

    Authors: Ebrahim Feghhi, Nima Hadidi, Bryan Song, Idan A. Blank, Jonathan C. Kao

    Abstract: Given the remarkable capabilities of large language models (LLMs), there has been a growing interest in evaluating their similarity to the human brain. One approach towards quantifying this similarity is by measuring how well a model predicts neural signals, also called "brain score". Internal representations from LLMs achieve state-of-the-art brain scores, leading to speculation that they share c… ▽ More

    Submitted 20 June, 2024; v1 submitted 3 June, 2024; originally announced June 2024.

    Comments: 10 pages, 4 figures in the main paper

  3. arXiv:2405.16557  [pdf, other

    cs.LG cs.AI

    Scalable Numerical Embeddings for Multivariate Time Series: Enhancing Healthcare Data Representation Learning

    Authors: Chun-Kai Huang, Yi-Hsien Hsieh, Ta-Jung Chien, Li-Cheng Chien, Shao-Hua Sun, Tung-Hung Su, Jia-Horng Kao, Che Lin

    Abstract: Multivariate time series (MTS) data, when sampled irregularly and asynchronously, often present extensive missing values. Conventional methodologies for MTS analysis tend to rely on temporal embeddings based on timestamps that necessitate subsequent imputations, yet these imputed values frequently deviate substantially from their actual counterparts, thereby compromising prediction accuracy. Furth… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

  4. arXiv:2312.05187  [pdf, other

    cs.CL cs.SD eess.AS

    Seamless: Multilingual Expressive and Streaming Speech Translation

    Authors: Seamless Communication, Loïc Barrault, Yu-An Chung, Mariano Coria Meglioli, David Dale, Ning Dong, Mark Duppenthaler, Paul-Ambroise Duquenne, Brian Ellis, Hady Elsahar, Justin Haaheim, John Hoffman, Min-Jae Hwang, Hirofumi Inaguma, Christopher Klaiber, Ilia Kulikov, Pengwei Li, Daniel Licht, Jean Maillard, Ruslan Mavlyutov, Alice Rakotoarison, Kaushik Ram Sadagopan, Abinesh Ramakrishnan, Tuan Tran, Guillaume Wenzek , et al. (40 additional authors not shown)

    Abstract: Large-scale automatic speech translation systems today lack key features that help machine-mediated communication feel seamless when compared to human-to-human dialogue. In this work, we introduce a family of models that enable end-to-end expressive and multilingual translations in a streaming fashion. First, we contribute an improved version of the massively multilingual and multimodal SeamlessM4… ▽ More

    Submitted 8 December, 2023; originally announced December 2023.

  5. arXiv:2310.08795  [pdf, other

    cs.CL cs.AI cs.CY cs.LG

    Mitigating Bias for Question Answering Models by Tracking Bias Influence

    Authors: Mingyu Derek Ma, Jiun-Yu Kao, Arpit Gupta, Yu-Hsiang Lin, Wenbo Zhao, Tagyoung Chung, Wei Wang, Kai-Wei Chang, Nanyun Peng

    Abstract: Models of various NLP tasks have been shown to exhibit stereotypes, and the bias in the question answering (QA) models is especially harmful as the output answers might be directly consumed by the end users. There have been datasets to evaluate bias in QA models, while bias mitigation technique for the QA models is still under-explored. In this work, we propose BMBI, an approach to mitigate the bi… ▽ More

    Submitted 17 June, 2024; v1 submitted 12 October, 2023; originally announced October 2023.

    Comments: To appear at NAACL 2024 main conference

  6. Existence of Pauli-like stabilizers for every quantum error-correcting code

    Authors: Jhih-Yuan Kao, Hsi-Sheng Goan

    Abstract: The Pauli stabilizer formalism is perhaps the most thoroughly studied means of procuring quantum error-correcting codes, whereby the code is obtained through commutative Pauli operators and ``stabilized'' by them. In this work we will show that every quantum error-correcting code, including Pauli stabilizer codes and subsystem codes, has a similar structure, in that the code can be stabilized by c… ▽ More

    Submitted 29 August, 2023; originally announced August 2023.

    Comments: 20 pages (including 7 appendices); to appear in Phys. Rev. A

    Journal ref: Physical Review A 108, 032414 (2023)

  7. arXiv:2308.11596  [pdf, other

    cs.CL

    SeamlessM4T: Massively Multilingual & Multimodal Machine Translation

    Authors: Seamless Communication, Loïc Barrault, Yu-An Chung, Mariano Cora Meglioli, David Dale, Ning Dong, Paul-Ambroise Duquenne, Hady Elsahar, Hongyu Gong, Kevin Heffernan, John Hoffman, Christopher Klaiber, Pengwei Li, Daniel Licht, Jean Maillard, Alice Rakotoarison, Kaushik Ram Sadagopan, Guillaume Wenzek, Ethan Ye, Bapi Akula, Peng-Jen Chen, Naji El Hachem, Brian Ellis, Gabriel Mejia Gonzalez, Justin Haaheim , et al. (43 additional authors not shown)

    Abstract: What does it take to create the Babel Fish, a tool that can help individuals translate speech between any two languages? While recent breakthroughs in text-based models have pushed machine translation coverage beyond 200 languages, unified speech-to-speech translation models have yet to achieve similar strides. More specifically, conventional speech-to-speech translation systems rely on cascaded s… ▽ More

    Submitted 24 October, 2023; v1 submitted 22 August, 2023; originally announced August 2023.

    ACM Class: I.2.7

  8. arXiv:2301.10915  [pdf, other

    cs.CL cs.AI

    Parameter-Efficient Low-Resource Dialogue State Tracking by Prompt Tuning

    Authors: Mingyu Derek Ma, Jiun-Yu Kao, Shuyang Gao, Arpit Gupta, Di **, Tagyoung Chung, Nanyun Peng

    Abstract: Dialogue state tracking (DST) is an important step in dialogue management to keep track of users' beliefs. Existing works fine-tune all language model (LM) parameters to tackle the DST task, which requires significant data and computing resources for training and hosting. The cost grows exponentially in the real-world deployment where dozens of fine-tuned LM are used for different domains and task… ▽ More

    Submitted 29 May, 2023; v1 submitted 25 January, 2023; originally announced January 2023.

    Comments: In the INTERSPEECH 2023, and the Second Workshop on Efficient Natural Language and Speech Processing (ENLSP) at NeurIPS 2022

  9. arXiv:2301.10606  [pdf, other

    cs.CL cs.SD eess.AS

    A Holistic Cascade System, benchmark, and Human Evaluation Protocol for Expressive Speech-to-Speech Translation

    Authors: Wen-Chin Huang, Benjamin Peloquin, Justine Kao, Changhan Wang, Hongyu Gong, Elizabeth Salesky, Yossi Adi, Ann Lee, Peng-Jen Chen

    Abstract: Expressive speech-to-speech translation (S2ST) aims to transfer prosodic attributes of source speech to target speech while maintaining translation accuracy. Existing research in expressive S2ST is limited, typically focusing on a single expressivity aspect at a time. Likewise, this research area lacks standard evaluation protocols and well-curated benchmark datasets. In this work, we propose a ho… ▽ More

    Submitted 25 January, 2023; originally announced January 2023.

    Comments: This is the full version of our submission to ICASSP 2023

  10. arXiv:2212.08486  [pdf, other

    cs.CL

    BLASER: A Text-Free Speech-to-Speech Translation Evaluation Metric

    Authors: Mingda Chen, Paul-Ambroise Duquenne, Pierre Andrews, Justine Kao, Alexandre Mourachko, Holger Schwenk, Marta R. Costa-jussà

    Abstract: End-to-End speech-to-speech translation (S2ST) is generally evaluated with text-based metrics. This means that generated speech has to be automatically transcribed, making the evaluation dependent on the availability and quality of automatic speech recognition (ASR) systems. In this paper, we propose a text-free evaluation metric for end-to-end S2ST, named BLASER, to avoid the dependency on ASR sy… ▽ More

    Submitted 16 December, 2022; originally announced December 2022.

    ACM Class: I.2.7

  11. arXiv:2211.06474  [pdf, other

    cs.CL cs.SD eess.AS

    Speech-to-Speech Translation For A Real-world Unwritten Language

    Authors: Peng-Jen Chen, Kevin Tran, Yilin Yang, **gfei Du, Justine Kao, Yu-An Chung, Paden Tomasello, Paul-Ambroise Duquenne, Holger Schwenk, Hongyu Gong, Hirofumi Inaguma, Sravya Popuri, Changhan Wang, Juan Pino, Wei-Ning Hsu, Ann Lee

    Abstract: We study speech-to-speech translation (S2ST) that translates speech from one language into another language and focuses on building systems to support languages without standard text writing systems. We use English-Taiwanese Hokkien as a case study, and present an end-to-end solution from training data collection, modeling choices to benchmark dataset release. First, we present efforts on creating… ▽ More

    Submitted 11 November, 2022; originally announced November 2022.

  12. arXiv:2205.12239  [pdf, other

    cs.LG cs.CV cs.IT

    Gacs-Korner Common Information Variational Autoencoder

    Authors: Michael Kleinman, Alessandro Achille, Stefano Soatto, Jonathan Kao

    Abstract: We propose a notion of common information that allows one to quantify and separate the information that is shared between two random variables from the information that is unique to each. Our notion of common information is defined by an optimization problem over a family of functions and recovers the Gács-Körner common information as a special case. Importantly, our notion can be approximated emp… ▽ More

    Submitted 5 November, 2023; v1 submitted 24 May, 2022; originally announced May 2022.

    Comments: Accepted to NeurIPS 2023

  13. arXiv:2109.12211  [pdf, other

    cs.CL

    Style Control for Schema-Guided Natural Language Generation

    Authors: Alicia Y. Tsai, Shereen Oraby, Vittorio Perera, Jiun-Yu Kao, Yuheng Du, Anjali Narayan-Chen, Tagyoung Chung, Dilek Hakkani-Tur

    Abstract: Natural Language Generation (NLG) for task-oriented dialogue systems focuses on communicating specific content accurately, fluently, and coherently. While these attributes are crucial for a successful dialogue, it is also desirable to simultaneously accomplish specific stylistic goals, such as response length, point-of-view, descriptiveness, sentiment, formality, and empathy. In this work, we focu… ▽ More

    Submitted 24 September, 2021; originally announced September 2021.

    Comments: Accepted at the 3rd Workshop on NLP for ConvAI at EMNLP '21

  14. arXiv:2104.09088  [pdf, other

    cs.CL cs.LG

    Alexa Conversations: An Extensible Data-driven Approach for Building Task-oriented Dialogue Systems

    Authors: Anish Acharya, Suranjit Adhikari, Sanchit Agarwal, Vincent Auvray, Nehal Belgamwar, Arijit Biswas, Shubhra Chandra, Tagyoung Chung, Maryam Fazel-Zarandi, Raefer Gabriel, Shuyang Gao, Rahul Goel, Dilek Hakkani-Tur, Jan Jezabek, Abhay Jha, Jiun-Yu Kao, Prakash Krishnan, Peter Ku, Anuj Goyal, Chien-Wei Lin, Qing Liu, Arindam Mandal, Angeliki Metallinou, Vishal Naik, Yi Pan , et al. (6 additional authors not shown)

    Abstract: Traditional goal-oriented dialogue systems rely on various components such as natural language understanding, dialogue state tracking, policy learning and response generation. Training each component requires annotations which are hard to obtain for every new domain, limiting scalability of such systems. Similarly, rule-based dialogue systems require extensive writing and maintenance of rules and… ▽ More

    Submitted 19 April, 2021; originally announced April 2021.

    Journal ref: NAACL 2021 System Demonstrations Track

  15. arXiv:2010.02459  [pdf, other

    cs.LG cs.IT stat.ML

    Usable Information and Evolution of Optimal Representations During Training

    Authors: Michael Kleinman, Alessandro Achille, Daksh Idnani, Jonathan C. Kao

    Abstract: We introduce a notion of usable information contained in the representation learned by a deep network, and use it to study how optimal representations for the task emerge during training. We show that the implicit regularization coming from training with Stochastic Gradient Descent with a high learning-rate and small batch size plays an important role in learning minimal sufficient representations… ▽ More

    Submitted 28 February, 2021; v1 submitted 5 October, 2020; originally announced October 2020.

    Comments: ICLR 2021

  16. Noise Robust Named Entity Understanding for Voice Assistants

    Authors: Deepak Muralidharan, Joel Ruben Antony Moniz, Sida Gao, Xiao Yang, Justine Kao, Stephen Pulman, Atish Kothari, Ray Shen, Yinying Pan, Vivek Kaul, Mubarak Seyed Ibrahim, Gang Xiang, Nan Dun, Yidan Zhou, Andy O, Yuan Zhang, Pooja Chitkara, Xuan Wang, Alkesh Patel, Kushal Tayal, Roger Zheng, Peter Grasch, Jason D. Williams, Lin Li

    Abstract: Named Entity Recognition (NER) and Entity Linking (EL) play an essential role in voice assistant interaction, but are challenging due to the special difficulties associated with spoken user queries. In this paper, we propose a novel architecture that jointly solves the NER and EL tasks by combining them in a joint reranking module. We show that our proposed framework improves NER accuracy by up to… ▽ More

    Submitted 10 August, 2021; v1 submitted 29 May, 2020; originally announced May 2020.

    Comments: NAACL 2021 Industry Track

    MSC Class: 68T50 ACM Class: I.2.7

  17. arXiv:1910.00458  [pdf, other

    cs.CL cs.LG

    MMM: Multi-stage Multi-task Learning for Multi-choice Reading Comprehension

    Authors: Di **, Shuyang Gao, Jiun-Yu Kao, Tagyoung Chung, Dilek Hakkani-tur

    Abstract: Machine Reading Comprehension (MRC) for question answering (QA), which aims to answer a question given the relevant context passages, is an important way to test the ability of intelligence systems to understand human language. Multiple-Choice QA (MCQA) is one of the most difficult tasks in MRC because it often requires more advanced reading comprehension skills such as logical reasoning, summariz… ▽ More

    Submitted 18 November, 2019; v1 submitted 1 October, 2019; originally announced October 2019.

    Comments: Accepted by AAAI 2020

  18. arXiv:1909.09143  [pdf, ps, other

    cs.LG cs.AI stat.ML

    Leveraging User Engagement Signals For Entity Labeling in a Virtual Assistant

    Authors: Deepak Muralidharan, Justine Kao, Xiao Yang, Lin Li, Lavanya Viswanathan, Mubarak Seyed Ibrahim, Kevin Luikens, Stephen Pulman, Ashish Garg, Atish Kothari, Jason Williams

    Abstract: Personal assistant AI systems such as Siri, Cortana, and Alexa have become widely used as a means to accomplish tasks through natural language commands. However, components in these systems generally rely on supervised machine learning algorithms that require large amounts of hand-annotated training data, which is expensive and time consuming to collect. The ability to incorporate unsupervised, we… ▽ More

    Submitted 18 September, 2019; originally announced September 2019.

    Comments: NeurIPS 2018 Conversational AI Workshop

  19. arXiv:1908.11404  [pdf

    cs.LG stat.ML

    Active Learning for Domain Classification in a Commercial Spoken Personal Assistant

    Authors: Xi C. Chen, Adithya Sagar, Justine T. Kao, Tony Y. Li, Christopher Klein, Stephen Pulman, Ashish Garg, Jason D. Williams

    Abstract: We describe a method for selecting relevant new training data for the LSTM-based domain selection component of our personal assistant system. Adding more annotated training data for any ML system typically improves accuracy, but only if it provides examples not already adequately covered in the existing data. However, obtaining, selecting, and labeling relevant data is expensive. This work present… ▽ More

    Submitted 29 August, 2019; originally announced August 2019.

  20. arXiv:1702.06780  [pdf, other

    cs.NI

    Joint Spectrum Reuse and Power Control for Multi-Sharing Device-to-Device Communication

    Authors: Kuo-Yi Chen, Jung-Chun Kao, Si-An Ciou, Shih-Han Lin

    Abstract: Compared to current mobile networks, next-generation mobile networks are expected to support higher numbers of simultaneously connected devices and to achieve higher system spectrum efficiency and lower power consumption. To achieve these goals, we study the multi-sharing device-to-device (D2D) communication, which allows any cellular user equipment to share its radio resource with multiple D2D de… ▽ More

    Submitted 22 February, 2017; originally announced February 2017.

  21. arXiv:0808.2417  [pdf, ps, other

    cs.CC cs.FL

    On NFAs Where All States are Final, Initial, or Both

    Authors: Jui-Yi Kao, Narad Rampersad, Jeffrey Shallit

    Abstract: We examine questions involving nondeterministic finite automata where all states are final, initial, or both initial and final. First, we prove hardness results for the nonuniversality and inequivalence problems for these NFAs. Next, we characterize the languages accepted. Finally, we discuss some state complexity problems involving such automata.

    Submitted 3 July, 2009; v1 submitted 18 August, 2008; originally announced August 2008.

    Comments: submitted

  22. arXiv:0710.4728  [pdf

    cs.AR

    Energy-Aware Routing for E-Textile Applications

    Authors: Jung-Chun Kao, Radu Marculescu

    Abstract: As the scale of electronic devices shrinks, "electronic textiles" (e-textiles) will make possible a wide variety of novel applications which are currently unfeasible. Due to the wearability concerns, low-power techniques are critical for e-textile applications. In this paper, we address the issue of the energy-aware routing for e-textile platforms and propose an efficient algorithm to solve it.… ▽ More

    Submitted 25 October, 2007; originally announced October 2007.

    Comments: Submitted on behalf of EDAA (http://www.edaa.com/)

    Journal ref: Dans Design, Automation and Test in Europe - DATE'05, Munich : Allemagne (2005)

  23. arXiv:0708.3224  [pdf, ps, other

    cs.DM math.CO

    The Frobenius Problem in a Free Monoid

    Authors: Jui-Yi Kao, Jeffrey Shallit, Zhi Xu

    Abstract: The classical Frobenius problem is to compute the largest number g not representable as a non-negative integer linear combination of non-negative integers x_1, x_2, ..., x_k, where gcd(x_1, x_2, ..., x_k) = 1. In this paper we consider generalizations of the Frobenius problem to the noncommutative setting of a free monoid. Unlike the commutative case, where the bound on g is quadratic, we are ab… ▽ More

    Submitted 23 August, 2007; originally announced August 2007.

    Comments: 19 pages; preliminary announcement

    ACM Class: F.4.3