Skip to main content

Showing 1–13 of 13 results for author: Kung, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.13892  [pdf, other

    cs.CL

    Adaptable Logical Control for Large Language Models

    Authors: Honghua Zhang, Po-Nien Kung, Masahiro Yoshida, Guy Van den Broeck, Nanyun Peng

    Abstract: Despite the success of Large Language Models (LLMs) on various tasks following human instructions, controlling model generation at inference time poses a persistent challenge. In this paper, we introduce Ctrl-G, an adaptable framework that facilitates tractable and flexible control of LLM generation to reliably follow logical constraints. Ctrl-G combines any production-ready LLM with a Hidden Mark… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  2. arXiv:2404.04763  [pdf, other

    cs.CV cs.AI

    GenEARL: A Training-Free Generative Framework for Multimodal Event Argument Role Labeling

    Authors: Hritik Bansal, Po-Nien Kung, P. Jeffrey Brantingham, Kai-Wei Chang, Nanyun Peng

    Abstract: Multimodal event argument role labeling (EARL), a task that assigns a role for each event participant (object) in an image is a complex challenge. It requires reasoning over the entire image, the depicted event, and the interactions between various objects participating in the event. Existing models heavily rely on high-quality event-annotated training data to understand the event semantics and st… ▽ More

    Submitted 6 April, 2024; originally announced April 2024.

    Comments: 20 pages, 15 Figures, 13 figures

  3. arXiv:2403.02586  [pdf, other

    cs.CL

    Improving Event Definition Following For Zero-Shot Event Detection

    Authors: Zefan Cai, Po-Nien Kung, Ashima Suvarna, Mingyu Derek Ma, Hritik Bansal, Baobao Chang, P. Jeffrey Brantingham, Wei Wang, Nanyun Peng

    Abstract: Existing approaches on zero-shot event detection usually train models on datasets annotated with known event types, and prompt them with unseen event definitions. These approaches yield sporadic successes, yet generally fall short of expectations. In this work, we aim to improve zero-shot event detection by training models to better follow event definitions. We hypothesize that a diverse set of ev… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

  4. arXiv:2311.00288  [pdf, other

    cs.CL cs.AI

    Active Instruction Tuning: Improving Cross-Task Generalization by Training on Prompt Sensitive Tasks

    Authors: Po-Nien Kung, Fan Yin, Di Wu, Kai-Wei Chang, Nanyun Peng

    Abstract: Instruction tuning (IT) achieves impressive zero-shot generalization results by training large language models (LLMs) on a massive amount of diverse tasks with instructions. However, how to select new tasks to improve the performance and generalizability of IT models remains an open question. Training on all existing tasks is impractical due to prohibiting computation requirements, and randomly se… ▽ More

    Submitted 1 November, 2023; originally announced November 2023.

    Comments: EMNLP 2023 Main

  5. arXiv:2310.02529  [pdf, other

    cs.SI cs.AI cs.HC

    MIDDAG: Where Does Our News Go? Investigating Information Diffusion via Community-Level Information Pathways

    Authors: Mingyu Derek Ma, Alexander K. Taylor, Nuan Wen, Yanchen Liu, Po-Nien Kung, Wenna Qin, Shicheng Wen, Azure Zhou, Diyi Yang, Xuezhe Ma, Nanyun Peng, Wei Wang

    Abstract: We present MIDDAG, an intuitive, interactive system that visualizes the information propagation paths on social media triggered by COVID-19-related news articles accompanied by comprehensive insights, including user/community susceptibility level, as well as events and popular opinions raised by the crowd while propagating the information. Besides discovering information flow patterns among users,… ▽ More

    Submitted 20 February, 2024; v1 submitted 3 October, 2023; originally announced October 2023.

    Comments: To appear at AAAI'24. System demo video and more info: info-pathways.github.io

  6. arXiv:2309.04937  [pdf, other

    cs.RO

    LONER: LiDAR Only Neural Representations for Real-Time SLAM

    Authors: Seth Isaacson, Pou-Chun Kung, Mani Ramanagopal, Ram Vasudevan, Katherine A. Skinner

    Abstract: This paper proposes LONER, the first real-time LiDAR SLAM algorithm that uses a neural implicit scene representation. Existing implicit map** methods for LiDAR show promising results in large-scale reconstruction, but either require groundtruth poses or run slower than real-time. In contrast, LONER uses LiDAR data to train an MLP to estimate a dense map in real-time, while simultaneously estimat… ▽ More

    Submitted 23 March, 2024; v1 submitted 10 September, 2023; originally announced September 2023.

    Comments: First two authors equally contributed. Webpage: https://umautobots.github.io/loner

  7. arXiv:2305.15090  [pdf, other

    cs.CL cs.AI

    STAR: Boosting Low-Resource Information Extraction by Structure-to-Text Data Generation with Large Language Models

    Authors: Mingyu Derek Ma, Xiaoxuan Wang, Po-Nien Kung, P. Jeffrey Brantingham, Nanyun Peng, Wei Wang

    Abstract: Information extraction tasks such as event extraction require an in-depth understanding of the output structure and sub-task dependencies. They heavily rely on task-specific training data in the form of (passage, target structure) pairs to obtain reasonable performance. However, obtaining such data through human annotation is costly, leading to a pressing need for low-resource information extracti… ▽ More

    Submitted 20 February, 2024; v1 submitted 24 May, 2023; originally announced May 2023.

    Comments: To appear at AAAI'24. More info is at https://derek.ma/STAR

  8. arXiv:2305.11383  [pdf, other

    cs.AI

    Do Models Really Learn to Follow Instructions? An Empirical Study of Instruction Tuning

    Authors: Po-Nien Kung, Nanyun Peng

    Abstract: Recent works on instruction tuning (IT) have achieved great performance with zero-shot generalizability to unseen tasks. With additional context (e.g., task definition, examples) provided to models for fine-tuning, they achieved much higher performance than untuned models. Despite impressive performance gains, what models learn from IT remains understudied. In this work, we analyze how models util… ▽ More

    Submitted 25 May, 2023; v1 submitted 18 May, 2023; originally announced May 2023.

    Comments: Proceedings of the 61th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)

  9. arXiv:2112.04282  [pdf, other

    cs.RO cs.LG

    Radar Occupancy Prediction with Lidar Supervision while Preserving Long-Range Sensing and Penetrating Capabilities

    Authors: Pou-Chun Kung, Chieh-Chih Wang, Wen-Chieh Lin

    Abstract: Radar shows great potential for autonomous driving by accomplishing long-range sensing under diverse weather conditions. But radar is also a particularly challenging sensing modality due to the radar noises. Recent works have made enormous progress in classifying free and occupied spaces in radar images by leveraging lidar label supervision. However, there are still several unsolved issues. Firstl… ▽ More

    Submitted 17 January, 2022; v1 submitted 8 December, 2021; originally announced December 2021.

    Comments: Accepted for publication in RA-L 2022

  10. arXiv:2110.05221  [pdf, other

    cs.CL cs.AI

    Multi-Task Learning for Situated Multi-Domain End-to-End Dialogue Systems

    Authors: Po-Nien Kung, Chung-Cheng Chang, Tse-Hsuan Yang, Hsin-Kai Hsu, Yu-Jia Liou, Yun-Nung Chen

    Abstract: Task-oriented dialogue systems have been a promising area in the NLP field. Previous work showed the effectiveness of using a single GPT-2 based model to predict belief states and responses via causal language modeling. In this paper, we leverage multi-task learning techniques to train a GPT-2 based model on a more challenging dataset with multiple domains, multiple modalities, and more diversity… ▽ More

    Submitted 11 October, 2021; originally announced October 2021.

  11. arXiv:2103.07908  [pdf, other

    cs.RO

    A Normal Distribution Transform-Based Radar Odometry Designed For Scanning and Automotive Radars

    Authors: Pou-Chun Kung, Chieh-Chih Wang, Wen-Chieh Lin

    Abstract: Existing radar sensors can be classified into automotive and scanning radars. While most radar odometry (RO) methods are only designed for a specific type of radar, our RO method adapts to both scanning and automotive radars. Our RO is simple yet effective, where the pipeline consists of thresholding, probabilistic submap building, and an NDT-based radar scan matching. The proposed RO has been tes… ▽ More

    Submitted 30 March, 2023; v1 submitted 14 March, 2021; originally announced March 2021.

    Comments: Accepted for publication in ICRA 2021. Code is available: For scanning RO, see https://github.com/kungfrank/pw_ndt_radar_scan_matching . For automotive RO, see https://github.com/kungfrank/pw_ndt_automotive_radar_scan_matching

  12. arXiv:1512.05840   

    cs.CY cs.LG stat.ML

    Deep Poisson Factorization Machines: factor analysis for map** behaviors in journalist ecosystem

    Authors: Pau Perng-Hwa Kung

    Abstract: Newsroom in online ecosystem is difficult to untangle. With prevalence of social media, interactions between journalists and individuals become visible, but lack of understanding to inner processing of information feedback loop in public sphere leave most journalists baffled. Can we provide an organized view to characterize journalist behaviors on individual level to know better of the ecosystem?… ▽ More

    Submitted 29 December, 2017; v1 submitted 17 December, 2015; originally announced December 2015.

    Comments: Incomplete work, will re-upload once the details and implementations are straightened out

  13. arXiv:1511.05842  [pdf, other

    cs.CY

    Measuring Responsiveness in the Online Public Sphere for the 2016 U.S. Election: Concepts

    Authors: Pau Perng-Hwa Kung, Deb Roy

    Abstract: The election narrative is formed under the competitions of ideas among critical players involving politicians, news media, public influentials, and the general public. Untangling the complex process of narrative formation, however, is no easy task due to implicit influences among the key players. This paper outlines a conceptual framework to untangle this complex process. We propose the problem of… ▽ More

    Submitted 9 December, 2015; v1 submitted 18 November, 2015; originally announced November 2015.

    Comments: 7 pages, Workshop on Networks in the Social and Information Sciences NIPS 2015