Skip to main content

Showing 1–50 of 124 results for author: Chan, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.08414  [pdf, other

    cs.LG

    Discovering Preference Optimization Algorithms with and for Large Language Models

    Authors: Chris Lu, Samuel Holt, Claudio Fanconi, Alex J. Chan, Jakob Foerster, Mihaela van der Schaar, Robert Tjarko Lange

    Abstract: Offline preference optimization is a key method for enhancing and controlling the quality of Large Language Model (LLM) outputs. Typically, preference optimization is approached as an offline supervised learning task using manually-crafted convex loss functions. While these methods are based on theoretical insights, they are inherently constrained by human creativity, so the large search space of… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  2. arXiv:2406.06523  [pdf, other

    cs.CV

    NaRCan: Natural Refined Canonical Image with Integration of Diffusion Prior for Video Editing

    Authors: Ting-Hsuan Chen, Jiewen Chan, Hau-Shiang Shiu, Shih-Han Yen, Chang-Han Yeh, Yu-Lun Liu

    Abstract: We propose a video editing framework, NaRCan, which integrates a hybrid deformation field and diffusion prior to generate high-quality natural canonical images to represent the input video. Our approach utilizes homography to model global motion and employs multi-layer perceptrons (MLPs) to capture local residual deformations, enhancing the model's ability to handle complex video dynamics. By intr… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: Project page: https://koi953215.github.io/NaRCan_page/

  3. arXiv:2406.03109  [pdf, other

    cs.IR

    CAPRI-FAIR: Integration of Multi-sided Fairness in Contextual POI Recommendation Framework

    Authors: Francis Zac dela Cruz, Flora D. Salim, Yonchanok Khaokaew, Jeffrey Chan

    Abstract: Point-of-interest (POI) recommendation, a form of context-aware recommendation, takes into account spatio-temporal constraints and contexts like distance, peak business hours, and previous user check-ins. Given the ability of these kinds of systems to influence not just the consumer's travel experience, but also the POI's business, it is important to consider fairness from multiple perspectives. U… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  4. arXiv:2406.00031  [pdf, other

    cs.CL cs.LG

    AMGPT: a Large Language Model for Contextual Querying in Additive Manufacturing

    Authors: Achuth Chandrasekhar, Jonathan Chan, Francis Ogoke, Olabode Ajenifujah, Amir Barati Farimani

    Abstract: Generalized large language models (LLMs) such as GPT-4 may not provide specific answers to queries formulated by materials science researchers. These models may produce a high-level outline but lack the capacity to return detailed instructions on manufacturing and material properties of novel alloys. Enhancing a smaller model with specialized domain knowledge may provide an advantage over large la… ▽ More

    Submitted 24 May, 2024; originally announced June 2024.

    Comments: 54 pages, 4 figures

  5. Promoting Two-sided Fairness in Dynamic Vehicle Routing Problem

    Authors: Yufan Kang, Rongsheng Zhang, Wei Shao, Flora D. Salim, Jeffrey Chan

    Abstract: Dynamic Vehicle Routing Problem (DVRP), is an extension of the classic Vehicle Routing Problem (VRP), which is a fundamental problem in logistics and transportation. Typically, DVRPs involve two stakeholders: service providers that deliver services to customers and customers who raise requests from different locations. Many real-world applications can be formulated as DVRP such as ridesharing and… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  6. arXiv:2405.06786  [pdf, other

    eess.IV cs.CV

    SAM3D: Zero-Shot Semi-Automatic Segmentation in 3D Medical Images with the Segment Anything Model

    Authors: Trevor J. Chan, Aarush Sahni, Jie Li, Alisha Luthra, Amy Fang, Alison Pouch, Chamith S. Rajapakse

    Abstract: We introduce SAM3D, a new approach to semi-automatic zero-shot segmentation of 3D images building on the existing Segment Anything Model. We achieve fast and accurate segmentations in 3D images with a four-step strategy comprising: volume slicing along non-orthogonal axes, efficient prompting in 3D, slice-wise inference using the pretrained SAM, and recoposition and refinement in 3D. We evaluated… ▽ More

    Submitted 10 May, 2024; originally announced May 2024.

  7. arXiv:2404.12361  [pdf, other

    cs.AI physics.med-ph

    Learning the Domain Specific Inverse NUFFT for Accelerated Spiral MRI using Diffusion Models

    Authors: Trevor J. Chan, Chamith S. Rajapakse

    Abstract: Deep learning methods for accelerated MRI achieve state-of-the-art results but largely ignore additional speedups possible with noncartesian sampling trajectories. To address this gap, we created a generative diffusion model-based reconstruction algorithm for multi-coil highly undersampled spiral MRI. This model uses conditioning during training as well as frequency-based guidance to ensure consis… ▽ More

    Submitted 10 May, 2024; v1 submitted 18 April, 2024; originally announced April 2024.

  8. arXiv:2402.08812  [pdf, other

    cs.HC cs.AI

    Intelligent Canvas: Enabling Design-Like Exploratory Visual Data Analysis with Generative AI through Rapid Prototy**, Iteration and Curation

    Authors: Zijian Ding, Joel Chan

    Abstract: Complex data analysis inherently seeks unexpected insights through exploratory visual analysis methods, transcending logical, step-by-step processing. However, existing interfaces such as notebooks and dashboards have limitations in exploration and comparison for visual data analysis. Addressing these limitations, we introduce a "design-like" intelligent canvas environment integrating generative A… ▽ More

    Submitted 21 March, 2024; v1 submitted 13 February, 2024; originally announced February 2024.

  9. arXiv:2402.03104  [pdf, other

    stat.ML cs.LG

    High-dimensional Bayesian Optimization via Covariance Matrix Adaptation Strategy

    Authors: Lam Ngo, Huong Ha, Jeffrey Chan, Vu Nguyen, Hongyu Zhang

    Abstract: Bayesian Optimization (BO) is an effective method for finding the global optimum of expensive black-box functions. However, it is well known that applying BO to high-dimensional optimization problems is challenging. To address this issue, a promising solution is to use a local search strategy that partitions the search domain into local regions with high likelihood of containing the global optimum… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

    Comments: 31 pages, 17 figures

    Journal ref: Transactions on Machine Learning Research 2024

  10. arXiv:2402.00782  [pdf, other

    cs.LG

    Dense Reward for Free in Reinforcement Learning from Human Feedback

    Authors: Alex J. Chan, Hao Sun, Samuel Holt, Mihaela van der Schaar

    Abstract: Reinforcement Learning from Human Feedback (RLHF) has been credited as the key advance that has allowed Large Language Models (LLMs) to effectively follow instructions and produce useful assistance. Classically, this involves generating completions from the LLM in response to a query before using a separate reward model to assign a score to the full completion. As an auto-regressive process, the L… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

  11. arXiv:2401.11022  [pdf, other

    cs.HC

    Formulating or Fixating: Effects of Examples on Problem Solving Vary as a Function of Example Presentation Interface Design

    Authors: Joel Chan, Zijian Ding, Eesh Kamrah, Mark Fuge

    Abstract: Interactive systems that facilitate exposure to examples can augment problem solving performance. However designers of such systems are often faced with many practical design decisions about how users will interact with examples, with little clear theoretical guidance. To understand how example interaction design choices affect whether/how people benefit from examples, we conducted an experiment w… ▽ More

    Submitted 23 January, 2024; v1 submitted 19 January, 2024; originally announced January 2024.

  12. arXiv:2401.01753   

    cs.AI

    A Generative AI Assistant to Accelerate Cloud Migration

    Authors: Amal Vaidya, Mohan Krishna Vankayalapati, Jacky Chan, Senad Ibraimoski, Sean Moran

    Abstract: We present a tool that leverages generative AI to accelerate the migration of on-premises applications to the cloud. The Cloud Migration LLM accepts input from the user specifying the parameters of their migration, and outputs a migration strategy with an architecture diagram. A user study suggests that the migration LLM can assist inexperienced users in finding the right cloud migration profile,… ▽ More

    Submitted 3 January, 2024; originally announced January 2024.

    Comments: arXiv admin comment: This version has been removed by arXiv administrators as the submitter did not have the rights to agree to the license at the time of submission

  13. arXiv:2312.12681  [pdf, other

    cs.CL cs.AI

    Imitation of Life: A Search Engine for Biologically Inspired Design

    Authors: Hen Emuna, Nadav Borenstein, Xin Qian, Hyeonsu Kang, Joel Chan, Aniket Kittur, Dafna Shahaf

    Abstract: Biologically Inspired Design (BID), or Biomimicry, is a problem-solving methodology that applies analogies from nature to solve engineering challenges. For example, Speedo engineers designed swimsuits based on shark skin. Finding relevant biological solutions for real-world problems poses significant challenges, both due to the limited biological knowledge engineers and designers typically possess… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

    Comments: To be published in the AAAI 2024 Proceedings Main Track

  14. arXiv:2312.02401  [pdf, other

    stat.ML cs.LG cs.SI

    Harmonizing Global Voices: Culturally-Aware Models for Enhanced Content Moderation

    Authors: Alex J. Chan, José Luis Redondo García, Fabrizio Silvestri, Colm O'Donnel, Konstantina Palla

    Abstract: Content moderation at scale faces the challenge of considering local cultural distinctions when assessing content. While global policies aim to maintain decision-making consistency and prevent arbitrary rule enforcement, they often overlook regional variations in interpreting natural language as expressed in content. In this study, we are looking into how moderation systems can tackle this issue b… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

    Comments: 12 pages, 8 Figures. Supplementary material

  15. arXiv:2311.14110  [pdf, other

    cs.LG cs.AI

    When is Off-Policy Evaluation Useful? A Data-Centric Perspective

    Authors: Hao Sun, Alex J. Chan, Nabeel Seedat, Alihan Hüyük, Mihaela van der Schaar

    Abstract: Evaluating the value of a hypothetical target policy with only a logged dataset is important but challenging. On the one hand, it brings opportunities for safe policy improvement under high-stakes scenarios like clinical guidelines. On the other hand, such opportunities raise a need for precise off-policy evaluation (OPE). While previous work on OPE focused on improving the algorithm in value esti… ▽ More

    Submitted 23 November, 2023; originally announced November 2023.

    Comments: Off-Policy Evaluation, Data-Centric AI, Data-Centric Reinforcement Learning, Reinforcement Learning

  16. arXiv:2311.07426  [pdf, other

    cs.LG cs.CV cs.HC

    Optimising Human-AI Collaboration by Learning Convincing Explanations

    Authors: Alex J. Chan, Alihan Huyuk, Mihaela van der Schaar

    Abstract: Machine learning models are being increasingly deployed to take, or assist in taking, complicated and high-impact decisions, from quasi-autonomous vehicles to clinical decision support systems. This poses challenges, particularly when models have hard-to-detect failure modes and are able to take actions without oversight. In order to handle this challenge, we propose a method for a collaborative s… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

  17. arXiv:2311.03866  [pdf, other

    cs.CV

    SCONE-GAN: Semantic Contrastive learning-based Generative Adversarial Network for an end-to-end image translation

    Authors: Iman Abbasnejad, Fabio Zambetta, Flora Salim, Timothy Wiley, Jeffrey Chan, Russell Gallagher, Ehsan Abbasnejad

    Abstract: SCONE-GAN presents an end-to-end image translation, which is shown to be effective for learning to generate realistic and diverse scenery images. Most current image-to-image translation approaches are devised as two map**s: a translation from the source to target domain and another to represent its inverse. While successful in many applications, these approaches may suffer from generating trivia… ▽ More

    Submitted 7 November, 2023; originally announced November 2023.

    Comments: 9 pages, 5 figures

  18. arXiv:2311.00320  [pdf, other

    cs.SD cs.LG eess.AS

    Semantic Hearing: Programming Acoustic Scenes with Binaural Hearables

    Authors: Bandhav Veluri, Malek Itani, Justin Chan, Takuya Yoshioka, Shyamnath Gollakota

    Abstract: Imagine being able to listen to the birds chir** in a park without hearing the chatter from other hikers, or being able to block out traffic noise on a busy street while still being able to hear emergency sirens and car honks. We introduce semantic hearing, a novel capability for hearable devices that enables them to, in real-time, focus on, or ignore, specific sounds from real-world environment… ▽ More

    Submitted 1 November, 2023; originally announced November 2023.

  19. arXiv:2310.17894  [pdf, other

    cs.CL cs.AI

    Natural Language Interfaces for Tabular Data Querying and Visualization: A Survey

    Authors: Weixu Zhang, Yifei Wang, Yuanfeng Song, Victor Junqiu Wei, Yuxing Tian, Yiyan Qi, Jonathan H. Chan, Raymond Chi-Wing Wong, Haiqin Yang

    Abstract: The emergence of natural language processing has revolutionized the way users interact with tabular data, enabling a shift from traditional query languages and manual plotting to more intuitive, language-based interfaces. The rise of large language models (LLMs) such as ChatGPT and its successors has further advanced this field, opening new avenues for natural language processing techniques. This… ▽ More

    Submitted 19 May, 2024; v1 submitted 27 October, 2023; originally announced October 2023.

    Comments: 20 pages, 4 figures, 5 tables. Accepted by IEEE TKDE

  20. arXiv:2310.06322  [pdf, other

    cs.LG cs.AI

    Predicting Three Types of Freezing of Gait Events Using Deep Learning Models

    Authors: Wen Tao Mo, Jonathan H. Chan

    Abstract: Freezing of gait is a Parkinson's Disease symptom that episodically inflicts a patient with the inability to step or turn while walking. While medical experts have discovered various triggers and alleviating actions for freezing of gait, the underlying causes and prediction models are still being explored today. Current freezing of gait prediction models that utilize machine learning achieve high… ▽ More

    Submitted 10 October, 2023; originally announced October 2023.

    Comments: 5 pages

  21. arXiv:2309.15840  [pdf, other

    cs.CL cs.AI cs.LG

    How to Catch an AI Liar: Lie Detection in Black-Box LLMs by Asking Unrelated Questions

    Authors: Lorenzo Pacchiardi, Alex J. Chan, Sören Mindermann, Ilan Moscovitz, Alexa Y. Pan, Yarin Gal, Owain Evans, Jan Brauner

    Abstract: Large language models (LLMs) can "lie", which we define as outputting false statements despite "knowing" the truth in a demonstrable sense. LLMs might "lie", for example, when instructed to output misinformation. Here, we develop a simple lie detector that requires neither access to the LLM's activations (black-box) nor ground-truth knowledge of the fact in question. The detector works by asking a… ▽ More

    Submitted 26 September, 2023; originally announced September 2023.

  22. arXiv:2309.14474  [pdf

    eess.IV cs.CV

    Gastro-Intestinal Tract Segmentation Using an Explainable 3D Unet

    Authors: Kai Li, Jonathan Chan

    Abstract: In treating gastrointestinal cancer using radiotherapy, the role of the radiation oncologist is to administer high doses of radiation, through x-ray beams, toward the tumor while avoiding the stomach and intestines. With the advent of precise radiation treatment technology such as the MR-Linac, oncologists can visualize the daily positions of the tumors and intestines, which may vary day to day. B… ▽ More

    Submitted 25 September, 2023; originally announced September 2023.

    Comments: 5 pages, 8 figures, 13th Joint Symposium on Computational Intelligence (JSCI13)

  23. arXiv:2309.12164  [pdf, other

    cs.PL

    Stratified Type Theory

    Authors: Jonathan Chan, Stephanie Weirich

    Abstract: A hierarchy of type universes is a rudimentary ingredient in the type theories of many proof assistants to prevent the logical inconsistency resulting from combining dependent functions and the type-in-type rule. In this work, we argue that a universe hierarchy is not the only option for a type theory with a type universe. Taking inspiration from Leivant's Stratified System F, we introduce Stratif… ▽ More

    Submitted 7 April, 2024; v1 submitted 21 September, 2023; originally announced September 2023.

    Comments: 26 pages, 4 figures

    ACM Class: D.3.1; F.4.1

  24. arXiv:2309.04211  [pdf, other

    cs.LG cs.CY

    Counterfactual Explanations via Locally-guided Sequential Algorithmic Recourse

    Authors: Edward A. Small, Jeffrey N. Clark, Christopher J. McWilliams, Kacper Sokol, Jeffrey Chan, Flora D. Salim, Raul Santos-Rodriguez

    Abstract: Counterfactuals operationalised through algorithmic recourse have become a powerful tool to make artificial intelligence systems explainable. Conceptually, given an individual classified as y -- the factual -- we seek actions such that their prediction becomes the desired class y' -- the counterfactual. This process offers algorithmic recourse that is (1) easy to customise and interpret, and (2) d… ▽ More

    Submitted 8 September, 2023; originally announced September 2023.

    Comments: 7 pages, 5 figures, 3 appendix pages

  25. arXiv:2309.01214  [pdf

    cs.HC

    Immersive Technologies in Virtual Companions: A Systematic Literature Review

    Authors: Ziaullah Momand, Jonathan H. Chan, Pornchai Mongkolnam

    Abstract: The emergence of virtual companions is transforming the evolution of intelligent systems that effortlessly cater to the unique requirements of users. These advanced systems not only take into account the user present capabilities, preferences, and needs but also possess the capability to adapt dynamically to changes in the environment, as well as fluctuations in the users emotional state or behavi… ▽ More

    Submitted 3 September, 2023; originally announced September 2023.

  26. i-Align: an interpretable knowledge graph alignment model

    Authors: Bayu Distiawan Trisedya, Flora D Salim, Jeffrey Chan, Damiano Spina, Falk Scholer, Mark Sanderson

    Abstract: Knowledge graphs (KGs) are becoming essential resources for many downstream applications. However, their incompleteness may limit their potential. Thus, continuous curation is needed to mitigate this problem. One of the strategies to address this problem is KG alignment, i.e., forming a more complete KG by merging two or more KGs. This paper proposes i-Align, an interpretable KG alignment model. U… ▽ More

    Submitted 25 August, 2023; originally announced August 2023.

    Comments: Data Min Knowl Disc (2023)

  27. arXiv:2307.11263  [pdf, other

    cs.NI eess.SP

    Underwater 3D positioning on smart devices

    Authors: Tuochao Chen, Justin Chan, Shyamnath Gollakota

    Abstract: The emergence of water-proof mobile and wearable devices (e.g., Garmin Descent and Apple Watch Ultra) designed for underwater activities like professional scuba diving, opens up opportunities for underwater networking and localization capabilities on these devices. Here, we present the first underwater acoustic positioning system for smart devices. Unlike conventional systems that use floating buo… ▽ More

    Submitted 20 July, 2023; originally announced July 2023.

    Journal ref: ACM SIGCOMM 2023

  28. arXiv:2305.13275  [pdf

    cs.LG

    A Machine Learning Approach to Detect Dehydration in Afghan Children

    Authors: Ziaullah Momand, Debajyoti Pal, Pornchai Mongkolnam, Jonathan H. Chan

    Abstract: Child dehydration is a significant health concern, especially among children under 5 years of age who are more susceptible to diarrhea and vomiting. In Afghanistan, severe diarrhea contributes to child mortality due to dehydration. However, there is no evidence of research exploring the potential of machine learning techniques in diagnosing dehydration in Afghan children under five. To fill this g… ▽ More

    Submitted 22 May, 2023; originally announced May 2023.

  29. arXiv:2305.05279   

    cs.IR cs.AI

    Learning to Personalize Recommendation based on Customers' Shop** Intents

    Authors: Xin Shen, Jiaying Shi, Sungro Yoon, Jon Katzur, Hanbo Wang, Jim Chan, ** Li

    Abstract: Understanding the customers' high level shop** intent, such as their desire to go cam** or hold a birthday party, is critically important for an E-commerce platform; it can help boost the quality of shop** experience by enabling provision of more relevant, explainable, and diversified recommendations. However, such high level shop** intent has been overlooked in the industry due to pract… ▽ More

    Submitted 10 May, 2023; v1 submitted 9 May, 2023; originally announced May 2023.

    Comments: This article has been removed by arXiv administrators in response to a copyright claim by a 3rd party

  30. arXiv:2304.09779  [pdf, other

    cs.LG cs.CY math.OC math.PR

    Equalised Odds is not Equal Individual Odds: Post-processing for Group and Individual Fairness

    Authors: Edward A. Small, Kacper Sokol, Daniel Manning, Flora D. Salim, Jeffrey Chan

    Abstract: Group fairness is achieved by equalising prediction distributions between protected sub-populations; individual fairness requires treating similar individuals alike. These two objectives, however, are incompatible when a scoring model is calibrated through discontinuous probability functions, where individuals can be randomly assigned an outcome determined by a fixed probability. This procedure ma… ▽ More

    Submitted 19 April, 2024; v1 submitted 19 April, 2023; originally announced April 2023.

    Comments: 25 pages, 9 figures, 4 tables

  31. arXiv:2304.08721  [pdf, other

    cs.SI cs.CY

    Are footpaths encroached by shared e-scooters? Spatio-temporal Analysis of Micro-mobility Services

    Authors: Hiruni Kegalle, Danula Hettiachchi, Jeffrey Chan, Flora Salim, Mark Sanderson

    Abstract: Micro-mobility services (e.g., e-bikes, e-scooters) are increasingly popular among urban communities, being a flexible transport option that brings both opportunities and challenges. As a growing mode of transportation, insights gained from micro-mobility usage data are valuable in policy formulation and improving the quality of services. Existing research analyses patterns and features associated… ▽ More

    Submitted 18 April, 2023; originally announced April 2023.

    Comments: Accepted to IEEE International Conference on Mobile Data Management

  32. arXiv:2304.07487  [pdf, other

    cs.IR

    More Is Less: When Do Recommenders Underperform for Data-rich Users?

    Authors: Yueqing Xuan, Kacper Sokol, Jeffrey Chan, Mark Sanderson

    Abstract: Users of recommender systems tend to differ in their level of interaction with these algorithms, which may affect the quality of recommendations they receive and lead to undesirable performance disparity. In this paper we investigate under what conditions the performance for data-rich and data-poor users diverges for a collection of popular evaluation metrics applied to ten benchmark datasets. We… ▽ More

    Submitted 15 April, 2023; originally announced April 2023.

  33. arXiv:2304.03279  [pdf, other

    cs.LG cs.AI cs.CL cs.CY

    Do the Rewards Justify the Means? Measuring Trade-Offs Between Rewards and Ethical Behavior in the MACHIAVELLI Benchmark

    Authors: Alexander Pan, Jun Shern Chan, Andy Zou, Nathaniel Li, Steven Basart, Thomas Woodside, Jonathan Ng, Hanlin Zhang, Scott Emmons, Dan Hendrycks

    Abstract: Artificial agents have traditionally been trained to maximize reward, which may incentivize power-seeking and deception, analogous to how next-token prediction in language models (LMs) may incentivize toxicity. So do agents naturally learn to be Machiavellian? And how do we measure these behaviors in general-purpose models such as GPT-4? Towards answering these questions, we introduce MACHIAVELLI,… ▽ More

    Submitted 12 June, 2023; v1 submitted 6 April, 2023; originally announced April 2023.

    Comments: ICML 2023 Oral (camera-ready); 31 pages, 5 figures

  34. arXiv:2303.16755  [pdf, other

    cs.CL cs.AI cs.LG

    Training Language Models with Language Feedback at Scale

    Authors: Jérémy Scheurer, Jon Ander Campos, Tomasz Korbak, Jun Shern Chan, Angelica Chen, Kyunghyun Cho, Ethan Perez

    Abstract: Pretrained language models often generate outputs that are not in line with human preferences, such as harmful text or factually incorrect summaries. Recent work approaches the above issues by learning from a simple form of human feedback: comparisons between pairs of model-generated outputs. However, comparison feedback only conveys limited information about human preferences. In this paper, we i… ▽ More

    Submitted 22 February, 2024; v1 submitted 28 March, 2023; originally announced March 2023.

    Comments: Published in TMLR: https://openreview.net/forum?id=xo3hI5MwvU

  35. arXiv:2303.16749  [pdf, other

    cs.SE cs.AI cs.CL cs.LG

    Improving Code Generation by Training with Natural Language Feedback

    Authors: Angelica Chen, Jérémy Scheurer, Tomasz Korbak, Jon Ander Campos, Jun Shern Chan, Samuel R. Bowman, Kyunghyun Cho, Ethan Perez

    Abstract: The potential for pre-trained large language models (LLMs) to use natural language feedback at inference time has been an exciting recent development. We build upon this observation by formalizing an algorithm for learning from natural language feedback at training time instead, which we call Imitation learning from Language Feedback (ILF). ILF requires only a small amount of human-written feedbac… ▽ More

    Submitted 22 February, 2024; v1 submitted 28 March, 2023; originally announced March 2023.

    Comments: Published in (and superceded by) TMLR: https://openreview.net/forum?id=xo3hI5MwvU

  36. arXiv:2303.06430  [pdf, other

    cs.AI

    Map** the Design Space of Interactions in Human-AI Text Co-creation Tasks

    Authors: Zijian Ding, Joel Chan

    Abstract: Large Language Models (LLMs) have demonstrated impressive text generation capabilities, prompting us to reconsider the future of human-AI co-creation and how humans interact with LLMs. In this paper, we present a spectrum of content generation tasks and their corresponding human-AI interaction patterns. These tasks include: 1) fixed-scope content curation tasks with minimal human-AI interactions,… ▽ More

    Submitted 14 March, 2023; v1 submitted 11 March, 2023; originally announced March 2023.

  37. arXiv:2302.13532  [pdf, other

    cs.CR

    Detection and Amelioration of Social Engineering Vulnerability in Contingency Table Data using an Orthogonalised Log-linear Analysis

    Authors: Glynn Rogers, Malcolm Crompton, Gaurav Sapre, Jonathan Chan

    Abstract: Social Engineering has emerged as a significant threat in cyber security. In a dialog based attack, by having enough of a potential victim's personal data to be convincing, a social engineer impersonates the victim in order to manipulate the attack's target into revealing sufficient information for accessing the victim's accounts etc. We utilise the develo** understanding of human information pr… ▽ More

    Submitted 27 February, 2023; originally announced February 2023.

    Comments: 28 pages, 1 figure

  38. arXiv:2302.12832  [pdf, other

    cs.CL cs.AI

    Fluid Transformers and Creative Analogies: Exploring Large Language Models' Capacity for Augmenting Cross-Domain Analogical Creativity

    Authors: Zijian Ding, Arvind Srinivasan, Stephen MacNeil, Joel Chan

    Abstract: Cross-domain analogical reasoning is a core creative ability that can be challenging for humans. Recent work has shown some proofs-of concept of Large language Models' (LLMs) ability to generate cross-domain analogies. However, the reliability and potential usefulness of this capacity for augmenting human creative work has received little systematic exploration. In this paper, we systematically ex… ▽ More

    Submitted 1 June, 2023; v1 submitted 27 February, 2023; originally announced February 2023.

  39. arXiv:2212.05435  [pdf, other

    cs.CY

    Wireless earbuds for low-cost hearing screening

    Authors: Justin Chan, Antonio Glenn, Malek Itani, Lisa R. Mancl, Emily Gallagher, Randall Bly, Shwetak Patel, Shyamnath Gollakota

    Abstract: We present the first wireless earbud hardware that can perform hearing screening by detecting otoacoustic emissions. The conventional wisdom has been that detecting otoacoustic emissions, which are the faint sounds generated by the cochlea, requires sensitive and expensive acoustic hardware. Thus, medical devices for hearing screening cost thousands of dollars and are inaccessible in low and middl… ▽ More

    Submitted 11 December, 2022; originally announced December 2022.

  40. arXiv:2211.06138  [pdf, other

    cs.LG cs.CY stat.ML

    Practical Approaches for Fair Learning with Multitype and Multivariate Sensitive Attributes

    Authors: Tennison Liu, Alex J. Chan, Boris van Breugel, Mihaela van der Schaar

    Abstract: It is important to guarantee that machine learning algorithms deployed in the real world do not result in unfairness or unintended social consequences. Fair ML has largely focused on the protection of single attributes in the simpler setting where both attributes and target outcomes are binary. However, the practical application in many a real-world problem entails the simultaneous protection of m… ▽ More

    Submitted 11 November, 2022; originally announced November 2022.

  41. arXiv:2211.02250  [pdf, other

    cs.SD cs.LG eess.AS

    Real-Time Target Sound Extraction

    Authors: Bandhav Veluri, Justin Chan, Malek Itani, Tuochao Chen, Takuya Yoshioka, Shyamnath Gollakota

    Abstract: We present the first neural network model to achieve real-time and streaming target sound extraction. To accomplish this, we propose Waveformer, an encoder-decoder architecture with a stack of dilated causal convolution layers as the encoder, and a transformer decoder layer as the decoder. This hybrid architecture uses dilated causal convolutions for processing large receptive fields in a computat… ▽ More

    Submitted 19 April, 2023; v1 submitted 3 November, 2022; originally announced November 2022.

    Comments: ICASSP 2023 camera-ready

  42. arXiv:2210.10039  [pdf, other

    cs.CV cs.CY cs.LG

    How Would The Viewer Feel? Estimating Wellbeing From Video Scenarios

    Authors: Mantas Mazeika, Eric Tang, Andy Zou, Steven Basart, Jun Shern Chan, Dawn Song, David Forsyth, Jacob Steinhardt, Dan Hendrycks

    Abstract: In recent years, deep neural networks have demonstrated increasingly strong abilities to recognize objects and activities in videos. However, as video understanding becomes widely used in real-world applications, a key consideration is develo** human-centric systems that understand not only the content of the video but also how it would affect the wellbeing and emotional state of viewers. To fac… ▽ More

    Submitted 18 October, 2022; originally announced October 2022.

    Comments: NeurIPS 2022; datasets available at https://github.com/hendrycks/emodiversity/

  43. arXiv:2210.09034  [pdf, other

    cs.CY cs.SI

    Analysing Donors' Behaviour in Non-profit Organisations for Disaster Resilience: The 2019--2020 Australian Bushfires Case Study

    Authors: Dilini Rajapaksha, Kacper Sokol, Jeffrey Chan, Flora Salim, Mukesh Prasad, Mahendra Samarawickrama

    Abstract: With the advancement and proliferation of technology, non-profit organisations have embraced social media platforms to improve their operational capabilities through brand advocacy, among many other strategies. The effect of such social media campaigns on these institutions, however, remains largely underexplored, especially during disaster periods. This work introduces and applies a quantitative… ▽ More

    Submitted 14 October, 2022; originally announced October 2022.

  44. arXiv:2210.05320  [pdf, other

    cs.LG

    Synthetic Model Combination: An Instance-wise Approach to Unsupervised Ensemble Learning

    Authors: Alex J. Chan, Mihaela van der Schaar

    Abstract: Consider making a prediction over new test data without any opportunity to learn from a training set of labelled data - instead given access to a set of expert models and their predictions alongside some limited information about the dataset used to train them. In scenarios from finance to the medical sciences, and even consumer practice, stakeholders have developed models on private data they eit… ▽ More

    Submitted 11 October, 2022; originally announced October 2022.

  45. arXiv:2209.13836  [pdf, other

    cs.LG

    Mutual Information Assisted Ensemble Recommender System for Identifying Critical Risk Factors in Healthcare Prognosis

    Authors: Abhishek Dey, Debayan Goswami, Rahul Roy, Susmita Ghosh, Yu Shrike Zhang, Jonathan H. Chan

    Abstract: Purpose: Health recommenders act as important decision support systems, aiding patients and medical professionals in taking actions that lead to patients' well-being. These systems extract the information which may be of particular relevance to the end-user, hel** them in making appropriate decisions. The present study proposes a feature recommender, as a part of a disease management system, tha… ▽ More

    Submitted 1 July, 2024; v1 submitted 28 September, 2022; originally announced September 2022.

  46. arXiv:2209.01780  [pdf, other

    cs.NI

    Underwater Acoustic Ranging Between Smartphones

    Authors: Tuochao Chen, Justin Chan, Shyamnath Gollakota

    Abstract: We present a novel underwater system that can perform acoustic ranging between commodity smartphones. To achieve this, we design a real-time underwater ranging protocol that computes the time-of-flight between smartphones. To address the severe underwater multipath, we present a dual-microphone optimization algorithm that can more reliably identify the direct path. Our underwater evaluations show… ▽ More

    Submitted 5 September, 2022; originally announced September 2022.

  47. Traversability analysis with vision and terrain probing for safe legged robot navigation

    Authors: Garen Haddeler, Meng Yee, Chuah, Yangwei You, Jianle Chan, Albertus H. Adiwahono, Wei Yun Yau, Chee-Meng Chew

    Abstract: Inspired by human behavior when traveling over unknown terrain, this study proposes the use of probing strategies and integrates them into a traversability analysis framework to address safe navigation on unknown rough terrain. Our framework integrates collapsibility information into our existing traversability analysis, as vision and geometric information alone could be misled by unpredictable no… ▽ More

    Submitted 1 September, 2022; originally announced September 2022.

  48. Underwater Messaging Using Mobile Devices

    Authors: Tuochao Chen, Justin Chan, Shyamnath Gollakota

    Abstract: Since its inception, underwater digital acoustic communication has required custom hardware that neither has the economies of scale nor is pervasive. We present the first acoustic system that brings underwater messaging capabilities to existing mobile devices like smartphones and smart watches. Our software-only solution leverages audio sensors, i.e., microphones and speakers, ubiquitous in today'… ▽ More

    Submitted 22 August, 2022; originally announced August 2022.

    Journal ref: SIGCOMM 2022

  49. arXiv:2208.01009  [pdf, other

    cs.CL cs.AI cs.LG

    Few-shot Adaptation Works with UnpredicTable Data

    Authors: Jun Shern Chan, Michael Pieler, Jonathan Jao, Jérémy Scheurer, Ethan Perez

    Abstract: Prior work on language models (LMs) shows that training on a large number of diverse tasks improves few-shot learning (FSL) performance on new tasks. We take this to the extreme, automatically extracting 413,299 tasks from internet tables - orders of magnitude more than the next-largest public datasets. Finetuning on the resulting dataset leads to improved FSL performance on Natural Language Proce… ▽ More

    Submitted 7 August, 2022; v1 submitted 1 August, 2022; originally announced August 2022.

    Comments: Code at https://github.com/JunShern/few-shot-adaptation

  50. arXiv:2207.04581  [pdf, other

    cs.LG cs.CY

    How Robust is your Fair Model? Exploring the Robustness of Diverse Fairness Strategies

    Authors: Edward Small, Wei Shao, Zeliang Zhang, Peihan Liu, Jeffrey Chan, Kacper Sokol, Flora Salim

    Abstract: With the introduction of machine learning in high-stakes decision making, ensuring algorithmic fairness has become an increasingly important problem to solve. In response to this, many mathematical definitions of fairness have been proposed, and a variety of optimisation techniques have been developed, all designed to maximise a defined notion of fairness. However, fair solutions are reliant on th… ▽ More

    Submitted 31 May, 2024; v1 submitted 10 July, 2022; originally announced July 2022.

    Comments: 27 pages, 7 figures