Skip to main content

Showing 51–100 of 329 results for author: Chang, E

.
  1. arXiv:2306.03566  [pdf, other

    cs.LG stat.ML

    Memory-Based Dual Gaussian Processes for Sequential Learning

    Authors: Paul E. Chang, Prakhar Verma, S. T. John, Arno Solin, Mohammad Emtiyaz Khan

    Abstract: Sequential learning with Gaussian processes (GPs) is challenging when access to past data is limited, for example, in continual and active learning. In such cases, errors can accumulate over time due to inaccuracies in the posterior, hyperparameters, and inducing points, making accurate learning challenging. Here, we present a method to keep all such errors in check using the recently proposed dua… ▽ More

    Submitted 6 June, 2023; originally announced June 2023.

    Comments: International Conference on Machine Learning (ICML) 2023

  2. arXiv:2306.01400  [pdf, other

    cs.LG cs.CR

    Adaptive Attractors: A Defense Strategy against ML Adversarial Collusion Attacks

    Authors: Jiyi Zhang, Han Fang, Ee-Chien Chang

    Abstract: In the seller-buyer setting on machine learning models, the seller generates different copies based on the original model and distributes them to different buyers, such that adversarial samples generated on one buyer's copy would likely not work on other copies. A known approach achieves this using attractor-based rewriter which injects different attractors to different copies. This induces differ… ▽ More

    Submitted 2 June, 2023; originally announced June 2023.

  3. arXiv:2305.17888  [pdf, other

    cs.CL

    LLM-QAT: Data-Free Quantization Aware Training for Large Language Models

    Authors: Zechun Liu, Barlas Oguz, Changsheng Zhao, Ernie Chang, Pierre Stock, Yashar Mehdad, Yangyang Shi, Raghuraman Krishnamoorthi, Vikas Chandra

    Abstract: Several post-training quantization methods have been applied to large language models (LLMs), and have been shown to perform well down to 8-bits. We find that these methods break down at lower bit precision, and investigate quantization aware training for LLMs (LLM-QAT) to push quantization levels even further. We propose a data-free distillation method that leverages generations produced by the p… ▽ More

    Submitted 29 May, 2023; originally announced May 2023.

  4. arXiv:2305.05869  [pdf, other

    cs.LG cs.CV

    Finding Meaningful Distributions of ML Black-boxes under Forensic Investigation

    Authors: Jiyi Zhang, Han Fang, Hwee Kuan Lee, Ee-Chien Chang

    Abstract: Given a poorly documented neural network model, we take the perspective of a forensic investigator who wants to find out the model's data domain (e.g. whether on face images or traffic signs). Although existing methods such as membership inference and model inversion can be used to uncover some information about an unknown model, they still require knowledge of the data domain to start with. In th… ▽ More

    Submitted 9 May, 2023; originally announced May 2023.

  5. Coronal Heating as Determined by the Solar Flare Frequency Distribution Obtained by Aggregating Case Studies

    Authors: James Paul Mason, Alexandra Werth, Colin G. West, Allison A. Youngblood, Donald L. Woodraska, Courtney Peck, Kevin Lacjak, Florian G. Frick, Moutamen Gabir, Reema A. Alsinan, Thomas Jacobsen, Mohammad Alrubaie, Kayla M. Chizmar, Benjamin P. Lau, Lizbeth Montoya Dominguez, David Price, Dylan R. Butler, Connor J. Biron, Nikita Feoktistov, Kai Dewey, N. E. Loomis, Michal Bodzianowski, Connor Kuybus, Henry Dietrick, Aubrey M. Wolfe , et al. (977 additional authors not shown)

    Abstract: Flare frequency distributions represent a key approach to addressing one of the largest problems in solar and stellar physics: determining the mechanism that counter-intuitively heats coronae to temperatures that are orders of magnitude hotter than the corresponding photospheres. It is widely accepted that the magnetic field is responsible for the heating, but there are two competing mechanisms th… ▽ More

    Submitted 9 May, 2023; originally announced May 2023.

    Comments: 1,002 authors, 14 pages, 4 figures, 3 tables, published by The Astrophysical Journal on 2023-05-09, volume 948, page 71

  6. arXiv:2304.02438  [pdf, other

    cs.OH

    CoCoMo: Computational Consciousness Modeling for Generative and Ethical AI

    Authors: Edward Y. Chang

    Abstract: The CoCoMo model proposes a computational solution to the challenge of incorporating ethical and emotional intelligence considerations into AI systems, with the aim of creating AI agents that combine knowledge with compassion. To achieve this goal, CoCoMo prioritizes fairness, beneficence, non-maleficence, empathy, adaptability, transparency, and critical and exploratory thinking abilities. The mo… ▽ More

    Submitted 8 April, 2023; v1 submitted 17 March, 2023; originally announced April 2023.

    Comments: 10 pages, 3 figures, 5 tables

    ACM Class: I.2.7

  7. arXiv:2303.10998  [pdf, other

    quant-ph physics.optics

    The maximum refractive index of an atomic crystal $\unicode{x2013}$ from quantum optics to quantum chemistry

    Authors: Francesco Andreoli, Bennet Windt, Stefano Grava, Gian Marcello Andolina, Michael J. Gullans, Alexander A. High, Darrick E. Chang

    Abstract: All known optical materials have an index of refraction of order unity. Despite the tremendous implications that an ultrahigh index could have for optical technologies, little research has been done on why the refractive index of materials is universally small, and whether this observation is fundamental. Here, we investigate the index of an ordered arrangement of atoms, as a function of atomic de… ▽ More

    Submitted 20 March, 2023; originally announced March 2023.

    Comments: 31 pages, 12 figures

  8. arXiv:2303.08769  [pdf, other

    cs.LG

    Prompting Large Language Models With the Socratic Method

    Authors: Edward Y. Chang

    Abstract: This paper presents a systematic approach to using the Socratic method in develo** prompt templates that effectively interact with large language models, including GPT-3. Various methods are examined, and those that yield precise answers and justifications while fostering creativity and imagination to enhance creative writing are identified. Techniques such as {\em definition}, {\em elenchus}, {… ▽ More

    Submitted 15 March, 2023; v1 submitted 17 February, 2023; originally announced March 2023.

    Comments: 10 pages

    ACM Class: I.2.7

    Journal ref: IEEE Computing and Communication Workshop and Conference (CCWC), March 2023

  9. arXiv:2303.03486  [pdf, other

    cs.RO

    Sampling-based Exploration for Reinforcement Learning of Dexterous Manipulation

    Authors: Gagan Khandate, Siqi Shang, Eric T. Chang, Tristan Luca Saidi, Yang Liu, Seth Matthew Dennis, Johnson Adams, Matei Ciocarlie

    Abstract: In this paper, we present a novel method for achieving dexterous manipulation of complex objects, while simultaneously securing the object without the use of passive support surfaces. We posit that a key difficulty for training such policies in a Reinforcement Learning framework is the difficulty of exploring the problem state space, as the accessible regions of this space form a complex structure… ▽ More

    Submitted 23 May, 2023; v1 submitted 6 March, 2023; originally announced March 2023.

    Comments: 10 pages, 7 figures, accepted at Robotics Science & Systems 2023

  10. On the Origin of Dust Structures in Protoplanetary Disks: Constraints from the Rossby Wave Instability

    Authors: Eonho Chang, Andrew N. Youdin, Leonardo Krapp

    Abstract: High resolution sub-mm observations of protoplanetary disks with ALMA have revealed that dust rings are common in large, bright disks. The leading explanation for these structures is dust-trap** in a local gas pressure maximum, caused by an embedded planet or other dynamical process. Independent of origin, such dust traps should be stable for many orbits to collect significant dust. However, rin… ▽ More

    Submitted 6 March, 2023; originally announced March 2023.

    Comments: 11 pages, 5 figures, accepted to ApJL

  11. arXiv:2301.05884  [pdf

    physics.optics q-bio.TO

    Metasurface-enhanced mid-infrared spectrochemical imaging of tissues

    Authors: S. Rosas, K. A. Schoeller, E. Chang, H. Mei, M. A. Kats, K. W. Eliceiri, X. Zhao, F. Yesilkoy

    Abstract: Label-free and nondestructive mid-infrared vibrational hyperspectral imaging is emerging as an important ex-vivo tissue analysis tool, providing spatially resolved biochemical information critical to understanding physiological and pathological processes. However, the chemically complex and spatially heterogeneous composition of tissue specimens and the inherently weak interaction of infrared ligh… ▽ More

    Submitted 26 April, 2023; v1 submitted 14 January, 2023; originally announced January 2023.

  12. arXiv:2301.01218  [pdf, other

    cs.CR cs.CV cs.LG

    Tracing the Origin of Adversarial Attack for Forensic Investigation and Deterrence

    Authors: Han Fang, Jiyi Zhang, Yupeng Qiu, Ke Xu, Chengfang Fang, Ee-Chien Chang

    Abstract: Deep neural networks are vulnerable to adversarial attacks. In this paper, we take the role of investigators who want to trace the attack and identify the source, that is, the particular model which the adversarial examples are generated from. Techniques derived would aid forensic investigation of attack incidents and serve as deterrence to potential attacks. We consider the buyers-seller setting… ▽ More

    Submitted 30 December, 2022; originally announced January 2023.

  13. arXiv:2212.13591  [pdf, other

    cs.AI cs.LG

    Knowledge-Guided Data-Centric AI in Healthcare: Progress, Shortcomings, and Future Directions

    Authors: Edward Y. Chang

    Abstract: The success of deep learning is largely due to the availability of large amounts of training data that cover a wide range of examples of a particular concept or meaning. In the field of medicine, having a diverse set of training data on a particular disease can lead to the development of a model that is able to accurately predict the disease. However, despite the potential benefits, there have not… ▽ More

    Submitted 30 April, 2023; v1 submitted 27 December, 2022; originally announced December 2022.

    Comments: 21 pages, 13 figures, 4 tables. arXiv admin note: text overlap with arXiv:1707.09873

    ACM Class: I.2.7; K.3.2

  14. arXiv:2212.00612  [pdf, other

    cs.LG cs.CR

    Purifier: Defending Data Inference Attacks via Transforming Confidence Scores

    Authors: Ziqi Yang, Li** Wang, Da Yang, Jie Wan, Ziming Zhao, Ee-Chien Chang, Fan Zhang, Kui Ren

    Abstract: Neural networks are susceptible to data inference attacks such as the membership inference attack, the adversarial model inversion attack and the attribute inference attack, where the attacker could infer useful information such as the membership, the reconstruction or the sensitive attributes of a data sample from the confidence scores predicted by the target classifier. In this paper, we propose… ▽ More

    Submitted 1 December, 2022; originally announced December 2022.

    Comments: accepted by AAAI 2023

  15. arXiv:2211.01053  [pdf, other

    cs.LG stat.ML

    Fantasizing with Dual GPs in Bayesian Optimization and Active Learning

    Authors: Paul E. Chang, Prakhar Verma, ST John, Victor Picheny, Henry Moss, Arno Solin

    Abstract: Gaussian processes (GPs) are the main surrogate functions used for sequential modelling such as Bayesian Optimization and Active Learning. Their drawbacks are poor scaling with data and the need to run an optimization loop when using a non-Gaussian likelihood. In this paper, we focus on `fantasizing' batch acquisition functions that need the ability to condition on new fantasized data computationa… ▽ More

    Submitted 2 November, 2022; originally announced November 2022.

    Comments: In the 2022 NeurIPS Workshop on Gaussian Processes, Spatiotemporal Modeling, and Decision-making Systems

  16. arXiv:2210.15493  [pdf, other

    q-fin.CP cs.LG

    Projecting Non-Fungible Token (NFT) Collections: A Contextual Generative Approach

    Authors: Wesley Joon-Wie Tann, Akhil Vuputuri, Ee-Chien Chang

    Abstract: Non-fungible tokens (NFTs) are digital assets stored on a blockchain representing real-world objects such as art or collectibles. An NFT collection comprises numerous tokens; each token can be transacted multiple times. It is a multibillion-dollar market where the number of collections has more than doubled in 2022. In this paper, we want to obtain a generative model that, given the early transact… ▽ More

    Submitted 4 February, 2023; v1 submitted 14 October, 2022; originally announced October 2022.

  17. arXiv:2208.13078  [pdf, other

    cs.CL

    MDIA: A Benchmark for Multilingual Dialogue Generation in 46 Languages

    Authors: Qingyu Zhang, Xiaoyu Shen, Ernie Chang, Jidong Ge, Pengke Chen

    Abstract: Owing to the lack of corpora for low-resource languages, current works on dialogue generation have mainly focused on English. In this paper, we present mDIA, the first large-scale multilingual benchmark for dialogue generation across low- to high-resource languages. It covers real-life conversations in 46 languages across 19 language families. We present baseline results obtained by fine-tuning th… ▽ More

    Submitted 27 August, 2022; originally announced August 2022.

    Comments: The dataset and processing scripts are available in https://github.com/DoctorDream/mDIA

  18. Unscented Kalman filter with stable embedding for simple, accurate and computationally efficient state estimation of systems on manifolds in Euclidean space

    Authors: Jae-Hyeon Park, Dong Eui Chang

    Abstract: This paper proposes a simple, accurate and computationally efficient method to apply the ordinary unscented Kalman filter developed in Euclidean space to systems whose dynamics evolve on manifolds.We use the mathematical theory called stable embedding to make a variant of unscented Kalman filter that keeps state estimates in closeproximity to the manifold while exhibiting excellent estimation perf… ▽ More

    Submitted 30 November, 2022; v1 submitted 22 August, 2022; originally announced August 2022.

    Comments: This paper is published in International Journal of Robust and Nonliner Control

    Journal ref: International Journal of Robust and Nonlinear Control (2022)

  19. arXiv:2208.01946  [pdf, other

    cs.DC cs.CR

    Mixed Fault Tolerance Protocols with Trusted Execution Environment

    Authors: Mingyuan Gao, Hung Dang, Ee-Chien Chang, Jialin Li

    Abstract: Blockchain systems are designed, built and operated in the presence of failures. There are two dominant failure models, namely crash fault and Byzantine fault. Byzantine fault tolerance (BFT) protocols offer stronger security guarantees, and thus are widely used in blockchain systems. However, their security guarantees come at a dear cost to their performance and scalability. Several works have im… ▽ More

    Submitted 3 August, 2022; originally announced August 2022.

    Comments: 12 pages, 3 figures

  20. arXiv:2207.05122  [pdf, other

    quant-ph cond-mat.mes-hall physics.optics

    Nonlinear quantum logic with colliding graphene plasmons

    Authors: Giuseppe Calajò, Philipp K. Jenke, Lee A. Rozema, Philip Walther, Darrick E. Chang, Joel D. Cox

    Abstract: Graphene has emerged as a promising platform to bring nonlinear quantum optics to the nanoscale, where a large intrinsic optical nonlinearity enables long-lived and actively tunable plasmon polaritons to strongly interact. Here we theoretically study the collision between two counter-propagating plasmons in a graphene nanoribbon, where transversal subwavelength confinement endows propagating plasm… ▽ More

    Submitted 18 March, 2023; v1 submitted 11 July, 2022; originally announced July 2022.

    Comments: 13 pages, 5 figures

    Journal ref: Phys. Rev. Research 5, 013188, (2023)

  21. arXiv:2206.13032  [pdf, other

    cs.MM

    De-END: Decoder-driven Watermarking Network

    Authors: Han Fang, Zhaoyang Jia, Yupeng Qiu, Jiyi Zhang, Weiming Zhang, Ee-Chien Chang

    Abstract: With recent advances in machine learning, researchers are now able to solve traditional problems with new solutions. In the area of digital watermarking, deep-learning-based watermarking technique is being extensively studied. Most existing approaches adopt a similar encoder-driven scheme which we name END (Encoder-NoiseLayer-Decoder) architecture. In this paper, we revamp the architecture and cre… ▽ More

    Submitted 26 June, 2022; originally announced June 2022.

  22. arXiv:2206.04615  [pdf, other

    cs.CL cs.AI cs.CY cs.LG stat.ML

    Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

    Authors: Aarohi Srivastava, Abhinav Rastogi, Abhishek Rao, Abu Awal Md Shoeb, Abubakar Abid, Adam Fisch, Adam R. Brown, Adam Santoro, Aditya Gupta, Adrià Garriga-Alonso, Agnieszka Kluska, Aitor Lewkowycz, Akshat Agarwal, Alethea Power, Alex Ray, Alex Warstadt, Alexander W. Kocurek, Ali Safaya, Ali Tazarv, Alice Xiang, Alicia Parrish, Allen Nie, Aman Hussain, Amanda Askell, Amanda Dsouza , et al. (426 additional authors not shown)

    Abstract: Language models demonstrate both quantitative improvement and new qualitative capabilities with increasing scale. Despite their potentially transformative impact, these new capabilities are as yet poorly characterized. In order to inform future research, prepare for disruptive new model capabilities, and ameliorate socially harmful effects, it is vital that we understand the present and near-futur… ▽ More

    Submitted 12 June, 2023; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: 27 pages, 17 figures + references and appendices, repo: https://github.com/google/BIG-bench

    Journal ref: Transactions on Machine Learning Research, May/2022, https://openreview.net/forum?id=uyTL5Bvosj

  23. arXiv:2205.14611  [pdf

    cs.CR

    Forensic Artefact Discovery and Attribution from Android Cryptocurrency Wallet Applications

    Authors: Eugene Chang, Paul Darcy, Kim-Kwang Raymond Choo, Nhien-An Le-Khac

    Abstract: Cryptocurrency has been (ab)used to purchase illicit goods and services such as drugs, weapons and child pornography (also referred to as child sexual abuse materials), and thus mobile devices (where cryptocurrency wallet applications are installed) are a potential source of evidence in a criminal investigation. Not surprisingly, there has been increased focus on the security of cryptocurrency wal… ▽ More

    Submitted 29 May, 2022; originally announced May 2022.

  24. arXiv:2205.14149  [pdf, other

    physics.ins-det physics.atom-ph physics.optics

    Low-Drift-Rate External Cavity Diode Laser

    Authors: Eddie H. Chang, Jared Rivera, Brian Bostwick, Christian Schneider, Peter Yu, Eric R. Hudson

    Abstract: We present the design, construction, and simulation of a simple, low-cost external cavity diode laser with a measured free-running frequency drift rate of 1.4(1)~MHz/h at 852 nm. This performance is achieved via a compact, nearly monolithic aluminum structure to minimize temperature gradients across the laser cavity. We present thermal finite element method simulations which quantify the effects o… ▽ More

    Submitted 25 May, 2022; originally announced May 2022.

  25. 3D Segmentation Guided Style-based Generative Adversarial Networks for PET Synthesis

    Authors: Yang Zhou, Zhiwen Yang, Hui Zhang, Eric I-Chao Chang, Yubo Fan, Yan Xu

    Abstract: Potential radioactive hazards in full-dose positron emission tomography (PET) imaging remain a concern, whereas the quality of low-dose images is never desirable for clinical use. So it is of great interest to translate low-dose PET images into full-dose. Previous studies based on deep learning methods usually directly extract hierarchical features for reconstruction. We notice that the importance… ▽ More

    Submitted 18 May, 2022; originally announced May 2022.

    Comments: This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/TMI.2022.3156614, IEEE Transactions on Medical Imaging

    Journal ref: IEEE Transactions on Medical Imaging, 2022, 41(8): 2092-2104

  26. arXiv:2205.08878  [pdf, other

    cs.CV

    Transformer based multiple instance learning for weakly supervised histopathology image segmentation

    Authors: Ziniu Qian, Kailu Li, Maode Lai, Eric I-Chao Chang, Bingzheng Wei, Yubo Fan, Yan Xu

    Abstract: Hispathological image segmentation algorithms play a critical role in computer aided diagnosis technology. The development of weakly supervised segmentation algorithm alleviates the problem of medical image annotation that it is time-consuming and labor-intensive. As a subset of weakly supervised learning, Multiple Instance Learning (MIL) has been proven to be effective in segmentation. However, t… ▽ More

    Submitted 18 May, 2022; originally announced May 2022.

    Comments: Provisional accepted for MICCAI 2022

  27. Feedback Gradient Descent: Efficient and Stable Optimization with Orthogonality for DNNs

    Authors: Fanchen Bu, Dong Eui Chang

    Abstract: The optimization with orthogonality has been shown useful in training deep neural networks (DNNs). To impose orthogonality on DNNs, both computational efficiency and stability are important. However, existing methods utilizing Riemannian optimization or hard constraints can only ensure stability while those using soft constraints can only improve efficiency. In this paper, we propose a novel metho… ▽ More

    Submitted 11 May, 2022; originally announced May 2022.

    Journal ref: AAAI 2022

  28. arXiv:2205.02022  [pdf, other

    cs.CL

    A Few Thousand Translations Go a Long Way! Leveraging Pre-trained Models for African News Translation

    Authors: David Ifeoluwa Adelani, Jesujoba Oluwadara Alabi, Angela Fan, Julia Kreutzer, Xiaoyu Shen, Machel Reid, Dana Ruiter, Dietrich Klakow, Peter Nabende, Ernie Chang, Tajuddeen Gwadabe, Freshia Sackey, Bonaventure F. P. Dossou, Chris Chinenye Emezue, Colin Leong, Michael Beukman, Shamsuddeen Hassan Muhammad, Guyo Dub Jarso, Oreen Yousuf, Andre Niyongabo Rubungo, Gilles Hacheme, Eric Peter Wairagala, Muhammad Umair Nasir, Benjamin Ayoade Ajibade, Tunde Oluwaseyi Ajayi , et al. (20 additional authors not shown)

    Abstract: Recent advances in the pre-training of language models leverage large-scale datasets to create multilingual models. However, low-resource languages are mostly left out in these datasets. This is primarily because many widely spoken languages are not well represented on the web and therefore excluded from the large-scale crawls used to create datasets. Furthermore, downstream users of these models… ▽ More

    Submitted 22 August, 2022; v1 submitted 4 May, 2022; originally announced May 2022.

    Comments: Accepted to NAACL 2022 (added evaluation data for amh, kin, nya, sna, xho)

  29. arXiv:2205.01838  [pdf, other

    q-bio.PE

    A model for malaria treatment evaluation in the presence of multiple species

    Authors: Camelia R. Walker, Roslyn I. Hickson, Edmond Chang, Pengby Ngor, Siv Sovannaroth, Julie A. Simpson, David J. Price, James M. McCaw, Ric N. Price, Jennifer A. Flegg, Angela Devine

    Abstract: Plasmodium (P.) falciparum and P. vivax are the two most common causes of malaria. While the majority of deaths and severe morbidity are due to P. falciparum, P. vivax poses a greater challenge to eliminating malaria outside of Africa due to its ability to form latent liver stage parasites (hypnozoites), which can cause relapsing episodes within an individual patient. In areas where P. falciparum… ▽ More

    Submitted 21 July, 2022; v1 submitted 3 May, 2022; originally announced May 2022.

  30. Scalable Private Decision Tree Evaluation with Sublinear Communication

    Authors: Jianli Bai, Xiangfu Song, Shujie Cui, Ee-Chien Chang, Giovanni Russello

    Abstract: Private decision tree evaluation (PDTE) allows a decision tree holder to run a secure protocol with a feature provider. By running the protocol, the feature provider will learn a classification result. Nothing more is revealed to either party. In most existing PDTE protocols, the required communication grows exponentially with the tree's depth $d$, which is highly inefficient for large trees. This… ▽ More

    Submitted 2 May, 2022; originally announced May 2022.

  31. arXiv:2204.00870  [pdf, ps, other

    cs.PL

    Differential Cost Analysis with Simultaneous Potentials and Anti-potentials

    Authors: Đorđe Žikelić, Bor-Yuh Evan Chang, Pauline Bolignano, Franco Raimondi

    Abstract: We present a novel approach to differential cost analysis that, given a program revision, attempts to statically bound the difference in resource usage, or cost, between the two program versions. Differential cost analysis is particularly interesting because of the many compelling applications for it, such as detecting resource-use regressions at code-review time or proving the absence of certain… ▽ More

    Submitted 7 April, 2022; v1 submitted 2 April, 2022; originally announced April 2022.

    Comments: Extended version of the PLDI 2022 paper

    ACM Class: D.3; F.3.1; F.3.2

  32. arXiv:2201.11867  [pdf, other

    cs.CL cs.SD eess.AS

    Neural-FST Class Language Model for End-to-End Speech Recognition

    Authors: Antoine Bruguier, Duc Le, Rohit Prabhavalkar, Dangna Li, Zhe Liu, Bo Wang, Eun Chang, Fuchun Peng, Ozlem Kalinli, Michael L. Seltzer

    Abstract: We propose Neural-FST Class Language Model (NFCLM) for end-to-end speech recognition, a novel method that combines neural network language models (NNLMs) and finite state transducers (FSTs) in a mathematically consistent framework. Our method utilizes a background NNLM which models generic background text together with a collection of domain-specific entities modeled as individual FSTs. Each outpu… ▽ More

    Submitted 31 January, 2022; v1 submitted 27 January, 2022; originally announced January 2022.

    Comments: Accepted for publication at ICASSP 2022

  33. arXiv:2111.15160  [pdf, other

    cs.CR cs.LG

    Mitigating Adversarial Attacks by Distributing Different Copies to Different Users

    Authors: Jiyi Zhang, Han Fang, Wesley Joon-Wie Tann, Ke Xu, Chengfang Fang, Ee-Chien Chang

    Abstract: Machine learning models are vulnerable to adversarial attacks. In this paper, we consider the scenario where a model is distributed to multiple buyers, among which a malicious buyer attempts to attack another buyer. The malicious buyer probes its copy of the model to search for adversarial samples and then presents the found samples to the victim's copy of the model in order to replicate the attac… ▽ More

    Submitted 26 May, 2023; v1 submitted 30 November, 2021; originally announced November 2021.

  34. Tenodesis Grasp Emulator: Kinematic Assessment of Wrist-Driven Orthotic Control

    Authors: Erin Y. Chang, Raghid Mardini, Andrew I. W. McPherson, Yuri Gloumakov, Hannah S. Stuart

    Abstract: Wrist-driven orthotics have been designed to assist people with C6-7 spinal cord injury, however, the kinematic constraint imposed by such a control strategy can impede mobility and lead to abnormal body motion. This study characterizes body compensation using the novel Tenodesis Grasp Emulator, an adaptor orthotic that allows for the investigation of tenodesis gras** in subjects with unimpaired… ▽ More

    Submitted 9 November, 2023; v1 submitted 22 November, 2021; originally announced November 2021.

    Comments: 7 pages, 11 figures, submitted to International Conference on Robotics and Automation (ICRA) 2022. Video Supplement: https://youtu.be/NIgKg5R3Roc

  35. arXiv:2111.03412  [pdf, other

    cs.LG stat.ML

    Dual Parameterization of Sparse Variational Gaussian Processes

    Authors: Vincent Adam, Paul E. Chang, Mohammad Emtiyaz Khan, Arno Solin

    Abstract: Sparse variational Gaussian process (SVGP) methods are a common choice for non-conjugate Gaussian process inference because of their computational benefits. In this paper, we improve their computational efficiency by using a dual parameterization where each data example is assigned dual parameters, similarly to site parameters used in expectation propagation. Our dual parameterization speeds-up in… ▽ More

    Submitted 19 January, 2022; v1 submitted 5 November, 2021; originally announced November 2021.

    Comments: Advances in Neural Information Processing Systems (NeurIPS 2021)

  36. Whole Brain Segmentation with Full Volume Neural Network

    Authors: Yeshu Li, Jonathan Cui, Yilun Sheng, Xiao Liang, **gdong Wang, Eric I-Chao Chang, Yan Xu

    Abstract: Whole brain segmentation is an important neuroimaging task that segments the whole brain volume into anatomically labeled regions-of-interest. Convolutional neural networks have demonstrated good performance in this task. Existing solutions, usually segment the brain image by classifying the voxels, or labeling the slices or the sub-volumes separately. Their representation learning is based on par… ▽ More

    Submitted 29 October, 2021; originally announced October 2021.

    Comments: Accepted to CMIG

    Journal ref: Computerized Medical Imaging and Graphics, Volume 93, October 2021, 101991

  37. arXiv:2110.05577  [pdf, other

    cond-mat.mes-hall cond-mat.other quant-ph

    Engineering the Radiative Dynamics of Thermalized Excitons with Metal Interfaces

    Authors: Grace H. Chen, David Z. Li, Amy Butcher, Alexander A. High, Darrick E. Chang

    Abstract: As a platform for optoelectronic devices based on exciton dynamics, monolayer transition metal dichalcogenides (TMDCs) are often placed near metal interfaces or inside planar cavities. While the radiative properties of point dipoles at metal interfaces has been studied extensively, those of excitons, which are delocalized and exhibit a temperature-dependent momentum distribution, lack a thorough t… ▽ More

    Submitted 11 October, 2021; originally announced October 2021.

    Comments: 30 pages, 7 figures

  38. arXiv:2110.00050  [pdf, other

    quant-ph cond-mat.quant-gas physics.optics

    Emergence of solitons from many-body photon bound states in quantum nonlinear media

    Authors: Giuseppe Calajo, Darrick E. Chang

    Abstract: Solitons are known to occur in the context of atom-light interaction via the well-known semi-classical phenomenon of self-induced transparency (SIT). Separately, in the regime where both light and atoms are fully treated quantum mechanically, quantum few-photon bound states are known to be a ubiquitous phenomenon that arises in different systems such as atoms coupled to chiral or bidirectional wav… ▽ More

    Submitted 9 April, 2022; v1 submitted 30 September, 2021; originally announced October 2021.

    Comments: 20 pages, 11 figures

    Journal ref: Phys. Rev. Research 4, 023026 (2022)

  39. arXiv:2109.12242  [pdf, other

    cs.CL

    Weakly Supervised Contrastive Learning for Chest X-Ray Report Generation

    Authors: An Yan, Zexue He, Xing Lu, Jiang Du, Eric Chang, Amilcare Gentili, Julian McAuley, Chun-Nan Hsu

    Abstract: Radiology report generation aims at generating descriptive text from radiology images automatically, which may present an opportunity to improve radiology reporting and interpretation. A typical setting consists of training encoder-decoder models on image-report pairs with a cross entropy loss, which struggles to generate informative sentences for clinical diagnoses since normal findings dominate… ▽ More

    Submitted 24 September, 2021; originally announced September 2021.

    Comments: Findings of EMNLP 2021

  40. arXiv:2109.11898  [pdf, other

    cs.IR

    Graph Learning Augmented Heterogeneous Graph Neural Network for Social Recommendation

    Authors: Yiming Zhang, Lingfei Wu, Qi Shen, Yitong Pang, Zhihua Wei, Fangli Xu, Ethan Chang, Bo Long

    Abstract: Social recommendation based on social network has achieved great success in improving the performance of recommendation system. Since social network (user-user relations) and user-item interactions are both naturally represented as graph-structured data, Graph Neural Networks (GNNs) have thus been widely applied for social recommendation. In this work, we propose an end-to-end heterogeneous global… ▽ More

    Submitted 24 September, 2021; originally announced September 2021.

    Comments: 10 pages, 5 figures

  41. arXiv:2108.09268  [pdf, other

    physics.atom-ph quant-ph

    Renormalization group analysis of near-field induced dephasing of optical spin waves in an atomic medium

    Authors: Stefano Grava, Yizun He, Saijun Wu, Darrick E. Chang

    Abstract: While typical theories of atom-light interactions treat the atomic medium as being smooth, it is well-known that microscopic optical effects driven by atomic granularity, dipole-dipole interactions, and multiple scattering can lead to important effects. Recently, for example, it was experimentally observed that these ingredients can lead to a fundamental, density-dependent dephasing of optical spi… ▽ More

    Submitted 20 August, 2021; originally announced August 2021.

    Comments: 15 pages, 5 figures

  42. arXiv:2108.08263  [pdf, other

    cs.PL

    Selectively-Amortized Resource Bounding (Extended Version)

    Authors: Tianhan Lu, Bor-Yuh Evan Chang, Ashutosh Trivedi

    Abstract: We consider the problem of automatically proving resource bounds. That is, we study how to prove that an integer-valued resource variable is bounded by a given program expression. Automatic resource-bound analysis has recently received significant attention because of a number of important applications (e.g., detecting performance bugs, preventing algorithmic-complexity attacks, identifying side-c… ▽ More

    Submitted 13 October, 2021; v1 submitted 18 August, 2021; originally announced August 2021.

    Comments: This is an extended version of SAS'21 paper (with appendices)

  43. arXiv:2108.06614  [pdf, other

    cs.CL

    The SelectGen Challenge: Finding the Best Training Samples for Few-Shot Neural Text Generation

    Authors: Ernie Chang, Xiaoyu Shen, Alex Marin, Vera Demberg

    Abstract: We propose a shared task on training instance selection for few-shot neural text generation. Large-scale pretrained language models have led to dramatic improvements in few-shot text generation. Nonetheless, almost all previous work simply applies random sampling to select the few-shot training instances. Little to no attention has been paid to the selection strategies and how they would affect mo… ▽ More

    Submitted 14 August, 2021; originally announced August 2021.

    Comments: Accepted at GenChal @ INLG 2021. arXiv admin note: text overlap with arXiv:2107.03176

  44. arXiv:2108.04912  [pdf

    physics.med-ph

    Quantitative Parametric Map** of Tissues Properties from Standard Magnetic Resonance Imaging Enabled by Deep Learning

    Authors: Yan Wu, Yajun Ma, Youngwook Kee, Nataliya Kovalchuk, Dante Capaldi, Hongyi Ren, Steven Hancock, Eric Chang, Marcus Alley, John Pauly, Jiang Du, Shreyas Vasanawala, Lei Xing

    Abstract: Magnetic resonance imaging (MRI) offers superior soft tissue contrast and is widely used in biomedicine. However, conventional MRI is not quantitative, which presents a bottleneck in image analysis and digital healthcare. Typically, additional scans are required to disentangle the effect of multiple parameters of MR and extract quantitative tissue properties. Here we investigate a data-driven stra… ▽ More

    Submitted 10 August, 2021; originally announced August 2021.

  45. arXiv:2108.03526  [pdf, other

    quant-ph physics.atom-ph

    Optomechanical strong coupling between a single cavity photon and a single atom

    Authors: Javier Argüello-Luengo, Darrick E. Chang

    Abstract: Single atoms coupled to a cavity offer unique opportunities as quantum optomechanical devices because of their small mass and strong interaction with light. A particular regime of interest in optomechanics is that of "single-photon strong coupling," where motional displacements on the order of the zero-point uncertainty are sufficient to shift the cavity resonance frequency by more than its linewi… ▽ More

    Submitted 7 August, 2021; originally announced August 2021.

    Comments: 14 pages, 6 figures

    Journal ref: New J. Phys. 24 023006 (2022)

  46. arXiv:2107.12612  [pdf, other

    cs.CR

    Poisoning Online Learning Filters: DDoS Attacks and Countermeasures

    Authors: Wesley Joon-Wie Tann, Ee-Chien Chang

    Abstract: The recent advancements in machine learning have led to a wave of interest in adopting online learning-based approaches for long-standing attack mitigation issues. In particular, DDoS attacks remain a significant threat to network service availability even after more than two decades. These attacks have been well studied under the assumption that malicious traffic originates from a single attack p… ▽ More

    Submitted 19 January, 2022; v1 submitted 27 July, 2021; originally announced July 2021.

  47. arXiv:2107.03978  [pdf, other

    astro-ph.SR physics.flu-dyn

    Modeling coexisting GSF and shear instabilities in rotating stars

    Authors: Eonho Chang, Pascale Garaud

    Abstract: Zahn's widely-used model for turbulent mixing induced by rotational shear has recently been validated (with some caveats) in non-rotating shear flows. It is not clear, however, whether his model remains valid in the presence of rotation, even though this was its original purpose. Furthermore, new instabilities arise in rotating fluids, such as the Goldreich-Schubert-Fricke (GSF) instability. Which… ▽ More

    Submitted 8 July, 2021; originally announced July 2021.

    Comments: 19 pages, 11 figures

  48. Heterogeneous Global Graph Neural Networks for Personalized Session-based Recommendation

    Authors: Yitong Pang, Lingfei Wu, Qi Shen, Yiming Zhang, Zhihua Wei, Fangli Xu, Ethan Chang, Bo Long, Jian Pei

    Abstract: Predicting the next interaction of a short-term interaction session is a challenging task in session-based recommendation. Almost all existing works rely on item transition patterns, and neglect the impact of user historical sessions while modeling user preference, which often leads to non-personalized recommendation. Additionally, existing personalized session-based recommenders capture user pref… ▽ More

    Submitted 26 February, 2022; v1 submitted 8 July, 2021; originally announced July 2021.

    Comments: 9 pages, 4 figures

  49. arXiv:2107.03179  [pdf, other

    cs.CL

    Time-Aware Ancient Chinese Text Translation and Inference

    Authors: Ernie Chang, Yow-Ting Shiue, Hui-Syuan Yeh, Vera Demberg

    Abstract: In this paper, we aim to address the challenges surrounding the translation of ancient Chinese text: (1) The linguistic gap due to the difference in eras results in translations that are poor in quality, and (2) most translations are missing the contextual information that is often very crucial to understanding the text. To this end, we improve upon past translation techniques by proposing the fol… ▽ More

    Submitted 7 July, 2021; originally announced July 2021.

    Comments: Accepted at LChange at ACL 2021

  50. arXiv:2107.03176  [pdf, other

    cs.CL cs.LG

    On Training Instance Selection for Few-Shot Neural Text Generation

    Authors: Ernie Chang, Xiaoyu Shen, Hui-Syuan Yeh, Vera Demberg

    Abstract: Large-scale pretrained language models have led to dramatic improvements in text generation. Impressive performance can be achieved by finetuning only on a small number of instances (few-shot setting). Nonetheless, almost all previous work simply applies random sampling to select the few-shot training instances. Little to no attention has been paid to the selection strategies and how they would af… ▽ More

    Submitted 7 July, 2021; originally announced July 2021.

    Comments: Accepted at ACL 2021