Skip to main content

Showing 101–150 of 333 results for author: Yang, E

.
  1. arXiv:2204.11996  [pdf, other

    cond-mat.soft cond-mat.mtrl-sci

    Understanding Creep Suppression Mechanism in Polymer Nanocomposites through Machine Learning

    Authors: Entao Yang, James F. Pressly, Bharath Natarajan, Robert Colby, Karen I. Winey, Robert A. Riggleman

    Abstract: While recent efforts have shown how local structure plays an essential role in the dynamic heterogeneity of homogeneous glass-forming materials, systems containing interfaces such as thin films or composite materials remain poorly understood. It is known that interfaces perturb the molecular packing nearby, however, numerous studies show the dynamics are modified over a much larger range. Here, we… ▽ More

    Submitted 25 April, 2022; originally announced April 2022.

  2. C3: Continued Pretraining with Contrastive Weak Supervision for Cross Language Ad-Hoc Retrieval

    Authors: Eugene Yang, Suraj Nair, Ramraj Chandradevan, Rebecca Iglesias-Flores, Douglas W. Oard

    Abstract: Pretrained language models have improved effectiveness on numerous tasks, including ad-hoc retrieval. Recent work has shown that continuing to pretrain a language model with auxiliary objectives before fine-tuning on the retrieval task can further improve retrieval effectiveness. Unlike monolingual retrieval, designing an appropriate auxiliary task for cross-language map**s is challenging. To ad… ▽ More

    Submitted 25 April, 2022; originally announced April 2022.

    Comments: 6 pages, 2 figures, accepted as a SIGIR 2022 Short Paper

  3. arXiv:2204.11087  [pdf, other

    cs.CL

    LitMind Dictionary: An Open-Source Online Dictionary

    Authors: Cunliang Kong, Xuezhi Fang, Liner Yang, Yun Chen, Erhong Yang

    Abstract: Dictionaries can help language learners to learn vocabulary by providing definitions of words. Since traditional dictionaries present word senses as discrete items in predefined inventories, they fall short of flexibility, which is required in providing specific meanings of words in particular contexts. In this paper, we introduce the LitMind Dictionary (https://dictionary.litmind.ink), an open-so… ▽ More

    Submitted 23 April, 2022; originally announced April 2022.

  4. arXiv:2204.07701  [pdf, other

    cs.CL

    BLCU-ICALL at SemEval-2022 Task 1: Cross-Attention Multitasking Framework for Definition Modeling

    Authors: Cunliang Kong, Yujie Wang, Ruining Chong, Liner Yang, Hengyuan Zhang, Erhong Yang, Ya** Huang

    Abstract: This paper describes the BLCU-ICALL system used in the SemEval-2022 Task 1 Comparing Dictionaries and Word Embeddings, the Definition Modeling subtrack, achieving 1st on Italian, 2nd on Spanish and Russian, and 3rd on English and French. We propose a transformer-based multitasking framework to explore the task. The framework integrates multiple embedding architectures through the cross-attention m… ▽ More

    Submitted 15 April, 2022; originally announced April 2022.

  5. arXiv:2203.12926  [pdf, other

    cs.CL

    Multitasking Framework for Unsupervised Simple Definition Generation

    Authors: Cunliang Kong, Yun Chen, Hengyuan Zhang, Liner Yang, Erhong Yang

    Abstract: The definition generation task can help language learners by providing explanations for unfamiliar words. This task has attracted much attention in recent years. We propose a novel task of Simple Definition Generation (SDG) to help language learners and low literacy readers. A significant challenge of this task is the lack of learner's dictionaries in many languages, and therefore the lack of data… ▽ More

    Submitted 24 March, 2022; originally announced March 2022.

    Comments: Accepted by ACL 2022 (main conference)

  6. arXiv:2203.04424  [pdf, other

    cs.RO

    SLAM-Supported Self-Training for 6D Object Pose Estimation

    Authors: Ziqi Lu, Yihao Zhang, Kevin Doherty, Odin Severinsen, Ethan Yang, John Leonard

    Abstract: Recent progress in object pose prediction provides a promising path for robots to build object-level scene representations during navigation. However, as we deploy a robot in novel environments, the out-of-distribution data can degrade the prediction performance. To mitigate the domain gap, we can potentially perform self-training in the target domain, using predictions on robot-captured images as… ▽ More

    Submitted 15 August, 2022; v1 submitted 8 March, 2022; originally announced March 2022.

  7. TARexp: A Python Framework for Technology-Assisted Review Experiments

    Authors: Eugene Yang, David D. Lewis

    Abstract: Technology-assisted review (TAR) is an important industrial application of information retrieval (IR) and machine learning (ML). While a small TAR research community exists, the complexity of TAR software and workflows is a major barrier to entry. Drawing on past open source TAR efforts, as well as design patterns from the IR and ML open source software, we present an open source Python framework… ▽ More

    Submitted 24 April, 2022; v1 submitted 23 February, 2022; originally announced February 2022.

    Comments: 6 pages, 4 figures, accepted as a SIGIR 2022 demo paper

  8. arXiv:2201.09996  [pdf, ps, other

    cs.IR

    Patapasco: A Python Framework for Cross-Language Information Retrieval Experiments

    Authors: Cash Costello, Eugene Yang, Dawn Lawrie, James Mayfield

    Abstract: While there are high-quality software frameworks for information retrieval experimentation, they do not explicitly support cross-language information retrieval (CLIR). To fill this gap, we have created Patapsco, a Python CLIR framework. This framework specifically addresses the complexity that comes with running experiments in multiple languages. Patapsco is designed to be extensible to many langu… ▽ More

    Submitted 24 January, 2022; originally announced January 2022.

    Comments: 5 pages, accepted at ECIR 2022 as a demo paper

  9. arXiv:2201.09992  [pdf, other

    cs.IR cs.CL

    HC4: A New Suite of Test Collections for Ad Hoc CLIR

    Authors: Dawn Lawrie, James Mayfield, Douglas Oard, Eugene Yang

    Abstract: HC4 is a new suite of test collections for ad hoc Cross-Language Information Retrieval (CLIR), with Common Crawl News documents in Chinese, Persian, and Russian, topics in English and in the document languages, and graded relevance judgments. New test collections are needed because existing CLIR test collections built using pooling of traditional CLIR runs have systematic gaps in their relevance j… ▽ More

    Submitted 24 January, 2022; originally announced January 2022.

    Comments: 16 pages, 2 figures, accepted at ECIR 2022

  10. arXiv:2201.08603  [pdf, other

    cs.AR

    Trireme: Exploring Hierarchical Multi-Level Parallelism for Domain Specific Hardware Acceleration

    Authors: Georgios Zacharopoulos, Adel Ejjeh, Ying **g, En-Yu Yang, Tianyu Jia, Iulian Brumar, Jeremy Intan, Muhammad Huzaifa, Sarita Adve, Vikram Adve, Gu-Yeon Wei, David Brooks

    Abstract: The design of heterogeneous systems that include domain specific accelerators is a challenging and time-consuming process. While taking into account area constraints, designers must decide which parts of an application to accelerate in hardware and which to leave in software. Moreover, applications in domains such as Extended Reality (XR) offer opportunities for various forms of parallel execution… ▽ More

    Submitted 21 January, 2022; originally announced January 2022.

    Comments: 20 pages

  11. arXiv:2201.08471  [pdf, other

    cs.IR cs.CL

    Transfer Learning Approaches for Building Cross-Language Dense Retrieval Models

    Authors: Suraj Nair, Eugene Yang, Dawn Lawrie, Kevin Duh, Paul McNamee, Kenton Murray, James Mayfield, Douglas W. Oard

    Abstract: The advent of transformer-based models such as BERT has led to the rise of neural ranking models. These models have improved the effectiveness of retrieval systems well beyond that of lexical term matching models such as BM25. While monolingual retrieval tasks have benefited from large-scale training collections such as MS MARCO and advances in neural architectures, cross-language retrieval tasks… ▽ More

    Submitted 20 January, 2022; originally announced January 2022.

    Comments: Accepted at ECIR 2022 (Full paper)

  12. arXiv:2201.01849  [pdf, other

    cs.CG

    Approximation Algorithms for Maximum Matchings in Geometric Intersection Graphs

    Authors: Sariel Har-Peled, Everett Yang

    Abstract: We present a $(1- \varepsilon)$-approximation algorithms for maximum cardinality matchings in disk intersection graphs -- all with near linear running time. We also present estimation algorithm that returns $(1\pm \varepsilon)$-approximation to the size of such matchings -- this algorithms run in linear time for unit disks, and $O(n \log n)$ for general disks (as long as the density is relatively… ▽ More

    Submitted 15 March, 2022; v1 submitted 5 January, 2022; originally announced January 2022.

  13. arXiv:2112.15043  [pdf, other

    cs.CL

    YACLC: A Chinese Learner Corpus with Multidimensional Annotation

    Authors: Yingying Wang, Cunliang Kong, Liner Yang, Yijun Wang, Xiaorong Lu, Renfen Hu, Shan He, Zhenghao Liu, Yun Chen, Erhong Yang, Maosong Sun

    Abstract: Learner corpus collects language data produced by L2 learners, that is second or foreign-language learners. This resource is of great relevance for second language acquisition research, foreign-language teaching, and automatic grammatical error correction. However, there is little focus on learner corpus for Chinese as Foreign Language (CFL) learners. Therefore, we propose to construct a large-sca… ▽ More

    Submitted 30 December, 2021; originally announced December 2021.

    Comments: 4 pages, 3 figures

  14. arXiv:2112.13610  [pdf, other

    cs.CL

    CUGE: A Chinese Language Understanding and Generation Evaluation Benchmark

    Authors: Yuan Yao, Qingxiu Dong, Jian Guan, Boxi Cao, Zhengyan Zhang, Chaojun Xiao, Xiaozhi Wang, Fanchao Qi, Junwei Bao, **ran Nie, Zheni Zeng, Yuxian Gu, Kun Zhou, Xuancheng Huang, Wenhao Li, Shuhuai Ren, **liang Lu, Chengqiang Xu, Huadong Wang, Guoyang Zeng, Zile Zhou, Jiajun Zhang, Juanzi Li, Minlie Huang, Rui Yan , et al. (10 additional authors not shown)

    Abstract: Realizing general-purpose language intelligence has been a longstanding goal for natural language processing, where standard evaluation benchmarks play a fundamental and guiding role. We argue that for general-purpose language intelligence evaluation, the benchmark itself needs to be comprehensive and systematic. To this end, we propose CUGE, a Chinese Language Understanding and Generation Evaluat… ▽ More

    Submitted 14 June, 2022; v1 submitted 27 December, 2021; originally announced December 2021.

    Comments: We add two new datasets, including grammatical error correction dataset YACLC from Bei**g Language and Culture University, and reading comprehension dataset GCRC from Shanxi University, and also improve the description consistency of all datasets

  15. arXiv:2112.08796  [pdf, other

    cs.CV cs.AI cs.LG

    Saliency Grafting: Innocuous Attribution-Guided Mixup with Calibrated Label Mixing

    Authors: Joonhyung Park, June Yong Yang, **woo Shin, Sung Ju Hwang, Eunho Yang

    Abstract: The Mixup scheme suggests mixing a pair of samples to create an augmented training sample and has gained considerable attention recently for improving the generalizability of neural networks. A straightforward and widely used extension of Mixup is to combine with regional dropout-like methods: removing random patches from a sample and replacing it with the features from another sample. Albeit thei… ▽ More

    Submitted 16 December, 2021; originally announced December 2021.

    Comments: 12 pages; Accepted to AAAI2022

  16. arXiv:2112.02772  [pdf, other

    cs.CV

    ActiveZero: Mixed Domain Learning for Active Stereovision with Zero Annotation

    Authors: Isabella Liu, Edward Yang, Jianyu Tao, Rui Chen, Xiaoshuai Zhang, Qing Ran, Zhu Liu, Hao Su

    Abstract: Traditional depth sensors generate accurate real world depth estimates that surpass even the most advanced learning approaches trained only on simulation domains. Since ground truth depth is readily available in the simulation domain but quite difficult to obtain in the real domain, we propose a method that leverages the best of both worlds. In this paper we present a new framework, ActiveZero, wh… ▽ More

    Submitted 5 December, 2021; originally announced December 2021.

  17. arXiv:2112.01021  [pdf, other

    cs.LG

    Fighting Fire with Fire: Contrastive Debiasing without Bias-free Data via Generative Bias-transformation

    Authors: Yeonsung Jung, Ha** Shim, June Yong Yang, Eunho Yang

    Abstract: Deep neural networks (DNNs), despite their impressive ability to generalize over-capacity networks, often rely heavily on malignant bias as shortcuts instead of task-related information for discriminative tasks. To address this problem, recent studies utilize auxiliary information related to the bias, which is rarely obtainable in practice, or sift through a handful of bias-free samples for debias… ▽ More

    Submitted 5 July, 2023; v1 submitted 2 December, 2021; originally announced December 2021.

  18. arXiv:2111.14785  [pdf, other

    astro-ph.IM astro-ph.CO

    BICEP Array: 150 GHz detector module development

    Authors: A. Schillaci, P. A. R. Ade, Z. Ahmed, M. Amiri, D. Barkats, R. Basu Thakur, C. A. Bischoff, D. Beck, J. J. Bock, V. Buza, J. Cheshire, J. Connors, J. Cornelison, M. Crumrine, A. Cukierman, E. Denison, M. Dierickx, L. Duband, M. Eiben, S. Fatigoni, J. P. Filippini, C. Giannakopoulos, N. Goeckner-Wald, D. Goldfinger, J. A. Grayson , et al. (59 additional authors not shown)

    Abstract: The BICEP/Keck Collaboration is currently leading the quest to the highest sensitivity measurements of the polarized CMB anisotropies on degree scale with a series of cryogenic telescopes, of which BICEP Array is the latest Stage-3 upgrade with a total of $\sim32,000$ detectors. The instrument comprises 4 receivers spanning 30 to 270 GHz, with the low-frequency 30/40 GHz deployed to the South Pole… ▽ More

    Submitted 29 November, 2021; originally announced November 2021.

    Comments: 9 pages, 5 figure, Proceeding of LTD19 submitted to Journal of Low Temperature Physics

  19. arXiv:2111.11316  [pdf, ps, other

    math.PR cs.DM math.CO

    Testing thresholds for high-dimensional sparse random geometric graphs

    Authors: Siqi Liu, Sidhanth Mohanty, Tselil Schramm, Elizabeth Yang

    Abstract: In the random geometric graph model $\mathsf{Geo}_d(n,p)$, we identify each of our $n$ vertices with an independently and uniformly sampled vector from the $d$-dimensional unit sphere, and we connect pairs of vertices whose vectors are ``sufficiently close'', such that the marginal probability of an edge is $p$. We investigate the problem of testing for this latent geometry, or in other words, d… ▽ More

    Submitted 22 November, 2021; originally announced November 2021.

    Comments: 54 pages

  20. arXiv:2111.06621  [pdf

    cond-mat.mes-hall cond-mat.mtrl-sci

    Radiative Pattern of Intralayer and Interlayer Excitons in Two-Dimensional WS2/WSe2 Heterostructure

    Authors: Mohammed Adel Aly, Manan Shah, Lorenz Maximilian Schneider, Kyungnam Kang, Martin Koch, Eui-Hyeok Yang, Arash Rahimi-Iman

    Abstract: Two-dimensional (2D) heterostructures (HS) formed by transition-metal dichalcogenide (TMDC) monolayers offer a unique platform for the study of intralayer and interlayer excitons as well as moiré-pattern-induced features. Particularly, the dipolar charge-transfer exciton comprising an electron and a hole, which are confined to separate layers of 2D semiconductors and Coulomb-bound across the heter… ▽ More

    Submitted 12 November, 2021; originally announced November 2021.

  21. arXiv:2111.05639  [pdf, other

    cs.LG

    Graph Transplant: Node Saliency-Guided Graph Mixup with Local Structure Preservation

    Authors: Joonhyung Park, Ha** Shim, Eunho Yang

    Abstract: Graph-structured datasets usually have irregular graph sizes and connectivities, rendering the use of recent data augmentation techniques, such as Mixup, difficult. To tackle this challenge, we present the first Mixup-like graph augmentation method at the graph-level called Graph Transplant, which mixes irregular graphs in data space. To be well defined on various scales of the graph, our method i… ▽ More

    Submitted 19 December, 2021; v1 submitted 10 November, 2021; originally announced November 2021.

    Comments: 19 pages; Accepted to AAAI2022

  22. arXiv:2111.05066  [pdf

    cs.CV

    Deep Convolution Network Based Emotion Analysis for Automatic Detection of Mild Cognitive Impairment in the Elderly

    Authors: Zixiang Fei, Erfu Yang, Leijian Yu, Xia Li, Huiyu Zhou, Wenju Zhou

    Abstract: A significant number of people are suffering from cognitive impairment all over the world. Early detection of cognitive impairment is of great importance to both patients and caregivers. However, existing approaches have their shortages, such as time consumption and financial expenses involved in clinics and the neuroimaging stage. It has been found that patients with cognitive impairment show abn… ▽ More

    Submitted 9 November, 2021; originally announced November 2021.

    Comments: 17 pages

  23. arXiv:2110.15018  [pdf, other

    eess.AS cs.SD

    TorchAudio: Building Blocks for Audio and Speech Processing

    Authors: Yao-Yuan Yang, Moto Hira, Zhaoheng Ni, Anjali Chourdia, Artyom Astafurov, Caroline Chen, Ching-Feng Yeh, Christian Puhrsch, David Pollack, Dmitriy Genzel, Donny Greenberg, Edward Z. Yang, Jason Lian, Jay Mahadeokar, Jeff Hwang, Ji Chen, Peter Goldsborough, Prabhat Roy, Sean Narenthiran, Shinji Watanabe, Soumith Chintala, Vincent Quenneville-Bélair, Yangyang Shi

    Abstract: This document describes version 0.10 of TorchAudio: building blocks for machine learning applications in the audio and speech processing domain. The objective of TorchAudio is to accelerate the development and deployment of machine learning applications for researchers and engineers by providing off-the-shelf building blocks. The building blocks are designed to be GPU-compatible, automatically dif… ▽ More

    Submitted 16 February, 2022; v1 submitted 28 October, 2021; originally announced October 2021.

    Comments: Accepted by ICASSP 2022

  24. arXiv:2110.02508  [pdf, other

    cs.LG

    Online Hyperparameter Meta-Learning with Hypergradient Distillation

    Authors: Hae Beom Lee, Hayeon Lee, Jaewoong Shin, Eunho Yang, Timothy Hospedales, Sung Ju Hwang

    Abstract: Many gradient-based meta-learning methods assume a set of parameters that do not participate in inner-optimization, which can be considered as hyperparameters. Although such hyperparameters can be optimized using the existing gradient-based hyperparameter optimization (HO) methods, they suffer from the following issues. Unrolled differentiation methods do not scale well to high-dimensional hyperpa… ▽ More

    Submitted 11 February, 2022; v1 submitted 6 October, 2021; originally announced October 2021.

  25. arXiv:2110.00892  [pdf, ps, other

    math.CO

    Cyclic Base Ordering of Graphs

    Authors: Jessica Li, Eric Yang, William Zhang

    Abstract: A cyclic base ordering of a connected graph $G$, is a cyclic ordering of $E(G)$ such that every cyclically consecutive $|V(G)|-1$ edges form a spanning tree. In this project, we study cyclic base ordering of various families of graphs, including square of cycles, wheel graphs, generalized wheel graphs and broken wheel graphs, fan and broken fan graphs, prism graphs, and maximal 2-degenerate graphs… ▽ More

    Submitted 2 October, 2021; originally announced October 2021.

  26. arXiv:2109.08359  [pdf, other

    cs.CL cs.AI cs.LG

    Distilling Linguistic Context for Language Model Compression

    Authors: Geondo Park, Gyeongman Kim, Eunho Yang

    Abstract: A computationally expensive and memory intensive neural network lies behind the recent success of language representation learning. Knowledge distillation, a major technique for deploying such a vast language model in resource-scarce environments, transfers the knowledge on individual word representations learned without restrictions. In this paper, inspired by the recent observations that languag… ▽ More

    Submitted 17 September, 2021; originally announced September 2021.

    Comments: EMNLP 2021. Code: https://github.com/GeondoPark/CKD

  27. arXiv:2109.06442  [pdf, ps, other

    cs.DS math.PR

    Domain Sparsification of Discrete Distributions using Entropic Independence

    Authors: Nima Anari, Michał Dereziński, Thuy-Duong Vuong, Elizabeth Yang

    Abstract: We present a framework for speeding up the time it takes to sample from discrete distributions $μ$ defined over subsets of size $k$ of a ground set of $n$ elements, in the regime $k\ll n$. We show that having estimates of marginals $\mathbb{P}_{S\sim μ}[i\in S]$, the task of sampling from $μ$ can be reduced to sampling from distributions $ν$ supported on size $k$ subsets of a ground set of only… ▽ More

    Submitted 14 September, 2021; v1 submitted 14 September, 2021; originally announced September 2021.

  28. arXiv:2109.02100  [pdf, other

    cs.LG cs.CV

    Cluster-Promoting Quantization with Bit-Drop for Minimizing Network Quantization Loss

    Authors: Jung Hyun Lee, Jihun Yun, Sung Ju Hwang, Eunho Yang

    Abstract: Network quantization, which aims to reduce the bit-lengths of the network weights and activations, has emerged for their deployments to resource-limited devices. Although recent studies have successfully discretized a full-precision network, they still incur large quantization errors after training, thus giving rise to a significant performance gap between a full-precision network and its quantize… ▽ More

    Submitted 5 September, 2021; originally announced September 2021.

    Comments: Accepted to ICCV 2021

  29. arXiv:2108.12752  [pdf, other

    cs.IR

    TAR on Social Media: A Framework for Online Content Moderation

    Authors: Eugene Yang, David D. Lewis, Ophir Frieder

    Abstract: Content moderation (removing or limiting the distribution of posts based on their contents) is one tool social networks use to fight problems such as harassment and disinformation. Manually screening all content is usually impractical given the scale of social media data, and the need for nuanced human interpretations makes fully automated approaches infeasible. We consider content moderation from… ▽ More

    Submitted 29 August, 2021; originally announced August 2021.

    Comments: 9 pages, 2 figures, accepted at DESIRES 2021

  30. Certifying One-Phase Technology-Assisted Reviews

    Authors: David D. Lewis, Eugene Yang, Ophir Frieder

    Abstract: Technology-assisted review (TAR) workflows based on iterative active learning are widely used in document review applications. Most stop** rules for one-phase TAR workflows lack valid statistical guarantees, which has discouraged their use in some legal contexts. Drawing on the theory of quantile estimation, we provide the first broadly applicable and statistically valid sample-based stop** ru… ▽ More

    Submitted 29 August, 2021; originally announced August 2021.

    Comments: 10 pages, 4 figures, accepted at CIKM 2021

  31. The Role of Local Structure in the Enhanced Dynamics of Deformed Glasses

    Authors: Entao Yang, Robert A. Riggleman

    Abstract: External stress can accelerate molecular mobility of amorphous solids by several orders of magnitude. The changes in mobility are commonly interpreted through the Eyring model, which invokes an empirical activation volume whose origin remains poorly understood. Here, we analyze constant-stress molecular dynamics simulations and propose an extension of the Eyring model with a machine-learned field,… ▽ More

    Submitted 12 August, 2021; originally announced August 2021.

  32. arXiv:2107.08603  [pdf, ps, other

    math.AG math.KT

    Some results on the motivic nearby cycle

    Authors: Fangzhou **, Enlin Yang

    Abstract: We extend Ayoub's formalism of motivic nearby cycle functor to the $\infty$-categorical level, and prove some desired cohomological properties by relating the motivic nearby cycle functor to the notion of local acyclicity in motivic homotopy.

    Submitted 20 August, 2022; v1 submitted 18 July, 2021; originally announced July 2021.

    MSC Class: 14F42; 14F45; 19E15; 32S30

  33. arXiv:2107.00233  [pdf, other

    cs.LG cs.AI cs.CV cs.DC

    FedMix: Approximation of Mixup under Mean Augmented Federated Learning

    Authors: Tehrim Yoon, Sumin Shin, Sung Ju Hwang, Eunho Yang

    Abstract: Federated learning (FL) allows edge devices to collectively learn a model without directly sharing data within each device, thus preserving privacy and eliminating the need to store data globally. While there are promising results under the assumption of independent and identically distributed (iid) local data, current state-of-the-art algorithms suffer from performance degradation as the heteroge… ▽ More

    Submitted 1 July, 2021; originally announced July 2021.

    Journal ref: ICLR 2021

  34. arXiv:2106.15853  [pdf, other

    cs.LG

    Understanding and Improving Early Stop** for Learning with Noisy Labels

    Authors: Yingbin Bai, Erkun Yang, Bo Han, Yanhua Yang, Jiatong Li, Yinian Mao, Gang Niu, Tongliang Liu

    Abstract: The memorization effect of deep neural network (DNN) plays a pivotal role in many state-of-the-art label-noise learning methods. To exploit this property, the early stop** trick, which stops the optimization at the early stage of training, is usually adopted. Current methods generally decide the early stop** point by considering a DNN as a whole. However, a DNN can be considered as a compositi… ▽ More

    Submitted 26 December, 2021; v1 submitted 30 June, 2021; originally announced June 2021.

    Comments: NeurIPS 2021

  35. Heuristic Stop** Rules For Technology-Assisted Review

    Authors: Eugene Yang, David D. Lewis, Ophir Frieder

    Abstract: Technology-assisted review (TAR) refers to human-in-the-loop active learning workflows for finding relevant documents in large collections. These workflows often must meet a target for the proportion of relevant documents found (i.e. recall) while also holding down costs. A variety of heuristic stop** rules have been suggested for striking this tradeoff in particular settings, but none have been… ▽ More

    Submitted 17 June, 2021; originally announced June 2021.

    Comments: 10 pages, 2 figures. Accepted at DocEng 21

  36. On Minimizing Cost in Legal Document Review Workflows

    Authors: Eugene Yang, David D. Lewis, Ophir Frieder

    Abstract: Technology-assisted review (TAR) refers to human-in-the-loop machine learning workflows for document review in legal discovery and other high recall review tasks. Attorneys and legal technologists have debated whether review should be a single iterative process (one-phase TAR workflows) or whether model training and review should be separate (two-phase TAR workflows), with implications for the cho… ▽ More

    Submitted 17 June, 2021; originally announced June 2021.

    Comments: 10 pages, 3 figures. Accepted at DocEng 21

  37. arXiv:2106.03153  [pdf, other

    eess.AS cs.CL cs.LG cs.SD

    Meta-StyleSpeech : Multi-Speaker Adaptive Text-to-Speech Generation

    Authors: Dongchan Min, Dong Bok Lee, Eunho Yang, Sung Ju Hwang

    Abstract: With rapid progress in neural text-to-speech (TTS) models, personalized speech generation is now in high demand for many applications. For practical applicability, a TTS model should generate high-quality speech with only a few audio samples from the given speaker, that are also short in length. However, existing methods either require to fine-tune the model or achieve low adaptation quality witho… ▽ More

    Submitted 16 June, 2021; v1 submitted 6 June, 2021; originally announced June 2021.

    Comments: Accepted by ICML 2021

  38. arXiv:2106.01085  [pdf, other

    cs.LG cs.CV

    Online Coreset Selection for Rehearsal-based Continual Learning

    Authors: Jaehong Yoon, Divyam Madaan, Eunho Yang, Sung Ju Hwang

    Abstract: A dataset is a shred of crucial evidence to describe a task. However, each data point in the dataset does not have the same potential, as some of the data points can be more representative or informative than others. This unequal importance among the data points may have a large impact in rehearsal-based continual learning, where we store a subset of the training examples (coreset) to be replayed… ▽ More

    Submitted 18 March, 2022; v1 submitted 2 June, 2021; originally announced June 2021.

    Comments: ICLR 2022

  39. arXiv:2105.13001  [pdf, other

    cs.LG

    Estimating Instance-dependent Bayes-label Transition Matrix using a Deep Neural Network

    Authors: Shuo Yang, Erkun Yang, Bo Han, Yang Liu, Min Xu, Gang Niu, Tongliang Liu

    Abstract: In label-noise learning, estimating the transition matrix is a hot topic as the matrix plays an important role in building statistically consistent classifiers. Traditionally, the transition from clean labels to noisy labels (i.e., clean-label transition matrix (CLTM)) has been widely exploited to learn a clean label classifier by employing the noisy data. Motivated by that classifiers mostly outp… ▽ More

    Submitted 14 July, 2022; v1 submitted 27 May, 2021; originally announced May 2021.

    Comments: ICML 22 camera ready

  40. arXiv:2105.01044  [pdf, other

    cs.IR cs.CL

    Goldilocks: Just-Right Tuning of BERT for Technology-Assisted Review

    Authors: Eugene Yang, Sean MacAvaney, David D. Lewis, Ophir Frieder

    Abstract: Technology-assisted review (TAR) refers to iterative active learning workflows for document review in high recall retrieval (HRR) tasks. TAR research and most commercial TAR software have applied linear models such as logistic regression to lexical features. Transformer-based models with supervised tuning are known to improve effectiveness on many text classification tasks, suggesting their use in… ▽ More

    Submitted 19 January, 2022; v1 submitted 3 May, 2021; originally announced May 2021.

    Comments: 6 pages, 1 figure, accepted at ECIR 2022

  41. arXiv:2105.00795  [pdf, other

    cs.LG

    RetCL: A Selection-based Approach for Retrosynthesis via Contrastive Learning

    Authors: Hankook Lee, Sungsoo Ahn, Seung-Woo Seo, You Young Song, Eunho Yang, Sung-Ju Hwang, **woo Shin

    Abstract: Retrosynthesis, of which the goal is to find a set of reactants for synthesizing a target product, is an emerging research area of deep learning. While the existing approaches have shown promising results, they currently lack the ability to consider availability (e.g., stability or purchasability) of the reactants or generalize to unseen reaction templates (i.e., chemical reaction rules). In this… ▽ More

    Submitted 3 June, 2021; v1 submitted 3 May, 2021; originally announced May 2021.

    Comments: Accepted to IJCAI 2021. Short version was accepted to Machine Learning for Molecules Workshop at NeurIPS 2020

  42. arXiv:2104.08314  [pdf, other

    cs.CV

    High Performance Convolution Using Sparsity and Patterns for Inference in Deep Convolutional Neural Networks

    Authors: Hossam Amer, Ahmed H. Salamah, Ahmad Sajedi, En-hui Yang

    Abstract: Deploying deep Convolutional Neural Networks (CNNs) is impacted by their memory footprint and speed requirements, which mainly come from convolution. Widely-used convolution algorithms, im2col and MEC, produce a lowered matrix from an activation map by redundantly storing the map's elements included at horizontal and/or vertical kernel overlap**s without considering the sparsity of the map. Usin… ▽ More

    Submitted 16 April, 2021; originally announced April 2021.

    Comments: 34 pages

  43. arXiv:2103.14302  [pdf, other

    cs.CL cs.SD eess.AS

    Mutually-Constrained Monotonic Multihead Attention for Online ASR

    Authors: Jaeyun Song, Ha** Shim, Eunho Yang

    Abstract: Despite the feature of real-time decoding, Monotonic Multihead Attention (MMA) shows comparable performance to the state-of-the-art offline methods in machine translation and automatic speech recognition (ASR) tasks. However, the latency of MMA is still a major issue in ASR and should be combined with a technique that can reduce the test latency at inference time, such as head-synchronous beam sea… ▽ More

    Submitted 26 March, 2021; originally announced March 2021.

    Comments: Accepted at IEEE ICASSP 2021

  44. arXiv:2103.13151  [pdf, other

    cs.CV

    Learning Polar Encodings for Arbitrary-Oriented Ship Detection in SAR Images

    Authors: Yishan He, Fei Gao, Jun Wang, Amir Hussain, Erfu Yang, Huiyu Zhou

    Abstract: Common horizontal bounding box (HBB)-based methods are not capable of accurately locating slender ship targets with arbitrary orientations in synthetic aperture radar (SAR) images. Therefore, in recent years, methods based on oriented bounding box (OBB) have gradually received attention from researchers. However, most of the recently proposed deep learning-based methods for OBB detection encounter… ▽ More

    Submitted 24 March, 2021; originally announced March 2021.

  45. arXiv:2103.01328  [pdf, other

    cs.CL

    ToxCCIn: Toxic Content Classification with Interpretability

    Authors: Tong Xiang, Sean MacAvaney, Eugene Yang, Nazli Goharian

    Abstract: Despite the recent successes of transformer-based models in terms of effectiveness on a variety of tasks, their decisions often remain opaque to humans. Explanations are particularly important for tasks like offensive language or toxicity detection on social media because a manual appeal process is often in place to dispute automatically flagged content. In this work, we propose a technique to imp… ▽ More

    Submitted 1 March, 2021; originally announced March 2021.

    Comments: Long paper accepted to WASSA2021@EACL

  46. arXiv:2102.03866  [pdf, other

    cs.LG

    Model-Augmented Q-learning

    Authors: Youngmin Oh, **woo Shin, Eunho Yang, Sung Ju Hwang

    Abstract: In recent years, $Q$-learning has become indispensable for model-free reinforcement learning (MFRL). However, it suffers from well-known problems such as under- and overestimation bias of the value, which may adversely affect the policy learning. To resolve this issue, we propose a MFRL framework that is augmented with the components of model-based RL. Specifically, we propose to estimate not only… ▽ More

    Submitted 7 February, 2021; originally announced February 2021.

  47. arXiv:2102.02386  [pdf, other

    astro-ph.CO astro-ph.IM

    Analysis of Temperature-to-Polarization Leakage in BICEP3 and Keck CMB Data from 2016 to 2018

    Authors: The BICEP/Keck Collaboration, :, T. St. Germaine, P. A. R. Ade, Z. Ahmed, M. Amiri, D. Barkats, R. Basu Thakur, C. A. Bischoff, J. J. Bock, H. Boenish, E. Bullock, V. Buza, J. R. Cheshire, J. Connors, J. Cornelison, M. Crumrine, A. Cukierman, E. Denison, M. Dierickx, L. Duband, M. Eiben, S. Fatigoni, J. P. Filippini, S. Fliescher , et al. (64 additional authors not shown)

    Abstract: The BICEP/Keck Array experiment is a series of small-aperture refracting telescopes observing degree-scale Cosmic Microwave Background polarization from the South Pole in search of a primordial $B$-mode signature. As a pair differencing experiment, an important systematic that must be controlled is the differential beam response between the co-located, orthogonally polarized detectors. We use high… ▽ More

    Submitted 3 February, 2021; originally announced February 2021.

    Comments: 9 pages, 4 figures

    Journal ref: Proc. SPIE 11453, Millimeter, Submillimeter, and Far-Infrared Detectors and Instrumentation for Astronomy X, 114532E (15 December 2020)

  48. arXiv:2101.12409  [pdf, other

    cs.CL

    Few-Shot Domain Adaptation for Grammatical Error Correction via Meta-Learning

    Authors: Shengsheng Zhang, Ya** Huang, Yun Chen, Liner Yang, Chencheng Wang, Erhong Yang

    Abstract: Most existing Grammatical Error Correction (GEC) methods based on sequence-to-sequence mainly focus on how to generate more pseudo data to obtain better performance. Few work addresses few-shot GEC domain adaptation. In this paper, we treat different GEC domains as different GEC tasks and propose to extend meta-learning to few-shot GEC domain adaptation without using any pseudo data. We exploit a… ▽ More

    Submitted 29 January, 2021; originally announced January 2021.

  49. arXiv:2101.12149  [pdf, other

    physics.comp-ph cs.DC physics.acc-ph

    Porting WarpX to GPU-accelerated platforms

    Authors: A. Myers, A. Almgren, L. D. Amorim, J. Bell, L. Fedeli, L. Ge, K. Gott, D. P. Grote, M. Hogan, A. Huebl, R. Jambunathan, R. Lehe, C. Ng, M. Rowan, O. Shapoval, M. Thévenet, J. -L. Vay, H. Vincenti, E. Yang, N. Zaïm, W. Zhang, Y. Zhao, E. Zoni

    Abstract: WarpX is a general purpose electromagnetic particle-in-cell code that was originally designed to run on many-core CPU architectures. We describe the strategy followed to allow WarpX to use the GPU-accelerated nodes on OLCF's Summit supercomputer, a strategy we believe will extend to the upcoming machines Frontier and Aurora. We summarize the challenges encountered, lessons learned, and give curren… ▽ More

    Submitted 2 September, 2021; v1 submitted 28 January, 2021; originally announced January 2021.

    Comments: 11 pages, 5 figures, accepted by Parallel Computing. Minor revisions, results unchanged

    Journal ref: Parallel Computing, Volume 108, 2021, 102833

  50. arXiv:2101.09294  [pdf, other

    cs.CL cs.AI cs.CY cs.LG

    Censorship of Online Encyclopedias: Implications for NLP Models

    Authors: Eddie Yang, Margaret E. Roberts

    Abstract: While artificial intelligence provides the backbone for many tools people use around the world, recent work has brought to attention that the algorithms powering AI are not free of politics, stereotypes, and bias. While most work in this area has focused on the ways in which AI can exacerbate existing inequalities and discrimination, very little work has studied how governments actively shape trai… ▽ More

    Submitted 22 January, 2021; originally announced January 2021.

    Comments: Accepted for publication at ACM FAccT 2021