Skip to main content

Showing 1–50 of 163 results for author: Chung, H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.18561  [pdf, other

    cs.CV cs.LG

    SelMatch: Effectively Scaling Up Dataset Distillation via Selection-Based Initialization and Partial Updates by Trajectory Matching

    Authors: Yongmin Lee, Hye Won Chung

    Abstract: Dataset distillation aims to synthesize a small number of images per class (IPC) from a large dataset to approximate full dataset training with minimal performance loss. While effective in very small IPC ranges, many distillation methods become less effective, even underperforming random sample selection, as IPC increases. Our examination of state-of-the-art trajectory-matching based distillation… ▽ More

    Submitted 28 May, 2024; originally announced June 2024.

    Comments: ICML 2024

  2. arXiv:2406.13144  [pdf, other

    cs.CL cs.AI

    DialSim: A Real-Time Simulator for Evaluating Long-Term Dialogue Understanding of Conversational Agents

    Authors: Jiho Kim, Woosog Chay, Hyeonji Hwang, Daeun Kyung, Hyunseung Chung, Eunbyeol Cho, Yohan Jo, Edward Choi

    Abstract: Recent advancements in Large Language Models (LLMs) have significantly enhanced the capabilities of conversational agents, making them applicable to various fields (e.g., education). Despite their progress, the evaluation of the agents often overlooks the complexities of real-world conversations, such as real-time interactions, multi-party dialogues, and extended contextual dependencies. To bridge… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  3. arXiv:2406.12632  [pdf, other

    eess.IV cs.CV

    Cyclic 2.5D Perceptual Loss for Cross-Modal 3D Image Synthesis: T1 MRI to Tau-PET

    Authors: Symac Kim, Junho Moon, Haejun Chung, Ikbeom Jang

    Abstract: Alzheimer's Disease (AD) is the most common form of dementia, characterised by cognitive decline and biomarkers such as tau-proteins. Tau-positron emission tomography (tau-PET), which employs a radiotracer to selectively bind, detect, and visualise tau protein aggregates within the brain, is valuable for early AD diagnosis but is less accessible due to high costs, limited availability, and its inv… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: 24 pages, 5 figures

  4. arXiv:2406.08070  [pdf, ps, other

    cs.CV cs.AI cs.LG

    CFG++: Manifold-constrained Classifier Free Guidance for Diffusion Models

    Authors: Hyung** Chung, Jeongsol Kim, Geon Yeong Park, Hyelin Nam, Jong Chul Ye

    Abstract: Classifier-free guidance (CFG) is a fundamental tool in modern diffusion models for text-guided generation. Although effective, CFG has notable drawbacks. For instance, DDIM with CFG lacks invertibility, complicating image editing; furthermore, high guidance scales, essential for high-quality outputs, frequently result in issues like mode collapse. Contrary to the widespread belief that these are… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  5. arXiv:2406.03057  [pdf, other

    cs.LG stat.ML

    BWS: Best Window Selection Based on Sample Scores for Data Pruning across Broad Ranges

    Authors: Hoyong Choi, Nohyun Ki, Hye Won Chung

    Abstract: Data subset selection aims to find a smaller yet informative subset of a large dataset that can approximate the full-dataset training, addressing challenges associated with training neural networks on large-scale datasets. However, existing methods tend to specialize in either high or low selection ratio regimes, lacking a universal approach that consistently achieves competitive performance acros… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: ICML 2024

  6. arXiv:2405.18698  [pdf, other

    cs.LG cs.AI

    Spectral-Risk Safe Reinforcement Learning with Convergence Guarantees

    Authors: Dohyeong Kim, Taehyun Cho, Seungyub Han, Hojun Chung, Kyungjae Lee, Songhwai Oh

    Abstract: The field of risk-constrained reinforcement learning (RCRL) has been developed to effectively reduce the likelihood of worst-case scenarios by explicitly handling risk-measure-based constraints. However, the nonlinearity of risk measures makes it challenging to achieve convergence and optimality. To overcome the difficulties posed by the nonlinearity, we propose a spectral risk measure-constrained… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: 26 pages

  7. arXiv:2405.03820  [pdf, other

    cs.CY cs.AI cs.HC

    False Sense of Security in Explainable Artificial Intelligence (XAI)

    Authors: Neo Christopher Chung, Hongkyou Chung, Hearim Lee, Lennart Brocki, Hongbeom Chung, George Dyer

    Abstract: A cautious interpretation of AI regulations and policy in the EU and the USA place explainability as a central deliverable of compliant AI systems. However, from a technical perspective, explainable AI (XAI) remains an elusive and complex target where even state of the art methods often reach erroneous, misleading, and incomplete explanations. "Explainability" has multiple meanings which are often… ▽ More

    Submitted 13 June, 2024; v1 submitted 6 May, 2024; originally announced May 2024.

    Comments: AI Governance Workshop at the 2024 International Joint Conference on Artificial Intelligence (IJCAI)

  8. arXiv:2404.13645  [pdf, other

    cs.CL

    PEACH: Pretrained-embedding Explanation Across Contextual and Hierarchical Structure

    Authors: Feiqi Cao, Caren Han, Hyunsuk Chung

    Abstract: In this work, we propose a novel tree-based explanation technique, PEACH (Pretrained-embedding Explanation Across Contextual and Hierarchical Structure), that can explain how text-based documents are classified by using any pretrained contextual embeddings in a tree-based human-interpretable manner. Note that PEACH can adopt any contextual embeddings of the PLMs as a training input for the decisio… ▽ More

    Submitted 21 April, 2024; originally announced April 2024.

    Comments: Accepted at IJCAI 2024

  9. arXiv:2404.01216  [pdf, other

    cs.LG cs.SI stat.ML

    Novel Node Category Detection Under Subpopulation Shift

    Authors: Hsing-Huan Chung, Shravan Chaudhari, Yoav Wald, Xing Han, Joydeep Ghosh

    Abstract: In real-world graph data, distribution shifts can manifest in various ways, such as the emergence of new categories and changes in the relative proportions of existing categories. It is often important to detect nodes of novel categories under such distribution shifts for safety or insight discovery purposes. We introduce a new approach, Recall-Constrained Optimization with Selective Link Predicti… ▽ More

    Submitted 30 June, 2024; v1 submitted 1 April, 2024; originally announced April 2024.

    Comments: Accepted to ECML-PKDD 2024

  10. arXiv:2402.18362  [pdf, other

    cs.CV cs.AI

    Objective and Interpretable Breast Cosmesis Evaluation with Attention Guided Denoising Diffusion Anomaly Detection Model

    Authors: Sangjoon Park, Yong Bae Kim, Jee Suk Chang, Seo Hee Choi, Hyung** Chung, Ik Jae Lee, Hwa Kyung Byun

    Abstract: As advancements in the field of breast cancer treatment continue to progress, the assessment of post-surgical cosmetic outcomes has gained increasing significance due to its substantial impact on patients' quality of life. However, evaluating breast cosmesis presents challenges due to the inherently subjective nature of expert labeling. In this study, we present a novel automated approach, Attenti… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

  11. arXiv:2402.17896  [pdf, other

    cs.CL cs.AI

    Researchy Questions: A Dataset of Multi-Perspective, Decompositional Questions for LLM Web Agents

    Authors: Corby Rosset, Ho-Lam Chung, Guanghui Qin, Ethan C. Chau, Zhuo Feng, Ahmed Awadallah, Jennifer Neville, Nikhil Rao

    Abstract: Existing question answering (QA) datasets are no longer challenging to most powerful Large Language Models (LLMs). Traditional QA benchmarks like TriviaQA, NaturalQuestions, ELI5 and HotpotQA mainly study ``known unknowns'' with clear indications of both what information is missing, and how to find it to answer the question. Hence, good performance on these benchmarks provides a false sense of sec… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

  12. arXiv:2402.13236  [pdf, other

    eess.AS cs.SD

    Towards audio language modeling -- an overview

    Authors: Haibin Wu, Xuanjun Chen, Yi-Cheng Lin, Kai-wei Chang, Ho-Lam Chung, Alexander H. Liu, Hung-yi Lee

    Abstract: Neural audio codecs are initially introduced to compress audio data into compact codes to reduce transmission latency. Researchers recently discovered the potential of codecs as suitable tokenizers for converting continuous audio into discrete codes, which can be employed to develop audio language models (LMs). Numerous high-performance neural audio codecs and codec-based LMs have been developed.… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

  13. arXiv:2402.13071  [pdf, other

    eess.AS cs.SD

    Codec-SUPERB: An In-Depth Analysis of Sound Codec Models

    Authors: Haibin Wu, Ho-Lam Chung, Yi-Cheng Lin, Yuan-Kuei Wu, Xuanjun Chen, Yu-Chi Pai, Hsiu-Hsuan Wang, Kai-Wei Chang, Alexander H. Liu, Hung-yi Lee

    Abstract: The sound codec's dual roles in minimizing data transmission latency and serving as tokenizers underscore its critical importance. Recent years have witnessed significant developments in codec models. The ideal sound codec should preserve content, paralinguistics, speakers, and audio information. However, the question of which codec achieves optimal sound information preservation remains unanswere… ▽ More

    Submitted 7 June, 2024; v1 submitted 20 February, 2024; originally announced February 2024.

    Comments: Github: https://github.com/voidful/Codec-SUPERB

  14. arXiv:2402.13061  [pdf, other

    cs.CV

    Toward Fairness via Maximum Mean Discrepancy Regularization on Logits Space

    Authors: Hao-Wei Chung, Ching-Hao Chiu, Yu-Jen Chen, Yiyu Shi, Tsung-Yi Ho

    Abstract: Fairness has become increasingly pivotal in machine learning for high-risk applications such as machine learning in healthcare and facial recognition. However, we see the deficiency in the previous logits space constraint methods. Therefore, we propose a novel framework, Logits-MMD, that achieves the fairness condition by imposing constraints on output logits with Maximum Mean Discrepancy. Moreove… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

  15. arXiv:2402.10482  [pdf, other

    cs.LG stat.ML

    Understanding Self-Distillation and Partial Label Learning in Multi-Class Classification with Label Noise

    Authors: Hyeonsu Jeong, Hye Won Chung

    Abstract: Self-distillation (SD) is the process of training a student model using the outputs of a teacher model, with both models sharing the same architecture. Our study theoretically examines SD in multi-class classification with cross-entropy loss, exploring both multi-round SD and SD with refined teacher outputs, inspired by partial label learning (PLL). By deriving a closed-form solution for the stude… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

  16. Collusion-Resilience in Transaction Fee Mechanism Design

    Authors: Hao Chung, Tim Roughgarden, Elaine Shi

    Abstract: Users bid in a transaction fee mechanism (TFM) to get their transactions included and confirmed by a blockchain protocol. Roughgarden (EC'21) initiated the formal treatment of TFMs and proposed three requirements: user incentive compatibility (UIC), miner incentive compatibility (MIC), and a form of collusion-resilience called OCA-proofness. Ethereum's EIP-1559 mechanism satisfies all three proper… ▽ More

    Submitted 19 June, 2024; v1 submitted 14 February, 2024; originally announced February 2024.

  17. arXiv:2401.00773  [pdf, other

    cs.LG cs.AI stat.ML

    Unsupervised Outlier Detection using Random Subspace and Subsampling Ensembles of Dirichlet Process Mixtures

    Authors: Dongwook Kim, Juyeon Park, Hee Cheol Chung, Seonghyun Jeong

    Abstract: Probabilistic mixture models are acknowledged as a valuable tool for unsupervised outlier detection owing to their interpretability and intuitive grounding in statistical principles. Within this framework, Dirichlet process mixture models emerge as a compelling alternative to conventional finite mixture models for both clustering and outlier detection tasks. However, despite their evident advantag… ▽ More

    Submitted 13 January, 2024; v1 submitted 1 January, 2024; originally announced January 2024.

  18. arXiv:2312.09781  [pdf, other

    cs.CL cs.AI

    GSQA: An End-to-End Model for Generative Spoken Question Answering

    Authors: Min-Han Shih, Ho-Lam Chung, Yu-Chi Pai, Ming-Hao Hsu, Guan-Ting Lin, Shang-Wen Li, Hung-yi Lee

    Abstract: In recent advancements in spoken question answering (QA), end-to-end models have made significant strides. However, previous research has primarily focused on extractive span selection. While this extractive-based approach is effective when answers are present directly within the input, it falls short in addressing abstractive questions, where answers are not directly extracted but inferred from t… ▽ More

    Submitted 25 December, 2023; v1 submitted 15 December, 2023; originally announced December 2023.

    Comments: 5 pages, 2 figures, submitted to ICASSP 2024

  19. arXiv:2312.02480  [pdf, other

    cs.CV

    Differentiable Point-based Inverse Rendering

    Authors: Hoon-Gyu Chung, Seokjun Choi, Seung-Hwan Baek

    Abstract: We present differentiable point-based inverse rendering, DPIR, an analysis-by-synthesis method that processes images captured under diverse illuminations to estimate shape and spatially-varying BRDF. To this end, we adopt point-based rendering, eliminating the need for multiple samplings per ray, typical of volumetric rendering, thus significantly enhancing the speed of inverse rendering. To reali… ▽ More

    Submitted 25 March, 2024; v1 submitted 4 December, 2023; originally announced December 2023.

  20. arXiv:2311.15658  [pdf, other

    cs.CV cs.AI cs.LG

    Regularization by Texts for Latent Diffusion Inverse Solvers

    Authors: Jeongsol Kim, Geon Yeong Park, Hyung** Chung, Jong Chul Ye

    Abstract: The recent advent of diffusion models has led to significant progress in solving inverse problems, leveraging these models as effective generative priors. Nonetheless, there remain challenges related to the ill-posed nature of such problems, often due to inherent ambiguities in measurements or intrinsic system symmetries. To address this, drawing inspiration from the human ability to resolve visua… ▽ More

    Submitted 16 April, 2024; v1 submitted 27 November, 2023; originally announced November 2023.

  21. arXiv:2311.12328  [pdf, other

    quant-ph astro-ph.IM cs.AI cs.PF

    Quantum-Enhanced Support Vector Machine for Large-Scale Stellar Classification with GPU Acceleration

    Authors: Kuan-Cheng Chen, Xiaotian Xu, Henry Makhanov, Hui-Hsuan Chung, Chen-Yu Liu

    Abstract: In this study, we introduce an innovative Quantum-enhanced Support Vector Machine (QSVM) approach for stellar classification, leveraging the power of quantum computing and GPU acceleration. Our QSVM algorithm significantly surpasses traditional methods such as K-Nearest Neighbors (KNN) and Logistic Regression (LR), particularly in handling complex binary and multi-class scenarios within the Harvar… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

  22. arXiv:2310.01110  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Prompt-tuning latent diffusion models for inverse problems

    Authors: Hyung** Chung, Jong Chul Ye, Peyman Milanfar, Mauricio Delbracio

    Abstract: We propose a new method for solving imaging inverse problems using text-to-image latent diffusion models as general priors. Existing methods using latent diffusion models for inverse problems typically rely on simple null text prompts, which can lead to suboptimal performance. To address this limitation, we introduce a method for prompt tuning, which jointly optimizes the text embedding on-the-fly… ▽ More

    Submitted 2 October, 2023; originally announced October 2023.

    Comments: 22 pages, 10 figures

  23. arXiv:2309.17347  [pdf, other

    cs.LG cs.CY

    Demographic Parity: Mitigating Biases in Real-World Data

    Authors: Orestis Loukas, Ho-Ryun Chung

    Abstract: Computer-based decision systems are widely used to automate decisions in many aspects of everyday life, which include sensitive areas like hiring, loaning and even criminal sentencing. A decision pipeline heavily relies on large volumes of historical real-world data for training its models. However, historical training data often contains gender, racial or other biases which are propagated to the… ▽ More

    Submitted 27 September, 2023; originally announced September 2023.

    Comments: 24 pages, 16 Figures, Python code attached

  24. arXiv:2309.14324  [pdf, other

    eess.AS cs.CL cs.LG cs.SD

    Towards General-Purpose Text-Instruction-Guided Voice Conversion

    Authors: Chun-Yi Kuan, Chen An Li, Tsu-Yuan Hsu, Tse-Yang Lin, Ho-Lam Chung, Kai-Wei Chang, Shuo-yiin Chang, Hung-yi Lee

    Abstract: This paper introduces a novel voice conversion (VC) model, guided by text instructions such as "articulate slowly with a deep tone" or "speak in a cheerful boyish voice". Unlike traditional methods that rely on reference utterances to determine the attributes of the converted speech, our model adds versatility and specificity to voice conversion. The proposed VC model is a neural codec language mo… ▽ More

    Submitted 16 January, 2024; v1 submitted 25 September, 2023; originally announced September 2023.

    Comments: Accepted to ASRU 2023

  25. arXiv:2309.05182  [pdf, ps, other

    cs.IT cs.DS

    Graph Matching in Correlated Stochastic Block Models for Improved Graph Clustering

    Authors: Joonhyuk Yang, Hye Won Chung

    Abstract: We consider community detection from multiple correlated graphs sharing the same community structure. The correlated graphs are generated by independent subsampling of a parent graph sampled from the stochastic block model. The vertex correspondence between the correlated graphs is assumed to be unknown. We consider the two-step procedure where the vertex correspondence between the correlated grap… ▽ More

    Submitted 10 September, 2023; originally announced September 2023.

    Comments: Allerton Conference 2023

  26. arXiv:2308.14409  [pdf, other

    cs.CV cs.LG

    Steerable Conditional Diffusion for Out-of-Distribution Adaptation in Imaging Inverse Problems

    Authors: Riccardo Barbano, Alexander Denker, Hyung** Chung, Tae Hoon Roh, Simon Arrdige, Peter Maass, Bangti **, Jong Chul Ye

    Abstract: Denoising diffusion models have emerged as the go-to framework for solving inverse problems in imaging. A critical concern regarding these models is their performance on out-of-distribution (OOD) tasks, which remains an under-explored challenge. Realistic reconstructions inconsistent with the measured data can be generated, hallucinating image features that are uniquely present in the training dat… ▽ More

    Submitted 28 August, 2023; originally announced August 2023.

  27. arXiv:2308.11140  [pdf, other

    cs.CV

    High Dynamic Range Imaging of Dynamic Scenes with Saturation Compensation but without Explicit Motion Compensation

    Authors: Haesoo Chung, Nam Ik Cho

    Abstract: High dynamic range (HDR) imaging is a highly challenging task since a large amount of information is lost due to the limitations of camera sensors. For HDR imaging, some methods capture multiple low dynamic range (LDR) images with altering exposures to aggregate more information. However, these approaches introduce ghosting artifacts when significant inter-frame motions are present. Moreover, alth… ▽ More

    Submitted 21 August, 2023; originally announced August 2023.

    Comments: WACV 2022

  28. arXiv:2308.11116  [pdf, other

    cs.CV

    LAN-HDR: Luminance-based Alignment Network for High Dynamic Range Video Reconstruction

    Authors: Haesoo Chung, Nam Ik Cho

    Abstract: As demands for high-quality videos continue to rise, high-resolution and high-dynamic range (HDR) imaging techniques are drawing attention. To generate an HDR video from low dynamic range (LDR) images, one of the critical steps is the motion compensation between LDR frames, for which most existing works employed the optical flow algorithm. However, these methods suffer from flow estimation errors… ▽ More

    Submitted 21 August, 2023; originally announced August 2023.

    Comments: ICCV 2023

  29. arXiv:2307.15208  [pdf, other

    eess.IV cs.CV

    Generative AI for Medical Imaging: extending the MONAI Framework

    Authors: Walter H. L. Pinaya, Mark S. Graham, Eric Kerfoot, Petru-Daniel Tudosiu, Jessica Dafflon, Virginia Fernandez, Pedro Sanchez, Julia Wolleb, Pedro F. da Costa, Ashay Patel, Hyung** Chung, Can Zhao, Wei Peng, Zelong Liu, Xueyan Mei, Oeslle Lucena, Jong Chul Ye, Sotirios A. Tsaftaris, Prerna Dogra, Andrew Feng, Marc Modat, Parashkev Nachev, Sebastien Ourselin, M. Jorge Cardoso

    Abstract: Recent advances in generative AI have brought incredible breakthroughs in several areas, including medical imaging. These generative models have tremendous potential not only to help safely share medical data via synthetic datasets but also to perform an array of diverse applications, such as anomaly detection, image-to-image translation, denoising, and MRI reconstruction. However, due to the comp… ▽ More

    Submitted 27 July, 2023; originally announced July 2023.

  30. Toward Fairness Through Fair Multi-Exit Framework for Dermatological Disease Diagnosis

    Authors: Ching-Hao Chiu, Hao-Wei Chung, Yu-Jen Chen, Yiyu Shi, Tsung-Yi Ho

    Abstract: Fairness has become increasingly pivotal in medical image recognition. However, without mitigating bias, deploying unfair medical AI systems could harm the interests of underprivileged populations. In this paper, we observe that while features extracted from the deeper layers of neural networks generally offer higher accuracy, fairness conditions deteriorate as we extract features from deeper laye… ▽ More

    Submitted 1 July, 2023; v1 submitted 26 June, 2023; originally announced June 2023.

    Comments: MICCAI2023

  31. arXiv:2306.09600  [pdf, other

    cs.RO

    Learning to Assist and Communicate with Novice Drone Pilots for Expert Level Performance

    Authors: Kal Backman, Dana Kulić, Hoam Chung

    Abstract: Multi-task missions for unmanned aerial vehicles (UAVs) involving inspection and landing tasks are challenging for novice pilots due to the difficulties associated with depth perception and the control interface. We propose a shared autonomy system, alongside supplementary information displays, to assist pilots to successfully complete multi-task missions without any pilot training. Our approach c… ▽ More

    Submitted 15 June, 2023; originally announced June 2023.

    Comments: 18 pages, 11 figures. Submitted to IEEE Transactions on Robotics (T-RO)

  32. arXiv:2305.19809  [pdf, other

    cs.CV cs.AI cs.LG stat.ML

    Direct Diffusion Bridge using Data Consistency for Inverse Problems

    Authors: Hyung** Chung, Jeongsol Kim, Jong Chul Ye

    Abstract: Diffusion model-based inverse problem solvers have shown impressive performance, but are limited in speed, mostly as they require reverse diffusion sampling starting from noise. Several recent works have tried to alleviate this problem by building a diffusion process, directly bridging the clean and the corrupted for specific inverse problems. In this paper, we first unify these existing works und… ▽ More

    Submitted 24 October, 2023; v1 submitted 31 May, 2023; originally announced May 2023.

    Comments: NeurIPS 2023 camera-ready. 16 pages, 6 figures

  33. arXiv:2305.19666  [pdf, other

    cs.DS cs.LG cs.SI stat.ML

    Efficient Algorithms for Exact Graph Matching on Correlated Stochastic Block Models with Constant Correlation

    Authors: Joonhyuk Yang, Dongpil Shin, Hye Won Chung

    Abstract: We consider the problem of graph matching, or learning vertex correspondence, between two correlated stochastic block models (SBMs). The graph matching problem arises in various fields, including computer vision, natural language processing and bioinformatics, and in particular, matching graphs with inherent community structure has significance related to de-anonymization of correlated social netw… ▽ More

    Submitted 2 June, 2023; v1 submitted 31 May, 2023; originally announced May 2023.

    Comments: ICML 2023

  34. arXiv:2305.16465  [pdf, other

    eess.IV cs.CV q-bio.QM

    An AI-Ready Multiplex Staining Dataset for Reproducible and Accurate Characterization of Tumor Immune Microenvironment

    Authors: Parmida Ghahremani, Joseph Marino, Juan Hernandez-Prera, Janis V. de la Iglesia, Robbert JC Slebos, Christine H. Chung, Saad Nadeem

    Abstract: We introduce a new AI-ready computational pathology dataset containing restained and co-registered digitized images from eight head-and-neck squamous cell carcinoma patients. Specifically, the same tumor sections were stained with the expensive multiplex immunofluorescence (mIF) assay first and then restained with cheaper multiplex immunohistochemistry (mIHC). This is a first public dataset that d… ▽ More

    Submitted 25 May, 2023; originally announced May 2023.

    Comments: MICCAI'23 (Early Accept). First two authors contributed equally. Forward correspondence to last two authors

  35. arXiv:2305.14705  [pdf, other

    cs.CL

    Mixture-of-Experts Meets Instruction Tuning:A Winning Combination for Large Language Models

    Authors: Sheng Shen, Le Hou, Yanqi Zhou, Nan Du, Shayne Longpre, Jason Wei, Hyung Won Chung, Barret Zoph, William Fedus, Xinyun Chen, Tu Vu, Yuexin Wu, Wuyang Chen, Albert Webson, Yunxuan Li, Vincent Zhao, Hongkun Yu, Kurt Keutzer, Trevor Darrell, Denny Zhou

    Abstract: Sparse Mixture-of-Experts (MoE) is a neural architecture design that can be utilized to add learnable parameters to Large Language Models (LLMs) without increasing inference cost. Instruction tuning is a technique for training LLMs to follow instructions. We advocate combining these two approaches, as we find that MoE models benefit more from instruction tuning than dense models. In particular, we… ▽ More

    Submitted 5 July, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

    Comments: Preprint

  36. arXiv:2305.10615  [pdf, other

    cs.SD cs.CL eess.AS

    ML-SUPERB: Multilingual Speech Universal PERformance Benchmark

    Authors: Jiatong Shi, Dan Berrebbi, William Chen, Ho-Lam Chung, En-Pei Hu, Wei ** Huang, Xuankai Chang, Shang-Wen Li, Abdelrahman Mohamed, Hung-yi Lee, Shinji Watanabe

    Abstract: Speech processing Universal PERformance Benchmark (SUPERB) is a leaderboard to benchmark the performance of Self-Supervised Learning (SSL) models on various speech processing tasks. However, SUPERB largely considers English speech in its evaluation. This paper presents multilingual SUPERB (ML-SUPERB), covering 143 languages (ranging from high-resource to endangered), and considering both automatic… ▽ More

    Submitted 11 August, 2023; v1 submitted 17 May, 2023; originally announced May 2023.

    Comments: Accepted by Interspeech

  37. arXiv:2305.01506  [pdf, other

    cs.CV cs.AI cs.LG

    Discovering the Effectiveness of Pre-Training in a Large-scale Car-sharing Platform

    Authors: Kyung Ho Park, Hyunhee Chung

    Abstract: Recent progress of deep learning has empowered various intelligent transportation applications, especially in car-sharing platforms. While the traditional operations of the car-sharing service highly relied on human engagements in fleet management, modern car-sharing platforms let users upload car images before and after their use to inspect the cars without a physical visit. To automate the afore… ▽ More

    Submitted 2 May, 2023; originally announced May 2023.

  38. arXiv:2304.09151  [pdf, other

    cs.CL

    UniMax: Fairer and more Effective Language Sampling for Large-Scale Multilingual Pretraining

    Authors: Hyung Won Chung, Noah Constant, Xavier Garcia, Adam Roberts, Yi Tay, Sharan Narang, Orhan Firat

    Abstract: Pretrained multilingual large language models have typically used heuristic temperature-based sampling to balance between different languages. However previous work has not systematically evaluated the efficacy of different pretraining language distributions across model scales. In this paper, we propose a new sampling method, UniMax, that delivers more uniform coverage of head languages while mit… ▽ More

    Submitted 18 April, 2023; originally announced April 2023.

  39. arXiv:2304.06447  [pdf, other

    cs.CV cs.CL

    PDFVQA: A New Dataset for Real-World VQA on PDF Documents

    Authors: Yihao Ding, Siwen Luo, Hyunsuk Chung, Soyeon Caren Han

    Abstract: Document-based Visual Question Answering examines the document understanding of document images in conditions of natural language questions. We proposed a new document-based VQA dataset, PDF-VQA, to comprehensively examine the document understanding from various aspects, including document element recognition, document layout structural understanding as well as contextual understanding and key inf… ▽ More

    Submitted 5 June, 2023; v1 submitted 13 April, 2023; originally announced April 2023.

    Comments: Accepted by ECML-PKDD 2023

  40. arXiv:2304.01577  [pdf, other

    cs.IR

    Form-NLU: Dataset for the Form Natural Language Understanding

    Authors: Yihao Ding, Siqu Long, Jiabin Huang, Kaixuan Ren, Xingxiang Luo, Hyunsuk Chung, Soyeon Caren Han

    Abstract: Compared to general document analysis tasks, form document structure understanding and retrieval are challenging. Form documents are typically made by two types of authors; A form designer, who develops the form structure and keys, and a form user, who fills out form values based on the provided keys. Hence, the form values may not be aligned with the form designer's intention (structure and keys)… ▽ More

    Submitted 2 August, 2023; v1 submitted 4 April, 2023; originally announced April 2023.

    Comments: Accepted by SIGIR 2023

  41. arXiv:2303.09395  [pdf, other

    cs.CL cs.LG eess.SP

    Text-to-ECG: 12-Lead Electrocardiogram Synthesis conditioned on Clinical Text Reports

    Authors: Hyunseung Chung, Jiho Kim, Joon-myoung Kwon, Ki-Hyun Jeon, Min Sung Lee, Edward Choi

    Abstract: Electrocardiogram (ECG) synthesis is the area of research focused on generating realistic synthetic ECG signals for medical use without concerns over annotation costs or clinical data privacy restrictions. Traditional ECG generation models consider a single ECG lead and utilize GAN-based generative models. These models can only generate single lead samples and require separate training for each di… ▽ More

    Submitted 9 March, 2023; originally announced March 2023.

    Comments: Accepted to ICASSP 2023 (5 pages, 3 figures, 4 tables)

  42. arXiv:2303.08774  [pdf, other

    cs.CL cs.AI

    GPT-4 Technical Report

    Authors: OpenAI, Josh Achiam, Steven Adler, Sandhini Agarwal, Lama Ahmad, Ilge Akkaya, Florencia Leoni Aleman, Diogo Almeida, Janko Altenschmidt, Sam Altman, Shyamal Anadkat, Red Avila, Igor Babuschkin, Suchir Balaji, Valerie Balcom, Paul Baltescu, Haiming Bao, Mohammad Bavarian, Jeff Belgum, Irwan Bello, Jake Berdine, Gabriel Bernadett-Shapiro, Christopher Berner, Lenny Bogdonoff, Oleg Boiko , et al. (256 additional authors not shown)

    Abstract: We report the development of GPT-4, a large-scale, multimodal model which can accept image and text inputs and produce text outputs. While less capable than humans in many real-world scenarios, GPT-4 exhibits human-level performance on various professional and academic benchmarks, including passing a simulated bar exam with a score around the top 10% of test takers. GPT-4 is a Transformer-based mo… ▽ More

    Submitted 4 March, 2024; v1 submitted 15 March, 2023; originally announced March 2023.

    Comments: 100 pages; updated authors list; fixed author names and added citation

  43. arXiv:2303.08440  [pdf, other

    eess.IV cs.CV cs.LG

    Improving 3D Imaging with Pre-Trained Perpendicular 2D Diffusion Models

    Authors: Suhyeon Lee, Hyung** Chung, Minyoung Park, Jonghyuk Park, Wi-Sun Ryu, Jong Chul Ye

    Abstract: Diffusion models have become a popular approach for image generation and reconstruction due to their numerous advantages. However, most diffusion-based inverse problem-solving methods only deal with 2D images, and even recently published 3D methods do not fully exploit the 3D distribution prior. To address this, we propose a novel approach using two perpendicular pre-trained 2D diffusion models to… ▽ More

    Submitted 1 September, 2023; v1 submitted 15 March, 2023; originally announced March 2023.

    Comments: ICCV23 poster. 15 pages, 9 figures

  44. arXiv:2303.05754  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Decomposed Diffusion Sampler for Accelerating Large-Scale Inverse Problems

    Authors: Hyung** Chung, Suhyeon Lee, Jong Chul Ye

    Abstract: Krylov subspace, which is generated by multiplying a given vector by the matrix of a linear transformation and its successive powers, has been extensively studied in classical optimization literature to design algorithms that converge quickly for large linear inverse problems. For example, the conjugate gradient method (CG), one of the most popular Krylov subspace methods, is based on the idea of… ▽ More

    Submitted 19 February, 2024; v1 submitted 10 March, 2023; originally announced March 2023.

    Comments: ICLR 2024; 28 pages, 9 figures

  45. arXiv:2302.12895  [pdf, ps, other

    cs.GT

    Maximizing Miner Revenue in Transaction Fee Mechanism Design

    Authors: Ke Wu, Elaine Shi, Hao Chung

    Abstract: Transaction fee mechanism design is a new decentralized mechanism design problem where users bid for space on the blockchain. Several recent works showed that the transaction fee mechanism design fundamentally departs from classical mechanism design. They then systematically explored the mathematical landscape of this new decentralized mechanism design problem in two settings: in the plain setting… ▽ More

    Submitted 21 April, 2024; v1 submitted 24 February, 2023; originally announced February 2023.

  46. arXiv:2302.00836  [pdf, other

    cs.CL cs.SD eess.AS

    Improving Rare Words Recognition through Homophone Extension and Unified Writing for Low-resource Cantonese Speech Recognition

    Authors: HoLam Chung, Junan Li, Pengfei Liu1, Wai-Kim Leung, Xixin Wu, Helen Meng

    Abstract: Homophone characters are common in tonal syllable-based languages, such as Mandarin and Cantonese. The data-intensive end-to-end Automatic Speech Recognition (ASR) systems are more likely to mis-recognize homophone characters and rare words under low-resource settings. For the problem of lowresource Cantonese speech recognition, this paper presents a novel homophone extension method to integrate h… ▽ More

    Submitted 1 February, 2023; originally announced February 2023.

    Comments: The 13th International Symposium on Chinese Spoken Language Processing (ISCSLP 2022)

    Journal ref: Published in ISCSLP 2022

  47. arXiv:2301.13688  [pdf, other

    cs.AI cs.CL cs.LG

    The Flan Collection: Designing Data and Methods for Effective Instruction Tuning

    Authors: Shayne Longpre, Le Hou, Tu Vu, Albert Webson, Hyung Won Chung, Yi Tay, Denny Zhou, Quoc V. Le, Barret Zoph, Jason Wei, Adam Roberts

    Abstract: We study the design decisions of publicly available instruction tuning methods, and break down the development of Flan 2022 (Chung et al., 2022). Through careful ablation studies on the Flan Collection of tasks and methods, we tease apart the effect of design decisions which enable Flan-T5 to outperform prior work by 3-17%+ across evaluation settings. We find task balancing and enrichment techniqu… ▽ More

    Submitted 14 February, 2023; v1 submitted 31 January, 2023; originally announced January 2023.

  48. arXiv:2301.05331  [pdf, other

    math.ST cs.LG math.PR stat.ML

    Detection problems in the spiked matrix models

    Authors: Ji Hyung Jung, Hye Won Chung, Ji Oon Lee

    Abstract: We study the statistical decision process of detecting the low-rank signal from various signal-plus-noise type data matrices, known as the spiked random matrix models. We first show that the principal component analysis can be improved by entrywise pre-transforming the data matrix if the noise is non-Gaussian, generalizing the known results for the spiked random matrix models with rank-1 signals.… ▽ More

    Submitted 16 January, 2023; v1 submitted 12 January, 2023; originally announced January 2023.

    Comments: 80 pages, 6 figures. arXiv admin note: text overlap with arXiv:2104.13517

    MSC Class: 62H25; 62H15; 60B20

  49. arXiv:2301.02989  [pdf, other

    cs.CV cs.AI cs.LG

    Fair Multi-Exit Framework for Facial Attribute Classification

    Authors: Ching-Hao Chiu, Hao-Wei Chung, Yu-Jen Chen, Yiyu Shi, Tsung-Yi Ho

    Abstract: Fairness has become increasingly pivotal in facial recognition. Without bias mitigation, deploying unfair AI would harm the interest of the underprivileged population. In this paper, we observe that though the higher accuracy that features from the deeper layer of a neural networks generally offer, fairness conditions deteriorate as we extract features from deeper layers. This phenomenon motivates… ▽ More

    Submitted 8 January, 2023; originally announced January 2023.

  50. arXiv:2301.00930  [pdf, other

    cs.LG

    Data Valuation Without Training of a Model

    Authors: Nohyun Ki, Hoyong Choi, Hye Won Chung

    Abstract: Many recent works on understanding deep learning try to quantify how much individual data instances influence the optimization and generalization of a model. Such attempts reveal characteristics and importance of individual instances, which may provide useful information in diagnosing and improving deep learning. However, most of the existing works on data valuation require actual training of a mo… ▽ More

    Submitted 7 March, 2023; v1 submitted 2 January, 2023; originally announced January 2023.

    Comments: ICLR 2023