Skip to main content

Showing 1–25 of 25 results for author: Bang, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.07007  [pdf, other

    cs.CL

    Crayon: Customized On-Device LLM via Instant Adapter Blending and Edge-Server Hybrid Inference

    Authors: Jihwan Bang, Juntae Lee, Kyuhong Shim, Seunghan Yang, Simyung Chang

    Abstract: The customization of large language models (LLMs) for user-specified tasks gets important. However, maintaining all the customized LLMs on cloud servers incurs substantial memory and computational overheads, and uploading user data can also lead to privacy concerns. On-device LLMs can offer a promising solution by mitigating these issues. Yet, the performance of on-device LLMs is inherently constr… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: ACL 2024 Main

  2. arXiv:2403.05814  [pdf, other

    cs.CL cs.AI

    MP2D: An Automated Topic Shift Dialogue Generation Framework Leveraging Knowledge Graphs

    Authors: Yerin Hwang, Yongil Kim, Yunah Jang, Jeesoo Bang, Hyunkyung Bae, Kyomin Jung

    Abstract: Despite advancements in on-topic dialogue systems, effectively managing topic shifts within dialogues remains a persistent challenge, largely attributed to the limited availability of training datasets. To address this issue, we propose Multi-Passage to Dialogue (MP2D), a data generation framework that automatically creates conversational question-answering datasets with natural topic transitions.… ▽ More

    Submitted 9 March, 2024; originally announced March 2024.

    Comments: 20 pages

  3. arXiv:2312.12391  [pdf, other

    cs.LG cs.AI cs.AR

    vTrain: A Simulation Framework for Evaluating Cost-effective and Compute-optimal Large Language Model Training

    Authors: Jehyeon Bang, Yujeong Choi, Myeongwoo Kim, Yongdeok Kim, Minsoo Rhu

    Abstract: As large language models (LLMs) become widespread in various application domains, a critical challenge the AI community is facing is how to train these large AI models in a cost-effective manner. Existing LLM training plans typically employ a heuristic based parallel training strategy which is based on empirical observations rather than grounded upon a thorough examination of the search space of L… ▽ More

    Submitted 27 November, 2023; originally announced December 2023.

  4. arXiv:2312.08677  [pdf, other

    cs.LG cs.AI

    Adaptive Shortcut Debiasing for Online Continual Learning

    Authors: Doyoung Kim, Dongmin Park, Yooju Shin, Jihwan Bang, Hwanjun Song, Jae-Gil Lee

    Abstract: We propose a novel framework DropTop that suppresses the shortcut bias in online continual learning (OCL) while being adaptive to the varying degree of the shortcut bias incurred by continuously changing environment. By the observed high-attention property of the shortcut bias, highly-activated features are considered candidates for debiasing. More importantly, resolving the limitation of the onli… ▽ More

    Submitted 14 December, 2023; originally announced December 2023.

  5. arXiv:2311.12048  [pdf, other

    cs.LG

    One Size Fits All for Semantic Shifts: Adaptive Prompt Tuning for Continual Learning

    Authors: Doyoung Kim, Susik Yoon, Dongmin Park, Youngjun Lee, Hwanjun Song, Jihwan Bang, Jae-Gil Lee

    Abstract: In real-world continual learning scenarios, tasks often exhibit intricate and unpredictable semantic shifts, posing challenges for fixed prompt management strategies. We identify the inadequacy of universal and specific prompting in handling these dynamic shifts. Universal prompting is ineffective for tasks with abrupt semantic changes, while specific prompting struggles with overfitting under mil… ▽ More

    Submitted 18 November, 2023; originally announced November 2023.

  6. arXiv:2311.11178  [pdf, other

    cs.CV

    Active Prompt Learning in Vision Language Models

    Authors: Jihwan Bang, Sumyeong Ahn, Jae-Gil Lee

    Abstract: Pre-trained Vision Language Models (VLMs) have demonstrated notable progress in various zero-shot tasks, such as classification and retrieval. Despite their performance, because improving performance on new tasks requires task-specific knowledge, their adaptation is essential. While labels are needed for the adaptation, acquiring them is typically expensive. To overcome this challenge, active lear… ▽ More

    Submitted 21 March, 2024; v1 submitted 18 November, 2023; originally announced November 2023.

    Comments: accepted at CVPR 2024

  7. arXiv:2311.07589  [pdf, other

    cs.CL cs.AI

    Dialogizer: Context-aware Conversational-QA Dataset Generation from Textual Sources

    Authors: Yerin Hwang, Yongil Kim, Hyunkyung Bae, Jeesoo Bang, Hwanhee Lee, Kyomin Jung

    Abstract: To address the data scarcity issue in Conversational question answering (ConvQA), a dialog inpainting method, which utilizes documents to generate ConvQA datasets, has been proposed. However, the original dialog inpainting model is trained solely on the dialog reconstruction task, resulting in the generation of questions with low contextual relevance due to insufficient learning of question-answer… ▽ More

    Submitted 9 November, 2023; originally announced November 2023.

    Comments: Accepted to EMNLP 2023 main conference

  8. arXiv:2308.15053  [pdf, other

    cs.CL cs.AI

    Adapting Text-based Dialogue State Tracker for Spoken Dialogues

    Authors: Jaeseok Yoon, Seunghyun Hwang, Ran Han, Jeonguk Bang, Kee-Eung Kim

    Abstract: Although there have been remarkable advances in dialogue systems through the dialogue systems technology competition (DSTC), it remains one of the key challenges to building a robust task-oriented dialogue system with a speech interface. Most of the progress has been made for text-based dialogue systems since there are abundant datasets with written corpora while those with spoken dialogues are ve… ▽ More

    Submitted 9 January, 2024; v1 submitted 29 August, 2023; originally announced August 2023.

    Comments: 8 pages, 5 figures, In Proceedings of The Eleventh Dialog System Technology Challenge, Association for Computational Linguistics

  9. arXiv:2303.14386  [pdf, other

    cs.CV

    Prompt-Guided Transformers for End-to-End Open-Vocabulary Object Detection

    Authors: Hwanjun Song, Jihwan Bang

    Abstract: Prompt-OVD is an efficient and effective framework for open-vocabulary object detection that utilizes class embeddings from CLIP as prompts, guiding the Transformer decoder to detect objects in both base and novel classes. Additionally, our novel RoI-based masked attention and RoI pruning techniques help leverage the zero-shot classification ability of the Vision Transformer-based CLIP, resulting… ▽ More

    Submitted 25 March, 2023; originally announced March 2023.

    Comments: version 1

  10. arXiv:2212.11151  [pdf, other

    cs.LO

    Template-Based Conjecturing for Automated Induction in Isabelle/HOL

    Authors: Yutaka Nagashima, Zi** Xu, Ningli Wang, Daniel Sebastian Goc, James Bang

    Abstract: Proof by induction plays a central role in formal verification. However, its automation remains as a formidable challenge in Computer Science. To solve inductive problems, human engineers often have to provide auxiliary lemmas manually. We automate this laborious process with template-based conjecturing, a novel approach to generate auxiliary lemmas and use them to prove final goals. Our evaluatio… ▽ More

    Submitted 19 January, 2023; v1 submitted 20 November, 2022; originally announced December 2022.

    Comments: To appear at Fundamentals of Software engineering 2023 (http://fsen.ir/2023/)

  11. arXiv:2210.07805  [pdf, other

    cs.LG

    Meta-Query-Net: Resolving Purity-Informativeness Dilemma in Open-set Active Learning

    Authors: Dongmin Park, Yooju Shin, Jihwan Bang, Youngjun Lee, Hwanjun Song, Jae-Gil Lee

    Abstract: Unlabeled data examples awaiting annotations contain open-set noise inevitably. A few active learning studies have attempted to deal with this open-set noise for sample selection by filtering out the noisy examples. However, because focusing on the purity of examples in a query set leads to overlooking the informativeness of the examples, the best balancing of purity and informativeness remains an… ▽ More

    Submitted 11 January, 2023; v1 submitted 13 October, 2022; originally announced October 2022.

    Comments: published in NeurIPS 2022

  12. arXiv:2207.03858  [pdf, other

    cs.CL

    DSTEA: Improving Dialogue State Tracking via Entity Adaptive Pre-training

    Authors: Yukyung Lee, Takyoung Kim, Hoonsang Yoon, Pilsung Kang, Junseong Bang, Misuk Kim

    Abstract: Dialogue State Tracking (DST) is critical for comprehensively interpreting user and system utterances, thereby forming the cornerstone of efficient dialogue systems. Despite past research efforts focused on enhancing DST performance through alterations to the model structure or integrating additional features like graph relations, they often require additional pre-training with external dialogue c… ▽ More

    Submitted 23 July, 2023; v1 submitted 8 July, 2022; originally announced July 2022.

    Journal ref: KnowledgeNLP@KDD2023

  13. arXiv:2203.15355  [pdf, other

    cs.CV cs.AI cs.LG

    Online Continual Learning on a Contaminated Data Stream with Blurry Task Boundaries

    Authors: Jihwan Bang, Hyunseo Koh, Seulki Park, Hwanjun Song, Jung-Woo Ha, Jonghyun Choi

    Abstract: Learning under a continuously changing data distribution with incorrect labels is a desirable real-world problem yet challenging. A large body of continual learning (CL) methods, however, assumes data streams with clean labels, and online learning scenarios under noisy data streams are yet underexplored. We consider a more practical CL task setup of an online learning from blurry data stream with… ▽ More

    Submitted 30 March, 2022; v1 submitted 29 March, 2022; originally announced March 2022.

    Comments: Accepted paper at CVPR 2022

  14. arXiv:2112.07208  [pdf

    cs.NE

    Interpretable Convolutional Neural Networks for Subject-Independent Motor Imagery Classification

    Authors: Ji-Seon Bang, Seong-Whan Lee

    Abstract: Deep learning frameworks have become increasingly popular in brain computer interface (BCI) study thanks to their outstanding performance. However, in terms of the classification model alone, they are treated as black box as they do not provide any information on what led them to reach a particular decision. In other words, we cannot convince whether the high performance was aroused by the neuro-p… ▽ More

    Submitted 14 December, 2021; originally announced December 2021.

    Comments: Submitted to IEEE 10th International Winter Conference on Brain-Computer Interface (BCI 2022)

  15. arXiv:2109.01212  [pdf, other

    cs.DC cs.CV

    A Reliable, Self-Adaptive Face Identification Framework via Lyapunov Optimization

    Authors: Dohyeon Kim, Joongheon Kim, Jae young Bang

    Abstract: Realtime face identification (FID) from a video feed is highly computation-intensive, and may exhaust computation resources if performed on a device with a limited amount of resources (e.g., a mobile device). In general, FID performs better when images are sampled at a higher rate, minimizing false negatives. However, performing it at an overwhelmingly high rate exposes the system to the risk of a… ▽ More

    Submitted 2 September, 2021; originally announced September 2021.

    Comments: This paper was presented at ACM Symposium on Operating Systems Principles (SOSP) Workshop on AI Systems (AISys), Shanghai, China, October 2017

  16. arXiv:2108.12637  [pdf, other

    cs.CL

    Oh My Mistake!: Toward Realistic Dialogue State Tracking including Turnback Utterances

    Authors: Takyoung Kim, Yukyung Lee, Hoonsang Yoon, Pilsung Kang, Junseong Bang, Misuk Kim

    Abstract: The primary purpose of dialogue state tracking (DST), a critical component of an end-to-end conversational system, is to build a model that responds well to real-world situations. Although we often change our minds from time to time during ordinary conversations, current benchmark datasets do not adequately reflect such occurrences and instead consist of over-simplified conversations, in which no… ▽ More

    Submitted 12 October, 2022; v1 submitted 28 August, 2021; originally announced August 2021.

    Comments: SereTOD Workshop at EMNLP 2022

  17. arXiv:2107.07062  [pdf

    cs.NE

    Motor Imagery Classification based on CNN-GRU Network with Spatio-Temporal Feature Representation

    Authors: Ji-Seon Bang, Seong-Whan Lee

    Abstract: Recently, various deep neural networks have been applied to classify electroencephalogram (EEG) signal. EEG is a brain signal that can be acquired in a non-invasive way and has a high temporal resolution. It can be used to decode the intention of users. As the EEG signal has a high dimension of feature space, appropriate feature extraction methods are needed to improve classification performance.… ▽ More

    Submitted 14 July, 2021; originally announced July 2021.

    Comments: Submitted to IAPR 6th Asian Conference on Pattern Recognition (ACPR 2021)

  18. Zeus: Efficiently Localizing Actions in Videos using Reinforcement Learning

    Authors: Pramod Chunduri, Jaeho Bang, Yao Lu, Joy Arulraj

    Abstract: Detection and localization of actions in videos is an important problem in practice. State-of-the-art video analytics systems are unable to efficiently and effectively answer such action queries because actions often involve a complex interaction between objects and are spread across a sequence of frames; detecting and localizing them requires computationally expensive deep neural networks. It is… ▽ More

    Submitted 27 September, 2022; v1 submitted 6 April, 2021; originally announced April 2021.

    Journal ref: In Proceedings of the 2022 International Conference on Management of Data (SIGMOD '22). Philadelphia, PA, USA, 545-558

  19. arXiv:2104.01671  [pdf, other

    cs.DB

    EKO: Adaptive Sampling of Compressed Video Data

    Authors: Jaeho Bang, Pramod Chunduri, Joy Arulraj

    Abstract: Researchers have presented systems for efficiently analysing video data at scale using sampling algorithms. While these systems effectively leverage the temporal redundancy present in videos, they suffer from three limitations. First, they use traditional video storage formats are tailored for human consumption. Second, they load and decode the entire compressed video in memory before applying the… ▽ More

    Submitted 4 April, 2021; originally announced April 2021.

  20. arXiv:2103.17230  [pdf, other

    cs.CV cs.LG

    Rainbow Memory: Continual Learning with a Memory of Diverse Samples

    Authors: Jihwan Bang, Heesu Kim, YoungJoon Yoo, Jung-Woo Ha, Jonghyun Choi

    Abstract: Continual learning is a realistic learning scenario for AI models. Prevalent scenario of continual learning, however, assumes disjoint sets of classes as tasks and is less realistic rather artificial. Instead, we focus on 'blurry' task boundary; where tasks shares classes and is more realistic and practical. To address such task, we argue the importance of diversity of samples in an episodic memor… ▽ More

    Submitted 31 March, 2021; originally announced March 2021.

    Comments: Accepted paper at CVPR 2021

  21. arXiv:2006.11021  [pdf, other

    eess.AS cs.LG

    Boosting Active Learning for Speech Recognition with Noisy Pseudo-labeled Samples

    Authors: Jihwan Bang, Heesu Kim, YoungJoon Yoo, Jung-Woo Ha

    Abstract: The cost of annotating transcriptions for large speech corpora becomes a bottleneck to maximally enjoy the potential capacity of deep neural network-based automatic speech recognition models. In this paper, we present a new training pipeline boosting the conventional active learning approach targeting label-efficient learning to resolve the mentioned problem. Existing active learning methods only… ▽ More

    Submitted 5 November, 2020; v1 submitted 19 June, 2020; originally announced June 2020.

    Comments: 8 pages, 4 figures, 2 tables

  22. arXiv:1911.09099  [pdf, other

    cs.CV

    SINet: Extreme Lightweight Portrait Segmentation Networks with Spatial Squeeze Modules and Information Blocking Decoder

    Authors: Hyo** Park, Lars Lowe Sjösund, YoungJoon Yoo, Nicolas Monet, Jihwan Bang, Nojun Kwak

    Abstract: Designing a lightweight and robust portrait segmentation algorithm is an important task for a wide range of face applications. However, the problem has been considered as a subset of the object segmentation problem and less handled in the semantic segmentation field. Obviously, portrait segmentation has its unique requirements. First, because the portrait segmentation is performed in the middle of… ▽ More

    Submitted 9 February, 2020; v1 submitted 20 November, 2019; originally announced November 2019.

    Comments: https://github.com/HYOJINPARK/ExtPortraitSeg. arXiv admin note: text overlap with arXiv:1908.03093

  23. arXiv:1908.03093  [pdf, other

    cs.CV

    ExtremeC3Net: Extreme Lightweight Portrait Segmentation Networks using Advanced C3-modules

    Authors: Hyo** Park, Lars Lowe Sjösund, YoungJoon Yoo, Jihwan Bang, Nojun Kwak

    Abstract: Designing a lightweight and robust portrait segmentation algorithm is an important task for a wide range of face applications. However, the problem has been considered as a subset of the object segmentation problem. bviously, portrait segmentation has its unique requirements. First, because the portrait segmentation is performed in the middle of a whole process of many realworld applications, it r… ▽ More

    Submitted 9 December, 2019; v1 submitted 8 August, 2019; originally announced August 2019.

    Comments: https://github.com/HYOJINPARK/ExtPortraitSeg

  24. arXiv:1603.08604  [pdf, other

    cs.LG cs.CE

    Classification-based Financial Markets Prediction using Deep Neural Networks

    Authors: Matthew Dixon, Diego Klabjan, ** Hoon Bang

    Abstract: Deep neural networks (DNNs) are powerful types of artificial neural networks (ANNs) that use several hidden layers. They have recently gained considerable attention in the speech transcription and image recognition community (Krizhevsky et al., 2012) for their superior predictive properties including robustness to overfitting. However their application to algorithmic trading has not been previousl… ▽ More

    Submitted 13 June, 2017; v1 submitted 28 March, 2016; originally announced March 2016.

  25. arXiv:0911.0971  [pdf, ps, other

    cs.IT

    Multicell Zero-Forcing and User Scheduling on the Downlink of a Linear Cell Array

    Authors: H. J. Bang, D. Gesbert

    Abstract: Coordinated base station (BS) transmission has attracted much interest for its potential to increase the capacity of wireless networks. Yet at the same time, the achievable sum-rate with single-cell processing (SCP) scales optimally with the number of users under Rayleigh fading conditions. One may therefore ask if the value of BS coordination is limited in the many-user regime from a sum-rate p… ▽ More

    Submitted 6 November, 2009; v1 submitted 5 November, 2009; originally announced November 2009.

    Comments: 15 pages, 3 figures