Skip to main content

Showing 1–19 of 19 results for author: Heo, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.16521  [pdf, other

    cs.CL cs.AI

    Carrot and Stick: Inducing Self-Motivation with Positive & Negative Feedback

    Authors: Jimin Sohn, Jeihee Cho, Junyong Lee, Songmu Heo, Ji-Eun Han, David R. Mortensen

    Abstract: Positive thinking is thought to be an important component of self-motivation in various practical fields such as education and the workplace. Previous work, including sentiment transfer and positive reframing, has focused on the positive side of language. However, self-motivation that drives people to reach their goals has not yet been studied from a computational perspective. Moreover, negative f… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: 10 pages, 8 figures

  2. TinySeg: Model Optimizing Framework for Image Segmentation on Tiny Embedded Systems

    Authors: Byungchul Chae, Jiae Kim, Seonyeong Heo

    Abstract: Image segmentation is one of the major computer vision tasks, which is applicable in a variety of domains, such as autonomous navigation of an unmanned aerial vehicle. However, image segmentation cannot easily materialize on tiny embedded systems because image segmentation models generally have high peak memory usage due to their architectural characteristics. This work finds that image segmentati… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

    Comments: LCTES 2024

  3. arXiv:2311.10792  [pdf

    cs.LG cs.AI stat.AP

    Enhancing Data Efficiency and Feature Identification for Lithium-Ion Battery Lifespan Prediction by Deciphering Interpretation of Temporal Patterns and Cyclic Variability Using Attention-Based Models

    Authors: Jaewook Lee, Seongmin Heo, Jay H. Lee

    Abstract: Accurately predicting the lifespan of lithium-ion batteries is crucial for optimizing operational strategies and mitigating risks. While numerous studies have aimed at predicting battery lifespan, few have examined the interpretability of their models or how such insights could improve predictions. Addressing this gap, we introduce three innovative models that integrate shallow attention layers in… ▽ More

    Submitted 11 April, 2024; v1 submitted 16 November, 2023; originally announced November 2023.

  4. arXiv:2311.07163  [pdf, other

    cs.CV cs.AI

    Enhancing Lightweight Neural Networks for Small Object Detection in IoT Applications

    Authors: Liam Boyle, Nicolas Baumann, Seonyeong Heo, Michele Magno

    Abstract: Advances in lightweight neural networks have revolutionized computer vision in a broad range of IoT applications, encompassing remote monitoring and process automation. However, the detection of small objects, which is crucial for many of these applications, remains an underexplored area in current computer vision research, particularly for embedded devices. To address this gap, the paper proposes… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

  5. arXiv:2211.00437  [pdf, other

    eess.AS cs.SD

    Disentangled representation learning for multilingual speaker recognition

    Authors: Kihyun Nam, Youkyum Kim, Jaesung Huh, Hee Soo Heo, Jee-weon Jung, Joon Son Chung

    Abstract: The goal of this paper is to learn robust speaker representation for bilingual speaking scenario. The majority of the world's population speak at least two languages; however, most speaker recognition systems fail to recognise the same speaker when speaking in different languages. Popular speaker recognition evaluation sets do not consider the bilingual scenario, making it difficult to analyse t… ▽ More

    Submitted 6 June, 2023; v1 submitted 1 November, 2022; originally announced November 2022.

    Comments: Interspeech 2023

  6. arXiv:2210.01126  [pdf

    cs.LG

    Wheel Impact Test by Deep Learning: Prediction of Location and Magnitude of Maximum Stress

    Authors: Seungyeon Shin, Ah-hyeon **, Soyoung Yoo, Sunghee Lee, ChangGon Kim, Sungpil Heo, Namwoo Kang

    Abstract: For ensuring vehicle safety, the impact performance of wheels during wheel development must be ensured through a wheel impact test. However, manufacturing and testing a real wheel requires a significant time and money because develo** an optimal wheel design requires numerous iterative processes to modify the wheel design and verify the safety performance. Accordingly, wheel impact tests have be… ▽ More

    Submitted 18 December, 2022; v1 submitted 3 October, 2022; originally announced October 2022.

  7. Enjoy the Ride Consciously with CAWA: Context-Aware Advisory Warnings for Automated Driving

    Authors: Erfan Pakdamanian, Erzhen Hu, Shili Sheng, Sarit Kraus, Seongkook Heo, Lu Feng

    Abstract: In conditionally automated driving, drivers decoupled from driving while immersed in non-driving-related tasks (NDRTs) could potentially either miss the system-initiated takeover request (TOR) or a sudden TOR may startle them. To better prepare drivers for a safer takeover in an emergency, we propose novel context-aware advisory warnings (CAWA) for automated driving to gently inform drivers. This… ▽ More

    Submitted 29 August, 2022; originally announced August 2022.

    Comments: Proceeding of the 14th International Conference on Automotive User Interfaces and Interactive Vehicular Applications (AutomotiveUI '22)

  8. Continuous Facial Motion Deblurring

    Authors: Tae Bok Lee, Sujy Han, Yong Seok Heo

    Abstract: We introduce a novel framework for continuous facial motion deblurring that restores the continuous sharp moment latent in a single motion-blurred face image via a moment control factor. Although a motion-blurred image is the accumulated signal of continuous sharp moments during the exposure time, most existing single image deblurring approaches aim to restore a fixed number of frames using multip… ▽ More

    Submitted 13 July, 2022; originally announced July 2022.

    Journal ref: IEEE Access (Early Access), 12 July 2022

  9. arXiv:2204.03896  [pdf, other

    cs.CL

    Advancing Semi-Supervised Learning for Automatic Post-Editing: Data-Synthesis by Mask-Infilling with Erroneous Terms

    Authors: Wonkee Lee, Seong-Hwan Heo, Jong-Hyeok Lee

    Abstract: Semi-supervised learning that leverages synthetic data for training has been widely adopted for develo** automatic post-editing (APE) models due to the lack of training data. With this aim, we focus on data-synthesis methods to create high-quality synthetic data. Given that APE takes as input a machine-translation result that might include errors, we present a data-synthesis method by which the… ▽ More

    Submitted 3 June, 2024; v1 submitted 8 April, 2022; originally announced April 2022.

    Comments: Accepted to LREC-COLING 2024

  10. arXiv:2203.12940  [pdf, other

    cs.CL cs.AI cs.LG

    mcBERT: Momentum Contrastive Learning with BERT for Zero-Shot Slot Filling

    Authors: Seong-Hwan Heo, WonKee Lee, Jong-Hyeok Lee

    Abstract: Zero-shot slot filling has received considerable attention to cope with the problem of limited available data for the target domain. One of the important factors in zero-shot learning is to make the model learn generalized and reliable representations. For this purpose, we present mcBERT, which stands for momentum contrastive learning with BERT, to develop a robust zero-shot slot filling model. mc… ▽ More

    Submitted 28 June, 2022; v1 submitted 24 March, 2022; originally announced March 2022.

    Comments: Accepted to INTERSPEECH 2022

  11. hSDB-instrument: Instrument Localization Database for Laparoscopic and Robotic Surgeries

    Authors: Jihun Yoon, Jiwon Lee, Sunghwan Heo, Hayeong Yu, Jayeon Lim, Chi Hyun Song, SeulGi Hong, Seungbum Hong, Bokyung Park, SungHyun Park, Woo ** Hyung, Min-Kook Choi

    Abstract: Automated surgical instrument localization is an important technology to understand the surgical process and in order to analyze them to provide meaningful guidance during surgery or surgical index after surgery to the surgeon. We introduce a new dataset that reflects the kinematic characteristics of surgical instruments for automated surgical instrument localization of surgical videos. The hSDB(h… ▽ More

    Submitted 25 October, 2021; v1 submitted 24 October, 2021; originally announced October 2021.

    Comments: https://hsdb-instrument.github.io

    Journal ref: MICCAI 2021 pp 393-402

  12. arXiv:2110.12172  [pdf, other

    cs.LG cs.DC

    Scalable Smartphone Cluster for Deep Learning

    Authors: Byunggook Na, Jaehee Jang, Seongsik Park, Seijoon Kim, Joonoo Kim, Moon Sik Jeong, Kwang Choon Kim, Seon Heo, Yoonsang Kim, Sungroh Yoon

    Abstract: Various deep learning applications on smartphones have been rapidly rising, but training deep neural networks (DNNs) has too large computational burden to be executed on a single smartphone. A portable cluster, which connects smartphones with a wireless network and supports parallel computation using them, can be a potential approach to resolve the issue. However, by our findings, the limitations… ▽ More

    Submitted 23 October, 2021; originally announced October 2021.

    Comments: 6 pages

  13. DeepTake: Prediction of Driver Takeover Behavior using Multimodal Data

    Authors: Erfan Pakdamanian, Shili Sheng, Sonia Baee, Seongkook Heo, Sarit Kraus, Lu Feng

    Abstract: Automated vehicles promise a future where drivers can engage in non-driving tasks without hands on the steering wheels for a prolonged period. Nevertheless, automated vehicles may still need to occasionally hand the control back to drivers due to technology limitations and legal requirements. While some systems determine the need for driver takeover using driver context and road condition to initi… ▽ More

    Submitted 15 January, 2021; v1 submitted 30 December, 2020; originally announced December 2020.

    Comments: Accepted to CHI 2021

    ACM Class: I.2.6; J.4

  14. arXiv:2011.14885  [pdf, ps, other

    cs.SD eess.AS

    Look who's not talking

    Authors: Youngki Kwon, Hee Soo Heo, Jaesung Huh, Bong-** Lee, Joon Son Chung

    Abstract: The objective of this work is speaker diarisation of speech recordings 'in the wild'. The ability to determine speech segments is a crucial part of diarisation systems, accounting for a large proportion of errors. In this paper, we present a simple but effective solution for speech activity detection based on the speaker embeddings. In particular, we discover that the norm of the speaker embedding… ▽ More

    Submitted 30 November, 2020; originally announced November 2020.

    Comments: SLT 2021

  15. arXiv:2009.14153  [pdf, other

    eess.AS cs.SD

    Clova Baseline System for the VoxCeleb Speaker Recognition Challenge 2020

    Authors: Hee Soo Heo, Bong-** Lee, Jaesung Huh, Joon Son Chung

    Abstract: This report describes our submission to the VoxCeleb Speaker Recognition Challenge (VoxSRC) at Interspeech 2020. We perform a careful analysis of speaker recognition models based on the popular ResNet architecture, and train a number of variants using a range of loss functions. Our results show significant improvements over most existing works without the use of model ensemble or post-processing.… ▽ More

    Submitted 29 September, 2020; originally announced September 2020.

  16. arXiv:2007.12085  [pdf, other

    cs.SD cs.LG eess.AS

    Augmentation adversarial training for self-supervised speaker recognition

    Authors: Jaesung Huh, Hee Soo Heo, **gu Kang, Shinji Watanabe, Joon Son Chung

    Abstract: The goal of this work is to train robust speaker recognition models without speaker labels. Recent works on unsupervised speaker representations are based on contrastive learning in which they encourage within-utterance embeddings to be similar and across-utterance embeddings to be dissimilar. However, since the within-utterance segments share the same acoustic characteristics, it is difficult to… ▽ More

    Submitted 30 October, 2020; v1 submitted 23 July, 2020; originally announced July 2020.

    Comments: Workshop on Self-Supervised Learning for Speech and Audio Processing, NeurIPS

  17. arXiv:2005.08606  [pdf, other

    cs.CV cs.MM cs.SD eess.AS

    End-to-End Lip Synchronisation Based on Pattern Classification

    Authors: You ** Kim, Hee Soo Heo, Soo-Whan Chung, Bong-** Lee

    Abstract: The goal of this work is to synchronise audio and video of a talking face using deep neural network models. Existing works have trained networks on proxy tasks such as cross-modal similarity learning, and then computed similarities between audio and video frames using a sliding window approach. While these methods demonstrate satisfactory performance, the networks are not trained directly on the t… ▽ More

    Submitted 19 March, 2021; v1 submitted 18 May, 2020; originally announced May 2020.

    Comments: slt 2021 accepted

  18. In defence of metric learning for speaker recognition

    Authors: Joon Son Chung, Jaesung Huh, Seongkyu Mun, Minjae Lee, Hee Soo Heo, Soyeon Choe, Chiheon Ham, Sunghwan Jung, Bong-** Lee, Icksang Han

    Abstract: The objective of this paper is 'open-set' speaker recognition of unseen speakers, where ideal embeddings should be able to condense information into a compact utterance-level representation that has small intra-speaker and large inter-speaker distance. A popular belief in speaker recognition is that networks trained with classification objectives outperform metric learning methods. In this paper… ▽ More

    Submitted 24 April, 2020; v1 submitted 26 March, 2020; originally announced March 2020.

    Comments: The code can be found at https://github.com/clovaai/voxceleb_trainer

  19. You Watch, You Give, and You Engage: A Study of Live Streaming Practices in China

    Authors: Zhicong Lu, Haijun Xia, Seongkook Heo, Daniel Wigdor

    Abstract: Despite gaining traction in North America, live streaming has not reached the popularity it has in China, where livestreaming has a tremendous impact on the social behaviors of users. To better understand this socio-technological phenomenon, we conducted a mixed methods study of live streaming practices in China. We present the results of an online survey of 527 live streaming users, focusing on t… ▽ More

    Submitted 15 March, 2018; originally announced March 2018.

    Comments: Published at ACM CHI Conference on Human Factors in Computing Systems (CHI 2018). Please cite the CHI version

    ACM Class: H.5.m

    Journal ref: Zhicong Lu, Haijun Xia, Seongkook Heo, and Daniel Wigdor. 2018. You Watch, You Give, and You Engage: A Study of Live Streaming Practices in China. In Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems (CHI '18)