Skip to main content

Showing 1–50 of 73 results for author: Jo, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.00573  [pdf, other

    cs.DS cs.DB

    A Simple Representation of Tree Covering Utilizing Balanced Parentheses and Efficient Implementation of Average-Case Optimal RMQs

    Authors: Kou Hamada, Sankardeep Chakraborty, Seungbum Jo, Takuto Koriyama, Kunihiko Sadakane, Srinivasa Rao Satti

    Abstract: Tree covering is a technique for decomposing a tree into smaller-sized trees with desirable properties, and has been employed in various succinct data structures. However, significant hurdles stand in the way of a practical implementation of tree covering: a lot of pointers are used to maintain the tree-covering hierarchy and many indices for tree navigational queries consume theoretically negligi… ▽ More

    Submitted 29 June, 2024; originally announced July 2024.

    Comments: To appear in ESA 2024

  2. arXiv:2406.06134  [pdf, other

    cs.CV cs.AI cs.LG

    DiffInject: Revisiting Debias via Synthetic Data Generation using Diffusion-based Style Injection

    Authors: Donggeun Ko, Sangwoo Jo, Dongjun Lee, Namjun Park, Jaekwang Kim

    Abstract: Dataset bias is a significant challenge in machine learning, where specific attributes, such as texture or color of the images are unintentionally learned resulting in detrimental performance. To address this, previous efforts have focused on debiasing models either by develo** novel debiasing algorithms or by generating synthetic data to mitigate the prevalent dataset biases. However, generativ… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 10 pages (including supplementary), 3 figures, SynData4CV@CVPR 24 (Workshop)

  3. arXiv:2406.05761  [pdf, other

    cs.CL

    The BiGGen Bench: A Principled Benchmark for Fine-grained Evaluation of Language Models with Language Models

    Authors: Seungone Kim, Juyoung Suk, Ji Yong Cho, Shayne Longpre, Chaeeun Kim, Dongkeun Yoon, Gui** Son, Ye** Cho, Sheikh Shafayat, **heon Baek, Sue Hyun Park, Hyeonbin Hwang, **kyung Jo, Hyowon Cho, Haebin Shin, Seongyun Lee, Hanseok Oh, Noah Lee, Namgyu Ho, Se June Joo, Miyoung Ko, Yoonjoo Lee, Hyungjoo Chae, Jamin Shin, Joel Jang , et al. (7 additional authors not shown)

    Abstract: As language models (LMs) become capable of handling a wide range of tasks, their evaluation is becoming as challenging as their development. Most generation benchmarks currently assess LMs using abstract evaluation criteria like helpfulness and harmlessness, which often lack the flexibility and granularity of human assessment. Additionally, these benchmarks tend to focus disproportionately on spec… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

    Comments: Work in Progress

  4. arXiv:2405.20649  [pdf, other

    cs.CL cs.LG

    Reward-based Input Construction for Cross-document Relation Extraction

    Authors: Byeonghu Na, Suhyeon Jo, Yeongmin Kim, Il-Chul Moon

    Abstract: Relation extraction (RE) is a fundamental task in natural language processing, aiming to identify relations between target entities in text. While many RE methods are designed for a single sentence or document, cross-document RE has emerged to address relations across multiple long documents. Given the nature of long documents in cross-document RE, extracting document embeddings is challenging due… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

    Comments: Accepted at ACL 2024 main conference

  5. arXiv:2404.01954  [pdf, other

    cs.CL cs.AI

    HyperCLOVA X Technical Report

    Authors: Kang Min Yoo, Jaegeun Han, Sookyo In, Heewon Jeon, Jisu Jeong, Jaewook Kang, Hyunwook Kim, Kyung-Min Kim, Munhyong Kim, Sungju Kim, Donghyun Kwak, Hanock Kwak, Se Jung Kwon, Bado Lee, Dongsoo Lee, Gichang Lee, Jooho Lee, Baeseong Park, Seong** Shin, Joonsang Yu, Seolki Baek, Sumin Byeon, Eungsup Cho, Dooseok Choe, Jeesung Han , et al. (371 additional authors not shown)

    Abstract: We introduce HyperCLOVA X, a family of large language models (LLMs) tailored to the Korean language and culture, along with competitive capabilities in English, math, and coding. HyperCLOVA X was trained on a balanced mix of Korean, English, and code data, followed by instruction-tuning with high-quality human-annotated datasets while abiding by strict safety guidelines reflecting our commitment t… ▽ More

    Submitted 13 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

    Comments: 44 pages; updated authors list and fixed author names

  6. arXiv:2404.00384  [pdf, other

    cs.CV

    TTD: Text-Tag Self-Distillation Enhancing Image-Text Alignment in CLIP to Alleviate Single Tag Bias

    Authors: Sanghyun Jo, Soohyun Ryu, Sungyub Kim, Eunho Yang, Kyungsu Kim

    Abstract: We identify a critical bias in contemporary CLIP-based models, which we denote as single tag bias. This bias manifests as a disproportionate focus on a singular tag (word) while neglecting other pertinent tags, stemming from CLIP's text embeddings that prioritize one specific tag in image-text relationships. When deconstructing text into individual tags, only one tag tends to have high relevancy w… ▽ More

    Submitted 20 May, 2024; v1 submitted 30 March, 2024; originally announced April 2024.

  7. arXiv:2404.00380  [pdf, other

    cs.CV

    DHR: Dual Features-Driven Hierarchical Rebalancing in Inter- and Intra-Class Regions for Weakly-Supervised Semantic Segmentation

    Authors: Sanghyun Jo, Fei Pan, In-Jae Yu, Kyungsu Kim

    Abstract: Weakly-supervised semantic segmentation (WSS) ensures high-quality segmentation with limited data and excels when employed as input seed masks for large-scale vision models such as Segment Anything. However, WSS faces challenges related to minor classes since those are overlooked in images with adjacent multiple classes, a limitation originating from the overfitting of traditional expansion method… ▽ More

    Submitted 19 May, 2024; v1 submitted 30 March, 2024; originally announced April 2024.

  8. arXiv:2403.13835  [pdf, other

    cs.LG cs.AI cs.CL cs.DB

    SMART: Automatically Scaling Down Language Models with Accuracy Guarantees for Reduced Processing Fees

    Authors: Saehan Jo, Immanuel Trummer

    Abstract: The advancement of Large Language Models (LLMs) has significantly boosted performance in natural language processing (NLP) tasks. However, the deployment of high-performance LLMs incurs substantial costs, primarily due to the increased number of parameters aimed at enhancing model performance. This has made the use of state-of-the-art LLMs more expensive for end-users. AI service providers, such a… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

  9. arXiv:2403.09024  [pdf, other

    cs.CL cs.AI

    Semiparametric Token-Sequence Co-Supervision

    Authors: Hyunji Lee, Doyoung Kim, Jihoon Jun, Sejune Joo, Joel Jang, Kyoung-Woon On, Minjoon Seo

    Abstract: In this work, we introduce a semiparametric token-sequence co-supervision training method. It trains a language model by simultaneously leveraging supervision from the traditional next token prediction loss which is calculated over the parametric token embedding space and the next sequence prediction loss which is calculated over the nonparametric sequence embedding space. The nonparametric sequen… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

  10. arXiv:2402.15162  [pdf, other

    cs.CL cs.AI cs.LG

    Entity-level Factual Adaptiveness of Fine-tuning based Abstractive Summarization Models

    Authors: Jongyoon Song, Nohil Park, Bongkyu Hwang, Jaewoong Yun, Seongho Joe, Youngjune L. Gwon, Sungroh Yoon

    Abstract: Abstractive summarization models often generate factually inconsistent content particularly when the parametric knowledge of the model conflicts with the knowledge in the input document. In this paper, we analyze the robustness of fine-tuning based summarization models to the knowledge conflict, which we call factual adaptiveness. We utilize pre-trained language models to construct evaluation sets… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

    Comments: EACL 2024

  11. arXiv:2402.09450  [pdf, other

    eess.SP cs.AI cs.LG

    Guiding Masked Representation Learning to Capture Spatio-Temporal Relationship of Electrocardiogram

    Authors: Yeongyeon Na, Minje Park, Yunwon Tae, Sunghoon Joo

    Abstract: Electrocardiograms (ECG) are widely employed as a diagnostic tool for monitoring electrical signals originating from a heart. Recent machine learning research efforts have focused on the application of screening various diseases using ECG signals. However, adapting to the application of screening disease is challenging in that labeled ECG data are limited. Achieving general representation through… ▽ More

    Submitted 19 March, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

    Comments: ICLR 2024. The first three authors contribute equally

  12. arXiv:2402.08359  [pdf, other

    cs.CV

    Learning to Produce Semi-dense Correspondences for Visual Localization

    Authors: Khang Truong Giang, Soohwan Song, Sungho Jo

    Abstract: This study addresses the challenge of performing visual localization in demanding conditions such as night-time scenarios, adverse weather, and seasonal changes. While many prior studies have focused on improving image-matching performance to facilitate reliable dense keypoint matching between images, existing methods often heavily rely on predefined feature points on a reconstructed 3D model. Con… ▽ More

    Submitted 20 March, 2024; v1 submitted 13 February, 2024; originally announced February 2024.

    Comments: Accepted at CVPR 2024

  13. arXiv:2311.09069  [pdf, other

    cs.CL cs.AI

    How Well Do Large Language Models Truly Ground?

    Authors: Hyunji Lee, Sejune Joo, Chaeeun Kim, Joel Jang, Doyoung Kim, Kyoung-Woon On, Minjoon Seo

    Abstract: To reduce issues like hallucinations and lack of control in Large Language Models (LLMs), a common method is to generate responses by grounding on external contexts given as input, known as knowledge-augmented models. However, previous research often narrowly defines "grounding" as just having the correct answer, which does not ensure the reliability of the entire response. To overcome this, we pr… ▽ More

    Submitted 29 June, 2024; v1 submitted 15 November, 2023; originally announced November 2023.

    Comments: published at NAACL 2022

  14. arXiv:2311.02839  [pdf, ps, other

    cs.DS

    Cell-Probe Lower Bound for Accessible Interval Graphs

    Authors: Sankardeep Chakraborty, Christian Engels, Seungbum Jo, Mingmou Liu

    Abstract: We spot a hole in the area of succinct data structures for graph classes from a universe of size at most $n^n$. Very often, the input graph is labeled by the user in an arbitrary and easy-to-use way, and the data structure for the graph relabels the input graph in some way. For any access, the user needs to store these labels or compute the new labels in an online manner. This might require more b… ▽ More

    Submitted 5 November, 2023; originally announced November 2023.

    MSC Class: 68P05; 68P30; 68Q17 ACM Class: E.1; F.1.3; F.2.3

  15. arXiv:2311.02427  [pdf, other

    cs.DS

    Succinct Data Structure for Graphs with $d$-Dimensional $t$-Representation

    Authors: Girish Balakrishnan, Sankardeep Chakraborty, Seungbum Jo, N S Narayanaswamy, Kunihiko Sadakane

    Abstract: ErdÅ‘s and West (Discrete Mathematics'85) considered the class of $n$ vertex intersection graphs which have a {\em $d$-dimensional} {\em $t$-representation}, that is, each vertex of a graph in the class has an associated set consisting of at most $t$ $d$-dimensional axis-parallel boxes. In particular, for a graph $G$ and for each $d \geq 1$, they consider $i_d(G)$ to be the minimum $t$ for which… ▽ More

    Submitted 6 February, 2024; v1 submitted 4 November, 2023; originally announced November 2023.

    Comments: 21 pages, 5 figures

  16. arXiv:2310.14663  [pdf, other

    eess.AS cs.CL

    DPP-TTS: Diversifying prosodic features of speech via determinantal point processes

    Authors: Seongho Joo, Hyukhun Koh, Kyomin Jung

    Abstract: With the rapid advancement in deep generative models, recent neural Text-To-Speech(TTS) models have succeeded in synthesizing human-like speech. There have been some efforts to generate speech with various prosody beyond monotonous prosody patterns. However, previous works have several limitations. First, typical TTS models depend on the scaled sampling temperature for boosting the diversity of pr… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

    Comments: EMNLP 2023

  17. arXiv:2308.03251  [pdf, ps, other

    eess.SP cs.IT

    Joint Precoding and Fronthaul Compression for Cell-Free MIMO Downlink With Radio Stripes

    Authors: Sangwon Jo, Hoon Lee, Seok-Hwan Park

    Abstract: A sequential fronthaul network, referred to as radio stripes, is a promising fronthaul topology of cell-free MIMO systems. In this setup, a single cable suffices to connect access points (APs) to a central processor (CP). Thus, radio stripes are more effective than conventional star fronthaul topology which requires dedicated cables for each of APs. Most of works on radio stripes focused on the up… ▽ More

    Submitted 6 August, 2023; originally announced August 2023.

    Comments: To be presented at IEEE Globecom 2023, Kuala Lumpur, Malaysia, Dec. 2023

  18. arXiv:2307.05916  [pdf, other

    cs.CV

    SwiFT: Swin 4D fMRI Transformer

    Authors: Peter Yongho Kim, Junbeom Kwon, Sunghwan Joo, Sangyoon Bae, Donggyu Lee, Yoonho Jung, Shinjae Yoo, Jiook Cha, Taesup Moon

    Abstract: Modeling spatiotemporal brain dynamics from high-dimensional data, such as functional Magnetic Resonance Imaging (fMRI), is a formidable task in neuroscience. Existing approaches for fMRI analysis utilize hand-crafted features, but the process of feature extraction risks losing essential information in fMRI scans. To address this challenge, we present SwiFT (Swin 4D fMRI Transformer), a Swin Trans… ▽ More

    Submitted 31 October, 2023; v1 submitted 12 July, 2023; originally announced July 2023.

    Comments: NeurIPS 2023

  19. arXiv:2307.00485  [pdf, other

    cs.CV

    TopicFM+: Boosting Accuracy and Efficiency of Topic-Assisted Feature Matching

    Authors: Khang Truong Giang, Soohwan Song, Sungho Jo

    Abstract: This study tackles the challenge of image matching in difficult scenarios, such as scenes with significant variations or limited texture, with a strong emphasis on computational efficiency. Previous studies have attempted to address this challenge by encoding global scene contexts using Transformers. However, these approaches suffer from high computational costs and may not capture sufficient high… ▽ More

    Submitted 2 July, 2023; originally announced July 2023.

    Comments: Paper extension of TopicFM (arXiv:2207.00328)

  20. arXiv:2306.00227  [pdf

    cs.CY

    From Human-Centered to Social-Centered Artificial Intelligence: Assessing ChatGPT's Impact through Disruptive Events

    Authors: Skyler Wang, Ned Cooper, Margaret Eby, Eun Seo Jo

    Abstract: Large language models (LLMs) and dialogue agents have existed for years, but the release of recent GPT models has been a watershed moment for artificial intelligence (AI) research and society at large. Immediately recognized for its generative capabilities and versatility, ChatGPT's impressive proficiency across technical and creative domains led to its widespread adoption. While society grapples… ▽ More

    Submitted 31 May, 2023; originally announced June 2023.

  21. arXiv:2305.14045  [pdf, other

    cs.CL cs.AI cs.LG

    The CoT Collection: Improving Zero-shot and Few-shot Learning of Language Models via Chain-of-Thought Fine-Tuning

    Authors: Seungone Kim, Se June Joo, Doyoung Kim, Joel Jang, Seonghyeon Ye, Jamin Shin, Minjoon Seo

    Abstract: Language models (LMs) with less than 100B parameters are known to perform poorly on chain-of-thought (CoT) reasoning in contrast to large LMs when solving unseen tasks. In this work, we aim to equip smaller LMs with the step-by-step reasoning capability by instruction tuning with CoT rationales. In order to achieve this goal, we first introduce a new instruction-tuning dataset called the CoT Colle… ▽ More

    Submitted 14 October, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: EMNLP 2023 (Main Conference)

  22. arXiv:2305.13046  [pdf, other

    cs.CV cs.AI cs.LG

    POEM: Polarization of Embeddings for Domain-Invariant Representations

    Authors: Sang-Yeong Jo, Sung Whan Yoon

    Abstract: Handling out-of-distribution samples is a long-lasting challenge for deep visual models. In particular, domain generalization (DG) is one of the most relevant tasks that aims to train a model with a generalization capability on novel domains. Most existing DG approaches share the same philosophy to minimize the discrepancy between domains by finding the domain-invariant representations. On the con… ▽ More

    Submitted 22 May, 2023; originally announced May 2023.

    Comments: In Proceedings of the 37th AAAI Conference on Artificial Intelligence (AAAI) 2023, Washington D.C. USA

  23. arXiv:2304.11095  [pdf, other

    cs.LG cs.AI cs.CL cs.CV cs.IR

    Is Cross-modal Information Retrieval Possible without Training?

    Authors: Hyun** Choi, Hyunjae Lee, Seongho Joe, Youngjune L. Gwon

    Abstract: Encoded representations from a pretrained deep learning model (e.g., BERT text embeddings, penultimate CNN layer activations of an image) convey a rich set of features beneficial for information retrieval. Embeddings for a particular modality of data occupy a high-dimensional space of its own, but it can be semantically aligned to another by a simple map** without training a deep neural net. In… ▽ More

    Submitted 19 April, 2023; originally announced April 2023.

    Journal ref: Advances in Information Retrieval: 45th European Conference on Information Retrieval, ECIR 2023, Dublin, Ireland, Proceedings, Part II

  24. arXiv:2304.09913  [pdf, other

    cs.CV cs.AI

    MARS: Model-agnostic Biased Object Removal without Additional Supervision for Weakly-Supervised Semantic Segmentation

    Authors: Sanghyun Jo, In-Jae Yu, Kyungsu Kim

    Abstract: Weakly-supervised semantic segmentation aims to reduce labeling costs by training semantic segmentation models using weak supervision, such as image-level class labels. However, most approaches struggle to produce accurate localization maps and suffer from false predictions in class-related backgrounds (i.e., biased objects), such as detecting a railroad with the train class. Recent methods that r… ▽ More

    Submitted 19 April, 2023; originally announced April 2023.

  25. Shuffle & Divide: Contrastive Learning for Long Text

    Authors: Joonseok Lee, Seongho Joe, Kyoungwon Park, Bogun Kim, Hoyoung Kang, Jaeseon Park, Youngjune Gwon

    Abstract: We propose a self-supervised learning method for long text documents based on contrastive learning. A key to our method is Shuffle and Divide (SaD), a simple text augmentation algorithm that sets up a pretext task required for contrastive updates to BERT-based document embedding. SaD splits a document into two sub-documents containing randomly shuffled words in the entire documents. The sub-docume… ▽ More

    Submitted 18 April, 2023; originally announced April 2023.

    Comments: Accepted at ICPR 2022

    Journal ref: 2022 26th International Conference on Pattern Recognition (ICPR), Montreal, QC, Canada, 2022, pp. 2935-2941

  26. ContraCluster: Learning to Classify without Labels by Contrastive Self-Supervision and Prototype-Based Semi-Supervision

    Authors: Seongho Joe, Byoungjip Kim, Hoyoung Kang, Kyoungwon Park, Bogun Kim, Jaeseon Park, Joonseok Lee, Youngjune Gwon

    Abstract: The recent advances in representation learning inspire us to take on the challenging problem of unsupervised image classification tasks in a principled way. We propose ContraCluster, an unsupervised image classification method that combines clustering with the power of contrastive self-supervised learning. ContraCluster consists of three stages: (1) contrastive self-supervised pre-training (CPT),… ▽ More

    Submitted 18 April, 2023; originally announced April 2023.

    Comments: Accepted at ICPR 2022

    Journal ref: 2022 26th International Conference on Pattern Recognition (ICPR), Montreal, QC, Canada, 2022, pp. 4685-4692

  27. arXiv:2303.08329  [pdf, other

    cs.SD cs.CL eess.AS

    Cross-speaker Emotion Transfer by Manipulating Speech Style Latents

    Authors: Suhee Jo, Younggun Lee, Yookyung Shin, Yeongtae Hwang, Taesu Kim

    Abstract: In recent years, emotional text-to-speech has shown considerable progress. However, it requires a large amount of labeled data, which is not easily accessible. Even if it is possible to acquire an emotional speech dataset, there is still a limitation in controlling emotion intensity. In this work, we propose a novel method for cross-speaker emotion transfer and manipulation using vector arithmetic… ▽ More

    Submitted 14 March, 2023; originally announced March 2023.

    Comments: accepted to ICASSP 2023

  28. arXiv:2303.03628  [pdf, other

    cs.CL cs.LG

    CoTEVer: Chain of Thought Prompting Annotation Toolkit for Explanation Verification

    Authors: Seungone Kim, Se June Joo, Yul Jang, Hyungjoo Chae, **young Yeo

    Abstract: Chain-of-thought (CoT) prompting enables large language models (LLMs) to solve complex reasoning tasks by generating an explanation before the final prediction. Despite it's promising ability, a critical downside of CoT prompting is that the performance is greatly affected by the factuality of the generated explanation. To improve the correctness of the explanations, fine-tuning language models wi… ▽ More

    Submitted 6 March, 2023; originally announced March 2023.

    Comments: Accepted at EACL 2023 Demo

  29. arXiv:2302.00319  [pdf, other

    cs.LG cs.AI q-bio.QM

    Development of deep biological ages aware of morbidity and mortality based on unsupervised and semi-supervised deep learning approaches

    Authors: Seong-Eun Moon, Ji Won Yoon, Shinyoung Joo, Yoohyung Kim, Jae Hyun Bae, Seokho Yoon, Haanju Yoo, Young Min Cho

    Abstract: Background: While deep learning technology, which has the capability of obtaining latent representations based on large-scale data, can be a potential solution for the discovery of a novel aging biomarker, existing deep learning methods for biological age estimation usually depend on chronological ages and lack of consideration of mortality and morbidity that are the most significant outcomes of a… ▽ More

    Submitted 1 February, 2023; originally announced February 2023.

  30. arXiv:2211.15900  [pdf, other

    cs.CV

    Towards More Robust Interpretation via Local Gradient Alignment

    Authors: Sunghwan Joo, Seokhyeon Jeong, Juyeon Heo, Adrian Weller, Taesup Moon

    Abstract: Neural network interpretation methods, particularly feature attribution methods, are known to be fragile with respect to adversarial input perturbations. To address this, several methods for enhancing the local smoothness of the gradient while training have been proposed for attaining \textit{robust} feature attributions. However, the lack of considering the normalization of the attributions, whic… ▽ More

    Submitted 7 December, 2022; v1 submitted 28 November, 2022; originally announced November 2022.

    Comments: 22 pages (9 pages in paper, 13 pages in Appendix), 9 figures, 6 tables Accepted in AAAI 23 (Association for the Advancement of Artificial Intelligence)

  31. arXiv:2209.11055  [pdf, other

    cs.CL

    Efficient Few-Shot Learning Without Prompts

    Authors: Lewis Tunstall, Nils Reimers, Unso Eun Seo Jo, Luke Bates, Daniel Korat, Moshe Wasserblat, Oren Pereg

    Abstract: Recent few-shot methods, such as parameter-efficient fine-tuning (PEFT) and pattern exploiting training (PET), have achieved impressive results in label-scarce settings. However, they are difficult to employ since they are subject to high variability from manually crafted prompts, and typically require billion-parameter language models to achieve high accuracy. To address these shortcomings, we pr… ▽ More

    Submitted 22 September, 2022; originally announced September 2022.

  32. arXiv:2209.00930  [pdf, other

    cs.CL cs.AI cs.LG

    Mind the Gap! Injecting Commonsense Knowledge for Abstractive Dialogue Summarization

    Authors: Seungone Kim, Se June Joo, Hyungjoo Chae, Chaehyeong Kim, Seung-won Hwang, **young Yeo

    Abstract: In this paper, we propose to leverage the unique characteristics of dialogues sharing commonsense knowledge across participants, to resolve the difficulties in summarizing them. We present SICK, a framework that uses commonsense inferences as additional context. Compared to previous work that solely relies on the input dialogue, SICK uses an external knowledge model to generate a rich set of commo… ▽ More

    Submitted 2 September, 2022; originally announced September 2022.

    Comments: Accepted at COLING 2022

  33. Enhancing Semantic Understanding with Self-supervised Methods for Abstractive Dialogue Summarization

    Authors: Hyunjae Lee, Jaewoong Yun, Hyun** Choi, Seongho Joe, Youngjune L. Gwon

    Abstract: Contextualized word embeddings can lead to state-of-the-art performances in natural language understanding. Recently, a pre-trained deep contextualized text encoder such as BERT has shown its potential in improving natural language tasks including abstractive summarization. Existing approaches in dialogue summarization focus on incorporating a large language model into summarization task trained o… ▽ More

    Submitted 1 September, 2022; originally announced September 2022.

    Comments: 5 pages, 3 figures, INTERSPEECH 2021

    Journal ref: Proc. Interspeech 2021, 796-800 (2021)

  34. arXiv:2209.00158  [pdf, other

    cs.DS

    Space-efficient data structure for next/previous larger/smaller value queries

    Authors: Seungbum Jo, Geunho Kim

    Abstract: Given an array of size $n$ from a total order, we consider the problem of constructing a data structure that supports various queries (range minimum/maximum queries with their variants and next/previous larger/smaller queries) efficiently. In the encoding model (i.e., the queries can be answered without the input array), we propose a $(3.701n + o(n))$-bit data structure, which supports all these q… ▽ More

    Submitted 31 August, 2022; originally announced September 2022.

  35. arXiv:2207.06000  [pdf, other

    cs.CL cs.LG cs.SD eess.AS

    Text-driven Emotional Style Control and Cross-speaker Style Transfer in Neural TTS

    Authors: Yookyung Shin, Younggun Lee, Suhee Jo, Yeongtae Hwang, Taesu Kim

    Abstract: Expressive text-to-speech has shown improved performance in recent years. However, the style control of synthetic speech is often restricted to discrete emotion categories and requires training data recorded by the target speaker in the target style. In many practical situations, users may not have reference speech recorded in target emotion but still be interested in controlling speech style just… ▽ More

    Submitted 13 July, 2022; originally announced July 2022.

    Comments: Accepted to Interspeech 2022

  36. TopicFM: Robust and Interpretable Topic-Assisted Feature Matching

    Authors: Khang Truong Giang, Soohwan Song, Sungho Jo

    Abstract: This study addresses an image-matching problem in challenging cases, such as large scene variations or textureless scenes. To gain robustness to such situations, most previous studies have attempted to encode the global contexts of a scene via graph neural networks or transformers. However, these contexts do not explicitly represent high-level contextual information, such as structural shapes or s… ▽ More

    Submitted 29 November, 2022; v1 submitted 1 July, 2022; originally announced July 2022.

    Comments: Accepted at AAAI-23. This version includes both main text and supplementary materials

  37. arXiv:2205.09185  [pdf, other

    physics.ins-det cs.LG hep-ex nucl-ex physics.comp-ph

    AI-assisted Optimization of the ECCE Tracking System at the Electron Ion Collider

    Authors: C. Fanelli, Z. Papandreou, K. Suresh, J. K. Adkins, Y. Akiba, A. Albataineh, M. Amaryan, I. C. Arsene, C. Ayerbe Gayoso, J. Bae, X. Bai, M. D. Baker, M. Bashkanov, R. Bellwied, F. Benmokhtar, V. Berdnikov, J. C. Bernauer, F. Bock, W. Boeglin, M. Borysova, E. Brash, P. Brindza, W. J. Briscoe, M. Brooks, S. Bueltmann , et al. (258 additional authors not shown)

    Abstract: The Electron-Ion Collider (EIC) is a cutting-edge accelerator facility that will study the nature of the "glue" that binds the building blocks of the visible matter in the universe. The proposed experiment will be realized at Brookhaven National Laboratory in approximately 10 years from now, with detector design and R&D currently ongoing. Notably, EIC is one of the first large-scale facilities to… ▽ More

    Submitted 19 May, 2022; v1 submitted 18 May, 2022; originally announced May 2022.

    Comments: 16 pages, 18 figures, 2 appendices, 3 tables

  38. arXiv:2204.06754  [pdf, other

    cs.CV cs.AI

    RecurSeed and EdgePredictMix: Pseudo-Label Refinement Learning for Weakly Supervised Semantic Segmentation across Single- and Multi-Stage Frameworks

    Authors: Sanghyun Jo, In-Jae Yu, Kyungsu Kim

    Abstract: Although weakly supervised semantic segmentation using only image-level labels (WSSS-IL) is potentially useful, its low performance and implementation complexity still limit its application. The main causes are (a) non-detection and (b) false-detection phenomena: (a) The class activation maps refined from existing WSSS-IL methods still only represent partial regions for large-scale objects, and (b… ▽ More

    Submitted 15 December, 2023; v1 submitted 14 April, 2022; originally announced April 2022.

  39. arXiv:2202.10967  [pdf, other

    cs.CL

    Learning Cluster Patterns for Abstractive Summarization

    Authors: Sung-Guk Jo, Jeong-Jae Kim, Byung-Won On

    Abstract: Nowadays, pre-trained sequence-to-sequence models such as BERTSUM and BART have shown state-of-the-art results in abstractive summarization. In these models, during fine-tuning, the encoder transforms sentences to context vectors in the latent space and the decoder learns the summary generation task based on the context vectors. In our approach, we consider two clusters of salient and non-salient… ▽ More

    Submitted 22 February, 2022; v1 submitted 22 February, 2022; originally announced February 2022.

    Comments: 11 pages, 4 figures, 4 tables

  40. arXiv:2202.07862  [pdf

    cs.DL physics.soc-ph

    See further upon the giants: Quantifying intellectual lineage in science

    Authors: Woo Seong Jo, Lu Liu, Dashun Wang

    Abstract: Newton's centuries-old wisdom of standing on the shoulders of giants raises a crucial yet underexplored question: Out of all the prior works cited by a discovery, which one is its giant? Here, we develop a novel, discipline-independent method to identify the giant for any individual paper, allowing us to systematically examine the role and characteristics of giants in science. We find that across… ▽ More

    Submitted 14 March, 2022; v1 submitted 16 February, 2022; originally announced February 2022.

  41. arXiv:2112.05999  [pdf, other

    cs.CV cs.AI cs.GR

    Curvature-guided dynamic scale networks for Multi-view Stereo

    Authors: Khang Truong Giang, Soohwan Song, Sungho Jo

    Abstract: Multi-view stereo (MVS) is a crucial task for precise 3D reconstruction. Most recent studies tried to improve the performance of matching cost volume in MVS by designing aggregated 3D cost volumes and their regularization. This paper focuses on learning a robust feature extraction network to enhance the performance of matching costs without heavy computation in the other steps. In particular, we p… ▽ More

    Submitted 9 March, 2022; v1 submitted 11 December, 2021; originally announced December 2021.

    Comments: Accepted to ICLR 2022

  42. Hierarchical Text Classification As Sub-Hierarchy Sequence Generation

    Authors: SangHun Im, Gibaeg Kim, Heung-Seon Oh, Seongung Jo, Donghwan Kim

    Abstract: Hierarchical text classification (HTC) is essential for various real applications. However, HTC models are challenging to develop because they often require processing a large volume of documents and labels with hierarchical taxonomy. Recent HTC models based on deep learning have attempted to incorporate hierarchy information into a model structure. Consequently, these models are challenging to im… ▽ More

    Submitted 6 November, 2023; v1 submitted 22 November, 2021; originally announced November 2021.

    Comments: 9 pages, 5 figures, Published at AAAI23

    Journal ref: Proceedings of the AAAI Conference on Artificial Intelligence, 37(11), 12933-12941 (2023)

  43. arXiv:2109.00911  [pdf, other

    cs.CV cs.LG

    BiHPF: Bilateral High-Pass Filters for Robust Deepfake Detection

    Authors: Yonghyun Jeong, Doyeon Kim, Seungjai Min, Seongho Joe, Youngjune Gwon, Jongwon Choi

    Abstract: The advancement in numerous generative models has a two-fold effect: a simple and easy generation of realistic synthesized images, but also an increased risk of malicious abuse of those images. Thus, it is important to develop a generalized detector for synthesized images of any GAN model or object category, including those unseen during the training phase. However, the conventional methods heavil… ▽ More

    Submitted 16 August, 2021; originally announced September 2021.

  44. arXiv:2108.10776  [pdf, other

    cs.DS

    Succinct Data Structures for Series-Parallel, Block-Cactus and 3-Leaf Power Graphs

    Authors: Sankardeep Chakraborty, Seungbum Jo, Kunihiko Sadakane, Srinivasa Rao Satti

    Abstract: We design succinct encodings of {\it series-parallel, block-cactus} and {\it 3-leaf power} graphs while supporting the basic navigational queries such as degree, adjacency and neighborhood {\it optimally} in the RAM model with logarithmic word size. One salient feature of our representation is that it can achieve optimal space even though the exact space lower bound for these graph classes is not… ▽ More

    Submitted 26 August, 2021; v1 submitted 24 August, 2021; originally announced August 2021.

  45. arXiv:2101.11363  [pdf, other

    cs.CL cs.LG

    KoreALBERT: Pretraining a Lite BERT Model for Korean Language Understanding

    Authors: Hyunjae Lee, Jaewoong Yoon, Bonggyu Hwang, Seongho Joe, Seungjai Min, Youngjune Gwon

    Abstract: A Lite BERT (ALBERT) has been introduced to scale up deep bidirectional representation learning for natural languages. Due to the lack of pretrained ALBERT models for Korean language, the best available practice is the multilingual model or resorting back to the any other BERT-based model. In this paper, we develop and pretrain KoreALBERT, a monolingual ALBERT model specifically for Korean languag… ▽ More

    Submitted 27 January, 2021; originally announced January 2021.

    Comments: 7 pages, 1 figure, to be published in 25th International Conference on Pattern Recognition, ICPR 2020

  46. Puzzle-CAM: Improved localization via matching partial and full features

    Authors: Sanghyun Jo, In-Jae Yu

    Abstract: Weakly-supervised semantic segmentation (WSSS) is introduced to narrow the gap for semantic segmentation performance from pixel-level supervision to image-level supervision. Most advanced approaches are based on class activation maps (CAMs) to generate pseudo-labels to train the segmentation network. The main limitation of WSSS is that the process of generating pseudo-labels from CAMs that use an… ▽ More

    Submitted 23 September, 2021; v1 submitted 27 January, 2021; originally announced January 2021.

    Comments: Accepted to ICIP 2021

  47. arXiv:2101.10649  [pdf, other

    cs.CL cs.AI

    Analyzing Zero-shot Cross-lingual Transfer in Supervised NLP Tasks

    Authors: Hyun** Choi, Judong Kim, Seongho Joe, Seungjai Min, Youngjune Gwon

    Abstract: In zero-shot cross-lingual transfer, a supervised NLP task trained on a corpus in one language is directly applicable to another language without any additional training. A source of cross-lingual transfer can be as straightforward as lexical overlap between languages (e.g., use of the same scripts, shared subwords) that naturally forces text embeddings to occupy a similar representation space. Re… ▽ More

    Submitted 26 January, 2021; originally announced January 2021.

    Comments: 6 pages, 4 figures, to be published in 25th International Conference on Pattern Recognition, ICPR 2020

  48. arXiv:2101.10642  [pdf, other

    cs.CL cs.AI

    Evaluation of BERT and ALBERT Sentence Embedding Performance on Downstream NLP Tasks

    Authors: Hyun** Choi, Judong Kim, Seongho Joe, Youngjune Gwon

    Abstract: Contextualized representations from a pre-trained language model are central to achieve a high performance on downstream NLP task. The pre-trained BERT and A Lite BERT (ALBERT) models can be fine-tuned to give state-ofthe-art results in sentence-pair regressions such as semantic textual similarity (STS) and natural language inference (NLI). Although BERT-based models yield the [CLS] token vector a… ▽ More

    Submitted 26 January, 2021; originally announced January 2021.

    Comments: 6 pages, 2 figures, to be published in 25th International Conference on Pattern Recognition, ICPR2020

  49. arXiv:2101.06480  [pdf, other

    cs.LG cs.CV

    SelfMatch: Combining Contrastive Self-Supervision and Consistency for Semi-Supervised Learning

    Authors: Byoungjip Kim, **ho Choo, Yeong-Dae Kwon, Seongho Joe, Seungjai Min, Youngjune Gwon

    Abstract: This paper introduces SelfMatch, a semi-supervised learning method that combines the power of contrastive self-supervised learning and consistency regularization. SelfMatch consists of two stages: (1) self-supervised pre-training based on contrastive learning and (2) semi-supervised fine-tuning based on augmentation consistency regularization. We empirically demonstrate that SelfMatch achieves the… ▽ More

    Submitted 16 January, 2021; originally announced January 2021.

    Comments: 4 pages, NeurIPS 2020 Workshop: Self-Supervised Learning - Theory and Practice

  50. arXiv:2011.11855  [pdf

    cs.RO cs.CL

    A Robotic Dating Coaching System Leveraging Online Communities Posts

    Authors: Sihyeon Jo, Donghwi Jung, Keonwoo Kim, Eun Gyo Joung, Giulia Nespoli, Seungryong Yoo, Minseob So, Seung-Woo Seo, Seong-Woo Kim

    Abstract: Can a robot be a personal dating coach? Even with the increasing amount of conversational data on the internet, the implementation of conversational robots remains a challenge. In particular, a detailed and professional counseling log is expensive and not publicly accessible. In this paper, we develop a robot dating coaching system leveraging corpus from online communities. We examine people's per… ▽ More

    Submitted 23 November, 2020; originally announced November 2020.