Skip to main content

Showing 1–12 of 12 results for author: Cha, S K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2211.01548  [pdf, other

    cs.LG cs.AI

    INGREX: An Interactive Explanation Framework for Graph Neural Networks

    Authors: Tien-Cuong Bui, Van-Duc Le, Wen-Syan Li, Sang Kyun Cha

    Abstract: Graph Neural Networks (GNNs) are widely used in many modern applications, necessitating explanations for their decisions. However, the complexity of GNNs makes it difficult to explain predictions. Even though several methods have been proposed lately, they can only provide simple and static explanations, which are difficult for users to understand in many scenarios. Therefore, we introduce INGREX,… ▽ More

    Submitted 2 November, 2022; originally announced November 2022.

    Comments: 4 pages, 5 figures, This paper is under review for IEEE ICDE 2023

  2. arXiv:2210.11094  [pdf, other

    cs.LG cs.AI

    Toward Multiple Specialty Learners for Explaining GNNs via Online Knowledge Distillation

    Authors: Tien-Cuong Bui, Van-Duc Le, Wen-syan Li, Sang Kyun Cha

    Abstract: Graph Neural Networks (GNNs) have become increasingly ubiquitous in numerous applications and systems, necessitating explanations of their predictions, especially when making critical decisions. However, explaining GNNs is challenging due to the complexity of graph data and model execution. Despite additional computational costs, post-hoc explanation approaches have been widely adopted due to the… ▽ More

    Submitted 20 October, 2022; originally announced October 2022.

    Comments: 13 pages, 11 figures, A preliminary paper under review of IEEE ICDE 2023

  3. arXiv:2011.14344  [pdf, other

    cs.CL

    Generative Pre-training for Paraphrase Generation by Representing and Predicting Spans in Exemplars

    Authors: Tien-Cuong Bui, Van-Duc Le, Hai-Thien To, Sang Kyun Cha

    Abstract: Paraphrase generation is a long-standing problem and serves an essential role in many natural language processing problems. Despite some encouraging results, recent methods either confront the problem of favoring generic utterance or need to retrain the model from scratch for each new dataset. This paper presents a novel approach to paraphrasing sentences, extended from the GPT-2 model. We develop… ▽ More

    Submitted 29 November, 2020; originally announced November 2020.

    Comments: 8 pages, 4 figures, Accepted to IEEE International Conference on Big Data and Smart Computing 2021

  4. Revisiting Binary Code Similarity Analysis using Interpretable Feature Engineering and Lessons Learned

    Authors: Dongkwan Kim, Eunsoo Kim, Sang Kil Cha, Sooel Son, Yongdae Kim

    Abstract: Binary code similarity analysis (BCSA) is widely used for diverse security applications, including plagiarism detection, software license violation detection, and vulnerability discovery. Despite the surging research interest in BCSA, it is significantly challenging to perform new research in this field for several reasons. First, most existing approaches focus only on the end results, namely, inc… ▽ More

    Submitted 6 July, 2022; v1 submitted 21 November, 2020; originally announced November 2020.

    Comments: 23 pages, accepted to IEEE Transactions on Software Engineering (June 2022)

  5. arXiv:2009.11543  [pdf, other

    cs.DB

    Compressed Key Sort and Fast Index Reconstruction

    Authors: Yongsik Kwon, Cheol Ryu, Sang Kyun Cha, Arthur H. Lee, Kunsoo Park, Bongki Moon

    Abstract: In this paper we propose an index key compression scheme based on the notion of distinction bits by proving that the distinction bits of index keys are sufficient information to determine the sorted order of the index keys correctly. While the actual compression ratio may vary depending on the characteristics of datasets (an average of 2.76 to one compression ratio was observed in our experiments)… ▽ More

    Submitted 24 September, 2020; originally announced September 2020.

    Comments: 26 pages and 13 figures

  6. arXiv:2007.03169  [pdf, other

    cs.CV

    Spatial Semantic Embedding Network: Fast 3D Instance Segmentation with Deep Metric Learning

    Authors: Dongsu Zhang, Junha Chun, Sang Kyun Cha, Young Min Kim

    Abstract: We propose spatial semantic embedding network (SSEN), a simple, yet efficient algorithm for 3D instance segmentation using deep metric learning. The raw 3D reconstruction of an indoor environment suffers from occlusions, noise, and is produced without any meaningful distinction between individual entities. For high-level intelligent tasks from a large scale scene, 3D instance segmentation recogniz… ▽ More

    Submitted 6 July, 2020; originally announced July 2020.

  7. arXiv:2001.04107  [pdf, other

    cs.CR cs.LG

    Montage: A Neural Network Language Model-Guided JavaScript Engine Fuzzer

    Authors: Suyoung Lee, HyungSeok Han, Sang Kil Cha, Sooel Son

    Abstract: JavaScript (JS) engine vulnerabilities pose significant security threats affecting billions of web browsers. While fuzzing is a prevalent technique for finding such vulnerabilities, there have been few studies that leverage the recent advances in neural network language models (NNLMs). In this paper, we present Montage, the first NNLM-guided fuzzer for finding JS engine vulnerabilities. The key as… ▽ More

    Submitted 14 January, 2020; v1 submitted 13 January, 2020; originally announced January 2020.

    Comments: 18 pages, accepted at USENIX Security '20

  8. arXiv:1912.00649  [pdf, other

    cs.MM cs.CV eess.AS

    An Attention-Based Speaker Naming Method for Online Adaptation in Non-Fixed Scenarios

    Authors: Jungwoo Pyo, Joohyun Lee, Youngjune Park, Tien-Cuong Bui, Sang Kyun Cha

    Abstract: A speaker naming task, which finds and identifies the active speaker in a certain movie or drama scene, is crucial for dealing with high-level video analysis applications such as automatic subtitle labeling and video summarization. Modern approaches have usually exploited biometric features with a gradient-based method instead of rule-based algorithms. In a certain situation, however, a naive grad… ▽ More

    Submitted 2 December, 2019; originally announced December 2019.

    Comments: AAAI 2020 Workshop on Interactive and Conversational Recommendation Systems(WICRS)

  9. arXiv:1911.12919  [pdf, other

    cs.LG eess.SP stat.ML

    Spatiotemporal deep learning model for citywide air pollution interpolation and prediction

    Authors: Van-Duc Le, Tien-Cuong Bui, Sang Kyun Cha

    Abstract: Recently, air pollution is one of the most concerns for big cities. Predicting air quality for any regions and at any time is a critical requirement of urban citizens. However, air pollution prediction for the whole city is a challenging problem. The reason is, there are many spatiotemporal factors affecting air pollution throughout the city. Collecting as many of them could help us to forecast ai… ▽ More

    Submitted 28 November, 2019; originally announced November 2019.

    Comments: Accepted at BigComp2020

  10. arXiv:1907.00957  [pdf

    physics.app-ph cs.ET

    Magnetic skyrmion artificial synapse for neuromorphic computing

    Authors: Kyung Mee Song, Jae-Seung Jeong, Biao Pan, Xichao Zhang, **g Xia, Sun Kyung Cha, Tae-Eon Park, Kwangsu Kim, Simone Finizio, Joerg Raabe, Joonyeon Chang, Yan Zhou, Weisheng Zhao, Wang Kang, Hyunsu Ju, Seonghoon Woo

    Abstract: Since the experimental discovery of magnetic skyrmions achieved one decade ago, there have been significant efforts to bring the virtual particles into all-electrical fully functional devices, inspired by their fascinating physical and topological properties suitable for future low-power electronics. Here, we experimentally demonstrate such a device: electrically-operating skyrmion-based artificia… ▽ More

    Submitted 30 September, 2019; v1 submitted 1 July, 2019; originally announced July 2019.

    Comments: 11 pages, 4 figures

    Journal ref: Nature Electronics 3, 148 (2020)

  11. arXiv:1812.00140  [pdf, ps, other

    cs.CR cs.SE

    The Art, Science, and Engineering of Fuzzing: A Survey

    Authors: Valentin J. M. Manes, HyungSeok Han, Choongwoo Han, Sang Kil Cha, Manuel Egele, Edward J. Schwartz, Maverick Woo

    Abstract: Among the many software vulnerability discovery techniques available today, fuzzing has remained highly popular due to its conceptual simplicity, its low barrier to deployment, and its vast amount of empirical evidence in discovering real-world software vulnerabilities. At a high level, fuzzing refers to a process of repeatedly running a program with generated inputs that may be syntactically or s… ▽ More

    Submitted 7 April, 2019; v1 submitted 30 November, 2018; originally announced December 2018.

    Comments: 29 pages, under submission to ACM Computing Surveys (July 2018) - 2018.12.10 update: correct minor mistakes in overview table - 2019.02.16 update: source clean - 2019.04.08: submission to TSE, 21 pages

  12. arXiv:1805.00432  [pdf

    cs.CY cs.IR cs.LG

    Real-time Air Pollution prediction model based on Spatiotemporal Big data

    Authors: V. Duc Le, Sang Kyun Cha

    Abstract: Air pollution is one of the most concerns for urban areas. Many countries have constructed monitoring stations to hourly collect pollution values. Recently, there is a research in Daegu city, Korea for real-time air quality monitoring via sensors installed on taxis running across the whole city. The collected data is huge (1-second interval) and in both Spatial and Temporal format. In this paper,… ▽ More

    Submitted 9 August, 2018; v1 submitted 5 April, 2018; originally announced May 2018.

    Comments: 6 pages

    Journal ref: The International Conference on Big data, IoT, and Cloud Computing (BIC 2018)