Skip to main content

Showing 1–50 of 213 results for author: An, S

.
  1. arXiv:2407.03014  [pdf

    physics.optics physics.app-ph quant-ph

    Dielectric Fano Nanoantennas for Enabling Sub-Nanosecond Lifetimes in NV-based Single Photon Emitters

    Authors: Shu An, Dmitry Kalashnikov, Wenqiao Shi, Zackaria Mahfoud, Ah Bian Chew, Yan Liu, **g Wu, Di Zhu, Weibo Gao, Cheng-Wei Qiu, Victor Leong, Zhaogang Dong

    Abstract: Solid-state quantum emitters are essential sources of single photons, and enhancing their emission rates is of paramount importance for applications in quantum communications, computing, and metrology. One approach is to couple quantum emitters with resonant photonic nanostructures, where the emission rate is enhanced due to the Purcell effect. Dielectric nanoantennas are promising as they provide… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: 20 pages, 4 figures

  2. arXiv:2407.02536  [pdf, other

    cs.LG cs.IR econ.GN stat.AP

    Reducing False Discoveries in Statistically-Significant Regional-Colocation Mining: A Summary of Results

    Authors: Subhankar Ghosh, Jayant Gupta, Arun Sharma, Shuai An, Shashi Shekhar

    Abstract: Given a set \emph{S} of spatial feature types, its feature instances, a study area, and a neighbor relationship, the goal is to find pairs $<$a region ($r_{g}$), a subset \emph{C} of \emph{S}$>$ such that \emph{C} is a statistically significant regional-colocation pattern in $r_{g}$. This problem is important for applications in various domains including ecology, economics, and sociology. The prob… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    ACM Class: E.m; F.2; E.1; H.3; I.5; J.0

  3. arXiv:2407.00256  [pdf, other

    cs.AI cs.CL cs.LG stat.ML

    One Prompt is not Enough: Automated Construction of a Mixture-of-Expert Prompts

    Authors: Ruochen Wang, Sohyun An, Minhao Cheng, Tianyi Zhou, Sung Ju Hwang, Cho-Jui Hsieh

    Abstract: Large Language Models (LLMs) exhibit strong generalization capabilities to novel tasks when prompted with language instructions and in-context demos. Since this ability sensitively depends on the quality of prompts, various methods have been explored to automate the instruction design. While these methods demonstrated promising results, they also restricted the searched prompt to one instruction.… ▽ More

    Submitted 28 June, 2024; originally announced July 2024.

    Comments: ICML 2024. code available at https://github.com/ruocwang/mixture-of-prompts

    MSC Class: 68T01

    Journal ref: Proceedings of the 41st International Conference on Machine Learning (ICML), Vienna, Austria, 2024

  4. arXiv:2406.17424  [pdf, other

    cs.CG cs.DS

    Sparse Outerstring Graphs Have Logarithmic Treewidth

    Authors: Shinwoo An, Eun** Oh, Jie Xue

    Abstract: An outerstring graph is the intersection graph of curves lying inside a disk with one endpoint on the boundary of the disk. We show that an outerstring graph with $n$ vertices has treewidth $O(α\log n)$, where $α$ denotes the arboricity of the graph, with an almost matching lower bound of $Ω(α\log (n/α))$. As a corollary, we show that a $t$-biclique-free outerstring graph has treewidth… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

    Comments: 17pages, In ESA'24

  5. arXiv:2406.16003  [pdf

    physics.optics

    Unidirectional Chiral Emission via Twisted Bi-layer Metasurfaces

    Authors: Dmitrii Gromyko, Shu An, Sergey Gorelik, Jiahui Xu, Li Jun Lim, Henry Yit Loong Lee, Febiana Tjiptoharsono, Zhi-Kuang Tan, Cheng-Wei Qiu, Zhaogang Dong, Lin Wu

    Abstract: Controlling and channelling light emissions from unpolarized quantum dots into specific directions with chiral polarization remains a key challenge in modern photonics. Stacked metasurface designs offer a potential compact solution for chirality and directionality engineering. However, experimental observations of directional chiral radiation from resonant metasurfaces with quantum emitters remain… ▽ More

    Submitted 22 June, 2024; originally announced June 2024.

    Comments: 16 pages, 4 figures

  6. arXiv:2406.12430  [pdf, other

    cs.CL cs.AI cs.LG

    PlanRAG: A Plan-then-Retrieval Augmented Generation for Generative Large Language Models as Decision Makers

    Authors: Myeonghwa Lee, Seonho An, Min-Soo Kim

    Abstract: In this paper, we conduct a study to utilize LLMs as a solution for decision making that requires complex data analysis. We define Decision QA as the task of answering the best decision, $d_{best}$, for a decision-making question $Q$, business rules $R$ and a database $D$. Since there is no benchmark that can examine Decision QA, we propose Decision QA benchmark, DQA. It has two scenarios, Locatin… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: NAACL 2024

    ACM Class: I.2.7

  7. arXiv:2406.10957  [pdf, other

    cs.CL

    Eliminating Biased Length Reliance of Direct Preference Optimization via Down-Sampled KL Divergence

    Authors: Junru Lu, Jiazheng Li, Siyu An, Meng Zhao, Yulan He, Di Yin, Xing Sun

    Abstract: Direct Preference Optimization (DPO) has emerged as a prominent algorithm for the direct and robust alignment of Large Language Models (LLMs) with human preferences, offering a more straightforward alternative to the complex Reinforcement Learning from Human Feedback (RLHF). Despite its promising efficacy, DPO faces a notable drawback: "verbosity", a common over-optimization phenomenon also observ… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

  8. arXiv:2406.10744   

    cs.CV

    Technique Report of CVPR 2024 PBDL Challenges

    Authors: Ying Fu, Yu Li, Shaodi You, Boxin Shi, Jose Alvarez, Coert van Gemeren, Linwei Chen, Yunhao Zou, Zichun Wang, Yichen Li, Yuze Han, Yingkai Zhang, Jianan Wang, Qinglin Liu, Wei Yu, Xiaoqian Lv, Jianing Li, Sheng** Zhang, Xiangyang Ji, Yuanpei Chen, Yuhan Zhang, Weihang Peng, Liwen Zhang, Zhe Xu, Dingyong Gou , et al. (77 additional authors not shown)

    Abstract: The intersection of physics-based vision and deep learning presents an exciting frontier for advancing computer vision technologies. By leveraging the principles of physics to inform and enhance deep learning models, we can develop more robust and accurate vision systems. Physics-based vision aims to invert the processes to recover scene properties such as shape, reflectance, light distribution, a… ▽ More

    Submitted 27 June, 2024; v1 submitted 15 June, 2024; originally announced June 2024.

    Comments: The author list and contents need to be verified by all authors

  9. arXiv:2406.06379  [pdf, other

    cs.CE

    FinVerse: An Autonomous Agent System for Versatile Financial Analysis

    Authors: Siyu An, Qin Li, Junru Lu, Di Yin, Xing Sun

    Abstract: With the significant advancements in cognitive intelligence driven by LLMs, autonomous agent systems have attracted extensive attention. Despite this growing interest, the development of stable and efficient agent systems poses substantial practical challenges. In this paper, we introduce FinVerse, a meticulously crafted agent system designed for a broad range of financial topics. FinVerse integra… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

  10. arXiv:2406.04867  [pdf, other

    cs.LG cs.AI cs.CV

    Deep learning for precipitation nowcasting: A survey from the perspective of time series forecasting

    Authors: Sojung An, Tae-** Oh, Eunha Sohn, Donghyun Kim

    Abstract: Deep learning-based time series forecasting has dominated the short-term precipitation forecasting field with the help of its ability to estimate motion flow in high-resolution datasets. The growing interest in precipitation nowcasting offers substantial opportunities for the advancement of current forecasting technologies. Nevertheless, there has been a scarcity of in-depth surveys of time series… ▽ More

    Submitted 13 June, 2024; v1 submitted 7 June, 2024; originally announced June 2024.

    Comments: 21 pages, 7 figures, 5 tables

  11. arXiv:2405.20602  [pdf, other

    cs.LG cs.CL

    Masked Language Modeling Becomes Conditional Density Estimation for Tabular Data Synthesis

    Authors: Seunghwan An, Gyeongdong Woo, Jaesung Lim, ChangHyun Kim, Sungchul Hong, Jong-June Jeon

    Abstract: In this paper, our goal is to generate synthetic data for heterogeneous (mixed-type) tabular datasets with high machine learning utility (MLu). Given that the MLu performance relies on accurately approximating the conditional distributions, we focus on devising a synthetic data generation method based on conditional distribution estimation. We propose a novel synthetic data generation method, MaCo… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  12. arXiv:2405.19757  [pdf, other

    cs.LG cs.AI

    Improving SMOTE via Fusing Conditional VAE for Data-adaptive Noise Filtering

    Authors: Sungchul Hong, Seunghwan An, Jong-June Jeon

    Abstract: Recent advances in a generative neural network model extend the development of data augmentation methods. However, the augmentation methods based on the modern generative models fail to achieve notable performance for class imbalance data compared to the conventional model, the SMOTE. We investigate the problem of the generative model for imbalanced classification and introduce a framework to enha… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  13. arXiv:2405.19346  [pdf, other

    eess.SP cs.AI cs.LG

    Subject-Adaptive Transfer Learning Using Resting State EEG Signals for Cross-Subject EEG Motor Imagery Classification

    Authors: Sion An, Myeongkyun Kang, Soopil Kim, Philip Chikontwe, Li Shen, Sang Hyun Park

    Abstract: Electroencephalography (EEG) motor imagery (MI) classification is a fundamental, yet challenging task due to the variation of signals between individuals i.e., inter-subject variability. Previous approaches try to mitigate this using task-specific (TS) EEG signals from the target subject in training. However, recording TS EEG signals requires time and limits its applicability in various fields. In… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

    Comments: Early Accepted at MICCAI 2024

  14. arXiv:2405.16013  [pdf, other

    cs.LG

    Convergence Behavior of an Adversarial Weak Supervision Method

    Authors: Steven An, Sanjoy Dasgupta

    Abstract: Labeling data via rules-of-thumb and minimal label supervision is central to Weak Supervision, a paradigm subsuming subareas of machine learning such as crowdsourced learning and semi-supervised ensemble learning. By using this labeled data to train modern machine learning methods, the cost of acquiring large amounts of hand labeled data can be ameliorated. Approaches to combining the rules-of-thu… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

    Comments: 49 pages, 16 figures, to be published in UAI 2024

  15. arXiv:2405.03439  [pdf, other

    cond-mat.mes-hall

    Anomalous Inverse Spin Hall Effect (AISHE) due to Unconventional Spin Currents in Ferromagnetic Films with Tailored Interfacial Magnetic Anisotropy

    Authors: Soumik Aon, Harekrishna Bhunia, Pratap Kumar Pal, Abu Bakkar Miah, Dhananjaya Mahapatra, Anjan Barman, Partha Mitra

    Abstract: A single layer ferromagnetic film magnetized in the plane of an ac current flow, exhibits a characteristic Hall voltage with harmonic and second harmonic components, which is attributed to the presence of spin currents with polarization non-collinear with the magnetization. A set of 30 nm thick permalloy (Py) films used in this study are deposited at an oblique angle with respect to the substrate… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

  16. arXiv:2405.02955  [pdf, other

    quant-ph

    Minimizing Kinetic Inductance in Tantalum-Based Superconducting Coplanar Waveguide Resonators for Alleviating Frequency Fluctuation Issues

    Authors: Dengfeng Li, **g**g Hu, Yuan Li, Shuoming An

    Abstract: Advancements in the fabrication of superconducting quantum devices have highlighted tantalum as a promising material, owing to its low surface oxidation microwave loss at low temperatures. However, tantalum films exhibit significantly larger kinetic inductances compared to materials such as aluminum or niobium. Given the inevitable variations in film thickness, this increased kinetic inductance le… ▽ More

    Submitted 5 May, 2024; originally announced May 2024.

  17. 3SHNet: Boosting Image-Sentence Retrieval via Visual Semantic-Spatial Self-Highlighting

    Authors: Xuri Ge, Songpei Xu, Fuhai Chen, Jie Wang, Guoxin Wang, Shan An, Joemon M. Jose

    Abstract: In this paper, we propose a novel visual Semantic-Spatial Self-Highlighting Network (termed 3SHNet) for high-precision, high-efficiency and high-generalization image-sentence retrieval. 3SHNet highlights the salient identification of prominent objects and their spatial locations within the visual modality, thus allowing the integration of visual semantics-spatial interactions and maintaining indep… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

    Comments: Accepted Information Processing and Management (IP&M), 10 pages, 9 figures and 8 tables

    Journal ref: Information Processing & Management, Volume 61, Issue 4, July 2024, 103716

  18. arXiv:2404.16898  [pdf, other

    cs.LG cs.AI

    How to Parameterize Asymmetric Quantization Ranges for Quantization-Aware Training

    Authors: Jaeseong You, Minseop Park, Kyunggeun Lee, Seokjun An, Chirag Patel, Markus Nage

    Abstract: This paper investigates three different parameterizations of asymmetric uniform quantization for quantization-aware training: (1) scale and offset, (2) minimum and maximum, and (3) beta and gamma. We perform a comprehensive comparative analysis of these parameterizations' influence on quantization-aware training, using both controlled experiments and real-world large language models. Our particula… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

  19. arXiv:2404.16811  [pdf, other

    cs.CL cs.AI

    Make Your LLM Fully Utilize the Context

    Authors: Shengnan An, Zexiong Ma, Zeqi Lin, Nanning Zheng, Jian-Guang Lou

    Abstract: While many contemporary large language models (LLMs) can process lengthy input, they still struggle to fully utilize information within the long context, known as the lost-in-the-middle challenge. We hypothesize that it stems from insufficient explicit supervision during the long-context training, which fails to emphasize that any position in a long context can hold crucial information. Based on t… ▽ More

    Submitted 26 April, 2024; v1 submitted 25 April, 2024; originally announced April 2024.

    Comments: 19 pages, 7 figures, 3 tables, 9 examples

  20. arXiv:2404.05680  [pdf, other

    cs.CV

    SphereHead: Stable 3D Full-head Synthesis with Spherical Tri-plane Representation

    Authors: Heyuan Li, Ce Chen, Tianhao Shi, Yuda Qiu, Sizhe An, Guanying Chen, Xiaoguang Han

    Abstract: While recent advances in 3D-aware Generative Adversarial Networks (GANs) have aided the development of near-frontal view human face synthesis, the challenge of comprehensively synthesizing a full 3D head viewable from all angles still persists. Although PanoHead proves the possibilities of using a large-scale dataset with images of both frontal and back views for full-head synthesis, it often caus… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

    Comments: project page: https://lhyfst.github.io/spherehead

  21. arXiv:2404.03934  [pdf, other

    cond-mat.mes-hall

    Direct Electrical Detection of Spin Chemical Potential Due to Spin Hall Effect in $β$-Tungsten and Platinum Using a Pair of Ferromagnetic and Normal Metal Voltage probes

    Authors: Soumik Aon, Abu Bakkar Miah, Arpita Mandal, Harekrishna Bhunia, Dhananjaya Mahapatra, Partha Mitra

    Abstract: The phenomenon of Spin Hall Effect (SHE) generates a pure spin current transverse to an applied current in materials with strong spin-orbit coupling, although not detectable through conventional electrical measurement. An intuitive Hall effect like measurement configuration is implemented to directly measure pure spin chemical potential of the accumulated spins at the edges of heavy metal (HM) cha… ▽ More

    Submitted 5 April, 2024; originally announced April 2024.

  22. arXiv:2404.01954  [pdf, other

    cs.CL cs.AI

    HyperCLOVA X Technical Report

    Authors: Kang Min Yoo, Jaegeun Han, Sookyo In, Heewon Jeon, Jisu Jeong, Jaewook Kang, Hyunwook Kim, Kyung-Min Kim, Munhyong Kim, Sungju Kim, Donghyun Kwak, Hanock Kwak, Se Jung Kwon, Bado Lee, Dongsoo Lee, Gichang Lee, Jooho Lee, Baeseong Park, Seong** Shin, Joonsang Yu, Seolki Baek, Sumin Byeon, Eungsup Cho, Dooseok Choe, Jeesung Han , et al. (371 additional authors not shown)

    Abstract: We introduce HyperCLOVA X, a family of large language models (LLMs) tailored to the Korean language and culture, along with competitive capabilities in English, math, and coding. HyperCLOVA X was trained on a balanced mix of Korean, English, and code data, followed by instruction-tuning with high-quality human-annotated datasets while abiding by strict safety guidelines reflecting our commitment t… ▽ More

    Submitted 13 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

    Comments: 44 pages; updated authors list and fixed author names

  23. arXiv:2403.17188  [pdf, other

    cs.CV cs.CR

    LOTUS: Evasive and Resilient Backdoor Attacks through Sub-Partitioning

    Authors: Siyuan Cheng, Guanhong Tao, Yingqi Liu, Guangyu Shen, Shengwei An, Shiwei Feng, Xiangzhe Xu, Kaiyuan Zhang, Shiqing Ma, Xiangyu Zhang

    Abstract: Backdoor attack poses a significant security threat to Deep Learning applications. Existing attacks are often not evasive to established backdoor detection techniques. This susceptibility primarily stems from the fact that these attacks typically leverage a universal trigger pattern or transformation function, such that the trigger can cause misclassification for any input. In response to this, re… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

    Comments: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2024)

  24. arXiv:2403.11426  [pdf, other

    cs.DS cs.CG

    ETH-Tight Algorithm for Cycle Packing on Unit Disk Graphs

    Authors: Shinwoo An, Eun** Oh

    Abstract: In this paper, we consider the Cycle Packing problem on unit disk graphs defined as follows. Given a unit disk graph G with n vertices and an integer k, the goal is to find a set of $k$ vertex-disjoint cycles of G if it exists. Our algorithm runs in time $2^{O(\sqrt k)}n^{O(1)}$. This improves the $2^{O(\sqrt k\log k)}n^{O(1)}$-time algorithm by Fomin et al. [SODA 2012, ICALP 2017]. Moreover, our… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

    Comments: In SoCG'24

  25. arXiv:2403.10141  [pdf, other

    cond-mat.mes-hall

    Anisotropic magneto-photothermal voltage in Sb2Te3 topological insulator thin films

    Authors: Subhadip Manna, Sambhu G Nath, Samrat Roy, Soumik Aon, Sayani Pal, Kanav Sharma, Dhananjaya Mahapatra, Partha Mitra, Sourin Das, Bipul Pal, Chiranjib Mitra

    Abstract: We studied longitudinal and Hall photothermal voltages under a planar magnetic field scan in epitaxial thin films of the Topological Insulator (TI) Sb2Te3, grown using pulsed laser deposition (PLD). Unlike prior research that utilised polarised light-induced photocurrent to investigate the TI, our study introduces advancements based on unpolarized light-induced local heating. This method yields a… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

  26. arXiv:2402.19431  [pdf, other

    cs.SE cs.AI cs.CL

    Compositional API Recommendation for Library-Oriented Code Generation

    Authors: Zexiong Ma, Shengnan An, Bing Xie, Zeqi Lin

    Abstract: Large language models (LLMs) have achieved exceptional performance in code generation. However, the performance remains unsatisfactory in generating library-oriented code, especially for the libraries not present in the training data of LLMs. Previous work utilizes API recommendation technology to help LLMs use libraries: it retrieves APIs related to the user requirements, then leverages them as c… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

    Journal ref: 32nd IEEE/ACM International Conference on Program Comprehension (ICPC 2024), Apr 2024, Lisboa, Portugal

  27. arXiv:2402.11811  [pdf, other

    cs.CL

    FIPO: Free-form Instruction-oriented Prompt Optimization with Preference Dataset and Modular Fine-tuning Schema

    Authors: Junru Lu, Siyu An, Min Zhang, Yulan He, Di Yin, Xing Sun

    Abstract: When the quality of naive prompts is carefully optimized by human experts, the task performance of large language models (LLMs) can be significantly improved. However, expert-based prompt optimizations are expensive. Herein, some works have proposed Automatic Prompt Optimization (APO), to optimize naive prompts according to task outputs of given in-box testing models, with the help of advanced LLM… ▽ More

    Submitted 16 June, 2024; v1 submitted 18 February, 2024; originally announced February 2024.

  28. arXiv:2402.05467  [pdf, other

    cs.AI cs.CL cs.CR

    Rapid Optimization for Jailbreaking LLMs via Subconscious Exploitation and Echopraxia

    Authors: Guangyu Shen, Siyuan Cheng, Kaiyuan Zhang, Guanhong Tao, Shengwei An, Lu Yan, Zhuo Zhang, Shiqing Ma, Xiangyu Zhang

    Abstract: Large Language Models (LLMs) have become prevalent across diverse sectors, transforming human life with their extraordinary reasoning and comprehension abilities. As they find increased use in sensitive tasks, safety concerns have gained widespread attention. Extensive efforts have been dedicated to aligning LLMs with human moral principles to ensure their safe deployment. Despite their potential,… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

  29. arXiv:2401.07571  [pdf, other

    cs.CV

    A Bi-Pyramid Multimodal Fusion Method for the Diagnosis of Bipolar Disorders

    Authors: Guoxin Wang, Sheng Shi, Shan An, Fengmei Fan, Wenshu Ge, Qi Wang, Feng Yu, Zhiren Wang

    Abstract: Previous research on the diagnosis of Bipolar disorder has mainly focused on resting-state functional magnetic resonance imaging. However, their accuracy can not meet the requirements of clinical diagnosis. Efficient multimodal fusion strategies have great potential for applications in multimodal data and can further improve the performance of medical diagnosis models. In this work, we utilize bot… ▽ More

    Submitted 15 January, 2024; originally announced January 2024.

    Comments: Accepted by IEEE ICASSP 2024

  30. arXiv:2401.03850  [pdf, other

    eess.AS cs.SD

    Inverse Nonlinearity Compensation of Hyperelastic Deformation in Dielectric Elastomer for Acoustic Actuation

    Authors: ** Woo Lee, Gwang Seok An, Jeong-Yun Sun, Kyogu Lee

    Abstract: This paper delves into the analysis of nonlinear deformation induced by dielectric actuation in pre-stressed ideal dielectric elastomers. It formulates a nonlinear ordinary differential equation governing this deformation based on the hyperelastic model under dielectric stress. Through numerical integration and neural network approximations, the relationship between voltage and stretch is establis… ▽ More

    Submitted 8 January, 2024; originally announced January 2024.

  31. arXiv:2401.01457  [pdf, other

    math.CO

    Flip Graphs on Self-Complementary Ideals of Chain Products

    Authors: Serena An, Holden Mui

    Abstract: In this paper, we introduce a flip operation on self-complementary ideals of chain product posets and study the resulting flip graphs. We give asymptotics for the number of vertices in these graphs, compute their diameters, and give bounds for their radii. We also define similar flip operations on self-complementary ideals of the chain product $[2r]\times [2r]\times [2r]$ satisfying additional sym… ▽ More

    Submitted 3 January, 2024; v1 submitted 2 January, 2024; originally announced January 2024.

    Comments: 39 pages, 32 figures

  32. arXiv:2401.00496  [pdf, other

    cs.CV cs.AI cs.LG

    SAR-RARP50: Segmentation of surgical instrumentation and Action Recognition on Robot-Assisted Radical Prostatectomy Challenge

    Authors: Dimitrios Psychogyios, Emanuele Colleoni, Beatrice Van Amsterdam, Chih-Yang Li, Shu-Yu Huang, Yuchong Li, Fucang Jia, Baosheng Zou, Guotai Wang, Yang Liu, Maxence Boels, Jiayu Huo, Rachel Sparks, Prokar Dasgupta, Alejandro Granados, Sebastien Ourselin, Mengya Xu, An Wang, Yanan Wu, Long Bai, Hongliang Ren, Atsushi Yamada, Yuriko Harai, Yuto Ishikawa, Kazuyuki Hayashi , et al. (25 additional authors not shown)

    Abstract: Surgical tool segmentation and action recognition are fundamental building blocks in many computer-assisted intervention applications, ranging from surgical skills assessment to decision support systems. Nowadays, learning-based action recognition and segmentation approaches outperform classical methods, relying, however, on large, annotated datasets. Furthermore, action recognition and tool segme… ▽ More

    Submitted 23 January, 2024; v1 submitted 31 December, 2023; originally announced January 2024.

  33. arXiv:2312.14492  [pdf, other

    cs.CV

    Context Enhanced Transformer for Single Image Object Detection

    Authors: Seungjun An, Seonghoon Park, Gyeongnyeon Kim, Jeongyeol Baek, Byeongwon Lee, Seungryong Kim

    Abstract: With the increasing importance of video data in real-world applications, there is a rising need for efficient object detection methods that utilize temporal information. While existing video object detection (VOD) techniques employ various strategies to address this challenge, they typically depend on locally adjacent frames or randomly sampled images within a clip. Although recent Transformer-bas… ▽ More

    Submitted 26 December, 2023; v1 submitted 22 December, 2023; originally announced December 2023.

    Comments: Project page: https://ku-cvlab.github.io/CETR

  34. arXiv:2312.13783  [pdf, other

    cs.CV cs.AI cs.LG

    Few Shot Part Segmentation Reveals Compositional Logic for Industrial Anomaly Detection

    Authors: Soopil Kim, Sion An, Philip Chikontwe, Myeongkyun Kang, Ehsan Adeli, Kilian M. Pohl, Sang Hyun Park

    Abstract: Logical anomalies (LA) refer to data violating underlying logical constraints e.g., the quantity, arrangement, or composition of components within an image. Detecting accurately such anomalies requires models to reason about various component types through segmentation. However, curation of pixel-level annotations for semantic segmentation is both time-consuming and expensive. Although there are s… ▽ More

    Submitted 15 April, 2024; v1 submitted 21 December, 2023; originally announced December 2023.

    Comments: Accepted in AAAI2024

  35. arXiv:2312.06871  [pdf, other

    cs.AI cs.LG cs.MA

    Using Analytics on Student Created Data to Content Validate Pedagogical Tools

    Authors: John Kos, Kenneth Eaton, Sareen Zhang, Rahul Dass, Stephen Buckley, Sungeun An, Ashok Goel

    Abstract: Conceptual and simulation models can function as useful pedagogical tools, however it is important to categorize different outcomes when evaluating them in order to more meaningfully interpret results. VERA is a ecology-based conceptual modeling software that enables users to simulate interactions between biotics and abiotics in an ecosystem, allowing users to form and then verify hypothesis throu… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

    Comments: 16 pages, preprint

  36. arXiv:2312.06405  [pdf, other

    quant-ph

    Optimizing Resonator Frequency Stability in Flip-Chip Architectures: A Novel Experimental Design Approach

    Authors: Yuan Li, Tianhui Wang, **g**g Hu, Dengfeng Li, Shuoming An

    Abstract: In multi-qubit superconducting systems utilizing flip-chip technology, achieving high accuracy in resonator frequencies is of paramount importance, particularly when multiple resonators share a common Purcell filter with restricted bandwidth. Nevertheless, variations in inter-chip spacing can considerably influence these frequencies. To tackle this issue, we present and experimentally validate the… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

  37. arXiv:2312.03307  [pdf, other

    stat.ML cs.LG

    Balanced Marginal and Joint Distributional Learning via Mixture Cramer-Wold Distance

    Authors: Seunghwan An, Sungchul Hong, Jong-June Jeon

    Abstract: In the process of training a generative model, it becomes essential to measure the discrepancy between two high-dimensional probability distributions: the generative distribution and the ground-truth distribution of the observed dataset. Recently, there has been growing interest in an approach that involves slicing high-dimensional distributions, with the Cramer-Wold distance emerging as a promisi… ▽ More

    Submitted 6 December, 2023; originally announced December 2023.

  38. arXiv:2312.01729  [pdf, other

    cs.LG

    EdgeConvFormer: Dynamic Graph CNN and Transformer based Anomaly Detection in Multivariate Time Series

    Authors: Jie Liu, Qilin Li, Senjian An, Bradley Ezard, Ling Li

    Abstract: Transformer-based models for anomaly detection in multivariate time series can benefit from the self-attention mechanism due to its advantage in modeling long-term dependencies. However, Transformer-based anomaly detection models have problems such as a large amount of data being required for training, standard positional encoding is not suitable for multivariate time series data, and the interdep… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

  39. arXiv:2312.00050  [pdf, other

    cs.CR cs.AI cs.LG

    Elijah: Eliminating Backdoors Injected in Diffusion Models via Distribution Shift

    Authors: Shengwei An, Sheng-Yen Chou, Kaiyuan Zhang, Qiuling Xu, Guanhong Tao, Guangyu Shen, Siyuan Cheng, Shiqing Ma, Pin-Yu Chen, Tsung-Yi Ho, Xiangyu Zhang

    Abstract: Diffusion models (DM) have become state-of-the-art generative models because of their capability to generate high-quality images from noises without adversarial training. However, they are vulnerable to backdoor attacks as reported by recent studies. When a data input (e.g., some Gaussian noise) is stamped with a trigger (e.g., a white patch), the backdoored model always generates the target image… ▽ More

    Submitted 4 February, 2024; v1 submitted 27 November, 2023; originally announced December 2023.

    Comments: AAAI 2024

  40. arXiv:2311.03665  [pdf, other

    cs.CG cs.DS

    Faster Algorithms for Cycle Hitting Problems on Disk Graphs

    Authors: Shinwoo An, Kyung** Cho, Eun** Oh

    Abstract: In this paper, we consider three hitting problems on a disk intersection graph: Triangle Hitting Set, Feedback Vertex Set, and Odd Cycle Transversal. Given a disk intersection graph $G$, our goal is to compute a set of vertices hitting all triangles, all cycles, or all odd cycles, respectively. Our algorithms run in time $2^{\tilde O(k^{4/5})}n^{O(1)}$, $2^{\tilde O(k^{9/10})}n^{O(1)}$, and… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

    Comments: WADS 2023

  41. arXiv:2310.20689  [pdf, other

    cs.CL cs.AI

    Learning From Mistakes Makes LLM Better Reasoner

    Authors: Shengnan An, Zexiong Ma, Zeqi Lin, Nanning Zheng, Jian-Guang Lou, Weizhu Chen

    Abstract: Large language models (LLMs) recently exhibited remarkable reasoning capabilities on solving math problems. To further improve their reasoning capabilities, this work explores whether LLMs can LEarn from MistAkes (LEMA), akin to the human learning process. Consider a human student who failed to solve a math problem, he will learn from what mistake he has made and how to correct it. Mimicking this… ▽ More

    Submitted 29 March, 2024; v1 submitted 31 October, 2023; originally announced October 2023.

    Comments: 23 pages, 13 figures, 6 tables

  42. arXiv:2310.20187  [pdf, other

    cs.LG cs.AI

    Self-Supervised Pre-Training for Precipitation Post-Processor

    Authors: Sojung An, Junha Lee, Jiyeon Jang, Inchae Na, Wooyeon Park, Sujeong You

    Abstract: Obtaining a sufficient forecast lead time for local precipitation is essential in preventing hazardous weather events. Global warming-induced climate change increases the challenge of accurately predicting severe precipitation events, such as heavy rainfall. In this paper, we propose a deep learning-based precipitation post-processor for numerical weather prediction (NWP) models. The precipitation… ▽ More

    Submitted 19 February, 2024; v1 submitted 31 October, 2023; originally announced October 2023.

    Comments: 7 pages, 3 figures, 1 table, accepted to NeurIPS 2023 Workshop on Tackling Climate Change with Machine Learning at [this http URL](https://www.climatechange.ai/papers/neurips2023/18)

  43. arXiv:2310.16374  [pdf, other

    cs.LG stat.ML

    Joint Distributional Learning via Cramer-Wold Distance

    Authors: Seunghwan An, Jong-June Jeon

    Abstract: The assumption of conditional independence among observed variables, primarily used in the Variational Autoencoder (VAE) decoder modeling, has limitations when dealing with high-dimensional datasets or complex correlation structures among observed variables. To address this issue, we introduced the Cramer-Wold distance regularization, which can be computed in a closed-form, to facilitate joint dis… ▽ More

    Submitted 25 October, 2023; originally announced October 2023.

  44. arXiv:2310.15179  [pdf, other

    physics.ao-ph cs.AI cs.LG math.DS stat.OT

    Reducing Uncertainty in Sea-level Rise Prediction: A Spatial-variability-aware Approach

    Authors: Subhankar Ghosh, Shuai An, Arun Sharma, Jayant Gupta, Shashi Shekhar, Aneesh Subramanian

    Abstract: Given multi-model ensemble climate projections, the goal is to accurately and reliably predict future sea-level rise while lowering the uncertainty. This problem is important because sea-level rise affects millions of people in coastal communities and beyond due to climate change's impacts on polar ice sheets and the ocean. This problem is challenging due to spatial variability and unknowns such a… ▽ More

    Submitted 18 October, 2023; originally announced October 2023.

    Comments: 6 pages, 5 figures, I-GUIDE 2023 conference

    ACM Class: J.2; I.2.m; I.2.6; I.2.1; I.2

  45. arXiv:2310.11650  [pdf, other

    cs.IR cs.CV cs.MM

    VKIE: The Application of Key Information Extraction on Video Text

    Authors: Siyu An, Ye Liu, Haoyuan Peng, Di Yin

    Abstract: Extracting structured information from videos is critical for numerous downstream applications in the industry. In this paper, we define a significant task of extracting hierarchical key information from visual texts on videos. To fulfill this task, we decouple it into four subtasks and introduce two implementation solutions called PipVKIE and UniVKIE. PipVKIE sequentially completes the four subta… ▽ More

    Submitted 9 January, 2024; v1 submitted 17 October, 2023; originally announced October 2023.

  46. arXiv:2310.06011  [pdf, other

    astro-ph.GA

    On the root cause of the host `mass-step' in the Hubble residuals of type Ia supernovae

    Authors: Chul Chung, Suk-** Yoon, Seunghyun Park, Seunghyeon An, Junhyuk Son, Hyejeon Cho, Young-Wook Lee

    Abstract: It is well established that the Hubble residuals of type Ia supernovae (SNe Ia) show the luminosity step with respect to their host galaxy stellar masses. This `mass-step' is taken as an additional correction factor for the SN Ia luminosity standardization. Here we investigate the root cause of the mass-step and propose that the bimodal nature of the host $age$ distribution is responsible for the… ▽ More

    Submitted 9 October, 2023; originally announced October 2023.

    Comments: Accepted for publication in ApJ, 10 pages, 5 figures, 1 table

  47. arXiv:2310.02690  [pdf, other

    eess.IV cs.CV

    Multi-Dimension-Embedding-Aware Modality Fusion Transformer for Psychiatric Disorder Clasification

    Authors: Guoxin Wang, Xuyang Cao, Shan An, Fengmei Fan, Chao Zhang, **song Wang, Feng Yu, Zhiren Wang

    Abstract: Deep learning approaches, together with neuroimaging techniques, play an important role in psychiatric disorders classification. Previous studies on psychiatric disorders diagnosis mainly focus on using functional connectivity matrices of resting-state functional magnetic resonance imaging (rs-fMRI) as input, which still needs to fully utilize the rich temporal information of the time series of rs… ▽ More

    Submitted 4 October, 2023; originally announced October 2023.

  48. arXiv:2309.13450  [pdf

    cs.SE

    Conducting A/B Experiments with a Scalable Architecture

    Authors: Andrew Hornback, Sungeun An, Scott Bunin, Stephen Buckley, John Kos, Ashok Goel

    Abstract: A/B experiments are commonly used in research to compare the effects of changing one or more variables in two different experimental groups - a control group and a treatment group. While the benefits of using A/B experiments are widely known and accepted, there is less agreement on a principled approach to creating software infrastructure systems to assist in rapidly conducting such experiments. W… ▽ More

    Submitted 23 September, 2023; originally announced September 2023.

  49. arXiv:2309.05590  [pdf, other

    cs.CV cs.AI cs.MM

    Temporal Action Localization with Enhanced Instant Discriminability

    Authors: Dingfeng Shi, Qiong Cao, Yujie Zhong, Shan An, Jian Cheng, Haogang Zhu, Dacheng Tao

    Abstract: Temporal action detection (TAD) aims to detect all action boundaries and their corresponding categories in an untrimmed video. The unclear boundaries of actions in videos often result in imprecise predictions of action boundaries by existing methods. To resolve this issue, we propose a one-stage framework named TriDet. First, we propose a Trident-head to model the action boundary via an estimated… ▽ More

    Submitted 11 September, 2023; originally announced September 2023.

    Comments: An extended version of the CVPR paper arXiv:2303.07347, submitted to IJCV

  50. arXiv:2308.15779  [pdf, other

    cond-mat.mtrl-sci physics.app-ph

    Exploring GaN crystallographic orientation disparity and its origin on bare and partly graphene-covered $m$-plane sapphire substrates

    Authors: Hyunkyu Lee, Hyeonoh Jo, Jae Hun Kim, Jongwoo Ha, Su Young An, Jaewu Choi, Chinkyo Kim

    Abstract: The crystallographic orientation of 3D materials grown over 2D material-covered substrates is one of the critical factors in discerning the true growth mechanism among competing possibilities, including remote epitaxy, van der Waals epitaxy, and pinhole-seeded lateral epitaxy also known as thru-hole epitaxy. However, definitive identification demands meticulous investigation to accurately interpre… ▽ More

    Submitted 30 August, 2023; originally announced August 2023.

    Comments: 15 pages, 5 figures