Skip to main content

Showing 1–50 of 281 results for author: Song, M

Searching in archive cs. Search in all archives.
.
  1. Unveiling Global Interactive Patterns across Graphs: Towards Interpretable Graph Neural Networks

    Authors: Yuwen Wang, Shunyu Liu, Tongya Zheng, Kaixuan Chen, Mingli Song

    Abstract: Graph Neural Networks (GNNs) have emerged as a prominent framework for graph mining, leading to significant advances across various domains. Stemmed from the node-wise representations of GNNs, existing explanation studies have embraced the subgraph-specific viewpoint that attributes the decision results to the salient features and local structures of nodes. However, graph-level tasks necessitate l… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: Accepted in KDD2024

  2. Temporal Prototype-Aware Learning for Active Voltage Control on Power Distribution Networks

    Authors: Feiyang Xu, Shunyu Liu, Yunpeng Qing, Yihe Zhou, Yuwen Wang, Mingli Song

    Abstract: Active Voltage Control (AVC) on the Power Distribution Networks (PDNs) aims to stabilize the voltage levels to ensure efficient and reliable operation of power systems. With the increasing integration of distributed energy resources, recent efforts have explored employing multi-agent reinforcement learning (MARL) techniques to realize effective AVC. Existing methods mainly focus on the acquisition… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

    Comments: 12 pages, 8 figures

  3. arXiv:2406.15695  [pdf, other

    cs.CL

    SS-Bench: A Benchmark for Social Story Generation and Evaluation

    Authors: Yi Feng, Mingyang Song, Jiaqi Wang, Mao Zheng, Li** **g, Jian Yu

    Abstract: Children with Autism Spectrum Disorder (ASD) often misunderstand social situations and struggle to participate in daily routines. Psychology experts write Social Stories under strict constraints of structural clarity, descriptive orientation, and situational safety to enhance their abilities in these regimes. However, Social Stories are costly in creation and often limited in diversity and timelin… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

  4. arXiv:2406.12315  [pdf, other

    cs.AI

    PruningBench: A Comprehensive Benchmark of Structural Pruning

    Authors: Haoling Li, Changhao Li, Mengqi Xue, Gongfan Fang, Sheng Zhou, Zunlei Feng, Huiqiong Wang, Yong Wang, Lechao Cheng, Mingli Song, Jie Song

    Abstract: Structural pruning has emerged as a promising approach for producing more efficient models. Nevertheless, the community suffers from a lack of standardized benchmarks and metrics, leaving the progress in this area not fully comprehended. To fill this gap, we present the first comprehensive benchmark, termed \textit{PruningBench}, for structural pruning. PruningBench showcases the following three c… ▽ More

    Submitted 28 June, 2024; v1 submitted 18 June, 2024; originally announced June 2024.

    Comments: Submitted to NeurIPS 2024 Datasets and Benchmarks Track

  5. arXiv:2406.12117  [pdf, other

    cs.CL

    Decoding the Narratives: Analyzing Personal Drug Experiences Shared on Reddit

    Authors: Layla Bouzoubaa, Elham Aghakhani, Max Song, Minh Trinh, Rezvaneh Rezapour

    Abstract: Online communities such as drug-related subreddits serve as safe spaces for people who use drugs (PWUD), fostering discussions on substance use experiences, harm reduction, and addiction recovery. Users' shared narratives on these forums provide insights into the likelihood of develo** a substance use disorder (SUD) and recovery potential. Our study aims to develop a multi-level, multi-label cla… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: Findings of the Association for Computational Linguistics: ACL 2024

  6. arXiv:2406.11629  [pdf, other

    cs.CL

    Can Many-Shot In-Context Learning Help Long-Context LLM Judges? See More, Judge Better!

    Authors: Mingyang Song, Mao Zheng, Xuan Luo

    Abstract: Leveraging Large Language Models (LLMs) as judges for judging the performance of LLMs has recently garnered attention. However, this type of approach is affected by the potential biases in LLMs, raising concerns about the reliability of the evaluation results. To mitigate this issue, we propose and study two versions of many-shot in-context prompts, which rely on two existing settings of many-shot… ▽ More

    Submitted 30 June, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

    Comments: work in progress

  7. arXiv:2406.09799  [pdf, other

    cs.CY

    GeoSEE: Regional Socio-Economic Estimation With a Large Language Model

    Authors: Sungwon Han, Donghyun Ahn, Seungeon Lee, Minhyuk Song, Sungwon Park, Sangyoon Park, Jihee Kim, Meeyoung Cha

    Abstract: Moving beyond traditional surveys, combining heterogeneous data sources with AI-driven inference models brings new opportunities to measure socio-economic conditions, such as poverty and population, over expansive geographic areas. The current research presents GeoSEE, a method that can estimate various socio-economic indicators using a unified pipeline powered by a large language model (LLM). Pre… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

  8. arXiv:2406.09181  [pdf, other

    cs.CV cs.AI

    A Large-scale Universal Evaluation Benchmark For Face Forgery Detection

    Authors: Yijun Bei, Hengrui Lou, **song Geng, Erteng Liu, Lechao Cheng, Jie Song, Mingli Song, Zunlei Feng

    Abstract: With the rapid development of AI-generated content (AIGC) technology, the production of realistic fake facial images and videos that deceive human visual perception has become possible. Consequently, various face forgery detection techniques have been proposed to identify such fake facial content. However, evaluating the effectiveness and generalizability of these detection techniques remains a si… ▽ More

    Submitted 13 June, 2024; v1 submitted 13 June, 2024; originally announced June 2024.

    Comments: This is a paper about constructing a large-scale universal evaluation benchmark for face forgery detection.The full text is 30 pages

  9. arXiv:2406.08829  [pdf, other

    cs.CV cs.CR

    Improving Adversarial Robustness via Feature Pattern Consistency Constraint

    Authors: Jiacong Hu, **gwen Ye, Zunlei Feng, Jiazhen Yang, Shunyu Liu, Xiaotian Yu, Lingxiang Jia, Mingli Song

    Abstract: Convolutional Neural Networks (CNNs) are well-known for their vulnerability to adversarial attacks, posing significant security concerns. In response to these threats, various defense methods have emerged to bolster the model's robustness. However, most existing methods either focus on learning from adversarial perturbations, leading to overfitting to the adversarial examples, or aim to eliminate… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  10. arXiv:2406.03574  [pdf, ps, other

    cs.DS cs.LG math.OC

    A Simple Learning-Augmented Algorithm for Online Packing with Concave Objectives

    Authors: Elena Grigorescu, Young-San Lin, Maoyuan Song

    Abstract: Learning-augmented algorithms has been extensively studied recently in the computer-science community, due to the potential of using machine learning predictions in order to improve the performance of algorithms. Predictions are especially useful for online algorithms making irrevocable decisions without knowledge of the future. Such learning-augmented algorithms aim to overcome the limitations of… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: 13 pages, 2 figures. Abstract shortened to fit arXiv limit

  11. arXiv:2406.01647  [pdf, other

    cs.LG cs.AI

    An Analysis under a Unified Fomulation of Learning Algorithms with Output Constraints

    Authors: Mooho Song, Jay-Yoon Lee

    Abstract: Neural networks (NN) perform well in diverse tasks, but sometimes produce nonsensical results to humans. Most NN models "solely" learn from (input, output) pairs, occasionally conflicting with human knowledge. Many studies indicate injecting human knowledge by reducing output constraints during training can improve model performance and reduce constraint violations. While there have been several a… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  12. arXiv:2405.16571  [pdf, other

    cs.CL

    A Preliminary Empirical Study on Prompt-based Unsupervised Keyphrase Extraction

    Authors: Mingyang Song, Yi Feng, Li** **g

    Abstract: Pre-trained large language models can perform natural language processing downstream tasks by conditioning on human-designed prompts. However, a prompt-based approach often requires "prompt engineering" to design different prompts, primarily hand-crafted through laborious trial and error, requiring human intervention and expertise. It is a challenging problem when constructing a prompt-based keyph… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

    Comments: work in progress

  13. arXiv:2405.16002  [pdf, other

    cs.LG math.OC stat.ML

    Does SGD really happen in tiny subspaces?

    Authors: Minhak Song, Kwangjun Ahn, Chulhee Yun

    Abstract: Understanding the training dynamics of deep neural networks is challenging due to their high-dimensional nature and intricate loss landscapes. Recent studies have revealed that, along the training trajectory, the gradient approximately aligns with a low-rank top eigenspace of the training loss Hessian, referred to as the dominant subspace. Given this alignment, this paper explores whether neural n… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

    Comments: 22 pages

  14. arXiv:2405.15831  [pdf, other

    eess.SY cs.AI cs.LG

    Transmission Interface Power Flow Adjustment: A Deep Reinforcement Learning Approach based on Multi-task Attribution Map

    Authors: Shunyu Liu, Wei Luo, Yanzhen Zhou, Kaixuan Chen, Quan Zhang, Huating Xu, Qinglai Guo, Mingli Song

    Abstract: Transmission interface power flow adjustment is a critical measure to ensure the security and economy operation of power systems. However, conventional model-based adjustment schemes are limited by the increasing variations and uncertainties occur in power systems, where the adjustment problems of different transmission interfaces are often treated as several independent tasks, ignoring their coup… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

    Comments: Accepted by IEEE Transactions on Power Systems

  15. arXiv:2405.14280  [pdf, other

    cs.IR

    ASI++: Towards Distributionally Balanced End-to-End Generative Retrieval

    Authors: Yuxuan Liu, Tianchi Yang, Zihan Zhang, Minghui Song, Haizhen Huang, Weiwei Deng, Feng Sun, Qi Zhang

    Abstract: Generative retrieval, a promising new paradigm in information retrieval, employs a seq2seq model to encode document features into parameters and decode relevant document identifiers (IDs) based on search queries. Existing generative retrieval solutions typically rely on a preprocessing stage to pre-define document IDs, which can suffer from a semantic gap between these IDs and the retrieval task.… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  16. arXiv:2405.06063  [pdf, other

    cs.LG

    A Minimalist Prompt for Zero-Shot Policy Learning

    Authors: Meng Song, Xuezhi Wang, Tanay Biradar, Yao Qin, Manmohan Chandraker

    Abstract: Transformer-based methods have exhibited significant generalization ability when prompted with target-domain demonstrations or example solutions during inference. Although demonstrations, as a way of task specification, can capture rich information that may be hard to specify by language, it remains unclear what information is extracted from the demonstrations to help generalization. Moreover, ass… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

  17. arXiv:2405.00476  [pdf, other

    cs.LG

    A Comprehensive Survey of Dynamic Graph Neural Networks: Models, Frameworks, Benchmarks, Experiments and Challenges

    Authors: ZhengZhao Feng, Rui Wang, TianXing Wang, Mingli Song, Sai Wu, Shuibing He

    Abstract: Dynamic Graph Neural Networks (GNNs) combine temporal information with GNNs to capture structural, temporal, and contextual relationships in dynamic graphs simultaneously, leading to enhanced performance in various applications. As the demand for dynamic GNNs continues to grow, numerous models and frameworks have emerged to cater to different application needs. There is a pressing need for a compr… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

    Comments: Under review of PVLDB2025

  18. arXiv:2404.07554  [pdf, other

    cs.CV cs.AI

    CAT: Contrastive Adapter Training for Personalized Image Generation

    Authors: Jae Wan Park, Sang Hyun Park, Jun Young Koh, Junha Lee, Min Song

    Abstract: The emergence of various adapters, including Low-Rank Adaptation (LoRA) applied from the field of natural language processing, has allowed diffusion models to personalize image generation at a low cost. However, due to the various challenges including limited datasets and shortage of regularization and computation resources, adapter training often results in unsatisfactory outcomes, leading to the… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

    Comments: CVPRW 2024

  19. arXiv:2404.03272  [pdf, other

    cs.LG cs.CC cs.CR math.ST stat.ML

    Cryptographic Hardness of Score Estimation

    Authors: Min Jae Song

    Abstract: We show that $L^2$-accurate score estimation, in the absence of strong assumptions on the data distribution, is computationally hard even when sample complexity is polynomial in the relevant problem parameters. Our reduction builds on the result of Chen et al. (ICLR 2023), who showed that the problem of generating samples from an unknown data distribution reduces to $L^2$-accurate score estimation… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

    Comments: 28 pages

  20. arXiv:2404.01954  [pdf, other

    cs.CL cs.AI

    HyperCLOVA X Technical Report

    Authors: Kang Min Yoo, Jaegeun Han, Sookyo In, Heewon Jeon, Jisu Jeong, Jaewook Kang, Hyunwook Kim, Kyung-Min Kim, Munhyong Kim, Sungju Kim, Donghyun Kwak, Hanock Kwak, Se Jung Kwon, Bado Lee, Dongsoo Lee, Gichang Lee, Jooho Lee, Baeseong Park, Seong** Shin, Joonsang Yu, Seolki Baek, Sumin Byeon, Eungsup Cho, Dooseok Choe, Jeesung Han , et al. (371 additional authors not shown)

    Abstract: We introduce HyperCLOVA X, a family of large language models (LLMs) tailored to the Korean language and culture, along with competitive capabilities in English, math, and coding. HyperCLOVA X was trained on a balanced mix of Korean, English, and code data, followed by instruction-tuning with high-quality human-annotated datasets while abiding by strict safety guidelines reflecting our commitment t… ▽ More

    Submitted 13 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

    Comments: 44 pages; updated authors list and fixed author names

  21. arXiv:2404.01620  [pdf

    cs.SD cs.AI cs.CY eess.AS

    Voice EHR: Introducing Multimodal Audio Data for Health

    Authors: James Anibal, Hannah Huth, Ming Li, Lindsey Hazen, Yen Minh Lam, Hang Nguyen, Phuc Hong, Michael Kleinman, Shelley Ost, Christopher Jackson, Laura Sprabery, Cheran Elangovan, Balaji Krishnaiah, Lee Akst, Ioan Lina, Iqbal Elyazar, Lenny Ekwati, Stefan Jansen, Richard Nduwayezu, Charisse Garcia, Jeffrey Plum, Jacqueline Brenner, Miranda Song, Emily Ricotta, David Clifton , et al. (3 additional authors not shown)

    Abstract: Large AI models trained on audio data may have the potential to rapidly classify patients, enhancing medical decision-making and potentially improving outcomes through early detection. Existing technologies depend on limited datasets using expensive recording equipment in high-income, English-speaking countries. This challenges deployment in resource-constrained, high-volume settings where audio d… ▽ More

    Submitted 1 June, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

    Comments: 19 pages, 2 figures, 7 tables

  22. arXiv:2403.14951  [pdf, other

    cs.LG cs.AI cs.SI

    Simple Graph Condensation

    Authors: Zhenbang Xiao, Yu Wang, Shunyu Liu, Huiqiong Wang, Mingli Song, Tongya Zheng

    Abstract: The burdensome training costs on large-scale graphs have aroused significant interest in graph condensation, which involves tuning Graph Neural Networks (GNNs) on a small condensed graph for use on the large-scale original graph. Existing methods primarily focus on aligning key metrics between the condensed and original graphs, such as gradients, distribution and trajectory of GNNs, yielding satis… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

    Comments: Under review

  23. arXiv:2403.14349  [pdf, other

    cs.CV

    On the Concept Trustworthiness in Concept Bottleneck Models

    Authors: Qihan Huang, Jie Song, **gwen Hu, Haofei Zhang, Yong Wang, Mingli Song

    Abstract: Concept Bottleneck Models (CBMs), which break down the reasoning process into the input-to-concept map** and the concept-to-label prediction, have garnered significant attention due to their remarkable interpretability achieved by the interpretable concept bottleneck. However, despite the transparency of the concept-to-label prediction, the map** from the input to the intermediate concept rema… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

  24. arXiv:2403.11802  [pdf, other

    cs.CL

    Counting-Stars: A Multi-evidence, Position-aware, and Scalable Benchmark for Evaluating Long-Context Large Language Models

    Authors: Mingyang Song, Mao Zheng, Xuan Luo

    Abstract: While recent research endeavors have focused on develo** Large Language Models (LLMs) with robust long-context capabilities, due to the lack of long-context benchmarks, relatively little is known about how well the performance of long-context LLMs. To address this gap, we propose a multi-evidence, position-aware, and scalable benchmark for evaluating long-context LLMs, named Counting-Stars, whic… ▽ More

    Submitted 17 May, 2024; v1 submitted 18 March, 2024; originally announced March 2024.

    Comments: work in progress

  25. arXiv:2403.10875  [pdf, other

    cs.LG

    Probabilistic World Modeling with Asymmetric Distance Measure

    Authors: Meng Song

    Abstract: Representation learning is a fundamental task in machine learning, aiming at uncovering structures from data to facilitate subsequent tasks. However, what is a good representation for planning and reasoning in a stochastic world remains an open problem. In this work, we posit that learning a distance function is essential to allow planning and reasoning in the representation space. We show that a… ▽ More

    Submitted 16 March, 2024; originally announced March 2024.

  26. arXiv:2403.07262  [pdf, other

    cs.LG cs.AI

    A2PO: Towards Effective Offline Reinforcement Learning from an Advantage-aware Perspective

    Authors: Yunpeng Qing, Shunyu liu, **gyuan Cong, Kaixuan Chen, Yihe Zhou, Mingli Song

    Abstract: Offline reinforcement learning endeavors to leverage offline datasets to craft effective agent policy without online interaction, which imposes proper conservative constraints with the support of behavior policies to tackle the out-of-distribution problem. However, existing works often suffer from the constraint conflict issue when offline datasets are collected from multiple behavior policies, i.… ▽ More

    Submitted 30 May, 2024; v1 submitted 11 March, 2024; originally announced March 2024.

  27. COLA: Cross-city Mobility Transformer for Human Trajectory Simulation

    Authors: Yu Wang, Tongya Zheng, Yuxuan Liang, Shunyu Liu, Mingli Song

    Abstract: Human trajectory data produced by daily mobile devices has proven its usefulness in various substantial fields such as urban planning and epidemic prevention. In terms of the individual privacy concern, human trajectory simulation has attracted increasing attention from researchers, targeting at offering numerous realistic mobility data for downstream tasks. Nevertheless, the prevalent issue of da… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

    Comments: Accepted by WWW 2024

  28. arXiv:2403.01753  [pdf, other

    cs.CV

    Training-Free Pretrained Model Merging

    Authors: Zhengqi Xu, Ke Yuan, Huiqiong Wang, Yong Wang, Mingli Song, Jie Song

    Abstract: Recently, model merging techniques have surfaced as a solution to combine multiple single-talent models into a single multi-talent model. However, previous endeavors in this field have either necessitated additional training or fine-tuning processes, or require that the models possess the same pre-trained initialization. In this work, we identify a common drawback in prior works w.r.t. the inconsi… ▽ More

    Submitted 15 March, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

    Comments: CVPR2024 accepted

  29. arXiv:2402.18264  [pdf, other

    cs.CL

    Retrieval-based Full-length Wikipedia Generation for Emergent Events

    Authors: Jiebin Zhang, Eugene J. Yu, Qinyu Chen, Chenhao Xiong, Dawei Zhu, Han Qian, Mingbo Song, Xiaoguang Li, Qun Liu, Sujian Li

    Abstract: In today's fast-paced world, the growing demand to quickly generate comprehensive and accurate Wikipedia documents for emerging events is both crucial and challenging. However, previous efforts in Wikipedia generation have often fallen short of meeting real-world requirements. Some approaches focus solely on generating segments of a complete Wikipedia document, while others overlook the importance… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

  30. arXiv:2402.18039  [pdf, other

    cs.CL cs.AI

    ResLoRA: Identity Residual Map** in Low-Rank Adaption

    Authors: Shuhua Shi, Shaohan Huang, Minghui Song, Zhoujun Li, Zihan Zhang, Haizhen Huang, Furu Wei, Weiwei Deng, Feng Sun, Qi Zhang

    Abstract: As one of the most popular parameter-efficient fine-tuning (PEFT) methods, low-rank adaptation (LoRA) is commonly applied to fine-tune large language models (LLMs). However, updating the weights of LoRA blocks effectively and expeditiously is challenging due to the long calculation path in the original model. To address this, we propose ResLoRA, an improved framework of LoRA. By adding residual pa… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

    Comments: 14 pages, 7 figures

  31. arXiv:2402.10002  [pdf, other

    cs.CV cs.AI cs.MM

    MM-Point: Multi-View Information-Enhanced Multi-Modal Self-Supervised 3D Point Cloud Understanding

    Authors: Hai-Tao Yu, Mofei Song

    Abstract: In perception, multiple sensory information is integrated to map visual information from 2D views onto 3D objects, which is beneficial for understanding in 3D environments. But in terms of a single 2D view rendered from different angles, only limited partial information can be provided.The richness and value of Multi-view 2D information can provide superior self-supervised signals for 3D objects.… ▽ More

    Submitted 25 February, 2024; v1 submitted 15 February, 2024; originally announced February 2024.

    Comments: Accepted by AAAI 2024

    Journal ref: AAAI 2024

  32. arXiv:2402.09173  [pdf, other

    cs.LG

    Nearly Optimal Regret for Decentralized Online Convex Optimization

    Authors: Yuanyu Wan, Tong Wei, Mingli Song, Lijun Zhang

    Abstract: We investigate decentralized online convex optimization (D-OCO), in which a set of local learners are required to minimize a sequence of global loss functions using only local computations and communications. Previous studies have established $O(n^{5/4}ρ^{-1/2}\sqrt{T})$ and ${O}(n^{3/2}ρ^{-1}\log T)$ regret bounds for convex and strongly convex functions respectively, where $n$ is the number of l… ▽ More

    Submitted 23 June, 2024; v1 submitted 14 February, 2024; originally announced February 2024.

  33. arXiv:2402.09152  [pdf, other

    cs.LG

    Improved Regret for Bandit Convex Optimization with Delayed Feedback

    Authors: Yuanyu Wan, Chang Yao, Mingli Song, Lijun Zhang

    Abstract: We investigate bandit convex optimization (BCO) with delayed feedback, where only the loss value of the action is revealed under an arbitrary delay. Let $n,T,\bar{d}$ denote the dimensionality, time horizon, and average delay, respectively. Previous studies have achieved an $O(\sqrt{n}T^{3/4}+(n\bar{d})^{1/3}T^{2/3})$ regret bound for this problem, whose delay-independent part matches the regret o… ▽ More

    Submitted 23 June, 2024; v1 submitted 14 February, 2024; originally announced February 2024.

  34. arXiv:2402.02405  [pdf, other

    cs.RO cs.CV

    Angle Robustness Unmanned Aerial Vehicle Navigation in GNSS-Denied Scenarios

    Authors: Yuxin Wang, Zunlei Feng, Haofei Zhang, Yang Gao, Jie Lei, Li Sun, Mingli Song

    Abstract: Due to the inability to receive signals from the Global Navigation Satellite System (GNSS) in extreme conditions, achieving accurate and robust navigation for Unmanned Aerial Vehicles (UAVs) is a challenging task. Recently emerged, vision-based navigation has been a promising and feasible alternative to GNSS-based navigation. However, existing vision-based techniques are inadequate in addressing f… ▽ More

    Submitted 4 February, 2024; originally announced February 2024.

    Comments: 9 pages, 4 figures

  35. arXiv:2402.02315  [pdf, other

    cs.CL q-fin.GN

    A Survey of Large Language Models in Finance (FinLLMs)

    Authors: Jean Lee, Nicholas Stevens, Soyeon Caren Han, Minseok Song

    Abstract: Large Language Models (LLMs) have shown remarkable capabilities across a wide variety of Natural Language Processing (NLP) tasks and have attracted attention from multiple domains, including financial services. Despite the extensive research into general-domain LLMs, and their immense potential in finance, Financial LLM (FinLLM) research remains limited. This survey provides a comprehensive overvi… ▽ More

    Submitted 3 February, 2024; originally announced February 2024.

    Comments: More information on https://github.com/adlnlp/FinLLMs

  36. arXiv:2401.12987  [pdf, other

    cs.CL cs.LG cs.SD eess.AS

    TelME: Teacher-leading Multimodal Fusion Network for Emotion Recognition in Conversation

    Authors: Taeyang Yun, Hyunkuk Lim, Jeonghwan Lee, Min Song

    Abstract: Emotion Recognition in Conversation (ERC) plays a crucial role in enabling dialogue systems to effectively respond to user requests. The emotions in a conversation can be identified by the representations from various modalities, such as audio, visual, and text. However, due to the weak contribution of non-verbal modalities to recognize emotions, multimodal ERC has always been considered a challen… ▽ More

    Submitted 31 March, 2024; v1 submitted 16 January, 2024; originally announced January 2024.

    Comments: NAACL 2024 main conference

  37. arXiv:2401.12231  [pdf, other

    cs.SI cs.LG

    Disentangled Condensation for Large-scale Graphs

    Authors: Zhenbang Xiao, Shunyu Liu, Yu Wang, Tongya Zheng, Mingli Song

    Abstract: Graph condensation has emerged as an intriguing technique to provide Graph Neural Networks for large-scale graphs with a more compact yet informative small graph to save the expensive costs of large-scale graph learning. Despite the promising results achieved, previous graph condensation methods often employ an entangled condensation strategy that involves condensing nodes and edges simultaneously… ▽ More

    Submitted 18 January, 2024; originally announced January 2024.

    Comments: Under Review

  38. arXiv:2401.11902  [pdf, other

    eess.IV cs.CV

    A Training-Free Defense Framework for Robust Learned Image Compression

    Authors: Myungseo Song, **young Choi, Bohyung Han

    Abstract: We study the robustness of learned image compression models against adversarial attacks and present a training-free defense technique based on simple image transform functions. Recent learned image compression models are vulnerable to adversarial attacks that result in poor compression rate, low reconstruction quality, or weird artifacts. To address the limitations, we propose a simple but effecti… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

    Comments: 10 pages and 14 figures

  39. arXiv:2401.10257  [pdf, other

    cs.NE cs.LG

    Curriculum Design Helps Spiking Neural Networks to Classify Time Series

    Authors: Chenxi Sun, Hongyan Li, Moxian Song, Derun Can, Shenda Hong

    Abstract: Spiking Neural Networks (SNNs) have a greater potential for modeling time series data than Artificial Neural Networks (ANNs), due to their inherent neuron dynamics and low energy consumption. However, it is difficult to demonstrate their superiority in classification accuracy, because current efforts mainly focus on designing better network structures. In this work, enlighten by brain-inspired sci… ▽ More

    Submitted 25 December, 2023; originally announced January 2024.

    Comments: 11 pages, 3 figures

  40. arXiv:2401.02771  [pdf, other

    cs.LG eess.SY

    Powerformer: A Section-adaptive Transformer for Power Flow Adjustment

    Authors: Kaixuan Chen, Wei Luo, Shunyu Liu, Yaoquan Wei, Yihe Zhou, Yunpeng Qing, Quan Zhang, Jie Song, Mingli Song

    Abstract: In this paper, we present a novel transformer architecture tailored for learning robust power system state representations, which strives to optimize power dispatch for the power flow adjustment across different transmission sections. Specifically, our proposed approach, named Powerformer, develops a dedicated section-adaptive attention mechanism, separating itself from the self-attention used in… ▽ More

    Submitted 30 January, 2024; v1 submitted 5 January, 2024; originally announced January 2024.

    Comments: 8 figures

  41. arXiv:2312.15853  [pdf, other

    cs.LG cs.AI

    Curricular and Cyclical Loss for Time Series Learning Strategy

    Authors: Chenxi Sun, Hongyan Li, Moxian Song, Derun Cai, Shenda Hong

    Abstract: Time series widely exists in real-world applications and many deep learning models have performed well on it. Current research has shown the importance of learning strategy for models, suggesting that the benefit is the order and size of learning samples. However, no effective strategy has been proposed for time series due to its abstract and dynamic construction. Meanwhile, the existing one-shot… ▽ More

    Submitted 25 December, 2023; originally announced December 2023.

    Comments: 23 pages, 5 figures

  42. arXiv:2312.15156  [pdf, other

    cs.CL

    Large Language Models as Zero-Shot Keyphrase Extractors: A Preliminary Empirical Study

    Authors: Mingyang Song, Xuelian Geng, Songfang Yao, Shilong Lu, Yi Feng, Li** **g

    Abstract: Zero-shot keyphrase extraction aims to build a keyphrase extractor without training by human-annotated data, which is challenging due to the limited human intervention involved. Challenging but worthwhile, zero-shot setting efficiently reduces the time and effort that data labeling takes. Recent efforts on pre-trained large language models (e.g., ChatGPT and ChatGLM) show promising performance on… ▽ More

    Submitted 10 January, 2024; v1 submitted 22 December, 2023; originally announced December 2023.

    Comments: Technical Report, 6 pages

  43. arXiv:2312.10698  [pdf, other

    cs.CR

    HE-DKSAP: Privacy-Preserving Stealth Address Protocol via Additively Homomorphic Encryption

    Authors: Yu** Yan, George Shao, Dennis Song, Mason Song, Yaochu **

    Abstract: Blockchain transactions have gained widespread adoption across various industries, largely attributable to their unparalleled transparency and robust security features. Nevertheless, this technique introduces various privacy concerns, including pseudonymity, Sybil attacks, and potential susceptibilities to quantum computing, to name a few. In response to these challenges, innovative privacy-enhanc… ▽ More

    Submitted 17 December, 2023; originally announced December 2023.

  44. arXiv:2312.08916  [pdf, other

    cs.CV

    Progressive Feature Self-reinforcement for Weakly Supervised Semantic Segmentation

    Authors: **gxuan He, Lechao Cheng, Chaowei Fang, Zunlei Feng, Tingting Mu, Mingli Song

    Abstract: Compared to conventional semantic segmentation with pixel-level supervision, Weakly Supervised Semantic Segmentation (WSSS) with image-level labels poses the challenge that it always focuses on the most discriminative regions, resulting in a disparity between fully supervised conditions. A typical manifestation is the diminished precision on the object boundaries, leading to a deteriorated accurac… ▽ More

    Submitted 17 December, 2023; v1 submitted 14 December, 2023; originally announced December 2023.

    Comments: Accepted by AAAI 2024

  45. arXiv:2312.04168  [pdf, other

    cs.CV cs.AI cs.LG

    Augmentation-Free Dense Contrastive Knowledge Distillation for Efficient Semantic Segmentation

    Authors: Jiawei Fan, Chao Li, Xiaolong Liu, Meina Song, Anbang Yao

    Abstract: In recent years, knowledge distillation methods based on contrastive learning have achieved promising results on image classification and object detection tasks. However, in this line of research, we note that less attention is paid to semantic segmentation. Existing methods heavily rely on data augmentation and memory buffer, which entail high computational resource demands when applying them to… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.

    Comments: The paper of Af-DCD is accepted to NeurIPS 2023. Code and models are available at https://github.com/OSVAI/Af-DCD

  46. arXiv:2312.03490  [pdf, other

    eess.IV cs.CV

    PneumoLLM: Harnessing the Power of Large Language Model for Pneumoconiosis Diagnosis

    Authors: Meiyue Song, Zhihua Yu, Jiaxin Wang, Jiarui Wang, Yuting Lu, Baicun Li, Xiaoxu Wang, Qinghua Huang, Zhijun Li, Nikolaos I. Kanellakis, Jiangfeng Liu, **g Wang, Binglu Wang, Juntao Yang

    Abstract: The conventional pretraining-and-finetuning paradigm, while effective for common diseases with ample data, faces challenges in diagnosing data-scarce occupational diseases like pneumoconiosis. Recently, large language models (LLMs) have exhibits unprecedented ability when conducting multiple tasks in dialogue, bringing opportunities to diagnosis. A common strategy might involve using adapter layer… ▽ More

    Submitted 28 June, 2024; v1 submitted 6 December, 2023; originally announced December 2023.

    Comments: Medical Image Analysis

  47. Rethinking Object Saliency Ranking: A Novel Whole-flow Processing Paradigm

    Authors: Mengke Song, Linfeng Li, Dunquan Wu, Wenfeng Song, Chenglizhao Chen

    Abstract: Existing salient object detection methods are capable of predicting binary maps that highlight visually salient regions. However, these methods are limited in their ability to differentiate the relative importance of multiple objects and the relationships among them, which can lead to errors and reduced accuracy in downstream tasks that depend on the relative importance of multiple objects. To con… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

    Comments: 16 pages, 14 figures, accepted by IEEE Transactions on Image Processing

  48. arXiv:2311.16807  [pdf, other

    cs.AI

    Agent-Aware Training for Agent-Agnostic Action Advising in Deep Reinforcement Learning

    Authors: Yaoquan Wei, Shunyu Liu, Jie Song, Tongya Zheng, Kaixuan Chen, Yong Wang, Mingli Song

    Abstract: Action advising endeavors to leverage supplementary guidance from expert teachers to alleviate the issue of sampling inefficiency in Deep Reinforcement Learning (DRL). Previous agent-specific action advising methods are hindered by imperfections in the agent itself, while agent-agnostic approaches exhibit limited adaptability to the learning agent. In this study, we propose a novel framework calle… ▽ More

    Submitted 28 November, 2023; originally announced November 2023.

  49. arXiv:2311.14496  [pdf, other

    cs.CR

    RTPS Attack Dataset Description

    Authors: Dong Young Kim, Dongsung Kim, Yuchan Song, Gang Min Kim, Min Geun Song, Jeong Do Yoo, Huy Kang Kim

    Abstract: This paper explains all about our RTPS datasets. We collect malicious/benign packet data by injecting attack data in an Unmanned Ground Vehicle (UGV) in the normal state. We assembled the testbed, consisting of UGV, Controller, PC, and Router. We collect this dataset in the UGV part of our testbed. We conducted two types of attack "Command Injection" and "Command Injection with ARP Spoofing" on… ▽ More

    Submitted 2 April, 2024; v1 submitted 24 November, 2023; originally announced November 2023.

    Comments: This manuscript is written in Korean. You can download our dataset through our lab: https://ocslab.hksecurity.net/Datasets/rtps-attack-dataset We welcome your comments or feedback. Contact INFO: Dong Young Kim ([email protected]), Huy Kang Kim ([email protected])

  50. arXiv:2311.14342  [pdf, other

    cs.CR

    AI-based Attack Graph Generation

    Authors: Sangbeom Park, Jaesung Lee, Jeong Do Yoo, Min Geun Song, Hyosun Lee, Jaewoong Choi, Chaeyeon Sagong, Huy Kang Kim

    Abstract: With the advancement of IoT technology, many electronic devices are interconnected through networks, communicating with each other and performing specific roles. However, as numerous devices join networks, the threat of cyberattacks also escalates. Preventing and detecting cyber threats are crucial, and one method of preventing such threats involves using attack graphs. Attack graphs are widely us… ▽ More

    Submitted 27 November, 2023; v1 submitted 24 November, 2023; originally announced November 2023.

    Comments: in Korean Language, 8 Figures, 14 Pages