Skip to main content

Showing 1–50 of 604 results for author: Na

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.01330  [pdf, other

    cs.CV

    Learning Unsigned Distance Fields from Local Shape Functions for 3D Surface Reconstruction

    Authors: Jiangbei Hu, Yanggeng Li, Fei Hou, Junhui Hou, Zhebin Zhang, Shengfa Wang, Na Lei, Ying He

    Abstract: Unsigned distance fields (UDFs) provide a versatile framework for representing a diverse array of 3D shapes, encompassing both watertight and non-watertight geometries. Traditional UDF learning methods typically require extensive training on large datasets of 3D shapes, which is costly and often necessitates hyperparameter adjustments for new datasets. This paper presents a novel neural framework,… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: 14 pages, 11 figures

    ACM Class: I.3.5

  2. arXiv:2406.18815  [pdf, other

    cs.LG

    MissionGNN: Hierarchical Multimodal GNN-based Weakly Supervised Video Anomaly Recognition with Mission-Specific Knowledge Graph Generation

    Authors: Sanggeon Yun, Ryozo Masukawa, Minhyoung Na, Mohsen Imani

    Abstract: In the context of escalating safety concerns across various domains, the tasks of Video Anomaly Detection (VAD) and Video Anomaly Recognition (VAR) have emerged as critically important for applications in intelligent surveillance, evidence investigation, violence alerting, etc. These tasks, aimed at identifying and classifying deviations from normal behavior in video data, face significant challen… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

  3. arXiv:2406.17923  [pdf, other

    cs.CL

    PAFT: A Parallel Training Paradigm for Effective LLM Fine-Tuning

    Authors: Shiva Kumar Pentyala, Zhichao Wang, Bin Bi, Kiran Ramnath, Xiang-Bo Mao, Regunathan Radhakrishnan, Sitaram Asur, Na, Cheng

    Abstract: Large language models (LLMs) have shown remarkable abilities in diverse natural language processing (NLP) tasks. The LLMs generally undergo supervised fine-tuning (SFT) followed by preference alignment to be usable in downstream applications. However, this sequential training pipeline leads to alignment tax that degrades the LLM performance. This paper introduces PAFT, a new PArallel training pa… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

  4. arXiv:2406.16560  [pdf

    cs.SI physics.soc-ph

    GNNTAL:A Novel Model for Identifying Critical Nodes in Complex Networks

    Authors: Hao Wang, Ting Luo, Shuang-** Yang, Ming **g, Jian Wang, Na Zhao

    Abstract: Identification of critical nodes is a prominent topic in the study of complex networks. Numerous methods have been proposed, yet most exhibit inherent limitations. Traditional approaches primarily analyze specific structural features of the network; however, node influence is typically the result of a combination of multiple factors. Machine learning-based methods struggle to effectively represent… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

  5. arXiv:2406.11311  [pdf, other

    cs.CV

    Syn-to-Real Unsupervised Domain Adaptation for Indoor 3D Object Detection

    Authors: Yunsong Wang, Na Zhao, Gim Hee Lee

    Abstract: The use of synthetic data in indoor 3D object detection offers the potential of greatly reducing the manual labor involved in 3D annotations and training effective zero-shot detectors. However, the complicated domain shifts across syn-to-real indoor datasets remains underexplored. In this paper, we propose a novel Object-wise Hierarchical Domain Alignment (OHDA) framework for syn-to-real unsupervi… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  6. arXiv:2406.11283  [pdf, other

    cs.CV

    Enhancing Generalizability of Representation Learning for Data-Efficient 3D Scene Understanding

    Authors: Yunsong Wang, Na Zhao, Gim Hee Lee

    Abstract: The field of self-supervised 3D representation learning has emerged as a promising solution to alleviate the challenge presented by the scarcity of extensive, well-annotated datasets. However, it continues to be hindered by the lack of diverse, large-scale, real-world 3D scene datasets for source data. To address this shortfall, we propose Generalizable Representation Learning (GRL), where we devi… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  7. arXiv:2406.09946  [pdf, other

    cs.LG eess.SY

    Finite-Time Analysis of Simultaneous Double Q-learning

    Authors: Hyunjun Na, Donghwan Lee

    Abstract: $Q$-learning is one of the most fundamental reinforcement learning (RL) algorithms. Despite its widespread success in various applications, it is prone to overestimation bias in the $Q$-learning update. To address this issue, double $Q$-learning employs two independent $Q$-estimators which are randomly selected and updated during the learning process. This paper proposes a modified double $Q… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: 25 pages, 3 figures

  8. arXiv:2406.08844  [pdf, other

    cs.GT math.OC

    Equilibrium Selection for Multi-agent Reinforcement Learning: A Unified Framework

    Authors: Runyu Zhang, Jeff Shamma, Na Li

    Abstract: While there are numerous works in multi-agent reinforcement learning (MARL), most of them focus on designing algorithms and proving convergence to a Nash equilibrium (NE) or other equilibrium such as coarse correlated equilibrium. However, NEs can be non-unique and their performance varies drastically. Thus, it is important to design algorithms that converge to Nash equilibrium with better rewards… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  9. arXiv:2406.08152  [pdf, other

    cs.CV

    CT3D++: Improving 3D Object Detection with Keypoint-induced Channel-wise Transformer

    Authors: Hualian Sheng, Sijia Cai, Na Zhao, Bing Deng, Qiao Liang, Min-Jian Zhao, Jie** Ye

    Abstract: The field of 3D object detection from point clouds is rapidly advancing in computer vision, aiming to accurately and efficiently detect and localize objects in three-dimensional space. Current 3D detectors commonly fall short in terms of flexibility and scalability, with ample room for advancements in performance. In this paper, our objective is to address these limitations by introducing two fram… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: 19 pages, 8 figures

  10. arXiv:2406.06744  [pdf

    cs.LG cs.CR eess.SY

    A Multi-module Robust Method for Transient Stability Assessment against False Label Injection Cyberattacks

    Authors: Hanxuan Wang, Na Lu, Yinhong Liu, Zhuqing Wang, Zixuan Wang

    Abstract: The success of deep learning in transient stability assessment (TSA) heavily relies on high-quality training data. However, the label information in TSA datasets is vulnerable to contamination through false label injection (FLI) cyberattacks, resulting in degraded performance of deep TSA models. To address this challenge, a Multi-Module Robust TSA method (MMR) is proposed to rectify the supervised… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

  11. arXiv:2406.05325  [pdf, other

    eess.AS cs.SD

    LDM-SVC: Latent Diffusion Model Based Zero-Shot Any-to-Any Singing Voice Conversion with Singer Guidance

    Authors: Shihao Chen, Yu Gu, Jie Zhang, Na Li, Rilin Chen, Li** Chen, Lirong Dai

    Abstract: Any-to-any singing voice conversion (SVC) is an interesting audio editing technique, aiming to convert the singing voice of one singer into that of another, given only a few seconds of singing data. However, during the conversion process, the issue of timbre leakage is inevitable: the converted singing voice still sounds like the original singer's voice. To tackle this, we propose a latent diffusi… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: Accepted by Interspeech 2024

  12. arXiv:2406.04690  [pdf, other

    cs.LG stat.ML

    Higher-order Structure Based Anomaly Detection on Attributed Networks

    Authors: Xu Yuan, Na Zhou, Shuo Yu, Huafei Huang, Zhikui Chen, Feng Xia

    Abstract: Anomaly detection (such as telecom fraud detection and medical image detection) has attracted the increasing attention of people. The complex interaction between multiple entities widely exists in the network, which can reflect specific human behavior patterns. Such patterns can be modeled by higher-order network structures, thus benefiting anomaly detection on attributed networks. However, due to… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  13. arXiv:2406.04115  [pdf, other

    cs.CV cs.GR

    Global Parameterization-based Texture Space Optimization

    Authors: Wei Chen, Yuxue Ren, Na Lei, Zhongxuan Luo, Xianfeng Gu

    Abstract: Texture map** is a common technology in the area of computer graphics, it maps the 3D surface space onto the 2D texture space. However, the loose texture space will reduce the efficiency of data storage and GPU memory addressing in the rendering process. Many of the existing methods focus on repacking given textures, but they still suffer from high computational cost and hardly produce a wholly… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: Preprint submitted to Comput. Math. Math. Phys

  14. arXiv:2406.03274  [pdf, other

    eess.AS cs.AI cs.SD

    Enhancing CTC-based speech recognition with diverse modeling units

    Authors: Shiyi Han, Zhihong Lei, Mingbin Xu, Xingyu Na, Zhen Huang

    Abstract: In recent years, the evolution of end-to-end (E2E) automatic speech recognition (ASR) models has been remarkable, largely due to advances in deep learning architectures like transformer. On top of E2E systems, researchers have achieved substantial accuracy improvement by rescoring E2E model's N-best hypotheses with a phoneme-based model. This raises an interesting question about where the improvem… ▽ More

    Submitted 11 June, 2024; v1 submitted 5 June, 2024; originally announced June 2024.

  15. arXiv:2405.20649  [pdf, other

    cs.CL cs.LG

    Reward-based Input Construction for Cross-document Relation Extraction

    Authors: Byeonghu Na, Suhyeon Jo, Yeongmin Kim, Il-Chul Moon

    Abstract: Relation extraction (RE) is a fundamental task in natural language processing, aiming to identify relations between target entities in text. While many RE methods are designed for a single sentence or document, cross-document RE has emerged to address relations across multiple long documents. Given the nature of long documents in cross-document RE, extracting document embeddings is challenging due… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

    Comments: Accepted at ACL 2024 main conference

  16. arXiv:2405.20216  [pdf, other

    cs.CV cs.AI cs.LG

    Boost Your Own Human Image Generation Model via Direct Preference Optimization with AI Feedback

    Authors: Sanghyeon Na, Yonggyu Kim, Hyunjoon Lee

    Abstract: The generation of high-quality human images through text-to-image (T2I) methods is a significant yet challenging task. Distinct from general image generation, human image synthesis must satisfy stringent criteria related to human pose, anatomy, and alignment with textual prompts, making it particularly difficult to achieve realistic results. Recent advancements in T2I generation based on diffusion… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: 28 pages, 18 figures

  17. arXiv:2405.19998  [pdf, other

    cs.MA

    LAGMA: LAtent Goal-guided Multi-Agent Reinforcement Learning

    Authors: Hyungho Na, Il-chul Moon

    Abstract: In cooperative multi-agent reinforcement learning (MARL), agents collaborate to achieve common goals, such as defeating enemies and scoring a goal. However, learning goal-reaching paths toward such a semantic goal takes a considerable amount of time in complex tasks and the trained model often fails to find such paths. To address this, we present LAtent Goal-guided Multi-Agent reinforcement learni… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: Accepted at ICML 2024

  18. arXiv:2405.18003  [pdf, other

    cs.CV cs.AI

    MAVIN: Multi-Action Video Generation with Diffusion Models via Transition Video Infilling

    Authors: Bowen Zhang, Xiaofei Xie, Haotian Lu, Na Ma, Tianlin Li, Qing Guo

    Abstract: Diffusion-based video generation has achieved significant progress, yet generating multiple actions that occur sequentially remains a formidable task. Directly generating a video with sequential actions can be extremely challenging due to the scarcity of fine-grained action annotations and the difficulty in establishing temporal semantic correspondences and maintaining long-term consistency. To ta… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  19. arXiv:2405.17880  [pdf, other

    cs.LG

    Diffusion Rejection Sampling

    Authors: Byeonghu Na, Yeongmin Kim, Minsang Park, Donghyeok Shin, Wanmo Kang, Il-Chul Moon

    Abstract: Recent advances in powerful pre-trained diffusion models encourage the development of methods to improve the sampling performance under well-trained diffusion models. This paper introduces Diffusion Rejection Sampling (DiffRS), which uses a rejection sampling scheme that aligns the sampling transition kernels with the true ones at each timestep. The proposed method can be viewed as a mechanism tha… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: Accepted at ICML 2024

  20. arXiv:2405.17111  [pdf, other

    cs.LG

    Diffusion Bridge AutoEncoders for Unsupervised Representation Learning

    Authors: Yeongmin Kim, Kwanghyeon Lee, Minsang Park, Byeonghu Na, Il-Chul Moon

    Abstract: Diffusion-based representation learning has achieved substantial attention due to its promising capabilities in latent representation and sample generation. Recent studies have employed an auxiliary encoder to identify a corresponding representation from a sample and to adjust the dimensionality of a latent variable z. Meanwhile, this auxiliary structure invokes information split problem because t… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  21. arXiv:2405.17109  [pdf, other

    cs.LO

    Left-Linear Completion with AC Axioms

    Authors: Johannes Niederhauser, Nao Hirokawa, Aart Middeldorp

    Abstract: We revisit completion modulo equational theories for left-linear term rewrite systems where unification modulo the theory is avoided and the normal rewrite relation can be used in order to decide validity questions. To that end, we give a new correctness proof for finite runs and establish a simulation result between the two inference systems known from the literature. Given a concrete reduction o… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  22. arXiv:2405.16516  [pdf, other

    eess.IV cs.CV

    Memory-efficient High-resolution OCT Volume Synthesis with Cascaded Amortized Latent Diffusion Models

    Authors: Kun Huang, Xiao Ma, Yuhan Zhang, Na Su, Songtao Yuan, Yong Liu, Qiang Chen, Huazhu Fu

    Abstract: Optical coherence tomography (OCT) image analysis plays an important role in the field of ophthalmology. Current successful analysis models rely on available large datasets, which can be challenging to be obtained for certain tasks. The use of deep generative models to create realistic data emerges as a promising approach. However, due to limitations in hardware resources, it is still difficulty t… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

    Comments: Provisionally accepted for medical image computing and computer-assisted intervention (MICCAI) 2024

  23. arXiv:2405.16099  [pdf, other

    cs.CV

    Improving 3D Occupancy Prediction through Class-balancing Loss and Multi-scale Representation

    Authors: Huizhou Chen, Jiangyi Wang, Yuxin Li, Na Zhao, Jun Cheng, Xulei Yang

    Abstract: 3D environment recognition is essential for autonomous driving systems, as autonomous vehicles require a comprehensive understanding of surrounding scenes. Recently, the predominant approach to define this real-life problem is through 3D occupancy prediction. It attempts to predict the occupancy states and semantic labels for all voxels in 3D space, which enhances the perception capability. Birds-… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

    Comments: 5 pages, 3 figures, accepted by IEEE CAI 2024

  24. arXiv:2405.09045  [pdf, other

    cs.CV

    AMSNet: Netlist Dataset for AMS Circuits

    Authors: Zhuofu Tao, Yichen Shi, Yiru Huo, Rui Ye, Zonghang Li, Li Huang, Chen Wu, Na Bai, Zhi** Yu, Ting-Jung Lin, Lei He

    Abstract: Today's analog/mixed-signal (AMS) integrated circuit (IC) designs demand substantial manual intervention. The advent of multimodal large language models (MLLMs) has unveiled significant potential across various fields, suggesting their applicability in streamlining large-scale AMS IC design as well. A bottleneck in employing MLLMs for automatic AMS circuit generation is the absence of a comprehens… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

  25. arXiv:2405.08852  [pdf, other

    cs.LG cs.AI cs.IR

    A Click-Through Rate Prediction Method Based on Cross-Importance of Multi-Order Features

    Authors: Hao Wang, Nao Li

    Abstract: Most current click-through rate prediction(CTR)models create explicit or implicit high-order feature crosses through Hadamard product or inner product, with little attention to the importance of feature crossing; only few models are either limited to the second-order explicit feature crossing, implicitly to high-order feature crossing, or can learn the importance of high-order explicit feature cro… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

  26. arXiv:2405.07845  [pdf, other

    cs.CV

    Multi-Task Learning for Fatigue Detection and Face Recognition of Drivers via Tree-Style Space-Channel Attention Fusion Network

    Authors: Shulei Qu, Zhenguo Gao, Xiaowei Chen, Na Li, Yakai Wang, Xiaoxiao Wu

    Abstract: In driving scenarios, automobile active safety systems are increasingly incorporating deep learning technology. These systems typically need to handle multiple tasks simultaneously, such as detecting fatigue driving and recognizing the driver's identity. However, the traditional parallel-style approach of combining multiple single-task models tends to waste resources when dealing with similar task… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

  27. arXiv:2405.07516  [pdf, other

    cs.CV

    Support-Query Prototype Fusion Network for Few-shot Medical Image Segmentation

    Authors: Xiaoxiao Wu, Zhenguo Gao, Xiaowei Chen, Yakai Wang, Shulei Qu, Na Li

    Abstract: In recent years, deep learning based on Convolutional Neural Networks (CNNs) has achieved remarkable success in many applications. However, their heavy reliance on extensive labeled data and limited generalization ability to unseen classes pose challenges to their suitability for medical image processing tasks. Few-shot learning, which utilizes a small amount of labeled data to generalize to unsee… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

    Comments: 19 pages, 7 figures, 4 tables

  28. arXiv:2405.06089  [pdf, other

    eess.SY cs.IT cs.LG

    Learning Low-dimensional Latent Dynamics from High-dimensional Observations: Non-asymptotics and Lower Bounds

    Authors: Yuyang Zhang, Shahriar Talebi, Na Li

    Abstract: In this paper, we focus on learning a linear time-invariant (LTI) model with low-dimensional latent variables but high-dimensional observations. We provide an algorithm that recovers the high-dimensional features, i.e. column space of the observer, embeds the data into low dimensions and learns the low-dimensional model parameters. Our algorithm enjoys a sample complexity guarantee of order… ▽ More

    Submitted 25 June, 2024; v1 submitted 9 May, 2024; originally announced May 2024.

  29. arXiv:2405.05908  [pdf, other

    physics.plasm-ph cs.AI

    Discovering hidden physics using ML-based multimodal super-resolution measurement and its application to fusion plasmas

    Authors: Azarakhsh Jalalvand, SangKyeun Kim, Jaemin Seo, Qiming Hu, Max Curie, Peter Steiner, Andrew Oakleigh Nelson, Yong-Su Na, Egemen Kolemen

    Abstract: A non-linear complex system governed by multi-spatial and multi-temporal physics scales cannot be fully understood with a single diagnostic, as each provides only a partial view and much information is lost during data extraction. Combining multiple diagnostics also results in imperfect projections of the system's physics. By identifying hidden inter-correlations between diagnostics, we can levera… ▽ More

    Submitted 27 June, 2024; v1 submitted 9 May, 2024; originally announced May 2024.

  30. arXiv:2405.05787  [pdf, other

    cs.RO cs.CV eess.SY

    Autonomous Robotic Ultrasound System for Liver Follow-up Diagnosis: Pilot Phantom Study

    Authors: Tianpeng Zhang, Sekeun Kim, Jerome Charton, Haitong Ma, Kyungsang Kim, Na Li, Quanzheng Li

    Abstract: The paper introduces a novel autonomous robot ultrasound (US) system targeting liver follow-up scans for outpatients in local communities. Given a computed tomography (CT) image with specific target regions of interest, the proposed system carries out the autonomous follow-up scan in three steps: (i) initial robot contact to surface, (ii) coordinate map** between CT image and robot, and (iii) ta… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

  31. arXiv:2405.04844  [pdf, ps, other

    cs.IR

    Full Stage Learning to Rank: A Unified Framework for Multi-Stage Systems

    Authors: Kai Zheng, Haijun Zhao, Rui Huang, Beichuan Zhang, Na Mou, Yanan Niu, Yang Song, Hongning Wang, Kun Gai

    Abstract: The Probability Ranking Principle (PRP) has been considered as the foundational standard in the design of information retrieval (IR) systems. The principle requires an IR module's returned list of results to be ranked with respect to the underlying user interests, so as to maximize the results' utility. Nevertheless, we point out that it is inappropriate to indiscriminately apply PRP through eve… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

    Comments: Accepted by WWW 2024

  32. arXiv:2405.03950  [pdf, other

    cs.LG cs.AI

    Relating-Up: Advancing Graph Neural Networks through Inter-Graph Relationships

    Authors: Qi Zou, Na Yu, Daoliang Zhang, Wei Zhang, Rui Gao

    Abstract: Graph Neural Networks (GNNs) have excelled in learning from graph-structured data, especially in understanding the relationships within a single graph, i.e., intra-graph relationships. Despite their successes, GNNs are limited by neglecting the context of relationships across graphs, i.e., inter-graph relationships. Recognizing the potential to extend this capability, we introduce Relating-Up, a p… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

    Comments: 16 pages, 6 figures, 9 tables

  33. arXiv:2404.08131  [pdf, other

    cs.LG cs.IT stat.ML

    Frame Quantization of Neural Networks

    Authors: Wojciech Czaja, Sanghoon Na

    Abstract: We present a post-training quantization algorithm with error estimates relying on ideas originating from frame theory. Specifically, we use first-order Sigma-Delta ($ΣΔ$) quantization for finite unit-norm tight frames to quantize weight matrices and biases in a neural network. In our scenario, we derive an error bound between the original neural network and the quantized neural network in terms of… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

    Comments: 20 pages, 2 figures

  34. arXiv:2404.08089  [pdf, other

    cs.LG math.OC

    Efficient Duple Perturbation Robustness in Low-rank MDPs

    Authors: Yang Hu, Haitong Ma, Bo Dai, Na Li

    Abstract: The pursuit of robustness has recently been a popular topic in reinforcement learning (RL) research, yet the existing methods generally suffer from efficiency issues that obstruct their real-world implementation. In this paper, we introduce duple perturbation robustness, i.e. perturbation on both the feature and factor vectors for low-rank Markov decision processes (MDPs), via a novel characteriza… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

    Comments: 25 pages, 8 figures, in submission to ICML'24

  35. arXiv:2404.06661  [pdf, other

    cs.CV

    Efficient Denoising using Score Embedding in Score-based Diffusion Models

    Authors: Andrew S. Na, William Gao, Justin W. L. Wan

    Abstract: It is well known that training a denoising score-based diffusion models requires tens of thousands of epochs and a substantial number of image data to train the model. In this paper, we propose to increase the efficiency in training score-based diffusion models. Our method allows us to decrease the number of epochs needed to train the diffusion model. We accomplish this by solving the log-density… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

  36. arXiv:2404.06078  [pdf, other

    cs.IR

    End-to-end training of Multimodal Model and ranking Model

    Authors: Xiuqi Deng, Lu Xu, Xiyao Li, **kai Yu, Erpeng Xue, Zhongyuan Wang, Di Zhang, Zhaojie Liu, Guorui Zhou, Yang Song, Na Mou, Shen Jiang, Han Li

    Abstract: Traditional recommender systems heavily rely on ID features, which often encounter challenges related to cold-start and generalization. Modeling pre-extracted content features can mitigate these issues, but is still a suboptimal solution due to the discrepancies between training tasks and model parameters. End-to-end training presents a promising solution for these problems, yet most of the existi… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

    Comments: 9 pages, 8 figures

  37. arXiv:2404.05415  [pdf

    cs.CL cs.AI

    Relation Extraction Using Large Language Models: A Case Study on Acupuncture Point Locations

    Authors: Yiming Li, Xueqing Peng, Jianfu Li, Xu Zuo, Suyuan Peng, Donghong Pei, Cui Tao, Hua Xu, Na Hong

    Abstract: In acupuncture therapy, the accurate location of acupoints is essential for its effectiveness. The advanced language understanding capabilities of large language models (LLMs) like Generative Pre-trained Transformers (GPT) present a significant opportunity for extracting relations related to acupoint locations from textual knowledge sources. This study aims to compare the performance of GPT with t… ▽ More

    Submitted 14 April, 2024; v1 submitted 8 April, 2024; originally announced April 2024.

  38. arXiv:2404.05051  [pdf, other

    cs.LG cs.RO

    Skill Transfer and Discovery for Sim-to-Real Learning: A Representation-Based Viewpoint

    Authors: Haitong Ma, Zhaolin Ren, Bo Dai, Na Li

    Abstract: We study sim-to-real skill transfer and discovery in the context of robotics control using representation learning. We draw inspiration from spectral decomposition of Markov decision processes. The spectral decomposition brings about representation that can linearly represent the state-action value function induced by any policies, thus can be regarded as skills. The skill representations are tran… ▽ More

    Submitted 7 April, 2024; originally announced April 2024.

    Comments: 9 pages, 6 figures. Project page: https://congharvard.github.io/steady-sim-to-real/

  39. arXiv:2404.03819  [pdf, other

    cs.CV

    Effective Lymph Nodes Detection in CT Scans Using Location Debiased Query Selection and Contrastive Query Representation in Transformer

    Authors: Qinji Yu, Yirui Wang, Ke Yan, Haoshen Li, Dazhou Guo, Li Zhang, Le Lu, Na Shen, Qifeng Wang, Xiaowei Ding, Xianghua Ye, Dakai **

    Abstract: Lymph node (LN) assessment is a critical, indispensable yet very challenging task in the routine clinical workflow of radiology and oncology. Accurate LN analysis is essential for cancer diagnosis, staging, and treatment planning. Finding scatteredly distributed, low-contrast clinically relevant LNs in 3D CT is difficult even for experienced physicians under high inter-observer variations. Previou… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

    Comments: Technical report

  40. arXiv:2404.02555  [pdf

    eess.SY cs.LG

    An Interpretable Power System Transient Stability Assessment Method with Expert Guiding Neural-Regression-Tree

    Authors: Hanxuan Wang, Na Lu, Zixuan Wang, Jiacheng Liu, Jun Liu

    Abstract: Deep learning based transient stability assessment (TSA) has achieved great success, yet the lack of interpretability hinders its industrial application. Although a great number of studies have tried to explore the interpretability of network solutions, many problems still remain unsolved: (1) the difference between the widely accepted power system knowledge and the generated interpretive rules is… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

  41. arXiv:2404.01954  [pdf, other

    cs.CL cs.AI

    HyperCLOVA X Technical Report

    Authors: Kang Min Yoo, Jaegeun Han, Sookyo In, Heewon Jeon, Jisu Jeong, Jaewook Kang, Hyunwook Kim, Kyung-Min Kim, Munhyong Kim, Sungju Kim, Donghyun Kwak, Hanock Kwak, Se Jung Kwon, Bado Lee, Dongsoo Lee, Gichang Lee, Jooho Lee, Baeseong Park, Seong** Shin, Joonsang Yu, Seolki Baek, Sumin Byeon, Eungsup Cho, Dooseok Choe, Jeesung Han , et al. (371 additional authors not shown)

    Abstract: We introduce HyperCLOVA X, a family of large language models (LLMs) tailored to the Korean language and culture, along with competitive capabilities in English, math, and coding. HyperCLOVA X was trained on a balanced mix of Korean, English, and code data, followed by instruction-tuning with high-quality human-annotated datasets while abiding by strict safety guidelines reflecting our commitment t… ▽ More

    Submitted 13 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

    Comments: 44 pages; updated authors list and fixed author names

  42. arXiv:2404.01104  [pdf, other

    cs.CL

    SentiCSE: A Sentiment-aware Contrastive Sentence Embedding Framework with Sentiment-guided Textual Similarity

    Authors: Jaemin Kim, Yohan Na, Kangmin Kim, Sang Rak Lee, Dong-Kyu Chae

    Abstract: Recently, sentiment-aware pre-trained language models (PLMs) demonstrate impressive results in downstream sentiment analysis tasks. However, they neglect to evaluate the quality of their constructed sentiment representations; they just focus on improving the fine-tuning performance, which overshadows the representation quality. We argue that without guaranteeing the representation quality, their d… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

    Comments: 14 pages, 8 figures

    MSC Class: 68T50 ACM Class: I.2.7

    Journal ref: LREC-COLING2024

  43. arXiv:2404.00670  [pdf, other

    cs.CV q-bio.QM stat.AP

    Statistical Analysis by Semiparametric Additive Regression and LSTM-FCN Based Hierarchical Classification for Computer Vision Quantification of Parkinsonian Bradykinesia

    Authors: Youngseo Cho, In Hee Kwak, Dohyeon Kim, **hee Na, Hanjoo Sung, Jeongjae Lee, Young Eun Kim, Hyeo-il Ma

    Abstract: Bradykinesia, characterized by involuntary slowing or decrement of movement, is a fundamental symptom of Parkinson's Disease (PD) and is vital for its clinical diagnosis. Despite various methodologies explored to quantify bradykinesia, computer vision-based approaches have shown promising results. However, these methods often fall short in adequately addressing key bradykinesia characteristics in… ▽ More

    Submitted 31 March, 2024; originally announced April 2024.

  44. arXiv:2403.17833  [pdf, other

    cs.LG cs.DC

    GPFL: A Gradient Projection-Based Client Selection Framework for Efficient Federated Learning

    Authors: Shijie Na, Yuzhi Liang, Siu-Ming Yiu

    Abstract: Federated learning client selection is crucial for determining participant clients while balancing model accuracy and communication efficiency. Existing methods have limitations in handling data heterogeneity, computational burdens, and independent client treatment. To address these challenges, we propose GPFL, which measures client value by comparing local and global descent directions. We also e… ▽ More

    Submitted 26 May, 2024; v1 submitted 26 March, 2024; originally announced March 2024.

  45. arXiv:2403.17428  [pdf, other

    cs.AI cs.CL

    Aligning Large Language Models for Enhancing Psychiatric Interviews through Symptom Delineation and Summarization

    Authors: Jae-hee So, Joonhwan Chang, Eunji Kim, Junho Na, JiYeon Choi, Jy-yong Sohn, Byung-Hoon Kim, Sang Hui Chu

    Abstract: Recent advancements in Large Language Models (LLMs) have accelerated their usage in various domains. Given the fact that psychiatric interviews are goal-oriented and structured dialogues between the professional interviewer and the interviewee, it is one of the most underexplored areas where LLMs can contribute substantial value. Here, we explore the use of LLMs for enhancing psychiatric interview… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

  46. arXiv:2403.17303  [pdf, other

    cs.CR

    Two Birds with One Stone: Differential Privacy by Low-power SRAM Memory

    Authors: Jianqing Liu, Na Gong, Hritom Das

    Abstract: The software-based implementation of differential privacy mechanisms has been shown to be neither friendly for lightweight devices nor secure against side-channel attacks. In this work, we aim to develop a hardware-based technique to achieve differential privacy by design. In contrary to the conventional software-based noise generation and injection process, our design realizes local differential… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

    Comments: 15 pages, with 2 pages of Appendix

    Journal ref: IEEE Transactions on Dependable and Secure Computing, 2024

  47. arXiv:2403.17216  [pdf, other

    cs.CL

    Ontology Completion with Natural Language Inference and Concept Embeddings: An Analysis

    Authors: Na Li, Thomas Bailleux, Zied Bouraoui, Steven Schockaert

    Abstract: We consider the problem of finding plausible knowledge that is missing from a given ontology, as a generalisation of the well-studied taxonomy expansion task. One line of work treats this task as a Natural Language Inference (NLI) problem, thus relying on the knowledge captured by language models to identify the missing knowledge. Another line of work uses concept embeddings to identify what diffe… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

  48. arXiv:2403.16984  [pdf, other

    cs.AI cs.CL

    Modelling Commonsense Commonalities with Multi-Facet Concept Embeddings

    Authors: Hanane Kteich, Na Li, Usashi Chatterjee, Zied Bouraoui, Steven Schockaert

    Abstract: Concept embeddings offer a practical and efficient mechanism for injecting commonsense knowledge into downstream tasks. Their core purpose is often not to predict the commonsense properties of concepts themselves, but rather to identify commonalities, i.e.\ sets of concepts which share some property of interest. Such commonalities are the basis for inductive generalisation, hence high-quality conc… ▽ More

    Submitted 4 June, 2024; v1 submitted 25 March, 2024; originally announced March 2024.

  49. arXiv:2403.16008  [pdf, other

    cs.CL

    CBT-LLM: A Chinese Large Language Model for Cognitive Behavioral Therapy-based Mental Health Question Answering

    Authors: Hongbin Na

    Abstract: The recent advancements in artificial intelligence highlight the potential of language models in psychological health support. While models trained on data from mental health service platform have achieved preliminary success, challenges persist in areas such as data scarcity, quality, and ensuring a solid foundation in psychological techniques. To address these challenges, this study introduces a… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

    Comments: Accepted at COLING 2024

  50. arXiv:2403.14425  [pdf, other

    cs.LG math.OC

    Task-optimal data-driven surrogate models for eNMPC via differentiable simulation and optimization

    Authors: Daniel Mayfrank, Na Young Ahn, Alexander Mitsos, Manuel Dahmen

    Abstract: We present a method for end-to-end learning of Koopman surrogate models for optimal performance in control. In contrast to previous contributions that employ standard reinforcement learning (RL) algorithms, we use a training algorithm that exploits the potential differentiability of environments based on mechanistic simulation models. We evaluate the performance of our method by comparing it to th… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

    Comments: 6 pages, 4 figures, 1 table