Skip to main content

Showing 51–100 of 902 results for author: Shen, L

.
  1. arXiv:2404.15598  [pdf, other

    cs.LG cs.CR

    Federated Learning with Only Positive Labels by Exploring Label Correlations

    Authors: Xuming An, Dui Wang, Li Shen, Yong Luo, Han Hu, Bo Du, Yonggang Wen, Dacheng Tao

    Abstract: Federated learning aims to collaboratively learn a model by using the data from multiple users under privacy constraints. In this paper, we study the multi-label classification problem under the federated learning setting, where trivial solution and extremely poor performance may be obtained, especially when only positive data w.r.t. a single class label are provided for each client. This issue ca… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

    Comments: To be published in IEEE Transactions on Neural Networks and Learning Systems

  2. arXiv:2404.14828  [pdf, other

    cs.IT

    GLDPC-PC Codes: Channel Coding Towards 6G Communications

    Authors: Li Shen, Yongpeng Wu, Yin Xu, Xiaohu You, Xiqi Gao, Wenjun Zhang

    Abstract: The sixth generation (6G) wireless communication system will improve the key technical indicators by one to two orders of magnitude, and come with some new features. As a crucial technique to enhance the reliability and efficiency of data transmission, the next generation channel coding is not only required to satisfy the stringent requirements of 6G, but also expected to be backward compatible to… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

    Comments: Submitted to IEEE Communications Magazine

  3. arXiv:2404.09125  [pdf

    physics.app-ph

    Achieving High Yield of Perpendicular SOT-MTJ Manufactured on 300 mm Wafers

    Authors: Wenlong Yang, Zhenghui Ji, Yang Gao, Kaiyuan Zhou, Qijun Guo, Dinggui Zeng, Shasha Wang, Ming Wang, Lijie Shen, Guilin Chen, Yihui Sun, Enlong Liu, Shikun He

    Abstract: The large-scale fabrication of three-terminal magnetic tunnel junctions (MTJs) with high yield is becoming increasingly crucial, especially with the growing interest in spin-orbit torque (SOT) magnetic random access memory (MRAM) as the next generation of MRAM technology. To achieve high yield and consistent device performance in MTJs with perpendicular magnetic anisotropy, an integration flow has… ▽ More

    Submitted 13 April, 2024; originally announced April 2024.

    Comments: 8 pages, 5 figures

    ACM Class: J.2.6

  4. arXiv:2404.06443  [pdf, other

    cs.CV

    Multi-scale Dynamic and Hierarchical Relationship Modeling for Facial Action Units Recognition

    Authors: Zihan Wang, Siyang Song, Cheng Luo, Songhe Deng, Weicheng Xie, Linlin Shen

    Abstract: Human facial action units (AUs) are mutually related in a hierarchical manner, as not only they are associated with each other in both spatial and temporal domains but also AUs located in the same/close facial regions show stronger relationships than those of different facial regions. While none of existing approach thoroughly model such hierarchical inter-dependencies among AUs, this paper propos… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

    Comments: Accepted to CVPR2024

  5. arXiv:2404.06258  [pdf

    cs.CV

    Robust feature knowledge distillation for enhanced performance of lightweight crack segmentation models

    Authors: Zhaohui Chen, Elyas Asadi Shamsabadi, Sheng Jiang, Luming Shen, Daniel Dias-da-Costa

    Abstract: Vision-based crack detection faces deployment challenges due to the size of robust models and edge device limitations. These can be addressed with lightweight models trained with knowledge distillation (KD). However, state-of-the-art (SOTA) KD methods compromise anti-noise robustness. This paper develops Robust Feature Knowledge Distillation (RFKD), a framework to improve robustness while retainin… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

    Comments: 24 pages, 13 figures

  6. arXiv:2404.05253  [pdf, other

    cs.CV

    CodeEnhance: A Codebook-Driven Approach for Low-Light Image Enhancement

    Authors: Xu Wu, XianXu Hou, Zhihui Lai, Jie Zhou, Ya-nan Zhang, Witold Pedrycz, Linlin Shen

    Abstract: Low-light image enhancement (LLIE) aims to improve low-illumination images. However, existing methods face two challenges: (1) uncertainty in restoration from diverse brightness degradations; (2) loss of texture and color information caused by noise suppression and light enhancement. In this paper, we propose a novel enhancement approach, CodeEnhance, by leveraging quantized priors and image refin… ▽ More

    Submitted 30 April, 2024; v1 submitted 8 April, 2024; originally announced April 2024.

    Comments: 10 pages, 13 figures

  7. arXiv:2404.05208  [pdf, other

    cond-mat.mes-hall quant-ph

    Proximity-Induced Exchange Interaction: a New Pathway for Quantum Sensing using Spin Centers in Hexagonal Boron Nitride

    Authors: Lingnan Shen, Di Xiao, Ting Cao

    Abstract: Defects in hexagonal boron nitride (hBN), a two-dimensional van der Waals material, have raised wide range interest for its potential in various quantum applications. Due to hBN's 2D nature, spin center in hBN can be engineered in close proximity to target material, providing advantages over their 3D counterparts, such as nitrogen-vacancy (NV) center in diamond. Here we propose a novel quantum sen… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

  8. arXiv:2404.03854  [pdf, other

    cs.LG cs.CL cs.CV

    Align as Ideal: Cross-Modal Alignment Binding for Federated Medical Vision-Language Pre-training

    Authors: Zitao Shuai, Liyue Shen

    Abstract: Vision-language pre-training (VLP) has arised as an efficient scheme for multimodal representation learning, but it requires large-scale multimodal data for pre-training, making it an obstacle especially for medical applications. To overcome the data limitation, federated learning (FL) can be a promising strategy to scale up the dataset for medical VLP while protecting data privacy. However, clien… ▽ More

    Submitted 24 May, 2024; v1 submitted 4 April, 2024; originally announced April 2024.

  9. arXiv:2404.01897  [pdf, other

    cs.NE cs.AI cs.LG

    Continuous Spiking Graph Neural Networks

    Authors: Nan Yin, Mengzhu Wan, Li Shen, Hitesh Laxmichand Patel, Baopu Li, Bin Gu, Huan Xiong

    Abstract: Continuous graph neural networks (CGNNs) have garnered significant attention due to their ability to generalize existing discrete graph neural networks (GNNs) by introducing continuous dynamics. They typically draw inspiration from diffusion-based methods to introduce a novel propagation scheme, which is analyzed using ordinary differential equations (ODE). However, the implementation of CGNNs req… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

  10. arXiv:2404.01200  [pdf, other

    stat.ML cs.LG

    Large-Scale Non-convex Stochastic Constrained Distributionally Robust Optimization

    Authors: Qi Zhang, Yi Zhou, Ashley Prater-Bennette, Lixin Shen, Shaofeng Zou

    Abstract: Distributionally robust optimization (DRO) is a powerful framework for training robust models against data distribution shifts. This paper focuses on constrained DRO, which has an explicit characterization of the robustness level. Existing studies on constrained DRO mostly focus on convex loss function, and exclude the practical and challenging case with non-convex loss function, e.g., neural netw… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

    Comments: We have corrected Theorem 1 in Sec 4 for AAAI 2024 version, where the order of $n_z$ changes from $ε^{-k_*} )$ to $ε^{-2k_*-2}$

  11. arXiv:2404.00865  [pdf, other

    cond-mat.mtrl-sci

    Scaling Crystal Structure Relaxation with a Universal Trustworthy Deep Generative Model

    Authors: Ziduo Yang, Yiming Zhao, Xiaoqing Liu, Xiuying Zhang, Yifan Li, Qiujie Lyu, Calvin Yu-Chian Chen, Lei Shen

    Abstract: The evolution of AI and high-throughput technologies has boosted a rapid increase in the number of new materials, challenging our computational ability to comprehensively analyze their properties. Relaxed crystal structures often serve as the foundational basis for further property calculations. However, determining equilibrium structures traditionally involves computationally expensive iterative… ▽ More

    Submitted 31 March, 2024; originally announced April 2024.

  12. arXiv:2404.00764  [pdf, ps, other

    math.OC

    Sparse Recovery: The Square of $\ell_1/\ell_2$ Norms

    Authors: Jianqing Jia, Ashley Prater-Bennette, Lixin Shen, Erin E. Tripp

    Abstract: This paper introduces a nonconvex approach to the problem of recovering sparse signals. We propose a novel model, termed the $τ_2$-model, which utilizes the square of $\ell_1/\ell_2$ norms for sparse recovery. This model is an advancement over the $\ell_0$ norm, which is often computationally intractable and less effective in handling practical scenarios. Our approach is grounded in the concept of… ▽ More

    Submitted 31 March, 2024; originally announced April 2024.

  13. arXiv:2404.00713  [pdf, ps, other

    math.OC

    Computing Proximity Operators of Scale and Signed Permutation Invariant Functions

    Authors: Jianqing Jia, Ashley Prater-Bennette, Lixin Shen

    Abstract: This paper investigates the computation of proximity operators for scale and signed permutation invariant functions. A scale-invariant function remains unchanged under uniform scaling, while a signed permutation invariant function retains its structure despite permutations and sign changes applied to its input variables. Noteworthy examples include the $\ell_0$ function and the ratios of… ▽ More

    Submitted 31 March, 2024; originally announced April 2024.

  14. arXiv:2403.18176  [pdf, other

    cs.LG cs.GT math.OC

    Mistake, Manipulation and Margin Guarantees in Online Strategic Classification

    Authors: Lingqing Shen, Nam Ho-Nguyen, Khanh-Hung Giang-Tran, Fatma Kılınç-Karzan

    Abstract: We consider an online strategic classification problem where each arriving agent can manipulate their true feature vector to obtain a positive predicted label, while incurring a cost that depends on the amount of manipulation. The learner seeks to predict the agent's true label given access to only the manipulated features. After the learner releases their prediction, the agent's true label is rev… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

  15. arXiv:2403.18014  [pdf, ps, other

    math.AP

    Generalized Chern-Simons-Schrodinger system with critical exponential growth: the zero mass case

    Authors: Liejun Shen, Marco Squassina

    Abstract: We consider the existence of ground state solutions for a class of zero-mass Chern-Simons-Schrödinger systems \[ \left\{ \begin{array}{ll} \displaystyle -Δu +A_0 u+\sum\limits_{j=1}^2A_j^2 u=f(u)-a(x)|u|^{p-2}u, \newline \displaystyle \partial_1A_2-\partial_2A_1=-\frac{1}{2}|u|^2,~\partial_1A_1+\partial_2A_2=0, \newline \displaystyle \partial_1A_0=A_2|u|^2,~ \partial_2A_0=-A_1|u|^2, \end… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    Comments: 20 pages

    MSC Class: 35J20; 58E50; 35B06

  16. arXiv:2403.17375  [pdf

    physics.med-ph

    Compensating for charge sharing by a deep-learning method: a preliminary experimental study

    Authors: Shengzi Zhao, Le Shen, Yuxing Xing

    Abstract: Photon counting detectors (PCDs) bring valuable advantages to diagnostic computed tomography (CT), including lower noise and higher resolution than energy integrating detectors. However, there are still several nonideal factors preventing PCDs from meeting people's expectations, for example, charge sharing and pile up. In this paper, we did some preliminary work on charge sharing and conducted an… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    Comments: 3 pages, 2 figures

  17. arXiv:2403.16578  [pdf, other

    cs.CV cs.AI

    SegICL: A Multimodal In-context Learning Framework for Enhanced Segmentation in Medical Imaging

    Authors: Lingdong Shen, Fangxin Shang, Xiaoshuang Huang, Yehui Yang, Haifeng Huang, Shiming Xiang

    Abstract: In the field of medical image segmentation, tackling Out-of-Distribution (OOD) segmentation tasks in a cost-effective manner remains a significant challenge. Universal segmentation models is a solution, which aim to generalize across the diverse modality of medical images, yet their effectiveness often diminishes when applied to OOD data modalities and tasks, requiring intricate fine-tuning of mod… ▽ More

    Submitted 29 May, 2024; v1 submitted 25 March, 2024; originally announced March 2024.

  18. arXiv:2403.16050  [pdf, other

    cs.CV

    Heterogeneous Federated Learning with Splited Language Model

    Authors: Yifan Shi, Yuhui Zhang, Ziyue Huang, Xiaofeng Yang, Li Shen, Wei Chen, Xueqian Wang

    Abstract: Federated Split Learning (FSL) is a promising distributed learning paradigm in practice, which gathers the strengths of both Federated Learning (FL) and Split Learning (SL) paradigms, to ensure model privacy while diminishing the resource overhead of each client, especially on large transformer models in a resource-constrained environment, e.g., Internet of Things (IoT). However, almost all works… ▽ More

    Submitted 19 April, 2024; v1 submitted 24 March, 2024; originally announced March 2024.

  19. arXiv:2403.14399  [pdf, other

    cs.CL cs.AI

    Building Accurate Translation-Tailored LLMs with Language Aware Instruction Tuning

    Authors: Changtong Zan, Liang Ding, Li Shen, Yibing Zhen, Weifeng Liu, Dacheng Tao

    Abstract: Translation-tailored Large language models (LLMs) exhibit remarkable translation capabilities, even competing with supervised-trained commercial translation systems. However, off-target translation remains an unsolved problem, especially for low-resource languages, hindering us from develo** accurate LLMs-based translation models. To mitigate the off-target translation problem and enhance the pe… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

  20. arXiv:2403.13249  [pdf, ps, other

    cs.LG cs.AI cs.CV

    A Unified and General Framework for Continual Learning

    Authors: Zhenyi Wang, Yan Li, Li Shen, Heng Huang

    Abstract: Continual Learning (CL) focuses on learning from dynamic and changing data distributions while retaining previously acquired knowledge. Various methods have been developed to address the challenge of catastrophic forgetting, including regularization-based, Bayesian-based, and memory-replay-based techniques. However, these methods lack a unified framework and common terminology for describing their… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

    Comments: ICLR 2024

  21. arXiv:2403.11384  [pdf, other

    cs.HC cs.RO

    Towards Massive Interaction with Generalist Robotics: A Systematic Review of XR-enabled Remote Human-Robot Interaction Systems

    Authors: Xian Wang, Luyao Shen, Lik-Hang Lee

    Abstract: The rising interest of generalist robots seek to create robots with versatility to handle multiple tasks in a variety of environments, and human will interact with such robots through immersive interfaces. In the context of human-robot interaction (HRI), this survey provides an exhaustive review of the applications of extended reality (XR) technologies in the field of remote HRI. We developed a sy… ▽ More

    Submitted 26 March, 2024; v1 submitted 17 March, 2024; originally announced March 2024.

  22. arXiv:2403.10103  [pdf, other

    cs.CV

    DyBluRF: Dynamic Neural Radiance Fields from Blurry Monocular Video

    Authors: Huiqiang Sun, Xingyi Li, Liao Shen, Xinyi Ye, Ke Xian, Zhiguo Cao

    Abstract: Recent advancements in dynamic neural radiance field methods have yielded remarkable outcomes. However, these approaches rely on the assumption of sharp input images. When faced with motion blur, existing dynamic NeRF methods often struggle to generate high-quality novel views. In this paper, we propose DyBluRF, a dynamic radiance field approach that synthesizes sharp novel views from a monocular… ▽ More

    Submitted 19 March, 2024; v1 submitted 15 March, 2024; originally announced March 2024.

    Comments: Accepted by CVPR 2024. Project page: https://huiqiang-sun.github.io/dyblurf/

  23. arXiv:2403.09366  [pdf, ps, other

    math.AP

    Existence and concentration of normalized solutions for $p$-Laplacian equations with logarithmic nonlinearity

    Authors: Liejun Shen, Marco Squassina

    Abstract: We investigate the existence and concentration of normalized solutions for a $p$-Laplacian problem with logarithmic nonlinearity of type \[ \left\{ \begin{array}{ll} \displaystyle -\varepsilon^pΔ_p u+V(x)|u|^{p-2}u=λ|u|^{p-2}u+|u|^{p-2}u\log|u|^p ~\text{in}~\mathbb R^N,\newline \displaystyle \int_{\mathbb R^N}|u|^pdx=a^p\varepsilon^N, \end{array} \right. \] where $a,\varepsilon> 0$,… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

    Comments: 35 pages

    MSC Class: 35J10; 35J20; 35B06

  24. arXiv:2403.07289  [pdf, other

    cs.CV

    Rediscovering BCE Loss for Uniform Classification

    Authors: Qiufu Li, Xi Jia, Jiancan Zhou, Linlin Shen, **ming Duan

    Abstract: This paper introduces the concept of uniform classification, which employs a unified threshold to classify all samples rather than adaptive threshold classifying each individual sample. We also propose the uniform classification accuracy as a metric to measure the model's performance in uniform classification. Furthermore, begin with a naive loss, we mathematically derive a loss function suitable… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

  25. arXiv:2403.05433  [pdf, other

    cs.CV

    Part-aware Personalized Segment Anything Model for Patient-Specific Segmentation

    Authors: Chenhui Zhao, Liyue Shen

    Abstract: Precision medicine, such as patient-adaptive treatments utilizing medical images, poses new challenges for image segmentation algorithms due to (1) the large variability across different patients and (2) the limited availability of annotated data for each patient. In this work, we propose a data-efficient segmentation method to address these challenges, namely Part-aware Personalized Segment Anyth… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

  26. arXiv:2402.15983  [pdf

    cond-mat.mtrl-sci

    Topological skyrmions in monolayer multiferroic MoPtGe2S6

    Authors: Zuxin Fu, Kuanrong Hao, Min Guo, **g**g He, Xiaohong Yan, Yangbo Zhou, Lei Shen, Jiaren Yuan

    Abstract: Two-dimensional (2D) multiferroic materials with coexisting ferroelectricity and ferromagnetism have garnered substantial attention for their intriguing physical properties and diverse promising applications in spintronics. For example, multiferroic materials with electronically controlled broken central symmetry provide a versatile platform for designing and manipulating topological skyrmions and… ▽ More

    Submitted 24 February, 2024; originally announced February 2024.

    Comments: 24 pages, 7 figures

  27. Fingerprint Presentation Attack Detector Using Global-Local Model

    Authors: Haozhe Liu, Wentian Zhang, Feng Liu, Haoqian Wu, Linlin Shen

    Abstract: The vulnerability of automated fingerprint recognition systems (AFRSs) to presentation attacks (PAs) promotes the vigorous development of PA detection (PAD) technology. However, PAD methods have been limited by information loss and poor generalization ability, resulting in new PA materials and fingerprint sensors. This paper thus proposes a global-local model-based PAD (RTK-PAD) method to overcome… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

    Comments: This paper was accepted by IEEE Transactions on Cybernetics. Current version is updated with minor revisions on introduction and related works

    Journal ref: IEEE TRANSACTIONS ON CYBERNETICS, VOL. 52, NO. 11, 12315-12328, November 2022

  28. arXiv:2402.12370  [pdf, other

    cs.CL cs.AI

    AnaloBench: Benchmarking the Identification of Abstract and Long-context Analogies

    Authors: Xiao Ye, Andrew Wang, Jacob Choi, Yining Lu, Shreya Sharma, Lingfeng Shen, Vijay Tiyyala, Nicholas Andrews, Daniel Khashabi

    Abstract: Humans regularly engage in analogical thinking, relating personal experiences to current situations ($X$ is analogous to $Y$ because of $Z$). Analogical thinking allows humans to solve problems in creative ways, grasp difficult concepts, and articulate ideas more effectively. Can language models (LMs) do the same? To answer this question, we propose ANALOBENCH, a benchmark to determine analogical… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

  29. arXiv:2402.12006  [pdf, other

    math.AP

    Infinitely many solutions for a class of fractional Schrodinger equations coupled with neutral scalar field

    Authors: Liejun Shen, Marco Squassina, Xiaoyu Zeng

    Abstract: We study the fractional Schrödinger equations coupled with a neutral scalar field $$ (-Δ)^s u+V(x)u=K(x)φu +g(x)|u|^{q-2}u, \quad x\in \mathbb{R}^3,\qquad (I-Δ)^t φ=K(x)u^2, \quad x\in \mathbb{R}^3, $$ where $(-Δ)^s$ and $(I-Δ)^t$ denote the fractional Laplacian and Bessel operators with $\frac{3}{4} <s<1$ and $0<t<1$, respectively. Under some suitable assumptions for the external potentia… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

    Comments: 15 pages

    MSC Class: 35J60; 35Q55; 53C35

  30. arXiv:2402.11890  [pdf, other

    cs.CL

    Revisiting Knowledge Distillation for Autoregressive Language Models

    Authors: Qihuang Zhong, Liang Ding, Li Shen, Juhua Liu, Bo Du, Dacheng Tao

    Abstract: Knowledge distillation (KD) is a common approach to compress a teacher model to reduce its inference cost and memory footprint, by training a smaller student model. However, in the context of autoregressive language models (LMs), we empirically find that larger teacher LMs might dramatically result in a poorer student. In response to this problem, we conduct a series of analyses and reveal that di… ▽ More

    Submitted 16 June, 2024; v1 submitted 19 February, 2024; originally announced February 2024.

    Comments: Accepted to ACL2024 Main Conference

  31. arXiv:2402.11857  [pdf, other

    cs.LG cs.DC

    Communication-Efficient Distributed Learning with Local Immediate Error Compensation

    Authors: Yifei Cheng, Li Shen, Linli Xu, Xun Qian, Shiwei Wu, Yiming Zhou, Tie Zhang, Dacheng Tao, Enhong Chen

    Abstract: Gradient compression with error compensation has attracted significant attention with the target of reducing the heavy communication overhead in distributed learning. However, existing compression methods either perform only unidirectional compression in one iteration with higher communication cost, or bidirectional compression with slower convergence rate. In this work, we propose the Local Immed… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

  32. arXiv:2402.11217  [pdf, other

    cs.CL cs.CV

    Asclepius: A Spectrum Evaluation Benchmark for Medical Multi-Modal Large Language Models

    Authors: Wenxuan Wang, Yihang Su, **gyuan Huan, Jie Liu, Wenting Chen, Yudi Zhang, Cheng-Yi Li, Kao-Jung Chang, Xiaohan Xin, Linlin Shen, Michael R. Lyu

    Abstract: The significant breakthroughs of Medical Multi-Modal Large Language Models (Med-MLLMs) renovate modern healthcare with robust information synthesis and medical decision support. However, these models are often evaluated on benchmarks that are unsuitable for the Med-MLLMs due to the intricate nature of the real-world diagnostic frameworks, which encompass diverse medical specialties and involve com… ▽ More

    Submitted 17 February, 2024; originally announced February 2024.

    Comments: 20 pages, 15 figures

  33. arXiv:2402.07610  [pdf, other

    cs.CL cs.AI

    Step-On-Feet Tuning: Scaling Self-Alignment of LLMs via Bootstrap**

    Authors: Haoyu Wang, Guozheng Ma, Ziqiao Meng, Zeyu Qin, Li Shen, Zhong Zhang, Bingzhe Wu, Liu Liu, Yatao Bian, Tingyang Xu, Xueqian Wang, Peilin Zhao

    Abstract: Self-alignment is an effective way to reduce the cost of human annotation while ensuring promising model capability. However, most current methods complete the data collection and training steps in a single round, which may overlook the continuously improving ability of self-aligned models. This gives rise to a key query: What if we do multi-time bootstrap** self-alignment? Does this strategy en… ▽ More

    Submitted 27 June, 2024; v1 submitted 12 February, 2024; originally announced February 2024.

  34. arXiv:2402.05773  [pdf, other

    cs.CV

    UAV-Rain1k: A Benchmark for Raindrop Removal from UAV Aerial Imagery

    Authors: Wenhui Chang, Hongming Chen, Xin He, Xiang Chen, Liangduo Shen

    Abstract: Raindrops adhering to the lens of UAVs can obstruct visibility of the background scene and degrade image quality. Despite recent progress in image deraining methods and datasets, there is a lack of focus on raindrop removal from UAV aerial imagery due to the unique challenges posed by varying angles and rapid movement during drone flight. To fill the gap in this research, we first construct a new… ▽ More

    Submitted 12 April, 2024; v1 submitted 8 February, 2024; originally announced February 2024.

    Comments: Accepted by IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) 2024

  35. arXiv:2402.03951  [pdf, other

    cs.CV cs.AI

    Boosting Adversarial Transferability across Model Genus by Deformation-Constrained War**

    Authors: Qinliang Lin, Cheng Luo, Zenghao Niu, Xilin He, Weicheng Xie, Yuanbo Hou, Linlin Shen, Siyang Song

    Abstract: Adversarial examples generated by a surrogate model typically exhibit limited transferability to unknown target systems. To address this problem, many transferability enhancement approaches (e.g., input transformation and model augmentation) have been proposed. However, they show poor performances in attacking systems having different model genera from the surrogate model. In this paper, we propos… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

    Comments: AAAI 2024

  36. arXiv:2402.02705  [pdf, other

    cs.LG cs.AI cs.CV

    Representation Surgery for Multi-Task Model Merging

    Authors: Enneng Yang, Li Shen, Zhenyi Wang, Guibing Guo, Xiaojun Chen, Xingwei Wang, Dacheng Tao

    Abstract: Multi-task learning (MTL) compresses the information from multiple tasks into a unified backbone to improve computational efficiency and generalization. Recent work directly merges multiple independently trained models to perform MTL instead of collecting their raw data for joint training, greatly expanding the application scenarios of MTL. However, by visualizing the representation distribution o… ▽ More

    Submitted 28 May, 2024; v1 submitted 4 February, 2024; originally announced February 2024.

    Comments: Forty-first International Conference on Machine Learning (ICML 2024)

  37. arXiv:2402.02003  [pdf, other

    cs.CV

    GenFace: A Large-Scale Fine-Grained Face Forgery Benchmark and Cross Appearance-Edge Learning

    Authors: Yaning Zhang, Zitong Yu, Xiaobin Huang, Linlin Shen, Jianfeng Ren

    Abstract: The rapid advancement of photorealistic generators has reached a critical juncture where the discrepancy between authentic and manipulated images is increasingly indistinguishable. Thus, benchmarking and advancing techniques detecting digital manipulation become an urgent issue. Although there have been a number of publicly available face forgery datasets, the forgery faces are mostly generated us… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

  38. arXiv:2402.00433  [pdf, other

    cs.LG cs.CV

    Merging Multi-Task Models via Weight-Ensembling Mixture of Experts

    Authors: Anke Tang, Li Shen, Yong Luo, Nan Yin, Lefei Zhang, Dacheng Tao

    Abstract: Merging various task-specific Transformer-based models trained on different tasks into a single unified model can execute all the tasks concurrently. Previous methods, exemplified by task arithmetic, have been proven to be both effective and scalable. Existing methods have primarily focused on seeking a static optimal solution within the original model parameter space. A notable challenge is mitig… ▽ More

    Submitted 7 June, 2024; v1 submitted 1 February, 2024; originally announced February 2024.

  39. arXiv:2402.00137  [pdf, other

    cs.LG cs.CV

    Multimodal Neurodegenerative Disease Subty** Explained by ChatGPT

    Authors: Diego Machado Reyes, Hanqing Chao, Juergen Hahn, Li Shen, **kun Yan

    Abstract: Alzheimer's disease (AD) is the most prevalent neurodegenerative disease; yet its currently available treatments are limited to stop** disease progression. Moreover, effectiveness of these treatments is not guaranteed due to the heterogenetiy of the disease. Therefore, it is essential to be able to identify the disease subtypes at a very early stage. Current data driven approaches are able to cl… ▽ More

    Submitted 31 January, 2024; originally announced February 2024.

  40. arXiv:2401.13212  [pdf, other

    cs.CV cs.AI cs.LG

    AdCorDA: Classifier Refinement via Adversarial Correction and Domain Adaptation

    Authors: Lulan Shen, Ali Edalati, Brett Meyer, Warren Gross, James J. Clark

    Abstract: This paper describes a simple yet effective technique for refining a pretrained classifier network. The proposed AdCorDA method is based on modification of the training set and making use of the duality between network weights and layer inputs. We call this input space training. The method consists of two stages - adversarial correction followed by domain adaptation. Adversarial correction uses ad… ▽ More

    Submitted 23 January, 2024; originally announced January 2024.

  41. arXiv:2401.13136  [pdf, other

    cs.CL cs.AI

    The Language Barrier: Dissecting Safety Challenges of LLMs in Multilingual Contexts

    Authors: Lingfeng Shen, Weiting Tan, Sihao Chen, Yunmo Chen, **gyu Zhang, Haoran Xu, Boyuan Zheng, Philipp Koehn, Daniel Khashabi

    Abstract: As the influence of large language models (LLMs) spans across global communities, their safety challenges in multilingual settings become paramount for alignment research. This paper examines the variations in safety challenges faced by LLMs across different languages and discusses approaches to alleviating such concerns. By comparing how state-of-the-art LLMs respond to the same set of malicious… ▽ More

    Submitted 23 January, 2024; originally announced January 2024.

  42. arXiv:2401.12402  [pdf, other

    astro-ph.GA

    Characterizing the Average Interstellar Medium Conditions of Galaxies at $z\sim$ 5.6-9 with UV and Optical Nebular Lines

    Authors: Weida Hu, Casey Papovich, Mark Dickinson, Robert Kennicutt, Lu Shen, Ricardo O. Amorín, Pablo Arrabal Haro, Micaela B. Bagley, Rachana Bhatawdekar, Nikko J. Cleri, Justin W. Cole, Avishai Dekel, Alexander de la Vega, Steven L. Finkelstein, Norman A. Grogin, Nimish P. Hathi, Michaela Hirschmann, Benne W. Holwerda, Taylor A. Hutchison, Intae Jung, Anton M. Koekemoer, Jeyhan S. Kartaltepe, Ray A. Lucas, Mario Llerena, S. Mascia , et al. (8 additional authors not shown)

    Abstract: Ultraviolet (UV; rest-frame $\sim1200-2000$ A) spectra provide a wealth of diagnostics to characterize fundamental galaxy properties, such as their chemical enrichment, the nature of their stellar populations, and their amount of Lyman-continuum (LyC) radiation. In this work, we leverage publicly released JWST data to construct the rest-frame UV-to-optical composite spectrum of a sample of 63 gala… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

    Comments: 21 pages, 7 figures, 4 tables. Submitted. Comments are welcome

  43. arXiv:2401.12014  [pdf, other

    cs.LG cs.AI cs.CV

    Robustness to distribution shifts of compressed networks for edge devices

    Authors: Lulan Shen, Ali Edalati, Brett Meyer, Warren Gross, James J. Clark

    Abstract: It is necessary to develop efficient DNNs deployed on edge devices with limited computation resources. However, the compressed networks often execute new tasks in the target domain, which is different from the source domain where the original network is trained. It is important to investigate the robustness of compressed networks in two types of data distribution shifts: domain shifts and adversar… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

  44. arXiv:2401.10663  [pdf, ps, other

    math.AP

    Planar Schrödinger-Poisson system with steep potential well: supercritical exponential case

    Authors: Liejun Shen, Marco Squassina

    Abstract: We study a class of planar Schrödinger-Poisson systems $$ -Δu+λV(x)u+φu=f(u) , \quad x\in{\mathbb R}^2,\qquad Δφ=u^2, \quad x\in{\mathbb R}^2, $$ where $λ>0$ is a parameter, $V\in C({\mathbb R}^2,{\mathbb R}^+)$ has a potential well $Ω\triangleq\text{int}\, V^{-1}(0)$ and the nonlinearity $f$ fulfills the supercritical exponential growth at infinity in the Trudinger-Moser sense. By exploitin… ▽ More

    Submitted 19 January, 2024; originally announced January 2024.

    Comments: 33 pages

    MSC Class: 35J15; 35J20; 35B06

  45. arXiv:2401.10021  [pdf, other

    eess.SP

    Interference Cancellation for UWA Random Access Data Packet Transmission

    Authors: Yuriy Zakharov, Lu Shen, Benjamin Henson, Nils Morozs, Paul D. Mitchell

    Abstract: In underwater acoustic (UWA) random access communication networks with multiple users and data packet transmissions, the packet collisions are the main cause of the network performance degradation. The aim of this paper is to investigate interference cancellation (IC) techniques capable of resolving such collisions in a low-complexity modem with single-carrier modulation and single transducer. Mor… ▽ More

    Submitted 18 January, 2024; originally announced January 2024.

    Comments: 13 pages, 13 figures

  46. arXiv:2401.09951  [pdf, other

    eess.SP

    Performance Evaluation of a Full-Duplex UWA System in Lake Experiments

    Authors: Lu Shen, Benjamin Henson, Long Shi, Yuriy Zakharov

    Abstract: In this work we present a full-duplex (FD) underwater acoustic (UWA) communication system simultaneously transmitting and receiving acoustic signals in the same frequency bandwidth. To simplify the FD hardware, the system exploits a recently designed transducer capable of simultaneously transmitting and receiving signals. The key challenge of implementing an FD system is to cancel at the near-end… ▽ More

    Submitted 18 January, 2024; originally announced January 2024.

    Comments: 9 pages, 15 figures

  47. arXiv:2401.09883  [pdf, other

    cs.CV

    Question-Answer Cross Language Image Matching for Weakly Supervised Semantic Segmentation

    Authors: Songhe Deng, Wei Zhuo, **heng Xie, Linlin Shen

    Abstract: Class Activation Map (CAM) has emerged as a popular tool for weakly supervised semantic segmentation (WSSS), allowing the localization of object regions in an image using only image-level labels. However, existing CAM methods suffer from under-activation of target object regions and false-activation of background regions due to the fact that a lack of detailed supervision can hinder the model's ab… ▽ More

    Submitted 18 January, 2024; originally announced January 2024.

    Comments: ACM MM 2023

  48. arXiv:2401.08478  [pdf, other

    cs.LG cs.AI

    Solving Continual Offline Reinforcement Learning with Decision Transformer

    Authors: Kaixin Huang, Li Shen, Chen Zhao, Chun Yuan, Dacheng Tao

    Abstract: Continuous offline reinforcement learning (CORL) combines continuous and offline reinforcement learning, enabling agents to learn multiple tasks from static datasets without forgetting prior tasks. However, CORL faces challenges in balancing stability and plasticity. Existing methods, employing Actor-Critic structures and experience replay (ER), suffer from distribution shifts, low efficiency, and… ▽ More

    Submitted 7 April, 2024; v1 submitted 16 January, 2024; originally announced January 2024.

    Comments: 11 pages, 6 figures

  49. arXiv:2401.08417  [pdf, other

    cs.CL

    Contrastive Preference Optimization: Pushing the Boundaries of LLM Performance in Machine Translation

    Authors: Haoran Xu, Amr Sharaf, Yunmo Chen, Weiting Tan, Lingfeng Shen, Benjamin Van Durme, Kenton Murray, Young ** Kim

    Abstract: Moderate-sized large language models (LLMs) -- those with 7B or 13B parameters -- exhibit promising machine translation (MT) performance. However, even the top-performing 13B LLM-based translation models, like ALMA, does not match the performance of state-of-the-art conventional encoder-decoder translation models or larger-scale LLMs such as GPT-4. In this study, we bridge this performance gap. We… ▽ More

    Submitted 2 June, 2024; v1 submitted 16 January, 2024; originally announced January 2024.

    Comments: Accepted at ICML 2024

  50. arXiv:2401.06659  [pdf, other

    cs.CL

    WisdoM: Improving Multimodal Sentiment Analysis by Fusing Contextual World Knowledge

    Authors: Wenbin Wang, Liang Ding, Li Shen, Yong Luo, Han Hu, Dacheng Tao

    Abstract: Sentiment analysis is rapidly advancing by utilizing various data modalities (e.g., text, image). However, most previous works relied on superficial information, neglecting the incorporation of contextual world knowledge (e.g., background information derived from but beyond the given image and text pairs) and thereby restricting their ability to achieve better multimodal sentiment analysis (MSA).… ▽ More

    Submitted 20 February, 2024; v1 submitted 12 January, 2024; originally announced January 2024.