Skip to main content

Showing 51–100 of 1,127 results for author: Shi, Z

.
  1. arXiv:2404.12638  [pdf, other

    cs.AI

    Learning to Cut via Hierarchical Sequence/Set Model for Efficient Mixed-Integer Programming

    Authors: Jie Wang, Zhihai Wang, Xijun Li, Yufei Kuang, Zhihao Shi, Fangzhou Zhu, Mingxuan Yuan, Jia Zeng, Yongdong Zhang, Feng Wu

    Abstract: Cutting planes (cuts) play an important role in solving mixed-integer linear programs (MILPs), which formulate many important real-world applications. Cut selection heavily depends on (P1) which cuts to prefer and (P2) how many cuts to select. Although modern MILP solvers tackle (P1)-(P2) by human-designed heuristics, machine learning carries the potential to learn more effective heuristics. Howev… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

    Comments: arXiv admin note: substantial text overlap with arXiv:2302.00244

  2. arXiv:2404.09814  [pdf, other

    cs.IT

    A Novel HARQ-CC Assisted SCMA Scheme

    Authors: Man Wang, Zheng Shi, Yunfei Li, Xianda Wu, Weiqiang Tan, Xinrong Ye

    Abstract: This letter proposes a novel hybrid automatic repeat request with chase combining assisted sparse code multiple access (HARQ-CC-SCMA) scheme. Depending on whether the same superimposed packet are retransmitted, synchronous and asynchronous modes are considered for retransmissions. Moreover, factor graph aggregation (FGA) and Log-likelihood ratio combination (LLRC) are proposed for multi-user detec… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

  3. arXiv:2404.08211  [pdf, other

    cond-mat.mes-hall cond-mat.quant-gas quant-ph

    Fractal spectrum in twisted bilayer optical lattice

    Authors: Xu-Tao Wan, Chao Gao, Zhe-Yu Shi

    Abstract: The translation symmetry of a lattice is greatly modified when subjected to a perpendicular magnetic field [Zak, Phys. Rev. \textbf{134}, A1602 (1964)]. This change in symmetry can lead to magnetic unit cells that are substantially larger than the original ones. Similarly, the translation properties of a double-layered lattice alters drastically while two monolayers are relatively twisted by a sma… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

    Comments: 6 pages, 3 figures. Comments are welcome!

  4. arXiv:2404.07956  [pdf, other

    cs.LG cs.AI cs.RO eess.SY math.OC

    Lyapunov-stable Neural Control for State and Output Feedback: A Novel Formulation

    Authors: Lujie Yang, Hongkai Dai, Zhouxing Shi, Cho-Jui Hsieh, Russ Tedrake, Huan Zhang

    Abstract: Learning-based neural network (NN) control policies have shown impressive empirical performance in a wide range of tasks in robotics and control. However, formal (Lyapunov) stability guarantees over the region-of-attraction (ROA) for NN controllers with nonlinear dynamical systems are challenging to obtain, and most existing approaches rely on expensive solvers such as sums-of-squares (SOS), mixed… ▽ More

    Submitted 4 June, 2024; v1 submitted 11 April, 2024; originally announced April 2024.

    Comments: Paper accepted by ICML 2024

  5. arXiv:2404.04155  [pdf, other

    cs.CV

    MarsSeg: Mars Surface Semantic Segmentation with Multi-level Extractor and Connector

    Authors: Junbo Li, Keyan Chen, Gengju Tian, Lu Li, Zhenwei Shi

    Abstract: The segmentation and interpretation of the Martian surface play a pivotal role in Mars exploration, providing essential data for the trajectory planning and obstacle avoidance of rovers. However, the complex topography, similar surface features, and the lack of extensive annotated data pose significant challenges to the high-precision semantic segmentation of the Martian surface. To address these… ▽ More

    Submitted 5 April, 2024; originally announced April 2024.

  6. arXiv:2404.04102  [pdf, other

    cs.LG cs.AI cs.CL

    ROPO: Robust Preference Optimization for Large Language Models

    Authors: Xize Liang, Chao Chen, Shuang Qiu, Jie Wang, Yue Wu, Zhihang Fu, Zhihao Shi, Feng Wu, Jie** Ye

    Abstract: Preference alignment is pivotal for empowering large language models (LLMs) to generate helpful and harmless responses. However, the performance of preference alignment is highly sensitive to the prevalent noise in the preference data. Recent efforts for this problem either marginally alleviate the impact of noise without the ability to actually reduce its presence, or rely on costly teacher LLMs… ▽ More

    Submitted 28 May, 2024; v1 submitted 5 April, 2024; originally announced April 2024.

  7. arXiv:2404.01082  [pdf, other

    eess.IV

    The state-of-the-art in Cardiac MRI Reconstruction: Results of the CMRxRecon Challenge in MICCAI 2023

    Authors: Jun Lyu, Chen Qin, Shuo Wang, Fanwen Wang, Yan Li, Zi Wang, Kunyuan Guo, Cheng Ouyang, Michael Tänzer, Meng Liu, Longyu Sun, Mengting Sun, Qin Li, Zhang Shi, Sha Hua, Hao Li, Zhensen Chen, Zhenlin Zhang, Bingyu Xin, Dimitris N. Metaxas, George Yiasemis, Jonas Teuwen, Li** Zhang, Weitian Chen, Yidong Zhao , et al. (25 additional authors not shown)

    Abstract: Cardiac MRI, crucial for evaluating heart structure and function, faces limitations like slow imaging and motion artifacts. Undersampling reconstruction, especially data-driven algorithms, has emerged as a promising solution to accelerate scans and enhance imaging performance using highly under-sampled data. Nevertheless, the scarcity of publicly available cardiac k-space datasets and evaluation p… ▽ More

    Submitted 16 April, 2024; v1 submitted 1 April, 2024; originally announced April 2024.

    Comments: 25 pages, 17 figures

  8. arXiv:2404.00938  [pdf, ps, other

    cs.HC cs.CL cs.CV cs.RO

    How Can Large Language Models Enable Better Socially Assistive Human-Robot Interaction: A Brief Survey

    Authors: Zhonghao Shi, Ellen Landrum, Amy O' Connell, Mina Kian, Leticia Pinto-Alva, Kaleen Shrestha, Xiaoyuan Zhu, Maja J Matarić

    Abstract: Socially assistive robots (SARs) have shown great success in providing personalized cognitive-affective support for user populations with special needs such as older adults, children with autism spectrum disorder (ASD), and individuals with mental health challenges. The large body of work on SAR demonstrates its potential to provide at-home support that complements clinic-based interventions deliv… ▽ More

    Submitted 5 April, 2024; v1 submitted 1 April, 2024; originally announced April 2024.

    Comments: 2 pages, accepted to the Proceedings of the AAAI Symposium Series, 2024

  9. arXiv:2404.00863  [pdf, other

    eess.AS

    Voice Conversion Augmentation for Speaker Recognition on Defective Datasets

    Authors: Ruijie Tao, Zhan Shi, Yidi Jiang, Tianchi Liu, Haizhou Li

    Abstract: Modern speaker recognition system relies on abundant and balanced datasets for classification training. However, diverse defective datasets, such as partially-labelled, small-scale, and imbalanced datasets, are common in real-world applications. Previous works usually studied specific solutions for each scenario from the algorithm perspective. However, the root cause of these problems lies in data… ▽ More

    Submitted 31 March, 2024; originally announced April 2024.

    Comments: 5 pages

  10. arXiv:2404.00299  [pdf, other

    cs.CV

    HOI-M3:Capture Multiple Humans and Objects Interaction within Contextual Environment

    Authors: Juze Zhang, **gyan Zhang, Zining Song, Zhanhe Shi, Chengfeng Zhao, Ye Shi, **gyi Yu, Lan Xu, **gya Wang

    Abstract: Humans naturally interact with both others and the surrounding multiple objects, engaging in various social activities. However, recent advances in modeling human-object interactions mostly focus on perceiving isolated individuals and objects, due to fundamental data scarcity. In this paper, we introduce HOI-M3, a novel large-scale dataset for modeling the interactions of Multiple huMans and Multi… ▽ More

    Submitted 2 April, 2024; v1 submitted 30 March, 2024; originally announced April 2024.

    Comments: Accepted to CVPR 2024

  11. arXiv:2403.20198  [pdf, other

    cs.IT eess.SY

    Minimizing End-to-End Latency for Joint Source-Channel Coding Systems

    Authors: Kaiyi Chi, Qianqian Yang, Yuanchao Shu, Zhaohui Yang, Zhiguo Shi

    Abstract: While existing studies have highlighted the advantages of deep learning (DL)-based joint source-channel coding (JSCC) schemes in enhancing transmission efficiency, they often overlook the crucial aspect of resource management during the deployment phase. In this paper, we propose an approach to minimize the transmission latency in an uplink JSCC-based system. We first analyze the correlation betwe… ▽ More

    Submitted 29 March, 2024; originally announced March 2024.

    Comments: 7 Pages, 5 Figures, accepted by 2024 IEEE ICC Workshop

  12. arXiv:2403.19654  [pdf, other

    cs.CV

    RSMamba: Remote Sensing Image Classification with State Space Model

    Authors: Keyan Chen, Bowen Chen, Chenyang Liu, Wenyuan Li, Zhengxia Zou, Zhenwei Shi

    Abstract: Remote sensing image classification forms the foundation of various understanding tasks, serving a crucial function in remote sensing image interpretation. The recent advancements of Convolutional Neural Networks (CNNs) and Transformers have markedly enhanced classification accuracy. Nonetheless, remote sensing scene classification remains a significant challenge, especially given the complexity a… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

  13. arXiv:2403.19646  [pdf, other

    cs.CV

    Change-Agent: Towards Interactive Comprehensive Remote Sensing Change Interpretation and Analysis

    Authors: Chenyang Liu, Keyan Chen, Haotian Zhang, Zipeng Qi, Zhengxia Zou, Zhenwei Shi

    Abstract: Monitoring changes in the Earth's surface is crucial for understanding natural processes and human impacts, necessitating precise and comprehensive interpretation methodologies. Remote sensing satellite imagery offers a unique perspective for monitoring these changes, leading to the emergence of remote sensing image change interpretation (RSICI) as a significant research focus. Current RSICI techn… ▽ More

    Submitted 1 April, 2024; v1 submitted 28 March, 2024; originally announced March 2024.

  14. arXiv:2403.19622  [pdf, other

    cs.RO cs.CV

    RH20T-P: A Primitive-Level Robotic Dataset Towards Composable Generalization Agents

    Authors: Zeren Chen, Zhelun Shi, Xiaoya Lu, Lehan He, Sucheng Qian, Hao Shu Fang, Zhenfei Yin, Wanli Ouyang, **g Shao, Yu Qiao, Cewu Lu, Lu Sheng

    Abstract: The ultimate goals of robotic learning is to acquire a comprehensive and generalizable robotic system capable of performing both seen skills within the training distribution and unseen skills in novel environments. Recent progress in utilizing language models as high-level planners has demonstrated that the complexity of tasks can be reduced through decomposing them into primitive-level plans, mak… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

    Comments: 24 pages, 12 figures, 6 tables

  15. arXiv:2403.19446  [pdf, other

    cs.LO

    EDA-Driven Preprocessing for SAT Solving

    Authors: Zhengyuan Shi, Tiebing Tang, Sadaf Khan, Hui-Ling Zhen, Mingxuan Yuan, Zhufei Chu, Qiang Xu

    Abstract: Effective formulation of problems into Conjunctive Normal Form (CNF) is critical in modern Boolean Satisfiability (SAT) solving for optimizing solver performance. Addressing the limitations of existing methods, our Electronic Design Automation (EDA)-driven preprocessing framework introduces a novel methodology for preparing SAT instances, leveraging both circuit and CNF formats for enhanced flexib… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

  16. arXiv:2403.18134  [pdf, other

    eess.IV cs.CV

    Integrative Graph-Transformer Framework for Histopathology Whole Slide Image Representation and Classification

    Authors: Zhan Shi, **gwei Zhang, Jun Kong, Fusheng Wang

    Abstract: In digital pathology, the multiple instance learning (MIL) strategy is widely used in the weakly supervised histopathology whole slide image (WSI) classification task where giga-pixel WSIs are only labeled at the slide level. However, existing attention-based MIL approaches often overlook contextual information and intrinsic spatial relationships between neighboring tissue tiles, while graph-based… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

  17. arXiv:2403.17830  [pdf, other

    cs.CV

    Assessment of Multimodal Large Language Models in Alignment with Human Values

    Authors: Zhelun Shi, Zhipin Wang, Hongxing Fan, Zaibin Zhang, Lijun Li, Yongting Zhang, Zhenfei Yin, Lu Sheng, Yu Qiao, **g Shao

    Abstract: Large Language Models (LLMs) aim to serve as versatile assistants aligned with human values, as defined by the principles of being helpful, honest, and harmless (hhh). However, in terms of Multimodal Large Language Models (MLLMs), despite their commendable performance in perception and reasoning tasks, their alignment with human values remains largely unexplored, given the complexity of defining h… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    Comments: arXiv admin note: text overlap with arXiv:2311.02692

  18. arXiv:2403.15726  [pdf, other

    cs.LG

    Convection-Diffusion Equation: A Theoretically Certified Framework for Neural Networks

    Authors: Tangjun Wang, Chenglong Bao, Zuoqiang Shi

    Abstract: In this paper, we study the partial differential equation models of neural networks. Neural network can be viewed as a map from a simple base model to a complicate function. Based on solid analysis, we show that this map can be formulated by a convection-diffusion equation. This theoretically certified framework gives mathematical foundation and more understanding of neural networks. Moreover, bas… ▽ More

    Submitted 23 March, 2024; originally announced March 2024.

  19. arXiv:2403.14621  [pdf, other

    cs.CV

    GRM: Large Gaussian Reconstruction Model for Efficient 3D Reconstruction and Generation

    Authors: Yinghao Xu, Zifan Shi, Wang Yifan, Hansheng Chen, Ceyuan Yang, Sida Peng, Yujun Shen, Gordon Wetzstein

    Abstract: We introduce GRM, a large-scale reconstructor capable of recovering a 3D asset from sparse-view images in around 0.1s. GRM is a feed-forward transformer-based model that efficiently incorporates multi-view information to translate the input pixels into pixel-aligned Gaussians, which are unprojected to create a set of densely distributed 3D Gaussians representing a scene. Together, our transformer… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

    Comments: Project page: https://justimyhxu.github.io/projects/grm/ Code: https://github.com/justimyhxu/GRM

  20. arXiv:2403.13562  [pdf, other

    eess.SY

    Augmented Labeled Random Finite Sets and Its Application to Group Target Tracking

    Authors: Chaoqun Yang, Mengdie Xu, Xiaowei Liang, Zhiguo Shi, Heng Zhang, Xianghui Cao

    Abstract: This paper addresses the problem of group target tracking (GTT), wherein multiple closely spaced targets within a group pose a coordinated motion. To improve the tracking performance, the labeled random finite sets (LRFSs) theory is adopted, and this paper develops a new kind of LRFSs, i.e., augmented LRFSs, which introduces group information into the definition of LRFSs. Specifically, for each el… ▽ More

    Submitted 16 April, 2024; v1 submitted 20 March, 2024; originally announced March 2024.

  21. arXiv:2403.11751  [pdf, other

    cs.CV

    Relational Representation Learning Network for Cross-Spectral Image Patch Matching

    Authors: Chuang Yu, Yunpeng Liu, **miao Zhao, Dou Quan, Zelin Shi

    Abstract: Recently, feature relation learning has drawn widespread attention in cross-spectral image patch matching. However, existing related research focuses on extracting diverse relations between image patch features and ignores sufficient intrinsic feature representations of individual image patches. Therefore, an innovative relational representation learning idea is proposed for the first time, which… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

  22. arXiv:2403.11465  [pdf

    cond-mat.mes-hall cond-mat.mtrl-sci

    Ultra-Long Homochiral Graphene Nanoribbons Grown Within h-BN Stacks for High-Performance Electronics

    Authors: Bosai Lyu, Jiajun Chen, Sen Wang, Shuo Lou, Peiyue Shen, **gxu Xie, Lu Qiu, Izaac Mitchell, Can Li, Cheng Hu, Xianliang Zhou, Kenji Watanabe, Takashi Taniguchi, Xiaoqun Wang, **feng Jia, Qi Liang, Guorui Chen, Tingxin Li, Shiyong Wang, Wengen Ouyang, Oded Hod, Feng Ding, Michael Urbakh, Zhiwen Shi

    Abstract: Van der Waals encapsulation of two-dimensional materials within hexagonal boron nitride (h-BN) stacks has proven to be a promising way to create ultrahigh-performance electronic devices. However, contemporary approaches for achieving van der Waals encapsulation, which involve artificial layer stacking using mechanical transfer techniques, are difficult to control, prone to contamination, and unsca… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

  23. arXiv:2403.09960  [pdf, other

    math.ST math.PR stat.ML

    Multivariate Gaussian Approximation for Random Forest via Region-based Stabilization

    Authors: Zhaoyang Shi, Chinmoy Bhattacharjee, Krishnakumar Balasubramanian, Wolfgang Polonik

    Abstract: We derive Gaussian approximation bounds for random forest predictions based on a set of training points given by a Poisson process, under fairly mild regularity assumptions on the data generating process. Our approach is based on the key observation that the random forest predictions satisfy a certain geometric property called region-based stabilization. In the process of develo** our results fo… ▽ More

    Submitted 25 March, 2024; v1 submitted 14 March, 2024; originally announced March 2024.

  24. arXiv:2403.07712  [pdf, ps, other

    math.AP math.NA

    Nonlocal Stokes equation with relaxation on the divergence free equation

    Authors: Yajie Zhang, Qiang Du, Zuoqiang Shi

    Abstract: In this paper, we consider a new nonlocal approximation to the linear Stokes system with periodic boundary conditions in two and three dimensional spaces . A relaxation term is added to the equation of nonlocal divergence free equation, which is reminiscent to the relaxation of local Stokes equation with small artificial compressibility. Our analysis shows that the well-posedness of the nonlocal s… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

  25. arXiv:2403.07436  [pdf, other

    cs.CV

    JSTR: Joint Spatio-Temporal Reasoning for Event-based Moving Object Detection

    Authors: Hanyu Zhou, Zhiwei Shi, Hao Dong, Shihan Peng, Yi Chang, Luxin Yan

    Abstract: Event-based moving object detection is a challenging task, where static background and moving object are mixed together. Typically, existing methods mainly align the background events to the same spatial coordinate system via motion compensation to distinguish the moving object. However, they neglect the potential spatial tailing effect of moving object events caused by excessive motion, which may… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

  26. arXiv:2403.07432  [pdf, other

    cs.CV

    Bring Event into RGB and LiDAR: Hierarchical Visual-Motion Fusion for Scene Flow

    Authors: Hanyu Zhou, Yi Chang, Zhiwei Shi, Luxin Yan

    Abstract: Single RGB or LiDAR is the mainstream sensor for the challenging scene flow, which relies heavily on visual features to match motion features. Compared with single modality, existing methods adopt a fusion strategy to directly fuse the cross-modal complementary knowledge in motion space. However, these direct fusion methods may suffer the modality gap due to the visual intrinsic heterogeneous natu… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

  27. arXiv:2403.07257  [pdf, other

    cs.AR cs.ET

    The Dawn of AI-Native EDA: Opportunities and Challenges of Large Circuit Models

    Authors: Lei Chen, Yiqi Chen, Zhufei Chu, Wenji Fang, Tsung-Yi Ho, Ru Huang, Yu Huang, Sadaf Khan, Min Li, Xingquan Li, Yu Li, Yun Liang, **wei Liu, Yi Liu, Yibo Lin, Guojie Luo, Zhengyuan Shi, Guangyu Sun, Dimitrios Tsaras, Runsheng Wang, Ziyi Wang, Xinming Wei, Zhiyao Xie, Qiang Xu, Chenhao Xue , et al. (14 additional authors not shown)

    Abstract: Within the Electronic Design Automation (EDA) domain, AI-driven solutions have emerged as formidable tools, yet they typically augment rather than redefine existing methodologies. These solutions often repurpose deep learning models from other domains, such as vision, text, and graph analytics, applying them to circuit design without tailoring to the unique complexities of electronic circuits. Suc… ▽ More

    Submitted 1 May, 2024; v1 submitted 11 March, 2024; originally announced March 2024.

    Comments: The authors are ordered alphabetically. Contact: qxu@cse[dot]cuhk[dot]edu[dot]hk, gluo@pku[dot]edu[dot]cn, yuan.mingxuan@huawei[dot]com

  28. arXiv:2403.05888  [pdf, ps, other

    math.NA

    A Second-Order Nonlocal Approximation to Manifold Poisson Models with Neumann Boundary

    Authors: Yajie Zhang, Zuoqiang Shi

    Abstract: In this paper, we propose a class of nonlocal models to approximate the Poisson model on manifolds with homogeneous Neumann boundary condition, where the manifolds are assumed to be embedded in high dimensional Euclid spaces. In comparison to the existing nonlocal approximation of Poisson models with Neumann boundary, we optimize the truncation error of model by adding an augmented term along the… ▽ More

    Submitted 9 March, 2024; originally announced March 2024.

    Comments: arXiv admin note: text overlap with arXiv:2203.02120

  29. arXiv:2403.05430  [pdf, other

    cs.CE

    MambaLithium: Selective state space model for remaining-useful-life, state-of-health, and state-of-charge estimation of lithium-ion batteries

    Authors: Zhuangwei Shi

    Abstract: Recently, lithium-ion batteries occupy a pivotal position in the realm of electric vehicles and the burgeoning new energy industry. Their performance is heavily dependent on three core states: remaining-useful-life (RUL), state-of-health (SOH), and state-of-charge (SOC). Given the remarkable success of Mamba (Structured state space sequence models with selection mechanism and scan module, S6) in s… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

    Comments: arXiv admin note: substantial text overlap with arXiv:2402.18959; text overlap with arXiv:2204.02623

  30. arXiv:2403.04194  [pdf, other

    cs.CV

    SAM-PD: How Far Can SAM Take Us in Tracking and Segmenting Anything in Videos by Prompt Denoising

    Authors: Tao Zhou, Wenhan Luo, Qi Ye, Zhiguo Shi, Jiming Chen

    Abstract: Recently, promptable segmentation models, such as the Segment Anything Model (SAM), have demonstrated robust zero-shot generalization capabilities on static images. These promptable models exhibit denoising abilities for imprecise prompt inputs, such as imprecise bounding boxes. In this paper, we explore the potential of applying SAM to track and segment objects in videos where we recognize the tr… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

  31. arXiv:2403.03031  [pdf, other

    cs.CL

    Learning to Use Tools via Cooperative and Interactive Agents

    Authors: Zhengliang Shi, Shen Gao, Xiuyi Chen, Yue Feng, Lingyong Yan, Haibo Shi, Dawei Yin, Pengjie Ren, Suzan Verberne, Zhaochun Ren

    Abstract: Tool learning empowers large language models (LLMs) as agents to use external tools and extend their utility. Existing methods employ one single LLM-based agent to iteratively select and execute tools, thereafter incorporating execution results into the next action prediction. Despite their progress, these methods suffer from performance degradation when addressing practical tasks due to: (1) the… ▽ More

    Submitted 22 June, 2024; v1 submitted 5 March, 2024; originally announced March 2024.

    Comments: working in process

  32. arXiv:2403.02714  [pdf, other

    cs.CV

    DomainVerse: A Benchmark Towards Real-World Distribution Shifts For Tuning-Free Adaptive Domain Generalization

    Authors: Feng Hou, ** Yuan, Ying Yang, Yang Liu, Yang Zhang, Cheng Zhong, Zhongchao Shi, Jian** Fan, Yong Rui, Zhiqiang He

    Abstract: Traditional cross-domain tasks, including domain adaptation and domain generalization, rely heavily on training model by source domain data. With the recent advance of vision-language models (VLMs), viewed as natural source models, the cross-domain task changes to directly adapt the pre-trained source model to arbitrary target domains equipped with prior domain knowledge, and we name this task Ada… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

    Comments: Currently in review for ICML 2024

  33. arXiv:2403.01093  [pdf, other

    eess.SP

    Variational Bayesian Learning Based Localization and Channel Reconstruction in RIS-aided Systems

    Authors: Yunfei Li, Yiting Luo, Xianda Wu, Zheng Shi, Shaodan Ma, Guanghua Yang

    Abstract: The emerging immersive and autonomous services have posed stringent requirements on both communications and localization. By considering the great potential of reconfigurable intelligent surface (RIS), this paper focuses on the joint channel estimation and localization for RIS-aided wireless systems. As opposed to existing works that treat channel estimation and localization independently, this pa… ▽ More

    Submitted 1 March, 2024; originally announced March 2024.

  34. arXiv:2403.00496  [pdf

    physics.optics physics.app-ph

    Benchmarking reconstructive spectrometer with multi-resonant cavities

    Authors: Chunhui Yao, Kangning Xu, Tianhua Lin, Jie Ma, Chumeng Yao, Peng Bao, Zhitian Shi, Richard Penty, Qixiang Cheng

    Abstract: Recent years have seen the rapid development of miniaturized reconstructive spectrometers (RSs), yet they still confront a range of technical challenges, such as bandwidth/resolution ratio, sensing speed, and/or power efficiency. Reported RS designs often suffer from insufficient decorrelation between sampling channels, which results in limited compressive sampling efficiency, in essence, due to i… ▽ More

    Submitted 1 March, 2024; originally announced March 2024.

  35. arXiv:2402.18959  [pdf, other

    cs.CE q-fin.ST

    MambaStock: Selective state space model for stock prediction

    Authors: Zhuangwei Shi

    Abstract: The stock market plays a pivotal role in economic development, yet its intricate volatility poses challenges for investors. Consequently, research and accurate predictions of stock price movements are crucial for mitigating risks. Traditional time series models fall short in capturing nonlinearity, leading to unsatisfactory stock predictions. This limitation has spurred the widespread adoption of… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

    Comments: arXiv admin note: substantial text overlap with arXiv:2204.02623

  36. arXiv:2402.16459  [pdf, other

    cs.CL cs.AI

    Defending LLMs against Jailbreaking Attacks via Backtranslation

    Authors: Yihan Wang, Zhouxing Shi, Andrew Bai, Cho-Jui Hsieh

    Abstract: Although many large language models (LLMs) have been trained to refuse harmful requests, they are still vulnerable to jailbreaking attacks which rewrite the original prompt to conceal its harmful intent. In this paper, we propose a new method for defending LLMs against jailbreaking attacks by ``backtranslation''. Specifically, given an initial response generated by the target LLM from an input pro… ▽ More

    Submitted 6 June, 2024; v1 submitted 26 February, 2024; originally announced February 2024.

  37. arXiv:2402.15048  [pdf, other

    cs.CL cs.AI

    Unlocking the Power of Large Language Models for Entity Alignment

    Authors: Xuhui Jiang, Yinghan Shen, Zhichao Shi, Cheng** Xu, Wei Li, Zixuan Li, Jian Guo, Huawei Shen, Yuanzhuo Wang

    Abstract: Entity Alignment (EA) is vital for integrating diverse knowledge graph (KG) data, playing a crucial role in data-driven AI applications. Traditional EA methods primarily rely on comparing entity embeddings, but their effectiveness is constrained by the limited input KG data and the capabilities of the representation learning techniques. Against this backdrop, we introduce ChatEA, an innovative fra… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

  38. arXiv:2402.15017  [pdf, other

    cs.LG cs.AI cs.CL

    Towards Few-Shot Adaptation of Foundation Models via Multitask Finetuning

    Authors: Zhuoyan Xu, Zhenmei Shi, Junyi Wei, Fangzhou Mu, Yin Li, Yingyu Liang

    Abstract: Foundation models have emerged as a powerful tool for many AI problems. Despite the tremendous success of foundation models, effective adaptation to new tasks, particularly those with limited labels, remains an open question and lacks theoretical understanding. An emerging solution with recent success in vision and NLP involves finetuning a foundation model on a selection of relevant tasks, before… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

    Comments: Published at ICLR 2024. 54 pages

  39. arXiv:2402.14985  [pdf, other

    math.ST stat.ML

    Nonsmooth Nonparametric Regression via Fractional Laplacian Eigenmaps

    Authors: Zhaoyang Shi, Krishnakumar Balasubramanian, Wolfgang Polonik

    Abstract: We develop nonparametric regression methods for the case when the true regression function is not necessarily smooth. More specifically, our approach is using the fractional Laplacian and is designed to handle the case when the true regression function lies in an $L_2$-fractional Sobolev space with order $s\in (0,1)$. This function class is a Hilbert space lying between the space of square-integra… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

  40. arXiv:2402.14000  [pdf, other

    cs.CV

    Real-time 3D-aware Portrait Editing from a Single Image

    Authors: Qingyan Bai, Zifan Shi, Yinghao Xu, Hao Ouyang, Qiuyu Wang, Ceyuan Yang, Xuan Wang, Gordon Wetzstein, Yujun Shen, Qifeng Chen

    Abstract: This work presents 3DPE, a practical method that can efficiently edit a face image following given prompts, like reference images or text descriptions, in a 3D-aware manner. To this end, a lightweight module is distilled from a 3D portrait generator and a text-to-image model, which provide prior knowledge of face geometry and superior editing capability, respectively. Such a design brings two comp… ▽ More

    Submitted 2 April, 2024; v1 submitted 21 February, 2024; originally announced February 2024.

  41. arXiv:2402.12436  [pdf, other

    cond-mat.str-el cond-mat.stat-mech hep-th

    Excitonic quantum criticality: from bilayer graphene to narrow Chern bands

    Authors: Zhengyan Darius Shi, Hart Goldman, Zhihuan Dong, T. Senthil

    Abstract: We study a family of excitonic quantum phase transitions describing the evolution of a bilayer metallic state to an inter-layer coherent state where excitons condense. We argue that such transitions can be continuous and exhibit a non-Fermi liquid counterflow response ${ρ_{\mathrm{counterflow}}(ω)\simω^{2/z}}$ that directly encodes the dynamical critical exponent $z$. Our calculations are performe… ▽ More

    Submitted 26 June, 2024; v1 submitted 19 February, 2024; originally announced February 2024.

    Comments: 30 pages + 26 pages appendices, 8 figures. (v2) Includes the effect of momentum dependence in the fermion self energy, which was neglected in (v1). Appendix A has been expanded to clarify the relationship between this work and the existing literature on ferromagnetic transitions in metals

  42. arXiv:2402.11988  [pdf

    physics.optics physics.app-ph

    Photonic Chiplet Interconnection via 3D-Nanoprinted Interposer

    Authors: Huiyu Huang, Zhitian Shi, Giuseppe Talli, Maxim Kuschnerov, Richard Penty, Qixiang Cheng

    Abstract: Photonic integrated circuits utilize various waveguide materials, each excelling in specific metrics like efficient light emission, low propagation loss, high electro-optic efficiency, and potential for mass production. Inherent shortcomings in each platform push exploration of hybrid and heterogeneous integration, which demands specialized designs and extra fabrication processes for each material… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

  43. arXiv:2402.11818  [pdf, other

    cs.CL cs.AI cs.CY

    Where It Really Matters: Few-Shot Environmental Conservation Media Monitoring for Low-Resource Languages

    Authors: Sameer Jain, Sedrick Scott Keh, Shova Chettri, Karun Dewan, Pablo Izquierdo, Johanna Prussman, Pooja Shreshtha, Cesar Suarez, Zheyuan Ryan Shi, Lei Li, Fei Fang

    Abstract: Environmental conservation organizations routinely monitor news content on conservation in protected areas to maintain situational awareness of developments that can have an environmental impact. Existing automated media monitoring systems require large amounts of data labeled by domain experts, which is only feasible at scale for high-resource languages like English. However, such tools are most… ▽ More

    Submitted 18 February, 2024; originally announced February 2024.

    Comments: AAAI 2024: AI for Social Impact Track

  44. arXiv:2402.09469  [pdf, other

    cs.LG stat.ML

    Fourier Circuits in Neural Networks: Unlocking the Potential of Large Language Models in Mathematical Reasoning and Modular Arithmetic

    Authors: Jiuxiang Gu, Chenyang Li, Yingyu Liang, Zhenmei Shi, Zhao Song, Tianyi Zhou

    Abstract: In the evolving landscape of machine learning, a pivotal challenge lies in deciphering the internal representations harnessed by neural networks and Transformers. Building on recent progress toward comprehending how networks execute distinct target functions, our study embarks on an exploration of the underlying reasons behind networks adopting specific computational strategies. We direct our focu… ▽ More

    Submitted 24 May, 2024; v1 submitted 12 February, 2024; originally announced February 2024.

    Comments: Update Section 5.3; clean up problem setup

  45. arXiv:2402.05940  [pdf

    cs.LG cs.AI stat.ME

    Causal Relationship Network of Risk Factors Impacting Workday Loss in Underground Coal Mines

    Authors: Shangsi Ren, Cameron A. Beeche, Zhiyi Shi, Maria Acevedo Garcia, Katherine Zychowski, Shuguang Leng, Pedram Roghanchi, Jiantao Pu

    Abstract: This study aims to establish the causal relationship network between various factors leading to workday loss in underground coal mines using a novel causal artificial intelligence (AI) method. The analysis utilizes data obtained from the National Institute for Occupational Safety and Health (NIOSH). A total of 101,010 injury records from 3,982 unique underground coal mines spanning the years from… ▽ More

    Submitted 24 January, 2024; originally announced February 2024.

    Comments: 5 figures 5 tables

  46. arXiv:2402.03569  [pdf, other

    cs.CR

    The Invisible Game on the Internet: A Case Study of Decoding Deceptive Patterns

    Authors: Zewei Shi, Ruoxi Sun, Jieshan Chen, Jiamou Sun, Minhui Xue

    Abstract: Deceptive patterns are design practices embedded in digital platforms to manipulate users, representing a widespread and long-standing issue in the web and mobile software development industry. Legislative actions highlight the urgency of globally regulating deceptive patterns. However, despite advancements in detection tools, a significant gap exists in assessing deceptive pattern risks. In this… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

  47. arXiv:2402.01647  [pdf, other

    cs.CY cs.AI cs.HC cs.LG cs.RO

    Build Your Own Robot Friend: An Open-Source Learning Module for Accessible and Engaging AI Education

    Authors: Zhonghao Shi, Allison O'Connell, Zongjian Li, Siqi Liu, Jennifer Ayissi, Guy Hoffman, Mohammad Soleymani, Maja J. Matarić

    Abstract: As artificial intelligence (AI) is playing an increasingly important role in our society and global economy, AI education and literacy have become necessary components in college and K-12 education to prepare students for an AI-powered society. However, current AI curricula have not yet been made accessible and engaging enough for students and schools from all socio-economic backgrounds with diffe… ▽ More

    Submitted 6 January, 2024; originally announced February 2024.

    Comments: Accepted to the Proceedings of the AAAI Conference on Artificial Intelligence (2024)

  48. arXiv:2402.00379  [pdf, other

    quant-ph

    Error-Tolerant Amplification and Simulation of the Ultrastrong-Coupling Quantum Rabi Model

    Authors: Ye-Hong Chen, Zhi-Cheng Shi, Franco Nori, Yan Xia

    Abstract: Cat-state qubits formed by photonic cat states show great promise for hardware-efficient universal quantum computing. We demonstrate that cat-state qubits are also promising for error-tolerant simulations of the quantum Rabi model (and its varieties) to enhance the coupling strength between the cat-state qubit and a cavity, to reach the ultrastrong-coupling regime. This allows us to explore severa… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

    Comments: 7 pages, 7 figures, comments are welcome

  49. arXiv:2401.17642  [pdf, other

    cs.CV

    Exploring the Common Appearance-Boundary Adaptation for Nighttime Optical Flow

    Authors: Hanyu Zhou, Yi Chang, Haoyue Liu, Wending Yan, Yuxing Duan, Zhiwei Shi, Luxin Yan

    Abstract: We investigate a challenging task of nighttime optical flow, which suffers from weakened texture and amplified noise. These degradations weaken discriminative visual features, thus causing invalid motion feature matching. Typically, existing methods employ domain adaptation to transfer knowledge from auxiliary domain to nighttime domain in either input visual space or output motion space. However,… ▽ More

    Submitted 31 January, 2024; originally announced January 2024.

    Journal ref: International Conference on Learning Representations (ICLR), 2024

  50. arXiv:2401.15619  [pdf, ps, other

    eess.SP

    A semidefinite programming approach for robust elliptic localization

    Authors: Wenxin Xiong, Jiajun He, Zhang-Lei Shi, Keyuan Hu, Hing Cheung So, Chi-Sing Leung

    Abstract: This short communication addresses the problem of elliptic localization with outlier measurements, whose occurrences are prevalent in various location-enabled applications and can significantly compromise the positioning performance if not adequately handled. In contrast to the reliance on $M$-estimation adopted in the majority of existing solutions, we take a different path, specifically explorin… ▽ More

    Submitted 28 January, 2024; originally announced January 2024.