Skip to main content

Showing 1–50 of 328 results for author: Yuan, M

.
  1. arXiv:2407.00312  [pdf, other

    cs.AI cs.NE

    UDC: A Unified Neural Divide-and-Conquer Framework for Large-Scale Combinatorial Optimization Problems

    Authors: Zhi Zheng, Changliang Zhou, Tong Xialiang, Mingxuan Yuan, Zhenkun Wang

    Abstract: Single-stage neural combinatorial optimization solvers have achieved near-optimal results on various small-scale combinatorial optimization (CO) problems without needing expert knowledge. However, these solvers exhibit significant performance degradation when applied to large-scale CO problems. Recently, two-stage neural methods with divide-and-conquer strategies have shown superiorities in addres… ▽ More

    Submitted 29 June, 2024; originally announced July 2024.

  2. arXiv:2406.18360  [pdf, other

    cs.CV

    XLD: A Cross-Lane Dataset for Benchmarking Novel Driving View Synthesis

    Authors: Hao Li, Ming Yuan, Yan Zhang, Chenming Wu, Chen Zhao, Chunyu Song, Haocheng Feng, Errui Ding, Dingwen Zhang, **gdong Wang

    Abstract: Thoroughly testing autonomy systems is crucial in the pursuit of safe autonomous driving vehicles. It necessitates creating safety-critical scenarios that go beyond what can be safely collected from real-world data, as many of these scenarios occur infrequently on public roads. However, the evaluation of most existing NVS methods relies on sporadic sampling of image frames from the training data,… ▽ More

    Submitted 26 June, 2024; v1 submitted 26 June, 2024; originally announced June 2024.

    Comments: project page: https://3d-aigc.github.io/XLD/

  3. arXiv:2406.14868  [pdf, other

    cs.CL cs.LG

    Direct Multi-Turn Preference Optimization for Language Agents

    Authors: Wentao Shi, Mengqi Yuan, Junkang Wu, Qifan Wang, Fuli Feng

    Abstract: Adapting Large Language Models (LLMs) for agent tasks is critical in develo** language agents. Direct Preference Optimization (DPO) is a promising technique for this adaptation with the alleviation of compounding errors, offering a means to directly optimize Reinforcement Learning (RL) objectives. However, applying DPO to multi-turn tasks presents challenges due to the inability to cancel the pa… ▽ More

    Submitted 25 June, 2024; v1 submitted 21 June, 2024; originally announced June 2024.

  4. arXiv:2406.05010  [pdf, other

    stat.ME

    Testing common invariant subspace of multilayer networks

    Authors: Mingao Yuan, Qianqian Yao

    Abstract: Graph (or network) is a mathematical structure that has been widely used to model relational data. As real-world systems get more complex, multilayer (or multiple) networks are employed to represent diverse patterns of relationships among the objects in the systems. One active research problem in multilayer networks analysis is to study the common invariant subspace of the networks, because such c… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  5. arXiv:2406.04699  [pdf, other

    cs.LO cs.AI

    Logic Synthesis with Generative Deep Neural Networks

    Authors: Xihan Li, Xing Li, Lei Chen, Xing Zhang, Mingxuan Yuan, Jun Wang

    Abstract: While deep learning has achieved significant success in various domains, its application to logic circuit design has been limited due to complex constraints and strict feasibility requirement. However, a recent generative deep neural model, "Circuit Transformer", has shown promise in this area by enabling equivalence-preserving circuit transformation on a small scale. In this paper, we introduce a… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: In IWLS 2024

  6. arXiv:2406.04594  [pdf, other

    cs.DC cs.AI cs.LG

    Boosting Large-scale Parallel Training Efficiency with C4: A Communication-Driven Approach

    Authors: Jianbo Dong, Bin Luo, Jun Zhang, Pengcheng Zhang, Fei Feng, Yikai Zhu, Ang Liu, Zian Chen, Yi Shi, Hairong Jiao, Gang Lu, Yu Guan, Ennan Zhai, Wencong Xiao, Hanyu Zhao, Man Yuan, Siran Yang, Xiang Li, Jiamang Wang, Rui Men, Jianwei Zhang, Huang Zhong, Dennis Cai, Yuan Xie, Binzhang Fu

    Abstract: The emergence of Large Language Models (LLMs) has necessitated the adoption of parallel training techniques, involving the deployment of thousands of GPUs to train a single model. Unfortunately, we have found that the efficiency of current parallel training is often suboptimal, largely due to the following two main issues. Firstly, hardware failures are inevitable, leading to interruptions in the… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  7. arXiv:2405.19548  [pdf, other

    cs.LG

    RLeXplore: Accelerating Research in Intrinsically-Motivated Reinforcement Learning

    Authors: Mingqi Yuan, Roger Creus Castanyer, Bo Li, Xin **, Glen Berseth, Wenjun Zeng

    Abstract: Extrinsic rewards can effectively guide reinforcement learning (RL) agents in specific tasks. However, extrinsic rewards frequently fall short in complex environments due to the significant human effort needed for their design and annotation. This limitation underscores the necessity for intrinsic rewards, which offer auxiliary and dense signals and can enable agents to learn in an unsupervised ma… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: 25 pages, 19 figures

  8. arXiv:2405.19531  [pdf, other

    cs.RO cs.LG

    Real-Time Dynamic Robot-Assisted Hand-Object Interaction via Motion Primitives

    Authors: Mingqi Yuan, Huijiang Wang, Kai-Fung Chu, Fumiya Iida, Bo Li, Wenjun Zeng

    Abstract: Advances in artificial intelligence (AI) have been propelling the evolution of human-robot interaction (HRI) technologies. However, significant challenges remain in achieving seamless interactions, particularly in tasks requiring physical contact with humans. These challenges arise from the need for accurate real-time perception of human actions, adaptive control algorithms for robots, and the eff… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: 8 pages, 10 figures

  9. arXiv:2405.18748  [pdf

    physics.soc-ph econ.GN

    Equity Implications of Net-Zero Emissions: A Multi-Model Analysis of Energy Expenditures Across Income Classes Under Economy-Wide Deep Decarbonization Policies

    Authors: John Bistlinea, Chikara Onda, Morgan Browning, Johannes Emmerling, Gokul Iyer, Megan Mahajan, Jim McFarland, Haewon McJeon, Robbie Orvis, Francisco Ralston Fonseca, Christopher Roney, Noah Sandoval, Luis Sarmiento, John Weyant, Jared Woollacott, Mei Yuan

    Abstract: With companies, states, and countries targeting net-zero emissions around midcentury, there are questions about how these targets alter household welfare and finances, including distributional effects across income groups. This paper examines the distributional dimensions of technology transitions and net-zero policies with a focus on welfare impacts across household incomes. The analysis uses a m… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Journal ref: 2024, Energy and Climate Change, 5: 100118

  10. arXiv:2405.18412  [pdf, other

    math.ST math.NA stat.ME stat.ML

    Tensor Methods in High Dimensional Data Analysis: Opportunities and Challenges

    Authors: Arnab Auddy, Dong Xia, Ming Yuan

    Abstract: Large amount of multidimensional data represented by multiway arrays or tensors are prevalent in modern applications across various fields such as chemometrics, genomics, physics, psychology, and signal processing. The structural complexity of such data provides vast new opportunities for modeling and analysis, but efficiently extracting information content from them, both statistically and comput… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  11. arXiv:2405.17525  [pdf, ps, other

    cs.LG

    SmoothGNN: Smoothing-based GNN for Unsupervised Node Anomaly Detection

    Authors: Xiangyu Dong, Xingyi Zhang, Yanni Sun, Lei Chen, Mingxuan Yuan, Sibo Wang

    Abstract: The smoothing issue leads to indistinguishable node representations, which poses a significant challenge in the field of graph learning. However, this issue also presents an opportunity to reveal underlying properties behind different types of nodes, which have been overlooked in previous studies. Through empirical and theoretical analysis of real-world node anomaly detection (NAD) datasets, we ob… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  12. arXiv:2405.17272  [pdf, other

    cs.LG cs.AI

    DPN: Decoupling Partition and Navigation for Neural Solvers of Min-max Vehicle Routing Problems

    Authors: Zhi Zheng, Shunyu Yao, Zhenkun Wang, Xialiang Tong, Mingxuan Yuan, Ke Tang

    Abstract: The min-max vehicle routing problem (min-max VRP) traverses all given customers by assigning several routes and aims to minimize the length of the longest route. Recently, reinforcement learning (RL)-based sequential planning methods have exhibited advantages in solving efficiency and optimality. However, these methods fail to exploit the problem-specific properties in learning representations, re… ▽ More

    Submitted 6 June, 2024; v1 submitted 27 May, 2024; originally announced May 2024.

  13. arXiv:2405.12262  [pdf, other

    cs.LG cs.AI

    Prompt Learning for Generalized Vehicle Routing

    Authors: Fei Liu, Xi Lin, Weiduo Liao, Zhenkun Wang, Qingfu Zhang, Xialiang Tong, Mingxuan Yuan

    Abstract: Neural combinatorial optimization (NCO) is a promising learning-based approach to solving various vehicle routing problems without much manual algorithm design. However, the current NCO methods mainly focus on the in-distribution performance, while the real-world problem instances usually come from different distributions. A costly fine-tuning approach or generalized model retraining from scratch… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

  14. arXiv:2405.11051  [pdf, ps, other

    math.PR

    Darboux transformation of diffusion processes

    Authors: Alexey Kuznetsov, Minjian Yuan

    Abstract: Darboux transformation of a second order linear differential operator is a well-known technique with many applications in mathematics and physics. We study Darboux transformation from the point of view of Markov semigroups of diffusion processes. We construct a Darboux transform of a diffusion process through a combination of Doob's $h$-transform and a version of Siegmund duality. Our main result… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

    Comments: 25 pages

    MSC Class: 60J60; 60J35

  15. arXiv:2405.11024  [pdf, other

    cs.LG cs.AI

    GraSS: Combining Graph Neural Networks with Expert Knowledge for SAT Solver Selection

    Authors: Zhanguang Zhang, Didier Chetelat, Joseph Cotnareanu, Amur Ghose, Wenyi Xiao, Hui-Ling Zhen, Yingxue Zhang, Jianye Hao, Mark Coates, Mingxuan Yuan

    Abstract: Boolean satisfiability (SAT) problems are routinely solved by SAT solvers in real-life applications, yet solving time can vary drastically between solvers for the same instance. This has motivated research into machine learning models that can predict, for a given SAT instance, which solver to select among several options. Existing SAT solver selection methods all rely on some hand-picked instance… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

    Comments: Accepted by KDD 2024

  16. arXiv:2405.09024  [pdf, other

    cs.CV

    Dynamic Loss Decay based Robust Oriented Object Detection on Remote Sensing Images with Noisy Labels

    Authors: Guozhang Liu, Ting Liu, Mengke Yuan, Tao Pang, Guangxing Yang, Hao Fu, Tao Wang, Tongkui Liao

    Abstract: The ambiguous appearance, tiny scale, and fine-grained classes of objects in remote sensing imagery inevitably lead to the noisy annotations in category labels of detection dataset. However, the effects and treatments of the label noises are underexplored in modern oriented remote sensing object detectors. To address this issue, we propose a robust oriented remote sensing object detection method t… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

  17. arXiv:2405.07131  [pdf, other

    cs.HC cs.MA

    MAxPrototyper: A Multi-Agent Generation System for Interactive User Interface Prototy**

    Authors: Mingyue Yuan, Jieshan Chen, Aaron Quigley

    Abstract: In automated user interactive design, designers face key challenges, including accurate representation of user intent, crafting high-quality components, and ensuring both aesthetic and semantic consistency. Addressing these challenges, we introduce MAxPrototyper, our human-centered, multi-agent system for interactive design generation. The core of MAxPrototyper is a theme design agent. It coordina… ▽ More

    Submitted 11 May, 2024; originally announced May 2024.

  18. arXiv:2405.04733  [pdf, other

    cs.IT

    One-Bit Phase Retrieval: Optimal Rates and Efficient Algorithms

    Authors: Junren Chen, Ming Yuan

    Abstract: In this paper, we study the sample complexity and develop efficient optimal algorithms for 1-bit phase retrieval: recovering a signal $\mathbf{x}\in\mathbb{R}^n$ from $m$ phaseless bits $\{\mathrm{sign}(|\mathbf{a}_i^\top\mathbf{x}|-τ)\}_{i=1}^m$ generated by standard Gaussian $\mathbf{a}_i$s. By investigating a phaseless version of random hyperplane tessellation, we show that (constrained) hammin… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

  19. arXiv:2405.01906  [pdf, other

    cs.AI cs.LG

    Instance-Conditioned Adaptation for Large-scale Generalization of Neural Combinatorial Optimization

    Authors: Changliang Zhou, Xi Lin, Zhenkun Wang, Xialiang Tong, Mingxuan Yuan, Qingfu Zhang

    Abstract: The neural combinatorial optimization (NCO) approach has shown great potential for solving routing problems without the requirement of expert knowledge. However, existing constructive NCO methods cannot directly solve large-scale instances, which significantly limits their application prospects. To address these crucial shortcomings, this work proposes a novel Instance-Conditioned Adaptation Model… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

    Comments: 17 pages, 6 figures

  20. arXiv:2404.17360  [pdf, other

    cs.CV

    UniRGB-IR: A Unified Framework for Visible-Infrared Downstream Tasks via Adapter Tuning

    Authors: Maoxun Yuan, Bo Cui, Tianyi Zhao, Xingxing Wei

    Abstract: Semantic analysis on visible (RGB) and infrared (IR) images has gained attention for its ability to be more accurate and robust under low-illumination and complex weather conditions. Due to the lack of pre-trained foundation models on the large-scale infrared image datasets, existing methods prefer to design task-specific frameworks and directly fine-tune them with pre-trained foundation models on… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

  21. arXiv:2404.12638  [pdf, other

    cs.AI

    Learning to Cut via Hierarchical Sequence/Set Model for Efficient Mixed-Integer Programming

    Authors: Jie Wang, Zhihai Wang, Xijun Li, Yufei Kuang, Zhihao Shi, Fangzhou Zhu, Mingxuan Yuan, Jia Zeng, Yongdong Zhang, Feng Wu

    Abstract: Cutting planes (cuts) play an important role in solving mixed-integer linear programs (MILPs), which formulate many important real-world applications. Cut selection heavily depends on (P1) which cuts to prefer and (P2) how many cuts to select. Although modern MILP solvers tackle (P1)-(P2) by human-designed heuristics, machine learning carries the potential to learn more effective heuristics. Howev… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

    Comments: arXiv admin note: substantial text overlap with arXiv:2302.00244

  22. arXiv:2404.07951  [pdf, other

    physics.data-an hep-ex

    Visualization for physics analysis improvement and applications in BESIII

    Authors: Zhi-Jun Li, Ming-Kuan Yuan, Yun-Xuan Song, Yan-Gu Li, **g-Shu Li, Sheng-Sen Sun, Xiao-Long Wang, Zheng-Yun You, Ya-Jun Mao

    Abstract: Modern particle physics experiments usually rely on highly complex and large-scale spectrometer devices. In high energy physics experiments, visualization helps detector design, data quality monitoring, offline data processing, and has great potential for improving physics analysis. In addition to the traditional physics data analysis based on statistical methods, visualization provides unique int… ▽ More

    Submitted 19 March, 2024; originally announced April 2024.

    Comments: 19 pages, 7 figures

  23. arXiv:2404.05404  [pdf, other

    eess.SY

    Contouring Error Bounded Control for Biaxial Switched Linear Systems

    Authors: Meng Yuan, Ye Wang, Chris Manzie, Zhezhuang Xu, Tianyou Chai

    Abstract: Biaxial motion control systems are used extensively in manufacturing and printing industries. To improve throughput and reduce machine cost, lightweight materials are being proposed in structural components but may result in higher flexibility in the machine links. This flexibility is often position dependent and compromises precision of the end effector of the machine. To address the need for imp… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

  24. arXiv:2404.05168  [pdf, other

    cs.LG

    Adapting to Covariate Shift in Real-time by Encoding Trees with Motion Equations

    Authors: Tham Yik Foong, Heng Zhang, Mao Po Yuan, Danilo Vasconcellos Vargas

    Abstract: Input distribution shift presents a significant problem in many real-world systems. Here we present Xenovert, an adaptive algorithm that can dynamically adapt to changes in input distribution. It is a perfect binary tree that adaptively divides a continuous input space into several intervals of uniform density while receiving a continuous stream of input. This process indirectly maps the source di… ▽ More

    Submitted 7 April, 2024; originally announced April 2024.

    Comments: 7 figures, 2 tables

  25. arXiv:2404.04878  [pdf, other

    eess.IV cs.CV

    CycleINR: Cycle Implicit Neural Representation for Arbitrary-Scale Volumetric Super-Resolution of Medical Data

    Authors: Wei Fang, Yuxing Tang, Heng Guo, Mingze Yuan, Tony C. W. Mok, Ke Yan, Jiawen Yao, Xin Chen, Zaiyi Liu, Le Lu, Ling Zhang, Minfeng Xu

    Abstract: In the realm of medical 3D data, such as CT and MRI images, prevalent anisotropic resolution is characterized by high intra-slice but diminished inter-slice resolution. The lowered resolution between adjacent slices poses challenges, hindering optimal viewing experiences and impeding the development of robust downstream analysis algorithms. Various volumetric super-resolution algorithms aim to sur… ▽ More

    Submitted 7 April, 2024; originally announced April 2024.

    Comments: CVPR accepted paper

  26. arXiv:2403.19561  [pdf, other

    cs.LG cs.AI

    Self-Improved Learning for Scalable Neural Combinatorial Optimization

    Authors: Fu Luo, Xi Lin, Zhenkun Wang, Xialiang Tong, Mingxuan Yuan, Qingfu Zhang

    Abstract: The end-to-end neural combinatorial optimization (NCO) method shows promising performance in solving complex combinatorial optimization problems without the need for expert design. However, existing methods struggle with large-scale problems, hindering their practical applicability. To overcome this limitation, this work proposes a novel Self-Improved Learning (SIL) method for better scalability o… ▽ More

    Submitted 2 May, 2024; v1 submitted 28 March, 2024; originally announced March 2024.

  27. arXiv:2403.19446  [pdf, other

    cs.LO

    EDA-Driven Preprocessing for SAT Solving

    Authors: Zhengyuan Shi, Tiebing Tang, Sadaf Khan, Hui-Ling Zhen, Mingxuan Yuan, Zhufei Chu, Qiang Xu

    Abstract: Effective formulation of problems into Conjunctive Normal Form (CNF) is critical in modern Boolean Satisfiability (SAT) solving for optimizing solver performance. Addressing the limitations of existing methods, our Electronic Design Automation (EDA)-driven preprocessing framework introduces a novel methodology for preparing SAT instances, leveraging both circuit and CNF formats for enhanced flexib… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

  28. arXiv:2403.18768  [pdf, other

    quant-ph

    Efficient Generation of Multi-partite Entanglement between Non-local Superconducting Qubits using Classical Feedback

    Authors: Akel Hashim, Ming Yuan, Pranav Gokhale, Larry Chen, Christian Juenger, Neelay Fruitwala, Yilun Xu, Gang Huang, Liang Jiang, Irfan Siddiqi

    Abstract: Quantum entanglement is one of the primary features which distinguishes quantum computers from classical computers. In gate-based quantum computing, the creation of entangled states or the distribution of entanglement across a quantum processor often requires circuit depths which grow with the number of entangled qubits. However, in teleportation-based quantum computing, one can deterministically… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

  29. arXiv:2403.17828  [pdf, other

    astro-ph.HE

    The Relativistic Spin Precession in the Compact Double Neutron Star System PSR~J1946+2052

    Authors: Lingqi Meng, Weiwei Zhu, Michael Kramer, Xueli Miao, Gregory Desvignes, Li**g Shao, Huanchen Hu, Paulo C. C. Freire, Yongkun Zhang, Mengyao Xue, Ziyao Fang, David J. Champion, Mao Yuan, Chenchen Miao, Jiarui Niu, Qiuyang Fu, Jumei Yao, Yanjun Guo, Chengmin Zhang

    Abstract: We observe systematic profile changes in the visible pulsar of the compact double neutron star system PSR~J1946+2052 using observations with the Five-hundred-meter Aperture Spherical radio Telescope (FAST). The interpulse of PSR~J1946+2052 changed from single-peak to double-peak shape from 2018 to 2021. We attribute this evolution as the result of the relativistic spin precession of the pulsar. Wi… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    Comments: 12 pages, 9 figures, accepted for publication in ApJ

  30. arXiv:2403.13838  [pdf, other

    cs.LG cs.AR

    Circuit Transformer: End-to-end Circuit Design by Predicting the Next Gate

    Authors: Xihan Li, Xing Li, Lei Chen, Xing Zhang, Mingxuan Yuan, Jun Wang

    Abstract: Language, a prominent human ability to express through sequential symbols, has been computationally mastered by recent advances of large language models (LLMs). By predicting the next word recurrently with huge neural models, LLMs have shown unprecedented capabilities in understanding and reasoning. Circuit, as the "language" of electronic design, specifies the functionality of an electronic devic… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

  31. arXiv:2403.11671  [pdf, other

    cs.AR cs.AI cs.CE cs.LG cs.SE

    HDLdebugger: Streamlining HDL debugging with Large Language Models

    Authors: Xufeng Yao, Haoyang Li, Tsz Ho Chan, Wenyi Xiao, Mingxuan Yuan, Yu Huang, Lei Chen, Bei Yu

    Abstract: In the domain of chip design, Hardware Description Languages (HDLs) play a pivotal role. However, due to the complex syntax of HDLs and the limited availability of online resources, debugging HDL codes remains a difficult and time-intensive task, even for seasoned engineers. Consequently, there is a pressing need to develop automated HDL code debugging models, which can alleviate the burden on har… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

    Comments: 13 pages,5 figures

  32. arXiv:2403.07257  [pdf, other

    cs.AR cs.ET

    The Dawn of AI-Native EDA: Opportunities and Challenges of Large Circuit Models

    Authors: Lei Chen, Yiqi Chen, Zhufei Chu, Wenji Fang, Tsung-Yi Ho, Ru Huang, Yu Huang, Sadaf Khan, Min Li, Xingquan Li, Yu Li, Yun Liang, **wei Liu, Yi Liu, Yibo Lin, Guojie Luo, Zhengyuan Shi, Guangyu Sun, Dimitrios Tsaras, Runsheng Wang, Ziyi Wang, Xinming Wei, Zhiyao Xie, Qiang Xu, Chenhao Xue , et al. (14 additional authors not shown)

    Abstract: Within the Electronic Design Automation (EDA) domain, AI-driven solutions have emerged as formidable tools, yet they typically augment rather than redefine existing methodologies. These solutions often repurpose deep learning models from other domains, such as vision, text, and graph analytics, applying them to circuit design without tailoring to the unique complexities of electronic circuits. Suc… ▽ More

    Submitted 1 May, 2024; v1 submitted 11 March, 2024; originally announced March 2024.

    Comments: The authors are ordered alphabetically. Contact: qxu@cse[dot]cuhk[dot]edu[dot]hk, gluo@pku[dot]edu[dot]cn, yuan.mingxuan@huawei[dot]com

  33. arXiv:2403.05280  [pdf, other

    cs.CV

    ContrastDiagnosis: Enhancing Interpretability in Lung Nodule Diagnosis Using Contrastive Learning

    Authors: Chenglong Wang, Yinqiao Yi, Yida Wang, Chengxiu Zhang, Yun Liu, Kensaku Mori, Mei Yuan, Guang Yang

    Abstract: With the ongoing development of deep learning, an increasing number of AI models have surpassed the performance levels of human clinical practitioners. However, the prevalence of AI diagnostic products in actual clinical practice remains significantly lower than desired. One crucial reason for this gap is the so-called `black box' nature of AI models. Clinicians' distrust of black box models has d… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

  34. arXiv:2403.04914  [pdf

    cs.CE stat.OT

    Improving the Equation of Exchange for Cryptoasset Valuation Using Empirical Data

    Authors: Stylianos Kampakis, Melody Yuan, Oritsebawo Paul Ikpobe, Linas Stankevicius

    Abstract: In the evolving domain of cryptocurrency markets, accurate token valuation remains a critical aspect influencing investment decisions and policy development. Whilst the prevailing equation of exchange pricing model offers a quantitative valuation approach based on the interplay between token price, transaction volume, supply, and either velocity or holding time, it exhibits intrinsic shortcomings.… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

  35. arXiv:2403.03517  [pdf, other

    cs.AI

    IB-Net: Initial Branch Network for Variable Decision in Boolean Satisfiability

    Authors: Tsz Ho Chan, Wenyi Xiao, Junhua Huang, Huiling Zhen, Guangji Tian, Mingxuan Yuan

    Abstract: Boolean Satisfiability problems are vital components in Electronic Design Automation, particularly within the Logic Equivalence Checking process. Currently, SAT solvers are employed for these problems and neural network is tried as assistance to solvers. However, as SAT problems in the LEC context are distinctive due to their predominantly unsatisfiability nature and a substantial proportion of UN… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

    Comments: 7 pages, 12 figures

  36. arXiv:2403.00012  [pdf, other

    cs.LG cs.AR

    PreRoutGNN for Timing Prediction with Order Preserving Partition: Global Circuit Pre-training, Local Delay Learning and Attentional Cell Modeling

    Authors: Ruizhe Zhong, Junjie Ye, Zhentao Tang, Shixiong Kai, Mingxuan Yuan, Jianye Hao, Junchi Yan

    Abstract: Pre-routing timing prediction has been recently studied for evaluating the quality of a candidate cell placement in chip design. It involves directly estimating the timing metrics for both pin-level (slack, slew) and edge-level (net delay, cell delay), without time-consuming routing. However, it often suffers from signal decay and error accumulation due to the long timing paths in large-scale indu… ▽ More

    Submitted 12 March, 2024; v1 submitted 26 February, 2024; originally announced March 2024.

    Comments: 13 pages, 5 figures, The 38th Annual AAAI Conference on Artificial Intelligence (AAAI 2024)

  37. arXiv:2402.18849  [pdf

    cs.CV cs.AI cs.CL

    Enhancing Steganographic Text Extraction: Evaluating the Impact of NLP Models on Accuracy and Semantic Coherence

    Authors: Mingyang Li, Maoqin Yuan, Luyao Li, Han Pengsihua

    Abstract: This study discusses a new method combining image steganography technology with Natural Language Processing (NLP) large models, aimed at improving the accuracy and robustness of extracting steganographic text. Traditional Least Significant Bit (LSB) steganography techniques face challenges in accuracy and robustness of information extraction when dealing with complex character encoding, such as Ch… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

  38. arXiv:2402.16891  [pdf, other

    cs.LG cs.AI

    Multi-Task Learning for Routing Problem with Cross-Problem Zero-Shot Generalization

    Authors: Fei Liu, Xi Lin, Zhenkun Wang, Qingfu Zhang, Xialiang Tong, Mingxuan Yuan

    Abstract: Vehicle routing problems (VRPs), which can be found in numerous real-world applications, have been an important research topic for several decades. Recently, the neural combinatorial optimization (NCO) approach that leverages a learning-based model to solve VRPs without manual algorithm design has gained substantial attention. However, current NCO methods typically require building one model for e… ▽ More

    Submitted 12 April, 2024; v1 submitted 23 February, 2024; originally announced February 2024.

  39. arXiv:2402.14232  [pdf, other

    hep-ph hep-ex

    The quark flavor-violating ALPs in light of B mesons and hadron colliders

    Authors: Tong Li, Zhuoni Qian, Michael A. Schmidt, Man Yuan

    Abstract: The axion-like particle (ALP) may induce flavor-changing neutral currents (FCNCs) when their Peccei-Quinn charges are not generation universal. The search for flavor-violating ALP couplings with a bottom quark so far focused on FCNC processes of $B$ mesons at low energies. The recent measurements of $B\to K +X$ rare decays place stringent bounds on the quark flavor violations of a light ALP in dif… ▽ More

    Submitted 26 April, 2024; v1 submitted 21 February, 2024; originally announced February 2024.

    Comments: 46 pages, 20 figures, 11 tables. version accepted for publication in JHEP

    Report number: CPPC-2024-03

  40. arXiv:2402.12143  [pdf, other

    eess.SP

    Joint mode switching and resource allocation in wireless-powered RIS-aided multiuser communication systems

    Authors: Mingang Yuan, Wenzhe Zhang, Gaofei Huang

    Abstract: This paper investigates a wireless-powered hybrid reflecting intelligent surface (hybrid RIS)-assisted multiple access system, where the RIS can harvest energy from energy station (ES) transmitted radio frequency signal (RF), and each reflecting element can flexibly switch between active mode, passive mode, and idle mode. The objective is to minimize the maximum energy consumption of the users by… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

  41. arXiv:2402.11903  [pdf, other

    cs.CL cs.AI

    DiLA: Enhancing LLM Tool Learning with Differential Logic Layer

    Authors: Yu Zhang, Hui-Ling Zhen, Zehua Pei, Yingzhao Lian, Lihao Yin, Mingxuan Yuan, Bei Yu

    Abstract: Considering the challenges faced by large language models (LLMs) in logical reasoning and planning, prior efforts have sought to augment LLMs with access to external solvers. While progress has been made on simple reasoning problems, solving classical constraint satisfaction problems, such as the Boolean Satisfiability Problem (SAT) and Graph Coloring Problem (GCP), remains difficult for off-the-s… ▽ More

    Submitted 18 June, 2024; v1 submitted 19 February, 2024; originally announced February 2024.

    Comments: arXiv admin note: text overlap with arXiv:2305.12295 by other authors

  42. arXiv:2402.08683  [pdf

    stat.AP math.OC

    Order picking efficiency: A scattered storage and clustered allocation strategy in automated drug dispensing systems

    Authors: Mengge Yuan, Ning Zhao, Kan Wu, Lulu Cheng

    Abstract: In the smart hospital, optimizing prescription order fulfilment processes in outpatient pharmacies is crucial. A promising device, automated drug dispensing systems (ADDSs), has emerged to streamline these processes. These systems involve human order pickers who are assisted by ADDSs. The ADDS's robotic arm transports bins from storage locations to the input/output (I/O) points, while the pharmaci… ▽ More

    Submitted 18 December, 2023; originally announced February 2024.

  43. arXiv:2402.07049  [pdf

    cs.AI

    A Factor Graph Model of Trust for a Collaborative Multi-Agent System

    Authors: Behzad Akbari, Mingfeng Yuan, Hao Wang, Haibin Zhu, **jun Shan

    Abstract: In the field of Multi-Agent Systems (MAS), known for their openness, dynamism, and cooperative nature, the ability to trust the resources and services of other agents is crucial. Trust, in this setting, is the reliance and confidence an agent has in the information, behaviors, intentions, truthfulness, and capabilities of others within the system. Our paper introduces a new graphical approach that… ▽ More

    Submitted 10 February, 2024; originally announced February 2024.

  44. arXiv:2402.05789  [pdf, ps, other

    econ.EM math.ST stat.ME

    High Dimensional Factor Analysis with Weak Factors

    Authors: Jungjun Choi, Ming Yuan

    Abstract: This paper studies the principal components (PC) estimator for high dimensional approximate factor models with weak factors in that the factor loading ($\boldsymbolΛ^0$) scales sublinearly in the number $N$ of cross-section units, i.e., $\boldsymbolΛ^{0\top} \boldsymbolΛ^0 / N^α$ is positive definite in the limit for some $α\in (0,1)$. While the consistency and asymptotic normality of these estima… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

  45. arXiv:2402.03375  [pdf, other

    cs.AI cs.PL

    BetterV: Controlled Verilog Generation with Discriminative Guidance

    Authors: Zehua Pei, Hui-Ling Zhen, Mingxuan Yuan, Yu Huang, Bei Yu

    Abstract: Due to the growing complexity of modern Integrated Circuits (ICs), there is a need for automated circuit design methods. Recent years have seen rising research in hardware design language generation to facilitate the design process. In this work, we propose a Verilog generation framework, BetterV, which fine-tunes the large language models (LLMs) on processed domain-specific datasets and incorpora… ▽ More

    Submitted 2 May, 2024; v1 submitted 3 February, 2024; originally announced February 2024.

    Comments: Accepted by ICML 2024

  46. arXiv:2402.01296  [pdf, other

    cs.LG cs.CR cs.CV

    Bi-CryptoNets: Leveraging Different-Level Privacy for Encrypted Inference

    Authors: Man-Jie Yuan, Zheng Zou, Wei Gao

    Abstract: Privacy-preserving neural networks have attracted increasing attention in recent years, and various algorithms have been developed to keep the balance between accuracy, computational complexity and information security from the cryptographic view. This work takes a different view from the input data and structure of neural networks. We decompose the input data (e.g., some images) into sensitive an… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

  47. arXiv:2401.12224  [pdf, other

    cs.AR cs.AI

    LLM4EDA: Emerging Progress in Large Language Models for Electronic Design Automation

    Authors: Ruizhe Zhong, Xingbo Du, Shixiong Kai, Zhentao Tang, Siyuan Xu, Hui-Ling Zhen, Jianye Hao, Qiang Xu, Mingxuan Yuan, Junchi Yan

    Abstract: Driven by Moore's Law, the complexity and scale of modern chip design are increasing rapidly. Electronic Design Automation (EDA) has been widely applied to address the challenges encountered in the full chip design process. However, the evolution of very large-scale integrated circuits has made chip design time-consuming and resource-intensive, requiring substantial prior expert knowledge. Additio… ▽ More

    Submitted 28 December, 2023; originally announced January 2024.

    Comments: 15 pages, 4 figures

  48. arXiv:2401.11491  [pdf

    cs.RO

    BA-LINS: A Frame-to-Frame Bundle Adjustment for LiDAR-Inertial Navigation

    Authors: Hailiang Tang, Tisheng Zhang, Liqiang Wang, Man Yuan, Xiaoji Niu

    Abstract: Bundle Adjustment (BA) has been proven to improve the accuracy of the LiDAR map**. However, the BA method has not yet been properly employed in a dead-reckoning navigation system. In this paper, we present a frame-to-frame (F2F) BA for LiDAR-inertial navigation, named BA-LINS. Based on the direct F2F point-cloud association, the same-plane points are associated among the LiDAR keyframes. Hence,… ▽ More

    Submitted 10 February, 2024; v1 submitted 21 January, 2024; originally announced January 2024.

    Comments: 14 pages, 14 figures

  49. arXiv:2401.10731  [pdf, other

    cs.CV

    Removal and Selection: Improving RGB-Infrared Object Detection via Coarse-to-Fine Fusion

    Authors: Tianyi Zhao, Maoxun Yuan, Feng Jiang, Nan Wang, Xingxing Wei

    Abstract: Object detection in visible (RGB) and infrared (IR) images has been widely applied in recent years. Leveraging the complementary characteristics of RGB and IR images, the object detector provides reliable and robust object localization from day to night. Most existing fusion strategies directly input RGB and IR images into deep neural networks, leading to inferior detection performance. However, t… ▽ More

    Submitted 7 May, 2024; v1 submitted 19 January, 2024; originally announced January 2024.

    Comments: 11pages, 11figures

  50. arXiv:2401.05960  [pdf, other

    cs.AI

    Machine Learning Insides OptVerse AI Solver: Design Principles and Applications

    Authors: Xijun Li, Fangzhou Zhu, Hui-Ling Zhen, Weilin Luo, Meng Lu, Yimin Huang, Zhenan Fan, Zirui Zhou, Yufei Kuang, Zhihai Wang, Zijie Geng, Yang Li, Haoyang Liu, Zhiwu An, Muming Yang, Jianshu Li, Jie Wang, Junchi Yan, Defeng Sun, Tao Zhong, Yong Zhang, Jia Zeng, Mingxuan Yuan, Jianye Hao, Jun Yao , et al. (1 additional authors not shown)

    Abstract: In an era of digital ubiquity, efficient resource management and decision-making are paramount across numerous industries. To this end, we present a comprehensive study on the integration of machine learning (ML) techniques into Huawei Cloud's OptVerse AI Solver, which aims to mitigate the scarcity of real-world mathematical programming instances, and to surpass the capabilities of traditional opt… ▽ More

    Submitted 17 January, 2024; v1 submitted 11 January, 2024; originally announced January 2024.