Skip to main content

Showing 1–50 of 215 results for author: Lai, L

.
  1. arXiv:2406.08102  [pdf, other

    cs.CV

    Adversarial Patch for 3D Local Feature Extractor

    Authors: Yu Wen Pao, Li Chang Lai, Hong-Yi Lin

    Abstract: Local feature extractors are the cornerstone of many computer vision tasks. However, their vulnerability to adversarial attacks can significantly compromise their effectiveness. This paper discusses approaches to attack sophisticated local feature extraction algorithms and models to achieve two distinct goals: (1) forcing a match between originally non-matching image regions, and (2) preventing a… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  2. arXiv:2405.13453  [pdf, other

    cs.LG cs.CR

    A Huber Loss Minimization Approach to Mean Estimation under User-level Differential Privacy

    Authors: Puning Zhao, Lifeng Lai, Li Shen, Qingming Li, Jiafei Wu, Zhe Liu

    Abstract: Privacy protection of users' entire contribution of samples is important in distributed systems. The most effective approach is the two-stage scheme, which finds a small interval first and then gets a refined estimate by clip** samples into the interval. However, the clip** operation induces bias, which is serious if the sample distribution is heavy-tailed. Besides, users with large local samp… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

  3. arXiv:2405.01736  [pdf, other

    cs.AR

    PipeOrgan: Efficient Inter-operation Pipelining with Flexible Spatial Organization and Interconnects

    Authors: Raveesh Garg, Hyoukjun Kwon, Eric Qin, Yu-Hsin Chen, Tushar Krishna, Liangzhen Lai

    Abstract: Because of the recent trends in Deep Neural Networks (DNN) models being memory-bound, inter-operator pipelining for DNN accelerators is emerging as a promising optimization. Inter-operator pipelining reduces costly on-chip global memory and off-chip memory accesses by forwarding the output of a layer as the input of the next layer within the compute array, which is proven to be an effective optimi… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

  4. arXiv:2405.01718  [pdf, other

    cs.LG math.OC stat.ML

    Robust Risk-Sensitive Reinforcement Learning with Conditional Value-at-Risk

    Authors: Xinyi Ni, Lifeng Lai

    Abstract: Robust Markov Decision Processes (RMDPs) have received significant research interest, offering an alternative to standard Markov Decision Processes (MDPs) that often assume fixed transition probabilities. RMDPs address this by optimizing for the worst-case scenarios within ambiguity sets. While earlier studies on RMDPs have largely centered on risk-neutral reinforcement learning (RL), with the goa… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

  5. arXiv:2405.00128  [pdf, other

    q-bio.BM

    Target-Specific De Novo Peptide Binder Design with DiffPepBuilder

    Authors: Fanhao Wang, Yuzhe Wang, Laiyi Feng, Changsheng Zhang, Luhua Lai

    Abstract: Despite the exciting progress in target-specific de novo protein binder design, peptide binder design remains challenging due to the flexibility of peptide structures and the scarcity of protein-peptide complex structure data. In this study, we curated a large synthetic dataset, referred to as PepPC-F, from the abundant protein-protein interface data and developed DiffPepBuilder, a de novo target-… ▽ More

    Submitted 30 April, 2024; originally announced May 2024.

  6. arXiv:2404.16710  [pdf, other

    cs.CL cs.AI cs.LG

    LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding

    Authors: Mostafa Elhoushi, Akshat Shrivastava, Diana Liskovich, Basil Hosmer, Bram Wasti, Liangzhen Lai, Anas Mahmoud, Bilge Acun, Saurabh Agarwal, Ahmed Roman, Ahmed A Aly, Beidi Chen, Carole-Jean Wu

    Abstract: We present LayerSkip, an end-to-end solution to speed-up inference of large language models (LLMs). First, during training we apply layer dropout, with low dropout rates for earlier layers and higher dropout rates for later layers, and an early exit loss where all transformer layers share the same exit. Second, during inference, we show that this training recipe increases the accuracy of early exi… ▽ More

    Submitted 29 April, 2024; v1 submitted 25 April, 2024; originally announced April 2024.

    Comments: Code open sourcing is in progress

  7. arXiv:2404.06037  [pdf, other

    cs.DC

    A Survey of Distributed Graph Algorithms on Massive Graphs

    Authors: Lingkai Meng, Yu Shao, Long Yuan, Longbin Lai, Peng Cheng, Xue Li, Wenyuan Yu, Wenjie Zhang, Xuemin Lin, **gren Zhou

    Abstract: Distributed processing of large-scale graph data has many practical applications and has been widely studied. In recent years, a lot of distributed graph processing frameworks and algorithms have been proposed. While many efforts have been devoted to analyzing these, with most analyzing them based on programming models, less research focuses on understanding their challenges in distributed environ… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

  8. arXiv:2403.15982  [pdf, other

    quant-ph math-ph

    Generally covariant geometric momentum and geometric potential for a Dirac fermion on a two-dimensional hypersurface

    Authors: Z. Li, L. Q. Lai

    Abstract: Geometric momentum is the proper momentum for a moving particle constrained on a curved surface, which depends on the outer curvature and has observable effects. In the context of multi-component quantum states, geometric momentum should be rewritten as generally covariant geometric momentum. For a Dirac fermion constrained on a two-dimensional hypersurface, we give the generally covariant geometr… ▽ More

    Submitted 23 March, 2024; originally announced March 2024.

    Comments: 7 pages, 2 figures

  9. arXiv:2403.14165  [pdf, other

    astro-ph.CO

    Improving SDSS Cosmological Constraints through $β$-Skeleton Weighted Correlation Functions

    Authors: Fenfen Yin, Jiacheng Ding, Limin Lai, Wei Zhang, Liang Xiao, Zihan Wang, Jaime Forero-Romero, Le Zhang, Xiao-Dong Li

    Abstract: The $β$-skeleton approach can be conveniently utilized to construct the cosmic web based on the spatial geometry distribution of galaxies, particularly in sparse samples. This method plays a key role in establishing the three-dimensional structure of the Universe and serves as a tool for quantitatively characterizing the nature of the cosmic web. This study is the first application of $β$-skeleton… ▽ More

    Submitted 25 March, 2024; v1 submitted 21 March, 2024; originally announced March 2024.

    Comments: 14 pages,10 figues

  10. The PSF Smoothing Effect on Concentration-Related Parameters of High Redshift Galaxies in HST and JWST

    Authors: Jia-Hui Wang, Zhao-Yu Li, Ming-Yang Zhuang, Luis C. Ho, Li-Min Lai

    Abstract: We perform a comprehensive investigation of the PSF smoothing effect on the measurement of concentration-related parameters ($C$, Gini, $M_{20}$) of high redshift galaxies in the HST and JWST surveys. Our sample contains massive galaxies from the CANDELS/EGS survey (0 < z < 2), and the CEERS survey (1 < z < 3). The non-parametric concentration-related parameters ($C$, Gini, $M_{20}$) and the model… ▽ More

    Submitted 14 May, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

    Comments: Accepted by A&A, 21 pages, 20 figures. Comments are welcome

    Journal ref: A&A 686, A100 (2024)

  11. arXiv:2403.00473  [pdf, other

    cs.GR cs.RO eess.SY

    Computer-Controlled 3D Freeform Surface Weaving

    Authors: Xiangjia Chen, Lip M. Lai, Zishun Liu, Chengkai Dai, Isaac C. W. Leung, Charlie C. L. Wang, Yeung Yam

    Abstract: In this paper, we present a new computer-controlled weaving technology that enables the fabrication of woven structures in the shape of given 3D surfaces by using threads in non-traditional materials with high bending-stiffness, allowing for multiple applications with the resultant woven fabrics. A new weaving machine and a new manufacturing process are developed to realize the function of 3D surf… ▽ More

    Submitted 8 May, 2024; v1 submitted 1 March, 2024; originally announced March 2024.

  12. arXiv:2402.14905  [pdf, other

    cs.LG cs.AI cs.CL

    MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases

    Authors: Zechun Liu, Changsheng Zhao, Forrest Iandola, Chen Lai, Yuandong Tian, Igor Fedorov, Yunyang Xiong, Ernie Chang, Yangyang Shi, Raghuraman Krishnamoorthi, Liangzhen Lai, Vikas Chandra

    Abstract: This paper addresses the growing need for efficient large language models (LLMs) on mobile devices, driven by increasing cloud costs and latency concerns. We focus on designing top-quality LLMs with fewer than a billion parameters, a practical choice for mobile deployment. Contrary to prevailing belief emphasizing the pivotal role of data and parameter quantity in determining model quality, our in… ▽ More

    Submitted 26 June, 2024; v1 submitted 22 February, 2024; originally announced February 2024.

    Comments: ICML 2024. Code is available at https://github.com/facebookresearch/MobileLLM

  13. arXiv:2402.13076  [pdf, other

    cs.SD cs.LG eess.AS

    Not All Weights Are Created Equal: Enhancing Energy Efficiency in On-Device Streaming Speech Recognition

    Authors: Yang Li, Yuan Shangguan, Yuhao Wang, Liangzhen Lai, Ernie Chang, Changsheng Zhao, Yangyang Shi, Vikas Chandra

    Abstract: Power consumption plays an important role in on-device streaming speech recognition, as it has a direct impact on the user experience. This study delves into how weight parameters in speech recognition models influence the overall power consumption of these models. We discovered that the impact of weight parameters on power consumption varies, influenced by factors including how often they are inv… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

  14. arXiv:2401.17786  [pdf, other

    cs.DB cs.PF

    A Graph-Native Query Optimization Framework

    Authors: Bingqing Lyu, Xiaoli Zhou, Longbin Lai, Yufan Yang, Yunkai Lou, Wenyuan Yu, **gren Zhou

    Abstract: Graph queries that combine pattern matching with relational operations, referred as PatRelQuery, are widely used in many real-world applications. It allows users to identify arbitrary patterns in a graph and further perform in-depth relational analysis on the results. To effectively support PatRelQuery, two key challenges need to be addressed: (1) how to optimize PatRelQuery in a unified framework… ▽ More

    Submitted 5 February, 2024; v1 submitted 31 January, 2024; originally announced January 2024.

  15. arXiv:2401.17405  [pdf, other

    cs.MA

    Camouflage Adversarial Attacks on Multiple Agent Systems

    Authors: Ziqing Lu, Guanlin Liu, Lifeng Lai, Weiyu Xu

    Abstract: The multi-agent reinforcement learning systems (MARL) based on the Markov decision process (MDP) have emerged in many critical applications. To improve the robustness/defense of MARL systems against adversarial attacks, the study of various adversarial attacks on reinforcement learning systems is very important. Previous works on adversarial attacks considered some possible features to attack in M… ▽ More

    Submitted 30 January, 2024; originally announced January 2024.

    Comments: arXiv admin note: text overlap with arXiv:2311.00859

  16. arXiv:2401.10806  [pdf, ps, other

    q-bio.BM

    DeepRLI: A Multi-objective Framework for Universal Protein--Ligand Interaction Prediction

    Authors: Haoyu Lin, Shiwei Wang, **tao Zhu, Yibo Li, Jianfeng Pei, Luhua Lai

    Abstract: Protein (receptor)--ligand interaction prediction is a critical component in computer-aided drug design, significantly influencing molecular docking and virtual screening processes. Despite the development of numerous scoring functions in recent years, particularly those employing machine learning, accurately and efficiently predicting binding affinities for protein--ligand complexes remains a for… ▽ More

    Submitted 19 January, 2024; originally announced January 2024.

  17. arXiv:2401.05119  [pdf, other

    cond-mat.quant-gas

    Interference-induced suppression of particle emission from a Bose-Einstein condensate in lattice with time-periodic modulations

    Authors: L. Q. Lai, Z. Li

    Abstract: Collective emission of particles from a parametrically driven condensate has attracted significant experimental and theoretical attention due to the appealing visual effects and potential metrological applications. In this paper, we investigate the particle emission from a Bose-Einstein condensate confined in a one-dimensional lattice with periodically modulated interparticle interactions. We give… ▽ More

    Submitted 10 January, 2024; originally announced January 2024.

    Comments: 6 pages, 6 figures

  18. arXiv:2401.01059  [pdf, other

    q-bio.QM

    Accelerating Discovery of Novel and Bioactive Ligands With Pharmacophore-Informed Generative Models

    Authors: Weixin Xie, Jianhang Zhang, Qin Xie, Chaojun Gong, Youjun Xu, Luhua Lai, Jianfeng Pei

    Abstract: Deep generative models have gained significant advancements to accelerate drug discovery by generating bioactive chemicals against desired targets. Nevertheless, most generated compounds that have been validated for potent bioactivity often exhibit structural novelty levels that fall short of satisfaction, thereby providing limited inspiration to human medicinal chemists. The challenge faced by ge… ▽ More

    Submitted 2 January, 2024; originally announced January 2024.

  19. arXiv:2312.12107  [pdf, other

    cs.DC cs.DB

    GraphScope Flex: LEGO-like Graph Computing Stack

    Authors: Tao He, Shuxian Hu, Longbin Lai, Dongze Li, Neng Li, Xue Li, Lexiao Liu, Xiaojian Luo, Binqing Lyu, Ke Meng, Sijie Shen, Li Su, Lei Wang, **gbo Xu, Wenyuan Yu, Weibin Zeng, Lei Zhang, Siyuan Zhang, **gren Zhou, Xiaoli Zhou, Diwen Zhu

    Abstract: Graph computing has become increasingly crucial in processing large-scale graph data, with numerous systems developed for this purpose. Two years ago, we introduced GraphScope as a system addressing a wide array of graph computing needs, including graph traversal, analytics, and learning in one system. Since its inception, GraphScope has achieved significant technological advancements and gained w… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

  20. Parameterized steering criteria via correlation matrices

    Authors: Qing-Hua Zhang, Lemin Lai, Shao-Ming Fei

    Abstract: We study the steerability for arbitrary dimensional bipartite systems based on the correlation matrices given by local special unitary groups. We present families of steering criteria for bipartite quantum states in terms of parameterized correlation matrices. We show that these steering criteria may detect more steerable states than the existing steering criteria. The results are illustrated by d… ▽ More

    Submitted 9 December, 2023; originally announced December 2023.

    Comments: 5 pages

    Journal ref: Results in Physics 56 (2024) 107253

  21. arXiv:2312.03244  [pdf, other

    astro-ph.CO

    Improving Constraint on $Ω_{m}$ from SDSS Using Marked Correlation Functions

    Authors: L. M. Lai, J. C. Ding, X. L. Luo, Y. Z. Yang, Z. H. Wang, K. S. Liu, G. F. Liu, X. Wang, Y. Zheng, Z. Y. Li, L. Zhang, X. D. Li

    Abstract: Large-scale structure (LSS) surveys will increasingly provide stringent constraints on our cosmological models. Recently, the density-marked correlation function (MCF) has been introduced, offering an easily computable density-correlation statistic. Simulations have demonstrated that MCFs offer additional, independent constraints on cosmological models beyond the standard two-point correlation (2P… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

    Comments: 16 pages, 10 figures

  22. arXiv:2311.15201  [pdf, other

    q-bio.BM

    DiffBindFR: An SE(3) Equivariant Network for Flexible Protein-Ligand Docking

    Authors: **tao Zhu, Zhonghui Gu, Jianfeng Pei, Luhua Lai

    Abstract: Molecular docking, a key technique in structure-based drug design, plays pivotal roles in protein-ligand interaction modeling, hit identification and optimization, in which accurate prediction of protein-ligand binding mode is essential. Conventional docking approaches perform well in redocking tasks with known protein binding pocket conformation in the complex state. However, in real-world dockin… ▽ More

    Submitted 19 December, 2023; v1 submitted 26 November, 2023; originally announced November 2023.

  23. arXiv:2311.00859  [pdf, other

    cs.LG cs.AI cs.CR cs.MA

    Optimal Cost Constrained Adversarial Attacks For Multiple Agent Systems

    Authors: Ziqing Lu, Guanlin Liu, Lifeng Lai, Weiyu Xu

    Abstract: Finding optimal adversarial attack strategies is an important topic in reinforcement learning and the Markov decision process. Previous studies usually assume one all-knowing coordinator (attacker) for whom attacking different recipient (victim) agents incurs uniform costs. However, in reality, instead of using one limitless central attacker, the attacks often need to be performed by distributed a… ▽ More

    Submitted 1 November, 2023; originally announced November 2023.

    Comments: Submitted to ICCASP2024

  24. arXiv:2310.12829  [pdf

    physics.bio-ph q-bio.CB

    A Microwell-Based Microfluidic Device for Single-Cell Trap** and Magnetic Field Gradient Stimulation

    Authors: Richard Lee Lai

    Abstract: We develop a microfluidic platform for the long-term cultivation and observation of both THP-1 cells under different physiological conditions. First, we determine optimal seeding conditions and microwell geometry. Next, we observe changes in cell size and circularity. Results show that gradient magnetic forces on the order of 102 T/m results in stunted growth and irregular cell shapes. Finally, we… ▽ More

    Submitted 19 October, 2023; originally announced October 2023.

  25. arXiv:2309.16772  [pdf, other

    cs.CV cs.AI cs.RO

    XVO: Generalized Visual Odometry via Cross-Modal Self-Training

    Authors: Lei Lai, Zhongkai Shangguan, Jimuyang Zhang, Eshed Ohn-Bar

    Abstract: We propose XVO, a semi-supervised learning method for training generalized monocular Visual Odometry (VO) models with robust off-the-self operation across diverse datasets and settings. In contrast to standard monocular VO approaches which often study a known calibration within a single dataset, XVO efficiently learns to recover relative pose with real-world scale from visual scene semantics, i.e.… ▽ More

    Submitted 8 October, 2023; v1 submitted 28 September, 2023; originally announced September 2023.

    Comments: ICCV 2023, Paris https://genxvo.github.io/

  26. arXiv:2309.16276  [pdf

    cond-mat.mtrl-sci

    Hidden phase uncovered by ultrafast carrier dynamics in thin Bi2O2Se

    Authors: Hao Li, Adeela Nairan, Xiaoran Niu, Yuxiang Chen, Huarui Sun, Linqing Lai, **gkai Qin, Leyang Dang, Guigen Wang, Usman Khan, Feng He

    Abstract: Bi2O2Se has attracted intensive attention due to its potential in electronics, optoelectronics, as well as ferroelectric applications. Despite that, there have only been a handful of experimental studies based on ultrafast spectroscopy to elucidate the carrier dynamics in Bi2O2Se thin films, Different groups have reported various ultrafast timescales and associated mechanisms across films of diffe… ▽ More

    Submitted 23 January, 2024; v1 submitted 28 September, 2023; originally announced September 2023.

  27. arXiv:2309.07988  [pdf, other

    cs.LG cs.AR cs.SD eess.AS

    Folding Attention: Memory and Power Optimization for On-Device Transformer-based Streaming Speech Recognition

    Authors: Yang Li, Liangzhen Lai, Yuan Shangguan, Forrest N. Iandola, Zhaoheng Ni, Ernie Chang, Yangyang Shi, Vikas Chandra

    Abstract: Transformer-based models excel in speech recognition. Existing efforts to optimize Transformer inference, typically for long-context applications, center on simplifying attention score calculations. However, streaming speech recognition models usually process a limited number of tokens each time, making attention score calculation less of a bottleneck. Instead, the bottleneck lies in the linear pr… ▽ More

    Submitted 18 January, 2024; v1 submitted 14 September, 2023; originally announced September 2023.

  28. arXiv:2308.14783  [pdf, other

    cs.LG cs.DC cs.IT

    Distributed Dual Coordinate Ascent with Imbalanced Data on a General Tree Network

    Authors: Myung Cho, Lifeng Lai, Weiyu Xu

    Abstract: In this paper, we investigate the impact of imbalanced data on the convergence of distributed dual coordinate ascent in a tree network for solving an empirical loss minimization problem in distributed machine learning. To address this issue, we propose a method called delayed generalized distributed dual coordinate ascent that takes into account the information of the imbalanced data, and provide… ▽ More

    Submitted 28 August, 2023; originally announced August 2023.

    Comments: To be published in IEEE 2023 Workshop on Machine Learning for Signal Processing (MLSP)

  29. arXiv:2308.01490  [pdf, ps, other

    cs.LG stat.ML

    Minimax Optimal Q Learning with Nearest Neighbors

    Authors: Puning Zhao, Lifeng Lai

    Abstract: Analyzing the Markov decision process (MDP) with continuous state spaces is generally challenging. A recent interesting work \cite{shah2018q} solves MDP with bounded continuous state space by a nearest neighbor $Q$ learning approach, which has a sample complexity of $\tilde{O}(\frac{1}{ε^{d+3}(1-γ)^{d+7}})$ for $ε$-accurate $Q$ function estimation with discount factor $γ$. In this paper, we propos… ▽ More

    Submitted 17 June, 2024; v1 submitted 2 August, 2023; originally announced August 2023.

  30. arXiv:2308.00899  [pdf, other

    math.OC

    Global stability of first-order methods for coercive tame functions

    Authors: Cédric Josz, Lexiao Lai

    Abstract: We consider first-order methods with constant step size for minimizing locally Lipschitz coercive functions that are tame in an o-minimal structure on the real field. We prove that if the method is approximated by subgradient trajectories, then the iterates eventually remain in a neighborhood of a connected component of the set of critical points. Under suitable method-dependent regularity assumpt… ▽ More

    Submitted 1 August, 2023; originally announced August 2023.

    Comments: 30 pages, 1 figure

  31. arXiv:2307.15374  [pdf

    eess.SY

    Leveraging Optical Communication Fiber and AI for Distributed Water Pipe Leak Detection

    Authors: Huan Wu, Huan-Feng Duan, Wallace W. L. Lai, Kun Zhu, Xin Cheng, Hao Yin, Bin Zhou, Chun-Cheung Lai, Chao Lu, Xiaoli Ding

    Abstract: Detecting leaks in water networks is a costly challenge. This article introduces a practical solution: the integration of optical network with water networks for efficient leak detection. Our approach uses a fiber-optic cable to measure vibrations, enabling accurate leak identification and localization by an intelligent algorithm. We also propose a method to access leak severity for prioritized re… ▽ More

    Submitted 28 July, 2023; originally announced July 2023.

    Comments: Accepted

    Journal ref: IEEE Communications Magazine, 2023

  32. arXiv:2307.07748  [pdf, other

    eess.AS

    Audio-Visual Speech Enhancement Using Self-supervised Learning to Improve Speech Intelligibility in Cochlear Implant Simulations

    Authors: Richard Lee Lai, Jen-Cheng Hou, Mandar Gogate, Kia Dashtipour, Amir Hussain, Yu Tsao

    Abstract: Individuals with hearing impairments face challenges in their ability to comprehend speech, particularly in noisy environments. The aim of this study is to explore the effectiveness of audio-visual speech enhancement (AVSE) in enhancing the intelligibility of vocoded speech in cochlear implant (CI) simulations. Notably, the study focuses on a challenged scenario where there is limited availability… ▽ More

    Submitted 15 July, 2023; originally announced July 2023.

  33. arXiv:2307.07670  [pdf, other

    cs.LG cs.AI cs.CR math.OC

    Efficient Adversarial Attacks on Online Multi-agent Reinforcement Learning

    Authors: Guanlin Liu, Lifeng Lai

    Abstract: Due to the broad range of applications of multi-agent reinforcement learning (MARL), understanding the effects of adversarial attacks against MARL model is essential for the safe applications of this model. Motivated by this, we investigate the impact of adversarial attacks on MARL. In the considered setup, there is an exogenous attacker who is able to modify the rewards before the agents receive… ▽ More

    Submitted 14 July, 2023; originally announced July 2023.

  34. arXiv:2307.07666  [pdf, other

    cs.LG cs.AI math.OC

    Efficient Action Robust Reinforcement Learning with Probabilistic Policy Execution Uncertainty

    Authors: Guanlin Liu, Zhihan Zhou, Han Liu, Lifeng Lai

    Abstract: Robust reinforcement learning (RL) aims to find a policy that optimizes the worst-case performance in the face of uncertainties. In this paper, we focus on action robust RL with the probabilistic policy execution uncertainty, in which, instead of always carrying out the action specified by the policy, the agent will take the action specified by the policy with probability $1-ρ$ and an alternative… ▽ More

    Submitted 20 July, 2023; v1 submitted 14 July, 2023; originally announced July 2023.

  35. arXiv:2307.07239  [pdf, other

    hep-ph

    Probing new physics with polarized $τ$ and $Λ_c$ in quasielastic $ν_τ\!+\!n\!\to\! τ^-\!+\!Λ_c$ scattering process

    Authors: Ya-Ru Kong, Li-Fen Lai, Xin-Qiang Li, Xin-Shuai Yan, Ya-Dong Yang, Dong-Hui Zheng

    Abstract: The absence of semitauonic decays of charmed hadrons makes the decay processes mediated by the quark-level $c\to d τ^+ ν_τ$ transition inadequate for probing a generic new physics (NP) with all kinds of Dirac structures. To fill in this gap, we consider in this paper the quasielastic neutrino scattering process $ν_τ+n\to τ^-+Λ_c$, and propose searching for NP through the polarizations of the $τ$ l… ▽ More

    Submitted 14 November, 2023; v1 submitted 14 July, 2023; originally announced July 2023.

    Comments: 31 pages, 17 figures, and 3 tables. Comments are welcome

  36. arXiv:2307.03331  [pdf, ps, other

    math.OC

    Convergence of the momentum method for semialgebraic functions with locally Lipschitz gradients

    Authors: Cédric Josz, Lexiao Lai, Xiaopeng Li

    Abstract: We propose a new length formula that governs the iterates of the momentum method when minimizing differentiable semialgebraic functions with locally Lipschitz gradients. It enables us to establish local convergence, global convergence, and convergence to local minimizers without assuming global Lipschitz continuity of the gradient, coercivity, and a global growth condition, as is done in the liter… ▽ More

    Submitted 7 January, 2024; v1 submitted 6 July, 2023; originally announced July 2023.

    Comments: 33 pages. Accepted for publication at SIAM Journal on Optimization

  37. arXiv:2306.10393  [pdf, ps, other

    math.NT

    Many $p$-adic odd zeta values are irrational

    Authors: Li Lai, Johannes Sprang

    Abstract: For any prime $p$ and $\varepsilon>0$ we prove that for any sufficiently large positive odd integer $s$ at least $(c_p-\varepsilon) \sqrt{\frac{s}{\log s}}$ of the $p$-adic zeta values $ζ_p(3),ζ_p(5),\dots,ζ_p(s)$ are irrational. The constant $c_p$ is positive and does only depend on $p$. This result establishes a $p$-adic version of the elimination technique used by Fischler--Sprang--Zudilin and… ▽ More

    Submitted 17 June, 2023; originally announced June 2023.

    Comments: 35 pages

    MSC Class: 11J72 (Primary) 11F85; 11M06 (Secondary)

  38. arXiv:2306.00838  [pdf, other

    q-bio.OT eess.IV

    The Brain Tumor Segmentation (BraTS-METS) Challenge 2023: Brain Metastasis Segmentation on Pre-treatment MRI

    Authors: Ahmed W. Moawad, Anastasia Janas, Ujjwal Baid, Divya Ramakrishnan, Rachit Saluja, Nader Ashraf, Leon Jekel, Raisa Amiruddin, Maruf Adewole, Jake Albrecht, Udunna Anazodo, Sanjay Aneja, Syed Muhammad Anwar, Timothy Bergquist, Evan Calabrese, Veronica Chiang, Verena Chung, Gian Marco Marco Conte, Farouk Dako, James Eddy, Ivan Ezhov, Ariana Familiar, Keyvan Farahani, Juan Eugenio Iglesias, Zhifan Jiang , et al. (206 additional authors not shown)

    Abstract: The translation of AI-generated brain metastases (BM) segmentation into clinical practice relies heavily on diverse, high-quality annotated medical imaging datasets. The BraTS-METS 2023 challenge has gained momentum for testing and benchmarking algorithms using rigorously annotated internationally compiled real-world datasets. This study presents the results of the segmentation challenge and chara… ▽ More

    Submitted 17 June, 2024; v1 submitted 1 June, 2023; originally announced June 2023.

  39. arXiv:2305.12420  [pdf, other

    cs.IR

    Multi-factor Sequential Re-ranking with Perception-Aware Diversification

    Authors: Yue Xu, Hao Chen, Zefan Wang, Jianwen Yin, Qijie Shen, Dimin Wang, Feiran Huang, Lixiang Lai, Tao Zhuang, Junfeng Ge, Xia Hu

    Abstract: Feed recommendation systems, which recommend a sequence of items for users to browse and interact with, have gained significant popularity in practical applications. In feed products, users tend to browse a large number of items in succession, so the previously viewed items have a significant impact on users' behavior towards the following items. Therefore, traditional methods that mainly focus on… ▽ More

    Submitted 21 May, 2023; originally announced May 2023.

    Journal ref: KDD 2023

  40. arXiv:2305.12319  [pdf, other

    cs.IR

    Multi-channel Integrated Recommendation with Exposure Constraints

    Authors: Yue Xu, Qijie Shen, Jianwen Yin, Zengde Deng, Dimin Wang, Hao Chen, Lixiang Lai, Tao Zhuang, Junfeng Ge

    Abstract: Integrated recommendation, which aims at jointly recommending heterogeneous items from different channels in a main feed, has been widely applied to various online platforms. Though attractive, integrated recommendation requires the ranking methods to migrate from conventional user-item models to the new user-channel-item paradigm in order to better capture users' preferences on both item and chan… ▽ More

    Submitted 20 May, 2023; originally announced May 2023.

    Journal ref: KDD 2023

  41. arXiv:2305.11243  [pdf

    cs.CL cs.AI

    Comparing Machines and Children: Using Developmental Psychology Experiments to Assess the Strengths and Weaknesses of LaMDA Responses

    Authors: Eliza Kosoy, Emily Rose Reagan, Leslie Lai, Alison Gopnik, Danielle Krettek Cobb

    Abstract: Developmental psychologists have spent decades devising experiments to test the intelligence and knowledge of infants and children, tracing the origin of crucial concepts and capacities. Moreover, experimental techniques in developmental psychology have been carefully designed to discriminate the cognitive capacities that underlie particular behaviors. We propose that using classical experiments f… ▽ More

    Submitted 7 November, 2023; v1 submitted 18 May, 2023; originally announced May 2023.

    Comments: 9 pages, 7 figures

  42. arXiv:2305.04145  [pdf

    cs.GT

    A Novel Reward Sha** Function for Single-Player Mahjong

    Authors: Kai Jun Chen, Lok Him Lai, Zi Iun Lai

    Abstract: Mahjong is a complex game with an intractably large state space with extremely sparse rewards, which poses challenges to develop an agent to play Mahjong. To overcome this, the ShangTing function was adopted as a reward sha** function. This was combined with a forward-search algorithm to create an agent capable of completing a winning hand in Single-player Mahjong (an average of 35 actions over… ▽ More

    Submitted 6 May, 2023; originally announced May 2023.

  43. arXiv:2304.00816  [pdf, ps, other

    math.NT

    On the irrationality of certain $2$-adic zeta values

    Authors: Li Lai

    Abstract: Let $ζ_2(\cdot)$ be the Kubota-Leopoldt $2$-adic zeta function. We prove that, for every nonnegative integer $s$, there exists an odd integer $j$ in the interval $[s+3,3s+5]$ such that $ζ_2(j)$ is irrational. In particular, at least one of $ζ_2(7),ζ_2(9),ζ_2(11),ζ_2(13)$ is irrational. Our approach is inspired by the recent work of Sprang. We construct explicit rational functions. The Volkenborn… ▽ More

    Submitted 3 April, 2023; originally announced April 2023.

    Comments: 21 pages

    MSC Class: 11J72 (Primary) 11F85; 11M06 (Secondary)

  44. arXiv:2301.10904  [pdf, other

    cs.CR cs.DC cs.LG

    GPU-based Private Information Retrieval for On-Device Machine Learning Inference

    Authors: Maximilian Lam, Jeff Johnson, Wenjie Xiong, Kiwan Maeng, Udit Gupta, Yang Li, Liangzhen Lai, Ilias Leontiadis, Minsoo Rhu, Hsien-Hsin S. Lee, Vijay Janapa Reddi, Gu-Yeon Wei, David Brooks, G. Edward Suh

    Abstract: On-device machine learning (ML) inference can enable the use of private user data on user devices without revealing them to remote servers. However, a pure on-device solution to private ML inference is impractical for many applications that rely on embedding tables that are too large to be stored on-device. In particular, recommendation models typically use multiple embedding tables each on the or… ▽ More

    Submitted 25 September, 2023; v1 submitted 25 January, 2023; originally announced January 2023.

  45. arXiv:2301.00167  [pdf, other

    q-bio.QM

    Synthesis-driven design of 3D molecules for structure-based drug discovery using geometric transformers

    Authors: Yibo Li, Jianfeng Pei, Luhua Lai

    Abstract: Finding drug-like compounds with high bioactivity is essential for drug discovery, but the task is complicated by the high cost of chemical synthesis and validation. With their outstanding performance in de novo drug design, deep generative models represent promising tools for tackling this challenge. In recently years, 3D molecule generative models have gained increasing attention due to their ab… ▽ More

    Submitted 31 December, 2022; originally announced January 2023.

  46. arXiv:2212.03414  [pdf, other

    cs.DC cs.LG

    DREAM: A Dynamic Scheduler for Dynamic Real-time Multi-model ML Workloads

    Authors: Seah Kim, Hyoukjun Kwon, **ook Song, Jihyuck Jo, Yu-Hsin Chen, Liangzhen Lai, Vikas Chandra

    Abstract: Emerging real-time multi-model ML (RTMM) workloads such as AR/VR and drone control involve dynamic behaviors in various granularity; task, model, and layers within a model. Such dynamic behaviors introduce new challenges to the system software in an ML system since the overall system load is not completely predictable, unlike traditional ML workloads. In addition, RTMM workloads require real-time… ▽ More

    Submitted 20 September, 2023; v1 submitted 6 December, 2022; originally announced December 2022.

    Comments: 14 pages

  47. arXiv:2211.14852  [pdf, other

    math.OC

    Sufficient conditions for instability of the subgradient method with constant step size

    Authors: Cédric Josz, Lexiao Lai

    Abstract: We provide sufficient conditions for instability of the subgradient method with constant step size around a local minimum of a locally Lipschitz semi-algebraic function. They are satisfied by several spurious local minima arising in robust principal component analysis and neural networks.

    Submitted 29 June, 2023; v1 submitted 27 November, 2022; originally announced November 2022.

    Comments: 18 pages, 5 figures

  48. Lyapunov stability of the subgradient method with constant step size

    Authors: Cédric Josz, Lexiao Lai

    Abstract: We consider the subgradient method with constant step size for minimizing locally Lipschitz semi-algebraic functions. In order to analyze the behavior of its iterates in the vicinity of a local minimum, we introduce a notion of discrete Lyapunov stability and propose necessary and sufficient conditions for stability.

    Submitted 6 March, 2023; v1 submitted 27 November, 2022; originally announced November 2022.

    Comments: 11 pages, 2 figures

    MSC Class: 65K05 90C30

    Journal ref: Mathematical Programming 2023

  49. arXiv:2211.14848  [pdf, other

    math.OC

    Nonsmooth rank-one matrix factorization landscape

    Authors: Cédric Josz, Lexiao Lai

    Abstract: We provide the first positive result on the nonsmooth optimization landscape of robust principal component analysis, to the best of our knowledge. It is the object of several conjectures and remains mostly uncharted territory. We identify a necessary and sufficient condition for the absence of spurious local minima in the rank-one case. Our proof exploits the subdifferential regularity of the obje… ▽ More

    Submitted 27 November, 2022; originally announced November 2022.

    Comments: 23 pages, 5 figures

  50. arXiv:2211.08675  [pdf, other

    cs.LG cs.ET

    XRBench: An Extended Reality (XR) Machine Learning Benchmark Suite for the Metaverse

    Authors: Hyoukjun Kwon, Krishnakumar Nair, Jamin Seo, Jason Yik, Debabrata Mohapatra, Dongyuan Zhan, **ook Song, Peter Capak, Peizhao Zhang, Peter Vajda, Colby Banbury, Mark Mazumder, Liangzhen Lai, Ashish Sirasao, Tushar Krishna, Harshit Khaitan, Vikas Chandra, Vijay Janapa Reddi

    Abstract: Real-time multi-task multi-model (MTMM) workloads, a new form of deep learning inference workloads, are emerging for applications areas like extended reality (XR) to support metaverse use cases. These workloads combine user interactivity with computationally complex machine learning (ML) activities. Compared to standard ML applications, these ML workloads present unique difficulties and constraint… ▽ More

    Submitted 19 May, 2023; v1 submitted 16 November, 2022; originally announced November 2022.