Skip to main content

Showing 1–50 of 255 results for author: Laan, T

.
  1. arXiv:2407.02031  [pdf, other

    cs.DC cs.AI cs.LG

    SwiftDiffusion: Efficient Diffusion Model Serving with Add-on Modules

    Authors: Suyi Li, Lingyun Yang, Xiaoxiao Jiang, Hanfeng Lu, Zhipeng Di, Weiyi Lu, Jiawei Chen, Kan Liu, Yinghao Yu, Tao Lan, Guodong Yang, Lin Qu, Li** Zhang, Wei Wang

    Abstract: This paper documents our characterization study and practices for serving text-to-image requests with stable diffusion models in production. We first comprehensively analyze inference request traces for commercial text-to-image applications. It commences with our observation that add-on modules, i.e., ControlNets and LoRAs, that augment the base stable diffusion models, are ubiquitous in generatin… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

  2. arXiv:2406.18518  [pdf, other

    cs.CL cs.AI cs.LG cs.SE

    APIGen: Automated Pipeline for Generating Verifiable and Diverse Function-Calling Datasets

    Authors: Zuxin Liu, Thai Hoang, Jianguo Zhang, Ming Zhu, Tian Lan, Shirley Kokane, Juntao Tan, Weiran Yao, Zhiwei Liu, Yihao Feng, Rithesh Murthy, Liangwei Yang, Silvio Savarese, Juan Carlos Niebles, Huan Wang, Shelby Heinecke, Caiming Xiong

    Abstract: The advancement of function-calling agent models requires diverse, reliable, and high-quality datasets. This paper presents APIGen, an automated data generation pipeline designed to synthesize verifiable high-quality datasets for function-calling applications. We leverage APIGen and collect 3,673 executable APIs across 21 different categories to generate diverse function-calling datasets in a scal… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

  3. arXiv:2405.19878  [pdf, other

    cs.LG cs.GT

    Learning from Random Demonstrations: Offline Reinforcement Learning with Importance-Sampled Diffusion Models

    Authors: Zeyu Fang, Tian Lan

    Abstract: Generative models such as diffusion have been employed as world models in offline reinforcement learning to generate synthetic data for more effective learning. Existing work either generates diffusion models one-time prior to training or requires additional interaction data to update it. In this paper, we propose a novel approach for offline reinforcement learning with closed-loop policy evaluati… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  4. arXiv:2405.16657  [pdf, other

    astro-ph.CO

    ELG Spectroscopic Systematics Analysis of the DESI Data Release 1

    Authors: Jiaxi Yu, Ashley J. Ross, Antoine Rocher, Otávio Alves, Arnaud de Mattia, Daniel Forero-Sánchez, Jean-Paul Kneib, Alex Krolewski, TingWen Lan, Michael Rashkovetskyi, Jessica Nicole Aguilar, Steven Ahlen, Stephen Bailey, David Brooks, Edmond Chaussidon, Todd Claybaugh, Axel de la Macorra, Arjun Dey, Biprateep Dey, Peter Doel, Kevin Fanning, Jaime E. Forero-Romero, Enrique Gaztañaga, Satya Gontcho A Gontcho, Klaus Honscheid , et al. (36 additional authors not shown)

    Abstract: Dark Energy Spectroscopic Instrument (DESI) uses more than 2.4 million Emission Line Galaxies (ELGs) for 3D large-scale structure (LSS) analyses in its Data Release 1 (DR1). Such large statistics enable thorough research on systematic uncertainties. In this study, we focus on spectroscopic systematics of ELGs. The redshift success rate ($f_{\rm goodz}$) is the relative fraction of secure redshifts… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

  5. arXiv:2405.16386  [pdf, other

    cs.LG cs.AI

    Variational Offline Multi-agent Skill Discovery

    Authors: Jiayu Chen, Bhargav Ganguly, Tian Lan, Vaneet Aggarwal

    Abstract: Skills are effective temporal abstractions established for sequential decision making tasks, which enable efficient hierarchical learning for long-horizon tasks and facilitate multi-task learning through their transferability. Despite extensive research, research gaps remain in multi-agent scenarios, particularly for automatically extracting subgroup coordination patterns in a multi-agent task. In… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

  6. arXiv:2405.14122  [pdf, other

    cs.GT

    Modeling Other Players with Bayesian Beliefs for Games with Incomplete Information

    Authors: Zuyuan Zhang, Mahdi Imani, Tian Lan

    Abstract: Bayesian games model interactive decision-making where players have incomplete information -- e.g., regarding payoffs and private data on players' strategies and preferences -- and must actively reason and update their belief models (with regard to such information) using observation and interaction history. Existing work on counterfactual regret minimization have shown great success for games wit… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

    Comments: arXiv admin note: text overlap with arXiv:2105.08440 by other authors

  7. arXiv:2405.13748  [pdf, other

    cs.CV

    Monocular Gaussian SLAM with Language Extended Loop Closure

    Authors: Tian Lan, Qinwei Lin, Haoqian Wang

    Abstract: Recently,3DGaussianSplattinghasshowngreatpotentialin visual Simultaneous Localization And Map** (SLAM). Existing methods have achieved encouraging results on RGB-D SLAM, but studies of the monocular case are still scarce. Moreover, they also fail to correct drift errors due to the lack of loop closure and global optimization. In this paper, we present MG-SLAM, a monocular Gaussian SLAM with a la… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

  8. arXiv:2405.08314  [pdf, other

    astro-ph.GA

    Probing the impact of radio-mode feedback on the properties of the cool circumgalactic medium

    Authors: Yu-Ling Chang, Ting-Wen Lan, J. Xavier Prochaska, Lucas Napolitano, Abhijeet Anand, J. Aguilar, S. Ahlen, D. Brooks, T. Claybaugh, A. de la Macorra, Arjun Dey, P. Doel, S. Gontcho A Gontcho, J. Guy, S. Juneau, T. Kisner, A. Lambert, M. Landriau, L. Le Guillou, M. Manera, P. Martini, A. Meisner, R. Miquel, J. Moustakas, A. D. Myers , et al. (11 additional authors not shown)

    Abstract: We explore the influence of radio-mode feedback on the properties of the cool circumgalactic medium (CGM). To this end, we assemble a statistical sample of approximately 30,000 radio galaxies with background quasars by combining optical spectroscopic measurements of luminous red galaxies (LRGs) and quasars from the year 1 dataset of Dark Energy Spectroscopic Instrument (DESI) and radio sources fro… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

    Comments: 20 pages, 12 figures

  9. arXiv:2405.03967  [pdf, other

    cs.LG cs.AI cs.AR

    SwiftRL: Towards Efficient Reinforcement Learning on Real Processing-In-Memory Systems

    Authors: Kailash Gogineni, Sai Santosh Dayapule, Juan Gómez-Luna, Karthikeya Gogineni, Peng Wei, Tian Lan, Mohammad Sadrosadati, Onur Mutlu, Guru Venkataramani

    Abstract: Reinforcement Learning (RL) trains agents to learn optimal behavior by maximizing reward signals from experience datasets. However, RL training often faces memory limitations, leading to execution latencies and prolonged training times. To overcome this, SwiftRL explores Processing-In-Memory (PIM) architectures to accelerate RL workloads. We achieve near-linear performance scaling by implementing… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

  10. arXiv:2404.13836  [pdf, other

    stat.ME

    MultiFun-DAG: Multivariate Functional Directed Acyclic Graph

    Authors: Tian Lan, Ziyue Li, Junpeng Lin, Zhishuai Li, Lei Bai, Man Li, Fugee Tsung, Rui Zhao, Chen Zhang

    Abstract: Directed Acyclic Graphical (DAG) models efficiently formulate causal relationships in complex systems. Traditional DAGs assume nodes to be scalar variables, characterizing complex systems under a facile and oversimplified form. This paper considers that nodes can be multivariate functional data and thus proposes a multivariate functional DAG (MultiFun-DAG). It constructs a hidden bilinear multivar… ▽ More

    Submitted 21 April, 2024; originally announced April 2024.

  11. arXiv:2404.03002  [pdf, other

    astro-ph.CO

    DESI 2024 VI: Cosmological Constraints from the Measurements of Baryon Acoustic Oscillations

    Authors: DESI Collaboration, A. G. Adame, J. Aguilar, S. Ahlen, S. Alam, D. M. Alexander, M. Alvarez, O. Alves, A. Anand, U. Andrade, E. Armengaud, S. Avila, A. Aviles, H. Awan, B. Bahr-Kalus, S. Bailey, C. Baltay, A. Bault, J. Behera, S. BenZvi, A. Bera, F. Beutler, D. Bianchi, C. Blake, R. Blum , et al. (178 additional authors not shown)

    Abstract: We present cosmological results from the measurement of baryon acoustic oscillations (BAO) in galaxy, quasar and Lyman-$α$ forest tracers from the first year of observations from the Dark Energy Spectroscopic Instrument (DESI), to be released in the DESI Data Release 1. DESI BAO provide robust measurements of the transverse comoving distance and Hubble rate, or their combination, relative to the s… ▽ More

    Submitted 24 April, 2024; v1 submitted 3 April, 2024; originally announced April 2024.

    Comments: This DESI Collaboration Key Publication is part of the 2024 publication series using the first year of observations (see https://data.desi.lbl.gov/doc/papers). Typos corrected and a new figure and discussion added to Appendix A

  12. arXiv:2404.03001  [pdf, other

    astro-ph.CO

    DESI 2024 IV: Baryon Acoustic Oscillations from the Lyman Alpha Forest

    Authors: DESI Collaboration, A. G. Adame, J. Aguilar, S. Ahlen, S. Alam, D. M. Alexander, M. Alvarez, O. Alves, A. Anand, U. Andrade, E. Armengaud, S. Avila, A. Aviles, H. Awan, S. Bailey, C. Baltay, A. Bault, J. Bautista, J. Behera, S. BenZvi, F. Beutler, D. Bianchi, C. Blake, R. Blum, S. Brieden , et al. (174 additional authors not shown)

    Abstract: We present the measurement of Baryon Acoustic Oscillations (BAO) from the Lyman-$α$ (Ly$α$) forest of high-redshift quasars with the first-year dataset of the Dark Energy Spectroscopic Instrument (DESI). Our analysis uses over $420\,000$ Ly$α$ forest spectra and their correlation with the spatial distribution of more than $700\,000$ quasars. An essential facet of this work is the development of a… ▽ More

    Submitted 12 April, 2024; v1 submitted 3 April, 2024; originally announced April 2024.

    Comments: This DESI Collaboration Key Publication is part of the 2024 publication series using the first year of observations (see https://data.desi.lbl.gov/doc/papers)

  13. arXiv:2404.03000  [pdf, other

    astro-ph.CO

    DESI 2024 III: Baryon Acoustic Oscillations from Galaxies and Quasars

    Authors: DESI Collaboration, A. G. Adame, J. Aguilar, S. Ahlen, S. Alam, D. M. Alexander, M. Alvarez, O. Alves, A. Anand, U. Andrade, E. Armengaud, S. Avila, A. Aviles, H. Awan, S. Bailey, C. Baltay, A. Bault, J. Behera, S. BenZvi, F. Beutler, D. Bianchi, C. Blake, R. Blum, S. Brieden, A. Brodzeller , et al. (171 additional authors not shown)

    Abstract: We present the DESI 2024 galaxy and quasar baryon acoustic oscillations (BAO) measurements using over 5.7 million unique galaxy and quasar redshifts in the range 0.1<z<2.1. Divided by tracer type, we utilize 300,017 galaxies from the magnitude-limited Bright Galaxy Survey with 0.1<z<0.4, 2,138,600 Luminous Red Galaxies with 0.4<z<1.1, 2,432,022 Emission Line Galaxies with 0.8<z<1.6, and 856,652 qu… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

    Comments: This DESI Collaboration Key Publication is part of the 2024 publication series using the first year of observations (see https://data.desi.lbl.gov/doc/papers)

  14. arXiv:2403.15341  [pdf, other

    cs.AI cs.MA

    Collaborative AI Teaming in Unknown Environments via Active Goal Deduction

    Authors: Zuyuan Zhang, Hanhan Zhou, Mahdi Imani, Taeyoung Lee, Tian Lan

    Abstract: With the advancements of artificial intelligence (AI), we're seeing more scenarios that require AI to work closely with other agents, whose goals and strategies might not be known beforehand. However, existing approaches for training collaborative agents often require defined and known reward signals and cannot address the problem of teaming with unknown agents that often have latent objectives/re… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

  15. arXiv:2403.01954  [pdf, other

    cs.CL cs.AI cs.LO

    DECIDER: A Dual-System Rule-Controllable Decoding Framework for Language Generation

    Authors: Chen Xu, Tian Lan, Changlong Yu, Wei Wang, Jun Gao, Yu Ji, Qunxi Dong, Kun Qian, Piji Li, Wei Bi, Bin Hu

    Abstract: Constrained decoding approaches aim to control the meaning or style of text generated by a Pre-trained Language Model (PLM) using specific target words during inference. However, these methods often guide plausible continuations by greedily selecting targets, which, while completing the task, may disrupt the natural patterns of human language generation. In this work, we propose a novel decoding f… ▽ More

    Submitted 7 July, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

    Comments: Submitted to IEEE TKDE (Major Revision), 13 pages, 6 figures

  16. arXiv:2403.01890  [pdf, other

    cs.RO

    Aerial Tensile Perching and Disentangling Mechanism for Long-Term Environmental Monitoring

    Authors: Tian Lan, Luca Romanello, Mirko Kovac, Sophie F. Armanini, Basaran Bahadir Kocer

    Abstract: Aerial robots show significant potential for forest canopy research and environmental monitoring by providing data collection capabilities at high spatial and temporal resolutions. However, limited flight endurance hinders their application. Inspired by natural perching behaviours, we propose a multi-modal aerial robot system that integrates tensile perching for energy conservation and a suspended… ▽ More

    Submitted 5 March, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

    Comments: 7 pages, 8 figures, Accepted in IEEE International Conference on Robotics and Automation (ICRA) 2024

  17. arXiv:2403.01642  [pdf

    cs.LG cs.CE eess.SY

    Blue and Green-Mode Energy-Efficient Chemiresistive Sensor Array Realized by Rapid Ensemble Learning

    Authors: Zeheng Wang, James Cooper, Muhammad Usman, Timothy van der Laan

    Abstract: The rapid advancement of Internet of Things (IoT) necessitates the development of optimized Chemiresistive Sensor (CRS) arrays that are both energy-efficient and capable. This study introduces a novel optimization strategy that employs a rapid ensemble learning-based model committee approach to achieve these goals. Utilizing machine learning models such as Elastic Net Regression, Random Forests, a… ▽ More

    Submitted 3 March, 2024; originally announced March 2024.

    Comments: First version before submission

  18. arXiv:2403.01577  [pdf, ps, other

    cond-mat.str-el math-ph

    Torus algebra and logical operators at low energy

    Authors: Ying Chan, Tian Lan, Linqian Wu

    Abstract: Given a modular tensor category $\mathscr{C}$, we construct an associative algebra $\mathrm{Tor({\mathscr{C}}})$, which we call the torus algebra. We prove that the torus algebra is semisimple by explicitly constructing all the simple modules. Suppose that a topological ordered phase described by $\mathscr{C}$ is put on a torus. Physically, each simple module over $\mathrm{Tor({\mathscr{C}}})$ con… ▽ More

    Submitted 3 March, 2024; originally announced March 2024.

    Comments: 22 pages, 1 figure

  19. arXiv:2402.19253  [pdf, ps, other

    cond-mat.str-el hep-th

    Condensation Completion and Defects in 2+1D Topological Orders

    Authors: Gen Yue, Longye Wang, Tian Lan

    Abstract: We review the condensation completion of a modular tensor category, which yields a fusion 2-category of codimension-1 and higher defects in a $2+1$D topological order. We apply the condensation completion to $2+1$D toric code model and a $\mathbbm Z_4$ chiral topological order. In both cases, we explicitly enumerate the $1$d and $0$d defects present in these topological orders, along with their fu… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

  20. arXiv:2402.15538  [pdf, other

    cs.MA cs.AI

    AgentLite: A Lightweight Library for Building and Advancing Task-Oriented LLM Agent System

    Authors: Zhiwei Liu, Weiran Yao, Jianguo Zhang, Liangwei Yang, Zuxin Liu, Juntao Tan, Prafulla K. Choubey, Tian Lan, Jason Wu, Huan Wang, Shelby Heinecke, Caiming Xiong, Silvio Savarese

    Abstract: The booming success of LLMs initiates rapid development in LLM agents. Though the foundation of an LLM agent is the generative model, it is critical to devise the optimal reasoning strategies and agent architectures. Accordingly, LLM agent research advances from the simple chain-of-thought prompting to more complex ReAct and Reflection reasoning strategy; agent architecture also evolves from singl… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

    Comments: preprint. Library is available at https://github.com/SalesforceAIResearch/AgentLite

  21. arXiv:2402.15506  [pdf, other

    cs.AI cs.CL cs.LG

    AgentOhana: Design Unified Data and Training Pipeline for Effective Agent Learning

    Authors: Jianguo Zhang, Tian Lan, Rithesh Murthy, Zhiwei Liu, Weiran Yao, Juntao Tan, Thai Hoang, Liangwei Yang, Yihao Feng, Zuxin Liu, Tulika Awalgaonkar, Juan Carlos Niebles, Silvio Savarese, Shelby Heinecke, Huan Wang, Caiming Xiong

    Abstract: Autonomous agents powered by large language models (LLMs) have garnered significant research attention. However, fully harnessing the potential of LLMs for agent-based tasks presents inherent challenges due to the heterogeneous nature of diverse data sources featuring multi-turn trajectories. In this paper, we introduce \textbf{AgentOhana} as a comprehensive solution to address these challenges. \… ▽ More

    Submitted 20 March, 2024; v1 submitted 23 February, 2024; originally announced February 2024.

    Comments: Add GitHub repo link at \url{https://github.com/SalesforceAIResearch/xLAM} and HuggingFace model link at \url{https://huggingface.co/Salesforce/xLAM-v0.1-r}

  22. arXiv:2402.13777  [pdf, other

    cs.LG cs.AI

    Deep Generative Models for Offline Policy Learning: Tutorial, Survey, and Perspectives on Future Directions

    Authors: Jiayu Chen, Bhargav Ganguly, Yang Xu, Yongsheng Mei, Tian Lan, Vaneet Aggarwal

    Abstract: Deep generative models (DGMs) have demonstrated great success across various domains, particularly in generating texts, images, and videos using models trained from offline data. Similarly, data-driven decision-making and robotic control also necessitate learning a generator function from the offline data to serve as the strategy or policy. In this case, applying deep generative models in offline… ▽ More

    Submitted 25 May, 2024; v1 submitted 21 February, 2024; originally announced February 2024.

    Comments: We restructured the paper and added more discussion

  23. arXiv:2402.13764  [pdf, other

    cs.CL cs.AI

    CriticBench: Evaluating Large Language Models as Critic

    Authors: Tian Lan, Wenwei Zhang, Chen Xu, Heyan Huang, Dahua Lin, Kai Chen, Xian-ling Mao

    Abstract: Critique ability are crucial in the scalable oversight and self-improvement of Large Language Models (LLMs). While many recent studies explore the critique ability of LLMs to judge and refine flaws in generations, how to comprehensively and reliably measure the critique abilities of LLMs is under-explored. This paper introduces CriticBench, a novel benchmark designed to comprehensively and reliabl… ▽ More

    Submitted 22 February, 2024; v1 submitted 21 February, 2024; originally announced February 2024.

  24. arXiv:2402.12417  [pdf

    cs.LG cs.AI

    Predicting trucking accidents with truck drivers 'safety climate perception across companies: A transfer learning approach

    Authors: Kailai Sun, Tianxiang Lan, Say Hong Kam, Yang Miang Goh, Yueng-Hsiang Huang

    Abstract: There is a rising interest in using artificial intelligence (AI)-powered safety analytics to predict accidents in the trucking industry. Companies may face the practical challenge, however, of not having enough data to develop good safety analytics models. Although pretrained models may offer a solution for such companies, existing safety research using transfer learning has mostly focused on comp… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

    Comments: submitted to journal: accident analysis and prevention

  25. arXiv:2402.10941  [pdf, other

    cs.CL cs.AI cs.LG

    Text2Data: Low-Resource Data Generation with Textual Control

    Authors: Shiyu Wang, Yihao Feng, Tian Lan, Ning Yu, Yu Bai, Ran Xu, Huan Wang, Caiming Xiong, Silvio Savarese

    Abstract: Natural language serves as a common and straightforward control signal for humans to interact seamlessly with machines. Recognizing the importance of this interface, the machine learning community is investing considerable effort in generating data that is semantically coherent with textual instructions. While strides have been made in text-to-data generation spanning image editing, audio synthesi… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

    Comments: We propose a method that can achieve text-to-data generation under low-resource situation

  26. Effects of Magnetic Helicity on 3D Equilibria and Self-Organized States in KTX Reversed Field Pinch

    Authors: Ke Liu, Guodong Yu, Yuhua Huang, Wenzhe Mao, Yidong Xie, Xianyi Nie, Hong Li, Tao Lan, **lin Xie, Weixing Ding, Wandong Liu, Ge Zhuang, Caoxiang Zhu

    Abstract: The RFP is a toroidal magnetic configuration in which plasmas can spontaneously transform into different self-organized states. Among various states, the QSH state has a dominant component for the magnetic field and significantly improves confinement. Many theoretical and experimental efforts have investigated the transitions among different states. This paper employs the MRxMHD model to study the… ▽ More

    Submitted 6 April, 2024; v1 submitted 25 January, 2024; originally announced January 2024.

  27. arXiv:2401.14544  [pdf, other

    cs.LG math.FA math.PR

    Bayesian Optimization through Gaussian Cox Process Models for Spatio-temporal Data

    Authors: Yongsheng Mei, Mahdi Imani, Tian Lan

    Abstract: Bayesian optimization (BO) has established itself as a leading strategy for efficiently optimizing expensive-to-evaluate functions. Existing BO methods mostly rely on Gaussian process (GP) surrogate models and are not applicable to (doubly-stochastic) Gaussian Cox processes, where the observation process is modulated by a latent intensity function modeled as a GP. In this paper, we propose a novel… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

    Comments: 2024 International Conference on Learning Representations (ICLR)

  28. arXiv:2312.15958  [pdf, other

    cond-mat.str-el hep-th math-ph

    Category of SET orders

    Authors: Tian Lan, Gen Yue, Longye Wang

    Abstract: We propose the representation principle to study physical systems with a given symmetry. In the context of symmetry enriched topological orders, we give the appropriate representation category, the category of SET orders. For fusion n-category symmetries, we show that the category of SET orders encodes almost all information about the interplay between symmetry and topological orders, in a natural… ▽ More

    Submitted 1 July, 2024; v1 submitted 26 December, 2023; originally announced December 2023.

    Comments: 21 pages, 8 figures, 1 table. Major revision: The perspective of representation principle is proposed, and the generalized gauging arises as a natural substructure of the category of SET orders

  29. arXiv:2312.15947  [pdf, ps, other

    hep-th math-ph

    On a class of fusion 2-category symmetry: condensation completion of braided fusion category

    Authors: Wenjie Xi, Tian Lan, Longye Wang, Chenjie Wang, Wei-Qiang Chen

    Abstract: Recently, many studies are focused on generalized global symmetry, a mixture of both invertible and non-invertible symmetries in various space-time dimensions. The complete structure of generalized global symmetry is described by higher fusion category theory. In this paper, We first review the construction of fusion 2-category symmetry $Σ\cal B$ where $\cal B$ is a a braided fusion category. In p… ▽ More

    Submitted 10 May, 2024; v1 submitted 26 December, 2023; originally announced December 2023.

    Comments: 42 pages, 3 figures, All the 10j-symbols of $Σ\mathrm{sVec}$ and the complete computer program has been uploaded on github: https://github.com/WJXI/2sVec.git

  30. arXiv:2312.15555  [pdf, other

    cs.MA

    ConcaveQ: Non-Monotonic Value Function Factorization via Concave Representations in Deep Multi-Agent Reinforcement Learning

    Authors: Huiqun Li, Hanhan Zhou, Yifei Zou, Dongxiao Yu, Tian Lan

    Abstract: Value function factorization has achieved great success in multi-agent reinforcement learning by optimizing joint action-value functions through the maximization of factorized per-agent utilities. To ensure Individual-Global-Maximum property, existing works often focus on value factorization using monotonic functions, which are known to result in restricted representation expressiveness. In this p… ▽ More

    Submitted 24 December, 2023; originally announced December 2023.

    Comments: Accepted at AAAI 2024

    Journal ref: AAAI 2024

  31. arXiv:2312.11742  [pdf, other

    cs.DC cs.AR cs.LG cs.NI

    ACCL+: an FPGA-Based Collective Engine for Distributed Applications

    Authors: Zhenhao He, Dario Korolija, Yu Zhu, Benjamin Ramhorst, Tristan Laan, Lucian Petrica, Michaela Blott, Gustavo Alonso

    Abstract: FPGAs are increasingly prevalent in cloud deployments, serving as Smart NICs or network-attached accelerators. Despite their potential, develo** distributed FPGA-accelerated applications remains cumbersome due to the lack of appropriate infrastructure and communication abstractions. To facilitate the development of distributed applications with FPGAs, in this paper we propose ACCL+, an open-sour… ▽ More

    Submitted 18 December, 2023; originally announced December 2023.

  32. arXiv:2312.07696  [pdf, ps, other

    cs.CR cs.AI

    Real-time Network Intrusion Detection via Decision Transformers

    Authors: **gdi Chen, Hanhan Zhou, Yongsheng Mei, Gina Adam, Nathaniel D. Bastian, Tian Lan

    Abstract: Many cybersecurity problems that require real-time decision-making based on temporal observations can be abstracted as a sequence modeling problem, e.g., network intrusion detection from a sequence of arriving packets. Existing approaches like reinforcement learning may not be suitable for such cybersecurity decision problems, since the Markovian property may not necessarily hold and the underlyin… ▽ More

    Submitted 16 December, 2023; v1 submitted 12 December, 2023; originally announced December 2023.

  33. arXiv:2312.07060  [pdf, other

    cs.DC

    Layered Randomized Quantization for Communication-Efficient and Privacy-Preserving Distributed Learning

    Authors: Guangfeng Yan, Tan Li, Tian Lan, Kui Wu, Linqi Song

    Abstract: Next-generation wireless networks, such as edge intelligence and wireless distributed learning, face two critical challenges: communication efficiency and privacy protection. In this work, our focus is on addressing these issues in a distributed learning framework. We consider a new approach that simultaneously achieves communication efficiency and privacy protection by exploiting the privacy adva… ▽ More

    Submitted 12 December, 2023; originally announced December 2023.

  34. arXiv:2312.02515  [pdf, other

    cs.LG cs.AI

    ASPEN: High-Throughput LoRA Fine-Tuning of Large Language Models with a Single GPU

    Authors: Zhengmao Ye, Dengchun Li, **gqi Tian, Tingfeng Lan, Jie Zuo, Lei Duan, Hui Lu, Yexi Jiang, Jian Sha, Ke Zhang, Mingjie Tang

    Abstract: Transformer-based large language models (LLMs) have demonstrated outstanding performance across diverse domains, particularly when fine-turned for specific domains. Recent studies suggest that the resources required for fine-tuning LLMs can be economized through parameter-efficient methods such as Low-Rank Adaptation (LoRA). While LoRA effectively reduces computational burdens and resource demands… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

    Comments: 14 pages, 14 figures

  35. arXiv:2311.17630  [pdf, other

    cs.NI eess.SP

    Optimization in Mobile Augmented Reality Systems for the Metaverse over Wireless Communications

    Authors: Tianming Lan, Jun Zhao

    Abstract: As the essential technical support for Metaverse, Mobile Augmented Reality (MAR) has attracted the attention of many researchers. MAR applications rely on real-time processing of visual and audio data, and thus those heavy workloads can quickly drain the battery of a mobile device. To address such problem, edge-based solutions have appeared for handling some tasks that require more computing power… ▽ More

    Submitted 29 November, 2023; originally announced November 2023.

    Comments: This paper appears in IEEE Global Communications Conference (GLOBECOM) 2023

  36. arXiv:2311.16018  [pdf, other

    cs.CR cs.AI

    RIDE: Real-time Intrusion Detection via Explainable Machine Learning Implemented in a Memristor Hardware Architecture

    Authors: **gdi Chen, Lei Zhang, Joseph Riem, Gina Adam, Nathaniel D. Bastian, Tian Lan

    Abstract: Deep Learning (DL) based methods have shown great promise in network intrusion detection by identifying malicious network traffic behavior patterns with high accuracy, but their applications to real-time, packet-level detections in high-speed communication networks are challenging due to the high computation time and resource requirements of Deep Neural Networks (DNNs), as well as lack of explaina… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.

  37. arXiv:2311.13235  [pdf

    physics.app-ph physics.chem-ph

    Strong Light-Matter Coupling Facilitated Charge Carrier Transport in Cavity Organic Solar Cells

    Authors: Yahui Tang, Alexandra Stuart, Timothy van der Laan, Girish Lakhwani

    Abstract: Strong light-matter coupling has shown great potential for modifying the electro-optical properties of semiconducting materials in recent years. In the strong coupling regime, excitons and cavity photons form new states named exciton-polaritons, with their properties a hybrid of each constituent. Herein, we report strong coupling observed in solution-processed donor:acceptor bulk-heterojunction or… ▽ More

    Submitted 22 November, 2023; originally announced November 2023.

  38. arXiv:2310.19841  [pdf

    cs.LG

    An interpretable clustering approach to safety climate analysis: examining driver group distinction in safety climate perceptions

    Authors: Kailai Sun, Tianxiang Lan, Yang Miang Goh, Sufiana Safiena, Yueng-Hsiang Huang, Bailey Lytle, Yimin He

    Abstract: The transportation industry, particularly the trucking sector, is prone to workplace accidents and fatalities. Accidents involving large trucks accounted for a considerable percentage of overall traffic fatalities. Recognizing the crucial role of safety climate in accident prevention, researchers have sought to understand its factors and measure its impact within organizations. While existing data… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

    Comments: Submitted to Journal:Accident Analysis and Prevention

  39. arXiv:2310.10226  [pdf, other

    cs.CL

    Repetition In Repetition Out: Towards Understanding Neural Text Degeneration from the Data Perspective

    Authors: Huayang Li, Tian Lan, Zihao Fu, Deng Cai, Lemao Liu, Nigel Collier, Taro Watanabe, Yixuan Su

    Abstract: There are a number of diverging hypotheses about the neural text degeneration problem, i.e., generating repetitive and dull loops, which makes this problem both interesting and confusing. In this work, we aim to advance our understanding by presenting a straightforward and fundamental explanation from the data perspective. Our preliminary investigation reveals a strong correlation between the dege… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

    Comments: Accepted to NeurIPS 2023

  40. arXiv:2310.08670  [pdf, other

    cs.LG cs.DC

    Every Parameter Matters: Ensuring the Convergence of Federated Learning with Dynamic Heterogeneous Models Reduction

    Authors: Hanhan Zhou, Tian Lan, Guru Venkataramani, Wenbo Ding

    Abstract: Cross-device Federated Learning (FL) faces significant challenges where low-end clients that could potentially make unique contributions are excluded from training large models due to their resource bottlenecks. Recent research efforts have focused on model-heterogeneous FL, by extracting reduced-size models from the global model and applying them to local clients accordingly. Despite the empirica… ▽ More

    Submitted 26 October, 2023; v1 submitted 12 October, 2023; originally announced October 2023.

    Comments: Accepted at NeurIPS 2023

  41. arXiv:2309.12606  [pdf, other

    math.NA

    Stable Reconstruction of Anisotropic Objects from Near-Field Electromagnetic Data

    Authors: Tran H. Lan, Dinh-Liem Nguyen

    Abstract: This paper addresses the electromagnetic inverse scattering problem of determining the location and shape of anisotropic objects from near-field data. We investigate both cases involving the Helmholtz equation and Maxwell's equations for this inverse problem. Our study focuses on develo** efficient imaging functionals that enable a fast and stable recovery of the anisotropic object. The implemen… ▽ More

    Submitted 21 September, 2023; originally announced September 2023.

    Comments: 22 pages

  42. arXiv:2309.04707  [pdf, other

    cs.AI cs.LG

    Advantage Actor-Critic with Reasoner: Explaining the Agent's Behavior from an Exploratory Perspective

    Authors: Muzhe Guo, Feixu Yu, Tian Lan, Fang **

    Abstract: Reinforcement learning (RL) is a powerful tool for solving complex decision-making problems, but its lack of transparency and interpretability has been a major challenge in domains where decisions have significant real-world consequences. In this paper, we propose a novel Advantage Actor-Critic with Reasoner (A2CR), which can be easily applied to Actor-Critic-based RL models and make them interpre… ▽ More

    Submitted 9 September, 2023; originally announced September 2023.

  43. arXiv:2308.14897  [pdf, other

    cs.LG cs.AI cs.DC

    Statistically Efficient Variance Reduction with Double Policy Estimation for Off-Policy Evaluation in Sequence-Modeled Reinforcement Learning

    Authors: Hanhan Zhou, Tian Lan, Vaneet Aggarwal

    Abstract: Offline reinforcement learning aims to utilize datasets of previously gathered environment-action interaction records to learn a policy without access to the real environment. Recent work has shown that offline reinforcement learning can be formulated as a sequence modeling problem and solved via supervised learning with approaches such as decision transformer. While these sequence-based methods a… ▽ More

    Submitted 28 August, 2023; originally announced August 2023.

  44. arXiv:2308.03358  [pdf, other

    cs.AI

    RGMComm: Return Gap Minimization via Discrete Communications in Multi-Agent Reinforcement Learning

    Authors: **gdi Chen, Tian Lan, Carlee Joe-Wong

    Abstract: Communication is crucial for solving cooperative Multi-Agent Reinforcement Learning tasks in partially observable Markov Decision Processes. Existing works often rely on black-box methods to encode local information/features into messages shared with other agents, leading to the generation of continuous messages with high communication overhead and poor interpretability. Prior attempts at discrete… ▽ More

    Submitted 18 December, 2023; v1 submitted 7 August, 2023; originally announced August 2023.

  45. arXiv:2308.00258  [pdf, other

    cs.LG cs.DC

    AQUILA: Communication Efficient Federated Learning with Adaptive Quantization in Device Selection Strategy

    Authors: Zihao Zhao, Yuzhu Mao, Zhenpeng Shi, Yang Liu, Tian Lan, Wenbo Ding, Xiao-** Zhang

    Abstract: The widespread adoption of Federated Learning (FL), a privacy-preserving distributed learning methodology, has been impeded by the challenge of high communication overheads, typically arising from the transmission of large-scale models. Existing adaptive quantization methods, designed to mitigate these overheads, operate under the impractical assumption of uniform device participation in every tra… ▽ More

    Submitted 4 October, 2023; v1 submitted 31 July, 2023; originally announced August 2023.

  46. arXiv:2307.11629  [pdf, other

    cs.LG cs.MA

    Scalable Multi-agent Covering Option Discovery based on Kronecker Graphs

    Authors: Jiayu Chen, **gdi Chen, Tian Lan, Vaneet Aggarwal

    Abstract: Covering skill (a.k.a., option) discovery has been developed to improve the exploration of RL in single-agent scenarios with sparse reward signals, through connecting the most distant states in the embedding space provided by the Fiedler vector of the state transition graph. Given that joint state space grows exponentially with the number of agents in multi-agent systems, existing researches still… ▽ More

    Submitted 20 August, 2023; v1 submitted 21 July, 2023; originally announced July 2023.

    Comments: Accepted to NeurIPS 2022. arXiv admin note: substantial text overlap with arXiv:2201.08227

  47. arXiv:2307.06962  [pdf, other

    cs.CL cs.AI

    Copy Is All You Need

    Authors: Tian Lan, Deng Cai, Yan Wang, Heyan Huang, Xian-Ling Mao

    Abstract: The dominant text generation models compose the output by sequentially selecting words from a fixed vocabulary. In this paper, we formulate text generation as progressively copying text segments (e.g., words or phrases) from an existing text collection. We compute the contextualized representations of meaningful text segments and index them using efficient vector search toolkits. The task of text… ▽ More

    Submitted 13 July, 2023; originally announced July 2023.

    Journal ref: The Eleventh International Conference on Learning Representations (ICLR 2023)

  48. arXiv:2307.02099  [pdf, other

    math.NA

    The Predictability of Stock Price: Empirical Study onTick Data in Chinese Stock Market

    Authors: Yueshan Chen, Xingyu Xu, Tian Lan, Sihai Zhang

    Abstract: Whether or not stocks are predictable has been a topic of concern for decades.The efficient market hypothesis (EMH) says that it is difficult for investors to make extra profits by predicting stock prices, but this may not be true, especially for the Chinese stock market. Therefore, we explore the predictability of the Chinese stock market based on tick data, a widely studied high-frequency data.… ▽ More

    Submitted 5 July, 2023; v1 submitted 5 July, 2023; originally announced July 2023.

  49. arXiv:2306.17054  [pdf, other

    cs.NI

    Two-tiered Online Optimization of Region-wide Datacenter Resource Allocation via Deep Reinforcement Learning

    Authors: Chang-Lin Chen, Hanhan Zhou, Jiayu Chen, Mohammad Pedramfar, Vaneet Aggarwal, Tian Lan, Zheqing Zhu, Chi Zhou, Tim Gasser, Pol Mauri Ruiz, Vijay Menon, Neeraj Kumar, Hongbo Dong

    Abstract: This paper addresses the important need for advanced techniques in continuously allocating workloads on shared infrastructures in data centers, a problem arising due to the growing popularity and scale of cloud computing. It particularly emphasizes the scarcity of research ensuring guaranteed capacity in capacity reservations during large-scale failures. To tackle these issues, the paper presents… ▽ More

    Submitted 29 June, 2023; originally announced June 2023.

  50. arXiv:2306.14616  [pdf

    physics.app-ph

    A Cu3BHT-Graphene van der Waals Heterostructure with Strong Interlayer Coupling

    Authors: Zhiyong Wang, Shuai Fu, Wenjie Zhang, Baokun Liang, Tsai Jung Liu, Mike Hambsch, Jonas F. Pöhls, Yufeng Wu, Jianjun Zhang, Tianshu Lan, Xiaodong Li, Haoyuan Qi, Miroslav Polozij, Stefan C. B. Mannsfeld, Ute Kaiser, Mischa Bonn, R. Thomas Weitz, Thomas Heine, Stuart S. P. Parkin, Hai I Wang, Renhao Dong, Xinliang Feng

    Abstract: Two dimensional van der Waals heterostructures (2D are of significant interest due to their intriguing physical properties that are critically defined by the constituent monolayers and their interlayer coupling . However, typical inorganic 2 D vdWhs fall into the weakly coupled region, limiting efficient interfacial charge flow crucial for develo** high performance quantum opto electronics. Here… ▽ More

    Submitted 26 June, 2023; originally announced June 2023.