Skip to main content

Showing 1–50 of 318 results for author: Cao, T

.
  1. arXiv:2407.02598  [pdf, other

    cs.CV cs.AI

    AutoSplat: Constrained Gaussian Splatting for Autonomous Driving Scene Reconstruction

    Authors: Mustafa Khan, Hamidreza Fazlali, Dhruv Sharma, Tongtong Cao, Dongfeng Bai, Yuan Ren, Bingbing Liu

    Abstract: Realistic scene reconstruction and view synthesis are essential for advancing autonomous driving systems by simulating safety-critical scenarios. 3D Gaussian Splatting excels in real-time rendering and static scene reconstructions but struggles with modeling driving scenarios due to complex backgrounds, dynamic objects, and sparse views. We propose AutoSplat, a framework employing Gaussian splatti… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

  2. arXiv:2407.00088  [pdf, other

    cs.DC cs.AI

    T-MAC: CPU Renaissance via Table Lookup for Low-Bit LLM Deployment on Edge

    Authors: Jianyu Wei, Shijie Cao, Ting Cao, Lingxiao Ma, Lei Wang, Yanyong Zhang, Mao Yang

    Abstract: The deployment of Large Language Models (LLMs) on edge devices is increasingly important to enhance on-device intelligence. Weight quantization is crucial for reducing the memory footprint of LLMs on devices. However, low-bit LLMs necessitate mixed precision matrix multiplication (mpGEMM) of low precision weights and high precision activations during inference. Existing systems, lacking native sup… ▽ More

    Submitted 25 June, 2024; originally announced July 2024.

  3. arXiv:2406.15539  [pdf, other

    hep-ex nucl-ex

    First Measurement of Deeply Virtual Compton Scattering on the Neutron with Detection of the Active Neutron

    Authors: CLAS Collaboration, A. Hobart, S. Niccolai, M. Čuić, K. Kumerički, P. Achenbach, J. S. Alvarado, W. R. Armstrong, H. Atac, H. Avakian, L. Baashen, N. A. Baltzell, L. Barion, M. Bashkanov, M. Battaglieri, B. Benkel, F. Benmokhtar, A. Bianconi, A. S. Biselli, S. Boiarinov, M. Bondi, W. A. Booth, F. Bossù, K. -Th. Brinkmann, W. J. Briscoe , et al. (124 additional authors not shown)

    Abstract: Measuring Deeply Virtual Compton Scattering on the neutron is one of the necessary steps to understand the structure of the nucleon in terms of Generalized Parton Distributions (GPDs). Neutron targets play a complementary role to transversely polarized proton targets in the determination of the GPD $E$. This poorly known and poorly constrained GPD is essential to obtain the contribution of the qua… ▽ More

    Submitted 25 June, 2024; v1 submitted 21 June, 2024; originally announced June 2024.

    Comments: 7 pages, 6 figures

    Report number: JLAB-PHY-24-4089

  4. arXiv:2406.09591  [pdf

    cond-mat.mes-hall cond-mat.str-el

    Ferromagnetism and Topology of the Higher Flat Band in a Fractional Chern Insulator

    Authors: Heonjoon Park, Jiaqi Cai, Eric Anderson, Xiao-Wei Zhang, Xiaoyu Liu, William Holtzmann, Weijie Li, Chong Wang, Chaowei Hu, Yuzhou Zhao, Takashi Taniguchi, Kenji Watanabe, Jihui Yang, David Cobden, Jiun-Haw Chu, Nicolas Regnault, B. Andrei Bernevig, Liang Fu, Ting Cao, Di Xiao, Xiaodong Xu

    Abstract: The recent observation of the fractional quantum anomalous Hall effect in moiré fractional Chern insulators (FCI) provides opportunities for investigating zero magnetic field anyons. So far, both experimental and theoretical results suggest that filling > 1/3 FCI states in the first Chern band share features with those of the lowest Landau level (LL). To create the possibility of realizing non-Abe… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: 24 pages, 4 figures

  5. arXiv:2406.00276  [pdf

    cs.LG cs.AI cs.CE physics.data-an

    Non-destructive Degradation Pattern Decoupling for Ultra-early Battery Prototype Verification Using Physics-informed Machine Learning

    Authors: Shengyu Tao, Mengtian Zhang, Zixi Zhao, Haoyang Li, Ruifei Ma, Yunhong Che, Xin Sun, Lin Su, Xiangyu Chen, Zihao Zhou, Heng Chang, Tingwei Cao, Xiao Xiao, Yaojun Liu, Wenjun Yu, Zhongling Xu, Yang Li, Han Hao, Xuan Zhang, Xiaosong Hu, Guangmin ZHou

    Abstract: Manufacturing complexities and uncertainties have impeded the transition from material prototypes to commercial batteries, making prototype verification critical to quality assessment. A fundamental challenge involves deciphering intertwined chemical processes to characterize degradation patterns and their quantitative relationship with battery performance. Here we show that a physics-informed mac… ▽ More

    Submitted 31 May, 2024; originally announced June 2024.

    ACM Class: J.2; G.3

  6. arXiv:2405.19853  [pdf

    cond-mat.supr-con cond-mat.mtrl-sci cond-mat.str-el

    Correlated Electronic Structure and Density-Wave Gap in Trilayer Nickelate La4Ni3O10

    Authors: X. Du, Y. D. Li, Y. T. Cao, C. Y. Pei, M. X. Zhang, W. X. Zhao, K. Y. Zhai, R. Z. Xu, Z. K. Liu, Z. W. Li, J. K. Zhao, G. Li, Y. L. Chen, Y. P. Qi, H. J. Guo, L. X. Yang

    Abstract: The discovery of pressurized superconductivity at 80 K in La3Ni2O7 officially brings nickelates into the family of high-temperature superconductors, which gives rise to not only new insights but also mysteries in the strongly correlated superconductivity. More recently, the sibling compound La4Ni3O10 was also shown to be superconducting below about 25 K under pressure, further boosting the popular… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  7. arXiv:2405.19308  [pdf, other

    cond-mat.mes-hall

    Visualizing the microscopic origins of topology in twisted molybdenum ditelluride

    Authors: Ellis Thompson, Keng Tou Chu, Florie Mesple, Xiao-Wei Zhang, Chaowei Hu, Yuzhou Zhao, Heonjoon Park, Jiaqi Cai, Eric Anderson, Kenji Watanabe, Takashi Taniguchi, Jihui Yang, Jiun-Haw Chu, Xiaodong Xu, Ting Cao, Di Xiao, Matthew Yankowitz

    Abstract: In moiré materials with flat electronic bands and suitable quantum geometry, strong correlations can give rise to novel topological states of matter. The nontrivial band topology of twisted molybdenum ditelluride (tMoTe$_2$) -- responsible for its fractional quantum anomalous Hall (FQAH) states -- is predicted to arise from a layer-pseudospin skyrmion lattice. Tracing the layer polarization of wav… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: 7 pages, 4 figures, Extended Data, 9 figures, Supplementary Information, 8 pages, 5 figures

  8. arXiv:2405.10318  [pdf, other

    cond-mat.mtrl-sci cond-mat.mes-hall

    Gauge theory of giant phonon magnetic moment in doped Dirac semimetals

    Authors: Wenqin Chen, Xiao-Wei Zhang, Ying Su, Ting Cao, Di Xiao, Shi-Zeng Lin

    Abstract: We present a quantum theory of phonon magnetic moment in doped Dirac semimetals. Our theory is based on an emergent gauge field approach to the electron-phonon coupling, applicable to both gapless and gapped systems. We find that the magnetic moment is directly proportional to the electrical Hall conductivity through the phonon Hall viscosity. Our theory is combined with the first-principles calcu… ▽ More

    Submitted 20 May, 2024; v1 submitted 16 May, 2024; originally announced May 2024.

    Comments: 7 pages, 3 figures, supplemental materials included

  9. arXiv:2405.01394  [pdf, other

    cs.AI

    Analysis of a Modular Autonomous Driving Architecture: The Top Submission to CARLA Leaderboard 2.0 Challenge

    Authors: Weize Zhang, Mohammed Elmahgiubi, Kasra Rezaee, Behzad Khamidehi, Hamidreza Mirkhani, Fazel Arasteh, Chunlin Li, Muhammad Ahsan Kaleem, Eduardo R. Corral-Soto, Dhruv Sharma, Tongtong Cao

    Abstract: In this paper we present the architecture of the Kyber-E2E submission to the map track of CARLA Leaderboard 2.0 Autonomous Driving (AD) challenge 2023, which achieved first place. We employed a modular architecture for our solution consists of five main components: sensing, localization, perception, tracking/prediction, and planning/control. Our solution leverages state-of-the-art language-assiste… ▽ More

    Submitted 21 March, 2024; originally announced May 2024.

  10. arXiv:2404.10584  [pdf, other

    cs.CV

    ReWiTe: Realistic Wide-angle and Telephoto Dual Camera Fusion Dataset via Beam Splitter Camera Rig

    Authors: Chunli Peng, Xuan Dong, Tiantian Cao, Zhengqing Li, Kun Dong, Weixin Li

    Abstract: The fusion of images from dual camera systems featuring a wide-angle and a telephoto camera has become a hotspot problem recently. By integrating simultaneously captured wide-angle and telephoto images from these systems, the resulting fused image achieves a wide field of view (FOV) coupled with high-definition quality. Existing approaches are mostly deep learning methods, and predominantly rely o… ▽ More

    Submitted 29 April, 2024; v1 submitted 16 April, 2024; originally announced April 2024.

  11. arXiv:2404.06162  [pdf, other

    cs.CL cs.AI cs.LG

    Characterizing Multimodal Long-form Summarization: A Case Study on Financial Reports

    Authors: Tianyu Cao, Natraj Raman, Danial Dervovic, Chenhao Tan

    Abstract: As large language models (LLMs) expand the power of natural language processing to handle long inputs, rigorous and systematic analyses are necessary to understand their abilities and behavior. A salient application is summarization, due to its ubiquity and controversy (e.g., researchers have declared the death of summarization). In this paper, we use financial report summarization as a case study… ▽ More

    Submitted 8 May, 2024; v1 submitted 9 April, 2024; originally announced April 2024.

  12. arXiv:2404.05697  [pdf, other

    cond-mat.str-el cond-mat.mes-hall cond-mat.mtrl-sci

    Higher Landau-Level Analogues and Signatures of Non-Abelian States in Twisted Bilayer MoTe$_2$

    Authors: Chong Wang, Xiao-Wei Zhang, Xiaoyu Liu, Jie Wang, Ting Cao, Di Xiao

    Abstract: Recent experimental discovery of fractional Chern insulators at zero magnetic field in moiré superlattices has sparked intense interests in bringing Landau level physics to flat Chern bands. In twisted MoTe$_2$ bilayers (tMoTe$_2$), recent theoretical and experimental studies have found three consecutive flat Chern bands at twist angle $\sim 2^\circ$. In this work, we investigate whether higher La… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

  13. arXiv:2404.05208  [pdf, other

    cond-mat.mes-hall quant-ph

    Proximity-Induced Exchange Interaction: a New Pathway for Quantum Sensing using Spin Centers in Hexagonal Boron Nitride

    Authors: Lingnan Shen, Di Xiao, Ting Cao

    Abstract: Defects in hexagonal boron nitride (hBN), a two-dimensional van der Waals material, have raised wide range interest for its potential in various quantum applications. Due to hBN's 2D nature, spin center in hBN can be engineered in close proximity to target material, providing advantages over their 3D counterparts, such as nitrogen-vacancy (NV) center in diamond. Here we propose a novel quantum sen… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

  14. arXiv:2403.15385  [pdf, other

    cs.CV cs.AI cs.GR cs.LG

    LATTE3D: Large-scale Amortized Text-To-Enhanced3D Synthesis

    Authors: Kevin Xie, Jonathan Lorraine, Tianshi Cao, Jun Gao, James Lucas, Antonio Torralba, Sanja Fidler, Xiaohui Zeng

    Abstract: Recent text-to-3D generation approaches produce impressive 3D results but require time-consuming optimization that can take up to an hour per prompt. Amortized methods like ATT3D optimize multiple prompts simultaneously to improve efficiency, enabling fast text-to-3D synthesis. However, they cannot capture high-frequency geometry and texture details and struggle to scale to large prompt sets, so t… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

    Comments: See the project website at https://research.nvidia.com/labs/toronto-ai/LATTE3D/

    MSC Class: 68T45 ACM Class: I.2.6; I.2.7; I.3.6; I.3.7

  15. arXiv:2403.05012  [pdf

    cond-mat.supr-con cond-mat.str-el

    Ultrafast Dynamics of Bilayer and Trilayer Nickelate Superconductors

    Authors: Y. D. Li, Y. T. Cao, L. Y. Liu, P. Peng, H. Lin, C. Y. Pei, M. X. Zhang, H. Wu, X. Du, W. X. Zhao, K. Y. Zhai, J. K. Zhao, M. -L. Lin, P. H. Tan, Y. P. Qi, G. Li, H. J. Guo, Luyi Yang, L. X. Yang

    Abstract: In addition to the pressurized high-temperature superconductivity, bilayer and trilayer nickelate superconductors Lan+1NinO3n+1 (n = 2 and 3) exhibit many intriguing properties at ambient pressure, such as orbital-dependent electronic correlation, non-Fermi liquid behavior, and density-wave transitions. Here, using ultrafast reflectivity measurement, we observe a drastic difference between the ult… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

  16. arXiv:2403.04997  [pdf, other

    cs.CL cs.CV

    DiffChat: Learning to Chat with Text-to-Image Synthesis Models for Interactive Image Creation

    Authors: Jiapeng Wang, Chengyu Wang, Tingfeng Cao, Jun Huang, Lianwen **

    Abstract: We present DiffChat, a novel method to align Large Language Models (LLMs) to "chat" with prompt-as-input Text-to-Image Synthesis (TIS) models (e.g., Stable Diffusion) for interactive image creation. Given a raw prompt/image and a user-specified instruction, DiffChat can effectively make appropriate modifications and generate the target prompt, which can be leveraged to create the target image of h… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

  17. arXiv:2403.03431  [pdf, other

    cs.CV

    Towards Understanding Cross and Self-Attention in Stable Diffusion for Text-Guided Image Editing

    Authors: Bingyan Liu, Chengyu Wang, Tingfeng Cao, Kui Jia, Jun Huang

    Abstract: Deep Text-to-Image Synthesis (TIS) models such as Stable Diffusion have recently gained significant popularity for creative Text-to-image generation. Yet, for domain-specific scenarios, tuning-free Text-guided Image Editing (TIE) is of greater importance for application developers, which modify objects or object properties in images by manipulating feature components in attention layers during the… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

  18. arXiv:2403.02253  [pdf, other

    cs.CR cs.AI cs.CL cs.LG

    KnowPhish: Large Language Models Meet Multimodal Knowledge Graphs for Enhancing Reference-Based Phishing Detection

    Authors: Yuexin Li, Chengyu Huang, Shumin Deng, Mei Lin Lock, Tri Cao, Nay Oo, Hoon Wei Lim, Bryan Hooi

    Abstract: Phishing attacks have inflicted substantial losses on individuals and businesses alike, necessitating the development of robust and efficient automated phishing detection approaches. Reference-based phishing detectors (RBPDs), which compare the logos on a target webpage to a known set of logos, have emerged as the state-of-the-art approach. However, a major limitation of existing RBPDs is that the… ▽ More

    Submitted 15 June, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

    Comments: Accepted by USENIX Security 2024

  19. arXiv:2403.01417  [pdf, other

    cs.LG cs.DC

    Asyn2F: An Asynchronous Federated Learning Framework with Bidirectional Model Aggregation

    Authors: Tien-Dung Cao, Nguyen T. Vuong, Thai Q. Le, Hoang V. N. Dao, Tram Truong-Huu

    Abstract: In federated learning, the models can be trained synchronously or asynchronously. Many research works have focused on develo** an aggregation method for the server to aggregate multiple local models into the global model with improved performance. They ignore the heterogeneity of the training workers, which causes the delay in the training of the local models, leading to the obsolete information… ▽ More

    Submitted 3 March, 2024; originally announced March 2024.

  20. arXiv:2402.13822  [pdf, other

    cs.CV

    MSTAR: Multi-Scale Backbone Architecture Search for Timeseries Classification

    Authors: Tue M. Cao, Nhat H. Tran, Hieu H. Pham, Hung T. Nguyen, Le P. Nguyen

    Abstract: Most of the previous approaches to Time Series Classification (TSC) highlight the significance of receptive fields and frequencies while overlooking the time resolution. Hence, unavoidably suffered from scalability issues as they integrated an extensive range of receptive fields into classification models. Other methods, while having a better adaptation for large datasets, require manual design an… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

  21. arXiv:2402.12603  [pdf

    cond-mat.mtrl-sci

    Interlayer ferroelectric polarization modulated anomalous Hall effects in four-layer MnBi2Te4 antiferromagnets

    Authors: Ziyu Niu, Xiang-Long Yu, Dingfu Shao, Xixiang **g, Defeng Hou, Xuhong Li, **g Sun, Junqin Shi, Xiaoli Fan, Tengfei Cao

    Abstract: Van der Waals (vdW) assembly could efficiently modulate the symmetry of two-dimensional (2D) materials that ultimately governs their physical properties. Of particular interest is the ferroelectric polarization being introduced by proper vdW assembly that enables the realization of novel electronic, magnetic and transport properties of 2D materials. Four-layer antiferromagnetic MnBi2Te4 (F-MBT) of… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

  22. arXiv:2402.10631  [pdf, other

    cs.CL

    BitDistiller: Unleashing the Potential of Sub-4-Bit LLMs via Self-Distillation

    Authors: Dayou Du, Yijia Zhang, Shijie Cao, Jiaqi Guo, Ting Cao, Xiaowen Chu, Ningyi Xu

    Abstract: The upscaling of Large Language Models (LLMs) has yielded impressive advances in natural language processing, yet it also poses significant deployment challenges. Weight quantization has emerged as a widely embraced solution to reduce memory and computational demands. This paper introduces BitDistiller, a framework that synergizes Quantization-Aware Training (QAT) with Knowledge Distillation (KD)… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

  23. arXiv:2402.05981  [pdf, other

    cs.LG cs.PF

    Exploring the Impact of In-Browser Deep Learning Inference on Quality of User Experience and Performance

    Authors: Qipeng Wang, Shiqi Jiang, Zhenpeng Chen, Xu Cao, Yuanchun Li, Aoyu Li, Ying Zhang, Yun Ma, Ting Cao, Xuanzhe Liu

    Abstract: Deep Learning (DL) is increasingly being integrated into Web applications through a method known as "in-browser inference", where the DL processes occur directly within Web browsers. However, the actual performance of this method and its effect on user experience quality (QoE) is not well-understood. This gap in knowledge necessitates new forms of QoE measurement, going beyond traditional metrics… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

  24. High-Speed Serial Optical Link Test Bench Using FPGA with Embedded Transceivers

    Authors: Annie C. Xiang, Tingting Cao, Datao Gong, Suen Hou, Chonghan Liu, Tiankuan Liu, Da-Shung Su, **-Kun Teng, **gbo Ye

    Abstract: We develop a custom Bit Error Rate test bench based on Altera's Stratix II GX transceiver signal integrity development kit, demonstrate it on point-to-point serial optical link with data rate up to 5 Gbps, and compare it with commercial stand alone tester. The 8B/10B protocol is implemented and its effects studied. A variable optical attenuator is inserted in the fibre loop to induce transmission… ▽ More

    Submitted 28 January, 2024; originally announced January 2024.

    Comments: 5 pages, 8 figures, Proceedings of the Topical Workshop on Electronics for Particle Physics 2009

  25. arXiv:2401.12216  [pdf, other

    stat.ML cs.LG math.OC

    Mitigating Covariate Shift in Misspecified Regression with Applications to Reinforcement Learning

    Authors: Philip Amortila, Tongyi Cao, Akshay Krishnamurthy

    Abstract: A pervasive phenomenon in machine learning applications is distribution shift, where training and deployment conditions for a machine learning model differ. As distribution shift typically results in a degradation in performance, much attention has been devoted to algorithmic interventions that mitigate these detrimental effects. In this paper, we study the effect of distribution shift in the pres… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

  26. arXiv:2401.03416  [pdf, other

    astro-ph.GA

    Active Galactic Nuclei in a Mid-Infrared Selected Galaxy Sample at z>0.13: [Ne V]3426 Line Emission as a Benchmark

    Authors: Zi-Jian Li, Y. Sophia Dai, Jia-Sheng Huang, Stijn Wuyts, Tian-Wen Cao

    Abstract: We present a 24 um-selected spectroscopic sample z > 0.13 (median z = 0.41) in the Lockman Hole field, consisting of 4035 spectra. Our aim is to identify AGNs and determine their fraction in this mid-infrared selected sample. In this work, we use the [Ne V]3426 emission line to spectroscopically identify AGNs. Combined with broad-line Type I AGNs selected in our previous study, our sample consists… ▽ More

    Submitted 7 January, 2024; originally announced January 2024.

    Comments: 16 pages, 14 figures. Accepted for publication in ApJ

  27. arXiv:2312.16199  [pdf, other

    cs.IR cs.LG

    Enhancing User Intent Capture in Session-Based Recommendation with Attribute Patterns

    Authors: Xin Liu, Zheng Li, Yifan Gao, **gfeng Yang, Tianyu Cao, Zhengyang Wang, Bing Yin, Yangqiu Song

    Abstract: The goal of session-based recommendation in E-commerce is to predict the next item that an anonymous user will purchase based on the browsing and purchase history. However, constructing global or local transition graphs to supplement session data can lead to noisy correlations and user intent vanishing. In this work, we propose the Frequent Attribute Pattern Augmented Transformer (FAPAT) that char… ▽ More

    Submitted 22 December, 2023; originally announced December 2023.

    Comments: Accepted by NeurIPS 2023

  28. arXiv:2312.11184  [pdf, other

    cs.CV

    View Transition based Dual Camera Image Fusion

    Authors: Tiantian Cao, Xuan Dong, Chunli Peng, Zhengqing Li, Xinyu Guo, Weixin Li

    Abstract: The dual camera system of wide-angle ($\bf{W}$) and telephoto ($\bf{T}$) cameras has been widely adopted by popular phones. In the overlap region, fusing the $\bf{W}$ and $\bf{T}$ images can generate a higher quality image. Related works perform pixel-level motion alignment or high-dimensional feature alignment of the $\bf{T}$ image to the view of the $\bf{W}$ image and then perform image/feature… ▽ More

    Submitted 18 December, 2023; originally announced December 2023.

  29. arXiv:2312.09445  [pdf, other

    eess.SP cs.CV cs.LG

    IncepSE: Leveraging InceptionTime's performance with Squeeze and Excitation mechanism in ECG analysis

    Authors: Tue Minh Cao, Nhat Hong Tran, Le Phi Nguyen, Hieu Huy Pham, Hung Thanh Nguyen

    Abstract: Our study focuses on the potential for modifications of Inception-like architecture within the electrocardiogram (ECG) domain. To this end, we introduce IncepSE, a novel network characterized by strategic architectural incorporation that leverages the strengths of both InceptionTime and channel attention mechanisms. Furthermore, we propose a training setup that employs stabilization techniques tha… ▽ More

    Submitted 16 November, 2023; originally announced December 2023.

  30. arXiv:2312.07141  [pdf, other

    cs.CL

    Multilingual large language models leak human stereotypes across language boundaries

    Authors: Yang Trista Cao, Anna Sotnikova, Jieyu Zhao, Linda X. Zou, Rachel Rudinger, Hal Daume III

    Abstract: Multilingual large language models have been increasingly popular for their proficiency in processing and generating text across various languages. Previous research has shown that the presence of stereotypes and biases in monolingual large language models can be attributed to the nature of their training data, which is collected from humans and reflects societal biases. Multilingual language mode… ▽ More

    Submitted 8 May, 2024; v1 submitted 12 December, 2023; originally announced December 2023.

  31. arXiv:2311.12776  [pdf, other

    cond-mat.mtrl-sci cond-mat.mes-hall

    Polarization-driven band topology evolution in twisted MoTe$_2$ and WSe$_2$

    Authors: Xiao-Wei Zhang, Chong Wang, Xiaoyu Liu, Yueyao Fan, Ting Cao, Di Xiao

    Abstract: Motivated by recent experimental observations of opposite Chern numbers in $R$-type twisted MoTe$_2$ and WSe$_2$ homobilayers, we perform large-scale density-functional-theory (DFT) calculations with machine learning force fields to investigate moiré band topology from large to small twist angles in both materials. We find that the Chern numbers of the moiré frontier bands change sign as a functio… ▽ More

    Submitted 27 March, 2024; v1 submitted 21 November, 2023; originally announced November 2023.

    Comments: 8 pages, 5 figures

    Journal ref: Nature Communications 15, 4223 (2024)

  32. arXiv:2311.09708  [pdf, other

    cs.CL

    A Self-enhancement Multitask Framework for Unsupervised Aspect Category Detection

    Authors: Thi-Nhung Nguyen, Hoang Ngo, Kiem-Hieu Nguyen, Tuan-Dung Cao

    Abstract: Our work addresses the problem of unsupervised Aspect Category Detection using a small set of seed words. Recent works have focused on learning embedding spaces for seed words and sentences to establish similarities between sentences and aspects. However, aspect representations are limited by the quality of initial seed words, and model performances are compromised by noise. To mitigate this limit… ▽ More

    Submitted 16 November, 2023; originally announced November 2023.

    Comments: Accepted to EMNLP 2023

  33. arXiv:2311.08100  [pdf, other

    cs.CV cs.RO

    PPAD: Iterative Interactions of Prediction and Planning for End-to-end Autonomous Driving

    Authors: Zhili Chen, Maosheng Ye, Shuangjie Xu, Tongyi Cao, Qifeng Chen

    Abstract: We present a new interaction mechanism of prediction and planning for end-to-end autonomous driving, called PPAD (Iterative Interaction of Prediction and Planning Autonomous Driving), which considers the timestep-wise interaction to better integrate prediction and planning. An ego vehicle performs motion planning at each timestep based on the trajectory prediction of surrounding agents (e.g., vehi… ▽ More

    Submitted 27 March, 2024; v1 submitted 14 November, 2023; originally announced November 2023.

  34. arXiv:2311.07879  [pdf, other

    cs.CL cs.AI

    Toxicity Detection is NOT all you Need: Measuring the Gaps to Supporting Volunteer Content Moderators

    Authors: Yang Trista Cao, Lovely-Frances Domingo, Sarah Ann Gilbert, Michelle Mazurek, Katie Shilton, Hal Daumé III

    Abstract: Extensive efforts in automated approaches for content moderation have been focused on develo** models to identify toxic, offensive, and hateful content with the aim of lightening the load for moderators. Yet, it remains uncertain whether improvements on those tasks have truly addressed moderators' needs in accomplishing their work. In this paper, we surface gaps between past research efforts tha… ▽ More

    Submitted 16 February, 2024; v1 submitted 13 November, 2023; originally announced November 2023.

  35. arXiv:2311.06758  [pdf, other

    cs.CL

    Sharing, Teaching and Aligning: Knowledgeable Transfer Learning for Cross-Lingual Machine Reading Comprehension

    Authors: Tingfeng Cao, Chengyu Wang, Chuanqi Tan, Jun Huang, **hui Zhu

    Abstract: In cross-lingual language understanding, machine translation is often utilized to enhance the transferability of models across languages, either by translating the training data from the source language to the target, or from the target to the source to aid inference. However, in cross-lingual machine reading comprehension (MRC), it is difficult to perform a deep level of assistance to enhance cro… ▽ More

    Submitted 12 November, 2023; originally announced November 2023.

    Comments: emnlp 2023

  36. arXiv:2311.06752  [pdf, other

    cs.CL

    BeautifulPrompt: Towards Automatic Prompt Engineering for Text-to-Image Synthesis

    Authors: Tingfeng Cao, Chengyu Wang, Bingyan Liu, Ziheng Wu, **hui Zhu, Jun Huang

    Abstract: Recently, diffusion-based deep generative models (e.g., Stable Diffusion) have shown impressive results in text-to-image synthesis. However, current text-to-image models often require multiple passes of prompt engineering by humans in order to produce satisfactory results for real-world applications. We propose BeautifulPrompt, a deep generative model to produce high-quality prompts from very simp… ▽ More

    Submitted 12 November, 2023; originally announced November 2023.

    Comments: emnlp 2023

  37. arXiv:2311.01792  [pdf, other

    cs.CL cs.AI

    AFPQ: Asymmetric Floating Point Quantization for LLMs

    Authors: Yijia Zhang, Sicheng Zhang, Shijie Cao, Dayou Du, Jianyu Wei, Ting Cao, Ningyi Xu

    Abstract: Large language models (LLMs) show great performance in various tasks, but face deployment challenges from limited memory capacity and bandwidth. Low-bit weight quantization can save memory and accelerate inference. Although floating-point (FP) formats show good performance in LLM quantization, they tend to perform poorly with small group sizes or sub-4 bits. We find the reason is that the absence… ▽ More

    Submitted 3 November, 2023; originally announced November 2023.

  38. arXiv:2310.13772  [pdf, other

    cs.CV cs.LG

    TexFusion: Synthesizing 3D Textures with Text-Guided Image Diffusion Models

    Authors: Tianshi Cao, Karsten Kreis, Sanja Fidler, Nicholas Sharp, Kangxue Yin

    Abstract: We present TexFusion (Texture Diffusion), a new method to synthesize textures for given 3D geometries, using large-scale text-guided image diffusion models. In contrast to recent works that leverage 2D text-to-image diffusion models to distill 3D objects using a slow and fragile optimization process, TexFusion introduces a new 3D-consistent generation technique specifically designed for texture sy… ▽ More

    Submitted 20 October, 2023; originally announced October 2023.

    Comments: Videos and more results on https://research.nvidia.com/labs/toronto-ai/texfusion/

    ACM Class: I.3.3

    Journal ref: Proceedings of the IEEE/CVF International Conference on Computer Vision (2023) 4169-4181

  39. arXiv:2310.12551  [pdf, other

    cs.RO eess.IV

    Iterative PnP and its application in 3D-2D vascular image registration for robot navigation

    Authors: **gwei Song, Keke Yang, Zheng Zhang, Meng Li, Tuoyu Cao, Maani Ghaffari

    Abstract: This paper reports on a new real-time robot-centered 3D-2D vascular image alignment algorithm, which is robust to outliers and can align nonrigid shapes. Few works have managed to achieve both real-time and accurate performance for vascular intervention robots. This work bridges high-accuracy 3D-2D registration techniques and computational efficiency requirements in intervention robot applications… ▽ More

    Submitted 11 January, 2024; v1 submitted 19 October, 2023; originally announced October 2023.

    Comments: Submitted to ICRA 2024 Errors in Eq. 4 and Eq. 6 have been corrected. Updates include some minor improvements in Section II

  40. arXiv:2310.00057  [pdf, other

    cs.CE

    A multi-fidelity deep operator network (DeepONet) for fusing simulation and monitoring data: Application to real-time settlement prediction during tunnel construction

    Authors: Chen Xu, Ba Trung Cao, Yong Yuan, Günther Meschke

    Abstract: Ground settlement prediction during the process of mechanized tunneling is of paramount importance and remains a challenging research topic. Typically, two paradigms are existing: a physics-driven approach utilizing process-oriented computational simulation models for the tunnel-soil interaction and the settlement prediction, and a data-driven approach employing machine learning techniques to esta… ▽ More

    Submitted 12 November, 2023; v1 submitted 29 September, 2023; originally announced October 2023.

  41. arXiv:2309.16110  [pdf, other

    cs.CV

    Learning Effective NeRFs and SDFs Representations with 3D Generative Adversarial Networks for 3D Object Generation: Technical Report for ICCV 2023 OmniObject3D Challenge

    Authors: Zheyuan Yang, Yibo Liu, Guile Wu, Tongtong Cao, Yuan Ren, Yang Liu, Bingbing Liu

    Abstract: In this technical report, we present a solution for 3D object generation of ICCV 2023 OmniObject3D Challenge. In recent years, 3D object generation has made great process and achieved promising results, but it remains a challenging task due to the difficulty of generating complex, textured and high-fidelity results. To resolve this problem, we study learning effective NeRFs and SDFs representation… ▽ More

    Submitted 27 September, 2023; originally announced September 2023.

  42. arXiv:2309.08978  [pdf, other

    cs.AI

    Accelerating In-Browser Deep Learning Inference on Diverse Edge Clients through Just-in-Time Kernel Optimizations

    Authors: Fucheng Jia, Shiqi Jiang, Ting Cao, Wei Cui, Tianrui Xia, Xu Cao, Yuanchun Li, Deyu Zhang, Ju Ren, Yunxin Liu, Lili Qiu, Mao Yang

    Abstract: Web applications are increasingly becoming the primary platform for AI service delivery, making in-browser deep learning (DL) inference more prominent. However, current in-browser inference systems fail to effectively utilize advanced web programming techniques and customize kernels for various client devices, leading to suboptimal performance. To address the issues, this paper presents the firs… ▽ More

    Submitted 16 September, 2023; originally announced September 2023.

  43. arXiv:2309.04865  [pdf, other

    cond-mat.mes-hall cond-mat.mtrl-sci cond-mat.str-el

    Observation of flat and weakly dispersing bands in a van der Waals semiconductor Nb3Br8 with breathing kagome lattice

    Authors: Sabin Regmi, Anup Pradhan Sakhya, Tharindu Fernando, Yuzhou Zhao, Dylan Jeff, Milo Sprague, Favian Gonzalez, Iftakhar Bin Elius, Mazharul Islam Mondal, Nathan Valadez, Damani Jarrett, Alexis Agosto, Jihui Yang, Jiun-Haw Chu, Saiful I. Khondaker, Xiaodong Xu, Ting Cao, Madhab Neupane

    Abstract: Niobium halides, Nb3X8 (X = Cl,Br,I), which are predicted two-dimensional magnets, have recently gotten attention due to their breathing kagome geometry. Here, we have studied the electronic structure of Nb3Br8 by using angle-resolved photoemission spectroscopy (ARPES) and first-principles calculations. ARPES results depict the presence of multiple flat and weakly dispersing bands. These bands are… ▽ More

    Submitted 9 September, 2023; originally announced September 2023.

    Comments: 24 pages, 12 figures, Supplemental Material included

    Journal ref: Phys. Rev. B 108, L121404 (2023)

  44. Effect of initial-state geometric configurations on the nuclear liquid-gas phase transition

    Authors: Y. T. Cao, X. G. Deng, Y. G. Ma

    Abstract: Within the framework of an extended quantum molecular dynamics model, we simulated $^{40}$Ca + $^{16}$O collisions at beam energies ranging from 60 to 150 MeV/nucleon for $^{16}$O with different $α$-cluster configurations. Results imply that different $α$-cluster configurations lead to different yields of deuteron, triton, $^3$He and $^4$He, but not for proton and neutron. We discuss the effect of… ▽ More

    Submitted 31 August, 2023; originally announced August 2023.

    Comments: 10 pages, 8 figures

    Journal ref: Physical Review C 108, 024610 (2023)

  45. arXiv:2308.16451  [pdf, other

    cs.RO

    Optical flow-based vascular respiratory motion compensation

    Authors: Keke Yang, Zheng Zhang, Meng Li, Tuoyu Cao, Maani Ghaffari, **gwei Song

    Abstract: This paper develops a new vascular respiratory motion compensation algorithm, Motion-Related Compensation (MRC), to conduct vascular respiratory motion compensation by extrapolating the correlation between invisible vascular and visible non-vascular. Robot-assisted vascular intervention can significantly reduce the radiation exposure of surgeons. In robot-assisted image-guided intervention, blood… ▽ More

    Submitted 31 August, 2023; originally announced August 2023.

    Comments: This manuscript has been accepted by IEEE Robotics and Automation Letters

  46. arXiv:2308.13323  [pdf, other

    cs.CV cs.RO

    SVQNet: Sparse Voxel-Adjacent Query Network for 4D Spatio-Temporal LiDAR Semantic Segmentation

    Authors: Xuechao Chen, Shuangjie Xu, Xiaoyi Zou, Tongyi Cao, Dit-Yan Yeung, Lu Fang

    Abstract: LiDAR-based semantic perception tasks are critical yet challenging for autonomous driving. Due to the motion of objects and static/dynamic occlusion, temporal information plays an essential role in reinforcing perception by enhancing and completing single-frame knowledge. Previous approaches either directly stack historical frames to the current frame or build a 4D spatio-temporal neighborhood usi… ▽ More

    Submitted 25 August, 2023; originally announced August 2023.

    Comments: Received by ICCV2023

  47. arXiv:2308.12066  [pdf, other

    cs.LG cs.AI cs.AR

    Pre-gated MoE: An Algorithm-System Co-Design for Fast and Scalable Mixture-of-Expert Inference

    Authors: Ranggi Hwang, Jianyu Wei, Shijie Cao, Changho Hwang, Xiaohu Tang, Ting Cao, Mao Yang

    Abstract: Large language models (LLMs) based on transformers have made significant strides in recent years, the success of which is driven by scaling up their model size. Despite their high algorithmic performance, the computational and memory requirements of LLMs present unprecedented challenges. To tackle the high compute requirements of LLMs, the Mixture-of-Experts (MoE) architecture was introduced which… ▽ More

    Submitted 27 April, 2024; v1 submitted 23 August, 2023; originally announced August 2023.

  48. arXiv:2308.08140  [pdf, other

    cs.CV

    GPA-3D: Geometry-aware Prototype Alignment for Unsupervised Domain Adaptive 3D Object Detection from Point Clouds

    Authors: Ziyu Li, **gming Guo, Tongtong Cao, Liu Bingbing, Wankou Yang

    Abstract: LiDAR-based 3D detection has made great progress in recent years. However, the performance of 3D detectors is considerably limited when deployed in unseen environments, owing to the severe domain gap problem. Existing domain adaptive 3D detection methods do not adequately consider the problem of the distributional discrepancy in feature space, thereby hindering generalization of detectors across d… ▽ More

    Submitted 16 August, 2023; originally announced August 2023.

    Comments: Accepted by ICCV 2023

  49. Gate-tunable antiferromagnetic Chern insulator in twisted bilayer transition metal dichalcogenides

    Authors: Xiaoyu Liu, Chong Wang, Xiao-Wei Zhang, Ting Cao, Di Xiao

    Abstract: A series of recent experimental works on twisted MoTe$_2$ homobilayers have unveiled an abundance of exotic states in this system. Valley-polarized quantum anomalous Hall states have been identified at hole do** of $ν= -1$, and the fractional quantum anomalous Hall effect is observed at $ν= -2/3$ and $ν= -3/5$. In this work, we investigate the electronic properties of AA-stacked twisted bilayer… ▽ More

    Submitted 14 August, 2023; originally announced August 2023.

  50. arXiv:2308.02657  [pdf

    cond-mat.mes-hall cond-mat.mtrl-sci cond-mat.str-el

    Observation of Fractionally Quantized Anomalous Hall Effect

    Authors: Heonjoon Park, Jiaqi Cai, Eric Anderson, Yinong Zhang, Jiayi Zhu, Xiaoyu Liu, Chong Wang, William Holtzmann, Chaowei Hu, Zhaoyu Liu, Takashi Taniguchi, Kenji Watanabe, Jiun-haw Chu, Ting Cao, Liang Fu, Wang Yao, Cui-Zu Chang, David Cobden, Di Xiao, Xiaodong Xu

    Abstract: The integer quantum anomalous Hall (QAH) effect is a lattice analog of the quantum Hall effect at zero magnetic field. This striking transport phenomenon occurs in electronic systems with topologically nontrivial bands and spontaneous time-reversal symmetry breaking. Discovery of its putative fractional counterpart in the presence of strong electron correlations, i.e., the fractional quantum anoma… ▽ More

    Submitted 4 August, 2023; originally announced August 2023.

    Comments: 15 pages, 4 figures for main text. 8 extended data figures

    Journal ref: Nature (2023)