Skip to main content

Showing 1–50 of 120 results for author: Xia, A

.
  1. arXiv:2406.16112  [pdf, ps, other

    math.NA

    Greedy randomized Bregman-Kaczmarz method for constrained nonlinear systems of equations

    Authors: Aqin Xiao, Junfeng Yin

    Abstract: A greedy randomized nonlinear Bregman-Kaczmarz method by sampling the working index with residual information is developed for the solution of the constrained nonlinear system of equations. Theoretical analyses prove the convergence of the greedy randomized nonlinear Bregman-Kaczmarz method and its relaxed version. Numerical experiments verify the effectiveness of the proposed method,which converg… ▽ More

    Submitted 23 June, 2024; originally announced June 2024.

  2. arXiv:2406.09813  [pdf, other

    astro-ph.IM astro-ph.HE

    Diffuse X-ray Explorer: a high-resolution X-ray spectroscopic sky surveyor on the China Space Station

    Authors: Hai **, Junjie Mao, Liubiao Chen, Naihui Chen, Wei Cui, Bo Gao, **** Li, Xinfeng Li, Jiejia Liu, Jia Quan, Chunyang Jiang, Guole Wang, Le Wang, Qian Wang, Sifan Wang, Aimin Xiao, Shuo Zhang

    Abstract: DIffuse X-ray Explorer (DIXE) is a proposed high-resolution X-ray spectroscopic sky surveyor on the China Space Station (CSS). DIXE will focus on studying hot baryons in the Milky Way. Galactic hot baryons like the X-ray emitting Milky Way halo and eROSITA bubbles are best observed in the sky survey mode with a large field of view. DIXE will take advantage of the orbital motion of the CSS to scan… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: 12 pages, 6 figures, the full version is published by Journal of Low Temperature Physics

  3. arXiv:2405.02794  [pdf, other

    cs.RO

    Octopi: Object Property Reasoning with Large Tactile-Language Models

    Authors: Samson Yu, Kelvin Lin, Anxing Xiao, Jiafei Duan, Harold Soh

    Abstract: Physical reasoning is important for effective robot manipulation. Recent work has investigated both vision and language modalities for physical reasoning; vision can reveal information about objects in the environment and language serves as an abstraction and communication medium for additional context. Although these works have demonstrated success on a variety of physical reasoning tasks, they a… ▽ More

    Submitted 4 June, 2024; v1 submitted 4 May, 2024; originally announced May 2024.

    Comments: Accepted at Robotics: Science and Systems (R:SS 2024)

  4. arXiv:2404.14953  [pdf, other

    cs.LG

    Dynamic pricing with Bayesian updates from online reviews

    Authors: José Correa, Mathieu Mari, Andrew Xia

    Abstract: When launching new products, firms face uncertainty about market reception. Online reviews provide valuable information not only to consumers but also to firms, allowing firms to adjust the product characteristics, including its selling price. In this paper, we consider a pricing model with online reviews in which the quality of the product is uncertain, and both the seller and the buyers Bayesian… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

  5. arXiv:2402.03631  [pdf, other

    cs.CV

    Conditional Tuning Network for Few-Shot Adaptation of Segmentation Anything Model

    Authors: Aoran Xiao, Weihao Xuan, Heli Qi, Yun Xing, Ruijie Ren, Xiaoqin Zhang, Ling Shao, Shijian Lu

    Abstract: The recent Segment Anything Model (SAM) has demonstrated remarkable zero-shot capability and flexible geometric prompting in general image segmentation. However, SAM often struggles when handling various unconventional images, such as aerial, medical, and non-RGB images. This paper presents CAT-SAM, a ConditionAl Tuning network that adapts SAM toward various unconventional target tasks with just f… ▽ More

    Submitted 21 March, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

    Comments: Project page: https://xiaoaoran.github.io/projects/CAT-SAM

  6. arXiv:2401.08407  [pdf, other

    cs.CV

    Cross-Domain Few-Shot Segmentation via Iterative Support-Query Correspondence Mining

    Authors: Jiahao Nie, Yun Xing, Gongjie Zhang, Pei Yan, Aoran Xiao, Yap-Peng Tan, Alex C. Kot, Shijian Lu

    Abstract: Cross-Domain Few-Shot Segmentation (CD-FSS) poses the challenge of segmenting novel categories from a distinct domain using only limited exemplars. In this paper, we undertake a comprehensive study of CD-FSS and uncover two crucial insights: (i) the necessity of a fine-tuning stage to effectively transfer the learned meta-knowledge across domains, and (ii) the overfitting risk during the naïve fin… ▽ More

    Submitted 13 March, 2024; v1 submitted 16 January, 2024; originally announced January 2024.

    Comments: Accepted by CVPR 2024

  7. arXiv:2401.08344  [pdf, other

    math.PR

    Large-population asymptotics for the maximum of diffusive particles with mean-field interaction in the noises

    Authors: Nikolaos Kolliopoulos, David Sanchez, Amy Xiao

    Abstract: We study the $N \to \infty$ limit of the normalized largest component in some systems of $N$ diffusive particles with mean-field interaction. By applying a universal time change, the interaction in noises is transferred to the drift terms, and the asymptotic behavior of the maximum becomes well-understood due to existing results in the literature. We expect that the normalized maximum in the origi… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

    Comments: 12 pages

    MSC Class: 60K35; 60H10; 60F05; 60G70

  8. arXiv:2311.17406  [pdf, other

    cs.RO cs.AI

    LLM-State: Open World State Representation for Long-horizon Task Planning with Large Language Model

    Authors: Siwei Chen, Anxing Xiao, David Hsu

    Abstract: This work addresses the problem of long-horizon task planning with the Large Language Model (LLM) in an open-world household environment. Existing works fail to explicitly track key objects and attributes, leading to erroneous decisions in long-horizon tasks, or rely on highly engineered state features and feedback, which is not generalizable. We propose an open state representation that provides… ▽ More

    Submitted 22 April, 2024; v1 submitted 29 November, 2023; originally announced November 2023.

  9. arXiv:2311.06711  [pdf, ps, other

    math.NA

    Optimal $L^\infty(L^2)$ and $L^1(L^2)$ a posteriori error estimates for the fully discrete approximations of time fractional parabolic differential equations

    Authors: Jiliang Cao, Wansheng Wang, Aiguo Xiao

    Abstract: We derive optimal order a posteriori error estimates in the $L^\infty(L^2)$ and $L^1(L^2)$-norms for the fully discrete approximations of time fractional parabolic differential equations. For the discretization in time, we use the $L1$ methods, while for the spatial discretization, we use standard conforming finite element methods. The linear and quadratic space-time reconstructions are introduced… ▽ More

    Submitted 11 November, 2023; originally announced November 2023.

    Comments: 22 pages

  10. Parking Spot Classification based on surround view camera system

    Authors: Andy Xiao, Deep Doshi, Lihao Wang, Harsha Gorantla, Thomas Heitzmann, Peter Groth

    Abstract: Surround-view fisheye cameras are commonly used for near-field sensing in automated driving scenarios, including urban driving and auto valet parking. Four fisheye cameras, one on each side, are sufficient to cover 360° around the vehicle capturing the entire near-field region. Based on surround view cameras, there has been much research on parking slot detection with main focus on the occupancy s… ▽ More

    Submitted 5 October, 2023; originally announced October 2023.

    Comments: SPIE Optical Engineering + Applications, 2023, San Diego, California, United States. Proc. SPIE 12675, Applications of Machine Learning 2023

  11. arXiv:2310.12141  [pdf, other

    math.PR

    A phase transition and critical phenomenon for the two-dimensional random field Ising model

    Authors: Jian Ding, Fenglin Huang, Aoteng Xia

    Abstract: We study the random field Ising model in a two-dimensional box with side length $N$ where the external field is given by independent normal variables with mean $0$ and variance $ε^2$. Our primary result is the following phase transition at $T = T_c$: for $ε\ll N^{-7/8}$ the boundary influence (i.e., the difference between the spin averages at the center of the box with the plus and the minus bound… ▽ More

    Submitted 4 March, 2024; v1 submitted 18 October, 2023; originally announced October 2023.

    Comments: 65 pages; minor revision throughout over previous version

    MSC Class: 60K35; 82B44

  12. arXiv:2310.09078  [pdf, other

    cs.NI eess.SP

    DNFS-VNE: Deep Neuro Fuzzy System Driven Virtual Network Embedding

    Authors: Ailing Xiao, Ning Chen, Sheng Wu, Peiying Zhang, Linling Kuang, Chunxiao Jiang

    Abstract: By decoupling substrate resources, network virtualization (NV) is a promising solution for meeting diverse demands and ensuring differentiated quality of service (QoS). In particular, virtual network embedding (VNE) is a critical enabling technology that enhances the flexibility and scalability of network deployment by addressing the coupling of Internet processes and services. However, in the exi… ▽ More

    Submitted 3 July, 2024; v1 submitted 13 October, 2023; originally announced October 2023.

  13. arXiv:2309.13505  [pdf, other

    cs.CV

    Rewrite Caption Semantics: Bridging Semantic Gaps for Language-Supervised Semantic Segmentation

    Authors: Yun Xing, Jian Kang, Aoran Xiao, Jiahao Nie, Ling Shao, Shijian Lu

    Abstract: Vision-Language Pre-training has demonstrated its remarkable zero-shot recognition ability and potential to learn generalizable visual representations from language supervision. Taking a step ahead, language-supervised semantic segmentation enables spatial localization of textual inputs by learning pixel grou** solely from image-text pairs. Nevertheless, the state-of-the-art suffers from clear s… ▽ More

    Submitted 4 January, 2024; v1 submitted 23 September, 2023; originally announced September 2023.

    Comments: NeurIPS 2023. Code is available at https://github.com/xing0047/rewrite

  14. arXiv:2309.06041  [pdf, other

    cs.RO

    GVD-Exploration: An Efficient Autonomous Robot Exploration Framework Based on Fast Generalized Voronoi Diagram Extraction

    Authors: Dingfeng Chen, Anxing Xiao, Meiyuan Zou, Wenzheng Chi, Jiankun Wang, Lining Sun

    Abstract: Rapidly-exploring Random Trees (RRTs) are a popular technique for autonomous exploration of mobile robots. However, the random sampling used by RRTs can result in inefficient and inaccurate frontiers extraction, which affects the exploration performance. To address the issues of slow path planning and high path cost, we propose a framework that uses a generalized Voronoi diagram (GVD) based multi-… ▽ More

    Submitted 12 September, 2023; originally announced September 2023.

    Comments: 11 pages, 10 figures

  15. arXiv:2309.03005  [pdf, ps, other

    math.NA

    On multi-step extended maximum residual Kaczmarz method for solving large inconsistent linear systems

    Authors: Aqin Xiao, Junfeng Yin, Ning Zheng

    Abstract: A multi-step extended maximum residual Kaczmarz method is presented for the solution of the large inconsistent linear system of equations by using the multi-step iterations technique. Theoretical analysis proves the proposed method is convergent and gives an upper bound on its convergence rate. Numerical experiments show that the proposed method is effective and outperforms the existing extended K… ▽ More

    Submitted 6 September, 2023; originally announced September 2023.

  16. arXiv:2309.02780  [pdf, other

    cs.CL cs.SD eess.AS

    GRASS: Unified Generation Model for Speech-to-Semantic Tasks

    Authors: Aobo Xia, Shuyu Lei, Yushu Yang, Xiang Guo, Hua Chai

    Abstract: This paper explores the instruction fine-tuning technique for speech-to-semantic tasks by introducing a unified end-to-end (E2E) framework that generates target text conditioned on a task-related prompt for audio data. We pre-train the model using large and diverse data, where instruction-speech pairs are constructed via a text-to-speech (TTS) system. Extensive experiments demonstrate that our pro… ▽ More

    Submitted 11 September, 2023; v1 submitted 6 September, 2023; originally announced September 2023.

  17. arXiv:2307.15283  [pdf, ps, other

    math.NA

    On averaging block Kaczmarz methods for solving nonlinear systems of equations

    Authors: Aqin Xiao, Junfeng Yin

    Abstract: A class of averaging block nonlinear Kaczmarz methods is developed for the solution of the nonlinear system of equations. The convergence theory of the proposed method is established under suitable assumptions and the upper bounds of the convergence rate for the proposed method with both constant stepsize and adaptive stepsize are derived. Numerical experiments are presented to verify the efficien… ▽ More

    Submitted 27 July, 2023; originally announced July 2023.

  18. arXiv:2305.19812  [pdf, other

    cs.CV

    A Survey of Label-Efficient Deep Learning for 3D Point Clouds

    Authors: Aoran Xiao, Xiaoqin Zhang, Ling Shao, Shijian Lu

    Abstract: In the past decade, deep neural networks have achieved significant progress in point cloud learning. However, collecting large-scale precisely-annotated training data is extremely laborious and expensive, which hinders the scalability of existing point cloud datasets and poses a bottleneck for efficient exploration of point cloud data in various tasks and applications. Label-efficient learning off… ▽ More

    Submitted 17 June, 2024; v1 submitted 31 May, 2023; originally announced May 2023.

    Comments: Accepted to IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)

  19. arXiv:2304.00690  [pdf, other

    cs.CV

    3D Semantic Segmentation in the Wild: Learning Generalized Models for Adverse-Condition Point Clouds

    Authors: Aoran Xiao, Jiaxing Huang, Weihao Xuan, Ruijie Ren, Kangcheng Liu, Dayan Guan, Abdulmotaleb El Saddik, Shijian Lu, Eric Xing

    Abstract: Robust point cloud parsing under all-weather conditions is crucial to level-5 autonomy in autonomous driving. However, how to learn a universal 3D semantic segmentation (3DSS) model is largely neglected as most existing benchmarks are dominated by point clouds captured under normal weather. We introduce SemanticSTF, an adverse-weather point cloud dataset that provides dense point-level annotations… ▽ More

    Submitted 2 April, 2023; originally announced April 2023.

    Comments: CVPR2023

  20. Designing the pressure-dependent shear modulus using tessellated granular metamaterials

    Authors: Jerry Zhang, Dong Wang, Weiwei **, Annie Xia, Nidhi Pashine, Rebecca Kramer-Bottiglio, Mark D. Shattuck, Corey S. O'Hern

    Abstract: Jammed packings of granular materials display complex mechanical response. For example, the ensemble-averaged shear modulus $\left\langle G \right\rangle$ increases as a power-law in pressure $p$ for static packings of soft spherical particles that can rearrange during compression. We seek to design granular materials with shear moduli that can either increase {\it or} decrease with pressure witho… ▽ More

    Submitted 10 September, 2023; v1 submitted 17 March, 2023; originally announced March 2023.

    Journal ref: Phys. Rev. E 108, 034901 (2023)

  21. arXiv:2303.06624  [pdf, other

    cs.RO

    Collaborative Trolley Transportation System with Autonomous Nonholonomic Robots

    Authors: Bingyi Xia, Hao Luan, Ziqi Zhao, Xuheng Gao, Peijia Xie, Anxing Xiao, Jiankun Wang, Max Q. -H. Meng

    Abstract: Cooperative object transportation using multiple robots has been intensively studied in the control and robotics literature, but most approaches are either only applicable to omnidirectional robots or lack a complete navigation and decision-making framework that operates in real time. This paper presents an autonomous nonholonomic multi-robot system and an end-to-end hierarchical autonomy framewor… ▽ More

    Submitted 21 July, 2023; v1 submitted 12 March, 2023; originally announced March 2023.

  22. arXiv:2303.05223  [pdf, other

    stat.ME

    LEAP: The latent exchangeability prior for borrowing information from historical data

    Authors: Ethan M. Alt, Xiuya Chang, Xun Jiang, Qing Liu, May Mo, H. Amy Xia, Joseph G. Ibrahim

    Abstract: It is becoming increasingly popular to elicit informative priors on the basis of historical data. Popular existing priors, including the power prior, commensurate prior, and robust meta-analytic prior provide blanket discounting. Thus, if only a subset of participants in the historical data are exchangeable with the current data, these priors may not be appropriate. In order to combat this issue,… ▽ More

    Submitted 9 March, 2023; originally announced March 2023.

  23. arXiv:2302.10654  [pdf, ps, other

    math.PR

    On the rate of normal approximation for Poisson continuum percolation

    Authors: Tiffany Y. Y. Lo, Aihua Xia

    Abstract: It is known that the number of points in the largest cluster of a percolating Poisson process restricted to a large finite box is asymptotically normal. In this note, we establish a rate of convergence for the statement. As each point in the largest cluster is determined by points as far as the diameter of the box, known results in the literature of normal approximation for Poisson functionals can… ▽ More

    Submitted 7 September, 2023; v1 submitted 21 February, 2023; originally announced February 2023.

    Comments: 10 pages. This version contains a correction to an error in Lemma 2.2 in the previous versions

    MSC Class: primary 60K35; 60F05; secondary 60D05; 60G57; 82B43; 62E20

  24. The Digital Foundation Platform -- A Multi-layered SOA Architecture for Intelligent Connected Vehicle Operating System

    Authors: David Yu, Andy Xiao

    Abstract: Legacy AD/ADAS development from OEMs centers around develo** functions on ECUs using services provided by AUTOSAR Classic Platform (CP) to meet automotive-grade and mass-production requirements. The AUTOSAR CP couples hardware and software components statically and encounters challenges to provide sufficient capacities for the processing of high-level intelligent driving functions, whereas the n… ▽ More

    Submitted 17 October, 2022; originally announced October 2022.

    Comments: WCX SAE World Congress Experience 2022

  25. arXiv:2210.05128  [pdf, ps, other

    math.NA

    On fast greedy block Kaczmarz methods for solving large consistent linear systems

    Authors: Aqin Xiao, Junfeng Yin, Ning Zheng

    Abstract: A class of fast greedy block Kaczmarz methods combined with general greedy strategy and average technique are proposed for solving large consistent linear systems. Theoretical analysis of the convergence of the proposed method is given in detail. Numerical experiments show that the proposed methods are efficient and faster than the existing methods.

    Submitted 16 October, 2022; v1 submitted 11 October, 2022; originally announced October 2022.

    Comments: 11 pages, 1 figure

  26. arXiv:2209.13998  [pdf, other

    math.PR

    Long range order for three-dimensional random field Ising model throughout the entire low temperature regime

    Authors: Jian Ding, Yu Liu, Aoteng Xia

    Abstract: For $d\geq 3$, we study the Ising model on $\mathbb Z^d$ with random field given by $\{εh_v: v\in \mathbb Z^d\}$ where $h_v$'s are independent normal variables with mean 0 and variance 1. We show that for any $T < T_c$ (here $T_c$ is the critical temperature without disorder), long range order exists as long as $ε$ is sufficiently small depending on $T$. Our work extends previous results of Imbrie… ▽ More

    Submitted 28 September, 2022; originally announced September 2022.

    Comments: 36 pages

    MSC Class: 60K35; 82B44

  27. arXiv:2208.00223  [pdf, other

    cs.CV cs.AI cs.LG

    PolarMix: A General Data Augmentation Technique for LiDAR Point Clouds

    Authors: Aoran Xiao, Jiaxing Huang, Dayan Guan, Kaiwen Cui, Shijian Lu, Ling Shao

    Abstract: LiDAR point clouds, which are usually scanned by rotating LiDAR sensors continuously, capture precise geometry of the surrounding environment and are crucial to many autonomous detection and navigation tasks. Though many 3D deep architectures have been developed, efficient collection and annotation of large amounts of point clouds remain one major challenge in the analytic and understanding of poi… ▽ More

    Submitted 30 July, 2022; originally announced August 2022.

  28. arXiv:2205.13211  [pdf, ps, other

    math.PR

    Convergence rate for geometric statistics of point processes with fast decay dependence

    Authors: Tianshu Cong, Aihua Xia

    Abstract: [Błaszczyszyn, Yogeshwaran and Yukich (2019)] established central limit theorems for geometric statistics of point processes having fast decay dependence. As limit theorems are of limited use unless we understand their errors involved in the approximation, in this paper, we consider the rates of a normal approximation in terms of the Wasserstein distance for statistics of point processes on… ▽ More

    Submitted 26 May, 2022; originally announced May 2022.

    Comments: 42 pages

    MSC Class: primary 60F05; secondary 60D05; 60G55; 62E20; 05C80

  29. arXiv:2205.03967  [pdf, other

    stat.ME math.ST

    The saturated pairwise interaction Gibbs point process as a joint species distribution model

    Authors: Ian Flint, Nick Golding, Peter Vesk, Yan Wang, Aihua Xia

    Abstract: In an effort to effectively model observed patterns in the spatial configuration of individuals of multiple species in nature, we introduce the saturated pairwise interaction Gibbs point process. Its main strength lies in its ability to model both attraction and repulsion within and between species, over different scales. As such, it is particularly well-suited to the study of associations in… ▽ More

    Submitted 20 August, 2022; v1 submitted 8 May, 2022; originally announced May 2022.

    Comments: 36 pages, 14 figures

    Journal ref: Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 71(5), 2022, pages 1721-1752

  30. arXiv:2204.06456  [pdf, other

    cond-mat.quant-gas hep-ph quant-ph

    Non-equilibrium dynamics of fluctuations in an ultra-cold atomic mixture

    Authors: Apoorva Hegde, Robert Ott, Andy Xia, Valentin Kasper, Jürgen Berges, Fred Jendrzejewski

    Abstract: We investigate an ultra-cold mixture of Bose gases interacting via spin-changing collisions by studying the dynamics of spin fluctuations. The experimental implementation employs $^{23}$Na and $^{7}$Li atoms, which are prepared out of equilibrium across a wide range of initial conditions. We identify three regimes in the dynamics of the system for different initial states: a long-lived metastable… ▽ More

    Submitted 13 April, 2022; originally announced April 2022.

    Comments: 9 pages, 5 figures

  31. arXiv:2204.03875  [pdf, other

    cs.DS cs.CG

    Deterministic, Near-Linear $\varepsilon$-Approximation Algorithm for Geometric Bipartite Matching

    Authors: Pankaj K. Agarwal, Hsien-Chih Chang, Sharath Raghvendra, Allen Xiao

    Abstract: Given point sets $A$ and $B$ in $\mathbb{R}^d$ where $A$ and $B$ have equal size $n$ for some constant dimension $d$ and a parameter $\varepsilon>0$, we present the first deterministic algorithm that computes, in $n\cdot(\varepsilon^{-1} \log n)^{O(d)}$ time, a perfect matching between $A$ and $B$ whose cost is within a $(1+\varepsilon)$ factor of the optimal under any $\smash{\ell_p}$-norm. Altho… ▽ More

    Submitted 8 April, 2022; originally announced April 2022.

    Comments: The conference version of the paper is accepted to STOC 2022

  32. arXiv:2203.10026  [pdf, other

    cs.CV

    Unbiased Subclass Regularization for Semi-Supervised Semantic Segmentation

    Authors: Dayan Guan, Jiaxing Huang, Aoran Xiao, Shijian Lu

    Abstract: Semi-supervised semantic segmentation learns from small amounts of labelled images and large amounts of unlabelled images, which has witnessed impressive progress with the recent advance of deep neural networks. However, it often suffers from severe class-bias problem while exploring the unlabelled images, largely due to the clear pixel-wise class imbalance in the labelled images. This paper prese… ▽ More

    Submitted 26 March, 2022; v1 submitted 18 March, 2022; originally announced March 2022.

    Comments: Accepted to CVPR 2022. Code is available at https://github.com/Dayan-Guan/USRN

  33. arXiv:2203.04541  [pdf, other

    cs.RO

    PUTN: A Plane-fitting based Uneven Terrain Navigation Framework

    Authors: Zhuozhu Jian, Zihong Lu, Xiao Zhou, Bin Lan, Anxing Xiao, Xueqian Wang, Bin Liang

    Abstract: Autonomous navigation of ground robots has been widely used in indoor structured 2D environments, but there are still many challenges in outdoor 3D unstructured environments, especially in rough, uneven terrains. This paper proposed a plane-fitting based uneven terrain navigation framework (PUTN) to solve this problem. The implementation of PUTN is divided into three steps. First, based on Rapidly… ▽ More

    Submitted 27 September, 2022; v1 submitted 9 March, 2022; originally announced March 2022.

    Comments: Accepted by IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) 2022

  34. arXiv:2203.03927  [pdf, other

    cs.RO eess.SY

    Quadruped Guidance Robot for the Visually Impaired: A Comfort-Based Approach

    Authors: Yanbo Chen, Zhengzhe Xu, Zhuozhu Jian, Gengpan Tang, Yunong Yangli, Anxing Xiao, Xueqian Wang, Bin Liang

    Abstract: Guidance robots that can guide people and avoid various obstacles, could potentially be owned by more visually impaired people at a fairly low cost. Most of the previous guidance robots for the visually impaired ignored the human response behavior and comfort, treating the human as an appendage dragged by the robot, which can lead to imprecise guidance of the human and sudden changes in the tracti… ▽ More

    Submitted 23 June, 2023; v1 submitted 8 March, 2022; originally announced March 2022.

    Comments: IEEE International Conference on Robotics and Automation (ICRA) 2023

  35. Unsupervised Point Cloud Representation Learning with Deep Neural Networks: A Survey

    Authors: Aoran Xiao, Jiaxing Huang, Dayan Guan, Xiaoqin Zhang, Shijian Lu, Ling Shao

    Abstract: Point cloud data have been widely explored due to its superior accuracy and robustness under various adverse situations. Meanwhile, deep neural networks (DNNs) have achieved very impressive success in various applications such as surveillance and autonomous driving. The convergence of point cloud and DNNs has led to many deep point cloud models, largely trained under the supervision of large-scale… ▽ More

    Submitted 26 March, 2023; v1 submitted 28 February, 2022; originally announced February 2022.

    Comments: IEEE Transactions on Pattern Analysis and Machine Intelligence

  36. arXiv:2111.09983  [pdf, other

    eess.AS cs.SD

    Towards Measuring Fairness in Speech Recognition: Casual Conversations Dataset Transcriptions

    Authors: Chunxi Liu, Michael Picheny, Leda Sarı, Pooja Chitkara, Alex Xiao, Xiaohui Zhang, Mark Chou, Andres Alvarado, Caner Hazirbas, Yatharth Saraf

    Abstract: It is well known that many machine learning systems demonstrate bias towards specific groups of individuals. This problem has been studied extensively in the Facial Recognition area, but much less so in Automatic Speech Recognition (ASR). This paper presents initial Speech Recognition results on "Casual Conversations" -- a publicly released 846 hour corpus designed to help researchers evaluate the… ▽ More

    Submitted 18 November, 2021; originally announced November 2021.

    Comments: Submitted to ICASSP 2022. Our dataset will be publicly available at (https://ai.facebook.com/datasets/casual-conversations-downloads) for general use. We also would like to note that considering the limitations of our dataset, we limit the use of it for only evaluation purposes (see license agreement)

  37. arXiv:2111.05948  [pdf, other

    cs.CL cs.SD eess.AS

    Scaling ASR Improves Zero and Few Shot Learning

    Authors: Alex Xiao, Weiyi Zheng, Gil Keren, Duc Le, Frank Zhang, Christian Fuegen, Ozlem Kalinli, Yatharth Saraf, Abdelrahman Mohamed

    Abstract: With 4.5 million hours of English speech from 10 different sources across 120 countries and models of up to 10 billion parameters, we explore the frontiers of scale for automatic speech recognition. We propose data selection techniques to efficiently scale training data to find the most valuable samples in massive datasets. To efficiently scale model sizes, we leverage various optimizations such a… ▽ More

    Submitted 29 November, 2021; v1 submitted 10 November, 2021; originally announced November 2021.

  38. arXiv:2110.06648  [pdf, other

    cs.RO eess.SY

    Robotic Autonomous Trolley Collection with Progressive Perception and Nonlinear Model Predictive Control

    Authors: Anxing Xiao, Hao Luan, Ziqi Zhao, Yue Hong, Jieting Zhao, Weinan Chen, Jiankun Wang, Max Q. -H. Meng

    Abstract: Autonomous mobile manipulation robots that can collect trolleys are widely used to liberate human resources and fight epidemics. Most prior robotic trolley collection solutions only detect trolleys with 2D poses or are merely based on specific marks and lack the formal design of planning algorithms. In this paper, we present a novel mobile manipulation system with applications in luggage trolley c… ▽ More

    Submitted 1 March, 2022; v1 submitted 13 October, 2021; originally announced October 2021.

    Comments: Accepted to the 2022 International Conference on Robotics and Automation (ICRA 2022)

  39. arXiv:2110.05241  [pdf, other

    eess.AS cs.CL cs.LG

    Streaming Transformer Transducer Based Speech Recognition Using Non-Causal Convolution

    Authors: Yangyang Shi, Chunyang Wu, Dilin Wang, Alex Xiao, Jay Mahadeokar, Xiaohui Zhang, Chunxi Liu, Ke Li, Yuan Shangguan, Varun Nagaraja, Ozlem Kalinli, Mike Seltzer

    Abstract: This paper improves the streaming transformer transducer for speech recognition by using non-causal convolution. Many works apply the causal convolution to improve streaming transformer ignoring the lookahead context. We propose to use non-causal convolution to process the center block and lookahead context separately. This method leverages the lookahead context in convolution and maintains simila… ▽ More

    Submitted 7 October, 2021; originally announced October 2021.

    Comments: 5 pages, 3 figures, submit to ICASSP 2022

  40. arXiv:2110.03374  [pdf, other

    cs.CV

    Model Adaptation: Historical Contrastive Learning for Unsupervised Domain Adaptation without Source Data

    Authors: Jiaxing Huang, Dayan Guan, Aoran Xiao, Shijian Lu

    Abstract: Unsupervised domain adaptation aims to align a labeled source domain and an unlabeled target domain, but it requires to access the source data which often raises concerns in data privacy, data portability and data transmission efficiency. We study unsupervised model adaptation (UMA), or called Unsupervised Domain Adaptation without Source Data, an alternative setting that aims to adapt source-trai… ▽ More

    Submitted 4 June, 2022; v1 submitted 7 October, 2021; originally announced October 2021.

    Comments: Accepted to Advances in Neural Information Processing Systems 34 (NeurIPS 2021)

  41. arXiv:2110.03174  [pdf, other

    cs.SD cs.AI eess.AS

    Transferring Voice Knowledge for Acoustic Event Detection: An Empirical Study

    Authors: Dawei Liang, Yangyang Shi, Yun Wang, Nayan Singhal, Alex Xiao, Jonathan Shaw, Edison Thomaz, Ozlem Kalinli, Mike Seltzer

    Abstract: Detection of common events and scenes from audio is useful for extracting and understanding human contexts in daily life. Prior studies have shown that leveraging knowledge from a relevant domain is beneficial for a target acoustic event detection (AED) process. Inspired by the observation that many human-centered acoustic events in daily life involve voice elements, this paper investigates the po… ▽ More

    Submitted 7 October, 2021; originally announced October 2021.

    Comments: Submitted to ICASSP 2022

  42. arXiv:2108.00177  [pdf, other

    cs.CV cs.AI

    Greedy Network Enlarging

    Authors: Chuanjian Liu, Kai Han, An Xiao, Yi** Deng, Wei Zhang, Chun**g Xu, Yunhe Wang

    Abstract: Recent studies on deep convolutional neural networks present a simple paradigm of architecture design, i.e., models with more MACs typically achieve better accuracy, such as EfficientNet and RegNet. These works try to enlarge all the stages in the model with one unified rule by sampling and statistical methods. However, we observe that some network architectures have similar MACs and accuracies, b… ▽ More

    Submitted 25 November, 2021; v1 submitted 31 July, 2021; originally announced August 2021.

  43. arXiv:2107.11004  [pdf, other

    cs.CV

    Domain Adaptive Video Segmentation via Temporal Consistency Regularization

    Authors: Dayan Guan, Jiaxing Huang, Aoran Xiao, Shijian Lu

    Abstract: Video semantic segmentation is an essential task for the analysis and understanding of videos. Recent efforts largely focus on supervised video segmentation by learning from fully annotated data, but the learnt models often experience clear performance drop while applied to videos of a different domain. This paper presents DA-VSN, a domain adaptive video segmentation network that addresses domain… ▽ More

    Submitted 22 July, 2021; originally announced July 2021.

    Comments: Accepted to ICCV 2021. Code is available at https://github.com/Dayan-Guan/DA-VSN

  44. arXiv:2107.05399  [pdf, other

    cs.CV

    Transfer Learning from Synthetic to Real LiDAR Point Cloud for Semantic Segmentation

    Authors: Aoran Xiao, Jiaxing Huang, Dayan Guan, Fangneng Zhan, Shijian Lu

    Abstract: Knowledge transfer from synthetic to real data has been widely studied to mitigate data annotation constraints in various computer vision tasks such as semantic segmentation. However, the study focused on 2D images and its counterpart in 3D point clouds segmentation lags far behind due to the lack of large-scale synthetic datasets and effective transfer methods. We address this issue by collecting… ▽ More

    Submitted 1 December, 2021; v1 submitted 12 July, 2021; originally announced July 2021.

    Comments: Accepted by AAAI 2022

  45. arXiv:2107.04140  [pdf, other

    cs.AR

    First-Generation Inference Accelerator Deployment at Facebook

    Authors: Michael Anderson, Benny Chen, Stephen Chen, Summer Deng, Jordan Fix, Michael Gschwind, Aravind Kalaiah, Changkyu Kim, Jaewon Lee, Jason Liang, Haixin Liu, Yinghai Lu, Jack Montgomery, Arun Moorthy, Satish Nadathur, Sam Naghshineh, Avinash Nayak, Jongsoo Park, Chris Petersen, Martin Schatz, Narayanan Sundaram, Bangsheng Tang, Peter Tang, Amy Yang, Jiecao Yu , et al. (90 additional authors not shown)

    Abstract: In this paper, we provide a deep dive into the deployment of inference accelerators at Facebook. Many of our ML workloads have unique characteristics, such as sparse memory accesses, large model sizes, as well as high compute, memory and network bandwidth requirements. We co-designed a high-performance, energy-efficient inference accelerator platform based on these requirements. We describe the in… ▽ More

    Submitted 4 August, 2021; v1 submitted 8 July, 2021; originally announced July 2021.

  46. arXiv:2107.03021  [pdf, other

    cs.CV

    Bi-level Feature Alignment for Versatile Image Translation and Manipulation

    Authors: Fangneng Zhan, Yingchen Yu, Rongliang Wu, Jiahui Zhang, Kaiwen Cui, Aoran Xiao, Shijian Lu, Chunyan Miao

    Abstract: Generative adversarial networks (GANs) have achieved great success in image translation and manipulation. However, high-fidelity image generation with faithful style control remains a grand challenge in computer vision. This paper presents a versatile image translation and manipulation framework that achieves accurate semantic and style guidance in image generation by explicitly building a corresp… ▽ More

    Submitted 21 July, 2022; v1 submitted 7 July, 2021; originally announced July 2021.

    Comments: Accepted to ECCV 2022

  47. arXiv:2107.00773  [pdf, other

    cs.RO cs.AI eess.SY

    Autonomous Navigation for Quadrupedal Robots with Optimized Jum** through Constrained Obstacles

    Authors: Scott Gilroy, Derek Lau, Lizhi Yang, Ed Izaguirre, Kristen Biermayer, Anxing Xiao, Mengti Sun, Ayush Agrawal, Jun Zeng, Zhongyu Li, Koushil Sreenath

    Abstract: Quadrupeds are strong candidates for navigating challenging environments because of their agile and dynamic designs. This paper presents a methodology that extends the range of exploration for quadrupedal robots by creating an end-to-end navigation framework that exploits walking and jum** modes. To obtain a dynamic jum** maneuver while avoiding obstacles, dynamically-feasible trajectories are… ▽ More

    Submitted 1 July, 2021; originally announced July 2021.

    Comments: Accepted to 2021 IEEE 17th International Conference on Automation Science and Engineering (CASE 2021)

  48. arXiv:2106.15941  [pdf, other

    cs.CV cs.LG

    Augmented Shortcuts for Vision Transformers

    Authors: Yehui Tang, Kai Han, Chang Xu, An Xiao, Yi** Deng, Chao Xu, Yunhe Wang

    Abstract: Transformer models have achieved great progress on computer vision tasks recently. The rapid development of vision transformers is mainly contributed by their high representation ability for extracting informative features from input images. However, the mainstream transformer models are designed with deep architectures, and the feature diversity will be continuously reduced as the depth increases… ▽ More

    Submitted 30 June, 2021; originally announced June 2021.

  49. arXiv:2106.03014  [pdf, ps, other

    math.PR

    Geometric sums, size biasing and zero biasing

    Authors: Qingwei Liu, Aihua Xia

    Abstract: The geometric sum plays a significant role in risk theory and reliability theory \cite{Kala97} and a prototypical example of the geometric sum is Rényi's theorem~\cite{Renyi56} saying a sequence of suitably parameterised geometric sums converges to the exponential distribution. There is extensive study of the accuracy of exponential distribution approximation to the geometric sum \cite{Sugakova95,… ▽ More

    Submitted 16 October, 2021; v1 submitted 5 June, 2021; originally announced June 2021.

  50. arXiv:2106.02885  [pdf, other

    cs.CV

    Category Contrast for Unsupervised Domain Adaptation in Visual Tasks

    Authors: Jiaxing Huang, Dayan Guan, Aoran Xiao, Shijian Lu, Ling Shao

    Abstract: Instance contrast for unsupervised representation learning has achieved great success in recent years. In this work, we explore the idea of instance contrastive learning in unsupervised domain adaptation (UDA) and propose a novel Category Contrast technique (CaCo) that introduces semantic priors on top of instance discrimination for visual UDA tasks. By considering instance contrastive learning as… ▽ More

    Submitted 17 March, 2022; v1 submitted 5 June, 2021; originally announced June 2021.

    Comments: CVPR2022 version