Skip to main content

Showing 151–200 of 579 results for author: Xia, F

.
  1. arXiv:2210.15149  [pdf

    eess.IV cs.CV

    Fully Automated Deep Learning-enabled Detection for Hepatic Steatosis on Computed Tomography: A Multicenter International Validation Study

    Authors: Zhongyi Zhang, Guixia Li, Ziqiang Wang, Feng Xia, Ning Zhao, Huibin Nie, Zezhong Ye, Joshua Lin, Yiyi Hui, Xiangchun Liu

    Abstract: Despite high global prevalence of hepatic steatosis, no automated diagnostics demonstrated generalizability in detecting steatosis on multiple international datasets. Traditionally, hepatic steatosis detection relies on clinicians selecting the region of interest (ROI) on computed tomography (CT) to measure liver attenuation. ROI selection demands time and expertise, and therefore is not routinely… ▽ More

    Submitted 6 November, 2022; v1 submitted 26 October, 2022; originally announced October 2022.

  2. arXiv:2210.10865  [pdf, other

    cs.RO cs.AI cs.LG eess.SY

    Robotic Table Wi** via Reinforcement Learning and Whole-body Trajectory Optimization

    Authors: Thomas Lew, Sumeet Singh, Mario Prats, Jeffrey Bingham, Jonathan Weisz, Benjie Holson, Xiaohan Zhang, Vikas Sindhwani, Yao Lu, Fei Xia, Peng Xu, Tingnan Zhang, Jie Tan, Montserrat Gonzalez

    Abstract: We propose a framework to enable multipurpose assistive mobile robots to autonomously wipe tables to clean spills and crumbs. This problem is challenging, as it requires planning wi** actions while reasoning over uncertain latent dynamics of crumbs and spills captured via high-dimensional visual observations. Simultaneously, we must guarantee constraints satisfaction to enable safe deployment in… ▽ More

    Submitted 19 October, 2022; originally announced October 2022.

  3. arXiv:2210.07372  [pdf, other

    cs.CV

    SWFormer: Sparse Window Transformer for 3D Object Detection in Point Clouds

    Authors: Pei Sun, Mingxing Tan, Weiyue Wang, Chenxi Liu, Fei Xia, Zhaoqi Leng, Dragomir Anguelov

    Abstract: 3D object detection in point clouds is a core component for modern robotics and autonomous driving systems. A key challenge in 3D object detection comes from the inherent sparse nature of point occupancy within the 3D scene. In this paper, we propose Sparse Window Transformer (SWFormer ), a scalable and accurate model for 3D object detection, which can take full advantage of the sparsity of point… ▽ More

    Submitted 13 October, 2022; originally announced October 2022.

    Journal ref: ECCV 2022

  4. arXiv:2210.06210  [pdf, other

    cs.CL cs.LG

    Pruning Pre-trained Language Models Without Fine-Tuning

    Authors: Ting Jiang, Deqing Wang, Fuzhen Zhuang, Ruobing Xie, Feng Xia

    Abstract: To overcome the overparameterized problem in Pre-trained Language Models (PLMs), pruning is widely used as a simple and straightforward compression method by directly removing unimportant weights. Previous first-order methods successfully compress PLMs to extremely high sparsity with little performance drop. These methods, such as movement pruning, use first-order information to prune PLMs while f… ▽ More

    Submitted 16 May, 2023; v1 submitted 12 October, 2022; originally announced October 2022.

    Comments: Accepted to ACL 2023; Code and models are available at https://github.com/kongds/SMP

  5. arXiv:2210.04616  [pdf, other

    math.AP

    Hilbert expansion of the Boltzmann equation in the incompressible Euler level in a channel

    Authors: Feimin Huang, Weiqiang Wang, Yong Wang, Feng Xiao

    Abstract: The study of hydrodynamic limit of the Boltzmann equation with physical boundary is a challenging problem due to appearance of the viscous and Knudsen boundary layers. In this paper, the hydrodynamic limit from the Boltzmann equation with specular reflection boundary condition to the incompressible Euler in a channel is investigated. Based on the multiscaled Hilbert expansion, the equations with b… ▽ More

    Submitted 8 September, 2023; v1 submitted 10 October, 2022; originally announced October 2022.

    Comments: 53 pages.Accepted for publication in SCIENCE CHINA Mathematics

  6. arXiv:2210.00515  [pdf, other

    eess.IV cs.CV

    Deep-OCTA: Ensemble Deep Learning Approaches for Diabetic Retinopathy Analysis on OCTA Images

    Authors: Junlin Hou, Fan Xiao, Jilan Xu, Yuejie Zhang, Haidong Zou, Rui Feng

    Abstract: The ultra-wide optical coherence tomography angiography (OCTA) has become an important imaging modality in diabetic retinopathy (DR) diagnosis. However, there are few researches focusing on automatic DR analysis using ultra-wide OCTA. In this paper, we present novel and practical deep-learning solutions based on ultra-wide OCTA for the Diabetic Retinopathy Analysis Challenge (DRAC). In the segment… ▽ More

    Submitted 2 October, 2022; originally announced October 2022.

  7. arXiv:2209.13880  [pdf, ps, other

    math.OC

    A machine learning based column-and-row generation approach for integrated air cargo recovery problem

    Authors: Lei Huang, Fan Xiao, Zhe Liang

    Abstract: Freighter airlines need to recover both aircraft and cargo schedules when disruptions happen. This process is usually divided into three sequential decisions to recovery flights, aircraft, and cargoes. This study focuses on the integrated recovery problem that makes aircraft and cargo recovery decisions simultaneously. We formulate two integrated models based on the flight connection network, one… ▽ More

    Submitted 28 September, 2022; originally announced September 2022.

  8. arXiv:2209.10780  [pdf, other

    cs.RO cs.AI cs.LG

    Learning Model Predictive Controllers with Real-Time Attention for Real-World Navigation

    Authors: Xuesu Xiao, Tingnan Zhang, Krzysztof Choromanski, Edward Lee, Anthony Francis, Jake Varley, Stephen Tu, Sumeet Singh, Peng Xu, Fei Xia, Sven Mikael Persson, Dmitry Kalashnikov, Leila Takayama, Roy Frostig, Jie Tan, Carolina Parada, Vikas Sindhwani

    Abstract: Despite decades of research, existing navigation systems still face real-world challenges when deployed in the wild, e.g., in cluttered home environments or in human-occupied public spaces. To address this, we present a new class of implicit control policies combining the benefits of imitation learning with the robust handling of system constraints from Model Predictive Control (MPC). Our approach… ▽ More

    Submitted 23 September, 2022; v1 submitted 22 September, 2022; originally announced September 2022.

  9. arXiv:2209.09874  [pdf, other

    cs.RO cs.AI cs.CV

    Open-vocabulary Queryable Scene Representations for Real World Planning

    Authors: Boyuan Chen, Fei Xia, Brian Ichter, Kanishka Rao, Keerthana Gopalakrishnan, Michael S. Ryoo, Austin Stone, Daniel Kappler

    Abstract: Large language models (LLMs) have unlocked new capabilities of task planning from human instructions. However, prior attempts to apply LLMs to real-world robotic tasks are limited by the lack of grounding in the surrounding scene. In this paper, we develop NLMap, an open-vocabulary and queryable scene representation to address this problem. NLMap serves as a framework to gather and integrate conte… ▽ More

    Submitted 15 October, 2022; v1 submitted 20 September, 2022; originally announced September 2022.

    Comments: v2, added references to concurrent work and acknowledgments

  10. arXiv:2209.09000  [pdf, other

    cs.IR

    Reweighting Clicks with Dwell Time in Recommendation

    Authors: Ruobing Xie, Lin Ma, Shaoliang Zhang, Feng Xia, Leyu Lin

    Abstract: The click behavior is the most widely-used user positive feedback in recommendation. However, simply considering each click equally in training may suffer from clickbaits and title-content mismatching, and thus fail to precisely capture users' real satisfaction on items. Dwell time could be viewed as a high-quality quantitative indicator of user preferences on each click, while existing recommenda… ▽ More

    Submitted 27 February, 2023; v1 submitted 19 September, 2022; originally announced September 2022.

    Comments: 5 pages, accepted by WWW-2023 Companion

    Journal ref: WWW-2023 Companion

  11. arXiv:2209.08774  [pdf, other

    cs.SD cs.AI cs.MM eess.AS

    Playing Technique Detection by Fusing Note Onset Information in Guzheng Performance

    Authors: Dichucheng Li, Yulun Wu, Qinyu Li, Jiahao Zhao, Yi Yu, Fan Xia, Wei Li

    Abstract: The Guzheng is a kind of traditional Chinese instruments with diverse playing techniques. Instrument playing techniques (IPT) play an important role in musical performance. However, most of the existing works for IPT detection show low efficiency for variable-length audio and provide no assurance in the generalization as they rely on a single sound bank for training and testing. In this study, we… ▽ More

    Submitted 19 September, 2022; originally announced September 2022.

    Comments: Accepted to ISMIR 2022

  12. arXiv:2209.07753  [pdf, other

    cs.RO

    Code as Policies: Language Model Programs for Embodied Control

    Authors: Jacky Liang, Wenlong Huang, Fei Xia, Peng Xu, Karol Hausman, Brian Ichter, Pete Florence, Andy Zeng

    Abstract: Large language models (LLMs) trained on code completion have been shown to be capable of synthesizing simple Python programs from docstrings [1]. We find that these code-writing LLMs can be re-purposed to write robot policy code, given natural language commands. Specifically, policy code can express functions or feedback loops that process perception outputs (e.g.,from object detectors [2], [3]) a… ▽ More

    Submitted 24 May, 2023; v1 submitted 16 September, 2022; originally announced September 2022.

  13. arXiv:2208.12613  [pdf, other

    cs.CV cs.LG

    Image augmentation improves few-shot classification performance in plant disease recognition

    Authors: Frank Xiao

    Abstract: With the world population projected to near 10 billion by 2050, minimizing crop damage and guaranteeing food security has never been more important. Machine learning has been proposed as a solution to quickly and efficiently identify diseases in crops. Convolutional Neural Networks typically require large datasets of annotated data which are not available on demand. Collecting this data is a long… ▽ More

    Submitted 24 August, 2022; originally announced August 2022.

    Comments: 11 pages, 3 figures, 3 tables

  14. arXiv:2208.10029  [pdf, other

    astro-ph.SR physics.plasm-ph

    Plasma heating and nanoflare caused by slow-mode wave in a coronal loop

    Authors: Fanxiaoyu Xia, Tongjiang Wang, Yang Su, Jie Zhao, Qingmin Zhang, Astrid M. Veronig, Weiqun Gan

    Abstract: We present a detailed analysis of a reflecting intensity perturbation in a large coronal loop that appeared as sloshing oscillation and lasted for at least one and a half periods. The perturbation is initiated by a microflare at one footpoint of the loop, propagates along the loop and is eventually reflected at the remote footpoint where significant brightenings are observed in all the AIA extreme… ▽ More

    Submitted 21 August, 2022; originally announced August 2022.

    Comments: 13pages, 5 figures; Accepted by ApJL

  15. Tunable Strong Magnetic Anisotropy in Two-Dimensional van der Waals Antiferromagnets

    Authors: Qingjun Tong

    Abstract: We show that anisotropic energy of a 2D antiferromagnet is greatly enhanced via stacking on a magnetic substrate layer, arising from the sublattice-dependent interlayer magnetic interaction that defines an effective anisotropic energy. Interestingly, this effective energy couples strongly with the interlayer stacking order and the magnetic order of the substrate layer, providing unique mechanical… ▽ More

    Submitted 14 August, 2022; originally announced August 2022.

    Comments: 8 pages, 4 figures

    Journal ref: Nano Lett. 22, 3946 (2022)

  16. arXiv:2207.14521  [pdf, other

    cs.RO math.OC

    Self-organized Polygon Formation Control based on Distributed Estimation

    Authors: Qingkai Yang, Fan Xiao, **gshuo Lyu, Bo Zhou, Hao Fang

    Abstract: This paper studies the problem of controlling a multi-robot system to achieve a polygon formation in a self-organized manner. Different from the typical formation control strategies where robots are steered to satisfy the predefined control variables, such as pairwise distances, relative positions and bearings, the foremost idea of this paper is to achieve polygon formations by injecting control i… ▽ More

    Submitted 2 April, 2023; v1 submitted 29 July, 2022; originally announced July 2022.

  17. arXiv:2207.13073  [pdf

    cond-mat.mtrl-sci

    Three-dimensional operando optical imaging of single particle and electrolyte heterogeneities inside Li-ion batteries

    Authors: Raj Pandya, Lorenzo Valzania, Florian Dorchies, Fei Xia, Jeffrey Mc Hugh, Angus Mathieson, Jien Hwee Tan, Thomas G. Parton, Michael De Volder, Jean-Marie Tarascon, Sylvain Gigan, Hilton B. de Aguiar, Alexis Grimaud

    Abstract: Understanding (de)lithiation heterogeneities in battery materials is key to ensuring optimal electrochemical performance and develo** better energy storage devices. However, this remains challenging due to the complex three dimensional morphology of microscopic electrode particles, the involvement of both solid and liquid phase reactants, and range of relevant timescales (seconds to hours). Here… ▽ More

    Submitted 27 June, 2022; originally announced July 2022.

    Comments: 29 pages, 6 figures

  18. DESCN: Deep Entire Space Cross Networks for Individual Treatment Effect Estimation

    Authors: Kailiang Zhong, Fengtong Xiao, Yan Ren, Yaorong Liang, Wenqing Yao, Xiaofeng Yang, Ling Cen

    Abstract: Causal Inference has wide applications in various areas such as E-commerce and precision medicine, and its performance heavily relies on the accurate estimation of the Individual Treatment Effect (ITE). Conventionally, ITE is predicted by modeling the treated and control response functions separately in their individual sample spaces. However, such an approach usually encounters two issues in prac… ▽ More

    Submitted 19 October, 2023; v1 submitted 18 July, 2022; originally announced July 2022.

    Comments: Accepted by SIGKDD 2022 Applied Data Science Track

    ACM Class: I.2.m

    Journal ref: Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD '22), August 14-18, 2022, Washington, DC, USA

  19. arXiv:2207.06333  [pdf, other

    cs.CV

    6D Camera Relocalization in Visually Ambiguous Extreme Environments

    Authors: Yang Zheng, Tolga Birdal, Fei Xia, Yanchao Yang, Yueqi Duan, Leonidas J. Guibas

    Abstract: We propose a novel method to reliably estimate the pose of a camera given a sequence of images acquired in extreme environments such as deep seas or extraterrestrial terrains. Data acquired under these challenging conditions are corrupted by textureless surfaces, image degradation, and presence of repetitive and highly ambiguous structures. When naively deployed, the state-of-the-art methods can f… ▽ More

    Submitted 13 July, 2022; originally announced July 2022.

  20. arXiv:2207.06030  [pdf, other

    cs.LG cs.AI stat.ML

    Cost-Effective Online Contextual Model Selection

    Authors: Xuefeng Liu, Fangfang Xia, Rick L. Stevens, Yuxin Chen

    Abstract: How can we collect the most useful labels to learn a model selection policy, when presented with arbitrary heterogeneous data streams? In this paper, we formulate this task as an online contextual active model selection problem, where at each round the learner receives an unlabeled data point along with a context. The goal is to output the best model for any given context without obtaining an exce… ▽ More

    Submitted 17 February, 2023; v1 submitted 13 July, 2022; originally announced July 2022.

  21. arXiv:2207.05608  [pdf, other

    cs.RO cs.AI cs.CL cs.CV cs.LG

    Inner Monologue: Embodied Reasoning through Planning with Language Models

    Authors: Wenlong Huang, Fei Xia, Ted Xiao, Harris Chan, Jacky Liang, Pete Florence, Andy Zeng, Jonathan Tompson, Igor Mordatch, Yevgen Chebotar, Pierre Sermanet, Noah Brown, Tomas Jackson, Linda Luu, Sergey Levine, Karol Hausman, Brian Ichter

    Abstract: Recent works have shown how the reasoning capabilities of Large Language Models (LLMs) can be applied to domains beyond natural language processing, such as planning and interaction for robots. These embodied problems require an agent to understand many semantic aspects of the world: the repertoire of skills available, how these skills influence the world, and how changes to the world map back to… ▽ More

    Submitted 12 July, 2022; originally announced July 2022.

    Comments: Project website: https://innermonologue.github.io

  22. arXiv:2207.02409  [pdf

    physics.optics physics.app-ph physics.bio-ph

    Sub-monolayer Biolasers: Lower Gain, Higher Sensitivity

    Authors: C. Gong, X. Yang, S. J. Tang, Q. Q. Zhang, Y. Wang, Y. L. Liu, Y. C. Chen, G. D. Peng, X. Fan, Y. F. Xiao, Y. J. Rao, Y. Gong

    Abstract: Biomarker detection is the key to identifying health risks. However, designing sensitive biosensors in a single-use mode for disease diagnosis remains a major challenge. Here, we report sub-monolayer biolasers with remarkable repeatability for ultrasensitive and disposable biomarker detection. The biolaser sensors are designed by employing the telecom optical fibers as distributed optical microcav… ▽ More

    Submitted 5 July, 2022; originally announced July 2022.

    Comments: 27 pages, 15 figures

    MSC Class: 78A70

  23. A clinically relevant online patient QA solution with daily CT scans and EPID-based in vivo dosimetry: A feasible study on rectal cancer

    Authors: Liyuan Chen, Zhiyuan Zhang, Lei Yu, Jiyou Peng, Bin Feng, Jun Zhao, Yanfang Liu, Fan Xia, Zhen Zhang, Weigang Hu, Jiazhou Wang

    Abstract: Adaptive radiation therapy (ART) could protect organs at risk (OARs) while maintain high dose coverage to targets. However, there still lack efficient online patient QA methods. We aim to develop a clinically relevant online patient quality assurance (QA) solution for ART using daily CT scans and electronic portal imaging device (EPID)-based in vivo dosimetry. Ten patients with rectal cancer at ou… ▽ More

    Submitted 27 June, 2022; originally announced June 2022.

  24. arXiv:2206.13090  [pdf, ps, other

    math.OC

    Variance Reduced Random Relaxed Projection Method for Constrained Finite-sum Minimization Problems

    Authors: Zhichun Yang, Fu-quan Xia, Kai Tu, Man-Chung Yue

    Abstract: For many applications in signal processing and machine learning, we are tasked with minimizing a large sum of convex functions subject to a large number of convex constraints. In this paper, we devise a new random projection method (RPM) to efficiently solve this problem. Compared with existing RPMs, our proposed algorithm features two useful algorithmic ideas. First, at each iteration, instead of… ▽ More

    Submitted 5 April, 2024; v1 submitted 27 June, 2022; originally announced June 2022.

  25. arXiv:2206.06489  [pdf, other

    cs.AI cs.CV cs.RO

    BEHAVIOR in Habitat 2.0: Simulator-Independent Logical Task Description for Benchmarking Embodied AI Agents

    Authors: Ziang Liu, Roberto Martín-Martín, Fei Xia, Jiajun Wu, Li Fei-Fei

    Abstract: Robots excel in performing repetitive and precision-sensitive tasks in controlled environments such as warehouses and factories, but have not been yet extended to embodied AI agents providing assistance in household tasks. Inspired by the catalyzing effect that benchmarks have played in the AI fields such as computer vision and natural language processing, the community is looking for new benchmar… ▽ More

    Submitted 13 June, 2022; originally announced June 2022.

  26. arXiv:2206.04615  [pdf, other

    cs.CL cs.AI cs.CY cs.LG stat.ML

    Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

    Authors: Aarohi Srivastava, Abhinav Rastogi, Abhishek Rao, Abu Awal Md Shoeb, Abubakar Abid, Adam Fisch, Adam R. Brown, Adam Santoro, Aditya Gupta, Adrià Garriga-Alonso, Agnieszka Kluska, Aitor Lewkowycz, Akshat Agarwal, Alethea Power, Alex Ray, Alex Warstadt, Alexander W. Kocurek, Ali Safaya, Ali Tazarv, Alice Xiang, Alicia Parrish, Allen Nie, Aman Hussain, Amanda Askell, Amanda Dsouza , et al. (426 additional authors not shown)

    Abstract: Language models demonstrate both quantitative improvement and new qualitative capabilities with increasing scale. Despite their potentially transformative impact, these new capabilities are as yet poorly characterized. In order to inform future research, prepare for disruptive new model capabilities, and ameliorate socially harmful effects, it is vital that we understand the present and near-futur… ▽ More

    Submitted 12 June, 2023; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: 27 pages, 17 figures + references and appendices, repo: https://github.com/google/BIG-bench

    Journal ref: Transactions on Machine Learning Research, May/2022, https://openreview.net/forum?id=uyTL5Bvosj

  27. arXiv:2206.00487  [pdf, other

    physics.optics eess.IV

    Physics-based neural network for non-invasive control of coherent light in scattering media

    Authors: Alexandra d'Arco, Fei Xia, Antoine Boniface, Jonathan Dong, Sylvain Gigan

    Abstract: Optical imaging through complex media, such as biological tissues or fog, is challenging due to light scattering. In the multiple scattering regime, wavefront sha** provides an effective method to retrieve information; it relies on measuring how the propagation of different optical wavefronts are impacted by scattering. Based on this principle, several wavefront sha** techniques were successfu… ▽ More

    Submitted 1 June, 2022; originally announced June 2022.

    Comments: 15 pages, 11 figures

  28. arXiv:2205.11710  [pdf, other

    cs.CV

    SCVRL: Shuffled Contrastive Video Representation Learning

    Authors: Michael Dorkenwald, Fanyi Xiao, Biagio Brattoli, Joseph Tighe, Davide Modolo

    Abstract: We propose SCVRL, a novel contrastive-based framework for self-supervised learning for videos. Differently from previous contrast learning based methods that mostly focus on learning visual semantics (e.g., CVRL), SCVRL is capable of learning both semantic and motion patterns. For that, we reformulate the popular shuffling pretext task within a modern contrastive learning paradigm. We show that ou… ▽ More

    Submitted 23 May, 2022; originally announced May 2022.

    Comments: CVPR 2022 - L3DIVU workshop

  29. arXiv:2205.11274  [pdf, other

    q-bio.MN stat.ME

    Single-cell gene regulatory network analysis for mixed cell populations with applications to COVID-19 single cell data

    Authors: Junjie Tang, Changhu Wang, Feiyi Xiao, Ruibin Xi

    Abstract: Gene regulatory network (GRN) refers to the complex network formed by regulatory interactions between genes in living cells. In this paper, we consider inferring GRNs in single cells based on single cell RNA sequencing (scRNA-seq) data. In scRNA-seq, single cells are often profiled from mixed populations and their cell identities are unknown. A common practice for single cell GRN analysis is to fi… ▽ More

    Submitted 23 May, 2022; originally announced May 2022.

    Comments: 95 pages,28 figures

  30. Arecibo and FAST Timing Follow-up of twelve Millisecond Pulsars Discovered in Commensal Radio Astronomy FAST Survey

    Authors: C. C. Miao, W. W. Zhu, D. Li, P. C. C. Freire, J. R. Niu, P. Wang, J. P. Yuan, M. Y. Xue, A. D. Cameron, D. J. Champion, M. Cruces, Y. T. Chen, M. M. Chi, X. F. Cheng, S. J. Dang, M. F. Ding, Y. Feng, Z. Y. Gan, G. Hobbs, M. Kramer, Z. J. Liu, Y. X. Li, Z. K. Luo, X. L. Miao, L. Q. Meng , et al. (24 additional authors not shown)

    Abstract: We report the phase-connected timing ephemeris, polarization pulse profiles, Faraday rotation measurements, and Rotating-Vector-Model (RVM) fitting results of twelve millisecond pulsars (MSPs) discovered with the Five-hundred-meter Aperture Spherical radio Telescope (FAST) in the Commensal radio Astronomy FAST survey (CRAFTS). The timing campaigns were carried out with FAST and Arecibo over three… ▽ More

    Submitted 9 May, 2022; originally announced May 2022.

    Comments: 11 pages, 5 figures, MNRAS accepted

  31. Chaotic dynamics of string around the conformal black hole

    Authors: Da-Zhu Ma, Fang Xia, Dan Zhang, Guoyang Fu, Jian-Pin Wu

    Abstract: In this paper, we make a systematical and in-depth study on the chaotic dynamics of the string around the conformal black hole. Depending on the characteristic parameter of the conformal black hole and the initial position of the string, there are three kinds of dynamical behaviors: ordered, chaotic and being captured, chaotic but not being captured. A particular interesting observation is that th… ▽ More

    Submitted 30 April, 2022; originally announced May 2022.

    Comments: 20 pages, 7 figure

    Journal ref: Eur. Phys. J. C (2022) 82:372

  32. arXiv:2204.10773  [pdf

    eess.IV cs.CV cs.LG physics.med-ph

    Denoising of Three-Dimensional Fast Spin Echo Magnetic Resonance Images of Knee Joints using Spatial-Variant Noise-Relevant Residual Learning of Convolution Neural Network

    Authors: Shutian Zhao, Donal G. Cahill, Siyue Li, Fan Xiao, Thierry Blu, James F Griffith, Weitian Chen

    Abstract: Two-dimensional (2D) fast spin echo (FSE) techniques play a central role in the clinical magnetic resonance imaging (MRI) of knee joints. Moreover, three-dimensional (3D) FSE provides high-isotropic-resolution magnetic resonance (MR) images of knee joints, but it has a reduced signal-to-noise ratio compared to 2D FSE. Deep-learning denoising methods are a promising approach for denoising MR images… ▽ More

    Submitted 20 April, 2022; originally announced April 2022.

    Comments: 6 figures, abstract accepted by Joint Annual Meeting ISMRM-ESMRMB & ISMRT 31st Annual Meeting

    Journal ref: Computers in Biology and Medicine, Volume 151, Part A, 2022, 106295, ISSN 0010-4825

  33. Dite-HRNet: Dynamic Lightweight High-Resolution Network for Human Pose Estimation

    Authors: Qun Li, Ziyi Zhang, Fu Xiao, Feng Zhang, Bir Bhanu

    Abstract: A high-resolution network exhibits remarkable capability in extracting multi-scale features for human pose estimation, but fails to capture long-range interactions between joints and has high computational complexity. To address these problems, we present a Dynamic lightweight High-Resolution Network (Dite-HRNet), which can efficiently extract multi-scale contextual information and model long-rang… ▽ More

    Submitted 24 May, 2022; v1 submitted 22 April, 2022; originally announced April 2022.

    Comments: Accepted by IJCAI-ECAI 2022

  34. arXiv:2204.09220  [pdf, other

    cs.CL cs.AI

    LingYi: Medical Conversational Question Answering System based on Multi-modal Knowledge Graphs

    Authors: Fei Xia, Bin Li, Yixuan Weng, Shizhu He, Kang Liu, Bin Sun, Shutao Li, Jun Zhao

    Abstract: The medical conversational system can relieve the burden of doctors and improve the efficiency of healthcare, especially during the pandemic. This paper presents a medical conversational question answering (CQA) system based on the multi-modal knowledge graph, namely "LingYi", which is designed as a pipeline framework to maintain high flexibility. Our system utilizes automated medical procedures i… ▽ More

    Submitted 20 April, 2022; originally announced April 2022.

    Comments: 9 pages, 4 figures, 5 tables

  35. CholecTriplet2021: A benchmark challenge for surgical action triplet recognition

    Authors: Chinedu Innocent Nwoye, Deepak Alapatt, Tong Yu, Armine Vardazaryan, Fangfang Xia, Zixuan Zhao, Tong Xia, Fucang Jia, Yuxuan Yang, Hao Wang, Derong Yu, Guoyan Zheng, Xiaotian Duan, Neil Getty, Ricardo Sanchez-Matilla, Maria Robu, Li Zhang, Huabin Chen, Jiacheng Wang, Liansheng Wang, Bokai Zhang, Beerend Gerats, Sista Raviteja, Rachana Sathish, Rong Tao , et al. (37 additional authors not shown)

    Abstract: Context-aware decision support in the operating room can foster surgical safety and efficiency by leveraging real-time feedback from surgical workflow analysis. Most existing works recognize surgical activities at a coarse-grained level, such as phases, steps or events, leaving out fine-grained interaction details about the surgical activity; yet those are needed for more helpful AI assistance in… ▽ More

    Submitted 29 December, 2022; v1 submitted 10 April, 2022; originally announced April 2022.

    Comments: CholecTriplet2021 challenge report. Paper accepted at Elsevier journal of Medical Image Analysis. 22 pages, 8 figures, 11 tables. Challenge website: https://cholectriplet2021.grand-challenge.org

    Journal ref: Medical Image Analysis 86 (2023) 102803

  36. arXiv:2204.04344  [pdf, other

    cs.CL

    Towards Better Chinese-centric Neural Machine Translation for Low-resource Languages

    Authors: Bin Li, Yixuan Weng, Fei Xia, Hanjun Deng

    Abstract: The last decade has witnessed enormous improvements in science and technology, stimulating the growing demand for economic and cultural exchanges in various countries. Building a neural machine translation (NMT) system has become an urgent trend, especially in the low-resource setting. However, recent work tends to study NMT systems for low-resource languages centered on English, while few works f… ▽ More

    Submitted 8 April, 2022; originally announced April 2022.

    Comments: 7pages, 4 figures, 4 tables

  37. arXiv:2204.03101  [pdf, other

    cs.CV

    Hierarchical Self-supervised Representation Learning for Movie Understanding

    Authors: Fanyi Xiao, Kaustav Kundu, Joseph Tighe, Davide Modolo

    Abstract: Most self-supervised video representation learning approaches focus on action recognition. In contrast, in this paper we focus on self-supervised video learning for movie understanding and propose a novel hierarchical self-supervised pretraining strategy that separately pretrains each level of our hierarchical movie understanding model (based on [37]). Specifically, we propose to pretrain the low-… ▽ More

    Submitted 6 April, 2022; originally announced April 2022.

    Comments: CVPR 2022

  38. arXiv:2204.02676  [pdf, other

    cs.SI

    Detecting Outlier Patterns with Query-based Artificially Generated Searching Conditions

    Authors: Shuo Yu, Feng Xia, Yuchen Sun, Tao Tang, Xiaoran Yan, Ivan Lee

    Abstract: In the age of social computing, finding interesting network patterns or motifs is significant and critical for various areas such as decision intelligence, intrusion detection, medical diagnosis, social network analysis, fake news identification, national security, etc. However, sub-graph matching remains a computationally challenging problem, let alone identifying special motifs among them. This… ▽ More

    Submitted 6 April, 2022; originally announced April 2022.

  39. arXiv:2204.02667  [pdf, other

    cs.SI

    Familiarity-based Collaborative Team Recognition in Academic Social Networks

    Authors: Shuo Yu, Feng Xia, Chen Zhang, Kathleen Keogh, Honglong Chen

    Abstract: Collaborative teamwork is key to major scientific discoveries. However, the prevalence of collaboration among researchers makes team recognition increasingly challenging. Previous studies have demonstrated that people are more likely to collaborate with individuals they are familiar with. In this work, we employ the definition of familiarity and then propose MOTO (faMiliarity-based cOllaborative T… ▽ More

    Submitted 6 April, 2022; originally announced April 2022.

  40. arXiv:2204.02656  [pdf, other

    cs.SI cs.LG

    CHIEF: Clustering with Higher-order Motifs in Big Networks

    Authors: Feng Xia, Shuo Yu, Chengfei Liu, Ivan Lee

    Abstract: Clustering a group of vertices in networks facilitates applications across different domains, such as social computing and Internet of Things. However, challenges arises for clustering networks with increased scale. This paper proposes a solution which consists of two motif clustering techniques: standard acceleration CHIEF-ST and approximate acceleration CHIEF-AP. Both algorithms first find the m… ▽ More

    Submitted 6 April, 2022; originally announced April 2022.

  41. arXiv:2204.01691  [pdf, other

    cs.RO cs.CL cs.LG

    Do As I Can, Not As I Say: Grounding Language in Robotic Affordances

    Authors: Michael Ahn, Anthony Brohan, Noah Brown, Yevgen Chebotar, Omar Cortes, Byron David, Chelsea Finn, Chuyuan Fu, Keerthana Gopalakrishnan, Karol Hausman, Alex Herzog, Daniel Ho, Jasmine Hsu, Julian Ibarz, Brian Ichter, Alex Irpan, Eric Jang, Rosario Jauregui Ruano, Kyle Jeffrey, Sally Jesmonth, Nikhil J Joshi, Ryan Julian, Dmitry Kalashnikov, Yuheng Kuang, Kuang-Huei Lee , et al. (20 additional authors not shown)

    Abstract: Large language models can encode a wealth of semantic knowledge about the world. Such knowledge could be extremely useful to robots aiming to act upon high-level, temporally extended instructions expressed in natural language. However, a significant weakness of language models is that they lack real-world experience, which makes it difficult to leverage them for decision making within a given embo… ▽ More

    Submitted 16 August, 2022; v1 submitted 4 April, 2022; originally announced April 2022.

    Comments: See website at https://say-can.github.io/ V1. Initial Upload. V2. Added PaLM results. Added study about new capabilities (drawer manipulation, chain of thought prompting, multilingual instructions). Added an ablation study of language model size. Added an open-source version of \algname on a simulated tabletop environment. Improved readability

  42. arXiv:2203.16319  [pdf, other

    cs.CV cs.RO

    Multi-Robot Active Map** via Neural Bipartite Graph Matching

    Authors: Kai Ye, Siyan Dong, Qingnan Fan, He Wang, Li Yi, Fei Xia, Jue Wang, Baoquan Chen

    Abstract: We study the problem of multi-robot active map**, which aims for complete scene map construction in minimum time steps. The key to this problem lies in the goal position estimation to enable more efficient robot movements. Previous approaches either choose the frontier as the goal position via a myopic solution that hinders the time efficiency, or maximize the long-term value via reinforcement l… ▽ More

    Submitted 1 April, 2022; v1 submitted 30 March, 2022; originally announced March 2022.

    Comments: CVPR 2022

  43. arXiv:2203.13498  [pdf

    cond-mat.mes-hall

    Strain-dependent structural and electronic reconstructions in long-wavelength WS$_{2}$ moiré superlattices

    Authors: Kai-Hui Li, Fei-** Xiao, Wen Guan, Yu-Long Xiao, Chang Xu, **-Ding Zhang, Chen-Fang Lin, Dong Li, Qing-Jun Tong, Si-Yu Li, An-Lian Pan

    Abstract: In long-wavelength moiré superlattices of stacked transition metal dichalcogenides (TMDs), structural reconstruction ubiquitously occurs, which has reported to impact significantly their electronic properties. However, complete microscopic understandings of the interplay between the lattice reconstruction and alteration of electronic properties, and their further response to external perturbations… ▽ More

    Submitted 25 March, 2022; originally announced March 2022.

  44. arXiv:2203.09020  [pdf, other

    cs.LG cs.AI

    Graph Augmentation Learning

    Authors: Shuo Yu, Huafei Huang, Minh N. Dao, Feng Xia

    Abstract: Graph Augmentation Learning (GAL) provides outstanding solutions for graph learning in handling incomplete data, noise data, etc. Numerous GAL methods have been proposed for graph-based applications such as social network analysis and traffic flow forecasting. However, the underlying reasons for the effectiveness of these GAL methods are still unclear. As a consequence, how to choose optimal graph… ▽ More

    Submitted 16 March, 2022; originally announced March 2022.

    Comments: 14 pages, 4 figures, Accepted in The First International Workshop on Graph Learning in IW3C2

  45. arXiv:2203.07999  [pdf, other

    cs.NI cs.LG

    MSCET: A Multi-Scenario Offloading Schedule for Biomedical Data Processing and Analysis in Cloud-Edge-Terminal Collaborative Vehicular Networks

    Authors: Zhichen Ni, Honglong Chen, Zhe Li, Xiaomeng Wang, Na Yan, Weifeng Liu, Feng Xia

    Abstract: With the rapid development of Artificial Intelligence (AI) and Internet of Things (IoTs), an increasing number of computation intensive or delay sensitive biomedical data processing and analysis tasks are produced in vehicles, bringing more and more challenges to the biometric monitoring of drivers. Edge computing is a new paradigm to solve these challenges by offloading tasks from the resource-li… ▽ More

    Submitted 16 February, 2022; originally announced March 2022.

  46. arXiv:2202.13368  [pdf, other

    physics.ao-ph physics.comp-ph

    A nonhydrostatic atmospheric dynamical core on cubed sphere using hybrid multi-moment finite-volume/finite difference methods: formulations and preliminary tests

    Authors: Chungang Chen, Xingliang Li, Feng Xiao, Xueshun Shen

    Abstract: A nonhydrostatic dynamical core has been developed by using the multi-moment finite volume method that ensures the rigorous numerical conservation. To represent the spherical geometry free of polar problems, the cubed-sphere grid is adopted. A fourth-order multi-moment discretization formulation is applied to solve the governing equations cast in the local curvilinear coordinates on each patch of… ▽ More

    Submitted 27 February, 2022; originally announced February 2022.

    Comments: 37 pages, 5 figures. arXiv admin note: text overlap with arXiv:2004.06290

  47. arXiv:2202.12151  [pdf, other

    cond-mat.str-el cond-mat.mes-hall

    Magnon corner states in twisted bilayer honeycomb magnets

    Authors: Chun-Bo Hua, Zheng-Rong Liu, **-Hua Sun, **-Hua Gao, Chui-Zhen Chen, Qingjun Tong, Bin Zhou, Dong-Hui Xu

    Abstract: Search for higher-order topological insulators, characterized by topologically protected gapless boundary states of codimension higher than one, in bosonic systems has attracted growing interest. Here, we establish twisted bilayer honeycomb magnets as a new platform for hosting second-order topological magnon insulators (SOTMIs) without fine-tuning. We employ a simple, minimal Heisenberg spin mode… ▽ More

    Submitted 3 January, 2023; v1 submitted 24 February, 2022; originally announced February 2022.

    Comments: Accepted for publication as a Letter in Phys. Rev. B

    Journal ref: Phys. Rev. B 107, L020404 (2023)

  48. arXiv:2202.12009  [pdf

    physics.med-ph

    First implementation of full-workflow automation in radiotherapy: the All-in-One solution on rectal cancer

    Authors: Lei Yu, Jun Zhao, Fan Xia, Zhiyuan Zhang, Yanfang Liu, Wei Zhang, **gjie Zhou, Jiazhou Wang, Weigang Hu, Zhen Zhang

    Abstract: The aim of this work is to describe the technical characteristics of an AI-powered radiotherapy workflow that enables full-process automation (All-in-One), evaluate its performance implemented for on-couch initial treatment of rectal cancer, and provide insight into the behavior of full-workflow automation in the specialty of radiotherapy. The All-in-One workflow was developed based on a CT-integr… ▽ More

    Submitted 24 February, 2022; originally announced February 2022.

  49. arXiv:2202.11608  [pdf, other

    cs.SI

    How to optimize an academic team when the outlier member is leaving?

    Authors: Shuo Yu, Jiaying Liu, Feng Xia, Haoran Wei, Hanghang Tong

    Abstract: An academic team is a highly-cohesive collaboration group of scholars, which has been recognized as an effective way to improve scientific output in terms of both quality and quantity. However, the high staff turnover brings about a series of problems that may have negative influence on team performance. To address this challenge, we first detect the tendency of the member who may potentially leav… ▽ More

    Submitted 23 February, 2022; originally announced February 2022.

  50. arXiv:2202.11435  [pdf, other

    cs.SI cs.CY

    Data-Driven Decision Making in COVID-19 Response: A Survey

    Authors: Shuo Yu, Qing Qing, Chen Zhang, Ahsan Shehzad, Giles Oatley, Feng Xia

    Abstract: COVID-19 has spread all over the world, having an enormous effect on our daily life and work. In response to the epidemic, a lot of important decisions need to be taken to save communities and economies worldwide. Data clearly plays a vital role in effective decision making. Data-driven decision making uses data related evidence and insights to guide the decision making process and to verify the p… ▽ More

    Submitted 23 February, 2022; originally announced February 2022.