Skip to main content

Showing 1–50 of 57 results for author: Shan, M

.
  1. arXiv:2405.18483  [pdf, other

    cs.CV

    Towards Open Domain Text-Driven Synthesis of Multi-Person Motions

    Authors: Mengyi Shan, Lu Dong, Yutao Han, Yuan Yao, Tao Liu, Ifeoma Nwogu, Guo-Jun Qi, Mitch Hill

    Abstract: This work aims to generate natural and diverse group motions of multiple humans from textual descriptions. While single-person text-to-motion generation is extensively studied, it remains challenging to synthesize motions for more than one or two subjects from in-the-wild prompts, mainly due to the lack of available datasets. In this work, we curate human pose and motion datasets by estimating pos… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: Project page: https://shanmy.github.io/Multi-Motion/

  2. arXiv:2404.06256  [pdf, other

    cs.CV cs.RO

    Label-Efficient 3D Object Detection For Road-Side Units

    Authors: Minh-Quan Dao, Holger Caesar, Julie Stephany Berrio, Mao Shan, Stewart Worrall, Vincent Frémont, Ezio Malis

    Abstract: Occlusion presents a significant challenge for safety-critical applications such as autonomous driving. Collaborative perception has recently attracted a large research interest thanks to the ability to enhance the perception of autonomous vehicles via deep information fusion with intelligent roadside units (RSU), thus minimizing the impact of occlusion. While significant advancement has been made… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

    Comments: IV 2024

  3. arXiv:2403.17960  [pdf, ps, other

    math.GR

    On generalizations of Iwasawa's theorem

    Authors: Jiangtao Shi, Fanjie Xu, Mengjiao Shan

    Abstract: Iwasawa's theorem indicates that a finite group $G$ is supersolvable if and only if all maximal chains of the identity in $G$ have the same length. As generalizations of Iwasawa's theorem, we provide some characterizations of the structure of a finite group $G$ in which all maximal chains of every minimal subgroup have the same length. Moreover, let $δ(G)$ be the number of subgroups of $G$ all of… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

    MSC Class: 20D10

  4. arXiv:2403.01644  [pdf, other

    cs.CV cs.RO

    OccFusion: Multi-Sensor Fusion Framework for 3D Semantic Occupancy Prediction

    Authors: Zhenxing Ming, Julie Stephany Berrio, Mao Shan, Stewart Worrall

    Abstract: A comprehensive understanding of 3D scenes is crucial in autonomous vehicles (AVs), and recent models for 3D semantic occupancy prediction have successfully addressed the challenge of describing real-world objects with varied shapes and classes. However, existing methods for 3D occupancy prediction heavily rely on surround-view camera images, making them susceptible to changes in lighting and weat… ▽ More

    Submitted 9 May, 2024; v1 submitted 3 March, 2024; originally announced March 2024.

  5. arXiv:2401.12422  [pdf, other

    cs.CV cs.RO

    InverseMatrixVT3D: An Efficient Projection Matrix-Based Approach for 3D Occupancy Prediction

    Authors: Zhenxing Ming, Julie Stephany Berrio, Mao Shan, Stewart Worrall

    Abstract: This paper introduces InverseMatrixVT3D, an efficient method for transforming multi-view image features into 3D feature volumes for 3D semantic occupancy prediction. Existing methods for constructing 3D volumes often rely on depth estimation, device-specific operators, or transformer queries, which hinders the widespread adoption of 3D occupancy models. In contrast, our approach leverages two proj… ▽ More

    Submitted 29 April, 2024; v1 submitted 22 January, 2024; originally announced January 2024.

  6. arXiv:2311.18303  [pdf, other

    cs.CV

    OmniMotionGPT: Animal Motion Generation with Limited Data

    Authors: Zhangsihao Yang, Mingyuan Zhou, Mengyi Shan, Bingbing Wen, Ziwei Xuan, Mitch Hill, Junjie Bai, Guo-Jun Qi, Yalin Wang

    Abstract: Our paper aims to generate diverse and realistic animal motion sequences from textual descriptions, without a large-scale animal text-motion dataset. While the task of text-driven human motion synthesis is already extensively studied and benchmarked, it remains challenging to transfer this success to other skeleton structures with limited data. In this work, we design a model architecture that imi… ▽ More

    Submitted 30 November, 2023; originally announced November 2023.

    Comments: The project page is at https://zshyang.github.io/omgpt-website/

  7. arXiv:2310.17858  [pdf

    cond-mat.supr-con cond-mat.str-el

    Emergent spin-glass state in the doped Hund's metal CsFe2As2

    Authors: S. J. Li, D. Zhao, S. Wang, S. T. Cui, N. Z. Wang, J. Li, D. W. Song, B. L. Kang, L. X. Zheng, L. P. Nie, Z. M. Wu, Y. B. Zhou, M. Shan, Z. Sun, T. Wu, X. H. Chen

    Abstract: Hund's metal is one kind of correlated metal, in which the electronic correlation is strongly influenced by the Hund's interaction. At high temperatures, while the charge and orbital degrees of freedom are quenched, the spin degrees of freedom can persist in terms of frozen moments. As temperature decreases, a coherent electronic state with characteristic orbital differentiation always emerges at… ▽ More

    Submitted 26 October, 2023; originally announced October 2023.

    Comments: 21 pages, 7 figures

    Journal ref: Phys. Rev. B 107, 115144 (2023)

  8. arXiv:2310.16311  [pdf

    cond-mat.str-el cond-mat.mtrl-sci

    Magnetic-field-induced electronic instability of Weyl-like fermions in compressed black phosphorus

    Authors: Lixuan Zheng, Kaifa Luo, Zeliang Sun, Dan Zhao, Jian Li, Dianwu Song, Shunjiao Li, Baolei Kang, Linpeng Nie, Min Shan, Zhimian Wu, Yanbing Zhou, Xi Dai, Hongming Weng, Rui Yu, Tao Wu, Xianhui Chen

    Abstract: Revealing the role of Coulomb interaction in topological semimetals with Dirac/Weyl-like band dispersion shapes a new frontier in condensed matter physics. Topological node-line semimetals (TNLSMs), anticipated as a fertile ground for exploring electronic correlation effects due to the anisotropy associated with their node-line structure, have recently attracted considerable attention. In this stu… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

    Comments: 10 pages, 4 figures

    Journal ref: Sci. China-Phys. Mech. Astron. 66, 117011 (2023)

  9. arXiv:2310.11608  [pdf, other

    cs.RO cs.CV cs.HC

    Classification of Safety Driver Attention During Autonomous Vehicle Operation

    Authors: Santiago Gerling Konrad, Julie Stephany Berrio, Mao Shan, Favio Masson, Stewart Worrall

    Abstract: Despite the continual advances in Advanced Driver Assistance Systems (ADAS) and the development of high-level autonomous vehicles (AV), there is a general consensus that for the short to medium term, there is a requirement for a human supervisor to handle the edge cases that inevitably arise. Given this requirement, it is essential that the state of the vehicle operator is monitored to ensure they… ▽ More

    Submitted 17 October, 2023; originally announced October 2023.

  10. Animating Street View

    Authors: Mengyi Shan, Brian Curless, Ira Kemelmacher-Shlizerman, Steve Seitz

    Abstract: We present a system that automatically brings street view imagery to life by populating it with naturally behaving, animated pedestrians and vehicles. Our approach is to remove existing people and vehicles from the input image, insert moving objects with proper scale, angle, motion, and appearance, plan paths and traffic behavior, as well as render the scene with plausible occlusion and shadowing… ▽ More

    Submitted 12 October, 2023; originally announced October 2023.

    Comments: SIGGRAPH Asia 2023 Conference Track

  11. arXiv:2309.07273  [pdf

    stat.ME stat.AP

    Real Effect or Bias? Best Practices for Evaluating the Robustness of Real-World Evidence through Quantitative Sensitivity Analysis for Unmeasured Confounding

    Authors: Douglas Faries, Chenyin Gao, Xiang Zhang, Chad Hazlett, James Stamey, Shu Yang, Peng Ding, Mingyang Shan, Kristin Sheffield, Nancy Dreyer

    Abstract: The assumption of no unmeasured confounders is a critical but unverifiable assumption required for causal inference yet quantitative sensitivity analyses to assess robustness of real-world evidence remains underutilized. The lack of use is likely in part due to complexity of implementation and often specific and restrictive data requirements required for application of each method. With the advent… ▽ More

    Submitted 13 September, 2023; originally announced September 2023.

    Comments: 16 pages which includes 5 figures

    MSC Class: Primary 62

  12. arXiv:2308.05988  [pdf, other

    cs.CV

    MS3D++: Ensemble of Experts for Multi-Source Unsupervised Domain Adaptation in 3D Object Detection

    Authors: Darren Tsai, Julie Stephany Berrio, Mao Shan, Eduardo Nebot, Stewart Worrall

    Abstract: Deploying 3D detectors in unfamiliar domains has been demonstrated to result in a significant 70-90% drop in detection rate due to variations in lidar, geography, or weather from their training dataset. This domain gap leads to missing detections for densely observed objects, misaligned confidence scores, and increased high-confidence false positives, rendering the detector highly unreliable. To a… ▽ More

    Submitted 4 September, 2023; v1 submitted 11 August, 2023; originally announced August 2023.

  13. arXiv:2308.05614  [pdf, other

    stat.ME stat.AP

    Bayesian Record Linkage with Variables in One File

    Authors: Gauri Kamat, Mingyang Shan, Roee Gutman

    Abstract: In many healthcare and social science applications, information about units is dispersed across multiple data files. Linking records across files is necessary to estimate the associations of interest. Common record linkage algorithms only rely on similarities between linking variables that appear in all the files. Moreover, analysis of linked files often ignores errors that may arise from incorrec… ▽ More

    Submitted 30 August, 2023; v1 submitted 10 August, 2023; originally announced August 2023.

    Journal ref: Statistics in Medicine, 2023

  14. arXiv:2308.03542  [pdf

    cs.LG

    A Transfer Learning Framework for Proactive Ramp Metering Performance Assessment

    Authors: Xiaobo Ma, Adrian Cottam, Mohammad Razaur Rahman Shaon, Yao-Jan Wu

    Abstract: Transportation agencies need to assess ramp metering performance when deploying or expanding a ramp metering system. The evaluation of a ramp metering strategy is primarily centered around examining its impact on freeway traffic mobility. One way these effects can be explored is by comparing traffic states, such as the speed before and after the ramp metering strategy has been altered. Predicting… ▽ More

    Submitted 7 August, 2023; originally announced August 2023.

  15. arXiv:2307.07442  [pdf

    stat.ME

    Sensitivity Analysis for Unmeasured Confounding in Medical Product Development and Evaluation Using Real World Evidence

    Authors: Peng Ding, Yixin Fang, Doug Faries, Susan Gruber, Hana Lee, Joo-Yeon Lee, Pallavi Mishra-Kalyani, Mingyang Shan, Mark van der Laan, Shu Yang, Xiang Zhang

    Abstract: The American Statistical Association Biopharmaceutical Section (ASA BIOP) working group on real-world evidence (RWE) has been making continuous, extended effort towards a goal of supporting and advancing regulatory science with respect to non-interventional, clinical studies intended to use real-world data for evidence generation for the purpose of medical product development and evaluation (i.e.,… ▽ More

    Submitted 14 July, 2023; originally announced July 2023.

    Comments: 17 pages, 2 figures

  16. arXiv:2307.07196  [pdf, other

    cs.CV cs.RO

    LightFormer: An End-to-End Model for Intersection Right-of-Way Recognition Using Traffic Light Signals and an Attention Mechanism

    Authors: Zhenxing Ming, Julie Stephany Berrio, Mao Shan, Eduardo Nebot, Stewart Worrall

    Abstract: For smart vehicles driving through signalised intersections, it is crucial to determine whether the vehicle has right of way given the state of the traffic lights. To address this issue, camera based sensors can be used to determine whether the vehicle has permission to proceed straight, turn left or turn right. This paper proposes a novel end to end intersection right of way recognition model cal… ▽ More

    Submitted 14 July, 2023; originally announced July 2023.

  17. arXiv:2307.01462  [pdf, other

    cs.RO cs.CV

    Practical Collaborative Perception: A Framework for Asynchronous and Multi-Agent 3D Object Detection

    Authors: Minh-Quan Dao, Julie Stephany Berrio, Vincent Frémont, Mao Shan, Elwan Héry, Stewart Worrall

    Abstract: Occlusion is a major challenge for LiDAR-based object detection methods. This challenge becomes safety-critical in urban traffic where the ego vehicle must have reliable object detection to avoid collision while its field of view is severely reduced due to the obstruction posed by a large number of road users. Collaborative perception via Vehicle-to-Everything (V2X) communication, which leverages… ▽ More

    Submitted 19 September, 2023; v1 submitted 3 July, 2023; originally announced July 2023.

    Comments: The code is available at https://github.com/quan-dao/practical-collab-perception

  18. arXiv:2306.16642  [pdf, other

    stat.ME stat.AP

    Integrating Randomized Placebo-Controlled Trial Data with External Controls: A Semiparametric Approach with Selective Borrowing

    Authors: Chenyin Gao, Shu Yang, Mingyang Shan, Wenyu Ye, Ilya Lipkovich, Douglas Faries

    Abstract: In recent years, real-world external controls (ECs) have grown in popularity as a tool to empower randomized placebo-controlled trials (RPCTs), particularly in rare diseases or cases where balanced randomization is unethical or impractical. However, as ECs are not always comparable to the RPCTs, direct borrowing ECs without scrutiny may heavily bias the treatment effect estimator. Our paper propos… ▽ More

    Submitted 28 June, 2023; originally announced June 2023.

  19. arXiv:2304.02431  [pdf, other

    cs.CV

    MS3D: Leveraging Multiple Detectors for Unsupervised Domain Adaptation in 3D Object Detection

    Authors: Darren Tsai, Julie Stephany Berrio, Mao Shan, Eduardo Nebot, Stewart Worrall

    Abstract: We introduce Multi-Source 3D (MS3D), a new self-training pipeline for unsupervised domain adaptation in 3D object detection. Despite the remarkable accuracy of 3D detectors, they often overfit to specific domain biases, leading to suboptimal performance in various sensor setups and environments. Existing methods typically focus on adapting a single detector to the target domain, overlooking the fa… ▽ More

    Submitted 8 May, 2023; v1 submitted 5 April, 2023; originally announced April 2023.

    Comments: Our code is available at https://github.com/darrenjkt/MS3D

  20. arXiv:2210.04114  [pdf, other

    cs.LG

    Towards Real-Time Temporal Graph Learning

    Authors: Deniz Gurevin, Mohsin Shan, Tong Geng, Weiwen Jiang, Caiwen Ding, Omer Khan

    Abstract: In recent years, graph representation learning has gained significant popularity, which aims to generate node embeddings that capture features of graphs. One of the methods to achieve this is employing a technique called random walks that captures node sequences in a graph and then learns embeddings for each node using a natural language processing technique called Word2Vec. These embeddings are t… ▽ More

    Submitted 11 October, 2022; v1 submitted 8 October, 2022; originally announced October 2022.

  21. arXiv:2209.12165  [pdf, other

    nucl-ex nucl-th

    An Indirect Measurement of $^6$Li(n,$γ$) Cross Sections

    Authors: Midhun C. V, M. M Musthafa, S. V Suryanarayana, Gokuldas H, Shaima A, Hajara. K, Antony Joseph, T. Santhosh, A. Baishya, A Pal, P. C Rout, S Santra, P. T. M Shan, Satheesh B, B. V. John, K. C Jagadeesan, S. Ganesan

    Abstract: The $^6$Li(n,$γ$)$^7$Li cross sections in the neutron energy range of 0.6 to 4 MeV have been measured by the experimental implementation of the direct capture formalism. This was done by measuring the $γ$ transition probability experimentally and accounting for the spin factor by theoretical calculation. The electromagnetic transition probabilities from $^7$Li$^*$ analogous to the initial neutron… ▽ More

    Submitted 25 September, 2022; originally announced September 2022.

  22. arXiv:2209.07340  [pdf

    cond-mat.supr-con cond-mat.mtrl-sci cond-mat.str-el

    Emergent charge order and unconventional superconductivity in pressurized kagome superconductor CsV3Sb5

    Authors: Lixuan Zheng, Zhimian Wu, Ye Yang, Linpeng Nie, Min Shan, Kuanglv Sun, Dianwu Song, Fanghang Yu, Jian Li, Dan Zhao, Shunjiao Li, Baolei Kang, Yanbing Zhou, Kai Liu, Ziji Xiang, Jianjun Ying, Zhenyu Wang, Tao Wu, Xianhui Chen

    Abstract: The discovery of multiple electronic orders in kagome superconductors AV3Sb5 (A = K, Rb, Cs) provides a promising platform for exploring unprecedented emergent physics. Under moderate pressure (< 2.2 GPa), the triple-Q charge density wave (CDW) order is monotonically suppressed by pressure, while the superconductivity displays a two-dome-like behavior, suggesting an unusual interplay between super… ▽ More

    Submitted 15 September, 2022; originally announced September 2022.

    Comments: 33 pages, 14 figures, Supplementary information available on request, Accepted for publication in Nature

  23. arXiv:2209.06407  [pdf, other

    cs.CV

    Viewer-Centred Surface Completion for Unsupervised Domain Adaptation in 3D Object Detection

    Authors: Darren Tsai, Julie Stephany Berrio, Mao Shan, Eduardo Nebot, Stewart Worrall

    Abstract: Every autonomous driving dataset has a different configuration of sensors, originating from distinct geographic regions and covering various scenarios. As a result, 3D detectors tend to overfit the datasets they are trained on. This causes a drastic decrease in accuracy when the detectors are trained on one dataset and tested on another. We observe that lidar scan pattern differences form a large… ▽ More

    Submitted 14 September, 2022; originally announced September 2022.

  24. arXiv:2203.16964  [pdf, other

    cs.RO

    A Novel Probabilistic V2X Data Fusion Framework for Cooperative Perception

    Authors: Mao Shan, Karan Narula, Stewart Worrall, Yung Fei Wong, Julie Stephany Berrio Perez, Paul Gray, Eduardo Nebot

    Abstract: The paper addresses the vehicle-to-X (V2X) data fusion for cooperative or collective perception (CP). This emerging and promising intelligent transportation systems (ITS) technology has enormous potential for improving efficiency and safety of road transportation. Recent advances in V2X communication primarily address the definition of V2X messages and data dissemination amongst ITS stations (ITS-… ▽ More

    Submitted 31 March, 2022; originally announced March 2022.

  25. arXiv:2112.11427  [pdf, other

    cs.CV cs.AI cs.GR cs.LG

    StyleSDF: High-Resolution 3D-Consistent Image and Geometry Generation

    Authors: Roy Or-El, Xuan Luo, Mengyi Shan, Eli Shechtman, Jeong Joon Park, Ira Kemelmacher-Shlizerman

    Abstract: We introduce a high resolution, 3D-consistent image and shape generation technique which we call StyleSDF. Our method is trained on single-view RGB data only, and stands on the shoulders of StyleGAN2 for image generation, while solving two main challenges in 3D-aware GANs: 1) high-resolution, view-consistent generation of the RGB images, and 2) detailed 3D shape. We achieve this by merging a SDF-b… ▽ More

    Submitted 29 March, 2022; v1 submitted 21 December, 2021; originally announced December 2021.

    Comments: Camera-Ready version. Paper was accepted as oral to CVPR 2022. Added discussions and figures from the rebuttal to the supplementary material (sections C & F). Project Webpage: https://stylesdf.github.io/

  26. Using principal stratification in analysis of clinical trials

    Authors: Ilya Lipkovich, Bohdana Ratitch, Yongming Qu, Xiang Zhang, Mingyang Shan, Craig Mallinckrodt

    Abstract: The ICH E9(R1) addendum (2019) proposed principal stratification (PS) as one of five strategies for dealing with intercurrent events. Therefore, understanding the strengths, limitations, and assumptions of PS is important for the broad community of clinical trialists. Many approaches have been developed under the general framework of PS in different areas of research, including experimental and ob… ▽ More

    Submitted 22 October, 2022; v1 submitted 6 December, 2021; originally announced December 2021.

    Journal ref: Statistics in Medicine. 41(19) 3837-3877 (2022)

  27. See Eye to Eye: A Lidar-Agnostic 3D Detection Framework for Unsupervised Multi-Target Domain Adaptation

    Authors: Darren Tsai, Julie Stephany Berrio, Mao Shan, Stewart Worrall, Eduardo Nebot

    Abstract: Sampling discrepancies between different manufacturers and models of lidar sensors result in inconsistent representations of objects. This leads to performance degradation when 3D detectors trained for one lidar are tested on other types of lidars. Remarkable progress in lidar manufacturing has brought about advances in mechanical, solid-state, and recently, adjustable scan pattern lidars. For the… ▽ More

    Submitted 10 April, 2023; v1 submitted 17 November, 2021; originally announced November 2021.

    Comments: Published in RAL and presented in IROS 2022. Code is available at https://github.com/darrenjkt/SEE-MTDA

    Journal ref: IEEE Robotics and Automation Letters (2022)

  28. arXiv:2108.01382  [pdf, other

    nucl-ex nucl-th

    Impact of $^7$Be breakup on $^7$Li(p,n) Neutron Spectrum

    Authors: Midhun C. V, M. M Musthafa, S. V Suryanarayana, T. Santhosh, A. Baishya, P. Patil, A Pal, P. C Rout, S Santra, R. Kujur, Antony Joseph, Shaima A, Hajara. K, P. T. M Shan, Satheesh B, Y. Sawant, B. V. John, E. T Mirgule, K. C Jagadeesan, S. Ganesan

    Abstract: The formation of continuum neutron distribution in $^7$Li(p,n) has been identified as due to the coupling of the $^7$Be breakup levels to the final state of the reaction. The continuum neutron spectra produced by $^7$Li(p,n) reaction has been estimated by measuring the double differential cross sections for continuum and resonant breakup of $^7$Be, through $^7$Li(p,n)$^7$Be$^*$ reaction at 21 MeV… ▽ More

    Submitted 3 August, 2021; originally announced August 2021.

  29. ELLIPSDF: Joint Object Pose and Shape Optimization with a Bi-level Ellipsoid and Signed Distance Function Description

    Authors: Mo Shan, Qiaojun Feng, You-Yi Jau, Nikolay Atanasov

    Abstract: Autonomous systems need to understand the semantics and geometry of their surroundings in order to comprehend and safely execute object-level task specifications. This paper proposes an expressive yet compact model for joint object pose and shape optimization, and an associated optimization algorithm to infer an object-level map from multi-view RGB-D camera observations. The model is expressive be… ▽ More

    Submitted 31 July, 2021; originally announced August 2021.

    Comments: Accepted by ICCV 2021

    Journal ref: 2021 IEEE/CVF International Conference on Computer Vision (ICCV), Montreal, QC, Canada, pp. 5926-5935

  30. arXiv:2104.11187  [pdf

    cond-mat.supr-con cond-mat.str-el

    Intrinsic Spin Susceptibility and Pseudogap-like Behavior in Infinite-Layer LaNiO2

    Authors: D. Zhao, Y. B. Zhou, Y. Fu, L. Wang, X. F. Zhou, H. Cheng, J. Li, D. W. Song, S. J. Li, B. L. Kang, L. X. Zheng, L. P. Nie, Z. M. Wu, M. Shan, F. H. Yu, J. J. Ying, S. M. Wang, J. W. Mei, T. Wu, X. H. Chen

    Abstract: The recent discovery of superconductivity in doped infinite-layer nickelates has stimulated intensive interest, especially for similarities and differences compared to that in cuprate superconductors. In contrast to cuprates, although earlier magnetization measurement reveals a Curie-Weiss-like behavior in undoped infinite-layer nickelates, there is no magnetic ordering observed by elastic neutron… ▽ More

    Submitted 22 April, 2021; originally announced April 2021.

    Comments: 14 pages, 4 figures, accepted by Physical Review Letters

  31. arXiv:2104.09173  [pdf

    cond-mat.supr-con cond-mat.str-el

    Orbital ordering and fluctuations in a kagome superconductor CsV3Sb5

    Authors: D. W. Song, L. X. Zheng, F. H. Yu, J. Li, L. P. Nie, M. Shan, D. Zhao, S. J. Li, B. L. Kang, Z. M. Wu, Y. B. Zhou, K. L. Sun, K. Liu, X. G. Luo, Z. Y. Wang, J. J. Ying, X. G. Wan, T. Wu, X. H. Chen

    Abstract: Recently, competing electronic instabilities, including superconductivity and density-wave-like order, have been discovered in vanadium-based kagome metals AV3Sb5 (A = K, Rb, Cs) with a nontrivial band topology. This finding stimulates wide interests to study the interplay of these competing electronic orders and possible exotic excitations in the superconducting state. Here, in order to further c… ▽ More

    Submitted 19 April, 2021; originally announced April 2021.

    Comments: 13 pages, 4 figures, supplementary information available upon request

  32. arXiv:2104.06147  [pdf, other

    cs.RO

    What is the appropriate speed for an autonomous vehicle? Designing a Pedestrian Aware Contextual Speed Controller

    Authors: Daniel Jiang, Stewart Worrall, Mao Shan

    Abstract: Social acceptance is a major hurdle for autonomous vehicle technology, central to which is ensuring both passengers and nearby pedestrians feel safe. This idea of `feeling safe' and perceived safety is highly subjective and rooted in human intuition. As such, traditional analytical approaches to autonomous navigation often fail to cater for the social expectations of individuals. Therefore, this p… ▽ More

    Submitted 13 April, 2021; originally announced April 2021.

    Comments: 8 pages

  33. arXiv:2103.12287  [pdf, other

    cs.CV cs.RO

    Optimising the selection of samples for robust lidar camera calibration

    Authors: Darren Tsai, Stewart Worrall, Mao Shan, Anton Lohr, Eduardo Nebot

    Abstract: We propose a robust calibration pipeline that optimises the selection of calibration samples for the estimation of calibration parameters that fit the entire scene. We minimise user error by automating the data selection process according to a metric, called Variability of Quality (VOQ) that gives a score to each calibration set of samples. We show that this VOQ score is correlated with the estima… ▽ More

    Submitted 22 September, 2021; v1 submitted 22 March, 2021; originally announced March 2021.

    Comments: ITSC2021

    MSC Class: 68U10 ACM Class: I.4.8

  34. Localization and Map** using Instance-specific Mesh Models

    Authors: Qiaojun Feng, Yue Meng, Mo Shan, Nikolay Atanasov

    Abstract: This paper focuses on building semantic maps, containing object poses and shapes, using a monocular camera. This is an important problem because robots need rich understanding of geometry and context if they are to shape the future of transportation, construction, and agriculture. Our contribution is an instance-specific mesh model of object shape that can be optimized online based on semantic inf… ▽ More

    Submitted 7 March, 2021; originally announced March 2021.

    Comments: 8 pages, 9 figures

    Journal ref: 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Macau, China, 2019, pp. 4985-4991

  35. arXiv:2011.11191  [pdf, other

    cs.RO

    Socially Aware Crowd Navigation with Multimodal Pedestrian Trajectory Prediction for Autonomous Vehicles

    Authors: Kunming Li, Mao Shan, Karan Narula, Stewart Worrall, Eduardo Nebot

    Abstract: Seamlessly operating an autonomous vehicle in a crowded pedestrian environment is a very challenging task. This is because human movement and interactions are very hard to predict in such environments. Recent work has demonstrated that reinforcement learning-based methods have the ability to learn to drive in crowds. However, these methods can have very poor performance due to inaccurate predictio… ▽ More

    Submitted 22 November, 2020; originally announced November 2020.

    Comments: 8 pages, 6 figures, submitted to IEEE International Conference on Intelligent Transportation Systems 2020

  36. arXiv:2011.11190  [pdf, other

    cs.CV cs.RO

    Attentional-GCNN: Adaptive Pedestrian Trajectory Prediction towards Generic Autonomous Vehicle Use Cases

    Authors: Kunming Li, Stuart Eiffert, Mao Shan, Francisco Gomez-Donoso, Stewart Worrall, Eduardo Nebot

    Abstract: Autonomous vehicle navigation in shared pedestrian environments requires the ability to predict future crowd motion both accurately and with minimal delay. Understanding the uncertainty of the prediction is also crucial. Most existing approaches however can only estimate uncertainty through repeated sampling of generative models. Additionally, most current predictive models are trained on datasets… ▽ More

    Submitted 22 November, 2020; originally announced November 2020.

    Comments: 8 pages, 5 figures, submitted to ICRA 2021

    MSC Class: 68T40

  37. arXiv:2011.08581  [pdf, other

    cs.RO

    Demonstrations of Cooperative Perception: Safety and Robustness in Connected and Automated Vehicle Operations

    Authors: Mao Shan, Karan Narula, Yung Fei Wong, Stewart Worrall, Malik Khan, Paul Alexander, Eduardo Nebot

    Abstract: Cooperative perception, or collective perception (CP) is an emerging and promising technology for intelligent transportation systems (ITS). It enables an ITS station (ITS-S) to share its local perception information with others by means of vehicle-to-X (V2X) communication, thereby achieving improved efficiency and safety in road transportation. In this paper, we present our recent progress on the… ▽ More

    Submitted 13 January, 2021; v1 submitted 17 November, 2020; originally announced November 2020.

  38. arXiv:2010.12173  [pdf, other

    eess.AS cs.SD

    A Cross-Verification Approach for Protecting World Leaders from Fake and Tampered Audio

    Authors: Mengyi Shan, TJ Tsai

    Abstract: This paper tackles the problem of verifying the authenticity of speech recordings from world leaders. Whereas previous work on detecting deep fake or tampered audio focus on scrutinizing an audio recording in isolation, we instead reframe the problem and focus on cross-verifying a questionable recording against trusted references. We present a method for cross-verifying a speech recording against… ▽ More

    Submitted 23 October, 2020; originally announced October 2020.

    Comments: 5 pages, 4 figures, 1 table

  39. arXiv:2008.12449  [pdf, other

    cs.RO eess.SP

    Long-term map maintenance pipeline for autonomous vehicles

    Authors: Julie Stephany Berrio, Stewart Worrall, Mao Shan, Eduardo Nebot

    Abstract: For autonomous vehicles to operate persistently in a typical urban environment, it is essential to have high accuracy position information. This requires a map** and localisation system that can adapt to changes over time. A localisation approach based on a single-survey map will not be suitable for long-term operation as it does not incorporate variations in the environment. In this paper, we p… ▽ More

    Submitted 27 August, 2020; originally announced August 2020.

    Comments: Paper submitted to IEE ITS Transactions

    MSC Class: 00-02 ACM Class: I.4

  40. arXiv:2008.03744  [pdf, ps, other

    math.AP

    Harnack's inequality for quasilinear elliptic equations with generalized Orlicz growth

    Authors: M. A. Shan, I. I. Skrypnik, M. V. Voitovych

    Abstract: We prove Harnack's inequality for bounded weak solutions to quasilinear second order elliptic equations with generalized Orlicz growth conditions. Our approach covers new cases of variable exponent and (p,q) growth conditions.

    Submitted 9 August, 2020; originally announced August 2020.

  41. arXiv:2007.15107  [pdf, other

    cs.RO cs.CV

    OrcVIO: Object residual constrained Visual-Inertial Odometry

    Authors: Mo Shan, Vikas Dhiman, Qiaojun Feng, **zhao Li, Nikolay Atanasov

    Abstract: Introducing object-level semantic information into simultaneous localization and map** (SLAM) system is critical. It not only improves the performance but also enables tasks specified in terms of meaningful objects. This work presents OrcVIO, for visual-inertial odometry tightly coupled with tracking and optimization over structured object models. OrcVIO differentiates through semantic feature a… ▽ More

    Submitted 29 May, 2021; v1 submitted 29 July, 2020; originally announced July 2020.

    Comments: Submitted to T-RO

  42. arXiv:2007.14580  [pdf, other

    cs.MM cs.SD eess.AS eess.IV

    Improved Handling of Repeats and Jumps in Audio-Sheet Image Synchronization

    Authors: Mengyi Shan, TJ Tsai

    Abstract: This paper studies the problem of automatically generating piano score following videos given an audio recording and raw sheet music images. Whereas previous works focus on synthetic sheet music where the data has been cleaned and preprocessed, we instead focus on develo** a system that can cope with the messiness of raw, unprocessed sheet music PDFs from IMSLP. We investigate how well existing… ▽ More

    Submitted 29 July, 2020; originally announced July 2020.

    Comments: 8 pages, 5 figures. Accepted paper at the International Society for Music Information Retrieval Conference (ISMIR) 2020

  43. arXiv:2007.05490  [pdf, other

    cs.CV cs.RO

    Camera-Lidar Integration: Probabilistic sensor fusion for semantic map**

    Authors: Julie Stephany Berrio, Mao Shan, Stewart Worrall, Eduardo Nebot

    Abstract: An automated vehicle operating in an urban environment must be able to perceive and recognise object/obstacles in a three-dimensional world while navigating in a constantly changing environment. In order to plan and execute accurate sophisticated driving maneuvers, a high-level contextual understanding of the surroundings is essential. Due to the recent progress in image processing, it is now poss… ▽ More

    Submitted 9 July, 2020; originally announced July 2020.

    Comments: 15 pages. arXiv admin note: text overlap with arXiv:2003.01871

  44. Probabilistic Crowd GAN: Multimodal Pedestrian Trajectory Prediction using a Graph Vehicle-Pedestrian Attention Network

    Authors: Stuart Eiffert, Kunming Li, Mao Shan, Stewart Worrall, Salah Sukkarieh, Eduardo Nebot

    Abstract: Understanding and predicting the intention of pedestrians is essential to enable autonomous vehicles and mobile robots to navigate crowds. This problem becomes increasingly complex when we consider the uncertainty and multimodality of pedestrian motion, as well as the implicit interactions between members of a crowd, including any response to a vehicle. Our approach, Probabilistic Crowd GAN, exten… ▽ More

    Submitted 12 July, 2020; v1 submitted 23 June, 2020; originally announced June 2020.

    Comments: Accepted for publication in IEEE Robotics and Automation Letters (RA-L) Copyright may be transferred without notice, after which this version may no longer be accessible

  45. arXiv:2005.08549  [pdf, other

    stat.ME

    A Bayesian Multi-Layered Record Linkage Procedure to Analyze Functional Status of Medicare Patients with Traumatic Brain Injury

    Authors: Mingyang Shan, Kali Thomas, Roee Gutman

    Abstract: Understanding the association between injury severity and patients' potential for recovery is crucial to providing better care for patients with traumatic brain injury (TBI). Estimation of this relationship requires clinical information on injury severity, patient demographics, and healthcare utilization, which are often obtained from separate data sources. Because of privacy and confidentiality r… ▽ More

    Submitted 18 May, 2020; originally announced May 2020.

  46. arXiv:2004.11724  [pdf, other

    cs.MM cs.SD eess.AS eess.IV

    Using Cell Phone Pictures of Sheet Music To Retrieve MIDI Passages

    Authors: TJ Tsai, Daniel Yang, Mengyi Shan, Thitaree Tanprasert, Teerapat Jenrungrot

    Abstract: This article investigates a cross-modal retrieval problem in which a user would like to retrieve a passage of music from a MIDI file by taking a cell phone picture of several lines of sheet music. This problem is challenging for two reasons: it has a significant runtime constraint since it is a user-facing application, and there is very little relevant training data containing cell phone images of… ▽ More

    Submitted 21 April, 2020; originally announced April 2020.

    Comments: 13 pages, 8 figures, 3 tables. Accepted article in IEEE Transactions on Multimedia. arXiv admin note: text overlap with arXiv:2004.10347

  47. arXiv:2004.10347  [pdf, other

    cs.MM cs.SD eess.AS eess.IV

    MIDI Passage Retrieval Using Cell Phone Pictures of Sheet Music

    Authors: Daniel Yang, Thitaree Tanprasert, Teerapat Jenrungrot, Mengyi Shan, TJ Tsai

    Abstract: This paper investigates a cross-modal retrieval problem in which a user would like to retrieve a passage of music from a MIDI file by taking a cell phone picture of a physical page of sheet music. While audio-sheet music retrieval has been explored by a number of works, this scenario is novel in that the query is a cell phone picture rather than a digital scan. To solve this problem, we introduce… ▽ More

    Submitted 21 April, 2020; originally announced April 2020.

    Comments: 8 pages, 8 figures, 1 table. Accepted paper at the International Society for Music Information Retrieval Conference (ISMIR) 2019

  48. arXiv:2003.07515  [pdf, ps, other

    math.AP

    Resonant Decompositions and Global Well-posedness for 2D Zakharov-Kuznetsov Equation in Sobolev spaces of Negative Indices

    Authors: Minjie Shan, Baoxiang Wang, Liqun Zhang

    Abstract: The Cauchy problem for Zakharov-Kuznetsov equation on $\mathbb{R}^2$ is shown to be global well-posed for the initial date in $H^{s}$ provided $s>-\frac{1}{13}$. As conservation laws are invalid in Sobolev spaces below $L^2$, we construct an almost conserved quantity using multilinear correction term following the $I$-method introduced by Colliander, Keel, Staffilani, Takaoka and Tao. In contrast… ▽ More

    Submitted 16 March, 2020; originally announced March 2020.

    Comments: 50pages

    MSC Class: 35Q55; 35A01

  49. arXiv:2003.03954  [pdf, other

    cs.RO

    Probabilistic Egocentric Motion Correction of Lidar Point Cloud and Projection to Camera Images for Moving Platforms

    Authors: Mao Shan, Julie Stephany Berrio, Stewart Worrall, Eduardo Nebot

    Abstract: The fusion of sensor data from heterogeneous sensors is crucial for robust perception in various robotics applications that involve moving platforms, for instance, autonomous vehicle navigation. In particular, combining camera and lidar sensors enables the projection of precise range information of the surrounding environment onto visual images. It also makes it possible to label each lidar point… ▽ More

    Submitted 9 March, 2020; originally announced March 2020.

    Comments: 8 pages, 9 figures, submitted to ITSC 2020 for review

    ACM Class: I.4.0

  50. arXiv:2003.01871  [pdf, other

    cs.RO cs.CV

    Semantic sensor fusion: from camera to sparse lidar information

    Authors: Julie Stephany Berrio, Mao Shan, Stewart Worrall, James Ward, Eduardo Nebot

    Abstract: To navigate through urban roads, an automated vehicle must be able to perceive and recognize objects in a three-dimensional environment. A high-level contextual understanding of the surroundings is necessary to plan and execute accurate driving maneuvers. This paper presents an approach to fuse different sensory information, Light Detection and Ranging (lidar) scans and camera images. The output o… ▽ More

    Submitted 3 March, 2020; originally announced March 2020.

    Comments: 8 pages, this paper was submitted to ITSC 2020

    MSC Class: 00-02 ACM Class: I.4