Skip to main content

Showing 1–50 of 110 results for author: Qiu, Q

Searching in archive cs. Search in all archives.
.
  1. Multi-agent Cooperative Games Using Belief Map Assisted Training

    Authors: Qinwei Huang, Chen Luo, Alex B. Wu, Simon Khan, Hai Li, Qinru Qiu

    Abstract: In a multi-agent system, agents share their local observations to gain global situational awareness for decision making and collaboration using a message passing system. When to send a message, how to encode a message, and how to leverage the received messages directly affect the effectiveness of the collaboration among agents. When training a multi-agent cooperative game using reinforcement learn… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

    Journal ref: ECAI 2023. IOS Press, 2023: 1617-1624

  2. arXiv:2406.18569  [pdf, other

    cs.CV cs.AI

    FLOW: Fusing and Shuffling Global and Local Views for Cross-User Human Activity Recognition with IMUs

    Authors: Qi Qiu, Tao Zhu, Furong Duan, Kevin I-Kai Wang, Liming Chen, Mingxing Nie, Mingxing Nie

    Abstract: Inertial Measurement Unit (IMU) sensors are widely employed for Human Activity Recognition (HAR) due to their portability, energy efficiency, and growing research interest. However, a significant challenge for IMU-HAR models is achieving robust generalization performance across diverse users. This limitation stems from substantial variations in data distribution among individual users. One primary… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  3. arXiv:2406.13897  [pdf, other

    cs.CV

    CLAY: A Controllable Large-scale Generative Model for Creating High-quality 3D Assets

    Authors: Longwen Zhang, Ziyu Wang, Qixuan Zhang, Qiwei Qiu, Anqi Pang, Haoran Jiang, Wei Yang, Lan Xu, **gyi Yu

    Abstract: In the realm of digital creativity, our potential to craft intricate 3D worlds from imagination is often hampered by the limitations of existing digital tools, which demand extensive expertise and efforts. To narrow this disparity, we introduce CLAY, a 3D geometry and material generator designed to effortlessly transform human imagination into intricate 3D digital structures. CLAY supports classic… ▽ More

    Submitted 30 May, 2024; originally announced June 2024.

    Comments: Project page: https://sites.google.com/view/clay-3dlm Video: https://youtu.be/YcKFp4U2Voo

  4. arXiv:2406.13568  [pdf, other

    cs.AI

    Trapezoidal Gradient Descent for Effective Reinforcement Learning in Spiking Networks

    Authors: Yuhao Pan, Xiucheng Wang, Nan Cheng, Qi Qiu

    Abstract: With the rapid development of artificial intelligence technology, the field of reinforcement learning has continuously achieved breakthroughs in both theory and practice. However, traditional reinforcement learning algorithms often entail high energy consumption during interactions with the environment. Spiking Neural Network (SNN), with their low energy consumption characteristics and performance… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  5. arXiv:2406.02075  [pdf, other

    cs.LG cs.NE

    ReLU-KAN: New Kolmogorov-Arnold Networks that Only Need Matrix Addition, Dot Multiplication, and ReLU

    Authors: Qi Qiu, Tao Zhu, Helin Gong, Liming Chen, Huansheng Ning

    Abstract: Limited by the complexity of basis function (B-spline) calculations, Kolmogorov-Arnold Networks (KAN) suffer from restricted parallel computing capability on GPUs. This paper proposes a novel ReLU-KAN implementation that inherits the core idea of KAN. By adopting ReLU (Rectified Linear Unit) and point-wise multiplication, we simplify the design of KAN's basis function and optimize the computation… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  6. arXiv:2404.00879  [pdf, other

    cs.CV

    Model-Agnostic Human Preference Inversion in Diffusion Models

    Authors: Jeeyung Kim, Ze Wang, Qiang Qiu

    Abstract: Efficient text-to-image generation remains a challenging task due to the high computational costs associated with the multi-step sampling in diffusion models. Although distillation of pre-trained diffusion models has been successful in reducing sampling steps, low-step image generation often falls short in terms of quality. In this study, we propose a novel sampling design to achieve high-quality… ▽ More

    Submitted 31 March, 2024; originally announced April 2024.

  7. arXiv:2403.19066  [pdf, other

    cs.CV cs.AI

    Generative Quanta Color Imaging

    Authors: Vishal Purohit, Junjie Luo, Yiheng Chi, Qi Guo, Stanley H. Chan, Qiang Qiu

    Abstract: The astonishing development of single-photon cameras has created an unprecedented opportunity for scientific and industrial imaging. However, the high data throughput generated by these 1-bit sensors creates a significant bottleneck for low-power applications. In this paper, we explore the possibility of generating a color image from a single binary frame of a single-photon camera. We evidently fi… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

    Comments: Accepted at IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2024

  8. arXiv:2403.00269  [pdf, other

    cs.CV cs.LG

    Large Convolutional Model Tuning via Filter Subspace

    Authors: Wei Chen, Zichen Miao, Qiang Qiu

    Abstract: Efficient fine-tuning methods are critical to address the high computational and parameter complexity while adapting large pre-trained models to downstream tasks. Our study is inspired by prior research that represents each convolution filter as a linear combination of a small set of filter subspace elements, referred to as filter atoms. In this paper, we propose to fine-tune pre-trained models by… ▽ More

    Submitted 11 March, 2024; v1 submitted 29 February, 2024; originally announced March 2024.

  9. arXiv:2402.11025  [pdf, other

    cs.LG stat.ML

    Training Bayesian Neural Networks with Sparse Subspace Variational Inference

    Authors: Junbo Li, Zichen Miao, Qiang Qiu, Ruqi Zhang

    Abstract: Bayesian neural networks (BNNs) offer uncertainty quantification but come with the downside of substantially increased training and inference costs. Sparse BNNs have been investigated for efficient inference, typically by either slowly introducing sparsity throughout the training or by post-training compression of dense BNNs. The dilemma of how to cut down massive training costs remains, particula… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

    Journal ref: Published at International Conference on Learning Representations (ICLR) 2024

  10. arXiv:2402.08936  [pdf, other

    cs.CV

    Predictive Temporal Attention on Event-based Video Stream for Energy-efficient Situation Awareness

    Authors: Yiming Bu, Jiayang Liu, Qinru Qiu

    Abstract: The Dynamic Vision Sensor (DVS) is an innovative technology that efficiently captures and encodes visual information in an event-driven manner. By combining it with event-driven neuromorphic processing, the sparsity in DVS camera output can result in high energy efficiency. However, similar to many embedded systems, the off-chip communication between the camera and processor presents a bottleneck… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

  11. arXiv:2401.13076  [pdf, other

    cs.RO cs.CV

    SemanticSLAM: Learning based Semantic Map Construction and Robust Camera Localization

    Authors: Mingyang Li, Yue Ma, Qinru Qiu

    Abstract: Current techniques in Visual Simultaneous Localization and Map** (VSLAM) estimate camera displacement by comparing image features of consecutive scenes. These algorithms depend on scene continuity, hence requires frequent camera inputs. However, processing images frequently can lead to significant memory usage and computation overhead. In this study, we introduce SemanticSLAM, an end-to-end visu… ▽ More

    Submitted 23 January, 2024; originally announced January 2024.

    Comments: 2023 IEEE Symposium Series on Computational Intelligence (SSCI) 6 pages

    Journal ref: 2023 IEEE Symposium Series on Computational Intelligence (SSCI)

  12. arXiv:2401.09258  [pdf, other

    cs.RO cs.CV

    An Efficient Generalizable Framework for Visuomotor Policies via Control-aware Augmentation and Privilege-guided Distillation

    Authors: Yinuo Zhao, Kun Wu, Tianjiao Yi, Zhiyuan Xu, Xiaozhu Ju, Zheng** Che, Qinru Qiu, Chi Harold Liu, Jian Tang

    Abstract: Visuomotor policies, which learn control mechanisms directly from high-dimensional visual observations, confront challenges in adapting to new environments with intricate visual variations. Data augmentation emerges as a promising method for bridging these generalization gaps by enriching data variety. However, straightforwardly augmenting the entire observation shall impose excessive burdens on p… ▽ More

    Submitted 17 January, 2024; originally announced January 2024.

  13. arXiv:2401.08957  [pdf, other

    cs.RO cs.AI

    SWBT: Similarity Weighted Behavior Transformer with the Imperfect Demonstration for Robotic Manipulation

    Authors: Kun Wu, Ning Liu, Zhen Zhao, Di Qiu, **ming Li, Zheng** Che, Zhiyuan Xu, Qinru Qiu, Jian Tang

    Abstract: Imitation learning (IL), aiming to learn optimal control policies from expert demonstrations, has been an effective method for robot manipulation tasks. However, previous IL methods either only use expensive expert demonstrations and omit imperfect demonstrations or rely on interacting with the environment and learning from online experiences. In the context of robotic manipulation, we aim to conq… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

    Comments: 8 pages, 5 figures

    ACM Class: I.2.9

  14. arXiv:2312.12721  [pdf, other

    cs.CV

    Cross-Modal Reasoning with Event Correlation for Video Question Answering

    Authors: Chengxiang Yin, Zheng** Che, Kun Wu, Zhiyuan Xu, Qinru Qiu, Jian Tang

    Abstract: Video Question Answering (VideoQA) is a very attractive and challenging research direction aiming to understand complex semantics of heterogeneous data from two domains, i.e., the spatio-temporal video content and the word sequence in question. Although various attention mechanisms have been utilized to manage contextualized representations by modeling intra- and inter-modal relationships of the t… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

  15. arXiv:2312.11933  [pdf, other

    cs.LG cs.AI

    Dynamic Frequency Domain Graph Convolutional Network for Traffic Forecasting

    Authors: Yujie Li, Zezhi Shao, Yongjun Xu, Qiang Qiu, Zhaogang Cao, Fei Wang

    Abstract: Complex spatial dependencies in transportation networks make traffic prediction extremely challenging. Much existing work is devoted to learning dynamic graph structures among sensors, and the strategy of mining spatial dependencies from traffic data, known as data-driven, tends to be an intuitive and effective approach. However, Time-Shift of traffic patterns and noise induced by random factors h… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

  16. arXiv:2311.00240  [pdf, other

    cs.CR

    Intell-dragonfly: A Cybersecurity Attack Surface Generation Engine Based On Artificial Intelligence-generated Content Technology

    Authors: Xingchen Wu, Qin Qiu, Jiaqi Li, Yang Zhao

    Abstract: With the rapid development of the Internet, cyber security issues have become increasingly prominent. Traditional cyber security defense methods are limited in the face of ever-changing threats, so it is critical to seek innovative attack surface generation methods. This study proposes Intell-dragonfly, a cyber security attack surface generation engine based on artificial intelligence generation t… ▽ More

    Submitted 31 October, 2023; originally announced November 2023.

    Comments: 17 pages, 7 figures, 2 tables

  17. arXiv:2310.13295  [pdf, other

    cs.RO cs.AI

    PathRL: An End-to-End Path Generation Method for Collision Avoidance via Deep Reinforcement Learning

    Authors: Wenhao Yu, Jie Peng, Quecheng Qiu, Hanyu Wang, Lu Zhang, Jianmin Ji

    Abstract: Robot navigation using deep reinforcement learning (DRL) has shown great potential in improving the performance of mobile robots. Nevertheless, most existing DRL-based navigation methods primarily focus on training a policy that directly commands the robot with low-level controls, like linear and angular velocities, which leads to unstable speeds and unsmooth trajectories of the robot during the l… ▽ More

    Submitted 20 October, 2023; originally announced October 2023.

  18. arXiv:2310.08370  [pdf, other

    cs.CV

    UniPAD: A Universal Pre-training Paradigm for Autonomous Driving

    Authors: Honghui Yang, Sha Zhang, Di Huang, Xiaoyang Wu, Haoyi Zhu, Tong He, Shixiang Tang, Hengshuang Zhao, Qibo Qiu, Binbin Lin, Xiaofei He, Wanli Ouyang

    Abstract: In the context of autonomous driving, the significance of effective feature learning is widely acknowledged. While conventional 3D self-supervised pre-training methods have shown widespread success, most methods follow the ideas originally designed for 2D images. In this paper, we present UniPAD, a novel self-supervised learning paradigm applying 3D volumetric differentiable rendering. UniPAD impl… ▽ More

    Submitted 7 April, 2024; v1 submitted 12 October, 2023; originally announced October 2023.

    Comments: CVPR2024

  19. arXiv:2309.13235  [pdf, other

    cs.CV

    M$^3$CS: Multi-Target Masked Point Modeling with Learnable Codebook and Siamese Decoders

    Authors: Qibo Qiu, Honghui Yang, Wenxiao Wang, Shun Zhang, Haiming Gao, Haochao Ying, Wei Hua, Xiaofei He

    Abstract: Masked point modeling has become a promising scheme of self-supervised pre-training for point clouds. Existing methods reconstruct either the original points or related features as the objective of pre-training. However, considering the diversity of downstream tasks, it is necessary for the model to have both low- and high-level representation modeling capabilities to capture geometric details and… ▽ More

    Submitted 22 September, 2023; originally announced September 2023.

  20. arXiv:2308.04118  [pdf, other

    cs.CV cs.MM

    Multimodal Color Recommendation in Vector Graphic Documents

    Authors: Qianru Qiu, Xueting Wang, Mayu Otani

    Abstract: Color selection plays a critical role in graphic document design and requires sufficient consideration of various contexts. However, recommending appropriate colors which harmonize with the other colors and textual contexts in documents is a challenging task, even for experienced designers. In this study, we propose a multimodal masked color model that integrates both color and textual contexts to… ▽ More

    Submitted 8 August, 2023; originally announced August 2023.

    Comments: Accepted to ACM MM 2023

  21. arXiv:2308.00287  [pdf, other

    cs.CV cs.LG

    A Study of Unsupervised Evaluation Metrics for Practical and Automatic Domain Adaptation

    Authors: Minghao Chen, Zepeng Gao, Shuai Zhao, Qibo Qiu, Wenxiao Wang, Binbin Lin, Xiaofei He

    Abstract: Unsupervised domain adaptation (UDA) methods facilitate the transfer of models to target domains without labels. However, these methods necessitate a labeled target validation set for hyper-parameter tuning and model selection. In this paper, we aim to find an evaluation metric capable of assessing the quality of a transferred model without access to target validation labels. We begin with the met… ▽ More

    Submitted 18 September, 2023; v1 submitted 1 August, 2023; originally announced August 2023.

  22. arXiv:2307.11314  [pdf

    cs.NE cs.LG

    Neuromorphic Online Learning for Spatiotemporal Patterns with a Forward-only Timeline

    Authors: Zhenhang Zhang, **gang **, Haowen Fang, Qinru Qiu

    Abstract: Spiking neural networks (SNNs) are bio-plausible computing models with high energy efficiency. The temporal dynamics of neurons and synapses enable them to detect temporal patterns and generate sequences. While Backpropagation Through Time (BPTT) is traditionally used to train SNNs, it is not suitable for online learning of embedded applications due to its high computation and memory cost as well… ▽ More

    Submitted 20 July, 2023; originally announced July 2023.

    Comments: 9 pages,8 figures

  23. arXiv:2306.01205  [pdf, other

    cs.CV

    SelFLoc: Selective Feature Fusion for Large-scale Point Cloud-based Place Recognition

    Authors: Qibo Qiu, Haiming Gao, Wenxiao Wang, Zhiyi Su, Tian Xie, Wei Hua, Xiaofei He

    Abstract: Point cloud-based place recognition is crucial for mobile robots and autonomous vehicles, especially when the global positioning sensor is not accessible. LiDAR points are scattered on the surface of objects and buildings, which have strong shape priors along different axes. To enhance message passing along particular axes, Stacked Asymmetric Convolution Block (SACB) is designed, which is one of t… ▽ More

    Submitted 5 June, 2023; v1 submitted 1 June, 2023; originally announced June 2023.

  24. arXiv:2304.04820  [pdf, other

    cs.CV

    Binary Latent Diffusion

    Authors: Ze Wang, Jiang Wang, Zicheng Liu, Qiang Qiu

    Abstract: In this paper, we show that a binary latent space can be explored for compact yet expressive image representations. We model the bi-directional map**s between an image and the corresponding latent binary representation by training an auto-encoder with a Bernoulli encoding distribution. On the one hand, the binary latent space provides a compact discrete image representation of which the distribu… ▽ More

    Submitted 10 April, 2023; originally announced April 2023.

  25. arXiv:2304.03117  [pdf, other

    cs.GR

    DreamFace: Progressive Generation of Animatable 3D Faces under Text Guidance

    Authors: Longwen Zhang, Qiwei Qiu, Hongyang Lin, Qixuan Zhang, Cheng Shi, Wei Yang, Ye Shi, Sibei Yang, Lan Xu, **gyi Yu

    Abstract: Emerging Metaverse applications demand accessible, accurate, and easy-to-use tools for 3D digital human creations in order to depict different cultures and societies as if in the physical world. Recent large-scale vision-language advances pave the way to for novices to conveniently customize 3D content. However, the generated CG-friendly assets still cannot represent the desired facial traits for… ▽ More

    Submitted 1 April, 2023; originally announced April 2023.

    Comments: Go to DreamFace project page https://sites.google.com/view/dreamface watch our video at https://youtu.be/yCuvzgGMvPM and experience DreamFace online at https://hyperhuman.top

  26. arXiv:2303.12354  [pdf, other

    cs.RO cs.LG

    Deep Reinforcement Learning for Localizability-Enhanced Navigation in Dynamic Human Environments

    Authors: Yuan Chen, Quecheng Qiu, Xiangyu Liu, Guangda Chen, Shunyi Yao, Jie Peng, Jianmin Ji, Yanyong Zhang

    Abstract: Reliable localization is crucial for autonomous robots to navigate efficiently and safely. Some navigation methods can plan paths with high localizability (which describes the capability of acquiring reliable localization). By following these paths, the robot can access the sensor streams that facilitate more accurate location estimation results by the localization algorithms. However, most of the… ▽ More

    Submitted 22 March, 2023; originally announced March 2023.

  27. arXiv:2303.06908  [pdf, other

    cs.CV

    CrossFormer++: A Versatile Vision Transformer Hinging on Cross-scale Attention

    Authors: Wenxiao Wang, Wei Chen, Qibo Qiu, Long Chen, Boxi Wu, Binbin Lin, Xiaofei He, Wei Liu

    Abstract: While features of different scales are perceptually important to visual inputs, existing vision transformers do not yet take advantage of them explicitly. To this end, we first propose a cross-scale vision transformer, CrossFormer. It introduces a cross-scale embedding layer (CEL) and a long-short distance attention (LSDA). On the one hand, CEL blends each token with multiple patches of different… ▽ More

    Submitted 30 November, 2023; v1 submitted 13 March, 2023; originally announced March 2023.

    Comments: 16 pages, 7 figures

  28. arXiv:2302.14290  [pdf, other

    cs.LG cs.CV

    Learning to Retain while Acquiring: Combating Distribution-Shift in Adversarial Data-Free Knowledge Distillation

    Authors: Gaurav Patel, Konda Reddy Mopuri, Qiang Qiu

    Abstract: Data-free Knowledge Distillation (DFKD) has gained popularity recently, with the fundamental idea of carrying out knowledge transfer from a Teacher neural network to a Student neural network in the absence of training data. However, in the Adversarial DFKD framework, the student network's accuracy, suffers due to the non-stationary distribution of the pseudo-samples under multiple generator update… ▽ More

    Submitted 27 February, 2023; originally announced February 2023.

    Comments: Accepted at CVPR 2023

  29. arXiv:2302.01384  [pdf, other

    cs.CV

    Energy-Inspired Self-Supervised Pretraining for Vision Models

    Authors: Ze Wang, Jiang Wang, Zicheng Liu, Qiang Qiu

    Abstract: Motivated by the fact that forward and backward passes of a deep network naturally form symmetric map**s between input and output representations, we introduce a simple yet effective self-supervised vision model pretraining framework inspired by energy-based models (EBMs). In the proposed framework, we model energy estimation and data restoration as the forward and backward passes of a single ne… ▽ More

    Submitted 2 February, 2023; originally announced February 2023.

    Journal ref: ICLR 2023

  30. arXiv:2209.10820  [pdf, other

    cs.CV

    Color Recommendation for Vector Graphic Documents based on Multi-Palette Representation

    Authors: Qianru Qiu, Xueting Wang, Mayu Otani, Yuki Iwazaki

    Abstract: Vector graphic documents present multiple visual elements, such as images, shapes, and texts. Choosing appropriate colors for multiple visual elements is a difficult but crucial task for both amateurs and professional designers. Instead of creating a single color palette for all elements, we extract multiple color palettes from each visual element in a graphic document, and then combine them into… ▽ More

    Submitted 22 September, 2022; originally announced September 2022.

    Comments: Accepted to WACV 2023

  31. arXiv:2209.06994  [pdf, ps, other

    cs.CV

    PriorLane: A Prior Knowledge Enhanced Lane Detection Approach Based on Transformer

    Authors: Qibo Qiu, Haiming Gao, Wei Hua, Gang Huang, Xiaofei He

    Abstract: Lane detection is one of the fundamental modules in self-driving. In this paper we employ a transformer-only method for lane detection, thus it could benefit from the blooming development of fully vision transformer and achieve the state-of-the-art (SOTA) performance on both CULane and TuSimple benchmarks, by fine-tuning the weight fully pre-trained on large datasets. More importantly, this paper… ▽ More

    Submitted 7 February, 2023; v1 submitted 14 September, 2022; originally announced September 2022.

    Comments: Accepted by ICRA 2023

  32. arXiv:2209.00641  [pdf, other

    cs.CV

    Seq-UPS: Sequential Uncertainty-aware Pseudo-label Selection for Semi-Supervised Text Recognition

    Authors: Gaurav Patel, Jan Allebach, Qiang Qiu

    Abstract: This paper looks at semi-supervised learning (SSL) for image-based text recognition. One of the most popular SSL approaches is pseudo-labeling (PL). PL approaches assign labels to unlabeled data before re-training the model with a combination of labeled and pseudo-labeled data. However, PL methods are severely degraded by noise and are prone to over-fitting to noisy labels, due to the inclusion of… ▽ More

    Submitted 6 October, 2022; v1 submitted 30 August, 2022; originally announced September 2022.

    Comments: Accepted at WACV 2023

  33. arXiv:2206.06635  [pdf, ps, other

    cs.RO

    CVR-LSE: Compact Vectorization Representation of Local Static Environments for Unmanned Ground Vehicles

    Authors: Haiming Gao, Qibo Qiu, Wei Hua, Xuebo Zhang, Zhengyong Han, Shun Zhang

    Abstract: According to the requirement of general static obstacle detection, this paper proposes a compact vectorization representation approach of local static environments for unmanned ground vehicles. At first, by fusing the data of LiDAR and IMU, high-frequency pose information is obtained. Then, through the two-dimensional (2D) obstacle points generation, the process of grid map maintenance with a fixe… ▽ More

    Submitted 14 June, 2022; originally announced June 2022.

  34. GraphAD: A Graph Neural Network for Entity-Wise Multivariate Time-Series Anomaly Detection

    Authors: Xu Chen, Qiu Qiu, Changshan Li, Kunqing Xie

    Abstract: In recent years, the emergence and development of third-party platforms have greatly facilitated the growth of the Online to Offline (O2O) business. However, the large amount of transaction data raises new challenges for retailers, especially anomaly detection in operating conditions. Thus, platforms begin to develop intelligent business assistants with embedded anomaly detection methods to reduce… ▽ More

    Submitted 23 May, 2022; originally announced May 2022.

    Comments: SIGIR'22 Short Paper

  35. arXiv:2203.16154  [pdf, other

    cs.RO

    Learning to Socially Navigate in Pedestrian-rich Environments with Interaction Capacity

    Authors: Quecheng Qiu, Shunyi Yao, **g Wang, Jun Ma, Guangda Chen, Jianmin Ji

    Abstract: Existing navigation policies for autonomous robots tend to focus on collision avoidance while ignoring human-robot interactions in social life. For instance, robots can pass along the corridor safer and easier if pedestrians notice them. Sounds have been considered as an efficient way to attract the attention of pedestrians, which can alleviate the freezing robot problem. In this work, we present… ▽ More

    Submitted 30 March, 2022; originally announced March 2022.

  36. Artemis: Articulated Neural Pets with Appearance and Motion synthesis

    Authors: Haimin Luo, Teng Xu, Yuheng Jiang, Chenglin Zhou, Qiwei Qiu, Yingliang Zhang, Wei Yang, Lan Xu, **gyi Yu

    Abstract: We, humans, are entering into a virtual era and indeed want to bring animals to the virtual world as well for companion. Yet, computer-generated (CGI) furry animals are limited by tedious off-line rendering, let alone interactive motion control. In this paper, we present ARTEMIS, a novel neural modeling and rendering pipeline for generating ARTiculated neural pets with appEarance and Motion synthe… ▽ More

    Submitted 17 June, 2022; v1 submitted 11 February, 2022; originally announced February 2022.

    Comments: Accepted to ACM SIGGRAPH 2022 (Journal track)

  37. arXiv:2202.00280  [pdf, other

    cs.LG stat.ML

    Recycling Model Updates in Federated Learning: Are Gradient Subspaces Low-Rank?

    Authors: Sheikh Shams Azam, Seyyedali Hosseinalipour, Qiang Qiu, Christopher Brinton

    Abstract: In this paper, we question the rationale behind propagating large numbers of parameters through a distributed system during federated learning. We start by examining the rank characteristics of the subspace spanned by gradients across epochs (i.e., the gradient-space) in centralized model training, and observe that this gradient-space often consists of a few leading principal components accounting… ▽ More

    Submitted 1 February, 2022; originally announced February 2022.

    Comments: In Proceedings of the 10th International Conference on Learning Representations (ICLR) 2022

  38. arXiv:2111.03628  [pdf, other

    cs.AI stat.ML

    Exploiting a Zoo of Checkpoints for Unseen Tasks

    Authors: Jiaji Huang, Qiang Qiu, Kenneth Church

    Abstract: There are so many models in the literature that it is difficult for practitioners to decide which combinations are likely to be effective for a new task. This paper attempts to address this question by capturing relationships among checkpoints published on the web. We model the space of tasks as a Gaussian process. The covariance can be estimated from checkpoints and unlabeled probing data. With t… ▽ More

    Submitted 5 November, 2021; originally announced November 2021.

    Comments: Accepted in Neurips 2021

  39. arXiv:2110.10108  [pdf, other

    cs.LG

    TESSERACT: Gradient Flip Score to Secure Federated Learning Against Model Poisoning Attacks

    Authors: Atul Sharma, Wei Chen, Joshua Zhao, Qiang Qiu, Somali Chaterji, Saurabh Bagchi

    Abstract: Federated learning---multi-party, distributed learning in a decentralized environment---is vulnerable to model poisoning attacks, even more so than centralized learning approaches. This is because malicious clients can collude and send in carefully tailored model updates to make the global model inaccurate. This motivated the development of Byzantine-resilient federated learning algorithms, such a… ▽ More

    Submitted 19 October, 2021; originally announced October 2021.

    Comments: 12 pages

  40. arXiv:2109.02541  [pdf, other

    cs.RO

    Crowd-Aware Robot Navigation for Pedestrians with Multiple Collision Avoidance Strategies via Map-based Deep Reinforcement Learning

    Authors: Shunyi Yao1, Guangda Chen, Quecheng Qiu, Jun Ma, ** Chen, Jianmin Ji

    Abstract: It is challenging for a mobile robot to navigate through human crowds. Existing approaches usually assume that pedestrians follow a predefined collision avoidance strategy, like social force model (SFM) or optimal reciprocal collision avoidance (ORCA). However, their performances commonly need to be further improved for practical applications, where pedestrians follow multiple different collision… ▽ More

    Submitted 6 September, 2021; originally announced September 2021.

    Comments: 7 page

  41. arXiv:2108.07895  [pdf, other

    cs.CV

    Adaptive Convolutions with Per-pixel Dynamic Filter Atom

    Authors: Ze Wang, Zichen Miao, Jun Hu, Qiang Qiu

    Abstract: Applying feature dependent network weights have been proved to be effective in many fields. However, in practice, restricted by the enormous size of model parameters and memory footprints, scalable and versatile dynamic convolutions with per-pixel adapted filters are yet to be fully explored. In this paper, we address this challenge by decomposing filters, adapted to each spatial position, over dy… ▽ More

    Submitted 17 August, 2021; originally announced August 2021.

  42. arXiv:2105.03649  [pdf, other

    cs.NE cs.DC cs.ET

    In-Hardware Learning of Multilayer Spiking Neural Networks on a Neuromorphic Processor

    Authors: Amar Shrestha, Haowen Fang, Daniel Patrick Rider, Zaidao Mei, Qinru Qiu

    Abstract: Although widely used in machine learning, backpropagation cannot directly be applied to SNN training and is not feasible on a neuromorphic processor that emulates biological neuron and synapses. This work presents a spike-based backpropagation algorithm with biological plausible local update rules and adapts it to fit the constraint in a neuromorphic hardware. The algorithm is implemented on Intel… ▽ More

    Submitted 8 May, 2021; originally announced May 2021.

    Comments: 6 pages, 5 figures, accepted for Design Automation Conference (DAC) 2021

  43. arXiv:2104.10712  [pdf, other

    cs.AR cs.NE

    Neuromorphic Algorithm-hardware Codesign for Temporal Pattern Learning

    Authors: Haowen Fang, Brady Taylor, Ziru Li, Zaidao Mei, Hai Li, Qinru Qiu

    Abstract: Neuromorphic computing and spiking neural networks (SNN) mimic the behavior of biological systems and have drawn interest for their potential to perform cognitive tasks with high energy efficiency. However, some factors such as temporal dynamics and spike timings prove critical for information processing but are often ignored by existing works, limiting the performance and applications of neuromor… ▽ More

    Submitted 6 May, 2021; v1 submitted 21 April, 2021; originally announced April 2021.

  44. arXiv:2012.02938  [pdf, other

    cs.CV

    Cirrus: A Long-range Bi-pattern LiDAR Dataset

    Authors: Ze Wang, Sihao Ding, Ying Li, Jonas Fenn, Sohini Roychowdhury, Andreas Wallin, Lane Martin, Scott Ryvola, Guillermo Sapiro, Qiang Qiu

    Abstract: In this paper, we introduce Cirrus, a new long-range bi-pattern LiDAR public dataset for autonomous driving tasks such as 3D object detection, critical to highway driving and timely decision making. Our platform is equipped with a high-resolution video camera and a pair of LiDAR sensors with a 250-meter effective range, which is significantly longer than existing public datasets. We record paired… ▽ More

    Submitted 4 December, 2020; originally announced December 2020.

  45. arXiv:2011.09928  [pdf, other

    cs.LG cs.AI cs.CV

    Using Text to Teach Image Retrieval

    Authors: Haoyu Dong, Ze Wang, Qiang Qiu, Guillermo Sapiro

    Abstract: Image retrieval relies heavily on the quality of the data modeling and the distance measurement in the feature space. Building on the concept of image manifold, we first propose to represent the feature space of images, learned via neural networks, as a graph. Neighborhoods in the feature space are now defined by the geodesic distance between images, represented as graph vertices or manifold sampl… ▽ More

    Submitted 19 November, 2020; originally announced November 2020.

  46. arXiv:2011.01408  [pdf, other

    cs.RO eess.SY

    Hybrid Visual Servoing Tracking Control of Uncalibrated Robotic Systems for Dynamic Dwarf Culture Orchards Harvest

    Authors: Tao Li, Quan Qiu, Chunjiang Zhao

    Abstract: The paper is concerned with the dynamic tracking problem of SNAP orchards harvesting robots in the presence of multiple uncalibrated model parameters in the application of dwarf culture orchards harvest. A new hybrid visual servoing adaptive tracking controller and three adaptive laws are proposed to guarantee harvesting robots to finish the dynamic harvesting task and the adaption to unknown para… ▽ More

    Submitted 10 June, 2021; v1 submitted 2 November, 2020; originally announced November 2020.

    Comments: 6 pages,15 figures, accepted by IEEE ICDL2021

    MSC Class: 70B15; 70E60; 70Q05; 93C85

  47. arXiv:2011.01022   

    cs.RO cs.CV

    Depth Ranging Performance Evaluation and Improvement for RGB-D Cameras on Field-Based High-Throughput Phenoty** Robots

    Authors: Zhengqiang Fan, Na Sun, Quan Qiu, Chunjiang Zhao

    Abstract: RGB-D cameras have been successfully used for indoor High-ThroughpuT Phenoty** (HTTP). However, their capability and feasibility for in-field HTTP still need to be evaluated, due to the noise and disturbances generated by unstable illumination, specular reflection, and diffuse reflection, etc. To solve these problems, we evaluated the depth-ranging performances of two consumer-level RGB-D camera… ▽ More

    Submitted 27 April, 2021; v1 submitted 2 November, 2020; originally announced November 2020.

    Comments: We want to improve the work of this paper before publishing it publicly

  48. arXiv:2010.10341  [pdf, other

    cs.LG

    Learning to Learn Variational Semantic Memory

    Authors: Xiantong Zhen, Yingjun Du, Huan Xiong, Qiang Qiu, Cees G. M. Snoek, Ling Shao

    Abstract: In this paper, we introduce variational semantic memory into meta-learning to acquire long-term knowledge for few-shot learning. The variational semantic memory accrues and stores semantic information for the probabilistic inference of class prototypes in a hierarchical Bayesian framework. The semantic memory is grown from scratch and gradually consolidated by absorbing information from tasks it e… ▽ More

    Submitted 14 July, 2021; v1 submitted 20 October, 2020; originally announced October 2020.

    Comments: accepted to NeurIPS 2020; code is available in https://github.com/YDU-uva/VSM

  49. arXiv:2009.02386  [pdf, other

    cs.CV cs.LG

    ACDC: Weight Sharing in Atom-Coefficient Decomposed Convolution

    Authors: Ze Wang, Xiuyuan Cheng, Guillermo Sapiro, Qiang Qiu

    Abstract: Convolutional Neural Networks (CNNs) are known to be significantly over-parametrized, and difficult to interpret, train and adapt. In this paper, we introduce a structural regularization across convolutional kernels in a CNN. In our approach, each convolution kernel is first decomposed as 2D dictionary atoms linearly combined by coefficients. The widely observed correlation and redundancy in a CNN… ▽ More

    Submitted 4 September, 2020; originally announced September 2020.

  50. arXiv:2008.01818  [pdf, other

    stat.ML cs.CV cs.LG

    Graph Convolution with Low-rank Learnable Local Filters

    Authors: Xiuyuan Cheng, Zichen Miao, Qiang Qiu

    Abstract: Geometric variations like rotation, scaling, and viewpoint changes pose a significant challenge to visual understanding. One common solution is to directly model certain intrinsic structures, e.g., using landmarks. However, it then becomes non-trivial to build effective deep models, especially when the underlying non-Euclidean grid is irregular and coarse. Recent deep models using graph convolutio… ▽ More

    Submitted 11 October, 2020; v1 submitted 4 August, 2020; originally announced August 2020.