Skip to main content

Showing 1–50 of 88 results for author: Fang, B

Searching in archive cs. Search in all archives.
.
  1. When Vision Meets Touch: A Contemporary Review for Visuotactile Sensors from the Signal Processing Perspective

    Authors: Shoujie Li, Zihan Wang, Changsheng Wu, Xiang Li, Shan Luo, Bin Fang, Fuchun Sun, Xiao-** Zhang, Wenbo Ding

    Abstract: Tactile sensors, which provide information about the physical properties of objects, are an essential component of robotic systems. The visuotactile sensing technology with the merits of high resolution and low cost has facilitated the development of robotics from environment exploration to dexterous operation. Over the years, several reviews on visuotactile sensors for robots have been presented,… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: Accepted by IEEE Journal of Selected Topics in Signal Processing

  2. arXiv:2406.03813  [pdf, other

    cs.RO

    Touch100k: A Large-Scale Touch-Language-Vision Dataset for Touch-Centric Multimodal Representation

    Authors: Ning Cheng, Changhao Guan, **g Gao, Weihao Wang, You Li, Fandong Meng, Jie Zhou, Bin Fang, **an Xu, Wenjuan Han

    Abstract: Touch holds a pivotal position in enhancing the perceptual and interactive capabilities of both humans and robots. Despite its significance, current tactile research mainly focuses on visual and tactile modalities, overlooking the language domain. Inspired by this, we construct Touch100k, a paired touch-language-vision dataset at the scale of 100k, featuring tactile sensation descriptions in multi… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  3. arXiv:2405.12779  [pdf

    cs.LG cs.AI

    Transformer in Touch: A Survey

    Authors: **g Gao, Ning Cheng, Bin Fang, Wenjuan Han

    Abstract: The Transformer model, initially achieving significant success in the field of natural language processing, has recently shown great potential in the application of tactile perception. This review aims to comprehensively outline the application and development of Transformers in tactile technology. We first introduce the two fundamental concepts behind the success of the Transformer: the self-atte… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

    Comments: 27 pages, 2 tables, 5 figures, accepted by ICIC 2024

  4. arXiv:2405.10063  [pdf, other

    cs.NI

    Low-latency Symbol-Synchronous Communication for Multi-hop Sensor Networks

    Authors: Xinlei Liu, Andrey Belogaev, Jonathan Oostvogels, Bingwu Fang, Danny Hughes, Maarten Weyn, Jeroen Famaey

    Abstract: Wireless sensor networks (WSNs) have received great interest due to their scalability, energy efficiency, and low-cost deployment. By utilizing multi-hop communication, WSNs can cover a wide area using low transmission power without the need for any communication infrastructure. Traditionally, WSNs rely on store-and-forward routing protocols and Time Division Multiple Access (TDMA)-based schedules… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

    Comments: EuCNC 2024

  5. arXiv:2405.07237  [pdf, other

    cs.RO

    Soft Contact Simulation and Manipulation Learning of Deformable Objects with Vision-based Tactile Sensor

    Authors: Jianhua Shan, Yuhao Sun, Shixin Zhang, Fuchun Sun, Zixi Chen, Zirong Shen, Cesare Stefanini, Yiyong Yang, Shan Luo, Bin Fang

    Abstract: Deformable object manipulation is a classical and challenging research area in robotics. Compared with rigid object manipulation, this problem is more complex due to the deformation properties including elastic, plastic, and elastoplastic deformation. In this paper, we describe a new deformable object manipulation method including soft contact simulation, manipulation learning, and sim-to-real tra… ▽ More

    Submitted 12 May, 2024; originally announced May 2024.

  6. arXiv:2405.02914  [pdf, other

    cs.RO

    Simulation of Optical Tactile Sensors Supporting Slip and Rotation using Path Tracing and IMPM

    Authors: Zirong Shen, Yuhao Sun, Shixin Zhang, Zixi Chen, Heyi Sun, Fuchun Sun, Bin Fang

    Abstract: Optical tactile sensors are extensively utilized in intelligent robot manipulation due to their ability to acquire high-resolution tactile information at a lower cost. However, achieving adequate reality and versatility in simulating optical tactile sensors is challenging. In this paper, we propose a simulation method and validate its effectiveness through experiments. We utilize path tracing for… ▽ More

    Submitted 5 May, 2024; originally announced May 2024.

  7. arXiv:2404.18201  [pdf, other

    cs.RO

    What Foundation Models can Bring for Robot Learning in Manipulation : A Survey

    Authors: Dingzhe Li, Yixiang **, Yong A, Hongze Yu, Jun Shi, Xiaoshuai Hao, Peng Hao, Hua** Liu, Fuchun Sun, Bin Fang

    Abstract: The realization of universal robots is an ultimate goal of researchers. However, a key hurdle in achieving this goal lies in the robots' ability to manipulate objects in their unstructured surrounding environments according to different tasks. The learning-based approach is considered an effective way to address generalization. The impressive performance of foundation models in the fields of compu… ▽ More

    Submitted 28 April, 2024; originally announced April 2024.

  8. arXiv:2404.11458  [pdf, other

    cs.AI

    Learn to Tour: Operator Design For Solution Feasibility Map** in Pickup-and-delivery Traveling Salesman Problem

    Authors: Bowen Fang, Xu Chen, Xuan Di

    Abstract: This paper aims to develop a learning method for a special class of traveling salesman problems (TSP), namely, the pickup-and-delivery TSP (PDTSP), which finds the shortest tour along a sequence of one-to-one pickup-and-delivery nodes. One-to-one here means that the transported people or goods are associated with designated pairs of pickup and delivery nodes, in contrast to that indistinguishable… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

  9. arXiv:2403.09813  [pdf, other

    cs.CV cs.RO

    Towards Comprehensive Multimodal Perception: Introducing the Touch-Language-Vision Dataset

    Authors: Ning Cheng, You Li, **g Gao, Bin Fang, **an Xu, Wenjuan Han

    Abstract: Tactility provides crucial support and enhancement for the perception and interaction capabilities of both humans and robots. Nevertheless, the multimodal research related to touch primarily focuses on visual and tactile modalities, with limited exploration in the domain of language. Beyond vocabulary, sentence-level descriptions contain richer semantics. Based on this, we construct a touch-langua… ▽ More

    Submitted 17 June, 2024; v1 submitted 14 March, 2024; originally announced March 2024.

    Comments: Accepted by ICIC 2024

  10. arXiv:2403.09284  [pdf, other

    cs.LG cs.DC

    DA-PFL: Dynamic Affinity Aggregation for Personalized Federated Learning

    Authors: Xu Yang, Jiyuan Feng, Songyue Guo, Ye Wang, Ye Ding, Binxing Fang, Qing Liao

    Abstract: Personalized federated learning becomes a hot research topic that can learn a personalized learning model for each client. Existing personalized federated learning models prefer to aggregate similar clients with similar data distribution to improve the performance of learning models. However, similaritybased personalized federated learning methods may exacerbate the class imbalanced problem. In th… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

  11. arXiv:2403.00232  [pdf, other

    cs.AR

    FTTN: Feature-Targeted Testing for Numerical Properties of NVIDIA & AMD Matrix Accelerators

    Authors: Xinyi Li, Ang Li, Bo Fang, Katarzyna Swirydowicz, Ignacio Laguna, Ganesh Gopalakrishnan

    Abstract: NVIDIA Tensor Cores and AMD Matrix Cores (together called Matrix Accelerators) are of growing interest in high-performance computing and machine learning owing to their high performance. Unfortunately, their numerical behaviors are not publicly documented, including the number of extra precision bits maintained, the accumulation order of addition, and predictable subnormal number handling during c… ▽ More

    Submitted 29 February, 2024; originally announced March 2024.

  12. arXiv:2402.12043  [pdf, other

    cs.CV

    A Lightweight Parallel Framework for Blind Image Quality Assessment

    Authors: Qunyue Huang, Bin Fang

    Abstract: Existing blind image quality assessment (BIQA) methods focus on designing complicated networks based on convolutional neural networks (CNNs) or transformer. In addition, some BIQA methods enhance the performance of the model in a two-stage training manner. Despite the significant advancements, these methods remarkably raise the parameter count of the model, thus requiring more training time and co… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

  13. arXiv:2312.02697  [pdf, other

    cs.RO

    Hierarchical Visual Policy Learning for Long-Horizon Robot Manipulation in Densely Cluttered Scenes

    Authors: Hecheng Wang, Lizhe Qi, Bin Fang, Yunquan Sun

    Abstract: In this work, we focus on addressing the long-horizon manipulation tasks in densely cluttered scenes. Such tasks require policies to effectively manage severe occlusions among objects and continually produce actions based on visual observations. We propose a vision-based Hierarchical policy for Cluttered-scene Long-horizon Manipulation (HCLM). It employs a high-level policy and three options to se… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

  14. arXiv:2312.01421  [pdf, other

    cs.RO

    RobotGPT: Robot Manipulation Learning from ChatGPT

    Authors: Yixiang **, Dingzhe Li, Yong A, Jun Shi, Peng Hao, Fuchun Sun, Jianwei Zhang, Bin Fang

    Abstract: We present RobotGPT, an innovative decision framework for robotic manipulation that prioritizes stability and safety. The execution code generated by ChatGPT cannot guarantee the stability and safety of the system. ChatGPT may provide different answers for the same task, leading to unpredictability. This instability prevents the direct integration of ChatGPT into the robot manipulation loop. Altho… ▽ More

    Submitted 3 December, 2023; originally announced December 2023.

  15. arXiv:2311.05782  [pdf, other

    cs.DC

    MPGemmFI: A Fault Injection Technique for Mixed Precision GEMM in ML Applications

    Authors: Bo Fang, Xinyi Li, Harvey Dam, Cheng Tan, Siva Kumar Sastry Hari, Timothy Tsai, Ignacio Laguna, Dingwen Tao, Ganesh Gopalakrishnan, Prashant Nair, Kevin Barker, Ang Li

    Abstract: Emerging deep learning workloads urgently need fast general matrix multiplication (GEMM). To meet such demand, one of the critical features of machine-learning-specific accelerators such as NVIDIA Tensor Cores, AMD Matrix Cores, and Google TPUs is the support of mixed-precision enabled GEMM. For DNN models, lower-precision FP data formats and computation offer acceptable correctness but significan… ▽ More

    Submitted 9 November, 2023; originally announced November 2023.

  16. arXiv:2310.12290  [pdf, other

    cs.AI

    Fact-based Agent modeling for Multi-Agent Reinforcement Learning

    Authors: Baofu Fang, Caiming Zheng, Hao Wang

    Abstract: In multi-agent systems, agents need to interact and collaborate with other agents in environments. Agent modeling is crucial to facilitate agent interactions and make adaptive cooperation strategies. However, it is challenging for agents to model the beliefs, behaviors, and intentions of other agents in non-stationary environment where all agent policies are learned simultaneously. In addition, th… ▽ More

    Submitted 18 October, 2023; originally announced October 2023.

  17. arXiv:2309.16979  [pdf, other

    quant-ph cs.ET

    MEMQSim: Highly Memory-Efficient and Modularized Quantum State-Vector Simulation

    Authors: Boyuan Zhang, Bo Fang, Qiang Guan, Ang Li, Dingwen Tao

    Abstract: In this extended abstract, we have introduced a highly memory-efficient state vector simulation of quantum circuits premised on data compression, harnessing the capabilities of both CPUs and GPUs. We have elucidated the inherent challenges in architecting this system, while concurrently proposing our tailored solutions. Moreover, we have delineated our preliminary implementation and deliberated up… ▽ More

    Submitted 29 September, 2023; originally announced September 2023.

  18. arXiv:2309.12994  [pdf, other

    cs.SE

    Smart Fuzzing of 5G Wireless Software Implementation

    Authors: Huan Wu, Brian Fang, Fei Xie

    Abstract: In this paper, we introduce a comprehensive approach to bolstering the security, reliability, and comprehensibility of OpenAirInterface5G (OAI5G), an open-source software framework for the exploration, development, and testing of 5G wireless communication systems. Firstly, we employ AFL++, a powerful fuzzing tool, to fuzzy-test OAI5G with respect to its configuration files rigorously. This extensi… ▽ More

    Submitted 22 September, 2023; originally announced September 2023.

  19. arXiv:2309.07284  [pdf

    cs.CR

    Toward Lossless Homomorphic Encryption for Scientific Computation

    Authors: Muhammad Jahanzeb Khan, Bo Fang, Dongfang Zhao

    Abstract: This paper presents a comprehensive investigation into encrypted computations using the CKKS (Cheon-Kim-Kim-Song) scheme, with a focus on multi-dimensional vector operations and real-world applications. Through two meticulously designed experiments, the study explores the potential of the CKKS scheme in Super Computing and its implications for data privacy and computational efficiency. The first e… ▽ More

    Submitted 13 September, 2023; originally announced September 2023.

  20. arXiv:2308.14034  [pdf, other

    cs.AI cs.CL

    Confucius: Iterative Tool Learning from Introspection Feedback by Easy-to-Difficult Curriculum

    Authors: Shen Gao, Zhengliang Shi, Minghang Zhu, Bowen Fang, Xin Xin, Pengjie Ren, Zhumin Chen, Jun Ma, Zhaochun Ren

    Abstract: Augmenting large language models (LLMs) with external tools has emerged as a promising approach to extending the capability of LLMs. Although some works employ open-source LLMs for the tool learning task, most of them are trained in a controlled environment in which LLMs only learn to execute the human-provided tools. However, selecting proper tools from the large toolset is also a crucial ability… ▽ More

    Submitted 21 December, 2023; v1 submitted 27 August, 2023; originally announced August 2023.

    Comments: Accepted by AAAI 2024

  21. arXiv:2307.09609  [pdf, other

    cs.DC

    AMRIC: A Novel In Situ Lossy Compression Framework for Efficient I/O in Adaptive Mesh Refinement Applications

    Authors: Daoce Wang, Jesus Pulido, Pascal Grosset, Jiannan Tian, Sian **, Houjun Tang, Jean Sexton, Sheng Di, Zarija Lukić, Kai Zhao, Bo Fang, Franck Cappello, James Ahrens, Dingwen Tao

    Abstract: As supercomputers advance towards exascale capabilities, computational intensity increases significantly, and the volume of data requiring storage and transmission experiences exponential growth. Adaptive Mesh Refinement (AMR) has emerged as an effective solution to address these two challenges. Concurrently, error-bounded lossy compression is recognized as one of the most efficient approaches to… ▽ More

    Submitted 13 July, 2023; originally announced July 2023.

    Comments: 12 pages, 18 figures, 3 tables, accepted by ACM/IEEE SC '23

  22. arXiv:2306.01123  [pdf, other

    cs.LG math.PR

    A Neural RDE-based model for solving path-dependent PDEs

    Authors: Bowen Fang, Hao Ni, Yue Wu

    Abstract: The concept of the path-dependent partial differential equation (PPDE) was first introduced in the context of path-dependent derivatives in financial markets. Its semilinear form was later identified as a non-Markovian backward stochastic differential equation (BSDE). Compared to the classical PDE, the solution of a PPDE involves an infinite-dimensional spatial variable, making it challenging to a… ▽ More

    Submitted 1 June, 2023; originally announced June 2023.

    MSC Class: 68T07; 60L90; 60H30

  23. arXiv:2305.11191  [pdf, other

    cs.CR cs.CV

    Towards Generalizable Data Protection With Transferable Unlearnable Examples

    Authors: Bin Fang, Bo Li, Shuang Wu, Tianyi Zheng, Shouhong Ding, Ran Yi, Lizhuang Ma

    Abstract: Artificial Intelligence (AI) is making a profound impact in almost every domain. One of the crucial factors contributing to this success has been the access to an abundance of high-quality data for constructing machine learning models. Lately, as the role of data in artificial intelligence has been significantly magnified, concerns have arisen regarding the secure utilization of data, particularly… ▽ More

    Submitted 18 May, 2023; originally announced May 2023.

    Comments: arXiv admin note: text overlap with arXiv:2305.10691

  24. arXiv:2305.10691  [pdf, other

    cs.CR cs.CV

    Re-thinking Data Availablity Attacks Against Deep Neural Networks

    Authors: Bin Fang, Bo Li, Shuang Wu, Ran Yi, Shouhong Ding, Lizhuang Ma

    Abstract: The unauthorized use of personal data for commercial purposes and the clandestine acquisition of private data for training machine learning models continue to raise concerns. In response to these issues, researchers have proposed availability attacks that aim to render data unexploitable. However, many current attack methods are rendered ineffective by adversarial training. In this paper, we re-ex… ▽ More

    Submitted 18 May, 2023; originally announced May 2023.

  25. arXiv:2303.09100  [pdf, other

    cs.CV cs.CL cs.LG

    Patch-Prompt Aligned Bayesian Prompt Tuning for Vision-Language Models

    Authors: Xinyang Liu, Dongsheng Wang, Bowei Fang, Miaoge Li, Zhibin Duan, Yishi Xu, Bo Chen, Mingyuan Zhou

    Abstract: For downstream applications of vision-language pre-trained models, there has been significant interest in constructing effective prompts. Existing works on prompt engineering, which either require laborious manual designs or optimize the prompt tuning as a point estimation problem, may fail to describe diverse characteristics of categories and limit their applications. We introduce a Bayesian prob… ▽ More

    Submitted 1 July, 2024; v1 submitted 16 March, 2023; originally announced March 2023.

    Comments: Accepted by UAI 2024

  26. Deep Reinforcement Learning for Robotic Pushing and Picking in Cluttered Environment

    Authors: Yuhong Deng, Xiaofeng Guo, Yixuan Wei, Kai Lu, Bin Fang, Di Guo, Hua** Liu, Fuchun Sun

    Abstract: In this paper, a novel robotic gras** system is established to automatically pick up objects in cluttered scenes. A composite robotic hand composed of a suction cup and a gripper is designed for gras** the object stably. The suction cup is used for lifting the object from the clutter first and the gripper for gras** the object accordingly. We utilize the affordance map to provide pixel-wise… ▽ More

    Submitted 21 February, 2023; originally announced February 2023.

    Comments: has been accepted by IEEE/RSJ International Conference on Intelligent Robots and Systems 2019

    Journal ref: IEEE/RSJ International Conference on Intelligent Robots and Systems 2019 (IROS 2019)

  27. Tacchi: A Pluggable and Low Computational Cost Elastomer Deformation Simulator for Optical Tactile Sensors

    Authors: Zixi Chen, Shixin Zhang, Shan Luo, Fuchun Sun, Bin Fang

    Abstract: Simulation is widely applied in robotics research to save time and resources. There have been several works to simulate optical tactile sensors that leverage either a smoothing method or Finite Element Method (FEM). However, elastomer deformation physics is not considered in the former method, whereas the latter requires a massive amount of computational resources like a computer cluster. In this… ▽ More

    Submitted 19 January, 2023; originally announced January 2023.

    Comments: 8 pages, 6 figures, accepted by IEEE Robotics and Automation Letters

  28. arXiv:2301.06309  [pdf, other

    cs.CV

    UATVR: Uncertainty-Adaptive Text-Video Retrieval

    Authors: Bo Fang, Wenhao Wu, Chang Liu, Yu Zhou, Yuxin Song, Wei** Wang, Xiangbo Shu, Xiangyang Ji, **gdong Wang

    Abstract: With the explosive growth of web videos and emerging large-scale vision-language pre-training models, e.g., CLIP, retrieving videos of interest with text instructions has attracted increasing attention. A common practice is to transfer text-video pairs to the same embedding space and craft cross-modal interactions with certain entities in specific granularities for semantic correspondence. Unfortu… ▽ More

    Submitted 18 August, 2023; v1 submitted 16 January, 2023; originally announced January 2023.

    Comments: To appear at ICCV2023

  29. arXiv:2301.00184  [pdf, other

    cs.CV

    Cap4Video: What Can Auxiliary Captions Do for Text-Video Retrieval?

    Authors: Wenhao Wu, Haipeng Luo, Bo Fang, **gdong Wang, Wanli Ouyang

    Abstract: Most existing text-video retrieval methods focus on cross-modal matching between the visual content of videos and textual query sentences. However, in real-world scenarios, online videos are often accompanied by relevant text information such as titles, tags, and even subtitles, which can be utilized to match textual queries. This insight has motivated us to propose a novel approach to text-video… ▽ More

    Submitted 28 March, 2023; v1 submitted 31 December, 2022; originally announced January 2023.

    Comments: Accepted by CVPR 2023. Selected as a Highlight (Top 2.5% of ALL submissions)

  30. arXiv:2212.14162  [pdf, other

    cs.CV

    OrthoGAN:High-Precision Image Generation for Teeth Orthodontic Visualization

    Authors: Feihong Shen, JIng**g Liu, Haizhen Li, Bing Fang, Chenglong Ma, ** Hao, Yang Feng, Youyi Zheng

    Abstract: Patients take care of what their teeth will be like after the orthodontics. Orthodontists usually describe the expectation movement based on the original smile images, which is unconvincing. The growth of deep-learning generative models change this situation. It can visualize the outcome of orthodontic treatment and help patients foresee their future teeth and facial appearance. While previous stu… ▽ More

    Submitted 28 December, 2022; originally announced December 2022.

  31. arXiv:2211.04454  [pdf, other

    cs.CL cs.LG

    SLATE: A Sequence Labeling Approach for Task Extraction from Free-form Inked Content

    Authors: Apurva Gandhi, Ryan Serrao, Biyi Fang, Gilbert Antonius, Jenna Hong, Tra My Nguyen, Sheng Yi, Ehi Nosakhare, Irene Shaffer, Soundararajan Srinivasan, Vivek Gupta

    Abstract: We present SLATE, a sequence labeling approach for extracting tasks from free-form content such as digitally handwritten (or "inked") notes on a virtual whiteboard. Our approach allows us to create a single, low-latency model to simultaneously perform sentence segmentation and classification of these sentences into task/non-task sentences. SLATE greatly outperforms a baseline two-model (sentence s… ▽ More

    Submitted 17 November, 2022; v1 submitted 8 November, 2022; originally announced November 2022.

    Comments: Accepted at EMNLP 2022 as an Industry Track paper

  32. arXiv:2210.08288  [pdf, other

    cs.CV

    Transformer-based dimensionality reduction

    Authors: Ruisheng Ran, Tianyu Gao, Bin Fang

    Abstract: Recently, Transformer is much popular and plays an important role in the fields of Machine Learning (ML), Natural Language Processing (NLP), and Computer Vision (CV), etc. In this paper, based on the Vision Transformer (ViT) model, a new dimensionality reduction (DR) model is proposed, named Transformer-DR. From data visualization, image reconstruction and face recognition, the representation abil… ▽ More

    Submitted 15 October, 2022; originally announced October 2022.

  33. ImmFusion: Robust mmWave-RGB Fusion for 3D Human Body Reconstruction in All Weather Conditions

    Authors: Anjun Chen, Xiangyu Wang, Kun Shi, Shaohao Zhu, Bin Fang, Yingfeng Chen, Jiming Chen, Yuchi Huo, Qi Ye

    Abstract: 3D human reconstruction from RGB images achieves decent results in good weather conditions but degrades dramatically in rough weather. Complementary, mmWave radars have been employed to reconstruct 3D human joints and meshes in rough weather. However, combining RGB and mmWave signals for robust all-weather 3D human reconstruction is still an open challenge, given the sparse nature of mmWave and th… ▽ More

    Submitted 20 September, 2023; v1 submitted 3 October, 2022; originally announced October 2022.

    Comments: Accepted to ICRA2023, Project Page: https://chen3110.github.io/ImmFusion/index.html

  34. arXiv:2205.06973  [pdf, other

    quant-ph cs.DC

    Efficient Hierarchical State Vector Simulation of Quantum Circuits via Acyclic Graph Partitioning

    Authors: Bo Fang, M. Yusuf Özkaya, Ang Li, Ümit V. Çatalyürek, Sriram Krishnamoorthy

    Abstract: Early but promising results in quantum computing have been enabled by the concurrent development of quantum algorithms, devices, and materials. Classical simulation of quantum programs has enabled the design and analysis of algorithms and implementation strategies targeting current and anticipated quantum device architectures. In this paper, we present a graph-based approach to achieve efficient q… ▽ More

    Submitted 14 May, 2022; originally announced May 2022.

  35. arXiv:2205.05413  [pdf, other

    cs.CR

    Compact and Efficient KEMs over NTRU Lattices

    Authors: Zhichuang Liang, Boyue Fang, Jieyu Zheng, Yunlei Zhao

    Abstract: The NTRU lattice is a promising candidate to construct practical cryptosystems, in particular key encapsulation mechanism (KEM), resistant to quantum computing attacks. Nevertheless, there are still some inherent obstacles to NTRU-based KEM schemes in having integrated performance, taking security, bandwidth, error probability, and computational efficiency \emph{as a whole}, that is as good as and… ▽ More

    Submitted 9 November, 2022; v1 submitted 11 May, 2022; originally announced May 2022.

  36. arXiv:2203.05784  [pdf

    eess.IV cs.AI cs.CV

    AI-enabled Automatic Multimodal Fusion of Cone-Beam CT and Intraoral Scans for Intelligent 3D Tooth-Bone Reconstruction and Clinical Applications

    Authors: ** Hao, Jiaxiang Liu, ** Li, Wei Pan, Ruizhe Chen, Huimin Xiong, Kaiwei Sun, Hangzheng Lin, Wanlu Liu, Wanghui Ding, Jianfei Yang, Haoji Hu, Yueling Zhang, Yang Feng, Zeyu Zhao, Huikai Wu, Youyi Zheng, Bing Fang, Zuozhu Liu, Zhihe Zhao

    Abstract: A critical step in virtual dental treatment planning is to accurately delineate all tooth-bone structures from CBCT with high fidelity and accurate anatomical information. Previous studies have established several methods for CBCT segmentation using deep learning. However, the inherent resolution discrepancy of CBCT and the loss of occlusal and dentition information largely limited its clinical ap… ▽ More

    Submitted 11 March, 2022; originally announced March 2022.

    Comments: 30 pages, 6 figures, 3 tables

  37. arXiv:2203.00762  [pdf, other

    cs.LG cs.CL cs.IR

    Topic Analysis for Text with Side Data

    Authors: Biyi Fang, Kripa Rajshekhar, Diego Klabjan

    Abstract: Although latent factor models (e.g., matrix factorization) obtain good performance in predictions, they suffer from several problems including cold-start, non-transparency, and suboptimal recommendations. In this paper, we employ text with side data to tackle these limitations. We introduce a hybrid generative probabilistic model that combines a neural network with a latent topic model, which is a… ▽ More

    Submitted 1 March, 2022; originally announced March 2022.

  38. arXiv:2203.00761  [pdf, other

    cs.LG cs.CV

    Tricks and Plugins to GBM on Images and Sequences

    Authors: Biyi Fang, Jean Utke, Diego Klabjan

    Abstract: Convolutional neural networks (CNNs) and transformers, which are composed of multiple processing layers and blocks to learn the representations of data with multiple abstract levels, are the most successful machine learning models in recent years. However, millions of parameters and many blocks make them difficult to be trained, and sometimes several days or weeks are required to find an ideal arc… ▽ More

    Submitted 1 March, 2022; originally announced March 2022.

  39. Hybrid Robotic Gras** with a Soft Multimodal Gripper and a Deep Multistage Learning Scheme

    Authors: Fukang Liu, Fuchun Sun, Bin Fang, Xiang Li, Songyu Sun, Hua** Liu

    Abstract: Gras** has long been considered an important and practical task in robotic manipulation. Yet achieving robust and efficient grasps of diverse objects is challenging, since it involves gripper design, perception, control and learning, etc. Recent learning-based approaches have shown excellent performance in gras** a variety of novel objects. However, these methods either are typically limited t… ▽ More

    Submitted 5 March, 2023; v1 submitted 25 February, 2022; originally announced February 2022.

  40. Grafting Transformer on Automatically Designed Convolutional Neural Network for Hyperspectral Image Classification

    Authors: Xizhe Xue, Haokui Zhang, Bei Fang, Zongwen Bai, Ying Li

    Abstract: Hyperspectral image (HSI) classification has been a hot topic for decides, as hyperspectral images have rich spatial and spectral information and provide strong basis for distinguishing different land-cover objects. Benefiting from the development of deep learning technologies, deep learning based HSI classification methods have achieved promising performance. Recently, several neural architecture… ▽ More

    Submitted 6 April, 2023; v1 submitted 21 October, 2021; originally announced October 2021.

    Comments: 15 pages, 10 figures

  41. arXiv:2108.05013  [pdf, other

    cs.RO cs.CV

    Elastic Tactile Simulation Towards Tactile-Visual Perception

    Authors: Yikai Wang, Wenbing Huang, Bin Fang, Fuchun Sun, Chang Li

    Abstract: Tactile sensing plays an important role in robotic perception and manipulation tasks. To overcome the real-world limitations of data collection, simulating tactile response in a virtual environment comes as a desirable direction of robotic research. In this paper, we propose Elastic Interaction of Particles (EIP) for tactile simulation. Most existing works model the tactile sensor as a rigid multi… ▽ More

    Submitted 12 August, 2021; v1 submitted 10 August, 2021; originally announced August 2021.

    Comments: ACMMM 2021 (Oral). Code available at https://github.com/yikaiw/EIP. arXiv admin note: substantial text overlap with arXiv:2011.11528

  42. arXiv:2107.07467  [pdf, other

    cs.LG

    Only Train Once: A One-Shot Neural Network Training And Pruning Framework

    Authors: Tianyi Chen, Bo Ji, Tianyu Ding, Biyi Fang, Guanyi Wang, Zhihui Zhu, Luming Liang, Yixin Shi, Sheng Yi, Xiao Tu

    Abstract: Structured pruning is a commonly used technique in deploying deep neural networks (DNNs) onto resource-constrained devices. However, the existing pruning methods are usually heuristic, task-specified, and require an extra fine-tuning procedure. To overcome these limitations, we propose a framework that compresses DNNs into slimmer architectures with competitive performances and significant FLOPs r… ▽ More

    Submitted 11 November, 2021; v1 submitted 15 July, 2021; originally announced July 2021.

    Comments: Accepted by NeurIPS 2021

  43. arXiv:2107.03578  [pdf, other

    cs.CV

    Video 3D Sampling for Self-supervised Representation Learning

    Authors: Wei Li, Dezhao Luo, Bo Fang, Yu Zhou, Wei** Wang

    Abstract: Most of the existing video self-supervised methods mainly leverage temporal signals of videos, ignoring that the semantics of moving objects and environmental information are all critical for video-related tasks. In this paper, we propose a novel self-supervised method for video representation learning, referred to as Video 3D Sampling (V3S). In order to sufficiently utilize the information (spati… ▽ More

    Submitted 7 July, 2021; originally announced July 2021.

    Comments: 9 pages, 5 figures, 6 tables

  44. arXiv:2106.13033  [pdf

    cs.CV cs.CL

    A Transformer-based Cross-modal Fusion Model with Adversarial Training for VQA Challenge 2021

    Authors: Ke-Han Lu, Bo-Han Fang, Kuan-Yu Chen

    Abstract: In this paper, inspired by the successes of visionlanguage pre-trained models and the benefits from training with adversarial attacks, we present a novel transformerbased cross-modal fusion modeling by incorporating the both notions for VQA challenge 2021. Specifically, the proposed model is on top of the architecture of VinVL model [19], and the adversarial training strategy [4] is applied to mak… ▽ More

    Submitted 26 October, 2021; v1 submitted 24 June, 2021; originally announced June 2021.

    Comments: CVPR 2021 Workshop: Visual Question Answering (VQA) Challenge

  45. arXiv:2105.12929  [pdf, other

    cs.DC

    Characterizing Impacts of Storage Faults on HPC Applications: A Methodology and Insights

    Authors: Bo Fang, Daoce Wang, Sian **, Quincey Koziol, Zhao Zhang, Qiang Guan, Suren Byna, Sriram Krishnamoorthy, Dingwen Tao

    Abstract: In recent years, the increasing complexity in scientific simulations and emerging demands for training heavy artificial intelligence models require massive and fast data accesses, which urges high-performance computing (HPC) platforms to equip with more advanced storage infrastructures such as solid-state disks (SSDs). While SSDs offer high-performance I/O, the reliability challenges faced by the… ▽ More

    Submitted 2 August, 2021; v1 submitted 26 May, 2021; originally announced May 2021.

    Comments: 12 pages, 9 figures, 4 tables, accepted by IEEE Cluster'21

  46. arXiv:2104.10553  [pdf

    eess.IV cs.CV

    Rethinking Annotation Granularity for Overcoming Shortcuts in Deep Learning-based Radiograph Diagnosis: A Multicenter Study

    Authors: Luyang Luo, Hao Chen, Yongjie Xiao, Yanning Zhou, Xi Wang, Varut Vardhanabhuti, Mingxiang Wu, Chu Han, Zaiyi Liu, Xin Hao Benjamin Fang, Efstratios Tsougenis, Huang**g Lin, Pheng-Ann Heng

    Abstract: Two DL models were developed using radiograph-level annotations (yes or no disease) and fine-grained lesion-level annotations (lesion bounding boxes), respectively named CheXNet and CheXDet. The models' internal classification performance and lesion localization performance were compared on a testing set (n=2,922), external classification performance was compared on NIH-Google (n=4,376) and PadChe… ▽ More

    Submitted 8 November, 2022; v1 submitted 21 April, 2021; originally announced April 2021.

    Comments: Radiology: Artificial Intelligence

  47. arXiv:2102.11482  [pdf, other

    cs.SE cs.AI cs.LO

    Structural Similarity of Boundary Conditions and an Efficient Local Search Algorithm for Goal Conflict Identification

    Authors: Hongzhen Zhong, Hai Wan, Weilin Luo, Zhanhao Xiao, Jia Li, Biqing Fang

    Abstract: In goal-oriented requirements engineering, goal conflict identification is of fundamental importance for requirements analysis. The task aims to find the feasible situations which make the goals diverge within the domain, called boundary conditions (BCs). However, the existing approaches for goal conflict identification fail to find sufficient BCs and general BCs which cover more combinations of c… ▽ More

    Submitted 22 February, 2021; originally announced February 2021.

  48. arXiv:2011.11528  [pdf, other

    cs.RO cs.CV

    Elastic Interaction of Particles for Robotic Tactile Simulation

    Authors: Yikai Wang, Wenbing Huang, Bin Fang, Fuchun Sun

    Abstract: Tactile sensing plays an important role in robotic perception and manipulation. To overcome the real-world limitations of data collection, simulating tactile response in virtual environment comes as a desire direction of robotic research. Most existing works model the tactile sensor as a rigid multi-body, which is incapable of reflecting the elastic property of the tactile sensor as well as charac… ▽ More

    Submitted 23 November, 2020; originally announced November 2020.

  49. arXiv:2011.04868  [pdf, other

    cs.LG math.OC stat.ML

    Neural Network Compression Via Sparse Optimization

    Authors: Tianyi Chen, Bo Ji, Yixin Shi, Tianyu Ding, Biyi Fang, Sheng Yi, Xiao Tu

    Abstract: The compression of deep neural networks (DNNs) to reduce inference cost becomes increasingly important to meet realistic deployment requirements of various applications. There have been a significant amount of work regarding network compression, while most of them are heuristic rule-based or typically not friendly to be incorporated into varying scenarios. On the other hand, sparse optimization yi… ▽ More

    Submitted 11 November, 2020; v1 submitted 9 November, 2020; originally announced November 2020.

  50. arXiv:2010.08861  [pdf, other

    cs.LG

    Deep Learning in the Era of Edge Computing: Challenges and Opportunities

    Authors: Mi Zhang, Faen Zhang, Nicholas D. Lane, Yuanchao Shu, Xiao Zeng, Biyi Fang, Shen Yan, Hui Xu

    Abstract: The era of edge computing has arrived. Although the Internet is the backbone of edge computing, its true value lies at the intersection of gathering data from sensors and extracting meaningful information from the sensor data. We envision that in the near future, majority of edge devices will be equipped with machine intelligence powered by deep learning. However, deep learning-based approaches re… ▽ More

    Submitted 17 October, 2020; originally announced October 2020.