Skip to main content

Showing 1–50 of 96 results for author: Lu, B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.15719  [pdf, other

    cs.CV

    How to Learn More? Exploring Kolmogorov-Arnold Networks for Hyperspectral Image Classification

    Authors: Ali Jamali, Swalpa Kumar Roy, Danfeng Hong, Bing Lu, Pedram Ghamisi

    Abstract: Convolutional Neural Networks (CNNs) and vision transformers (ViTs) have shown excellent capability in complex hyperspectral image (HSI) classification. However, these models require a significant number of training data and are computational resources. On the other hand, modern Multi-Layer Perceptrons (MLPs) have demonstrated great classification capability. These modern MLP-based models require… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

  2. arXiv:2406.03511  [pdf, other

    cs.LG cs.AI

    MagiNet: Mask-Aware Graph Imputation Network for Incomplete Traffic Data

    Authors: Jian** Zhou, Bin Lu, Zhanyu Liu, Siyu Pan, Xuejun Feng, Hua Wei, Guanjie Zheng, Xinbing Wang, Chenghu Zhou

    Abstract: Due to detector malfunctions and communication failures, missing data is ubiquitous during the collection of traffic data. Therefore, it is of vital importance to impute the missing values to facilitate data analysis and decision-making for Intelligent Transportation System (ITS). However, existing imputation methods generally perform zero pre-filling techniques to initialize missing values, intro… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: 19 pages, 7 figures

  3. arXiv:2405.18765  [pdf, other

    cs.LG

    Large Brain Model for Learning Generic Representations with Tremendous EEG Data in BCI

    Authors: Wei-Bang Jiang, Li-Ming Zhao, Bao-Liang Lu

    Abstract: The current electroencephalogram (EEG) based deep learning models are typically designed for specific datasets and applications in brain-computer interaction (BCI), limiting the scale of the models and thus diminishing their perceptual capabilities and generalizability. Recently, Large Language Models (LLMs) have achieved unprecedented success in text processing, prompting us to explore the capabi… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: The Twelfth International Conference on Learning Representations

    Journal ref: The Twelfth International Conference on Learning Representations, 2024

  4. arXiv:2405.14502  [pdf, other

    cs.DB cs.DC

    DEX: Scalable Range Indexing on Disaggregated Memory [Extended Version]

    Authors: Baotong Lu, Kaisong Huang, Chieh-Jan Mike Liang, Tianzheng Wang, Eric Lo

    Abstract: Memory disaggregation can potentially allow memory-optimized range indexes such as B+-trees to scale beyond one machine while attaining high hardware utilization and low cost. Designing scalable indexes on disaggregated memory, however, is challenging due to rudimentary caching, unprincipled offloading and excessive inconsistency among servers. This paper proposes DEX, a new scalable B+-tree for… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

    Comments: 16 pages; To appear at VLDB 2024

  5. arXiv:2405.07233  [pdf, other

    cs.LG cs.AI physics.ao-ph

    OXYGENERATOR: Reconstructing Global Ocean Deoxygenation Over a Century with Deep Learning

    Authors: Bin Lu, Ze Zhao, Luyu Han, Xiaoying Gan, Yuntao Zhou, Lei Zhou, Luoyi Fu, Xinbing Wang, Chenghu Zhou, **g Zhang

    Abstract: Accurately reconstructing the global ocean deoxygenation over a century is crucial for assessing and protecting marine ecosystem. Existing expert-dominated numerical simulations fail to catch up with the dynamic variation caused by global warming and human activities. Besides, due to the high-cost data collection, the historical observations are severely sparse, leading to big challenge for precis… ▽ More

    Submitted 12 May, 2024; originally announced May 2024.

    Comments: Accepted to ICML 2024

  6. arXiv:2405.05925  [pdf, other

    cs.LG cs.AI physics.ao-ph

    FuXi-ENS: A machine learning model for medium-range ensemble weather forecasting

    Authors: Xiaohui Zhong, Lei Chen, Hao Li, Jie Feng, Bo Lu

    Abstract: Ensemble weather forecasting is essential for weather predictions and mitigating the impacts of extreme weather events. Constructing an ensemble prediction system (EPS) based on conventional numerical weather prediction (NWP) models is highly computationally expensive. Machine learning (ML) models have emerged as valuable tools for deterministic weather forecasts, providing forecasts with signific… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

  7. arXiv:2404.15311  [pdf, other

    eess.SP cs.AI cs.LG

    Fusing Pretrained ViTs with TCNet for Enhanced EEG Regression

    Authors: Eric Modesitt, Haicheng Yin, Williams Huang Wang, Brian Lu

    Abstract: The task of Electroencephalogram (EEG) analysis is paramount to the development of Brain-Computer Interfaces (BCIs). However, to reach the goal of develo** robust, useful BCIs depends heavily on the speed and the accuracy at which BCIs can understand neural dynamics. In response to that goal, this paper details the integration of pre-trained Vision Transformers (ViTs) with Temporal Convolutional… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

    Comments: Accepted HCI International 2024

  8. arXiv:2404.04969  [pdf, other

    cs.LG cs.AI

    Temporal Generalization Estimation in Evolving Graphs

    Authors: Bin Lu, Tingyan Ma, Xiaoying Gan, Xinbing Wang, Yunqiang Zhu, Chenghu Zhou, Shiyu Liang

    Abstract: Graph Neural Networks (GNNs) are widely deployed in vast fields, but they often struggle to maintain accurate representations as graphs evolve. We theoretically establish a lower bound, proving that under mild conditions, representation distortion inevitably occurs over time. To estimate the temporal distortion without human annotation after deployment, one naive approach is to pre-train a recurre… ▽ More

    Submitted 7 April, 2024; originally announced April 2024.

    Comments: Published as a conference paper at ICLR 2024

  9. arXiv:2403.13112  [pdf, other

    cs.CL

    Efficient Encoder-Decoder Transformer Decoding for Decomposable Tasks

    Authors: Bo-Ru Lu, Nikita Haduong, Chien-Yu Lin, Hao Cheng, Noah A. Smith, Mari Ostendorf

    Abstract: Transformer-based NLP models are powerful but have high computational costs that limit deployment. Finetuned encoder-decoder models are popular in specialized domains and can outperform larger more generalized decoder-only models, such as GPT-4. We introduce a new configuration for encoder-decoder models that improves efficiency on structured output and decomposable tasks where multiple outputs ar… ▽ More

    Submitted 23 May, 2024; v1 submitted 19 March, 2024; originally announced March 2024.

    Comments: 14 pages, 4 figures. https://github.com/boru-roylu/encode-once-and-decode-in-parallel

  10. arXiv:2403.12024  [pdf, other

    cs.CL

    Enhancing Taiwanese Hokkien Dual Translation by Exploring and Standardizing of Four Writing Systems

    Authors: Bo-Han Lu, Yi-Hsuan Lin, En-Shiun Annie Lee, Richard Tzong-Han Tsai

    Abstract: Machine translation focuses mainly on high-resource languages (HRLs), while low-resource languages (LRLs) like Taiwanese Hokkien are relatively under-explored. The study aims to address this gap by develo** a dual translation model between Taiwanese Hokkien and both Traditional Mandarin Chinese and English. We employ a pre-trained LLaMA 2-7B model specialized in Traditional Mandarin Chinese to l… ▽ More

    Submitted 14 May, 2024; v1 submitted 18 March, 2024; originally announced March 2024.

    Comments: Accepted by LREC-COLING 2024 as a long oral paper

  11. arXiv:2403.05796  [pdf, other

    cs.CV

    Weakly Supervised Change Detection via Knowledge Distillation and Multiscale Sigmoid Inference

    Authors: Binghao Lu, Caiwen Ding, **bo Bi, Dong** Song

    Abstract: Change detection, which aims to detect spatial changes from a pair of multi-temporal images due to natural or man-made causes, has been widely applied in remote sensing, disaster management, urban management, etc. Most existing change detection approaches, however, are fully supervised and require labor-intensive pixel-level labels. To address this, we develop a novel weakly supervised change dete… ▽ More

    Submitted 9 March, 2024; originally announced March 2024.

    Comments: code is available: https://github.com/BinghaoLu/KD-MSI

  12. arXiv:2403.02576  [pdf, other

    cs.DL cs.LG cs.SI

    AceMap: Knowledge Discovery through Academic Graph

    Authors: Xinbing Wang, Luoyi Fu, Xiaoying Gan, Ying Wen, Guanjie Zheng, Jiaxin Ding, Liyao Xiang, Nanyang Ye, Meng **, Shiyu Liang, Bin Lu, Haiwen Wang, Yi Xu, Cheng Deng, Shao Zhang, Huquan Kang, Xingli Wang, Qi Li, Zhixin Guo, Jiexing Qi, Pan Liu, Yuyang Ren, Lyuwen Wu, Jungang Yang, Jian** Zhou , et al. (1 additional authors not shown)

    Abstract: The exponential growth of scientific literature requires effective management and extraction of valuable insights. While existing scientific search engines excel at delivering search results based on relational databases, they often neglect the analysis of collaborations between scientific entities and the evolution of ideas, as well as the in-depth analysis of content within scientific publicatio… ▽ More

    Submitted 14 April, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

    Comments: Technical Report for AceMap (https://www.acemap.info)

  13. arXiv:2401.07188  [pdf, other

    cs.CV

    Left-right Discrepancy for Adversarial Attack on Stereo Networks

    Authors: Pengfei Wang, Xiaofei Hui, Beijia Lu, Nimrod Lilith, Jun Liu, Sameer Alam

    Abstract: Stereo matching neural networks often involve a Siamese structure to extract intermediate features from left and right images. The similarity between these intermediate left-right features significantly impacts the accuracy of disparity estimation. In this paper, we introduce a novel adversarial attack approach that generates perturbation noise specifically designed to maximize the discrepancy bet… ▽ More

    Submitted 13 January, 2024; originally announced January 2024.

  14. arXiv:2312.15109  [pdf

    cs.RO cs.AI

    UAS-based Automated Structural Inspection Path Planning via Visual Data Analytics and Optimization

    Authors: Yuxiang Zhao, Benhao Lu, Mohamad Alipour

    Abstract: Unmanned Aerial Systems (UAS) have gained significant traction for their application in infrastructure inspections. However, considering the enormous scale and complex nature of infrastructure, automation is essential for improving the efficiency and quality of inspection operations. One of the core problems in this regard is electing an optimal automated flight path that can achieve the mission o… ▽ More

    Submitted 22 December, 2023; originally announced December 2023.

  15. arXiv:2312.14792  [pdf, ps, other

    cs.LG cs.AI cs.CV cs.IT math.PR

    The Rate-Distortion-Perception-Classification Tradeoff: Joint Source Coding and Modulation via Inverse-Domain GANs

    Authors: Junli Fang, João F. C. Mota, Baoshan Lu, Weicheng Zhang, Xuemin Hong

    Abstract: The joint source-channel coding (JSCC) framework leverages deep learning to learn from data the best codes for source and channel coding. When the output signal, rather than being binary, is directly mapped onto the IQ domain (complex-valued), we call the resulting framework joint source coding and modulation (JSCM). We consider a JSCM scenario and show the existence of a strict tradeoff between c… ▽ More

    Submitted 6 June, 2024; v1 submitted 22 December, 2023; originally announced December 2023.

    Comments: Paper accepted in IEEE Transactions on Signal Processing

  16. arXiv:2312.09926  [pdf, other

    physics.ao-ph cs.AI cs.LG

    FuXi-S2S: An accurate machine learning model for global subseasonal forecasts

    Authors: Lei Chen, Xiaohui Zhong, Jie Wu, Deliang Chen, Qingchen Chao, Chensen Lin, Zixin Hu, Bo Lu, Hao Li, Yuan Qi

    Abstract: Skillful subseasonal forecasts beyond 2 weeks are crucial for a wide range of applications across various sectors of society. Recently, state-of-the-art machine learning based weather forecasting models have made significant advancements, outperforming the high-resolution forecast (HRES) from the European Centre for Medium-Range Weather Forecasts (ECMWF). However, the full potential of machine lea… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

  17. arXiv:2312.02015  [pdf, other

    cs.CV

    ColonNeRF: High-Fidelity Neural Reconstruction of Long Colonoscopy

    Authors: Yufei Shi, Beijia Lu, Jia-Wei Liu, Ming Li, Mike Zheng Shou

    Abstract: Colonoscopy reconstruction is pivotal for diagnosing colorectal cancer. However, accurate long-sequence colonoscopy reconstruction faces three major challenges: (1) dissimilarity among segments of the colon due to its meandering and convoluted shape; (2) co-existence of simple and intricately folded geometry structures; (3) sparse viewpoints due to constrained camera trajectories. To tackle these… ▽ More

    Submitted 21 March, 2024; v1 submitted 4 December, 2023; originally announced December 2023.

    Comments: for Project Page, see https://showlab.github.io/ColonNeRF/

  18. Deep Learning-based 3D Point Cloud Classification: A Systematic Survey and Outlook

    Authors: Huang Zhang, Changshuo Wang, Shengwei Tian, Baoli Lu, Li** Zhang, Xin Ning, Xiao Bai

    Abstract: In recent years, point cloud representation has become one of the research hotspots in the field of computer vision, and has been widely used in many fields, such as autonomous driving, virtual reality, robotics, etc. Although deep learning techniques have achieved great success in processing regular structured 2D grid image data, there are still great challenges in processing irregular, unstructu… ▽ More

    Submitted 5 November, 2023; originally announced November 2023.

    Journal ref: Displays 102456 (2023)

  19. arXiv:2308.08344  [pdf, other

    cs.LG cs.AI cs.SI

    Graph Out-of-Distribution Generalization with Controllable Data Augmentation

    Authors: Bin Lu, Xiaoying Gan, Ze Zhao, Shiyu Liang, Luoyi Fu, Xinbing Wang, Chenghu Zhou

    Abstract: Graph Neural Network (GNN) has demonstrated extraordinary performance in classifying graph properties. However, due to the selection bias of training and testing data (e.g., training on small graphs and testing on large graphs, or training on dense graphs and testing on sparse graphs), distribution deviation is widespread. More importantly, we often observe \emph{hybrid structure distribution shif… ▽ More

    Submitted 16 August, 2023; originally announced August 2023.

    Comments: Under review

  20. arXiv:2308.02510  [pdf, other

    eess.IV cs.AI cs.CV cs.MM q-bio.NC

    Seeing through the Brain: Image Reconstruction of Visual Perception from Human Brain Signals

    Authors: Yu-Ting Lan, Kan Ren, Yansen Wang, Wei-Long Zheng, Dongsheng Li, Bao-Liang Lu, Lili Qiu

    Abstract: Seeing is believing, however, the underlying mechanism of how human visual perceptions are intertwined with our cognitions is still a mystery. Thanks to the recent advances in both neuroscience and artificial intelligence, we have been able to record the visually evoked brain activities and mimic the visual perception ability through computational approaches. In this paper, we pay attention to vis… ▽ More

    Submitted 16 August, 2023; v1 submitted 27 July, 2023; originally announced August 2023.

    Comments: A preprint version of an ongoing work

  21. arXiv:2307.07047  [pdf, other

    cs.CL

    Does Collaborative Human-LM Dialogue Generation Help Information Extraction from Human Dialogues?

    Authors: Bo-Ru Lu, Nikita Haduong, Chia-Hsuan Lee, Zeqiu Wu, Hao Cheng, Paul Koester, Jean Utke, Tao Yu, Noah A. Smith, Mari Ostendorf

    Abstract: The capabilities of pretrained language models have opened opportunities to explore new application areas, but applications involving human-human interaction are limited by the fact that most data is protected from public release for privacy reasons. Problem-solving human dialogues in real applications can be much more complex than existing Wizard-of-Oz collections, preventing successful domain tr… ▽ More

    Submitted 20 February, 2024; v1 submitted 13 July, 2023; originally announced July 2023.

  22. arXiv:2306.08103  [pdf, other

    cs.CV

    Generating Images with 3D Annotations Using Diffusion Models

    Authors: Wufei Ma, Qihao Liu, Jiahao Wang, Angtian Wang, Xiaoding Yuan, Yi Zhang, Zihao Xiao, Guofeng Zhang, Beijia Lu, Ruxiao Duan, Yongrui Qi, Adam Kortylewski, Yaoyao Liu, Alan Yuille

    Abstract: Diffusion models have emerged as a powerful generative method, capable of producing stunning photo-realistic images from natural language descriptions. However, these models lack explicit control over the 3D structure in the generated images. Consequently, this hinders our ability to obtain detailed 3D annotations for the generated images or to craft instances with specific poses and distances. In… ▽ More

    Submitted 3 April, 2024; v1 submitted 13 June, 2023; originally announced June 2023.

    Comments: ICLR 2024 Spotlight. Code: https://ccvl.jhu.edu/3D-DST/

  23. arXiv:2303.02219  [pdf, other

    cs.LG

    NSGA-PINN: A Multi-Objective Optimization Method for Physics-Informed Neural Network Training

    Authors: Binghang Lu, Christian B. Moya, Guang Lin

    Abstract: This paper presents NSGA-PINN, a multi-objective optimization framework for effective training of Physics-Informed Neural Networks (PINNs). The proposed framework uses the Non-dominated Sorting Genetic Algorithm (NSGA-II) to enable traditional stochastic gradient optimization algorithms (e.g., ADAM) to escape local minima effectively. Additionally, the NSGA-II algorithm enables satisfying the init… ▽ More

    Submitted 6 March, 2023; v1 submitted 3 March, 2023; originally announced March 2023.

    Comments: 13 pages, 35 figures

  24. arXiv:2302.14267  [pdf, other

    cs.CV

    Adversarial Attack with Raindrops

    Authors: Jiyuan Liu, Bingyi Lu, Mingkang Xiong, Tao Zhang, Huilin Xiong

    Abstract: Deep neural networks (DNNs) are known to be vulnerable to adversarial examples, which are usually designed artificially to fool DNNs, but rarely exist in real-world scenarios. In this paper, we study the adversarial examples caused by raindrops, to demonstrate that there exist plenty of natural phenomena being able to work as adversarial attackers to DNNs. Moreover, we present a new approach to ge… ▽ More

    Submitted 16 July, 2023; v1 submitted 27 February, 2023; originally announced February 2023.

    Comments: 10 pages, 7 figures, This manuscript was submitted to CVPR 2023

    MSC Class: I.2.6

  25. arXiv:2301.11499  [pdf

    cs.CV cs.AI

    Dual-View Selective Instance Segmentation Network for Unstained Live Adherent Cells in Differential Interference Contrast Images

    Authors: Fei Pan, Yutong Wu, Kangning Cui, Shuxun Chen, Yanfang Li, Yaofang Liu, Adnan Shakoor, Han Zhao, Beijia Lu, Shaohua Zhi, Raymond Chan, Dong Sun

    Abstract: Despite recent advances in data-independent and deep-learning algorithms, unstained live adherent cell instance segmentation remains a long-standing challenge in cell image processing. Adherent cells' inherent visual characteristics, such as low contrast structures, fading edges, and irregular morphology, have made it difficult to distinguish from one another, even by human experts, let alone comp… ▽ More

    Submitted 26 January, 2023; originally announced January 2023.

    Comments: 13 pages, 5 figures, 3 tables

  26. arXiv:2301.08937  [pdf, other

    cs.CL cs.AI

    Exploring Methods for Building Dialects-Mandarin Code-Mixing Corpora: A Case Study in Taiwanese Hokkien

    Authors: Sin-En Lu, Bo-Han Lu, Chao-Yi Lu, Richard Tzong-Han Tsai

    Abstract: In natural language processing (NLP), code-mixing (CM) is a challenging task, especially when the mixed languages include dialects. In Southeast Asian countries such as Singapore, Indonesia, and Malaysia, Hokkien-Mandarin is the most widespread code-mixed language pair among Chinese immigrants, and it is also common in Taiwan. However, dialects such as Hokkien often have a scarcity of resources an… ▽ More

    Submitted 21 January, 2023; originally announced January 2023.

    Comments: The paper was accepted by EMNLP 2022 findings

  27. arXiv:2209.08864  [pdf, other

    cs.RO

    CFVS: Coarse-to-Fine Visual Servoing for 6-DoF Object-Agnostic Peg-In-Hole Assembly

    Authors: Bo-Siang Lu, Tung-I Chen, Hsin-Ying Lee, Winston H. Hsu

    Abstract: Robotic peg-in-hole assembly remains a challenging task due to its high accuracy demand. Previous work tends to simplify the problem by restricting the degree of freedom of the end-effector, or limiting the distance between the target and the initial pose position, which prevents them from being deployed in real-world manufacturing. Thus, we present a Coarse-to-Fine Visual Servoing (CFVS) peg-in-h… ▽ More

    Submitted 19 January, 2023; v1 submitted 19 September, 2022; originally announced September 2022.

    Comments: Accepted by ICRA 2023

  28. arXiv:2209.05095  [pdf, other

    cs.RO

    FBG-Based Online Learning and 3-D Shape Control of Unmodeled Continuum and Soft Robots in Unstructured Environments

    Authors: Yiang Lu, Wei Chen, Bo Lu, Jianshu Zhou, Zhi Chen, Qi Dou, Yun-Hui Liu

    Abstract: In this paper, we present a novel and generic data-driven method to servo-control the 3-D shape of continuum and soft robots embedded with fiber Bragg grating (FBG) sensors. Developments of 3-D shape perception and control technologies are crucial for continuum robots to perform the tasks autonomously in surgical interventions. However, owing to the nonlinear properties of continuum robots, one ma… ▽ More

    Submitted 19 November, 2022; v1 submitted 12 September, 2022; originally announced September 2022.

  29. Combinatorial optimization solving by coherent Ising machines based on spiking neural networks

    Authors: Bo Lu, Yong-Pan Gao, Kai Wen, Chuan Wang

    Abstract: Spiking neural network is a kind of neuromorphic computing that is believed to improve the level of intelligence and provide advantages for quantum computing. In this work, we address this issue by designing an optical spiking neural network and find that it can be used to accelerate the speed of computation, especially on combinatorial optimization problems. Here the spiking neural network is con… ▽ More

    Submitted 20 October, 2023; v1 submitted 15 August, 2022; originally announced August 2022.

    Comments: 10 pages, 5 figures, accepted by Quantum

    Journal ref: Quantum 7, 1151 (2023)

  30. arXiv:2208.02049  [pdf, other

    cs.CV

    AutoLaparo: A New Dataset of Integrated Multi-tasks for Image-guided Surgical Automation in Laparoscopic Hysterectomy

    Authors: Ziyi Wang, Bo Lu, Yonghao Long, Fangxun Zhong, Tak-Hong Cheung, Qi Dou, Yunhui Liu

    Abstract: Computer-assisted minimally invasive surgery has great potential in benefiting modern operating theatres. The video data streamed from the endoscope provides rich information to support context-awareness for next-generation intelligent surgical systems. To achieve accurate perception and automatic manipulation during the procedure, learning based technique is a promising way, which enables advance… ▽ More

    Submitted 3 August, 2022; originally announced August 2022.

    Comments: Accepted at MICCAI 2022

  31. Are Updatable Learned Indexes Ready?

    Authors: Chaichon Wongkham, Baotong Lu, Chris Liu, Zhicong Zhong, Eric Lo, Tianzheng Wang

    Abstract: Recently, numerous promising results have shown that updatable learned indexes can perform better than traditional indexes with much lower memory space consumption. But it is unknown how these learned indexes compare against each other and against the traditional ones under realistic workloads with changing data distributions and concurrency levels. This makes practitioners still wary about how th… ▽ More

    Submitted 4 September, 2022; v1 submitted 6 July, 2022; originally announced July 2022.

    Journal ref: PVLDB, 15(11): 3004 - 3017, 2022

  32. arXiv:2207.01249  [pdf, other

    cs.RO

    Model-Free 3D Shape Control of Deformable Objects Using Novel Features Based on Modal Analysis

    Authors: Bohan Yang, Bo Lu, Wei Chen, Fangxun Zhong, Yun-Hui Liu

    Abstract: Shape control of deformable objects is a challenging and important robotic problem. This paper proposes a model-free controller using novel 3D global deformation features based on modal analysis. Unlike most existing controllers using geometric features, our controller employs a physically-based deformation feature by decoupling 3D global deformation into low-frequency mode shapes. Although modal… ▽ More

    Submitted 18 April, 2023; v1 submitted 4 July, 2022; originally announced July 2022.

    Comments: Accepted by the IEEE Transactions on Robotics. The paper will appear in the IEEE Transactions on Robotics. IEEE copyright

  33. Geometer: Graph Few-Shot Class-Incremental Learning via Prototype Representation

    Authors: Bin Lu, Xiaoying Gan, Lina Yang, Weinan Zhang, Luoyi Fu, Xinbing Wang

    Abstract: With the tremendous expansion of graphs data, node classification shows its great importance in many real-world applications. Existing graph neural network based methods mainly focus on classifying unlabeled nodes within fixed classes with abundant labeling. However, in many practical scenarios, graph evolves with emergence of new nodes and edges. Novel classes appear incrementally along with few… ▽ More

    Submitted 3 June, 2022; v1 submitted 27 May, 2022; originally announced May 2022.

    Comments: Accepted to KDD2022

  34. Spatio-Temporal Graph Few-Shot Learning with Cross-City Knowledge Transfer

    Authors: Bin Lu, Xiaoying Gan, Weinan Zhang, Huaxiu Yao, Luoyi Fu, Xinbing Wang

    Abstract: Spatio-temporal graph learning is a key method for urban computing tasks, such as traffic flow, taxi demand and air quality forecasting. Due to the high cost of data collection, some develo** cities have few available data, which makes it infeasible to train a well-performed model. To address this challenge, cross-city knowledge transfer has shown its promise, where the model learned from data-s… ▽ More

    Submitted 3 June, 2022; v1 submitted 27 May, 2022; originally announced May 2022.

    Comments: Accepted to KDD2022

  35. arXiv:2205.12244  [pdf, other

    cs.CL

    Unsupervised Learning of Hierarchical Conversation Structure

    Authors: Bo-Ru Lu, Yushi Hu, Hao Cheng, Noah A. Smith, Mari Ostendorf

    Abstract: Human conversations can evolve in many different ways, creating challenges for automatic understanding and summarization. Goal-oriented conversations often have meaningful sub-dialogue structure, but it can be highly domain-dependent. This work introduces an unsupervised approach to learning hierarchical conversation structure, including turn and sub-dialogue segment labels, corresponding roughly… ▽ More

    Submitted 17 November, 2022; v1 submitted 24 May, 2022; originally announced May 2022.

    Comments: In Findings of the Conference on Empirical Methods in Natural Language Processing (EMNLP 2022 Findings)

  36. arXiv:2205.09106  [pdf, ps, other

    cs.IT eess.SY

    Reinforcement Learning Based Robust Policy Design for Relay and Power Optimization in DF Relaying Networks

    Authors: Yuanzhe Geng, Erwu Liu, Rui Wang, Pengcheng Sun, Binyu Lu

    Abstract: In this paper, we study the outage minimization problem in a decode-and-forward cooperative network with relay uncertainty. To reduce the outage probability and improve the quality of service, existing researches usually rely on the assumption of both exact instantaneous channel state information (CSI) and environmental uncertainty. However, it is difficult to obtain perfect instantaneous CSI imme… ▽ More

    Submitted 8 May, 2022; originally announced May 2022.

  37. arXiv:2204.05045  [pdf, other

    cs.CV

    SAL-CNN: Estimate the Remaining Useful Life of Bearings Using Time-frequency Information

    Authors: Bingguo Liu, Zhuo Gao, Binghui Lu, Hangcheng Dong, Zeru An

    Abstract: In modern industrial production, the prediction ability of the remaining useful life (RUL) of bearings directly affects the safety and stability of the system. Traditional methods require rigorous physical modeling and perform poorly for complex systems. In this paper, an end-to-end RUL prediction method is proposed, which uses short-time Fourier transform (STFT) as preprocessing. Considering the… ▽ More

    Submitted 11 April, 2022; originally announced April 2022.

  38. arXiv:2204.03195  [pdf, other

    cs.RO

    3D Perception based Imitation Learning under Limited Demonstration for Laparoscope Control in Robotic Surgery

    Authors: Bin Li, Ruofeng Wei, Jiaqi Xu, Bo Lu, Chi-Hang Yee, Chi-Fai Ng, Pheng-Ann Heng, Qi Dou, Yun-Hui Liu

    Abstract: Automatic laparoscope motion control is fundamentally important for surgeons to efficiently perform operations. However, its traditional control methods based on tool tracking without considering information hidden in surgical scenes are not intelligent enough, while the latest supervised imitation learning (IL)-based methods require expensive sensor data and suffer from distribution mismatch issu… ▽ More

    Submitted 7 April, 2022; originally announced April 2022.

    Comments: 7 pages, 7 figures, 2022 IEEE International Conference on Robotics and Automation (ICRA)

  39. arXiv:2203.13921  [pdf, other

    cs.AR cs.AI

    A Semi-Decoupled Approach to Fast and Optimal Hardware-Software Co-Design of Neural Accelerators

    Authors: Bingqian Lu, Zheyu Yan, Yiyu Shi, Shaolei Ren

    Abstract: In view of the performance limitations of fully-decoupled designs for neural architectures and accelerators, hardware-software co-design has been emerging to fully reap the benefits of flexible design spaces and optimize neural network performance. Nonetheless, such co-design also enlarges the total search space to practically infinity and presents substantial challenges. While the prior studies h… ▽ More

    Submitted 25 March, 2022; originally announced March 2022.

    Comments: Accepted by and presented at the TinyML Research Symposium 2022

  40. arXiv:2201.12811  [pdf, other

    cs.DS

    A DFS Algorithm for Maximum Matchings in General Graphs

    Authors: Tony T. Lee, Bojun Lu, Hanli Chu

    Abstract: In this paper, we propose a depth-first search (DFS) algorithm for searching maximum matchings in general graphs. Unlike blossom shrinking algorithms, which store all possible alternative alternating paths in the super-vertices shrunk from blossoms, the newly proposed algorithm does not involve blossom shrinking. The basic idea is to deflect the alternating path when facing blossoms. The algorithm… ▽ More

    Submitted 19 April, 2022; v1 submitted 30 January, 2022; originally announced January 2022.

    Comments: 17 pages, 9 figures, 2 tables

    MSC Class: 05C30 (Primary) 68R10; 68R05 (Secondary) ACM Class: G.2.1; G.2.2; F.2.2

  41. arXiv:2201.05707  [pdf, other

    math.NA cs.CE physics.comp-ph

    Efficient Generation of Membrane and Solvent Tetrahedral Meshes for Ion Channel Finite Element Calculation

    Authors: Zhen Chao, Sheng Gui, Benzhuo Lu, Dexuan Xie

    Abstract: A finite element solution of an ion channel dielectric continuum model such as Poisson-Boltzmann equation (PBE) and a system of Poisson-Nernst-Planck equations (PNP) requires tetrahedral meshes for an ion channel protein region, a membrane region, and an ionic solvent region as well as an interface fitted irregular tetrahedral mesh of a simulation box domain. However, generating these meshes is ve… ▽ More

    Submitted 14 January, 2022; originally announced January 2022.

    Comments: 20 pages, 14 figures

    MSC Class: 65M50; 65M60; 92-08; 68N01

  42. arXiv:2111.01203  [pdf

    cs.LG cs.AI

    One Proxy Device Is Enough for Hardware-Aware Neural Architecture Search

    Authors: Bingqian Lu, Jianyi Yang, Weiwen Jiang, Yiyu Shi, Shaolei Ren

    Abstract: Convolutional neural networks (CNNs) are used in numerous real-world applications such as vision-based autonomous driving and video content analysis. To run CNN inference on various target devices, hardware-aware neural architecture search (NAS) is crucial. A key requirement of efficient hardware-aware NAS is the fast evaluation of inference latencies in order to rank different architectures. Whil… ▽ More

    Submitted 2 November, 2021; v1 submitted 1 November, 2021; originally announced November 2021.

    Comments: Accepted by the ACM SIGMETRICS 2022. Published in the Proceedings of the ACM on Measurement and Analysis of Computing Systems, vol. 5, no. 3, Article 34, December 2021. GitHub: https://github.com/Ren-Research/OneProxy

    Journal ref: Proc. ACM Meas. Anal. Comput. Syst., vol. 5, no. 3, Article 34, December 2021

  43. arXiv:2111.00775  [pdf, other

    cs.CV

    PP-ShiTu: A Practical Lightweight Image Recognition System

    Authors: Shengyu Wei, Ruoyu Guo, Cheng Cui, Bin Lu, Shuilong Dong, Tingquan Gao, Yuning Du, Ying Zhou, Xueying Lyu, Qiwen Liu, Xiaoguang Hu, Dianhai Yu, Yanjun Ma

    Abstract: In recent years, image recognition applications have developed rapidly. A large number of studies and techniques have emerged in different fields, such as face recognition, pedestrian and vehicle re-identification, landmark retrieval, and product recognition. In this paper, we propose a practical lightweight image recognition system, named PP-ShiTu, consisting of the following 3 modules, mainbody… ▽ More

    Submitted 21 January, 2022; v1 submitted 1 November, 2021; originally announced November 2021.

    Comments: 9 pages, 5 figures, 9 tables. arXiv admin note: text overlap with arXiv:2109.03144

  44. arXiv:2110.15114  [pdf, other

    cs.IR

    UltraGCN: Ultra Simplification of Graph Convolutional Networks for Recommendation

    Authors: Kelong Mao, Jieming Zhu, Xi Xiao, Biao Lu, Zhaowei Wang, Xiuqiang He

    Abstract: With the recent success of graph convolutional networks (GCNs), they have been widely applied for recommendation, and achieved impressive performance gains. The core of GCNs lies in its message passing mechanism to aggregate neighborhood information. However, we observed that message passing largely slows down the convergence of GCNs during training, especially for large-scale recommender systems,… ▽ More

    Submitted 29 November, 2023; v1 submitted 28 October, 2021; originally announced October 2021.

    Comments: Accepted by CIKM 2021. Code available at: https://reczoo.github.io/UltraGCN

  45. arXiv:2110.03912  [pdf, other

    cs.CV cs.AI eess.IV

    Stereo Dense Scene Reconstruction and Accurate Localization for Learning-Based Navigation of Laparoscope in Minimally Invasive Surgery

    Authors: Ruofeng Wei, Bin Li, Hangjie Mo, Bo Lu, Yonghao Long, Bohan Yang, Qi Dou, Yunhui Liu, Dong Sun

    Abstract: Objective: The computation of anatomical information and laparoscope position is a fundamental block of surgical navigation in Minimally Invasive Surgery (MIS). Recovering a dense 3D structure of surgical scene using visual cues remains a challenge, and the online laparoscopic tracking primarily relies on external sensors, which increases system complexity. Methods: Here, we propose a learning-dri… ▽ More

    Submitted 27 November, 2022; v1 submitted 8 October, 2021; originally announced October 2021.

    Journal ref: IEEE Transactions on Biomedical Engineering 2022

  46. arXiv:2109.15099  [pdf, other

    cs.CV

    PP-LCNet: A Lightweight CPU Convolutional Neural Network

    Authors: Cheng Cui, Tingquan Gao, Shengyu Wei, Yuning Du, Ruoyu Guo, Shuilong Dong, Bin Lu, Ying Zhou, Xueying Lv, Qiwen Liu, Xiaoguang Hu, Dianhai Yu, Yanjun Ma

    Abstract: We propose a lightweight CPU network based on the MKLDNN acceleration strategy, named PP-LCNet, which improves the performance of lightweight models on multiple tasks. This paper lists technologies which can improve network accuracy while the latency is almost constant. With these improvements, the accuracy of PP-LCNet can greatly surpass the previous network structure with the same inference time… ▽ More

    Submitted 17 September, 2021; originally announced September 2021.

    Comments: 8 pages, 2 figures, 9 tables

  47. PlaTe: Visually-Grounded Planning with Transformers in Procedural Tasks

    Authors: Jiankai Sun, De-An Huang, Bo Lu, Yun-Hui Liu, Bolei Zhou, Animesh Garg

    Abstract: In this work, we study the problem of how to leverage instructional videos to facilitate the understanding of human decision-making processes, focusing on training a model with the ability to plan a goal-directed procedure from real-world videos. Learning structured and plannable state and action spaces directly from unstructured videos is the key technical challenge of our task. There are two pro… ▽ More

    Submitted 2 March, 2022; v1 submitted 10 September, 2021; originally announced September 2021.

    Journal ref: IEEE Robotics and Automation Letters ( Volume: 7 , Issue: 2 , April 2022 )

  48. arXiv:2109.04673  [pdf, other

    cs.CL

    DIALKI: Knowledge Identification in Conversational Systems through Dialogue-Document Contextualization

    Authors: Zeqiu Wu, Bo-Ru Lu, Hannaneh Hajishirzi, Mari Ostendorf

    Abstract: Identifying relevant knowledge to be used in conversational systems that are grounded in long documents is critical to effective response generation. We introduce a knowledge identification model that leverages the document structure to provide dialogue-contextualized passage encodings and better locate knowledge relevant to the conversation. An auxiliary loss captures the history of dialogue-docu… ▽ More

    Submitted 10 September, 2021; originally announced September 2021.

    Comments: EMNLP 2021 camera-ready

  49. arXiv:2109.03144  [pdf, other

    cs.CV

    PP-OCRv2: Bag of Tricks for Ultra Lightweight OCR System

    Authors: Yuning Du, Chenxia Li, Ruoyu Guo, Cheng Cui, Weiwei Liu, Jun Zhou, Bin Lu, Yehua Yang, Qiwen Liu, Xiaoguang Hu, Dianhai Yu, Yanjun Ma

    Abstract: Optical Character Recognition (OCR) systems have been widely used in various of application scenarios. Designing an OCR system is still a challenging task. In previous work, we proposed a practical ultra lightweight OCR system (PP-OCR) to balance the accuracy against the efficiency. In order to improve the accuracy of PP-OCR and keep high efficiency, in this paper, we propose a more robust OCR sys… ▽ More

    Submitted 12 October, 2021; v1 submitted 7 September, 2021; originally announced September 2021.

    Comments: 8 pages, 9 figures, 5 tables

  50. arXiv:2108.13035  [pdf, other

    cs.RO cs.AI eess.SY

    SurRoL: An Open-source Reinforcement Learning Centered and dVRK Compatible Platform for Surgical Robot Learning

    Authors: Jiaqi Xu, Bin Li, Bo Lu, Yun-Hui Liu, Qi Dou, Pheng-Ann Heng

    Abstract: Autonomous surgical execution relieves tedious routines and surgeon's fatigue. Recent learning-based methods, especially reinforcement learning (RL) based methods, achieve promising performance for dexterous manipulation, which usually requires the simulation to collect data efficiently and reduce the hardware cost. The existing learning-based simulation platforms for medical robots suffer from li… ▽ More

    Submitted 30 August, 2021; originally announced August 2021.

    Comments: 8 pages, 8 figures, 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)