Skip to main content

Showing 1–50 of 120 results for author: Lou, Y

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.02095  [pdf, other

    cs.SE

    TIGER: A Generating-Then-Ranking Framework for Practical Python Type Inference

    Authors: Chong Wang, Jian Zhang, Yiling Lou, Mingwei Liu, Weisong Sun, Yang Liu, Xin Peng

    Abstract: Python's dynamic ty** system offers flexibility and expressiveness but can lead to type-related errors, prompting the need for automated type inference to enhance type hinting. While existing learning-based approaches show promising inference accuracy, they struggle with practical challenges in comprehensively handling various types, including complex generic types and (unseen) user-defined type… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

  2. arXiv:2406.15806  [pdf, other

    cs.RO

    Robust Dynamic Control Barrier Function Based Trajectory Planning for Mobile Manipulator

    Authors: Lihao Xu, Xiaogang Xiong, Bai Yang, Yunjiang Lou

    Abstract: High-dimensional robot dynamic trajectory planning poses many challenges for traditional planning algorithms. Existing planning methods suffer from issues such as long computation times, limited capacity to address intricate obstacle models, and lack of consideration for external disturbances and measurement inaccuracies in these high-dimensional systems. To tackle these challenges, this paper pro… ▽ More

    Submitted 22 June, 2024; originally announced June 2024.

  3. arXiv:2406.11707  [pdf, other

    cs.CR cs.CV cs.LG

    A First Physical-World Trajectory Prediction Attack via LiDAR-induced Deceptions in Autonomous Driving

    Authors: Yang Lou, Yi Zhu, Qun Song, Rui Tan, Chunming Qiao, Wei-Bin Lee, Jian** Wang

    Abstract: Trajectory prediction forecasts nearby agents' moves based on their historical trajectories. Accurate trajectory prediction is crucial for autonomous vehicles. Existing attacks compromise the prediction model of a victim AV by directly manipulating the historical trajectory of an attacker AV, which has limited real-world applicability. This paper, for the first time, explores an indirect attack ap… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: In Proceedings of the 33rd USENIX Security Symposium 2024

  4. arXiv:2406.11147  [pdf, other

    cs.SE cs.AI

    Vul-RAG: Enhancing LLM-based Vulnerability Detection via Knowledge-level RAG

    Authors: Xueying Du, Geng Zheng, Kaixin Wang, Jiayi Feng, Wentai Deng, Mingwei Liu, Bihuan Chen, Xin Peng, Tao Ma, Yiling Lou

    Abstract: Vulnerability detection is essential for software quality assurance. In recent years, deep learning models (especially large language models) have shown promise in vulnerability detection. In this work, we propose a novel LLM-based vulnerability detection technique Vul-RAG, which leverages knowledge-level retrieval-augmented generation (RAG) framework to detect vulnerability for the given code in… ▽ More

    Submitted 19 June, 2024; v1 submitted 16 June, 2024; originally announced June 2024.

  5. arXiv:2406.10018  [pdf, other

    cs.SE

    STALL+: Boosting LLM-based Repository-level Code Completion with Static Analysis

    Authors: Junwei Liu, Yixuan Chen, Mingwei Liu, Xin Peng, Yiling Lou

    Abstract: Repository-level code completion is challenging as it involves complicated contexts from multiple files in the repository. To date, researchers have proposed two technical categories to enhance LLM-based repository-level code completion, i.e., retrieval-augmented generation (RAG) and static analysis integration. This work performs the first study on the static analysis integration in LLM-based rep… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: 12 pages, 5 figures

  6. arXiv:2406.03803  [pdf, ps, other

    cs.IT

    Determining the Weight Spectrum of the Reed--Muller Codes RM(m-6,m)

    Authors: Yueying Lou, Qichun Wang

    Abstract: The weight spectra of the Reed-Muller codes $RM(r,m)$ were unknown for $r=3,...,m-5$. In IEEE Trans. Inform. Theory 2024, Carlet determined the weight spectrum of $RM(m-5,m)$ for $m\ge10$ using the Maiorana-McFarland construction, where the result was tried to be extended to $RM(m-6,m)$, but many problems occurred and much work needed to be done. In this paper, we propose a novel way of constructi… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  7. arXiv:2404.14294  [pdf, other

    cs.CL cs.AI

    A Survey on Efficient Inference for Large Language Models

    Authors: Zixuan Zhou, Xuefei Ning, Ke Hong, Tianyu Fu, Jiaming Xu, Shiyao Li, Yuming Lou, Luning Wang, Zhihang Yuan, Xiuhong Li, Shengen Yan, Guohao Dai, Xiao-** Zhang, Yuhan Dong, Yu Wang

    Abstract: Large Language Models (LLMs) have attracted extensive attention due to their remarkable performance across various tasks. However, the substantial computational and memory requirements of LLM inference pose challenges for deployment in resource-constrained scenarios. Efforts within the field have been directed towards develo** techniques aimed at enhancing the efficiency of LLM inference. This p… ▽ More

    Submitted 8 June, 2024; v1 submitted 22 April, 2024; originally announced April 2024.

  8. arXiv:2404.11978  [pdf, other

    cs.CL

    EVIT: Event-Oriented Instruction Tuning for Event Reasoning

    Authors: Zhengwei Tao, Xiancai Chen, Zhi **, Xiaoying Bai, Haiyan Zhao, Yiwei Lou

    Abstract: Events refer to specific occurrences, incidents, or happenings that take place under a particular background. Event reasoning aims to infer events according to certain relations and predict future events. The cutting-edge techniques for event reasoning play a crucial role in various natural language processing applications. Large language models (LLMs) have made significant advancements in event r… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

  9. arXiv:2404.05952  [pdf, other

    cs.RO

    Robot Safe Planning In Dynamic Environments Based On Model Predictive Control Using Control Barrier Function

    Authors: Zetao Lu, Kaijun Feng, Jun Xu, Haoyao Chen, Yunjiang Lou

    Abstract: Implementing obstacle avoidance in dynamic environments is a challenging problem for robots. Model predictive control (MPC) is a popular strategy for dealing with this type of problem, and recent work mainly uses control barrier function (CBF) as hard constraints to ensure that the system state remains in the safe set. However, in crowded scenarios, effective solutions may not be obtained due to i… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

  10. arXiv:2403.16362  [pdf, other

    cs.SE

    AgentFL: Scaling LLM-based Fault Localization to Project-Level Context

    Authors: Yihao Qin, Shangwen Wang, Yiling Lou, **hao Dong, Kaixin Wang, Xiaoling Li, Xiaoguang Mao

    Abstract: Fault Localization (FL) is an essential step during the debugging process. With the strong capabilities of code comprehension, the recent Large Language Models (LLMs) have demonstrated promising performance in diagnosing bugs in the code. Nevertheless, due to LLMs' limited performance in handling long contexts, existing LLM-based fault localization remains on localizing bugs within a small code sc… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

  11. arXiv:2402.03610  [pdf, other

    cs.LG cs.AI cs.CL

    RAP: Retrieval-Augmented Planning with Contextual Memory for Multimodal LLM Agents

    Authors: Tomoyuki Kagaya, Thong **g Yuan, Yuxuan Lou, Jayashree Karlekar, Sugiri Pranata, Akira Kinose, Koki Oguri, Felix Wick, Yang You

    Abstract: Owing to recent advancements, Large Language Models (LLMs) can now be deployed as agents for increasingly complex decision-making applications in areas including robotics, gaming, and API integration. However, reflecting past experiences in current decision-making processes, an innate human behavior, continues to pose significant challenges. Addressing this, we propose Retrieval-Augmented Planning… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

  12. arXiv:2401.17786  [pdf, other

    cs.DB cs.PF

    A Graph-Native Query Optimization Framework

    Authors: Bingqing Lyu, Xiaoli Zhou, Longbin Lai, Yufan Yang, Yunkai Lou, Wenyuan Yu, **gren Zhou

    Abstract: Graph queries that combine pattern matching with relational operations, referred as PatRelQuery, are widely used in many real-world applications. It allows users to identify arbitrary patterns in a graph and further perform in-depth relational analysis on the results. To effectively support PatRelQuery, two key challenges need to be addressed: (1) how to optimize PatRelQuery in a unified framework… ▽ More

    Submitted 5 February, 2024; v1 submitted 31 January, 2024; originally announced January 2024.

  13. Integrated Sensing and Communication with Massive MIMO: A Unified Tensor Approach for Channel and Target Parameter Estimation

    Authors: Ruoyu Zhang, Lei Cheng, Shuai Wang, Yi Lou, Yulong Gao, Wen Wu, Derrick Wing Kwan Ng

    Abstract: Benefitting from the vast spatial degrees of freedom, the amalgamation of integrated sensing and communication (ISAC) and massive multiple-input multiple-output (MIMO) is expected to simultaneously improve spectral and energy efficiencies as well as the sensing capability. However, a large number of antennas deployed in massive MIMO-ISAC raises critical challenges in acquiring both accurate channe… ▽ More

    Submitted 3 January, 2024; originally announced January 2024.

    Journal ref: IEEE Transactions on Wireless Communications, 2024

  14. arXiv:2312.10448  [pdf, other

    cs.SE cs.AI cs.CL

    Resolving Crash Bugs via Large Language Models: An Empirical Study

    Authors: Xueying Du, Mingwei Liu, Juntao Li, Hanlin Wang, Xin Peng, Yiling Lou

    Abstract: Crash bugs cause unexpected program behaviors or even termination, requiring high-priority resolution. However, manually resolving crash bugs is challenging and labor-intensive, and researchers have proposed various techniques for their automated localization and repair. ChatGPT, a recent large language model (LLM), has garnered significant attention due to its exceptional performance across vario… ▽ More

    Submitted 16 December, 2023; originally announced December 2023.

  15. arXiv:2312.08367  [pdf, other

    cs.CV

    ViLA: Efficient Video-Language Alignment for Video Question Answering

    Authors: Xijun Wang, Junbang Liang, Chun-Kai Wang, Kenan Deng, Yu Lou, Ming Lin, Shan Yang

    Abstract: In this work, we propose an efficient Video-Language Alignment (ViLA) network. Our ViLA model addresses both efficient frame sampling and effective cross-modal alignment in a unified way. In our ViLA network, we design a new learnable text-guided Frame-Prompter together with a new cross-modal distillation (QFormer-Distiller) module. Pre-trained large image-language models have shown promising resu… ▽ More

    Submitted 29 April, 2024; v1 submitted 13 December, 2023; originally announced December 2023.

  16. arXiv:2311.05795  [pdf, other

    cs.LG stat.ML

    Improvements on Uncertainty Quantification for Node Classification via Distance-Based Regularization

    Authors: Russell Alan Hart, Linlin Yu, Yifei Lou, Feng Chen

    Abstract: Deep neural networks have achieved significant success in the last decades, but they are not well-calibrated and often produce unreliable predictions. A large number of literature relies on uncertainty quantification to evaluate the reliability of a learning model, which is particularly important for applications of out-of-distribution (OOD) detection and misclassification detection. We are intere… ▽ More

    Submitted 9 November, 2023; originally announced November 2023.

    Comments: Neurips 2023

  17. arXiv:2311.04726  [pdf, other

    cs.CV

    Social Motion Prediction with Cognitive Hierarchies

    Authors: Wentao Zhu, Jason Qin, Yuke Lou, Hang Ye, Xiaoxuan Ma, Hai Ci, Yizhou Wang

    Abstract: Humans exhibit a remarkable capacity for anticipating the actions of others and planning their own actions accordingly. In this study, we strive to replicate this ability by addressing the social motion prediction problem. We introduce a new benchmark, a novel formulation, and a cognition-inspired framework. We present Wusi, a 3D multi-person motion dataset under the context of team sports, which… ▽ More

    Submitted 8 November, 2023; originally announced November 2023.

    Comments: NeurIPS 2023

  18. arXiv:2311.04448  [pdf, other

    cs.SE

    Inferring Resource-Oriented Intentions using LLMs for Static Resource Leak Detection

    Authors: Chong Wang, Jianan Liu, Xin Peng, Yang Liu, Yiling Lou

    Abstract: Resource leaks, caused by resources not being released after acquisition, often lead to performance issues and system crashes. Existing static detection techniques rely on mechanical matching of predefined resource acquisition/release APIs and null-checking conditions to find unreleased resources, suffering from both (1) false negatives caused by the incompleteness of predefined resource acquisiti… ▽ More

    Submitted 2 July, 2024; v1 submitted 7 November, 2023; originally announced November 2023.

  19. On Finding Bi-objective Pareto-optimal Fraud Prevention Rule Sets for Fintech Applications

    Authors: Chengyao Wen, Yin Lou

    Abstract: Rules are widely used in Fintech institutions to make fraud prevention decisions, since rules are highly interpretable thanks to their intuitive if-then structure. In practice, a two-stage framework of fraud prevention decision rule set mining is usually employed in large Fintech institutions; Stage 1 generates a potentially large pool of rules and Stage 2 aims to produce a refined rule subset acc… ▽ More

    Submitted 27 June, 2024; v1 submitted 1 November, 2023; originally announced November 2023.

  20. arXiv:2310.04551  [pdf, other

    cs.CV

    MeSa: Masked, Geometric, and Supervised Pre-training for Monocular Depth Estimation

    Authors: Muhammad Osama Khan, Junbang Liang, Chun-Kai Wang, Shan Yang, Yu Lou

    Abstract: Pre-training has been an important ingredient in develo** strong monocular depth estimation models in recent years. For instance, self-supervised learning (SSL) is particularly effective by alleviating the need for large datasets with dense ground-truth depth maps. However, despite these improvements, our study reveals that the later layers of the SOTA SSL method are actually suboptimal. By exam… ▽ More

    Submitted 6 October, 2023; originally announced October 2023.

  21. arXiv:2309.01372  [pdf, other

    cs.CV

    DiverseMotion: Towards Diverse Human Motion Generation via Discrete Diffusion

    Authors: Yunhong Lou, Linchao Zhu, Yaxiong Wang, Xiaohan Wang, Yi Yang

    Abstract: We present DiverseMotion, a new approach for synthesizing high-quality human motions conditioned on textual descriptions while preserving motion diversity.Despite the recent significant process in text-based human motion generation,existing methods often prioritize fitting training motions at the expense of action diversity. Consequently, striking a balance between motion quality and diversity rem… ▽ More

    Submitted 4 September, 2023; originally announced September 2023.

    Comments: 12 pages, 7 figures

  22. arXiv:2308.13561  [pdf, other

    cs.HC cs.CV

    Project Aria: A New Tool for Egocentric Multi-Modal AI Research

    Authors: Jakob Engel, Kiran Somasundaram, Michael Goesele, Albert Sun, Alexander Gamino, Andrew Turner, Arjang Talattof, Arnie Yuan, Bilal Souti, Brighid Meredith, Cheng Peng, Chris Sweeney, Cole Wilson, Dan Barnes, Daniel DeTone, David Caruso, Derek Valleroy, Dinesh Ginjupalli, Duncan Frost, Edward Miller, Elias Mueggler, Evgeniy Oleinik, Fan Zhang, Guruprasad Somasundaram, Gustavo Solaira , et al. (49 additional authors not shown)

    Abstract: Egocentric, multi-modal data as available on future augmented reality (AR) devices provides unique challenges and opportunities for machine perception. These future devices will need to be all-day wearable in a socially acceptable form-factor to support always available, context-aware and personalized AI applications. Our team at Meta Reality Labs Research built the Aria device, an egocentric, mul… ▽ More

    Submitted 1 October, 2023; v1 submitted 24 August, 2023; originally announced August 2023.

  23. Four years of multi-modal odometry and map** on the rail vehicles

    Authors: Yusheng Wang, Weiwei Song, Yi Zhang, Fei Huang, Zhiyong Tu, Ruoying Li, Shimin Zhang, Yidong Lou

    Abstract: Precise, seamless, and efficient train localization as well as long-term railway environment monitoring is the essential property towards reliability, availability, maintainability, and safety (RAMS) engineering for railroad systems. Simultaneous localization and map** (SLAM) is right at the core of solving the two problems concurrently. In this end, we propose a high-performance and versatile m… ▽ More

    Submitted 22 August, 2023; originally announced August 2023.

  24. arXiv:2308.11492  [pdf

    cs.RO

    A LiDAR-Inertial SLAM Tightly-Coupled with Dropout-Tolerant GNSS Fusion for Autonomous Mine Service Vehicles

    Authors: Yusheng Wang, Yidong Lou, Weiwei Song, Bing Zhan, Feihuang Xia, Qigeng Duan

    Abstract: Multi-modal sensor integration has become a crucial prerequisite for the real-world navigation systems. Recent studies have reported successful deployment of such system in many fields. However, it is still challenging for navigation tasks in mine scenes due to satellite signal dropouts, degraded perception, and observation degeneracy. To solve this problem, we propose a LiDAR-inertial odometry me… ▽ More

    Submitted 22 August, 2023; originally announced August 2023.

  25. arXiv:2308.11422  [pdf, other

    cs.SE

    Recommending Analogical APIs via Knowledge Graph Embedding

    Authors: Mingwei Liu, Yanjun Yang, Yiling Lou, Xin Peng, Zhong Zhou, Xueying Du, Tianyong Yang

    Abstract: Library migration, which re-implements the same software behavior by using a different library instead of using the current one, has been widely observed in software evolution. One essential part of library migration is to find an analogical API that could provide the same functionality as current ones. However, given the large number of libraries/APIs, manually finding an analogical API could be… ▽ More

    Submitted 22 August, 2023; originally announced August 2023.

    Comments: Accepted by FSE 2023

  26. arXiv:2308.09119  [pdf, other

    cs.CV

    ICAR: Image-based Complementary Auto Reasoning

    Authors: Xijun Wang, Anqi Liang, Junbang Liang, Ming Lin, Yu Lou, Shan Yang

    Abstract: Scene-aware Complementary Item Retrieval (CIR) is a challenging task which requires to generate a set of compatible items across domains. Due to the subjectivity, it is difficult to set up a rigorous standard for both data collection and learning objectives. To address this challenging task, we propose a visual compatibility concept, composed of similarity (resembling in color, geometry, texture,… ▽ More

    Submitted 17 August, 2023; originally announced August 2023.

  27. arXiv:2308.01861  [pdf, other

    cs.CL cs.AI

    ClassEval: A Manually-Crafted Benchmark for Evaluating LLMs on Class-level Code Generation

    Authors: Xueying Du, Mingwei Liu, Kaixin Wang, Hanlin Wang, Junwei Liu, Yixuan Chen, Jiayi Feng, Chaofeng Sha, Xin Peng, Yiling Lou

    Abstract: In this work, we make the first attempt to evaluate LLMs in a more challenging code generation scenario, i.e. class-level code generation. We first manually construct the first class-level code generation benchmark ClassEval of 100 class-level Python code generation tasks with approximately 500 person-hours. Based on it, we then perform the first study of 11 state-of-the-art LLMs on class-level co… ▽ More

    Submitted 14 August, 2023; v1 submitted 3 August, 2023; originally announced August 2023.

  28. arXiv:2308.01240  [pdf, other

    cs.CL cs.AI

    Evaluating Instruction-Tuned Large Language Models on Code Comprehension and Generation

    Authors: Zhiqiang Yuan, Junwei Liu, Qiancheng Zi, Mingwei Liu, Xin Peng, Yiling Lou

    Abstract: In this work, we evaluate 10 open-source instructed LLMs on four representative code comprehension and generation tasks. We have the following main findings. First, for the zero-shot setting, instructed LLMs are very competitive on code comprehension and generation tasks and sometimes even better than small SOTA models specifically fine-tuned on each downstream task. We also find that larger instr… ▽ More

    Submitted 2 August, 2023; originally announced August 2023.

  29. Uncertainty-Encoded Multi-Modal Fusion for Robust Object Detection in Autonomous Driving

    Authors: Yang Lou, Qun Song, Qian Xu, Rui Tan, Jian** Wang

    Abstract: Multi-modal fusion has shown initial promising results for object detection of autonomous driving perception. However, many existing fusion schemes do not consider the quality of each fusion input and may suffer from adverse conditions on one or more sensors. While predictive uncertainty has been applied to characterize single-modal object detection performance at run time, incorporating uncertain… ▽ More

    Submitted 30 July, 2023; originally announced July 2023.

    Comments: In proceedings of the 26th European Conference on Artificial Intelligence ECAI 2023. 8 pages + 2 appendix pages

  30. Dynamic Object Tracking for Quadruped Manipulator with Spherical Image-Based Approach

    Authors: Tianlin Zhang, Sikai Guo, Xiaogang Xiong, Wanlei Li, Zezheng Qi, Yunjiang Lou

    Abstract: Exactly estimating and tracking the motion of surrounding dynamic objects is one of important tasks for the autonomy of a quadruped manipulator. However, with only an onboard RGB camera, it is still a challenging work for a quadruped manipulator to track the motion of a dynamic object moving with unknown and changing velocities. To address this problem, this manuscript proposes a novel image-based… ▽ More

    Submitted 14 July, 2023; originally announced July 2023.

    Journal ref: 2023 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Detroit, MI, USA, 2023, pp. 727-734

  31. arXiv:2307.00439  [pdf, other

    eess.IV cs.CV math.NA

    Weighted Anisotropic-Isotropic Total Variation for Poisson Denoising

    Authors: Kevin Bui, Yifei Lou, Fredrick Park, Jack Xin

    Abstract: Poisson noise commonly occurs in images captured by photon-limited imaging systems such as in astronomy and medicine. As the distribution of Poisson noise depends on the pixel intensity value, noise levels vary from pixels to pixels. Hence, denoising a Poisson-corrupted image while preserving important details can be challenging. In this paper, we propose a Poisson denoising model by incorporating… ▽ More

    Submitted 1 July, 2023; originally announced July 2023.

    Comments: accepted to ICIP 2023

  32. Computational Design of Passive Grippers

    Authors: Milin Kodnongbua, Ian Good Yu Lou, Jeffrey Lipton, Adriana Schulz

    Abstract: This work proposes a novel generative design tool for passive grippers -- robot end effectors that have no additional actuation and instead leverage the existing degrees of freedom in a robotic arm to perform gras** tasks. Passive grippers are used because they offer interesting trade-offs between cost and capabilities. However, existing designs are limited in the types of shapes that can be gra… ▽ More

    Submitted 5 June, 2023; originally announced June 2023.

    Journal ref: ACM Transactions on Graphics, Volume 41, Issue 4, July 2022, Article No.: 149, pp 2-12

  33. SPP-CNN: An Efficient Framework for Network Robustness Prediction

    Authors: Chengpei Wu, Yang Lou, Lin Wang, Junli Li, Xiang Li, Guanrong Chen

    Abstract: This paper addresses the robustness of a network to sustain its connectivity and controllability against malicious attacks. This kind of network robustness is typically measured by the time-consuming attack simulation, which returns a sequence of values that record the remaining connectivity and controllability after a sequence of node- or edge-removal attacks. For improvement, this paper develops… ▽ More

    Submitted 13 May, 2023; originally announced May 2023.

    Comments: 10 pages, 7 figures, 14 pages Supplementary Information

    Journal ref: IEEE Transactions on Circuits and Systems I: Regular Papers. 2023, 70 (10), 4067-4079

  34. arXiv:2305.04207  [pdf, other

    cs.SE

    No More Manual Tests? Evaluating and Improving ChatGPT for Unit Test Generation

    Authors: Zhiqiang Yuan, Yiling Lou, Mingwei Liu, Shiji Ding, Kaixin Wang, Yixuan Chen, Xin Peng

    Abstract: Unit testing is essential in detecting bugs in functionally-discrete program units. Manually writing high-quality unit tests is time-consuming and laborious. Although traditional techniques can generate tests with reasonable coverage, they exhibit low readability and cannot be directly adopted by developers. Recent work has shown the large potential of large language models (LLMs) in unit test gen… ▽ More

    Submitted 19 May, 2024; v1 submitted 7 May, 2023; originally announced May 2023.

  35. arXiv:2305.00366  [pdf, other

    cs.CL cs.IR cs.LG

    S2abEL: A Dataset for Entity Linking from Scientific Tables

    Authors: Yuze Lou, Bailey Kuehl, Erin Bransom, Sergey Feldman, Aakanksha Naik, Doug Downey

    Abstract: Entity linking (EL) is the task of linking a textual mention to its corresponding entry in a knowledge base, and is critical for many knowledge-intensive NLP applications. When applied to tables in scientific papers, EL is a step toward large-scale scientific knowledge bases that could enable advanced scientific question answering and analytics. We present the first dataset for EL in scientific ta… ▽ More

    Submitted 29 April, 2023; originally announced May 2023.

  36. arXiv:2304.06548  [pdf, other

    eess.SY cs.LG eess.SP

    Multi-kernel Correntropy-based Orientation Estimation of IMUs: Gradient Descent Methods

    Authors: Shilei Li, Li**g Li, Dawei Shi, Yunjiang Lou, Ling Shi

    Abstract: This paper presents two computationally efficient algorithms for the orientation estimation of inertial measurement units (IMUs): the correntropy-based gradient descent (CGD) and the correntropy-based decoupled orientation estimation (CDOE). Traditional methods, such as gradient descent (GD) and decoupled orientation estimation (DOE), rely on the mean squared error (MSE) criterion, making them vul… ▽ More

    Submitted 11 October, 2023; v1 submitted 13 April, 2023; originally announced April 2023.

    Comments: 16 pages

  37. arXiv:2304.03285  [pdf, other

    cs.CV

    $\text{DC}^2$: Dual-Camera Defocus Control by Learning to Refocus

    Authors: Hadi Alzayer, Abdullah Abuolaim, Leung Chun Chan, Yang Yang, Ying Chen Lou, Jia-Bin Huang, Abhishek Kar

    Abstract: Smartphone cameras today are increasingly approaching the versatility and quality of professional cameras through a combination of hardware and software advancements. However, fixed aperture remains a key limitation, preventing users from controlling the depth of field (DoF) of captured images. At the same time, many smartphones now have multiple cameras with different fixed apertures -- specifica… ▽ More

    Submitted 6 April, 2023; originally announced April 2023.

    Comments: CVPR 2023. See the project page at https://defocus-control.github.io

  38. Non-convex approaches for low-rank tensor completion under tubal sampling

    Authors: Zheng Tan, Longxiu Huang, HanQin Cai, Yifei Lou

    Abstract: Tensor completion is an important problem in modern data analysis. In this work, we investigate a specific sampling strategy, referred to as tubal sampling. We propose two novel non-convex tensor completion frameworks that are easy to implement, named tensor $L_1$-$L_2$ (TL12) and tensor completion via CUR (TCCUR). We test the efficiency of both methods on synthetic data and a color image inpainti… ▽ More

    Submitted 17 March, 2023; originally announced March 2023.

    Journal ref: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 1-5. IEEE, 2023

  39. arXiv:2303.03101  [pdf, other

    cs.CV

    CRIN: Rotation-Invariant Point Cloud Analysis and Rotation Estimation via Centrifugal Reference Frame

    Authors: Yu**g Lou, Zelin Ye, Yang You, Nianjuan Jiang, Jiangbo Lu, Weiming Wang, Lizhuang Ma, Cewu Lu

    Abstract: Various recent methods attempt to implement rotation-invariant 3D deep learning by replacing the input coordinates of points with relative distances and angles. Due to the incompleteness of these low-level features, they have to undertake the expense of losing global information. In this paper, we propose the CRIN, namely Centrifugal Rotation-Invariant Network. CRIN directly takes the coordinates… ▽ More

    Submitted 6 March, 2023; originally announced March 2023.

    Comments: AAAI 2023

  40. Structural Robustness of Complex Networks: A Survey of A Posteriori Measures

    Authors: Yang Lou, Lin Wang, Guanrong Chen

    Abstract: Network robustness is critical for various industrial and social networks against malicious attacks, which has various meanings in different research contexts and here it refers to the ability of a network to sustain its functionality when a fraction of the network fail to work due to attacks. The rapid development of complex networks research indicates special interest and great concern about the… ▽ More

    Submitted 3 February, 2023; originally announced February 2023.

    Journal ref: IEEE Circuits and Systems Magazine, Volume 23, Issue 1, 2023

  41. arXiv:2302.01857  [pdf, other

    cs.SE cs.AI

    KNOD: Domain Knowledge Distilled Tree Decoder for Automated Program Repair

    Authors: Nan Jiang, Thibaud Lutellier, Yiling Lou, Lin Tan, Dan Goldwasser, Xiangyu Zhang

    Abstract: Automated Program Repair (APR) improves software reliability by generating patches for a buggy program automatically. Recent APR techniques leverage deep learning (DL) to build models to learn to generate patches from existing patches and code corpora. While promising, DL-based APR techniques suffer from the abundant syntactically or semantically incorrect patches in the patch space. These patches… ▽ More

    Submitted 16 April, 2023; v1 submitted 3 February, 2023; originally announced February 2023.

    Comments: This paper is accepted by 2023 IEEE/ACM 45th International Conference on Software Engineering (ICSE)

  42. arXiv:2301.03393  [pdf, other

    cs.CV

    Difference of Anisotropic and Isotropic TV for Segmentation under Blur and Poisson Noise

    Authors: Kevin Bui, Yifei Lou, Fredrick Park, Jack Xin

    Abstract: In this paper, we aim to segment an image degraded by blur and Poisson noise. We adopt a smoothing-and-thresholding (SaT) segmentation framework that finds a piecewise-smooth solution, followed by $k$-means clustering to segment the image. Specifically for the image smoothing step, we replace the least-squares fidelity for Gaussian noise in the Mumford-Shah model with a maximum posterior (MAP) ter… ▽ More

    Submitted 16 June, 2023; v1 submitted 5 January, 2023; originally announced January 2023.

    Comments: Accepted to Frontiers in Computer Science: https://www.frontiersin.org/articles/10.3389/fcomp.2023.1131317/abstract; Arxiv version has clearer images best for zooming in

  43. arXiv:2212.04145  [pdf, other

    cs.CV

    Decorate the Newcomers: Visual Domain Prompt for Continual Test Time Adaptation

    Authors: Yulu Gan, Yan Bai, Yihang Lou, Xianzheng Ma, Renrui Zhang, Nian Shi, Lin Luo

    Abstract: Continual Test-Time Adaptation (CTTA) aims to adapt the source model to continually changing unlabeled target domains without access to the source data. Existing methods mainly focus on model-based adaptation in a self-training manner, such as predicting pseudo labels for new domain datasets. Since pseudo labels are noisy and unreliable, these methods suffer from catastrophic forgetting and error… ▽ More

    Submitted 11 February, 2023; v1 submitted 8 December, 2022; originally announced December 2022.

    Comments: AAAI2023 Outstanding Student Paper Award

  44. arXiv:2211.15386  [pdf, other

    cs.NE

    PC-SNN: Supervised Learning with Local Hebbian Synaptic Plasticity based on Predictive Coding in Spiking Neural Networks

    Authors: Mengting Lan, Xiaogang Xiong, Zixuan Jiang, Yunjiang Lou

    Abstract: Deemed as the third generation of neural networks, the event-driven Spiking Neural Networks(SNNs) combined with bio-plausible local learning rules make it promising to build low-power, neuromorphic hardware for SNNs. However, because of the non-linearity and discrete property of spiking neural networks, the training of SNN remains difficult and is still under discussion. Originating from gradient… ▽ More

    Submitted 24 November, 2022; originally announced November 2022.

    Comments: 15 pages, 11figs

    ACM Class: I.2.3; I.2.10

  45. Active View Planning for Visual SLAM in Outdoor Environments Based on Continuous Information Modeling

    Authors: Zhihao Wang, Haoyao Chen, Shiwu Zhang, Yunjiang Lou

    Abstract: The visual simultaneous localization and map**(vSLAM) is widely used in GPS-denied and open field environments for ground and surface robots. However, due to the frequent perception failures derived from lacking visual texture or the {swing} of robot view direction on rough terrains, the accuracy and robustness of vSLAM are still to be enhanced. The study develops a novel view planning approach… ▽ More

    Submitted 22 May, 2023; v1 submitted 11 November, 2022; originally announced November 2022.

    Comments: 11 pages, 14 figures, in IEEE/ASME Transactions on Mechatronics

  46. arXiv:2210.01448  [pdf, other

    cs.SD cs.AI cs.CV cs.GR eess.AS

    Rhythmic Gesticulator: Rhythm-Aware Co-Speech Gesture Synthesis with Hierarchical Neural Embeddings

    Authors: Tenglong Ao, Qingzhe Gao, Yuke Lou, Baoquan Chen, Libin Liu

    Abstract: Automatic synthesis of realistic co-speech gestures is an increasingly important yet challenging task in artificial embodied agent creation. Previous systems mainly focus on generating gestures in an end-to-end manner, which leads to difficulties in mining the clear rhythm and semantics due to the complex yet subtle harmony between speech and gestures. We present a novel co-speech gesture synthesi… ▽ More

    Submitted 4 May, 2023; v1 submitted 4 October, 2022; originally announced October 2022.

    Comments: SIGGRAPH Asia 2022 (Journal Track); Project Page: https://pku-mocca.github.io/Rhythmic-Gesticulator-Page/

  47. arXiv:2209.02834  [pdf, other

    cs.CV

    Unsupervised Scene Sketch to Photo Synthesis

    Authors: Jiayun Wang, Sangryul Jeon, Stella X. Yu, Xi Zhang, Himanshu Arora, Yu Lou

    Abstract: Sketches make an intuitive and powerful visual expression as they are fast executed freehand drawings. We present a method for synthesizing realistic photos from scene sketches. Without the need for sketch and photo pairs, our framework directly learns from readily available large-scale photo datasets in an unsupervised manner. To this end, we introduce a standardization module that provides pseud… ▽ More

    Submitted 6 September, 2022; originally announced September 2022.

    Journal ref: ECCVW 2022

  48. CNN-based Prediction of Network Robustness With Missing Edges

    Authors: Chengpei Wu, Yang Lou, Ruizi Wu, Wenwen Liu, Junli Li

    Abstract: Connectivity and controllability of a complex network are two important issues that guarantee a networked system to function. Robustness of connectivity and controllability guarantees the system to function properly and stably under various malicious attacks. Evaluating network robustness using attack simulations is time consuming, while the convolutional neural network (CNN)-based prediction appr… ▽ More

    Submitted 24 August, 2022; originally announced August 2022.

    Comments: In Proceedings of the IEEE 2022 International Joint Conference on Neural Networks (IJCNN)

  49. arXiv:2208.02068  [pdf, other

    cs.LG

    HybridGNN: Learning Hybrid Representation in Multiplex Heterogeneous Networks

    Authors: Tiankai Gu, Chaokun Wang, Cheng Wu, **gcao Xu, Yunkai Lou, Chang** Wang, Kai Xu, Can Ye, Yang Song

    Abstract: Recently, graph neural networks have shown the superiority of modeling the complex topological structures in heterogeneous network-based recommender systems. Due to the diverse interactions among nodes and abundant semantics emerging from diverse types of nodes and edges, there is a bursting research interest in learning expressive node representations in multiplex heterogeneous networks. One of t… ▽ More

    Submitted 3 August, 2022; originally announced August 2022.

    Comments: ICDE 2022

  50. arXiv:2207.13137  [pdf, other

    cs.CV

    Bayesian Evidential Learning for Few-Shot Classification

    Authors: Xiongkun Linghu, Yan Bai, Yihang Lou, Shengsen Wu, **ze Li, Jianzhong He, Tao Bai

    Abstract: Few-Shot Classification(FSC) aims to generalize from base classes to novel classes given very limited labeled samples, which is an important step on the path toward human-like machine learning. State-of-the-art solutions involve learning to find a good metric and representation space to compute the distance between samples. Despite the promising accuracy performance, how to model uncertainty for m… ▽ More

    Submitted 18 July, 2022; originally announced July 2022.

    Comments: arXiv admin note: text overlap with arXiv:2107.10161 by other authors