Skip to main content

Showing 1–50 of 82 results for author: Luo, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.11666  [pdf, other

    math.ST cs.LG stat.ML

    ROTI-GCV: Generalized Cross-Validation for right-ROTationally Invariant Data

    Authors: Kevin Luo, Yufan Li, Pragya Sur

    Abstract: Two key tasks in high-dimensional regularized regression are tuning the regularization strength for good predictions and estimating the out-of-sample risk. It is known that the standard approach -- $k$-fold cross-validation -- is inconsistent in modern high-dimensional settings. While leave-one-out and generalized cross-validation remain consistent in some high-dimensional cases, they become incon… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: 25 pages, 3 figures

  2. arXiv:2406.11238  [pdf, other

    cs.CL

    What Kinds of Tokens Benefit from Distant Text? An Analysis on Long Context Language Modeling

    Authors: Yutong Hu, Quzhe Huang, Kangcheng Luo, Yansong Feng

    Abstract: As the context length that large language models can handle continues to increase, these models demonstrate an enhanced ability to utilize distant information for tasks such as language modeling. This capability contrasts with human reading and writing habits, where it is uncommon to remember and use particularly distant information, except in cases of foreshadowing. In this paper, we aim to explo… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  3. arXiv:2405.17777  [pdf, other

    cs.IR

    RREH: Reconstruction Relations Embedded Hashing for Semi-Paired Cross-Modal Retrieval

    Authors: Jianzong Wang, Haoxiang Shi, Kaiyi Luo, Xulong Zhang, Ning Cheng, **g Xiao

    Abstract: Known for efficient computation and easy storage, hashing has been extensively explored in cross-modal retrieval. The majority of current hashing models are predicated on the premise of a direct one-to-one map** between data points. However, in real practice, data correspondence across modalities may be partially provided. In this research, we introduce an innovative unsupervised hashing techniq… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    Comments: Accepted by the 20th International Conference on Intelligent Computing (ICIC 2024)

  4. arXiv:2405.16034  [pdf, other

    cs.CV

    DiffuBox: Refining 3D Object Detection with Point Diffusion

    Authors: Xiangyu Chen, Zhenzhen Liu, Katie Z Luo, Siddhartha Datta, Adhitya Polavaram, Yan Wang, Yurong You, Boyi Li, Marco Pavone, Wei-Lun Chao, Mark Campbell, Bharath Hariharan, Kilian Q. Weinberger

    Abstract: Ensuring robust 3D object detection and localization is crucial for many applications in robotics and autonomous driving. Recent models, however, face difficulties in maintaining high performance when applied to domains with differing sensor setups or geographic locations, often resulting in poor localization accuracy due to domain shift. To overcome this challenge, we introduce a novel diffusion-… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  5. arXiv:2405.01258  [pdf, other

    cs.CV cs.RO eess.IV

    Towards Consistent Object Detection via LiDAR-Camera Synergy

    Authors: Kai Luo, Hao Wu, Kefu Yi, Kailun Yang, Wei Hao, Rongdong Hu

    Abstract: As human-machine interaction continues to evolve, the capacity for environmental perception is becoming increasingly crucial. Integrating the two most common types of sensory data, images, and point clouds, can enhance detection accuracy. However, currently, no model exists that can simultaneously detect an object's position in both point clouds and images and ascertain their corresponding relatio… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

    Comments: The source code will be made publicly available at https://github.com/xifen523/COD

  6. arXiv:2404.14073  [pdf, other

    cs.LG cs.AI

    Towards Robust Trajectory Representations: Isolating Environmental Confounders with Causal Learning

    Authors: Kang Luo, Yuanshao Zhu, Wei Chen, Kun Wang, Zhengyang Zhou, Sijie Ruan, Yuxuan Liang

    Abstract: Trajectory modeling refers to characterizing human movement behavior, serving as a pivotal step in understanding mobility patterns. Nevertheless, existing studies typically ignore the confounding effects of geospatial context, leading to the acquisition of spurious correlations and limited generalization capabilities. To bridge this gap, we initially formulate a Structural Causal Model (SCM) to de… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

    Comments: The paper has been accepted by IJCAI 2024

  7. RMAFF-PSN: A Residual Multi-Scale Attention Feature Fusion Photometric Stereo Network

    Authors: Kai Luo, Yakun Ju, Lin Qi, Kaixuan Wang, Junyu Dong

    Abstract: Predicting accurate normal maps of objects from two-dimensional images in regions of complex structure and spatial material variations is challenging using photometric stereo methods due to the influence of surface reflection properties caused by variations in object geometry and surface materials. To address this issue, we propose a photometric stereo network called a RMAFF-PSN that uses residual… ▽ More

    Submitted 14 April, 2024; v1 submitted 11 April, 2024; originally announced April 2024.

    Comments: 17 pages,12 figures

    Journal ref: Photonics 2023,10(5),548

  8. arXiv:2404.05139  [pdf, other

    cs.CV cs.RO

    Better Monocular 3D Detectors with LiDAR from the Past

    Authors: Yurong You, Cheng Perng Phoo, Carlos Andres Diaz-Ruiz, Katie Z Luo, Wei-Lun Chao, Mark Campbell, Bharath Hariharan, Kilian Q Weinberger

    Abstract: Accurate 3D object detection is crucial to autonomous driving. Though LiDAR-based detectors have achieved impressive performance, the high cost of LiDAR sensors precludes their widespread adoption in affordable vehicles. Camera-based detectors are cheaper alternatives but often suffer inferior performance compared to their LiDAR-based counterparts due to inherent depth ambiguities in images. In th… ▽ More

    Submitted 9 April, 2024; v1 submitted 7 April, 2024; originally announced April 2024.

    Comments: Accepted by ICRA 2024. The code can be found at https://github.com/YurongYou/AsyncDepth

  9. arXiv:2404.02788  [pdf, other

    cs.CV

    GenN2N: Generative NeRF2NeRF Translation

    Authors: Xiangyue Liu, Han Xue, Kunming Luo, ** Tan, Li Yi

    Abstract: We present GenN2N, a unified NeRF-to-NeRF translation framework for various NeRF translation tasks such as text-driven NeRF editing, colorization, super-resolution, inpainting, etc. Unlike previous methods designed for individual translation tasks with task-specific schemes, GenN2N achieves all these NeRF editing tasks by employing a plug-and-play image-to-image translator to perform editing in th… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

    Comments: Accepted to CVPR 2024. Project page: https://xiangyueliu.github.io/GenN2N/

  10. arXiv:2404.00732  [pdf, other

    cs.GT cs.CY

    An Abundance of Katherines: The Game Theory of Baby Naming

    Authors: Katy Blumer, Kate Donahue, Katie Fritz, Kate Ivanovich, Katherine Lee, Katie Luo, Cathy Meng, Katie Van Koevering

    Abstract: In this paper, we study the highly competitive arena of baby naming. Through making several Extremely Reasonable Assumptions (namely, that parents are myopic, perfectly knowledgeable agents who pick a name based solely on its uniquness), we create a model which is not only tractable and clean, but also perfectly captures the real world. We then extend our investigation with numerical experiments,… ▽ More

    Submitted 1 April, 2024; v1 submitted 31 March, 2024; originally announced April 2024.

    Comments: Accepted at SIGBOVIK 2024

  11. arXiv:2403.17158  [pdf, other

    cs.CL

    Reflecting the Male Gaze: Quantifying Female Objectification in 19th and 20th Century Novels

    Authors: Kexin Luo, Yue Mao, Bei Zhang, Sophie Hao

    Abstract: Inspired by the concept of the male gaze (Mulvey, 1975) in literature and media studies, this paper proposes a framework for analyzing gender bias in terms of female objectification: the extent to which a text portrays female individuals as objects of visual pleasure. Our framework measures female objectification along two axes. First, we compute an agency bias score that indicates whether male en… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

    Comments: To appear in LREC-COLING 2024

  12. arXiv:2403.14151  [pdf, other

    cs.LG cs.AI cs.CY cs.DB

    Deep Learning for Trajectory Data Management and Mining: A Survey and Beyond

    Authors: Wei Chen, Yuxuan Liang, Yuanshao Zhu, Yanchuan Chang, Kang Luo, Haomin Wen, Lei Li, Yanwei Yu, Qingsong Wen, Chao Chen, Kai Zheng, Yunjun Gao, Xiaofang Zhou, Yu Zheng

    Abstract: Trajectory computing is a pivotal domain encompassing trajectory data management and mining, garnering widespread attention due to its crucial role in various practical applications such as location services, urban traffic, and public safety. Traditional methods, focusing on simplistic spatio-temporal features, face challenges of complex calculations, limited scalability, and inadequate adaptabili… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

    Comments: 25 pages, 12 figures, 5 tables

  13. arXiv:2402.16907  [pdf, other

    eess.IV cs.CV cs.LG

    Diffusion Posterior Proximal Sampling for Image Restoration

    Authors: Hongjie Wu, Linchao He, Mingqin Zhang, Dongdong Chen, Kunming Luo, Mengting Luo, Ji-Zhe Zhou, Hu Chen, Jiancheng Lv

    Abstract: Diffusion models have demonstrated remarkable efficacy in generating high-quality samples. Existing diffusion-based image restoration algorithms exploit pre-trained diffusion models to leverage data priors, yet they still preserve elements inherited from the unconditional generation paradigm. These strategies initiate the denoising process with pure white noise and incorporate random noise at each… ▽ More

    Submitted 24 February, 2024; originally announced February 2024.

  14. arXiv:2402.14704  [pdf, other

    cs.CL

    An LLM-Enhanced Adversarial Editing System for Lexical Simplification

    Authors: Keren Tan, Kangyang Luo, Yunshi Lan, Zheng Yuan, **long Shu

    Abstract: Lexical Simplification (LS) aims to simplify text at the lexical level. Existing methods rely heavily on annotated data, making it challenging to apply in low-resource scenarios. In this paper, we propose a novel LS method without parallel corpora. This method employs an Adversarial Editing System with guidance from a confusion loss and an invariance loss to predict lexical edits in the original s… ▽ More

    Submitted 22 March, 2024; v1 submitted 22 February, 2024; originally announced February 2024.

    Comments: Accepted by COLING 2024 main conference

  15. arXiv:2402.11573  [pdf, other

    cs.CL

    BGE Landmark Embedding: A Chunking-Free Embedding Method For Retrieval Augmented Long-Context Large Language Models

    Authors: Kun Luo, Zheng Liu, Shitao Xiao, Kang Liu

    Abstract: Large language models (LLMs) call for extension of context to handle many critical applications. However, the existing approaches are prone to expensive costs and inferior quality of context extension. In this work, we proposeExtensible Embedding, which realizes high-quality extension of LLM's context with strong flexibility and cost-effectiveness. Extensible embedding stand as an enhancement of t… ▽ More

    Submitted 18 February, 2024; originally announced February 2024.

  16. arXiv:2402.07095  [pdf, other

    cs.RO cs.HC

    Does ChatGPT and Whisper Make Humanoid Robots More Relatable?

    Authors: Xiaohui Chen, Katherine Luo, Trevor Gee, Mahla Nejati

    Abstract: Humanoid robots are designed to be relatable to humans for applications such as customer support and helpdesk services. However, many such systems, including Softbank's Pepper, fall short because they fail to communicate effectively with humans. The advent of Large Language Models (LLMs) shows the potential to solve the communication barrier for humanoid robotics. This paper outlines the compariso… ▽ More

    Submitted 10 February, 2024; originally announced February 2024.

    Comments: Published in Australasian Conference on Robotics and Automation (ACRA 2023

  17. arXiv:2402.04671  [pdf, other

    cs.CV

    V2VSSC: A 3D Semantic Scene Completion Benchmark for Perception with Vehicle to Vehicle Communication

    Authors: Yuanfang Zhang, Junxuan Li, Kaiqing Luo, Yiying Yang, Jiayi Han, Nian Liu, Denghui Qin, Peng Han, Chengpei Xu

    Abstract: Semantic scene completion (SSC) has recently gained popularity because it can provide both semantic and geometric information that can be used directly for autonomous vehicle navigation. However, there are still challenges to overcome. SSC is often hampered by occlusion and short-range perception due to sensor limitations, which can pose safety risks. This paper proposes a fundamental solution to… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

  18. arXiv:2402.03216  [pdf, other

    cs.CL cs.AI cs.LG

    BGE M3-Embedding: Multi-Lingual, Multi-Functionality, Multi-Granularity Text Embeddings Through Self-Knowledge Distillation

    Authors: Jianlv Chen, Shitao Xiao, Peitian Zhang, Kun Luo, Defu Lian, Zheng Liu

    Abstract: In this paper, we present a new embedding model, called M3-Embedding, which is distinguished for its versatility in Multi-Linguality, Multi-Functionality, and Multi-Granularity. It can support more than 100 working languages, leading to new state-of-the-art performances on multi-lingual and cross-lingual retrieval tasks. It can simultaneously perform the three common retrieval functionalities of e… ▽ More

    Submitted 28 June, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

  19. arXiv:2402.01089  [pdf, other

    stat.ML cs.LG

    No Free Prune: Information-Theoretic Barriers to Pruning at Initialization

    Authors: Tanishq Kumar, Kevin Luo, Mark Sellke

    Abstract: The existence of "lottery tickets" arXiv:1803.03635 at or near initialization raises the tantalizing question of whether large models are necessary in deep learning, or whether sparse networks can be quickly identified and trained without ever training the dense models that contain them. However, efforts to find these sparse subnetworks without training the dense model ("pruning at initialization"… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

  20. arXiv:2401.16433  [pdf, other

    cs.IR cs.LG

    Within-basket Recommendation via Neural Pattern Associator

    Authors: Kai Luo, Tianshu Shen, Lan Yao, Ga Wu, Aaron Liblong, Istvan Fehervari, Ruijian An, Jawad Ahmed, Harshit Mishra, Charu Pujari

    Abstract: Within-basket recommendation (WBR) refers to the task of recommending items to the end of completing a non-empty shop** basket during a shop** session. While the latest innovations in this space demonstrate remarkable performance improvement on benchmark datasets, they often overlook the complexity of user behaviors in practice, such as 1) co-existence of multiple shop** intentions, 2) multi… ▽ More

    Submitted 14 March, 2024; v1 submitted 25 January, 2024; originally announced January 2024.

    Comments: 13 pages, 9 figures

  21. arXiv:2401.02957  [pdf, other

    cs.CV

    Denoising Vision Transformers

    Authors: Jiawei Yang, Katie Z Luo, Jiefeng Li, Kilian Q Weinberger, Yonglong Tian, Yue Wang

    Abstract: We delve into a nuanced but significant challenge inherent to Vision Transformers (ViTs): feature maps of these models exhibit grid-like artifacts, which detrimentally hurt the performance of ViTs in downstream tasks. Our investigations trace this fundamental issue down to the positional embeddings at the input stage. To address this, we propose a novel noise model, which is universally applicable… ▽ More

    Submitted 5 January, 2024; originally announced January 2024.

    Comments: Project website: https://jiawei-yang.github.io/DenoisingViT/

  22. arXiv:2312.08952  [pdf, other

    cs.CV

    UCMCTrack: Multi-Object Tracking with Uniform Camera Motion Compensation

    Authors: Kefu Yi, Kai Luo, Xiaolei Luo, Jiangui Huang, Hao Wu, Rongdong Hu, Wei Hao

    Abstract: Multi-object tracking (MOT) in video sequences remains a challenging task, especially in scenarios with significant camera movements. This is because targets can drift considerably on the image plane, leading to erroneous tracking outcomes. Addressing such challenges typically requires supplementary appearance cues or Camera Motion Compensation (CMC). While these strategies are effective, they als… ▽ More

    Submitted 11 January, 2024; v1 submitted 14 December, 2023; originally announced December 2023.

    Comments: Accepted to AAAI 2024

  23. arXiv:2311.04079  [pdf, other

    cs.CV

    Augmenting Lane Perception and Topology Understanding with Standard Definition Navigation Maps

    Authors: Katie Z Luo, Xinshuo Weng, Yan Wang, Shuang Wu, Jie Li, Kilian Q Weinberger, Yue Wang, Marco Pavone

    Abstract: Autonomous driving has traditionally relied heavily on costly and labor-intensive High Definition (HD) maps, hindering scalability. In contrast, Standard Definition (SD) maps are more affordable and have worldwide coverage, offering a scalable alternative. In this work, we systematically explore the effect of SD maps for real-time lane-topology understanding. We propose a novel framework to integr… ▽ More

    Submitted 7 November, 2023; originally announced November 2023.

  24. arXiv:2310.19080  [pdf, other

    cs.CV

    Reward Finetuning for Faster and More Accurate Unsupervised Object Discovery

    Authors: Katie Z Luo, Zhenzhen Liu, Xiangyu Chen, Yurong You, Sagie Benaim, Cheng Perng Phoo, Mark Campbell, Wen Sun, Bharath Hariharan, Kilian Q. Weinberger

    Abstract: Recent advances in machine learning have shown that Reinforcement Learning from Human Feedback (RLHF) can improve machine learning models and align them with human preferences. Although very successful for Large Language Models (LLMs), these advancements have not had a comparable impact in research for autonomous vehicles -- where alignment with human expectations can be imperative. In this paper,… ▽ More

    Submitted 5 November, 2023; v1 submitted 29 October, 2023; originally announced October 2023.

  25. arXiv:2310.14592  [pdf, other

    cs.CV cs.LG

    Pre-Training LiDAR-Based 3D Object Detectors Through Colorization

    Authors: Tai-Yu Pan, Chenyang Ma, Tianle Chen, Cheng Perng Phoo, Katie Z Luo, Yurong You, Mark Campbell, Kilian Q. Weinberger, Bharath Hariharan, Wei-Lun Chao

    Abstract: Accurate 3D object detection and understanding for self-driving cars heavily relies on LiDAR point clouds, necessitating large amounts of labeled data to train. In this work, we introduce an innovative pre-training approach, Grounded Point Colorization (GPC), to bridge the gap between data and labels by teaching the model to colorize LiDAR point clouds, equip** it with valuable semantic cues. To… ▽ More

    Submitted 25 February, 2024; v1 submitted 23 October, 2023; originally announced October 2023.

    Comments: Accepted to ICLR 2024

  26. arXiv:2310.03602  [pdf, other

    cs.CV

    Ctrl-Room: Controllable Text-to-3D Room Meshes Generation with Layout Constraints

    Authors: Chuan Fang, Yuan Dong, Kunming Luo, Xiaotao Hu, Rakesh Shrestha, ** Tan

    Abstract: Text-driven 3D indoor scene generation is useful for gaming, the film industry, and AR/VR applications. However, existing methods cannot faithfully capture the room layout, nor do they allow flexible editing of individual objects in the room. To address these problems, we present Ctrl-Room, which can generate convincing 3D rooms with designer-style layouts and high-fidelity textures from just a te… ▽ More

    Submitted 1 July, 2024; v1 submitted 5 October, 2023; originally announced October 2023.

  27. arXiv:2309.13546  [pdf, other

    cs.CV cs.LG

    DFRD: Data-Free Robustness Distillation for Heterogeneous Federated Learning

    Authors: Kangyang Luo, Shuai Wang, Yexuan Fu, Xiang Li, Yunshi Lan, Ming Gao

    Abstract: Federated Learning (FL) is a privacy-constrained decentralized machine learning paradigm in which clients enable collaborative training without compromising private data. However, how to learn a robust global model in the data-heterogeneous and model-heterogeneous FL scenarios is challenging. To address it, we resort to data-free knowledge distillation to propose a new FL method (namely DFRD). DFR… ▽ More

    Submitted 7 October, 2023; v1 submitted 24 September, 2023; originally announced September 2023.

    Comments: Published as a conference paper at NeurIPS 2023

  28. arXiv:2309.12140  [pdf, other

    cs.CV cs.AI cs.LG

    Unsupervised Domain Adaptation for Self-Driving from Past Traversal Features

    Authors: Travis Zhang, Katie Luo, Cheng Perng Phoo, Yurong You, Wei-Lun Chao, Bharath Hariharan, Mark Campbell, Kilian Q. Weinberger

    Abstract: The rapid development of 3D object detection systems for self-driving cars has significantly improved accuracy. However, these systems struggle to generalize across diverse driving environments, which can lead to safety-critical failures in detecting traffic participants. To address this, we propose a method that utilizes unlabeled repeated traversals of multiple locations to adapt object detector… ▽ More

    Submitted 21 September, 2023; originally announced September 2023.

  29. arXiv:2309.08839  [pdf, other

    cs.SD cs.MM eess.AS

    Contrastive Latent Space Reconstruction Learning for Audio-Text Retrieval

    Authors: Kaiyi Luo, Xulong Zhang, Jianzong Wang, Huaxiong Li, Ning Cheng, **g Xiao

    Abstract: Cross-modal retrieval (CMR) has been extensively applied in various domains, such as multimedia search engines and recommendation systems. Most existing CMR methods focus on image-to-text retrieval, whereas audio-to-text retrieval, a less explored domain, has posed a great challenge due to the difficulty to uncover discriminative features from audio clips and texts. Existing studies are restricted… ▽ More

    Submitted 15 September, 2023; originally announced September 2023.

    Comments: Accepted by The 35th IEEE International Conference on Tools with Artificial Intelligence. (ICTAI 2023)

  30. arXiv:2308.16517  [pdf, other

    cs.DC cs.NI cs.RO

    BeeFlow: Behavior Tree-based Serverless Workflow Modeling and Scheduling for Resource-Constrained Edge Clusters

    Authors: Ke Luo, Tao Ouyang, Zhi Zhou, Xu Chen

    Abstract: Serverless computing has gained popularity in edge computing due to its flexible features, including the pay-per-use pricing model, auto-scaling capabilities, and multi-tenancy support. Complex Serverless-based applications typically rely on Serverless workflows (also known as Serverless function orchestration) to express task execution logic, and numerous application- and system-level optimizatio… ▽ More

    Submitted 31 August, 2023; originally announced August 2023.

    Comments: Accepted by Journal of Systems Architecture

  31. arXiv:2308.13133  [pdf, other

    cs.CV

    AccFlow: Backward Accumulation for Long-Range Optical Flow

    Authors: Guangyang Wu, Xiaohong Liu, Kunming Luo, Xi Liu, Qingqing Zheng, Shuaicheng Liu, Xinyang Jiang, Guangtao Zhai, Wenyi Wang

    Abstract: Recent deep learning-based optical flow estimators have exhibited impressive performance in generating local flows between consecutive frames. However, the estimation of long-range flows between distant frames, particularly under complex object deformation and large motion occlusion, remains a challenging task. One promising solution is to accumulate local flows explicitly or implicitly to obtain… ▽ More

    Submitted 24 August, 2023; originally announced August 2023.

  32. arXiv:2308.10088  [pdf, other

    cs.CL cs.SE

    PACE: Improving Prompt with Actor-Critic Editing for Large Language Model

    Authors: Yihong Dong, Kangcheng Luo, Xue Jiang, Zhi **, Ge Li

    Abstract: Large language models (LLMs) have showcased remarkable potential across various tasks by conditioning on prompts. However, the quality of different human-written prompts leads to substantial discrepancies in LLMs' performance, and improving prompts usually necessitates considerable human effort and expertise. To this end, this paper proposes Prompt with Actor-Critic Editing (PACE) for LLMs to enab… ▽ More

    Submitted 16 May, 2024; v1 submitted 19 August, 2023; originally announced August 2023.

    Comments: Accepted to ACL

  33. arXiv:2307.12070  [pdf, other

    cs.CV

    Fast and Stable Diffusion Inverse Solver with History Gradient Update

    Authors: Linchao He, Hongyu Yan, Mengting Luo, Hongjie Wu, Kunming Luo, Wang Wang, Wenchao Du, Hu Chen, Hongyu Yang, Yi Zhang, Jiancheng Lv

    Abstract: Diffusion models have recently been recognised as efficient inverse problem solvers due to their ability to produce high-quality reconstruction results without relying on pairwise data training. Existing diffusion-based solvers utilize Gradient Descent strategy to get a optimal sample solution. However, these solvers only calculate the current gradient and have not utilized any history information… ▽ More

    Submitted 11 March, 2024; v1 submitted 22 July, 2023; originally announced July 2023.

    Comments: 17 pages, 7 figures. Provision of theoretical proofs to demonstrate the convergence of the methods

  34. arXiv:2307.08299  [pdf, other

    cs.DC

    Decentralized Local Updates with Dual-Slow Estimation and Momentum-based Variance-Reduction for Non-Convex Optimization

    Authors: Kangyang Luo, Kunkun Zhang, Shengbo Zhang, Xiang Li, Ming Gao

    Abstract: Decentralized learning (DL) has recently employed local updates to reduce the communication cost for general non-convex optimization problems. Specifically, local updates require each node to perform multiple update steps on the parameters of the local model before communicating with others. However, most existing methods could be highly sensitive to data heterogeneity (i.e., non-iid data distribu… ▽ More

    Submitted 17 July, 2023; originally announced July 2023.

  35. arXiv:2307.01684  [pdf, other

    cs.DC cs.AI cs.LG cs.NI

    Serving Graph Neural Networks With Distributed Fog Servers For Smart IoT Services

    Authors: Liekang Zeng, Xu Chen, Peng Huang, Ke Luo, Xiaoxi Zhang, Zhi Zhou

    Abstract: Graph Neural Networks (GNNs) have gained growing interest in miscellaneous applications owing to their outstanding ability in extracting latent representation on graph structures. To render GNN-based service for IoT-driven smart applications, traditional model serving paradigms usually resort to the cloud by fully uploading geo-distributed input data to remote datacenters. However, our empirical m… ▽ More

    Submitted 4 July, 2023; originally announced July 2023.

    Comments: Accepted by IEEE/ACM Transactions on Networking

  36. arXiv:2304.06018  [pdf, other

    cs.CV

    Adaptive Human Matting for Dynamic Videos

    Authors: Chung-Ching Lin, Jiang Wang, Kun Luo, Kevin Lin, Linjie Li, Lijuan Wang, Zicheng Liu

    Abstract: The most recent efforts in video matting have focused on eliminating trimap dependency since trimap annotations are expensive and trimap-based methods are less adaptable for real-time applications. Despite the latest tripmap-free methods showing promising results, their performance often degrades when dealing with highly diverse and unstructured videos. We address this limitation by introducing Ad… ▽ More

    Submitted 12 April, 2023; originally announced April 2023.

    Comments: CVPR 2023

  37. arXiv:2303.15286  [pdf, other

    cs.CV cs.LG

    Unsupervised Adaptation from Repeated Traversals for Autonomous Driving

    Authors: Yurong You, Cheng Perng Phoo, Katie Z Luo, Travis Zhang, Wei-Lun Chao, Bharath Hariharan, Mark Campbell, Kilian Q. Weinberger

    Abstract: For a self-driving car to operate reliably, its perceptual system must generalize to the end-user's environment -- ideally without additional annotation efforts. One potential solution is to leverage unlabeled data (e.g., unlabeled LiDAR point clouds) collected from the end-users' environments (i.e. target domain) to adapt the system to the difference between training and testing environments. Whi… ▽ More

    Submitted 27 March, 2023; originally announced March 2023.

    Comments: Accepted by NeurIPS 2022. Code is available at https://github.com/YurongYou/Rote-DA

  38. arXiv:2303.11011  [pdf, other

    cs.CV

    Learning Optical Flow from Event Camera with Rendered Dataset

    Authors: Xinglong Luo, Kunming Luo, Ao Luo, Zhengning Wang, ** Tan, Shuaicheng Liu

    Abstract: We study the problem of estimating optical flow from event cameras. One important issue is how to build a high-quality event-flow dataset with accurate event values and flow labels. Previous datasets are created by either capturing real scenes by event cameras or synthesizing from images with pasted foreground objects. The former case can produce real event values but with calculated flow labels,… ▽ More

    Submitted 20 March, 2023; originally announced March 2023.

  39. arXiv:2302.14307  [pdf, other

    cs.CV cs.LG

    GradMA: A Gradient-Memory-based Accelerated Federated Learning with Alleviated Catastrophic Forgetting

    Authors: Kangyang Luo, Xiang Li, Yunshi Lan, Ming Gao

    Abstract: Federated Learning (FL) has emerged as a de facto machine learning area and received rapid increasing research interests from the community. However, catastrophic forgetting caused by data heterogeneity and partial participation poses distinctive challenges for FL, which are detrimental to the performance. To tackle the problems, we propose a new FL approach (namely GradMA), which takes inspiratio… ▽ More

    Submitted 15 March, 2023; v1 submitted 27 February, 2023; originally announced February 2023.

  40. arXiv:2302.10570  [pdf, other

    cs.CL cs.AI cs.SC

    Co-Driven Recognition of Semantic Consistency via the Fusion of Transformer and HowNet Sememes Knowledge

    Authors: Fan Chen, Yan Huang, Xinfang Zhang, Kang Luo, **xuan Zhu, Ruixian He

    Abstract: Semantic consistency recognition aims to detect and judge whether the semantics of two text sentences are consistent with each other. However, the existing methods usually encounter the challenges of synonyms, polysemy and difficulty to understand long text. To solve the above problems, this paper proposes a co-driven semantic consistency recognition method based on the fusion of Transformer and H… ▽ More

    Submitted 21 February, 2023; originally announced February 2023.

    Comments: 17 pages, 5 figures

  41. arXiv:2301.10018  [pdf, other

    cs.CV

    GyroFlow+: Gyroscope-Guided Unsupervised Deep Homography and Optical Flow Learning

    Authors: Haipeng Li, Kunming Luo, Bing Zeng, Shuaicheng Liu

    Abstract: Existing homography and optical flow methods are erroneous in challenging scenes, such as fog, rain, night, and snow because the basic assumptions such as brightness and gradient constancy are broken. To address this issue, we present an unsupervised learning approach that fuses gyroscope into homography and optical flow learning. Specifically, we first convert gyroscope readings into motion field… ▽ More

    Submitted 29 May, 2023; v1 submitted 23 January, 2023; originally announced January 2023.

    Comments: 12 pages. arXiv admin note: substantial text overlap with arXiv:2103.13725

  42. arXiv:2301.08406  [pdf, other

    cs.NI cs.AI cs.DC

    Real-Time High-Resolution Pedestrian Detection in Crowded Scenes via Parallel Edge Offloading

    Authors: Hao Wang, Hao Bao, Liekang Zeng, Ke Luo, Xu Chen

    Abstract: To identify dense and small-size pedestrians in surveillance systems, high-resolution cameras are widely deployed, where high-resolution images are captured and delivered to off-the-shelf pedestrian detection models. However, given the highly computation-intensive workload brought by the high resolution, the resource-constrained cameras fail to afford accurate inference in real time. To address th… ▽ More

    Submitted 19 January, 2023; originally announced January 2023.

    Comments: Accepted by IEEE ICC 2023

  43. arXiv:2211.02176  [pdf, other

    cs.DS

    Connected k-Center and k-Diameter Clustering

    Authors: Lukas Drexler, Jan Eube, Kelin Luo, Dorian Reineccius, Heiko Röglin, Melanie Schmidt, Julian Wargalla

    Abstract: Motivated by an application from geodesy, we introduce a novel clustering problem which is a $k$-center (or k-diameter) problem with a side constraint. For the side constraint, we are given an undirected connectivity graph $G$ on the input points, and a clustering is now only feasible if every cluster induces a connected subgraph in $G$. We call the resulting problems the connected $k$-center prob… ▽ More

    Submitted 18 October, 2023; v1 submitted 3 November, 2022; originally announced November 2022.

  44. A Hierarchical Grou** Algorithm for the Multi-Vehicle Dial-a-Ride Problem

    Authors: Kelin Luo, Alexandre M. Florio, Syamantak Das, Xiangyu Guo

    Abstract: Ride-sharing is an essential aspect of modern urban mobility. In this paper, we consider a classical problem in ride-sharing - the Multi-Vehicle Dial-a-Ride Problem (Multi-Vehicle DaRP). Given a fleet of vehicles with a fixed capacity stationed at various locations and a set of ride requests specified by origins and destinations, the goal is to serve all requests such that no vehicle is assigned m… ▽ More

    Submitted 10 October, 2022; originally announced October 2022.

  45. arXiv:2209.12314  [pdf, other

    cs.DS

    Package Delivery Using Drones with Restricted Movement Areas

    Authors: Thomas Erlebach, Kelin Luo, Frits C. R. Spieksma

    Abstract: For the problem of delivering a package from a source node to a destination node in a graph using a set of drones, we study the setting where the movements of each drone are restricted to a certain subgraph of the given graph. We consider the objectives of minimizing the delivery time (problem DDT) and of minimizing the total energy consumption (problem DDC). For general graphs, we show a strong i… ▽ More

    Submitted 25 September, 2022; originally announced September 2022.

  46. arXiv:2208.01166  [pdf, other

    cs.CV

    Ithaca365: Dataset and Driving Perception under Repeated and Challenging Weather Conditions

    Authors: Carlos A. Diaz-Ruiz, Youya Xia, Yurong You, Jose Nino, Junan Chen, Josephine Monica, Xiangyu Chen, Katie Luo, Yan Wang, Marc Emond, Wei-Lun Chao, Bharath Hariharan, Kilian Q. Weinberger, Mark Campbell

    Abstract: Advances in perception for self-driving cars have accelerated in recent years due to the availability of large-scale datasets, typically collected at specific locations and under nice weather conditions. Yet, to achieve the high safety requirement, these perceptual systems must operate robustly under a wide variety of weather conditions including snow and rain. In this paper, we present a new data… ▽ More

    Submitted 1 August, 2022; originally announced August 2022.

    Comments: Accepted by CVPR 2022

  47. Adaptive Marginalized Semantic Hashing for Unpaired Cross-Modal Retrieval

    Authors: Kaiyi Luo, Chao Zhang, Huaxiong Li, Xiuyi Jia, Chunlin Chen

    Abstract: In recent years, Cross-Modal Hashing (CMH) has aroused much attention due to its fast query speed and efficient storage. Previous literatures have achieved promising results for Cross-Modal Retrieval (CMR) by discovering discriminative hash codes and modality-specific hash functions. Nonetheless, most existing CMR works are subjected to some restrictions: 1) It is assumed that data of different mo… ▽ More

    Submitted 24 July, 2022; originally announced July 2022.

  48. arXiv:2207.11075  [pdf, other

    cs.CV

    RealFlow: EM-based Realistic Optical Flow Dataset Generation from Videos

    Authors: Yunhui Han, Kunming Luo, Ao Luo, Jiangyu Liu, Haoqiang Fan, Guiming Luo, Shuaicheng Liu

    Abstract: Obtaining the ground truth labels from a video is challenging since the manual annotation of pixel-wise flow labels is prohibitively expensive and laborious. Besides, existing approaches try to adapt the trained model on synthetic datasets to authentic videos, which inevitably suffers from domain discrepancy and hinders the performance for real-world applications. To solve these problems, we propo… ▽ More

    Submitted 22 July, 2022; originally announced July 2022.

    Comments: ECCV 2022 Oral

  49. arXiv:2203.15882  [pdf, other

    cs.CV

    Learning to Detect Mobile Objects from LiDAR Scans Without Labels

    Authors: Yurong You, Katie Z Luo, Cheng Perng Phoo, Wei-Lun Chao, Wen Sun, Bharath Hariharan, Mark Campbell, Kilian Q. Weinberger

    Abstract: Current 3D object detectors for autonomous driving are almost entirely trained on human-annotated data. Although of high quality, the generation of such data is laborious and costly, restricting them to a few specific locations and object types. This paper proposes an alternative approach entirely based on unlabeled data, which can be collected cheaply and in abundance almost everywhere on earth.… ▽ More

    Submitted 29 March, 2022; originally announced March 2022.

    Comments: Accepted by CVPR 2022. Code is available at https://github.com/YurongYou/MODEST

  50. arXiv:2203.11405  [pdf, other

    cs.CV

    Hindsight is 20/20: Leveraging Past Traversals to Aid 3D Perception

    Authors: Yurong You, Katie Z Luo, Xiangyu Chen, Junan Chen, Wei-Lun Chao, Wen Sun, Bharath Hariharan, Mark Campbell, Kilian Q. Weinberger

    Abstract: Self-driving cars must detect vehicles, pedestrians, and other traffic participants accurately to operate safely. Small, far-away, or highly occluded objects are particularly challenging because there is limited information in the LiDAR point clouds for detecting them. To address this challenge, we leverage valuable information from the past: in particular, data collected in past traversals of the… ▽ More

    Submitted 21 March, 2022; originally announced March 2022.

    Comments: Accepted by ICLR 2022. Code is available at https://github.com/YurongYou/Hindsight