Skip to main content

Showing 1–50 of 76 results for author: Chi, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.19464  [pdf, other

    cs.RO cs.AI cs.CV cs.SD eess.AS

    ManiWAV: Learning Robot Manipulation from In-the-Wild Audio-Visual Data

    Authors: Zeyi Liu, Cheng Chi, Eric Cousineau, Naveen Kuppuswamy, Benjamin Burchfiel, Shuran Song

    Abstract: Audio signals provide rich information for the robot interaction and object properties through contact. These information can surprisingly ease the learning of contact-rich robot manipulation skills, especially when the visual information alone is ambiguous or incomplete. However, the usage of audio data in robot manipulation has been constrained to teleoperated demonstrations collected by either… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  2. arXiv:2406.12229  [pdf, other

    cs.AI cs.LG

    Spatially Resolved Gene Expression Prediction from Histology via Multi-view Graph Contrastive Learning with HSIC-bottleneck Regularization

    Authors: Changxi Chi, Hang Shi, Qi Zhu, Daoqiang Zhang, Wei Shao

    Abstract: The rapid development of spatial transcriptomics(ST) enables the measurement of gene expression at spatial resolution, making it possible to simultaneously profile the gene expression, spatial locations of spots, and the matched histopathological images. However, the cost for collecting ST data is much higher than acquiring histopathological images, and thus several studies attempt to predict the… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  3. arXiv:2404.10147  [pdf, other

    cs.CV

    Eyes on the Streets: Leveraging Street-Level Imaging to Model Urban Crime Dynamics

    Authors: Zhixuan Qi, Huaiying Luo, Chen Chi

    Abstract: This study addresses the challenge of urban safety in New York City by examining the relationship between the built environment and crime rates using machine learning and a comprehensive dataset of street view images. We aim to identify how urban landscapes correlate with crime statistics, focusing on the characteristics of street views and their association with crime rates. The findings offer in… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

  4. arXiv:2404.00611  [pdf, ps, other

    cs.CV

    Object-level Copy-Move Forgery Image Detection based on Inconsistency Mining

    Authors: **gyu Wang, Niantai **g, Ziyao Liu, Jie Nie, Yuxin Qi, Chi-Hung Chi, Kwok-Yan Lam

    Abstract: In copy-move tampering operations, perpetrators often employ techniques, such as blurring, to conceal tampering traces, posing significant challenges to the detection of object-level targets with intact structures. Focus on these challenges, this paper proposes an Object-level Copy-Move Forgery Image Detection based on Inconsistency Mining (IMNet). To obtain complete object-level targets, we custo… ▽ More

    Submitted 3 April, 2024; v1 submitted 31 March, 2024; originally announced April 2024.

    Comments: 4 pages, 2 figures, Accepted to WWW 2024

  5. arXiv:2403.16446  [pdf, other

    cs.CL

    Towards Automatic Evaluation for LLMs' Clinical Capabilities: Metric, Data, and Algorithm

    Authors: Lei Liu, Xiaoyan Yang, Fangzhou Li, Chenfei Chi, Yue Shen, Shiwei Lyu Ming Zhang, Xiaowei Ma, Xiangguo Lyu, Liya Ma, Zhiqiang Zhang, Wei Xue, Yiran Huang, **jie Gu

    Abstract: Large language models (LLMs) are gaining increasing interests to improve clinical efficiency for medical diagnosis, owing to their unprecedented performance in modelling natural language. Ensuring the safe and reliable clinical applications, the evaluation of LLMs indeed becomes critical for better mitigating the potential risks, e.g., hallucinations. However, current evaluation methods heavily re… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

  6. arXiv:2403.12945  [pdf, other

    cs.RO

    DROID: A Large-Scale In-The-Wild Robot Manipulation Dataset

    Authors: Alexander Khazatsky, Karl Pertsch, Suraj Nair, Ashwin Balakrishna, Sudeep Dasari, Siddharth Karamcheti, Soroush Nasiriany, Mohan Kumar Srirama, Lawrence Yunliang Chen, Kirsty Ellis, Peter David Fagan, Joey Hejna, Masha Itkina, Marion Lepert, Yecheng Jason Ma, Patrick Tree Miller, Jimmy Wu, Suneel Belkhale, Shivin Dass, Huy Ha, Arhan Jain, Abraham Lee, Youngwoon Lee, Marius Memmel, Sungjae Park , et al. (74 additional authors not shown)

    Abstract: The creation of large, diverse, high-quality robot manipulation datasets is an important step** stone on the path toward more capable and robust robotic manipulation policies. However, creating such datasets is challenging: collecting robot manipulation data in diverse environments poses logistical and safety challenges and requires substantial investments in hardware and human labour. As a resu… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

    Comments: Project website: https://droid-dataset.github.io/

  7. arXiv:2403.09566  [pdf, other

    cs.RO

    PaperBot: Learning to Design Real-World Tools Using Paper

    Authors: Ruoshi Liu, Junbang Liang, Sruthi Sudhakar, Huy Ha, Cheng Chi, Shuran Song, Carl Vondrick

    Abstract: Paper is a cheap, recyclable, and clean material that is often used to make practical tools. Traditional tool design either relies on simulation or physical analysis, which is often inaccurate and time-consuming. In this paper, we propose PaperBot, an approach that directly learns to design and use a tool in the real world using paper without human intervention. We demonstrated the effectiveness a… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

    Comments: Project Website: https://paperbot.cs.columbia.edu/

  8. arXiv:2403.09096  [pdf, other

    eess.IV cs.CV

    Deep unfolding Network for Hyperspectral Image Super-Resolution with Automatic Exposure Correction

    Authors: Yuan Fang, Yipeng Liu, Jie Chen, Zhen Long, Ao Li, Chong-Yung Chi, Ce Zhu

    Abstract: In recent years, the fusion of high spatial resolution multispectral image (HR-MSI) and low spatial resolution hyperspectral image (LR-HSI) has been recognized as an effective method for HSI super-resolution (HSI-SR). However, both HSI and MSI may be acquired under extreme conditions such as night or poorly illuminating scenarios, which may cause different exposure levels, thereby seriously downgr… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

  9. arXiv:2403.02814  [pdf, other

    cs.LG cs.AI

    InjectTST: A Transformer Method of Injecting Global Information into Independent Channels for Long Time Series Forecasting

    Authors: Ce Chi, Xing Wang, Kexin Yang, Zhiyan Song, Di **, Lin Zhu, Chao Deng, Junlan Feng

    Abstract: Transformer has become one of the most popular architectures for multivariate time series (MTS) forecasting. Recent Transformer-based MTS models generally prefer channel-independent structures with the observation that channel independence can alleviate noise and distribution drift issues, leading to more robustness. Nevertheless, it is essential to note that channel dependency remains an inherent… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

  10. arXiv:2402.14840  [pdf, other

    cs.CL cs.AI stat.AP

    RJUA-MedDQA: A Multimodal Benchmark for Medical Document Question Answering and Clinical Reasoning

    Authors: Congyun **, Ming Zhang, Xiaowei Ma, Li Yujiao, Yingbo Wang, Yabo Jia, Yuliang Du, Tao Sun, Haowen Wang, Cong Fan, **jie Gu, Chenfei Chi, Xiangguo Lv, Fangzhou Li, Wei Xue, Yiran Huang

    Abstract: Recent advancements in Large Language Models (LLMs) and Large Multi-modal Models (LMMs) have shown potential in various medical applications, such as Intelligent Medical Diagnosis. Although impressive results have been achieved, we find that existing benchmarks do not reflect the complexity of real medical reports and specialized in-depth reasoning capabilities. In this work, we introduced RJUA-Me… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

    Comments: 15 pages, 13 figures

  11. arXiv:2402.10329  [pdf, other

    cs.RO

    Universal Manipulation Interface: In-The-Wild Robot Teaching Without In-The-Wild Robots

    Authors: Cheng Chi, Zhenjia Xu, Chuer Pan, Eric Cousineau, Benjamin Burchfiel, Siyuan Feng, Russ Tedrake, Shuran Song

    Abstract: We present Universal Manipulation Interface (UMI) -- a data collection and policy learning framework that allows direct skill transfer from in-the-wild human demonstrations to deployable robot policies. UMI employs hand-held grippers coupled with careful interface design to enable portable, low-cost, and information-rich data collection for challenging bimanual and dynamic manipulation demonstrati… ▽ More

    Submitted 5 March, 2024; v1 submitted 15 February, 2024; originally announced February 2024.

    Comments: Project website: https://umi-gripper.github.io

  12. arXiv:2402.04064  [pdf, other

    cs.CV cs.AI

    Multi-class Road Defect Detection and Segmentation using Spatial and Channel-wise Attention for Autonomous Road Repairing

    Authors: Jongmin Yu, Chen Bene Chi, Sebastiano Fichera, Paolo Paoletti, Devansh Mehta, Shan Luo

    Abstract: Road pavement detection and segmentation are critical for develo** autonomous road repair systems. However, develo** an instance segmentation method that simultaneously performs multi-class defect detection and segmentation is challenging due to the textural simplicity of road pavement image, the diversity of defect geometries, and the morphological ambiguity between classes. We propose a nove… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

    Comments: Accepted to the ICRA 2024

  13. arXiv:2401.01836  [pdf, other

    cs.AI

    Neural Control: Concurrent System Identification and Control Learning with Neural ODE

    Authors: Cheng Chi

    Abstract: Controlling continuous-time dynamical systems is generally a two step process: first, identify or model the system dynamics with differential equations, then, minimize the control objectives to achieve optimal control function and optimal state trajectories. However, any inaccuracy in dynamics modeling will lead to sub-optimality in the resulting control function. To address this, we propose a neu… ▽ More

    Submitted 22 April, 2024; v1 submitted 3 January, 2024; originally announced January 2024.

    Comments: 9 pages, code open sourced in format of Google Colab notebooks; Resubmitted for adding missed references in the last submission

  14. arXiv:2312.09785  [pdf, other

    cs.CL

    RJUA-QA: A Comprehensive QA Dataset for Urology

    Authors: Shiwei Lyu, Chenfei Chi, Hongbo Cai, Lei Shi, Xiaoyan Yang, Lei Liu, Xiang Chen, Deng Zhao, Zhiqiang Zhang, Xianguo Lyu, Ming Zhang, Fangzhou Li, Xiaowei Ma, Yue Shen, **jie Gu, Wei Xue, Yiran Huang

    Abstract: We introduce RJUA-QA, a novel medical dataset for question answering (QA) and reasoning with clinical evidence, contributing to bridge the gap between general large language models (LLMs) and medical-specific LLM applications. RJUA-QA is derived from realistic clinical scenarios and aims to facilitate LLMs in generating reliable diagnostic and advice. The dataset contains 2,132 curated Question-Co… ▽ More

    Submitted 7 January, 2024; v1 submitted 15 December, 2023; originally announced December 2023.

    Comments: An initial version

  15. Privacy-preserving Federated Primal-dual Learning for Non-convex and Non-smooth Problems with Model Sparsification

    Authors: Yiwei Li, Chien-Wei Huang, Shuai Wang, Chong-Yung Chi, Tony Q. S. Quek

    Abstract: Federated learning (FL) has been recognized as a rapidly growing research area, where the model is trained over massively distributed clients under the orchestration of a parameter server (PS) without sharing clients' data. This paper delves into a class of federated problems characterized by non-convex and non-smooth loss functions, that are prevalent in FL applications but challenging to handle… ▽ More

    Submitted 3 April, 2024; v1 submitted 30 October, 2023; originally announced October 2023.

    Comments: 33 pages, 8 figures, 1 table. Accepted by IEEE Internet of Things Journal

  16. arXiv:2310.08864  [pdf, other

    cs.RO

    Open X-Embodiment: Robotic Learning Datasets and RT-X Models

    Authors: Open X-Embodiment Collaboration, Abby O'Neill, Abdul Rehman, Abhinav Gupta, Abhiram Maddukuri, Abhishek Gupta, Abhishek Padalkar, Abraham Lee, Acorn Pooley, Agrim Gupta, Ajay Mandlekar, A**kya Jain, Albert Tung, Alex Bewley, Alex Herzog, Alex Irpan, Alexander Khazatsky, Anant Rai, Anchit Gupta, Andrew Wang, Andrey Kolobov, Anikait Singh, Animesh Garg, Aniruddha Kembhavi, Annie Xie , et al. (267 additional authors not shown)

    Abstract: Large, high-capacity models trained on diverse datasets have shown remarkable successes on efficiently tackling downstream applications. In domains from NLP to Computer Vision, this has led to a consolidation of pretrained models, with general pretrained backbones serving as a starting point for many applications. Can such a consolidation happen in robotics? Conventionally, robotic learning method… ▽ More

    Submitted 1 June, 2024; v1 submitted 13 October, 2023; originally announced October 2023.

    Comments: Project website: https://robotics-transformer-x.github.io

  17. arXiv:2309.13733  [pdf, other

    stat.ML cs.LG stat.CO

    Towards Tuning-Free Minimum-Volume Nonnegative Matrix Factorization

    Authors: Duc Toan Nguyen, Eric C. Chi

    Abstract: Nonnegative Matrix Factorization (NMF) is a versatile and powerful tool for discovering latent structures in data matrices, with many variations proposed in the literature. Recently, Leplat et al.\@ (2019) introduced a minimum-volume NMF for the identifiable recovery of rank-deficient matrices in the presence of noise. The performance of their formulation, however, requires the selection of a tuni… ▽ More

    Submitted 24 September, 2023; originally announced September 2023.

  18. arXiv:2307.16259  [pdf, ps, other

    cs.IT cs.NI eess.SP

    Communication-Sensing Region for Cell-Free Massive MIMO ISAC Systems

    Authors: Weihao Mao, Yang Lu, Chong-Yung Chi, Bo Ai, Zhangdui Zhong, Zhiguo Ding

    Abstract: This paper investigates the system model and the transmit beamforming design for the Cell-Free massive multi-input multi-output (MIMO) integrated sensing and communication (ISAC) system. The impact of the uncertainty of the target locations on the propagation of wireless signals is considered during both uplink and downlink phases, and especially, the main statistics of the MIMO channel estimation… ▽ More

    Submitted 30 July, 2023; originally announced July 2023.

  19. arXiv:2307.09955  [pdf, other

    cs.RO cs.AI cs.LG

    XSkill: Cross Embodiment Skill Discovery

    Authors: Mengda Xu, Zhenjia Xu, Cheng Chi, Manuela Veloso, Shuran Song

    Abstract: Human demonstration videos are a widely available data source for robot learning and an intuitive user interface for expressing desired behavior. However, directly extracting reusable robot manipulation skills from unstructured human videos is challenging due to the big embodiment difference and unobserved action parameters. To bridge this embodiment gap, this paper introduces XSkill, an imitation… ▽ More

    Submitted 28 September, 2023; v1 submitted 19 July, 2023; originally announced July 2023.

  20. GICI-LIB: A GNSS/INS/Camera Integrated Navigation Library

    Authors: Cheng Chi, Xin Zhang, Jiahui Liu, Yulong Sun, Zihao Zhang, Xingqun Zhan

    Abstract: Accurate navigation is essential for autonomous robots and vehicles. In recent years, the integration of the Global Navigation Satellite System (GNSS), Inertial Navigation System (INS), and camera has garnered considerable attention due to its robustness and high accuracy in diverse environments. However, leveraging the full capacity of GNSS is cumbersome because of the diverse choices of formulat… ▽ More

    Submitted 12 November, 2023; v1 submitted 22 June, 2023; originally announced June 2023.

    Comments: Open-source: https://github.com/chichengcn/gici-open. Preprint version on Robotics and Automation Letters (RAL)

  21. arXiv:2306.00275  [pdf, other

    cs.DC

    A Comprehensive Survey on Orbital Edge Computing: Systems, Applications, and Algorithms

    Authors: Changhao Wu, Yuanchun Li, Mengwei Xu, Chongbin Guo, Zengshan Yin, Weiwei Gao, Chuanxiu Chi

    Abstract: The number of satellites, especially those operating in low-earth orbit (LEO), is exploding in recent years. Additionally, the use of COTS hardware into those satellites enables a new paradigm of computing: orbital edge computing (OEC). OEC entails more technically advanced steps compared to single-satellite computing. This feature allows for vast design spaces with multiple parameters, rendering… ▽ More

    Submitted 1 June, 2023; v1 submitted 31 May, 2023; originally announced June 2023.

    Comments: 18 pages, 9 figures and 5 tables

    MSC Class: 68M14 ACM Class: C.2.4

  22. arXiv:2304.13940  [pdf, other

    stat.ML cs.LG

    A Majorization-Minimization Gauss-Newton Method for 1-Bit Matrix Completion

    Authors: Xiaoqian Liu, Xu Han, Eric C. Chi, Boaz Nadler

    Abstract: In 1-bit matrix completion, the aim is to estimate an underlying low-rank matrix from a partial set of binary observations. We propose a novel method for 1-bit matrix completion called MMGN. Our method is based on the majorization-minimization (MM) principle, which converts the original optimization problem into a sequence of standard low-rank matrix completion problems. We solve each of these sub… ▽ More

    Submitted 22 April, 2024; v1 submitted 26 April, 2023; originally announced April 2023.

    Comments: 28 pages, 7 figures

  23. arXiv:2304.03292  [pdf, other

    cs.LG

    SE-shapelets: Semi-supervised Clustering of Time Series Using Representative Shapelets

    Authors: Borui Cai, Guangyan Huang, Shuiqiao Yang, Yong Xiang, Chi-Hung Chi

    Abstract: Shapelets that discriminate time series using local features (subsequences) are promising for time series clustering. Existing time series clustering methods may fail to capture representative shapelets because they discover shapelets from a large pool of uninformative subsequences, and thus result in low clustering accuracy. This paper proposes a Semi-supervised Clustering of Time Series Using Re… ▽ More

    Submitted 14 November, 2023; v1 submitted 6 April, 2023; originally announced April 2023.

  24. arXiv:2303.09858  [pdf, other

    eess.IV cs.CR cs.CV cs.MM

    Preventing Unauthorized AI Over-Analysis by Medical Image Adversarial Watermarking

    Authors: Xingxing Wei, Bangzheng Pu, Shiji Zhao, Chen Chi, Huazhu Fu

    Abstract: The advancement of deep learning has facilitated the integration of Artificial Intelligence (AI) into clinical practices, particularly in computer-aided diagnosis. Given the pivotal role of medical images in various diagnostic procedures, it becomes imperative to ensure the responsible and secure utilization of AI techniques. However, the unauthorized utilization of AI for image analysis raises si… ▽ More

    Submitted 13 September, 2023; v1 submitted 17 March, 2023; originally announced March 2023.

  25. arXiv:2303.04137  [pdf, other

    cs.RO

    Diffusion Policy: Visuomotor Policy Learning via Action Diffusion

    Authors: Cheng Chi, Zhenjia Xu, Siyuan Feng, Eric Cousineau, Yilun Du, Benjamin Burchfiel, Russ Tedrake, Shuran Song

    Abstract: This paper introduces Diffusion Policy, a new way of generating robot behavior by representing a robot's visuomotor policy as a conditional denoising diffusion process. We benchmark Diffusion Policy across 12 different tasks from 4 different robot manipulation benchmarks and find that it consistently outperforms existing state-of-the-art robot learning methods with an average improvement of 46.9%.… ▽ More

    Submitted 14 March, 2024; v1 submitted 7 March, 2023; originally announced March 2023.

    Comments: An extended journal version of the original RSS2023 paper

  26. arXiv:2303.02454  [pdf, other

    cs.CV

    Exploiting Implicit Rigidity Constraints via Weight-Sharing Aggregation for Scene Flow Estimation from Point Clouds

    Authors: Yun Wang, Cheng Chi, Xin Yang

    Abstract: Scene flow estimation, which predicts the 3D motion of scene points from point clouds, is a core task in autonomous driving and many other 3D vision applications. Existing methods either suffer from structure distortion due to ignorance of rigid motion consistency or require explicit pose estimation and 3D object segmentation. Errors of estimated poses and segmented objects would yield inaccurate… ▽ More

    Submitted 1 April, 2023; v1 submitted 4 March, 2023; originally announced March 2023.

  27. arXiv:2302.11553  [pdf, other

    cs.RO

    RoboNinja: Learning an Adaptive Cutting Policy for Multi-Material Objects

    Authors: Zhenjia Xu, Zhou Xian, Xingyu Lin, Cheng Chi, Zhiao Huang, Chuang Gan, Shuran Song

    Abstract: We introduce RoboNinja, a learning-based cutting system for multi-material objects (i.e., soft objects with rigid cores such as avocados or mangos). In contrast to prior works using open-loop cutting actions to cut through single-material objects (e.g., slicing a cucumber), RoboNinja aims to remove the soft part of an object while preserving the rigid core, thereby maximizing the yield. To achieve… ▽ More

    Submitted 22 February, 2023; originally announced February 2023.

  28. Robust Extrinsic Self-Calibration of Camera and Solid State LiDAR

    Authors: Jiahui Liu, Xingqun Zhan, Cheng Chi, Xin Zhang, Chuanrun Zhai

    Abstract: This letter proposes an extrinsic calibration approach for a pair of monocular camera and prism-spinning solid-state LiDAR. The unique characteristics of the point cloud measured resulting from the flower-like scanning pattern is first disclosed as the vacant points, a type of outlier between foreground target and background objects. Unlike existing method using only depth continuous measurements,… ▽ More

    Submitted 13 February, 2023; originally announced February 2023.

    Journal ref: Journal of Intelligent & Robotic Systems. 109 (2023) 81

  29. Differentially Private Federated Clustering over Non-IID Data

    Authors: Yiwei Li, Shuai Wang, Chong-Yung Chi, Tony Q. S. Quek

    Abstract: In this paper, we investigate federated clustering (FedC) problem, that aims to accurately partition unlabeled data samples distributed over massive clients into finite clusters under the orchestration of a parameter server, meanwhile considering data privacy. Though it is an NP-hard optimization problem involving real variables denoting cluster centroids and binary variables denoting the cluster… ▽ More

    Submitted 30 October, 2023; v1 submitted 3 January, 2023; originally announced January 2023.

    Comments: 34 pages, 4 figures, 1 table

  30. arXiv:2210.09347  [pdf, other

    cs.RO

    Cloth Funnels: Canonicalized-Alignment for Multi-Purpose Garment Manipulation

    Authors: Alper Canberk, Cheng Chi, Huy Ha, Benjamin Burchfiel, Eric Cousineau, Siyuan Feng, Shuran Song

    Abstract: Automating garment manipulation is challenging due to extremely high variability in object configurations. To reduce this intrinsic variation, we introduce the task of "canonicalized-alignment" that simplifies downstream applications by reducing the possible garment configurations. This task can be considered as "cloth state funnel" that manipulates arbitrarily configured clothing items into a pre… ▽ More

    Submitted 17 October, 2022; originally announced October 2022.

    Comments: 8 pages, 8 figures, website at https://clothfunnels.cs.columbia.edu/

    ACM Class: I.2.9

  31. arXiv:2207.02891  [pdf, other

    cs.LG cs.AI

    Don't overfit the history -- Recursive time series data augmentation

    Authors: Amine Mohamed Aboussalah, Min-Jae Kwon, Raj G Patel, Cheng Chi, Chi-Guhn Lee

    Abstract: Time series observations can be seen as realizations of an underlying dynamical system governed by rules that we typically do not know. For time series learning tasks, we need to understand that we fit our model on available data, which is a unique realized history. Training on a single realization often induces severe overfitting lacking generalization. To address this issue, we introduce a gener… ▽ More

    Submitted 28 January, 2023; v1 submitted 6 July, 2022; originally announced July 2022.

    Comments: Accepted to ICLR 2023 Resubmitted here due to major change in proofs following conference submission

  32. arXiv:2207.01678  [pdf, other

    stat.ML cs.LG math.ST

    FACT: High-Dimensional Random Forests Inference

    Authors: Chien-Ming Chi, Yingying Fan, **chi Lv

    Abstract: Quantifying the usefulness of individual features in random forests learning can greatly enhance its interpretability. Existing studies have shown that some popularly used feature importance measures for random forests suffer from the bias issue. In addition, there lack comprehensive size and power analyses for most of these existing methods. In this paper, we approach the problem via hypothesis t… ▽ More

    Submitted 12 November, 2023; v1 submitted 4 July, 2022; originally announced July 2022.

    Comments: 42 pages, 3 figures

  33. arXiv:2206.02743  [pdf, other

    cs.IR

    A Neural Corpus Indexer for Document Retrieval

    Authors: Yu**g Wang, Yingyan Hou, Haonan Wang, Ziming Miao, Shibin Wu, Hao Sun, Qi Chen, Yuqing Xia, Chengmin Chi, Guoshuai Zhao, Zheng Liu, Xing Xie, Hao Allen Sun, Weiwei Deng, Qi Zhang, Mao Yang

    Abstract: Current state-of-the-art document retrieval solutions mainly follow an index-retrieve paradigm, where the index is hard to be directly optimized for the final retrieval target. In this paper, we aim to show that an end-to-end deep neural network unifying training and indexing stages can significantly improve the recall performance of traditional methods. To this end, we propose Neural Corpus Index… ▽ More

    Submitted 12 February, 2023; v1 submitted 6 June, 2022; originally announced June 2022.

    Comments: 19 pages, 6 figures, accepted by NeurIPS 2022

  34. arXiv:2206.02568  [pdf, other

    math.OC cs.AI cs.DM cs.LG

    A Deep Reinforcement Learning Framework For Column Generation

    Authors: Cheng Chi, Amine Mohamed Aboussalah, Elias B. Khalil, Juyoung Wang, Zoha Sherkat-Masoumi

    Abstract: Column Generation (CG) is an iterative algorithm for solving linear programs (LPs) with an extremely large number of variables (columns). CG is the workhorse for tackling large-scale \textit{integer} linear programs, which rely on CG to solve LP relaxations within a branch and price algorithm. Two canonical applications are the Cutting Stock Problem (CSP) and Vehicle Routing Problem with Time Wind… ▽ More

    Submitted 12 January, 2023; v1 submitted 2 June, 2022; originally announced June 2022.

    Journal ref: Advances in Neural Information Processing Systems (NeurIPS), 2022

  35. arXiv:2204.12284  [pdf, other

    cs.LG cs.CR

    Federated Stochastic Primal-dual Learning with Differential Privacy

    Authors: Yiwei Li, Shuai Wang, Tsung-Hui Chang, Chong-Yung Chi

    Abstract: Federated learning (FL) is a new paradigm that enables many clients to jointly train a machine learning (ML) model under the orchestration of a parameter server while kee** the local data not being exposed to any third party. However, the training of FL is an interactive process between local clients and the parameter server. Such process would cause privacy leakage since adversaries may retriev… ▽ More

    Submitted 26 April, 2022; originally announced April 2022.

    Comments: 18 pages, 6 figures

  36. arXiv:2203.12837  [pdf

    cs.CR cs.DC

    Secure Multi-Party Delegated Authorisation For Access and Sharing of Electronic Health Records

    Authors: Kheng-Leong Tan, Chi-Hung Chi, Kwok-Yan Lam

    Abstract: Timely sharing of electronic health records (EHR) across providers is essential and significance in facilitating medical researches and prompt patients' care. With sharing, it is crucial that patients can control who can access their data and when, and guarantee the security and privacy of their data. In current literature, various system models, cryptographic techniques and access control mechani… ▽ More

    Submitted 23 March, 2022; originally announced March 2022.

  37. arXiv:2203.01197  [pdf, other

    cs.RO

    DextAIRity: Deformable Manipulation Can be a Breeze

    Authors: Zhenjia Xu, Cheng Chi, Benjamin Burchfiel, Eric Cousineau, Siyuan Feng, Shuran Song

    Abstract: This paper introduces DextAIRity, an approach to manipulate deformable objects using active airflow. In contrast to conventional contact-based quasi-static manipulations, DextAIRity allows the system to apply dense forces on out-of-contact surfaces, expands the system's reach range, and provides safe high-speed interactions. These properties are particularly advantageous when manipulating under-ac… ▽ More

    Submitted 24 April, 2022; v1 submitted 2 March, 2022; originally announced March 2022.

    Comments: RSS 2022. Project page: https://dextairity.cs.columbia.edu/

  38. arXiv:2203.00663  [pdf, other

    cs.RO

    Iterative Residual Policy: for Goal-Conditioned Dynamic Manipulation of Deformable Objects

    Authors: Cheng Chi, Benjamin Burchfiel, Eric Cousineau, Siyuan Feng, Shuran Song

    Abstract: This paper tackles the task of goal-conditioned dynamic manipulation of deformable objects. This task is highly challenging due to its complex dynamics (introduced by object deformation and high-speed action) and strict task requirements (defined by a precise goal specification). To address these challenges, we present Iterative Residual Policy (IRP), a general learning framework applicable to rep… ▽ More

    Submitted 21 April, 2022; v1 submitted 1 March, 2022; originally announced March 2022.

  39. arXiv:2202.10069  [pdf

    cs.CR cs.DC cs.SE

    Analysis of Digital Sovereignty and Identity: From Digitization to Digitalization

    Authors: Kheng-Leong Tan, Chi-Hung Chi, Kwok-Yan Lam

    Abstract: Advances in emerging technologies have accelerated digital transformation with the pervasive digitalization of the economy and society, driving innovations such as smart cities, industry 4.0 and FinTech. Unlike digitization, digitalization is a transformation to improve processes by leveraging digital technologies and digitized data. The cyberspace has evolved from a hardware internetworking infra… ▽ More

    Submitted 21 February, 2022; originally announced February 2022.

  40. arXiv:2104.05177  [pdf, other

    cs.CV cs.LG cs.RO

    GarmentNets: Category-Level Pose Estimation for Garments via Canonical Space Shape Completion

    Authors: Cheng Chi, Shuran Song

    Abstract: This paper tackles the task of category-level pose estimation for garments. With a near infinite degree of freedom, a garment's full configuration (i.e., poses) is often described by the per-vertex 3D locations of its entire 3D surface. However, garments are also commonly subject to extreme cases of self-occlusion, especially when folded or crumpled, making it challenging to perceive their full 3D… ▽ More

    Submitted 13 August, 2021; v1 submitted 11 April, 2021; originally announced April 2021.

    ACM Class: I.2.10; I.2.9

  41. arXiv:2102.12726  [pdf, other

    cs.RO cs.CY eess.SY

    Design and Control of a Highly Redundant Rigid-Flexible Coupling Robot to Assist the COVID-19 Oropharyngeal-Swab Sampling

    Authors: Yingbai Hu, Jian Li, Yongquan Chen, Qiwen Wang, Chuliang Chi, Heng Zhang, Qing Gao, Yuanmin Lan, Zheng Li, Zonggao Mu, Zhenglong Sun, Alois Knoll

    Abstract: The outbreak of novel coronavirus pneumonia (COVID-19) has caused mortality and morbidity worldwide. Oropharyngeal-swab (OP-swab) sampling is widely used for the diagnosis of COVID-19 in the world. To avoid the clinical staff from being affected by the virus, we developed a 9-degree-of-freedom (DOF) rigid-flexible coupling (RFC) robot to assist the COVID-19 OP-swab sampling. This robot is composed… ▽ More

    Submitted 25 February, 2021; originally announced February 2021.

    Comments: 8 pages, 11 figures

  42. Occlusion-robust Deformable Object Tracking without Physics Simulation

    Authors: Cheng Chi, Dmitry Berenson

    Abstract: Estimating the state of a deformable object is crucial for robotic manipulation, yet accurate tracking is challenging when the object is partially-occluded. To address this problem, we propose an occlusion-robust RGBD sequence tracking framework based on Coherent Point Drift (CPD). To mitigate the effects of occlusion, our method 1) Uses a combination of locally linear embedding and constrained op… ▽ More

    Submitted 3 January, 2021; originally announced January 2021.

  43. arXiv:2010.15831  [pdf, other

    cs.CV

    RelationNet++: Bridging Visual Representations for Object Detection via Transformer Decoder

    Authors: Cheng Chi, Fangyun Wei, Han Hu

    Abstract: Existing object detection frameworks are usually built on a single format of object/part representation, i.e., anchor/proposal rectangle boxes in RetinaNet and Faster R-CNN, center points in FCOS and RepPoints, and corner points in CornerNet. While these different representations usually drive the frameworks to perform well in different aspects, e.g., better classification or finer localization, i… ▽ More

    Submitted 29 October, 2020; originally announced October 2020.

    Comments: NeurIPS2020 Spotlight

  44. arXiv:2007.06542  [pdf, other

    cs.CV

    Loss Function Search for Face Recognition

    Authors: Xiaobo Wang, Shuo Wang, Cheng Chi, Shifeng Zhang, Tao Mei

    Abstract: In face recognition, designing margin-based (e.g., angular, additive, additive angular margins) softmax loss functions plays an important role in learning discriminative features. However, these hand-crafted heuristic methods are sub-optimal because they require much effort to explore the large design space. Recently, an AutoML for loss function search method AM-LFS has been derived, which leverag… ▽ More

    Submitted 9 July, 2020; originally announced July 2020.

    Comments: Accepted by ICML2020. arXiv admin note: substantial text overlap with arXiv:1912.00833; text overlap with arXiv:1905.07375 by other authors

  45. arXiv:2007.00041  [pdf, other

    eess.SP cs.LG stat.ML

    Multi-way Graph Signal Processing on Tensors: Integrative analysis of irregular geometries

    Authors: Jay S. Stanley III, Eric C. Chi, Gal Mishne

    Abstract: Graph signal processing (GSP) is an important methodology for studying data residing on irregular structures. As acquired data is increasingly taking the form of multi-way tensors, new signal processing tools are needed to maximally utilize the multi-way structure within the data. In this paper, we review modern signal processing frameworks generalizing GSP to multi-way data, starting from graph s… ▽ More

    Submitted 27 July, 2020; v1 submitted 30 June, 2020; originally announced July 2020.

    Comments: In review for IEEE Signal Processing Magazine

  46. arXiv:2002.01862  [pdf

    cs.HC cs.AI cs.CL

    If I Hear You Correctly: Building and Evaluating Interview Chatbots with Active Listening Skills

    Authors: Ziang Xiao, Michelle X. Zhou, Wenxi Chen, Huahai Yang, Changyan Chi

    Abstract: Interview chatbots engage users in a text-based conversation to draw out their views and opinions. It is, however, challenging to build effective interview chatbots that can handle user free-text responses to open-ended questions and deliver engaging user experience. As the first step, we are investigating the feasibility and effectiveness of using publicly available, practical AI technologies to… ▽ More

    Submitted 5 February, 2020; originally announced February 2020.

    Comments: Working draft. To appear in the ACM CHI Conference on Human Factors in Computing Systems (CHI 2020)

  47. arXiv:1912.02424  [pdf, other

    cs.CV

    Bridging the Gap Between Anchor-based and Anchor-free Detection via Adaptive Training Sample Selection

    Authors: Shifeng Zhang, Cheng Chi, Yongqiang Yao, Zhen Lei, Stan Z. Li

    Abstract: Object detection has been dominated by anchor-based detectors for several years. Recently, anchor-free detectors have become popular due to the proposal of FPN and Focal Loss. In this paper, we first point out that the essential difference between anchor-based and anchor-free detection is actually how to define positive and negative training samples, which leads to the performance gap between them… ▽ More

    Submitted 20 June, 2020; v1 submitted 5 December, 2019; originally announced December 2019.

    Comments: Accepted by CVPR 2020 as Oral; Best Paper Nomination

  48. arXiv:1909.10674  [pdf, other

    cs.CV

    Relational Learning for Joint Head and Human Detection

    Authors: Cheng Chi, Shifeng Zhang, Junliang Xing, Zhen Lei, Stan Z. Li, Xudong Zou

    Abstract: Head and human detection have been rapidly improved with the development of deep convolutional neural networks. However, these two tasks are often studied separately without considering their inherent correlation, leading to that 1) head detection is often trapped in more false positives, and 2) the performance of human detector frequently drops dramatically in crowd scenes. To handle these two is… ▽ More

    Submitted 23 September, 2019; originally announced September 2019.

  49. arXiv:1909.06826  [pdf, other

    cs.CV

    PedHunter: Occlusion Robust Pedestrian Detector in Crowded Scenes

    Authors: Cheng Chi, Shifeng Zhang, Junliang Xing, Zhen Lei, Stan Z. Li, Xudong Zou

    Abstract: Pedestrian detection in crowded scenes is a challenging problem, because occlusion happens frequently among different pedestrians. In this paper, we propose an effective and efficient detection network to hunt pedestrians in crowd scenes. The proposed method, namely PedHunter, introduces strong occlusion handling ability to existing region-based detection networks without bringing extra computatio… ▽ More

    Submitted 15 September, 2019; originally announced September 2019.

  50. arXiv:1909.04376  [pdf, other

    cs.CV

    RefineFace: Refinement Neural Network for High Performance Face Detection

    Authors: Shifeng Zhang, Cheng Chi, Zhen Lei, Stan Z. Li

    Abstract: Face detection has achieved significant progress in recent years. However, high performance face detection still remains a very challenging problem, especially when there exists many tiny faces. In this paper, we present a single-shot refinement face detector namely RefineFace to achieve high performance. Specifically, it consists of five modules: Selective Two-step Regression (STR), Selective Two… ▽ More

    Submitted 10 September, 2019; originally announced September 2019.

    Comments: Journal extension of our previous conference paper: arXiv:1809.02693. arXiv admin note: text overlap with arXiv:1901.02350 by other authors