Skip to main content

Showing 1–50 of 138 results for author: Qian, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.18008  [pdf, other

    cs.IT

    Rate-Distortion-Perception Tradeoff for Gaussian Vector Sources

    Authors: **g**g Qian, Sadaf Salehkalaibar, Jun Chen, Ashish Khisti, Wei Yu, Wuxian Shi, Yiqun Ge, Wen Tong

    Abstract: This paper studies the rate-distortion-perception (RDP) tradeoff for a Gaussian vector source coding problem where the goal is to compress the multi-component source subject to distortion and perception constraints. The purpose of imposing a perception constraint is to ensure visually pleasing reconstructions. This paper studies this RDP setting with either the Kullback-Leibler (KL) divergence or… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

  2. arXiv:2406.13664  [pdf, other

    cs.AI

    Root-KGD: A Novel Framework for Root Cause Diagnosis Based on Knowledge Graph and Industrial Data

    Authors: Jiyu Chen, **chuan Qian, Xinmin Zhang, Zhihuan Song

    Abstract: With the development of intelligent manufacturing and the increasing complexity of industrial production, root cause diagnosis has gradually become an important research direction in the field of industrial fault diagnosis. However, existing research methods struggle to effectively combine domain knowledge and industrial data, failing to provide accurate, online, and reliable root cause diagnosis… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  3. arXiv:2406.08754  [pdf, other

    cs.CL cs.CR

    StructuralSleight: Automated Jailbreak Attacks on Large Language Models Utilizing Uncommon Text-Encoded Structure

    Authors: Bangxin Li, Hengrui Xing, Chao Huang, ** Qian, Huangqing Xiao, Linfeng Feng, Cong Tian

    Abstract: Large Language Models (LLMs) are widely used in natural language processing but face the risk of jailbreak attacks that maliciously induce them to generate harmful content. Existing jailbreak attacks, including character-level and context-level attacks, mainly focus on the prompt of the plain text without specifically exploring the significant influence of its structure. In this paper, we focus on… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: 12 pages, 4 figures

  4. arXiv:2405.20612  [pdf, other

    cs.CL cs.AI

    UniBias: Unveiling and Mitigating LLM Bias through Internal Attention and FFN Manipulation

    Authors: Hanzhang Zhou, Zijian Feng, Zixiao Zhu, Junlang Qian, Kezhi Mao

    Abstract: Large language models (LLMs) have demonstrated impressive capabilities in various tasks using the in-context learning (ICL) paradigm. However, their effectiveness is often compromised by inherent bias, leading to prompt brittleness, i.e., sensitivity to design settings such as example selection, order, and prompt formatting. Previous studies have addressed LLM bias through external adjustment of m… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  5. arXiv:2405.17796  [pdf, ps, other

    cs.LG stat.ML

    Offline Oracle-Efficient Learning for Contextual MDPs via Layerwise Exploration-Exploitation Tradeoff

    Authors: Jian Qian, Haichen Hu, David Simchi-Levi

    Abstract: Motivated by the recent discovery of a statistical and computational reduction from contextual bandits to offline regression (Simchi-Levi and Xu, 2021), we address the general (stochastic) Contextual Markov Decision Process (CMDP) problem with horizon H (as known as CMDP with H layers). In this paper, we introduce a reduction from CMDPs to offline density estimation under the realizability assumpt… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  6. arXiv:2405.16105  [pdf, other

    cs.CV cs.AI

    MambaLLIE: Implicit Retinex-Aware Low Light Enhancement with Global-then-Local State Space

    Authors: Jiangwei Weng, Zhiqiang Yan, Ying Tai, Jianjun Qian, Jian Yang, Jun Li

    Abstract: Recent advances in low light image enhancement have been dominated by Retinex-based learning framework, leveraging convolutional neural networks (CNNs) and Transformers. However, the vanilla Retinex theory primarily addresses global illumination degradation and neglects local issues such as noise and blur in dark conditions. Moreover, CNNs and Transformers struggle to capture global degradation du… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

  7. arXiv:2405.15916  [pdf, other

    cs.CV cs.RO

    Recasting Generic Pretrained Vision Transformers As Object-Centric Scene Encoders For Manipulation Policies

    Authors: Jianing Qian, Anastasios Panagopoulos, Dinesh Jayaraman

    Abstract: Generic re-usable pre-trained image representation encoders have become a standard component of methods for many computer vision tasks. As visual representations for robots however, their utility has been limited, leading to a recent wave of efforts to pre-train robotics-specific image encoders that are better suited to robotic tasks than their generic counterparts. We propose Scene Objects From T… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

    Comments: Accepted to International Conference on Robotics and Automation(ICRA) 2024

  8. arXiv:2405.15370  [pdf, other

    cs.CL

    Large Language Models can Deliver Accurate and Interpretable Time Series Anomaly Detection

    Authors: Jun Liu, Chaoyun Zhang, Jiaxu Qian, Minghua Ma, Si Qin, Chetan Bansal, Qingwei Lin, Saravan Rajmohan, Dongmei Zhang

    Abstract: Time series anomaly detection (TSAD) plays a crucial role in various industries by identifying atypical patterns that deviate from standard trends, thereby maintaining system integrity and enabling prompt response measures. Traditional TSAD models, which often rely on deep learning, require extensive training data and operate as black boxes, lacking interpretability for detected anomalies. To addr… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  9. arXiv:2405.11891  [pdf, ps, other

    cs.CL cs.AI

    Unveiling and Manipulating Prompt Influence in Large Language Models

    Authors: Zijian Feng, Hanzhang Zhou, Zixiao Zhu, Junlang Qian, Kezhi Mao

    Abstract: Prompts play a crucial role in guiding the responses of Large Language Models (LLMs). However, the intricate role of individual tokens in prompts, known as input saliency, in sha** the responses remains largely underexplored. Existing saliency methods either misalign with LLM generation objectives or rely heavily on linearity assumptions, leading to potential inaccuracies. To address this, we pr… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

    Comments: ICLR 2024

  10. arXiv:2405.09996  [pdf, other

    cs.CV

    Driving-Video Dehazing with Non-Aligned Regularization for Safety Assistance

    Authors: Junkai Fan, Jiangwei Weng, Kun Wang, Yijun Yang, Jianjun Qian, Jun Li, Jian Yang

    Abstract: Real driving-video dehazing poses a significant challenge due to the inherent difficulty in acquiring precisely aligned hazy/clear video pairs for effective model training, especially in dynamic driving scenarios with unpredictable weather conditions. In this paper, we propose a pioneering approach that addresses this challenge through a nonaligned regularization strategy. Our core concept involve… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

    Comments: Accepted by CVPR 2024

  11. arXiv:2405.03516  [pdf, other

    cs.LG

    GI-SMN: Gradient Inversion Attack against Federated Learning without Prior Knowledge

    Authors: ** Qian, Kaimin Wei, Yongdong Wu, Jilian Zhang, Jipeng Chen, Huan Bao

    Abstract: Federated learning (FL) has emerged as a privacy-preserving machine learning approach where multiple parties share gradient information rather than original user data. Recent work has demonstrated that gradient inversion attacks can exploit the gradients of FL to recreate the original user data, posing significant privacy risks. However, these attacks make strong assumptions about the attacker, su… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

    Comments: 18 pages, 10 figures, conference

  12. arXiv:2404.14546  [pdf, other

    cs.RO

    Closing the Perception-Action Loop for Semantically Safe Navigation in Semi-Static Environments

    Authors: **gxing Qian, Siqi Zhou, Nicholas Jianrui Ren, Veronica Chatrath, Angela P. Schoellig

    Abstract: Autonomous robots navigating in changing environments demand adaptive navigation strategies for safe long-term operation. While many modern control paradigms offer theoretical guarantees, they often assume known extrinsic safety constraints, overlooking challenges when deployed in real-world environments where objects can appear, disappear, and shift over time. In this paper, we present a closed-l… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

    Comments: Manuscript accepted to ICRA 2024

  13. arXiv:2404.13860  [pdf, other

    cs.LG cs.CR

    Distributional Black-Box Model Inversion Attack with Multi-Agent Reinforcement Learning

    Authors: Huan Bao, Kaimin Wei, Yongdong Wu, ** Qian, Robert H. Deng

    Abstract: A Model Inversion (MI) attack based on Generative Adversarial Networks (GAN) aims to recover the private training data from complex deep learning models by searching codes in the latent space. However, they merely search a deterministic latent space such that the found latent code is usually suboptimal. In addition, the existing distributional MI schemes assume that an attacker can access the stru… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

  14. arXiv:2404.13474  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    Composing Pre-Trained Object-Centric Representations for Robotics From "What" and "Where" Foundation Models

    Authors: Junyao Shi, Jianing Qian, Yecheng Jason Ma, Dinesh Jayaraman

    Abstract: There have recently been large advances both in pre-training visual representations for robotic control and segmenting unknown category objects in general images. To leverage these for improved robot learning, we propose $\textbf{POCR}$, a new framework for building pre-trained object-centric representations for robotic control. Building on theories of "what-where" representations in psychology an… ▽ More

    Submitted 20 April, 2024; originally announced April 2024.

    Comments: ICRA 2024. Project website: https://sites.google.com/view/pocr

  15. arXiv:2404.10122  [pdf, other

    stat.ML cs.LG math.ST

    Online Estimation via Offline Estimation: An Information-Theoretic Framework

    Authors: Dylan J. Foster, Yanjun Han, Jian Qian, Alexander Rakhlin

    Abstract: $… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

  16. arXiv:2403.19306  [pdf, other

    cs.CV

    Sparse Generation: Making Pseudo Labels Sparse for weakly supervision with points

    Authors: Tian Ma, Chuyang Shang, Wanzhu Ren, Yuancheng Li, Jiiayi Yang, Jiali Qian

    Abstract: In recent years, research on point weakly supervised object detection (PWSOD) methods in the field of computer vision has attracted people's attention. However, existing pseudo labels generation methods perform poorly in a small amount of supervised annotation data and dense object detection tasks. We consider the generation of weakly supervised pseudo labels as the result of model's sparse output… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

  17. arXiv:2403.07608  [pdf, other

    cs.DB cs.AI cs.LG

    Couler: Unified Machine Learning Workflow Optimization in Cloud

    Authors: Xiaoda Wang, Yuan Tang, Tengda Guo, Bo Sang, **gji Wu, Jian Sha, Ke Zhang, Jiang Qian, Mingjie Tang

    Abstract: Machine Learning (ML) has become ubiquitous, fueling data-driven applications across various organizations. Contrary to the traditional perception of ML in research, ML workflows can be complex, resource-intensive, and time-consuming. Expanding an ML workflow to encompass a wider range of data infrastructure and data types may lead to larger workloads and increased deployment costs. Currently, num… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

  18. arXiv:2403.01169  [pdf, other

    cs.CV

    Learn Suspected Anomalies from Event Prompts for Video Anomaly Detection

    Authors: Chenchen Tao, Chong Wang, Yuexian Zou, Xiaohao Peng, Jiafei Wu, Jiangbo Qian

    Abstract: Most models for weakly supervised video anomaly detection (WS-VAD) rely on multiple instance learning, aiming to distinguish normal and abnormal snippets without specifying the type of anomaly. The ambiguous nature of anomaly definitions across contexts introduces bias in detecting abnormal and normal snippets within the abnormal bag. Taking the first step to show the model why it is anomalous, a… ▽ More

    Submitted 2 March, 2024; originally announced March 2024.

  19. arXiv:2403.00381  [pdf, other

    cs.RO cs.LG eess.SY

    Structured Deep Neural Networks-Based Backstep** Trajectory Tracking Control for Lagrangian Systems

    Authors: Jiajun Qian, Liang Xu, Xiaoqiang Ren, Xiaofan Wang

    Abstract: Deep neural networks (DNN) are increasingly being used to learn controllers due to their excellent approximation capabilities. However, their black-box nature poses significant challenges to closed-loop stability guarantees and performance analysis. In this paper, we introduce a structured DNN-based controller for the trajectory tracking control of Lagrangian systems using backing techniques. By p… ▽ More

    Submitted 1 March, 2024; originally announced March 2024.

  20. ARTiST: Automated Text Simplification for Task Guidance in Augmented Reality

    Authors: Guande Wu, **g Qian, Sonia Castelo, Shaoyu Chen, Joao Rulff, Claudio Silva

    Abstract: Text presented in augmented reality provides in-situ, real-time information for users. However, this content can be challenging to apprehend quickly when engaging in cognitively demanding AR tasks, especially when it is presented on a head-mounted display. We propose ARTiST, an automatic text simplification system that uses a few-shot prompt and GPT-3 models to specifically optimize the text lengt… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

    Comments: Conditionally accepted by CHI '24

    ACM Class: H.1.2; I.2.7

  21. arXiv:2312.11845  [pdf, ps, other

    cs.CR

    A Summary of Privacy-Preserving Data Publishing in the Local Setting

    Authors: Wenjun Lin, Jiahao Qian, Wenwen Liu, Lang Wu

    Abstract: The exponential growth of collected, processed, and shared data has given rise to concerns about individuals' privacy. Consequently, various laws and regulations have been established to oversee how organizations handle and safeguard data. One such method is Statistical Disclosure Control, which aims to minimize the risk of exposing confidential information by de-identifying it. This de-identifica… ▽ More

    Submitted 18 December, 2023; originally announced December 2023.

  22. arXiv:2311.06555  [pdf, other

    cs.CL cs.AI

    Heuristic-Driven Link-of-Analogy Prompting: Enhancing Large Language Models for Document-Level Event Argument Extraction

    Authors: Hanzhang Zhou, Junlang Qian, Zijian Feng, Hui Lu, Zixiao Zhu, Kezhi Mao

    Abstract: In this study, we investigate in-context learning (ICL) in document-level event argument extraction (EAE) to alleviate the dependency on large-scale labeled data for this task. We introduce the Heuristic-Driven Link-of-Analogy (HD-LoA) prompting to address the challenge of example selection and to develop a prompting strategy tailored for EAE. Specifically, we hypothesize and validate that LLMs le… ▽ More

    Submitted 19 February, 2024; v1 submitted 11 November, 2023; originally announced November 2023.

  23. arXiv:2311.05316  [pdf, other

    cs.LG cs.AI

    ABIGX: A Unified Framework for eXplainable Fault Detection and Classification

    Authors: Yue Zhuo, **chuan Qian, Zhihuan Song, Zhiqiang Ge

    Abstract: For explainable fault detection and classification (FDC), this paper proposes a unified framework, ABIGX (Adversarial fault reconstruction-Based Integrated Gradient eXplanation). ABIGX is derived from the essentials of previous successful fault diagnosis methods, contribution plots (CP) and reconstruction-based contribution (RBC). It is the first explanation framework that provides variable contri… ▽ More

    Submitted 9 November, 2023; originally announced November 2023.

  24. arXiv:2310.17817  [pdf, other

    stat.ML cs.AI cs.LG math.PR

    Bayesian imaging inverse problem with SA-Roundtrip prior via HMC-pCN sampler

    Authors: Jiayu Qian, Yuanyuan Liu, **gya Yang, Qing** Zhou

    Abstract: Bayesian inference with deep generative prior has received considerable interest for solving imaging inverse problems in many scientific and engineering fields. The selection of the prior distribution is learned from, and therefore an important representation learning of, available prior measurements. The SA-Roundtrip, a novel deep generative prior, is introduced to enable controlled sampling gene… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

  25. arXiv:2309.09118  [pdf, other

    cs.CV cs.AI cs.RO

    Uncertainty-aware 3D Object-Level Map** with Deep Shape Priors

    Authors: Ziwei Liao, Jun Yang, **gxing Qian, Angela P. Schoellig, Steven L. Waslander

    Abstract: 3D object-level map** is a fundamental problem in robotics, which is especially challenging when object CAD models are unavailable during inference. In this work, we propose a framework that can reconstruct high-quality object-level maps for unknown objects. Our approach takes multiple RGB-D images as input and outputs dense 3D shapes and 9-DoF poses (including 3 scale parameters) for detected o… ▽ More

    Submitted 16 September, 2023; originally announced September 2023.

    Comments: Manuscript submitted to ICRA 2024

  26. arXiv:2308.13707  [pdf

    cs.SE

    Human-in-the-loop online just-in-time software defect prediction

    Authors: Xutong Liu, Yufei Zhou, Yutian Tang, Junyan Qian, Yuming Zhou

    Abstract: Online Just-In-Time Software Defect Prediction (O-JIT-SDP) uses an online model to predict whether a new software change will introduce a bug or not. However, existing studies neglect the interaction of Software Quality Assurance (SQA) staff with the model, which may miss the opportunity to improve the prediction accuracy through the feedback from SQA staff. To tackle this problem, we propose Huma… ▽ More

    Submitted 25 August, 2023; originally announced August 2023.

    Comments: 16 pages, 10 figures

  27. arXiv:2308.13072  [pdf

    eess.IV cs.CV

    Full-dose Whole-body PET Synthesis from Low-dose PET Using High-efficiency Denoising Diffusion Probabilistic Model: PET Consistency Model

    Authors: Shaoyan Pan, Elham Abouei, Junbo Peng, Joshua Qian, Jacob F Wynne, Tonghe Wang, Chih-Wei Chang, Justin Roper, Jonathon A Nye, Hui Mao, Xiaofeng Yang

    Abstract: Objective: Positron Emission Tomography (PET) has been a commonly used imaging modality in broad clinical applications. One of the most important tradeoffs in PET imaging is between image quality and radiation dose: high image quality comes with high radiation exposure. Improving image quality is desirable for all clinical applications while minimizing radiation exposure is needed to reduce risk t… ▽ More

    Submitted 16 April, 2024; v1 submitted 24 August, 2023; originally announced August 2023.

  28. arXiv:2308.08792  [pdf, other

    eess.SY cs.LG cs.MA

    Federated Reinforcement Learning for Electric Vehicles Charging Control on Distribution Networks

    Authors: Junkai Qian, Yuning Jiang, Xin Liu, Qing Wang, Ting Wang, Yuanming Shi, Wei Chen

    Abstract: With the growing popularity of electric vehicles (EVs), maintaining power grid stability has become a significant challenge. To address this issue, EV charging control strategies have been developed to manage the switch between vehicle-to-grid (V2G) and grid-to-vehicle (G2V) modes for EVs. In this context, multi-agent deep reinforcement learning (MADRL) has proven its effectiveness in EV charging… ▽ More

    Submitted 17 August, 2023; originally announced August 2023.

  29. arXiv:2308.08072  [pdf, other

    cs.IR cs.AI cs.CR cs.LG

    Decentralized Graph Neural Network for Privacy-Preserving Recommendation

    Authors: Xiaolin Zheng, Zhongyu Wang, Chaochao Chen, Jiashu Qian, Yao Yang

    Abstract: Building a graph neural network (GNN)-based recommender system without violating user privacy proves challenging. Existing methods can be divided into federated GNNs and decentralized GNNs. But both methods have undesirable effects, i.e., low communication efficiency and privacy leakage. This paper proposes DGREC, a novel decentralized GNN for privacy-preserving recommendations, where users can ch… ▽ More

    Submitted 15 August, 2023; originally announced August 2023.

  30. arXiv:2308.08071  [pdf, ps, other

    cs.LG cs.AI

    Freshness or Accuracy, Why Not Both? Addressing Delayed Feedback via Dynamic Graph Neural Networks

    Authors: Xiaolin Zheng, Zhongyu Wang, Chaochao Chen, Feng Zhu, Jiashu Qian

    Abstract: The delayed feedback problem is one of the most pressing challenges in predicting the conversion rate since users' conversions are always delayed in online commercial systems. Although new data are beneficial for continuous training, without complete feedback information, i.e., conversion labels, training algorithms may suffer from overwhelming fake negatives. Existing methods tend to use multitas… ▽ More

    Submitted 15 August, 2023; originally announced August 2023.

  31. Large Language Models to Identify Social Determinants of Health in Electronic Health Records

    Authors: Marco Guevara, Shan Chen, Spencer Thomas, Tafadzwa L. Chaunzwa, Idalid Franco, Benjamin Kann, Shalini Moningi, Jack Qian, Madeleine Goldstein, Susan Harper, Hugo JWL Aerts, Guergana K. Savova, Raymond H. Mak, Danielle S. Bitterman

    Abstract: Social determinants of health (SDoH) have an important impact on patient outcomes but are incompletely collected from the electronic health records (EHR). This study researched the ability of large language models to extract SDoH from free text in EHRs, where they are most commonly documented, and explored the role of synthetic clinical text for improving the extraction of these scarcely documente… ▽ More

    Submitted 5 March, 2024; v1 submitted 11 August, 2023; originally announced August 2023.

    Comments: Peer-reviewed version published at NPJ Digital Medicine: https://www.nature.com/articles/s41746-023-00970-0

    Journal ref: NPJ Digit Med. 2024 Jan 11;7(1):6

  32. arXiv:2308.06246  [pdf, other

    cs.HC

    ARGUS: Visualization of AI-Assisted Task Guidance in AR

    Authors: Sonia Castelo, Joao Rulff, Erin McGowan, Bea Steers, Guande Wu, Shaoyu Chen, Iran Roman, Roque Lopez, Ethan Brewer, Chen Zhao, **g Qian, Kyunghyun Cho, He He, Qi Sun, Huy Vo, Juan Bello, Michael Krone, Claudio Silva

    Abstract: The concept of augmented reality (AR) assistants has captured the human imagination for decades, becoming a staple of modern science fiction. To pursue this goal, it is necessary to develop artificial intelligence (AI)-based methods that simultaneously perceive the 3D environment, reason about physical tasks, and model the performer, all in real-time. Within this framework, a wide variety of senso… ▽ More

    Submitted 11 August, 2023; originally announced August 2023.

    Comments: 11 pages, 8 figures. This is the author's version of the article of the article that has been accepted for publication in IEEE Transactions on Visualization and Computer Graphics

  33. arXiv:2307.16120  [pdf, other

    cs.LG

    Deep Unrolling Networks with Recurrent Momentum Acceleration for Nonlinear Inverse Problems

    Authors: Qing** Zhou, Jiayu Qian, Junqi Tang, **glai Li

    Abstract: Combining the strengths of model-based iterative algorithms and data-driven deep learning solutions, deep unrolling networks (DuNets) have become a popular tool to solve inverse imaging problems. While DuNets have been successfully applied to many linear inverse problems, nonlinear problems tend to impair the performance of the method. Inspired by momentum acceleration techniques that are often us… ▽ More

    Submitted 31 March, 2024; v1 submitted 29 July, 2023; originally announced July 2023.

    MSC Class: 68U10; 94A08; 68T99

  34. arXiv:2307.00488  [pdf, other

    cs.RO

    POV-SLAM: Probabilistic Object-Aware Variational SLAM in Semi-Static Environments

    Authors: **gxing Qian, Veronica Chatrath, James Servos, Aaron Mavrinac, Wolfram Burgard, Steven L. Waslander, Angela P. Schoellig

    Abstract: Simultaneous localization and map** (SLAM) in slowly varying scenes is important for long-term robot task completion. Failing to detect scene changes may lead to inaccurate maps and, ultimately, lost robots. Classical SLAM algorithms assume static scenes, and recent works take dynamics into account, but require scene changes to be observed in consecutive frames. Semi-static scenes, wherein objec… ▽ More

    Submitted 2 July, 2023; originally announced July 2023.

    Comments: Published in Robotics: Science and Systems (RSS) 2023

  35. arXiv:2306.16197  [pdf, other

    cs.CV eess.IV

    Multi-IMU with Online Self-Consistency for Freehand 3D Ultrasound Reconstruction

    Authors: Mingyuan Luo, Xin Yang, Zhongnuo Yan, Junyu Li, Yuanji Zhang, Jiongquan Chen, Xindi Hu, Jikuan Qian, Jun Cheng, Dong Ni

    Abstract: Ultrasound (US) imaging is a popular tool in clinical diagnosis, offering safety, repeatability, and real-time capabilities. Freehand 3D US is a technique that provides a deeper understanding of scanned regions without increasing complexity. However, estimating elevation displacement and accumulation error remains challenging, making it difficult to infer the relative position using images alone.… ▽ More

    Submitted 18 July, 2023; v1 submitted 28 June, 2023; originally announced June 2023.

    Comments: Accepted by MICCAI-2023

  36. arXiv:2306.15700  [pdf, other

    cs.RO cs.LG

    Imitation with Spatial-Temporal Heatmap: 2nd Place Solution for NuPlan Challenge

    Authors: Yihan Hu, Kun Li, **yuan Liang, **gyu Qian, Zhening Yang, Haichao Zhang, Wenxin Shao, Zhuangzhuang Ding, Wei Xu, Qiang Liu

    Abstract: This paper presents our 2nd place solution for the NuPlan Challenge 2023. Autonomous driving in real-world scenarios is highly complex and uncertain. Achieving safe planning in the complex multimodal scenarios is a highly challenging task. Our approach, Imitation with Spatial-Temporal Heatmap, adopts the learning form of behavior cloning, innovatively predicts the future multimodal states with a h… ▽ More

    Submitted 26 June, 2023; originally announced June 2023.

  37. arXiv:2306.13532  [pdf, other

    cs.LG cs.SI

    PathMLP: Smooth Path Towards High-order Homophily

    Authors: Chenxuan Xie, Jiajun Zhou, Shengbo Gong, Jiacheng Wan, Jiaxu Qian, Shanqing Yu, Qi Xuan, Xiaoniu Yang

    Abstract: Real-world graphs exhibit increasing heterophily, where nodes no longer tend to be connected to nodes with the same label, challenging the homophily assumption of classical graph neural networks (GNNs) and impeding their performance. Intriguingly, we observe that certain high-order information on heterophilous data exhibits high homophily, which motivates us to involve high-order information in no… ▽ More

    Submitted 23 June, 2023; originally announced June 2023.

  38. arXiv:2306.11332  [pdf, ps, other

    cs.IT eess.SP

    Minimum Eigenvalue Based Covariance Matrix Estimation with Limited Samples

    Authors: **g Qian, Juening **, Hao Wang

    Abstract: In this paper, we consider the interference rejection combining (IRC) receiver, which improves the cell-edge user throughput via suppressing inter-cell interference and requires estimating the covariance matrix including the inter-cell interference with high accuracy. In order to solve the problem of sample covariance matrix estimation with limited samples, a regularization parameter optimization… ▽ More

    Submitted 20 June, 2023; originally announced June 2023.

  39. arXiv:2306.01264  [pdf, ps, other

    math.OC cs.LG stat.ML

    Convex and Non-convex Optimization Under Generalized Smoothness

    Authors: Haochuan Li, Jian Qian, Yi Tian, Alexander Rakhlin, Ali Jadbabaie

    Abstract: Classical analysis of convex and non-convex optimization methods often requires the Lipshitzness of the gradient, which limits the analysis to functions bounded by quadratics. Recent work relaxed this requirement to a non-uniform smoothness condition with the Hessian norm bounded by an affine function of the gradient norm, and proved convergence in the non-convex setting via gradient clip**, ass… ▽ More

    Submitted 3 November, 2023; v1 submitted 2 June, 2023; originally announced June 2023.

    Comments: 37 pages

  40. arXiv:2304.13302  [pdf, other

    cs.DC cs.AI cs.LG cs.PF

    HiQ -- A Declarative, Non-intrusive, Dynamic and Transparent Observability and Optimization System

    Authors: Fuheng Wu, Ivan Davchev, Jun Qian

    Abstract: This paper proposes a non-intrusive, declarative, dynamic and transparent system called `HiQ` to track Python program runtime information without compromising on the run-time system performance and losing insight. HiQ can be used for monolithic and distributed systems, offline and online applications. HiQ is developed when we optimize our large deep neural network (DNN) models which are written in… ▽ More

    Submitted 26 April, 2023; originally announced April 2023.

    Comments: 7 pages, 12 figures, opensource

  41. arXiv:2303.09027  [pdf, other

    cs.LG

    Learning Rewards to Optimize Global Performance Metrics in Deep Reinforcement Learning

    Authors: Junqi Qian, Paul Weng, Chenmien Tan

    Abstract: When applying reinforcement learning (RL) to a new problem, reward engineering is a necessary, but often difficult and error-prone task a system designer has to face. To avoid this step, we propose LR4GPM, a novel (deep) RL method that can optimize a global performance metric, which is supposed to be available as part of the problem description. LR4GPM alternates between two phases: (1) learning a… ▽ More

    Submitted 15 March, 2023; originally announced March 2023.

  42. arXiv:2303.04940  [pdf, other

    cs.CV

    Non-aligned supervision for Real Image Dehazing

    Authors: Junkai Fan, Fei Guo, Jianjun Qian, Xiang Li, Jun Li, Jian Yang

    Abstract: Removing haze from real-world images is challenging due to unpredictable weather conditions, resulting in the misalignment of hazy and clear image pairs. In this paper, we propose an innovative dehazing framework that operates under non-aligned supervision. This framework is grounded in the atmospheric scattering model, and consists of three interconnected networks: dehazing, airlight, and transmi… ▽ More

    Submitted 5 January, 2024; v1 submitted 8 March, 2023; originally announced March 2023.

  43. arXiv:2301.13419  [pdf, other

    cs.CV

    Recurrent Structure Attention Guidance for Depth Super-Resolution

    Authors: Jiayi Yuan, Haobo Jiang, Xiang Li, Jianjun Qian, Jun Li, Jian Yang

    Abstract: Image guidance is an effective strategy for depth super-resolution. Generally, most existing methods employ hand-crafted operators to decompose the high-frequency (HF) and low-frequency (LF) ingredients from low-resolution depth maps and guide the HF ingredients by directly concatenating them with image features. However, the hand-designed operators usually cause inferior HF maps (e.g., distorted… ▽ More

    Submitted 31 January, 2023; originally announced January 2023.

    Comments: Accepted by AAAI-2023

  44. arXiv:2301.13416  [pdf, other

    cs.CV

    Structure Flow-Guided Network for Real Depth Super-Resolution

    Authors: Jiayi Yuan, Haobo Jiang, Xiang Li, Jianjun Qian, Jun Li, Jian Yang

    Abstract: Real depth super-resolution (DSR), unlike synthetic settings, is a challenging task due to the structural distortion and the edge noise caused by the natural degradation in real-world low-resolution (LR) depth maps. These defeats result in significant structure inconsistency between the depth map and the RGB guidance, which potentially confuses the RGB-structure guidance and thereby degrades the D… ▽ More

    Submitted 31 January, 2023; originally announced January 2023.

    Comments: Accepted by AAAI-2023

  45. arXiv:2301.10368  [pdf, other

    cs.CL cs.LG

    Language Model Detoxification in Dialogue with Contextualized Stance Control

    Authors: **g Qian, Xifeng Yan

    Abstract: To reduce the toxic degeneration in a pretrained Language Model (LM), previous work on Language Model detoxification has focused on reducing the toxicity of the generation itself (self-toxicity) without consideration of the context. As a result, a type of implicit offensive language where the generations support the offensive language in the context is ignored. Different from the LM controlling ta… ▽ More

    Submitted 24 January, 2023; originally announced January 2023.

    Comments: Findings of EMNLP 2022

  46. arXiv:2211.14250  [pdf, other

    cs.LG math.OC math.ST stat.ML

    Model-Free Reinforcement Learning with the Decision-Estimation Coefficient

    Authors: Dylan J. Foster, Noah Golowich, Jian Qian, Alexander Rakhlin, Ayush Sekhari

    Abstract: We consider the problem of interactive decision making, encompassing structured bandits and reinforcement learning with general function approximation. Recently, Foster et al. (2021) introduced the Decision-Estimation Coefficient, a measure of statistical complexity that lower bounds the optimal regret for interactive decision making, as well as a meta-algorithm, Estimation-to-Decisions, which ach… ▽ More

    Submitted 12 August, 2023; v1 submitted 25 November, 2022; originally announced November 2022.

    Comments: V2 changes: Improved writing and added more examples

  47. arXiv:2211.12746  [pdf

    cs.CV

    Completing point cloud from few points by Wasserstein GAN and Transformers

    Authors: Xianfeng Wu, **hui Qian, Qing Wei, Xianzu Wu, Xinyi Liu, Luxin Hu, Yanli Gong, Zhongyuan Lai, Libing Wu

    Abstract: In many vision and robotics applications, it is common that the captured objects are represented by very few points. Most of the existing completion methods are designed for partial point clouds with many points, and they perform poorly or even fail completely in the case of few points. However, due to the lack of detail information, completing objects from few points faces a huge challenge. Inspi… ▽ More

    Submitted 23 November, 2022; originally announced November 2022.

  48. arXiv:2211.09469  [pdf, other

    cs.CV

    Visual Commonsense-aware Representation Network for Video Captioning

    Authors: Pengpeng Zeng, Haonan Zhang, Lianli Gao, Xiangpeng Li, ** Qian, Heng Tao Shen

    Abstract: Generating consecutive descriptions for videos, i.e., Video Captioning, requires taking full advantage of visual representation along with the generation process. Existing video captioning methods focus on making an exploration of spatial-temporal representations and their relationships to produce inferences. However, such methods only exploit the superficial association contained in the video its… ▽ More

    Submitted 17 November, 2022; originally announced November 2022.

  49. arXiv:2210.06726  [pdf, other

    cs.CL

    Explanations from Large Language Models Make Small Reasoners Better

    Authors: Shiyang Li, Jianshu Chen, Yelong Shen, Zhiyu Chen, Xinlu Zhang, Zekun Li, Hong Wang, **g Qian, Baolin Peng, Yi Mao, Wenhu Chen, Xifeng Yan

    Abstract: Integrating free-text explanations to in-context learning of large language models (LLM) is shown to elicit strong reasoning capabilities along with reasonable explanations. In this paper, we consider the problem of leveraging the explanations generated by LLM to improve the training of small reasoners, which are more favorable in real-production deployment due to their low cost. We systematically… ▽ More

    Submitted 13 October, 2022; originally announced October 2022.

  50. arXiv:2210.06676  [pdf, other

    cs.CR

    A Tagging Solution to Discover IoT Devices in Apartments

    Authors: Berkay Kaplan, **gyu Qian, Israel J Lopez-Toledo, Carl A. Gunter

    Abstract: The number of IoT devices in smart homes is increasing. This broad adoption facilitates users' lives, but it also brings problems. One such issue is that some IoT devices may invade users' privacy. Some reasons for this invasion can stem from obscure data collection practices or hidden devices. Specific IoT devices can exist out of sight and still collect user data to send to third parties via the… ▽ More

    Submitted 20 September, 2023; v1 submitted 12 October, 2022; originally announced October 2022.