Skip to main content

Showing 201–250 of 1,041 results for author: Zha, D

.
  1. arXiv:2308.07234  [pdf, other

    cs.CV cs.RO

    UniWorld: Autonomous Driving Pre-training via World Models

    Authors: Chen Min, Dawei Zhao, Liang Xiao, Yiming Nie, Bin Dai

    Abstract: In this paper, we draw inspiration from Alberto Elfes' pioneering work in 1989, where he introduced the concept of the occupancy grid as World Models for robots. We imbue the robot with a spatial-temporal world model, termed UniWorld, to perceive its surroundings and predict the future behavior of other participants. UniWorld involves initially predicting 4D geometric occupancy as the World Models… ▽ More

    Submitted 14 August, 2023; originally announced August 2023.

    Comments: 8 pages, 5 figures. arXiv admin note: substantial text overlap with arXiv:2305.18829

  2. arXiv:2308.06933  [pdf, other

    cs.CV

    Radiomics-Informed Deep Learning for Classification of Atrial Fibrillation Sub-Types from Left-Atrium CT Volumes

    Authors: Weihang Dai, Xiaomeng Li, Taihui Yu, Di Zhao, Jun Shen, Kwang-Ting Cheng

    Abstract: Atrial Fibrillation (AF) is characterized by rapid, irregular heartbeats, and can lead to fatal complications such as heart failure. The disease is divided into two sub-types based on severity, which can be automatically classified through CT volumes for disease screening of severe cases. However, existing classification approaches rely on generic radiomic features that may not be optimal for the… ▽ More

    Submitted 14 August, 2023; originally announced August 2023.

    Comments: Accepted by MICCAI23

  3. arXiv:2308.06891  [pdf

    cs.RO eess.SY

    Viia-hand: a Reach-and-grasp Restoration System Integrating Voice interaction, Computer vision and Auditory feedback for Blind Amputees

    Authors: Chunhao Peng, Dapeng Yang, Ming Cheng, **ghui Dai, Deyu Zhao, Li Jiang

    Abstract: Visual feedback plays a crucial role in the process of amputation patients completing gras** in the field of prosthesis control. However, for blind and visually impaired (BVI) amputees, the loss of both visual and gras** abilities makes the "easy" reach-and-grasp task a feasible challenge. In this paper, we propose a novel multi-sensory prosthesis system hel** BVI amputees with sensing, navi… ▽ More

    Submitted 13 August, 2023; originally announced August 2023.

  4. arXiv:2308.04648  [pdf, ps, other

    cs.CR cs.DS

    Communication-Efficient Search under Fully Homomorphic Encryption for Federated Machine Learning

    Authors: Dongfang Zhao

    Abstract: Homomorphic encryption (HE) has found extensive utilization in federated learning (FL) systems, capitalizing on its dual advantages: (i) ensuring the confidentiality of shared models contributed by participating entities, and (ii) enabling algebraic operations directly on ciphertexts representing encrypted models. Particularly, the approximate fully homomorphic encryption (FHE) scheme, known as CK… ▽ More

    Submitted 8 August, 2023; originally announced August 2023.

  5. arXiv:2308.00918  [pdf, other

    cs.CV

    A Novel Cross-Perturbation for Single Domain Generalization

    Authors: Dongjia Zhao, Lei Qi, Xiao Shi, Yinghuan Shi, Xin Geng

    Abstract: Single domain generalization aims to enhance the ability of the model to generalize to unknown domains when trained on a single source domain. However, the limited diversity in the training data hampers the learning of domain-invariant features, resulting in compromised generalization performance. To address this, data perturbation (augmentation) has emerged as a crucial method to increase data di… ▽ More

    Submitted 7 June, 2024; v1 submitted 1 August, 2023; originally announced August 2023.

    Comments: Accepted by IEEE TCSVT

  6. arXiv:2307.15061  [pdf, other

    cs.CV cs.RO

    The RoboDepth Challenge: Methods and Advancements Towards Robust Depth Estimation

    Authors: Lingdong Kong, Yaru Niu, Shaoyuan Xie, Hanjiang Hu, Lai Xing Ng, Benoit R. Cottereau, Ding Zhao, Liangjun Zhang, Hesheng Wang, Wei Tsang Ooi, Ruijie Zhu, Ziyang Song, Li Liu, Tianzhu Zhang, Jun Yu, Mohan **g, Pengwei Li, Xiaohua Qi, Cheng **, Yingfeng Chen, Jie Hou, Jie Zhang, Zhen Kan, Qiang Ling, Liang Peng , et al. (18 additional authors not shown)

    Abstract: Accurate depth estimation under out-of-distribution (OoD) scenarios, such as adverse weather conditions, sensor failure, and noise contamination, is desirable for safety-critical applications. Existing depth estimation systems, however, suffer inevitably from real-world corruptions and perturbations and are struggled to provide reliable depth predictions under such cases. In this paper, we summari… ▽ More

    Submitted 27 July, 2023; originally announced July 2023.

    Comments: Technical Report; 65 pages, 34 figures, 24 tables; Code at https://github.com/ldkong1205/RoboDepth

  7. arXiv:2307.15049  [pdf, other

    cs.CV

    Regularized Mask Tuning: Uncovering Hidden Knowledge in Pre-trained Vision-Language Models

    Authors: Kecheng Zheng, Wei Wu, Ruili Feng, Kai Zhu, Jiawei Liu, Deli Zhao, Zheng-Jun Zha, Wei Chen, Yujun Shen

    Abstract: Prompt tuning and adapter tuning have shown great potential in transferring pre-trained vision-language models (VLMs) to various downstream tasks. In this work, we design a new type of tuning method, termed as regularized mask tuning, which masks the network parameters through a learnable selection. Inspired by neural pathways, we argue that the knowledge required by a downstream task already exis… ▽ More

    Submitted 6 August, 2023; v1 submitted 27 July, 2023; originally announced July 2023.

    Comments: Accepted at ICCV 2023

  8. arXiv:2307.14778  [pdf, other

    cs.LG eess.SP

    MATNilm: Multi-appliance-task Non-intrusive Load Monitoring with Limited Labeled Data

    Authors: **g Xiong, Tianqi Hong, Dongbo Zhao, Yu Zhang

    Abstract: Non-intrusive load monitoring (NILM) identifies the status and power consumption of various household appliances by disaggregating the total power usage signal of an entire house. Efficient and accurate load monitoring facilitates user profile establishment, intelligent household energy management, and peak load shifting. This is beneficial for both the end-users and utilities by improving the ove… ▽ More

    Submitted 29 July, 2023; v1 submitted 27 July, 2023; originally announced July 2023.

  9. arXiv:2307.12533  [pdf, ps, other

    cs.CR

    PUMA: Secure Inference of LLaMA-7B in Five Minutes

    Authors: Ye Dong, Wen-jie Lu, Yancheng Zheng, Haoqi Wu, Derun Zhao, ** Tan, Zhicong Huang, Cheng Hong, Tao Wei, Wenguang Chen

    Abstract: With ChatGPT as a representative, tons of companies have began to provide services based on large Transformers models. However, using such a service inevitably leak users' prompts to the model provider. Previous studies have studied secure inference for Transformer models using secure multiparty computation (MPC), where model parameters and clients' prompts are kept secret. Despite this, these fra… ▽ More

    Submitted 26 September, 2023; v1 submitted 24 July, 2023; originally announced July 2023.

  10. Toward a Physical Understanding of Galaxy-Halo Alignment

    Authors: Kun Xu, Y. P. **g, Donghai Zhao

    Abstract: We investigate the alignment of galaxy and halo orientations using the TNG300-1 hydrodynamical simulation. Our analysis reveals that the distribution of the 2D misalignment angle $θ_{\rm{2D}}$ can be well described by a truncated shifted exponential (TSE) distribution with only {\textit{one}} free parameter across different redshifts and galaxy/halo properties. We demonstrate that the galaxy-ellip… ▽ More

    Submitted 5 November, 2023; v1 submitted 23 July, 2023; originally announced July 2023.

    Comments: 19 pages, 12 figures, Published in ApJ

    Journal ref: The Astrophysical Journal, Volume 957, 2023, Number 1

  11. arXiv:2307.10485  [pdf, other

    cs.CL cs.LG q-fin.GN

    FinGPT: Democratizing Internet-scale Data for Financial Large Language Models

    Authors: Xiao-Yang Liu, Guoxuan Wang, Hongyang Yang, Daochen Zha

    Abstract: Large language models (LLMs) have demonstrated remarkable proficiency in understanding and generating human-like texts, which may potentially revolutionize the finance industry. However, existing LLMs often fall short in the financial field, which is mainly attributed to the disparities between general text data and financial text data. Unfortunately, there is only a limited number of financial te… ▽ More

    Submitted 14 November, 2023; v1 submitted 19 July, 2023; originally announced July 2023.

    Comments: 43 pages, 8 tables, and 2 figures

  12. arXiv:2307.09823  [pdf, other

    eess.IV cs.CV cs.LG

    Multi-modal Learning based Prediction for Disease

    Authors: Yaran Chen, Xueyu Chen, Yu Han, Haoran Li, Dongbin Zhao, **gzhong Li, Xu Wang

    Abstract: Non alcoholic fatty liver disease (NAFLD) is the most common cause of chronic liver disease, which can be predicted accurately to prevent advanced fibrosis and cirrhosis. While, a liver biopsy, the gold standard for NAFLD diagnosis, is invasive, expensive, and prone to sampling errors. Therefore, non-invasive studies are extremely promising, yet they are still in their infancy due to the lack of c… ▽ More

    Submitted 19 July, 2023; originally announced July 2023.

  13. arXiv:2307.09481  [pdf, other

    cs.CV

    AnyDoor: Zero-shot Object-level Image Customization

    Authors: Xi Chen, Lianghua Huang, Yu Liu, Yujun Shen, Deli Zhao, Hengshuang Zhao

    Abstract: This work presents AnyDoor, a diffusion-based image generator with the power to teleport target objects to new scenes at user-specified locations in a harmonious way. Instead of tuning parameters for each object, our model is trained only once and effortlessly generalizes to diverse object-scene combinations at the inference stage. Such a challenging zero-shot setting requires an adequate characte… ▽ More

    Submitted 7 May, 2024; v1 submitted 18 July, 2023; originally announced July 2023.

    Comments: CVPR2024

  14. arXiv:2307.08918  [pdf, other

    physics.flu-dyn

    Measuring Scale-dependent Shape Anisotropy by Coarse-Graining: Application to Inhomogeneous Rayleigh-Taylor Turbulence

    Authors: Dongxiao Zhao, Hussein Aluie

    Abstract: We generalize the `filtering spectrum' [1] to probe scales along different directions by spatial coarse-graining. This multi-dimensional filtering spectrum quantifies the spectral content of flows that are not necessarily homogeneous. From multi-dimensional spectral information, we propose a simple metric for shape anisotropy at various scales. The method is applied to simulations of 2D and 3D Ray… ▽ More

    Submitted 18 October, 2023; v1 submitted 17 July, 2023; originally announced July 2023.

  15. arXiv:2307.07907  [pdf, other

    cs.LG

    Seeing is not Believing: Robust Reinforcement Learning against Spurious Correlation

    Authors: Wenhao Ding, Laixi Shi, Yuejie Chi, Ding Zhao

    Abstract: Robustness has been extensively studied in reinforcement learning (RL) to handle various forms of uncertainty such as random perturbations, rare events, and malicious attacks. In this work, we consider one critical type of robustness against spurious correlation, where different portions of the state do not have correlations induced by unobserved confounders. These spurious correlations are ubiqui… ▽ More

    Submitted 25 October, 2023; v1 submitted 15 July, 2023; originally announced July 2023.

    Comments: Accepted to NeurIPS 2023

  16. arXiv:2307.05689  [pdf, other

    astro-ph.HE

    Magnetar emergence in a peculiar gamma-ray burst from a compact star merger

    Authors: H. Sun, C. -W. Wang, J. Yang, B. -B. Zhang, S. -L. Xiong, Y. -H. I. Yin, Y. Liu, Y. Li, W. -C. Xue, Z. Yan, C. Zhang, W. -J. Tan, H. -W. Pan, J. -C. Liu, H. -Q. Cheng, Y. -Q. Zhang, J. -W. Hu, C. Zheng, Z. -H. An, C. Cai, L. Hu, C. **, D. -Y. Li, X. -Q. Li, H. -Y. Liu , et al. (19 additional authors not shown)

    Abstract: The central engine that powers gamma-ray bursts (GRBs), the most powerful explosions in the universe, is still not identified. Besides hyper-accreting black holes, rapidly spinning and highly magnetized neutron stars, known as millisecond magnetars, have been suggested to power both long and short GRBs. The presence of a magnetar engine following compact star mergers is of particular interest as i… ▽ More

    Submitted 11 July, 2023; originally announced July 2023.

    Comments: 44 pages, 10 figures, 5 tables

  17. arXiv:2307.02869  [pdf, other

    cs.CV

    MomentDiff: Generative Video Moment Retrieval from Random to Real

    Authors: Pandeng Li, Chen-Wei Xie, Hongtao Xie, Liming Zhao, Lei Zhang, Yun Zheng, Deli Zhao, Yongdong Zhang

    Abstract: Video moment retrieval pursues an efficient and generalized solution to identify the specific temporal segments within an untrimmed video that correspond to a given language description. To achieve this goal, we provide a generative diffusion-based framework called MomentDiff, which simulates a typical human retrieval process from random browsing to gradual localization. Specifically, we first dif… ▽ More

    Submitted 11 October, 2023; v1 submitted 6 July, 2023; originally announced July 2023.

    Comments: 19 pages, 6 figures

  18. arXiv:2307.02127  [pdf, other

    cs.CL

    Leveraging Denoised Abstract Meaning Representation for Grammatical Error Correction

    Authors: He**g Cao, Dongyan Zhao

    Abstract: Grammatical Error Correction (GEC) is the task of correcting errorful sentences into grammatically correct, semantically consistent, and coherent sentences. Popular GEC models either use large-scale synthetic corpora or use a large number of human-designed rules. The former is costly to train, while the latter requires quite a lot of human expertise. In recent years, AMR, a semantic representation… ▽ More

    Submitted 5 July, 2023; originally announced July 2023.

    Comments: 7 pages, 3 figures, Accepted by ACL findings 2023

  19. Traversability Analysis for Autonomous Driving in Complex Environment: A LiDAR-based Terrain Modeling Approach

    Authors: Hanzhang Xue, Hao Fu, Liang Xiao, Yiming Fan, Dawei Zhao, Bin Dai

    Abstract: For autonomous driving, traversability analysis is one of the most basic and essential tasks. In this paper, we propose a novel LiDAR-based terrain modeling approach, which could output stable, complete and accurate terrain models and traversability analysis results. As terrain is an inherent property of the environment that does not change with different view angles, our approach adopts a multi-f… ▽ More

    Submitted 5 July, 2023; originally announced July 2023.

    Comments: accepted to Journal of Field Robotics

    Journal ref: Journal of Field Robotics, 2023, 1-25

  20. Reciprocating Magnetic Fields in the Pulsar Wind Observed from the Black Widow Pulsar J1720-0534

    Authors: Chen-Chen Miao, Victoria Blackmon, Wei-Wei Zhu, Dong-Zi Li, Mingyu Ge, Xiao-Peng You, Maura McLaughlin, Di Li, Na Wang, Pei Wang, Jia-Rui Niu, M. Cruces, Jian-** Yuan, Jun-Tao Bai, D. J. Champion, Yu-Tong Chen, Ming-Min Chi, P. C. C. Freire, Yi Feng, Zhen-Ye Gan, M. Kramer, Fei-Fei Kou, Yu-Xi Li, Xue-Li Miao, Ling-Qi Meng , et al. (19 additional authors not shown)

    Abstract: We report the radio observations of the eclipsing black widow pulsar J1720-0534, a 3.26 ms pulsar in orbit with a low mass companion of mass 0.029 to 0.034 M$_{\odot}$. We obtain the phase-connected timing ephemeris and polarization profile of this millisecond pulsar (MSP) using the Five-hundred-meter Aperture Spherical Radio Telescope (FAST), the Green Bank Telescope (GBT), and the Parkes Telesco… ▽ More

    Submitted 28 August, 2023; v1 submitted 2 July, 2023; originally announced July 2023.

    Comments: 15 pages, 8 figures, 1 table, accepted by RAA

  21. arXiv:2306.15864  [pdf, other

    cs.RO

    What Went Wrong? Closing the Sim-to-Real Gap via Differentiable Causal Discovery

    Authors: Peide Huang, Xilun Zhang, Ziang Cao, Shiqi Liu, Mengdi Xu, Wenhao Ding, Jonathan Francis, Bingqing Chen, Ding Zhao

    Abstract: Training control policies in simulation is more appealing than on real robots directly, as it allows for exploring diverse states in an efficient manner. Yet, robot simulators inevitably exhibit disparities from the real-world \rebut{dynamics}, yielding inaccuracies that manifest as the dynamical simulation-to-reality (sim-to-real) gap. Existing literature has proposed to close this gap by activel… ▽ More

    Submitted 19 October, 2023; v1 submitted 27 June, 2023; originally announced June 2023.

  22. arXiv:2306.14436  [pdf, other

    cs.CR

    Silca: Singular Caching of Homomorphic Encryption for Outsourced Databases in Cloud Computing

    Authors: Dongfang Zhao

    Abstract: Ensuring the confidentiality and privacy of sensitive information in cloud computing and outsourced databases is crucial. Homomorphic encryption (HE) offers a solution by enabling computations on encrypted data without decryption, allowing secure outsourcing while maintaining data confidentiality. However, HE faces performance challenges in query-intensive databases. To address this, we propose tw… ▽ More

    Submitted 26 June, 2023; originally announced June 2023.

  23. arXiv:2306.12619  [pdf, other

    cs.CL cs.AI cs.LG cs.NE

    Class-Incremental Learning based on Label Generation

    Authors: Yijia Shao, Yiduo Guo, Dongyan Zhao, Bing Liu

    Abstract: Despite the great success of pre-trained language models, it is still a challenge to use these models for continual learning, especially for the class-incremental learning (CIL) setting due to catastrophic forgetting (CF). This paper reports our finding that if we formulate CIL as a continual label generation problem, CF is drastically reduced and the generalizable representations of pre-trained m… ▽ More

    Submitted 20 July, 2023; v1 submitted 21 June, 2023; originally announced June 2023.

    Comments: 12 pages, ACL 2023 Main Conference

  24. arXiv:2306.11546  [pdf, other

    cs.CV

    Bullying10K: A Large-Scale Neuromorphic Dataset towards Privacy-Preserving Bullying Recognition

    Authors: Yiting Dong, Yang Li, Dongcheng Zhao, Guobin Shen, Yi Zeng

    Abstract: The prevalence of violence in daily life poses significant threats to individuals' physical and mental well-being. Using surveillance cameras in public spaces has proven effective in proactively deterring and preventing such incidents. However, concerns regarding privacy invasion have emerged due to their widespread deployment. To address the problem, we leverage Dynamic Vision Sensors (DVS) camer… ▽ More

    Submitted 23 October, 2023; v1 submitted 20 June, 2023; originally announced June 2023.

    Comments: Accepted at the 37th Conference on Neural Information Processing Systems (NeurIPS 2023) Track on Datasets and Benchmarks

  25. arXiv:2306.11251  [pdf, other

    cs.CV

    Eliminating Lipschitz Singularities in Diffusion Models

    Authors: Zhantao Yang, Ruili Feng, Han Zhang, Yujun Shen, Kai Zhu, Lianghua Huang, Yifei Zhang, Yu Liu, Deli Zhao, **gren Zhou, Fan Cheng

    Abstract: Diffusion models, which employ stochastic differential equations to sample images through integrals, have emerged as a dominant class of generative models. However, the rationality of the diffusion process itself receives limited attention, leaving the question of whether the problem is well-posed and well-conditioned. In this paper, we uncover a vexing propensity of diffusion models: they frequen… ▽ More

    Submitted 19 June, 2023; originally announced June 2023.

  26. arXiv:2306.10280  [pdf, other

    cs.LG cs.SI

    OpenGSL: A Comprehensive Benchmark for Graph Structure Learning

    Authors: Zhiyao Zhou, Sheng Zhou, Bochao Mao, Xuanyi Zhou, Jiawei Chen, Qiaoyu Tan, Daochen Zha, Yan Feng, Chun Chen, Can Wang

    Abstract: Graph Neural Networks (GNNs) have emerged as the de facto standard for representation learning on graphs, owing to their ability to effectively integrate graph topology and node attributes. However, the inherent suboptimal nature of node connections, resulting from the complex and contingent formation process of graphs, presents significant challenges in modeling them effectively. To tackle this i… ▽ More

    Submitted 23 December, 2023; v1 submitted 17 June, 2023; originally announced June 2023.

    Comments: 25 pages, 21 figures. Camera-ready version for NeurIPS Datasets and Benchmarks Track 2023

  27. arXiv:2306.09303  [pdf, other

    cs.LG cs.AI cs.RO

    Datasets and Benchmarks for Offline Safe Reinforcement Learning

    Authors: Zuxin Liu, Zijian Guo, Haohong Lin, Yihang Yao, Jiacheng Zhu, Zhepeng Cen, Hanjiang Hu, Wenhao Yu, Tingnan Zhang, Jie Tan, Ding Zhao

    Abstract: This paper presents a comprehensive benchmarking suite tailored to offline safe reinforcement learning (RL) challenges, aiming to foster progress in the development and evaluation of safe learning algorithms in both the training and deployment phases. Our benchmark suite contains three packages: 1) expertly crafted safe policies, 2) D4RL-styled datasets along with environment wrappers, and 3) high… ▽ More

    Submitted 16 June, 2023; v1 submitted 15 June, 2023; originally announced June 2023.

    Comments: 22 pages.13 figures, 7 tables

  28. arXiv:2306.09273  [pdf, other

    cs.RO cs.CR cs.CV cs.LG

    Your Room is not Private: Gradient Inversion Attack on Reinforcement Learning

    Authors: Miao Li, Wenhao Ding, Ding Zhao

    Abstract: The prominence of embodied Artificial Intelligence (AI), which empowers robots to navigate, perceive, and engage within virtual environments, has attracted significant attention, owing to the remarkable advancements in computer vision and large language models. Privacy emerges as a pivotal concern within the realm of embodied AI, as the robot accesses substantial personal information. However, the… ▽ More

    Submitted 17 September, 2023; v1 submitted 15 June, 2023; originally announced June 2023.

    Comments: 7 pages, 4 figures, 2 tables

  29. arXiv:2306.07707  [pdf, other

    cs.GT

    Incentive-Compatible Selection for One or Two Influentials

    Authors: Yuxin Zhao, Yao Zhang, Dengji Zhao

    Abstract: Selecting influentials in networks against strategic manipulations has attracted many researchers' attention and it also has many practical applications. Here, we aim to select one or two influentials in terms of progeny (the influential power) and prevent agents from manipulating their edges (incentive compatibility). The existing studies mostly focused on selecting a single influential for this… ▽ More

    Submitted 13 June, 2023; originally announced June 2023.

    Comments: To Appear on IJCAI 2023

  30. arXiv:2306.07239  [pdf, ps, other

    stat.ME

    Nonparametric empirical Bayes biomarker imputation and estimation

    Authors: Alton Barbehenn, Sihai Dave Zhao

    Abstract: Biomarkers are often measured in bulk to diagnose patients, monitor patient conditions, and research novel drug pathways. The measurement of these biomarkers often suffers from detection limits that result in missing and untrustworthy measurements. Frequently, missing biomarkers are imputed so that down-stream analysis can be conducted with modern statistical methods that cannot normally handle da… ▽ More

    Submitted 12 June, 2023; originally announced June 2023.

  31. arXiv:2306.06317  [pdf, other

    astro-ph.GA astro-ph.CO

    The DESI One-Percent survey: constructing galaxy-halo connections for ELGs and LRGs using auto and cross correlations

    Authors: Hongyu Gao, Y. P. **g, Shanquan Gui, Kun Xu, Yun Zheng, Donghai Zhao, Jessica Nicole Aguilar, Steven Ahlen, David Brooks, Todd Claybaugh, Kyle Dawson, Axel de la Macorra, Peter Doel, Kevin Fanning, Jaime E. Forero-Romero, Satya Gontcho A Gontcho, Julien Guy, Klaus Honscheid, Robert Kehoe, Martin Landriau, Marc Manera, Aaron Meisner, Ramon Miquel, John Moustakas, Jeffrey A. Newman , et al. (9 additional authors not shown)

    Abstract: In the current Dark Energy Spectroscopic Instrument (DESI) survey, emission line galaxies (ELGs) and luminous red galaxies (LRGs) are essential for map** the dark matter distribution at $z \sim 1$. We measure the auto and cross correlation functions of ELGs and LRGs at $0.8<z\leq 1.0$ from the DESI One-Percent survey. Following Gao et al. (2022), we construct the galaxy-halo connections for ELGs… ▽ More

    Submitted 18 July, 2023; v1 submitted 9 June, 2023; originally announced June 2023.

    Comments: 27 pages, 16 figures, accepted by ApJ

  32. arXiv:2306.05696  [pdf, other

    cs.RO

    Embodied Executable Policy Learning with Language-based Scene Summarization

    Authors: Jielin Qiu, Mengdi Xu, William Han, Seungwhan Moon, Ding Zhao

    Abstract: Large Language models (LLMs) have shown remarkable success in assisting robot learning tasks, i.e., complex household planning. However, the performance of pretrained LLMs heavily relies on domain-specific templated text data, which may be infeasible in real-world robot learning tasks with image-based observations. Moreover, existing LLMs with text inputs lack the capability to evolve with non-exp… ▽ More

    Submitted 9 June, 2023; originally announced June 2023.

    Comments: 15 pages. arXiv admin note: text overlap with arXiv:2107.06912 by other authors

  33. arXiv:2306.04227  [pdf, other

    cs.CR cs.DB

    High-Performance Caching of Homomorphic Encryption for Cloud Databases

    Authors: Dongfang Zhao

    Abstract: While homomorphic encryption (HE) has garnered significant research interest in cloud-based outsourced databases due to its algebraic properties over ciphertexts, the computational overhead associated with HE has hindered its widespread adoption in production database systems. Recently, a caching technique called Radix-based additive caching of homomorphic encryption (Rache) was proposed in SIGMOD… ▽ More

    Submitted 7 June, 2023; originally announced June 2023.

  34. arXiv:2306.04216  [pdf, other

    cs.CV cs.MM

    MMSum: A Dataset for Multimodal Summarization and Thumbnail Generation of Videos

    Authors: Jielin Qiu, Jiacheng Zhu, William Han, Aditesh Kumar, Karthik Mittal, Claire **, Zhengyuan Yang, Linjie Li, Jianfeng Wang, Ding Zhao, Bo Li, Lijuan Wang

    Abstract: Multimodal summarization with multimodal output (MSMO) has emerged as a promising research direction. Nonetheless, numerous limitations exist within existing public MSMO datasets, including insufficient maintenance, data inaccessibility, limited size, and the absence of proper categorization, which pose significant challenges. To address these challenges and provide a comprehensive dataset for thi… ▽ More

    Submitted 19 November, 2023; v1 submitted 7 June, 2023; originally announced June 2023.

    Comments: Project website: https://mmsum-dataset.github.io/

  35. arXiv:2306.04170  [pdf, other

    cs.CL

    From the One, Judge of the Whole: Typed Entailment Graph Construction with Predicate Generation

    Authors: Zhibin Chen, Yansong Feng, Dongyan Zhao

    Abstract: Entailment Graphs (EGs) have been constructed based on extracted corpora as a strong and explainable form to indicate context-independent entailment relations in natural languages. However, EGs built by previous methods often suffer from the severe sparsity issues, due to limited corpora available and the long-tail phenomenon of predicate distributions. In this paper, we propose a multi-stage meth… ▽ More

    Submitted 7 June, 2023; originally announced June 2023.

    Comments: 9 pages, 3 figures, accepted to ACL 2023

  36. arXiv:2306.02252  [pdf, other

    cs.CV

    MoviePuzzle: Visual Narrative Reasoning through Multimodal Order Learning

    Authors: Jianghui Wang, Yuxuan Wang, Dongyan Zhao, Zilong Zheng

    Abstract: We introduce MoviePuzzle, a novel challenge that targets visual narrative reasoning and holistic movie understanding. Despite the notable progress that has been witnessed in the realm of video understanding, most prior works fail to present tasks and models to address holistic video understanding and the innate visual narrative structures existing in long-form videos. To tackle this quandary, we p… ▽ More

    Submitted 14 June, 2023; v1 submitted 3 June, 2023; originally announced June 2023.

  37. arXiv:2306.02070  [pdf, ps, other

    eess.SY

    Adaptive Approximation-Based Control for Nonlinear Systems: A Unified Solution with Accurate and Inaccurate Measurements

    Authors: Dong Zhao

    Abstract: A unified solution to adaptive approximation-based control for nonlinear systems with accurate and inaccurate state measurement is synthesized in this study. Starting from the standard adaptive approximation-based controller with accurate state measurement, its corresponding physical interpretation, stability conclusion, and learning ability are rigorously addressed when facing additive measuremen… ▽ More

    Submitted 3 June, 2023; originally announced June 2023.

  38. arXiv:2306.02020  [pdf, ps, other

    eess.SY

    Replay Attack Detection Based on Parity Space Method for Cyber-Physical Systems

    Authors: Dong Zhao, Yang Shi, Steven X. Ding, Yueyang Li, Fangzhou Fu

    Abstract: The replay attack detection problem is studied from a new perspective based on parity space method in this paper. The proposed detection methods have the ability to distinguish system fault and replay attack, handle both input and output data replay, maintain certain control performance, and can be implemented conveniently and efficiently. First, the replay attack effect on the residual is derived… ▽ More

    Submitted 3 June, 2023; originally announced June 2023.

  39. arXiv:2306.02018  [pdf, other

    cs.CV

    VideoComposer: Compositional Video Synthesis with Motion Controllability

    Authors: Xiang Wang, Hangjie Yuan, Shiwei Zhang, Dayou Chen, Jiuniu Wang, Yingya Zhang, Yujun Shen, Deli Zhao, **gren Zhou

    Abstract: The pursuit of controllability as a higher standard of visual content creation has yielded remarkable progress in customizable image synthesis. However, achieving controllable video synthesis remains challenging due to the large variation of temporal dynamics and the requirement of cross-frame temporal consistency. Based on the paradigm of compositional generation, this work presents VideoComposer… ▽ More

    Submitted 5 June, 2023; v1 submitted 3 June, 2023; originally announced June 2023.

    Comments: The first four authors contributed equally. Project page: https://videocomposer.github.io

  40. arXiv:2306.02016  [pdf, ps, other

    eess.SY math.OC

    Converse negative imaginary theorems

    Authors: Sei Zhen Khong, Di Zhao, Alexander Lanzon

    Abstract: Converse negative imaginary theorems for linear time-invariant systems are derived. In particular, we provide necessary and sufficient conditions for a feedback system to be robustly stable against various types of negative imaginary (NI) uncertainty. Uncertainty classes of marginally stable NI systems and stable strictly NI systems with restrictions on their static or instantaneous gains are cons… ▽ More

    Submitted 20 November, 2023; v1 submitted 3 June, 2023; originally announced June 2023.

    Comments: This paper has been submitted for possible publication at Automatica

  41. arXiv:2306.00435  [pdf, other

    cs.CL

    How Many Answers Should I Give? An Empirical Study of Multi-Answer Reading Comprehension

    Authors: Chen Zhang, Jiuheng Lin, Xiao Liu, Yuxuan Lai, Yansong Feng, Dongyan Zhao

    Abstract: The multi-answer phenomenon, where a question may have multiple answers scattered in the document, can be well handled by humans but is challenging enough for machine reading comprehension (MRC) systems. Despite recent progress in multi-answer MRC, there lacks a systematic analysis of how this phenomenon arises and how to better address it. In this work, we design a taxonomy to categorize commonly… ▽ More

    Submitted 1 June, 2023; originally announced June 2023.

    Comments: Findings of ACL 2023

  42. arXiv:2306.00350  [pdf, other

    cs.GT

    Score-Based Equilibrium Learning in Multi-Player Finite Games with Imperfect Information

    Authors: Runyu Lu, Yuanheng Zhu, Dongbin Zhao

    Abstract: Real-world games, which concern imperfect information, multiple players, and simultaneous moves, are less frequently discussed in the existing literature of game theory. While reinforcement learning (RL) provides a general framework to extend the game theoretical algorithms, the assumptions that guarantee their convergence towards Nash equilibria may no longer hold in real-world games. Starting fr… ▽ More

    Submitted 1 June, 2023; originally announced June 2023.

  43. arXiv:2306.00342  [pdf, other

    cs.LG cs.AI cs.NE stat.ML

    Combining Explicit and Implicit Regularization for Efficient Learning in Deep Networks

    Authors: Dan Zhao

    Abstract: Works on implicit regularization have studied gradient trajectories during the optimization process to explain why deep networks favor certain kinds of solutions over others. In deep linear networks, it has been shown that gradient descent implicitly regularizes toward low-rank solutions on matrix completion/factorization tasks. Adding depth not only improves performance on these tasks but also ac… ▽ More

    Submitted 1 June, 2023; originally announced June 2023.

    Journal ref: Advances in Neural Information Processing Systems 35 (NeurIPS 2022), 3024--3038

  44. arXiv:2306.00014  [pdf, other

    cs.CL cs.LG

    PreQuant: A Task-agnostic Quantization Approach for Pre-trained Language Models

    Authors: Zhuocheng Gong, Jiahao Liu, Qifan Wang, Yang Yang, **gang Wang, Wei Wu, Yunsen Xian, Dongyan Zhao, Rui Yan

    Abstract: While transformer-based pre-trained language models (PLMs) have dominated a number of NLP applications, these models are heavy to deploy and expensive to use. Therefore, effectively compressing large-scale PLMs becomes an increasingly important problem. Quantization, which represents high-precision tensors with low-bit fix-point format, is a viable solution. However, most existing quantization met… ▽ More

    Submitted 30 May, 2023; originally announced June 2023.

    Comments: Findings of ACL2023

  45. arXiv:2305.19327  [pdf, other

    cs.CV

    Cones 2: Customizable Image Synthesis with Multiple Subjects

    Authors: Zhiheng Liu, Yifei Zhang, Yujun Shen, Kecheng Zheng, Kai Zhu, Ruili Feng, Yu Liu, Deli Zhao, **gren Zhou, Yang Cao

    Abstract: Synthesizing images with user-specified subjects has received growing attention due to its practical applications. Despite the recent success in single subject customization, existing algorithms suffer from high training cost and low success rate along with increased number of subjects. Towards controllable image synthesis with multiple subjects as the constraints, this work studies how to efficie… ▽ More

    Submitted 30 May, 2023; originally announced May 2023.

  46. arXiv:2305.19213  [pdf, other

    cs.CL

    The Magic of IF: Investigating Causal Reasoning Abilities in Large Language Models of Code

    Authors: Xiao Liu, Da Yin, Chen Zhang, Yansong Feng, Dongyan Zhao

    Abstract: Causal reasoning, the ability to identify cause-and-effect relationship, is crucial in human thinking. Although large language models (LLMs) succeed in many NLP tasks, it is still challenging for them to conduct complex causal reasoning like abductive reasoning and counterfactual reasoning. Given the fact that programming code may express causal relations more often and explicitly with conditional… ▽ More

    Submitted 30 May, 2023; originally announced May 2023.

    Comments: Findings of ACL 2023. Code and data are available at https://github.com/xxxiaol/magic-if

  47. arXiv:2305.18829  [pdf, other

    cs.CV cs.MM cs.RO

    UniScene: Multi-Camera Unified Pre-training via 3D Scene Reconstruction for Autonomous Driving

    Authors: Chen Min, Liang Xiao, Dawei Zhao, Yiming Nie, Bin Dai

    Abstract: Multi-camera 3D perception has emerged as a prominent research field in autonomous driving, offering a viable and cost-effective alternative to LiDAR-based solutions. The existing multi-camera algorithms primarily rely on monocular 2D pre-training. However, the monocular 2D pre-training overlooks the spatial and temporal correlations among the multi-camera system. To address this limitation, we pr… ▽ More

    Submitted 27 April, 2024; v1 submitted 30 May, 2023; originally announced May 2023.

    Comments: Accepted by RAL2024

  48. arXiv:2305.18760  [pdf, other

    cs.CL

    Shuo Wen Jie Zi: Rethinking Dictionaries and Glyphs for Chinese Language Pre-training

    Authors: Yuxuan Wang, Jianghui Wang, Dongyan Zhao, Zilong Zheng

    Abstract: We introduce CDBERT, a new learning paradigm that enhances the semantics understanding ability of the Chinese PLMs with dictionary knowledge and structure of Chinese characters. We name the two core modules of CDBERT as Shuowen and Jiezi, where Shuowen refers to the process of retrieving the most appropriate meaning from Chinese dictionaries and Jiezi refers to the process of enhancing characters'… ▽ More

    Submitted 30 May, 2023; originally announced May 2023.

    Comments: To appear at ACL 2023 Findings

  49. arXiv:2305.18756  [pdf, other

    cs.CV cs.CL

    VSTAR: A Video-grounded Dialogue Dataset for Situated Semantic Understanding with Scene and Topic Transitions

    Authors: Yuxuan Wang, Zilong Zheng, Xueliang Zhao, **peng Li, Yueqian Wang, Dongyan Zhao

    Abstract: Video-grounded dialogue understanding is a challenging problem that requires machine to perceive, parse and reason over situated semantics extracted from weakly aligned video and dialogues. Most existing benchmarks treat both modalities the same as a frame-independent visual understanding task, while neglecting the intrinsic attributes in multimodal dialogues, such as scene and topic transitions.… ▽ More

    Submitted 30 May, 2023; originally announced May 2023.

    Comments: To appear at ACL 2023

  50. arXiv:2305.17607  [pdf, other

    cs.CL

    More than Classification: A Unified Framework for Event Temporal Relation Extraction

    Authors: Quzhe Huang, Yutong Hu, Shengqi Zhu, Yansong Feng, Chang Liu, Dongyan Zhao

    Abstract: Event temporal relation extraction~(ETRE) is usually formulated as a multi-label classification task, where each type of relation is simply treated as a one-hot label. This formulation ignores the meaning of relations and wipes out their intrinsic dependency. After examining the relation definitions in various ETRE tasks, we observe that all relations can be interpreted using the start and end tim… ▽ More

    Submitted 27 May, 2023; originally announced May 2023.

    Journal ref: ACL 2023 Main Conference