Skip to main content

Showing 151–200 of 1,070 results for author: Zhao, T

.
  1. arXiv:2309.13438  [pdf, other

    cs.CV cs.AI

    Rethinking Superpixel Segmentation from Biologically Inspired Mechanisms

    Authors: Tingyu Zhao, Bo Peng, Yuan Sun, Daipeng Yang, Zhenguang Zhang, Xi Wu

    Abstract: Recently, advancements in deep learning-based superpixel segmentation methods have brought about improvements in both the efficiency and the performance of segmentation. However, a significant challenge remains in generating superpixels that strictly adhere to object boundaries while conveying rich visual significance, especially when cross-surface color correlations may interfere with objects. Dr… ▽ More

    Submitted 11 October, 2023; v1 submitted 23 September, 2023; originally announced September 2023.

  2. arXiv:2309.09831  [pdf, other

    math.ST stat.ML

    Pivotal Estimation of Linear Discriminant Analysis in High Dimensions

    Authors: Ethan X. Fang, Yajun Mei, Yuyang Shi, Qunzhi Xu, Tuo Zhao

    Abstract: We consider the linear discriminant analysis problem in the high-dimensional settings. In this work, we propose PANDA(PivotAl liNear Discriminant Analysis), a tuning-insensitive method in the sense that it requires very little effort to tune the parameters. Moreover, we prove that PANDA achieves the optimal convergence rate in terms of both the estimation error and misclassification rate. Our theo… ▽ More

    Submitted 18 September, 2023; originally announced September 2023.

  3. arXiv:2309.06149  [pdf, ps, other

    math.CO

    Two involutions on binary trees and generalizations

    Authors: Yang Li, Zhicong Lin, Tongyuan Zhao

    Abstract: This paper investigates two involutions on binary trees. One is the mirror symmetry of binary trees which combined with the classical bijection $\varphi$ between binary trees and plane trees answers an open problem posed by Bai and Chen. This involution can be generalized to weakly increasing trees, which admits to merge two recent equidistributions found by Bai--Chen and Chen--Fu, respectively. T… ▽ More

    Submitted 12 September, 2023; originally announced September 2023.

    Comments: 24 pages, 16 figures

  4. arXiv:2309.02632  [pdf, other

    cs.LG cs.AI

    Deep Reinforcement Learning from Hierarchical Preference Design

    Authors: Alexander Bukharin, Yixiao Li, Pengcheng He, Tuo Zhao

    Abstract: Reward design is a fundamental, yet challenging aspect of reinforcement learning (RL). Researchers typically utilize feedback signals from the environment to handcraft a reward function, but this process is not always effective due to the varying scale and intricate dependencies of the feedback signals. This paper shows by exploiting certain structures, one can ease the reward design process. Spec… ▽ More

    Submitted 10 June, 2024; v1 submitted 5 September, 2023; originally announced September 2023.

    Comments: 28 Pages, 14 figures

  5. arXiv:2309.02610  [pdf, other

    cs.LG cs.DS

    T-SaS: Toward Shift-aware Dynamic Adaptation for Streaming Data

    Authors: Weijieying Ren, Tianxiang Zhao, Wei Qin, Kunpeng Liu

    Abstract: In many real-world scenarios, distribution shifts exist in the streaming data across time steps. Many complex sequential data can be effectively divided into distinct regimes that exhibit persistent dynamics. Discovering the shifted behaviors and the evolving patterns underlying the streaming data are important to understand the dynamic system. Existing methods typically train one robust model to… ▽ More

    Submitted 5 September, 2023; originally announced September 2023.

    Comments: CIKM 2023

  6. arXiv:2309.00738  [pdf, other

    cs.LG

    Rethinking the Power of Graph Canonization in Graph Representation Learning with Stability

    Authors: Zehao Dong, Muhan Zhang, Philip R. O. Payne, Michael A Province, Carlos Cruchaga, Tianyu Zhao, Fuhai Li, Yixin Chen

    Abstract: The expressivity of Graph Neural Networks (GNNs) has been studied broadly in recent years to reveal the design principles for more powerful GNNs. Graph canonization is known as a typical approach to distinguish non-isomorphic graphs, yet rarely adopted when develo** expressive GNNs. This paper proposes to maximize the expressivity of GNNs by graph canonization, then the power of such GNNs is stu… ▽ More

    Submitted 9 February, 2024; v1 submitted 1 September, 2023; originally announced September 2023.

  7. arXiv:2308.16422  [pdf, other

    astro-ph.IM cs.LG gr-qc

    Dilated convolutional neural network for detecting extreme-mass-ratio inspirals

    Authors: Tianyu Zhao, Yue Zhou, Ruijun Shi, Zhoujian Cao, Zhixiang Ren

    Abstract: The detection of Extreme Mass Ratio Inspirals (EMRIs) is intricate due to their complex waveforms, extended duration, and low signal-to-noise ratio (SNR), making them more challenging to be identified compared to compact binary coalescences. While matched filtering-based techniques are known for their computational demands, existing deep learning-based methods primarily handle time-domain data and… ▽ More

    Submitted 14 May, 2024; v1 submitted 30 August, 2023; originally announced August 2023.

    Comments: 11 pages, 5 figures, and 2 tables

    Journal ref: Phys. Rev. D 109, 084054 (2024)

  8. arXiv:2308.13917  [pdf, other

    cs.CV cond-mat.mtrl-sci

    Transfer Learning for Microstructure Segmentation with CS-UNet: A Hybrid Algorithm with Transformer and CNN Encoders

    Authors: Khaled Alrfou, Tian Zhao, Amir Kordijazi

    Abstract: Transfer learning improves the performance of deep learning models by initializing them with parameters pre-trained on larger datasets. Intuitively, transfer learning is more effective when pre-training is on the in-domain datasets. A recent study by NASA has demonstrated that the microstructure segmentation with encoder-decoder algorithms benefits more from CNN encoders pre-trained on microscopy… ▽ More

    Submitted 26 August, 2023; originally announced August 2023.

    Comments: 21 pages, 8 figures, 11 tables

  9. arXiv:2308.13513  [pdf, other

    cs.LG cs.CR cs.SI

    Unveiling the Role of Message Passing in Dual-Privacy Preservation on GNNs

    Authors: Tianyi Zhao, Hui Hu, Lu Cheng

    Abstract: Graph Neural Networks (GNNs) are powerful tools for learning representations on graphs, such as social networks. However, their vulnerability to privacy inference attacks restricts their practicality, especially in high-stake domains. To address this issue, privacy-preserving GNNs have been proposed, focusing on preserving node and/or link privacy. This work takes a step back and investigates how… ▽ More

    Submitted 25 August, 2023; originally announced August 2023.

    Comments: CIKM 2023

  10. arXiv:2308.13177  [pdf, other

    cs.CV cs.CL

    How to Evaluate the Generalization of Detection? A Benchmark for Comprehensive Open-Vocabulary Detection

    Authors: Yiyang Yao, Peng Liu, Tiancheng Zhao, Qianqian Zhang, Jiajia Liao, Chunxin Fang, Kyusong Lee, Qing Wang

    Abstract: Object detection (OD) in computer vision has made significant progress in recent years, transitioning from closed-set labels to open-vocabulary detection (OVD) based on large-scale vision-language pre-training (VLP). However, current evaluation methods and datasets are limited to testing generalization over object types and referral expressions, which do not provide a systematic, fine-grained, and… ▽ More

    Submitted 18 December, 2023; v1 submitted 25 August, 2023; originally announced August 2023.

    Comments: Long paper accepted at AAAI 2024

  11. arXiv:2308.13159  [pdf, ps, other

    math.AP

    Almost sure scattering for defocusing energy critical Hartree equation on $\R^5$

    Authors: Liying Tao, Tengfei Zhao

    Abstract: We consider the defocusing energy-critical Hartree equation $i\pa_tu+Δu=(|\cdot|^{-4}\ast|u|^2)u$ in spatial dimension $d=5$ and prove almost sure scattering with initial data $u_0\in H^s_x(\R^5)$ for any $s\in\R$. The proof relies on the modified interaction Morawetz estimate, the stability theories, the ``Narrowed'' Wiener randomization. We are inspired to consider this problem by the work of Sh… ▽ More

    Submitted 24 August, 2023; originally announced August 2023.

  12. arXiv:2308.12952  [pdf, other

    cs.RO cs.LG

    BridgeData V2: A Dataset for Robot Learning at Scale

    Authors: Homer Walke, Kevin Black, Abraham Lee, Moo ** Kim, Max Du, Chongyi Zheng, Tony Zhao, Philippe Hansen-Estruch, Quan Vuong, Andre He, Vivek Myers, Kuan Fang, Chelsea Finn, Sergey Levine

    Abstract: We introduce BridgeData V2, a large and diverse dataset of robotic manipulation behaviors designed to facilitate research on scalable robot learning. BridgeData V2 contains 60,096 trajectories collected across 24 environments on a publicly available low-cost robot. BridgeData V2 provides extensive task and environment variability, leading to skills that can generalize across environments, domains,… ▽ More

    Submitted 17 January, 2024; v1 submitted 24 August, 2023; originally announced August 2023.

    Comments: 9 pages

  13. arXiv:2308.11627  [pdf, other

    eess.SP cs.AI cs.CV eess.IV eess.SY

    Non-Intrusive Electric Load Monitoring Approach Based on Current Feature Visualization for Smart Energy Management

    Authors: Yiwen Xu, Dengfeng Liu, Liangtao Huang, Zhiquan Lin, Tiesong Zhao, Sam Kwong

    Abstract: The state-of-the-art smart city has been calling for an economic but efficient energy management over large-scale network, especially for the electric power system. It is a critical issue to monitor, analyze and control electric loads of all users in system. In this paper, we employ the popular computer vision techniques of AI to design a non-invasive load monitoring method for smart electric ener… ▽ More

    Submitted 8 August, 2023; originally announced August 2023.

  14. arXiv:2308.11257  [pdf, other

    cs.CL

    HopPG: Self-Iterative Program Generation for Multi-Hop Question Answering over Heterogeneous Knowledge

    Authors: Yingyao Wang, Yongwei Zhou, Chaoqun Duan, Junwei Bao, Tiejun Zhao

    Abstract: The semantic parsing-based method is an important research branch for knowledge-based question answering. It usually generates executable programs lean upon the question and then conduct them to reason answers over a knowledge base. Benefit from this inherent mechanism, it has advantages in the performance and the interpretability. However, traditional semantic parsing methods usually generate a c… ▽ More

    Submitted 10 September, 2023; v1 submitted 22 August, 2023; originally announced August 2023.

  15. arXiv:2308.02530  [pdf, other

    cs.CV cs.AI

    Gated Driver Attention Predictor

    Authors: Tianci Zhao, Xue Bai, Jianwu Fang, Jianru Xue

    Abstract: Driver attention prediction implies the intention understanding of where the driver intends to go and what object the driver concerned about, which commonly provides a driving task-guided traffic scene understanding. Some recent works explore driver attention prediction in critical or accident scenarios and find a positive role in hel** accident prediction, while the promotion ability is constra… ▽ More

    Submitted 31 July, 2023; originally announced August 2023.

    Comments: Accepted by ITSC2023

  16. arXiv:2307.16457  [pdf, other

    cs.CL

    A Benchmark for Understanding Dialogue Safety in Mental Health Support

    Authors: Huachuan Qiu, Tong Zhao, Anqi Li, Shuai Zhang, Hongliang He, Zhenzhong Lan

    Abstract: Dialogue safety remains a pervasive challenge in open-domain human-machine interaction. Existing approaches propose distinctive dialogue safety taxonomies and datasets for detecting explicitly harmful responses. However, these taxonomies may not be suitable for analyzing response safety in mental health support. In real-world interactions, a model response deemed acceptable in casual conversations… ▽ More

    Submitted 31 July, 2023; originally announced July 2023.

    Comments: accepted to The 12th CCF International Conference on Natural Language Processing and Chinese Computing (NLPCC2023)

  17. arXiv:2307.16416  [pdf, other

    cs.CV

    MRA-GNN: Minutiae Relation-Aware Model over Graph Neural Network for Fingerprint Embedding

    Authors: Yapeng Su, Tong Zhao, Zicheng Zhang

    Abstract: Deep learning has achieved remarkable results in fingerprint embedding, which plays a critical role in modern Automated Fingerprint Identification Systems. However, previous works including CNN-based and Transformer-based approaches fail to exploit the nonstructural data, such as topology and correlation in fingerprints, which is essential to facilitate the identifiability and robustness of embedd… ▽ More

    Submitted 31 July, 2023; originally announced July 2023.

    Comments: 10 pages, 6 figures, accepted by IJCB 2023

  18. arXiv:2307.14326  [pdf, other

    cs.RO cs.AI cs.LG

    Waypoint-Based Imitation Learning for Robotic Manipulation

    Authors: Lucy Xiaoyang Shi, Archit Sharma, Tony Z. Zhao, Chelsea Finn

    Abstract: While imitation learning methods have seen a resurgent interest for robotic manipulation, the well-known problem of compounding errors continues to afflict behavioral cloning (BC). Waypoints can help address this problem by reducing the horizon of the learning problem for BC, and thus, the errors compounded over time. However, waypoint labeling is underspecified, and requires additional human supe… ▽ More

    Submitted 26 July, 2023; originally announced July 2023.

    Comments: The first two authors contributed equally

  19. arXiv:2307.12975  [pdf, ps, other

    cs.LG math.ST stat.ML

    Provable Benefits of Policy Learning from Human Preferences in Contextual Bandit Problems

    Authors: Xiang Ji, Huazheng Wang, Minshuo Chen, Tuo Zhao, Mengdi Wang

    Abstract: For a real-world decision-making problem, the reward function often needs to be engineered or learned. A popular approach is to utilize human feedback to learn a reward function for training. The most straightforward way to do so is to ask humans to provide ratings for state-action pairs on an absolute scale and take these ratings as reward samples directly. Another popular way is to ask humans to… ▽ More

    Submitted 28 October, 2023; v1 submitted 24 July, 2023; originally announced July 2023.

  20. arXiv:2307.11699  [pdf, other

    cs.HC

    Co-Design with Myself: A Brain-Computer Interface Design Tool that Predicts Live Emotion to Enhance Metacognitive Monitoring of Designers

    Authors: Qi Yang, Shuo Feng, Tianlin Zhao, Saleh Kalantari

    Abstract: Intuition, metacognition, and subjective uncertainty interact in complex ways to shape the creative design process. Design intuition, a designer's innate ability to generate creative ideas and solutions based on implicit knowledge and experience, is often evaluated and refined through metacognitive monitoring. This self-awareness and management of cognitive processes can be triggered by subjective… ▽ More

    Submitted 21 July, 2023; originally announced July 2023.

  21. Learning and Evaluating Human Preferences for Conversational Head Generation

    Authors: Mohan Zhou, Yalong Bai, Wei Zhang, Ting Yao, Tiejun Zhao, Tao Mei

    Abstract: A reliable and comprehensive evaluation metric that aligns with manual preference assessments is crucial for conversational head video synthesis methods development. Existing quantitative evaluations often fail to capture the full complexity of human preference, as they only consider limited evaluation dimensions. Qualitative evaluations and user studies offer a solution but are time-consuming and… ▽ More

    Submitted 2 August, 2023; v1 submitted 20 July, 2023; originally announced July 2023.

    Comments: Accepted by ACM Multimedia 2023

  22. arXiv:2307.08209  [pdf, other

    cs.CV

    Ada3D : Exploiting the Spatial Redundancy with Adaptive Inference for Efficient 3D Object Detection

    Authors: Tianchen Zhao, Xuefei Ning, Ke Hong, Zhongyuan Qiu, Pu Lu, Yali Zhao, Linfeng Zhang, Lipu Zhou, Guohao Dai, Huazhong Yang, Yu Wang

    Abstract: Voxel-based methods have achieved state-of-the-art performance for 3D object detection in autonomous driving. However, their significant computational and memory costs pose a challenge for their application to resource-constrained vehicles. One reason for this high resource consumption is the presence of a large number of redundant background points in Lidar point clouds, resulting in spatial redu… ▽ More

    Submitted 8 August, 2023; v1 submitted 16 July, 2023; originally announced July 2023.

    Comments: Accepted at ICCV2023

  23. arXiv:2307.06736  [pdf, other

    cs.LG cs.AI

    MPR-Net:Multi-Scale Pattern Reproduction Guided Universality Time Series Interpretable Forecasting

    Authors: Tianlong Zhao, Xiang Ma, Xuemei Li, Caiming Zhang

    Abstract: Time series forecasting has received wide interest from existing research due to its broad applications and inherent challenging. The research challenge lies in identifying effective patterns in historical series and applying them to future forecasting. Advanced models based on point-wise connected MLP and Transformer architectures have strong fitting power, but their secondary computational compl… ▽ More

    Submitted 13 July, 2023; originally announced July 2023.

  24. arXiv:2307.05857  [pdf, other

    cs.LG cs.AI cs.CY

    FAIRO: Fairness-aware Adaptation in Sequential-Decision Making for Human-in-the-Loop Systems

    Authors: Tianyu Zhao, Mojtaba Taherisadr, Salma Elmalaki

    Abstract: Achieving fairness in sequential-decision making systems within Human-in-the-Loop (HITL) environments is a critical concern, especially when multiple humans with different behavior and expectations are affected by the same adaptation decisions in the system. This human variability factor adds more complexity since policies deemed fair at one point in time may become discriminatory over time due to… ▽ More

    Submitted 6 November, 2023; v1 submitted 11 July, 2023; originally announced July 2023.

  25. arXiv:2307.02090  [pdf, other

    cs.CV

    Interactive Conversational Head Generation

    Authors: Mohan Zhou, Yalong Bai, Wei Zhang, Ting Yao, Tiejun Zhao

    Abstract: We introduce a new conversation head generation benchmark for synthesizing behaviors of a single interlocutor in a face-to-face conversation. The capability to automatically synthesize interlocutors which can participate in long and multi-turn conversations is vital and offer benefits for various applications, including digital humans, virtual agents, and social robots. While existing research pri… ▽ More

    Submitted 5 July, 2023; originally announced July 2023.

    Comments: arXiv admin note: text overlap with arXiv:2112.13548

  26. arXiv:2307.02049  [pdf

    eess.SY cs.LG

    Graph Neural Network-based Power Flow Model

    Authors: Mingjian Tuo, Xingpeng Li, Tianxia Zhao

    Abstract: Power flow analysis plays a crucial role in examining the electricity flow within a power system network. By performing power flow calculations, the system's steady-state variables, including voltage magnitude, phase angle at each bus, active/reactive power flow across branches, can be determined. While the widely used DC power flow model offers speed and robustness, it may yield inaccurate line f… ▽ More

    Submitted 5 July, 2023; originally announced July 2023.

    Comments: arXiv admin note: text overlap with arXiv:2112.08418

  27. arXiv:2307.01649  [pdf, other

    cs.LG

    Nonparametric Classification on Low Dimensional Manifolds using Overparameterized Convolutional Residual Networks

    Authors: Kaiqi Zhang, Zixuan Zhang, Minshuo Chen, Yuma Takeda, Mengdi Wang, Tuo Zhao, Yu-Xiang Wang

    Abstract: Convolutional residual neural networks (ConvResNets), though overparameterized, can achieve remarkable prediction performance in practice, which cannot be well explained by conventional wisdom. To bridge this gap, we study the performance of ConvResNeXts, which cover ConvResNets as a special case, trained with weight decay from the perspective of nonparametric classification. Our analysis allows f… ▽ More

    Submitted 17 February, 2024; v1 submitted 4 July, 2023; originally announced July 2023.

    Comments: 20 pages, 1 figure

  28. arXiv:2307.00863  [pdf, ps, other

    cs.LG cs.CR

    Thompson Sampling under Bernoulli Rewards with Local Differential Privacy

    Authors: Bo Jiang, Tianchi Zhao, Ming Li

    Abstract: This paper investigates the problem of regret minimization for multi-armed bandit (MAB) problems with local differential privacy (LDP) guarantee. Given a fixed privacy budget $ε$, we consider three privatizing mechanisms under Bernoulli scenario: linear, quadratic and exponential mechanisms. Under each mechanism, we derive stochastic regret bound for Thompson Sampling algorithm. Finally, we simula… ▽ More

    Submitted 3 July, 2023; originally announced July 2023.

    Comments: Accepted by ICML 22 workshop

  29. arXiv:2306.16564  [pdf, other

    cs.CL stat.ML

    Pareto Optimal Learning for Estimating Large Language Model Errors

    Authors: Theodore Zhao, Mu Wei, J. Samuel Preston, Hoifung Poon

    Abstract: Large Language Models (LLMs) have shown impressive abilities in many applications. When a concrete and precise answer is desired, it is important to have a quantitative estimation of the potential error rate. However, this can be challenging due to the text-in-text-out nature of generative models. We present a method based on Pareto optimization that generates a risk score to estimate the probabil… ▽ More

    Submitted 22 May, 2024; v1 submitted 28 June, 2023; originally announced June 2023.

  30. arXiv:2306.16181  [pdf, other

    cs.CV cs.MM eess.IV

    Learning to Pan-sharpening with Memories of Spatial Details

    Authors: Maoxun Yuan, Tianyi Zhao, Bo Li, Xingxing Wei

    Abstract: Pan-sharpening, as one of the most commonly used techniques in remote sensing systems, aims to inject spatial details from panchromatic images into multispectral images (MS) to obtain high-resolution multispectral images. Since deep learning has received widespread attention because of its powerful fitting ability and efficient feature extraction, a variety of pan-sharpening methods have been prop… ▽ More

    Submitted 8 August, 2023; v1 submitted 28 June, 2023; originally announced June 2023.

  31. arXiv:2306.14859  [pdf, other

    cs.LG stat.ML

    Effective Minkowski Dimension of Deep Nonparametric Regression: Function Approximation and Statistical Theories

    Authors: Zixuan Zhang, Minshuo Chen, Mengdi Wang, Wen**g Liao, Tuo Zhao

    Abstract: Existing theories on deep nonparametric regression have shown that when the input data lie on a low-dimensional manifold, deep neural networks can adapt to the intrinsic data structures. In real world applications, such an assumption of data lying exactly on a low dimensional manifold is stringent. This paper introduces a relaxed assumption that the input data are concentrated around a subset of… ▽ More

    Submitted 26 June, 2023; originally announced June 2023.

  32. arXiv:2306.12859   

    cs.LG

    Reinforcement Federated Learning Method Based on Adaptive OPTICS Clustering

    Authors: Tianyu Zhao, Jun** Du, Yingxia Shao, Zeli Guan

    Abstract: Federated learning is a distributed machine learning technology, which realizes the balance between data privacy protection and data sharing computing. To protect data privacy, feder-ated learning learns shared models by locally executing distributed training on participating devices and aggregating local models into global models. There is a problem in federated learning, that is, the negative im… ▽ More

    Submitted 22 June, 2023; v1 submitted 22 June, 2023; originally announced June 2023.

    Comments: There is a declarative error in the method part, we want to withdraw the revision first, so as not to mislead others

  33. Visual-Aware Text-to-Speech

    Authors: Mohan Zhou, Yalong Bai, Wei Zhang, Ting Yao, Tiejun Zhao, Tao Mei

    Abstract: Dynamically synthesizing talking speech that actively responds to a listening head is critical during the face-to-face interaction. For example, the speaker could take advantage of the listener's facial expression to adjust the tones, stressed syllables, or pauses. In this work, we present a new visual-aware text-to-speech (VA-TTS) task to synthesize speech conditioned on both textual inputs and s… ▽ More

    Submitted 21 June, 2023; originally announced June 2023.

    Comments: accepted as oral and top 3% paper by ICASSP 2023

    Journal ref: ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2023, 1-5

  34. arXiv:2306.11868  [pdf, other

    cs.CV

    Multiverse Transformer: 1st Place Solution for Waymo Open Sim Agents Challenge 2023

    Authors: Yu Wang, Tiebiao Zhao, Fan Yi

    Abstract: This technical report presents our 1st place solution for the Waymo Open Sim Agents Challenge (WOSAC) 2023. Our proposed MultiVerse Transformer for Agent simulation (MVTA) effectively leverages transformer-based motion prediction approaches, and is tailored for closed-loop simulation of agents. In order to produce simulations with a high degree of realism, we design novel training and sampling met… ▽ More

    Submitted 20 June, 2023; originally announced June 2023.

    Comments: Technical report for the 1st place solution of Waymo Open Sim Agents Challenge 2023. Project page: https://multiverse-transformer.github.io/sim-agents/. CVPR 2023 workshop on Autonomous Driving: https://cvpr2023.wad.vision/

  35. arXiv:2306.11300  [pdf, other

    cs.CV cs.AI cs.CL cs.MM

    RS5M and GeoRSCLIP: A Large Scale Vision-Language Dataset and A Large Vision-Language Model for Remote Sensing

    Authors: Zilun Zhang, Tiancheng Zhao, Yulong Guo, Jianwei Yin

    Abstract: Pre-trained Vision-Language Models (VLMs) utilizing extensive image-text paired data have demonstrated unprecedented image-text association capabilities, achieving remarkable results across various downstream tasks. A critical challenge is how to make use of existing large-scale pre-trained VLMs, which are trained on common objects, to perform the domain-specific transfer for accomplishing domain-… ▽ More

    Submitted 2 January, 2024; v1 submitted 20 June, 2023; originally announced June 2023.

    Comments: RS5M dataset v5

  36. arXiv:2306.11222  [pdf, other

    cs.LG cs.CL

    LoSparse: Structured Compression of Large Language Models based on Low-Rank and Sparse Approximation

    Authors: Yixiao Li, Yifan Yu, Qingru Zhang, Chen Liang, Pengcheng He, Weizhu Chen, Tuo Zhao

    Abstract: Transformer models have achieved remarkable results in various natural language tasks, but they are often prohibitively large, requiring massive memories and computational resources. To reduce the size and complexity of these models, we propose LoSparse (Low-Rank and Sparse approximation), a novel model compression technique that approximates a weight matrix by the sum of a low-rank matrix and a s… ▽ More

    Submitted 26 June, 2023; v1 submitted 19 June, 2023; originally announced June 2023.

  37. arXiv:2306.09841  [pdf, other

    cs.CL cs.AI

    Are Large Language Models Really Good Logical Reasoners? A Comprehensive Evaluation and Beyond

    Authors: Fangzhi Xu, Qika Lin, Jiawei Han, Tianzhe Zhao, Jun Liu, Erik Cambria

    Abstract: Logical reasoning consistently plays a fundamental and significant role in the domains of knowledge engineering and artificial intelligence. Recently, Large Language Models (LLMs) have emerged as a noteworthy innovation in natural language processing (NLP), exhibiting impressive achievements across various classic NLP tasks. However, the question of whether LLMs can effectively address the task of… ▽ More

    Submitted 8 August, 2023; v1 submitted 16 June, 2023; originally announced June 2023.

    Comments: 14 pages, 11 figures

  38. arXiv:2306.09832  [pdf, ps, other

    math.RA

    On the generalizations of global dimensions and singularity categories

    Authors: Xiaolei Zhang, Tiwei Zhao, Dingguo Wang

    Abstract: For each $n\in\mathbb{N}\cup\{\infty\}$, we introduce the notion of $n$-singularity category $\mathbf{D}_{n{\rm-}sg}(R)$ of a given ring $R$, which can be seen as a generalization of the classical singularity category. Moreover, the $n$-global dimension $n$-gldim$(R)$ of $R$ is investigated. We show that $\mathbf{D}_{n{\rm-}sg}(R)=0$ if and only if $n$-gldim$(R)$ is finite. Furthermore, we charact… ▽ More

    Submitted 14 July, 2023; v1 submitted 16 June, 2023; originally announced June 2023.

    Comments: arXiv admin note: substantial text overlap with arXiv:2306.09140

  39. arXiv:2306.09140  [pdf, ps, other

    math.RA

    Little finitistic dimensions and generalized derived categories

    Authors: Xiaolei Zhang, Tiwei Zhao, Dingguo Wang

    Abstract: In this paper, we introduced a generalization of the derived category, which is called the $n$-derived category and denoted by $\D_{n}(R)$, of a given ring $R$ for each $n\in\mathbb{N}\cup\{\infty\}$. The $n$-derived category of a ring is proved to be very closely connected with its left little finitistic dimension. We also introduce and investigate the notions of $n$-exact sequences, $n$-projecti… ▽ More

    Submitted 14 July, 2023; v1 submitted 15 June, 2023; originally announced June 2023.

  40. Skill Disentanglement for Imitation Learning from Suboptimal Demonstrations

    Authors: Tianxiang Zhao, Wenchao Yu, Suhang Wang, Lu Wang, Xiang Zhang, Yuncong Chen, Yanchi Liu, Wei Cheng, Haifeng Chen

    Abstract: Imitation learning has achieved great success in many sequential decision-making tasks, in which a neural agent is learned by imitating collected human demonstrations. However, existing algorithms typically require a large number of high-quality demonstrations that are difficult and expensive to collect. Usually, a trade-off needs to be made between demonstration quality and quantity in practice.… ▽ More

    Submitted 13 June, 2023; originally announced June 2023.

    Journal ref: Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD '23), August 6--10, 2023, Long Beach, CA, USA

  41. CARL-G: Clustering-Accelerated Representation Learning on Graphs

    Authors: William Shiao, Uday Singh Saini, Yozen Liu, Tong Zhao, Neil Shah, Evangelos E. Papalexakis

    Abstract: Self-supervised learning on graphs has made large strides in achieving great performance in various downstream tasks. However, many state-of-the-art methods suffer from a number of impediments, which prevent them from realizing their full potential. For instance, contrastive methods typically require negative sampling, which is often computationally costly. While non-contrastive methods avoid this… ▽ More

    Submitted 31 July, 2023; v1 submitted 12 June, 2023; originally announced June 2023.

    Comments: 14 pages. Accepted at KDD 2023

  42. arXiv:2306.06912  [pdf, other

    physics.optics nlin.CD quant-ph

    Chaos with Gaussian invariant distribution by quantum-noise random phase feedback

    Authors: Yanqiang Guo, Haifeng Li, Yingqi Wang, Xiangyu Meng, Tong Zhao, Xiaomin Guo

    Abstract: We experimentally present a random phase feedback based on quantum noise to generate a chaotic laser with Gaussian invariant distribution. The quantum noise from vacuum fluctuations is acquired by balanced homodyne detection and injected into a phase modulator to form a random phase feedback. An optical switch using high-speed intensity modulator is employed to reset the chaotic states repeatedly… ▽ More

    Submitted 12 June, 2023; originally announced June 2023.

    Comments: 11 pages, 10 figures

  43. arXiv:2306.03109  [pdf, other

    q-bio.QM cs.LG physics.chem-ph

    Machine Learning Force Fields with Data Cost Aware Training

    Authors: Alexander Bukharin, Tianyi Liu, Shengjie Wang, Simiao Zuo, Weihao Gao, Wen Yan, Tuo Zhao

    Abstract: Machine learning force fields (MLFF) have been proposed to accelerate molecular dynamics (MD) simulation, which finds widespread applications in chemistry and biomedical research. Even for the most data-efficient MLFFs, reaching chemical accuracy can require hundreds of frames of force and energy labels generated by expensive quantum mechanical algorithms, which may scale as $O(n^3)$ to $O(n^7)$,… ▽ More

    Submitted 5 June, 2023; originally announced June 2023.

  44. arXiv:2306.01323  [pdf, other

    cs.LG cs.AI

    Demystifying Structural Disparity in Graph Neural Networks: Can One Size Fit All?

    Authors: Haitao Mao, Zhikai Chen, Wei **, Haoyu Han, Yao Ma, Tong Zhao, Neil Shah, Jiliang Tang

    Abstract: Recent studies on Graph Neural Networks(GNNs) provide both empirical and theoretical evidence supporting their effectiveness in capturing structural patterns on both homophilic and certain heterophilic graphs. Notably, most real-world homophilic and heterophilic graphs are comprised of a mixture of nodes in both homophilic and heterophilic structural patterns, exhibiting a structural disparity. Ho… ▽ More

    Submitted 14 October, 2023; v1 submitted 2 June, 2023; originally announced June 2023.

    Comments: 55 pages, 24 figures. arXiv admin note: text overlap with arXiv:2106.15535 by other authors

  45. arXiv:2306.00369  [pdf, other

    cs.CL

    Focused Prefix Tuning for Controllable Text Generation

    Authors: Congda Ma, Tianyu Zhao, Makoto Shing, Kei Sawada, Manabu Okumura

    Abstract: In a controllable text generation dataset, there exist unannotated attributes that could provide irrelevant learning signals to models that use it for training and thus degrade their performance. We propose focused prefix tuning(FPT) to mitigate the problem and to enable the control to focus on the desired attribute. Experimental results show that FPT can achieve better control accuracy and text f… ▽ More

    Submitted 10 June, 2023; v1 submitted 1 June, 2023; originally announced June 2023.

    Comments: Accepted to the ACL 2023

  46. arXiv:2305.18713  [pdf, other

    cond-mat.mes-hall

    How Thermal Effect Regulates Cyclic Voltammetry of Supercapacitors

    Authors: Teng Zhao, Shuangliang Zhao, Shenggao Zhou, Zhenli Xu

    Abstract: Cyclic voltammetry (CV) is a powerful technique for characterizing electrochemical properties of electrochemical devices. During charging-discharging cycles, thermal effect has profound impact on its performance, but existing theoretical models cannot clarify such intrinsic mechanism and often give poor prediction. Herein, we propose an interfacial model for the electro-thermal coupling, based on… ▽ More

    Submitted 2 July, 2023; v1 submitted 29 May, 2023; originally announced May 2023.

  47. arXiv:2305.18703  [pdf, other

    cs.CL cs.AI

    Domain Specialization as the Key to Make Large Language Models Disruptive: A Comprehensive Survey

    Authors: Chen Ling, Xujiang Zhao, Jiaying Lu, Chengyuan Deng, Can Zheng, Junxiang Wang, Tanmoy Chowdhury, Yun Li, Hejie Cui, Xuchao Zhang, Tianjiao Zhao, Amit Panalkar, Dhagash Mehta, Stefano Pasquali, Wei Cheng, Haoyu Wang, Yanchi Liu, Zhengzhang Chen, Haifeng Chen, Chris White, Quanquan Gu, Jian Pei, Carl Yang, Liang Zhao

    Abstract: Large language models (LLMs) have significantly advanced the field of natural language processing (NLP), providing a highly useful, task-agnostic foundation for a wide range of applications. However, directly applying LLMs to solve sophisticated problems in specific domains meets many hurdles, caused by the heterogeneity of domain data, the sophistication of domain knowledge, the uniqueness of dom… ▽ More

    Submitted 29 March, 2024; v1 submitted 29 May, 2023; originally announced May 2023.

  48. arXiv:2305.14913  [pdf, other

    cs.CL

    CoLaDa: A Collaborative Label Denoising Framework for Cross-lingual Named Entity Recognition

    Authors: Tingting Ma, Qianhui Wu, Huiqiang Jiang, Börje F. Karlsson, Tiejun Zhao, Chin-Yew Lin

    Abstract: Cross-lingual named entity recognition (NER) aims to train an NER system that generalizes well to a target language by leveraging labeled data in a given source language. Previous work alleviates the data scarcity problem by translating source-language labeled data or performing knowledge distillation on target-language unlabeled data. However, these methods may suffer from label noise due to the… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

    Comments: ACL 2023. Our code is available at https://github.com/microsoft/vert-papers/tree/master/papers/CoLaDa

  49. arXiv:2305.12087  [pdf, other

    cs.LG

    Semi-Supervised Graph Imbalanced Regression

    Authors: Gang Liu, Tong Zhao, Eric Inae, Tengfei Luo, Meng Jiang

    Abstract: Data imbalance is easily found in annotated data when the observations of certain continuous label values are difficult to collect for regression tasks. When they come to molecule and polymer property predictions, the annotated graph datasets are often small because labeling them requires expensive equipment and effort. To address the lack of examples of rare label values in graph regression tasks… ▽ More

    Submitted 20 May, 2023; originally announced May 2023.

    Comments: Accepted by KDD 2023. 17 pages, 5 figures, 10 tables

  50. Coronal Heating as Determined by the Solar Flare Frequency Distribution Obtained by Aggregating Case Studies

    Authors: James Paul Mason, Alexandra Werth, Colin G. West, Allison A. Youngblood, Donald L. Woodraska, Courtney Peck, Kevin Lacjak, Florian G. Frick, Moutamen Gabir, Reema A. Alsinan, Thomas Jacobsen, Mohammad Alrubaie, Kayla M. Chizmar, Benjamin P. Lau, Lizbeth Montoya Dominguez, David Price, Dylan R. Butler, Connor J. Biron, Nikita Feoktistov, Kai Dewey, N. E. Loomis, Michal Bodzianowski, Connor Kuybus, Henry Dietrick, Aubrey M. Wolfe , et al. (977 additional authors not shown)

    Abstract: Flare frequency distributions represent a key approach to addressing one of the largest problems in solar and stellar physics: determining the mechanism that counter-intuitively heats coronae to temperatures that are orders of magnitude hotter than the corresponding photospheres. It is widely accepted that the magnetic field is responsible for the heating, but there are two competing mechanisms th… ▽ More

    Submitted 9 May, 2023; originally announced May 2023.

    Comments: 1,002 authors, 14 pages, 4 figures, 3 tables, published by The Astrophysical Journal on 2023-05-09, volume 948, page 71