Search | arXiv e-print repository

HIR-Diff: Unsupervised Hyperspectral Image Restoration Via Improved Diffusion Models

Authors: Li Pang, Xiangyu Rui, Long Cui, Hongzhong Wang, Deyu Meng, Xiangyong Cao

Abstract: Hyperspectral image (HSI) restoration aims at recovering clean images from degraded observations and plays a vital role in downstream tasks. Existing model-based methods have limitations in accurately modeling the complex image characteristics with handcraft priors, and deep learning-based methods suffer from poor generalization ability. To alleviate these issues, this paper proposes an unsupervis… ▽ More Hyperspectral image (HSI) restoration aims at recovering clean images from degraded observations and plays a vital role in downstream tasks. Existing model-based methods have limitations in accurately modeling the complex image characteristics with handcraft priors, and deep learning-based methods suffer from poor generalization ability. To alleviate these issues, this paper proposes an unsupervised HSI restoration framework with pre-trained diffusion model (HIR-Diff), which restores the clean HSIs from the product of two low-rank components, i.e., the reduced image and the coefficient matrix. Specifically, the reduced image, which has a low spectral dimension, lies in the image field and can be inferred from our improved diffusion model where a new guidance function with total variation (TV) prior is designed to ensure that the reduced image can be well sampled. The coefficient matrix can be effectively pre-estimated based on singular value decomposition (SVD) and rank-revealing QR (RRQR) factorization. Furthermore, a novel exponential noise schedule is proposed to accelerate the restoration process (about 5$\times$ acceleration for denoising) with little performance decrease. Extensive experimental results validate the superiority of our method in both performance and speed on a variety of HSI restoration tasks, including HSI denoising, noisy HSI super-resolution, and noisy HSI inpainting. The code is available at https://github.com/LiPang/HIRDiff. △ Less

Submitted 24 February, 2024; originally announced February 2024.

arXiv:2311.08217 [pdf, other]

Peer is Your Pillar: A Data-unbalanced Conditional GANs for Few-shot Image Generation

Authors: Ziqiang Li, Chaoyue Wang, Xue Rui, Chao Xue, Jiaxu Leng, Bin Li

Abstract: Few-shot image generation aims to train generative models using a small number of training images. When there are few images available for training (e.g. 10 images), Learning From Scratch (LFS) methods often generate images that closely resemble the training data while Transfer Learning (TL) methods try to improve performance by leveraging prior knowledge from GANs pre-trained on large-scale datas… ▽ More Few-shot image generation aims to train generative models using a small number of training images. When there are few images available for training (e.g. 10 images), Learning From Scratch (LFS) methods often generate images that closely resemble the training data while Transfer Learning (TL) methods try to improve performance by leveraging prior knowledge from GANs pre-trained on large-scale datasets. However, current TL methods may not allow for sufficient control over the degree of knowledge preservation from the source model, making them unsuitable for setups where the source and target domains are not closely related. To address this, we propose a novel pipeline called Peer is your Pillar (PIP), which combines a target few-shot dataset with a peer dataset to create a data-unbalanced conditional generation. Our approach includes a class embedding method that separates the class space from the latent space, and we use a direction loss based on pre-trained CLIP to improve image diversity. Experiments on various few-shot datasets demonstrate the advancement of the proposed PIP, especially reduces the training requirements of few-shot image generation. △ Less

Submitted 14 November, 2023; originally announced November 2023.

Comments: Under Review

arXiv:2310.15432 [pdf, other]

A Review of Economic Incentives for Efficient Operation of Flexible Transmission

Authors: Xinyang Rui, Omid Mirzapour, Brittany Pruneau, Mostafa Sahraei-Ardakani

Abstract: The growing penetration of renewable energy requires upgrades to the transmission network to ensure the deliverability of renewable generation. As an efficient alternative to transmission expansion, flexible transmission technologies, whose benefits have been widely studied, can alleviate transmission system congestion and enhance renewable energy integration. However, under the current market str… ▽ More The growing penetration of renewable energy requires upgrades to the transmission network to ensure the deliverability of renewable generation. As an efficient alternative to transmission expansion, flexible transmission technologies, whose benefits have been widely studied, can alleviate transmission system congestion and enhance renewable energy integration. However, under the current market structure, investments for these technologies only receive a regulated rate of return, providing little to no incentive for efficient operation. Additionally, a regulated rate of return creates an incentive for building more transmission lines rather than efficient utilization of the existing system. Therefore, investments in flexible transmission technologies remain rather limited. To facilitate the deployment of flexible transmission, improve system efficiency, and accommodate renewable energy integration, a proper incentive structure for flexible transmission technologies, compatible with the current market design, is vital. This paper reviews the current market-based mechanisms for various flexible transmission technologies, including impedance control, dynamic line rating, and transmission switching. This review pinpoints current challenges of the market-based operation of flexible transmission and provides insights for future endeavors in designing efficient price signals for flexible transmission operation. △ Less

Submitted 23 October, 2023; originally announced October 2023.

Comments: 2023 55th North American Power Symposium (NAPS)

arXiv:2310.08691 [pdf, other]

Flexible Transmission: A Comprehensive Review of Concepts, Technologies, and Market

Authors: Omid Mirzapour, Xinyang Rui, Brittany Pruneau, Mostafa Sahraei-Ardakani

Abstract: As global concerns regarding climate change are increasing worldwide, the transition towards clean energy sources has accelerated. Accounting for a large share of energy consumption, the electricity sector is experiencing a significant shift towards renewable energy sources. To accommodate this rapid shift, the transmission system requires major upgrades. Although enhancing grid capacity through t… ▽ More As global concerns regarding climate change are increasing worldwide, the transition towards clean energy sources has accelerated. Accounting for a large share of energy consumption, the electricity sector is experiencing a significant shift towards renewable energy sources. To accommodate this rapid shift, the transmission system requires major upgrades. Although enhancing grid capacity through transmission system expansion is always a solution, this solution is very costly and requires a protracted permitting process. The concept of flexible transmission encompasses a broad range of technologies and market tools that enable effective reconfiguration and manipulation of the power grid for leveraged dispatch of renewable energy resources. The proliferation of such technologies allows for enhanced transfer capability over the current transmission network, thus reducing the need for grid expansion projects. This paper comprehensively reviews flexible transmission technologies and their role in achieving a net-zero carbon emission grid vision. Flexible transmission definitions from different viewpoints are discussed, and mathematical measures to quantify grid flexibility are reviewed. An extensive range of technologies enhancing flexibility across the grid is introduced and explored in detail. The environmental impacts of flexible transmission, including renewable energy utilization and carbon emission reduction, are presented. Finally, market models required for creating proper incentives for the deployment of flexible transmission and regulatory barriers and challenges are discussed. △ Less

Submitted 12 October, 2023; originally announced October 2023.

arXiv:2309.01958 [pdf, other]

Empowering Low-Light Image Enhancer through Customized Learnable Priors

Authors: Naishan Zheng, Man Zhou, Yanmeng Dong, Xiangyu Rui, Jie Huang, Chongyi Li, Feng Zhao

Abstract: Deep neural networks have achieved remarkable progress in enhancing low-light images by improving their brightness and eliminating noise. However, most existing methods construct end-to-end map** networks heuristically, neglecting the intrinsic prior of image enhancement task and lacking transparency and interpretability. Although some unfolding solutions have been proposed to relieve these issu… ▽ More Deep neural networks have achieved remarkable progress in enhancing low-light images by improving their brightness and eliminating noise. However, most existing methods construct end-to-end map** networks heuristically, neglecting the intrinsic prior of image enhancement task and lacking transparency and interpretability. Although some unfolding solutions have been proposed to relieve these issues, they rely on proximal operator networks that deliver ambiguous and implicit priors. In this work, we propose a paradigm for low-light image enhancement that explores the potential of customized learnable priors to improve the transparency of the deep unfolding paradigm. Motivated by the powerful feature representation capability of Masked Autoencoder (MAE), we customize MAE-based illumination and noise priors and redevelop them from two perspectives: 1) \textbf{structure flow}: we train the MAE from a normal-light image to its illumination properties and then embed it into the proximal operator design of the unfolding architecture; and m2) \textbf{optimization flow}: we train MAE from a normal-light image to its gradient representation and then employ it as a regularization term to constrain noise in the model output. These designs improve the interpretability and representation capability of the model.Extensive experiments on multiple low-light image enhancement datasets demonstrate the superiority of our proposed paradigm over state-of-the-art methods. Code is available at https://github.com/zheng980629/CUE. △ Less

Submitted 5 September, 2023; originally announced September 2023.

Comments: Accepted by ICCV 2023

arXiv:2308.10576 [pdf, other]

Incorprating Prompt tuning for Commit classification with prior Knowledge

Authors: Jiajun Tong, Xiaobin Rui

Abstract: Commit Classification(CC) is an important task in software maintenance since it helps software developers classify code changes into different types according to their nature and purpose. This allows them to better understand how their development efforts are progressing, identify areas where they need improvement. However, existing methods are all discriminative models, usually with complex archi… ▽ More Commit Classification(CC) is an important task in software maintenance since it helps software developers classify code changes into different types according to their nature and purpose. This allows them to better understand how their development efforts are progressing, identify areas where they need improvement. However, existing methods are all discriminative models, usually with complex architectures that require additional output layers to produce class label probabilities. Moreover, they require a large amount of labeled data for fine-tuning, and it is difficult to learn effective classification boundaries in the case of limited labeled data. To solve above problems, we propose a generative framework that Incorporating prompt-tuning for commit classification with prior knowledge (IPCK) https://github.com/AppleMax1992/IPCK, which simplifies the model structure and learns features across different tasks. It can still reach the SOTA performance with only limited samples. Firstly, we proposed a generative framework based on T5. This encoder-decoder construction method unifies different CC task into a text2text problem, which simplifies the structure of the model by not requiring an extra output layer. Second, instead of fine-tuning, we design an prompt-tuning solution which can be adopted in few-shot scenarios with only limit samples. Furthermore, we incorporate prior knowledge via an external knowledge graph to map the probabilities of words into the final labels in the speech machine step to improve performance in few-shot scenarios. Extensive experiments on two open available datasets show that our framework can solve the CC problem simply but effectively in few-shot and zeroshot scenarios, while improving the adaptability of the model without requiring a large amount of training samples for fine-tuning. △ Less

Submitted 26 October, 2023; v1 submitted 21 August, 2023; originally announced August 2023.

arXiv:2308.08263 [pdf, other]

Boosting Commit Classification with Contrastive Learning

Authors: Jiajun Tong, Zhixiao Wang, Xiaobin Rui

Abstract: Commit Classification (CC) is an important task in software maintenance, which helps software developers classify code changes into different types according to their nature and purpose. It allows developers to understand better how their development efforts are progressing, identify areas where they need improvement, and make informed decisions about when and how to release new software versions.… ▽ More Commit Classification (CC) is an important task in software maintenance, which helps software developers classify code changes into different types according to their nature and purpose. It allows developers to understand better how their development efforts are progressing, identify areas where they need improvement, and make informed decisions about when and how to release new software versions. However, existing models need lots of manually labeled data for fine-tuning processes, and ignore sentence-level semantic information, which is often essential for discovering the difference between diverse commits. Therefore, it is still challenging to solve CC in fewshot scenario. To solve the above problems, we propose a contrastive learning-based commit classification framework. Firstly, we generate $K$ sentences and pseudo-labels according to the labels of the dataset, which aims to enhance the dataset. Secondly, we randomly group the augmented data $N$ times to compare their similarity with the positive $T_p^{|C|}$ and negative $T_n^{|C|}$ samples. We utilize individual pretrained sentence transformers (ST)s to efficiently obtain the sentence-level embeddings from different features respectively. Finally, we adopt the cosine similarity function to limit the distribution of vectors, similar vectors are more adjacent. The light fine-tuned model is then applied to the label prediction of incoming commits. Extensive experiments on two open available datasets demonstrate that our framework can solve the CC problem simply but effectively in fewshot scenarios, while achieving state-of-the-art(SOTA) performance and improving the adaptability of the model without requiring a large number of training samples for fine-tuning. The code, data, and trained models are available at https://github.com/AppleMax1992/CommitFit. △ Less

Submitted 16 August, 2023; originally announced August 2023.

arXiv:2307.08991 [pdf, other]

EgoVM: Achieving Precise Ego-Localization using Lightweight Vectorized Maps

Authors: Yuzhe He, Shuang Liang, Xiaofei Rui, Chengying Cai, Guowei Wan

Abstract: Accurate and reliable ego-localization is critical for autonomous driving. In this paper, we present EgoVM, an end-to-end localization network that achieves comparable localization accuracy to prior state-of-the-art methods, but uses lightweight vectorized maps instead of heavy point-based maps. To begin with, we extract BEV features from online multi-view images and LiDAR point cloud. Then, we em… ▽ More Accurate and reliable ego-localization is critical for autonomous driving. In this paper, we present EgoVM, an end-to-end localization network that achieves comparable localization accuracy to prior state-of-the-art methods, but uses lightweight vectorized maps instead of heavy point-based maps. To begin with, we extract BEV features from online multi-view images and LiDAR point cloud. Then, we employ a set of learnable semantic embeddings to encode the semantic types of map elements and supervise them with semantic segmentation, to make their feature representation consistent with BEV features. After that, we feed map queries, composed of learnable semantic embeddings and coordinates of map elements, into a transformer decoder to perform cross-modality matching with BEV features. Finally, we adopt a robust histogram-based pose solver to estimate the optimal pose by searching exhaustively over candidate poses. We comprehensively validate the effectiveness of our method using both the nuScenes dataset and a newly collected dataset. The experimental results show that our method achieves centimeter-level localization accuracy, and outperforms existing methods using vectorized maps by a large margin. Furthermore, our model has been extensively tested in a large fleet of autonomous vehicles under various challenging urban scenes. △ Less

Submitted 18 July, 2023; originally announced July 2023.

Comments: 8 pages

arXiv:2306.17797 [pdf, other]

HIDFlowNet: A Flow-Based Deep Network for Hyperspectral Image Denoising

Authors: Li Pang, Weizhen Gu, Xiangyong Cao, Xiangyu Rui, Jiangjun Peng, Shuang Xu, Gang Yang, Deyu Meng

Abstract: Hyperspectral image (HSI) denoising is essentially ill-posed since a noisy HSI can be degraded from multiple clean HSIs. However, current deep learning-based approaches ignore this fact and restore the clean image with deterministic map** (i.e., the network receives a noisy HSI and outputs a clean HSI). To alleviate this issue, this paper proposes a flow-based HSI denoising network (HIDFlowNet)… ▽ More Hyperspectral image (HSI) denoising is essentially ill-posed since a noisy HSI can be degraded from multiple clean HSIs. However, current deep learning-based approaches ignore this fact and restore the clean image with deterministic map** (i.e., the network receives a noisy HSI and outputs a clean HSI). To alleviate this issue, this paper proposes a flow-based HSI denoising network (HIDFlowNet) to directly learn the conditional distribution of the clean HSI given the noisy HSI and thus diverse clean HSIs can be sampled from the conditional distribution. Overall, our HIDFlowNet is induced from the flow methodology and contains an invertible decoder and a conditional encoder, which can fully decouple the learning of low-frequency and high-frequency information of HSI. Specifically, the invertible decoder is built by staking a succession of invertible conditional blocks (ICBs) to capture the local high-frequency details since the invertible network is information-lossless. The conditional encoder utilizes down-sampling operations to obtain low-resolution images and uses transformers to capture correlations over a long distance so that global low-frequency information can be effectively extracted. Extensive experimental results on simulated and real HSI datasets verify the superiority of our proposed HIDFlowNet compared with other state-of-the-art methods both quantitatively and visually. △ Less

Submitted 20 June, 2023; originally announced June 2023.

Comments: 10 pages, 8 figures

arXiv:2306.08313 [pdf, other]

A Proxy Attack-Free Strategy for Practically Improving the Poisoning Efficiency in Backdoor Attacks

Authors: Ziqiang Li, Hong Sun, Pengfei Xia, Beihao Xia, Xue Rui, Wei Zhang, Qinglang Guo, Bin Li

Abstract: Poisoning efficiency plays a critical role in poisoning-based backdoor attacks. To evade detection, attackers aim to use the fewest poisoning samples while achieving the desired attack strength. Although efficient triggers have significantly improved poisoning efficiency, there is still room for further enhancement. Recently, selecting efficient samples has shown promise, but it often requires a p… ▽ More Poisoning efficiency plays a critical role in poisoning-based backdoor attacks. To evade detection, attackers aim to use the fewest poisoning samples while achieving the desired attack strength. Although efficient triggers have significantly improved poisoning efficiency, there is still room for further enhancement. Recently, selecting efficient samples has shown promise, but it often requires a proxy backdoor injection task to identify an efficient poisoning sample set. However, the proxy attack-based approach can lead to performance degradation if the proxy attack settings differ from those used by the actual victims due to the shortcut of backdoor learning. This paper presents a Proxy attack-Free Strategy (PFS) designed to identify efficient poisoning samples based on individual similarity and ensemble diversity, effectively addressing the mentioned concern. The proposed PFS is motivated by the observation that selecting the to-be-poisoned samples with high similarity between clean samples and their corresponding poisoning samples results in significantly higher attack success rates compared to using samples with low similarity. Furthermore, theoretical analyses for this phenomenon are provided based on the theory of active learning and neural tangent kernel. We comprehensively evaluate the proposed strategy across various datasets, triggers, poisoning rates, architectures, and training hyperparameters. Our experimental results demonstrate that PFS enhances backdoor attack efficiency, while also exhibiting a remarkable speed advantage over prior proxy-dependent selection methodologies. △ Less

Submitted 25 April, 2024; v1 submitted 14 June, 2023; originally announced June 2023.

Comments: Under review

arXiv:2306.06820 [pdf, ps, other]

Scalable Fair Influence Maximization

Authors: Xiaobin Rui, Zhixiao Wang, Jiayu Zhao, Lichao Sun, Wei Chen

Abstract: Given a graph $G$, a community structure $\mathcal{C}$, and a budget $k$, the fair influence maximization problem aims to select a seed set $S$ ($|S|\leq k$) that maximizes the influence spread while narrowing the influence gap between different communities. While various fairness notions exist, the welfare fairness notion, which balances fairness level and influence spread, has shown promising ef… ▽ More Given a graph $G$, a community structure $\mathcal{C}$, and a budget $k$, the fair influence maximization problem aims to select a seed set $S$ ($|S|\leq k$) that maximizes the influence spread while narrowing the influence gap between different communities. While various fairness notions exist, the welfare fairness notion, which balances fairness level and influence spread, has shown promising effectiveness. However, the lack of efficient algorithms for optimizing the welfare fairness objective function restricts its application to small-scale networks with only a few hundred nodes. In this paper, we adopt the objective function of welfare fairness to maximize the exponentially weighted summation over the influenced fraction of all communities. We first introduce an unbiased estimator for the fractional power of the arithmetic mean. Then, by adapting the reverse influence sampling (RIS) approach, we convert the optimization problem to a weighted maximum coverage problem. We also analyze the number of reverse reachable sets needed to approximate the fair influence at a high probability. Further, we present an efficient algorithm that guarantees $1-1/e - \varepsilon$ approximation. △ Less

Submitted 21 November, 2023; v1 submitted 11 June, 2023; originally announced June 2023.

arXiv:2305.10925 [pdf, other]

Unsupervised Hyperspectral Pansharpening via Low-rank Diffusion Model

Authors: Xiangyu Rui, Xiangyong Cao, Li Pang, Zeyu Zhu, Zongsheng Yue, Deyu Meng

Abstract: Hyperspectral pansharpening is a process of merging a high-resolution panchromatic (PAN) image and a low-resolution hyperspectral (LRHS) image to create a single high-resolution hyperspectral (HRHS) image. Existing Bayesian-based HS pansharpening methods require designing handcraft image prior to characterize the image features, and deep learning-based HS pansharpening methods usually require a la… ▽ More Hyperspectral pansharpening is a process of merging a high-resolution panchromatic (PAN) image and a low-resolution hyperspectral (LRHS) image to create a single high-resolution hyperspectral (HRHS) image. Existing Bayesian-based HS pansharpening methods require designing handcraft image prior to characterize the image features, and deep learning-based HS pansharpening methods usually require a large number of paired training data and suffer from poor generalization ability. To address these issues, in this work, we propose a low-rank diffusion model for hyperspectral pansharpening by simultaneously leveraging the power of the pre-trained deep diffusion model and better generalization ability of Bayesian methods. Specifically, we assume that the HRHS image can be recovered from the product of two low-rank tensors, i.e., the base tensor and the coefficient matrix. The base tensor lies on the image field and has a low spectral dimension. Thus, we can conveniently utilize a pre-trained remote sensing diffusion model to capture its image structures. Additionally, we derive a simple yet quite effective way to pre-estimate the coefficient matrix from the observed LRHS image, which preserves the spectral information of the HRHS. Experimental results demonstrate that the proposed method performs better than some popular traditional approaches and gains better generalization ability than some DL-based methods. The code is released in https://github.com/xyrui/PLRDiff. △ Less

Submitted 19 November, 2023; v1 submitted 18 May, 2023; originally announced May 2023.

arXiv:2303.16438 [pdf, other]

Random Weights Networks Work as Loss Prior Constraint for Image Restoration

Authors: Man Zhou, Naishan Zheng, Jie Huang, Xiangyu Rui, Chunle Guo, Deyu Meng, Chongyi Li, **wei Gu

Abstract: In this paper, orthogonal to the existing data and model studies, we instead resort our efforts to investigate the potential of loss function in a new perspective and present our belief ``Random Weights Networks can Be Acted as Loss Prior Constraint for Image Restoration''. Inspired by Functional theory, we provide several alternative solutions to implement our belief in the strict mathematical ma… ▽ More In this paper, orthogonal to the existing data and model studies, we instead resort our efforts to investigate the potential of loss function in a new perspective and present our belief ``Random Weights Networks can Be Acted as Loss Prior Constraint for Image Restoration''. Inspired by Functional theory, we provide several alternative solutions to implement our belief in the strict mathematical manifolds including Taylor's Unfolding Network, Invertible Neural Network, Central Difference Convolution and Zero-order Filtering as ``random weights network prototype'' with respect of the following four levels: 1) the different random weights strategies; 2) the different network architectures, \emph{eg,} pure convolution layer or transformer; 3) the different network architecture depths; 4) the different numbers of random weights network combination. Furthermore, to enlarge the capability of the randomly initialized manifolds, we devise the manner of random weights in the following two variants: 1) the weights are randomly initialized only once during the whole training procedure; 2) the weights are randomly initialized at each training iteration epoch. Our propose belief can be directly inserted into existing networks without any training and testing computational cost. Extensive experiments across multiple image restoration tasks, including image de-noising, low-light image enhancement, guided image super-resolution demonstrate the consistent performance gains obtained by introducing our belief. To emphasize, our main focus is to spark the realms of loss function and save their current neglected status. Code will be publicly available. △ Less

Submitted 28 March, 2023; originally announced March 2023.

arXiv:2301.06081 [pdf, other]

A Hyper-weight Network for Hyperspectral Image Denoising

Authors: Xiangyu Rui, Xiangyong Cao, Jun Shu, Qian Zhao, Deyu Meng

Abstract: In the hyperspectral image (HSI) denoising task, the real noise embedded in the HSI is always complex and diverse so that many model-based HSI denoising methods only perform well on some specific noisy HSIs. To enhance the noise adaptation capability of current methods, we first resort to the weighted HSI denoising model since its weight is capable of characterizing the noise in different position… ▽ More In the hyperspectral image (HSI) denoising task, the real noise embedded in the HSI is always complex and diverse so that many model-based HSI denoising methods only perform well on some specific noisy HSIs. To enhance the noise adaptation capability of current methods, we first resort to the weighted HSI denoising model since its weight is capable of characterizing the noise in different positions of the image. However, the weight in these weighted models is always determined by an empirical updating formula, which does not fully utilize the noise information contained in noisy images and thus limits their performance improvement. In this work, we propose an automatic weighting scheme to alleviate this issue. Specifically, the weight in the weighted model is predicted by a hyper-weight network (i.e., HWnet), which can be learned in a bi-level optimization framework based on the data-driven methodology. The learned HWnet can be explicitly plugged into other weighted denoising models, and help adjust weights for different noisy HSIs and different weighted models. Extensive experiments verify that the proposed HWnet can help improve the generalization ability of a weighted model to adapt to more complex noise, and can also strengthen the weighted model by transferring the knowledge from another weighted model. Additionally, to explain the experimental results, we also theoretically prove the training error and generalization error upper bound of the proposed HWnet, which should be the first generalization error analysis in the low-level vision field as far as we know. △ Less

Submitted 8 December, 2022; originally announced January 2023.

Comments: 16 pages

arXiv:2212.03538 [pdf]

Hysteretic Electronic Phase Transitions in Correlated Charge-Density-Wave State of 1T-TaS2

Authors: Geng Yanyan, Lei Le, Dong Haoyu, Guo Jianfeng, Mi Shuo, Li Yan, Huang Li, Pang Fei, Xu Rui, Zhou Weichang, Liu Zheng, Ji Wei, Cheng Zhihai

Abstract: Recently, many exotic electronic states, such as quantum spin liquid (QSL) and superconductivity (SC), have been extensively discovered and introduced in layered transition metal dichalcogenides 1T-TaS2 by controlling their complex correlated charge-density-wave (CDW) states. However, few studies have focused on its hysteretic electronic phase transitions based on the in-depth discussion of the de… ▽ More Recently, many exotic electronic states, such as quantum spin liquid (QSL) and superconductivity (SC), have been extensively discovered and introduced in layered transition metal dichalcogenides 1T-TaS2 by controlling their complex correlated charge-density-wave (CDW) states. However, few studies have focused on its hysteretic electronic phase transitions based on the in-depth discussion of the delicate interplay among temperature-dependent electronic interactions. Here, we reported a sequence of spatial electronic phase transitions in the hysteresis temperature range of 1T-TaS2 via variable-temperature scanning tunneling microscopy (VT-STM). The emergence, evolution, coexistence, and separation of diverse novel electronic states within the commensurate CDW/triclinic CDW (CCDW/TCDW) phase are investigated in detail through the warming/cooling process. These novel emergent electronic states can be attributed to the delicate temperature-dependent competition and/or cooperation of interlayer interactions, intralayer electron-electron correlation, and electron-phonon (e-ph) coupling of 1T-TaS2. Our results not only provide a novel insight to understand the hysteretic electronic phase transitions of correlated CDW state, but also pave a way to realize more exotic quantum states by accurately and effectively controlling various interactions in correlated materials. △ Less

Submitted 7 December, 2022; originally announced December 2022.

Comments: 20 pages, 5 figures,

arXiv:2211.01825 [pdf, other]

doi 10.1109/TGRS.2022.3229012

Fast Noise Removal in Hyperspectral Images via Representative Coefficient Total Variation

Authors: Jiangjun Peng, Hailin Wang, Xiangyong Cao, Xinlin Liu, Xiangyu Rui, Deyu Meng

Abstract: Mining structural priors in data is a widely recognized technique for hyperspectral image (HSI) denoising tasks, whose typical ways include model-based methods and data-based methods. The model-based methods have good generalization ability, while the runtime cannot meet the fast processing requirements of the practical situations due to the large size of an HSI data… ▽ More Mining structural priors in data is a widely recognized technique for hyperspectral image (HSI) denoising tasks, whose typical ways include model-based methods and data-based methods. The model-based methods have good generalization ability, while the runtime cannot meet the fast processing requirements of the practical situations due to the large size of an HSI data $ \mathbf{X} \in \mathbb{R}^{MN\times B}$. For the data-based methods, they perform very fast on new test data once they have been trained. However, their generalization ability is always insufficient. In this paper, we propose a fast model-based HSI denoising approach. Specifically, we propose a novel regularizer named Representative Coefficient Total Variation (RCTV) to simultaneously characterize the low rank and local smooth properties. The RCTV regularizer is proposed based on the observation that the representative coefficient matrix $\mathbf{U}\in\mathbb{R}^{MN\times R} (R\ll B)$ obtained by orthogonally transforming the original HSI $\mathbf{X}$ can inherit the strong local-smooth prior of $\mathbf{X}$. Since $R/B$ is very small, the HSI denoising model based on the RCTV regularizer has lower time complexity. Additionally, we find that the representative coefficient matrix $\mathbf{U}$ is robust to noise, and thus the RCTV regularizer can somewhat promote the robustness of the HSI denoising model. Extensive experiments on mixed noise removal demonstrate the superiority of the proposed method both in denoising performance and denoising speed compared with other state-of-the-art methods. Remarkably, the denoising speed of our proposed method outperforms all the model-based techniques and is comparable with the deep learning-based approaches. △ Less

Submitted 3 November, 2022; originally announced November 2022.

Comments: 16 pages, 18 figures, 5 tables, 1 theorem

arXiv:2209.00892 [pdf, other]

doi 10.1145/3539597.3570416

Scalable Adversarial Attack Algorithms on Influence Maximization

Authors: Lichao Sun, Xiaobin Rui, Wei Chen

Abstract: In this paper, we study the adversarial attacks on influence maximization under dynamic influence propagation models in social networks. In particular, given a known seed set S, the problem is to minimize the influence spread from S by deleting a limited number of nodes and edges. This problem reflects many application scenarios, such as blocking virus (e.g. COVID-19) propagation in social network… ▽ More In this paper, we study the adversarial attacks on influence maximization under dynamic influence propagation models in social networks. In particular, given a known seed set S, the problem is to minimize the influence spread from S by deleting a limited number of nodes and edges. This problem reflects many application scenarios, such as blocking virus (e.g. COVID-19) propagation in social networks by quarantine and vaccination, blocking rumor spread by freezing fake accounts, or attacking competitor's influence by incentivizing some users to ignore the information from the competitor. In this paper, under the linear threshold model, we adapt the reverse influence sampling approach and provide efficient algorithms of sampling valid reverse reachable paths to solve the problem. We present three different design choices on reverse sampling, which all guarantee $1/2 - \varepsilon$ approximation (for any small $\varepsilon >0$) and an efficient running time. △ Less

Submitted 17 December, 2022; v1 submitted 2 September, 2022; originally announced September 2022.

Comments: 11 pages, 2 figures

arXiv:2208.14601 [pdf]

A topic-aware graph neural network model for knowledge base updating

Authors: Jiajun Tong, Zhixiao Wang, Xiaobin Rui

Abstract: The open domain knowledge base is very important. It is usually extracted from encyclopedia websites and is widely used in knowledge retrieval systems, question answering systems, or recommendation systems. In practice, the key challenge is to maintain an up-to-date knowledge base. Different from Unwieldy fetching all of the data from the encyclopedia dumps, to enlarge the freshness of the knowled… ▽ More The open domain knowledge base is very important. It is usually extracted from encyclopedia websites and is widely used in knowledge retrieval systems, question answering systems, or recommendation systems. In practice, the key challenge is to maintain an up-to-date knowledge base. Different from Unwieldy fetching all of the data from the encyclopedia dumps, to enlarge the freshness of the knowledge base as big as possible while avoiding invalid fetching, the current knowledge base updating methods usually determine whether entities need to be updated by building a prediction model. However, these methods can only be defined in some specific fields and the result turns out to be obvious bias, due to the problem of data source and data structure. The users' query intentions are often diverse as to the open domain knowledge, so we construct a topic-aware graph network for knowledge updating based on the user query log. Our methods can be summarized as follow: 1. Extract entities through the user's log and select them as seeds 2. Scrape the attributes of seed entities in the encyclopedia website, and self-supervised construct the entity attribute graph for each entity. 3. Use the entity attribute graph to train the GNN entity update model to determine whether the entity needs to be synchronized. 4.Use the encyclopedia knowledge to match and update the filtered entity with the entity in the knowledge base according to the minimum edit times algorithm. △ Less

Submitted 1 September, 2022; v1 submitted 30 August, 2022; originally announced August 2022.

arXiv:2206.12027 [pdf]

A multi-model-based deep learning framework for short text multiclass classification with the imbalanced and extremely small data set

Authors: Jiajun Tong, Zhixiao Wang, Xiaobin Rui

Abstract: Text classification plays an important role in many practical applications. In the real world, there are extremely small datasets. Most existing methods adopt pre-trained neural network models to handle this kind of dataset. However, these methods are either difficult to deploy on mobile devices because of their large output size or cannot fully extract the deep semantic information between phrase… ▽ More Text classification plays an important role in many practical applications. In the real world, there are extremely small datasets. Most existing methods adopt pre-trained neural network models to handle this kind of dataset. However, these methods are either difficult to deploy on mobile devices because of their large output size or cannot fully extract the deep semantic information between phrases and clauses. This paper proposes a multimodel-based deep learning framework for short-text multiclass classification with an imbalanced and extremely small data set. Our framework mainly includes five layers: The encoder layer uses DISTILBERT to obtain context-sensitive dynamic word vectors that are difficult to represent in traditional feature engineering methods. Since the transformer part of this layer is distilled, our framework is compressed. Then, we use the next two layers to extract deep semantic information. The output of the encoder layer is sent to a bidirectional LSTM network, and the feature matrix is extracted hierarchically through the LSTM at the word and sentence level to obtain the fine-grained semantic representation. After that, the max-pooling layer converts the feature matrix into a lower-dimensional matrix, preserving only the obvious features. Finally, the feature matrix is taken as the input of a fully connected softmax layer, which contains a function that can convert the predicted linear vector into the output value as the probability of the text in each classification. Extensive experiments on two public benchmarks demonstrate the effectiveness of our proposed approach on an extremely small data set. It retains the state-of-the-art baseline performance in terms of precision, recall, accuracy, and F1 score, and through the model size, training time, and convergence epoch, we can conclude that our method can be deployed faster and lighter on mobile devices. △ Less

Submitted 23 June, 2022; originally announced June 2022.

arXiv:2112.10728 [pdf, other]

MuMuQA: Multimedia Multi-Hop News Question Answering via Cross-Media Knowledge Extraction and Grounding

Authors: Revanth Gangi Reddy, Xilin Rui, Manling Li, Xudong Lin, Haoyang Wen, Jaemin Cho, Lifu Huang, Mohit Bansal, Avirup Sil, Shih-Fu Chang, Alexander Schwing, Heng Ji

Abstract: Recently, there has been an increasing interest in building question answering (QA) models that reason across multiple modalities, such as text and images. However, QA using images is often limited to just picking the answer from a pre-defined set of options. In addition, images in the real world, especially in news, have objects that are co-referential to the text, with complementary information… ▽ More Recently, there has been an increasing interest in building question answering (QA) models that reason across multiple modalities, such as text and images. However, QA using images is often limited to just picking the answer from a pre-defined set of options. In addition, images in the real world, especially in news, have objects that are co-referential to the text, with complementary information from both modalities. In this paper, we present a new QA evaluation benchmark with 1,384 questions over news articles that require cross-media grounding of objects in images onto text. Specifically, the task involves multi-hop questions that require reasoning over image-caption pairs to identify the grounded visual object being referred to and then predicting a span from the news body text to answer the question. In addition, we introduce a novel multimedia data augmentation framework, based on cross-media knowledge extraction and synthetic question-answer generation, to automatically augment data that can provide weak supervision for this task. We evaluate both pipeline-based and end-to-end pretraining-based multimedia QA models on our benchmark, and show that they achieve promising performance, while considerably lagging behind human performance hence leaving large room for future work on this challenging new task. △ Less

Submitted 4 May, 2022; v1 submitted 20 December, 2021; originally announced December 2021.

Comments: Accepted at AAAI 2022

arXiv:2103.11093 [pdf, other]

Exploring The Effect of High-frequency Components in GANs Training

Authors: Ziqiang Li, Pengfei Xia, Xue Rui, Bin Li

Abstract: Generative Adversarial Networks (GANs) have the ability to generate images that are visually indistinguishable from real images. However, recent studies have revealed that generated and real images share significant differences in the frequency domain. In this paper, we explore the effect of high-frequency components in GANs training. According to our observation, during the training of most GANs,… ▽ More Generative Adversarial Networks (GANs) have the ability to generate images that are visually indistinguishable from real images. However, recent studies have revealed that generated and real images share significant differences in the frequency domain. In this paper, we explore the effect of high-frequency components in GANs training. According to our observation, during the training of most GANs, severe high-frequency differences make the discriminator focus on high-frequency components excessively, which hinders the generator from fitting the low-frequency components that are important for learning images' content. Then, we propose two simple yet effective frequency operations for eliminating the side effects caused by high-frequency differences in GANs training: High-Frequency Confusion (HFC) and High-Frequency Filter (HFF). The proposed operations are general and can be applied to most existing GANs with a fraction of the cost. The advanced performance of the proposed operations is verified on multiple loss functions, network architectures, and datasets. Specifically, the proposed HFF achieves significant improvements of $42.5\%$ FID on CelebA (128*128) unconditional generation based on SNGAN, $30.2\%$ FID on CelebA unconditional generation based on SSGAN, and $69.3\%$ FID on CelebA unconditional generation based on InfoMAXGAN. △ Less

Submitted 29 November, 2021; v1 submitted 20 March, 2021; originally announced March 2021.

arXiv:2003.03026 [pdf, other]

DA4AD: End-to-End Deep Attention-based Visual Localization for Autonomous Driving

Authors: Yao Zhou, Guowei Wan, Shenhua Hou, Li Yu, Gang Wang, Xiaofei Rui, Shiyu Song

Abstract: We present a visual localization framework based on novel deep attention aware features for autonomous driving that achieves centimeter level localization accuracy. Conventional approaches to the visual localization problem rely on handcrafted features or human-made objects on the road. They are known to be either prone to unstable matching caused by severe appearance or lighting changes, or too s… ▽ More We present a visual localization framework based on novel deep attention aware features for autonomous driving that achieves centimeter level localization accuracy. Conventional approaches to the visual localization problem rely on handcrafted features or human-made objects on the road. They are known to be either prone to unstable matching caused by severe appearance or lighting changes, or too scarce to deliver constant and robust localization results in challenging scenarios. In this work, we seek to exploit the deep attention mechanism to search for salient, distinctive and stable features that are good for long-term matching in the scene through a novel end-to-end deep neural network. Furthermore, our learned feature descriptors are demonstrated to be competent to establish robust matches and therefore successfully estimate the optimal camera poses with high precision. We comprehensively validate the effectiveness of our method using a freshly collected dataset with high-quality ground truth trajectories and hardware synchronization between sensors. Results demonstrate that our method achieves a competitive localization accuracy when compared to the LiDAR-based localization solutions under various challenging circumstances, leading to a potential low-cost localization solution for autonomous driving. △ Less

Submitted 13 July, 2020; v1 submitted 5 March, 2020; originally announced March 2020.

Comments: 19 pages, 4 figures, Accepted by ECCV 2020

arXiv:1905.05177 [pdf, other]

doi 10.1109/TNNLS.2014.2377211

A Distributed Approach towards Discriminative Distance Metric Learning

Authors: Jun Li, Xun Lin, Xiaoguang Rui, Yong Rui, Dacheng Tao

Abstract: Distance metric learning is successful in discovering intrinsic relations in data. However, most algorithms are computationally demanding when the problem size becomes large. In this paper, we propose a discriminative metric learning algorithm, and develop a distributed scheme learning metrics on moderate-sized subsets of data, and aggregating the results into a global solution. The technique leve… ▽ More Distance metric learning is successful in discovering intrinsic relations in data. However, most algorithms are computationally demanding when the problem size becomes large. In this paper, we propose a discriminative metric learning algorithm, and develop a distributed scheme learning metrics on moderate-sized subsets of data, and aggregating the results into a global solution. The technique leverages the power of parallel computation. The algorithm of the aggregated distance metric learning (ADML) scales well with the data size and can be controlled by the partition. We theoretically analyse and provide bounds for the error induced by the distributed treatment. We have conducted experimental evaluation of ADML, both on specially designed tests and on practical image annotation tasks. Those tests have shown that ADML achieves the state-of-the-art performance at only a fraction of the cost incurred by most existing methods. △ Less

Submitted 11 May, 2019; originally announced May 2019.

arXiv:1811.09769 [pdf]

doi 10.1017/S1431927618000910

Atomic-resolution study of oxygen vacancy ordering in $La_{0.5}$$Sr_{0.5}$Co$O_{3-δ}$ thin films on SrTi$O_{3}$ during in-situ cooling experiments

Authors: Xue Rui, Robert Klie

Abstract: The presence of oxygen vacancy, as well as ordering of vacancies plays an important role in determining the electronic, ionic and thermal transport properties of many transition metal oxide materials. Controlling the concentration of oxygen vacancies as well as the structures or domains of ordered oxygen vacancies has been the subject of many experimental and theoretical studies. In epitaxial thin… ▽ More The presence of oxygen vacancy, as well as ordering of vacancies plays an important role in determining the electronic, ionic and thermal transport properties of many transition metal oxide materials. Controlling the concentration of oxygen vacancies as well as the structures or domains of ordered oxygen vacancies has been the subject of many experimental and theoretical studies. In epitaxial thin films, the concentration of oxygen vacancies as well as the type of ordering depends on the structure of the support as well as the lattice mismatch between the thin films and the support. The role of temperature induced structural phase transitions on the oxygen vacancy ordering has remained largely unexplored. Here, we use aberration-corrected scanning transmission electron microscopy (STEM) combined with an in-situ cooling experiments to characterize the atomic/electronic structures of oxygen-deficient $La_{0.5}$$Sr_{0.5}$Co$O_{3-δ}$ thin films grown on SrTi$O_{3}$ across the anti-ferrodistortive phase transition of SrTi$O_{3}$ at 105 K. We demonstrate that atomic-resolution imaging and electron energy-loss spectroscopy (EELS) can be used to examine variations in the local density of states as a function of sample temperature and thus of the structure of the support. △ Less

Submitted 24 November, 2018; originally announced November 2018.

Comments: Oxygen vacancy orderings, Cyro-Temperature, cooling experiment, Scanning Transmission Electron Microscopy, Electron Energy Loss Spectroscopy

arXiv:1612.05610 [pdf, other]

doi 10.1103/PhysRevB.95.205131

Experimental verification of orbital engineering at the atomic scale: charge transfer and symmetry breaking in nickelate heterostructures

Authors: Patrick J. Phillips, Paolo Longo, Alexandru B. Georgescu, Eiji Okunishi, Xue Rui, Ankit S. Disa, Fred Walker, Sohrab Ismail-Beigi, Charles H. Ahn, Robert F. Klie

Abstract: Epitaxial strain, layer confinement and inversion symmetry breaking have emerged as powerful new approaches to control the electronic and atomic-scale structural properties in complex metal oxides. Nickelate heterostructures, based on RENiO$_3$, where RE is a trivalent rare-earth cation, have been shown to be relevant model systems since the orbital occupancy, degeneracy, and, consequently, the el… ▽ More Epitaxial strain, layer confinement and inversion symmetry breaking have emerged as powerful new approaches to control the electronic and atomic-scale structural properties in complex metal oxides. Nickelate heterostructures, based on RENiO$_3$, where RE is a trivalent rare-earth cation, have been shown to be relevant model systems since the orbital occupancy, degeneracy, and, consequently, the electronic/magnetic properties can be altered as a function of epitaxial strain, layer thickness and superlattice structure. One such recent example is the tri-component LaTiO$_3$-LaNiO$_3$-LaAlO$_3$ superlattice, which exhibits charge transfer and orbital polarization as the result of its interfacial dipole electric field. A crucial step towards control of these parameters for future electronic and magnetic device applications is to develop an understanding of both the magnitude and range of the octahedral network's response towards interfacial strain and electric fields. An approach that provides atomic-scale resolution and sensitivity towards the local octahedral distortions and orbital occupancy is therefore required. Here, we employ atomic-resolution imaging coupled with electron spectroscopies and first principles theory to examine the role of interfacial charge transfer and symmetry breaking in a tricomponent nickelate superlattice system. We find that nearly complete charge transfer occurs between the LaTiO$_3$ and LaNiO$_3$ layers, resulting in a Ni$^{2+}$ valence state. We further demonstrate that this charge transfer is highly localized with a range of about 1 unit cell, within the LaNiO$_3$ layers. The results presented here provide important feedback to synthesis efforts aimed at stabilizing new electronic phases that are not accessible by conventional bulk or epitaxial film approaches. △ Less

Submitted 16 December, 2016; originally announced December 2016.

Journal ref: Phys. Rev. B 95, 205131 (2017)

arXiv:1108.4794 [pdf]

Dielectric layer dependent surface plasmon effect of metallic nanoparticles on silicon substrate

Authors: Xu Rui, Wang Xiao-Dong, Liu Wen, Xu Xiao-Na, Li Yue-Qiang, Ji An, Yang Fu-Hua, Li **-Min

Abstract: The electromagnetic interaction between Ag nanoparticles on the top of the Si substrate and the incident light has been studied by numerical simulations. It is found that the presence of a dielectric layer with different thickness leads to varied resonance wavelength and scattering cross section, and consequently shifted photocurrent response over all wavelengths. These different behaviors are det… ▽ More The electromagnetic interaction between Ag nanoparticles on the top of the Si substrate and the incident light has been studied by numerical simulations. It is found that the presence of a dielectric layer with different thickness leads to varied resonance wavelength and scattering cross section, and consequently shifted photocurrent response over all wavelengths. These different behaviors are determined by whether the dielectric layer is beyond the domain where the near field of nanoparticles takes effect, and geometrical optics effects must be taken into account. It is revealed that for particle of a certain size, an appropriate dielectric layer thickness is desirable to achieve the best absorption performance. For a certain thickness of dielectric layer, an appropriate granular size is also desirable. These observations have substantial applications for the optimization of surface plasmon enhanced silicon solar cells. △ Less

Submitted 20 September, 2011; v1 submitted 24 August, 2011; originally announced August 2011.

arXiv:cond-mat/0302365 [pdf]

doi 10.1063/1.1606884

Improved Irreversibility Behaviour and Critical Current Density in MgB2-Diamond Nanocomposites

Authors: Y. Zhao, X. F. Rui, C. H. Cheng, H. Zhang, P. Munroe, H. M. Zeng, N. Koshizuka, M. Murakami

Abstract: MgB2-diamond nanocomposite superconductors have been synthesized by addition of nano-diamond powder. Microstructural analysis shows that the nanocomposite superconductor consists of tightly-packed MgB2 nano-grains (~50-100 nm) with highly-dispersed and uniformly-distributed diamond nanoparticles (~10-20 nm) inside the grains. The Jc-H and Hiir-T characteristics have been significantly improved i… ▽ More MgB2-diamond nanocomposite superconductors have been synthesized by addition of nano-diamond powder. Microstructural analysis shows that the nanocomposite superconductor consists of tightly-packed MgB2 nano-grains (~50-100 nm) with highly-dispersed and uniformly-distributed diamond nanoparticles (~10-20 nm) inside the grains. The Jc-H and Hiir-T characteristics have been significantly improved in this MgB2-diamond nanocomposite, compared to MgB2 bulk materials prepared by other techniques. Also, the Jc value of 1x104 A/cm2 at 20 K and 4 T and the Hirr value of 6.4 T at 20 K have been achieved △ Less

Submitted 18 February, 2003; originally announced February 2003.

Comments: 9 pages, 5 figures

arXiv:cond-mat/0302202 [pdf]

doi 10.1088/0953-2048/16/10/310

Do** Effect of Nano-Diamond on Superconductivity and Flux Pinning in MgB2

Authors: C. H. Cheng, H. Zhang, Y. Zhao, Y. Feng, X. F. Rui, P. Munroe, H. M. Zeng, N. Koshizuka, M. Murakami

Abstract: Do** effect of diamond nanoparticles on the superconducting properties of MgB2 bulk material has been studied. It is found that the superconducting transition temperature Tc of MgB2 is suppressed by the diamond-do**, however, the irreversibility field Hirr and the critical current density Jc are systematically enhanced. Microstructural analysis shows that the diamond-doped MgB2 superconducto… ▽ More Do** effect of diamond nanoparticles on the superconducting properties of MgB2 bulk material has been studied. It is found that the superconducting transition temperature Tc of MgB2 is suppressed by the diamond-do**, however, the irreversibility field Hirr and the critical current density Jc are systematically enhanced. Microstructural analysis shows that the diamond-doped MgB2 superconductor consists of tightly-packed MgB2 nano-grains (~50-100 nm) with highly-dispersed and uniformly-distributed diamond nanoparticles (~10-20 nm) inside the grains. High density of dislocations and diamond nanoparticles may take the responsibility for the enhanced flux pinning in the diamond-doped MgB2. △ Less

Submitted 11 February, 2003; originally announced February 2003.

Comments: 16 pages, 6 figures

Showing 1–28 of 28 results for author: Rui, X