Search | arXiv e-print repository

Decoupled Cross-Scale Cross-View Interaction for Stereo Image Enhancement in The Dark

Authors: Huan Zheng, Zhao Zhang, Jicong Fan, Richang Hong, Yi Yang, Shuicheng Yan

Abstract: Low-light stereo image enhancement (LLSIE) is a relatively new task to enhance the quality of visually unpleasant stereo images captured in dark condition. However, current methods achieve inferior performance on detail recovery and illumination adjustment. We find it is because: 1) the insufficient single-scale inter-view interaction makes the cross-view cues unable to be fully exploited; 2) lack… ▽ More Low-light stereo image enhancement (LLSIE) is a relatively new task to enhance the quality of visually unpleasant stereo images captured in dark condition. However, current methods achieve inferior performance on detail recovery and illumination adjustment. We find it is because: 1) the insufficient single-scale inter-view interaction makes the cross-view cues unable to be fully exploited; 2) lacking long-range dependency leads to the inability to deal with the spatial long-range effects caused by illumination degradation. To alleviate such limitations, we propose a LLSIE model termed Decoupled Cross-scale Cross-view Interaction Network (DCI-Net). Specifically, we present a decoupled interaction module (DIM) that aims for sufficient dual-view information interaction. DIM decouples the dual-view information exchange into discovering multi-scale cross-view correlations and further exploring cross-scale information flow. Besides, we present a spatial-channel information mining block (SIMB) for intra-view feature extraction, and the benefits are twofold. One is the long-range dependency capture to build spatial long-range relationship, and the other is expanded channel information refinement that enhances information flow in channel dimension. Extensive experiments on Flickr1024, KITTI 2012, KITTI 2015 and Middlebury datasets show that our method obtains better illumination adjustment and detail recovery, and achieves SOTA performance compared to other related methods. Our codes, datasets and models will be publicly available. △ Less

Submitted 12 November, 2022; v1 submitted 2 November, 2022; originally announced November 2022.

arXiv:2210.12487 [pdf, other]

MetaLogic: Logical Reasoning Explanations with Fine-Grained Structure

Authors: Yinya Huang, Hongming Zhang, Ruixin Hong, Xiaodan Liang, Changshui Zhang, Dong Yu

Abstract: In this paper, we propose a comprehensive benchmark to investigate models' logical reasoning capabilities in complex real-life scenarios. Current explanation datasets often employ synthetic data with simple reasoning structures. Therefore, it cannot express more complex reasoning processes, such as the rebuttal to a reasoning step and the degree of certainty of the evidence. To this end, we propos… ▽ More In this paper, we propose a comprehensive benchmark to investigate models' logical reasoning capabilities in complex real-life scenarios. Current explanation datasets often employ synthetic data with simple reasoning structures. Therefore, it cannot express more complex reasoning processes, such as the rebuttal to a reasoning step and the degree of certainty of the evidence. To this end, we propose a comprehensive logical reasoning explanation form. Based on the multi-hop chain of reasoning, the explanation form includes three main components: (1) The condition of rebuttal that the reasoning node can be challenged; (2) Logical formulae that uncover the internal texture of reasoning nodes; (3) Reasoning strength indicated by degrees of certainty. The fine-grained structure conforms to the real logical reasoning scenario, better fitting the human cognitive process but, simultaneously, is more challenging for the current models. We evaluate the current best models' performance on this new explanation form. The experimental results show that generating reasoning graphs remains a challenging task for current models, even with the help of giant pre-trained language models. △ Less

Submitted 22 October, 2022; originally announced October 2022.

Comments: To appear at the main conference of EMNLP 2022

Journal ref: EMNLP 2022

arXiv:2210.07443 [pdf, other]

doi 10.1145/3544106

MEGCF: Multimodal Entity Graph Collaborative Filtering for Personalized Recommendation

Authors: Kang Liu, Feng Xue, Dan Guo, Le Wu, Shujie Li, Richang Hong

Abstract: In most E-commerce platforms, whether the displayed items trigger the user's interest largely depends on their most eye-catching multimodal content. Consequently, increasing efforts focus on modeling multimodal user preference, and the pressing paradigm is to incorporate complete multimodal deep features of the items into the recommendation module. However, the existing studies ignore the mismatch… ▽ More In most E-commerce platforms, whether the displayed items trigger the user's interest largely depends on their most eye-catching multimodal content. Consequently, increasing efforts focus on modeling multimodal user preference, and the pressing paradigm is to incorporate complete multimodal deep features of the items into the recommendation module. However, the existing studies ignore the mismatch problem between multimodal feature extraction (MFE) and user interest modeling (UIM). That is, MFE and UIM have different emphases. Specifically, MFE is migrated from and adapted to upstream tasks such as image classification. In addition, it is mainly a content-oriented and non-personalized process, while UIM, with its greater focus on understanding user interaction, is essentially a user-oriented and personalized process. Therefore, the direct incorporation of MFE into UIM for purely user-oriented tasks, tends to introduce a large number of preference-independent multimodal noise and contaminate the embedding representations in UIM. This paper aims at solving the mismatch problem between MFE and UIM, so as to generate high-quality embedding representations and better model multimodal user preferences. Towards this end, we develop a novel model, MEGCF. The UIM of the proposed model captures the semantic correlation between interactions and the features obtained from MFE, thus making a better match between MFE and UIM. More precisely, semantic-rich entities are first extracted from the multimodal data, since they are more relevant to user preferences than other multimodal information. These entities are then integrated into the user-item interaction graph. Afterwards, a symmetric linear Graph Convolution Network (GCN) module is constructed to perform message propagation over the graph, in order to capture both high-order semantic correlation and collaborative filtering signals. △ Less

Submitted 13 October, 2022; originally announced October 2022.

arXiv:2210.04614 [pdf, other]

doi 10.1109/tcss.2022.3151822

Joint Multi-grained Popularity-aware Graph Convolution Collaborative Filtering for Recommendation

Authors: Kang Liu, Feng Xue, Xiangnan He, Dan Guo, Richang Hong

Abstract: Graph Convolution Networks (GCNs), with their efficient ability to capture high-order connectivity in graphs, have been widely applied in recommender systems. Stacking multiple neighbor aggregation is the major operation in GCNs. It implicitly captures popularity features because the number of neighbor nodes reflects the popularity of a node. However, existing GCN-based methods ignore a universal… ▽ More Graph Convolution Networks (GCNs), with their efficient ability to capture high-order connectivity in graphs, have been widely applied in recommender systems. Stacking multiple neighbor aggregation is the major operation in GCNs. It implicitly captures popularity features because the number of neighbor nodes reflects the popularity of a node. However, existing GCN-based methods ignore a universal problem: users' sensitivity to item popularity is differentiated, but the neighbor aggregations in GCNs actually fix this sensitivity through Graph Laplacian Normalization, leading to suboptimal personalization. In this work, we propose to model multi-grained popularity features and jointly learn them together with high-order connectivity, to match the differentiation of user preferences exhibited in popularity features. Specifically, we develop a Joint Multi-grained Popularity-aware Graph Convolution Collaborative Filtering model, short for JMP-GCF, which uses a popularity-aware embedding generation to construct multi-grained popularity features, and uses the idea of joint learning to capture the signals within and between different granularities of popularity features that are relevant for modeling user preferences. Additionally, we propose a multistage stacked training strategy to speed up model convergence. We conduct extensive experiments on three public datasets to show the state-of-the-art performance of JMP-GCF. △ Less

Submitted 10 October, 2022; originally announced October 2022.

arXiv:2210.00545 [pdf, other]

Seeing Through the Noisy Dark: Towards Real-world Low-Light Image Enhancement and Denoising

Authors: Jiahuan Ren, Zhao Zhang, Richang Hong, Mingliang Xu, Yi Yang, Shuicheng Yan

Abstract: Low-light image enhancement (LLIE) aims at improving the illumination and visibility of dark images with lighting noise. To handle the real-world low-light images often with heavy and complex noise, some efforts have been made for joint LLIE and denoising, which however only achieve inferior restoration performance. We attribute it to two challenges: 1) in real-world low-light images, noise is som… ▽ More Low-light image enhancement (LLIE) aims at improving the illumination and visibility of dark images with lighting noise. To handle the real-world low-light images often with heavy and complex noise, some efforts have been made for joint LLIE and denoising, which however only achieve inferior restoration performance. We attribute it to two challenges: 1) in real-world low-light images, noise is somewhat covered by low-lighting and the left noise after denoising would be inevitably amplified during enhancement; 2) conversion of raw data to sRGB would cause information loss and also more noise, and hence prior LLIE methods trained on raw data are unsuitable for more common sRGB images. In this work, we propose a novel Low-light Enhancement & Denoising Network for real-world low-light images (RLED-Net) in the sRGB color space. In RLED-Net, we apply a plug-and-play differentiable Latent Subspace Reconstruction Block (LSRB) to embed the real-world images into low-rank subspaces to suppress the noise and rectify the errors, such that the impact of noise during enhancement can be effectively shrunk. We then present an efficient Crossed-channel & Shift-window Transformer (CST) layer with two branches to calculate the window and channel attentions to resist the degradation (e.g., speckle noise and blur) caused by the noise in input images. Based on the CST layers, we further present a U-structure network CSTNet as backbone for deep feature recovery, and construct a feature refine block to refine the final features. Extensive experiments on both real noisy images and public image databases well verify the effectiveness of the proposed RLED-Net for RLLIE and denoising simultaneously. △ Less

Submitted 15 November, 2022; v1 submitted 2 October, 2022; originally announced October 2022.

arXiv:2209.15303 [pdf, other]

doi 10.1063/5.0101805

Validating and optimising mismatch tolerance of Doppler backscattering measurements with the beam model

Authors: Valerian H. Hall-Chen, Julius Damba, Felix I. Parra, Quinn T. Pratt, Clive A. Michael, Shi Peng, Terry L. Rhodes, Neal A. Crocker, Jon C. Hillesheim, Rongjie Hong, Shikang Ni, William A. Peebles, Ching Eng Png, Juan Ruiz Ruiz

Abstract: We use the beam model of Doppler backscattering (DBS), which was previously derived from beam tracing and the reciprocity theorem, to shed light on mismatch attenuation. This attenuation of the backscattered signal occurs when the wavevector of the probe beam's electric field is not in the plane perpendicular to the magnetic field. Correcting for this effect is important for determining the amplit… ▽ More We use the beam model of Doppler backscattering (DBS), which was previously derived from beam tracing and the reciprocity theorem, to shed light on mismatch attenuation. This attenuation of the backscattered signal occurs when the wavevector of the probe beam's electric field is not in the plane perpendicular to the magnetic field. Correcting for this effect is important for determining the amplitude of the actual density fluctuations. Previous preliminary comparisons between the model and Mega-Ampere Spherical Tokamak (MAST) plasmas were promising. In this work, we quantitatively account for this effect on DIII-D, a conventional tokamak. We compare the predicted and measured mismatch attenuation in various DIII-D, MAST, and MAST-U plasmas, showing that the beam model is applicable in a wide variety of situations. Finally, we performed a preliminary parameter sweep and found that the mismatch tolerance can be improved by optimising the probe beam's width and curvature at launch. This is potentially a design consideration for new DBS systems. △ Less

Submitted 30 September, 2022; originally announced September 2022.

arXiv:2209.04996 [pdf, other]

Switchable Online Knowledge Distillation

Authors: Biao Qian, Yang Wang, Hongzhi Yin, Richang Hong, Meng Wang

Abstract: Online Knowledge Distillation (OKD) improves the involved models by reciprocally exploiting the difference between teacher and student. Several crucial bottlenecks over the gap between them -- e.g., Why and when does a large gap harm the performance, especially for student? How to quantify the gap between teacher and student? -- have received limited formal study. In this paper, we propose Switcha… ▽ More Online Knowledge Distillation (OKD) improves the involved models by reciprocally exploiting the difference between teacher and student. Several crucial bottlenecks over the gap between them -- e.g., Why and when does a large gap harm the performance, especially for student? How to quantify the gap between teacher and student? -- have received limited formal study. In this paper, we propose Switchable Online Knowledge Distillation (SwitOKD), to answer these questions. Instead of focusing on the accuracy gap at test phase by the existing arts, the core idea of SwitOKD is to adaptively calibrate the gap at training phase, namely distillation gap, via a switching strategy between two modes -- expert mode (pause the teacher while keep the student learning) and learning mode (restart the teacher). To possess an appropriate distillation gap, we further devise an adaptive switching threshold, which provides a formal criterion as to when to switch to learning mode or expert mode, and thus improves the student's performance. Meanwhile, the teacher benefits from our adaptive switching threshold and keeps basically on a par with other online arts. We further extend SwitOKD to multiple networks with two basis topologies. Finally, extensive experiments and analysis validate the merits of SwitOKD for classification over the state-of-the-arts. Our code is available at https://github.com/hfutqian/SwitOKD. △ Less

Submitted 11 September, 2022; originally announced September 2022.

Comments: 16 pages, 7 figures, accepted by ECCV 2022

arXiv:2208.01351 [pdf]

doi 10.1021/acsaelm.2c00609

Fermi Level Depinning in Two-Dimensional Materials Using a Fluorinated Bilayer Graphene Barrier

Authors: Cunzhi Sun, Cheng Xiang, Rongdun Hong, Feng Zhang, Timothy J. Booth, Peter Bøggild, Manh-Ha Doan

Abstract: Strong Fermi level pinning (FLP) - often attributed to metal-induced gap states at the interfacial contacts - severely reduces the tunability of the Schottky barrier height of the junction and limits applications of the 2D materials in electronics and optoelectronics. Here, we show that fluorinated bilayer graphene (FBLG) can be used as a barrier to effectively prevent FLP at metal/2D materials in… ▽ More Strong Fermi level pinning (FLP) - often attributed to metal-induced gap states at the interfacial contacts - severely reduces the tunability of the Schottky barrier height of the junction and limits applications of the 2D materials in electronics and optoelectronics. Here, we show that fluorinated bilayer graphene (FBLG) can be used as a barrier to effectively prevent FLP at metal/2D materials interfaces. FLBG can be produced via short exposure (1-3 min) to SF6 plasma that fluorinates only the top layer of a bilayer graphene with covalent C-F bonding, while the bottom layer remains intrinsic, resulting in a band gap opening of about 75 meV. Inserting FBLG between the metallic contacts and a layer of MoS2 reduces the Schottky barrier height dramatically for the low-work function metals (313 and 260 meV for Ti and Cr, respectively) while it increases for the high-work function one ( 160 meV for Pd), corresponding to an improved pinning factor. Our results provide a straightforward method to generate atomically thin dielectrics with applications not only for depinning the Fermi level at metal/transition metal dichalcogenide (TMD) interfaces but also for solving many other problems in electronics and optoelectronics △ Less

Submitted 2 August, 2022; originally announced August 2022.

arXiv:2207.11081 [pdf, other]

Emotion Separation and Recognition from a Facial Expression by Generating the Poker Face with Vision Transformers

Authors: Jia Li, Jiantao Nie, Dan Guo, Richang Hong, Meng Wang

Abstract: Representation learning and feature disentanglement have recently attracted much research interests in facial expression recognition. The ubiquitous ambiguity of emotion labels is detrimental to those methods based on conventional supervised representation learning. Meanwhile, directly learning the map** from a facial expression image to an emotion label lacks explicit supervision signals of fac… ▽ More Representation learning and feature disentanglement have recently attracted much research interests in facial expression recognition. The ubiquitous ambiguity of emotion labels is detrimental to those methods based on conventional supervised representation learning. Meanwhile, directly learning the map** from a facial expression image to an emotion label lacks explicit supervision signals of facial details. In this paper, we propose a novel FER model, called Poker Face Vision Transformer or PF-ViT, to separate and recognize the disturbance-agnostic emotion from a static facial image via generating its corresponding poker face without the need for paired images. Here, we regard an expressive face as the comprehensive result of a set of facial muscle movements on one's poker face (i.e., emotionless face), inspired by Facial Action Coding System. The proposed PF-ViT leverages vanilla Vision Transformers, and are firstly pre-trained as Masked Autoencoders on a large facial expression dataset without emotion labels, obtaining excellent representations. It mainly consists of five components: 1) an encoder map** the facial expression to a complete representation, 2) a separator decomposing the representation into an emotional component and an orthogonal residue, 3) a generator that can reconstruct the expressive face and synthesize the poker face, 4) a discriminator distinguishing the fake face produced by the generator, trained adversarially with the encoder and generator, 5) a classification head recognizing the emotion. Quantitative and qualitative results demonstrate the effectiveness of our method, which trumps the state-of-the-art methods on four popular FER testing sets. △ Less

Submitted 9 June, 2023; v1 submitted 22 July, 2022; originally announced July 2022.

Comments: 11 pages, 7 figures

arXiv:2207.08591 [pdf, other]

The Brain-Inspired Decoder for Natural Visual Image Reconstruction

Authors: Wenyi Li, Shengjie Zheng, Yufan Liao, Rongqi Hong, Weiliang Chen, Chenggnag He, Xiaojian Li

Abstract: Decoding images from brain activity has been a challenge. Owing to the development of deep learning, there are available tools to solve this problem. The decoded image, which aims to map neural spike trains to low-level visual features and high-level semantic information space. Recently, there are a few studies of decoding from spike trains, however, these studies pay less attention to the foundat… ▽ More Decoding images from brain activity has been a challenge. Owing to the development of deep learning, there are available tools to solve this problem. The decoded image, which aims to map neural spike trains to low-level visual features and high-level semantic information space. Recently, there are a few studies of decoding from spike trains, however, these studies pay less attention to the foundations of neuroscience and there are few studies that merged receptive field into visual image reconstruction. In this paper, we propose a deep learning neural network architecture with biological properties to reconstruct visual image from spike trains. As far as we know, we implemented a method that integrated receptive field property matrix into loss function at the first time. Our model is an end-to-end decoder from neural spike trains to images. We not only merged Gabor filter into auto-encoder which used to generate images but also proposed a loss function with receptive field properties. We evaluated our decoder on two datasets which contain macaque primary visual cortex neural spikes and salamander retina ganglion cells (RGCs) spikes. Our results show that our method can effectively combine receptive field features to reconstruct images, providing a new approach to visual reconstruction based on neural information. △ Less

Submitted 18 July, 2022; originally announced July 2022.

arXiv:2206.13413 [pdf, other]

doi 10.1145/3534678.3539419

RES: A Robust Framework for Guiding Visual Explanation

Authors: Yuyang Gao, Tong Steven Sun, Guangji Bai, Siyi Gu, Sungsoo Ray Hong, Liang Zhao

Abstract: Despite the fast progress of explanation techniques in modern Deep Neural Networks (DNNs) where the main focus is handling "how to generate the explanations", advanced research questions that examine the quality of the explanation itself (e.g., "whether the explanations are accurate") and improve the explanation quality (e.g., "how to adjust the model to generate more accurate explanations when ex… ▽ More Despite the fast progress of explanation techniques in modern Deep Neural Networks (DNNs) where the main focus is handling "how to generate the explanations", advanced research questions that examine the quality of the explanation itself (e.g., "whether the explanations are accurate") and improve the explanation quality (e.g., "how to adjust the model to generate more accurate explanations when explanations are inaccurate") are still relatively under-explored. To guide the model toward better explanations, techniques in explanation supervision - which add supervision signals on the model explanation - have started to show promising effects on improving both the generalizability as and intrinsic interpretability of Deep Neural Networks. However, the research on supervising explanations, especially in vision-based applications represented through saliency maps, is in its early stage due to several inherent challenges: 1) inaccuracy of the human explanation annotation boundary, 2) incompleteness of the human explanation annotation region, and 3) inconsistency of the data distribution between human annotation and model explanation maps. To address the challenges, we propose a generic RES framework for guiding visual explanation by develo** a novel objective that handles inaccurate boundary, incomplete region, and inconsistent distribution of human annotations, with a theoretical justification on model generalizability. Extensive experiments on two real-world image datasets demonstrate the effectiveness of the proposed framework on enhancing both the reasonability of the explanation and the performance of the backbone DNNs model. △ Less

Submitted 27 June, 2022; originally announced June 2022.

Comments: Published in KDD 2022

Journal ref: In Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD '22), August 14-18, 2022, Washington, DC, USA

arXiv:2206.13256 [pdf, other]

A Topic-Attentive Transformer-based Model For Multimodal Depression Detection

Authors: Yanrong Guo, Chenyang Zhu, Shijie Hao, Richang Hong

Abstract: Depression is one of the most common mental disorders, which imposes heavy negative impacts on one's daily life. Diagnosing depression based on the interview is usually in the form of questions and answers. In this process, the audio signals and their text transcripts of a subject are correlated to depression cues and easily recorded. Therefore, it is feasible to build an Automatic Depression Dete… ▽ More Depression is one of the most common mental disorders, which imposes heavy negative impacts on one's daily life. Diagnosing depression based on the interview is usually in the form of questions and answers. In this process, the audio signals and their text transcripts of a subject are correlated to depression cues and easily recorded. Therefore, it is feasible to build an Automatic Depression Detection (ADD) model based on the data of these modalities in practice. However, there are two major challenges that should be addressed for constructing an effective ADD model. The first challenge is the organization of the textual and audio data, which can be of various contents and lengths for different subjects. The second challenge is the lack of training samples due to the privacy concern. Targeting to these two challenges, we propose the TOpic ATtentive transformer-based ADD model, abbreviated as TOAT. To address the first challenge, in the TOAT model, topic is taken as the basic unit of the textual and audio data according to the question-answer form in a typical interviewing process. Based on that, a topic attention module is designed to learn the importance of of each topic, which helps the model better retrieve the depressed samples. To solve the issue of data scarcity, we introduce large pre-trained models, and the fine-tuning strategy is adopted based on the small-scale ADD training data. We also design a two-branch architecture with a late-fusion strategy for building the TOAT model, in which the textual and audio data are encoded independently. We evaluate our model on the multimodal DAIC-WOZ dataset specifically designed for the ADD task. Experimental results show the superiority of our method. More importantly, the ablation studies demonstrate the effectiveness of the key elements in the TOAT model. △ Less

Submitted 27 June, 2022; originally announced June 2022.

arXiv:2206.00742 [pdf, other]

doi 10.1103/PhysRevLett.129.182502

$β$-nuclear-recoil correlation from $^6$He decay in a laser trap

Authors: P. Müller, Y. Bagdasarova, R. Hong, A. Leredde, K. G. Bailey, X. Fléchard, A. García, B. Graner, A. Knecht, O. Naviliat-Cuncic, T. P. O'Connor, M. G. Sternberg, D. W. Storm, H. E. Swanson, F. Wauters, D. W. Zumwalt

Abstract: We report the first precise measurement of a $β$-recoil correlation from a radioactive noble gas ($^6{\rm He}$) confined via a magneto-optical trap. The measurement is motivated by the search for exotic tensor-type contributions to the charged weak current. Interpreted as tensor currents with right-handed neutrinos, the measurements yield: $|C_T/C_A|^2\le 0.022$ (90\% C.L.). On the other hand, for… ▽ More We report the first precise measurement of a $β$-recoil correlation from a radioactive noble gas ($^6{\rm He}$) confined via a magneto-optical trap. The measurement is motivated by the search for exotic tensor-type contributions to the charged weak current. Interpreted as tensor currents with right-handed neutrinos, the measurements yield: $|C_T/C_A|^2\le 0.022$ (90\% C.L.). On the other hand, for left-handed neutrinos the limits are $0.007< C_T/C_A <0.111$~(90\%\ C.L.). The sensitivity of the present measurement is mainly limited by experimental uncertainties in determining the time response properties and the distance between the atom cloud and the micro-channel plate used for recoil ion detection. △ Less

Submitted 1 June, 2022; originally announced June 2022.

arXiv:2205.14693 [pdf, other]

doi 10.1016/j.patcog.2022.108540

VD-PCR: Improving Visual Dialog with Pronoun Coreference Resolution

Authors: Xintong Yu, Hongming Zhang, Ruixin Hong, Yangqiu Song, Changshui Zhang

Abstract: The visual dialog task requires an AI agent to interact with humans in multi-round dialogs based on a visual environment. As a common linguistic phenomenon, pronouns are often used in dialogs to improve the communication efficiency. As a result, resolving pronouns (i.e., grounding pronouns to the noun phrases they refer to) is an essential step towards understanding dialogs. In this paper, we prop… ▽ More The visual dialog task requires an AI agent to interact with humans in multi-round dialogs based on a visual environment. As a common linguistic phenomenon, pronouns are often used in dialogs to improve the communication efficiency. As a result, resolving pronouns (i.e., grounding pronouns to the noun phrases they refer to) is an essential step towards understanding dialogs. In this paper, we propose VD-PCR, a novel framework to improve Visual Dialog understanding with Pronoun Coreference Resolution in both implicit and explicit ways. First, to implicitly help models understand pronouns, we design novel methods to perform the joint training of the pronoun coreference resolution and visual dialog tasks. Second, after observing that the coreference relationship of pronouns and their referents indicates the relevance between dialog rounds, we propose to explicitly prune the irrelevant history rounds in visual dialog models' input. With pruned input, the models can focus on relevant dialog history and ignore the distraction in the irrelevant one. With the proposed implicit and explicit methods, VD-PCR achieves state-of-the-art experimental results on the VisDial dataset. △ Less

Submitted 29 May, 2022; originally announced May 2022.

Comments: The manuscript version of the paper. The published version is available at https://doi.org/10.1016/j.patcog.2022.108540 . The data, code and models are available at: https://github.com/HKUST- KnowComp/VD-PCR

Journal ref: Pattern Recognition, 125, 108540 (2022)

arXiv:2205.02593 [pdf, other]

METGEN: A Module-Based Entailment Tree Generation Framework for Answer Explanation

Authors: Ruixin Hong, Hongming Zhang, Xintong Yu, Changshui Zhang

Abstract: Knowing the reasoning chains from knowledge to the predicted answers can help construct an explainable question answering (QA) system. Advances on QA explanation propose to explain the answers with entailment trees composed of multiple entailment steps. While current work proposes to generate entailment trees with end-to-end generative models, the steps in the generated trees are not constrained a… ▽ More Knowing the reasoning chains from knowledge to the predicted answers can help construct an explainable question answering (QA) system. Advances on QA explanation propose to explain the answers with entailment trees composed of multiple entailment steps. While current work proposes to generate entailment trees with end-to-end generative models, the steps in the generated trees are not constrained and could be unreliable. In this paper, we propose METGEN, a Module-based Entailment Tree GENeration framework that has multiple modules and a reasoning controller. Given a question and several supporting knowledge, METGEN can iteratively generate the entailment tree by conducting single-step entailment with separate modules and selecting the reasoning flow with the controller. As each module is guided to perform a specific type of entailment reasoning, the steps generated by METGEN are more reliable and valid. Experiment results on the standard benchmark show that METGEN can outperform previous state-of-the-art models with only 9% of the parameters. △ Less

Submitted 5 May, 2022; originally announced May 2022.

Comments: NAACL 2022 Findings

arXiv:2205.00179 [pdf, other]

Towards Feature Distribution Alignment and Diversity Enhancement for Data-Free Quantization

Authors: Yangcheng Gao, Zhao Zhang, Richang Hong, Haijun Zhang, Jicong Fan, Shuicheng Yan

Abstract: To obtain lower inference latency and less memory footprint of deep neural networks, model quantization has been widely employed in deep model deployment, by converting the floating points to low-precision integers. However, previous methods (such as quantization aware training and post training quantization) require original data for the fine-tuning or calibration of quantized model, which makes… ▽ More To obtain lower inference latency and less memory footprint of deep neural networks, model quantization has been widely employed in deep model deployment, by converting the floating points to low-precision integers. However, previous methods (such as quantization aware training and post training quantization) require original data for the fine-tuning or calibration of quantized model, which makes them inapplicable to the cases that original data are not accessed due to privacy or security. This gives birth to the data-free quantization method with synthetic data generation. While current data-free quantization methods still suffer from severe performance degradation when quantizing a model into lower bit, caused by the low inter-class separability of semantic features. To this end, we propose a new and effective data-free quantization method termed ClusterQ, which utilizes the feature distribution alignment for synthetic data generation. To obtain high inter-class separability of semantic features, we cluster and align the feature distribution statistics to imitate the distribution of real data, so that the performance degradation is alleviated. Moreover, we incorporate the diversity enhancement to solve class-wise mode collapse. We also employ the exponential moving average to update the centroid of each cluster for further feature distribution improvement. Extensive experiments based on different deep models (e.g., ResNet-18 and MobileNet-V2) over the ImageNet dataset demonstrate that our proposed ClusterQ model obtains state-of-the-art performance. △ Less

Submitted 19 December, 2022; v1 submitted 30 April, 2022; originally announced May 2022.

Comments: Please cite this work as: Yangcheng Gao, Zhao Zhang, Richang Hong, Haijun Zhang, Jicong Fan and Shuicheng Yan, "Towards Feature Distribution Alignment and Diversity Enhancement for Data-Free Quantization," In: Proceedings of the 22nd IEEE International Conference on Data Mining (ICDM), Orlando, FL, USA, pp.1-10, Aug 2022

arXiv:2204.12063 [pdf, other]

doi 10.1145/3477495.3531927

A Review-aware Graph Contrastive Learning Framework for Recommendation

Authors: Jie Shuai, Kun Zhang, Le Wu, Peijie Sun, Richang Hong, Meng Wang, Yong Li

Abstract: Most modern recommender systems predict users preferences with two components: user and item embedding learning, followed by the user-item interaction modeling. By utilizing the auxiliary review information accompanied with user ratings, many of the existing review-based recommendation models enriched user/item embedding learning ability with historical reviews or better modeled user-item interact… ▽ More Most modern recommender systems predict users preferences with two components: user and item embedding learning, followed by the user-item interaction modeling. By utilizing the auxiliary review information accompanied with user ratings, many of the existing review-based recommendation models enriched user/item embedding learning ability with historical reviews or better modeled user-item interactions with the help of available user-item target reviews. Though significant progress has been made, we argue that current solutions for review-based recommendation suffer from two drawbacks. First, as review-based recommendation can be naturally formed as a user-item bipartite graph with edge features from corresponding user-item reviews, how to better exploit this unique graph structure for recommendation? Second, while most current models suffer from limited user behaviors, can we exploit the unique self-supervised signals in the review-aware graph to guide two recommendation components better? To this end, in this paper, we propose a novel Review-aware Graph Contrastive Learning (RGCL) framework for review-based recommendation. Specifically, we first construct a review-aware user-item graph with feature-enhanced edges from reviews, where each edge feature is composed of both the user-item rating and the corresponding review semantics. This graph with feature-enhanced edges can help attentively learn each neighbor node weight for user and item representation learning. After that, we design two additional contrastive learning tasks (i.e., Node Discrimination and Edge Discrimination) to provide self-supervised signals for the two components in recommendation process. Finally, extensive experiments over five benchmark datasets demonstrate the superiority of our proposed RGCL compared to the state-of-the-art baselines. △ Less

Submitted 16 May, 2022; v1 submitted 25 April, 2022; originally announced April 2022.

Comments: Accepted by SIGIR 2022

arXiv:2204.07820 [pdf, other]

FCL-GAN: A Lightweight and Real-Time Baseline for Unsupervised Blind Image Deblurring

Authors: Suiyi Zhao, Zhao Zhang, Richang Hong, Mingliang Xu, Yi Yang, Meng Wang

Abstract: Blind image deblurring (BID) remains a challenging and significant task. Benefiting from the strong fitting ability of deep learning, paired data-driven supervised BID method has obtained great progress. However, paired data are usually synthesized by hand, and the realistic blurs are more complex than synthetic ones, which makes the supervised methods inept at modeling realistic blurs and hinders… ▽ More Blind image deblurring (BID) remains a challenging and significant task. Benefiting from the strong fitting ability of deep learning, paired data-driven supervised BID method has obtained great progress. However, paired data are usually synthesized by hand, and the realistic blurs are more complex than synthetic ones, which makes the supervised methods inept at modeling realistic blurs and hinders their real-world applications. As such, unsupervised deep BID method without paired data offers certain advantages, but current methods still suffer from some drawbacks, e.g., bulky model size, long inference time, and strict image resolution and domain requirements. In this paper, we propose a lightweight and real-time unsupervised BID baseline, termed Frequency-domain Contrastive Loss Constrained Lightweight CycleGAN (shortly, FCL-GAN), with attractive properties, i.e., no image domain limitation, no image resolution limitation, 25x lighter than SOTA, and 5x faster than SOTA. To guarantee the lightweight property and performance superiority, two new collaboration units called lightweight domain conversion unit(LDCU) and parameter-free frequency-domain contrastive unit(PFCU) are designed. LDCU mainly implements inter-domain conversion in lightweight manner. PFCU further explores the similarity measure, external difference and internal connection between the blurred domain and sharp domain images in frequency domain, without involving extra parameters. Extensive experiments on several image datasets demonstrate the effectiveness of our FCL-GAN in terms of performance, model size and reference time. △ Less

Submitted 23 July, 2022; v1 submitted 16 April, 2022; originally announced April 2022.

Comments: Please cite this work as: Suiyi Zhao, Zhao Zhang, Richang Hong, Mingliang Xu, Yi Yang and Meng Wang, "FCL-GAN: A Lightweight and Real-Time Baseline for Unsupervised Blind Image Deblurring," In: Proceedings of the 30th ACM International Conference on Multimedia (ACM MM), Lisbon, Portugal, June 2022

arXiv:2203.00304 [pdf, other]

Automatic Depression Detection via Learning and Fusing Features from Visual Cues

Authors: Yanrong Guo, Chenyang Zhu, Shijie Hao, Richang Hong

Abstract: Depression is one of the most prevalent mental disorders, which seriously affects one's life. Traditional depression diagnostics commonly depends on rating with scales, which can be labor-intensive and subjective. In this context, Automatic Depression Detection (ADD) has been attracting more attention for its low cost and objectivity. ADD systems are able to detect depression automatically from so… ▽ More Depression is one of the most prevalent mental disorders, which seriously affects one's life. Traditional depression diagnostics commonly depends on rating with scales, which can be labor-intensive and subjective. In this context, Automatic Depression Detection (ADD) has been attracting more attention for its low cost and objectivity. ADD systems are able to detect depression automatically from some medical records, like video sequences. However, it remains challenging to effectively extract depression-specific information from long sequences, thereby hindering a satisfying accuracy. In this paper, we propose a novel ADD method via learning and fusing features from visual cues. Specifically, we firstly construct Temporal Dilated Convolutional Network (TDCN), in which multiple Dilated Convolution Blocks (DCB) are designed and stacked, to learn the long-range temporal information from sequences. Then, the Feature-Wise Attention (FWA) module is adopted to fuse different features extracted from TDCNs. The module learns to assign weights for the feature channels, aiming to better incorporate different kinds of visual features and further enhance the detection accuracy. Our method achieves the state-of-the-art performance on the DAIC_WOZ dataset compared to other visual-feature-based methods, showing its effectiveness. △ Less

Submitted 1 March, 2022; originally announced March 2022.

arXiv:2202.09881 [pdf, other]

doi 10.1088/1741-4326/acedc2

Observation of Long-Radial-Range-Correlation in Turbulence in High-Collisionality High-Confinement Fusion Plasmas

Authors: R. Hong, T. L. Rhodes, P. H. Diamond, Y. Ren, L. Zeng, X. Jian, K. Barada, G. Wang, W. A. Peebles

Abstract: We report on the observation of spatially asymmetric turbulent structures with a long radial correlation length in the core of high-collisionality H-mode plasmas on DIII-D tokamak. These turbulent structures develop from shorter wavelength turbulence and have a radially elongated structure. The envelope of turbulence spans a broad radial range in the mid-radius region, leading to streamer-like tra… ▽ More We report on the observation of spatially asymmetric turbulent structures with a long radial correlation length in the core of high-collisionality H-mode plasmas on DIII-D tokamak. These turbulent structures develop from shorter wavelength turbulence and have a radially elongated structure. The envelope of turbulence spans a broad radial range in the mid-radius region, leading to streamer-like transport events. The underlying turbulence is featured by intermittency, long-term memory effect, and the characteristic spectrum of self-organized criticality. The amplitude and the radial scale increase substantially when the shearing rate of the mean flow is reduced below the turbulent scattering rate. The enhanced LRRC transport events are accompanied by apparent degradation of normalized energy confinement time. These findings constitute the first experimental observation of long-radial-range turbulent transport events in high-collisionality H-mode plasmas, and demonstrate the role of mean shear flows in the formation and propagation of turbulence with long-radial-range correlation. △ Less

Submitted 15 February, 2023; v1 submitted 20 February, 2022; originally announced February 2022.

arXiv:2201.12823 [pdf, other]

doi 10.1103/PhysRevB.105.165116

Functional Tensor Network Solving Many-body Schrödinger Equation

Authors: Rui Hong, Ya-Xuan Xiao, Jie Hu, An-Chun Ji, Shi-Ju Ran

Abstract: Schrödinger equation belongs to the most fundamental differential equations in quantum physics. However, the exact solutions are extremely rare, and many analytical methods are applicable only to the cases with small perturbations or weak correlations. Solving the many-body Schrödinger equation in the continuous spaces with the presence of strong correlations is an extremely important and challeng… ▽ More Schrödinger equation belongs to the most fundamental differential equations in quantum physics. However, the exact solutions are extremely rare, and many analytical methods are applicable only to the cases with small perturbations or weak correlations. Solving the many-body Schrödinger equation in the continuous spaces with the presence of strong correlations is an extremely important and challenging issue. In this work, we propose the functional tensor network (FTN) approach to solve the many-body Schrödinger equation. Provided the orthonormal functional bases, we represent the coefficients of the many-body wave-function as tensor network. The observables, such as energy, can be calculated simply by tensor contractions. Simulating the ground state becomes solving a minimization problem defined by the tensor network. An efficient gradient-decent algorithm based on the automatically differentiable tensors is proposed. We here take matrix product state (MPS) as an example, whose complexity scales only linearly with the system size. We apply our approach to solve the ground state of coupled harmonic oscillators, and achieve high accuracy by comparing with the exact solutions. Reliable results are also given with the presence of three-body interactions, where the system cannot be decoupled to isolated oscillators. Our approach is simple and with well-controlled error, superior to the highly-nonlinear neural-network solvers. Our work extends the applications of tensor network from quantum lattice models to the systems in the continuous space. FTN can be used as a general solver of the differential equations with many variables. The MPS exemplified here can be generalized to, e.g., the fermionic tensor networks, to solve the electronic Schrödinger equation. △ Less

Submitted 30 January, 2022; originally announced January 2022.

Comments: 9 pages, 6 figures

Journal ref: Physical Review B 105, 165116 (2022)

arXiv:2109.08992 [pdf, other]

doi 10.1088/1748-0221/16/12/P12041

High-Accuracy Absolute Magnetometry with Application to the Fermilab Muon $g-2$ Experiment

Authors: D. Flay, D. Kawall, T. Chupp, S. Corrodi, M. Farooq, M. Fertl, J. George, J. Grange, R. Hong, R. Osofsky, S. Ramachandran, E. Swanson, P. Winter

Abstract: We present details of a high-accuracy absolute scalar magnetometer based on pulsed proton NMR. The $B$-field magnitude is determined from the precession frequency of proton spins in a cylindrical sample of water after accounting for field perturbations from probe materials, sample shape, and other corrections. Features of the design, testing procedures, and corrections necessary for qualification… ▽ More We present details of a high-accuracy absolute scalar magnetometer based on pulsed proton NMR. The $B$-field magnitude is determined from the precession frequency of proton spins in a cylindrical sample of water after accounting for field perturbations from probe materials, sample shape, and other corrections. Features of the design, testing procedures, and corrections necessary for qualification as an absolute scalar magnetometer are described. The device was tested at $B = 1.45$\,T but can be modified for a range exceeding 1--3\,T. The magnetometer was used to calibrate other NMR magnetometers and measure absolute magnetic field magnitudes to an accuracy of 19 parts per billion as part of a measurement of the muon magnetic moment anomaly at Fermilab. △ Less

Submitted 22 December, 2021; v1 submitted 18 September, 2021; originally announced September 2021.

Journal ref: D. Flay et al 2021 JINST 16 P12041

arXiv:2107.05583 [pdf, other]

Few-shot Learning with Global Relatedness Decoupled-Distillation

Authors: Yuan Zhou, Yanrong Guo, Shijie Hao, Richang Hong, Zhengjun Zha, Meng Wang

Abstract: Despite the success that metric learning based approaches have achieved in few-shot learning, recent works reveal the ineffectiveness of their episodic training mode. In this paper, we point out two potential reasons for this problem: 1) the random episodic labels can only provide limited supervision information, while the relatedness information between the query and support samples is not fully… ▽ More Despite the success that metric learning based approaches have achieved in few-shot learning, recent works reveal the ineffectiveness of their episodic training mode. In this paper, we point out two potential reasons for this problem: 1) the random episodic labels can only provide limited supervision information, while the relatedness information between the query and support samples is not fully exploited; 2) the meta-learner is usually constrained by the limited contextual information of the local episode. To overcome these problems, we propose a new Global Relatedness Decoupled-Distillation (GRDD) method using the global category knowledge and the Relatedness Decoupled-Distillation (RDD) strategy. Our GRDD learns new visual concepts quickly by imitating the habit of humans, i.e. learning from the deep knowledge distilled from the teacher. More specifically, we first train a global learner on the entire base subset using category labels as supervision to leverage the global context information of the categories. Then, the well-trained global learner is used to simulate the query-support relatedness in global dependencies. Finally, the distilled global query-support relatedness is explicitly used to train the meta-learner using the RDD strategy, with the goal of making the meta-learner more discriminative. The RDD strategy aims to decouple the dense query-support relatedness into the groups of sparse decoupled relatedness. Moreover, only the relatedness of a single support sample with other query samples is considered in each group. By distilling the sparse decoupled relatedness group by group, sharper relatedness can be effectively distilled to the meta-learner, thereby facilitating the learning of a discriminative meta-learner. We conduct extensive experiments on the miniImagenet and CIFAR-FS datasets, which show the state-of-the-art performance of our GRDD method. △ Less

Submitted 13 December, 2022; v1 submitted 12 July, 2021; originally announced July 2021.

Comments: 11 pages

arXiv:2106.03126 [pdf, other]

doi 10.21468/SciPostPhysCore.4.3.022

Predicting Quantum Potentials by Deep Neural Network and Metropolis Sampling

Authors: Rui Hong, Peng-Fei Zhou, Bin Xi, Jie Hu, An-Chun Ji, Shi-Ju Ran

Abstract: The hybridizations of machine learning and quantum physics have caused essential impacts to the methodology in both fields. Inspired by quantum potential neural network, we here propose to solve the potential in the Schrodinger equation provided the eigenstate, by combining Metropolis sampling with deep neural network, which we dub as Metropolis potential neural network (MPNN). A loss function is… ▽ More The hybridizations of machine learning and quantum physics have caused essential impacts to the methodology in both fields. Inspired by quantum potential neural network, we here propose to solve the potential in the Schrodinger equation provided the eigenstate, by combining Metropolis sampling with deep neural network, which we dub as Metropolis potential neural network (MPNN). A loss function is proposed to explicitly involve the energy in the optimization for its accurate evaluation. Benchmarking on the harmonic oscillator and hydrogen atom, MPNN shows excellent accuracy and stability on predicting not just the potential to satisfy the Schrodinger equation, but also the eigen-energy. Our proposal could be potentially applied to the ab-initio simulations, and to inversely solving other partial differential equations in physics and beyond. △ Less

Submitted 8 August, 2021; v1 submitted 6 June, 2021; originally announced June 2021.

Journal ref: SciPost Phys. Core 4, 022 (2021)

arXiv:2105.14975 [pdf, other]

Privileged Graph Distillation for Cold Start Recommendation

Authors: Shuai Wang, Kun Zhang, Le Wu, Hai** Ma, Richang Hong, Meng Wang

Abstract: The cold start problem in recommender systems is a long-standing challenge, which requires recommending to new users (items) based on attributes without any historical interaction records. In these recommendation systems, warm users (items) have privileged collaborative signals of interaction records compared to cold start users (items), and these Collaborative Filtering (CF) signals are shown to… ▽ More The cold start problem in recommender systems is a long-standing challenge, which requires recommending to new users (items) based on attributes without any historical interaction records. In these recommendation systems, warm users (items) have privileged collaborative signals of interaction records compared to cold start users (items), and these Collaborative Filtering (CF) signals are shown to have competing performance for recommendation. Many researchers proposed to learn the correlation between collaborative signal embedding space and the attribute embedding space to improve the cold start recommendation, in which user and item categorical attributes are available in many online platforms. However, the cold start recommendation is still limited by two embedding spaces modeling and simple assumptions of space transformation. As user-item interaction behaviors and user (item) attributes naturally form a heterogeneous graph structure, in this paper, we propose a privileged graph distillation model~(PGD). The teacher model is composed of a heterogeneous graph structure for warm users and items with privileged CF links. The student model is composed of an entity-attribute graph without CF links. Specifically, the teacher model can learn better embeddings of each entity by injecting complex higher-order relationships from the constructed heterogeneous graph. The student model can learn the distilled output with privileged CF embeddings from the teacher embeddings. Our proposed model is generally applicable to different cold start scenarios with new user, new item, or new user-new item. Finally, extensive experimental results on the real-world datasets clearly show the effectiveness of our proposed model on different types of cold start problems, with average $6.6\%, 5.6\%, $ and $17.1\%$ improvement over state-of-the-art baselines on three datasets, respectively. △ Less

Submitted 31 May, 2021; originally announced May 2021.

Comments: 10 pages,5 figures

arXiv:2105.09818 [pdf, other]

doi 10.1063/5.0043600

Ray-tracing Analysis for Cross-polarization Scattering Diagnostic on MAST-Upgrade Spherical Tokamak

Authors: R. Hong, T. L. Rhodes, G. Wang, W. A. Peebles

Abstract: A combined Doppler backscattering/cross-polarization scattering (DBS/CPS) system is being deployed on MAST-U, for simultaneous measurements of local density turbulence, turbulence flows, and magnetic turbulence. In this design, CPS shares the probing beam with the DBS and uses a separate parallel-viewing receiver system. In this study, we utilize a modified GENRAY 3D ray-tracing code, to simulate… ▽ More A combined Doppler backscattering/cross-polarization scattering (DBS/CPS) system is being deployed on MAST-U, for simultaneous measurements of local density turbulence, turbulence flows, and magnetic turbulence. In this design, CPS shares the probing beam with the DBS and uses a separate parallel-viewing receiver system. In this study, we utilize a modified GENRAY 3D ray-tracing code, to simulate the propagation of the probing and scattered beams. The contributions of different scattering locations along the entire beam trajectories are considered, and the corresponding local $\tilde{\mathbf{B}}$ wavenumbers are estimated using the wave-vector matching criterion. The wavenumber ranges of the local $\tilde{\mathbf{B}}$ that is detectable to the CPS system are explored for simulated L- and H-mode plasmas. △ Less

Submitted 20 May, 2021; originally announced May 2021.

Journal ref: Rev Sci Instrum 92, 063505 (2021)

arXiv:2105.07377 [pdf, other]

Set2setRank: Collaborative Set to Set Ranking for Implicit Feedback based Recommendation

Authors: Lei Chen, Le Wu, Kun Zhang, Richang Hong, Meng Wang

Abstract: As users often express their preferences with binary behavior data~(implicit feedback), such as clicking items or buying products, implicit feedback based Collaborative Filtering~(CF) models predict the top ranked items a user might like by leveraging implicit user-item interaction data. For each user, the implicit feedback is divided into two sets: an observed item set with limited observed behav… ▽ More As users often express their preferences with binary behavior data~(implicit feedback), such as clicking items or buying products, implicit feedback based Collaborative Filtering~(CF) models predict the top ranked items a user might like by leveraging implicit user-item interaction data. For each user, the implicit feedback is divided into two sets: an observed item set with limited observed behaviors, and a large unobserved item set that is mixed with negative item behaviors and unknown behaviors. Given any user preference prediction model, researchers either designed ranking based optimization goals or relied on negative item mining techniques for better optimization. Despite the performance gain of these implicit feedback based models, the recommendation results are still far from satisfactory due to the sparsity of the observed item set for each user. To this end, in this paper, we explore the unique characteristics of the implicit feedback and propose Set2setRank framework for recommendation. The optimization criteria of Set2setRank are two folds: First, we design an item to an item set comparison that encourages each observed item from the sampled observed set is ranked higher than any unobserved item from the sampled unobserved set. Second, we model set level comparison that encourages a margin between the distance summarized from the observed item set and the most "hard" unobserved item from the sampled negative set. Further, an adaptive sampling technique is designed to implement these two goals. We have to note that our proposed framework is model-agnostic and can be easily applied to most recommendation prediction approaches, and is time efficient in practice. Finally, extensive experiments on three real-world datasets demonstrate the superiority of our proposed approach. △ Less

Submitted 26 May, 2021; v1 submitted 16 May, 2021; originally announced May 2021.

Comments: The paper is accepted by SIGIR 2021

arXiv:2105.02046 [pdf, other]

Few-shot Partial Multi-view Learning

Authors: Yuan Zhou, Yanrong Guo, Shijie Hao, Richang Hong, Jiebo Luo

Abstract: It is often the case that data are with multiple views in real-world applications. Fully exploring the information of each view is significant for making data more representative. However, due to various limitations and failures in data collection and pre-processing, it is inevitable for real data to suffer from view missing and data scarcity. The coexistence of these two issues makes it more chal… ▽ More It is often the case that data are with multiple views in real-world applications. Fully exploring the information of each view is significant for making data more representative. However, due to various limitations and failures in data collection and pre-processing, it is inevitable for real data to suffer from view missing and data scarcity. The coexistence of these two issues makes it more challenging to achieve the pattern classification task. Currently, to our best knowledge, few appropriate methods can well-handle these two issues simultaneously. Aiming to draw more attention from the community to this challenge, we propose a new task in this paper, called few-shot partial multi-view learning, which focuses on overcoming the negative impact of the view-missing issue in the low-data regime. The challenges of this task are twofold: (i) it is difficult to overcome the impact of data scarcity under the interference of missing views; (ii) the limited number of data exacerbates information scarcity, thus making it harder to address the view-missing issue in turn. To address these challenges, we propose a new unified Gaussian dense-anchoring method. The unified dense anchors are learned for the limited partial multi-view data, thereby anchoring them into a unified dense representation space where the influence of data scarcity and view missing can be alleviated. We conduct extensive experiments to evaluate our method. The results on Cub-googlenet-doc2vec, Handwritten, Caltech102, Scene15, Animal, ORL, tieredImagenet, and Birds-200-2011 datasets validate its effectiveness. △ Less

Submitted 18 May, 2023; v1 submitted 5 May, 2021; originally announced May 2021.

Comments: 18 pages

arXiv:2104.14949 [pdf, other]

doi 10.1103/PhysRevA.104.042601

Automatically Differentiable Quantum Circuit for Many-qubit State Preparation

Authors: Peng-Fei Zhou, Rui Hong, Shi-Ju Ran

Abstract: Constructing quantum circuits for efficient state preparation belongs to the central topics in the field of quantum information and computation. As the number of qubits grows fast, methods to derive large-scale quantum circuits are strongly desired. In this work, we propose the automatically differentiable quantum circuit (ADQC) approach to efficiently prepare arbitrary quantum many-qubit states.… ▽ More Constructing quantum circuits for efficient state preparation belongs to the central topics in the field of quantum information and computation. As the number of qubits grows fast, methods to derive large-scale quantum circuits are strongly desired. In this work, we propose the automatically differentiable quantum circuit (ADQC) approach to efficiently prepare arbitrary quantum many-qubit states. A key ingredient is to introduce the latent gates whose decompositions give the unitary gates that form the quantum circuit. The circuit is optimized by updating the latent gates using back propagation to minimize the distance between the evolved and target states. Taking the ground states of quantum lattice models and random matrix product states as examples, with the number of qubits where processing the full coefficients is unlikely, ADQC obtains high fidelities with small numbers of layers $N_L \sim O(1)$. Superior accuracy is reached compared with the existing state-preparation approach based on the matrix product disentangler. The parameter complexity of MPS can be significantly reduced by ADQC with the compression ratio $r \sim O(10^{-3})$. Our work sheds light on the "intelligent construction" of quantum circuits for many-qubit systems by combining with the machine learning methods. △ Less

Submitted 30 April, 2021; originally announced April 2021.

Comments: 5 pages, 5 figures

Journal ref: Phys. Rev. A 104, 042601 (2021)

arXiv:2104.07375 [pdf]

doi 10.1088/1361-648X/ac0f2c

Effect of Interfacial Dipole on Heterogeneous Ice Nucleation

Authors: Hao Lu, Quanming Xu, Jianyang Wu, Rongdun Hong, Zhisen Zhang

Abstract: In this letter, we performed molecular dynamics simulations of ice nucleation on a rigid surface model of cubic zinc blende structure with different surface dipole strength and orientation. It follows that, despite the excellent lattice match between cubic ice and substrates, the ice nucleation happened only when the interfacial water molecules (IWs) have the same or similar orientations as that o… ▽ More In this letter, we performed molecular dynamics simulations of ice nucleation on a rigid surface model of cubic zinc blende structure with different surface dipole strength and orientation. It follows that, despite the excellent lattice match between cubic ice and substrates, the ice nucleation happened only when the interfacial water molecules (IWs) have the same or similar orientations as that of the water molecules in cubic ice. The free energy landscapes revealed that, for substrates with improper dipole strength/orientation, large free energy barriers arose to prevent the dipole of IWs rotating to the right orientation to trigger ice formation. Our results suggest that the traditional concept of lattice match, the similarity of lattice length between a substrate and the new-formed crystalline, should be extended to a broader match include the similarity between the molecular orientations of the interfacial component and the component in the specific new-formed crystalline face. △ Less

Submitted 15 April, 2021; originally announced April 2021.

arXiv:2104.03281 [pdf, other]

doi 10.1103/PhysRevLett.126.141801

Measurement of the Positive Muon Anomalous Magnetic Moment to 0.46 ppm

Authors: B. Abi, T. Albahri, S. Al-Kilani, D. Allspach, L. P. Alonzi, A. Anastasi, A. Anisenkov, F. Azfar, K. Badgley, S. Baeßler, I. Bailey, V. A. Baranov, E. Barlas-Yucel, T. Barrett, E. Barzi, A. Basti, F. Bedeschi, A. Behnke, M. Berz, M. Bhattacharya, H. P. Binney, R. Bjorkquist, P. Bloom, J. Bono, E. Bottalico , et al. (212 additional authors not shown)

Abstract: We present the first results of the Fermilab Muon g-2 Experiment for the positive muon magnetic anomaly $a_μ\equiv (g_μ-2)/2$. The anomaly is determined from the precision measurements of two angular frequencies. Intensity variation of high-energy positrons from muon decays directly encodes the difference frequency $ω_a$ between the spin-precession and cyclotron frequencies for polarized muons in… ▽ More We present the first results of the Fermilab Muon g-2 Experiment for the positive muon magnetic anomaly $a_μ\equiv (g_μ-2)/2$. The anomaly is determined from the precision measurements of two angular frequencies. Intensity variation of high-energy positrons from muon decays directly encodes the difference frequency $ω_a$ between the spin-precession and cyclotron frequencies for polarized muons in a magnetic storage ring. The storage ring magnetic field is measured using nuclear magnetic resonance probes calibrated in terms of the equivalent proton spin precession frequency ${\tildeω'^{}_p}$ in a spherical water sample at 34.7$^{\circ}$C. The ratio $ω_a / {\tildeω'^{}_p}$, together with known fundamental constants, determines $a_μ({\rm FNAL}) = 116\,592\,040(54)\times 10^{-11}$ (0.46\,ppm). The result is 3.3 standard deviations greater than the standard model prediction and is in excellent agreement with the previous Brookhaven National Laboratory (BNL) E821 measurement. After combination with previous measurements of both $μ^+$ and $μ^-$, the new experimental average of $a_μ({\rm Exp}) = 116\,592\,061(41)\times 10^{-11}$ (0.35\,ppm) increases the tension between experiment and theory to 4.2 standard deviations △ Less

Submitted 7 April, 2021; originally announced April 2021.

Comments: 10 pages; 4 figures

Report number: FERMILAB-PUB-21-132-E

Journal ref: Phys. Rev. Lett. 126, 141801 (2021)

arXiv:2104.03247 [pdf, other]

doi 10.1103/PhysRevD.103.072002

Measurement of the anomalous precession frequency of the muon in the Fermilab Muon g-2 experiment

Authors: T. Albahri, A. Anastasi, A. Anisenkov, K. Badgley, S. Baeßler, I. Bailey, V. A. Baranov, E. Barlas-Yucel, T. Barrett, A. Basti, F. Bedeschi, M. Berz, M. Bhattacharya, H. P. Binney, P. Bloom, J. Bono, E. Bottalico, T. Bowcock, G. Cantatore, R. M. Carey, B. C. K. Casey, D. Cauz, R. Chakraborty, S. P. Chang, A. Chapelain , et al. (153 additional authors not shown)

Abstract: The Muon g-2 Experiment at Fermi National Accelerator Laboratory (FNAL) has measured the muon anomalous precession frequency $ω_a$ to an uncertainty of 434 parts per billion (ppb), statistical, and 56 ppb, systematic, with data collected in four storage ring configurations during its first physics run in 2018. When combined with a precision measurement of the magnetic field of the experiment's muo… ▽ More The Muon g-2 Experiment at Fermi National Accelerator Laboratory (FNAL) has measured the muon anomalous precession frequency $ω_a$ to an uncertainty of 434 parts per billion (ppb), statistical, and 56 ppb, systematic, with data collected in four storage ring configurations during its first physics run in 2018. When combined with a precision measurement of the magnetic field of the experiment's muon storage ring, the precession frequency measurement determines a muon magnetic anomaly of $a_μ({\rm FNAL}) = 116\,592\,040(54) \times 10^{-11}$ (0.46 ppm). This article describes the multiple techniques employed in the reconstruction, analysis and fitting of the data to measure the precession frequency. It also presents the averaging of the results from the eleven separate determinations of ω_a, and the systematic uncertainties on the result. △ Less

Submitted 7 April, 2021; originally announced April 2021.

Comments: 29 pages, 19 figures. Published in Physical Review D

Report number: FERMILAB-PUB-21-183-E

Journal ref: Phys. Rev. D 103, 072002 (2021)

arXiv:2104.03240 [pdf, other]

doi 10.1103/PhysRevAccelBeams.24.044002

Beam dynamics corrections to the Run-1 measurement of the muon anomalous magnetic moment at Fermilab

Authors: T. Albahri, A. Anastasi, K. Badgley, S. Baeßler, I. Bailey, V. A. Baranov, E. Barlas-Yucel, T. Barrett, F. Bedeschi, M. Berz, M. Bhattacharya, H. P. Binney, P. Bloom, J. Bono, E. Bottalico, T. Bowcock, G. Cantatore, R. M. Carey, B. C. K. Casey, D. Cauz, R. Chakraborty, S. P. Chang, A. Chapelain, S. Charity, R. Chislett , et al. (152 additional authors not shown)

Abstract: This paper presents the beam dynamics systematic corrections and their uncertainties for the Run-1 data set of the Fermilab Muon g-2 Experiment. Two corrections to the measured muon precession frequency $ω_a^m$ are associated with well-known effects owing to the use of electrostatic quadrupole (ESQ) vertical focusing in the storage ring. An average vertically oriented motional magnetic field is fe… ▽ More This paper presents the beam dynamics systematic corrections and their uncertainties for the Run-1 data set of the Fermilab Muon g-2 Experiment. Two corrections to the measured muon precession frequency $ω_a^m$ are associated with well-known effects owing to the use of electrostatic quadrupole (ESQ) vertical focusing in the storage ring. An average vertically oriented motional magnetic field is felt by relativistic muons passing transversely through the radial electric field components created by the ESQ system. The correction depends on the stored momentum distribution and the tunes of the ring, which has relatively weak vertical focusing. Vertical betatron motions imply that the muons do not orbit the ring in a plane exactly orthogonal to the vertical magnetic field direction. A correction is necessary to account for an average pitch angle associated with their trajectories. A third small correction is necessary because muons that escape the ring during the storage time are slightly biased in initial spin phase compared to the parent distribution. Finally, because two high-voltage resistors in the ESQ network had longer than designed RC time constants, the vertical and horizontal centroids and envelopes of the stored muon beam drifted slightly, but coherently, during each storage ring fill. This led to the discovery of an important phase-acceptance relationship that requires a correction. The sum of the corrections to $ω_a^m$ is 0.50 $\pm$ 0.09 ppm; the uncertainty is small compared to the 0.43 ppm statistical precision of $ω_a^m$. △ Less

Submitted 23 April, 2021; v1 submitted 7 April, 2021; originally announced April 2021.

Comments: 35 pages, 29 figures. Accepted by Phys. Rev. Accel. Beams

Report number: FERMILAB-PUB-21-133-E

Journal ref: Phys. Rev. Accel. Beams 24, 044002 (2021)

arXiv:2104.03201 [pdf, other]

doi 10.1103/PhysRevA.103.042208

Magnetic Field Measurement and Analysis for the Muon g-2 Experiment at Fermilab

Authors: T. Albahri, A. Anastasi, K. Badgley, S. Baeßler, I. Bailey, V. A. Baranov, E. Barlas-Yucel, T. Barrett, F. Bedeschi, M. Berz, M. Bhattacharya, H. P. Binney, P. Bloom, J. Bono, E. Bottalico, T. Bowcock, G. Cantatore, R. M. Carey, B. C. K. Casey, D. Cauz, R. Chakraborty, S. P. Chang, A. Chapelain, S. Charity, R. Chislett , et al. (148 additional authors not shown)

Abstract: The Fermi National Accelerator Laboratory has measured the anomalous precession frequency $a^{}_μ= (g^{}_μ-2)/2$ of the muon to a combined precision of 0.46 parts per million with data collected during its first physics run in 2018. This paper documents the measurement of the magnetic field in the muon storage ring. The magnetic field is monitored by nuclear magnetic resonance systems and calibrat… ▽ More The Fermi National Accelerator Laboratory has measured the anomalous precession frequency $a^{}_μ= (g^{}_μ-2)/2$ of the muon to a combined precision of 0.46 parts per million with data collected during its first physics run in 2018. This paper documents the measurement of the magnetic field in the muon storage ring. The magnetic field is monitored by nuclear magnetic resonance systems and calibrated in terms of the equivalent proton spin precession frequency in a spherical water sample at 34.7$^\circ$C. The measured field is weighted by the muon distribution resulting in $\tildeω'^{}_p$, the denominator in the ratio $ω^{}_a$/$\tildeω'^{}_p$ that together with known fundamental constants yields $a^{}_μ$. The reported uncertainty on $\tildeω'^{}_p$ for the Run-1 data set is 114 ppb consisting of uncertainty contributions from frequency extraction, calibration, map**, tracking, and averaging of 56 ppb, and contributions from fast transient fields of 99 ppb. △ Less

Submitted 17 June, 2022; v1 submitted 7 April, 2021; originally announced April 2021.

Comments: Added one citation and corrected missing normalization in Eqs (35) and (36)

Report number: FERMILAB-PUB-21-109-E

Journal ref: Phys. Rev. A 103, 042208 (2021)

arXiv:2104.02429 [pdf, other]

doi 10.1109/TIP.2021.3115658

Fine-Grained Fashion Similarity Prediction by Attribute-Specific Embedding Learning

Authors: Jianfeng Dong, Zhe Ma, Xiaofeng Mao, Xun Yang, Yuan He, Richang Hong, Shouling Ji

Abstract: This paper strives to predict fine-grained fashion similarity. In this similarity paradigm, one should pay more attention to the similarity in terms of a specific design/attribute between fashion items. For example, whether the collar designs of the two clothes are similar. It has potential value in many fashion related applications, such as fashion copyright protection. To this end, we propose an… ▽ More This paper strives to predict fine-grained fashion similarity. In this similarity paradigm, one should pay more attention to the similarity in terms of a specific design/attribute between fashion items. For example, whether the collar designs of the two clothes are similar. It has potential value in many fashion related applications, such as fashion copyright protection. To this end, we propose an Attribute-Specific Embedding Network (ASEN) to jointly learn multiple attribute-specific embeddings, thus measure the fine-grained similarity in the corresponding space. The proposed ASEN is comprised of a global branch and a local branch. The global branch takes the whole image as input to extract features from a global perspective, while the local branch takes as input the zoomed-in region-of-interest (RoI) w.r.t. the specified attribute thus able to extract more fine-grained features. As the global branch and the local branch extract the features from different perspectives, they are complementary to each other. Additionally, in each branch, two attention modules, i.e., Attribute-aware Spatial Attention and Attribute-aware Channel Attention, are integrated to make ASEN be able to locate the related regions and capture the essential patterns under the guidance of the specified attribute, thus make the learned attribute-specific embeddings better reflect the fine-grained similarity. Extensive experiments on three fashion-related datasets, i.e., FashionAI, DARN, and DeepFashion, show the effectiveness of ASEN for fine-grained fashion similarity prediction and its potential for fashion reranking. Code and data are available at https://github.com/maryeon/asenpp . △ Less

Submitted 11 October, 2021; v1 submitted 6 April, 2021; originally announced April 2021.

Comments: Conference paper: arXiv:2002.02814

Journal ref: IEEE Transactions on Image Processing, vol. 30, pp. 8410-8425, 2021

arXiv:2103.16009 [pdf, other]

Revisiting Local Descriptor for Improved Few-Shot Classification

Authors: Jun He, Richang Hong, Xueliang Liu, Mingliang Xu, Qianru Sun

Abstract: Few-shot classification studies the problem of quickly adapting a deep learner to understanding novel classes based on few support images. In this context, recent research efforts have been aimed at designing more and more complex classifiers that measure similarities between query and support images, but left the importance of feature embeddings seldom explored. We show that the reliance on sophi… ▽ More Few-shot classification studies the problem of quickly adapting a deep learner to understanding novel classes based on few support images. In this context, recent research efforts have been aimed at designing more and more complex classifiers that measure similarities between query and support images, but left the importance of feature embeddings seldom explored. We show that the reliance on sophisticated classifiers is not necessary, and a simple classifier applied directly to improved feature embeddings can instead outperform most of the leading methods in the literature. To this end, we present a new method named \textbf{DCAP} for few-shot classification, in which we investigate how one can improve the quality of embeddings by leveraging \textbf{D}ense \textbf{C}lassification and \textbf{A}ttentive \textbf{P}ooling. Specifically, we propose to train a learner on base classes with abundant samples to solve dense classification problem first and then meta-train the learner on a bunch of randomly sampled few-shot tasks to adapt it to few-shot scenario or the test time scenario. During meta-training, we suggest to pool feature maps by applying attentive pooling instead of the widely used global average pooling (GAP) to prepare embeddings for few-shot classification. Attentive pooling learns to reweight local descriptors, explaining what the learner is looking for as evidence for decision making. Experiments on two benchmark datasets show the proposed method to be superior in multiple few-shot settings while being simpler and more explainable. Code is available at: \url{https://github.com/Ukeyboard/dcap/}. △ Less

Submitted 18 February, 2022; v1 submitted 29 March, 2021; originally announced March 2021.

Comments: 23 pages, 7 figures, 7 tables

arXiv:2102.09140 [pdf, other]

Learning Fair Representations for Recommendation: A Graph-based Perspective

Authors: Le Wu, Lei Chen, Pengyang Shao, Richang Hong, Xiting Wang, Meng Wang

Abstract: As a key application of artificial intelligence, recommender systems are among the most pervasive computer aided systems to help users find potential items of interests. Recently, researchers paid considerable attention to fairness issues for artificial intelligence applications. Most of these approaches assumed independence of instances, and designed sophisticated models to eliminate the sensitiv… ▽ More As a key application of artificial intelligence, recommender systems are among the most pervasive computer aided systems to help users find potential items of interests. Recently, researchers paid considerable attention to fairness issues for artificial intelligence applications. Most of these approaches assumed independence of instances, and designed sophisticated models to eliminate the sensitive information to facilitate fairness. However, recommender systems differ greatly from these approaches as users and items naturally form a user-item bipartite graph, and are collaboratively correlated in the graph structure. In this paper, we propose a novel graph based technique for ensuring fairness of any recommendation models. Here, the fairness requirements refer to not exposing sensitive feature set in the user modeling process. Specifically, given the original embeddings from any recommendation models, we learn a composition of filters that transform each user's and each item's original embeddings into a filtered embedding space based on the sensitive feature set. For each user, this transformation is achieved under the adversarial learning of a user-centric graph, in order to obfuscate each sensitive feature between both the filtered user embedding and the sub graph structures of this user. Finally, extensive experimental results clearly show the effectiveness of our proposed model for fair recommendation. We publish the source code at https://github.com/newlei/FairGo. △ Less

Submitted 23 April, 2021; v1 submitted 17 February, 2021; originally announced February 2021.

Comments: The paper is accepted by WWW 2021

arXiv:2101.08412 [pdf, other]

doi 10.1016/j.jmr.2021.107020

Systematic and Statistical Uncertainties of the Hilbert-Transform Based High-precision FID Frequency Extraction Method

Authors: Ran Hong, Simon Corrodi, Saskia Charity, Stefan Baessler, Jason Bono, Timothy Chupp, Martin Fertl, David Flay, Alejandro Garcia, Jimin George, Kevin Louis Giovanetti, Timothy Gorringe, Joseph Grange, Kyun Woo Hong, David Kawall, Brendan Kiburg, Bingzhi Li, Rachel Osofsky, Dinko Pocanic, Suvarna Ramachandran, Matthias Smith, Herbert Erik Swanson, Alec Tewsley-Booth, Peter Winter, Tianyu Yang , et al. (1 additional authors not shown)

Abstract: Pulsed nuclear magnetic resonance (NMR) is widely used in high-precision magnetic field measurements. The absolute value of the magnetic field is determined from the precession frequency of nuclear magnetic moments. The Hilbert transform is widely used to extract the phase function from the observed free induction decay (FID) signal and then its frequency. In this paper, a detailed implementation… ▽ More Pulsed nuclear magnetic resonance (NMR) is widely used in high-precision magnetic field measurements. The absolute value of the magnetic field is determined from the precession frequency of nuclear magnetic moments. The Hilbert transform is widely used to extract the phase function from the observed free induction decay (FID) signal and then its frequency. In this paper, a detailed implementation of a Hilbert-transform based FID frequency extraction method is described. How artifacts and noise level in the FID signal affect the extracted phase function are derived analytically. A method of mitigating the artifacts in the extracted phase function of an FID is discussed. Correlations between noises of the phase function samples are studied for different noise spectra. We discovered that the error covariance matrix for the extracted phase function is nearly singular and improper for constructing the $χ^2$ used in the fitting routine. A down-sampling method for fixing the singular covariance matrix has been developed, so that the minimum $χ^2$-fit yields properly the statistical uncertainty of the extracted frequency. Other practical methods of obtaining the statistical uncertainty are also discussed. △ Less

Submitted 27 January, 2021; v1 submitted 20 January, 2021; originally announced January 2021.

Report number: FERMILAB-PUB-21-017-E

arXiv:2012.12226 [pdf, ps, other]

doi 10.24425/aoa.2021.139640

Influence of geometric structure, convection, and eddy on sound propagation in acoustic metamaterial with turbulent flow

Authors: Myong Chol Pak, Kwang-Il Kim, Hak Chol Pak, Kwon Ryong Hong

Abstract: The problem of reducing noise in the transportation is an important research field to prevent accidents and to provide a civilized environment for people. A material that has recently attracted attention in research to reduce noise is acoustic metamaterial, and most of the research projects so far have been limited to the case of static media without flow. We have studied the sound transmission pr… ▽ More The problem of reducing noise in the transportation is an important research field to prevent accidents and to provide a civilized environment for people. A material that has recently attracted attention in research to reduce noise is acoustic metamaterial, and most of the research projects so far have been limited to the case of static media without flow. We have studied the sound transmission properties of acoustic metamaterial with turbulent flow to develop acoustic metamaterial that be used in transportation. In this paper, the effect of geometrical structure, the convective effect, and the eddy effect on sound propagation in acoustic metamaterial with turbulent flow are investigated, and the relationships between them are analyzed. The convective effect and the eddy effect both reduce the resonant strength of sound transmission loss resulting from the unique geometry of the acoustic crystal, but shift the resonant frequencies in opposite directions. In addition, when the convective effect and the eddy effect of the airflow, as well as the intrinsic interaction effect generated from the unique geometrical structure of the acoustic metamaterial cannot be ignored, they exhibit competition phenomena with each other, resulting in a widening of the resonance peak. As a result, these three effects cause the shift of the resonance frequency of the sound transmission loss and the widening of the resonance peak. The results of this study show that even in the case of turbulent flow, acoustic metamaterial can be used for transportation by properly controlling the geometric size and shape of the acoustic metamaterial. △ Less

Submitted 22 December, 2020; originally announced December 2020.

Comments: 19 pages, 17 figures

Report number: 20-15-5

Journal ref: Archives of Acoustics 46 (2021) 637-647

arXiv:2011.05253 [pdf, other]

doi 10.1088/1361-6587/abf85b

Observation of quasi-coherent density fluctuation in scrape-off layer enhancing boundary transport in high-betaN hybrid plasmas on DIII-D

Authors: R. Hong, T. L. Rhodes, Z. Y. Li, H. Wang, L. Zeng, K. Barada, G. Wang, J. G. Watkins, W. A. Peebles

Abstract: We report the observation of a quasi-coherent density fluctuation (QCF) by the Doppler backscattering system in the scrape-off layer (SOL) region of the DIII-D tokamak. This QCF is observed in high-power, high-performance hybrid plasmas with near double-null divertor (DND) shape during the electron cyclotron heating period. This mode is correlated with a steepened SOL density profile and leads to… ▽ More We report the observation of a quasi-coherent density fluctuation (QCF) by the Doppler backscattering system in the scrape-off layer (SOL) region of the DIII-D tokamak. This QCF is observed in high-power, high-performance hybrid plasmas with near double-null divertor (DND) shape during the electron cyclotron heating period. This mode is correlated with a steepened SOL density profile and leads to significantly elevated particle and heat fluxes between ELMs. The SOL QCF is a long-wavelength ion-scale fluctuation and propagates in the ion diamagnetic direction in the plasma frame. Its radial expanse is about 1.5-2 cm, well beyond the typical width of heat flux on DIII-D. Also, the SOL QCF does not show any clear dependence on the effective SOL collisionality and thus may raise issues on the control of plasma-material interactions in low collisionality plasmas in which the blob-induced transport is reduced. A linear simulation using BOUT++ with a 5-field reduced model is performed and compared with experimental observations. In simulation results, an interchange-like density perturbation can be driven by the SOL density gradient, and its peak location and the radial width of the density perturbation are in agreement with the experimental observations. △ Less

Submitted 19 March, 2021; v1 submitted 10 November, 2020; originally announced November 2020.

arXiv:2011.01353 [pdf]

Recyclable Waste Identification Using CNN Image Recognition and Gaussian Clustering

Authors: Yuheng Wang, Wen Jie Zhao, Jiahui Xu, Raymond Hong

Abstract: Waste recycling is an important way of saving energy and materials in the production process. In general cases recyclable objects are mixed with unrecyclable objects, which raises a need for identification and classification. This paper proposes a convolutional neural network (CNN) model to complete both tasks. The model uses transfer learning from a pretrained Resnet-50 CNN to complete feature ex… ▽ More Waste recycling is an important way of saving energy and materials in the production process. In general cases recyclable objects are mixed with unrecyclable objects, which raises a need for identification and classification. This paper proposes a convolutional neural network (CNN) model to complete both tasks. The model uses transfer learning from a pretrained Resnet-50 CNN to complete feature extraction. A subsequent fully connected layer for classification was trained on the augmented TrashNet dataset [1]. In the application, sliding-window is used for image segmentation in the pre-classification stage. In the post-classification stage, the labelled sample points are integrated with Gaussian Clustering to locate the object. The resulting model has achieved an overall detection rate of 48.4% in simulation and final classification accuracy of 92.4%. △ Less

Submitted 2 November, 2020; originally announced November 2020.

arXiv:2009.06168 [pdf, other]

One-bit Supervision for Image Classification

Authors: Hengtong Hu, Lingxi Xie, Zewei Du, Richang Hong, Qi Tian

Abstract: This paper presents one-bit supervision, a novel setting of learning from incomplete annotations, in the scenario of image classification. Instead of training a model upon the accurate label of each sample, our setting requires the model to query with a predicted label of each sample and learn from the answer whether the guess is correct. This provides one bit (yes or no) of information, and more… ▽ More This paper presents one-bit supervision, a novel setting of learning from incomplete annotations, in the scenario of image classification. Instead of training a model upon the accurate label of each sample, our setting requires the model to query with a predicted label of each sample and learn from the answer whether the guess is correct. This provides one bit (yes or no) of information, and more importantly, annotating each sample becomes much easier than finding the accurate label from many candidate classes. There are two keys to training a model upon one-bit supervision: improving the guess accuracy and making use of incorrect guesses. For these purposes, we propose a multi-stage training paradigm which incorporates negative label suppression into an off-the-shelf semi-supervised learning algorithm. In three popular image classification benchmarks, our approach claims higher efficiency in utilizing the limited amount of annotations. △ Less

Submitted 11 May, 2021; v1 submitted 13 September, 2020; originally announced September 2020.

Journal ref: NeurIPS 2020

arXiv:2009.00449 [pdf, other]

Towards Evaluating Exploratory Model Building Process with AutoML Systems

Authors: Sungsoo Ray Hong, Sonia Castelo, Vito D'Orazio, Christopher Benthune, Aecio Santos, Scott Langevin, David Jonker, Enrico Bertini, Juliana Freire

Abstract: The use of Automated Machine Learning (AutoML) systems are highly open-ended and exploratory. While rigorously evaluating how end-users interact with AutoML is crucial, establishing a robust evaluation methodology for such exploratory systems is challenging. First, AutoML is complex, including multiple sub-components that support a variety of sub-tasks for synthesizing ML pipelines, such as data p… ▽ More The use of Automated Machine Learning (AutoML) systems are highly open-ended and exploratory. While rigorously evaluating how end-users interact with AutoML is crucial, establishing a robust evaluation methodology for such exploratory systems is challenging. First, AutoML is complex, including multiple sub-components that support a variety of sub-tasks for synthesizing ML pipelines, such as data preparation, problem specification, and model generation, making it difficult to yield insights that tell us which components were successful or not. Second, because the usage pattern of AutoML is highly exploratory, it is not possible to rely solely on widely used task efficiency and effectiveness metrics as success metrics. To tackle the challenges in evaluation, we propose an evaluation methodology that (1) guides AutoML builders to divide their AutoML system into multiple sub-system components, and (2) helps them reason about each component through visualization of end-users' behavioral patterns and attitudinal data. We conducted a study to understand when, how, why, and applying our methodology can help builders to better understand their systems and end-users. We recruited 3 teams of professional AutoML builders. The teams prepared their own systems and let 41 end-users use the systems. Using our methodology, we visualized end-users' behavioral and attitudinal data and distributed the results to the teams. We analyzed the results in two directions: what types of novel insights the AutoML builders learned from end-users, and (2) how the evaluation methodology helped the builders to understand workflows and the effectiveness of their systems. Our findings suggest new insights explaining future design opportunities in the AutoML domain as well as how using our methodology helped the builders to determine insights and let them draw concrete directions for improving their systems. △ Less

Submitted 1 September, 2020; originally announced September 2020.

arXiv:2008.09968 [pdf]

The Effect of Dipole from $γ$-AgI Substrates on Heterogeneous Ice Nucleation

Authors: Hao Lu, Quanming Xu, Chaohong Wang, Jianyang Wu, Rongdun Hong, Xiang-Yang Liu, Zhisen Zhang

Abstract: Heterogeneous ice nucleation is one of the most common and important process in the physical environment. AgI has been proved to be an effective ice nucleating agent in the process of ice nucleation. However, the microscopic mechanism of AgI in heterogeneous ice nucleation has not been fully understood. Molecular dynamics simulations are applied to investigate the ability of which kinds of $γ$-AgI… ▽ More Heterogeneous ice nucleation is one of the most common and important process in the physical environment. AgI has been proved to be an effective ice nucleating agent in the process of ice nucleation. However, the microscopic mechanism of AgI in heterogeneous ice nucleation has not been fully understood. Molecular dynamics simulations are applied to investigate the ability of which kinds of $γ$-AgI substrate can promote ice nucleation by changing the dipole of $γ$-AgI on the substrate, we conclude that the dipole of $γ$-AgI on the substrate can affect the conformation of ice nucleation. The surface ions with positive charge on the substrate may promote ice nucleation, while there is no ice nucleation founded on the surface ions with negative charge. $γ$-AgI substrates affect ice nucleation through adjust the orientations of water molecules near the surfaces. △ Less

Submitted 23 August, 2020; originally announced August 2020.

arXiv:2007.03383 [pdf, other]

RGCF: Refined Graph Convolution Collaborative Filtering with concise and expressive embedding

Authors: Kang Liu, Feng Xue, Richang Hong

Abstract: Graph Convolution Network (GCN) has attracted significant attention and become the most popular method for learning graph representations. In recent years, many efforts have been focused on integrating GCN into the recommender tasks and have made remarkable progress. At its core is to explicitly capture high-order connectivities between the nodes in user-item bipartite graph. However, we theoretic… ▽ More Graph Convolution Network (GCN) has attracted significant attention and become the most popular method for learning graph representations. In recent years, many efforts have been focused on integrating GCN into the recommender tasks and have made remarkable progress. At its core is to explicitly capture high-order connectivities between the nodes in user-item bipartite graph. However, we theoretically and empirically find an inherent drawback existed in these GCN-based recommendation methods, where GCN is directly applied to aggregate neighboring nodes will introduce noise and information redundancy. Consequently, the these models' capability of capturing high-order connectivities among different nodes is limited, leading to suboptimal performance of the recommender tasks. The main reason is that the the nonlinear network layer inside GCN structure is not suitable for extracting non-sematic features(such as one-hot ID feature) in the collaborative filtering scenarios. In this work, we develop a new GCN-based Collaborative Filtering model, named Refined Graph convolution Collaborative Filtering(RGCF), where the construction of the embeddings of users (items) are delicately redesigned from several aspects during the aggregation on the graph. Compared to the state-of-the-art GCN-based recommendation, RGCF is more capable for capturing the implicit high-order connectivities inside the graph and the resultant vector representations are more expressive. We conduct extensive experiments on three public million-size datasets, demonstrating that our RGCF significantly outperforms state-of-the-art models. We release our code at https://github.com/hfutmars/RGCF. △ Less

Submitted 11 July, 2020; v1 submitted 7 July, 2020; originally announced July 2020.

arXiv:2006.11478 [pdf, ps, other]

Representation via Representations: Domain Generalization via Adversarially Learned Invariant Representations

Authors: Zhun Deng, Frances Ding, Cynthia Dwork, Rachel Hong, Giovanni Parmigiani, Prasad Patil, Pragya Sur

Abstract: We investigate the power of censoring techniques, first developed for learning {\em fair representations}, to address domain generalization. We examine {\em adversarial} censoring techniques for learning invariant representations from multiple "studies" (or domains), where each study is drawn according to a distribution on domains. The map** is used at test time to classify instances from a new… ▽ More We investigate the power of censoring techniques, first developed for learning {\em fair representations}, to address domain generalization. We examine {\em adversarial} censoring techniques for learning invariant representations from multiple "studies" (or domains), where each study is drawn according to a distribution on domains. The map** is used at test time to classify instances from a new domain. In many contexts, such as medical forecasting, domain generalization from studies in populous areas (where data are plentiful), to geographically remote populations (for which no training data exist) provides fairness of a different flavor, not anticipated in previous work on algorithmic fairness. We study an adversarial loss function for $k$ domains and precisely characterize its limiting behavior as $k$ grows, formalizing and proving the intuition, backed by experiments, that observing data from a larger number of domains helps. The limiting results are accompanied by non-asymptotic learning-theoretic bounds. Furthermore, we obtain sufficient conditions for good worst-case prediction performance of our algorithm on previously unseen domains. Finally, we decompose our map**s into two components and provide a complete characterization of invariance in terms of this decomposition. To our knowledge, our results provide the first formal guarantees of these kinds for adversarial invariant domain generalization. △ Less

Submitted 19 June, 2020; originally announced June 2020.

arXiv:2005.12021 [pdf, other]

Joint Item Recommendation and Attribute Inference: An Adaptive Graph Convolutional Network Approach

Authors: Le Wu, Yonghui Yang, Kun Zhang, Richang Hong, Yanjie Fu, Meng Wang

Abstract: In many recommender systems, users and items are associated with attributes, and users show preferences to items. The attribute information describes users'(items') characteristics and has a wide range of applications, such as user profiling, item annotation, and feature-enhanced recommendation. As annotating user (item) attributes is a labor intensive task, the attribute values are often incomple… ▽ More In many recommender systems, users and items are associated with attributes, and users show preferences to items. The attribute information describes users'(items') characteristics and has a wide range of applications, such as user profiling, item annotation, and feature-enhanced recommendation. As annotating user (item) attributes is a labor intensive task, the attribute values are often incomplete with many missing attribute values. Therefore, item recommendation and attribute inference have become two main tasks in these platforms. Researchers have long converged that user (item) attributes and the preference behavior are highly correlated. Some researchers proposed to leverage one kind of data for the remaining task, and showed to improve performance. Nevertheless, these models either neglected the incompleteness of user (item) attributes or regarded the correlation of the two tasks with simple models, leading to suboptimal performance of these two tasks. To this end, in this paper, we define these two tasks in an attributed user-item bipartite graph, and propose an Adaptive Graph Convolutional Network (AGCN) approach for joint item recommendation and attribute inference. The key idea of AGCN is to iteratively perform two parts: 1) Learning graph embedding parameters with previously learned approximated attribute values to facilitate two tasks; 2) Sending the approximated updated attribute values back to the attributed graph for better graph embedding learning. Therefore, AGCN could adaptively adjust the graph embedding learning parameters by incorporating both the given attributes and the estimated attribute values, in order to provide weakly supervised information to refine the two tasks. Extensive experimental results on three real-world datasets clearly show the effectiveness of the proposed model. △ Less

Submitted 25 May, 2020; originally announced May 2020.

Comments: Accepted by SIGIR2020

arXiv:2005.11724 [pdf, other]

Learning to Transfer Graph Embeddings for Inductive Graph based Recommendation

Authors: Le Wu, Yonghui Yang, Lei Chen, Defu Lian, Richang Hong, Meng Wang

Abstract: With the increasing availability of videos, how to edit them and present the most interesting parts to users, i.e., video highlight, has become an urgent need with many broad applications. As users'visual preferences are subjective and vary from person to person, previous generalized video highlight extraction models fail to tailor to users' unique preferences. In this paper, we study the problem… ▽ More With the increasing availability of videos, how to edit them and present the most interesting parts to users, i.e., video highlight, has become an urgent need with many broad applications. As users'visual preferences are subjective and vary from person to person, previous generalized video highlight extraction models fail to tailor to users' unique preferences. In this paper, we study the problem of personalized video highlight recommendation with rich visual content. By dividing each video into non-overlap** segments, we formulate the problem as a personalized segment recommendation task with many new segments in the test stage. The key challenges of this problem lie in: the cold-start users with limited video highlight records in the training data and new segments without any user ratings at the test stage. In this paper, we propose an inductive Graph based Transfer learning framework for personalized video highlight Recommendation (TransGRec). TransGRec is composed of two parts: a graph neural network followed by an item embedding transfer network. Specifically, the graph neural network part exploits the higher-order proximity between users and segments to alleviate the user cold-start problem. The transfer network is designed to approximate the learned item embeddings from graph neural networks by taking each item's visual content as input, in order to tackle the new segment problem in the test phase. We design two detailed implementations of the transfer learning optimization function, and we show how the two parts of TransGRec can be efficiently optimized with different transfer learning optimization functions. Extensive experimental results on a real-world dataset clearly show the effectiveness of our proposed model. △ Less

Submitted 24 May, 2020; originally announced May 2020.

Comments: Accepted by SIGIR2020

arXiv:2005.11034 [pdf, other]

Real-time Semantic Segmentation via Spatial-detail Guided Context Propagation

Authors: Shijie Hao, Yuan Zhou, Yanrong Guo, Richang Hong, Jun Cheng, Meng Wang

Abstract: Nowadays, vision-based computing tasks play an important role in various real-world applications. However, many vision computing tasks, e.g. semantic segmentation, are usually computationally expensive, posing a challenge to the computing systems that are resource-constrained but require fast response speed. Therefore, it is valuable to develop accurate and real-time vision processing models that… ▽ More Nowadays, vision-based computing tasks play an important role in various real-world applications. However, many vision computing tasks, e.g. semantic segmentation, are usually computationally expensive, posing a challenge to the computing systems that are resource-constrained but require fast response speed. Therefore, it is valuable to develop accurate and real-time vision processing models that only require limited computational resources. To this end, we propose the Spatial-detail Guided Context Propagation Network (SGCPNet) for achieving real-time semantic segmentation. In SGCPNet, we propose the strategy of spatial-detail guided context propagation. It uses the spatial details of shallow layers to guide the propagation of the low-resolution global contexts, in which the lost spatial information can be effectively reconstructed. In this way, the need for maintaining high-resolution features along the network is freed, therefore largely improving the model efficiency. On the other hand, due to the effective reconstruction of spatial details, the segmentation accuracy can be still preserved. In the experiments, we validate the effectiveness and efficiency of the proposed SGCPNet model. On the Citysacpes dataset, for example, our SGCPNet achieves 69.5% mIoU segmentation accuracy, while its speed reaches 178.5 FPS on 768x1536 images on a GeForce GTX 1080 Ti GPU card. In addition, SGCPNet is very lightweight and only contains 0.61 M parameters. △ Less

Submitted 19 March, 2022; v1 submitted 22 May, 2020; originally announced May 2020.

Comments: The paper has been accepted in the IEEE Transactions on Neural Networks and Learning Systems

arXiv:2005.07580 [pdf, ps, other]

Efficient Network Function Backup by Update Piggybacking

Authors: Kate Ching-Ju Lin, Ruei-Yong Hong, Yu-Chee Tseng

Abstract: Network Function Virtualization (NFV) and Service Function Chaining (SFC) have been widely used to enable flexible and agile network management. To enhance reliability, some research has proposed to deploy backup function instances for prompt recovery when a primary instance fails. While most of the recent studies focus on speeding up recovery, less attention has been paid to the problem of minimi… ▽ More Network Function Virtualization (NFV) and Service Function Chaining (SFC) have been widely used to enable flexible and agile network management. To enhance reliability, some research has proposed to deploy backup function instances for prompt recovery when a primary instance fails. While most of the recent studies focus on speeding up recovery, less attention has been paid to the problem of minimizing the state update cost. In this work, we present PiggyBackup (Piggyback-based Backup), an efficient backup instance deployment and update protocol. Our key idea is to reuse the existing service chains traversing through servers in a network to help piggyback the update information. By doing this, we eliminate the header overhead and reduce the amount of update traffic significantly. To realize such a piggyback-based update more efficiently, we investigate the backup instance deployment and chain selection problems to enhance piggybacking opportunities and reduce the forwarding hop counts with explicit consideration of the distribution of service chains. Our simulation results show that PiggyBackup reduces the average overall update overhead by 47.65% and 39.56%, respectively, in a fat-tree topology as compared to random deployment and shortest path based deployment. △ Less

Submitted 15 May, 2020; originally announced May 2020.

Showing 51–100 of 156 results for author: Hong, R