-
Decoupled Cross-Scale Cross-View Interaction for Stereo Image Enhancement in The Dark
Authors:
Huan Zheng,
Zhao Zhang,
Jicong Fan,
Richang Hong,
Yi Yang,
Shuicheng Yan
Abstract:
Low-light stereo image enhancement (LLSIE) is a relatively new task to enhance the quality of visually unpleasant stereo images captured in dark condition. However, current methods achieve inferior performance on detail recovery and illumination adjustment. We find it is because: 1) the insufficient single-scale inter-view interaction makes the cross-view cues unable to be fully exploited; 2) lack…
▽ More
Low-light stereo image enhancement (LLSIE) is a relatively new task to enhance the quality of visually unpleasant stereo images captured in dark condition. However, current methods achieve inferior performance on detail recovery and illumination adjustment. We find it is because: 1) the insufficient single-scale inter-view interaction makes the cross-view cues unable to be fully exploited; 2) lacking long-range dependency leads to the inability to deal with the spatial long-range effects caused by illumination degradation. To alleviate such limitations, we propose a LLSIE model termed Decoupled Cross-scale Cross-view Interaction Network (DCI-Net). Specifically, we present a decoupled interaction module (DIM) that aims for sufficient dual-view information interaction. DIM decouples the dual-view information exchange into discovering multi-scale cross-view correlations and further exploring cross-scale information flow. Besides, we present a spatial-channel information mining block (SIMB) for intra-view feature extraction, and the benefits are twofold. One is the long-range dependency capture to build spatial long-range relationship, and the other is expanded channel information refinement that enhances information flow in channel dimension. Extensive experiments on Flickr1024, KITTI 2012, KITTI 2015 and Middlebury datasets show that our method obtains better illumination adjustment and detail recovery, and achieves SOTA performance compared to other related methods. Our codes, datasets and models will be publicly available.
△ Less
Submitted 12 November, 2022; v1 submitted 2 November, 2022;
originally announced November 2022.
-
MetaLogic: Logical Reasoning Explanations with Fine-Grained Structure
Authors:
Yinya Huang,
Hongming Zhang,
Ruixin Hong,
Xiaodan Liang,
Changshui Zhang,
Dong Yu
Abstract:
In this paper, we propose a comprehensive benchmark to investigate models' logical reasoning capabilities in complex real-life scenarios. Current explanation datasets often employ synthetic data with simple reasoning structures. Therefore, it cannot express more complex reasoning processes, such as the rebuttal to a reasoning step and the degree of certainty of the evidence. To this end, we propos…
▽ More
In this paper, we propose a comprehensive benchmark to investigate models' logical reasoning capabilities in complex real-life scenarios. Current explanation datasets often employ synthetic data with simple reasoning structures. Therefore, it cannot express more complex reasoning processes, such as the rebuttal to a reasoning step and the degree of certainty of the evidence. To this end, we propose a comprehensive logical reasoning explanation form. Based on the multi-hop chain of reasoning, the explanation form includes three main components: (1) The condition of rebuttal that the reasoning node can be challenged; (2) Logical formulae that uncover the internal texture of reasoning nodes; (3) Reasoning strength indicated by degrees of certainty. The fine-grained structure conforms to the real logical reasoning scenario, better fitting the human cognitive process but, simultaneously, is more challenging for the current models. We evaluate the current best models' performance on this new explanation form. The experimental results show that generating reasoning graphs remains a challenging task for current models, even with the help of giant pre-trained language models.
△ Less
Submitted 22 October, 2022;
originally announced October 2022.
-
MEGCF: Multimodal Entity Graph Collaborative Filtering for Personalized Recommendation
Authors:
Kang Liu,
Feng Xue,
Dan Guo,
Le Wu,
Shujie Li,
Richang Hong
Abstract:
In most E-commerce platforms, whether the displayed items trigger the user's interest largely depends on their most eye-catching multimodal content. Consequently, increasing efforts focus on modeling multimodal user preference, and the pressing paradigm is to incorporate complete multimodal deep features of the items into the recommendation module. However, the existing studies ignore the mismatch…
▽ More
In most E-commerce platforms, whether the displayed items trigger the user's interest largely depends on their most eye-catching multimodal content. Consequently, increasing efforts focus on modeling multimodal user preference, and the pressing paradigm is to incorporate complete multimodal deep features of the items into the recommendation module. However, the existing studies ignore the mismatch problem between multimodal feature extraction (MFE) and user interest modeling (UIM). That is, MFE and UIM have different emphases. Specifically, MFE is migrated from and adapted to upstream tasks such as image classification. In addition, it is mainly a content-oriented and non-personalized process, while UIM, with its greater focus on understanding user interaction, is essentially a user-oriented and personalized process. Therefore, the direct incorporation of MFE into UIM for purely user-oriented tasks, tends to introduce a large number of preference-independent multimodal noise and contaminate the embedding representations in UIM.
This paper aims at solving the mismatch problem between MFE and UIM, so as to generate high-quality embedding representations and better model multimodal user preferences. Towards this end, we develop a novel model, MEGCF. The UIM of the proposed model captures the semantic correlation between interactions and the features obtained from MFE, thus making a better match between MFE and UIM. More precisely, semantic-rich entities are first extracted from the multimodal data, since they are more relevant to user preferences than other multimodal information. These entities are then integrated into the user-item interaction graph. Afterwards, a symmetric linear Graph Convolution Network (GCN) module is constructed to perform message propagation over the graph, in order to capture both high-order semantic correlation and collaborative filtering signals.
△ Less
Submitted 13 October, 2022;
originally announced October 2022.
-
Joint Multi-grained Popularity-aware Graph Convolution Collaborative Filtering for Recommendation
Authors:
Kang Liu,
Feng Xue,
Xiangnan He,
Dan Guo,
Richang Hong
Abstract:
Graph Convolution Networks (GCNs), with their efficient ability to capture high-order connectivity in graphs, have been widely applied in recommender systems. Stacking multiple neighbor aggregation is the major operation in GCNs. It implicitly captures popularity features because the number of neighbor nodes reflects the popularity of a node. However, existing GCN-based methods ignore a universal…
▽ More
Graph Convolution Networks (GCNs), with their efficient ability to capture high-order connectivity in graphs, have been widely applied in recommender systems. Stacking multiple neighbor aggregation is the major operation in GCNs. It implicitly captures popularity features because the number of neighbor nodes reflects the popularity of a node. However, existing GCN-based methods ignore a universal problem: users' sensitivity to item popularity is differentiated, but the neighbor aggregations in GCNs actually fix this sensitivity through Graph Laplacian Normalization, leading to suboptimal personalization.
In this work, we propose to model multi-grained popularity features and jointly learn them together with high-order connectivity, to match the differentiation of user preferences exhibited in popularity features. Specifically, we develop a Joint Multi-grained Popularity-aware Graph Convolution Collaborative Filtering model, short for JMP-GCF, which uses a popularity-aware embedding generation to construct multi-grained popularity features, and uses the idea of joint learning to capture the signals within and between different granularities of popularity features that are relevant for modeling user preferences. Additionally, we propose a multistage stacked training strategy to speed up model convergence. We conduct extensive experiments on three public datasets to show the state-of-the-art performance of JMP-GCF.
△ Less
Submitted 10 October, 2022;
originally announced October 2022.
-
Seeing Through the Noisy Dark: Towards Real-world Low-Light Image Enhancement and Denoising
Authors:
Jiahuan Ren,
Zhao Zhang,
Richang Hong,
Mingliang Xu,
Yi Yang,
Shuicheng Yan
Abstract:
Low-light image enhancement (LLIE) aims at improving the illumination and visibility of dark images with lighting noise. To handle the real-world low-light images often with heavy and complex noise, some efforts have been made for joint LLIE and denoising, which however only achieve inferior restoration performance. We attribute it to two challenges: 1) in real-world low-light images, noise is som…
▽ More
Low-light image enhancement (LLIE) aims at improving the illumination and visibility of dark images with lighting noise. To handle the real-world low-light images often with heavy and complex noise, some efforts have been made for joint LLIE and denoising, which however only achieve inferior restoration performance. We attribute it to two challenges: 1) in real-world low-light images, noise is somewhat covered by low-lighting and the left noise after denoising would be inevitably amplified during enhancement; 2) conversion of raw data to sRGB would cause information loss and also more noise, and hence prior LLIE methods trained on raw data are unsuitable for more common sRGB images. In this work, we propose a novel Low-light Enhancement & Denoising Network for real-world low-light images (RLED-Net) in the sRGB color space. In RLED-Net, we apply a plug-and-play differentiable Latent Subspace Reconstruction Block (LSRB) to embed the real-world images into low-rank subspaces to suppress the noise and rectify the errors, such that the impact of noise during enhancement can be effectively shrunk. We then present an efficient Crossed-channel & Shift-window Transformer (CST) layer with two branches to calculate the window and channel attentions to resist the degradation (e.g., speckle noise and blur) caused by the noise in input images. Based on the CST layers, we further present a U-structure network CSTNet as backbone for deep feature recovery, and construct a feature refine block to refine the final features. Extensive experiments on both real noisy images and public image databases well verify the effectiveness of the proposed RLED-Net for RLLIE and denoising simultaneously.
△ Less
Submitted 15 November, 2022; v1 submitted 2 October, 2022;
originally announced October 2022.
-
Validating and optimising mismatch tolerance of Doppler backscattering measurements with the beam model
Authors:
Valerian H. Hall-Chen,
Julius Damba,
Felix I. Parra,
Quinn T. Pratt,
Clive A. Michael,
Shi Peng,
Terry L. Rhodes,
Neal A. Crocker,
Jon C. Hillesheim,
Rongjie Hong,
Shikang Ni,
William A. Peebles,
Ching Eng Png,
Juan Ruiz Ruiz
Abstract:
We use the beam model of Doppler backscattering (DBS), which was previously derived from beam tracing and the reciprocity theorem, to shed light on mismatch attenuation. This attenuation of the backscattered signal occurs when the wavevector of the probe beam's electric field is not in the plane perpendicular to the magnetic field. Correcting for this effect is important for determining the amplit…
▽ More
We use the beam model of Doppler backscattering (DBS), which was previously derived from beam tracing and the reciprocity theorem, to shed light on mismatch attenuation. This attenuation of the backscattered signal occurs when the wavevector of the probe beam's electric field is not in the plane perpendicular to the magnetic field. Correcting for this effect is important for determining the amplitude of the actual density fluctuations. Previous preliminary comparisons between the model and Mega-Ampere Spherical Tokamak (MAST) plasmas were promising. In this work, we quantitatively account for this effect on DIII-D, a conventional tokamak. We compare the predicted and measured mismatch attenuation in various DIII-D, MAST, and MAST-U plasmas, showing that the beam model is applicable in a wide variety of situations. Finally, we performed a preliminary parameter sweep and found that the mismatch tolerance can be improved by optimising the probe beam's width and curvature at launch. This is potentially a design consideration for new DBS systems.
△ Less
Submitted 30 September, 2022;
originally announced September 2022.
-
Switchable Online Knowledge Distillation
Authors:
Biao Qian,
Yang Wang,
Hongzhi Yin,
Richang Hong,
Meng Wang
Abstract:
Online Knowledge Distillation (OKD) improves the involved models by reciprocally exploiting the difference between teacher and student. Several crucial bottlenecks over the gap between them -- e.g., Why and when does a large gap harm the performance, especially for student? How to quantify the gap between teacher and student? -- have received limited formal study. In this paper, we propose Switcha…
▽ More
Online Knowledge Distillation (OKD) improves the involved models by reciprocally exploiting the difference between teacher and student. Several crucial bottlenecks over the gap between them -- e.g., Why and when does a large gap harm the performance, especially for student? How to quantify the gap between teacher and student? -- have received limited formal study. In this paper, we propose Switchable Online Knowledge Distillation (SwitOKD), to answer these questions. Instead of focusing on the accuracy gap at test phase by the existing arts, the core idea of SwitOKD is to adaptively calibrate the gap at training phase, namely distillation gap, via a switching strategy between two modes -- expert mode (pause the teacher while keep the student learning) and learning mode (restart the teacher). To possess an appropriate distillation gap, we further devise an adaptive switching threshold, which provides a formal criterion as to when to switch to learning mode or expert mode, and thus improves the student's performance. Meanwhile, the teacher benefits from our adaptive switching threshold and keeps basically on a par with other online arts. We further extend SwitOKD to multiple networks with two basis topologies. Finally, extensive experiments and analysis validate the merits of SwitOKD for classification over the state-of-the-arts. Our code is available at https://github.com/hfutqian/SwitOKD.
△ Less
Submitted 11 September, 2022;
originally announced September 2022.
-
Fermi Level Depinning in Two-Dimensional Materials Using a Fluorinated Bilayer Graphene Barrier
Authors:
Cunzhi Sun,
Cheng Xiang,
Rongdun Hong,
Feng Zhang,
Timothy J. Booth,
Peter Bøggild,
Manh-Ha Doan
Abstract:
Strong Fermi level pinning (FLP) - often attributed to metal-induced gap states at the interfacial contacts - severely reduces the tunability of the Schottky barrier height of the junction and limits applications of the 2D materials in electronics and optoelectronics. Here, we show that fluorinated bilayer graphene (FBLG) can be used as a barrier to effectively prevent FLP at metal/2D materials in…
▽ More
Strong Fermi level pinning (FLP) - often attributed to metal-induced gap states at the interfacial contacts - severely reduces the tunability of the Schottky barrier height of the junction and limits applications of the 2D materials in electronics and optoelectronics. Here, we show that fluorinated bilayer graphene (FBLG) can be used as a barrier to effectively prevent FLP at metal/2D materials interfaces. FLBG can be produced via short exposure (1-3 min) to SF6 plasma that fluorinates only the top layer of a bilayer graphene with covalent C-F bonding, while the bottom layer remains intrinsic, resulting in a band gap opening of about 75 meV. Inserting FBLG between the metallic contacts and a layer of MoS2 reduces the Schottky barrier height dramatically for the low-work function metals (313 and 260 meV for Ti and Cr, respectively) while it increases for the high-work function one ( 160 meV for Pd), corresponding to an improved pinning factor. Our results provide a straightforward method to generate atomically thin dielectrics with applications not only for depinning the Fermi level at metal/transition metal dichalcogenide (TMD) interfaces but also for solving many other problems in electronics and optoelectronics
△ Less
Submitted 2 August, 2022;
originally announced August 2022.
-
Emotion Separation and Recognition from a Facial Expression by Generating the Poker Face with Vision Transformers
Authors:
Jia Li,
Jiantao Nie,
Dan Guo,
Richang Hong,
Meng Wang
Abstract:
Representation learning and feature disentanglement have recently attracted much research interests in facial expression recognition. The ubiquitous ambiguity of emotion labels is detrimental to those methods based on conventional supervised representation learning. Meanwhile, directly learning the map** from a facial expression image to an emotion label lacks explicit supervision signals of fac…
▽ More
Representation learning and feature disentanglement have recently attracted much research interests in facial expression recognition. The ubiquitous ambiguity of emotion labels is detrimental to those methods based on conventional supervised representation learning. Meanwhile, directly learning the map** from a facial expression image to an emotion label lacks explicit supervision signals of facial details. In this paper, we propose a novel FER model, called Poker Face Vision Transformer or PF-ViT, to separate and recognize the disturbance-agnostic emotion from a static facial image via generating its corresponding poker face without the need for paired images. Here, we regard an expressive face as the comprehensive result of a set of facial muscle movements on one's poker face (i.e., emotionless face), inspired by Facial Action Coding System. The proposed PF-ViT leverages vanilla Vision Transformers, and are firstly pre-trained as Masked Autoencoders on a large facial expression dataset without emotion labels, obtaining excellent representations. It mainly consists of five components: 1) an encoder map** the facial expression to a complete representation, 2) a separator decomposing the representation into an emotional component and an orthogonal residue, 3) a generator that can reconstruct the expressive face and synthesize the poker face, 4) a discriminator distinguishing the fake face produced by the generator, trained adversarially with the encoder and generator, 5) a classification head recognizing the emotion. Quantitative and qualitative results demonstrate the effectiveness of our method, which trumps the state-of-the-art methods on four popular FER testing sets.
△ Less
Submitted 9 June, 2023; v1 submitted 22 July, 2022;
originally announced July 2022.
-
The Brain-Inspired Decoder for Natural Visual Image Reconstruction
Authors:
Wenyi Li,
Shengjie Zheng,
Yufan Liao,
Rongqi Hong,
Weiliang Chen,
Chenggnag He,
Xiaojian Li
Abstract:
Decoding images from brain activity has been a challenge. Owing to the development of deep learning, there are available tools to solve this problem. The decoded image, which aims to map neural spike trains to low-level visual features and high-level semantic information space. Recently, there are a few studies of decoding from spike trains, however, these studies pay less attention to the foundat…
▽ More
Decoding images from brain activity has been a challenge. Owing to the development of deep learning, there are available tools to solve this problem. The decoded image, which aims to map neural spike trains to low-level visual features and high-level semantic information space. Recently, there are a few studies of decoding from spike trains, however, these studies pay less attention to the foundations of neuroscience and there are few studies that merged receptive field into visual image reconstruction. In this paper, we propose a deep learning neural network architecture with biological properties to reconstruct visual image from spike trains. As far as we know, we implemented a method that integrated receptive field property matrix into loss function at the first time. Our model is an end-to-end decoder from neural spike trains to images. We not only merged Gabor filter into auto-encoder which used to generate images but also proposed a loss function with receptive field properties. We evaluated our decoder on two datasets which contain macaque primary visual cortex neural spikes and salamander retina ganglion cells (RGCs) spikes. Our results show that our method can effectively combine receptive field features to reconstruct images, providing a new approach to visual reconstruction based on neural information.
△ Less
Submitted 18 July, 2022;
originally announced July 2022.
-
RES: A Robust Framework for Guiding Visual Explanation
Authors:
Yuyang Gao,
Tong Steven Sun,
Guangji Bai,
Siyi Gu,
Sungsoo Ray Hong,
Liang Zhao
Abstract:
Despite the fast progress of explanation techniques in modern Deep Neural Networks (DNNs) where the main focus is handling "how to generate the explanations", advanced research questions that examine the quality of the explanation itself (e.g., "whether the explanations are accurate") and improve the explanation quality (e.g., "how to adjust the model to generate more accurate explanations when ex…
▽ More
Despite the fast progress of explanation techniques in modern Deep Neural Networks (DNNs) where the main focus is handling "how to generate the explanations", advanced research questions that examine the quality of the explanation itself (e.g., "whether the explanations are accurate") and improve the explanation quality (e.g., "how to adjust the model to generate more accurate explanations when explanations are inaccurate") are still relatively under-explored. To guide the model toward better explanations, techniques in explanation supervision - which add supervision signals on the model explanation - have started to show promising effects on improving both the generalizability as and intrinsic interpretability of Deep Neural Networks. However, the research on supervising explanations, especially in vision-based applications represented through saliency maps, is in its early stage due to several inherent challenges: 1) inaccuracy of the human explanation annotation boundary, 2) incompleteness of the human explanation annotation region, and 3) inconsistency of the data distribution between human annotation and model explanation maps. To address the challenges, we propose a generic RES framework for guiding visual explanation by develo** a novel objective that handles inaccurate boundary, incomplete region, and inconsistent distribution of human annotations, with a theoretical justification on model generalizability. Extensive experiments on two real-world image datasets demonstrate the effectiveness of the proposed framework on enhancing both the reasonability of the explanation and the performance of the backbone DNNs model.
△ Less
Submitted 27 June, 2022;
originally announced June 2022.
-
A Topic-Attentive Transformer-based Model For Multimodal Depression Detection
Authors:
Yanrong Guo,
Chenyang Zhu,
Shijie Hao,
Richang Hong
Abstract:
Depression is one of the most common mental disorders, which imposes heavy negative impacts on one's daily life. Diagnosing depression based on the interview is usually in the form of questions and answers. In this process, the audio signals and their text transcripts of a subject are correlated to depression cues and easily recorded. Therefore, it is feasible to build an Automatic Depression Dete…
▽ More
Depression is one of the most common mental disorders, which imposes heavy negative impacts on one's daily life. Diagnosing depression based on the interview is usually in the form of questions and answers. In this process, the audio signals and their text transcripts of a subject are correlated to depression cues and easily recorded. Therefore, it is feasible to build an Automatic Depression Detection (ADD) model based on the data of these modalities in practice. However, there are two major challenges that should be addressed for constructing an effective ADD model. The first challenge is the organization of the textual and audio data, which can be of various contents and lengths for different subjects. The second challenge is the lack of training samples due to the privacy concern. Targeting to these two challenges, we propose the TOpic ATtentive transformer-based ADD model, abbreviated as TOAT. To address the first challenge, in the TOAT model, topic is taken as the basic unit of the textual and audio data according to the question-answer form in a typical interviewing process. Based on that, a topic attention module is designed to learn the importance of of each topic, which helps the model better retrieve the depressed samples. To solve the issue of data scarcity, we introduce large pre-trained models, and the fine-tuning strategy is adopted based on the small-scale ADD training data. We also design a two-branch architecture with a late-fusion strategy for building the TOAT model, in which the textual and audio data are encoded independently. We evaluate our model on the multimodal DAIC-WOZ dataset specifically designed for the ADD task. Experimental results show the superiority of our method. More importantly, the ablation studies demonstrate the effectiveness of the key elements in the TOAT model.
△ Less
Submitted 27 June, 2022;
originally announced June 2022.
-
$β$-nuclear-recoil correlation from $^6$He decay in a laser trap
Authors:
P. Müller,
Y. Bagdasarova,
R. Hong,
A. Leredde,
K. G. Bailey,
X. Fléchard,
A. García,
B. Graner,
A. Knecht,
O. Naviliat-Cuncic,
T. P. O'Connor,
M. G. Sternberg,
D. W. Storm,
H. E. Swanson,
F. Wauters,
D. W. Zumwalt
Abstract:
We report the first precise measurement of a $β$-recoil correlation from a radioactive noble gas ($^6{\rm He}$) confined via a magneto-optical trap. The measurement is motivated by the search for exotic tensor-type contributions to the charged weak current. Interpreted as tensor currents with right-handed neutrinos, the measurements yield: $|C_T/C_A|^2\le 0.022$ (90\% C.L.). On the other hand, for…
▽ More
We report the first precise measurement of a $β$-recoil correlation from a radioactive noble gas ($^6{\rm He}$) confined via a magneto-optical trap. The measurement is motivated by the search for exotic tensor-type contributions to the charged weak current. Interpreted as tensor currents with right-handed neutrinos, the measurements yield: $|C_T/C_A|^2\le 0.022$ (90\% C.L.). On the other hand, for left-handed neutrinos the limits are $0.007< C_T/C_A <0.111$~(90\%\ C.L.). The sensitivity of the present measurement is mainly limited by experimental uncertainties in determining the time response properties and the distance between the atom cloud and the micro-channel plate used for recoil ion detection.
△ Less
Submitted 1 June, 2022;
originally announced June 2022.
-
VD-PCR: Improving Visual Dialog with Pronoun Coreference Resolution
Authors:
Xintong Yu,
Hongming Zhang,
Ruixin Hong,
Yangqiu Song,
Changshui Zhang
Abstract:
The visual dialog task requires an AI agent to interact with humans in multi-round dialogs based on a visual environment. As a common linguistic phenomenon, pronouns are often used in dialogs to improve the communication efficiency. As a result, resolving pronouns (i.e., grounding pronouns to the noun phrases they refer to) is an essential step towards understanding dialogs. In this paper, we prop…
▽ More
The visual dialog task requires an AI agent to interact with humans in multi-round dialogs based on a visual environment. As a common linguistic phenomenon, pronouns are often used in dialogs to improve the communication efficiency. As a result, resolving pronouns (i.e., grounding pronouns to the noun phrases they refer to) is an essential step towards understanding dialogs. In this paper, we propose VD-PCR, a novel framework to improve Visual Dialog understanding with Pronoun Coreference Resolution in both implicit and explicit ways. First, to implicitly help models understand pronouns, we design novel methods to perform the joint training of the pronoun coreference resolution and visual dialog tasks. Second, after observing that the coreference relationship of pronouns and their referents indicates the relevance between dialog rounds, we propose to explicitly prune the irrelevant history rounds in visual dialog models' input. With pruned input, the models can focus on relevant dialog history and ignore the distraction in the irrelevant one. With the proposed implicit and explicit methods, VD-PCR achieves state-of-the-art experimental results on the VisDial dataset.
△ Less
Submitted 29 May, 2022;
originally announced May 2022.
-
METGEN: A Module-Based Entailment Tree Generation Framework for Answer Explanation
Authors:
Ruixin Hong,
Hongming Zhang,
Xintong Yu,
Changshui Zhang
Abstract:
Knowing the reasoning chains from knowledge to the predicted answers can help construct an explainable question answering (QA) system. Advances on QA explanation propose to explain the answers with entailment trees composed of multiple entailment steps. While current work proposes to generate entailment trees with end-to-end generative models, the steps in the generated trees are not constrained a…
▽ More
Knowing the reasoning chains from knowledge to the predicted answers can help construct an explainable question answering (QA) system. Advances on QA explanation propose to explain the answers with entailment trees composed of multiple entailment steps. While current work proposes to generate entailment trees with end-to-end generative models, the steps in the generated trees are not constrained and could be unreliable. In this paper, we propose METGEN, a Module-based Entailment Tree GENeration framework that has multiple modules and a reasoning controller. Given a question and several supporting knowledge, METGEN can iteratively generate the entailment tree by conducting single-step entailment with separate modules and selecting the reasoning flow with the controller. As each module is guided to perform a specific type of entailment reasoning, the steps generated by METGEN are more reliable and valid. Experiment results on the standard benchmark show that METGEN can outperform previous state-of-the-art models with only 9% of the parameters.
△ Less
Submitted 5 May, 2022;
originally announced May 2022.
-
Towards Feature Distribution Alignment and Diversity Enhancement for Data-Free Quantization
Authors:
Yangcheng Gao,
Zhao Zhang,
Richang Hong,
Haijun Zhang,
Jicong Fan,
Shuicheng Yan
Abstract:
To obtain lower inference latency and less memory footprint of deep neural networks, model quantization has been widely employed in deep model deployment, by converting the floating points to low-precision integers. However, previous methods (such as quantization aware training and post training quantization) require original data for the fine-tuning or calibration of quantized model, which makes…
▽ More
To obtain lower inference latency and less memory footprint of deep neural networks, model quantization has been widely employed in deep model deployment, by converting the floating points to low-precision integers. However, previous methods (such as quantization aware training and post training quantization) require original data for the fine-tuning or calibration of quantized model, which makes them inapplicable to the cases that original data are not accessed due to privacy or security. This gives birth to the data-free quantization method with synthetic data generation. While current data-free quantization methods still suffer from severe performance degradation when quantizing a model into lower bit, caused by the low inter-class separability of semantic features. To this end, we propose a new and effective data-free quantization method termed ClusterQ, which utilizes the feature distribution alignment for synthetic data generation. To obtain high inter-class separability of semantic features, we cluster and align the feature distribution statistics to imitate the distribution of real data, so that the performance degradation is alleviated. Moreover, we incorporate the diversity enhancement to solve class-wise mode collapse. We also employ the exponential moving average to update the centroid of each cluster for further feature distribution improvement. Extensive experiments based on different deep models (e.g., ResNet-18 and MobileNet-V2) over the ImageNet dataset demonstrate that our proposed ClusterQ model obtains state-of-the-art performance.
△ Less
Submitted 19 December, 2022; v1 submitted 30 April, 2022;
originally announced May 2022.
-
A Review-aware Graph Contrastive Learning Framework for Recommendation
Authors:
Jie Shuai,
Kun Zhang,
Le Wu,
Peijie Sun,
Richang Hong,
Meng Wang,
Yong Li
Abstract:
Most modern recommender systems predict users preferences with two components: user and item embedding learning, followed by the user-item interaction modeling. By utilizing the auxiliary review information accompanied with user ratings, many of the existing review-based recommendation models enriched user/item embedding learning ability with historical reviews or better modeled user-item interact…
▽ More
Most modern recommender systems predict users preferences with two components: user and item embedding learning, followed by the user-item interaction modeling. By utilizing the auxiliary review information accompanied with user ratings, many of the existing review-based recommendation models enriched user/item embedding learning ability with historical reviews or better modeled user-item interactions with the help of available user-item target reviews. Though significant progress has been made, we argue that current solutions for review-based recommendation suffer from two drawbacks. First, as review-based recommendation can be naturally formed as a user-item bipartite graph with edge features from corresponding user-item reviews, how to better exploit this unique graph structure for recommendation? Second, while most current models suffer from limited user behaviors, can we exploit the unique self-supervised signals in the review-aware graph to guide two recommendation components better? To this end, in this paper, we propose a novel Review-aware Graph Contrastive Learning (RGCL) framework for review-based recommendation. Specifically, we first construct a review-aware user-item graph with feature-enhanced edges from reviews, where each edge feature is composed of both the user-item rating and the corresponding review semantics. This graph with feature-enhanced edges can help attentively learn each neighbor node weight for user and item representation learning. After that, we design two additional contrastive learning tasks (i.e., Node Discrimination and Edge Discrimination) to provide self-supervised signals for the two components in recommendation process. Finally, extensive experiments over five benchmark datasets demonstrate the superiority of our proposed RGCL compared to the state-of-the-art baselines.
△ Less
Submitted 16 May, 2022; v1 submitted 25 April, 2022;
originally announced April 2022.
-
FCL-GAN: A Lightweight and Real-Time Baseline for Unsupervised Blind Image Deblurring
Authors:
Suiyi Zhao,
Zhao Zhang,
Richang Hong,
Mingliang Xu,
Yi Yang,
Meng Wang
Abstract:
Blind image deblurring (BID) remains a challenging and significant task. Benefiting from the strong fitting ability of deep learning, paired data-driven supervised BID method has obtained great progress. However, paired data are usually synthesized by hand, and the realistic blurs are more complex than synthetic ones, which makes the supervised methods inept at modeling realistic blurs and hinders…
▽ More
Blind image deblurring (BID) remains a challenging and significant task. Benefiting from the strong fitting ability of deep learning, paired data-driven supervised BID method has obtained great progress. However, paired data are usually synthesized by hand, and the realistic blurs are more complex than synthetic ones, which makes the supervised methods inept at modeling realistic blurs and hinders their real-world applications. As such, unsupervised deep BID method without paired data offers certain advantages, but current methods still suffer from some drawbacks, e.g., bulky model size, long inference time, and strict image resolution and domain requirements. In this paper, we propose a lightweight and real-time unsupervised BID baseline, termed Frequency-domain Contrastive Loss Constrained Lightweight CycleGAN (shortly, FCL-GAN), with attractive properties, i.e., no image domain limitation, no image resolution limitation, 25x lighter than SOTA, and 5x faster than SOTA. To guarantee the lightweight property and performance superiority, two new collaboration units called lightweight domain conversion unit(LDCU) and parameter-free frequency-domain contrastive unit(PFCU) are designed. LDCU mainly implements inter-domain conversion in lightweight manner. PFCU further explores the similarity measure, external difference and internal connection between the blurred domain and sharp domain images in frequency domain, without involving extra parameters. Extensive experiments on several image datasets demonstrate the effectiveness of our FCL-GAN in terms of performance, model size and reference time.
△ Less
Submitted 23 July, 2022; v1 submitted 16 April, 2022;
originally announced April 2022.
-
Automatic Depression Detection via Learning and Fusing Features from Visual Cues
Authors:
Yanrong Guo,
Chenyang Zhu,
Shijie Hao,
Richang Hong
Abstract:
Depression is one of the most prevalent mental disorders, which seriously affects one's life. Traditional depression diagnostics commonly depends on rating with scales, which can be labor-intensive and subjective. In this context, Automatic Depression Detection (ADD) has been attracting more attention for its low cost and objectivity. ADD systems are able to detect depression automatically from so…
▽ More
Depression is one of the most prevalent mental disorders, which seriously affects one's life. Traditional depression diagnostics commonly depends on rating with scales, which can be labor-intensive and subjective. In this context, Automatic Depression Detection (ADD) has been attracting more attention for its low cost and objectivity. ADD systems are able to detect depression automatically from some medical records, like video sequences. However, it remains challenging to effectively extract depression-specific information from long sequences, thereby hindering a satisfying accuracy. In this paper, we propose a novel ADD method via learning and fusing features from visual cues. Specifically, we firstly construct Temporal Dilated Convolutional Network (TDCN), in which multiple Dilated Convolution Blocks (DCB) are designed and stacked, to learn the long-range temporal information from sequences. Then, the Feature-Wise Attention (FWA) module is adopted to fuse different features extracted from TDCNs. The module learns to assign weights for the feature channels, aiming to better incorporate different kinds of visual features and further enhance the detection accuracy. Our method achieves the state-of-the-art performance on the DAIC_WOZ dataset compared to other visual-feature-based methods, showing its effectiveness.
△ Less
Submitted 1 March, 2022;
originally announced March 2022.
-
Observation of Long-Radial-Range-Correlation in Turbulence in High-Collisionality High-Confinement Fusion Plasmas
Authors:
R. Hong,
T. L. Rhodes,
P. H. Diamond,
Y. Ren,
L. Zeng,
X. Jian,
K. Barada,
G. Wang,
W. A. Peebles
Abstract:
We report on the observation of spatially asymmetric turbulent structures with a long radial correlation length in the core of high-collisionality H-mode plasmas on DIII-D tokamak. These turbulent structures develop from shorter wavelength turbulence and have a radially elongated structure. The envelope of turbulence spans a broad radial range in the mid-radius region, leading to streamer-like tra…
▽ More
We report on the observation of spatially asymmetric turbulent structures with a long radial correlation length in the core of high-collisionality H-mode plasmas on DIII-D tokamak. These turbulent structures develop from shorter wavelength turbulence and have a radially elongated structure. The envelope of turbulence spans a broad radial range in the mid-radius region, leading to streamer-like transport events. The underlying turbulence is featured by intermittency, long-term memory effect, and the characteristic spectrum of self-organized criticality. The amplitude and the radial scale increase substantially when the shearing rate of the mean flow is reduced below the turbulent scattering rate. The enhanced LRRC transport events are accompanied by apparent degradation of normalized energy confinement time. These findings constitute the first experimental observation of long-radial-range turbulent transport events in high-collisionality H-mode plasmas, and demonstrate the role of mean shear flows in the formation and propagation of turbulence with long-radial-range correlation.
△ Less
Submitted 15 February, 2023; v1 submitted 20 February, 2022;
originally announced February 2022.
-
Functional Tensor Network Solving Many-body Schrödinger Equation
Authors:
Rui Hong,
Ya-Xuan Xiao,
Jie Hu,
An-Chun Ji,
Shi-Ju Ran
Abstract:
Schrödinger equation belongs to the most fundamental differential equations in quantum physics. However, the exact solutions are extremely rare, and many analytical methods are applicable only to the cases with small perturbations or weak correlations. Solving the many-body Schrödinger equation in the continuous spaces with the presence of strong correlations is an extremely important and challeng…
▽ More
Schrödinger equation belongs to the most fundamental differential equations in quantum physics. However, the exact solutions are extremely rare, and many analytical methods are applicable only to the cases with small perturbations or weak correlations. Solving the many-body Schrödinger equation in the continuous spaces with the presence of strong correlations is an extremely important and challenging issue. In this work, we propose the functional tensor network (FTN) approach to solve the many-body Schrödinger equation. Provided the orthonormal functional bases, we represent the coefficients of the many-body wave-function as tensor network. The observables, such as energy, can be calculated simply by tensor contractions. Simulating the ground state becomes solving a minimization problem defined by the tensor network. An efficient gradient-decent algorithm based on the automatically differentiable tensors is proposed. We here take matrix product state (MPS) as an example, whose complexity scales only linearly with the system size. We apply our approach to solve the ground state of coupled harmonic oscillators, and achieve high accuracy by comparing with the exact solutions. Reliable results are also given with the presence of three-body interactions, where the system cannot be decoupled to isolated oscillators. Our approach is simple and with well-controlled error, superior to the highly-nonlinear neural-network solvers. Our work extends the applications of tensor network from quantum lattice models to the systems in the continuous space. FTN can be used as a general solver of the differential equations with many variables. The MPS exemplified here can be generalized to, e.g., the fermionic tensor networks, to solve the electronic Schrödinger equation.
△ Less
Submitted 30 January, 2022;
originally announced January 2022.
-
High-Accuracy Absolute Magnetometry with Application to the Fermilab Muon $g-2$ Experiment
Authors:
D. Flay,
D. Kawall,
T. Chupp,
S. Corrodi,
M. Farooq,
M. Fertl,
J. George,
J. Grange,
R. Hong,
R. Osofsky,
S. Ramachandran,
E. Swanson,
P. Winter
Abstract:
We present details of a high-accuracy absolute scalar magnetometer based on pulsed proton NMR. The $B$-field magnitude is determined from the precession frequency of proton spins in a cylindrical sample of water after accounting for field perturbations from probe materials, sample shape, and other corrections. Features of the design, testing procedures, and corrections necessary for qualification…
▽ More
We present details of a high-accuracy absolute scalar magnetometer based on pulsed proton NMR. The $B$-field magnitude is determined from the precession frequency of proton spins in a cylindrical sample of water after accounting for field perturbations from probe materials, sample shape, and other corrections. Features of the design, testing procedures, and corrections necessary for qualification as an absolute scalar magnetometer are described. The device was tested at $B = 1.45$\,T but can be modified for a range exceeding 1--3\,T. The magnetometer was used to calibrate other NMR magnetometers and measure absolute magnetic field magnitudes to an accuracy of 19 parts per billion as part of a measurement of the muon magnetic moment anomaly at Fermilab.
△ Less
Submitted 22 December, 2021; v1 submitted 18 September, 2021;
originally announced September 2021.
-
Few-shot Learning with Global Relatedness Decoupled-Distillation
Authors:
Yuan Zhou,
Yanrong Guo,
Shijie Hao,
Richang Hong,
Zhengjun Zha,
Meng Wang
Abstract:
Despite the success that metric learning based approaches have achieved in few-shot learning, recent works reveal the ineffectiveness of their episodic training mode. In this paper, we point out two potential reasons for this problem: 1) the random episodic labels can only provide limited supervision information, while the relatedness information between the query and support samples is not fully…
▽ More
Despite the success that metric learning based approaches have achieved in few-shot learning, recent works reveal the ineffectiveness of their episodic training mode. In this paper, we point out two potential reasons for this problem: 1) the random episodic labels can only provide limited supervision information, while the relatedness information between the query and support samples is not fully exploited; 2) the meta-learner is usually constrained by the limited contextual information of the local episode. To overcome these problems, we propose a new Global Relatedness Decoupled-Distillation (GRDD) method using the global category knowledge and the Relatedness Decoupled-Distillation (RDD) strategy. Our GRDD learns new visual concepts quickly by imitating the habit of humans, i.e. learning from the deep knowledge distilled from the teacher. More specifically, we first train a global learner on the entire base subset using category labels as supervision to leverage the global context information of the categories. Then, the well-trained global learner is used to simulate the query-support relatedness in global dependencies. Finally, the distilled global query-support relatedness is explicitly used to train the meta-learner using the RDD strategy, with the goal of making the meta-learner more discriminative. The RDD strategy aims to decouple the dense query-support relatedness into the groups of sparse decoupled relatedness. Moreover, only the relatedness of a single support sample with other query samples is considered in each group. By distilling the sparse decoupled relatedness group by group, sharper relatedness can be effectively distilled to the meta-learner, thereby facilitating the learning of a discriminative meta-learner. We conduct extensive experiments on the miniImagenet and CIFAR-FS datasets, which show the state-of-the-art performance of our GRDD method.
△ Less
Submitted 13 December, 2022; v1 submitted 12 July, 2021;
originally announced July 2021.
-
Predicting Quantum Potentials by Deep Neural Network and Metropolis Sampling
Authors:
Rui Hong,
Peng-Fei Zhou,
Bin Xi,
Jie Hu,
An-Chun Ji,
Shi-Ju Ran
Abstract:
The hybridizations of machine learning and quantum physics have caused essential impacts to the methodology in both fields. Inspired by quantum potential neural network, we here propose to solve the potential in the Schrodinger equation provided the eigenstate, by combining Metropolis sampling with deep neural network, which we dub as Metropolis potential neural network (MPNN). A loss function is…
▽ More
The hybridizations of machine learning and quantum physics have caused essential impacts to the methodology in both fields. Inspired by quantum potential neural network, we here propose to solve the potential in the Schrodinger equation provided the eigenstate, by combining Metropolis sampling with deep neural network, which we dub as Metropolis potential neural network (MPNN). A loss function is proposed to explicitly involve the energy in the optimization for its accurate evaluation. Benchmarking on the harmonic oscillator and hydrogen atom, MPNN shows excellent accuracy and stability on predicting not just the potential to satisfy the Schrodinger equation, but also the eigen-energy. Our proposal could be potentially applied to the ab-initio simulations, and to inversely solving other partial differential equations in physics and beyond.
△ Less
Submitted 8 August, 2021; v1 submitted 6 June, 2021;
originally announced June 2021.
-
Privileged Graph Distillation for Cold Start Recommendation
Authors:
Shuai Wang,
Kun Zhang,
Le Wu,
Hai** Ma,
Richang Hong,
Meng Wang
Abstract:
The cold start problem in recommender systems is a long-standing challenge, which requires recommending to new users (items) based on attributes without any historical interaction records. In these recommendation systems, warm users (items) have privileged collaborative signals of interaction records compared to cold start users (items), and these Collaborative Filtering (CF) signals are shown to…
▽ More
The cold start problem in recommender systems is a long-standing challenge, which requires recommending to new users (items) based on attributes without any historical interaction records. In these recommendation systems, warm users (items) have privileged collaborative signals of interaction records compared to cold start users (items), and these Collaborative Filtering (CF) signals are shown to have competing performance for recommendation. Many researchers proposed to learn the correlation between collaborative signal embedding space and the attribute embedding space to improve the cold start recommendation, in which user and item categorical attributes are available in many online platforms. However, the cold start recommendation is still limited by two embedding spaces modeling and simple assumptions of space transformation. As user-item interaction behaviors and user (item) attributes naturally form a heterogeneous graph structure, in this paper, we propose a privileged graph distillation model~(PGD). The teacher model is composed of a heterogeneous graph structure for warm users and items with privileged CF links. The student model is composed of an entity-attribute graph without CF links. Specifically, the teacher model can learn better embeddings of each entity by injecting complex higher-order relationships from the constructed heterogeneous graph. The student model can learn the distilled output with privileged CF embeddings from the teacher embeddings. Our proposed model is generally applicable to different cold start scenarios with new user, new item, or new user-new item. Finally, extensive experimental results on the real-world datasets clearly show the effectiveness of our proposed model on different types of cold start problems, with average $6.6\%, 5.6\%, $ and $17.1\%$ improvement over state-of-the-art baselines on three datasets, respectively.
△ Less
Submitted 31 May, 2021;
originally announced May 2021.
-
Ray-tracing Analysis for Cross-polarization Scattering Diagnostic on MAST-Upgrade Spherical Tokamak
Authors:
R. Hong,
T. L. Rhodes,
G. Wang,
W. A. Peebles
Abstract:
A combined Doppler backscattering/cross-polarization scattering (DBS/CPS) system is being deployed on MAST-U, for simultaneous measurements of local density turbulence, turbulence flows, and magnetic turbulence. In this design, CPS shares the probing beam with the DBS and uses a separate parallel-viewing receiver system. In this study, we utilize a modified GENRAY 3D ray-tracing code, to simulate…
▽ More
A combined Doppler backscattering/cross-polarization scattering (DBS/CPS) system is being deployed on MAST-U, for simultaneous measurements of local density turbulence, turbulence flows, and magnetic turbulence. In this design, CPS shares the probing beam with the DBS and uses a separate parallel-viewing receiver system. In this study, we utilize a modified GENRAY 3D ray-tracing code, to simulate the propagation of the probing and scattered beams. The contributions of different scattering locations along the entire beam trajectories are considered, and the corresponding local $\tilde{\mathbf{B}}$ wavenumbers are estimated using the wave-vector matching criterion. The wavenumber ranges of the local $\tilde{\mathbf{B}}$ that is detectable to the CPS system are explored for simulated L- and H-mode plasmas.
△ Less
Submitted 20 May, 2021;
originally announced May 2021.
-
Set2setRank: Collaborative Set to Set Ranking for Implicit Feedback based Recommendation
Authors:
Lei Chen,
Le Wu,
Kun Zhang,
Richang Hong,
Meng Wang
Abstract:
As users often express their preferences with binary behavior data~(implicit feedback), such as clicking items or buying products, implicit feedback based Collaborative Filtering~(CF) models predict the top ranked items a user might like by leveraging implicit user-item interaction data. For each user, the implicit feedback is divided into two sets: an observed item set with limited observed behav…
▽ More
As users often express their preferences with binary behavior data~(implicit feedback), such as clicking items or buying products, implicit feedback based Collaborative Filtering~(CF) models predict the top ranked items a user might like by leveraging implicit user-item interaction data. For each user, the implicit feedback is divided into two sets: an observed item set with limited observed behaviors, and a large unobserved item set that is mixed with negative item behaviors and unknown behaviors. Given any user preference prediction model, researchers either designed ranking based optimization goals or relied on negative item mining techniques for better optimization. Despite the performance gain of these implicit feedback based models, the recommendation results are still far from satisfactory due to the sparsity of the observed item set for each user. To this end, in this paper, we explore the unique characteristics of the implicit feedback and propose Set2setRank framework for recommendation. The optimization criteria of Set2setRank are two folds: First, we design an item to an item set comparison that encourages each observed item from the sampled observed set is ranked higher than any unobserved item from the sampled unobserved set. Second, we model set level comparison that encourages a margin between the distance summarized from the observed item set and the most "hard" unobserved item from the sampled negative set. Further, an adaptive sampling technique is designed to implement these two goals. We have to note that our proposed framework is model-agnostic and can be easily applied to most recommendation prediction approaches, and is time efficient in practice. Finally, extensive experiments on three real-world datasets demonstrate the superiority of our proposed approach.
△ Less
Submitted 26 May, 2021; v1 submitted 16 May, 2021;
originally announced May 2021.
-
Few-shot Partial Multi-view Learning
Authors:
Yuan Zhou,
Yanrong Guo,
Shijie Hao,
Richang Hong,
Jiebo Luo
Abstract:
It is often the case that data are with multiple views in real-world applications. Fully exploring the information of each view is significant for making data more representative. However, due to various limitations and failures in data collection and pre-processing, it is inevitable for real data to suffer from view missing and data scarcity. The coexistence of these two issues makes it more chal…
▽ More
It is often the case that data are with multiple views in real-world applications. Fully exploring the information of each view is significant for making data more representative. However, due to various limitations and failures in data collection and pre-processing, it is inevitable for real data to suffer from view missing and data scarcity. The coexistence of these two issues makes it more challenging to achieve the pattern classification task. Currently, to our best knowledge, few appropriate methods can well-handle these two issues simultaneously. Aiming to draw more attention from the community to this challenge, we propose a new task in this paper, called few-shot partial multi-view learning, which focuses on overcoming the negative impact of the view-missing issue in the low-data regime. The challenges of this task are twofold: (i) it is difficult to overcome the impact of data scarcity under the interference of missing views; (ii) the limited number of data exacerbates information scarcity, thus making it harder to address the view-missing issue in turn. To address these challenges, we propose a new unified Gaussian dense-anchoring method. The unified dense anchors are learned for the limited partial multi-view data, thereby anchoring them into a unified dense representation space where the influence of data scarcity and view missing can be alleviated. We conduct extensive experiments to evaluate our method. The results on Cub-googlenet-doc2vec, Handwritten, Caltech102, Scene15, Animal, ORL, tieredImagenet, and Birds-200-2011 datasets validate its effectiveness.
△ Less
Submitted 18 May, 2023; v1 submitted 5 May, 2021;
originally announced May 2021.
-
Automatically Differentiable Quantum Circuit for Many-qubit State Preparation
Authors:
Peng-Fei Zhou,
Rui Hong,
Shi-Ju Ran
Abstract:
Constructing quantum circuits for efficient state preparation belongs to the central topics in the field of quantum information and computation. As the number of qubits grows fast, methods to derive large-scale quantum circuits are strongly desired. In this work, we propose the automatically differentiable quantum circuit (ADQC) approach to efficiently prepare arbitrary quantum many-qubit states.…
▽ More
Constructing quantum circuits for efficient state preparation belongs to the central topics in the field of quantum information and computation. As the number of qubits grows fast, methods to derive large-scale quantum circuits are strongly desired. In this work, we propose the automatically differentiable quantum circuit (ADQC) approach to efficiently prepare arbitrary quantum many-qubit states. A key ingredient is to introduce the latent gates whose decompositions give the unitary gates that form the quantum circuit. The circuit is optimized by updating the latent gates using back propagation to minimize the distance between the evolved and target states. Taking the ground states of quantum lattice models and random matrix product states as examples, with the number of qubits where processing the full coefficients is unlikely, ADQC obtains high fidelities with small numbers of layers $N_L \sim O(1)$. Superior accuracy is reached compared with the existing state-preparation approach based on the matrix product disentangler. The parameter complexity of MPS can be significantly reduced by ADQC with the compression ratio $r \sim O(10^{-3})$. Our work sheds light on the "intelligent construction" of quantum circuits for many-qubit systems by combining with the machine learning methods.
△ Less
Submitted 30 April, 2021;
originally announced April 2021.
-
Effect of Interfacial Dipole on Heterogeneous Ice Nucleation
Authors:
Hao Lu,
Quanming Xu,
Jianyang Wu,
Rongdun Hong,
Zhisen Zhang
Abstract:
In this letter, we performed molecular dynamics simulations of ice nucleation on a rigid surface model of cubic zinc blende structure with different surface dipole strength and orientation. It follows that, despite the excellent lattice match between cubic ice and substrates, the ice nucleation happened only when the interfacial water molecules (IWs) have the same or similar orientations as that o…
▽ More
In this letter, we performed molecular dynamics simulations of ice nucleation on a rigid surface model of cubic zinc blende structure with different surface dipole strength and orientation. It follows that, despite the excellent lattice match between cubic ice and substrates, the ice nucleation happened only when the interfacial water molecules (IWs) have the same or similar orientations as that of the water molecules in cubic ice. The free energy landscapes revealed that, for substrates with improper dipole strength/orientation, large free energy barriers arose to prevent the dipole of IWs rotating to the right orientation to trigger ice formation. Our results suggest that the traditional concept of lattice match, the similarity of lattice length between a substrate and the new-formed crystalline, should be extended to a broader match include the similarity between the molecular orientations of the interfacial component and the component in the specific new-formed crystalline face.
△ Less
Submitted 15 April, 2021;
originally announced April 2021.
-
Measurement of the Positive Muon Anomalous Magnetic Moment to 0.46 ppm
Authors:
B. Abi,
T. Albahri,
S. Al-Kilani,
D. Allspach,
L. P. Alonzi,
A. Anastasi,
A. Anisenkov,
F. Azfar,
K. Badgley,
S. Baeßler,
I. Bailey,
V. A. Baranov,
E. Barlas-Yucel,
T. Barrett,
E. Barzi,
A. Basti,
F. Bedeschi,
A. Behnke,
M. Berz,
M. Bhattacharya,
H. P. Binney,
R. Bjorkquist,
P. Bloom,
J. Bono,
E. Bottalico
, et al. (212 additional authors not shown)
Abstract:
We present the first results of the Fermilab Muon g-2 Experiment for the positive muon magnetic anomaly $a_μ\equiv (g_μ-2)/2$. The anomaly is determined from the precision measurements of two angular frequencies. Intensity variation of high-energy positrons from muon decays directly encodes the difference frequency $ω_a$ between the spin-precession and cyclotron frequencies for polarized muons in…
▽ More
We present the first results of the Fermilab Muon g-2 Experiment for the positive muon magnetic anomaly $a_μ\equiv (g_μ-2)/2$. The anomaly is determined from the precision measurements of two angular frequencies. Intensity variation of high-energy positrons from muon decays directly encodes the difference frequency $ω_a$ between the spin-precession and cyclotron frequencies for polarized muons in a magnetic storage ring. The storage ring magnetic field is measured using nuclear magnetic resonance probes calibrated in terms of the equivalent proton spin precession frequency ${\tildeω'^{}_p}$ in a spherical water sample at 34.7$^{\circ}$C. The ratio $ω_a / {\tildeω'^{}_p}$, together with known fundamental constants, determines $a_μ({\rm FNAL}) = 116\,592\,040(54)\times 10^{-11}$ (0.46\,ppm). The result is 3.3 standard deviations greater than the standard model prediction and is in excellent agreement with the previous Brookhaven National Laboratory (BNL) E821 measurement. After combination with previous measurements of both $μ^+$ and $μ^-$, the new experimental average of $a_μ({\rm Exp}) = 116\,592\,061(41)\times 10^{-11}$ (0.35\,ppm) increases the tension between experiment and theory to 4.2 standard deviations
△ Less
Submitted 7 April, 2021;
originally announced April 2021.
-
Measurement of the anomalous precession frequency of the muon in the Fermilab Muon g-2 experiment
Authors:
T. Albahri,
A. Anastasi,
A. Anisenkov,
K. Badgley,
S. Baeßler,
I. Bailey,
V. A. Baranov,
E. Barlas-Yucel,
T. Barrett,
A. Basti,
F. Bedeschi,
M. Berz,
M. Bhattacharya,
H. P. Binney,
P. Bloom,
J. Bono,
E. Bottalico,
T. Bowcock,
G. Cantatore,
R. M. Carey,
B. C. K. Casey,
D. Cauz,
R. Chakraborty,
S. P. Chang,
A. Chapelain
, et al. (153 additional authors not shown)
Abstract:
The Muon g-2 Experiment at Fermi National Accelerator Laboratory (FNAL) has measured the muon anomalous precession frequency $ω_a$ to an uncertainty of 434 parts per billion (ppb), statistical, and 56 ppb, systematic, with data collected in four storage ring configurations during its first physics run in 2018. When combined with a precision measurement of the magnetic field of the experiment's muo…
▽ More
The Muon g-2 Experiment at Fermi National Accelerator Laboratory (FNAL) has measured the muon anomalous precession frequency $ω_a$ to an uncertainty of 434 parts per billion (ppb), statistical, and 56 ppb, systematic, with data collected in four storage ring configurations during its first physics run in 2018. When combined with a precision measurement of the magnetic field of the experiment's muon storage ring, the precession frequency measurement determines a muon magnetic anomaly of $a_μ({\rm FNAL}) = 116\,592\,040(54) \times 10^{-11}$ (0.46 ppm). This article describes the multiple techniques employed in the reconstruction, analysis and fitting of the data to measure the precession frequency. It also presents the averaging of the results from the eleven separate determinations of ω_a, and the systematic uncertainties on the result.
△ Less
Submitted 7 April, 2021;
originally announced April 2021.
-
Beam dynamics corrections to the Run-1 measurement of the muon anomalous magnetic moment at Fermilab
Authors:
T. Albahri,
A. Anastasi,
K. Badgley,
S. Baeßler,
I. Bailey,
V. A. Baranov,
E. Barlas-Yucel,
T. Barrett,
F. Bedeschi,
M. Berz,
M. Bhattacharya,
H. P. Binney,
P. Bloom,
J. Bono,
E. Bottalico,
T. Bowcock,
G. Cantatore,
R. M. Carey,
B. C. K. Casey,
D. Cauz,
R. Chakraborty,
S. P. Chang,
A. Chapelain,
S. Charity,
R. Chislett
, et al. (152 additional authors not shown)
Abstract:
This paper presents the beam dynamics systematic corrections and their uncertainties for the Run-1 data set of the Fermilab Muon g-2 Experiment. Two corrections to the measured muon precession frequency $ω_a^m$ are associated with well-known effects owing to the use of electrostatic quadrupole (ESQ) vertical focusing in the storage ring. An average vertically oriented motional magnetic field is fe…
▽ More
This paper presents the beam dynamics systematic corrections and their uncertainties for the Run-1 data set of the Fermilab Muon g-2 Experiment. Two corrections to the measured muon precession frequency $ω_a^m$ are associated with well-known effects owing to the use of electrostatic quadrupole (ESQ) vertical focusing in the storage ring. An average vertically oriented motional magnetic field is felt by relativistic muons passing transversely through the radial electric field components created by the ESQ system. The correction depends on the stored momentum distribution and the tunes of the ring, which has relatively weak vertical focusing. Vertical betatron motions imply that the muons do not orbit the ring in a plane exactly orthogonal to the vertical magnetic field direction. A correction is necessary to account for an average pitch angle associated with their trajectories. A third small correction is necessary because muons that escape the ring during the storage time are slightly biased in initial spin phase compared to the parent distribution. Finally, because two high-voltage resistors in the ESQ network had longer than designed RC time constants, the vertical and horizontal centroids and envelopes of the stored muon beam drifted slightly, but coherently, during each storage ring fill. This led to the discovery of an important phase-acceptance relationship that requires a correction. The sum of the corrections to $ω_a^m$ is 0.50 $\pm$ 0.09 ppm; the uncertainty is small compared to the 0.43 ppm statistical precision of $ω_a^m$.
△ Less
Submitted 23 April, 2021; v1 submitted 7 April, 2021;
originally announced April 2021.
-
Magnetic Field Measurement and Analysis for the Muon g-2 Experiment at Fermilab
Authors:
T. Albahri,
A. Anastasi,
K. Badgley,
S. Baeßler,
I. Bailey,
V. A. Baranov,
E. Barlas-Yucel,
T. Barrett,
F. Bedeschi,
M. Berz,
M. Bhattacharya,
H. P. Binney,
P. Bloom,
J. Bono,
E. Bottalico,
T. Bowcock,
G. Cantatore,
R. M. Carey,
B. C. K. Casey,
D. Cauz,
R. Chakraborty,
S. P. Chang,
A. Chapelain,
S. Charity,
R. Chislett
, et al. (148 additional authors not shown)
Abstract:
The Fermi National Accelerator Laboratory has measured the anomalous precession frequency $a^{}_μ= (g^{}_μ-2)/2$ of the muon to a combined precision of 0.46 parts per million with data collected during its first physics run in 2018. This paper documents the measurement of the magnetic field in the muon storage ring. The magnetic field is monitored by nuclear magnetic resonance systems and calibrat…
▽ More
The Fermi National Accelerator Laboratory has measured the anomalous precession frequency $a^{}_μ= (g^{}_μ-2)/2$ of the muon to a combined precision of 0.46 parts per million with data collected during its first physics run in 2018. This paper documents the measurement of the magnetic field in the muon storage ring. The magnetic field is monitored by nuclear magnetic resonance systems and calibrated in terms of the equivalent proton spin precession frequency in a spherical water sample at 34.7$^\circ$C. The measured field is weighted by the muon distribution resulting in $\tildeω'^{}_p$, the denominator in the ratio $ω^{}_a$/$\tildeω'^{}_p$ that together with known fundamental constants yields $a^{}_μ$. The reported uncertainty on $\tildeω'^{}_p$ for the Run-1 data set is 114 ppb consisting of uncertainty contributions from frequency extraction, calibration, map**, tracking, and averaging of 56 ppb, and contributions from fast transient fields of 99 ppb.
△ Less
Submitted 17 June, 2022; v1 submitted 7 April, 2021;
originally announced April 2021.
-
Fine-Grained Fashion Similarity Prediction by Attribute-Specific Embedding Learning
Authors:
Jianfeng Dong,
Zhe Ma,
Xiaofeng Mao,
Xun Yang,
Yuan He,
Richang Hong,
Shouling Ji
Abstract:
This paper strives to predict fine-grained fashion similarity. In this similarity paradigm, one should pay more attention to the similarity in terms of a specific design/attribute between fashion items. For example, whether the collar designs of the two clothes are similar. It has potential value in many fashion related applications, such as fashion copyright protection. To this end, we propose an…
▽ More
This paper strives to predict fine-grained fashion similarity. In this similarity paradigm, one should pay more attention to the similarity in terms of a specific design/attribute between fashion items. For example, whether the collar designs of the two clothes are similar. It has potential value in many fashion related applications, such as fashion copyright protection. To this end, we propose an Attribute-Specific Embedding Network (ASEN) to jointly learn multiple attribute-specific embeddings, thus measure the fine-grained similarity in the corresponding space. The proposed ASEN is comprised of a global branch and a local branch. The global branch takes the whole image as input to extract features from a global perspective, while the local branch takes as input the zoomed-in region-of-interest (RoI) w.r.t. the specified attribute thus able to extract more fine-grained features. As the global branch and the local branch extract the features from different perspectives, they are complementary to each other. Additionally, in each branch, two attention modules, i.e., Attribute-aware Spatial Attention and Attribute-aware Channel Attention, are integrated to make ASEN be able to locate the related regions and capture the essential patterns under the guidance of the specified attribute, thus make the learned attribute-specific embeddings better reflect the fine-grained similarity. Extensive experiments on three fashion-related datasets, i.e., FashionAI, DARN, and DeepFashion, show the effectiveness of ASEN for fine-grained fashion similarity prediction and its potential for fashion reranking. Code and data are available at https://github.com/maryeon/asenpp .
△ Less
Submitted 11 October, 2021; v1 submitted 6 April, 2021;
originally announced April 2021.
-
Revisiting Local Descriptor for Improved Few-Shot Classification
Authors:
Jun He,
Richang Hong,
Xueliang Liu,
Mingliang Xu,
Qianru Sun
Abstract:
Few-shot classification studies the problem of quickly adapting a deep learner to understanding novel classes based on few support images. In this context, recent research efforts have been aimed at designing more and more complex classifiers that measure similarities between query and support images, but left the importance of feature embeddings seldom explored. We show that the reliance on sophi…
▽ More
Few-shot classification studies the problem of quickly adapting a deep learner to understanding novel classes based on few support images. In this context, recent research efforts have been aimed at designing more and more complex classifiers that measure similarities between query and support images, but left the importance of feature embeddings seldom explored. We show that the reliance on sophisticated classifiers is not necessary, and a simple classifier applied directly to improved feature embeddings can instead outperform most of the leading methods in the literature. To this end, we present a new method named \textbf{DCAP} for few-shot classification, in which we investigate how one can improve the quality of embeddings by leveraging \textbf{D}ense \textbf{C}lassification and \textbf{A}ttentive \textbf{P}ooling. Specifically, we propose to train a learner on base classes with abundant samples to solve dense classification problem first and then meta-train the learner on a bunch of randomly sampled few-shot tasks to adapt it to few-shot scenario or the test time scenario. During meta-training, we suggest to pool feature maps by applying attentive pooling instead of the widely used global average pooling (GAP) to prepare embeddings for few-shot classification. Attentive pooling learns to reweight local descriptors, explaining what the learner is looking for as evidence for decision making. Experiments on two benchmark datasets show the proposed method to be superior in multiple few-shot settings while being simpler and more explainable. Code is available at: \url{https://github.com/Ukeyboard/dcap/}.
△ Less
Submitted 18 February, 2022; v1 submitted 29 March, 2021;
originally announced March 2021.
-
Learning Fair Representations for Recommendation: A Graph-based Perspective
Authors:
Le Wu,
Lei Chen,
Pengyang Shao,
Richang Hong,
Xiting Wang,
Meng Wang
Abstract:
As a key application of artificial intelligence, recommender systems are among the most pervasive computer aided systems to help users find potential items of interests. Recently, researchers paid considerable attention to fairness issues for artificial intelligence applications. Most of these approaches assumed independence of instances, and designed sophisticated models to eliminate the sensitiv…
▽ More
As a key application of artificial intelligence, recommender systems are among the most pervasive computer aided systems to help users find potential items of interests. Recently, researchers paid considerable attention to fairness issues for artificial intelligence applications. Most of these approaches assumed independence of instances, and designed sophisticated models to eliminate the sensitive information to facilitate fairness. However, recommender systems differ greatly from these approaches as users and items naturally form a user-item bipartite graph, and are collaboratively correlated in the graph structure. In this paper, we propose a novel graph based technique for ensuring fairness of any recommendation models. Here, the fairness requirements refer to not exposing sensitive feature set in the user modeling process. Specifically, given the original embeddings from any recommendation models, we learn a composition of filters that transform each user's and each item's original embeddings into a filtered embedding space based on the sensitive feature set. For each user, this transformation is achieved under the adversarial learning of a user-centric graph, in order to obfuscate each sensitive feature between both the filtered user embedding and the sub graph structures of this user. Finally, extensive experimental results clearly show the effectiveness of our proposed model for fair recommendation. We publish the source code at https://github.com/newlei/FairGo.
△ Less
Submitted 23 April, 2021; v1 submitted 17 February, 2021;
originally announced February 2021.
-
Systematic and Statistical Uncertainties of the Hilbert-Transform Based High-precision FID Frequency Extraction Method
Authors:
Ran Hong,
Simon Corrodi,
Saskia Charity,
Stefan Baessler,
Jason Bono,
Timothy Chupp,
Martin Fertl,
David Flay,
Alejandro Garcia,
Jimin George,
Kevin Louis Giovanetti,
Timothy Gorringe,
Joseph Grange,
Kyun Woo Hong,
David Kawall,
Brendan Kiburg,
Bingzhi Li,
Rachel Osofsky,
Dinko Pocanic,
Suvarna Ramachandran,
Matthias Smith,
Herbert Erik Swanson,
Alec Tewsley-Booth,
Peter Winter,
Tianyu Yang
, et al. (1 additional authors not shown)
Abstract:
Pulsed nuclear magnetic resonance (NMR) is widely used in high-precision magnetic field measurements. The absolute value of the magnetic field is determined from the precession frequency of nuclear magnetic moments. The Hilbert transform is widely used to extract the phase function from the observed free induction decay (FID) signal and then its frequency. In this paper, a detailed implementation…
▽ More
Pulsed nuclear magnetic resonance (NMR) is widely used in high-precision magnetic field measurements. The absolute value of the magnetic field is determined from the precession frequency of nuclear magnetic moments. The Hilbert transform is widely used to extract the phase function from the observed free induction decay (FID) signal and then its frequency. In this paper, a detailed implementation of a Hilbert-transform based FID frequency extraction method is described. How artifacts and noise level in the FID signal affect the extracted phase function are derived analytically. A method of mitigating the artifacts in the extracted phase function of an FID is discussed. Correlations between noises of the phase function samples are studied for different noise spectra. We discovered that the error covariance matrix for the extracted phase function is nearly singular and improper for constructing the $χ^2$ used in the fitting routine. A down-sampling method for fixing the singular covariance matrix has been developed, so that the minimum $χ^2$-fit yields properly the statistical uncertainty of the extracted frequency. Other practical methods of obtaining the statistical uncertainty are also discussed.
△ Less
Submitted 27 January, 2021; v1 submitted 20 January, 2021;
originally announced January 2021.
-
Influence of geometric structure, convection, and eddy on sound propagation in acoustic metamaterial with turbulent flow
Authors:
Myong Chol Pak,
Kwang-Il Kim,
Hak Chol Pak,
Kwon Ryong Hong
Abstract:
The problem of reducing noise in the transportation is an important research field to prevent accidents and to provide a civilized environment for people. A material that has recently attracted attention in research to reduce noise is acoustic metamaterial, and most of the research projects so far have been limited to the case of static media without flow. We have studied the sound transmission pr…
▽ More
The problem of reducing noise in the transportation is an important research field to prevent accidents and to provide a civilized environment for people. A material that has recently attracted attention in research to reduce noise is acoustic metamaterial, and most of the research projects so far have been limited to the case of static media without flow. We have studied the sound transmission properties of acoustic metamaterial with turbulent flow to develop acoustic metamaterial that be used in transportation. In this paper, the effect of geometrical structure, the convective effect, and the eddy effect on sound propagation in acoustic metamaterial with turbulent flow are investigated, and the relationships between them are analyzed. The convective effect and the eddy effect both reduce the resonant strength of sound transmission loss resulting from the unique geometry of the acoustic crystal, but shift the resonant frequencies in opposite directions. In addition, when the convective effect and the eddy effect of the airflow, as well as the intrinsic interaction effect generated from the unique geometrical structure of the acoustic metamaterial cannot be ignored, they exhibit competition phenomena with each other, resulting in a widening of the resonance peak. As a result, these three effects cause the shift of the resonance frequency of the sound transmission loss and the widening of the resonance peak. The results of this study show that even in the case of turbulent flow, acoustic metamaterial can be used for transportation by properly controlling the geometric size and shape of the acoustic metamaterial.
△ Less
Submitted 22 December, 2020;
originally announced December 2020.
-
Observation of quasi-coherent density fluctuation in scrape-off layer enhancing boundary transport in high-betaN hybrid plasmas on DIII-D
Authors:
R. Hong,
T. L. Rhodes,
Z. Y. Li,
H. Wang,
L. Zeng,
K. Barada,
G. Wang,
J. G. Watkins,
W. A. Peebles
Abstract:
We report the observation of a quasi-coherent density fluctuation (QCF) by the Doppler backscattering system in the scrape-off layer (SOL) region of the DIII-D tokamak. This QCF is observed in high-power, high-performance hybrid plasmas with near double-null divertor (DND) shape during the electron cyclotron heating period. This mode is correlated with a steepened SOL density profile and leads to…
▽ More
We report the observation of a quasi-coherent density fluctuation (QCF) by the Doppler backscattering system in the scrape-off layer (SOL) region of the DIII-D tokamak. This QCF is observed in high-power, high-performance hybrid plasmas with near double-null divertor (DND) shape during the electron cyclotron heating period. This mode is correlated with a steepened SOL density profile and leads to significantly elevated particle and heat fluxes between ELMs. The SOL QCF is a long-wavelength ion-scale fluctuation and propagates in the ion diamagnetic direction in the plasma frame. Its radial expanse is about 1.5-2 cm, well beyond the typical width of heat flux on DIII-D. Also, the SOL QCF does not show any clear dependence on the effective SOL collisionality and thus may raise issues on the control of plasma-material interactions in low collisionality plasmas in which the blob-induced transport is reduced. A linear simulation using BOUT++ with a 5-field reduced model is performed and compared with experimental observations. In simulation results, an interchange-like density perturbation can be driven by the SOL density gradient, and its peak location and the radial width of the density perturbation are in agreement with the experimental observations.
△ Less
Submitted 19 March, 2021; v1 submitted 10 November, 2020;
originally announced November 2020.
-
Recyclable Waste Identification Using CNN Image Recognition and Gaussian Clustering
Authors:
Yuheng Wang,
Wen Jie Zhao,
Jiahui Xu,
Raymond Hong
Abstract:
Waste recycling is an important way of saving energy and materials in the production process. In general cases recyclable objects are mixed with unrecyclable objects, which raises a need for identification and classification. This paper proposes a convolutional neural network (CNN) model to complete both tasks. The model uses transfer learning from a pretrained Resnet-50 CNN to complete feature ex…
▽ More
Waste recycling is an important way of saving energy and materials in the production process. In general cases recyclable objects are mixed with unrecyclable objects, which raises a need for identification and classification. This paper proposes a convolutional neural network (CNN) model to complete both tasks. The model uses transfer learning from a pretrained Resnet-50 CNN to complete feature extraction. A subsequent fully connected layer for classification was trained on the augmented TrashNet dataset [1]. In the application, sliding-window is used for image segmentation in the pre-classification stage. In the post-classification stage, the labelled sample points are integrated with Gaussian Clustering to locate the object. The resulting model has achieved an overall detection rate of 48.4% in simulation and final classification accuracy of 92.4%.
△ Less
Submitted 2 November, 2020;
originally announced November 2020.
-
One-bit Supervision for Image Classification
Authors:
Hengtong Hu,
Lingxi Xie,
Zewei Du,
Richang Hong,
Qi Tian
Abstract:
This paper presents one-bit supervision, a novel setting of learning from incomplete annotations, in the scenario of image classification. Instead of training a model upon the accurate label of each sample, our setting requires the model to query with a predicted label of each sample and learn from the answer whether the guess is correct. This provides one bit (yes or no) of information, and more…
▽ More
This paper presents one-bit supervision, a novel setting of learning from incomplete annotations, in the scenario of image classification. Instead of training a model upon the accurate label of each sample, our setting requires the model to query with a predicted label of each sample and learn from the answer whether the guess is correct. This provides one bit (yes or no) of information, and more importantly, annotating each sample becomes much easier than finding the accurate label from many candidate classes. There are two keys to training a model upon one-bit supervision: improving the guess accuracy and making use of incorrect guesses. For these purposes, we propose a multi-stage training paradigm which incorporates negative label suppression into an off-the-shelf semi-supervised learning algorithm. In three popular image classification benchmarks, our approach claims higher efficiency in utilizing the limited amount of annotations.
△ Less
Submitted 11 May, 2021; v1 submitted 13 September, 2020;
originally announced September 2020.
-
Towards Evaluating Exploratory Model Building Process with AutoML Systems
Authors:
Sungsoo Ray Hong,
Sonia Castelo,
Vito D'Orazio,
Christopher Benthune,
Aecio Santos,
Scott Langevin,
David Jonker,
Enrico Bertini,
Juliana Freire
Abstract:
The use of Automated Machine Learning (AutoML) systems are highly open-ended and exploratory. While rigorously evaluating how end-users interact with AutoML is crucial, establishing a robust evaluation methodology for such exploratory systems is challenging. First, AutoML is complex, including multiple sub-components that support a variety of sub-tasks for synthesizing ML pipelines, such as data p…
▽ More
The use of Automated Machine Learning (AutoML) systems are highly open-ended and exploratory. While rigorously evaluating how end-users interact with AutoML is crucial, establishing a robust evaluation methodology for such exploratory systems is challenging. First, AutoML is complex, including multiple sub-components that support a variety of sub-tasks for synthesizing ML pipelines, such as data preparation, problem specification, and model generation, making it difficult to yield insights that tell us which components were successful or not. Second, because the usage pattern of AutoML is highly exploratory, it is not possible to rely solely on widely used task efficiency and effectiveness metrics as success metrics. To tackle the challenges in evaluation, we propose an evaluation methodology that (1) guides AutoML builders to divide their AutoML system into multiple sub-system components, and (2) helps them reason about each component through visualization of end-users' behavioral patterns and attitudinal data. We conducted a study to understand when, how, why, and applying our methodology can help builders to better understand their systems and end-users. We recruited 3 teams of professional AutoML builders. The teams prepared their own systems and let 41 end-users use the systems. Using our methodology, we visualized end-users' behavioral and attitudinal data and distributed the results to the teams. We analyzed the results in two directions: what types of novel insights the AutoML builders learned from end-users, and (2) how the evaluation methodology helped the builders to understand workflows and the effectiveness of their systems. Our findings suggest new insights explaining future design opportunities in the AutoML domain as well as how using our methodology helped the builders to determine insights and let them draw concrete directions for improving their systems.
△ Less
Submitted 1 September, 2020;
originally announced September 2020.
-
The Effect of Dipole from $γ$-AgI Substrates on Heterogeneous Ice Nucleation
Authors:
Hao Lu,
Quanming Xu,
Chaohong Wang,
Jianyang Wu,
Rongdun Hong,
Xiang-Yang Liu,
Zhisen Zhang
Abstract:
Heterogeneous ice nucleation is one of the most common and important process in the physical environment. AgI has been proved to be an effective ice nucleating agent in the process of ice nucleation. However, the microscopic mechanism of AgI in heterogeneous ice nucleation has not been fully understood. Molecular dynamics simulations are applied to investigate the ability of which kinds of $γ$-AgI…
▽ More
Heterogeneous ice nucleation is one of the most common and important process in the physical environment. AgI has been proved to be an effective ice nucleating agent in the process of ice nucleation. However, the microscopic mechanism of AgI in heterogeneous ice nucleation has not been fully understood. Molecular dynamics simulations are applied to investigate the ability of which kinds of $γ$-AgI substrate can promote ice nucleation by changing the dipole of $γ$-AgI on the substrate, we conclude that the dipole of $γ$-AgI on the substrate can affect the conformation of ice nucleation. The surface ions with positive charge on the substrate may promote ice nucleation, while there is no ice nucleation founded on the surface ions with negative charge. $γ$-AgI substrates affect ice nucleation through adjust the orientations of water molecules near the surfaces.
△ Less
Submitted 23 August, 2020;
originally announced August 2020.
-
RGCF: Refined Graph Convolution Collaborative Filtering with concise and expressive embedding
Authors:
Kang Liu,
Feng Xue,
Richang Hong
Abstract:
Graph Convolution Network (GCN) has attracted significant attention and become the most popular method for learning graph representations. In recent years, many efforts have been focused on integrating GCN into the recommender tasks and have made remarkable progress. At its core is to explicitly capture high-order connectivities between the nodes in user-item bipartite graph. However, we theoretic…
▽ More
Graph Convolution Network (GCN) has attracted significant attention and become the most popular method for learning graph representations. In recent years, many efforts have been focused on integrating GCN into the recommender tasks and have made remarkable progress. At its core is to explicitly capture high-order connectivities between the nodes in user-item bipartite graph. However, we theoretically and empirically find an inherent drawback existed in these GCN-based recommendation methods, where GCN is directly applied to aggregate neighboring nodes will introduce noise and information redundancy. Consequently, the these models' capability of capturing high-order connectivities among different nodes is limited, leading to suboptimal performance of the recommender tasks. The main reason is that the the nonlinear network layer inside GCN structure is not suitable for extracting non-sematic features(such as one-hot ID feature) in the collaborative filtering scenarios. In this work, we develop a new GCN-based Collaborative Filtering model, named Refined Graph convolution Collaborative Filtering(RGCF), where the construction of the embeddings of users (items) are delicately redesigned from several aspects during the aggregation on the graph. Compared to the state-of-the-art GCN-based recommendation, RGCF is more capable for capturing the implicit high-order connectivities inside the graph and the resultant vector representations are more expressive. We conduct extensive experiments on three public million-size datasets, demonstrating that our RGCF significantly outperforms state-of-the-art models. We release our code at https://github.com/hfutmars/RGCF.
△ Less
Submitted 11 July, 2020; v1 submitted 7 July, 2020;
originally announced July 2020.
-
Representation via Representations: Domain Generalization via Adversarially Learned Invariant Representations
Authors:
Zhun Deng,
Frances Ding,
Cynthia Dwork,
Rachel Hong,
Giovanni Parmigiani,
Prasad Patil,
Pragya Sur
Abstract:
We investigate the power of censoring techniques, first developed for learning {\em fair representations}, to address domain generalization. We examine {\em adversarial} censoring techniques for learning invariant representations from multiple "studies" (or domains), where each study is drawn according to a distribution on domains. The map** is used at test time to classify instances from a new…
▽ More
We investigate the power of censoring techniques, first developed for learning {\em fair representations}, to address domain generalization. We examine {\em adversarial} censoring techniques for learning invariant representations from multiple "studies" (or domains), where each study is drawn according to a distribution on domains. The map** is used at test time to classify instances from a new domain. In many contexts, such as medical forecasting, domain generalization from studies in populous areas (where data are plentiful), to geographically remote populations (for which no training data exist) provides fairness of a different flavor, not anticipated in previous work on algorithmic fairness.
We study an adversarial loss function for $k$ domains and precisely characterize its limiting behavior as $k$ grows, formalizing and proving the intuition, backed by experiments, that observing data from a larger number of domains helps. The limiting results are accompanied by non-asymptotic learning-theoretic bounds. Furthermore, we obtain sufficient conditions for good worst-case prediction performance of our algorithm on previously unseen domains. Finally, we decompose our map**s into two components and provide a complete characterization of invariance in terms of this decomposition. To our knowledge, our results provide the first formal guarantees of these kinds for adversarial invariant domain generalization.
△ Less
Submitted 19 June, 2020;
originally announced June 2020.
-
Joint Item Recommendation and Attribute Inference: An Adaptive Graph Convolutional Network Approach
Authors:
Le Wu,
Yonghui Yang,
Kun Zhang,
Richang Hong,
Yanjie Fu,
Meng Wang
Abstract:
In many recommender systems, users and items are associated with attributes, and users show preferences to items. The attribute information describes users'(items') characteristics and has a wide range of applications, such as user profiling, item annotation, and feature-enhanced recommendation. As annotating user (item) attributes is a labor intensive task, the attribute values are often incomple…
▽ More
In many recommender systems, users and items are associated with attributes, and users show preferences to items. The attribute information describes users'(items') characteristics and has a wide range of applications, such as user profiling, item annotation, and feature-enhanced recommendation. As annotating user (item) attributes is a labor intensive task, the attribute values are often incomplete with many missing attribute values. Therefore, item recommendation and attribute inference have become two main tasks in these platforms. Researchers have long converged that user (item) attributes and the preference behavior are highly correlated. Some researchers proposed to leverage one kind of data for the remaining task, and showed to improve performance. Nevertheless, these models either neglected the incompleteness of user (item) attributes or regarded the correlation of the two tasks with simple models, leading to suboptimal performance of these two tasks. To this end, in this paper, we define these two tasks in an attributed user-item bipartite graph, and propose an Adaptive Graph Convolutional Network (AGCN) approach for joint item recommendation and attribute inference. The key idea of AGCN is to iteratively perform two parts: 1) Learning graph embedding parameters with previously learned approximated attribute values to facilitate two tasks; 2) Sending the approximated updated attribute values back to the attributed graph for better graph embedding learning. Therefore, AGCN could adaptively adjust the graph embedding learning parameters by incorporating both the given attributes and the estimated attribute values, in order to provide weakly supervised information to refine the two tasks. Extensive experimental results on three real-world datasets clearly show the effectiveness of the proposed model.
△ Less
Submitted 25 May, 2020;
originally announced May 2020.
-
Learning to Transfer Graph Embeddings for Inductive Graph based Recommendation
Authors:
Le Wu,
Yonghui Yang,
Lei Chen,
Defu Lian,
Richang Hong,
Meng Wang
Abstract:
With the increasing availability of videos, how to edit them and present the most interesting parts to users, i.e., video highlight, has become an urgent need with many broad applications. As users'visual preferences are subjective and vary from person to person, previous generalized video highlight extraction models fail to tailor to users' unique preferences. In this paper, we study the problem…
▽ More
With the increasing availability of videos, how to edit them and present the most interesting parts to users, i.e., video highlight, has become an urgent need with many broad applications. As users'visual preferences are subjective and vary from person to person, previous generalized video highlight extraction models fail to tailor to users' unique preferences. In this paper, we study the problem of personalized video highlight recommendation with rich visual content. By dividing each video into non-overlap** segments, we formulate the problem as a personalized segment recommendation task with many new segments in the test stage. The key challenges of this problem lie in: the cold-start users with limited video highlight records in the training data and new segments without any user ratings at the test stage. In this paper, we propose an inductive Graph based Transfer learning framework for personalized video highlight Recommendation (TransGRec). TransGRec is composed of two parts: a graph neural network followed by an item embedding transfer network. Specifically, the graph neural network part exploits the higher-order proximity between users and segments to alleviate the user cold-start problem. The transfer network is designed to approximate the learned item embeddings from graph neural networks by taking each item's visual content as input, in order to tackle the new segment problem in the test phase. We design two detailed implementations of the transfer learning optimization function, and we show how the two parts of TransGRec can be efficiently optimized with different transfer learning optimization functions. Extensive experimental results on a real-world dataset clearly show the effectiveness of our proposed model.
△ Less
Submitted 24 May, 2020;
originally announced May 2020.
-
Real-time Semantic Segmentation via Spatial-detail Guided Context Propagation
Authors:
Shijie Hao,
Yuan Zhou,
Yanrong Guo,
Richang Hong,
Jun Cheng,
Meng Wang
Abstract:
Nowadays, vision-based computing tasks play an important role in various real-world applications. However, many vision computing tasks, e.g. semantic segmentation, are usually computationally expensive, posing a challenge to the computing systems that are resource-constrained but require fast response speed. Therefore, it is valuable to develop accurate and real-time vision processing models that…
▽ More
Nowadays, vision-based computing tasks play an important role in various real-world applications. However, many vision computing tasks, e.g. semantic segmentation, are usually computationally expensive, posing a challenge to the computing systems that are resource-constrained but require fast response speed. Therefore, it is valuable to develop accurate and real-time vision processing models that only require limited computational resources. To this end, we propose the Spatial-detail Guided Context Propagation Network (SGCPNet) for achieving real-time semantic segmentation. In SGCPNet, we propose the strategy of spatial-detail guided context propagation. It uses the spatial details of shallow layers to guide the propagation of the low-resolution global contexts, in which the lost spatial information can be effectively reconstructed. In this way, the need for maintaining high-resolution features along the network is freed, therefore largely improving the model efficiency. On the other hand, due to the effective reconstruction of spatial details, the segmentation accuracy can be still preserved. In the experiments, we validate the effectiveness and efficiency of the proposed SGCPNet model. On the Citysacpes dataset, for example, our SGCPNet achieves 69.5% mIoU segmentation accuracy, while its speed reaches 178.5 FPS on 768x1536 images on a GeForce GTX 1080 Ti GPU card. In addition, SGCPNet is very lightweight and only contains 0.61 M parameters.
△ Less
Submitted 19 March, 2022; v1 submitted 22 May, 2020;
originally announced May 2020.
-
Efficient Network Function Backup by Update Piggybacking
Authors:
Kate Ching-Ju Lin,
Ruei-Yong Hong,
Yu-Chee Tseng
Abstract:
Network Function Virtualization (NFV) and Service Function Chaining (SFC) have been widely used to enable flexible and agile network management. To enhance reliability, some research has proposed to deploy backup function instances for prompt recovery when a primary instance fails. While most of the recent studies focus on speeding up recovery, less attention has been paid to the problem of minimi…
▽ More
Network Function Virtualization (NFV) and Service Function Chaining (SFC) have been widely used to enable flexible and agile network management. To enhance reliability, some research has proposed to deploy backup function instances for prompt recovery when a primary instance fails. While most of the recent studies focus on speeding up recovery, less attention has been paid to the problem of minimizing the state update cost. In this work, we present PiggyBackup (Piggyback-based Backup), an efficient backup instance deployment and update protocol. Our key idea is to reuse the existing service chains traversing through servers in a network to help piggyback the update information. By doing this, we eliminate the header overhead and reduce the amount of update traffic significantly. To realize such a piggyback-based update more efficiently, we investigate the backup instance deployment and chain selection problems to enhance piggybacking opportunities and reduce the forwarding hop counts with explicit consideration of the distribution of service chains. Our simulation results show that PiggyBackup reduces the average overall update overhead by 47.65% and 39.56%, respectively, in a fat-tree topology as compared to random deployment and shortest path based deployment.
△ Less
Submitted 15 May, 2020;
originally announced May 2020.