-
Nuclear spin relaxation rate of nonunitary Dirac and Weyl superconductors
Authors:
Koki Maeno,
Yuki Kawaguchi,
Yasuhiro Asano,
Shingo Kobayashi
Abstract:
Nonunitary superconductivity has attracted renewed interest as a novel gapless phase of matter. In this study, we investigate the superconducting gap structure of nonunitary odd-parity chiral pairing states in a superconductor involving strong spin-orbit interactions. By applying a group theoretical classification of chiral states in terms of discrete rotation symmetry, we categorized all possible…
▽ More
Nonunitary superconductivity has attracted renewed interest as a novel gapless phase of matter. In this study, we investigate the superconducting gap structure of nonunitary odd-parity chiral pairing states in a superconductor involving strong spin-orbit interactions. By applying a group theoretical classification of chiral states in terms of discrete rotation symmetry, we categorized all possible point-nodal gap structures in nonunitary chiral states into four types in terms of the topological number of nodes and node positions relative to the rotation axis. In addition to conventional Dirac and Weyl point nodes, we identify a novel type of Dirac point node unique to nonunitary chiral superconducting states. The node type can be identified experimentally based on the temperature dependence of the nuclear magnetic resonance longitudinal relaxation rate. The implication of our results for a nonunitary odd-parity superconductor in UTe$_2$ is also discussed.
△ Less
Submitted 6 November, 2022;
originally announced November 2022.
-
VTC: Improving Video-Text Retrieval with User Comments
Authors:
Laura Hanu,
James Thewlis,
Yuki M. Asano,
Christian Rupprecht
Abstract:
Multi-modal retrieval is an important problem for many applications, such as recommendation and search. Current benchmarks and even datasets are often manually constructed and consist of mostly clean samples where all modalities are well-correlated with the content. Thus, current video-text retrieval literature largely focuses on video titles or audio transcripts, while ignoring user comments, sin…
▽ More
Multi-modal retrieval is an important problem for many applications, such as recommendation and search. Current benchmarks and even datasets are often manually constructed and consist of mostly clean samples where all modalities are well-correlated with the content. Thus, current video-text retrieval literature largely focuses on video titles or audio transcripts, while ignoring user comments, since users often tend to discuss topics only vaguely related to the video. Despite the ubiquity of user comments online, there is currently no multi-modal representation learning datasets that includes comments. In this paper, we a) introduce a new dataset of videos, titles and comments; b) present an attention-based mechanism that allows the model to learn from sometimes irrelevant data such as comments; c) show that by using comments, our method is able to learn better, more contextualised, representations for image, video and audio representations. Project page: https://unitaryai.github.io/vtc-paper.
△ Less
Submitted 19 October, 2022;
originally announced October 2022.
-
Prompt Generation Networks for Input-based Adaptation of Frozen Vision Transformers
Authors:
Jochem Loedeman,
Maarten C. Stol,
Tengda Han,
Yuki M. Asano
Abstract:
With the introduction of the transformer architecture in computer vision, increasing model scale has been demonstrated as a clear path to achieving performance and robustness gains. However, with model parameter counts reaching the billions, classical finetuning approaches are becoming increasingly limiting and even unfeasible when models become hosted as inference APIs, as in NLP. To this end, vi…
▽ More
With the introduction of the transformer architecture in computer vision, increasing model scale has been demonstrated as a clear path to achieving performance and robustness gains. However, with model parameter counts reaching the billions, classical finetuning approaches are becoming increasingly limiting and even unfeasible when models become hosted as inference APIs, as in NLP. To this end, visual prompt learning, whereby a model is adapted by learning additional inputs, has emerged as a potential solution for adapting frozen and cloud-hosted models: During inference, this neither requires access to the internals of models' forward pass function, nor requires any post-processing. In this work, we propose the Prompt Generation Network (PGN) that generates high performing, input-dependent prompts by sampling from an end-to-end learned library of tokens. We further introduce the "prompt inversion" trick, with which PGNs can be efficiently trained in a latent space but deployed as strictly input-only prompts for inference. We show the PGN is effective in adapting pre-trained models to various new datasets: It surpasses previous methods by a large margin on 12/12 datasets and even outperforms full-finetuning on 5/12, while requiring 100x less parameters.
△ Less
Submitted 19 April, 2023; v1 submitted 12 October, 2022;
originally announced October 2022.
-
Self-Guided Diffusion Models
Authors:
Vincent Tao Hu,
David W Zhang,
Yuki M. Asano,
Gertjan J. Burghouts,
Cees G. M. Snoek
Abstract:
Diffusion models have demonstrated remarkable progress in image generation quality, especially when guidance is used to control the generative process. However, guidance requires a large amount of image-annotation pairs for training and is thus dependent on their availability, correctness and unbiasedness. In this paper, we eliminate the need for such annotation by instead leveraging the flexibili…
▽ More
Diffusion models have demonstrated remarkable progress in image generation quality, especially when guidance is used to control the generative process. However, guidance requires a large amount of image-annotation pairs for training and is thus dependent on their availability, correctness and unbiasedness. In this paper, we eliminate the need for such annotation by instead leveraging the flexibility of self-supervision signals to design a framework for self-guided diffusion models. By leveraging a feature extraction function and a self-annotation function, our method provides guidance signals at various image granularities: from the level of holistic images to object boxes and even segmentation masks. Our experiments on single-label and multi-label image datasets demonstrate that self-labeled guidance always outperforms diffusion models without guidance and may even surpass guidance based on ground-truth labels, especially on unbalanced data. When equipped with self-supervised box or mask proposals, our method further generates visually diverse yet semantically consistent images, without the need for any class, box, or segment label annotation. Self-guided diffusion is simple, flexible and expected to profit from deployment at scale. Source code will be at: https://taohu.me/sgdm/
△ Less
Submitted 27 November, 2023; v1 submitted 12 October, 2022;
originally announced October 2022.
-
Comparison of Lexical Alignment with a Teachable Robot in Human-Robot and Human-Human-Robot Interactions
Authors:
Yuya Asano,
Diane Litman,
Mingzhi Yu,
Nikki Lobczowski,
Timothy Nokes-Malach,
Adriana Kovashka,
Erin Walker
Abstract:
Speakers build rapport in the process of aligning conversational behaviors with each other. Rapport engendered with a teachable agent while instructing domain material has been shown to promote learning. Past work on lexical alignment in the field of education suffers from limitations in both the measures used to quantify alignment and the types of interactions in which alignment with agents has b…
▽ More
Speakers build rapport in the process of aligning conversational behaviors with each other. Rapport engendered with a teachable agent while instructing domain material has been shown to promote learning. Past work on lexical alignment in the field of education suffers from limitations in both the measures used to quantify alignment and the types of interactions in which alignment with agents has been studied. In this paper, we apply alignment measures based on a data-driven notion of shared expressions (possibly composed of multiple words) and compare alignment in one-on-one human-robot (H-R) interactions with the H-R portions of collaborative human-human-robot (H-H-R) interactions. We find that students in the H-R setting align with a teachable robot more than in the H-H-R setting and that the relationship between lexical alignment and rapport is more complex than what is predicted by previous theoretical and empirical work.
△ Less
Submitted 23 September, 2022;
originally announced September 2022.
-
Measuring the Interpretability of Unsupervised Representations via Quantized Reverse Probing
Authors:
Iro Laina,
Yuki M. Asano,
Andrea Vedaldi
Abstract:
Self-supervised visual representation learning has recently attracted significant research interest. While a common way to evaluate self-supervised representations is through transfer to various downstream tasks, we instead investigate the problem of measuring their interpretability, i.e. understanding the semantics encoded in raw representations. We formulate the latter as estimating the mutual i…
▽ More
Self-supervised visual representation learning has recently attracted significant research interest. While a common way to evaluate self-supervised representations is through transfer to various downstream tasks, we instead investigate the problem of measuring their interpretability, i.e. understanding the semantics encoded in raw representations. We formulate the latter as estimating the mutual information between the representation and a space of manually labelled concepts. To quantify this we introduce a decoding bottleneck: information must be captured by simple predictors, map** concepts to clusters in representation space. This approach, which we call reverse linear probing, provides a single number sensitive to the semanticity of the representation. This measure is also able to detect when the representation contains combinations of concepts (e.g., "red apple") instead of just individual attributes ("red" and "apple" independently). Finally, we propose to use supervised classifiers to automatically label large datasets in order to enrich the space of concepts used for probing. We use our method to evaluate a large number of self-supervised representations, ranking them by interpretability, highlight the differences that emerge compared to the standard evaluation with linear probes and discuss several qualitative insights. Code at: {\scriptsize{\url{https://github.com/iro-cp/ssl-qrp}}}.
△ Less
Submitted 7 September, 2022;
originally announced September 2022.
-
Spin Susceptibility of a J=3/2 Superconductor
Authors:
Dakyeong Kim,
Takumi Sato,
Shingo Kobayashi,
Yasuhiro Asano
Abstract:
We discuss the spin susceptibility of superconductors in which a Cooper pair consists of two electrons having the angular momentum J=3/2 due to strong spin-orbit interactions. The susceptibility is calculated analytically for pseudospin quintet states in a cubic superconductor within the linear response to a Zeeman field. The susceptibility for $A_{1g}$ symmetry states is isotropic in real space.…
▽ More
We discuss the spin susceptibility of superconductors in which a Cooper pair consists of two electrons having the angular momentum J=3/2 due to strong spin-orbit interactions. The susceptibility is calculated analytically for pseudospin quintet states in a cubic superconductor within the linear response to a Zeeman field. The susceptibility for $A_{1g}$ symmetry states is isotropic in real space. For $E_g$ and $T_{2g}$ symmetry cases, the results depend sensitively on choices of order parameter. The susceptibility is isotropic for a $T_{2g}$ symmetry state, whereas it becomes anisotropic for an $E_{g} $ symmetry state. We also find in a $T_{2g}$ state that the susceptibility tensor has off-diagonal elements.
△ Less
Submitted 6 October, 2022; v1 submitted 29 June, 2022;
originally announced June 2022.
-
Causal Representation Learning for Instantaneous and Temporal Effects in Interactive Systems
Authors:
Phillip Lippe,
Sara Magliacane,
Sindy Löwe,
Yuki M. Asano,
Taco Cohen,
Efstratios Gavves
Abstract:
Causal representation learning is the task of identifying the underlying causal variables and their relations from high-dimensional observations, such as images. Recent work has shown that one can reconstruct the causal variables from temporal sequences of observations under the assumption that there are no instantaneous causal relations between them. In practical applications, however, our measur…
▽ More
Causal representation learning is the task of identifying the underlying causal variables and their relations from high-dimensional observations, such as images. Recent work has shown that one can reconstruct the causal variables from temporal sequences of observations under the assumption that there are no instantaneous causal relations between them. In practical applications, however, our measurement or frame rate might be slower than many of the causal effects. This effectively creates "instantaneous" effects and invalidates previous identifiability results. To address this issue, we propose iCITRIS, a causal representation learning method that allows for instantaneous effects in intervened temporal sequences when intervention targets can be observed, e.g., as actions of an agent. iCITRIS identifies the potentially multidimensional causal variables from temporal observations, while simultaneously using a differentiable causal discovery method to learn their causal graph. In experiments on three datasets of interactive systems, iCITRIS accurately identifies the causal variables and their causal graph.
△ Less
Submitted 7 March, 2023; v1 submitted 13 June, 2022;
originally announced June 2022.
-
Looking for a Handsome Carpenter! Debiasing GPT-3 Job Advertisements
Authors:
Conrad Borchers,
Dalia Sara Gala,
Benjamin Gilburt,
Eduard Oravkin,
Wilfried Bounsi,
Yuki M. Asano,
Hannah Rose Kirk
Abstract:
The growing capability and availability of generative language models has enabled a wide range of new downstream tasks. Academic research has identified, quantified and mitigated biases present in language models but is rarely tailored to downstream tasks where wider impact on individuals and society can be felt. In this work, we leverage one popular generative language model, GPT-3, with the goal…
▽ More
The growing capability and availability of generative language models has enabled a wide range of new downstream tasks. Academic research has identified, quantified and mitigated biases present in language models but is rarely tailored to downstream tasks where wider impact on individuals and society can be felt. In this work, we leverage one popular generative language model, GPT-3, with the goal of writing unbiased and realistic job advertisements. We first assess the bias and realism of zero-shot generated advertisements and compare them to real-world advertisements. We then evaluate prompt-engineering and fine-tuning as debiasing methods. We find that prompt-engineering with diversity-encouraging prompts gives no significant improvement to bias, nor realism. Conversely, fine-tuning, especially on unbiased real advertisements, can improve realism and reduce bias.
△ Less
Submitted 23 May, 2022;
originally announced May 2022.
-
Self-Supervised Learning of Object Parts for Semantic Segmentation
Authors:
Adrian Ziegler,
Yuki M. Asano
Abstract:
Progress in self-supervised learning has brought strong general image representation learning methods. Yet so far, it has mostly focused on image-level learning. In turn, tasks such as unsupervised image segmentation have not benefited from this trend as they require spatially-diverse representations. However, learning dense representations is challenging, as in the unsupervised context it is not…
▽ More
Progress in self-supervised learning has brought strong general image representation learning methods. Yet so far, it has mostly focused on image-level learning. In turn, tasks such as unsupervised image segmentation have not benefited from this trend as they require spatially-diverse representations. However, learning dense representations is challenging, as in the unsupervised context it is not clear how to guide the model to learn representations that correspond to various potential object categories. In this paper, we argue that self-supervised learning of object parts is a solution to this issue. Object parts are generalizable: they are a priori independent of an object definition, but can be grouped to form objects a posteriori. To this end, we leverage the recently proposed Vision Transformer's capability of attending to objects and combine it with a spatially dense clustering task for fine-tuning the spatial tokens. Our method surpasses the state-of-the-art on three semantic segmentation benchmarks by 17%-3%, showing that our representations are versatile under various object definitions. Finally, we extend this to fully unsupervised segmentation - which refrains completely from using label information even at test-time - and demonstrate that a simple method for automatically merging discovered object parts based on community detection yields substantial gains.
△ Less
Submitted 20 June, 2022; v1 submitted 27 April, 2022;
originally announced April 2022.
-
Quasiparticle spectrum in mesoscopic superconducting junctions with weak magnetization
Authors:
Shu-Ichiro Suzuki,
Alexander A. Golubov,
Yasuhiro Asano,
Yukio Tanaka
Abstract:
We theoretically investigate the effects of the weak magnetization on the local density of states of mesoscopic proximity structures, where two superconducting terminals are attached to a side surface of the diffusive ferromagnet wire with a phase difference. When there is no phase difference, the local density of states is significantly modified by the magnetization in both spin-singlet $s$-wave…
▽ More
We theoretically investigate the effects of the weak magnetization on the local density of states of mesoscopic proximity structures, where two superconducting terminals are attached to a side surface of the diffusive ferromagnet wire with a phase difference. When there is no phase difference, the local density of states is significantly modified by the magnetization in both spin-singlet $s$-wave and spin-triplet $p$-wave cases. When the phase difference is $π$, the local density of stets is less modified by the magnetization compared with the in-phase case because of the destructive interference of Cooper pairs.
△ Less
Submitted 27 April, 2022;
originally announced April 2022.
-
Less than Few: Self-Shot Video Instance Segmentation
Authors:
Pengwan Yang,
Yuki M. Asano,
Pascal Mettes,
Cees G. M. Snoek
Abstract:
The goal of this paper is to bypass the need for labelled examples in few-shot video understanding at run time. While proven effective, in many practical video settings even labelling a few examples appears unrealistic. This is especially true as the level of details in spatio-temporal video understanding and with it, the complexity of annotations continues to increase. Rather than performing few-…
▽ More
The goal of this paper is to bypass the need for labelled examples in few-shot video understanding at run time. While proven effective, in many practical video settings even labelling a few examples appears unrealistic. This is especially true as the level of details in spatio-temporal video understanding and with it, the complexity of annotations continues to increase. Rather than performing few-shot learning with a human oracle to provide a few densely labelled support videos, we propose to automatically learn to find appropriate support videos given a query. We call this self-shot learning and we outline a simple self-supervised learning method to generate an embedding space well-suited for unsupervised retrieval of relevant samples. To showcase this novel setting, we tackle, for the first time, video instance segmentation in a self-shot (and few-shot) setting, where the goal is to segment instances at the pixel-level across the spatial and temporal domains. We provide strong baseline performances that utilize a novel transformer-based model and show that self-shot learning can even surpass few-shot and can be positively combined for further performance gains. Experiments on new benchmarks show that our approach achieves strong performance, is competitive to oracle support in some settings, scales to large unlabelled video collections, and can be combined in a semi-supervised setting.
△ Less
Submitted 19 April, 2022;
originally announced April 2022.
-
An odd-frequency Cooper pair around a magnetic impurity
Authors:
Shu-Ichiro Suzuki,
Takumi Sato,
Yasuhiro Asano
Abstract:
The Yu-Shiba-Rusinov (YSR) state appears as a bound state of a quasiparticle at a magnetic atom embedded in a superconductor. We discuss why the YSR state has energy below the superconducting gap and why the pair potential changes the sign at the magnetic atom. Although a magnetic atom in a superconductor has been considered as a pair breaker since 1960s, we propose an alternative physical picture…
▽ More
The Yu-Shiba-Rusinov (YSR) state appears as a bound state of a quasiparticle at a magnetic atom embedded in a superconductor. We discuss why the YSR state has energy below the superconducting gap and why the pair potential changes the sign at the magnetic atom. Although a magnetic atom in a superconductor has been considered as a pair breaker since 1960s, we propose an alternative physical picture to explain these reasons. We show that a magnetic atom converts a spin-singlet s-wave Cooper pair into an odd-frequency pair rather than breaking it. The odd-frequency pairing correlations always coexist with the quasiparticle states below the gap. The YSR state is an example of such a subgap quasiparticle state. The paramagnetic property of an odd-frequency pair explains the sign change of the pair potential at a magnetic atom and the decrease of superconducting transition temperature in the presence of many magnetic impurities.
△ Less
Submitted 11 August, 2022; v1 submitted 7 April, 2022;
originally announced April 2022.
-
CITRIS: Causal Identifiability from Temporal Intervened Sequences
Authors:
Phillip Lippe,
Sara Magliacane,
Sindy Löwe,
Yuki M. Asano,
Taco Cohen,
Efstratios Gavves
Abstract:
Understanding the latent causal factors of a dynamical system from visual observations is considered a crucial step towards agents reasoning in complex environments. In this paper, we propose CITRIS, a variational autoencoder framework that learns causal representations from temporal sequences of images in which underlying causal factors have possibly been intervened upon. In contrast to the recen…
▽ More
Understanding the latent causal factors of a dynamical system from visual observations is considered a crucial step towards agents reasoning in complex environments. In this paper, we propose CITRIS, a variational autoencoder framework that learns causal representations from temporal sequences of images in which underlying causal factors have possibly been intervened upon. In contrast to the recent literature, CITRIS exploits temporality and observing intervention targets to identify scalar and multidimensional causal factors, such as 3D rotation angles. Furthermore, by introducing a normalizing flow, CITRIS can be easily extended to leverage and disentangle representations obtained by already pretrained autoencoders. Extending previous results on scalar causal factors, we prove identifiability in a more general setting, in which only some components of a causal factor are affected by interventions. In experiments on 3D rendered image sequences, CITRIS outperforms previous methods on recovering the underlying causal variables. Moreover, using pretrained autoencoders, CITRIS can even generalize to unseen instantiations of causal factors, opening future research areas in sim-to-real generalization for causal representation learning.
△ Less
Submitted 15 June, 2022; v1 submitted 7 February, 2022;
originally announced February 2022.
-
Spherically symmetric solutions of higher-spin gravity in the IKKT matrix model
Authors:
Yuhma Asano,
Harold C. Steinacker
Abstract:
We present a systematic study of spherically symmetric vacuum solutions of the IKKT matrix model, within the framework of semi-classical covariant quantum geometries. All asymptotically flat solutions of the equations of motion of the frame are found explicitly. They reproduce the linearized Schwarzschild geometry for large $r$ but deviate from it at the non-linear level, and include contributions…
▽ More
We present a systematic study of spherically symmetric vacuum solutions of the IKKT matrix model, within the framework of semi-classical covariant quantum geometries. All asymptotically flat solutions of the equations of motion of the frame are found explicitly. They reproduce the linearized Schwarzschild geometry for large $r$ but deviate from it at the non-linear level, and include contributions from dilaton and axion. They are pertinent to the pre-gravity theory arising on classical brane solutions within the classical matrix model, before taking into account the Einstein-Hilbert term induced by quantum effects. We also address the problem of reconstructing matrix configurations corresponding to some given frame, and show that this problem can always be solved at the geometrical level of the underlying higher spin theory, ignoring possible higher spin modes.
△ Less
Submitted 15 December, 2021;
originally announced December 2021.
-
SPring-8 LEPS2 beamline: A facility to produce a multi-GeV photon beam via laser Compton scattering
Authors:
N. Muramatsu,
M. Yosoi,
T. Yorita,
Y. Ohashi,
J. K. Ahn,
S. Ajimura,
Y. Asano,
W. C. Chang,
J. Y. Chen,
S. Date,
T. Gogami,
H. Hamano,
T. Hashimoto,
T. Hiraiwa,
T. Hotta,
T. Ishikawa,
Y. Kasamatsu,
H. Katsuragawa,
R. Kobayakawa,
H. Kohri,
S. Masumoto,
Y. Matsumura,
M. Miyabe,
K. Mizutani,
Y. Morino
, et al. (26 additional authors not shown)
Abstract:
We have constructed a new laser-Compton-scattering facility, called the LEPS2 beamline, at the 8-GeV electron storage ring, SPring-8. This facility provides a linearly polarized photon beam in a tagged energy range of 1.3--2.4 GeV. Thanks to a small divergence of the low-emittance storage-ring electrons, the tagged photon beam has a size (sigma) suppressed to about 4 mm even after it travels about…
▽ More
We have constructed a new laser-Compton-scattering facility, called the LEPS2 beamline, at the 8-GeV electron storage ring, SPring-8. This facility provides a linearly polarized photon beam in a tagged energy range of 1.3--2.4 GeV. Thanks to a small divergence of the low-emittance storage-ring electrons, the tagged photon beam has a size (sigma) suppressed to about 4 mm even after it travels about 130 m to the experimental building that is independent of the storage ring building and contains large detector systems. This beamline is designed to achieve a photon beam intensity higher than that of the first laser-Compton-scattering beamline at SPring-8 by adopting the simultaneous injection of up to four high-power laser beams and increasing a transmittance for the long photon-beam path up to about 77%. The new beamline is under operation for hadron photoproduction experiments.
△ Less
Submitted 14 December, 2021;
originally announced December 2021.
-
Effects of gas-liquid phase transitions on soundwave propagation: A molecular dynamics study
Authors:
Yuta Asano,
Hiroshi Watanabe,
Hiroshi Noguchi
Abstract:
To understand ultrasonic cavitation, it is imperative to analyze the effects of the gas-liquid phase transitions on soundwave propagation. Since current methods based on fluid dynamics offer limited information, it is imperative to carry out further research on this phenomenon. In this study, we investigated the effects of cavitation and near-critical fluid on soundwaves using the molecular dynami…
▽ More
To understand ultrasonic cavitation, it is imperative to analyze the effects of the gas-liquid phase transitions on soundwave propagation. Since current methods based on fluid dynamics offer limited information, it is imperative to carry out further research on this phenomenon. In this study, we investigated the effects of cavitation and near-critical fluid on soundwaves using the molecular dynamics (MD) simulations of Lennard-Jones fluids. In the first-order liquid-to-gas transition region (far from the critical point), the waveform does not continuously change with the temperature and source oscillation amplitude owing to the discontinuous change in the density due to the phase transition. Meanwhile, in the continuous transition region (crossing near the critical point), the waveform continuously varies with temperature regardless of the amplitudes because phase separation is not involved in this region. The density fluctuations increase as the amplitude increases; however, it does not affect the waveform. Thus, we clarified that the first-order and continuous transitions have different impacts on soundwaves. Moreover, we determined the acoustic characteristics, such as attenuation and nonlinear parameters, by comparing the results of the numerical solution of Burgers' equation and MD simulation. Burgers' equation clearly describes the soundwave phenomenon until phase separation or bubble formation occurs. In the continuous transition region, the attenuation parameters tend to diverge, reflecting a critical anomaly trend. We observed the bubbles move forward with the oscillation of their radii owing to their interaction with the soundwaves. This is the first direct observation of the interaction using MD simulations.
△ Less
Submitted 13 December, 2021;
originally announced December 2021.
-
The Augmented Image Prior: Distilling 1000 Classes by Extrapolating from a Single Image
Authors:
Yuki M. Asano,
Aaqib Saeed
Abstract:
What can neural networks learn about the visual world when provided with only a single image as input? While any image obviously cannot contain the multitudes of all existing objects, scenes and lighting conditions - within the space of all 256^(3x224x224) possible 224-sized square images, it might still provide a strong prior for natural images. To analyze this `augmented image prior' hypothesis,…
▽ More
What can neural networks learn about the visual world when provided with only a single image as input? While any image obviously cannot contain the multitudes of all existing objects, scenes and lighting conditions - within the space of all 256^(3x224x224) possible 224-sized square images, it might still provide a strong prior for natural images. To analyze this `augmented image prior' hypothesis, we develop a simple framework for training neural networks from scratch using a single image and augmentations using knowledge distillation from a supervised pretrained teacher. With this, we find the answer to the above question to be: `surprisingly, a lot'. In quantitative terms, we find accuracies of 94%/74% on CIFAR-10/100, 69% on ImageNet, and by extending this method to video and audio, 51% on Kinetics-400 and 84% on SpeechCommands. In extensive analyses spanning 13 datasets, we disentangle the effect of augmentations, choice of data and network architectures and also provide qualitative evaluations that include lucid `panda neurons' in networks that have never even seen one.
△ Less
Submitted 24 January, 2023; v1 submitted 1 December, 2021;
originally announced December 2021.
-
Flavor number dependence of QCD at finite density by the complex Langevin method
Authors:
Yusuke Namekawa,
Yuhma Asano,
Yuta Ito,
Takashi Kaneko,
Hideo Matsufuru,
Jun Nishimura,
Asato Tsuchiya,
Shoichiro Tsutsui,
Takeru Yokota
Abstract:
We discuss the flavor number dependence of QCD at low temperature and high density by the complex Langevin method. In our previous work, the complex Langevin method is confirmed to satisfy the criterion for correct convergence in certain regions, such as $μ_{\rm q} / T = 5.2-7.2$ on $8^3 \times 16$ and $μ_{\rm q} / T = 1.6-9.6$ on $16^3 \times 32$ using $N_{\rm f} = 4$ staggered fermion at…
▽ More
We discuss the flavor number dependence of QCD at low temperature and high density by the complex Langevin method. In our previous work, the complex Langevin method is confirmed to satisfy the criterion for correct convergence in certain regions, such as $μ_{\rm q} / T = 5.2-7.2$ on $8^3 \times 16$ and $μ_{\rm q} / T = 1.6-9.6$ on $16^3 \times 32$ using $N_{\rm f} = 4$ staggered fermion at $β= 5.7$. We extend this study to more realistic flavor cases, $N_{\rm f} = 2, 2 + 1, 3$, using Wilson fermions. We present the flavor number dependence of the validity regions of the complex Langevin method and the quark number.
△ Less
Submitted 30 November, 2021;
originally announced December 2021.
-
Color superconductivity in a small box: a complex Langevin study
Authors:
Shoichiro Tsutsui,
Yuhma Asano,
Yuta Ito,
Hideo Matsufuru,
Yusuke Namekawa,
Jun Nishimura,
Asato Tsuchiya,
Takeru Yokota
Abstract:
It is expected that the color superconductivity (CSC) phase appears in QCD at low temperature and high density. On the basis of the lattice perturbation theory, a possible parameter region in which the CSC occurs has been predicted. In this work, we perform complex Langevin simulation on an $8^3\times 128$ lattice using four-flavor staggered fermions. We find, in particular, that the quark number…
▽ More
It is expected that the color superconductivity (CSC) phase appears in QCD at low temperature and high density. On the basis of the lattice perturbation theory, a possible parameter region in which the CSC occurs has been predicted. In this work, we perform complex Langevin simulation on an $8^3\times 128$ lattice using four-flavor staggered fermions. We find, in particular, that the quark number has plateaux with respect to the chemical potential similar to our previous study, indicating the formation of the Fermi sphere. A diquark-antidiquark operator, which is an order parameter of color superconductivity, is formulated on the lattice using the U(1) noise. Our result for this operator is found to fluctuate violently when the Fermi surface coincides with the energy levels of quarks. We also discuss partial restoration of the chiral symmetry at high density.
△ Less
Submitted 29 November, 2021;
originally announced November 2021.
-
Perturbative predictions for color superconductivity on the lattice
Authors:
Takeru Yokota,
Yuhma Asano,
Yuta Ito,
Hideo Matsufuru,
Yusuke Namekawa,
Jun Nishimura,
Asato Tsuchiya,
Shoichiro Tsutsui
Abstract:
We develop a new method to investigate color superconductivity (CSC) on the lattice based on the Thouless criterion, which amounts to solving the linearized gap equation without imposing any ansatz on the structure of the Cooper pairs. We perform explicit calculations at the one-loop level with the staggered fermions on a $8^3 \times 128$ lattice and the Wilson fermions on a $4^3 \times 128$ latti…
▽ More
We develop a new method to investigate color superconductivity (CSC) on the lattice based on the Thouless criterion, which amounts to solving the linearized gap equation without imposing any ansatz on the structure of the Cooper pairs. We perform explicit calculations at the one-loop level with the staggered fermions on a $8^3 \times 128$ lattice and the Wilson fermions on a $4^3 \times 128$ lattice, which enables us to obtain the critical $β(=6/g^2)$ as a function of the quark chemical potential $μ$, below which the CSC phase is expected to appear. The obtained critical $β$ has sharp peaks at the values of $μ$ corresponding to the discretized energy levels of quarks similarly to what was observed in previous studies on simplified effective models. From the solution to the linearized gap equation, one can read off the flavor and spatial structures of the Cooper pairs at the critical $β$. In the case of massless staggered fermion, in particular, we find that the chiral $\mathrm{U}(1)$ symmetry of the staggered fermions is spontaneously broken by the condensation of the Cooper pairs.
△ Less
Submitted 29 November, 2021;
originally announced November 2021.
-
PASS: An ImageNet replacement for self-supervised pretraining without humans
Authors:
Yuki M. Asano,
Christian Rupprecht,
Andrew Zisserman,
Andrea Vedaldi
Abstract:
Computer vision has long relied on ImageNet and other large datasets of images sampled from the Internet for pretraining models. However, these datasets have ethical and technical shortcomings, such as containing personal information taken without consent, unclear license usage, biases, and, in some cases, even problematic image content. On the other hand, state-of-the-art pretraining is nowadays…
▽ More
Computer vision has long relied on ImageNet and other large datasets of images sampled from the Internet for pretraining models. However, these datasets have ethical and technical shortcomings, such as containing personal information taken without consent, unclear license usage, biases, and, in some cases, even problematic image content. On the other hand, state-of-the-art pretraining is nowadays obtained with unsupervised methods, meaning that labelled datasets such as ImageNet may not be necessary, or perhaps not even optimal, for model pretraining. We thus propose an unlabelled dataset PASS: Pictures without humAns for Self-Supervision. PASS only contains images with CC-BY license and complete attribution metadata, addressing the copyright issue. Most importantly, it contains no images of people at all, and also avoids other types of images that are problematic for data protection or ethics. We show that PASS can be used for pretraining with methods such as MoCo-v2, SwAV and DINO. In the transfer learning setting, it yields similar downstream performances to ImageNet pretraining even on tasks that involve humans, such as human pose estimation. PASS does not make existing datasets obsolete, as for instance it is insufficient for benchmarking. However, it shows that model pretraining is often possible while using safer data, and it also provides the basis for a more robust evaluation of pretraining methods.
△ Less
Submitted 27 September, 2021;
originally announced September 2021.
-
Quasiparticle on Bogoliubov Fermi Surface and Odd-Frequency Cooper Pair
Authors:
Dakyeong Kim,
Shingo Kobayashi,
Yasuhiro Asano
Abstract:
We discuss a close relationship between a quasiparticle on the Bogoliubov Fermi surface and an odd-frequency Cooper pair in a superconductor in which a Cooper pair consisting of two j=3/2 electrons forms the pseudospin-quintet even-parity pair potential with breaking time-reversal symmetry. It has been established in a single-band superconductor that a low-energy quasiparticle below the supercondu…
▽ More
We discuss a close relationship between a quasiparticle on the Bogoliubov Fermi surface and an odd-frequency Cooper pair in a superconductor in which a Cooper pair consisting of two j=3/2 electrons forms the pseudospin-quintet even-parity pair potential with breaking time-reversal symmetry. It has been established in a single-band superconductor that a low-energy quasiparticle below the superconducting gap accompanies an odd-frequency Cooper pair. In this paper, we show that an odd-frequency pair characterized by chirality coexists with a quasiparticle on the Bogoliubov Fermi surface. The symmetry of odd-frequency Cooper pairs is analyzed in detail by taking realistic pair potentials into account in a cubic superconductor.
△ Less
Submitted 17 September, 2021;
originally announced September 2021.
-
Memes in the Wild: Assessing the Generalizability of the Hateful Memes Challenge Dataset
Authors:
Hannah Rose Kirk,
Yennie Jun,
Paulius Rauba,
Gal Wachtel,
Ruining Li,
Xingjian Bai,
Noah Broestl,
Martin Doff-Sotta,
Aleksandar Shtedritski,
Yuki M. Asano
Abstract:
Hateful memes pose a unique challenge for current machine learning systems because their message is derived from both text- and visual-modalities. To this effect, Facebook released the Hateful Memes Challenge, a dataset of memes with pre-extracted text captions, but it is unclear whether these synthetic examples generalize to `memes in the wild'. In this paper, we collect hateful and non-hateful m…
▽ More
Hateful memes pose a unique challenge for current machine learning systems because their message is derived from both text- and visual-modalities. To this effect, Facebook released the Hateful Memes Challenge, a dataset of memes with pre-extracted text captions, but it is unclear whether these synthetic examples generalize to `memes in the wild'. In this paper, we collect hateful and non-hateful memes from Pinterest to evaluate out-of-sample performance on models pre-trained on the Facebook dataset. We find that memes in the wild differ in two key aspects: 1) Captions must be extracted via OCR, injecting noise and diminishing performance of multimodal models, and 2) Memes are more diverse than `traditional memes', including screenshots of conversations or text on a plain background. This paper thus serves as a reality check for the current benchmark of hateful meme detection and its applicability for detecting real world hate.
△ Less
Submitted 9 July, 2021;
originally announced July 2021.
-
Towards Universal Neural Network Potential for Material Discovery Applicable to Arbitrary Combination of 45 Elements
Authors:
So Takamoto,
Chikashi Shinagawa,
Daisuke Motoki,
Kosuke Nakago,
Wenwen Li,
Iori Kurata,
Taku Watanabe,
Yoshihiro Yayama,
Hiroki Iriguchi,
Yusuke Asano,
Tasuku Onodera,
Takafumi Ishii,
Takao Kudo,
Hideki Ono,
Ryohto Sawada,
Ryuichiro Ishitani,
Marc Ong,
Taiki Yamaguchi,
Toshiki Kataoka,
Akihide Hayashi,
Nontawat Charoenphakdee,
Takeshi Ibuka
Abstract:
Computational material discovery is under intense study owing to its ability to explore the vast space of chemical systems. Neural network potentials (NNPs) have been shown to be particularly effective in conducting atomistic simulations for such purposes. However, existing NNPs are generally designed for narrow target materials, making them unsuitable for broader applications in material discover…
▽ More
Computational material discovery is under intense study owing to its ability to explore the vast space of chemical systems. Neural network potentials (NNPs) have been shown to be particularly effective in conducting atomistic simulations for such purposes. However, existing NNPs are generally designed for narrow target materials, making them unsuitable for broader applications in material discovery. To overcome this issue, we have developed a universal NNP called PreFerred Potential (PFP), which is able to handle any combination of 45 elements. Particular emphasis is placed on the datasets, which include a diverse set of virtual structures used to attain the universality. We demonstrated the applicability of PFP in selected domains: lithium diffusion in LiFeSO${}_4$F, molecular adsorption in metal-organic frameworks, an order-disorder transition of Cu-Au alloys, and material discovery for a Fischer-Tropsch catalyst. They showcase the power of PFP, and this technology provides a highly useful tool for material discovery.
△ Less
Submitted 1 April, 2022; v1 submitted 28 June, 2021;
originally announced June 2021.
-
Kee** Your Eye on the Ball: Trajectory Attention in Video Transformers
Authors:
Mandela Patrick,
Dylan Campbell,
Yuki M. Asano,
Ishan Misra,
Florian Metze,
Christoph Feichtenhofer,
Andrea Vedaldi,
João F. Henriques
Abstract:
In video transformers, the time dimension is often treated in the same way as the two spatial dimensions. However, in a scene where objects or the camera may move, a physical point imaged at one location in frame $t$ may be entirely unrelated to what is found at that location in frame $t+k$. These temporal correspondences should be modeled to facilitate learning about dynamic scenes. To this end,…
▽ More
In video transformers, the time dimension is often treated in the same way as the two spatial dimensions. However, in a scene where objects or the camera may move, a physical point imaged at one location in frame $t$ may be entirely unrelated to what is found at that location in frame $t+k$. These temporal correspondences should be modeled to facilitate learning about dynamic scenes. To this end, we propose a new drop-in block for video transformers -- trajectory attention -- that aggregates information along implicitly determined motion paths. We additionally propose a new method to address the quadratic dependence of computation and memory on the input size, which is particularly important for high resolution or long videos. While these ideas are useful in a range of settings, we apply them to the specific task of video action recognition with a transformer model and obtain state-of-the-art results on the Kinetics, Something--Something V2, and Epic-Kitchens datasets. Code and models are available at: https://github.com/facebookresearch/Motionformer
△ Less
Submitted 23 October, 2021; v1 submitted 9 June, 2021;
originally announced June 2021.
-
Multi-view 3D Reconstruction of a Texture-less Smooth Surface of Unknown Generic Reflectance
Authors:
Ziang Cheng,
Hongdong Li,
Yuta Asano,
Yinqiang Zheng,
Imari Sato
Abstract:
Recovering the 3D geometry of a purely texture-less object with generally unknown surface reflectance (e.g. non-Lambertian) is regarded as a challenging task in multi-view reconstruction. The major obstacle revolves around establishing cross-view correspondences where photometric constancy is violated. This paper proposes a simple and practical solution to overcome this challenge based on a co-loc…
▽ More
Recovering the 3D geometry of a purely texture-less object with generally unknown surface reflectance (e.g. non-Lambertian) is regarded as a challenging task in multi-view reconstruction. The major obstacle revolves around establishing cross-view correspondences where photometric constancy is violated. This paper proposes a simple and practical solution to overcome this challenge based on a co-located camera-light scanner device. Unlike existing solutions, we do not explicitly solve for correspondence. Instead, we argue the problem is generally well-posed by multi-view geometrical and photometric constraints, and can be solved from a small number of input views. We formulate the reconstruction task as a joint energy minimization over the surface geometry and reflectance. Despite this energy is highly non-convex, we develop an optimization algorithm that robustly recovers globally optimal shape and reflectance even from a random initialization. Extensive experiments on both simulated and real data have validated our method, and possible future extensions are discussed.
△ Less
Submitted 24 May, 2021;
originally announced May 2021.
-
Effects of polymers on the cavitating flow around a cylinder: A Large-scale molecular dynamics analysis
Authors:
Yuta Asano,
Hiroshi Watanabe,
Hiroshi Noguchi
Abstract:
The cavitation flow of linear-polymer solutions around a cylinder is studied by performing a large-scale molecular dynamics simulation. The addition of polymer chains remarkably suppresses the cavitation. The polymers are stretched into a linear shape near the cylinder and entrained in the vortex behind the cylinder. As the polymers stretch, the elongational viscosity increases, which suppresses t…
▽ More
The cavitation flow of linear-polymer solutions around a cylinder is studied by performing a large-scale molecular dynamics simulation. The addition of polymer chains remarkably suppresses the cavitation. The polymers are stretched into a linear shape near the cylinder and entrained in the vortex behind the cylinder. As the polymers stretch, the elongational viscosity increases, which suppresses the vortex formation. Furthermore, the polymers exhibit an entropic elasticity owing to the stretching. This elastic energy increases the local temperature, which inhibits the cavitation inception. These effects of polymers result in the dramatic suppression of cavitation.
△ Less
Submitted 24 June, 2021; v1 submitted 16 May, 2021;
originally announced May 2021.
-
Self-supervised object detection from audio-visual correspondence
Authors:
Triantafyllos Afouras,
Yuki M. Asano,
Francois Fagan,
Andrea Vedaldi,
Florian Metze
Abstract:
We tackle the problem of learning object detectors without supervision. Differently from weakly-supervised object detection, we do not assume image-level class labels. Instead, we extract a supervisory signal from audio-visual data, using the audio component to "teach" the object detector. While this problem is related to sound source localisation, it is considerably harder because the detector mu…
▽ More
We tackle the problem of learning object detectors without supervision. Differently from weakly-supervised object detection, we do not assume image-level class labels. Instead, we extract a supervisory signal from audio-visual data, using the audio component to "teach" the object detector. While this problem is related to sound source localisation, it is considerably harder because the detector must classify the objects by type, enumerate each instance of the object, and do so even when the object is silent. We tackle this problem by first designing a self-supervised framework with a contrastive objective that jointly learns to classify and localise objects. Then, without using any supervision, we simply use these self-supervised labels and boxes to train an image-based object detector. With this, we outperform previous unsupervised and weakly-supervised detectors for the task of object detection and sound source localization. We also show that we can align this detector to ground-truth classes with as little as one label per pseudo-class, and show how our method can learn to detect generic objects that go beyond instruments, such as airplanes and cats.
△ Less
Submitted 9 July, 2022; v1 submitted 13 April, 2021;
originally announced April 2021.
-
Strong anomalous proximity effect from spin-singlet superconductors
Authors:
Satoshi Ikegaya,
Jaechul Lee,
Andreas P. Schnyder,
Yasuhiro Asano
Abstract:
The proximity effect from a spin-triplet $p_x$-wave superconductor to a dirty normal-metal has been shown to result in various unusual electromagnetic properties, reflecting a cooperative relation between topologically protected zero-energy quasiparticles and odd-frequency Cooper pairs. However, because of a lack of candidate materials for spin-triplet $p_x$-wave superconductors, observing this ef…
▽ More
The proximity effect from a spin-triplet $p_x$-wave superconductor to a dirty normal-metal has been shown to result in various unusual electromagnetic properties, reflecting a cooperative relation between topologically protected zero-energy quasiparticles and odd-frequency Cooper pairs. However, because of a lack of candidate materials for spin-triplet $p_x$-wave superconductors, observing this effect has been difficult. In this paper, we demonstrate that the anomalous proximity effect, which is essentially equivalent to that of a spin-triplet $p_x$-wave superconductor, can occur in a semiconductor/high-$T_c$ cuprate superconductor hybrid device in which two potentials coexist: a spin-singlet $d$-wave pair potential and a spin--orbit coupling potential sustaining the persistent spin-helix state. As a result, we propose an alternative and promising route to observe the anomalous proximity effect related to the profound nature of topologically protected quasiparticles and odd-frequency Cooper pairs.
△ Less
Submitted 17 June, 2021; v1 submitted 9 April, 2021;
originally announced April 2021.
-
Space-Time Crop & Attend: Improving Cross-modal Video Representation Learning
Authors:
Mandela Patrick,
Yuki M. Asano,
Bernie Huang,
Ishan Misra,
Florian Metze,
Joao Henriques,
Andrea Vedaldi
Abstract:
The quality of the image representations obtained from self-supervised learning depends strongly on the type of data augmentations used in the learning formulation. Recent papers have ported these methods from still images to videos and found that leveraging both audio and video signals yields strong gains; however, they did not find that spatial augmentations such as crop**, which are very impo…
▽ More
The quality of the image representations obtained from self-supervised learning depends strongly on the type of data augmentations used in the learning formulation. Recent papers have ported these methods from still images to videos and found that leveraging both audio and video signals yields strong gains; however, they did not find that spatial augmentations such as crop**, which are very important for still images, work as well for videos. In this paper, we improve these formulations in two ways unique to the spatio-temporal aspect of videos. First, for space, we show that spatial augmentations such as crop** do work well for videos too, but that previous implementations, due to the high processing and memory cost, could not do this at a scale sufficient for it to work well. To address this issue, we first introduce Feature Crop, a method to simulate such augmentations much more efficiently directly in feature space. Second, we show that as opposed to naive average pooling, the use of transformer-based attention improves performance significantly, and is well suited for processing feature crops. Combining both of our discoveries into a new method, Space-Time Crop & Attend (STiCA) we achieve state-of-the-art performance across multiple video-representation learning benchmarks. In particular, we achieve new state-of-the-art accuracies of 67.0% on HMDB-51 and 93.1% on UCF-101 when pre-training on Kinetics-400.
△ Less
Submitted 27 October, 2021; v1 submitted 18 March, 2021;
originally announced March 2021.
-
Privacy-preserving Object Detection
Authors:
Peiyang He,
Charlie Griffin,
Krzysztof Kacprzyk,
Artjom Joosen,
Michael Collyer,
Aleksandar Shtedritski,
Yuki M. Asano
Abstract:
Privacy considerations and bias in datasets are quickly becoming high-priority issues that the computer vision community needs to face. So far, little attention has been given to practical solutions that do not involve collection of new datasets. In this work, we show that for object detection on COCO, both anonymizing the dataset by blurring faces, as well as swap** faces in a balanced manner a…
▽ More
Privacy considerations and bias in datasets are quickly becoming high-priority issues that the computer vision community needs to face. So far, little attention has been given to practical solutions that do not involve collection of new datasets. In this work, we show that for object detection on COCO, both anonymizing the dataset by blurring faces, as well as swap** faces in a balanced manner along the gender and skin tone dimension, can retain object detection performances while preserving privacy and partially balancing bias.
△ Less
Submitted 11 March, 2021;
originally announced March 2021.
-
Bias Out-of-the-Box: An Empirical Analysis of Intersectional Occupational Biases in Popular Generative Language Models
Authors:
Hannah Kirk,
Yennie Jun,
Haider Iqbal,
Elias Benussi,
Filippo Volpin,
Frederic A. Dreyer,
Aleksandar Shtedritski,
Yuki M. Asano
Abstract:
The capabilities of natural language models trained on large-scale data have increased immensely over the past few years. Open source libraries such as HuggingFace have made these models easily available and accessible. While prior research has identified biases in large language models, this paper considers biases contained in the most popular versions of these models when applied `out-of-the-box…
▽ More
The capabilities of natural language models trained on large-scale data have increased immensely over the past few years. Open source libraries such as HuggingFace have made these models easily available and accessible. While prior research has identified biases in large language models, this paper considers biases contained in the most popular versions of these models when applied `out-of-the-box' for downstream tasks. We focus on generative language models as they are well-suited for extracting biases inherited from training data. Specifically, we conduct an in-depth analysis of GPT-2, which is the most downloaded text generation model on HuggingFace, with over half a million downloads per month. We assess biases related to occupational associations for different protected categories by intersecting gender with religion, sexuality, ethnicity, political affiliation, and continental name origin. Using a template-based data collection pipeline, we collect 396K sentence completions made by GPT-2 and find: (i) The machine-predicted jobs are less diverse and more stereotypical for women than for men, especially for intersections; (ii) Intersectional interactions are highly relevant for occupational associations, which we quantify by fitting 262 logistic models; (iii) For most occupations, GPT-2 reflects the skewed gender and ethnicity distribution found in US Labor Bureau data, and even pulls the societally-skewed distribution towards gender parity in cases where its predictions deviate from real labor market observations. This raises the normative question of what language models should learn - whether they should reflect or correct for existing inequalities.
△ Less
Submitted 27 October, 2021; v1 submitted 8 February, 2021;
originally announced February 2021.
-
Josephson effect of superconductors with $J=3/2$ electrons
Authors:
Dakyeong Kim,
Shingo Kobayashi,
Yasuhiro Asano
Abstract:
The angular momentum of an electron is characterized well by pseudospin with $J=3/2$ in the presence of strong spin-orbit interactions. We study theoretically the Josephson effect of superconductors in which such two $J=3/2$ electrons form a Cooper pair. Within even-parity symmetry class, pseudospin-quintet pairing states with $J=2$ can exist as well as pseudospin-singlet state with $J=0$. We focu…
▽ More
The angular momentum of an electron is characterized well by pseudospin with $J=3/2$ in the presence of strong spin-orbit interactions. We study theoretically the Josephson effect of superconductors in which such two $J=3/2$ electrons form a Cooper pair. Within even-parity symmetry class, pseudospin-quintet pairing states with $J=2$ can exist as well as pseudospin-singlet state with $J=0$. We focus especially on the Josephson selection rule among these even-parity superconductors. We find that the selection rule between quintet states is severer than that between spin-triplet states formed by two $S=1/2$ electrons. The effects of a pseudospin-active interface on the selection rule are discussed as well as those of odd-frequency Cooper pairs generated by pseudospin dependent band structures.
△ Less
Submitted 19 January, 2021;
originally announced January 2021.
-
Odd-parity pairing correlations in a d-wave superconductor
Authors:
Jaechul Lee,
Satoshi Ikegaya,
Yasuhiro Asano
Abstract:
We theoretically study the effects of spin-orbit interactions on symmetry of a Cooper pair in a spin-singlet d-wave superconductor in two-dimension. The pairing symmetry is analyzed in terms of the anomalous Green's function which is obtained by solving the Gor'kov equation analytically. A spin-orbit interaction induces a spin-triplet p-wave pairing correlation in a uniform superconductor. An odd-…
▽ More
We theoretically study the effects of spin-orbit interactions on symmetry of a Cooper pair in a spin-singlet d-wave superconductor in two-dimension. The pairing symmetry is analyzed in terms of the anomalous Green's function which is obtained by solving the Gor'kov equation analytically. A spin-orbit interaction induces a spin-triplet p-wave pairing correlation in a uniform superconductor. An odd-frequency spin-triplet s-wave pairing correlation appears at a surface of such superconductor as a result of breaking inversion symmetry locally. We also discuss a close relationship among the odd-frequency pairing correlation, chirality of surface bound states at the zero energy, and the anomalous proximity effect. The obtained results enable us to design a superconductor which causes the strong anomalous proximity effect.
△ Less
Submitted 4 March, 2021; v1 submitted 20 November, 2020;
originally announced November 2020.
-
Geo-Graph-Indistinguishability: Location Privacy on Road Networks Based on Differential Privacy
Authors:
Shun Takagi,
Yang Cao,
Yasuhito Asano,
Masatoshi Yoshikawa
Abstract:
In recent years, concerns about location privacy are increasing with the spread of location-based services (LBSs). Many methods to protect location privacy have been proposed in the past decades. Especially, perturbation methods based on Geo-Indistinguishability (Geo-I), which randomly perturb a true location to a pseudolocation, are getting attention due to its strong privacy guarantee inherited…
▽ More
In recent years, concerns about location privacy are increasing with the spread of location-based services (LBSs). Many methods to protect location privacy have been proposed in the past decades. Especially, perturbation methods based on Geo-Indistinguishability (Geo-I), which randomly perturb a true location to a pseudolocation, are getting attention due to its strong privacy guarantee inherited from differential privacy. However, Geo-I is based on the Euclidean plane even though many LBSs are based on road networks (e.g. ride-sharing services). This causes unnecessary noise and thus an insufficient tradeoff between utility and privacy for LBSs on road networks. To address this issue, we propose a new privacy notion, Geo-Graph-Indistinguishability (GG-I), for locations on a road network to achieve a better tradeoff. We propose Graph-Exponential Mechanism (GEM), which satisfies GG-I. Moreover, we formalize the optimization problem to find the optimal GEM in terms of the tradeoff. However, the computational complexity of a naive method to find the optimal solution is prohibitive, so we propose a greedy algorithm to find an approximate solution in an acceptable amount of time. Finally, our experiments show that our proposed mechanism outperforms a Geo-I's mechanism with respect to the tradeoff.
△ Less
Submitted 26 October, 2020;
originally announced October 2020.
-
Support-set bottlenecks for video-text representation learning
Authors:
Mandela Patrick,
Po-Yao Huang,
Yuki Asano,
Florian Metze,
Alexander Hauptmann,
João Henriques,
Andrea Vedaldi
Abstract:
The dominant paradigm for learning video-text representations -- noise contrastive learning -- increases the similarity of the representations of pairs of samples that are known to be related, such as text and video from the same sample, and pushes away the representations of all other pairs. We posit that this last behaviour is too strict, enforcing dissimilar representations even for samples tha…
▽ More
The dominant paradigm for learning video-text representations -- noise contrastive learning -- increases the similarity of the representations of pairs of samples that are known to be related, such as text and video from the same sample, and pushes away the representations of all other pairs. We posit that this last behaviour is too strict, enforcing dissimilar representations even for samples that are semantically-related -- for example, visually similar videos or ones that share the same depicted action. In this paper, we propose a novel method that alleviates this by leveraging a generative model to naturally push these related samples together: each sample's caption must be reconstructed as a weighted combination of other support samples' visual representations. This simple idea ensures that representations are not overly-specialized to individual samples, are reusable across the dataset, and results in representations that explicitly encode semantics shared between samples, unlike noise contrastive learning. Our proposed method outperforms others by a large margin on MSR-VTT, VATEX and ActivityNet, and MSVD for video-to-text and text-to-video retrieval.
△ Less
Submitted 14 January, 2021; v1 submitted 6 October, 2020;
originally announced October 2020.
-
Molecular Dynamics Simulation of Soundwave Propagation in a Simple Fluid
Authors:
Yuta Asano,
Hiroshi Watanabe,
Hiroshi Noguchi
Abstract:
A molecular dynamics (MD) simulation was performed to study the propagation of soundwaves in a fluid. Soundwaves are generated by a sinusoidally oscillating wall and annihilated by a locally applied Langevin thermostat near the opposite wall. The waveform changes from sinusoidal to sawtooth with increasing wave amplitude. For low-frequency sounds, the simulation results show very good agreement wi…
▽ More
A molecular dynamics (MD) simulation was performed to study the propagation of soundwaves in a fluid. Soundwaves are generated by a sinusoidally oscillating wall and annihilated by a locally applied Langevin thermostat near the opposite wall. The waveform changes from sinusoidal to sawtooth with increasing wave amplitude. For low-frequency sounds, the simulation results show very good agreement with Burgers equation without any fitting parameters. In contrast, for highfrequency sounds, significant deviations are obtained because of acoustic streaming. The speed of sound can be directly determined from the Fourier transform of a waveform with high accuracy. Although obtaining the attenuation rate directly from the simulation results is difficult because of the nonlinear effects of the wave amplitude, it can be estimated via Burgers equation. The results demonstrate that MD simulations are a useful tool for the quantitative analysis of soundwaves.
△ Less
Submitted 16 September, 2020; v1 submitted 24 July, 2020;
originally announced July 2020.
-
Labelling unlabelled videos from scratch with multi-modal self-supervision
Authors:
Yuki M. Asano,
Mandela Patrick,
Christian Rupprecht,
Andrea Vedaldi
Abstract:
A large part of the current success of deep learning lies in the effectiveness of data -- more precisely: labelled data. Yet, labelling a dataset with human annotation continues to carry high costs, especially for videos. While in the image domain, recent methods have allowed to generate meaningful (pseudo-) labels for unlabelled datasets without supervision, this development is missing for the vi…
▽ More
A large part of the current success of deep learning lies in the effectiveness of data -- more precisely: labelled data. Yet, labelling a dataset with human annotation continues to carry high costs, especially for videos. While in the image domain, recent methods have allowed to generate meaningful (pseudo-) labels for unlabelled datasets without supervision, this development is missing for the video domain where learning feature representations is the current focus. In this work, we a) show that unsupervised labelling of a video dataset does not come for free from strong feature encoders and b) propose a novel clustering method that allows pseudo-labelling of a video dataset without any human annotations, by leveraging the natural correspondence between the audio and visual modalities. An extensive analysis shows that the resulting clusters have high semantic overlap to ground truth human labels. We further introduce the first benchmarking results on unsupervised labelling of common video datasets Kinetics, Kinetics-Sound, VGG-Sound and AVE.
△ Less
Submitted 28 February, 2021; v1 submitted 24 June, 2020;
originally announced June 2020.
-
Emergent Geometries from the BMN Matrix Model
Authors:
Yuhma Asano
Abstract:
We review recent results of emergent geometries in the BMN matrix model, a one-dimensional gauge theory considered as a non-perturbative formulation of M-theory on the plane-wave geometry. A key to understand the emergent geometries is the eigenvalue distribution of a BPS operator. Gauge-theory calculation shows that the BPS operator reproduces the corresponding supergravity solutions in the gauge…
▽ More
We review recent results of emergent geometries in the BMN matrix model, a one-dimensional gauge theory considered as a non-perturbative formulation of M-theory on the plane-wave geometry. A key to understand the emergent geometries is the eigenvalue distribution of a BPS operator. Gauge-theory calculation shows that the BPS operator reproduces the corresponding supergravity solutions in the gauge/gravity duality and also brane geometries in the M-brane picture. At finite temperatures, these geometries should be realised in a non-trivial way. Monte Carlo simulations of this gauge theory revealed two types of phase transitions: the confinement/deconfinement transition and the Myers transition, which provide insights into the emergence of the geometries. Especially, the numerical results qualitatively agree with the critical temperature of the confinement/deconfinement transition predicted on the gravity side.
△ Less
Submitted 4 May, 2020; v1 submitted 27 April, 2020;
originally announced April 2020.
-
Superconductivity in Cu-doped Bi$_2$Se$_3$ with potential disorder
Authors:
Takumi Sato,
Yasuhiro Asano
Abstract:
We study the effects of random nonmagnetic impurities on superconducting transition temperature $T_c$ in a Cu-doped Bi$_2$Se$_3$, for which four types of pair potentials have been proposed. Although all the candidates belong to $s$-wave symmetry, two orbital degree of freedom in electronic structures enriches the symmetry variety of a Cooper pair such as even-orbital-parity and odd-orbital-parity.…
▽ More
We study the effects of random nonmagnetic impurities on superconducting transition temperature $T_c$ in a Cu-doped Bi$_2$Se$_3$, for which four types of pair potentials have been proposed. Although all the candidates belong to $s$-wave symmetry, two orbital degree of freedom in electronic structures enriches the symmetry variety of a Cooper pair such as even-orbital-parity and odd-orbital-parity. We consider realistic electronic structures of Cu-doped Bi$_2$Se$_3$ by using tight-binding Hamiltonian on a hexagonal lattice and consider effects of impurity scatterings through the self-energy of the Green's function within the Born approximation. We find that even-orbital-parity spin-singlet superconductivity is basically robust even in the presence of impurities. The degree of the robustness depends on the electronic structures in the normal state and on the pairing symmetry in orbital space. On the other hand, two odd-orbital-parity spin-triplet order parameters are always fragile in the presence of potential disorder.
△ Less
Submitted 20 April, 2020;
originally announced April 2020.
-
The nonperturbative phase diagram of the bosonic BMN matrix model
Authors:
Samuel Kováčik,
Denjoe O'Connor,
Yuhma Asano
Abstract:
We study the thermal phase transition of the bosonic BMN model which is a mass deformed version of the bosonic part of the BFSS model. Our results connect the massless region of the phase diagram described by the bosonic BFSS model with the large-mass region, where the model is analytically solvable. We observe that at finite value of the matrix size $N$, the critical region is smeared over a smal…
▽ More
We study the thermal phase transition of the bosonic BMN model which is a mass deformed version of the bosonic part of the BFSS model. Our results connect the massless region of the phase diagram described by the bosonic BFSS model with the large-mass region, where the model is analytically solvable. We observe that at finite value of the matrix size $N$, the critical region is smeared over a small temperature range. The model has a single critical temperature, which arises as the large $N$ limit of two apparent transitions at finite $N$. We emphasise the vital role played by finite $N$ corrections in the confined phase and illustrate this with a novel treatment of the noninteracting Gaussian model.
△ Less
Submitted 14 February, 2022; v1 submitted 13 April, 2020;
originally announced April 2020.
-
Nodal Andreev Spectra in Multi-Majorana Three-Terminal Josephson Junctions
Authors:
Keimei Sakurai,
Maria Teresa Mercaldo,
Shingo Kobayashi,
Ai Yamakage,
Satoshi Ikegaya,
Tetsuro Habe,
Panagiotis Kotetes,
Mario Cuoco,
Yasuhiro Asano
Abstract:
We investigate the Andreev-bound-state (ABS) spectra of three-terminal Josephson junctions which consist of 1D topological superconductors (TSCs) harboring multiple zero-energy edge Majorana bound states (MBSs) protected by chiral symmetry. Our theoretical analysis relies on the exact numerical diagonalization of the Bogoliubov-de Gennes (BdG) Hamiltonian describing the three interfaced TSCs, comp…
▽ More
We investigate the Andreev-bound-state (ABS) spectra of three-terminal Josephson junctions which consist of 1D topological superconductors (TSCs) harboring multiple zero-energy edge Majorana bound states (MBSs) protected by chiral symmetry. Our theoretical analysis relies on the exact numerical diagonalization of the Bogoliubov-de Gennes (BdG) Hamiltonian describing the three interfaced TSCs, complemented by an effective low-energy description solely based on the coupling of the interfacial MBSs arising before the leads get contacted. Considering the 2D synthetic space spanned by the two independent superconducting phase differences, we demonstrate that the ABS spectra may contain either point or line nodes, and identify $\mathbb{Z}_2$ topological invariants to classify them. We show that the resulting type of nodes depends on the number of preexisting interfacial MBSs, with nodal lines necessarily appearing when two TSCs harbor an unequal number of MBSs. Specifically, the precise number of interfacial MBSs determines the periodicity of the spectrum under $2π$-slidings of the phase differences and, as a result, also controls the shape of the nodal lines in synthetic space. When chiral symmetry is preserved, the lines are open and coincide with high-symmetry lines of synthetic space, while when it is violated the lines can also transform into loops and chains. The nodal spectra are robust by virtue of the inherent particle-hole symmetry of the BdG Hamiltonian, and give rise to distinctive experimental signatures that we identify.
△ Less
Submitted 30 March, 2020;
originally announced March 2020.
-
On Compositions of Transformations in Contrastive Self-Supervised Learning
Authors:
Mandela Patrick,
Yuki M. Asano,
Polina Kuznetsova,
Ruth Fong,
João F. Henriques,
Geoffrey Zweig,
Andrea Vedaldi
Abstract:
In the image domain, excellent representations can be learned by inducing invariance to content-preserving transformations via noise contrastive learning. In this paper, we generalize contrastive learning to a wider set of transformations, and their compositions, for which either invariance or distinctiveness is sought. We show that it is not immediately obvious how existing methods such as SimCLR…
▽ More
In the image domain, excellent representations can be learned by inducing invariance to content-preserving transformations via noise contrastive learning. In this paper, we generalize contrastive learning to a wider set of transformations, and their compositions, for which either invariance or distinctiveness is sought. We show that it is not immediately obvious how existing methods such as SimCLR can be extended to do so. Instead, we introduce a number of formal requirements that all contrastive formulations must satisfy, and propose a practical construction which satisfies these requirements. In order to maximise the reach of this analysis, we express all components of noise contrastive formulations as the choice of certain generalized transformations of the data (GDTs), including data sampling. We then consider videos as an example of data in which a large variety of transformations are applicable, accounting for the extra modalities -- for which we analyze audio and text -- and the dimension of time. We find that being invariant to certain transformations and distinctive to others is critical to learning effective video representations, improving the state-of-the-art for multiple benchmarks by a large margin, and even surpassing supervised pretraining.
△ Less
Submitted 27 October, 2021; v1 submitted 9 March, 2020;
originally announced March 2020.
-
The Confining Transition in the Bosonic BMN Matrix Model
Authors:
Yuhma Asano,
Samuel Kováčik,
Denjoe O'Connor
Abstract:
We study the confining/deconfining phase transition in the mass deformed Yang-Mills matrix model which is obtained by the dimensional reduction of the bosonic sector of the four-dimensional maximally supersymmetric Yang-Mills theory compactified on the three sphere, i.e. the bosonic BMN model. The $1/D$ (with $D$ the number of matrices) expansion suggests that the model may have two closely separa…
▽ More
We study the confining/deconfining phase transition in the mass deformed Yang-Mills matrix model which is obtained by the dimensional reduction of the bosonic sector of the four-dimensional maximally supersymmetric Yang-Mills theory compactified on the three sphere, i.e. the bosonic BMN model. The $1/D$ (with $D$ the number of matrices) expansion suggests that the model may have two closely separated transitions. However, using a second order lattice formulation of the model we find that for the small value of the mass parameter, $μ=2$, those two apparent critical temperatures merge at large $N$, leaving only a single weakly first-order phase transition, in agreement with recent numerical results for $μ=0$ (the bosonic BFSS model).
△ Less
Submitted 19 May, 2020; v1 submitted 11 January, 2020;
originally announced January 2020.
-
Identification of Spin-Triplet Superconductivity through a Helical-Chiral Phase Transition in Sr$_2$RuO$_4$ Thin-Films
Authors:
S. Ikegaya,
K. Yada,
Y. Tanaka,
S. Kashiwaya,
Y. Asano,
D. Manske
Abstract:
Despite much effort for over the two decades, the paring symmetry of a Sr$_2$RuO$_4$ superconductor has been still unclear. In this Rapid Communication, motivated by the recent rapid progress in fabrication techniques for Sr$_2$RuO$_4$ thin-films, we propose a promising strategy for identifying the spin-triplet superconductivity in the thin-film geometry by employing an antisymmetric spin-orbit co…
▽ More
Despite much effort for over the two decades, the paring symmetry of a Sr$_2$RuO$_4$ superconductor has been still unclear. In this Rapid Communication, motivated by the recent rapid progress in fabrication techniques for Sr$_2$RuO$_4$ thin-films, we propose a promising strategy for identifying the spin-triplet superconductivity in the thin-film geometry by employing an antisymmetric spin-orbit coupling potential and a Zeeman potential due to an external magnetic field. We demonstrate that a spin-triplet superconducting thin-film undergoes a phase transition from a helical state to a chiral state by increasing the applied magnetic field. This phase transition is accompanied by a drastic change in the property of surface Andreev bound states. As a consequence, the helical-chiral phase transition, which is unique to the spin-triplet superconductors, can be detected through a sudden change in a tunneling conductance spectrum of a normal-metal/superconductor junction. Importantly, our proposal is constructed by combining fundamental and rigid concepts regarding physics of spin-triplet superconductivity.
△ Less
Submitted 31 December, 2019;
originally announced December 2019.
-
Josephson effect in two-band superconductors
Authors:
Akihiro Sasaki,
Satoshi Ikegaya,
Tetsuro Habe,
Alexander A. Golubov,
Yasuhiro Asano
Abstract:
We study theoretically the Josephson effect between two time-reversal two-band superconductors, where we assume the equal-time spin-singlet $s$-wave pair potential in each conduction band. %as well as the band asymmetry and the band hybridization in the normal state. The superconducting phase at the first band $\varphi_1$ and that at the second band $\varphi_2$ characterize a two-band superconduct…
▽ More
We study theoretically the Josephson effect between two time-reversal two-band superconductors, where we assume the equal-time spin-singlet $s$-wave pair potential in each conduction band. %as well as the band asymmetry and the band hybridization in the normal state. The superconducting phase at the first band $\varphi_1$ and that at the second band $\varphi_2$ characterize a two-band superconducting state. We consider a Josephson junction where an insulating barrier separates two such two-band superconductors. By applying the tunnel Hamiltonian description, the Josephson current is calculated in terms of the anomalous Green's function on either side of the junction. We find that the Josephson current consists of three components which depend on three types of phase differences across the junction: the phase difference at the first band $δ\varphi_1$, the phase difference at the second band $δ\varphi_2$, and the difference at the center-of-mass phase $δ(\varphi_1+\varphi_2)/2$. A Cooper pairs generated by the band hybridization carries the last current component. In some cases, the current-phase relationship deviates from the sinusoidal function as a result of time-reversal symmetry breaking down.
△ Less
Submitted 12 December, 2019;
originally announced December 2019.
-
Effects of Cavitation on Karman Vortex Behind Circular-Cylinder Arrays
Authors:
Yuta Asano,
Hiroshi Watanabe,
Hiroshi Noguchi
Abstract:
The effects of cavitation on the flow around a circular-cylinder array are studied by using a molecular dynamics simulation. Cavitation significantly affects on vortex shedding characteristics. As the cavitation develops,the vibration acting on the cylinders decreases and eventually disappears. The further cavitation development generates a longer vapor region next to the cylinders, and the vortex…
▽ More
The effects of cavitation on the flow around a circular-cylinder array are studied by using a molecular dynamics simulation. Cavitation significantly affects on vortex shedding characteristics. As the cavitation develops,the vibration acting on the cylinders decreases and eventually disappears. The further cavitation development generates a longer vapor region next to the cylinders, and the vortex streets are formed at further positions from the cylinders. The neighboring Karman vortexes are synchronized in the antiphase in the absence of the cavitation. This synchronization is weakened by the cavitation, and an asymmetric wake mode can be induced. These findings help mechanical designs of fluid machinery that include cylinder arrays.
△ Less
Submitted 14 November, 2019;
originally announced November 2019.
-
Self-labelling via simultaneous clustering and representation learning
Authors:
Yuki Markus Asano,
Christian Rupprecht,
Andrea Vedaldi
Abstract:
Combining clustering and representation learning is one of the most promising approaches for unsupervised learning of deep neural networks. However, doing so naively leads to ill posed learning problems with degenerate solutions. In this paper, we propose a novel and principled learning formulation that addresses these issues. The method is obtained by maximizing the information between labels and…
▽ More
Combining clustering and representation learning is one of the most promising approaches for unsupervised learning of deep neural networks. However, doing so naively leads to ill posed learning problems with degenerate solutions. In this paper, we propose a novel and principled learning formulation that addresses these issues. The method is obtained by maximizing the information between labels and input data indices. We show that this criterion extends standard crossentropy minimization to an optimal transport problem, which we solve efficiently for millions of input images and thousands of labels using a fast variant of the Sinkhorn-Knopp algorithm. The resulting method is able to self-label visual data so as to train highly competitive image representations without manual labels. Our method achieves state of the art representation learning performance for AlexNet and ResNet-50 on SVHN, CIFAR-10, CIFAR-100 and ImageNet and yields the first self-supervised AlexNet that outperforms the supervised Pascal VOC detection baseline. Code and models are available.
△ Less
Submitted 19 February, 2020; v1 submitted 13 November, 2019;
originally announced November 2019.
-
Dipole oscillation of a trapped Bose--Fermi-mixture gas in collisionless and hydrodynamic regimes
Authors:
Yoji Asano,
Shohei Watabe,
Tetsuro Nikuni
Abstract:
Dipole oscillation is studied in a normal phase of a trapped Bose--Fermi-mixture gas composed of single-species bosons and single-species fermions. Applying the moment method to the linearized Boltzmann equation, we derive a closed set of equations of motion for the center-of-mass position and momentum of both components. By solving the coupled equations, we reveal the behavior of dipole modes in…
▽ More
Dipole oscillation is studied in a normal phase of a trapped Bose--Fermi-mixture gas composed of single-species bosons and single-species fermions. Applying the moment method to the linearized Boltzmann equation, we derive a closed set of equations of motion for the center-of-mass position and momentum of both components. By solving the coupled equations, we reveal the behavior of dipole modes in the transition between the collisionless regime and the hydrodynamic regime. We find that two oscillating modes in the collisionless regime have distinct fates in the hydrodynamic regime: one collisionless mode shows a crossover to a hydrodynamic in-phase mode, and the other collisionless mode shows a transition to two purely damped modes. The temperature dependence of these dipole modes are also discussed.
△ Less
Submitted 30 January, 2020; v1 submitted 23 September, 2019;
originally announced September 2019.