-
Evolving Storytelling: Benchmarks and Methods for New Character Customization with Diffusion Models
Authors:
Xiyu Wang,
Yufei Wang,
Satoshi Tsutsui,
Weisi Lin,
Bihan Wen,
Alex C. Kot
Abstract:
Diffusion-based models for story visualization have shown promise in generating content-coherent images for storytelling tasks. However, how to effectively integrate new characters into existing narratives while maintaining character consistency remains an open problem, particularly with limited data. Two major limitations hinder the progress: (1) the absence of a suitable benchmark due to potenti…
▽ More
Diffusion-based models for story visualization have shown promise in generating content-coherent images for storytelling tasks. However, how to effectively integrate new characters into existing narratives while maintaining character consistency remains an open problem, particularly with limited data. Two major limitations hinder the progress: (1) the absence of a suitable benchmark due to potential character leakage and inconsistent text labeling, and (2) the challenge of distinguishing between new and old characters, leading to ambiguous results. To address these challenges, we introduce the NewEpisode benchmark, comprising refined datasets designed to evaluate generative models' adaptability in generating new stories with fresh characters using just a single example story. The refined dataset involves refined text prompts and eliminates character leakage. Additionally, to mitigate the character confusion of generated results, we propose EpicEvo, a method that customizes a diffusion-based visual story generation model with a single story featuring the new characters seamlessly integrating them into established character dynamics. EpicEvo introduces a novel adversarial character alignment module to align the generated images progressively in the diffusive process, with exemplar images of new characters, while applying knowledge distillation to prevent forgetting of characters and background details. Our evaluation quantitatively demonstrates that EpicEvo outperforms existing baselines on the NewEpisode benchmark, and qualitative studies confirm its superior customization of visual story generation in diffusion models. In summary, EpicEvo provides an effective way to incorporate new characters using only one example story, unlocking new possibilities for applications such as serialized cartoons.
△ Less
Submitted 20 May, 2024;
originally announced May 2024.
-
Basis Function Dependence of Estimation Precision for Synchrotron-Radiation-Based Mössbauer Spectroscopy
Authors:
Binsheu Shieh,
Ryo Masuda,
Satoshi Tsutsui,
Shun Katakami,
Kenji Nagata,
Masaichiro Mizumaki,
Masato Okada
Abstract:
Mössbauer spectroscopy is a technique employed to investigate the microscopic properties of materials using transitions between energy levels in the nuclei. Conventionally, in synchrotron-radiation-based Mössbauer spectroscopy, the measurement window is decided by the researcher heuristically, although this decision has a significant impact on the shape of the measurement spectra. In this paper, w…
▽ More
Mössbauer spectroscopy is a technique employed to investigate the microscopic properties of materials using transitions between energy levels in the nuclei. Conventionally, in synchrotron-radiation-based Mössbauer spectroscopy, the measurement window is decided by the researcher heuristically, although this decision has a significant impact on the shape of the measurement spectra. In this paper, we propose a method for evaluating the precision of the spectral position by introducing Bayesian estimation. The proposed method makes it possible to select the best measurement window by calculating the precision of Mössbauer spectroscopy from the data. Based on the results, the precision of the Mössbauer center shifts improved by more than three times compared with the results achieved with the conventional simple fitting method using the Lorentzian function.
△ Less
Submitted 22 April, 2024;
originally announced April 2024.
-
Delocate: Detection and Localization for Deepfake Videos with Randomly-Located Tampered Traces
Authors:
Juan Hu,
Xin Liao,
Difei Gao,
Satoshi Tsutsui,
Qian Wang,
Zheng Qin,
Mike Zheng Shou
Abstract:
Deepfake videos are becoming increasingly realistic, showing few tampering traces on facial areasthat vary between frames. Consequently, existing Deepfake detection methods struggle to detect unknown domain Deepfake videos while accurately locating the tampered region. To address thislimitation, we propose Delocate, a novel Deepfake detection model that can both recognize andlocalize unknown domai…
▽ More
Deepfake videos are becoming increasingly realistic, showing few tampering traces on facial areasthat vary between frames. Consequently, existing Deepfake detection methods struggle to detect unknown domain Deepfake videos while accurately locating the tampered region. To address thislimitation, we propose Delocate, a novel Deepfake detection model that can both recognize andlocalize unknown domain Deepfake videos. Ourmethod consists of two stages named recoveringand localization. In the recovering stage, the modelrandomly masks regions of interest (ROIs) and reconstructs real faces without tampering traces, leading to a relatively good recovery effect for realfaces and a poor recovery effect for fake faces. Inthe localization stage, the output of the recoveryphase and the forgery ground truth mask serve assupervision to guide the forgery localization process. This process strategically emphasizes the recovery phase of fake faces with poor recovery, facilitating the localization of tampered regions. Ourextensive experiments on four widely used benchmark datasets demonstrate that Delocate not onlyexcels in localizing tampered areas but also enhances cross-domain detection performance.
△ Less
Submitted 9 May, 2024; v1 submitted 24 January, 2024;
originally announced January 2024.
-
Structural and Dynamical Changes in a Gd-Co Metallic Glass by Cryogenic Rejuvenation
Authors:
Shinya Hosokawa,
Jens R. Stellhorn,
Laszlo Pusztai,
Yoshikatsu Yamazaki,
**g Jiang,
Hidemi Kato,
Tetsu Ichitsubo,
Eisuke Magome,
Nils Blanc,
Nathalie Boudet,
Koji Ohara,
Satoshi Tsutsui,
Hiroshi Uchiyama,
Alfred Q. R. Baron
Abstract:
To experimentally clarify the changes in structural and dynamic heterogeneities in a metallic glass (MG), Gd65Co35, by rejuvenation with a temperature cycling (cryogenic rejuvenation), high-energy x-ray diffraction (HEXRD), anomalous x-ray scattering (AXS), and inelastic x-ray scattering (IXS) experiments were carried out. By a repeated temperature change between liquid N2 and room temperatures 40…
▽ More
To experimentally clarify the changes in structural and dynamic heterogeneities in a metallic glass (MG), Gd65Co35, by rejuvenation with a temperature cycling (cryogenic rejuvenation), high-energy x-ray diffraction (HEXRD), anomalous x-ray scattering (AXS), and inelastic x-ray scattering (IXS) experiments were carried out. By a repeated temperature change between liquid N2 and room temperatures 40 times, tiny but clear structural changes are observed by HEXRD even in the first neighboring range. Partial structural information obtained by AXS reveals that slight movements of the Gd and Co atoms occur in the first- and second-neighboring shells around the central Gd atom. The concentration inhomogeneity in the nm size drastically increases for the Gd atoms by the temperature cycling, while the other heterogeneities are negligible. A distinct change was detected in a microscopic elastic property by IXS: The width of longitudinal acoustic excitation broadens by about 20%, indicating an increase of the elastic heterogeneity of this MG by the thermal treatments. These static and dynamic results explicitly clarify the features of the cryogenic rejuvenation effect experimentally.
△ Less
Submitted 7 November, 2023;
originally announced November 2023.
-
Bias-preserving computation with the bit-flip code
Authors:
Shoichiro Tsutsui,
Keita Kanno
Abstract:
We explore the feasibility of fault-tolerant quantum computation using the bit-flip repetition code in a biased noise channel where only the bit-flip error can occur. While several logic gates can potentially produce phase-flip errors even in such a channel, we propose bias-preserving implementation of $S$, $H$, $\mathrm{CZ}$, and $R_z$ gates. We demonstrate that our scheme improves the computatio…
▽ More
We explore the feasibility of fault-tolerant quantum computation using the bit-flip repetition code in a biased noise channel where only the bit-flip error can occur. While several logic gates can potentially produce phase-flip errors even in such a channel, we propose bias-preserving implementation of $S$, $H$, $\mathrm{CZ}$, and $R_z$ gates. We demonstrate that our scheme improves the computational precision in several tasks such as the time evolution of quantum systems and variational quantum eigensolver.
△ Less
Submitted 3 June, 2024; v1 submitted 4 October, 2023;
originally announced October 2023.
-
Recap: Detecting Deepfake Video with Unpredictable Tampered Traces via Recovering Faces and Map** Recovered Faces
Authors:
Juan Hu,
Xin Liao,
Difei Gao,
Satoshi Tsutsui,
Qian Wang,
Zheng Qin,
Mike Zheng Shou
Abstract:
The exploitation of Deepfake techniques for malicious intentions has driven significant research interest in Deepfake detection. Deepfake manipulations frequently introduce random tampered traces, leading to unpredictable outcomes in different facial regions. However, existing detection methods heavily rely on specific forgery indicators, and as the forgery mode improves, these traces become incre…
▽ More
The exploitation of Deepfake techniques for malicious intentions has driven significant research interest in Deepfake detection. Deepfake manipulations frequently introduce random tampered traces, leading to unpredictable outcomes in different facial regions. However, existing detection methods heavily rely on specific forgery indicators, and as the forgery mode improves, these traces become increasingly randomized, resulting in a decline in the detection performance of methods reliant on specific forgery traces. To address the limitation, we propose Recap, a novel Deepfake detection model that exposes unspecific facial part inconsistencies by recovering faces and enlarges the differences between real and fake by map** recovered faces. In the recovering stage, the model focuses on randomly masking regions of interest (ROIs) and reconstructing real faces without unpredictable tampered traces, resulting in a relatively good recovery effect for real faces while a poor recovery effect for fake faces. In the map** stage, the output of the recovery phase serves as supervision to guide the facial map** process. This map** process strategically emphasizes the map** of fake faces with poor recovery, leading to a further deterioration in their representation, while enhancing and refining the map** of real faces with good representation. As a result, this approach significantly amplifies the discrepancies between real and fake videos. Our extensive experiments on standard benchmarks demonstrate that Recap is effective in multiple scenarios.
△ Less
Submitted 19 August, 2023;
originally announced August 2023.
-
WBCAtt: A White Blood Cell Dataset Annotated with Detailed Morphological Attributes
Authors:
Satoshi Tsutsui,
Winnie Pang,
Bihan Wen
Abstract:
The examination of blood samples at a microscopic level plays a fundamental role in clinical diagnostics, influencing a wide range of medical conditions. For instance, an in-depth study of White Blood Cells (WBCs), a crucial component of our blood, is essential for diagnosing blood-related diseases such as leukemia and anemia. While multiple datasets containing WBC images have been proposed, they…
▽ More
The examination of blood samples at a microscopic level plays a fundamental role in clinical diagnostics, influencing a wide range of medical conditions. For instance, an in-depth study of White Blood Cells (WBCs), a crucial component of our blood, is essential for diagnosing blood-related diseases such as leukemia and anemia. While multiple datasets containing WBC images have been proposed, they mostly focus on cell categorization, often lacking the necessary morphological details to explain such categorizations, despite the importance of explainable artificial intelligence (XAI) in medical domains. This paper seeks to address this limitation by introducing comprehensive annotations for WBC images. Through collaboration with pathologists, a thorough literature review, and manual inspection of microscopic images, we have identified 11 morphological attributes associated with the cell and its components (nucleus, cytoplasm, and granules). We then annotated ten thousand WBC images with these attributes. Moreover, we conduct experiments to predict these attributes from images, providing insights beyond basic WBC classification. As the first public dataset to offer such extensive annotations, we also illustrate specific applications that can benefit from our attribute annotations. Overall, our dataset paves the way for interpreting WBC recognition models, further advancing XAI in the fields of pathology and hematology.
△ Less
Submitted 25 December, 2023; v1 submitted 23 June, 2023;
originally announced June 2023.
-
Mover: Mask and Recovery based Facial Part Consistency Aware Method for Deepfake Video Detection
Authors:
Juan Hu,
Xin Liao,
Difei Gao,
Satoshi Tsutsui,
Qian Wang,
Zheng Qin,
Mike Zheng Shou
Abstract:
Deepfake techniques have been widely used for malicious purposes, prompting extensive research interest in develo** Deepfake detection methods. Deepfake manipulations typically involve tampering with facial parts, which can result in inconsistencies across different parts of the face. For instance, Deepfake techniques may change smiling lips to an upset lip, while the eyes remain smiling. Existi…
▽ More
Deepfake techniques have been widely used for malicious purposes, prompting extensive research interest in develo** Deepfake detection methods. Deepfake manipulations typically involve tampering with facial parts, which can result in inconsistencies across different parts of the face. For instance, Deepfake techniques may change smiling lips to an upset lip, while the eyes remain smiling. Existing detection methods depend on specific indicators of forgery, which tend to disappear as the forgery patterns are improved. To address the limitation, we propose Mover, a new Deepfake detection model that exploits unspecific facial part inconsistencies, which are inevitable weaknesses of Deepfake videos. Mover randomly masks regions of interest (ROIs) and recovers faces to learn unspecific features, which makes it difficult for fake faces to be recovered, while real faces can be easily recovered. Specifically, given a real face image, we first pretrain a masked autoencoder to learn facial part consistency by dividing faces into three parts and randomly masking ROIs, which are then recovered based on the unmasked facial parts. Furthermore, to maximize the discrepancy between real and fake videos, we propose a novel model with dual networks that utilize the pretrained encoder and masked autoencoder, respectively. 1) The pretrained encoder is finetuned for capturing the encoding of inconsistent information in the given video. 2) The pretrained masked autoencoder is utilized for map** faces and distinguishing real and fake videos. Our extensive experiments on standard benchmarks demonstrate that Mover is highly effective.
△ Less
Submitted 10 May, 2023;
originally announced May 2023.
-
Computation of Green's function by local variational quantum compilation
Authors:
Shota Kanasugi,
Shoichiro Tsutsui,
Yuya O. Nakagawa,
Kazunori Maruyama,
Hirotaka Oshima,
Shintaro Sato
Abstract:
Computation of the Green's function is crucial to study the properties of quantum many-body systems such as strongly correlated systems. Although the high-precision calculation of the Green's function is a notoriously challenging task on classical computers, the development of quantum computers may enable us to compute the Green's function with high accuracy even for classically-intractable large-…
▽ More
Computation of the Green's function is crucial to study the properties of quantum many-body systems such as strongly correlated systems. Although the high-precision calculation of the Green's function is a notoriously challenging task on classical computers, the development of quantum computers may enable us to compute the Green's function with high accuracy even for classically-intractable large-scale systems. Here, we propose an efficient method to compute the real-time Green's function based on the local variational quantum compilation (LVQC) algorithm, which simulates the time evolution of a large-scale quantum system using a low-depth quantum circuit constructed through optimization on a smaller-size subsystem. Our method requires shallow quantum circuits to calculate the Green's function and can be utilized on both near-term noisy intermediate-scale and long-term fault-tolerant quantum computers depending on the computational resources we have. We perform a numerical simulation of the Green's function for the one- and two-dimensional Fermi-Hubbard model up to $4\times4$ sites lattice (32 qubits) and demonstrate the validity of our protocol compared to a standard method based on the Trotter decomposition. We finally present a detailed estimation of the gate count for the large-scale Fermi-Hubbard model, which also illustrates the advantage of our method over the Trotter decomposition.
△ Less
Submitted 27 March, 2023;
originally announced March 2023.
-
Benchmarking White Blood Cell Classification Under Domain Shift
Authors:
Satoshi Tsutsui,
Zhengyang Su,
Bihan Wen
Abstract:
Recognizing the types of white blood cells (WBCs) in microscopic images of human blood smears is a fundamental task in the fields of pathology and hematology. Although previous studies have made significant contributions to the development of methods and datasets, few papers have investigated benchmarks or baselines that others can easily refer to. For instance, we observed notable variations in t…
▽ More
Recognizing the types of white blood cells (WBCs) in microscopic images of human blood smears is a fundamental task in the fields of pathology and hematology. Although previous studies have made significant contributions to the development of methods and datasets, few papers have investigated benchmarks or baselines that others can easily refer to. For instance, we observed notable variations in the reported accuracies of the same Convolutional Neural Network (CNN) model across different studies, yet no public implementation exists to reproduce these results. In this paper, we establish a benchmark for WBC recognition. Our results indicate that CNN-based models achieve high accuracy when trained and tested under similar imaging conditions. However, their performance drops significantly when tested under different conditions. Moreover, the ResNet classifier, which has been widely employed in previous work, exhibits an unreasonably poor generalization ability under domain shifts due to batch normalization. We investigate this issue and suggest some alternative normalization techniques that can mitigate it. We make fully-reproducible code publicly available\footnote{\url{https://github.com/apple2373/wbc-benchmark}}.
△ Less
Submitted 19 May, 2023; v1 submitted 3 March, 2023;
originally announced March 2023.
-
Mover: Mask and Recovery based Facial Part Consistency Aware Method for Deepfake Video Detection
Authors:
Juan Hu,
Xin Liao,
Difei Gao,
Satoshi Tsutsui,
Qian Wang,
Zheng Qin,
Mike Zheng Shou
Abstract:
Deepfake techniques have been widely used for malicious purposes, prompting extensive research interest in develo** Deepfake detection methods. Deepfake manipulations typically involve tampering with facial parts, which can result in inconsistencies across different parts of the face. For instance, Deepfake techniques may change smiling lips to an upset lip, while the eyes remain smiling. Existi…
▽ More
Deepfake techniques have been widely used for malicious purposes, prompting extensive research interest in develo** Deepfake detection methods. Deepfake manipulations typically involve tampering with facial parts, which can result in inconsistencies across different parts of the face. For instance, Deepfake techniques may change smiling lips to an upset lip, while the eyes remain smiling. Existing detection methods depend on specific indicators of forgery, which tend to disappear as the forgery patterns are improved. To address the limitation, we propose Mover, a new Deepfake detection model that exploits unspecific facial part inconsistencies, which are inevitable weaknesses of Deepfake videos. Mover randomly masks regions of interest (ROIs) and recovers faces to learn unspecific features, which makes it difficult for fake faces to be recovered, while real faces can be easily recovered. Specifically, given a real face image, we first pretrain a masked autoencoder to learn facial part consistency by dividing faces into three parts and randomly masking ROIs, which are then recovered based on the unmasked facial parts. Furthermore, to maximize the discrepancy between real and fake videos, we propose a novel model with dual networks that utilize the pretrained encoder and masked autoencoder, respectively. 1) The pretrained encoder is finetuned for capturing the encoding of inconsistent information in the given video. 2) The pretrained masked autoencoder is utilized for map** faces and distinguishing real and fake videos. Our extensive experiments on standard benchmarks demonstrate that Mover is highly effective.
△ Less
Submitted 5 May, 2023; v1 submitted 3 March, 2023;
originally announced March 2023.
-
Color superconductivity on the lattice -- analytic predictions from QCD in a small box
Authors:
Takeru Yokota,
Yuta Ito,
Hideo Matsufuru,
Yusuke Namekawa,
Jun Nishimura,
Asato Tsuchiya,
Shoichiro Tsutsui
Abstract:
We investigate color superconductivity on the lattice using the gap equation for the Cooper pair condensate. The weak coupling analysis is justified by choosing the physical size of the lattice to be smaller than the QCD scale, while kee** the aspect ratio of the lattice small enough to suppress thermal excitations. In the vicinity of the critical coupling constant that separates the superconduc…
▽ More
We investigate color superconductivity on the lattice using the gap equation for the Cooper pair condensate. The weak coupling analysis is justified by choosing the physical size of the lattice to be smaller than the QCD scale, while kee** the aspect ratio of the lattice small enough to suppress thermal excitations. In the vicinity of the critical coupling constant that separates the superconducting phase and the normal phase, the gap equation can be linearized, and by solving the corresponding eigenvalue problem, we obtain the critical point and the Cooper pair condensate without assuming its explicit form. The momentum components of the condensate suggest spatially isotropic s-wave superconductivity with Cooper pairs formed by quarks near the Fermi surface. The chiral symmetry in the massless limit is spontaneously broken by the Cooper pair condensate, which turns out to be dominated by the scalar and the pseudo-scalar components. Our results provide useful predictions, in particular, for future lattice simulations based on methods to overcome the sign problem such as the complex Langevin method.
△ Less
Submitted 3 March, 2023; v1 submitted 22 February, 2023;
originally announced February 2023.
-
Cluster Toroidal Multipoles Formed by Electric-Quadrupole and Magnetic-Octupole Trimers: A Possible Scenario for Hidden Orders in Ca$_5$Ir$_3$O$_{12}$
Authors:
Satoru Hayami,
Satoshi Tsutsui,
Hiroki Hanate,
Nobumoto Nagasawa,
Yoshitaka Yoda,
Kazuyuki Matsuhira
Abstract:
Cluster multipole orderings composed of atomic high-rank multipole moments are theoretically investigated with a 5$d$-electron compound Ca$_5$Ir$_3$O$_{12}$ in mind. Ca$_5$Ir$_3$O$_{12}$ exhibits two hidden orders: One is an intermediate-temperature phase with time-reversal symmetry and the other is a low-temperature phase without time-reversal symmetry. By performing the symmetry and augmented mu…
▽ More
Cluster multipole orderings composed of atomic high-rank multipole moments are theoretically investigated with a 5$d$-electron compound Ca$_5$Ir$_3$O$_{12}$ in mind. Ca$_5$Ir$_3$O$_{12}$ exhibits two hidden orders: One is an intermediate-temperature phase with time-reversal symmetry and the other is a low-temperature phase without time-reversal symmetry. By performing the symmetry and augmented multipole analyses for a $d$-orbital model under the hexagonal point group $D_{\rm 3h}$, we find that the 120$^{\circ}$-type ordering of the electric quadrupole corresponds to cluster electric toroidal dipole ordering with the electric ferroaxial moment, which can become the microscopic origin of the intermediate-temperature phase in Ca$_5$Ir$_3$O$_{12}$. Furthermore, based on ${}^{193}$Ir synchrotron-radiation-based Mössbauer spectroscopy, we propose that the low-temperature phase in Ca$_5$Ir$_3$O$_{12}$ is regarded as a coexisting state with cluster electric toroidal dipole and cluster magnetic toroidal quadrupole, the latter of which is formed by the 120$^{\circ}$-type ordering of the magnetic octupole and accompanies a small uniform magnetization as a secondary effect. Our results provide a clue to two hidden phases in Ca$_5$Ir$_3$O$_{12}$.
△ Less
Submitted 7 February, 2023;
originally announced February 2023.
-
Density-Induced Hadron-Quark Crossover via the Formation of Cooper Triples
Authors:
Hiroyuki Tajima,
Shoichiro Tsutsui,
Takahiro M. Doi,
Kei Iida
Abstract:
We discuss the hadron--quark crossover accompanied by the formation of Cooper triples (three-body counterpart of Cooper pairs) by analogy with the Bose--Einstein condensate to Bardeen--Cooper--Schrieffer crossover in two-component fermionic systems. Such a crossover is different from a phase transition, which often involves symmetry breaking. We calculate the in-medium three-body energy from the t…
▽ More
We discuss the hadron--quark crossover accompanied by the formation of Cooper triples (three-body counterpart of Cooper pairs) by analogy with the Bose--Einstein condensate to Bardeen--Cooper--Schrieffer crossover in two-component fermionic systems. Such a crossover is different from a phase transition, which often involves symmetry breaking. We calculate the in-medium three-body energy from the three-body $T$-matrix with a phenomenological three-body force characterizing a bound hadronic state in vacuum. With increasing density, the hadronic bound-state pole smoothly undergoes a crossover toward the Cooper triple phase where the in-medium three-body clusters coexist with the quark Fermi sea. The relation to the quarkyonic matter model can also be found in a natural manner.
△ Less
Submitted 1 February, 2023; v1 submitted 25 November, 2022;
originally announced November 2022.
-
Multigap Superconductivity in the Filled-Skutterudite Compound LaRu$_4$As$_{12}$ probed by muon spin rotation
Authors:
A. Bhattacharyya,
D. T. Adroja,
M. M. Koza,
S. Tsutsui,
T. Cichorek,
A. D. Hillier
Abstract:
Muon spin rotation ($μ$SR) and inelastic X-ray scattering (IXS) were used to investigate the superconducting properties of the filled-skutterudite compound LaRu$_{4}$As$_{12}$. A two-gap isotropic ($s+s$)-wave model can explain the temperature dependence of the superfluid density. Zero field $μ$SR measurements confirm that the time-reversal symmetry does not break upon entering the superconducting…
▽ More
Muon spin rotation ($μ$SR) and inelastic X-ray scattering (IXS) were used to investigate the superconducting properties of the filled-skutterudite compound LaRu$_{4}$As$_{12}$. A two-gap isotropic ($s+s$)-wave model can explain the temperature dependence of the superfluid density. Zero field $μ$SR measurements confirm that the time-reversal symmetry does not break upon entering the superconducting state. The measurements of lattice dynamics at 2, 20 and 300 K revealed temperature dependencies of the phonon modes that do not follow strictly a hardening of phonon frequencies upon cooling as expected within the quasi-harmonic picture. The 20~K data rather mark a turning point for the majority of the phonon frequencies. Indeed a hardening is observed approaching 20~K from above, while for a few branches a weak softening is visible upon further cooling to 2~K. The observed dispersion relations of phonon modes throughout the Brillouin zone matches with the DFT prediction quite closely. Our results point out that cubic LaRu$_{4}$As$_{12}$ is a good reference material for studying multiband superconductivity, including those with lower crystallographic symmetries such as iron arsenide-based superconductors.
△ Less
Submitted 23 September, 2022;
originally announced September 2022.
-
Single-Stage Open-world Instance Segmentation with Cross-task Consistency Regularization
Authors:
Xizhe Xue,
Dongdong Yu,
Lingqiao Liu,
Yu Liu,
Satoshi Tsutsui,
Ying Li,
Zehuan Yuan,
** Song,
Mike Zheng Shou
Abstract:
Open-World Instance Segmentation (OWIS) is an emerging research topic that aims to segment class-agnostic object instances from images. The mainstream approaches use a two-stage segmentation framework, which first locates the candidate object bounding boxes and then performs instance segmentation. In this work, we instead promote a single-stage framework for OWIS. We argue that the end-to-end trai…
▽ More
Open-World Instance Segmentation (OWIS) is an emerging research topic that aims to segment class-agnostic object instances from images. The mainstream approaches use a two-stage segmentation framework, which first locates the candidate object bounding boxes and then performs instance segmentation. In this work, we instead promote a single-stage framework for OWIS. We argue that the end-to-end training process in the single-stage framework can be more convenient for directly regularizing the localization of class-agnostic object pixels. Based on the single-stage instance segmentation framework, we propose a regularization model to predict foreground pixels and use its relation to instance segmentation to construct a cross-task consistency loss. We show that such a consistency loss could alleviate the problem of incomplete instance annotation -- a common problem in the existing OWIS datasets. We also show that the proposed loss lends itself to an effective solution to semi-supervised OWIS that could be considered an extreme case that all object annotations are absent for some images. Our extensive experiments demonstrate that the proposed method achieves impressive results in both fully-supervised and semi-supervised settings. Compared to SOTA methods, the proposed method significantly improves the $AP_{100}$ score by 4.75\% in UVO$\rightarrow$UVO setting and 4.05\% in COCO$\rightarrow$UVO setting. In the case of semi-supervised learning, our model learned with only 30\% labeled data, even outperforms its fully-supervised counterpart with 50\% labeled data. The code will be released soon.
△ Less
Submitted 18 October, 2022; v1 submitted 18 August, 2022;
originally announced August 2022.
-
Action Recognition based on Cross-Situational Action-object Statistics
Authors:
Satoshi Tsutsui,
Xizi Wang,
Guangyuan Weng,
Yayun Zhang,
David Crandall,
Chen Yu
Abstract:
Machine learning models of visual action recognition are typically trained and tested on data from specific situations where actions are associated with certain objects. It is an open question how action-object associations in the training set influence a model's ability to generalize beyond trained situations. We set out to identify properties of training data that lead to action recognition mode…
▽ More
Machine learning models of visual action recognition are typically trained and tested on data from specific situations where actions are associated with certain objects. It is an open question how action-object associations in the training set influence a model's ability to generalize beyond trained situations. We set out to identify properties of training data that lead to action recognition models with greater generalization ability. To do this, we take inspiration from a cognitive mechanism called cross-situational learning, which states that human learners extract the meaning of concepts by observing instances of the same concept across different situations. We perform controlled experiments with various types of action-object associations, and identify key properties of action-object co-occurrence in training data that lead to better classifiers. Given that these properties are missing in the datasets that are typically used to train action classifiers in the computer vision literature, our work provides useful insights on how we should best construct datasets for efficiently training for better generalization.
△ Less
Submitted 15 August, 2022;
originally announced August 2022.
-
Novel View Synthesis for High-fidelity Headshot Scenes
Authors:
Satoshi Tsutsui,
Weijia Mao,
Si**g Lin,
Yunyi Zhu,
Murong Ma,
Mike Zheng Shou
Abstract:
Rendering scenes with a high-quality human face from arbitrary viewpoints is a practical and useful technique for many real-world applications. Recently, Neural Radiance Fields (NeRF), a rendering technique that uses neural networks to approximate classical ray tracing, have been considered as one of the promising approaches for synthesizing novel views from a sparse set of images. We find that Ne…
▽ More
Rendering scenes with a high-quality human face from arbitrary viewpoints is a practical and useful technique for many real-world applications. Recently, Neural Radiance Fields (NeRF), a rendering technique that uses neural networks to approximate classical ray tracing, have been considered as one of the promising approaches for synthesizing novel views from a sparse set of images. We find that NeRF can render new views while maintaining geometric consistency, but it does not properly maintain skin details, such as moles and pores. These details are important particularly for faces because when we look at an image of a face, we are much more sensitive to details than when we look at other objects. On the other hand, 3D Morpable Models (3DMMs) based on traditional meshes and textures can perform well in terms of skin detail despite that it has less precise geometry and cannot cover the head and the entire scene with background. Based on these observations, we propose a method to use both NeRF and 3DMM to synthesize a high-fidelity novel view of a scene with a face. Our method learns a Generative Adversarial Network (GAN) to mix a NeRF-synthesized image and a 3DMM-rendered image and produces a photorealistic scene with a face preserving the skin details. Experiments with various real-world scenes demonstrate the effectiveness of our approach. The code will be available on https://github.com/showlab/headshot .
△ Less
Submitted 31 May, 2022;
originally announced May 2022.
-
Bayesian Inference on Hamiltonian Selections for Mössbauer Spectroscopy
Authors:
Ryota Moriguchi,
Satoshi Tsutsui,
Shun Katakami,
Kenji Nagata,
Masaichiro Mizumaki,
Masato Okada
Abstract:
Mössbauer spectroscopy, which provides knowledge related to electronic states in materials, has been applied to various fields such as condensed matter physics and material sciences. In conventional spectral analyses based on least-square fitting, hyperfine interactions in materials have been determined from the shape of observed spectra. In conventional spectral analyses, it is difficult to discu…
▽ More
Mössbauer spectroscopy, which provides knowledge related to electronic states in materials, has been applied to various fields such as condensed matter physics and material sciences. In conventional spectral analyses based on least-square fitting, hyperfine interactions in materials have been determined from the shape of observed spectra. In conventional spectral analyses, it is difficult to discuss the validity of the hyperfine interactions and the estimated values. We propose a spectral analysis method based on Bayesian inference for the selection of hyperfine interactions and the estimation of Mössbauer parameters. An appropriate Hamiltonian has been selected by comparing Bayesian free energy among possible Hamiltonians. We have estimated the Mössbauer parameters and evaluated their estimated values by calculating the posterior distribution of each Mössbauer parameter with confidence intervals. We have also discussed the accuracy of the spectral analyses to elucidate the noise intensity dependence of numerical experiments.
△ Less
Submitted 2 May, 2022;
originally announced May 2022.
-
Reinforcing Generated Images via Meta-learning for One-Shot Fine-Grained Visual Recognition
Authors:
Satoshi Tsutsui,
Yanwei Fu,
David Crandall
Abstract:
One-shot fine-grained visual recognition often suffers from the problem of having few training examples for new fine-grained classes. To alleviate this problem, off-the-shelf image generation techniques based on Generative Adversarial Networks (GANs) can potentially create additional training images. However, these GAN-generated images are often not helpful for actually improving the accuracy of o…
▽ More
One-shot fine-grained visual recognition often suffers from the problem of having few training examples for new fine-grained classes. To alleviate this problem, off-the-shelf image generation techniques based on Generative Adversarial Networks (GANs) can potentially create additional training images. However, these GAN-generated images are often not helpful for actually improving the accuracy of one-shot fine-grained recognition. In this paper, we propose a meta-learning framework to combine generated images with original images, so that the resulting "hybrid" training images improve one-shot learning. Specifically, the generic image generator is updated by a few training instances of novel classes, and a Meta Image Reinforcing Network (MetaIRNet) is proposed to conduct one-shot fine-grained recognition as well as image reinforcement. Our experiments demonstrate consistent improvement over baselines on one-shot fine-grained image classification benchmarks. Furthermore, our analysis shows that the reinforced images have more diversity compared to the original and GAN-generated images.
△ Less
Submitted 22 April, 2022;
originally announced April 2022.
-
LO-mode phonon of KCl and NaCl at 300 K by inelastic X ray scattering measurements and first principles calculations
Authors:
Atsushi Togo,
Hiroyuki Hayashi,
Terumasa Tadano,
Satoshi Tsutsui,
Isao Tanaka
Abstract:
Longitudinal-optical (LO) mode phonon branches of KCl and NaCl were measured using inelastic X-ray scattering (IXS) at 300 K and calculated by the first-principles phonon calculation with the stochastic self-consistent harmonic approximation. Spectral shapes of the IXS measurements and calculated spectral functions agreed well. We analyzed the calculated spectral functions that provide higher reso…
▽ More
Longitudinal-optical (LO) mode phonon branches of KCl and NaCl were measured using inelastic X-ray scattering (IXS) at 300 K and calculated by the first-principles phonon calculation with the stochastic self-consistent harmonic approximation. Spectral shapes of the IXS measurements and calculated spectral functions agreed well. We analyzed the calculated spectral functions that provide higher resolutions of the spectra than the IXS measurements. Due to strong anharmonicity, the spectral functions of these phonon branches have several peaks and the LO modes along $Γ$--L paths are disconnected.
△ Less
Submitted 22 April, 2022; v1 submitted 15 April, 2022;
originally announced April 2022.
-
Bulk charge density wave and electron-phonon coupling in superconducting copper oxychlorides
Authors:
Laura Chaix,
Blair W. Lebert,
Hu Miao,
Alessandro Nicolaou,
Flora Yakhou,
H Cercellier,
Stéphane Grenier,
N Brookes,
A Sulpice,
S Tsutsui,
A Bossak,
L Paolasini,
D Santos-Cottin,
H Yamamoto,
I Yamada,
M Azuma,
T Nishikubo,
T Yamamoto,
M Katsumata,
M Dean,
Matteo d'Astuto
Abstract:
Bulk charge density waves are now reported in nearly all high-temperature superconducting cuprates, with the noticeable exception of one particular family: the copper oxychlorides. Here, we used resonant inelastic X-ray scattering to reveal a bulk charge density waves in these materials. Combining resonant inelastic X-ray scattering with non-resonant inelastic X-ray scattering, we investigate the…
▽ More
Bulk charge density waves are now reported in nearly all high-temperature superconducting cuprates, with the noticeable exception of one particular family: the copper oxychlorides. Here, we used resonant inelastic X-ray scattering to reveal a bulk charge density waves in these materials. Combining resonant inelastic X-ray scattering with non-resonant inelastic X-ray scattering, we investigate the interplay between the lattice excitations and the charge density wave, and evidence the phonon anomalies of the Cu-O bond-stretching mode at the charge density wave wave-vector. We propose that such electron-phonon anomalies occur in the presence of dispersive charge excitations emanating from the charge density wave and interacting with the Cu-O bond-stretching phonon. Our results pave the way for future studies, combining both bulk and surface probes, to investigate the static and dynamical properties of the charge density wave in the copper oxychloride family.
△ Less
Submitted 27 April, 2022; v1 submitted 29 March, 2022;
originally announced March 2022.
-
Flavor number dependence of QCD at finite density by the complex Langevin method
Authors:
Yusuke Namekawa,
Yuhma Asano,
Yuta Ito,
Takashi Kaneko,
Hideo Matsufuru,
Jun Nishimura,
Asato Tsuchiya,
Shoichiro Tsutsui,
Takeru Yokota
Abstract:
We discuss the flavor number dependence of QCD at low temperature and high density by the complex Langevin method. In our previous work, the complex Langevin method is confirmed to satisfy the criterion for correct convergence in certain regions, such as $μ_{\rm q} / T = 5.2-7.2$ on $8^3 \times 16$ and $μ_{\rm q} / T = 1.6-9.6$ on $16^3 \times 32$ using $N_{\rm f} = 4$ staggered fermion at…
▽ More
We discuss the flavor number dependence of QCD at low temperature and high density by the complex Langevin method. In our previous work, the complex Langevin method is confirmed to satisfy the criterion for correct convergence in certain regions, such as $μ_{\rm q} / T = 5.2-7.2$ on $8^3 \times 16$ and $μ_{\rm q} / T = 1.6-9.6$ on $16^3 \times 32$ using $N_{\rm f} = 4$ staggered fermion at $β= 5.7$. We extend this study to more realistic flavor cases, $N_{\rm f} = 2, 2 + 1, 3$, using Wilson fermions. We present the flavor number dependence of the validity regions of the complex Langevin method and the quark number.
△ Less
Submitted 30 November, 2021;
originally announced December 2021.
-
Color superconductivity in a small box: a complex Langevin study
Authors:
Shoichiro Tsutsui,
Yuhma Asano,
Yuta Ito,
Hideo Matsufuru,
Yusuke Namekawa,
Jun Nishimura,
Asato Tsuchiya,
Takeru Yokota
Abstract:
It is expected that the color superconductivity (CSC) phase appears in QCD at low temperature and high density. On the basis of the lattice perturbation theory, a possible parameter region in which the CSC occurs has been predicted. In this work, we perform complex Langevin simulation on an $8^3\times 128$ lattice using four-flavor staggered fermions. We find, in particular, that the quark number…
▽ More
It is expected that the color superconductivity (CSC) phase appears in QCD at low temperature and high density. On the basis of the lattice perturbation theory, a possible parameter region in which the CSC occurs has been predicted. In this work, we perform complex Langevin simulation on an $8^3\times 128$ lattice using four-flavor staggered fermions. We find, in particular, that the quark number has plateaux with respect to the chemical potential similar to our previous study, indicating the formation of the Fermi sphere. A diquark-antidiquark operator, which is an order parameter of color superconductivity, is formulated on the lattice using the U(1) noise. Our result for this operator is found to fluctuate violently when the Fermi surface coincides with the energy levels of quarks. We also discuss partial restoration of the chiral symmetry at high density.
△ Less
Submitted 29 November, 2021;
originally announced November 2021.
-
Perturbative predictions for color superconductivity on the lattice
Authors:
Takeru Yokota,
Yuhma Asano,
Yuta Ito,
Hideo Matsufuru,
Yusuke Namekawa,
Jun Nishimura,
Asato Tsuchiya,
Shoichiro Tsutsui
Abstract:
We develop a new method to investigate color superconductivity (CSC) on the lattice based on the Thouless criterion, which amounts to solving the linearized gap equation without imposing any ansatz on the structure of the Cooper pairs. We perform explicit calculations at the one-loop level with the staggered fermions on a $8^3 \times 128$ lattice and the Wilson fermions on a $4^3 \times 128$ latti…
▽ More
We develop a new method to investigate color superconductivity (CSC) on the lattice based on the Thouless criterion, which amounts to solving the linearized gap equation without imposing any ansatz on the structure of the Cooper pairs. We perform explicit calculations at the one-loop level with the staggered fermions on a $8^3 \times 128$ lattice and the Wilson fermions on a $4^3 \times 128$ lattice, which enables us to obtain the critical $β(=6/g^2)$ as a function of the quark chemical potential $μ$, below which the CSC phase is expected to appear. The obtained critical $β$ has sharp peaks at the values of $μ$ corresponding to the discretized energy levels of quarks similarly to what was observed in previous studies on simplified effective models. From the solution to the linearized gap equation, one can read off the flavor and spatial structures of the Cooper pairs at the critical $β$. In the case of massless staggered fermion, in particular, we find that the chiral $\mathrm{U}(1)$ symmetry of the staggered fermions is spontaneously broken by the condensation of the Cooper pairs.
△ Less
Submitted 29 November, 2021;
originally announced November 2021.
-
AVA-AVD: Audio-Visual Speaker Diarization in the Wild
Authors:
Eric Zhongcong Xu,
Zeyang Song,
Satoshi Tsutsui,
Chao Feng,
Mang Ye,
Mike Zheng Shou
Abstract:
Audio-visual speaker diarization aims at detecting "who spoke when" using both auditory and visual signals. Existing audio-visual diarization datasets are mainly focused on indoor environments like meeting rooms or news studios, which are quite different from in-the-wild videos in many scenarios such as movies, documentaries, and audience sitcoms. To develop diarization methods for these challengi…
▽ More
Audio-visual speaker diarization aims at detecting "who spoke when" using both auditory and visual signals. Existing audio-visual diarization datasets are mainly focused on indoor environments like meeting rooms or news studios, which are quite different from in-the-wild videos in many scenarios such as movies, documentaries, and audience sitcoms. To develop diarization methods for these challenging videos, we create the AVA Audio-Visual Diarization (AVA-AVD) dataset. Our experiments demonstrate that adding AVA-AVD into training set can produce significantly better diarization models for in-the-wild videos despite that the data is relatively small. Moreover, this benchmark is challenging due to the diverse scenes, complicated acoustic conditions, and completely off-screen speakers. As a first step towards addressing the challenges, we design the Audio-Visual Relation Network (AVR-Net) which introduces a simple yet effective modality mask to capture discriminative information based on face visibility. Experiments show that our method not only can outperform state-of-the-art methods but is more robust as varying the ratio of off-screen speakers. Our data and code has been made publicly available at https://github.com/showlab/AVA-AVD.
△ Less
Submitted 16 July, 2022; v1 submitted 29 November, 2021;
originally announced November 2021.
-
Experimental Observation of Mesoscopic Fluctuations to Identify Origin of Thermodynamic Anomalies of Ambient Liquid Water
Authors:
Yukio Kajihara,
Masanori Inui,
Kazuhiro Matsuda,
Daisuke Ishikawa,
Satoshi Tsutsui,
Alfred Q. R. Baron
Abstract:
We report a new experimental approach for observing mesoscopic fluctuations underlying the thermodynamic anomalies of ambient liquid water. In this approach, two sound velocity measurements with different frequencies, namely inelastic X-ray scattering (IXS) in THz band and ultrasonic (US) in MHz band, are required to investigate the relaxation phenomenon with the characteristic frequency between t…
▽ More
We report a new experimental approach for observing mesoscopic fluctuations underlying the thermodynamic anomalies of ambient liquid water. In this approach, two sound velocity measurements with different frequencies, namely inelastic X-ray scattering (IXS) in THz band and ultrasonic (US) in MHz band, are required to investigate the relaxation phenomenon with the characteristic frequency between the two aforementioned frequencies. We performed IXS measurements to obtain the IXS sound velocity of liquid water from the ambient conditions to the supercritical region of liquid-gas phase transition (LGT) and compared the results with the US sound velocity in the literature. We found that the ratio of the two sound velocities, Sf, which corresponds to the relaxation intensity, exhibits a simple but significant change. Two distinct rises were observed in the high-temperature and low-temperature regions, implying that two relaxation phenomena exist: in the high-temperature region, a peak was observed near the LGT critical ridge line, which was linked with changes in the density fluctuation and isochoric and isobaric specific heat capacities; in the low-temperature region, Sf increased toward the low-temperature region, which was linked with the change in the isochoric heat capacity. We concluded that these two relaxation phenomena are originated from critical fluctuations of liquid-gas phase transition (LGT) and liquid-liquid phase transition, respectively. The linkage between Sf and isochoric heat capacity in the low-temperature region proves that the relaxation is the cause of the well-known heat capacity anomaly of ambient liquid water. In this study, both LGT and LLT critical fluctuations were observed, and the relationship between thermodynamics and the critical fluctuations was comprehensively discussed.
△ Less
Submitted 10 January, 2023; v1 submitted 12 November, 2021;
originally announced November 2021.
-
How You Move Your Head Tells What You Do: Self-supervised Video Representation Learning with Egocentric Cameras and IMU Sensors
Authors:
Satoshi Tsutsui,
Ruta Desai,
Karl Ridgeway
Abstract:
Understanding users' activities from head-mounted cameras is a fundamental task for Augmented and Virtual Reality (AR/VR) applications. A typical approach is to train a classifier in a supervised manner using data labeled by humans. This approach has limitations due to the expensive annotation cost and the closed coverage of activity labels. A potential way to address these limitations is to use s…
▽ More
Understanding users' activities from head-mounted cameras is a fundamental task for Augmented and Virtual Reality (AR/VR) applications. A typical approach is to train a classifier in a supervised manner using data labeled by humans. This approach has limitations due to the expensive annotation cost and the closed coverage of activity labels. A potential way to address these limitations is to use self-supervised learning (SSL). Instead of relying on human annotations, SSL leverages intrinsic properties of data to learn representations. We are particularly interested in learning egocentric video representations benefiting from the head-motion generated by users' daily activities, which can be easily obtained from IMU sensors embedded in AR/VR devices. Towards this goal, we propose a simple but effective approach to learn video representation by learning to tell the corresponding pairs of video clip and head-motion. We demonstrate the effectiveness of our learned representation for recognizing egocentric activities of people and dogs.
△ Less
Submitted 4 October, 2021;
originally announced October 2021.
-
Three-body crossover from a Cooper triple to bound trimer state in three-component Fermi gases near a triatomic resonance
Authors:
Hiroyuki Tajima,
Shoichiro Tsutsui,
Takahiro M. Doi,
Kei Iida
Abstract:
We theoretically investigate ground-state properties of a three-component Fermi gas with pairwise contact interactions between different components near a triatomic resonance where bound trimers are about to appear. Using variational equations for in-medium two- and three-body cluster states in three dimensions, we elucidate the competition of pair and triple formations due to the Fermi surface ef…
▽ More
We theoretically investigate ground-state properties of a three-component Fermi gas with pairwise contact interactions between different components near a triatomic resonance where bound trimers are about to appear. Using variational equations for in-medium two- and three-body cluster states in three dimensions, we elucidate the competition of pair and triple formations due to the Fermi surface effects. We present the ground-state phase diagram that exhibits transition from a Cooper pair to Cooper triple state and crossover from a Cooper triple to tightly bound trimer state at negative scattering lengths. This three-body crossover is analogous to the Bardeen-Cooper-Schrieffer to Bose-Einstein condensation crossover observed in a two-component Fermi gas. We predict that the threshold scattering length $a_{-}$ for three-body states can be shifted towards the weak-coupling side due to the emergence of Cooper triples.
△ Less
Submitted 28 July, 2021;
originally announced July 2021.
-
Unitary $p$-wave Fermi gas in one dimension
Authors:
Hiroyuki Tajima,
Shoichiro Tsutsui,
Takahiro M. Doi,
Kei Iida
Abstract:
We elucidate universal many-body properties of a one-dimensional, two-component ultracold Fermi gas near the $p$-wave Feshbach resonance. The low-energy scattering in this system can be characterized by two parameters, that is, $p$-wave scattering length and effective range. At the unitarity limit where the $p$-wave scattering length diverges and the effective range is reduced to zero without conf…
▽ More
We elucidate universal many-body properties of a one-dimensional, two-component ultracold Fermi gas near the $p$-wave Feshbach resonance. The low-energy scattering in this system can be characterized by two parameters, that is, $p$-wave scattering length and effective range. At the unitarity limit where the $p$-wave scattering length diverges and the effective range is reduced to zero without conflicting with the causality bound, the system obeys universal thermodynamics as observed in a unitary Fermi gas with contact $s$-wave interaction in three dimensions. It is in contrast to a Fermi gas with the $p$-wave resonance in three dimensions in which the effective range is inevitably finite. We present the universal equation of state in this unitary $p$-wave Fermi gas within the many-body $T$-matrix approach as well as the virial expansion method. Moreover, we examine the single-particle spectral function in the high-density regime where the virial expansion is no longer valid. On the basis of the Hartree-like self-energy shift at the divergent scattering length, we conjecture that the equivalence of the Bertsch parameter across spatial dimensions holds even for a one-dimensional unitary $p$-wave Fermi gas.
△ Less
Submitted 24 June, 2021;
originally announced June 2021.
-
Quantum hydrodynamics from local thermal pure states
Authors:
Shoichiro Tsutsui,
Masaru Hongo,
Shintaro Sato,
Takahiro Sagawa
Abstract:
We provide a pure state formulation for hydrodynamic dynamics of isolated quantum many-body systems. A pure state describing quantum systems in local thermal equilibrium is constructed, which we call a local thermal pure quantum ($\ell$TPQ) state. We show that the thermodynamic functional and the expectation values of local operators (including a real-time correlation function) calculated from the…
▽ More
We provide a pure state formulation for hydrodynamic dynamics of isolated quantum many-body systems. A pure state describing quantum systems in local thermal equilibrium is constructed, which we call a local thermal pure quantum ($\ell$TPQ) state. We show that the thermodynamic functional and the expectation values of local operators (including a real-time correlation function) calculated from the $\ell$TPQ state converge to those from a local Gibbs ensemble in the large fluid-cell limit. As a numerical demonstration, we investigate a one-dimensional spin chain and observe the hydrodynamic relaxation obeying the Fourier's law. We further prove the second law of thermodynamics and the quantum fluctuation theorem, which are also validated numerically. The $\ell$TPQ formulation gives a useful theoretical basis to describe the emergent hydrodynamic behavior of quantum many-body systems furnished with a numerical efficiency, being applicable to both the non-relativistic and relativistic regimes.
△ Less
Submitted 5 July, 2021; v1 submitted 24 June, 2021;
originally announced June 2021.
-
Reverse-engineer the Distributional Structure of Infant Egocentric Views for Training Generalizable Image Classifiers
Authors:
Satoshi Tsutsui,
David Crandall,
Chen Yu
Abstract:
We analyze egocentric views of attended objects from infants. This paper shows 1) empirical evidence that children's egocentric views have more diverse distributions compared to adults' views, 2) we can computationally simulate the infants' distribution, and 3) the distribution is beneficial for training more generalized image classifiers not only for infant egocentric vision but for third-person…
▽ More
We analyze egocentric views of attended objects from infants. This paper shows 1) empirical evidence that children's egocentric views have more diverse distributions compared to adults' views, 2) we can computationally simulate the infants' distribution, and 3) the distribution is beneficial for training more generalized image classifiers not only for infant egocentric vision but for third-person computer vision.
△ Less
Submitted 12 June, 2021;
originally announced June 2021.
-
Complex Langevin study for polarons in an attractively interacting one-dimensional two-component Fermi gas
Authors:
Takahiro M. Doi,
Hiroyuki Tajima,
Shoichiro Tsutsui
Abstract:
We investigate a polaronic excitation in a one-dimensional spin-1/2 Fermi gas with contact attractive interactions, using the complex Langevin method, which is a promising approach to evade a possible sign problem in quantum Monte Carlo simulations. We found that the complex Langevin method works correctly in a wide range of temperature, interaction strength, and population imbalance. The Fermi po…
▽ More
We investigate a polaronic excitation in a one-dimensional spin-1/2 Fermi gas with contact attractive interactions, using the complex Langevin method, which is a promising approach to evade a possible sign problem in quantum Monte Carlo simulations. We found that the complex Langevin method works correctly in a wide range of temperature, interaction strength, and population imbalance. The Fermi polaron energy extracted from the two-point imaginary Green's function is not sensitive to the temperature and the impurity concentration in the parameter region we considered. Our results show a good agreement with the solution of the thermodynamic Bethe ansatz at zero temperature.
△ Less
Submitted 23 May, 2021;
originally announced May 2021.
-
Isotropic parallel antiferromagnetism in the magnetic-field-induced charge-ordered state of SmRu$_4$P$_{12}$ caused by $p$-$f$ hybridization
Authors:
T. Matsumura,
S. Michimura,
T. Inami,
C. H. Lee,
M. Matsuda,
H. Nakao,
M. Mizumaki,
N. Kawamura,
M. Tsukagoshi,
S. Tsutsui,
H. Sugawara,
K. Fushiya,
T. D. Matsuda,
R. Higashinaka,
Y. Aoki
Abstract:
Nature of the field-induced charge ordered phase (phase II) of SmRu$_4$P$_{12}$ has been investigated by resonant x-ray diffraction (RXD) and polarized neutron diffraction (PND), focusing on the relationship between the atomic displacements and the antiferromagnetic (AFM) moments of Sm. From the analysis of the interference between the non-resonant Thomson scattering and the resonant magnetic scat…
▽ More
Nature of the field-induced charge ordered phase (phase II) of SmRu$_4$P$_{12}$ has been investigated by resonant x-ray diffraction (RXD) and polarized neutron diffraction (PND), focusing on the relationship between the atomic displacements and the antiferromagnetic (AFM) moments of Sm. From the analysis of the interference between the non-resonant Thomson scattering and the resonant magnetic scattering, combined with the spectral function obtained from x-ray magnetic circular dichroism, it is shown that the AFM moment of Sm prefers to be parallel to the field ($m_{\text{AF}} \parallel H$), giving rise to large and small moment sites around which the P$_{12}$ and Ru cage contract and expand, respectively. This is associated with the formation of the staggered ordering of the $Γ_7$-like and $Γ_8$-like crystal-field states, providing a strong piece of evidence for the charge order. PND was also performed to obtain complementary and unambiguous conclusion. In addition, isotropic and continuous nature of the phase II is demonstrated by the field-direction invariance of the interference spectrum in RXD. Crucial role of the $p$-$f$ hybridization is shown by resonant soft x-ray diffraction at the P $K$-edge ($1s\leftrightarrow 3p$), where we detected a resonance due to the spin polarized $3p$ orbitals reflecting the AFM order of Sm.
△ Less
Submitted 29 December, 2020;
originally announced December 2020.
-
Cooper Triples in Attractive Three-Component Fermions: Implication for Hadron-Quark Crossover
Authors:
Hiroyuki Tajima,
Shoichiro Tsutsui,
Takahiro M. Doi,
Kei Iida
Abstract:
We investigate many-body properties of equally populated three-component fermions with attractive three-body contact interaction in one dimension. A diagrammatic approach suggests the possible occurrence of Cooper triples at low temperature, which are three-body counterparts of Cooper pairs with a two-body attraction. We develop a minimal framework that bridges the crossover from tightly-bound tri…
▽ More
We investigate many-body properties of equally populated three-component fermions with attractive three-body contact interaction in one dimension. A diagrammatic approach suggests the possible occurrence of Cooper triples at low temperature, which are three-body counterparts of Cooper pairs with a two-body attraction. We develop a minimal framework that bridges the crossover from tightly-bound trimers to Cooper triples with increasing chemical potential and show how the formation of Cooper triples occurs in the grand-canonical phase diagram. Moreover, we argue that this non-trivial crossover is similar to the hadron-quark crossover proposed in dense matter. A coexistence of medium-induced triples and the underlying Fermi sea at positive chemical potential is analogous to quarkyonic matter consisting of baryonic excitations and the underlying quark Fermi sea. The comparison with the existing quantum Monte Carlo results implies that the emergence of these kinds of three-body states can be a microscopic origin of the peak of the sound velocity along the crossover.
△ Less
Submitted 8 October, 2021; v1 submitted 7 December, 2020;
originally announced December 2020.
-
Whose hand is this? Person Identification from Egocentric Hand Gestures
Authors:
Satoshi Tsutsui,
Yanwei Fu,
David Crandall
Abstract:
Recognizing people by faces and other biometrics has been extensively studied in computer vision. But these techniques do not work for identifying the wearer of an egocentric (first-person) camera because that person rarely (if ever) appears in their own first-person view. But while one's own face is not frequently visible, their hands are: in fact, hands are among the most common objects in one's…
▽ More
Recognizing people by faces and other biometrics has been extensively studied in computer vision. But these techniques do not work for identifying the wearer of an egocentric (first-person) camera because that person rarely (if ever) appears in their own first-person view. But while one's own face is not frequently visible, their hands are: in fact, hands are among the most common objects in one's own field of view. It is thus natural to ask whether the appearance and motion patterns of people's hands are distinctive enough to recognize them. In this paper, we systematically study the possibility of Egocentric Hand Identification (EHI) with unconstrained egocentric hand gestures. We explore several different visual cues, including color, shape, skin texture, and depth maps to identify users' hands. Extensive ablation experiments are conducted to analyze the properties of hands that are most distinctive. Finally, we show that EHI can improve generalization of other tasks, such as gesture recognition, by training adversarially to encourage these models to ignore differences between users.
△ Less
Submitted 17 November, 2020;
originally announced November 2020.
-
Complex Langevin calculations in QCD at finite density
Authors:
Yuta Ito,
Hideo Matsufuru,
Yusuke Namekawa,
Jun Nishimura,
Shinji Shimasaki,
Asato Tsuchiya,
Shoichiro Tsutsui
Abstract:
We demonstrate that the complex Langevin method (CLM) enables calculations in QCD at finite density in a parameter regime in which conventional methods, such as the density of states method and the Taylor expansion method, are not applicable due to the severe sign problem. Here we use the plaquette gauge action with $β= 5.7$ and four-flavor staggered fermions with degenerate quark mass…
▽ More
We demonstrate that the complex Langevin method (CLM) enables calculations in QCD at finite density in a parameter regime in which conventional methods, such as the density of states method and the Taylor expansion method, are not applicable due to the severe sign problem. Here we use the plaquette gauge action with $β= 5.7$ and four-flavor staggered fermions with degenerate quark mass $m a = 0.01$ and nonzero quark chemical potential $μ$. We confirm that a sufficient condition for correct convergence is satisfied for $μ/T = 5.2 - 7.2$ on a $8^3 \times 16$ lattice and $μ/T = 1.6 - 9.6$ on a $16^3 \times 32$ lattice. In particular, the expectation value of the quark number is found to have a plateau with respect to $μ$ with the height of 24 for both lattices. This plateau can be understood from the Fermi distribution of quarks, and its height coincides with the degrees of freedom of a single quark with zero momentum, which is 3 (color) $\times$ 4 (flavor) $\times$ 2 (spin) $=24$. Our results may be viewed as the first step towards the formation of the Fermi sphere, which plays a crucial role in color superconductivity conjectured from effective theories.
△ Less
Submitted 5 November, 2020; v1 submitted 17 July, 2020;
originally announced July 2020.
-
A Computational Model of Early Word Learning from the Infant's Point of View
Authors:
Satoshi Tsutsui,
Arjun Chandrasekaran,
Md Alimoor Reza,
David Crandall,
Chen Yu
Abstract:
Human infants have the remarkable ability to learn the associations between object names and visual objects from inherently ambiguous experiences. Researchers in cognitive science and developmental psychology have built formal models that implement in-principle learning algorithms, and then used pre-selected and pre-cleaned datasets to test the abilities of the models to find statistical regularit…
▽ More
Human infants have the remarkable ability to learn the associations between object names and visual objects from inherently ambiguous experiences. Researchers in cognitive science and developmental psychology have built formal models that implement in-principle learning algorithms, and then used pre-selected and pre-cleaned datasets to test the abilities of the models to find statistical regularities in the input data. In contrast to previous modeling approaches, the present study used egocentric video and gaze data collected from infant learners during natural toy play with their parents. This allowed us to capture the learning environment from the perspective of the learner's own point of view. We then used a Convolutional Neural Network (CNN) model to process sensory data from the infant's point of view and learn name-object associations from scratch. As the first model that takes raw egocentric video to simulate infant word learning, the present study provides a proof of principle that the problem of early word learning can be solved, using actual visual data perceived by infant learners. Moreover, we conducted simulation experiments to systematically determine how visual, perceptual, and attentional properties of infants' sensory experiences may affect word learning.
△ Less
Submitted 4 June, 2020;
originally announced June 2020.
-
Low-Dimensional Fluctuations and Pseudogap in Gaudin-Yang Fermi Gases
Authors:
Hiroyuki Tajima,
Shoichiro Tsutsui,
Takahiro M. Doi
Abstract:
Pseudogap is a ubiquitous phenomenon in strongly correlated systems such as high-$T_{\rm c}$ superconductors, ultracold atoms and nuclear physics. While pairing fluctuations inducing the pseudogap are known to be enhanced in low-dimensional systems, such effects have not been explored well in one of the most fundamental 1D models, that is, Gaudin-Yang model. In this work, we show that the pseudoga…
▽ More
Pseudogap is a ubiquitous phenomenon in strongly correlated systems such as high-$T_{\rm c}$ superconductors, ultracold atoms and nuclear physics. While pairing fluctuations inducing the pseudogap are known to be enhanced in low-dimensional systems, such effects have not been explored well in one of the most fundamental 1D models, that is, Gaudin-Yang model. In this work, we show that the pseudogap effect can be visible in the single-particle excitation in this system using a diagrammatic approach. Fermionic single-particle spectra exhibit a unique crossover from the double-particle dispersion to pseudogap state with increasing the attractive interaction and the number density at finite temperature. Surprisingly, our results of thermodynamic quantities in unpolarized and polarized gases show an excellent agreement with the recent quantum Monte Carlo and complex Langevin results, even in the region where the pseudogap appears.
△ Less
Submitted 25 May, 2020;
originally announced May 2020.
-
Exploring the QCD phase diagram at finite density by the complex Langevin method on a $16^3\times 32$ lattice
Authors:
Shoichiro Tsutsui,
Yuta Ito,
Hideo Matsufuru,
Jun Nishimura,
Shinji Shimasaki,
Asato Tsuchiya
Abstract:
We explore the QCD phase diagram at finite density with four-flavor staggered fermions using the complex Langevin method, which is a promising approach to overcome the sign problem. In our previous work on an $8^3 \times 16$ lattice at $β= 5.7$ with the quark mass $m = 0.01$, we have found that the baryon number density has a clear plateau as a function of the chemical potential. In this study, we…
▽ More
We explore the QCD phase diagram at finite density with four-flavor staggered fermions using the complex Langevin method, which is a promising approach to overcome the sign problem. In our previous work on an $8^3 \times 16$ lattice at $β= 5.7$ with the quark mass $m = 0.01$, we have found that the baryon number density has a clear plateau as a function of the chemical potential. In this study, we use a $16^3 \times 32$ lattice to reduce finite volume effects and find that the plateau structure survives. Moreover, the number of quarks in the plateau region turns out to be 24, which is exactly the same as the one obtained previously on the $8^3 \times 16$ lattice. We provide a simple interpretation of this number, which suggests that the Fermi sphere is starting to form.
△ Less
Submitted 1 December, 2019;
originally announced December 2019.
-
Meta-Reinforced Synthetic Data for One-Shot Fine-Grained Visual Recognition
Authors:
Satoshi Tsutsui,
Yanwei Fu,
David Crandall
Abstract:
One-shot fine-grained visual recognition often suffers from the problem of training data scarcity for new fine-grained classes. To alleviate this problem, an off-the-shelf image generator can be applied to synthesize additional training images, but these synthesized images are often not helpful for actually improving the accuracy of one-shot fine-grained recognition. This paper proposes a meta-lea…
▽ More
One-shot fine-grained visual recognition often suffers from the problem of training data scarcity for new fine-grained classes. To alleviate this problem, an off-the-shelf image generator can be applied to synthesize additional training images, but these synthesized images are often not helpful for actually improving the accuracy of one-shot fine-grained recognition. This paper proposes a meta-learning framework to combine generated images with original images, so that the resulting ``hybrid'' training images can improve one-shot learning. Specifically, the generic image generator is updated by a few training instances of novel classes, and a Meta Image Reinforcing Network (MetaIRNet) is proposed to conduct one-shot fine-grained recognition as well as image reinforcement. The model is trained in an end-to-end manner, and our experiments demonstrate consistent improvement over baselines on one-shot fine-grained image classification benchmarks.
△ Less
Submitted 17 November, 2019;
originally announced November 2019.
-
Active Object Manipulation Facilitates Visual Object Learning: An Egocentric Vision Study
Authors:
Satoshi Tsutsui,
Dian Zhi,
Md Alimoor Reza,
David Crandall,
Chen Yu
Abstract:
Inspired by the remarkable ability of the infant visual learning system, a recent study collected first-person images from children to analyze the `training data' that they receive. We conduct a follow-up study that investigates two additional directions. First, given that infants can quickly learn to recognize a new object without much supervision (i.e. few-shot learning), we limit the number of…
▽ More
Inspired by the remarkable ability of the infant visual learning system, a recent study collected first-person images from children to analyze the `training data' that they receive. We conduct a follow-up study that investigates two additional directions. First, given that infants can quickly learn to recognize a new object without much supervision (i.e. few-shot learning), we limit the number of training images. Second, we investigate how children control the supervision signals they receive during learning based on hand manipulation of objects. Our experimental results suggest that supervision with hand manipulation is better than without hands, and the trend is consistent even when a small number of images is available.
△ Less
Submitted 4 June, 2019;
originally announced June 2019.
-
Phonon anomalies with do** in superconducting oxychlorides Ca2-xCuO2Cl2
Authors:
Blair W. Lebert,
Hajime Yamamoto,
Masaki Azuma,
Rolf Heid,
Satoshi Tsutsui,
Hiroshi Uchiyama,
Alfred Q. R. Baron,
Benoît Baptiste,
Matteo d'Astuto
Abstract:
We measure the dispersion of the Cu-O bond-stretching phonon mode in the high-temperature superconducting parent compound Ca$_2$CuO$_2$Cl$_2$. Our density functional theory calculations predict a cosine-shaped bending of the dispersion along both the ($ξ$00) and ($ξξ$0) directions, while comparison with previous results on Ca$_{1.84}$CuO$_2$Cl$_2$ show it only along ($ξ$00), suggesting an anisotro…
▽ More
We measure the dispersion of the Cu-O bond-stretching phonon mode in the high-temperature superconducting parent compound Ca$_2$CuO$_2$Cl$_2$. Our density functional theory calculations predict a cosine-shaped bending of the dispersion along both the ($ξ$00) and ($ξξ$0) directions, while comparison with previous results on Ca$_{1.84}$CuO$_2$Cl$_2$ show it only along ($ξ$00), suggesting an anisotropic effect which is not reproduced in calculation at optimal do**. Comparison with isostructural La$_{2-x}$Sr$_x$CuO$_4$ suggests that these calculations reproduce well the overdoped regime, however they overestimate the do** effect on the Cu-O bond-stretching mode at optimal do**.
△ Less
Submitted 17 April, 2019;
originally announced April 2019.
-
Distribution of solutions of the fastest apparent convergence condition in optimized perturbation theory and its relation to anti-Stokes lines
Authors:
Shoichiro Tsutsui,
Takahiro M. Doi
Abstract:
We discuss fundamental properties of the fastest apparent convergence (FAC) condition which is used as a variational criterion in optimized perturbation theory (OPT). We examine an integral representation of the FAC condition and a distribution of the zeros of the integral in a complex artificial parameter space on the basis of theory of Lefschetz thimbles. We find that the zeros accumulate on a c…
▽ More
We discuss fundamental properties of the fastest apparent convergence (FAC) condition which is used as a variational criterion in optimized perturbation theory (OPT). We examine an integral representation of the FAC condition and a distribution of the zeros of the integral in a complex artificial parameter space on the basis of theory of Lefschetz thimbles. We find that the zeros accumulate on a certain line segment so-called anti-Stokes line in the limit $K \to \infty$, where $K$ is a truncation order of a perturbation series. This phenomenon gives an underlying mechanism that physical quantities calculated by OPT can be insensitive to the choice of the artificial parameter.
△ Less
Submitted 14 January, 2019;
originally announced January 2019.
-
Exploring the phase diagram of finite density QCD at low temperature by the complex Langevin method
Authors:
Yuta Ito,
Hideo Matsufuru,
Jun Nishimura,
Shinji Shimasaki,
Asato Tsuchiya,
Shoichiro Tsutsui
Abstract:
Monte Carlo studies of QCD at finite density suffer from the sign problem, which becomes easily uncontrollable as the chemical potential $μ$ is increased even for a moderate lattice size. In this work we make an attempt to approach the high density low temperature region by the complex Langevin method (CLM) using four-flavor staggered fermions with reasonably small quark mass on a $8^3 \times 16$…
▽ More
Monte Carlo studies of QCD at finite density suffer from the sign problem, which becomes easily uncontrollable as the chemical potential $μ$ is increased even for a moderate lattice size. In this work we make an attempt to approach the high density low temperature region by the complex Langevin method (CLM) using four-flavor staggered fermions with reasonably small quark mass on a $8^3 \times 16$ lattice. Unlike the previous work on a $4^3 \times 8$ lattice, the criterion for correct convergence is satisfied within a wide range of $μ$ without using the deformation technique. In particular, the baryon number density exhibits a plateau behavior consistent with the formation of eight baryons, and it starts to grow gradually at some $μ$.
△ Less
Submitted 5 December, 2018; v1 submitted 30 November, 2018;
originally announced November 2018.
-
Can the complex Langevin method see the deconfinement phase transition in QCD at finite density?
Authors:
Shoichiro Tsutsui,
Yuta Ito,
Hideo Matsufuru,
Jun Nishimura,
Shinji Shimasaki,
Asato Tsuchiya
Abstract:
Exploring the phase diagram of QCD at finite density is a challenging problem since first-principle calculations based on standard Monte Carlo methods suffer from the sign problem. As a promising approach to this issue, the complex Langevin method (CLM) has been pursued intensively.In this work, we investigate the applicability of the CLM in the vicinity of the deconfinement phase transition using…
▽ More
Exploring the phase diagram of QCD at finite density is a challenging problem since first-principle calculations based on standard Monte Carlo methods suffer from the sign problem. As a promising approach to this issue, the complex Langevin method (CLM) has been pursued intensively.In this work, we investigate the applicability of the CLM in the vicinity of the deconfinement phase transition using the four-flavor staggered fermions. In particular, we look for a signal of the expected first order phase transition within the validity region of the CLM.
△ Less
Submitted 19 November, 2018;
originally announced November 2018.
-
Evidence of a structural quantum critical point in (Ca$_{x}$Sr$_{1-x}$)$_3$Rh$_4$Sn$_{13}$ from a lattice dynamics study
Authors:
Y. W. Cheung,
Y. J. Hu,
M. Imai,
Y. Tanioku,
H. Kanagawa,
J. Murakawa,
K. Moriyama,
W. Zhang,
K. T. Lai,
K. Yoshimura,
F. M. Grosche,
K. Kaneko,
S. Tsutsui,
Swee K. Goh
Abstract:
Approaching a quantum critical point (QCP) has been an effective route to stabilize superconductivity. While the role of magnetic QCPs has been extensively discussed, similar exploration of a structural QCP is scarce due to the lack of suitable systems with a continuous structural transition that can be conveniently tuned to 0~K. Using inelastic X-ray scattering, we examine the phonon spectrum of…
▽ More
Approaching a quantum critical point (QCP) has been an effective route to stabilize superconductivity. While the role of magnetic QCPs has been extensively discussed, similar exploration of a structural QCP is scarce due to the lack of suitable systems with a continuous structural transition that can be conveniently tuned to 0~K. Using inelastic X-ray scattering, we examine the phonon spectrum of the nonmagnetic quasi-skutterudite (Ca$_{x}$Sr$_{1-x}$)$_3$Rh$_4$Sn$_{13}$, which represents a precious system to explore the interplay between structural instabilities and superconductivity by tuning the Ca concentration $x$. We unambiguously detect the softening of phonon modes around the M point on cooling towards the structural transition. Intriguingly, at $x=0.85$, the soft mode energy squared at the M point extrapolates to zero at $(-5.7 \pm 7.7)$~K, providing the first compelling microscopic evidence of a structural QCP in (Ca$_{x}$Sr$_{1-x}$)$_3$Rh$_4$Sn$_{13}$. The enhanced phonon density-of-states at low energy provides the essential ingredient for realizing strong-coupling superconductivity near the structural QCP.
△ Less
Submitted 4 October, 2018;
originally announced October 2018.
-
edge2vec: Representation learning using edge semantics for biomedical knowledge discovery
Authors:
Zheng Gao,
Gang Fu,
Chun** Ouyang,
Satoshi Tsutsui,
Xiaozhong Liu,
Jeremy Yang,
Christopher Gessner,
Brian Foote,
David Wild,
Qi Yu,
Ying Ding
Abstract:
Representation learning provides new and powerful graph analytical approaches and tools for the highly valued data science challenge of mining knowledge graphs. Since previous graph analytical methods have mostly focused on homogeneous graphs, an important current challenge is extending this methodology for richly heterogeneous graphs and knowledge domains. The biomedical sciences are such a domai…
▽ More
Representation learning provides new and powerful graph analytical approaches and tools for the highly valued data science challenge of mining knowledge graphs. Since previous graph analytical methods have mostly focused on homogeneous graphs, an important current challenge is extending this methodology for richly heterogeneous graphs and knowledge domains. The biomedical sciences are such a domain, reflecting the complexity of biology, with entities such as genes, proteins, drugs, diseases, and phenotypes, and relationships such as gene co-expression, biochemical regulation, and biomolecular inhibition or activation. Therefore, the semantics of edges and nodes are critical for representation learning and knowledge discovery in real world biomedical problems. In this paper, we propose the edge2vec model, which represents graphs considering edge semantics. An edge-type transition matrix is trained by an Expectation-Maximization approach, and a stochastic gradient descent model is employed to learn node embedding on a heterogeneous graph via the trained transition matrix. edge2vec is validated on three biomedical domain tasks: biomedical entity classification, compound-gene bioactivity prediction, and biomedical information retrieval. Results show that by considering edge-types into node embedding learning in heterogeneous graphs, \textbf{edge2vec}\ significantly outperforms state-of-the-art models on all three tasks. We propose this method for its added value relative to existing graph analytical methodology, and in the real world context of biomedical knowledge discovery applicability.
△ Less
Submitted 27 May, 2019; v1 submitted 6 September, 2018;
originally announced September 2018.
-
Combining Pyramid Pooling and Attention Mechanism for Pelvic MR Image Semantic Segmentaion
Authors:
Ting-Ting Liang,
Satoshi Tsutsui,
Liangcai Gao,
**g-**g Lu,
Mengyan Sun
Abstract:
One of the time-consuming routine work for a radiologist is to discern anatomical structures from tomographic images. For assisting radiologists, this paper develops an automatic segmentation method for pelvic magnetic resonance (MR) images. The task has three major challenges 1) A pelvic organ can have various sizes and shapes depending on the axial image, which requires local contexts to segment…
▽ More
One of the time-consuming routine work for a radiologist is to discern anatomical structures from tomographic images. For assisting radiologists, this paper develops an automatic segmentation method for pelvic magnetic resonance (MR) images. The task has three major challenges 1) A pelvic organ can have various sizes and shapes depending on the axial image, which requires local contexts to segment correctly. 2) Different organs often have quite similar appearance in MR images, which requires global context to segment. 3) The number of available annotated images are very small to use the latest segmentation algorithms. To address the challenges, we propose a novel convolutional neural network called Attention-Pyramid network (APNet) that effectively exploits both local and global contexts, in addition to a data-augmentation technique that is particularly effective for MR images. In order to evaluate our method, we construct fine-grained (50 pelvic organs) MR image segmentation dataset, and experimentally confirm the superior performance of our techniques over the state-of-the-art image segmentation methods.
△ Less
Submitted 28 June, 2018; v1 submitted 1 June, 2018;
originally announced June 2018.
-
Electron-Phonon Coupling Mode in Excitonic Insulator
Authors:
Akitoshi Nakano,
Takumi Hasegawa,
Shinya Tamura,
Naoyuki Katayama,
Satoshi Tsutsui,
Hiroshi Sawa
Abstract:
Ta2NiSe5 is considered a promising excitonic insulator (EI) candidate with slight phonon contributions, since it exhibits a tiny orthorhombic-to-monoclinic structural distortion at 328 K without any superlattice structure. Our synchrotron inelastic x-ray scattering measurements reveal strong electron-optical-phonon coupling occurring at temperatures higher than the transition temperature. Density…
▽ More
Ta2NiSe5 is considered a promising excitonic insulator (EI) candidate with slight phonon contributions, since it exhibits a tiny orthorhombic-to-monoclinic structural distortion at 328 K without any superlattice structure. Our synchrotron inelastic x-ray scattering measurements reveal strong electron-optical-phonon coupling occurring at temperatures higher than the transition temperature. Density functional theoretical calculations indicate that two coupled optical modes arise due to the vibration of Ta and Se ions. Further, the two modes are frozen such that Ta and Se approach each other, forming atomic-displacement-type electric dipoles in the monoclinic phase. The characteristic of electronic toroidal moment formation by the antiferro-arrangements of electric dipoles is the universality of EI between Ta2NiSe5 and 1T-TiSe2.
△ Less
Submitted 1 March, 2018;
originally announced March 2018.