-
Gravitational Waves in Metastable Supersymmetry Breaking
Authors:
Chong-Sun Chu,
Asuka Ito
Abstract:
If supersymmetry is broken in metastable vacua, it is not clear why we are now in there rather than supersymmetric vacua. Moreover, it is natural to expect that we were in supersymmetric vacua, which have higher symmetry than metastable vacua, in the early universe. In this paper, we reexamine and improve the previous analysis on the cosmological evolution of the vacuum structure in the ISS model…
▽ More
If supersymmetry is broken in metastable vacua, it is not clear why we are now in there rather than supersymmetric vacua. Moreover, it is natural to expect that we were in supersymmetric vacua, which have higher symmetry than metastable vacua, in the early universe. In this paper, we reexamine and improve the previous analysis on the cosmological evolution of the vacuum structure in the ISS model of metastable supersymmetry breaking by taking into account constraints on the reheating temperature, which is needed to avoid the overproduction of gravitinos. It turns out that the desired phase transition from a supersymmetric vacuum to a metastable vacuum is allowed only in the light gravitino mass region $m_{3/2} < 4.7$ eV. This is achieved by either rolling down potential or tunneling processes depending on the reheating temperature. We show that when the tunneling processes are realized, abundant gravitational waves could be produced from collisions of runaway bubbles. The resulting gravitational waves are detectable with the future gravitational wave interferometers like LISA and DECIGO.
△ Less
Submitted 5 July, 2022; v1 submitted 26 January, 2022;
originally announced January 2022.
-
Linguistically-driven Multi-task Pre-training for Low-resource Neural Machine Translation
Authors:
Zhuoyuan Mao,
Chenhui Chu,
Sadao Kurohashi
Abstract:
In the present study, we propose novel sequence-to-sequence pre-training objectives for low-resource machine translation (NMT): Japanese-specific sequence to sequence (JASS) for language pairs involving Japanese as the source or target language, and English-specific sequence to sequence (ENSS) for language pairs involving English. JASS focuses on masking and reordering Japanese linguistic units kn…
▽ More
In the present study, we propose novel sequence-to-sequence pre-training objectives for low-resource machine translation (NMT): Japanese-specific sequence to sequence (JASS) for language pairs involving Japanese as the source or target language, and English-specific sequence to sequence (ENSS) for language pairs involving English. JASS focuses on masking and reordering Japanese linguistic units known as bunsetsu, whereas ENSS is proposed based on phrase structure masking and reordering tasks. Experiments on ASPEC Japanese--English & Japanese--Chinese, Wikipedia Japanese--Chinese, News English--Korean corpora demonstrate that JASS and ENSS outperform MASS and other existing language-agnostic pre-training methods by up to +2.9 BLEU points for the Japanese--English tasks, up to +7.0 BLEU points for the Japanese--Chinese tasks and up to +1.3 BLEU points for English--Korean tasks. Empirical analysis, which focuses on the relationship between individual parts in JASS and ENSS, reveals the complementary nature of the subtasks of JASS and ENSS. Adequacy evaluation using LASER, human evaluation, and case studies reveals that our proposed methods significantly outperform pre-training methods without injected linguistic knowledge and they have a larger positive impact on the adequacy as compared to the fluency. We release codes here: https://github.com/Mao-KU/JASS/tree/master/linguistically-driven-pretraining.
△ Less
Submitted 20 January, 2022;
originally announced January 2022.
-
VISA: An Ambiguous Subtitles Dataset for Visual Scene-Aware Machine Translation
Authors:
Yihang Li,
Shuichiro Shimizu,
Weiqi Gu,
Chenhui Chu,
Sadao Kurohashi
Abstract:
Existing multimodal machine translation (MMT) datasets consist of images and video captions or general subtitles, which rarely contain linguistic ambiguity, making visual information not so effective to generate appropriate translations. We introduce VISA, a new dataset that consists of 40k Japanese-English parallel sentence pairs and corresponding video clips with the following key features: (1)…
▽ More
Existing multimodal machine translation (MMT) datasets consist of images and video captions or general subtitles, which rarely contain linguistic ambiguity, making visual information not so effective to generate appropriate translations. We introduce VISA, a new dataset that consists of 40k Japanese-English parallel sentence pairs and corresponding video clips with the following key features: (1) the parallel sentences are subtitles from movies and TV episodes; (2) the source subtitles are ambiguous, which means they have multiple possible translations with different meanings; (3) we divide the dataset into Polysemy and Omission according to the cause of ambiguity. We show that VISA is challenging for the latest MMT system, and we hope that the dataset can facilitate MMT research. The VISA dataset is available at: https://github.com/ku-nlp/VISA.
△ Less
Submitted 26 May, 2022; v1 submitted 20 January, 2022;
originally announced January 2022.
-
Towards Trustworthy DeFi Oracles: Past,Present and Future
Authors:
Yinjie Zhao,
Xin Kang,
Tieyan Li,
Cheng-Kang Chu,
Haiguang Wang
Abstract:
With the rapid development of blockchain technology in recent years, all kinds of blockchain-based applications have emerged. Among them, the decentralized finance (DeFi) is one of the most successful applications, which is regarded as the future of finance. The great success of DeFi relies on the real-world data which is not directly available on the blockchain. Besides, due to the deterministic…
▽ More
With the rapid development of blockchain technology in recent years, all kinds of blockchain-based applications have emerged. Among them, the decentralized finance (DeFi) is one of the most successful applications, which is regarded as the future of finance. The great success of DeFi relies on the real-world data which is not directly available on the blockchain. Besides, due to the deterministic nature of blockchain,the blockchain cannot directly obtain in-deterministic data from the outside world (off-chain). Thus, oracles have appeared as a viable solution to feed off-chain data to blockchain applications. In this paper, we carryout a comprehensive study on oracles, especially on DeFi oracles. We first briefly introduce the application scenarios of DeFi oracles, and then we talk about the past of DeFi oracles by categorizing them into several types based on their design features. After that, we introduce five popular DeFi oracles currently in use(such as Chainlink and Band Protocol), with the focus on their system architecture, data validation process,and their incentive mechanisms. We compare these present DeFi oracles from their data trustworthiness,data source trustworthiness and their overall trust models. Finally, we propose a set of metrics for designing trustworthiness DeFi oracles, and propose a potential trust architecture and a few promising techniques for building trustworthiness oracles.
△ Less
Submitted 7 January, 2022;
originally announced January 2022.
-
Transferring Domain-Agnostic Knowledge in Video Question Answering
Authors:
Tianran Wu,
Noa Garcia,
Mayu Otani,
Chenhui Chu,
Yuta Nakashima,
Haruo Takemura
Abstract:
Video question answering (VideoQA) is designed to answer a given question based on a relevant video clip. The current available large-scale datasets have made it possible to formulate VideoQA as the joint understanding of visual and language information. However, this training procedure is costly and still less competent with human performance. In this paper, we investigate a transfer learning met…
▽ More
Video question answering (VideoQA) is designed to answer a given question based on a relevant video clip. The current available large-scale datasets have made it possible to formulate VideoQA as the joint understanding of visual and language information. However, this training procedure is costly and still less competent with human performance. In this paper, we investigate a transfer learning method by the introduction of domain-agnostic knowledge and domain-specific knowledge. First, we develop a novel transfer learning framework, which finetunes the pre-trained model by applying domain-agnostic knowledge as the medium. Second, we construct a new VideoQA dataset with 21,412 human-generated question-answer samples for comparable transfer of knowledge. Our experiments show that: (i) domain-agnostic knowledge is transferable and (ii) our proposed transfer learning framework can boost VideoQA performance effectively.
△ Less
Submitted 25 October, 2021;
originally announced October 2021.
-
Conformal Boundary Condition and Massive Gravitons in AdS/BCFT
Authors:
Chong-Sun Chu,
Rong-Xin Miao
Abstract:
According to Witten [1], the conformal boundary condition of gravity, which specifies the conformal geometry of the boundary and the trace of the extrinsic curvature, is elliptic and leads to well-defined perturbation theory of gravity about any classical solution. The conformal boundary condition was previously considered in [2, 3] in the context of AdS/BCFT, wherein the equation of motion of the…
▽ More
According to Witten [1], the conformal boundary condition of gravity, which specifies the conformal geometry of the boundary and the trace of the extrinsic curvature, is elliptic and leads to well-defined perturbation theory of gravity about any classical solution. The conformal boundary condition was previously considered in [2, 3] in the context of AdS/BCFT, wherein the equation of motion of the end-of-the-world was derived and emphasized. In this paper, we investigate further other consequences of the conformal boundary condition in AdS/BCFT. We derive the boundary central charges of the holographic Weyl anomaly and show that they are exactly the same for conformal boundary condition and Dirichlet boundary condition. We analysis the metric perturbation with conformal boundary condition (CBC), Dirichlet boundary condition (DBC) and Neumann boundary condition (NBC) imposed on the end-of-the-world brane and show that they admit an interpretation as the fluctuation of the extrinsic curvature (case of CBC and DBC) and the induced metric (case of NBC) of Q respectively. In all cases, the fluctuation modes are massive, which are closely relevant to the massive island formation in the literature. Our results reveal that there are non-trivial gravitational dynamics from extrinsic curvatures on the conformal and Dirichlet branes, which may have interesting applications to the island. We also discuss, in passing, the localization of gravitons in brane world theory. We find that, contrary to NBC, the graviton for CBC/DBC is located on the brane with non-positive tension instead of non-negative tension.
△ Less
Submitted 19 January, 2022; v1 submitted 6 October, 2021;
originally announced October 2021.
-
MeetDot: Videoconferencing with Live Translation Captions
Authors:
Arkady Arkhangorodsky,
Christopher Chu,
Scot Fang,
Yiqi Huang,
Denglin Jiang,
Ajay Nagesh,
Boliang Zhang,
Kevin Knight
Abstract:
We present MeetDot, a videoconferencing system with live translation captions overlaid on screen. The system aims to facilitate conversation between people who speak different languages, thereby reducing communication barriers between multilingual participants. Currently, our system supports speech and captions in 4 languages and combines automatic speech recognition (ASR) and machine translation…
▽ More
We present MeetDot, a videoconferencing system with live translation captions overlaid on screen. The system aims to facilitate conversation between people who speak different languages, thereby reducing communication barriers between multilingual participants. Currently, our system supports speech and captions in 4 languages and combines automatic speech recognition (ASR) and machine translation (MT) in a cascade. We use the re-translation strategy to translate the streamed speech, resulting in caption flicker. Additionally, our system has very strict latency requirements to have acceptable call quality. We implement several features to enhance user experience and reduce their cognitive load, such as smooth scrolling captions and reducing caption flicker. The modular architecture allows us to integrate different ASR and MT services in our backend. Our system provides an integrated evaluation suite to optimize key intrinsic evaluation metrics such as accuracy, latency and erasure. Finally, we present an innovative cross-lingual word-guessing game as an extrinsic evaluation metric to measure end-to-end system performance. We plan to make our system open-source for research purposes.
△ Less
Submitted 20 September, 2021;
originally announced September 2021.
-
IBEX: An open and extensible method for high content multiplex imaging of diverse tissues
Authors:
Andrea J. Radtke,
Colin J. Chu,
Ziv Yaniv,
Li Yao,
James Marr,
Rebecca T. Beuschel,
Hiroshi Ichise,
Anita Gola,
Juraj Kabat,
Bradley Lowekamp,
Emily Speranza,
Joshua Croteau,
Nishant Thakur,
Danny Jonigk,
Jeremy Davis,
Jonathan M. Hernandez,
Ronald N. Germain
Abstract:
High content imaging is needed to catalogue the variety of cellular phenotypes and multi-cellular ecosystems present in metazoan tissues. We recently developed Iterative Bleaching Extends multi-pleXity (IBEX), an iterative immunolabeling and chemical bleaching method that enables multiplexed imaging (>65 parameters) in diverse tissues, including human organs relevant for international consortia ef…
▽ More
High content imaging is needed to catalogue the variety of cellular phenotypes and multi-cellular ecosystems present in metazoan tissues. We recently developed Iterative Bleaching Extends multi-ple** of diverse tissues in support of a Human Reference Atlas or other such applications.
△ Less
Submitted 23 July, 2021;
originally announced July 2021.
-
Energy-Efficient Accelerator Design for Deformable Convolution Networks
Authors:
Dawen Xu,
Cheng Chu,
Cheng Liu,
Ying Wang,
Huawei Li,
Xiaowei Li,
Kwang-Ting Cheng
Abstract:
Deformable convolution networks (DCNs) proposed to address the image recognition with geometric or photometric variations typically involve deformable convolution that convolves on arbitrary locations of input features. The locations change with different inputs and induce considerable dynamic and irregular memory accesses which cannot be handled by classic neural network accelerators (NNAs). More…
▽ More
Deformable convolution networks (DCNs) proposed to address the image recognition with geometric or photometric variations typically involve deformable convolution that convolves on arbitrary locations of input features. The locations change with different inputs and induce considerable dynamic and irregular memory accesses which cannot be handled by classic neural network accelerators (NNAs). Moreover, bilinear interpolation (BLI) operation that is required to obtain deformed features in DCNs also cannot be deployed on existing NNAs directly. Although a general purposed processor (GPP) seated along with classic NNAs can process the deformable convolution, the processing on GPP can be extremely slow due to the lack of parallel computing capability.
To address the problem, we develop a DCN accelerator on existing NNAs to support both the standard convolution and deformable convolution. Specifically, for the dynamic and irregular accesses in DCNs, we have both the input and output features divided into tiles and build a tile dependency table (TDT) to track the irregular tile dependency at runtime. With the TDT, we further develop an on-chip tile scheduler to handle the dynamic and irregular accesses efficiently. In addition, we propose a novel map** strategy to enable parallel BLI processing on NNAs and apply layer fusion techniques for more energy-efficient DCN processing. According to our experiments, the proposed accelerator achieves orders of magnitude higher performance and energy efficiency compared to the typical computing architectures including ARM, ARM+TPU, and GPU with 6.6\% chip area penalty to a classic NNA.
△ Less
Submitted 6 July, 2021;
originally announced July 2021.
-
A Picture May Be Worth a Hundred Words for Visual Question Answering
Authors:
Yusuke Hirota,
Noa Garcia,
Mayu Otani,
Chenhui Chu,
Yuta Nakashima,
Ittetsu Taniguchi,
Takao Onoye
Abstract:
How far can we go with textual representations for understanding pictures? In image understanding, it is essential to use concise but detailed image representations. Deep visual features extracted by vision models, such as Faster R-CNN, are prevailing used in multiple tasks, and especially in visual question answering (VQA). However, conventional deep visual features may struggle to convey all the…
▽ More
How far can we go with textual representations for understanding pictures? In image understanding, it is essential to use concise but detailed image representations. Deep visual features extracted by vision models, such as Faster R-CNN, are prevailing used in multiple tasks, and especially in visual question answering (VQA). However, conventional deep visual features may struggle to convey all the details in an image as we humans do. Meanwhile, with recent language models' progress, descriptive text may be an alternative to this problem. This paper delves into the effectiveness of textual representations for image understanding in the specific context of VQA. We propose to take description-question pairs as input, instead of deep visual features, and fed them into a language-only Transformer model, simplifying the process and the computational cost. We also experiment with data augmentation techniques to increase the diversity in the training set and avoid learning statistical bias. Extensive evaluations have shown that textual representations require only about a hundred words to compete with deep visual features on both VQA 2.0 and VQA-CP v2.
△ Less
Submitted 25 June, 2021;
originally announced June 2021.
-
Joint Determination of Reactor Antineutrino Spectra from $^{235}$U and $^{239}$Pu Fission by Daya Bay and PROSPECT
Authors:
Daya Bay Collaboration,
PROSPECT Collaboration,
F. P. An,
M. Andriamirado,
A. B. Balantekin,
H. R. Band,
C. D. Bass,
D. E. Bergeron,
D. Berish,
M. Bishai,
S. Blyth,
N. S. Bowden,
C. D. Bryan,
G. F. Cao,
J. Cao,
J. F. Chang,
Y. Chang,
H. S. Chen,
S. M. Chen,
Y. Chen,
Y. X. Chen,
J. Cheng,
Z. K. Cheng,
J. J. Cherwinka,
M. C. Chu
, et al. (217 additional authors not shown)
Abstract:
A joint determination of the reactor antineutrino spectra resulting from the fission of $^{235}$U and $^{239}$Pu has been carried out by the Daya Bay and PROSPECT collaborations. This Letter reports the level of consistency of $^{235}$U spectrum measurements from the two experiments and presents new results from a joint analysis of both data sets. The measurements are found to be consistent. The c…
▽ More
A joint determination of the reactor antineutrino spectra resulting from the fission of $^{235}$U and $^{239}$Pu has been carried out by the Daya Bay and PROSPECT collaborations. This Letter reports the level of consistency of $^{235}$U spectrum measurements from the two experiments and presents new results from a joint analysis of both data sets. The measurements are found to be consistent. The combined analysis reduces the degeneracy between the dominant $^{235}$U and $^{239}$Pu isotopes and improves the uncertainty of the $^{235}$U spectral shape to about 3\%. The ${}^{235}$U and $^{239}$Pu antineutrino energy spectra are unfolded from the jointly deconvolved reactor spectra using the Wiener-SVD unfolding method, providing a data-based reference for other reactor antineutrino experiments and other applications. This is the first measurement of the $^{235}$U and $^{239}$Pu spectra based on the combination of experiments at low- and highly enriched uranium reactors.
△ Less
Submitted 22 February, 2022; v1 submitted 23 June, 2021;
originally announced June 2021.
-
A Game-Theoretic Taxonomy of Visual Concepts in DNNs
Authors:
Xu Cheng,
Chuntung Chu,
Yi Zheng,
Jie Ren,
Quanshi Zhang
Abstract:
In this paper, we rethink how a DNN encodes visual concepts of different complexities from a new perspective, i.e. the game-theoretic multi-order interactions between pixels in an image. Beyond the categorical taxonomy of objects and the cognitive taxonomy of textures and shapes, we provide a new taxonomy of visual concepts, which helps us interpret the encoding of shapes and textures, in terms of…
▽ More
In this paper, we rethink how a DNN encodes visual concepts of different complexities from a new perspective, i.e. the game-theoretic multi-order interactions between pixels in an image. Beyond the categorical taxonomy of objects and the cognitive taxonomy of textures and shapes, we provide a new taxonomy of visual concepts, which helps us interpret the encoding of shapes and textures, in terms of concept complexities. In this way, based on multi-order interactions, we find three distinctive signal-processing behaviors of DNNs encoding textures. Besides, we also discover the flexibility for a DNN to encode shapes is lower than the flexibility of encoding textures. Furthermore, we analyze how DNNs encode outlier samples, and explore the impacts of network architectures on interactions. Additionally, we clarify the crucial role of the multi-order interactions in real-world applications. The code will be released when the paper is accepted.
△ Less
Submitted 21 June, 2021;
originally announced June 2021.
-
On the Trust and Trust Modelling for the Future Fully-Connected Digital World: A Comprehensive Study
Authors:
Hannah Lim **g Ting,
Xin Kang,
Tieyan Li,
Haiguang Wang,
Cheng-Kang Chu
Abstract:
With the fast development of digital technologies, we are running into a digital world. The relationship among people and the connections among things become more and more complex, and new challenges arise. To tackle these challenges, trust-a soft security mechanism-is considered as a promising technology. Thus, in this survey, we do a comprehensive study on the trust and trust modelling for the f…
▽ More
With the fast development of digital technologies, we are running into a digital world. The relationship among people and the connections among things become more and more complex, and new challenges arise. To tackle these challenges, trust-a soft security mechanism-is considered as a promising technology. Thus, in this survey, we do a comprehensive study on the trust and trust modelling for the future digital world. We revisit the definitions and properties of trust, and analysis the trust theories and discuss their impact on digital trust modelling. We analyze the digital world and its corresponding environment where people, things, and infrastructure connect with each other. We detail the challenges that require trust in these digital scenarios. Under our analysis of trust and the digital world, we define different types of trust relationships and find out the factors that are needed to ensure a fully representative model. Next, to meet the challenges of digital trust modelling, comprehensive trust model evaluation criteria are proposed, and potential securities and privacy issues of trust modelling are analyzed. Finally, we provide a wide-ranging analysis of different methodologies, mathematical theories, and how they can be applied to trust modelling.
△ Less
Submitted 14 June, 2021;
originally announced June 2021.
-
HyCA: A Hybrid Computing Architecture for Fault Tolerant Deep Learning
Authors:
Cheng Liu,
Cheng Chu,
Dawen Xu,
Ying Wang,
Qianlong Wang,
Huawei Li,
Xiaowei Li,
Kwang-Ting Cheng
Abstract:
Hardware faults on the regular 2-D computing array of a typical deep learning accelerator (DLA) can lead to dramatic prediction accuracy loss. Prior redundancy design approaches typically have each homogeneous redundant processing element (PE) to mitigate faulty PEs for a limited region of the 2-D computing array rather than the entire computing array to avoid the excessive hardware overhead. Howe…
▽ More
Hardware faults on the regular 2-D computing array of a typical deep learning accelerator (DLA) can lead to dramatic prediction accuracy loss. Prior redundancy design approaches typically have each homogeneous redundant processing element (PE) to mitigate faulty PEs for a limited region of the 2-D computing array rather than the entire computing array to avoid the excessive hardware overhead. However, they fail to recover the computing array when the number of faulty PEs in any region exceeds the number of redundant PEs in the same region. The mismatch problem deteriorates when the fault injection rate rises and the faults are unevenly distributed. To address the problem, we propose a hybrid computing architecture (HyCA) for fault-tolerant DLAs. It has a set of dot-production processing units (DPPUs) to recompute all the operations that are mapped to the faulty PEs despite the faulty PE locations. According to our experiments, HyCA shows significantly higher reliability, scalability, and performance with less chip area penalty when compared to the conventional redundancy approaches. Moreover, by taking advantage of the flexible recomputing, HyCA can also be utilized to scan the entire 2-D computing array and detect the faulty PEs effectively at runtime.
△ Less
Submitted 27 October, 2021; v1 submitted 8 June, 2021;
originally announced June 2021.
-
Lightweight Cross-Lingual Sentence Representation Learning
Authors:
Zhuoyuan Mao,
Prakhar Gupta,
Pei Wang,
Chenhui Chu,
Martin Jaggi,
Sadao Kurohashi
Abstract:
Large-scale models for learning fixed-dimensional cross-lingual sentence representations like LASER (Artetxe and Schwenk, 2019b) lead to significant improvement in performance on downstream tasks. However, further increases and modifications based on such large-scale models are usually impractical due to memory limitations. In this work, we introduce a lightweight dual-transformer architecture wit…
▽ More
Large-scale models for learning fixed-dimensional cross-lingual sentence representations like LASER (Artetxe and Schwenk, 2019b) lead to significant improvement in performance on downstream tasks. However, further increases and modifications based on such large-scale models are usually impractical due to memory limitations. In this work, we introduce a lightweight dual-transformer architecture with just 2 layers for generating memory-efficient cross-lingual sentence representations. We explore different training tasks and observe that current cross-lingual training tasks leave a lot to be desired for this shallow architecture. To ameliorate this, we propose a novel cross-lingual language model, which combines the existing single-word masked language model with the newly proposed cross-lingual token-level reconstruction task. We further augment the training task by the introduction of two computationally-lite sentence-level contrastive learning tasks to enhance the alignment of cross-lingual sentence representation space, which compensates for the learning bottleneck of the lightweight transformer for generative tasks. Our comparisons with competing models on cross-lingual sentence retrieval and multilingual document classification confirm the effectiveness of the newly proposed training tasks for a shallow model.
△ Less
Submitted 27 May, 2022; v1 submitted 28 May, 2021;
originally announced May 2021.
-
GCNBoost: Artwork Classification by Label Propagation through a Knowledge Graph
Authors:
Cheikh Brahim El Vaigh,
Noa Garcia,
Benjamin Renoust,
Chenhui Chu,
Yuta Nakashima,
Hajime Nagahara
Abstract:
The rise of digitization of cultural documents offers large-scale contents, opening the road for development of AI systems in order to preserve, search, and deliver cultural heritage. To organize such cultural content also means to classify them, a task that is very familiar to modern computer science. Contextual information is often the key to structure such real world data, and we propose to use…
▽ More
The rise of digitization of cultural documents offers large-scale contents, opening the road for development of AI systems in order to preserve, search, and deliver cultural heritage. To organize such cultural content also means to classify them, a task that is very familiar to modern computer science. Contextual information is often the key to structure such real world data, and we propose to use it in form of a knowledge graph. Such a knowledge graph, combined with content analysis, enhances the notion of proximity between artworks so it improves the performances in classification tasks. In this paper, we propose a novel use of a knowledge graph, that is constructed on annotated data and pseudo-labeled data. With label propagation, we boost artwork classification by training a model using a graph convolutional network, relying on the relationships between entities of the knowledge graph. Following a transductive learning framework, our experiments show that relying on a knowledge graph modeling the relations between labeled data and unlabeled data allows to achieve state-of-the-art results on multiple classification tasks on a dataset of paintings, and on a dataset of Buddha statues. Additionally, we show state-of-the-art results for the difficult case of dealing with unbalanced data, with the limitation of disregarding classes with extremely low degrees in the knowledge graph.
△ Less
Submitted 25 May, 2021;
originally announced May 2021.
-
An Integrated Deep Learning and Dynamic Programming Method for Predicting Tumor Suppressor Genes, Oncogenes, and Fusion from PDB Structures
Authors:
Nishanth. Anandanadarajah,
C. H. Chu,
R. Loganantharaj
Abstract:
Mutations in proto-oncogenes (ONGO) and the loss of regulatory function of tumor suppression genes (TSG) are the common underlying mechanism for uncontrolled tumor growth. While cancer is a heterogeneous complex of distinct diseases, finding the potentiality of the genes related functionality to ONGO or TSG through computational studies can help develop drugs that target the disease. This paper pr…
▽ More
Mutations in proto-oncogenes (ONGO) and the loss of regulatory function of tumor suppression genes (TSG) are the common underlying mechanism for uncontrolled tumor growth. While cancer is a heterogeneous complex of distinct diseases, finding the potentiality of the genes related functionality to ONGO or TSG through computational studies can help develop drugs that target the disease. This paper proposes a classification method that starts with a preprocessing stage to extract the feature map sets from the input 3D protein structural information. The next stage is a deep convolutional neural network stage (DCNN) that outputs the probability of functional classification of genes. We explored and tested two approaches: in Approach 1, all filtered and cleaned 3D-protein-structures (PDB) are pooled together, whereas in Approach 2, the primary structures and their corresponding PDBs are separated according to the genes' primary structural information. Following the DCNN stage, a dynamic programming-based method is used to determine the final prediction of the primary structures' functionality. We validated our proposed method using the COSMIC online database. For the ONGO vs TSG classification problem, the AUROC of the DCNN stage for Approach 1 and Approach 2 DCNN are 0.978 and 0.765, respectively. The AUROCs of the final genes' primary structure functionality classification for Approach 1 and Approach 2 are 0.989, and 0.879, respectively. For comparison, the current state-of-the-art reported AUROC is 0.924.
△ Less
Submitted 17 May, 2021;
originally announced May 2021.
-
BasisNet: Two-stage Model Synthesis for Efficient Inference
Authors:
Mingda Zhang,
Chun-Te Chu,
Andrey Zhmoginov,
Andrew Howard,
Brendan Jou,
Yukun Zhu,
Li Zhang,
Rebecca Hwa,
Adriana Kovashka
Abstract:
In this work, we present BasisNet which combines recent advancements in efficient neural network architectures, conditional computation, and early termination in a simple new form. Our approach incorporates a lightweight model to preview the input and generate input-dependent combination coefficients, which later controls the synthesis of a more accurate specialist model to make final prediction.…
▽ More
In this work, we present BasisNet which combines recent advancements in efficient neural network architectures, conditional computation, and early termination in a simple new form. Our approach incorporates a lightweight model to preview the input and generate input-dependent combination coefficients, which later controls the synthesis of a more accurate specialist model to make final prediction. The two-stage model synthesis strategy can be applied to any network architectures and both stages are jointly trained. We also show that proper training recipes are critical for increasing generalizability for such high capacity neural networks. On ImageNet classification benchmark, our BasisNet with MobileNets as backbone demonstrated clear advantage on accuracy-efficiency trade-off over several strong baselines. Specifically, BasisNet-MobileNetV3 obtained 80.3% top-1 accuracy with only 290M Multiply-Add operations, halving the computational cost of previous state-of-the-art without sacrificing accuracy. With early termination, the average cost can be further reduced to 198M MAdds while maintaining accuracy of 80.0% on ImageNet.
△ Less
Submitted 6 May, 2021;
originally announced May 2021.
-
Induced Quantized Spin Current in Vacuum
Authors:
Chong-Sun Chu,
Chun-Hei Leung
Abstract:
We uncover a fundamental effect of the QED vacuum in an external electromagnetic (EM) field. We show that the quantized vacuum of electrons is spin polarized by the EM field and manifests as a vacuum spin current. An experiment is proposed to measure the spin torque exerted by the spin current by measuring the twisted angle of the director axis of a nematic liquid crystal.
We uncover a fundamental effect of the QED vacuum in an external electromagnetic (EM) field. We show that the quantized vacuum of electrons is spin polarized by the EM field and manifests as a vacuum spin current. An experiment is proposed to measure the spin torque exerted by the spin current by measuring the twisted angle of the director axis of a nematic liquid crystal.
△ Less
Submitted 16 September, 2021; v1 submitted 30 April, 2021;
originally announced May 2021.
-
Camera View Adjustment Prediction for Improving Image Composition
Authors:
Yu-Chuan Su,
Raviteja Vemulapalli,
Ben Weiss,
Chun-Te Chu,
Philip Andrew Mansfield,
Lior Shapira,
Colvin Pitts
Abstract:
Image composition plays an important role in the quality of a photo. However, not every camera user possesses the knowledge and expertise required for capturing well-composed photos. While post-capture crop** can improve the composition sometimes, it does not work in many common scenarios in which the photographer needs to adjust the camera view to capture the best shot. To address this issue, w…
▽ More
Image composition plays an important role in the quality of a photo. However, not every camera user possesses the knowledge and expertise required for capturing well-composed photos. While post-capture crop** can improve the composition sometimes, it does not work in many common scenarios in which the photographer needs to adjust the camera view to capture the best shot. To address this issue, we propose a deep learning-based approach that provides suggestions to the photographer on how to adjust the camera view before capturing. By optimizing the composition before a photo is captured, our system helps photographers to capture better photos. As there is no publicly-available dataset for this task, we create a view adjustment dataset by repurposing existing image crop** datasets. Furthermore, we propose a two-stage semi-supervised approach that utilizes both labeled and unlabeled images for training a view adjustment model. Experiment results show that the proposed semi-supervised approach outperforms the corresponding supervised alternatives, and our user study results show that the suggested view adjustment improves image composition 79% of the time.
△ Less
Submitted 15 April, 2021;
originally announced April 2021.
-
Pressure-induced high-temperature superconductivity retained at ambient
Authors:
Liangzi Deng,
Trevor Bontke,
Rabin Dahal,
Yu Xie,
Bin Gao,
Xue Li,
Ketao Yin,
Melissa Gooch,
Donald Rolston,
Tong Chen,
Zheng Wu,
Yanming Ma,
Pengcheng Dai,
Ching-Wu Chu
Abstract:
To raise the superconducting-transition temperature (Tc) has been the driving force for the long, sustained effort in superconductivity research. Recent progress in hydrides with Tcs up to 287 K under 267 GPa has heralded a new era of room-temperature superconductivity (RTS) with immense technological promise. Indeed, RTS has lifted the temperature barrier for the ubiquitous application of superco…
▽ More
To raise the superconducting-transition temperature (Tc) has been the driving force for the long, sustained effort in superconductivity research. Recent progress in hydrides with Tcs up to 287 K under 267 GPa has heralded a new era of room-temperature superconductivity (RTS) with immense technological promise. Indeed, RTS has lifted the temperature barrier for the ubiquitous application of superconductivity. Unfortunately, formidable pressure is required to attain such high Tcs. The most effective relief to this impasse is to remove the pressure needed while retaining the pressure-induced Tc without pressure. Here we show such a possibility in the pure and doped high-temperature superconductor (HTS) FeSe by retaining, at ambient via pressure-quenching (PQ), its Tc up to 37 K (quadrupling that of a pristine FeSe) and other pressure-induced phases. We have also observed that some phases remain stable without pressure at up to 300 K and for at least 7 days. The observations are in qualitative agreement with our ab initio simulations using the solid-state nudged elastic band (SSNEB) method. We strongly believe that the PQ technique developed here can be adapted to the RTS hydrides and other materials of value with minimal effort.
△ Less
Submitted 12 April, 2021;
originally announced April 2021.
-
Software-Hardware Co-design for Fast and Scalable Training of Deep Learning Recommendation Models
Authors:
Dheevatsa Mudigere,
Yuchen Hao,
Jianyu Huang,
Zhihao Jia,
Andrew Tulloch,
Srinivas Sridharan,
Xing Liu,
Mustafa Ozdal,
Jade Nie,
Jongsoo Park,
Liang Luo,
Jie Amy Yang,
Leon Gao,
Dmytro Ivchenko,
Aarti Basant,
Yuxi Hu,
Jiyan Yang,
Ehsan K. Ardestani,
Xiaodong Wang,
Rakesh Komuravelli,
Ching-Hsiang Chu,
Serhat Yilmaz,
Huayu Li,
Jiyuan Qian,
Zhuobo Feng
, et al. (28 additional authors not shown)
Abstract:
Deep learning recommendation models (DLRMs) are used across many business-critical services at Facebook and are the single largest AI application in terms of infrastructure demand in its data-centers. In this paper we discuss the SW/HW co-designed solution for high-performance distributed training of large-scale DLRMs. We introduce a high-performance scalable software stack based on PyTorch and pa…
▽ More
Deep learning recommendation models (DLRMs) are used across many business-critical services at Facebook and are the single largest AI application in terms of infrastructure demand in its data-centers. In this paper we discuss the SW/HW co-designed solution for high-performance distributed training of large-scale DLRMs. We introduce a high-performance scalable software stack based on PyTorch and pair it with the new evolution of Zion platform, namely ZionEX. We demonstrate the capability to train very large DLRMs with up to 12 Trillion parameters and show that we can attain 40X speedup in terms of time to solution over previous systems. We achieve this by (i) designing the ZionEX platform with dedicated scale-out network, provisioned with high bandwidth, optimal topology and efficient transport (ii) implementing an optimized PyTorch-based training stack supporting both model and data parallelism (iii) develo** sharding algorithms capable of hierarchical partitioning of the embedding tables along row, column dimensions and load balancing them across multiple workers; (iv) adding high-performance core operators while retaining flexibility to support optimizers with fully deterministic updates (v) leveraging reduced precision communications, multi-level memory hierarchy (HBM+DDR+SSD) and pipelining. Furthermore, we develop and briefly comment on distributed data ingestion and other supporting services that are required for the robust and efficient end-to-end training in production environments.
△ Less
Submitted 26 February, 2023; v1 submitted 11 April, 2021;
originally announced April 2021.
-
Dopamine Transporter SPECT Image Classification for Neurodegenerative Parkinsonism via Diffusion Maps and Machine Learning Classifiers
Authors:
Jun-En Ding,
Chi-Hsiang Chu,
Mong-Na Lo Huang,
Chien-Ching Hsu
Abstract:
Neurodegenerative parkinsonism can be assessed by dopamine transporter single photon emission computed tomography (DaT-SPECT). Although generating images is time consuming, these images can show interobserver variability and they have been visually interpreted by nuclear medicine physicians to date. Accordingly, this study aims to provide an automatic and robust method based on Diffusion Maps and…
▽ More
Neurodegenerative parkinsonism can be assessed by dopamine transporter single photon emission computed tomography (DaT-SPECT). Although generating images is time consuming, these images can show interobserver variability and they have been visually interpreted by nuclear medicine physicians to date. Accordingly, this study aims to provide an automatic and robust method based on Diffusion Maps and machine learning classifiers to classify the SPECT images into two types, namely Normal and Abnormal DaT-SPECT image groups. In the proposed method, the 3D images of N patients are mapped to an N by N pairwise distance matrix and are visualized in Diffusion Maps coordinates. The images of the training set are embedded into a low-dimensional space by using diffusion maps. Moreover, we use Nyström's out-of-sample extension, which embeds new sample points as the testing set in the reduced space. Testing samples in the embedded space are then classified into two types through the ensemble classifier with Linear Discriminant Analysis (LDA) and voting procedure through twenty-five-fold cross-validation results. The feasibility of the method is demonstrated via Parkinsonism Progression Markers Initiative (PPMI) dataset of 1097 subjects and a clinical cohort from Kaohsiung Chang Gung Memorial Hospital (KCGMH-TW) of 630 patients. We compare performances using Diffusion Maps with those of three alternative manifold methods for dimension reduction, namely Locally Linear Embedding (LLE), Isomorphic Map** Algorithm (Isomap), and Kernel Principal Component Analysis (Kernel PCA). We also compare results using 2D and 3D CNN methods. The diffusion maps method has an average accuracy of 98% for the PPMI and 90% for the KCGMH-TW dataset with twenty-five fold cross-validation results. It outperforms the other three methods concerning the overall accuracy and the robustness in the training and testing samples.
△ Less
Submitted 7 May, 2021; v1 submitted 6 April, 2021;
originally announced April 2021.
-
Generalized Darmois-Israel junction conditions
Authors:
Chong-Sun Chu,
H. S. Tan
Abstract:
We present a general method to derive the appropriate Darmois-Israel junction conditions for gravitational theories with higher-order derivative terms by integrating the bulk equations of motion across the singular hypersurface. In higher derivative theories, the field equations can contain terms which are more singular than the Dirac delta distribution. To handle them appropriately, we formulate…
▽ More
We present a general method to derive the appropriate Darmois-Israel junction conditions for gravitational theories with higher-order derivative terms by integrating the bulk equations of motion across the singular hypersurface. In higher derivative theories, the field equations can contain terms which are more singular than the Dirac delta distribution. To handle them appropriately, we formulate a regularization procedure based on representing the delta function as the limit of a sequence of classical functions. This procedure involves imposing suitable constraints on the extrinsic curvature such that the field equations are compatible with the singular source being a delta distribution. As explicit examples of our approach, we demonstrate in detail how to obtain the generalized junction conditions for quadratic gravity, $\mathcal{F}(R)$ theories, a 4D low-energy effective action in string theory and action terms that are Euler densities. Our results are novel, and refine the accuracy of previously claimed results in $\mathcal{F} (R)$ theories and quadratic gravity. In particular, when the coupling constants of quadratic gravity are those for the Gauss-Bonnet case, our junction conditions reduce to the known ones for the latter obtained independently by boundary variation of a surface term in the action. Finally, we briefly discuss a couple of applications to thin-shell wormholes and stellar models.
△ Less
Submitted 24 April, 2021; v1 submitted 10 March, 2021;
originally announced March 2021.
-
Inferring the Type of Phase Transitions Undergone in Epileptic Seizures Using Random Graph Hidden Markov Models for Percolation in Noisy Dynamic Networks
Authors:
Xiao**g Zhu,
Heather Shappell,
Mark A. Kramer,
Catherine J. Chu,
Eric D. Kolaczyk
Abstract:
In clinical neuroscience, epileptic seizures have been associated with the sudden emergence of coupled activity across the brain. The resulting functional networks - in which edges indicate strong enough coupling between brain regions - are consistent with the notion of percolation, which is a phenomenon in complex networks corresponding to the sudden emergence of a giant connected component. Trad…
▽ More
In clinical neuroscience, epileptic seizures have been associated with the sudden emergence of coupled activity across the brain. The resulting functional networks - in which edges indicate strong enough coupling between brain regions - are consistent with the notion of percolation, which is a phenomenon in complex networks corresponding to the sudden emergence of a giant connected component. Traditionally, work has concentrated on noise-free percolation with a monotonic process of network growth, but real-world networks are more complex. We develop a class of random graph hidden Markov models (RG-HMMs) for characterizing percolation regimes in noisy, dynamically evolving networks in the presence of edge birth and edge death, as well as noise. This class is used to understand the type of phase transitions undergone in a seizure, and in particular, distinguishing between different percolation regimes in epileptic seizures. We develop a hypothesis testing framework for inferring putative percolation mechanisms. As a necessary precursor, we present an EM algorithm for estimating parameters from a sequence of noisy networks only observed at a longitudinal subsampling of time points. Our results suggest that different types of percolation can occur in human seizures. The type inferred may suggest tailored treatment strategies and provide new insights into the fundamental science of epilepsy.
△ Less
Submitted 23 February, 2021;
originally announced February 2021.
-
Antineutrino Energy Spectrum Unfolding Based on the Daya Bay Measurement and Its Applications
Authors:
Daya Bay collaboration,
F. P. An,
A. B. Balantekin,
H. R. Band,
M. Bishai,
S. Blyth,
G. F. Cao,
J. Cao,
J. F. Chang,
Y. Chang,
H. S. Chen,
S. M. Chen,
Y. Chen,
Y. X. Chen,
J. Cheng,
Z. K. Cheng,
J. J. Cherwinka,
M. C. Chu,
J. P. Cummings,
O. Dalager,
F. S. Deng,
Y. Y. Ding,
M. V. Diwan,
T. Dohnal,
J. Dove
, et al. (162 additional authors not shown)
Abstract:
The prediction of reactor antineutrino spectra will play a crucial role as reactor experiments enter the precision era. The positron energy spectrum of 3.5 million antineutrino inverse beta decay reactions observed by the Daya Bay experiment, in combination with the fission rates of fissile isotopes in the reactor, is used to extract the positron energy spectra resulting from the fission of specif…
▽ More
The prediction of reactor antineutrino spectra will play a crucial role as reactor experiments enter the precision era. The positron energy spectrum of 3.5 million antineutrino inverse beta decay reactions observed by the Daya Bay experiment, in combination with the fission rates of fissile isotopes in the reactor, is used to extract the positron energy spectra resulting from the fission of specific isotopes. This information can be used to produce a precise, data-based prediction of the antineutrino energy spectrum in other reactor antineutrino experiments with different fission fractions than Daya Bay. The positron energy spectra are unfolded to obtain the antineutrino energy spectra by removing the contribution from detector response with the Wiener-SVD unfolding method. Consistent results are obtained with other unfolding methods. A technique to construct a data-based prediction of the reactor antineutrino energy spectrum is proposed and investigated. Given the reactor fission fractions, the technique can predict the energy spectrum to a 2% precision. In addition, we illustrate how to perform a rigorous comparison between the unfolded antineutrino spectrum and a theoretical model prediction that avoids the input model bias of the unfolding method.
△ Less
Submitted 6 July, 2021; v1 submitted 8 February, 2021;
originally announced February 2021.
-
High Frequency Radio Observations of Two Magnetars, PSR J1622$-$4950 and 1E 1547.0$-$5408
Authors:
Che-Yen Chu,
C. -Y. Ng,
Albert K. H. Kong,
Hsiang-Kuang Chang
Abstract:
We investigated the radio spectra of two magnetars, PSR J1622$-$4950 and 1E 1547.0$-$5408, using observations from the Australia Telescope Compact Array and the Atacama Large Millimeter/submillimeter Array taken in 2017. Our observations of PSR J1622$-$4950 show a steep spectrum with a spectral index of $-$1.3 $\pm$ 0.2 in the range of 5.5-45 GHz during its re-activating X-ray outburst in 2017. By…
▽ More
We investigated the radio spectra of two magnetars, PSR J1622$-$4950 and 1E 1547.0$-$5408, using observations from the Australia Telescope Compact Array and the Atacama Large Millimeter/submillimeter Array taken in 2017. Our observations of PSR J1622$-$4950 show a steep spectrum with a spectral index of $-$1.3 $\pm$ 0.2 in the range of 5.5-45 GHz during its re-activating X-ray outburst in 2017. By comparing the data taken at different epochs, we found significant enhancement in the radio flux density. The spectrum of 1E 1547.0$-$5408 was inverted in the range of 43-95 GHz, suggesting a spectral peak at a few hundred gigahertz. Moreover, we obtained the X-ray and radio data of radio magnetars, PSR J1622$-$4950 and SGR J1745$-$2900, from literature and found two interesting properties. First, radio emission is known to be associated with X-ray outburst but has different evolution. We further found that the rising time of the radio emission is much longer than that of the X-ray during the outburst. Second, the radio magnetars may have double peak spectra at a few GHz and a few hundred GHz. This could indicate that the emission mechanism is different in the cm and the sub-mm bands. These two phenomenons could provide a hint to understand the origin of radio emission and its connection with the X-ray properties.
△ Less
Submitted 4 February, 2021;
originally announced February 2021.
-
Understanding the Role of Scene Graphs in Visual Question Answering
Authors:
Vinay Damodaran,
Sharanya Chakravarthy,
Akshay Kumar,
Anjana Umapathy,
Teruko Mitamura,
Yuta Nakashima,
Noa Garcia,
Chenhui Chu
Abstract:
Visual Question Answering (VQA) is of tremendous interest to the research community with important applications such as aiding visually impaired users and image-based search. In this work, we explore the use of scene graphs for solving the VQA task. We conduct experiments on the GQA dataset which presents a challenging set of questions requiring counting, compositionality and advanced reasoning ca…
▽ More
Visual Question Answering (VQA) is of tremendous interest to the research community with important applications such as aiding visually impaired users and image-based search. In this work, we explore the use of scene graphs for solving the VQA task. We conduct experiments on the GQA dataset which presents a challenging set of questions requiring counting, compositionality and advanced reasoning capability, and provides scene graphs for a large number of images. We adopt image + question architectures for use with scene graphs, evaluate various scene graph generation techniques for unseen images, propose a training curriculum to leverage human-annotated and auto-generated scene graphs, and build late fusion architectures to learn from multiple image representations. We present a multi-faceted study into the use of scene graphs for VQA, making this work the first of its kind.
△ Less
Submitted 16 January, 2021; v1 submitted 14 January, 2021;
originally announced January 2021.
-
Asymmetric self-play for automatic goal discovery in robotic manipulation
Authors:
OpenAI OpenAI,
Matthias Plappert,
Raul Sampedro,
Tao Xu,
Ilge Akkaya,
Vineet Kosaraju,
Peter Welinder,
Ruben D'Sa,
Arthur Petron,
Henrique P. d. O. Pinto,
Alex Paino,
Hyeonwoo Noh,
Lilian Weng,
Qiming Yuan,
Casey Chu,
Wojciech Zaremba
Abstract:
We train a single, goal-conditioned policy that can solve many robotic manipulation tasks, including tasks with previously unseen goals and objects. We rely on asymmetric self-play for goal discovery, where two agents, Alice and Bob, play a game. Alice is asked to propose challenging goals and Bob aims to solve them. We show that this method can discover highly diverse and complex goals without an…
▽ More
We train a single, goal-conditioned policy that can solve many robotic manipulation tasks, including tasks with previously unseen goals and objects. We rely on asymmetric self-play for goal discovery, where two agents, Alice and Bob, play a game. Alice is asked to propose challenging goals and Bob aims to solve them. We show that this method can discover highly diverse and complex goals without any human priors. Bob can be trained with only sparse rewards, because the interaction between Alice and Bob results in a natural curriculum and Bob can learn from Alice's trajectory when relabeled as a goal-conditioned demonstration. Finally, our method scales, resulting in a single policy that can generalize to many unseen tasks such as setting a table, stacking blocks, and solving simple puzzles. Videos of a learned policy is available at https://robotics-self-play.github.io.
△ Less
Submitted 13 January, 2021;
originally announced January 2021.
-
Role of Crown in Tree Resistance Against High Winds
Authors:
Hsin-Huei Li,
Yu-Chuan Cheng,
Kai-Jie Yang,
Chia-Ren Chu,
Tzay-Ming Hong
Abstract:
Rather than using wooden sticks to simulate the breakage of trees in high winds as in most research, we employed fresh samples with branches and leaves to certify the crucial role played by the tree crown. By using the blowdown wind tunnel with a maximum wind speed of 60 m/s, we purposely reduce the number of leaves and show that the drag force will drop by as much as two thirds when half pruned.…
▽ More
Rather than using wooden sticks to simulate the breakage of trees in high winds as in most research, we employed fresh samples with branches and leaves to certify the crucial role played by the tree crown. By using the blowdown wind tunnel with a maximum wind speed of 60 m/s, we purposely reduce the number of leaves and show that the drag force will drop by as much as two thirds when half pruned. Based on real observations, we model the leaf by an open and full cone in the presence of light and strong wind, and calculate how their corresponding cross-sectional area and drag force vary with wind speed. Different power-law relations are predicted and confirmed by experiments for these properties before and after the formation of a full cone. Compared to the empirical value of 1/3 and 3/4, our simple model gave 2/5 and 2/3 for the power-law exponent of cross-sectional area at low and high winds. Discrepancy can be accounted for by including further details, such as the reorientation of open cones and the movement of branches.
△ Less
Submitted 5 January, 2021;
originally announced January 2021.
-
A Gleason-Kahane-Żelazko theorem for reproducing kernel Hilbert spaces
Authors:
Cheng Chu,
Michael Hartz,
Javad Mashreghi,
Thomas Ransford
Abstract:
We establish the following Hilbert-space analogue of the Gleason-Kahane-Żelazko theorem. If $\mathcal{H}$ is a reproducing kernel Hilbert space with a normalized complete Pick kernel, and if $Λ$ is a linear functional on $\mathcal{H}$ such that $Λ(1)=1$ and $Λ(f)\ne0$ for all cyclic functions $f\in\mathcal{H}$, then $Λ$ is multiplicative, in the sense that $Λ(fg)=Λ(f)Λ(g)$ for all…
▽ More
We establish the following Hilbert-space analogue of the Gleason-Kahane-Żelazko theorem. If $\mathcal{H}$ is a reproducing kernel Hilbert space with a normalized complete Pick kernel, and if $Λ$ is a linear functional on $\mathcal{H}$ such that $Λ(1)=1$ and $Λ(f)\ne0$ for all cyclic functions $f\in\mathcal{H}$, then $Λ$ is multiplicative, in the sense that $Λ(fg)=Λ(f)Λ(g)$ for all $f,g\in\mathcal{H}$ such that $fg\in\mathcal{H}$. Moreover $Λ$ is automatically continuous. We give examples to show that the theorem fails if the hypothesis of a complete Pick kernel is omitted. We also discuss conditions under which $Λ$ has to be a point evaluation.
△ Less
Submitted 20 August, 2021; v1 submitted 4 November, 2020;
originally announced November 2020.
-
Pressure induced superconductivity in MnSe
Authors:
T. L. Hung,
C. H. Huang,
L. Z. Deng,
M. N. Ou,
Y. Y. Chen,
M. K. Wu,
S. Y. Huyan,
C. W. Chu,
P. J. Chen,
T. K. Lee
Abstract:
The rich phenomena in the FeSe and related compounds have attracted great interests as it provides fertile material to gain further insight into the mechanism of high temperature superconductivity. A natural follow-up work was to look into the possibility of superconductivity in MnSe. It was shown that MnP becomes superconducting with Tc ~ 1 K under pressure. We demonstrated in this work that high…
▽ More
The rich phenomena in the FeSe and related compounds have attracted great interests as it provides fertile material to gain further insight into the mechanism of high temperature superconductivity. A natural follow-up work was to look into the possibility of superconductivity in MnSe. It was shown that MnP becomes superconducting with Tc ~ 1 K under pressure. We demonstrated in this work that high pressure can effectively suppress the complex magnetic characters of MnSe crystal when observed at ambient condition. MnSe under pressure is found to undergo several structural transformations: the cubic phase first partially transforms to the hexagonal phase at about 12 GPa, the crystal exhibits the coexistence of cubic, hexagonal and orthorhombic phases from 16 GPa to 30 GPa, and above 30 GPa the crystal shows a single orthorhombic phase. Superconductivity with Tc ~ 5 K was first observed at pressure ~12 GPa by magnetic measurements (~16 GPa by resistive measurements). The highest Tc is ~ 9 K (magnetic result) at ~35 GPa. Our observations suggest the observed superconductivity may closely relate to the pressure-induced structural change. However, the interface between the metallic and insulating boundaries may also play an important role to the pressure induced superconductivity in MnSe.
△ Less
Submitted 3 November, 2020;
originally announced November 2020.
-
A Corpus for English-Japanese Multimodal Neural Machine Translation with Comparable Sentences
Authors:
Andrew Merritt,
Chenhui Chu,
Yuki Arase
Abstract:
Multimodal neural machine translation (NMT) has become an increasingly important area of research over the years because additional modalities, such as image data, can provide more context to textual data. Furthermore, the viability of training multimodal NMT models without a large parallel corpus continues to be investigated due to low availability of parallel sentences with images, particularly…
▽ More
Multimodal neural machine translation (NMT) has become an increasingly important area of research over the years because additional modalities, such as image data, can provide more context to textual data. Furthermore, the viability of training multimodal NMT models without a large parallel corpus continues to be investigated due to low availability of parallel sentences with images, particularly for English-Japanese data. However, this void can be filled with comparable sentences that contain bilingual terms and parallel phrases, which are naturally created through media such as social network posts and e-commerce product descriptions. In this paper, we propose a new multimodal English-Japanese corpus with comparable sentences that are compiled from existing image captioning datasets. In addition, we supplement our comparable sentences with a smaller parallel corpus for validation and test purposes. To test the performance of this comparable sentence translation scenario, we train several baseline NMT models with our comparable corpus and evaluate their English-Japanese translation performance. Due to low translation scores in our baseline experiments, we believe that current multimodal NMT models are not designed to effectively utilize comparable sentence data. Despite this, we hope for our corpus to be used to further research into multimodal NMT with comparable sentences.
△ Less
Submitted 17 October, 2020;
originally announced October 2020.
-
Google Crowdsourced Speech Corpora and Related Open-Source Resources for Low-Resource Languages and Dialects: An Overview
Authors:
Alena Butryna,
Shan-Hui Cathy Chu,
Isin Demirsahin,
Alexander Gutkin,
Linne Ha,
Fei He,
Martin Jansche,
Cibu Johny,
Anna Katanova,
Oddur Kjartansson,
Chenfang Li,
Tatiana Merkulova,
Yin May Oo,
Knot Pipatsrisawat,
Clara Rivera,
Supheakmungkol Sarin,
Pasindu de Silva,
Keshan Sodimana,
Richard Sproat,
Theeraphol Wattanavekin,
Jaka Aris Eko Wibawa
Abstract:
This paper presents an overview of a program designed to address the growing need for develo** freely available speech resources for under-represented languages. At present we have released 38 datasets for building text-to-speech and automatic speech recognition applications for languages and dialects of South and Southeast Asia, Africa, Europe and South America. The paper describes the methodol…
▽ More
This paper presents an overview of a program designed to address the growing need for develo** freely available speech resources for under-represented languages. At present we have released 38 datasets for building text-to-speech and automatic speech recognition applications for languages and dialects of South and Southeast Asia, Africa, Europe and South America. The paper describes the methodology used for develo** such corpora and presents some of our findings that could benefit under-represented language communities.
△ Less
Submitted 13 October, 2020;
originally announced October 2020.
-
Lexically Cohesive Neural Machine Translation with Copy Mechanism
Authors:
Vipul Mishra,
Chenhui Chu,
Yuki Arase
Abstract:
Lexically cohesive translations preserve consistency in word choices in document-level translation. We employ a copy mechanism into a context-aware neural machine translation model to allow copying words from previous translation outputs. Different from previous context-aware neural machine translation models that handle all the discourse phenomena implicitly, our model explicitly addresses the le…
▽ More
Lexically cohesive translations preserve consistency in word choices in document-level translation. We employ a copy mechanism into a context-aware neural machine translation model to allow copying words from previous translation outputs. Different from previous context-aware neural machine translation models that handle all the discourse phenomena implicitly, our model explicitly addresses the lexical cohesion problem by boosting the probabilities to output words consistently. We conduct experiments on Japanese to English translation using an evaluation dataset for discourse translation. The results showed that the proposed model significantly improved lexical cohesion compared to previous context-aware models.
△ Less
Submitted 11 October, 2020;
originally announced October 2020.
-
Constructing a Visual Relationship Authenticity Dataset
Authors:
Chenhui Chu,
Yuto Takebayashi,
Mishra Vipul,
Yuta Nakashima
Abstract:
A visual relationship denotes a relationship between two objects in an image, which can be represented as a triplet of (subject; predicate; object). Visual relationship detection is crucial for scene understanding in images. Existing visual relationship detection datasets only contain true relationships that correctly describe the content in an image. However, distinguishing false visual relations…
▽ More
A visual relationship denotes a relationship between two objects in an image, which can be represented as a triplet of (subject; predicate; object). Visual relationship detection is crucial for scene understanding in images. Existing visual relationship detection datasets only contain true relationships that correctly describe the content in an image. However, distinguishing false visual relationships from true ones is also crucial for image understanding and grounded natural language processing. In this paper, we construct a visual relationship authenticity dataset, where both true and false relationships among all objects appeared in the captions in the Flickr30k entities image caption dataset are annotated. The dataset is available at https://github.com/codecreator2053/VR_ClassifiedDataset. We hope that this dataset can promote the study on both vision and language understanding.
△ Less
Submitted 11 October, 2020;
originally announced October 2020.
-
MEEP: An Open-Source Platform for Human-Human Dialog Collection and End-to-End Agent Training
Authors:
Arkady Arkhangorodsky,
Amittai Axelrod,
Christopher Chu,
Scot Fang,
Yiqi Huang,
Ajay Nagesh,
Xing Shi,
Boliang Zhang,
Kevin Knight
Abstract:
We create a new task-oriented dialog platform (MEEP) where agents are given considerable freedom in terms of utterances and API calls, but are constrained to work within a push-button environment. We include facilities for collecting human-human dialog corpora, and for training automatic agents in an end-to-end fashion. We demonstrate MEEP with a dialog assistant that lets users specify trip desti…
▽ More
We create a new task-oriented dialog platform (MEEP) where agents are given considerable freedom in terms of utterances and API calls, but are constrained to work within a push-button environment. We include facilities for collecting human-human dialog corpora, and for training automatic agents in an end-to-end fashion. We demonstrate MEEP with a dialog assistant that lets users specify trip destinations.
△ Less
Submitted 9 October, 2020;
originally announced October 2020.
-
Solving Historical Dictionary Codes with a Neural Language Model
Authors:
Christopher Chu,
Raphael Valenti,
Kevin Knight
Abstract:
We solve difficult word-based substitution codes by constructing a decoding lattice and searching that lattice with a neural language model. We apply our method to a set of enciphered letters exchanged between US Army General James Wilkinson and agents of the Spanish Crown in the late 1700s and early 1800s, obtained from the US Library of Congress. We are able to decipher 75.1% of the cipher-word…
▽ More
We solve difficult word-based substitution codes by constructing a decoding lattice and searching that lattice with a neural language model. We apply our method to a set of enciphered letters exchanged between US Army General James Wilkinson and agents of the Spanish Crown in the late 1700s and early 1800s, obtained from the US Library of Congress. We are able to decipher 75.1% of the cipher-word tokens correctly.
△ Less
Submitted 9 October, 2020;
originally announced October 2020.
-
Learning to Pronounce Chinese Without a Pronunciation Dictionary
Authors:
Christopher Chu,
Scot Fang,
Kevin Knight
Abstract:
We demonstrate a program that learns to pronounce Chinese text in Mandarin, without a pronunciation dictionary. From non-parallel streams of Chinese characters and Chinese pinyin syllables, it establishes a many-to-many map** between characters and pronunciations. Using unsupervised methods, the program effectively deciphers writing into speech. Its token-level character-to-syllable accuracy is…
▽ More
We demonstrate a program that learns to pronounce Chinese text in Mandarin, without a pronunciation dictionary. From non-parallel streams of Chinese characters and Chinese pinyin syllables, it establishes a many-to-many map** between characters and pronunciations. Using unsupervised methods, the program effectively deciphers writing into speech. Its token-level character-to-syllable accuracy is 89%, which significantly exceeds the 22% accuracy of prior work.
△ Less
Submitted 9 October, 2020;
originally announced October 2020.
-
A Dataset and Baselines for Visual Question Answering on Art
Authors:
Noa Garcia,
Chentao Ye,
Zihua Liu,
Qingtao Hu,
Mayu Otani,
Chenhui Chu,
Yuta Nakashima,
Teruko Mitamura
Abstract:
Answering questions related to art pieces (paintings) is a difficult task, as it implies the understanding of not only the visual information that is shown in the picture, but also the contextual knowledge that is acquired through the study of the history of art. In this work, we introduce our first attempt towards building a new dataset, coined AQUA (Art QUestion Answering). The question-answer (…
▽ More
Answering questions related to art pieces (paintings) is a difficult task, as it implies the understanding of not only the visual information that is shown in the picture, but also the contextual knowledge that is acquired through the study of the history of art. In this work, we introduce our first attempt towards building a new dataset, coined AQUA (Art QUestion Answering). The question-answer (QA) pairs are automatically generated using state-of-the-art question generation methods based on paintings and comments provided in an existing art understanding dataset. The QA pairs are cleansed by crowdsourcing workers with respect to their grammatical correctness, answerability, and answers' correctness. Our dataset inherently consists of visual (painting-based) and knowledge (comment-based) questions. We also present a two-branch model as baseline, where the visual and knowledge questions are handled independently. We extensively compare our baseline model against the state-of-the-art models for question answering, and we provide a comprehensive study about the challenges and potential future directions for visual question answering on art.
△ Less
Submitted 28 August, 2020;
originally announced August 2020.
-
Novel polymorphic phase of BaCu2As2: impact of flux for new phase formation in crystal growth
Authors:
Hanlin Wu,
Sheng Li,
Zheng Wu,
Xiqu Wang,
Gareth A. Ofenstein,
Sunah Kwon,
Moon J. Kim,
Paul C. W. Chu,
Bing Lv
Abstract:
In this work, we have thoroughly studied the effects of flux composition and temperature on the crystal growth of the BaCu2As2 compound. While Pb and CuAs self-flux produce the well-known α-phase ThCr2Si2-type structure (Z=2), a new polymorphic phase of BaCu2As2 (\b{eta} phase) with a much larger c lattice parameter (Z=10), which could be considered an intergrowth of the ThCr2Si2- and CaBe2Ge2-typ…
▽ More
In this work, we have thoroughly studied the effects of flux composition and temperature on the crystal growth of the BaCu2As2 compound. While Pb and CuAs self-flux produce the well-known α-phase ThCr2Si2-type structure (Z=2), a new polymorphic phase of BaCu2As2 (\b{eta} phase) with a much larger c lattice parameter (Z=10), which could be considered an intergrowth of the ThCr2Si2- and CaBe2Ge2-type structures, has been discovered via Sn flux growth. We have characterized this structure through single-crystal X-ray diffraction, transmission electron microscopy (TEM), and scanning transmission electron microscopy (STEM) studies. Furthermore, we compare this new polymorphic intergrowth structure with the α-phase BaCu2As2 (ThCr2Si2 type with Z=2) and the \b{eta}-phase BaCu2Sb2 (intergrowth of ThCr2Si2 and CaBe2Ge2 types with Z=6), both with the same space group I4/mmm. Electrical transport studies reveal p-type carriers and magnetoresistivity up to 22% at 5 K and under a magnetic field of 7 T. Our work suggests a new route for the discovery of new polymorphic structures through flux and temperature control during material synthesis.
△ Less
Submitted 20 August, 2020;
originally announced August 2020.
-
TR-GAN: Topology Ranking GAN with Triplet Loss for Retinal Artery/Vein Classification
Authors:
Wenting Chen,
Shuang Yu,
Junde Wu,
Kai Ma,
Cheng Bian,
Chunyan Chu,
Linlin Shen,
Yefeng Zheng
Abstract:
Retinal artery/vein (A/V) classification lays the foundation for the quantitative analysis of retinal vessels, which is associated with potential risks of various cardiovascular and cerebral diseases. The topological connection relationship, which has been proved effective in improving the A/V classification performance for the conventional graph based method, has not been exploited by the deep le…
▽ More
Retinal artery/vein (A/V) classification lays the foundation for the quantitative analysis of retinal vessels, which is associated with potential risks of various cardiovascular and cerebral diseases. The topological connection relationship, which has been proved effective in improving the A/V classification performance for the conventional graph based method, has not been exploited by the deep learning based method. In this paper, we propose a Topology Ranking Generative Adversarial Network (TR-GAN) to improve the topology connectivity of the segmented arteries and veins, and further to boost the A/V classification performance. A topology ranking discriminator based on ordinal regression is proposed to rank the topological connectivity level of the ground-truth, the generated A/V mask and the intentionally shuffled mask. The ranking loss is further back-propagated to the generator to generate better connected A/V masks. In addition, a topology preserving module with triplet loss is also proposed to extract the high-level topological features and further to narrow the feature distance between the predicted A/V mask and the ground-truth. The proposed framework effectively increases the topological connectivity of the predicted A/V masks and achieves state-of-the-art A/V classification performance on the publicly available AV-DRIVE dataset.
△ Less
Submitted 29 July, 2020;
originally announced July 2020.
-
Difficulty-aware Glaucoma Classification with Multi-Rater Consensus Modeling
Authors:
Shuang Yu,
Hong-Yu Zhou,
Kai Ma,
Cheng Bian,
Chunyan Chu,
Hanruo Liu,
Yefeng Zheng
Abstract:
Medical images are generally labeled by multiple experts before the final ground-truth labels are determined. Consensus or disagreement among experts regarding individual images reflects the gradeability and difficulty levels of the image. However, when being used for model training, only the final ground-truth label is utilized, while the critical information contained in the raw multi-rater grad…
▽ More
Medical images are generally labeled by multiple experts before the final ground-truth labels are determined. Consensus or disagreement among experts regarding individual images reflects the gradeability and difficulty levels of the image. However, when being used for model training, only the final ground-truth label is utilized, while the critical information contained in the raw multi-rater gradings regarding the image being an easy/hard case is discarded. In this paper, we aim to take advantage of the raw multi-rater gradings to improve the deep learning model performance for the glaucoma classification task. Specifically, a multi-branch model structure is proposed to predict the most sensitive, most specifical and a balanced fused result for the input images. In order to encourage the sensitivity branch and specificity branch to generate consistent results for consensus labels and opposite results for disagreement labels, a consensus loss is proposed to constrain the output of the two branches. Meanwhile, the consistency/inconsistency between the prediction results of the two branches implies the image being an easy/hard case, which is further utilized to encourage the balanced fusion branch to concentrate more on the hard cases. Compared with models trained only with the final ground-truth labels, the proposed method using multi-rater consensus information has achieved superior performance, and it is also able to estimate the difficulty levels of individual input images when making the prediction.
△ Less
Submitted 29 July, 2020;
originally announced July 2020.
-
When Classical Chinese Meets Machine Learning: Explaining the Relative Performances of Word and Sentence Segmentation Tasks
Authors:
Chao-Lin Liu,
Chang-Ting Chu,
Wei-Ting Chang,
Ti-Yong Zheng
Abstract:
We consider three major text sources about the Tang Dynasty of China in our experiments that aim to segment text written in classical Chinese. These corpora include a collection of Tang Tomb Biographies, the New Tang Book, and the Old Tang Book. We show that it is possible to achieve satisfactory segmentation results with the deep learning approach. More interestingly, we found that some of the re…
▽ More
We consider three major text sources about the Tang Dynasty of China in our experiments that aim to segment text written in classical Chinese. These corpora include a collection of Tang Tomb Biographies, the New Tang Book, and the Old Tang Book. We show that it is possible to achieve satisfactory segmentation results with the deep learning approach. More interestingly, we found that some of the relative superiority that we observed among different designs of experiments may be explainable. The relative relevance among the training corpora provides hints/explanation for the observed differences in segmentation results that were achieved when we employed different combinations of corpora to train the classifiers.
△ Less
Submitted 21 July, 2020;
originally announced July 2020.
-
Fermion-boson many-body interplay in a frustrated kagome paramagnet
Authors:
J. -X. Yin,
Nana Shumiya,
Sougata Mardanya,
Qi Wang,
S. S. Zhang,
Hung-Ju Tien,
Daniel Multer,
Yuxiao Jiang,
Guangming Cheng,
Nan Yao,
Shangfei Wu,
Desheng Wu,
Liangzi Deng,
Zhipeng Ye,
Rui He,
Guoqing Chang,
Zhonghao Liu,
Kun Jiang,
Ziqiang Wang,
Titus Neupert,
Amit Agarwal,
Tay-Rong Chang,
Ching-Wu Chu,
Hechang Lei,
M. Zahid Hasan
Abstract:
Kagome-net, appearing in areas of fundamental physics, materials, photonic and cold-atom systems, hosts frustrated fermionic and bosonic excitations. However, it is extremely rare to find a system to study both fermionic and bosonic modes to gain insights into their many-body interplay. Here we use state-of-the-art scanning tunneling microscopy and spectroscopy to discover unusual electronic coupl…
▽ More
Kagome-net, appearing in areas of fundamental physics, materials, photonic and cold-atom systems, hosts frustrated fermionic and bosonic excitations. However, it is extremely rare to find a system to study both fermionic and bosonic modes to gain insights into their many-body interplay. Here we use state-of-the-art scanning tunneling microscopy and spectroscopy to discover unusual electronic coupling to flat-band phonons in a layered kagome paramagnet. Our results reveal the kagome structure with unprecedented atomic resolution and observe the striking bosonic mode interacting with dispersive kagome electrons near the Fermi surface. At this mode energy, the fermionic quasi-particle dispersion exhibits a pronounced renormalization, signaling a giant coupling to bosons. Through a combination of self-energy analysis, first-principles calculation, and a lattice vibration model, we present evidence that this mode arises from the geometrically frustrated phonon flat-band, which is the lattice analog of kagome electron flat-band. Our findings provide the first example of kagome bosonic mode (flat-band phonon) in electronic excitations and its strong interaction with fermionic degrees of freedom in kagome-net materials.
△ Less
Submitted 1 July, 2020;
originally announced July 2020.
-
Optimization of the JUNO liquid scintillator composition using a Daya Bay antineutrino detector
Authors:
Daya Bay,
JUNO collaborations,
:,
A. Abusleme,
T. Adam,
S. Ahmad,
S. Aiello,
M. Akram,
N. Ali,
F. P. An,
G. P. An,
Q. An,
G. Andronico,
N. Anfimov,
V. Antonelli,
T. Antoshkina,
B. Asavapibhop,
J. P. A. M. de André,
A. Babic,
A. B. Balantekin,
W. Baldini,
M. Baldoncini,
H. R. Band,
A. Barresi,
E. Baussan
, et al. (642 additional authors not shown)
Abstract:
To maximize the light yield of the liquid scintillator (LS) for the Jiangmen Underground Neutrino Observatory (JUNO), a 20 t LS sample was produced in a pilot plant at Daya Bay. The optical properties of the new LS in various compositions were studied by replacing the gadolinium-loaded LS in one antineutrino detector. The concentrations of the fluor, PPO, and the wavelength shifter, bis-MSB, were…
▽ More
To maximize the light yield of the liquid scintillator (LS) for the Jiangmen Underground Neutrino Observatory (JUNO), a 20 t LS sample was produced in a pilot plant at Daya Bay. The optical properties of the new LS in various compositions were studied by replacing the gadolinium-loaded LS in one antineutrino detector. The concentrations of the fluor, PPO, and the wavelength shifter, bis-MSB, were increased in 12 steps from 0.5 g/L and <0.01 mg/L to 4 g/L and 13 mg/L, respectively. The numbers of total detected photoelectrons suggest that, with the optically purified solvent, the bis-MSB concentration does not need to be more than 4 mg/L. To bridge the one order of magnitude in the detector size difference between Daya Bay and JUNO, the Daya Bay data were used to tune the parameters of a newly developed optical model. Then, the model and tuned parameters were used in the JUNO simulation. This enabled to determine the optimal composition for the JUNO LS: purified solvent LAB with 2.5 g/L PPO, and 1 to 4 mg/L bis-MSB.
△ Less
Submitted 1 July, 2020;
originally announced July 2020.
-
Search For Electron-Antineutrinos Associated With Gravitational-Wave Events GW150914, GW151012, GW151226, GW170104, GW170608, GW170814, and GW170817 at Daya Bay
Authors:
F. P. An,
A. B. Balantekin,
H. R. Band,
M. Bishai,
S. Blyth,
G. F. Cao,
J. Cao,
J. F. Chang,
Y. Chang,
H. S. Chen,
S. M. Chen,
Y. Chen,
Y. X. Chen,
J. Cheng,
Z. K. Cheng,
J. J. Cherwinka,
M. C. Chu,
J. P. Cummings,
O. Dalager,
F. S. Deng,
Y. Y. Ding,
M. V. Diwan,
T. Dohnal,
J. Dove,
M. Dvorak
, et al. (161 additional authors not shown)
Abstract:
Providing a possible connection between neutrino emission and gravitational-wave (GW) bursts is important to our understanding of the physical processes that occur when black holes or neutron stars merge. In the Daya Bay experiment, using data collected from December 2011 to August 2017, a search has been performed for electron-antineutrino signals coinciding with detected GW events, including GW1…
▽ More
Providing a possible connection between neutrino emission and gravitational-wave (GW) bursts is important to our understanding of the physical processes that occur when black holes or neutron stars merge. In the Daya Bay experiment, using data collected from December 2011 to August 2017, a search has been performed for electron-antineutrino signals coinciding with detected GW events, including GW150914, GW151012, GW151226, GW170104, GW170608, GW170814, and GW170817. We used three time windows of $\mathrm{\pm 10~s}$, $\mathrm{\pm 500~s}$, and $\mathrm{\pm 1000~s}$ relative to the occurrence of the GW events, and a neutrino energy range of 1.8 to 100 MeV to search for correlated neutrino candidates. The detected electron-antineutrino candidates are consistent with the expected background rates for all the three time windows. Assuming monochromatic spectra, we found upper limits (90% confidence level) on electron-antineutrino fluence of $(1.13~-~2.44) \times 10^{11}~\rm{cm^{-2}}$ at 5 MeV to $8.0 \times 10^{7}~\rm{cm^{-2}}$ at 100 MeV for the three time windows. Under the assumption of a Fermi-Dirac spectrum, the upper limits were found to be $(5.4~-~7.0)\times 10^{9}~\rm{cm^{-2}}$ for the three time windows.
△ Less
Submitted 14 September, 2020; v1 submitted 27 June, 2020;
originally announced June 2020.
-
Siegel domains over Finsler symmetric cones
Authors:
Cho-Ho Chu
Abstract:
Let $Ω$ be a proper open cone in a real Banach space $V$. We show that the tube domain $V \oplus iΩ$ over $Ω$ is biholomorphic to a bounded symmetric domain if and only if $Ω$ is a normal linearly homogeneous Finsler symmetric cone, which is equivalent to the condition that $V$ is a unital JB-algebra in an equivalent norm and $Ω$ is the interior of $\{v^2: v\in V\}$.
Let $Ω$ be a proper open cone in a real Banach space $V$. We show that the tube domain $V \oplus iΩ$ over $Ω$ is biholomorphic to a bounded symmetric domain if and only if $Ω$ is a normal linearly homogeneous Finsler symmetric cone, which is equivalent to the condition that $V$ is a unital JB-algebra in an equivalent norm and $Ω$ is the interior of $\{v^2: v\in V\}$.
△ Less
Submitted 11 June, 2020;
originally announced June 2020.
-
Weyl Anomaly induced Fermi Condensation and Holography
Authors:
Chong-Sun Chu,
Rong-Xin Miao
Abstract:
Recently it is found that, due to Weyl anomaly, a background scalar field induces a non-trivial Fermi condensation for theories with Yukawa couplings. For simplicity, the paper consider only scalar type Yukawa coupling and, in the BCFT case, only for a specific boundary condition. In these cases, the Weyl anomaly takes on a simple special form. In this paper, we generalize the results to more gene…
▽ More
Recently it is found that, due to Weyl anomaly, a background scalar field induces a non-trivial Fermi condensation for theories with Yukawa couplings. For simplicity, the paper consider only scalar type Yukawa coupling and, in the BCFT case, only for a specific boundary condition. In these cases, the Weyl anomaly takes on a simple special form. In this paper, we generalize the results to more general situations. First, we obtain general expressions of Weyl anomaly due to a background scalar and pseudo scalar field in general 4d BCFTs. Then, we derive the general form of Fermi condensation from the Weyl anomaly. It is remarkable that, in general, Fermi condensation is non-zero even if there was not a non-vanishing scalar field background. Finally, we verify our results with free BCFT with Yukawa coupling to scalar and pseudo-scalar background potential with general chiral bag boundary condition and with holographic BCFT. In particular, we obtain the shape and curvature dependence of the Fermi condensate from the holographic one point function.
△ Less
Submitted 26 August, 2020; v1 submitted 26 May, 2020;
originally announced May 2020.
-
Keyed Non-Parametric Hypothesis Tests
Authors:
Yao Cheng,
Cheng-Kang Chu,
Hsiao-Ying Lin,
Marius Lombard-Platet,
David Naccache
Abstract:
The recent popularity of machine learning calls for a deeper understanding of AI security. Amongst the numerous AI threats published so far, poisoning attacks currently attract considerable attention. In a poisoning attack the opponent partially tampers the dataset used for learning to mislead the classifier during the testing phase.
This paper proposes a new protection strategy against poisonin…
▽ More
The recent popularity of machine learning calls for a deeper understanding of AI security. Amongst the numerous AI threats published so far, poisoning attacks currently attract considerable attention. In a poisoning attack the opponent partially tampers the dataset used for learning to mislead the classifier during the testing phase.
This paper proposes a new protection strategy against poisoning attacks. The technique relies on a new primitive called keyed non-parametric hypothesis tests allowing to evaluate under adversarial conditions the training input's conformance with a previously learned distribution $\mathfrak{D}$. To do so we use a secret key $κ$ unknown to the opponent.
Keyed non-parametric hypothesis tests differs from classical tests in that the secrecy of $κ$ prevents the opponent from misleading the keyed test into concluding that a (significantly) tampered dataset belongs to $\mathfrak{D}$.
△ Less
Submitted 25 May, 2020;
originally announced May 2020.