Search | arXiv e-print repository

Open-Source Conversational AI with SpeechBrain 1.0

Authors: Mirco Ravanelli, Titouan Parcollet, Adel Moumen, Sylvain de Langen, Cem Subakan, Peter Plantinga, Yingzhi Wang, Pooneh Mousavi, Luca Della Libera, Artem Ploujnikov, Francesco Paissan, Davide Borra, Salah Zaiem, Zeyu Zhao, Shucong Zhang, Georgios Karakasidis, Sung-Lin Yeh, Aku Rouhe, Rudolf Braun, Florian Mai, Juan Zuluaga-Gomez, Seyed Mahed Mousavi, Andreas Nautsch, Xuechen Liu, Sangeet Sagar , et al. (5 additional authors not shown)

Abstract: SpeechBrain is an open-source Conversational AI toolkit based on PyTorch, focused particularly on speech processing tasks such as speech recognition, speech enhancement, speaker recognition, text-to-speech, and much more. It promotes transparency and replicability by releasing both the pre-trained models and the complete "recipes" of code and algorithms required for training them. This paper prese… ▽ More SpeechBrain is an open-source Conversational AI toolkit based on PyTorch, focused particularly on speech processing tasks such as speech recognition, speech enhancement, speaker recognition, text-to-speech, and much more. It promotes transparency and replicability by releasing both the pre-trained models and the complete "recipes" of code and algorithms required for training them. This paper presents SpeechBrain 1.0, a significant milestone in the evolution of the toolkit, which now has over 200 recipes for speech, audio, and language processing tasks, and more than 100 models available on Hugging Face. SpeechBrain 1.0 introduces new technologies to support diverse learning modalities, Large Language Model (LLM) integration, and advanced decoding strategies, along with novel models, tasks, and modalities. It also includes a new benchmark repository, offering researchers a unified platform for evaluating models across diverse tasks △ Less

Submitted 2 July, 2024; v1 submitted 29 June, 2024; originally announced July 2024.

Comments: Submitted to JMLR (Machine Learning Open Source Software)

arXiv:2404.05142 [pdf]

Moiré superlattices of antimonene on a Bi(111) substrate with van Hove singularity and Rashba-type spin polarization

Authors: Tomonori Nakamura, Yitao Chen, Ryohei Nemoto, Wenxuan Qian, Yuto Fukushima, Kaishu Kawaguchi, Ryo Mori, Takeshi Kondo, Youhei Yamaji, Shunsuke Tsuda, Koichiro Yaji, Takashi Uchihashi

Abstract: Moiré superlattices consisting of two-dimensional (2D) materials have attracted immense attention because of emergent phenomena such as flat band-induced Mott insulating states and unconventional superconductivity. However, the effects of spin-orbit coupling (SOC) on them have not been fully explored yet. Here we show that single- and double-bilayer (BL) Sb honeycomb lattices, referred to as antim… ▽ More Moiré superlattices consisting of two-dimensional (2D) materials have attracted immense attention because of emergent phenomena such as flat band-induced Mott insulating states and unconventional superconductivity. However, the effects of spin-orbit coupling (SOC) on them have not been fully explored yet. Here we show that single- and double-bilayer (BL) Sb honeycomb lattices, referred to as antimonene, forms moiré superlattices on a Bi(111) substrate due to a lattice mismatch. Scanning tunnelling microscopy (STM) measurements reveal the presence of spectral peaks near the Fermi level, which are spatially modulated with the moiré period. Angle-resolved photoemission spectroscopy (ARPES) combined with density functional theory (DFT) calculations clarifies the surface band structure with saddle points near the Fermi level, which allows us to attribute the observed STM spectral peaks to the van Hove singularity. Spin-resolved ARPES measurements also shows that the observed surface states are Rashba-type spin-polarized. The present work has significant implications that Fermi surface instability and symmetry breaking may emerge at low temperatures, where spin degree of freedom and electron correlation will also play important roles. △ Less

Submitted 7 April, 2024; originally announced April 2024.

Comments: 30 pages including Supplementary Information

arXiv:2404.00907 [pdf, ps, other]

The long-time behavior of solutions of a three-component reaction-diffusion model for the population dynamics of farmers and hunter-gatherers: the different motility case

Authors: Dongyuan Xiao, Ryunosuke Mori

Abstract: In this paper, we investigate the spreading properties of solutions of the Aoki-Shida-Shigesada model. This model is a three-component reaction-diffusion system that delineates the geographical expansion of an initially localized population of farmers into a region occupied by hunter-gatherers. By considering the scenario where farmers and hunter-gatherers possess identical motility, Aoki et al. p… ▽ More In this paper, we investigate the spreading properties of solutions of the Aoki-Shida-Shigesada model. This model is a three-component reaction-diffusion system that delineates the geographical expansion of an initially localized population of farmers into a region occupied by hunter-gatherers. By considering the scenario where farmers and hunter-gatherers possess identical motility, Aoki et al. previously concluded, through numerical simulations and some formal linearization arguments, that there are four different types of spreading behaviors depending on the parameter values. In this paper, we concentrate on the general case for which farmers and hunter-gatherers possess different motility. By providing more sophisticated estimates, we not only theoretically justify the spreading speed of the Aoki-Shida-Shigesada model, but also establish sharp estimates for the long-time behaviors of solutions. These estimates enable us to validate all four types of spreading behaviors observed by Aoki et al.. △ Less

Submitted 1 April, 2024; originally announced April 2024.

arXiv:2402.05874 [pdf, ps, other]

Complexity of graph-state preparation by Clifford circuits

Authors: Soh Kumabe, Ryuhei Mori, Yusei Yoshimura

Abstract: In this work, we study a complexity of graph-state preparation. We consider general quantum algorithms consisting of the Clifford operations on at most two qubits for graph-state preparations. We define the CZ-complexity of graph state $|G\rangle$ as the minimum number of two-qubit Clifford operations (excluding single-qubit Clifford operations) for generating $|G\rangle$ from a trivial state… ▽ More In this work, we study a complexity of graph-state preparation. We consider general quantum algorithms consisting of the Clifford operations on at most two qubits for graph-state preparations. We define the CZ-complexity of graph state $|G\rangle$ as the minimum number of two-qubit Clifford operations (excluding single-qubit Clifford operations) for generating $|G\rangle$ from a trivial state $|0\rangle^{\otimes n}$. We first prove that a graph state $|G\rangle$ is generated by at most $t$ two-qubit Clifford operations if and only if $|G\rangle$ is generated by at most $t$ controlled-Z (CZ) operations. We next prove that a graph state $|G\rangle$ is generated from another graph state $|H\rangle$ by $t$ CZ operations if and only if the graph $G$ is generated from $H$ by some combinatorial graph transformation with cost $t$. As the main results, we show a connection between the CZ-complexity of graph state $|G\rangle$ and the rank-width of the graph $G$. Indeed, we prove that for any graph $G$ with $n$ vertices and rank-width $r$, 1. The CZ-complexity of $|G\rangle$ is $O(rn\log n)$. 2. If $G$ is connected, the CZ-complexity of $|G\rangle$ is at least $n + r - 2$. We also show the existence of graph states whose CZ-complexities are close to the upper and lower bounds. Finally, we present quantum algorithms preparing $|G\rangle$ with $O(n)$ CZ-complexity when $G$ is included in special classes of graphs, namely, cographs, interval graphs, permutation graphs and circle graphs. △ Less

Submitted 8 February, 2024; originally announced February 2024.

Comments: 23 pages

arXiv:2311.05953 [pdf, other]

doi 10.1103/PhysRevB.109.L241114

Photoemission angular distribution beyond the single wavevector description of photoelectron final states

Authors: Hiroaki Tanaka, Shota Okazaki, Yuto Fukushima, Kaishu Kawaguchi, Ayumi Harasawa, Takushi Iimori, Fumio Komori, Masashi Arita, Ryo Mori, Kenta Kuroda, Takao Sasagawa, Takeshi Kondo

Abstract: We develop a simulation procedure for angle-resolved photoemission spectroscopy (ARPES), where a photoelectron wave function is set to be an outgoing plane wave in a vacuum associated with the emitted photoelectron wave packet. ARPES measurements on the transition metal dichalcogenide $1T$-$\mathrm{Ti}\mathrm{S}_2$ are performed, and our simulations exhibit good agreement with experiments. Analysi… ▽ More We develop a simulation procedure for angle-resolved photoemission spectroscopy (ARPES), where a photoelectron wave function is set to be an outgoing plane wave in a vacuum associated with the emitted photoelectron wave packet. ARPES measurements on the transition metal dichalcogenide $1T$-$\mathrm{Ti}\mathrm{S}_2$ are performed, and our simulations exhibit good agreement with experiments. Analysis of our calculated final state wave functions quantitatively visualizes that they include various waves due to the boundary condition and the uneven crystal potential. These results show that a more detailed investigation of the photoelectron final states is necessary to fully explain the photon-energy- and light-polarization-dependent ARPES spectra. △ Less

Submitted 22 June, 2024; v1 submitted 10 November, 2023; originally announced November 2023.

Comments: 7+18 pages, 4+18 figures

Journal ref: Phys. Rev. B 109, L241114 (2024)

arXiv:2309.17294 [pdf, other]

Time-space dynamics of income segregation: a case study of Milan's neighbourhoods

Authors: Lavinia Rossi Mori, Vittorio Loreto, Riccardo Di Clemente

Abstract: Traditional approaches to urban income segregation focus on static residential patterns, often failing to capture the dynamic nature of social mixing at the neighborhood level. Leveraging high-resolution location-based data from mobile phones, we capture the interplay of three different income groups (high, medium, low) based on their daily routines. We propose a three-dimensional space to analyze… ▽ More Traditional approaches to urban income segregation focus on static residential patterns, often failing to capture the dynamic nature of social mixing at the neighborhood level. Leveraging high-resolution location-based data from mobile phones, we capture the interplay of three different income groups (high, medium, low) based on their daily routines. We propose a three-dimensional space to analyze social mixing, which is embedded in the temporal dynamics of urban activities. This framework offers a more detailed perspective on social interactions, closely linked to the geographical features of each neighborhood. While residential areas fail to encourage social mixing in the nighttime, the working hours foster inclusion, with the city center showing a heightened level of interaction. As evening sets in, leisure areas emerge as potential facilitators for social interactions, depending on urban features such as public transport and a variety of Points Of Interest. These characteristics significantly modulate the magnitude and type of social stratification involved in social mixing, also underscoring the significance of urban design in either bridging or widening socio-economic divides. △ Less

Submitted 28 February, 2024; v1 submitted 29 September, 2023; originally announced September 2023.

arXiv:2308.01024 [pdf, other]

doi 10.3390/technologies11050143

Dual-Matrix Domain-Wall: A Novel Technique for Generating Permutations by QUBO and Ising Models with Quadratic Sizes

Authors: Koji Nakano, Shunsuke Tsukiyama, Yasuaki Ito, Takashi Yazane, Junko Yano, Takumi Kato, Shiro Ozaki, Rie Mori, Ryota Katsuki

Abstract: The Ising model is defined by an objective function using a quadratic formula of qubit variables. The problem of an Ising model aims to determine the qubit values of the variables that minimize the objective function, and many optimization problems can be reduced to this problem. In this paper, we focus on optimization problems related to permutations, where the goal is to find the optimal permuta… ▽ More The Ising model is defined by an objective function using a quadratic formula of qubit variables. The problem of an Ising model aims to determine the qubit values of the variables that minimize the objective function, and many optimization problems can be reduced to this problem. In this paper, we focus on optimization problems related to permutations, where the goal is to find the optimal permutation out of the $n!$ possible permutations of $n$ elements. To represent these problems as Ising models, a commonly employed approach is to use a kernel that utilizes one-hot encoding to find any one of the $n!$ permutations as the optimal solution. However, this kernel contains a large number of quadratic terms and high absolute coefficient values. The main contribution of this paper is the introduction of a novel permutation encoding technique called dual-matrix domain-wall, which significantly reduces the number of quadratic terms and the maximum absolute coefficient values in the kernel. Surprisingly, our dual-matrix domain-wall encoding reduces the quadratic term count and maximum absolute coefficient values from $n^3-n^2$ and $2n-4$ to $6n^2-12n+4$ and $2$, respectively. We also demonstrate the applicability of our encoding technique to partial permutations and Quadratic Unconstrained Binary Optimization (QUBO) models. Furthermore, we discuss a family of permutation problems that can be efficiently implemented using Ising/QUBO models with our dual-matrix domain-wall encoding. △ Less

Submitted 1 November, 2023; v1 submitted 2 August, 2023; originally announced August 2023.

Comments: 26 pages, 9 figures

arXiv:2305.18102 [pdf, other]

Linearly dispersive bands at the onset of correlations in K$_x$C$_{60}$ films

Authors: ** Ai, Luca Moreschini, Ryo Mori, Drew W. Latzke, Jonathan D. Denlinger, Alex Zettl, Claudia Ojeda-Aristizabal, Alessandra Lanzara

Abstract: Molecular crystals are a flexible platform to induce novel electronic phases. Due to the weak forces between molecules, intermolecular distances can be varied over relatively larger ranges than interatomic distances in atomic crystals. On the other hand, the hop** terms are generally small, which results in narrow bands, strong correlations and heavy electrons. Here, by growing K$_x$C$_{60}$ ful… ▽ More Molecular crystals are a flexible platform to induce novel electronic phases. Due to the weak forces between molecules, intermolecular distances can be varied over relatively larger ranges than interatomic distances in atomic crystals. On the other hand, the hop** terms are generally small, which results in narrow bands, strong correlations and heavy electrons. Here, by growing K$_x$C$_{60}$ fullerides on hexagonal layered Bi$_2$Se$_3$, we show that upon do** the series undergoes a Mott transition from a molecular insulator to a correlated metal, and an in-gap state evolves into highly dispersive Dirac-like fermions at half filling, where superconductivity occurs. This picture challenges the commonly accepted description of the low energy quasiparticles as appearing from a gradual electron do** of the conduction states, and suggests an intriguing parallel with the more famous family of the cuprate superconductors. More in general, it indicates that molecular crystals offer a viable route to engineer electron-electron interactions. △ Less

Submitted 29 May, 2023; originally announced May 2023.

Comments: 5 pages, 4 figures. Accepted at Physical Review Research

arXiv:2301.09371 [pdf, other]

Understanding the Frequency Dependence of Capacitance Measurements of Irradiated Silicon Detectors

Authors: Sven Mägdefessel, Riccardo Mori, Niels Sorgenfrei, Ulrich Parzefall

Abstract: Capacitance-voltage (CV) measurements are a widely used technique in silicon detector physics. It gives direct information about the full depletion voltage and the effective do** concentration. However, for highly irradiated sensors, the measured data differs significantly from the usual shape which makes the extraction of the afore mentioned parameters less precise to not possible. We present a… ▽ More Capacitance-voltage (CV) measurements are a widely used technique in silicon detector physics. It gives direct information about the full depletion voltage and the effective do** concentration. However, for highly irradiated sensors, the measured data differs significantly from the usual shape which makes the extraction of the afore mentioned parameters less precise to not possible. We present an explanation for the obseved frequency dependence and based on that, a method to extract the desired sensor parameters. △ Less

Submitted 23 January, 2023; originally announced January 2023.

arXiv:2212.11507 [pdf, other]

Supervised Anomaly Detection Method Combining Generative Adversarial Networks and Three-Dimensional Data in Vehicle Inspections

Authors: Yohei Baba, Takuro Hoshi, Ryosuke Mori, Gaurang Gavai

Abstract: The external visual inspections of rolling stock's underfloor equipment are currently being performed via human visual inspection. In this study, we attempt to partly automate visual inspection by investigating anomaly inspection algorithms that use image processing technology. As the railroad maintenance studies tend to have little anomaly data, unsupervised learning methods are usually preferred… ▽ More The external visual inspections of rolling stock's underfloor equipment are currently being performed via human visual inspection. In this study, we attempt to partly automate visual inspection by investigating anomaly inspection algorithms that use image processing technology. As the railroad maintenance studies tend to have little anomaly data, unsupervised learning methods are usually preferred for anomaly detection; however, training cost and accuracy is still a challenge. Additionally, a researcher created anomalous images from normal images by adding noise, etc., but the anomalous targeted in this study is the rotation of pi** cocks that was difficult to create using noise. Therefore, in this study, we propose a new method that uses style conversion via generative adversarial networks on three-dimensional computer graphics and imitates anomaly images to apply anomaly detection based on supervised learning. The geometry-consistent style conversion model was used to convert the image, and because of this the color and texture of the image were successfully made to imitate the real image while maintaining the anomalous shape. Using the generated anomaly images as supervised data, the anomaly detection model can be easily trained without complex adjustments and successfully detects anomalies. △ Less

Submitted 22 December, 2022; originally announced December 2022.

Comments: 6 pages, 12 figures

arXiv:2207.03069 [pdf, other]

doi 10.1109/IPDPSW59300.2023.00060

Diverse Adaptive Bulk Search: a Framework for Solving QUBO Problems on Multiple GPUs

Authors: Koji Nakano, Daisuke Takafuji, Yasuaki Ito, Takashi Yazane, Junko Yano, Shiro Ozaki, Ryota Katsuki, Rie Mori

Abstract: Quadratic Unconstrained Binary Optimization (QUBO) is a combinatorial optimization to find an optimal binary solution vector that minimizes the energy value defined by a quadratic formula of binary variables in the vector. As many NP-hard problems can be reduced to QUBO problems, considerable research has gone into develo** QUBO solvers running on various computing platforms such as quantum devi… ▽ More Quadratic Unconstrained Binary Optimization (QUBO) is a combinatorial optimization to find an optimal binary solution vector that minimizes the energy value defined by a quadratic formula of binary variables in the vector. As many NP-hard problems can be reduced to QUBO problems, considerable research has gone into develo** QUBO solvers running on various computing platforms such as quantum devices, ASICs, FPGAs, GPUs, and optical fibers. This paper presents a framework called Diverse Adaptive Bulk Search (DABS), which has the potential to find optimal solutions of many types of QUBO problems. Our DABS solver employs a genetic algorithm-based search algorithm featuring three diverse strategies: multiple search algorithms, multiple genetic operations, and multiple solution pools. During the execution of the solver, search algorithms and genetic operations that succeeded in finding good solutions are automatically selected to obtain better solutions. Moreover, search algorithms traverse between different solution pools to find good solutions. We have implemented our DABS solver to run on multiple GPUs. Experimental evaluations using eight NVIDIA A100 GPUs confirm that our DABS solver succeeds in finding optimal or potentially optimal solutions for three types of QUBO problems. △ Less

Submitted 17 March, 2023; v1 submitted 6 July, 2022; originally announced July 2022.

arXiv:2205.15478 [pdf, ps, other]

doi 10.1109/TCOMM.2023.3244924

Quantum Algorithm for Higher-Order Unconstrained Binary Optimization and MIMO Maximum Likelihood Detection

Authors: Masaya Norimoto, Ryuhei Mori, Naoki Ishikawa

Abstract: In this paper, we propose a quantum algorithm that supports a real-valued higher-order unconstrained binary optimization (HUBO) problem. This algorithm is based on the Grover adaptive search that originally supported HUBO with integer coefficients. Next, as an application example, we formulate multiple-input multiple-output maximum likelihood detection as a HUBO problem with real-valued coefficien… ▽ More In this paper, we propose a quantum algorithm that supports a real-valued higher-order unconstrained binary optimization (HUBO) problem. This algorithm is based on the Grover adaptive search that originally supported HUBO with integer coefficients. Next, as an application example, we formulate multiple-input multiple-output maximum likelihood detection as a HUBO problem with real-valued coefficients, where we use the Gray-coded bit-to-symbol map** specified in the 5G standard. The proposed approach allows us to construct an efficient quantum circuit for the detection problem and to analyze specific numbers of required qubits and quantum gates, whereas other conventional studies have assumed that such a circuit is feasible as a quantum oracle. To further accelerate the quantum algorithm, we also derive a probability distribution of the objective function value and determine a unique threshold to sample better states. Assuming a future fault-tolerant quantum computing, our proposed algorithm has the potential for significantly reducing query complexity in the classical domain and providing a quadratic speedup in the quantum domain. △ Less

Submitted 16 February, 2023; v1 submitted 30 May, 2022; originally announced May 2022.

Comments: 14 pages, 14 figures, accepted for publication in IEEE Transactions on Communications

arXiv:2202.09684 [pdf, other]

Energy reconstruction of hadronic showers at the CERN PS and SPS using the Semi-Digital Hadronic Calorimeter

Authors: I. Laktineh, B. Liu, D. Boumediene, Y. W. Baek, D-W. Kim, S. C. Lee, B. G. Min, S. W. Park, Y. Deguchi, K. Kawagoe, Y. Miura, R. Mori, I. Sekiya, T. Suehara, T. Yoshioka, L. Caponetto, C. Combaret, G. Garillot, G. Grenier, J-C. Ianigro, T. Kurca, I. Laktineh, B. Liu, B. Li, N. Lumb , et al. (53 additional authors not shown)

Abstract: The CALICE Semi-Digital Hadronic CALorimeter (SDHCAL) is the first technological prototype in a family of high-granularity calorimeters developed by the CALICE Collaboration to equip the experiments of future lepton colliders. The SDHCAL is a sampling calorimeter using stainless steel for absorber and Glass Resistive Plate Chambers (GRPC) as a sensitive medium. The GRPC are read out by 1~cm… ▽ More The CALICE Semi-Digital Hadronic CALorimeter (SDHCAL) is the first technological prototype in a family of high-granularity calorimeters developed by the CALICE Collaboration to equip the experiments of future lepton colliders. The SDHCAL is a sampling calorimeter using stainless steel for absorber and Glass Resistive Plate Chambers (GRPC) as a sensitive medium. The GRPC are read out by 1~cm $\times$ 1~cm pickup pads combined to a multi-threshold electronics. The prototype was exposed to hadron beams in both the CERN PS and the SPS beamlines in 2015 allowing the test of the SDHCAL in a large energy range from 3~GeV to 80~GeV. After introducing the method used to select the hadrons of our data and reject the muon and electron contamination, we present the energy reconstruction approach that we apply to the data collected from both beamlines and we discuss the response linearity and the energy resolution of the SDHCAL. The results obtained in the two beamlines confirm the excellent SDHCAL performance observed with the data collected with the same prototype in the SPS beamline in 2012. They also show the stability of the SDHCAL in different beam conditions and different time periods. △ Less

Submitted 19 February, 2022; originally announced February 2022.

Comments: 21 pages,23 figures

Report number: CALICE-PUB-2022-001

arXiv:2111.03981 [pdf, other]

doi 10.3390/ma15082744

Observation of a flat and extended surface state in a topological semimetal

Authors: Ryo Mori, Kefeng Wang, Takahiro Morimoto, Jonathan D. Denlinger, Johnpierre Paglione, Alessandra Lanzara

Abstract: A topological flatband, also known as drumhead states, is an ideal platform to drive new exotic topological quantum phases. Using angle-resolved photoemission spectroscopy experiments, we reveal the emergence of a highly localized possible drumhead surface state in a topological semimetal BaAl4 and provide its full energy and momentum space topology. We find that the observed surface state is high… ▽ More A topological flatband, also known as drumhead states, is an ideal platform to drive new exotic topological quantum phases. Using angle-resolved photoemission spectroscopy experiments, we reveal the emergence of a highly localized possible drumhead surface state in a topological semimetal BaAl4 and provide its full energy and momentum space topology. We find that the observed surface state is highly localized in momentum, inside a square-shaped bulk Dirac nodal loop, and in energy, leading to a flat band and a peak in the density of state. These results establish this class of materials as a possible experimental realization of drumhead surface states and provide an important reference for future studies of fundamental physics of topological quantum phase transition. △ Less

Submitted 6 November, 2021; originally announced November 2021.

arXiv:2109.06380 [pdf, other]

Brakke's formulation of velocity and the second order regularity property

Authors: Ryunosuke Mori, Eita Tomimatsu, Yoshihiro Tonegawa

Abstract: Suppose that a family of $k$-dimensional surfaces in $\mathbb R^n$ evolves by the motion law of $v=h+u^\perp$ in the sense of Brakke's formulation of velocity, where $v$ is the normal velocity vector, $h$ is the generalized mean curvature vector and $u^\perp$ is the normal projection of a given vector field $u$ in a dimensionally sharp integrability class. When the flow is locally close to a time-… ▽ More Suppose that a family of $k$-dimensional surfaces in $\mathbb R^n$ evolves by the motion law of $v=h+u^\perp$ in the sense of Brakke's formulation of velocity, where $v$ is the normal velocity vector, $h$ is the generalized mean curvature vector and $u^\perp$ is the normal projection of a given vector field $u$ in a dimensionally sharp integrability class. When the flow is locally close to a time-independent $k$-dimensional plane in a weak sense of measure in space-time, it is represented as a graph of a $C^{1,α}$ function over the plane. On the other hand, it is not known if the graph satisfies the PDE of $v=h+u^\perp$ pointwise in general. For this problem, when $k=n-1$ and under the additional assumption that the distributional time derivative of the graph is a signed Radon measure, it is proved that the graph satisfies the PDE pointwise. An application to a short-time existence theorem for a surface evolution problem is given. △ Less

Submitted 12 January, 2022; v1 submitted 13 September, 2021; originally announced September 2021.

Comments: 16 pages, 1 figure

MSC Class: 53E10

arXiv:2107.03948 [pdf, other]

Lower bounds on the error probability of multiple quantum channel discrimination by the Bures angle and the trace distance

Authors: Ryo Ito, Ryuhei Mori

Abstract: Quantum channel discrimination is a fundamental problem in quantum information science. In this study, we consider general quantum channel discrimination problems, and derive the lower bounds of the error probability. Our lower bounds are based on the triangle inequalities of the Bures angle and the trace distance. As a consequence of the lower bound based on the Bures angle, we prove the optimali… ▽ More Quantum channel discrimination is a fundamental problem in quantum information science. In this study, we consider general quantum channel discrimination problems, and derive the lower bounds of the error probability. Our lower bounds are based on the triangle inequalities of the Bures angle and the trace distance. As a consequence of the lower bound based on the Bures angle, we prove the optimality of Grover's search if the number of marked elements is fixed to some integer $\ell$. This result generalizes Zalka's result for $\ell=1$. We also present several numerical results in which our lower bounds based on the trace distance outperform recently obtained lower bounds. △ Less

Submitted 1 August, 2022; v1 submitted 8 July, 2021; originally announced July 2021.

Comments: 17 pages, 6 figures

arXiv:2106.04624 [pdf, other]

SpeechBrain: A General-Purpose Speech Toolkit

Authors: Mirco Ravanelli, Titouan Parcollet, Peter Plantinga, Aku Rouhe, Samuele Cornell, Loren Lugosch, Cem Subakan, Nauman Dawalatabad, Abdelwahab Heba, Jianyuan Zhong, Ju-Chieh Chou, Sung-Lin Yeh, Szu-Wei Fu, Chien-Feng Liao, Elena Rastorgueva, François Grondin, William Aris, Hwidong Na, Yan Gao, Renato De Mori, Yoshua Bengio

Abstract: SpeechBrain is an open-source and all-in-one speech toolkit. It is designed to facilitate the research and development of neural speech processing technologies by being simple, flexible, user-friendly, and well-documented. This paper describes the core architecture designed to support several tasks of common interest, allowing users to naturally conceive, compare and share novel speech processing… ▽ More SpeechBrain is an open-source and all-in-one speech toolkit. It is designed to facilitate the research and development of neural speech processing technologies by being simple, flexible, user-friendly, and well-documented. This paper describes the core architecture designed to support several tasks of common interest, allowing users to naturally conceive, compare and share novel speech processing pipelines. SpeechBrain achieves competitive or state-of-the-art performance in a wide range of speech benchmarks. It also provides training recipes, pretrained models, and inference scripts for popular speech datasets, as well as tutorials which allow anyone with basic Python proficiency to familiarize themselves with speech technologies. △ Less

Submitted 8 June, 2021; originally announced June 2021.

Comments: Preprint

arXiv:2104.14384 [pdf, other]

Quantum speedups for dynamic programming on $n$-dimensional lattice graphs

Authors: Adam Glos, Martins Kokainis, Ryuhei Mori, Jevgēnijs Vihrovs

Abstract: Motivated by the quantum speedup for dynamic programming on the Boolean hypercube by Ambainis et al. (2019), we investigate which graphs admit a similar quantum advantage. In this paper, we examine a generalization of the Boolean hypercube graph, the $n$-dimensional lattice graph $Q(D,n)$ with vertices in $\{0,1,\ldots,D\}^n$. We study the complexity of the following problem: given a subgraph $G$… ▽ More Motivated by the quantum speedup for dynamic programming on the Boolean hypercube by Ambainis et al. (2019), we investigate which graphs admit a similar quantum advantage. In this paper, we examine a generalization of the Boolean hypercube graph, the $n$-dimensional lattice graph $Q(D,n)$ with vertices in $\{0,1,\ldots,D\}^n$. We study the complexity of the following problem: given a subgraph $G$ of $Q(D,n)$ via query access to the edges, determine whether there is a path from $0^n$ to $D^n$. While the classical query complexity is $\widetildeΘ((D+1)^n)$, we show a quantum algorithm with complexity $\widetilde O(T_D^n)$, where $T_D < D+1$. The first few values of $T_D$ are $T_1 \approx 1.817$, $T_2 \approx 2.660$, $T_3 \approx 3.529$, $T_4 \approx 4.421$, $T_5 \approx 5.332$. We also prove that $T_D \geq \frac{D+1}{\mathrm e}$, thus for general $D$, this algorithm does not provide, for example, a speedup, polynomial in the size of the lattice. While the presented quantum algorithm is a natural generalization of the known quantum algorithm for $D=1$ by Ambainis et al., the analysis of complexity is rather complicated. For the precise analysis, we use the saddle-point method, which is a common tool in analytic combinatorics, but has not been widely used in this field. We then show an implementation of this algorithm with time complexity $\text{poly}(n)^{\log n} T_D^n$, and apply it to the Set Multicover problem. In this problem, $m$ subsets of $[n]$ are given, and the task is to find the smallest number of these subsets that cover each element of $[n]$ at least $D$ times. While the time complexity of the best known classical algorithm is $O(m(D+1)^n)$, the time complexity of our quantum algorithm is $\text{poly}(m,n)^{\log n} T_D^n$. △ Less

Submitted 7 May, 2021; v1 submitted 29 April, 2021; originally announced April 2021.

arXiv:2103.08076 [pdf, other]

doi 10.1038/s41535-021-00404-8

Correlation-Driven Electron-Hole Asymmetry in Graphene Field Effect Devices

Authors: Nicholas Dale, Ryo Mori, M. Iqbal Bakti Utama, Jonathan D. Denlinger, Conrad Stansbury, Claudia G. Fatuzzo, Sihan Zhao, Kyunghoon Lee, Takashi Taniguchi, Kenji Watanabe, Chris Jozwiak, Aaron Bostwick, Eli Rotenberg, Roland J. Koch, Feng Wang, Alessandra Lanzara

Abstract: Electron-hole asymmetry is a fundamental property in solids that can determine the nature of quantum phase transitions and the regime of operation for devices. The observation of electron-hole asymmetry in graphene and recently in the phase diagram of bilayer graphene has spurred interest into whether it stems from disorder or from fundamental interactions such as correlations. Here, we report an… ▽ More Electron-hole asymmetry is a fundamental property in solids that can determine the nature of quantum phase transitions and the regime of operation for devices. The observation of electron-hole asymmetry in graphene and recently in the phase diagram of bilayer graphene has spurred interest into whether it stems from disorder or from fundamental interactions such as correlations. Here, we report an effective new way to access electron-hole asymmetry in 2D materials by directly measuring the quasiparticle self-energy in graphene/Boron Nitride field effect devices. As the chemical potential moves from the hole to the electron doped side, we see an increased strength of electronic correlations manifested by an increase in the band velocity and inverse quasiparticle lifetime. These results suggest that electronic correlations play an intrinsic role in driving electron hole asymmetry in graphene and provide a new insight for asymmetries in more strongly correlated materials. △ Less

Submitted 14 March, 2021; originally announced March 2021.

Comments: 22 pages, 7 figures

Journal ref: npj Quantum Materials 7, 9 (2022)

arXiv:2102.01960 [pdf, ps, other]

Quantum supremacy and hardness of estimating output probabilities of quantum circuits

Authors: Yasuhiro Kondo, Ryuhei Mori, Ramis Movassagh

Abstract: Motivated by the recent experimental demonstrations of quantum supremacy, proving the hardness of the output of random quantum circuits is an imperative near term goal. We prove under the complexity theoretical assumption of the non-collapse of the polynomial hierarchy that approximating the output probabilities of random quantum circuits to within $\exp(-Ω(m\log m))$ additive error is hard for an… ▽ More Motivated by the recent experimental demonstrations of quantum supremacy, proving the hardness of the output of random quantum circuits is an imperative near term goal. We prove under the complexity theoretical assumption of the non-collapse of the polynomial hierarchy that approximating the output probabilities of random quantum circuits to within $\exp(-Ω(m\log m))$ additive error is hard for any classical computer, where $m$ is the number of gates in the quantum computation. More precisely, we show that the above problem is $\#\mathsf{P}$-hard under $\mathsf{BPP}^{\mathsf{NP}}$ reduction. In the recent experiments, the quantum circuit has $n$-qubits and the architecture is a two-dimensional grid of size $\sqrt{n}\times\sqrt{n}$. Indeed for constant depth circuits approximating the output probabilities to within $2^{-Ω(n\log{n})}$ is hard. For circuits of depth $\log{n}$ or $\sqrt{n}$ for which the anti-concentration property holds, approximating the output probabilities to within $2^{-Ω(n\log^2{n})}$ and $2^{-Ω(n^{3/2}\log n)}$ is hard respectively. We then show that the hardness results extend to any open neighborhood of an arbitrary (fixed) circuit including the trivial circuit with identity gates. We made an effort to find the best proofs and proved these results from first principles, which do not use the standard techniques such as the Berlekamp--Welch algorithm, the usual Paturi's lemma, and Rakhmanov's result. △ Less

Submitted 10 December, 2021; v1 submitted 3 February, 2021; originally announced February 2021.

Comments: 10 pages + References + short appendix. Has 2 figures. v3: New material added and changed the original title in v1 "Fine-Grained Analysis and Improved Robustness of Quantum Supremacy for Random Circuit Sampling"

Journal ref: 2021 IEEE 62nd Annual Symposium on Foundations of Computer Science (FOCS)

arXiv:2102.01013 [pdf, other]

doi 10.1109/ICASSP39728.2021.9413581

End2End Acoustic to Semantic Transduction

Authors: Valentin Pelloin, Nathalie Camelin, Antoine Laurent, Renato De Mori, Antoine Caubrière, Yannick Estève, Sylvain Meignier

Abstract: In this paper, we propose a novel end-to-end sequence-to-sequence spoken language understanding model using an attention mechanism. It reliably selects contextual acoustic features in order to hypothesize semantic contents. An initial architecture capable of extracting all pronounced words and concepts from acoustic spans is designed and tested. With a shallow fusion language model, this system re… ▽ More In this paper, we propose a novel end-to-end sequence-to-sequence spoken language understanding model using an attention mechanism. It reliably selects contextual acoustic features in order to hypothesize semantic contents. An initial architecture capable of extracting all pronounced words and concepts from acoustic spans is designed and tested. With a shallow fusion language model, this system reaches a 13.6 concept error rate (CER) and an 18.5 concept value error rate (CVER) on the French MEDIA corpus, achieving an absolute 2.8 points reduction compared to the state-of-the-art. Then, an original model is proposed for hypothesizing concepts and their values. This transduction reaches a 15.4 CER and a 21.6 CVER without any new type of context. △ Less

Submitted 1 February, 2021; originally announced February 2021.

Comments: Accepted at IEEE ICASSP 2021

Journal ref: ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

arXiv:2008.08822 [pdf, ps, other]

A Simple and Fast Algorithm for Computing the $N$-th Term of a Linearly Recurrent Sequence

Authors: Alin Bostan, Ryuhei Mori

Abstract: We present a simple and fast algorithm for computing the $N$-th term of a given linearly recurrent sequence. Our new algorithm uses $O(\mathsf{M}(d) \log N)$ arithmetic operations, where $d$ is the order of the recurrence, and $\mathsf{M}(d)$ denotes the number of arithmetic operations for computing the product of two polynomials of degree $d$. The state-of-the-art algorithm, due to Charles Fiducc… ▽ More We present a simple and fast algorithm for computing the $N$-th term of a given linearly recurrent sequence. Our new algorithm uses $O(\mathsf{M}(d) \log N)$ arithmetic operations, where $d$ is the order of the recurrence, and $\mathsf{M}(d)$ denotes the number of arithmetic operations for computing the product of two polynomials of degree $d$. The state-of-the-art algorithm, due to Charles Fiduccia (1985), has the same arithmetic complexity up to a constant factor. Our algorithm is simpler, faster and obtained by a totally different method. We also discuss several algorithmic applications, notably to polynomial modular exponentiation, powering of matrices and high-order lifting. △ Less

Submitted 20 August, 2020; originally announced August 2020.

Comments: 34 pages

arXiv:2007.12571 [pdf, other]

Crystalline symmetry-protected non-trivial topology in prototype compound BaAl$_4$

Authors: Kefeng Wang, Ryo Mori, Zhijun Wang, Limin Wang, Jonathan Han Son Ma, Drew W. Latzke, David E. Graf, Jonathan D. Denlinger, Daniel Campbell, B. Andrei Bernevig, Alessandra Lanzara, Johnpierre Paglione

Abstract: The BaAl$_4$ prototype crystal structure is the most populous of all structure types, and is the building block for a diverse set of sub-structures including the famous ThCr$_2$Si$_2$ family that hosts high-temperature superconductivity and numerous magnetic and strongly correlated electron systems. The MA$_4$ family of materials (M=Sr, Ba, Eu; A=Al, Ga, In) themselves present an intriguing set of… ▽ More The BaAl$_4$ prototype crystal structure is the most populous of all structure types, and is the building block for a diverse set of sub-structures including the famous ThCr$_2$Si$_2$ family that hosts high-temperature superconductivity and numerous magnetic and strongly correlated electron systems. The MA$_4$ family of materials (M=Sr, Ba, Eu; A=Al, Ga, In) themselves present an intriguing set of ground states including charge and spin orders, but have largely been considered as uninteresting metals. Using electronic structure calculations, symmetry analysis and topological quantum chemistry techniques, we predict the exemplary compound BaAl$_4$ to harbor a three-dimensional Dirac spectrum with non-trivial topology and possible nodal lines crossing the Brillouin zone, wherein one pair of semi-Dirac points with linear dispersion along the $k_z$ direction and quadratic dispersion along the $k_x/k_y$ direction resides on the rotational axis with $C_{4v}$ point group symmetry. Electrical transport measurements reveal the presence of an extremely large, unsaturating positive magnetoresistance in BaAl$_4$ despite an uncompensated band structure, and quantum oscillations and angle-resolved photoemission spectroscopy measurements confirm the predicted multiband semimetal structure with pockets of Dirac holes and a Van Hove singularity (VHS) remarkably consistent with the theoretical prediction. We thus present BaAl$_4$ as a new topological semimetal, casting its prototype status into a new role as building block for a vast array of new topological materials. △ Less

Submitted 24 July, 2020; originally announced July 2020.

Comments: 11 pages, 5 figures

arXiv:2004.02972 [pdf, other]

doi 10.1088/1748-0221/15/10/P10009

Particle Identification Using Boosted Decision Trees in the Semi-Digital Hadronic Calorimeter Prototype

Authors: D. Boumediene, A. **ault, M. Tytgat, B. Bilki, D. Northacker, Y. Onel, G. Cho, D-W. Kim, S. C. Lee, W. Park, S. Vallecorsa, Y. Deguchi, K. Kawagoe, Y. Miura, R. Mori, I. Sekiya, T. Suehara, T. Yoshioka, L. Caponetto, C. Combaret, R. Ete G. Garillot, G. Grenier, J-C. Ianigro, T. Kurca, I. Laktineh , et al. (65 additional authors not shown)

Abstract: The CALICE Semi-Digital Hadronic CALorimeter (SDHCAL) prototype using Glass Resistive Plate Chambers as a sensitive medium is the first technological prototype of a family of high-granularity calorimeters developed by the CALICE collaboration to equip the experiments of future leptonic colliders. It was exposed to beams of hadrons, electrons and muons several times in the CERN PS and SPS beamlines… ▽ More The CALICE Semi-Digital Hadronic CALorimeter (SDHCAL) prototype using Glass Resistive Plate Chambers as a sensitive medium is the first technological prototype of a family of high-granularity calorimeters developed by the CALICE collaboration to equip the experiments of future leptonic colliders. It was exposed to beams of hadrons, electrons and muons several times in the CERN PS and SPS beamlines between 2012 and 2018. We present here a new method of particle identification within the SDHCAL using the Boosted Decision Trees (BDT) method applied to the data collected in 2015. The performance of the method is tested first with Geant4-based simulated events and then on the data collected by the SDHCAL in the energy range between 10 and 80~GeV with 10~GeV energy steps. The BDT method is then used to reject the electrons and muons that contaminate the SPS hadron beams. △ Less

Submitted 6 April, 2020; originally announced April 2020.

Report number: CALICE-PUB-2020-001

arXiv:2002.06780 [pdf, other]

doi 10.1088/1748-0221/15/05/C05051

Study of silicon sensors for precise timing measurement

Authors: Y. Deguchi, K. Kawagoe, E. Mestre, R. Mori, T. Suehara, T. Yoshioka

Abstract: Silicon sensors with high time resolution can help particle identification in the International Linear Collider (ILC). We are studying Low Gain Avalanche Diodes (LGADs) as a high timing resolution sensor. As a step to develop LGADs, we are now focusing to characterize Avalanche Photo Diode (APD)s, because the APDs has the same multiplication structure as LGADs. We studied the characteristics of AP… ▽ More Silicon sensors with high time resolution can help particle identification in the International Linear Collider (ILC). We are studying Low Gain Avalanche Diodes (LGADs) as a high timing resolution sensor. As a step to develop LGADs, we are now focusing to characterize Avalanche Photo Diode (APD)s, because the APDs has the same multiplication structure as LGADs. We studied the characteristics of APDs with particles from radioisotopes. △ Less

Submitted 17 February, 2020; originally announced February 2020.

arXiv:2002.06534 [pdf, other]

doi 10.1088/1748-0221/15/05/C05033

Study of Position Sensitive Silicon Detector (PSD) for SiW-ECAL at ILC

Authors: Y. Uesugi, R. Mori, H. Yamashiro, T. Suehara, T. Yoshioka, K. Kawagoe

Abstract: We are develo** position sensitive silicon detectors (PSDs) which have an electrode at each of four corners so that incident position of a charged particle can be obtained with signal from the electrodes. It is expected that the position resolution of the electromagnetic calorimeter (ECAL) of the ILD detector will be improved by introducing PSDs to detection layers. We have been develo** the P… ▽ More We are develo** position sensitive silicon detectors (PSDs) which have an electrode at each of four corners so that incident position of a charged particle can be obtained with signal from the electrodes. It is expected that the position resolution of the electromagnetic calorimeter (ECAL) of the ILD detector will be improved by introducing PSDs to detection layers. We have been develo** the PSDs for several years. In the previous production we found that the charge separation is not optimally done due to the readout impedance. To solve the issue, we produced new PSDs with higher surface resistance with an additional resistive layer on the surface. We also implemented several techniques to decrease position distortion and increase signal-to-noise ratio which are essential for the optimal position resolution. The measurements on the prototype sensors are ongoing, including radiation source measurement and laser measurement using an ASIC for silicon pad detectors. △ Less

Submitted 25 March, 2020; v1 submitted 16 February, 2020; originally announced February 2020.

Comments: 7 pages, 8 figures. Talk presented at the Calorimetry for the High Energy Frontier 2019 (CHEF2019), Fukuoka, Japan, 25-29 November 2019

Journal ref: 2020 JINST 15 C05033

arXiv:2002.06012 [pdf, other]

doi 10.1109/ICASSP40776.2020.9053247

Dialogue history integration into end-to-end signal-to-concept spoken language understanding systems

Authors: Natalia Tomashenko, Christian Raymond, Antoine Caubriere, Renato De Mori, Yannick Esteve

Abstract: This work investigates the embeddings for representing dialog history in spoken language understanding (SLU) systems. We focus on the scenario when the semantic information is extracted directly from the speech signal by means of a single end-to-end neural network model. We proposed to integrate dialogue history into an end-to-end signal-to-concept SLU system. The dialog history is represented in… ▽ More This work investigates the embeddings for representing dialog history in spoken language understanding (SLU) systems. We focus on the scenario when the semantic information is extracted directly from the speech signal by means of a single end-to-end neural network model. We proposed to integrate dialogue history into an end-to-end signal-to-concept SLU system. The dialog history is represented in the form of dialog history embedding vectors (so-called h-vectors) and is provided as an additional information to end-to-end SLU models in order to improve the system performance. Three following types of h-vectors are proposed and experimentally evaluated in this paper: (1) supervised-all embeddings predicting bag-of-concepts expected in the answer of the user from the last dialog system response; (2) supervised-freq embeddings focusing on predicting only a selected set of semantic concept (corresponding to the most frequent errors in our experiments); and (3) unsupervised embeddings. Experiments on the MEDIA corpus for the semantic slot filling task demonstrate that the proposed h-vectors improve the model performance. △ Less

Submitted 14 February, 2020; originally announced February 2020.

Comments: Accepted for ICASSP 2020 (Submitted: October 21, 2019)

Journal ref: ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

arXiv:1907.00529 [pdf, ps, other]

Exponential-time quantum algorithms for graph coloring problems

Authors: Kazuya Shimizu, Ryuhei Mori

Abstract: The fastest known classical algorithm deciding the $k$-colorability of $n$-vertex graph requires running time $Ω(2^n)$ for $k\ge 5$. In this work, we present an exponential-space quantum algorithm computing the chromatic number with running time $O(1.9140^n)$ using quantum random access memory (QRAM). Our approach is based on Ambainis et al's quantum dynamic programming with applications of Grover… ▽ More The fastest known classical algorithm deciding the $k$-colorability of $n$-vertex graph requires running time $Ω(2^n)$ for $k\ge 5$. In this work, we present an exponential-space quantum algorithm computing the chromatic number with running time $O(1.9140^n)$ using quantum random access memory (QRAM). Our approach is based on Ambainis et al's quantum dynamic programming with applications of Grover's search to branching algorithms. We also present a polynomial-space quantum algorithm not using QRAM for the graph $20$-coloring problem with running time $O(1.9575^n)$. In the polynomial-space quantum algorithm, we essentially show $(4-ε)^n$-time classical algorithms that can be improved quadratically by Grover's search. △ Less

Submitted 30 June, 2019; originally announced July 2019.

Comments: 14 pages

arXiv:1906.08043 [pdf, other]

Real to H-space Encoder for Speech Recognition

Authors: Titouan Parcollet, Mohamed Morchid, Georges Linarès, Renato De Mori

Abstract: Deep neural networks (DNNs) and more precisely recurrent neural networks (RNNs) are at the core of modern automatic speech recognition systems, due to their efficiency to process input sequences. Recently, it has been shown that different input representations, based on multidimensional algebras, such as complex and quaternion numbers, are able to bring to neural networks a more natural, compressi… ▽ More Deep neural networks (DNNs) and more precisely recurrent neural networks (RNNs) are at the core of modern automatic speech recognition systems, due to their efficiency to process input sequences. Recently, it has been shown that different input representations, based on multidimensional algebras, such as complex and quaternion numbers, are able to bring to neural networks a more natural, compressive and powerful representation of the input signal by outperforming common real-valued NNs. Indeed, quaternion-valued neural networks (QNNs) better learn both internal dependencies, such as the relation between the Mel-filter-bank value of a specific time frame and its time derivatives, and global dependencies, describing the relations that exist between time frames. Nonetheless, QNNs are limited to quaternion-valued input signals, and it is difficult to benefit from this powerful representation with real-valued input data. This paper proposes to tackle this weakness by introducing a real-to-quaternion encoder that allows QNNs to process any one dimensional input features, such as traditional Mel-filter-banks for automatic speech recognition. △ Less

Submitted 17 June, 2019; originally announced June 2019.

Comments: Accepted at INTERSPEECH 2019

arXiv:1902.06161 [pdf, other]

doi 10.1016/j.nima.2019.04.111

Characterisation of different stages of hadronic showers using the CALICE Si-W ECAL physics prototype

Authors: CALICE Collaboration, G. Eigen, T. Price, N. K. Watson, A. Winter, Y. Do, A. Khan, D. Kim, G. C. Blazey, A. Dyshkant, K. Francis, V. Zutshi, K. Kawagoe, Y. Miura, R. Mori, I. Sekiya, T. Suehara, T. Yoshioka, J. Apostolakis, J. Giraud, D. Grondin, J. -Y. Hostachy, O. Bach, V. Bocharnikov, E. Brianne , et al. (81 additional authors not shown)

Abstract: A detailed investigation of hadronic interactions is performed using $π^-$-mesons with energies in the range 2--10 GeV incident on a high granularity silicon-tungsten electromagnetic calorimeter. The data were recorded at FNAL in 2008. The region in which the $π^-$-mesons interact with the detector material and the produced secondary particles are characterised using a novel track-finding algorith… ▽ More A detailed investigation of hadronic interactions is performed using $π^-$-mesons with energies in the range 2--10 GeV incident on a high granularity silicon-tungsten electromagnetic calorimeter. The data were recorded at FNAL in 2008. The region in which the $π^-$-mesons interact with the detector material and the produced secondary particles are characterised using a novel track-finding algorithm that reconstructs tracks within hadronic showers in a calorimeter in the absence of a magnetic field. The principle of carrying out detector monitoring and calibration using secondary tracks is also demonstrated. △ Less

Submitted 18 September, 2019; v1 submitted 16 February, 2019; originally announced February 2019.

Comments: 21 pages, 21 figures

Report number: CALICE-PUB-2019-002

Journal ref: Nucl.Instrum.Meth. A937 (2019) 41-52

arXiv:1901.08818 [pdf, other]

doi 10.1016/j.nima.2019.05.013

Analysis of Testbeam Data of the Highly Granular RPC-Steel CALICE Digital Hadron Calorimeter and Validation of Geant4 Monte Carlo Models

Authors: CALICE Collaboration, M. Chefdeville, J. Repond, J. Schlereth, J. R. Smith, D. Trojand, L. Xia, Q. Zhang, J. Apostolakis, C. Grefe, V. Ivantchenko, G. Folger, A. Ribon, V. Uzhinskiy, G. C. Blazey, A. Dyshkant, K. Francis, V. Zutshi, O. Bach, V. Bocharnikov, E. Brianne, K. Gadow, P. Göttlicher, O. Hartbrich, D. Heuchel , et al. (71 additional authors not shown)

Abstract: We present a study of the response of the highly granular Digital Hadronic Calorimeter with steel absorbers, the Fe-DHCAL, to positrons, muons, and pions with momenta ranging from 2 to 60 GeV/c. Developed in the context of the CALICE collaboration, this hadron calorimeter utilises Resistive Plate Chambers as active media, interspersed with steel absorber plates. With a transverse granularity of… ▽ More We present a study of the response of the highly granular Digital Hadronic Calorimeter with steel absorbers, the Fe-DHCAL, to positrons, muons, and pions with momenta ranging from 2 to 60 GeV/c. Developed in the context of the CALICE collaboration, this hadron calorimeter utilises Resistive Plate Chambers as active media, interspersed with steel absorber plates. With a transverse granularity of $1\,\times\,1\,$cm$^{2}$ and a longitudinal segmentation of 38 layers, the calorimeter counted 350,208 readout channels, each read out with single-bit resolution (digital readout). The data were recorded in the Fermilab test beam in 2010-11. The analysis includes measurements of the calorimeter response and the energy resolution to positrons and muons, as well as detailed studies of various shower shape quantities. The results are compared to simulations based on Geant4, which utilise different electromagnetic and hadronic physics lists. △ Less

Submitted 25 January, 2019; originally announced January 2019.

Report number: CALICE-PUB-2019-001

arXiv:1812.09321 [pdf, other]

Multiple topic identification in telephone conversations

Authors: Xavier Bost, Marc El Bèze, Renato De Mori

Abstract: This paper deals with the automatic analysis of conversations between a customer and an agent in a call centre of a customer care service. The purpose of the analysis is to hypothesize themes about problems and complaints discussed in the conversation. Themes are defined by the application documentation topics. A conversation may contain mentions that are irrelevant for the application purpose and… ▽ More This paper deals with the automatic analysis of conversations between a customer and an agent in a call centre of a customer care service. The purpose of the analysis is to hypothesize themes about problems and complaints discussed in the conversation. Themes are defined by the application documentation topics. A conversation may contain mentions that are irrelevant for the application purpose and multiple themes whose mentions may be interleaved portions of a conversation that cannot be well defined. Two methods are proposed for multiple theme hypothesization. One of them is based on a cosine similarity measure using a bag of features extracted from the entire conversation. The other method introduces the concept of thematic density distributed around specific word positions in a conversation. In addition to automatically selected words, word bi-grams with possible gaps between successive words are also considered and selected. Experimental results show that the results obtained with the proposed methods outperform the results obtained with support vector machines on the same data. Furthermore, using the theme skeleton of a conversation from which thematic densities are derived, it will be possible to extract components of an automatic conversation report to be used for improving the service performance. Index Terms: multi-topic audio document classification, hu-man/human conversation analysis, speech analytics, distance bigrams △ Less

Submitted 29 December, 2018; v1 submitted 21 December, 2018; originally announced December 2018.

Comments: arXiv admin note: text overlap with arXiv:1812.07207

Journal ref: Interspeech, Aug 2013, Lyon, France

arXiv:1812.07207 [pdf, other]

doi 10.1016/j.csl.2015.03.006

Multiple topic identification in human/human conversations

Authors: X. Bost, G. Senay, M. El-Bèze, R. De Mori

Abstract: The paper deals with the automatic analysis of real-life telephone conversations between an agent and a customer of a customer care service (ccs). The application domain is the public transportation system in Paris and the purpose is to collect statistics about customer problems in order to monitor the service and decide priorities on the intervention for improving user satisfaction. Of primary im… ▽ More The paper deals with the automatic analysis of real-life telephone conversations between an agent and a customer of a customer care service (ccs). The application domain is the public transportation system in Paris and the purpose is to collect statistics about customer problems in order to monitor the service and decide priorities on the intervention for improving user satisfaction. Of primary importance for the analysis is the detection of themes that are the object of customer problems. Themes are defined in the application requirements and are part of the application ontology that is implicit in the ccs documentation. Due to variety of customer population, the structure of conversations with an agent is unpredictable. A conversation may be about one or more themes. Theme mentions can be interleaved with mentions of facts that are irrelevant for the application purpose. Furthermore, in certain conversations theme mentions are localized in specific conversation segments while in other conversations mentions cannot be localized. As a consequence, approaches to feature extraction with and without mention localization are considered. Application domain relevant themes identified by an automatic procedure are expressed by specific sentences whose words are hypothesized by an automatic speech recognition (asr) system. The asr system is error prone. The word error rates can be very high for many reasons. Among them it is worth mentioning unpredictable background noise, speaker accent, and various types of speech disfluencies. As the application task requires the composition of proportions of theme mentions, a sequential decision strategy is introduced in this paper for performing a survey of the large amount of conversations made available in a given time period. The strategy has to sample the conversations to form a survey containing enough data analyzed with high accuracy so that proportions can be estimated with sufficient accuracy. Due to the unpredictable type of theme mentions, it is appropriate to consider methods for theme hypothesization based on global as well as local feature extraction. Two systems based on each type of feature extraction will be considered by the strategy. One of the four methods is novel. It is based on a new definition of density of theme mentions and on the localization of high density zones whose boundaries do not need to be precisely detected. The sequential decision strategy starts by grou** theme hypotheses into sets of different expected accuracy and coverage levels. For those sets for which accuracy can be improved with a consequent increase of coverage a new system with new features is introduced. Its execution is triggered only when specific preconditions are met on the hypotheses generated by the basic four systems. Experimental results are provided on a corpus collected in the call center of the Paris transportation system known as ratp. The results show that surveys with high accuracy and coverage can be composed with the proposed strategy and systems. This makes it possible to apply a previously published proportion estimation approach that takes into account hypothesization errors . △ Less

Submitted 29 December, 2018; v1 submitted 18 December, 2018; originally announced December 2018.

Journal ref: Computer Speech \& Language, 2015, 34 (1), pp.18-42

arXiv:1811.09678 [pdf, other]

Speech recognition with quaternion neural networks

Authors: Titouan Parcollet, Mirco Ravanelli, Mohamed Morchid, Georges Linarès, Renato De Mori

Abstract: Neural network architectures are at the core of powerful automatic speech recognition systems (ASR). However, while recent researches focus on novel model architectures, the acoustic input features remain almost unchanged. Traditional ASR systems rely on multidimensional acoustic features such as the Mel filter bank energies alongside with the first, and second order derivatives to characterize ti… ▽ More Neural network architectures are at the core of powerful automatic speech recognition systems (ASR). However, while recent researches focus on novel model architectures, the acoustic input features remain almost unchanged. Traditional ASR systems rely on multidimensional acoustic features such as the Mel filter bank energies alongside with the first, and second order derivatives to characterize time-frames that compose the signal sequence. Considering that these components describe three different views of the same element, neural networks have to learn both the internal relations that exist within these features, and external or global dependencies that exist between the time-frames. Quaternion-valued neural networks (QNN), recently received an important interest from researchers to process and learn such relations in multidimensional spaces. Indeed, quaternion numbers and QNNs have shown their efficiency to process multidimensional inputs as entities, to encode internal dependencies, and to solve many tasks with up to four times less learning parameters than real-valued models. We propose to investigate modern quaternion-valued models such as convolutional and recurrent quaternion neural networks in the context of speech recognition with the TIMIT dataset. The experiments show that QNNs always outperform real-valued equivalent models with way less free parameters, leading to a more efficient, compact, and expressive representation of the relevant information. △ Less

Submitted 21 November, 2018; originally announced November 2018.

Comments: NIPS 2018 (IRASL). arXiv admin note: text overlap with arXiv:1806.04418

arXiv:1811.02566 [pdf, other]

Bidirectional Quaternion Long-Short Term Memory Recurrent Neural Networks for Speech Recognition

Authors: Titouan Parcollet, Mohamed Morchid, Georges Linarès, Renato De Mori

Abstract: Recurrent neural networks (RNN) are at the core of modern automatic speech recognition (ASR) systems. In particular, long-short term memory (LSTM) recurrent neural networks have achieved state-of-the-art results in many speech recognition tasks, due to their efficient representation of long and short term dependencies in sequences of inter-dependent features. Nonetheless, internal dependencies wit… ▽ More Recurrent neural networks (RNN) are at the core of modern automatic speech recognition (ASR) systems. In particular, long-short term memory (LSTM) recurrent neural networks have achieved state-of-the-art results in many speech recognition tasks, due to their efficient representation of long and short term dependencies in sequences of inter-dependent features. Nonetheless, internal dependencies within the element composing multidimensional features are weakly considered by traditional real-valued representations. We propose a novel quaternion long-short term memory (QLSTM) recurrent neural network that takes into account both the external relations between the features composing a sequence, and these internal latent structural dependencies with the quaternion algebra. QLSTMs are compared to LSTMs during a memory copy-task and a realistic application of speech recognition on the Wall Street Journal (WSJ) dataset. QLSTM reaches better performances during the two experiments with up to $2.8$ times less learning parameters, leading to a more expressive representation of the information. △ Less

Submitted 6 November, 2018; originally announced November 2018.

Comments: Submitted at ICASSP 2019. arXiv admin note: text overlap with arXiv:1806.04418

arXiv:1806.07789 [pdf, other]

Quaternion Convolutional Neural Networks for End-to-End Automatic Speech Recognition

Authors: Titouan Parcollet, Ying Zhang, Mohamed Morchid, Chiheb Trabelsi, Georges Linarès, Renato De Mori, Yoshua Bengio

Abstract: Recently, the connectionist temporal classification (CTC) model coupled with recurrent (RNN) or convolutional neural networks (CNN), made it easier to train speech recognition systems in an end-to-end fashion. However in real-valued models, time frame components such as mel-filter-bank energies and the cepstral coefficients obtained from them, together with their first and second order derivatives… ▽ More Recently, the connectionist temporal classification (CTC) model coupled with recurrent (RNN) or convolutional neural networks (CNN), made it easier to train speech recognition systems in an end-to-end fashion. However in real-valued models, time frame components such as mel-filter-bank energies and the cepstral coefficients obtained from them, together with their first and second order derivatives, are processed as individual elements, while a natural alternative is to process such components as composed entities. We propose to group such elements in the form of quaternions and to process these quaternions using the established quaternion algebra. Quaternion numbers and quaternion neural networks have shown their efficiency to process multidimensional inputs as entities, to encode internal dependencies, and to solve many tasks with less learning parameters than real-valued models. This paper proposes to integrate multiple feature views in quaternion-valued convolutional neural network (QCNN), to be used for sequence-to-sequence map** with the CTC model. Promising results are reported using simple QCNNs in phoneme recognition experiments with the TIMIT corpus. More precisely, QCNNs obtain a lower phoneme error rate (PER) with less learning parameters than a competing model based on real-valued CNNs. △ Less

Submitted 20 June, 2018; originally announced June 2018.

Comments: Accepted at INTERSPEECH 2018

arXiv:1806.04418 [pdf, other]

Quaternion Recurrent Neural Networks

Authors: Titouan Parcollet, Mirco Ravanelli, Mohamed Morchid, Georges Linarès, Chiheb Trabelsi, Renato De Mori, Yoshua Bengio

Abstract: Recurrent neural networks (RNNs) are powerful architectures to model sequential data, due to their capability to learn short and long-term dependencies between the basic elements of a sequence. Nonetheless, popular tasks such as speech or images recognition, involve multi-dimensional input features that are characterized by strong internal dependencies between the dimensions of the input vector. W… ▽ More Recurrent neural networks (RNNs) are powerful architectures to model sequential data, due to their capability to learn short and long-term dependencies between the basic elements of a sequence. Nonetheless, popular tasks such as speech or images recognition, involve multi-dimensional input features that are characterized by strong internal dependencies between the dimensions of the input vector. We propose a novel quaternion recurrent neural network (QRNN), alongside with a quaternion long-short term memory neural network (QLSTM), that take into account both the external relations and these internal structural dependencies with the quaternion algebra. Similarly to capsules, quaternions allow the QRNN to code internal dependencies by composing and processing multidimensional features as single entities, while the recurrent operation reveals correlations between the elements composing the sequence. We show that both QRNN and QLSTM achieve better performances than RNN and LSTM in a realistic application of automatic speech recognition. Finally, we show that QRNN and QLSTM reduce by a maximum factor of 3.3x the number of free parameters needed, compared to real-valued RNNs and LSTMs to reach better results, leading to a more compact representation of the relevant information. △ Less

Submitted 7 January, 2019; v1 submitted 12 June, 2018; originally announced June 2018.

Comments: ICLR Update - Full rework

arXiv:1803.09947 [pdf, ps, other]

Periodic Fourier representation of Boolean functions

Authors: Ryuhei Mori

Abstract: In this work, we consider a new type of Fourier-like representation of Boolean function $f\colon\{+1,-1\}^n\to\{+1,-1\}$ \[ f(x) = \cos\left(π\sum_{S\subseteq[n]}φ_S \prod_{i\in S} x_i\right). \] This representation, which we call the periodic Fourier representation, of Boolean function is closely related to a certain type of multipartite Bell inequalities and non-adaptive measurement-based quantu… ▽ More In this work, we consider a new type of Fourier-like representation of Boolean function $f\colon\{+1,-1\}^n\to\{+1,-1\}$ \[ f(x) = \cos\left(π\sum_{S\subseteq[n]}φ_S \prod_{i\in S} x_i\right). \] This representation, which we call the periodic Fourier representation, of Boolean function is closely related to a certain type of multipartite Bell inequalities and non-adaptive measurement-based quantum computation with linear side-processing ($\mathrm{NMQC}_\oplus$). The minimum number of non-zero coefficients in the above representation, which we call the periodic Fourier sparsity, is equal to the required number of qubits for the exact computation of $f$ by $\mathrm{NMQC}_\oplus$. Periodic Fourier representations are not unique, and can be directly obtained both from the Fourier representation and the $\mathbb{F}_2$-polynomial representation. In this work, we first show that Boolean functions related to $\mathbb{Z}/4\mathbb{Z}$-polynomial have small periodic Fourier sparsities. Second, we show that the periodic Fourier sparsity is at least $2^{\mathrm{deg}_{\mathbb{F}_2}(f)}-1$, which means that $\mathrm{NMQC}_\oplus$ efficiently computes a Boolean function $f$ if and only if $\mathbb{F}_2$-degree of $f$ is small. Furthermore, we show that any symmetric Boolean function, e.g., $\mathsf{AND}_n$, $\mathsf{Mod}^3_n$, $\mathsf{Maj}_n$, etc, can be exactly computed by depth-2 $\mathrm{NMQC}_\oplus$ using a polynomial number of qubits, that implies exponential gaps between $\mathrm{NMQC}_\oplus$ and depth-2 $\mathrm{NMQC}_\oplus$. △ Less

Submitted 26 March, 2019; v1 submitted 27 March, 2018; originally announced March 2018.

Comments: 18 pages, 2 figures, 2 tables

arXiv:1801.00081 [pdf, ps, other]

Validity of formal expansions for singularly perturbed competition-diffusion systems

Authors: Ryunosuke Mori

Abstract: We consider a two-species competition-diffusion system involving a small parameter $\varepsilon>0$ and discuss the validity of formal asymptotic expansions of solutions near the sharp interface limit $\varepsilon\approx0$. We assume that the corresponding ODE system has two stable equilibria. As in the scalar Allen--Cahn equation, it is known that the motion of the sharp interfaces of such systems… ▽ More We consider a two-species competition-diffusion system involving a small parameter $\varepsilon>0$ and discuss the validity of formal asymptotic expansions of solutions near the sharp interface limit $\varepsilon\approx0$. We assume that the corresponding ODE system has two stable equilibria. As in the scalar Allen--Cahn equation, it is known that the motion of the sharp interfaces of such systems is governed by the mean curvature flow with a driving force. The formal expansion also suggests that the profile of the transition layers converges to that of a traveling wave solution as $\varepsilon\rightarrow0$. In this paper, we rigorously verify this latter ansatz for a large class of initial data. The proof relies on a rescaling argument, the super--subsolution method and a Liouville type theorem for eternal solutions of parabolic systems. Roughly speaking, the Liouville type theorem states that any eternal solution that lies between two traveling waves is itself a traveling wave. The same Liouville type theorem was established for the scalar Allen--Cahn equation by Berestycki and Hamel. In view of their importance, we prove the Liouville type theorems in a rather general framework, not only for two-species competition-diffusion systems but also for $m$-species cooperation-diffusion systems possibly with time periodic or spatially periodic coefficients. △ Less

Submitted 29 December, 2017; originally announced January 2018.

arXiv:1712.09778 [pdf, other]

doi 10.1112/plms.12243

A variational problem associated with the minimal speed of traveling waves for spatially periodic KPP type equations

Authors: Dongyuan Xiao, Ryunosuke Mori

Abstract: We consider a variational problem associated with the minimal speed of pulsating traveling waves of the equation $u_t=u_{xx}+b(x)(1-u)u$, $x\in{\mathbb R},\ t>0$, where the coefficient $b(x)$ is nonnegative and periodic in $x\in{\mathbb R}$ with a period $L>0$. It is known that there exists a quantity $c^*(b)>0$ such that a pulsating traveling wave with the average speed $c>0$ exists if and only i… ▽ More We consider a variational problem associated with the minimal speed of pulsating traveling waves of the equation $u_t=u_{xx}+b(x)(1-u)u$, $x\in{\mathbb R},\ t>0$, where the coefficient $b(x)$ is nonnegative and periodic in $x\in{\mathbb R}$ with a period $L>0$. It is known that there exists a quantity $c^*(b)>0$ such that a pulsating traveling wave with the average speed $c>0$ exists if and only if $c\geq c^*(b)$. The quantity $c^*(b)$ is the so-called minimal speed of pulsating traveling waves. In this paper, we study the problem of maximizing $c^*(b)$ by varying the coefficient $b(x)$ under some constraints. We prove the existence of the maximizer under a certain assumption of the constraint and derive the Euler--Lagrange equation which the maximizer satisfies under $L^2$ constraint $\int_0^L b(x)^2dx=β$. The limit problems of the solution of this Euler--Lagrange equation as $L\rightarrow0$ and as $β\rightarrow0$ are also considered. Moreover, we also consider the variational problem in a certain class of step functions under $L^p$ constraint $\int_0^L b(x)^pdx=β$ when $L$ or $β$ tends to infinity. △ Less

Submitted 28 December, 2017; originally announced December 2017.

arXiv:1712.09590 [pdf, other]

On mean curvature flow with driving force starting as singular initial hypersurface

Authors: Ryunosuke Mori, Longjie Zhang

Abstract: We consider an axisymmetric closed hypersurface evolving by its mean curvature with driving force under singular initial hypersurface. We study this problem by level set method. We give some criteria to judge whether the interface evolution is fattening or non-fattening. We consider an axisymmetric closed hypersurface evolving by its mean curvature with driving force under singular initial hypersurface. We study this problem by level set method. We give some criteria to judge whether the interface evolution is fattening or non-fattening. △ Less

Submitted 27 December, 2017; originally announced December 2017.

Comments: arXiv admin note: text overlap with arXiv:1703.10707

arXiv:1711.01594 [pdf, other]

doi 10.1088/1748-0221/13/03/T03004

Prototy** of petalets for the Phase-II Upgrade of the silicon strip tracking detector of the ATLAS Experiment

Authors: S. Kuehn, V. Benítez, J. Fernández-Tejero, C. Fleta, M. Lozano, M. Ullán, H. Lacker, L. Rehnisch, D. Sperlich, D. Ariza, I. Bloch, S. Díez, I. Gregor, J. Keller, K. Lohwasser, L. Poley, V. Prahl, N. Zakharchuk, M. Hauser, K. Jakobs, K. Mahboubi, R. Mori, U. Parzefall, J. Bernabéu, C. Lacasta , et al. (9 additional authors not shown)

Abstract: In the high luminosity era of the Large Hadron Collider, the HL-LHC, the instantaneous luminosity is expected to reach unprecedented values, resulting in about 200 proton-proton interactions in a typical bunch crossing. To cope with the resultant increase in occupancy, bandwidth and radiation damage, the ATLAS Inner Detector will be replaced by an all-silicon system, the Inner Tracker (ITk). The I… ▽ More In the high luminosity era of the Large Hadron Collider, the HL-LHC, the instantaneous luminosity is expected to reach unprecedented values, resulting in about 200 proton-proton interactions in a typical bunch crossing. To cope with the resultant increase in occupancy, bandwidth and radiation damage, the ATLAS Inner Detector will be replaced by an all-silicon system, the Inner Tracker (ITk). The ITk consists of a silicon pixel and a strip detector and exploits the concept of modularity. Prototy** and testing of various strip detector components has been carried out. This paper presents the developments and results obtained with reduced-size structures equivalent to those foreseen to be used in the forward region of the silicon strip detector. Referred to as petalets, these structures are built around a composite sandwich with embedded cooling pipes and electrical tapes for routing the signals and power. Detector modules built using electronic flex boards and silicon strip sensors are glued on both the front and back side surfaces of the carbon structure. Details are given on the assembly, testing and evaluation of several petalets. Measurement results of both mechanical and electrical quantities are shown. Moreover, an outlook is given for improved prototy** plans for large structures. △ Less

Submitted 5 November, 2017; originally announced November 2017.

Comments: 22 pages for submission for Journal of Instrumentation

arXiv:1706.05184 [pdf, ps, other]

Average Length of Cycles in Rectangular Lattice

Authors: Ryuhei Mori

Abstract: We study the number of cycles and their average length in $L\times N$ lattice by using classical method of transfer matrix. In this work, we derive a bivariate generating function $G_3(y, z)$ in which a coefficient of $y^i z^j$ is the number of cycles of length $i$ in $3\times j$ lattice. By using the bivariate generating function, we show that the average length of cycles in $3\times N$ lattice i… ▽ More We study the number of cycles and their average length in $L\times N$ lattice by using classical method of transfer matrix. In this work, we derive a bivariate generating function $G_3(y, z)$ in which a coefficient of $y^i z^j$ is the number of cycles of length $i$ in $3\times j$ lattice. By using the bivariate generating function, we show that the average length of cycles in $3\times N$ lattice is $αN + β+ o(1)$ where $α$ and $β$ are some algebraic numbers approximately equal to 3.166 and 0.961, respectively. We argue generalizations of this method for $L\ge 4$, and obtain a generating function of the number of cycles in $L\times N$ lattice for $L$ up to 7. △ Less

Submitted 16 June, 2017; originally announced June 2017.

Comments: 8 pages, 3 figures, 1 table

arXiv:1705.09515 [pdf, other]

ASR error management for improving spoken language understanding

Authors: Edwin Simonnet, Sahar Ghannay, Nathalie Camelin, Yannick Estève, Renato De Mori

Abstract: This paper addresses the problem of automatic speech recognition (ASR) error detection and their use for improving spoken language understanding (SLU) systems. In this study, the SLU task consists in automatically extracting, from ASR transcriptions , semantic concepts and concept/values pairs in a e.g touristic information system. An approach is proposed for enriching the set of semantic labels w… ▽ More This paper addresses the problem of automatic speech recognition (ASR) error detection and their use for improving spoken language understanding (SLU) systems. In this study, the SLU task consists in automatically extracting, from ASR transcriptions , semantic concepts and concept/values pairs in a e.g touristic information system. An approach is proposed for enriching the set of semantic labels with error specific labels and by using a recently proposed neural approach based on word embeddings to compute well calibrated ASR confidence measures. Experimental results are reported showing that it is possible to decrease significantly the Concept/Value Error Rate with a state of the art system, outperforming previously published results performance on the same experimental data. It also shown that combining an SLU approach based on conditional random fields with a neural encoder/decoder attention based architecture , it is possible to effectively identifying confidence islands and uncertain semantic output segments useful for deciding appropriate error handling actions by the dialogue manager strategy . △ Less

Submitted 26 May, 2017; originally announced May 2017.

Comments: Interspeech 2017, Aug 2017, Stockholm, Sweden. 2017

arXiv:1702.03402 [pdf, ps, other]

Parallel Long Short-Term Memory for Multi-stream Classification

Authors: Mohamed Bouaziz, Mohamed Morchid, Richard Dufour, Georges Linarès, Renato De Mori

Abstract: Recently, machine learning methods have provided a broad spectrum of original and efficient algorithms based on Deep Neural Networks (DNN) to automatically predict an outcome with respect to a sequence of inputs. Recurrent hidden cells allow these DNN-based models to manage long-term dependencies such as Recurrent Neural Networks (RNN) and Long Short-Term Memory (LSTM). Nevertheless, these RNNs pr… ▽ More Recently, machine learning methods have provided a broad spectrum of original and efficient algorithms based on Deep Neural Networks (DNN) to automatically predict an outcome with respect to a sequence of inputs. Recurrent hidden cells allow these DNN-based models to manage long-term dependencies such as Recurrent Neural Networks (RNN) and Long Short-Term Memory (LSTM). Nevertheless, these RNNs process a single input stream in one (LSTM) or two (Bidirectional LSTM) directions. But most of the information available nowadays is from multistreams or multimedia documents, and require RNNs to process these information synchronously during the training. This paper presents an original LSTM-based architecture, named Parallel LSTM (PLSTM), that carries out multiple parallel synchronized input sequences in order to predict a common output. The proposed PLSTM method could be used for parallel sequence classification purposes. The PLSTM approach is evaluated on an automatic telecast genre sequences classification task and compared with different state-of-the-art architectures. Results show that the proposed PLSTM method outperforms the baseline n-gram models as well as the state-of-the-art LSTM approach. △ Less

Submitted 11 February, 2017; originally announced February 2017.

Comments: 2016 IEEE Workshop on Spoken Language Technology

arXiv:1701.04521 [pdf, other]

Sum of squares lower bounds for refuting any CSP

Authors: Pravesh K. Kothari, Ryuhei Mori, Ryan O'Donnell, David Witmer

Abstract: Let $P:\{0,1\}^k \to \{0,1\}$ be a nontrivial $k$-ary predicate. Consider a random instance of the constraint satisfaction problem $\mathrm{CSP}(P)$ on $n$ variables with $Δn$ constraints, each being $P$ applied to $k$ randomly chosen literals. Provided the constraint density satisfies $Δ\gg 1$, such an instance is unsatisfiable with high probability. The \emph{refutation} problem is to efficientl… ▽ More Let $P:\{0,1\}^k \to \{0,1\}$ be a nontrivial $k$-ary predicate. Consider a random instance of the constraint satisfaction problem $\mathrm{CSP}(P)$ on $n$ variables with $Δn$ constraints, each being $P$ applied to $k$ randomly chosen literals. Provided the constraint density satisfies $Δ\gg 1$, such an instance is unsatisfiable with high probability. The \emph{refutation} problem is to efficiently find a proof of unsatisfiability. We show that whenever the predicate $P$ supports a $t$-\emph{wise uniform} probability distribution on its satisfying assignments, the sum of squares (SOS) algorithm of degree $d = Θ(\frac{n}{Δ^{2/(t-1)} \log Δ})$ (which runs in time $n^{O(d)}$) \emph{cannot} refute a random instance of $\mathrm{CSP}(P)$. In particular, the polynomial-time SOS algorithm requires $\widetildeΩ(n^{(t+1)/2})$ constraints to refute random instances of CSP$(P)$ when $P$ supports a $t$-wise uniform distribution on its satisfying assignments. Together with recent work of Lee et al. [LRS15], our result also implies that \emph{any} polynomial-size semidefinite programming relaxation for refutation requires at least $\widetildeΩ(n^{(t+1)/2})$ constraints. Our results (which also extend with no change to CSPs over larger alphabets) subsume all previously known lower bounds for semialgebraic refutation of random CSPs. For every constraint predicate~$P$, they give a three-way hardness tradeoff between the density of constraints, the SOS degree (hence running time), and the strength of the refutation. By recent algorithmic results of Allen et al. [AOW15] and Raghavendra et al. [RRS16], this full three-way tradeoff is \emph{tight}, up to lower-order factors. △ Less

Submitted 16 January, 2017; originally announced January 2017.

Comments: 39 pages, 1 figure

MSC Class: 68Q17 ACM Class: G.1.6; F.4.1

arXiv:1701.04327 [pdf, ps, other]

Better Protocol for XOR Game using Communication Protocol and Nonlocal Boxes

Authors: Ryuhei Mori

Abstract: Buhrman showed that an efficient communication protocol implies a reliable XOR game protocol. This idea rederives Linial and Shraibman's lower bounds of communication complexity, which was derived by using factorization norms, with worse constant factor in much more intuitive way. In this work, we improve and generalize Buhrman's idea, and obtain a class of lower bounds for classical communication… ▽ More Buhrman showed that an efficient communication protocol implies a reliable XOR game protocol. This idea rederives Linial and Shraibman's lower bounds of communication complexity, which was derived by using factorization norms, with worse constant factor in much more intuitive way. In this work, we improve and generalize Buhrman's idea, and obtain a class of lower bounds for classical communication complexity including an exact Linial and Shraibman's lower bound as a special case. In the proof, we explicitly construct a protocol for XOR game from a classical communication protocol by using a concept of nonlocal boxes and Pawłowski et al.'s elegant protocol, which was used for showing the violation of information causality in superquantum theories. △ Less

Submitted 28 April, 2017; v1 submitted 16 January, 2017; originally announced January 2017.

Comments: 21 pages + the title page

arXiv:1610.03029 [pdf, ps, other]

Lower bounds for CSP refutation by SDP hierarchies

Authors: Ryuhei Mori, David Witmer

Abstract: For a $k$-ary predicate $P$, a random instance of CSP$(P)$ with $n$ variables and $m$ constraints is unsatisfiable with high probability when $m \gg n$. The natural algorithmic task in this regime is \emph{refutation}: finding a proof that a given random instance is unsatisfiable. Recent work of Allen et al. suggests that the difficulty of refuting CSP$(P)$ using an SDP is determined by a paramete… ▽ More For a $k$-ary predicate $P$, a random instance of CSP$(P)$ with $n$ variables and $m$ constraints is unsatisfiable with high probability when $m \gg n$. The natural algorithmic task in this regime is \emph{refutation}: finding a proof that a given random instance is unsatisfiable. Recent work of Allen et al. suggests that the difficulty of refuting CSP$(P)$ using an SDP is determined by a parameter $\mathrm{cmplx}(P)$, the smallest $t$ for which there does not exist a $t$-wise uniform distribution over satisfying assignments to $P$. In particular they show that random instances of CSP$(P)$ with $m \gg n^{\mathrm{cmplx(P)}/2}$ can be refuted efficiently using an SDP. In this work, we give evidence that $n^{\mathrm{cmplx}(P)/2}$ constraints are also \emph{necessary} for refutation using SDPs. Specifically, we show that if $P$ supports a $(t-1)$-wise uniform distribution over satisfying assignments, then the Sherali-Adams$_+$ and Lovász-Schrijver$_+$ SDP hierarchies cannot refute a random instance of CSP$(P)$ in polynomial time for any $m \leq n^{t/2-ε}$. △ Less

Submitted 10 October, 2016; originally announced October 2016.

arXiv:1606.06192 [pdf]

doi 10.1038/nmat4488

A Novel Quasi-One-Dimensional Topological Insulator in Bismuth Iodide $β$-Bi$_4$I$_4$

Authors: Gabriel Autès, Anna Isaeva, Luca Moreschini, Jens C. Johannsen, Andrea Pisoni, Ryo Mori, Wentao Zhang, Taisia G. Filatova, Alexey N. Kuznetsov, László Forró, Wouter Van den Broek, Yeongkwan Kim, Keun Su Kim, Alessandra Lanzara, Jonathan D. Denlinger, Eli Rotenberg, Aaron Bostwick, Marco Grioni, Oleg V. Yazyev

Abstract: Recent progress in the field of topological states of matter(1,2) has largely been initiated by the discovery of bismuth and antimony chalcogenide bulk topological insulators (TIs)(3-6), followed by closely related ternary compounds(7-16) and predictions of several weak TIs(17-19). However, both the conceptual richness of Z$_2$ classification of TIs as well as their structural and compositional di… ▽ More Recent progress in the field of topological states of matter(1,2) has largely been initiated by the discovery of bismuth and antimony chalcogenide bulk topological insulators (TIs)(3-6), followed by closely related ternary compounds(7-16) and predictions of several weak TIs(17-19). However, both the conceptual richness of Z$_2$ classification of TIs as well as their structural and compositional diversity are far from being fully exploited. Here, a new Z$_2$ topological insulator is theoretically predicted and experimentally confirmed in the $β$-phase of quasi-one-dimensional bismuth iodide Bi$_4$I$_4$. The electronic structure of $β$-Bi$_4$I$_4$, characterized by Z$_2$ invariants (1;110), is in proximity of both the weak TI phase (0;001) and the trivial insulator phase (0;000). Our angle-resolved photoemission spectroscopy measurements on the (001) surface reveal a highly anisotropic band-crossing feature located at the point of the surface Brillouin zone and showing no dispersion with the photon energy, thus being fully consistent with the theoretical prediction. △ Less

Submitted 20 June, 2016; originally announced June 2016.

Journal ref: Nature Materials 15, 154-158 (2016)

arXiv:1606.05119 [pdf, other]

Average Shortest Path Length of Graphs of Diameter 3

Authors: Nobutaka Shimizu, Ryuhei Mori

Abstract: A network topology with low average shortest path length (ASPL) provides efficient data transmission while the number of nodes and the number of links incident to each node are often limited due to physical constraints. In this paper, we consider the construction of low ASPL graphs under these constraints by using stochastic local search (SLS) algorithms. Since the ASPL cannot be calculated effici… ▽ More A network topology with low average shortest path length (ASPL) provides efficient data transmission while the number of nodes and the number of links incident to each node are often limited due to physical constraints. In this paper, we consider the construction of low ASPL graphs under these constraints by using stochastic local search (SLS) algorithms. Since the ASPL cannot be calculated efficiently, the ASPL is not suitable for the evaluation function of SLS algorithms. We first derive an equality and bounds for the ASPL of graphs of diameter 3. Then, we propose use the simpliest upper bound represented by the number of triangles and squares in the graph as an evaluation function for graphs of diameter 3. We show that the proposed evaluation function can be evaluated in O(1) time as the number of nodes and the maximum degree tend to infinity by using some data tables. By using the simulated annealing with the proposed evaluation function, we construct low ASPL regular graphs of diameter 3 with 10 000 nodes. △ Less

Submitted 16 June, 2016; originally announced June 2016.

Comments: 6 pages, 2 figures

Showing 1–50 of 73 results for author: Mori, R