-
EEND-M2F: Masked-attention mask transformers for speaker diarization
Abstract: In this paper, we make the explicit connection between image segmentation methods and end-to-end diarization methods. From these insights, we propose a novel, fully end-to-end diarization model, EEND-M2F, based on the Mask2Former architecture. Speaker representations are computed in parallel using a stack of transformer decoders, in which irrelevant frames are explicitly masked from the cross atte… ▽ More
Submitted 23 January, 2024; originally announced January 2024.
Comments: 14 pages, 2 figures
-
Transformer Attractors for Robust and Efficient End-to-End Neural Diarization
Abstract: End-to-end neural diarization with encoder-decoder based attractors (EEND-EDA) is a method to perform diarization in a single neural network. EDA handles the diarization of a flexible number of speakers by using an LSTM-based encoder-decoder that generates a set of speaker-wise attractors in an autoregressive manner. In this paper, we propose to replace EDA with a transformer-based attractor calcu… ▽ More
Submitted 11 December, 2023; originally announced December 2023.
Comments: 8 pages, 1 figure, ASRU2023
-
Gaussian Process Priors for Systems of Linear Partial Differential Equations with Constant Coefficients
Abstract: Partial differential equations (PDEs) are important tools to model physical systems and including them into machine learning models is an important way of incorporating physical knowledge. Given any system of linear PDEs with constant coefficients, we propose a family of Gaussian process (GP) priors, which we call EPGP, such that all realizations are exact solutions of this system. We apply the Eh… ▽ More
Submitted 2 November, 2023; v1 submitted 29 December, 2022; originally announced December 2022.
Comments: 26 pages, 8 figures; ICML 2023 (oral); updated with expanded appendices and ancillary files. Code available at https://github.com/haerski/EPGP. For animations, see https://mathrepo.mis.mpg.de/EPGP/index.html. For a presentation see https://icml.cc/virtual/2023/oral/25571. The paper and all ancillary files are released under CC-BY
MSC Class: 60G15; 13N10; 13P25; 60-08; 35G35
Journal ref: ICML 2023 (oral); PMLR 202:12587-12615, 2023
-
arXiv:2104.10146 [pdf, ps, other]
Linear PDE with Constant Coefficients
Abstract: We discuss practical methods for computing the space of solutions to an arbitrary homogeneous linear system of partial differential equations with constant coefficients. These rest on the Fundamental Principle of Ehrenpreis-Palamodov from the 1960s. We develop this further using recent advances in computational commutative algebra.
Submitted 12 October, 2021; v1 submitted 20 April, 2021; originally announced April 2021.
Comments: 31 pages, 1 figure
MSC Class: 13N10; 14-04; 14Q15; 35G35; 35C15
-
arXiv:2006.13881 [pdf, ps, other]
Noetherian operators and primary decomposition
Abstract: Noetherian operators are differential operators that encode primary components of a polynomial ideal. We develop a framework, as well as algorithms, for computing Noetherian operators with local dual spaces, both symbolically and numerically. For a primary ideal, such operators provide an alternative representation to one given by a set of generators. This description fits well with numerical alge… ▽ More
Submitted 24 June, 2020; originally announced June 2020.
Comments: 17 pages, codebase available at https://github.com/haerski/NoetherianOperators
MSC Class: 14Q15; 14-04; 13N05; 65L80; 65D05