Skip to main content

Showing 1–27 of 27 results for author: Weyde, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.00555  [pdf, other

    cs.LG

    Derivative-based regularization for regression

    Authors: Enrico Lopedoto, Maksim Shekhunov, Vitaly Aksenov, Kizito Salako, Tillman Weyde

    Abstract: In this work, we introduce a novel approach to regularization in multivariable regression problems. Our regularizer, called DLoss, penalises differences between the model's derivatives and derivatives of the data generating function as estimated from the training data. We call these estimated derivatives data derivatives. The goal of our method is to align the model to the data, not only in terms… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

  2. arXiv:2311.16004  [pdf, other

    q-fin.ST cs.LG

    Improved Data Generation for Enhanced Asset Allocation: A Synthetic Dataset Approach for the Fixed Income Universe

    Authors: Szymon Kubiak, Tillman Weyde, Oleksandr Galkin, Dan Philps, Ram Gopal

    Abstract: We present a novel process for generating synthetic datasets tailored to assess asset allocation methods and construct portfolios within the fixed income universe. Our approach begins by enhancing the CorrGAN model to generate synthetic correlation matrices. Subsequently, we propose an Encoder-Decoder model that samples additional data conditioned on a given correlation matrix. The resulting synth… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.

  3. arXiv:2305.13258  [pdf, other

    cs.AI

    NeSy4VRD: A Multifaceted Resource for Neurosymbolic AI Research using Knowledge Graphs in Visual Relationship Detection

    Authors: David Herron, Ernesto Jiménez-Ruiz, Giacomo Tarroni, Tillman Weyde

    Abstract: NeSy4VRD is a multifaceted resource designed to support the development of neurosymbolic AI (NeSy) research. NeSy4VRD re-establishes public access to the images of the VRD dataset and couples them with an extensively revised, quality-improved version of the VRD visual relationship annotations. Crucially, NeSy4VRD provides a well-aligned, companion OWL ontology that describes the dataset domain.It… ▽ More

    Submitted 22 May, 2023; originally announced May 2023.

  4. arXiv:2304.03639  [pdf, other

    cs.LG cs.CL cs.FL cs.NE

    Theoretical Conditions and Empirical Failure of Bracket Counting on Long Sequences with Linear Recurrent Networks

    Authors: Nadine El-Naggar, Pranava Madhyastha, Tillman Weyde

    Abstract: Previous work has established that RNNs with an unbounded activation function have the capacity to count exactly. However, it has also been shown that RNNs are challenging to train effectively and generally do not learn exact counting behaviour. In this paper, we focus on this problem by studying the simplest possible RNN, a linear single-cell network. We conduct a theoretical analysis of linear R… ▽ More

    Submitted 7 April, 2023; originally announced April 2023.

    Comments: 17th Conference of the European Chapter of the Association for Computational Linguistics Student Research Workshop (EACL 2023 SRW)

  5. arXiv:2301.10799  [pdf, other

    cs.CL

    Towards a Unified Model for Generating Answers and Explanations in Visual Question Answering

    Authors: Chenxi Whitehouse, Tillman Weyde, Pranava Madhyastha

    Abstract: The field of visual question answering (VQA) has recently seen a surge in research focused on providing explanations for predicted answers. However, current systems mostly rely on separate models to predict answers and generate explanations, leading to less grounded and frequently inconsistent results. To address this, we propose a multitask learning approach towards a Unified Model for Answer and… ▽ More

    Submitted 13 February, 2023; v1 submitted 25 January, 2023; originally announced January 2023.

    Comments: Findings of EACL 2023

  6. arXiv:2211.16429  [pdf, other

    cs.NE cs.FL cs.LG

    Exploring the Long-Term Generalization of Counting Behavior in RNNs

    Authors: Nadine El-Naggar, Pranava Madhyastha, Tillman Weyde

    Abstract: In this study, we investigate the generalization of LSTM, ReLU and GRU models on counting tasks over long sequences. Previous theoretical work has established that RNNs with ReLU activation and LSTMs have the capacity for counting with suitable configuration, while GRUs have limitations that prevent correct counting over longer sequences. Despite this and some positive empirical results for LSTMs… ▽ More

    Submitted 29 November, 2022; originally announced November 2022.

    Comments: Published in I Can't Believe It's Not Better: Understanding Deep Learning Through Empirical Falsification Workshop at NeurIPS 2022

  7. arXiv:2208.00792  [pdf, other

    cs.SD cs.LG eess.AS

    Jazz Contrafact Detection

    Authors: C. Bunks, T. Weyde

    Abstract: In jazz, a contrafact is a new melody composed over an existing, but often reharmonized chord progression. Because reharmonization can introduce a wide range of variations, detecting contrafacts is a challenging task. This paper develops a novel vector-space model to represent chord progressions, and uses it for contrafact detection. The process applies principles from music theory to reduce the d… ▽ More

    Submitted 1 August, 2022; originally announced August 2022.

    Comments: 8 pages, 6 figures, 4 tables

  8. arXiv:2206.03224  [pdf

    cs.CY cs.AI cs.HC

    The Beyond the Fence Musical and Computer Says Show Documentary

    Authors: Simon Colton, Maria Teresa Llano, Rose Hepworth, John Charnley, Catherine V. Gale, Archie Baron, Francois Pachet, Pierre Roy, Pablo Gervas, Nick Collins, Bob Sturm, Tillman Weyde, Daniel Wolff, James Robert Lloyd

    Abstract: During 2015 and early 2016, the cultural application of Computational Creativity research and practice took a big leap forward, with a project where multiple computational systems were used to provide advice and material for a new musical theatre production. Billed as the world's first 'computer musical... conceived by computer and substantially crafted by computer', Beyond The Fence was staged in… ▽ More

    Submitted 11 May, 2022; originally announced June 2022.

    Journal ref: The Seventh International Conference on Computational Creativity, {ICCC} 2016

  9. arXiv:2204.02385  [pdf, other

    eess.AS cs.LG cs.SD

    Learning Speech Emotion Representations in the Quaternion Domain

    Authors: Eric Guizzo, Tillman Weyde, Simone Scardapane, Danilo Comminiello

    Abstract: The modeling of human emotion expression in speech signals is an important, yet challenging task. The high resource demand of speech emotion recognition models, combined with the the general scarcity of emotion-labelled data are obstacles to the development and application of effective solutions in this field. In this paper, we present an approach to jointly circumvent these difficulties. Our meth… ▽ More

    Submitted 3 March, 2023; v1 submitted 5 April, 2022; originally announced April 2022.

    Comments: Accepted for Publication in IEEE/ACM Transactions on Audio, Speech and Language Processing

  10. arXiv:2204.00458  [pdf, other

    cs.CL

    Evaluation of Fake News Detection with Knowledge-Enhanced Language Models

    Authors: Chenxi Whitehouse, Tillman Weyde, Pranava Madhyastha, Nikos Komninos

    Abstract: Recent advances in fake news detection have exploited the success of large-scale pre-trained language models (PLMs). The predominant state-of-the-art approaches are based on fine-tuning PLMs on labelled fake news datasets. However, large-scale PLMs are generally not trained on structured factual data and hence may not possess priors that are grounded in factually accurate knowledge. The use of exi… ▽ More

    Submitted 13 February, 2023; v1 submitted 1 April, 2022; originally announced April 2022.

    Comments: Proceedings of AAAI-ICWSM 2022

    Journal ref: Proceedings of the International AAAI Conference on Web and Social Media 16 (2022) 1425-1429

  11. arXiv:2103.12864  [pdf, other

    cs.SD eess.AS

    Learned complex masks for multi-instrument source separation

    Authors: Andreas Jansson, Rachel M. Bittner, Nicola Montecchio, Tillman Weyde

    Abstract: Music source separation in the time-frequency domain is commonly achieved by applying a soft or binary mask to the magnitude component of (complex) spectrograms. The phase component is usually not estimated, but instead copied from the mixture and applied to the magnitudes of the estimated isolated sources. While this method has several practical advantages, it imposes an upper bound on the perfor… ▽ More

    Submitted 23 March, 2021; originally announced March 2021.

  12. arXiv:2103.06198  [pdf, other

    cs.CL cs.AI cs.LG

    Relational Weight Priors in Neural Networks for Abstract Pattern Learning and Language Modelling

    Authors: Radha Kopparti, Tillman Weyde

    Abstract: Deep neural networks have become the dominant approach in natural language processing (NLP). However, in recent years, it has become apparent that there are shortcomings in systematicity that limit the performance and data efficiency of deep learning in NLP. These shortcomings can be clearly shown in lower-level artificial tasks, mostly on synthetic data. Abstract patterns are the best known examp… ▽ More

    Submitted 10 March, 2021; originally announced March 2021.

    Comments: 29 pages

  13. arXiv:2006.06494  [pdf, other

    cs.LG cs.NE cs.SD eess.AS stat.ML

    Anti-Transfer Learning for Task Invariance in Convolutional Neural Networks for Speech Processing

    Authors: Eric Guizzo, Tillman Weyde, Giacomo Tarroni

    Abstract: We introduce the novel concept of anti-transfer learning for speech processing with convolutional neural networks. While transfer learning assumes that the learning process for a target task will benefit from re-using representations learned for another task, anti-transfer avoids the learning of representations that have been learned for an orthogonal task, i.e., one that is not relevant and poten… ▽ More

    Submitted 13 January, 2021; v1 submitted 11 June, 2020; originally announced June 2020.

    Comments: Neural Networks Journal

  14. arXiv:2003.03375  [pdf, other

    eess.AS cs.LG cs.SD stat.ML

    Multi-Time-Scale Convolution for Emotion Recognition from Speech Audio Signals

    Authors: Eric Guizzo, Tillman Weyde, Jack Barnett Leveson

    Abstract: Robustness against temporal variations is important for emotion recognition from speech audio, since emotion is ex-pressed through complex spectral patterns that can exhibit significant local dilation and compression on the time axis depending on speaker and context. To address this and potentially other tasks, we introduce the multi-time-scale (MTS) method to create flexibility towards temporal v… ▽ More

    Submitted 6 March, 2020; originally announced March 2020.

  15. arXiv:2003.03125  [pdf, other

    cs.LG stat.ML

    Weight Priors for Learning Identity Relations

    Authors: Radha Kopparti, Tillman Weyde

    Abstract: Learning abstract and systematic relations has been an open issue in neural network learning for over 30 years. It has been shown recently that neural networks do not learn relations based on identity and are unable to generalize well to unseen data. The Relation Based Pattern (RBP) approach has been proposed as a solution for this problem. In this work, we extend RBP by realizing it as a Bayesian… ▽ More

    Submitted 19 May, 2020; v1 submitted 6 March, 2020; originally announced March 2020.

    Comments: Proceedings of KR2ML @ NeurIPS 2019, Vancouver, Canada

    Journal ref: Proceedings of KR2ML @ NeurIPS 2019, Vancouver, Canada

  16. arXiv:1911.04489  [pdf, other

    cs.LG q-fin.CP q-fin.PM stat.ML

    Making Good on LSTMs' Unfulfilled Promise

    Authors: Daniel Philps, Artur d'Avila Garcez, Tillman Weyde

    Abstract: LSTMs promise much to financial time-series analysis, temporal and cross-sectional inference, but we find that they do not deliver in a real-world financial management task. We examine an alternative called Continual Learning (CL), a memory-augmented approach, which can provide transparent explanations, i.e. which memory did what and when. This work has implications for many financial applications… ▽ More

    Submitted 8 December, 2019; v1 submitted 11 November, 2019; originally announced November 2019.

    Comments: 33rd Conference on Neural Information Processing Systems (NeurIPS 2019), Vancouver, Canada. arXiv admin note: text overlap with arXiv:1812.02340

  17. arXiv:1910.10071  [pdf, other

    cs.LG cs.SD eess.AS stat.ML

    Improving singing voice separation with the Wave-U-Net using Minimum Hyperspherical Energy

    Authors: Joaquin Perez-Lapillo, Oleksandr Galkin, Tillman Weyde

    Abstract: In recent years, deep learning has surpassed traditional approaches to the problem of singing voice separation. The Wave-U-Net is a recent deep network architecture that operates directly on the time domain. The standard Wave-U-Net is trained with data augmentation and early stop** to prevent overfitting. Minimum hyperspherical energy (MHE) regularization has recently proven to increase generali… ▽ More

    Submitted 22 October, 2019; originally announced October 2019.

    Comments: Paper submitted to ICASSP 2020 conference

  18. arXiv:1906.08362  [pdf, other

    cs.AI

    Trepan Reloaded: A Knowledge-driven Approach to Explaining Artificial Neural Networks

    Authors: Roberto Confalonieri, Tillman Weyde, Tarek R. Besold, Fermín Moscoso del Prado Martín

    Abstract: Explainability in Artificial Intelligence has been revived as a topic of active research by the need of conveying safety and trust to users in the `how' and `why' of automated decision-making. Whilst a plethora of approaches have been developed for post-hoc explainability, only a few focus on how to use domain knowledge, and how this influences the understandability of global explanations from the… ▽ More

    Submitted 21 November, 2019; v1 submitted 19 June, 2019; originally announced June 2019.

  19. arXiv:1906.05449  [pdf, other

    cs.LG stat.ML

    Factors for the Generalisation of Identity Relations by Neural Networks

    Authors: Radha Kopparti, Tillman Weyde

    Abstract: Many researchers implicitly assume that neural networks learn relations and generalise them to new unseen data. It has been shown recently, however, that the generalisation of feed-forward networks fails for identity relations.The proposed solution for this problem is to create an inductive bias with Differential Rectifier (DR) units. In this work we explore various factors in the neural network a… ▽ More

    Submitted 12 June, 2019; originally announced June 2019.

    Comments: ICML 2019 Workshop on Understanding and Improving Generalization in Deep Learning}, Long Beach, California, 2019

    Journal ref: ICML 2019 Workshop on Understanding and Improving Generalization in Deep Learning}, Long Beach, California, 201

  20. arXiv:1812.02616  [pdf, other

    cs.LG stat.ML

    Modelling Identity Rules with Neural Networks

    Authors: Tillman Weyde, Radha Manisha Kopparti

    Abstract: In this paper, we show that standard feed-forward and recurrent neural networks fail to learn abstract patterns based on identity rules. We propose Relation Based Pattern (RBP) extensions to neural network structures that solve this problem and answer, as well as raise, questions about integrating structures for inductive bias into neural networks. Examples of abstract patterns are the sequence pa… ▽ More

    Submitted 20 May, 2019; v1 submitted 6 December, 2018; originally announced December 2018.

    Comments: To be Published in Journal of Applied Logic

    MSC Class: 62M45 ACM Class: I.2.6

    Journal ref: Journal of applied logics (Online) ISSN 2631-9829 Volume 6, Number 4, June 2019

  21. arXiv:1812.02340  [pdf, other

    cs.LG cs.AI q-fin.CP q-fin.PM q-fin.TR stat.ML

    Continual Learning Augmented Investment Decisions

    Authors: Daniel Philps, Tillman Weyde, Artur d'Avila Garcez, Roy Batchelor

    Abstract: Investment decisions can benefit from incorporating an accumulated knowledge of the past to drive future decision making. We introduce Continual Learning Augmentation (CLA) which is based on an explicit memory structure and a feed forward neural network (FFNN) base model and used to drive long term financial investment decisions. We demonstrate that our approach improves accuracy in investment dec… ▽ More

    Submitted 25 January, 2019; v1 submitted 5 December, 2018; originally announced December 2018.

    Comments: NeurIPS 2018 Workshop on Challenges and Opportunities for AI in Financial Services: the Impact of Fairness, Explainability, Accuracy, and Privacy, Montreal, Canada. This is a non-archival publication - the authors may submit revisions and extensions of this paper to other publication venues

  22. arXiv:1812.01662  [pdf, other

    cs.LG stat.ML

    Feed-Forward Neural Networks Need Inductive Bias to Learn Equality Relations

    Authors: Tillman Weyde, Radha Manisha Kopparti

    Abstract: Basic binary relations such as equality and inequality are fundamental to relational data structures. Neural networks should learn such relations and generalise to new unseen data. We show in this study, however, that this generalisation fails with standard feed-forward networks on binary vectors. Even when trained with maximal training data, standard networks do not reliably detect equality.We in… ▽ More

    Submitted 4 December, 2018; originally announced December 2018.

    Comments: Relational Representation Learning Workshop, NeurIPS 2018

    Journal ref: Relational Representation Learning (R2L) Workshop, NeurIPS 2018

  23. arXiv:1811.11307  [pdf, other

    cs.SD cs.LG cs.NE eess.AS eess.SP

    Improved Speech Enhancement with the Wave-U-Net

    Authors: Craig Macartney, Tillman Weyde

    Abstract: We study the use of the Wave-U-Net architecture for speech enhancement, a model introduced by Stoller et al for the separation of music vocals and accompaniment. This end-to-end learning method for audio source separation operates directly in the time domain, permitting the integrated modelling of phase information and being able to take large temporal contexts into account. Our experiments show t… ▽ More

    Submitted 27 November, 2018; originally announced November 2018.

    Comments: 5 pages (including 1 for References), 1 figure, 2 tables

  24. arXiv:1811.07738  [pdf, other

    cs.CV

    M2U-Net: Effective and Efficient Retinal Vessel Segmentation for Resource-Constrained Environments

    Authors: Tim Laibacher, Tillman Weyde, Sepehr Jalali

    Abstract: In this paper, we present a novel neural network architecture for retinal vessel segmentation that improves over the state of the art on two benchmark datasets, is the first to run in real time on high resolution images, and its small memory and processing requirements make it deployable in mobile and embedded systems. The M2U-Net has a new encoder-decoder architecture that is inspired by the U-Ne… ▽ More

    Submitted 23 April, 2019; v1 submitted 19 November, 2018; originally announced November 2018.

  25. arXiv:1710.02245  [pdf, other

    cs.LG stat.ML

    Linear-Time Sequence Classification using Restricted Boltzmann Machines

    Authors: Son N. Tran, Srikanth Cherla, Artur Garcez, Tillman Weyde

    Abstract: Classification of sequence data is the topic of interest for dynamic Bayesian models and Recurrent Neural Networks (RNNs). While the former can explicitly model the temporal dependencies between class variables, the latter have a capability of learning representations. Several attempts have been made to improve performance by combining these two approaches or increasing the processing capability o… ▽ More

    Submitted 8 March, 2018; v1 submitted 5 October, 2017; originally announced October 2017.

  26. arXiv:1604.01806  [pdf, ps, other

    cs.LG

    Generalising the Discriminative Restricted Boltzmann Machine

    Authors: Srikanth Cherla, Son N Tran, Tillman Weyde, Artur d'Avila Garcez

    Abstract: We present a novel theoretical result that generalises the Discriminative Restricted Boltzmann Machine (DRBM). While originally the DRBM was defined assuming the {0, 1}-Bernoulli distribution in each of its hidden units, this result makes it possible to derive cost functions for variants of the DRBM that utilise other distributions, including some that are often encountered in the literature. This… ▽ More

    Submitted 6 April, 2016; originally announced April 2016.

    Comments: Submitted to ECML 2016 conference track

  27. arXiv:1411.1623  [pdf, ps, other

    cs.LG

    A Hybrid Recurrent Neural Network For Music Transcription

    Authors: Siddharth Sigtia, Emmanouil Benetos, Nicolas Boulanger-Lewandowski, Tillman Weyde, Artur S. d'Avila Garcez, Simon Dixon

    Abstract: We investigate the problem of incorporating higher-level symbolic score-like information into Automatic Music Transcription (AMT) systems to improve their performance. We use recurrent neural networks (RNNs) and their variants as music language models (MLMs) and present a generative architecture for combining these models with predictions from a frame level acoustic classifier. We also compare dif… ▽ More

    Submitted 6 November, 2014; originally announced November 2014.