Skip to main content

Showing 1–19 of 19 results for author: Twomey, N

.
  1. arXiv:2312.10238  [pdf, other

    cs.LG stat.ML

    Hypothesis Testing for Class-Conditional Noise Using Local Maximum Likelihood

    Authors: Weisong Yang, Rafael Poyiadzi, Niall Twomey, Raul Santos Rodriguez

    Abstract: In supervised learning, automatically assessing the quality of the labels before any learning takes place remains an open research question. In certain particular cases, hypothesis testing procedures have been proposed to assess whether a given instance-label dataset is contaminated with class-conditional label noise, as opposed to uniform label noise. The existing theory builds on the asymptotic… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

  2. arXiv:2311.10049  [pdf, other

    cs.LG cs.AI

    Inherently Interpretable Time Series Classification via Multiple Instance Learning

    Authors: Joseph Early, Gavin KC Cheung, Kurt Cutajar, Hanting Xie, Jas Kandola, Niall Twomey

    Abstract: Conventional Time Series Classification (TSC) methods are often black boxes that obscure inherent interpretation of their decision-making processes. In this work, we leverage Multiple Instance Learning (MIL) to overcome this issue, and propose a new framework called MILLET: Multiple Instance Learning for Locally Explainable Time series classification. We apply MILLET to existing deep learning TSC… ▽ More

    Submitted 16 March, 2024; v1 submitted 16 November, 2023; originally announced November 2023.

    Comments: Published at ICLR 2024. 29 pages (9 main, 3 ref, 17 appendix)

  3. Low-count Time Series Anomaly Detection

    Authors: Philipp Renz, Kurt Cutajar, Niall Twomey, Gavin K. C. Cheung, Hanting Xie

    Abstract: Low-count time series describe sparse or intermittent events, which are prevalent in large-scale online platforms that capture and monitor diverse data types. Several distinct challenges surface when modelling low-count time series, particularly low signal-to-noise ratios (when anomaly signatures are provably undetectable), and non-uniform performance (when average metrics are not representative o… ▽ More

    Submitted 24 August, 2023; originally announced August 2023.

    Comments: 6 pages, 7 figures, to be published in IEEE 2023 Workshop on Machine Learning for Signal Processing (MLSP)

    Journal ref: 2023 IEEE 33rd International Workshop on Machine Learning for Signal Processing (MLSP)

  4. arXiv:2203.10170  [pdf, other

    cs.CY cs.LG

    Equitable Ability Estimation in Neurodivergent Student Populations with Zero-Inflated Learner Models

    Authors: Niall Twomey, Sarah McMullan, Anat Elhalal, Rafael Poyiadzi, Luis Vaquero

    Abstract: At present, the educational data mining community lacks many tools needed for ensuring equitable ability estimation for Neurodivergent (ND) learners. On one hand, most learner models are susceptible to under-estimating ND ability since confounding contexts cannot be held accountable (eg consider dyslexia and text-heavy assessments), and on the other, few (if any) existing datasets are suited for a… ▽ More

    Submitted 9 May, 2022; v1 submitted 18 March, 2022; originally announced March 2022.

  5. arXiv:2112.01243  [pdf, other

    cs.CY

    Towards Continuous Compounding Effects and Agile Practices in Educational Experimentation

    Authors: Luis M. Vaquero, Niall Twomey, Miguel Patricio Dias, Massimo Camplani, Robert Hardman

    Abstract: Randomised control trials are currently the definitive gold standard approach for formal educational experiments. Although conclusions from these experiments are highly credible, their relatively slow experimentation rate, high expense and rigid framework can be seen to limit scope on: 1. $\textit{metrics}$: automation of the consistent rigorous computation of hundreds of metrics for every experim… ▽ More

    Submitted 17 November, 2021; originally announced December 2021.

  6. arXiv:2105.05710  [pdf, other

    cs.IR

    Evaluation of Field-Aware Neural Ranking Models for Recipe Search

    Authors: Kentaro Takiguchi, Mikhail Fain, Niall Twomey, Luis M Vaquero

    Abstract: Explicitly modelling field interactions and correlations in complex document structures has recently gained popularity in neural document embedding and retrieval tasks. Although this requires the specification of bespoke task-dependent models, encouraging empirical results are beginning to emerge. We present the first in-depth analyses of non-linear multi-field interaction (NL-MFI) ranking in the… ▽ More

    Submitted 8 July, 2021; v1 submitted 12 May, 2021; originally announced May 2021.

  7. Backretrieval: An Image-Pivoted Evaluation Metric for Cross-Lingual Text Representations Without Parallel Corpora

    Authors: Mikhail Fain, Niall Twomey, Danushka Bollegala

    Abstract: Cross-lingual text representations have gained popularity lately and act as the backbone of many tasks such as unsupervised machine translation and cross-lingual information retrieval, to name a few. However, evaluation of such representations is difficult in the domains beyond standard benchmarks due to the necessity of obtaining domain-specific parallel language data across different pairs of la… ▽ More

    Submitted 11 May, 2021; originally announced May 2021.

    Comments: SIGIR 2021

  8. arXiv:2103.02630  [pdf, other

    cs.LG

    Hypothesis Testing for Class-Conditional Label Noise

    Authors: Rafael Poyiadzi, Weisong Yang, Niall Twomey, Raul Santos-Rodriguez

    Abstract: In this paper we provide machine learning practitioners with tools to answer the question: is there class-conditional noise in my labels? In particular, we present hypothesis tests to check whether a given dataset of instance-label pairs has been corrupted with class-conditional label noise, as opposed to uniform label noise, with the former biasing learning, while the latter -- under mild conditi… ▽ More

    Submitted 31 May, 2021; v1 submitted 3 March, 2021; originally announced March 2021.

    Comments: 15 pages, 4 figures

  9. arXiv:2011.09580  [pdf, ps, other

    cs.IR cs.AI

    Non-Linear Multiple Field Interactions Neural Document Ranking

    Authors: Kentaro Takiguchi, Niall Twomey, Luis M. Vaquero

    Abstract: Ranking tasks are usually based on the text of the main body of the page and the actions (clicks) of users on the page. There are other elements that could be leveraged to better contextualise the ranking experience (e.g. text in other fields, query made by the user, images, etc). We present one of the first in-depth analyses of field interaction for multiple field ranking in two separate datasets… ▽ More

    Submitted 18 November, 2020; originally announced November 2020.

  10. Towards Multi-Language Recipe Personalisation and Recommendation

    Authors: Niall Twomey, Mikhail Fain, Andrey Ponikar, Nadine Sarraf

    Abstract: Multi-language recipe personalisation and recommendation is an under-explored field of information retrieval in academic and production systems. The existing gaps in our current understanding are numerous, even on fundamental questions such as whether consistent and high-quality recipe recommendation can be delivered across languages. In this paper, we introduce the multi-language recipe recommend… ▽ More

    Submitted 18 August, 2020; v1 submitted 27 July, 2020; originally announced July 2020.

    Comments: 5 tables

    Journal ref: Fourteenth ACM Conference on Recommender Systems (RecSys 2020)

  11. arXiv:2007.03615  [pdf, other

    cs.CY cs.LG eess.SP stat.ML

    Detecting Signatures of Early-stage Dementia with Behavioural Models Derived from Sensor Data

    Authors: Rafael Poyiadzi, Weisong Yang, Yoav Ben-Shlomo, Ian Craddock, Liz Coulthard, Raul Santos-Rodriguez, James Selwood, Niall Twomey

    Abstract: There is a pressing need to automatically understand the state and progression of chronic neurological diseases such as dementia. The emergence of state-of-the-art sensing platforms offers unprecedented opportunities for indirect and automatic evaluation of disease state through the lens of behavioural monitoring. This paper specifically seeks to characterise behavioural signatures of mild cogniti… ▽ More

    Submitted 3 July, 2020; originally announced July 2020.

    Comments: Accepted by the 1st edition of HELPLINE: Artificial Intelligence for Health, Personalized Medicine and Wellbeing

  12. arXiv:1911.12763  [pdf, other

    cs.CV

    Dividing and Conquering Cross-Modal Recipe Retrieval: from Nearest Neighbours Baselines to SoTA

    Authors: Mikhail Fain, Niall Twomey, Andrey Ponikar, Ryan Fox, Danushka Bollegala

    Abstract: We propose a novel non-parametric method for cross-modal recipe retrieval which is applied on top of precomputed image and text embeddings. By combining our method with standard approaches for building image and text encoders, trained independently with a self-supervised classification objective, we create a baseline model which outperforms most existing methods on a challenging image-to-recipe ta… ▽ More

    Submitted 13 July, 2021; v1 submitted 28 November, 2019; originally announced November 2019.

  13. arXiv:1908.02858  [pdf, other

    cs.LG eess.SY stat.ML

    HyperStream: a Workflow Engine for Streaming Data

    Authors: Tom Diethe, Meelis Kull, Niall Twomey, Kacper Sokol, Hao Song, Miquel Perello-Nieto, Emma Tonkin, Peter Flach

    Abstract: This paper describes HyperStream, a large-scale, flexible and robust software package, written in the Python language, for processing streaming data with workflow creation capabilities. HyperStream overcomes the limitations of other computational engines and provides high-level interfaces to execute complex nesting, fusion, and prediction both in online and offline forms in streaming environments.… ▽ More

    Submitted 7 August, 2019; originally announced August 2019.

  14. arXiv:1905.13658  [pdf, other

    cs.LG stat.ML

    Ordinal Regression as Structured Classification

    Authors: Niall Twomey, Rafael Poyiadzi, Callum Mann, Raúl Santos-Rodríguez

    Abstract: This paper extends the class of ordinal regression models with a structured interpretation of the problem by applying a novel treatment of encoded labels. The net effect of this is to transform the underlying problem from an ordinal regression task to a (structured) classification task which we solve with conditional random fields, thereby achieving a coherent and probabilistic model in which all… ▽ More

    Submitted 31 May, 2019; originally announced May 2019.

  15. arXiv:1905.09905  [pdf, other

    cs.LG stat.ML

    Neural ODEs with stochastic vector field mixtures

    Authors: Niall Twomey, Michał Kozłowski, Raúl Santos-Rodríguez

    Abstract: It was recently shown that neural ordinary differential equation models cannot solve fundamental and seemingly straightforward tasks even with high-capacity vector field representations. This paper introduces two other fundamental tasks to the set that baseline methods cannot solve, and proposes mixtures of stochastic vector fields as a model class that is capable of solving these essential proble… ▽ More

    Submitted 23 May, 2019; originally announced May 2019.

  16. arXiv:1810.10328  [pdf, other

    cs.LG stat.ML

    Label Propagation for Learning with Label Proportions

    Authors: Rafael Poyiadzi, Raul Santos-Rodriguez, Niall Twomey

    Abstract: Learning with Label Proportions (LLP) is the problem of recovering the underlying true labels given a dataset when the data is presented in the form of bags. This paradigm is particularly suitable in contexts where providing individual labels is expensive and label aggregates are more easily obtained. In the healthcare domain, it is a burden for a patient to keep a detailed diary of their daily ro… ▽ More

    Submitted 24 October, 2018; originally announced October 2018.

    Comments: Accepted to MLSP 2018

  17. arXiv:1805.11907  [pdf, other

    cs.OH

    A Guide to the SPHERE 100 Homes Study Dataset

    Authors: Atis Elsts, Tilo Burghardt, Dallan Byrne, Massimo Camplani, Dima Damen, Xenofon Fafoutis, Sion Hannuna, William Harwin, Michael Holmes, Balazs Janko, Victor Ponce Lopez, Alessandro Masullo, Majid Mirmehdi, George Oikonomou, Robert Piechocki, R. Simon Sherratt, Emma Tonkin, Niall Twomey, Antonis Vafeas, Przemyslaw Woznowski, Ian Craddock

    Abstract: The SPHERE project has developed a multi-modal sensor platform for health and behavior monitoring in residential environments. So far, the SPHERE platform has been deployed for data collection in approximately 50 homes for duration up to one year. This technical document describes the format and the expected content of the SPHERE dataset(s) under preparation. It includes a list of some data qualit… ▽ More

    Submitted 30 October, 2018; v1 submitted 30 May, 2018; originally announced May 2018.

  18. arXiv:1702.01209  [pdf, other

    stat.ML cs.HC

    Probabilistic Sensor Fusion for Ambient Assisted Living

    Authors: Tom Diethe, Niall Twomey, Meelis Kull, Peter Flach, Ian Craddock

    Abstract: There is a widely-accepted need to revise current forms of health-care provision, with particular interest in sensing systems in the home. Given a multiple-modality sensor platform with heterogeneous network connectivity, as is under development in the Sensor Platform for HEalthcare in Residential Environment (SPHERE) Interdisciplinary Research Collaboration (IRC), we face specific challenges rela… ▽ More

    Submitted 3 February, 2017; originally announced February 2017.

    Comments: Journal article. 19 pages; 7 figures

  19. arXiv:1603.00797  [pdf, other

    cs.CY cs.HC

    The SPHERE Challenge: Activity Recognition with Multimodal Sensor Data

    Authors: Niall Twomey, Tom Diethe, Meelis Kull, Hao Song, Massimo Camplani, Sion Hannuna, Xenofon Fafoutis, Ni Zhu, Pete Woznowski, Peter Flach, Ian Craddock

    Abstract: This paper outlines the Sensor Platform for HEalthcare in Residential Environment (SPHERE) project and details the SPHERE challenge that will take place in conjunction with European Conference on Machine Learning and Principles and Practice of Knowledge Discovery (ECML-PKDD) between March and July 2016. The SPHERE challenge is an activity recognition competition where predictions are made from vid… ▽ More

    Submitted 17 March, 2016; v1 submitted 2 March, 2016; originally announced March 2016.

    Comments: Paper describing dataset. 11 pages; 4 figures