Skip to main content

Showing 1–11 of 11 results for author: Domijan, K

.
  1. arXiv:2404.09863  [pdf, other

    stat.ME stat.CO

    sfislands: An R Package for Accommodating Islands and Disjoint Zones in Areal Spatial Modelling

    Authors: Kevin Horan, Katarina Domijan, Chris Brunsdon

    Abstract: Fitting areal models which use a spatial weights matrix to represent relationships between geographical units can be a cumbersome task, particularly when these units are not well-behaved. The two chief aims of sfislands are to simplify the process of creating an appropriate neighbourhood matrix, and to quickly visualise the predictions of subsequent models. The package uses visual aids in the form… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: 27 pages, 26 figures

  2. What is to be gained by ensemble models in analysis of spectroscopic data?

    Authors: Katarina Domijan

    Abstract: An empirical study was carried out to compare different implementations of ensemble models aimed at improving prediction in spectroscopic data. A wide range of candidate models were fitted to benchmark datasets from regression and classification settings. A statistical analysis using linear mixed model was carried out on prediction performance criteria resulting from model fits over random splits… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

    Comments: 14 pages, 8 figures, 1 algorithm

    Journal ref: Chemometrics and Intelligent Laboratory Systems, Volume 241, 2023, 104936, ISSN 0169-7439

  3. Subjective assessment of the impact of a content adaptive optimiser for compressing 4K HDR content with AV1

    Authors: Vibhoothi, Angeliki Katsenou, François Pitié, Katarina Domijan, Anil Kokaram

    Abstract: Since 2015 video dimensionality has expanded to higher spatial and temporal resolutions and a wider colour gamut. This High Dynamic Range (HDR) content has gained traction in the consumer space as it delivers an enhanced quality of experience. At the same time, the complexity of codecs is growing. This has driven the development of tools for content-adaptive optimisation that achieve optimal rate-… ▽ More

    Submitted 26 June, 2023; originally announced June 2023.

    Comments: Accepted Camera-ready version for the ICIP 2023 Paper

  4. arXiv:2210.04479  [pdf, other

    q-bio.QM stat.AP

    Classification of cow diet based on milk mid infrared spectra: a data analysis competition at the "International workshop of spectroscopy and chemometrics 2022"

    Authors: Maria Frizzarin, Giulio Visentin, Alessandro Ferragina, Elena Hayes, Antonio Bevilacqua, Bhaskar Dhariyal, Katarina Domijan, Hussain Khan, Georgiana Ifrim, Thach Le Nguyen, Joe Meagher, Laura Menchetti, Ashish Singh, Suzy Whoriskey, Robert Williamson, Martina Zappaterra, Alessandro Casa

    Abstract: In April 2022, the Vistamilk SFI Research Centre organized the second edition of the "International Workshop on Spectroscopy and Chemometrics - Applications in Food and Agriculture". Within this event, a data challenge was organized among participants of the workshop. Such data competition aimed at develo** a prediction model to discriminate dairy cows' diet based on milk spectral information co… ▽ More

    Submitted 10 October, 2022; originally announced October 2022.

    Comments: 27 pages, 9 figures

  5. arXiv:2204.07207  [pdf, other

    stat.ME cs.LG

    Hierarchical Embedded Bayesian Additive Regression Trees

    Authors: Bruna Wundervald, Andrew Parnell, Katarina Domijan

    Abstract: We propose a simple yet powerful extension of Bayesian Additive Regression Trees which we name Hierarchical Embedded BART (HE-BART). The model allows for random effects to be included at the terminal node level of a set of regression trees, making HE-BART a non-parametric alternative to mixed effects models which avoids the need for the user to specify the structure of the random effects in the mo… ▽ More

    Submitted 24 April, 2023; v1 submitted 14 April, 2022; originally announced April 2022.

  6. Mid infrared spectroscopy and milk quality traits: a data analysis competition at the "International Workshop on Spectroscopy and Chemometrics 2021"

    Authors: Maria Frizzarin, Antonio Bevilacqua, Bhaskar Dhariyal, Katarina Domijan, Federico Ferraccioli, Elena Hayes, Georgiana Ifrim, Agnieszka Konkolewska, Thach Le Nguyen, Uche Mbaka, Giovanna Ranzato, Ashish Singh, Marco Stefanucci, Alessandro Casa

    Abstract: A chemometric data analysis challenge has been arranged during the first edition of the "International Workshop on Spectroscopy and Chemometrics", organized by the Vistamilk SFI Research Centre and held online in April 2021. The aim of the competition was to build a calibration model in order to predict milk quality traits exploiting the information contained in mid-infrared spectra only. Three di… ▽ More

    Submitted 19 September, 2022; v1 submitted 5 July, 2021; originally announced July 2021.

    Comments: 17 pages, 6 figures, 6 tables

    Journal ref: Chemometrics and Intelligent Laboratory Systems, 2021, Volume 219, 104442

  7. arXiv:2101.06986  [pdf, other

    stat.ML cs.LG

    Interactive slice visualization for exploring machine learning models

    Authors: Catherine B. Hurley, Mark O'Connell, Katarina Domijan

    Abstract: Machine learning models fit complex algorithms to arbitrarily large datasets. These algorithms are well-known to be high on performance and low on interpretability. We use interactive visualization of slices of predictor space to address the interpretability deficit; in effect opening up the black-box of machine learning algorithms, for the purpose of interrogating, explaining, validating and comp… ▽ More

    Submitted 7 September, 2021; v1 submitted 18 January, 2021; originally announced January 2021.

    Comments: 35 pages, 11 figures

  8. arXiv:2006.07515  [pdf, other

    stat.ML cs.IR cs.LG

    Generalizing Gain Penalization for Feature Selection in Tree-based Models

    Authors: Bruna Wundervald, Andrew Parnell, Katarina Domijan

    Abstract: We develop a new approach for feature selection via gain penalization in tree-based models. First, we show that previous methods do not perform sufficient regularization and often exhibit sub-optimal out-of-sample performance, especially when correlated features are present. Instead, we develop a new gain penalization idea that exhibits a general local-global regularization for tree-based models.… ▽ More

    Submitted 12 June, 2020; originally announced June 2020.

    Comments: 13 pages, 2 figures

  9. arXiv:1905.07302  [pdf, other

    stat.ML cs.LG eess.SP

    Comparison of Machine Learning Models in Food Authentication Studies

    Authors: Manokamna Singh, Katarina Domijan

    Abstract: The underlying objective of food authentication studies is to determine whether unknown food samples have been correctly labelled. In this paper we study three near infrared (NIR) spectroscopic datasets from food samples of different types: meat samples (labelled by species), olive oil samples (labelled by their geographic origin) and honey samples (labelled as pure or adulterated by different adu… ▽ More

    Submitted 17 May, 2019; originally announced May 2019.

    Comments: Accepted for 2019 30th Irish Signals and Systems Conference (ISSC)

  10. arXiv:1812.02652  [pdf, other

    astro-ph.SR

    Solar flare forecasting from magnetic feature properties generated by Solar Monitor Active Region Tracker

    Authors: Katarina Domijan, D. Shaun Bloomfield, Francois Pitie

    Abstract: We study the predictive capabilities of magnetic feature properties (MF) generated by Solar Monitor Active Region Tracker (SMART) for solar flare forecasting from two datasets: the full dataset of SMART detections from 1996 to 2010 that has been previously studied by Ahmed et al. (2011) and a subset of that dataset which only includes detections that are NOAA active regions (ARs). Main contributio… ▽ More

    Submitted 6 December, 2018; originally announced December 2018.

    Comments: Accepted for publication in Solar Physics. 22 pages, 6 figures, 8 tables

  11. arXiv:1610.00290  [pdf, other

    stat.OT

    Conditional Visualization for Statistical Models: An Introduction to the condvis Package in R

    Authors: Mark O'Connell, Catherine B. Hurley, Katarina Domijan

    Abstract: The condvis package is for interactive visualization of sections in data space, showing fitted models on the section, and observed data near the section. The primary goal is the interpretation of complex models, and showing how the observed data support the fitted model. There is a video accompaniment to this paper available at https://www.youtube.com/watch?v=rKFq7xwgdX0. This is a preprint versio… ▽ More

    Submitted 2 October, 2016; originally announced October 2016.