Skip to main content

Showing 1–6 of 6 results for author: Moran, S

Searching in archive eess. Search in all archives.
.
  1. arXiv:2403.19509  [pdf

    cs.CL cs.SD eess.AS

    Phonetic Segmentation of the UCLA Phonetics Lab Archive

    Authors: Eleanor Chodroff, Blaž Pažon, Annie Baker, Steven Moran

    Abstract: Research in speech technologies and comparative linguistics depends on access to diverse and accessible speech data. The UCLA Phonetics Lab Archive is one of the earliest multilingual speech corpora, with long-form audio recordings and phonetic transcriptions for 314 languages (Ladefoged et al., 2009). Recently, 95 of these languages were time-aligned with word-level phonetic transcriptions (Li et… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

    Comments: Accepted at LREC-COLING 2024

  2. arXiv:2301.02214  [pdf, other

    eess.AS cs.SD

    Automatic Sound Event Detection and Classification of Great Ape Calls Using Neural Networks

    Authors: Zifan Jiang, Adrian Soldati, Isaac Schamberg, Adriano R. Lameira, Steven Moran

    Abstract: We present a novel approach to automatically detect and classify great ape calls from continuous raw audio recordings collected during field research. Our method leverages deep pretrained and sequential neural networks, including wav2vec 2.0 and LSTM, and is validated on three data sets from three different great ape lineages (orangutans, chimpanzees, and bonobos). The recordings were collected by… ▽ More

    Submitted 21 June, 2024; v1 submitted 5 January, 2023; originally announced January 2023.

    Comments: This paper is published as: Jiang, Zifan, Adrian Soldati, Isaac Schamberg, Adriano R. Lameira and Steven Moran. Automatic Sound Event Detection and Classification of Great Ape Calls Using Neural Networks. In Proceedings of the 20th International Congress of Phonetic Sciences (ICPhS 2023), 3100-3104, Prague, Czech Republic (https://guarant.cz/icphs2023/508.pdf)

  3. arXiv:2203.13680  [pdf, other

    eess.IV cs.CV cs.LG

    ST-FL: Style Transfer Preprocessing in Federated Learning for COVID-19 Segmentation

    Authors: Antonios Georgiadis, Varun Babbar, Fran Silavong, Sean Moran, Rob Otter

    Abstract: Chest Computational Tomography (CT) scans present low cost, speed and objectivity for COVID-19 diagnosis and deep learning methods have shown great promise in assisting the analysis and interpretation of these images. Most hospitals or countries can train their own models using in-house data, however empirical evidence shows that those models perform poorly when tested on new unseen cases, surfaci… ▽ More

    Submitted 25 March, 2022; originally announced March 2022.

    Comments: 5 pages, 1 figure, full version (15 pages, 13 figures) to be published in SPIE: Medical Imaging 2022 Proceedings

  4. arXiv:2007.09187  [pdf, other

    eess.IV

    Low Light Video Enhancement using Synthetic Data Produced with an Intermediate Domain Map**

    Authors: Danai Triantafyllidou, Sean Moran, Steven McDonagh, Sarah Parisot, Gregory Slabaugh

    Abstract: Advances in low-light video RAW-to-RGB translation are opening up the possibility of fast low-light imaging on commodity devices (e.g. smartphone cameras) without the need for a tripod. However, it is challenging to collect the required paired short-long exposure frames to learn a supervised map**. Current approaches require a specialised rig or the use of static videos with no subject or object… ▽ More

    Submitted 17 July, 2020; originally announced July 2020.

    Comments: Accepted to ECCV 2020

  5. arXiv:1911.13175  [pdf, other

    eess.IV cs.CV stat.ML

    CURL: Neural Curve Layers for Global Image Enhancement

    Authors: Sean Moran, Steven McDonagh, Gregory Slabaugh

    Abstract: We present a novel approach to adjust global image properties such as colour, saturation, and luminance using human-interpretable image enhancement curves, inspired by the Photoshop curves tool. Our method, dubbed neural CURve Layers (CURL), is designed as a multi-colour space neural retouching block trained jointly in three different colour spaces (HSV, CIELab, RGB) guided by a novel multi-colour… ▽ More

    Submitted 23 October, 2020; v1 submitted 29 November, 2019; originally announced November 2019.

    Comments: Accepted to ICPR 2020

  6. arXiv:1909.05249  [pdf, other

    eess.IV cs.CV

    NODE: Extreme Low Light Raw Image Denoising using a Noise Decomposition Network

    Authors: Hao Guan, Liu Liu, Sean Moran, Fenglong Song, Gregory Slabaugh

    Abstract: Denoising extreme low light images is a challenging task due to the high noise level. When the illumination is low, digital cameras increase the ISO (electronic gain) to amplify the brightness of captured data. However, this in turn amplifies the noise, arising from read, shot, and defective pixel sources. In the raw domain, read and shot noise are effectively modelled using Gaussian and Poisson d… ▽ More

    Submitted 11 September, 2019; originally announced September 2019.