-
TCuPGAN: A novel framework developed for optimizing human-machine interactions in citizen science
Authors:
Ramanakumar Sankar,
Kameswara Mantha,
Lucy Fortson,
Helen Spiers,
Thomas Pengo,
Douglas Mashek,
Myat Mo,
Mark Sanders,
Trace Christensen,
Jeffrey Salisbury,
Laura Trouille
Abstract:
In the era of big data in scientific research, there is a necessity to leverage techniques which reduce human effort in labeling and categorizing large datasets by involving sophisticated machine tools. To combat this problem, we present a novel, general purpose model for 3D segmentation that leverages patch-wise adversariality and Long Short-Term Memory to encode sequential information. Using thi…
▽ More
In the era of big data in scientific research, there is a necessity to leverage techniques which reduce human effort in labeling and categorizing large datasets by involving sophisticated machine tools. To combat this problem, we present a novel, general purpose model for 3D segmentation that leverages patch-wise adversariality and Long Short-Term Memory to encode sequential information. Using this model alongside citizen science projects which use 3D datasets (image cubes) on the Zooniverse platforms, we propose an iterative human-machine optimization framework where only a fraction of the 2D slices from these cubes are seen by the volunteers. We leverage the patch-wise discriminator in our model to provide an estimate of which slices within these image cubes have poorly generalized feature representations, and correspondingly poor machine performance. These images with corresponding machine proposals would be presented to volunteers on Zooniverse for correction, leading to a drastic reduction in the volunteer effort on citizen science projects. We trained our model on ~2300 liver tissue 3D electron micrographs. Lipid droplets were segmented within these images through human annotation via the `Etch A Cell - Fat Checker' citizen science project, hosted on the Zooniverse platform. In this work, we demonstrate this framework and the selection methodology which resulted in a measured reduction in volunteer effort by more than 60%. We envision this type of joint human-machine partnership will be of great use on future Zooniverse projects.
△ Less
Submitted 23 November, 2023;
originally announced November 2023.
-
Capturing and Explaining Trajectory Singularities using Composite Signal Neural Networks
Authors:
Hippolyte Dubois,
Patrick Le Callet,
Michael Hornberger,
Hugo J. Spiers,
Antoine Coutrot
Abstract:
Spatial trajectories are ubiquitous and complex signals. Their analysis is crucial in many research fields, from urban planning to neuroscience. Several approaches have been proposed to cluster trajectories. They rely on hand-crafted features, which struggle to capture the spatio-temporal complexity of the signal, or on Artificial Neural Networks (ANNs) which can be more efficient but less interpr…
▽ More
Spatial trajectories are ubiquitous and complex signals. Their analysis is crucial in many research fields, from urban planning to neuroscience. Several approaches have been proposed to cluster trajectories. They rely on hand-crafted features, which struggle to capture the spatio-temporal complexity of the signal, or on Artificial Neural Networks (ANNs) which can be more efficient but less interpretable. In this paper we present a novel ANN architecture designed to capture the spatio-temporal patterns characteristic of a set of trajectories, while taking into account the demographics of the navigators. Hence, our model extracts markers linked to both behaviour and demographics. We propose a composite signal analyser (CompSNN) combining three simple ANN modules. Each of these modules uses different signal representations of the trajectory while remaining interpretable. Our CompSNN performs significantly better than its modules taken in isolation and allows to visualise which parts of the signal were most useful to discriminate the trajectories.
△ Less
Submitted 7 May, 2020; v1 submitted 24 March, 2020;
originally announced March 2020.
-
A Survey of Crowdsourcing in Medical Image Analysis
Authors:
Silas Ørting,
Andrew Doyle,
Arno van Hilten,
Matthias Hirth,
Oana Inel,
Christopher R. Madan,
Panagiotis Mavridis,
Helen Spiers,
Veronika Cheplygina
Abstract:
Rapid advances in image processing capabilities have been seen across many domains, fostered by the application of machine learning algorithms to "big-data". However, within the realm of medical image analysis, advances have been curtailed, in part, due to the limited availability of large-scale, well-annotated datasets. One of the main reasons for this is the high cost often associated with produ…
▽ More
Rapid advances in image processing capabilities have been seen across many domains, fostered by the application of machine learning algorithms to "big-data". However, within the realm of medical image analysis, advances have been curtailed, in part, due to the limited availability of large-scale, well-annotated datasets. One of the main reasons for this is the high cost often associated with producing large amounts of high-quality meta-data. Recently, there has been growing interest in the application of crowdsourcing for this purpose; a technique that has proven effective for creating large-scale datasets across a range of disciplines, from computer vision to astrophysics. Despite the growing popularity of this approach, there has not yet been a comprehensive literature review to provide guidance to researchers considering using crowdsourcing methodologies in their own medical imaging analysis. In this survey, we review studies applying crowdsourcing to the analysis of medical images, published prior to July 2018. We identify common approaches, challenges and considerations, providing guidance of utility to researchers adopting this approach. Finally, we discuss future opportunities for development within this emerging domain.
△ Less
Submitted 4 September, 2019; v1 submitted 25 February, 2019;
originally announced February 2019.
-
A transient search using combined human and machine classifications
Authors:
Darryl E. Wright,
Chris J. Lintott,
Stephen J. Smartt,
Ken W. Smith,
Lucy Fortson,
Laura Trouille,
Campbell R. Allen,
Melanie Beck,
Mark C. Bouslog,
Amy Boyer,
K. C. Chambers,
Heather Flewelling,
Will Granger,
Eugene A. Magnier,
Adam McMaster,
Grant R. M. Miller,
James E. O'Donnell,
Helen Spiers,
John L. Tonry,
Marten Veldthuis,
Richard J. Wainscoat,
Chris Waters,
Mark Willman,
Zach Wolfenbarger,
Dave R. Young
Abstract:
Large modern surveys require efficient review of data in order to find transient sources such as supernovae, and to distinguish such sources from artefacts and noise. Much effort has been put into the development of automatic algorithms, but surveys still rely on human review of targets. This paper presents an integrated system for the identification of supernovae in data from Pan-STARRS1, combini…
▽ More
Large modern surveys require efficient review of data in order to find transient sources such as supernovae, and to distinguish such sources from artefacts and noise. Much effort has been put into the development of automatic algorithms, but surveys still rely on human review of targets. This paper presents an integrated system for the identification of supernovae in data from Pan-STARRS1, combining classifications from volunteers participating in a citizen science project with those from a convolutional neural network. The unique aspect of this work is the deployment, in combination, of both human and machine classifications for near real-time discovery in an astronomical project. We show that the combination of the two methods outperforms either one used individually. This result has important implications for the future development of transient searches, especially in the era of LSST and other large-throughput surveys.
△ Less
Submitted 17 July, 2017;
originally announced July 2017.