-
Exploring the use of a Large Language Model for data extraction in systematic reviews: a rapid feasibility study
Authors:
Lena Schmidt,
Kaitlyn Hair,
Sergio Graziozi,
Fiona Campbell,
Claudia Kapp,
Alireza Khanteymoori,
Dawn Craig,
Mark Engelbert,
James Thomas
Abstract:
This paper describes a rapid feasibility study of using GPT-4, a large language model (LLM), to (semi)automate data extraction in systematic reviews. Despite the recent surge of interest in LLMs there is still a lack of understanding of how to design LLM-based automation tools and how to robustly evaluate their performance. During the 2023 Evidence Synthesis Hackathon we conducted two feasibility…
▽ More
This paper describes a rapid feasibility study of using GPT-4, a large language model (LLM), to (semi)automate data extraction in systematic reviews. Despite the recent surge of interest in LLMs there is still a lack of understanding of how to design LLM-based automation tools and how to robustly evaluate their performance. During the 2023 Evidence Synthesis Hackathon we conducted two feasibility studies. Firstly, to automatically extract study characteristics from human clinical, animal, and social science domain studies. We used two studies from each category for prompt-development; and ten for evaluation. Secondly, we used the LLM to predict Participants, Interventions, Controls and Outcomes (PICOs) labelled within 100 abstracts in the EBM-NLP dataset. Overall, results indicated an accuracy of around 80%, with some variability between domains (82% for human clinical, 80% for animal, and 72% for studies of human social sciences). Causal inference methods and study design were the data extraction items with the most errors. In the PICO study, participants and intervention/control showed high accuracy (>80%), outcomes were more challenging. Evaluation was done manually; scoring methods such as BLEU and ROUGE showed limited value. We observed variability in the LLMs predictions and changes in response quality. This paper presents a template for future evaluations of LLMs in the context of data extraction for systematic review automation. Our results show that there might be value in using LLMs, for example as second or third reviewers. However, caution is advised when integrating models such as GPT-4 into tools. Further research on stability and reliability in practical settings is warranted for each type of data that is processed by the LLM.
△ Less
Submitted 23 May, 2024;
originally announced May 2024.
-
Kpc-Scale Neutral Iron K$α$ Emission in the Starburst-AGN NGC 4945: a Relic AGN Outflow?
Authors:
Kimberly A. Weaver,
Jenna M. Cann,
Lynne Valencic,
Ryan W. Pfeifle,
K. D. Kuntz,
Joel F. Campbell,
Kimberly Engle,
Ryan Tanner,
Edmund Hodges-Kluck,
Isabella Carlton,
Miranda McCarthy
Abstract:
NGC 4945 contains a well-known heavily obscured active galactic nucleus (AGN) at its core, with prior reports of strong nuclear and off-nuclear neutral Fe K$α$ emission due to the AGN activity. We report the discovery of very extended Fe K$α$ emission with the XMM-Newton EPIC pn in a $\sim5$ kpc by $\sim10$ kpc region that is misaligned with the plane of the inclined optical galaxy disk by…
▽ More
NGC 4945 contains a well-known heavily obscured active galactic nucleus (AGN) at its core, with prior reports of strong nuclear and off-nuclear neutral Fe K$α$ emission due to the AGN activity. We report the discovery of very extended Fe K$α$ emission with the XMM-Newton EPIC pn in a $\sim5$ kpc by $\sim10$ kpc region that is misaligned with the plane of the inclined optical galaxy disk by $\sim60$ degrees in projection. After a careful consideration of the crowded center of the galaxy and numerous unresolved hard X-ray sources present, we estimate that $\sim15$% of the Fe K$α$ is extended on kpc-sized scales. The overall size and misalignment of the region follows an unusual pattern of radio polarization that is not typical of starbursts or normal disk galaxies but has been interpreted as possibly due to AGN activity. We suggest that the extended Fe K$α$ emission arose from a period of AGN eruption several million years ago - a relic of a past AGN ejection episode.
△ Less
Submitted 4 April, 2024;
originally announced April 2024.
-
Query Refinement for Diverse Top-$k$ Selection
Authors:
Felix S. Campbell,
Alon Silberstein,
Julia Stoyanovich,
Yuval Moskovitch
Abstract:
Database queries are often used to select and rank items as decision support for many applications. As automated decision-making tools become more prevalent, there is a growing recognition of the need to diversify their outcomes. In this paper, we define and study the problem of modifying the selection conditions of an ORDER BY query so that the result of the modified query closely fits some user-…
▽ More
Database queries are often used to select and rank items as decision support for many applications. As automated decision-making tools become more prevalent, there is a growing recognition of the need to diversify their outcomes. In this paper, we define and study the problem of modifying the selection conditions of an ORDER BY query so that the result of the modified query closely fits some user-defined notion of diversity while simultaneously maintaining the intent of the original query. We show the hardness of this problem and propose a Mixed Integer Linear Programming (MILP) based solution. We further present optimizations designed to enhance the scalability and applicability of the solution in real-life scenarios. We investigate the performance characteristics of our algorithm and show its efficiency and the usefulness of our optimizations.
△ Less
Submitted 27 March, 2024; v1 submitted 26 March, 2024;
originally announced March 2024.
-
Likelihood-based Out-of-Distribution Detection with Denoising Diffusion Probabilistic Models
Authors:
Joseph Goodier,
Neill D. F. Campbell
Abstract:
Out-of-Distribution detection between dataset pairs has been extensively explored with generative models. We show that likelihood-based Out-of-Distribution detection can be extended to diffusion models by leveraging the fact that they, like other likelihood-based generative models, are dramatically affected by the input sample complexity. Currently, all Out-of-Distribution detection methods with D…
▽ More
Out-of-Distribution detection between dataset pairs has been extensively explored with generative models. We show that likelihood-based Out-of-Distribution detection can be extended to diffusion models by leveraging the fact that they, like other likelihood-based generative models, are dramatically affected by the input sample complexity. Currently, all Out-of-Distribution detection methods with Diffusion Models are reconstruction-based. We propose a new likelihood ratio for Out-of-Distribution detection with Deep Denoising Diffusion Models, which we call the Complexity Corrected Likelihood Ratio. Our likelihood ratio is constructed using Evidence Lower-Bound evaluations from an individual model at various noising levels. We present results that are comparable to state-of-the-art Out-of-Distribution detection methods with generative models.
△ Less
Submitted 26 October, 2023;
originally announced October 2023.
-
The Robust Semantic Segmentation UNCV2023 Challenge Results
Authors:
Xuanlong Yu,
Yi Zuo,
Zitao Wang,
Xiaowen Zhang,
Jiaxuan Zhao,
Yuting Yang,
Licheng Jiao,
Rui Peng,
Xinyi Wang,
Junpei Zhang,
Kexin Zhang,
Fang Liu,
Roberto Alcover-Couso,
Juan C. SanMiguel,
Marcos Escudero-Viñolo,
Hanlin Tian,
Kenta Matsui,
Tianhao Wang,
Fahmy Adan,
Zhitong Gao,
Xuming He,
Quentin Bouniot,
Hossein Moghaddam,
Shyam Nandan Rai,
Fabio Cermelli
, et al. (12 additional authors not shown)
Abstract:
This paper outlines the winning solutions employed in addressing the MUAD uncertainty quantification challenge held at ICCV 2023. The challenge was centered around semantic segmentation in urban environments, with a particular focus on natural adversarial scenarios. The report presents the results of 19 submitted entries, with numerous techniques drawing inspiration from cutting-edge uncertainty q…
▽ More
This paper outlines the winning solutions employed in addressing the MUAD uncertainty quantification challenge held at ICCV 2023. The challenge was centered around semantic segmentation in urban environments, with a particular focus on natural adversarial scenarios. The report presents the results of 19 submitted entries, with numerous techniques drawing inspiration from cutting-edge uncertainty quantification methodologies presented at prominent conferences in the fields of computer vision and machine learning and journals over the past few years. Within this document, the challenge is introduced, shedding light on its purpose and objectives, which primarily revolved around enhancing the robustness of semantic segmentation in urban scenes under varying natural adversarial conditions. The report then delves into the top-performing solutions. Moreover, the document aims to provide a comprehensive overview of the diverse solutions deployed by all participants. By doing so, it seeks to offer readers a deeper insight into the array of strategies that can be leveraged to effectively handle the inherent uncertainties associated with autonomous driving and semantic segmentation, especially within urban environments.
△ Less
Submitted 27 September, 2023;
originally announced September 2023.
-
Numerical and Experimental Study on the Addition of Surface Roughness to Micro-Propellers
Authors:
Justin P Cooke,
Matthew F Campbell,
Edward B Steager,
Igor Bargatin,
Mark H Yim,
George I Park
Abstract:
Micro aerial vehicles are making a large impact in applications such as search-and-rescue, package delivery, and recreation. Unfortunately, these diminutive drones are currently constrained to carrying small payloads, in large part because they use propellers optimized for larger aircraft and inviscid flow regimes. Fully realizing the potential of emerging microflyers requires next-generation prop…
▽ More
Micro aerial vehicles are making a large impact in applications such as search-and-rescue, package delivery, and recreation. Unfortunately, these diminutive drones are currently constrained to carrying small payloads, in large part because they use propellers optimized for larger aircraft and inviscid flow regimes. Fully realizing the potential of emerging microflyers requires next-generation propellers that are specifically designed for low-Reynolds number conditions and that include new features advantageous in highly viscous flows. One aspect that has received limited attention in the literature is the addition of roughness to propeller blades as a method of reducing drag and increasing thrust. To investigate this possibility, we used large eddy simulation to conduct a numerical investigation of smooth and rough propellers. Our results indicate that roughness produces a 2% increase in thrust and a 5% decrease in power relative to a baseline smooth propeller operating at the same Reynolds number of Rec = 6500, held constant by rotational speed. We corroborated our numerical findings using thrust-stand-based experiments of 3D-printed propellers identical to those of the numerical simulations. Our study confirms that surface roughness is an additional parameter within the design space for micro-propellers that will lead to unprecedented drone efficiencies and payloads.
△ Less
Submitted 29 June, 2023;
originally announced June 2023.
-
Photophoretic Light-flyers with Germanium Coatings as Selective Absorbers
Authors:
Zhipeng Lu,
Gulzhan Aldan,
Danielle Levin,
Matthew F. Campbell,
Igor Bargatin
Abstract:
The goal of ultrathin lightweight photophoretic flyers, or light-flyers for short, is to levitate continuously in Earth's upper atmosphere using only sunlight for propulsive power. We previously reported light-flyers that levitated by utilizing differences in thermal accommodation coefficient (TAC) between the top and bottom of a thin film, made possible by coating their lower surfaces with carbon…
▽ More
The goal of ultrathin lightweight photophoretic flyers, or light-flyers for short, is to levitate continuously in Earth's upper atmosphere using only sunlight for propulsive power. We previously reported light-flyers that levitated by utilizing differences in thermal accommodation coefficient (TAC) between the top and bottom of a thin film, made possible by coating their lower surfaces with carbon nanotubes (CNTs). Such designs, though successful, were limited due to their high thermal emissivity (>0.5), which prevented them from achieving high temperatures and resulted in their transferring relatively low amounts of momentum to the surrounding gas. To address this issue, we have developed light-flyers with undoped germanium layers that selectively absorb nearly 80% of visible light but are mostly transparent in the thermal infrared, with an average thermal emissivity of <0.1. Our experiments show that germanium-coated light-flyers could levitate at up to 43% lower light irradiances than mylar-CNT disks with identical sizes. In addition, we simulated our experiments using a combined first-principles-empirical model, allowing us to predict that our 2-cm-diameter disk-shaped germanium-coated light-flyers can levitate in the mesosphere (altitudes 68-78 km) under the natural sunlight (1.36 kW/m2). Similar ultrathin selective-absorber coatings can also be applied to three-dimensional light-flyers shaped like solar balloons, allowing them to carry significant payloads and thereby revolutionize long-term atmospheric exploration of Earth or Mars.
△ Less
Submitted 30 May, 2023;
originally announced May 2023.
-
Compressed Sensing MRI Reconstruction Regularized by VAEs with Structured Image Covariance
Authors:
Margaret Duff,
Ivor J. A. Simpson,
Matthias J. Ehrhardt,
Neill D. F. Campbell
Abstract:
Objective: This paper investigates how generative models, trained on ground-truth images, can be used \changes{as} priors for inverse problems, penalizing reconstructions far from images the generator can produce. The aim is that learned regularization will provide complex data-driven priors to inverse problems while still retaining the control and insight of a variational regularization method. M…
▽ More
Objective: This paper investigates how generative models, trained on ground-truth images, can be used \changes{as} priors for inverse problems, penalizing reconstructions far from images the generator can produce. The aim is that learned regularization will provide complex data-driven priors to inverse problems while still retaining the control and insight of a variational regularization method. Moreover, unsupervised learning, without paired training data, allows the learned regularizer to remain flexible to changes in the forward problem such as noise level, sampling pattern or coil sensitivities in MRI.
Approach: We utilize variational autoencoders (VAEs) that generate not only an image but also a covariance uncertainty matrix for each image. The covariance can model changing uncertainty dependencies caused by structure in the image, such as edges or objects, and provides a new distance metric from the manifold of learned images.
Main results: We evaluate these novel generative regularizers on retrospectively sub-sampled real-valued MRI measurements from the fastMRI dataset. We compare our proposed learned regularization against other unlearned regularization approaches and unsupervised and supervised deep learning methods.
Significance: Our results show that the proposed method is competitive with other state-of-the-art methods and behaves consistently with changing sampling patterns and noise levels.
△ Less
Submitted 16 June, 2023; v1 submitted 26 October, 2022;
originally announced October 2022.
-
Analysing Training-Data Leakage from Gradients through Linear Systems and Gradient Matching
Authors:
Cangxiong Chen,
Neill D. F. Campbell
Abstract:
Recent works have demonstrated that it is possible to reconstruct training images and their labels from gradients of an image-classification model when its architecture is known. Unfortunately, there is still an incomplete theoretical understanding of the efficacy and failure of these gradient-leakage attacks. In this paper, we propose a novel framework to analyse training-data leakage from gradie…
▽ More
Recent works have demonstrated that it is possible to reconstruct training images and their labels from gradients of an image-classification model when its architecture is known. Unfortunately, there is still an incomplete theoretical understanding of the efficacy and failure of these gradient-leakage attacks. In this paper, we propose a novel framework to analyse training-data leakage from gradients that draws insights from both analytic and optimisation-based gradient-leakage attacks. We formulate the reconstruction problem as solving a linear system from each layer iteratively, accompanied by corrections using gradient matching. Under this framework, we claim that the solubility of the reconstruction problem is primarily determined by that of the linear system at each layer. As a result, we are able to partially attribute the leakage of the training data in a deep network to its architecture. We also propose a metric to measure the level of security of a deep learning model against gradient-based attacks on the training data.
△ Less
Submitted 20 October, 2022;
originally announced October 2022.
-
Longitudinal Acoustic Speech Tracking Following Pediatric Traumatic Brain Injury
Authors:
Camille Noufi,
Adam C. Lammert,
Daryush D. Mehta,
James R. Williamson,
Gregory Ciccarelli,
Douglas Sturim,
Jordan R. Green,
Thomas F. Quatieri,
Thomas F. Campbell
Abstract:
Recommendations for common outcome measures following pediatric traumatic brain injury (TBI) support the integration of instrumental measurements alongside perceptual assessment in recovery and treatment plans. A comprehensive set of sensitive, robust and non-invasive measurements is therefore essential in assessing variations in speech characteristics over time following pediatric TBI. In this ar…
▽ More
Recommendations for common outcome measures following pediatric traumatic brain injury (TBI) support the integration of instrumental measurements alongside perceptual assessment in recovery and treatment plans. A comprehensive set of sensitive, robust and non-invasive measurements is therefore essential in assessing variations in speech characteristics over time following pediatric TBI. In this article, we study the changes in the acoustic speech patterns of a pediatric cohort of ten subjects diagnosed with severe TBI. We extract a diverse set of both well-known and novel acoustic features from child speech recorded throughout the year after the child produced intelligible words. These features are analyzed individually and by speech subsystem, within-subject and across the cohort. As a group, older children exhibit highly significant (p<0.01) increases in pitch variation and phoneme diversity, shortened pause length, and steadying articulation rate variability. Younger children exhibit similar steadied rate variability alongside an increase in formant-based articulation complexity. Correlation analysis of the feature set with age and comparisons to normative developmental data confirm that age at injury plays a significant role in framing the recovery trajectory. Nearly all speech features significantly change (p<0.05) for the cohort as a whole, confirming that acoustic measures supplementing perceptual assessment are needed to identify efficacious treatment targets for speech therapy following TBI.
△ Less
Submitted 9 September, 2022;
originally announced September 2022.
-
Minimizing the Ground Effect for Photophoretically Levitating Disks
Authors:
Zhipeng Lu,
Miranda Stern,
**qiao Li,
David Candia,
Lorenzo Yao-Bate,
Thomas J. Celenza,
Mohsen Azadi,
Matthew F. Campbell,
Igor Bargatin
Abstract:
Photophoretic levitation is a propulsion mechanism in which lightweight objects can be lifted and controlled through their interactions with light. Since photophoretic forces on macroscopic objects are usually maximized at low pressures, they may be tested in vacuum chambers in close proximity to the chamber floor and walls. We report here experimental evidence that the terrain under levitating mi…
▽ More
Photophoretic levitation is a propulsion mechanism in which lightweight objects can be lifted and controlled through their interactions with light. Since photophoretic forces on macroscopic objects are usually maximized at low pressures, they may be tested in vacuum chambers in close proximity to the chamber floor and walls. We report here experimental evidence that the terrain under levitating microflyers, including the chamber floor or the launchpad from which microflyers lift off, can greatly increase the photophoretic lift forces relative to their free-space (mid-air) values. To characterize this so-called "ground effect" during vacuum chamber tests, we introduced a new miniature launchpad composed of three J-shaped (candy-cane-like) wires that minimized a microflyer's extraneous interactions with underlying surfaces. We compared our new launchpads to previously used wire-mesh launchpads for simple levitating mylar-based disks with diameters of 2, 4, and 8 cm. Importantly, wire-mesh launchpads increased the photophoretic lift force by up to sixfold. A significant ground effect was also associated with the bottom of the vacuum chamber, particularly when the distance to the bottom surface was less than the diameter of the levitating disk. We provide guidelines to minimize the ground effect in vacuum chamber experiments, which are necessary to test photophoretic microflyers intended for high-altitude exploration and surveillance on Earth or on Mars.
△ Less
Submitted 1 September, 2022;
originally announced September 2022.
-
Learning Structured Gaussians to Approximate Deep Ensembles
Authors:
Ivor J. A. Simpson,
Sara Vicente,
Neill D. F. Campbell
Abstract:
This paper proposes using a sparse-structured multivariate Gaussian to provide a closed-form approximator for the output of probabilistic ensemble models used for dense image prediction tasks. This is achieved through a convolutional neural network that predicts the mean and covariance of the distribution, where the inverse covariance is parameterised by a sparsely structured Cholesky matrix. Simi…
▽ More
This paper proposes using a sparse-structured multivariate Gaussian to provide a closed-form approximator for the output of probabilistic ensemble models used for dense image prediction tasks. This is achieved through a convolutional neural network that predicts the mean and covariance of the distribution, where the inverse covariance is parameterised by a sparsely structured Cholesky matrix. Similarly to distillation approaches, our single network is trained to maximise the probability of samples from pre-trained probabilistic models, in this work we use a fixed ensemble of networks. Once trained, our compact representation can be used to efficiently draw spatially correlated samples from the approximated output distribution. Importantly, this approach captures the uncertainty and structured correlations in the predictions explicitly in a formal distribution, rather than implicitly through sampling alone. This allows direct introspection of the model, enabling visualisation of the learned structure. Moreover, this formulation provides two further benefits: estimation of a sample probability, and the introduction of arbitrary spatial conditioning at test time. We demonstrate the merits of our approach on monocular depth estimation and show that the advantages of our approach are obtained with comparable quantitative performance.
△ Less
Submitted 29 March, 2022;
originally announced March 2022.
-
Efficient Answering of Historical What-if Queries
Authors:
Felix S. Campbell,
Bahareh Sadat Arab,
Boris Glavic
Abstract:
We introduce historical what-if queries, a novel type of what-if analysis that determines the effect of a hypothetical change to the transactional history of a database. For example, "how would revenue be affected if we would have charged an additional $6 for ship**?" Such queries may lead to more actionable insights than traditional what-if queries as their results can be used to inform future…
▽ More
We introduce historical what-if queries, a novel type of what-if analysis that determines the effect of a hypothetical change to the transactional history of a database. For example, "how would revenue be affected if we would have charged an additional $6 for ship**?" Such queries may lead to more actionable insights than traditional what-if queries as their results can be used to inform future actions, e.g., increasing ship** fees. We develop efficient techniques for answering historical what-if queries, i.e., determining how a modified history affects the current database state. Our techniques are based on reenactment, a replay technique for transactional histories. We optimize this process using program and data slicing techniques that determine which updates and what data can be excluded from reenactment without affecting the result. Using an implementation of our techniques in Mahif (a Middleware for Answering Historical what-IF queries) we demonstrate their effectiveness experimentally.
△ Less
Submitted 24 March, 2022;
originally announced March 2022.
-
Understanding Training-Data Leakage from Gradients in Neural Networks for Image Classification
Authors:
Cangxiong Chen,
Neill D. F. Campbell
Abstract:
Federated learning of deep learning models for supervised tasks, e.g. image classification and segmentation, has found many applications: for example in human-in-the-loop tasks such as film post-production where it enables sharing of domain expertise of human artists in an efficient and effective fashion. In many such applications, we need to protect the training data from being leaked when gradie…
▽ More
Federated learning of deep learning models for supervised tasks, e.g. image classification and segmentation, has found many applications: for example in human-in-the-loop tasks such as film post-production where it enables sharing of domain expertise of human artists in an efficient and effective fashion. In many such applications, we need to protect the training data from being leaked when gradients are shared in the training process due to IP or privacy concerns. Recent works have demonstrated that it is possible to reconstruct the training data from gradients for an image-classification model when its architecture is known. However, there is still an incomplete theoretical understanding of the efficacy and failure of such attacks. In this paper, we analyse the source of training-data leakage from gradients. We formulate the problem of training data reconstruction as solving an optimisation problem iteratively for each layer. The layer-wise objective function is primarily defined by weights and gradients from the current layer as well as the output from the reconstruction of the subsequent layer, but it might also involve a 'pull-back' constraint from the preceding layer. Training data can be reconstructed when we solve the problem backward from the output of the network through each layer. Based on this formulation, we are able to attribute the potential leakage of the training data in a deep network to its architecture. We also propose a metric to measure the level of security of a deep learning model against gradient-based attacks on the training data.
△ Less
Submitted 19 November, 2021;
originally announced November 2021.
-
Aligned Multi-Task Gaussian Process
Authors:
Olga Mikheeva,
Ieva Kazlauskaite,
Adam Hartshorne,
Hedvig Kjellström,
Carl Henrik Ek,
Neill D. F. Campbell
Abstract:
Multi-task learning requires accurate identification of the correlations between tasks. In real-world time-series, tasks are rarely perfectly temporally aligned; traditional multi-task models do not account for this and subsequent errors in correlation estimation will result in poor predictive performance and uncertainty quantification. We introduce a method that automatically accounts for tempora…
▽ More
Multi-task learning requires accurate identification of the correlations between tasks. In real-world time-series, tasks are rarely perfectly temporally aligned; traditional multi-task models do not account for this and subsequent errors in correlation estimation will result in poor predictive performance and uncertainty quantification. We introduce a method that automatically accounts for temporal misalignment in a unified generative model that improves predictive performance. Our method uses Gaussian processes (GPs) to model the correlations both within and between the tasks. Building on the previous work by Kazlauskaiteet al. [2019], we include a separate monotonic warp of the input data to model temporal misalignment. In contrast to previous work, we formulate a lower bound that accounts for uncertainty in both the estimates of the war** process and the underlying functions. Also, our new take on a monotonic stochastic process, with efficient path-wise sampling for the warp functions, allows us to perform full Bayesian inference in the model rather than MAP estimates. Missing data experiments, on synthetic and real time-series, demonstrate the advantages of accounting for misalignments (vs standard unaligned method) as well as modelling the uncertainty in the war** process(vs baseline MAP alignment approach).
△ Less
Submitted 29 October, 2021;
originally announced October 2021.
-
Regularising Inverse Problems with Generative Machine Learning Models
Authors:
Margaret Duff,
Neill D. F. Campbell,
Matthias J. Ehrhardt
Abstract:
Deep neural network approaches to inverse imaging problems have produced impressive results in the last few years. In this paper, we consider the use of generative models in a variational regularisation approach to inverse problems. The considered regularisers penalise images that are far from the range of a generative model that has learned to produce images similar to a training dataset. We name…
▽ More
Deep neural network approaches to inverse imaging problems have produced impressive results in the last few years. In this paper, we consider the use of generative models in a variational regularisation approach to inverse problems. The considered regularisers penalise images that are far from the range of a generative model that has learned to produce images similar to a training dataset. We name this family \textit{generative regularisers}. The success of generative regularisers depends on the quality of the generative model and so we propose a set of desired criteria to assess generative models and guide future research. In our numerical experiments, we evaluate three common generative models, autoencoders, variational autoencoders and generative adversarial networks, against our desired criteria. We also test three different generative regularisers on the inverse problems of deblurring, deconvolution, and tomography. We show that restricting solutions of the inverse problem to lie exactly in the range of a generative model can give good results but that allowing small deviations from the range of the generator produces more consistent results.
△ Less
Submitted 18 June, 2022; v1 submitted 22 July, 2021;
originally announced July 2021.
-
Multi-scale photonic emissivity engineering for relativistic lightsail thermal regulation
Authors:
John Brewer,
Matthew F. Campbell,
Pawan Kumar,
Sachin Kulkarni,
Deep Jariwala,
Igor Bargatin,
Aaswath P. Raman
Abstract:
The Breakthrough Starshot Initiative aims to send a gram-scale probe to Proxima Centuri B using a laser-accelerated lightsail traveling at relativistic speeds. Thermal management is a key lightsail design objective because of the intense laser powers required but has generally been considered secondary to accelerative performance. Here, we demonstrate nanophotonic photonic crystal slab reflectors…
▽ More
The Breakthrough Starshot Initiative aims to send a gram-scale probe to Proxima Centuri B using a laser-accelerated lightsail traveling at relativistic speeds. Thermal management is a key lightsail design objective because of the intense laser powers required but has generally been considered secondary to accelerative performance. Here, we demonstrate nanophotonic photonic crystal slab reflectors composed of 2H-phase molybdenum disulfide and crystalline silicon nitride, highlight the inverse relationship between the thermal band extinction coefficient and the lightsail's maximum temperature, and examine the trade-off between the acceleration distance and setting realistic sail thermal limits, ultimately realizing a thermally endurable acceleration minimum distance of 16.3~Gm. We additionally demonstrate multi-scale photonic structures featuring thermal-wavelength-scale Mie resonant geometries, and characterize their broadband Mie resonance-driven emissivity enhancement and acceleration distance reduction. Our results highlight new possibilities in simultaneously controlling optical and thermal response over broad wavelength ranges in ultralight nanophotonic structures.
△ Less
Submitted 14 September, 2021; v1 submitted 3 June, 2021;
originally announced June 2021.
-
Relativistic light sails need to billow
Authors:
Matthew F. Campbell,
John Brewer,
Deep Jariwala,
Aaswath Raman,
Igor Bargatin
Abstract:
We argue that light sails that are rapidly accelerated to relativistic velocities by lasers must be significantly curved in order to reduce their mechanical stresses and avoid tears. Using an integrated opto-thermo-mechanical model, we show that the diameter and radius of curvature of a circular light sail should be comparable in magnitude, both on the order of a few meters in optimal designs for…
▽ More
We argue that light sails that are rapidly accelerated to relativistic velocities by lasers must be significantly curved in order to reduce their mechanical stresses and avoid tears. Using an integrated opto-thermo-mechanical model, we show that the diameter and radius of curvature of a circular light sail should be comparable in magnitude, both on the order of a few meters in optimal designs for gram-scale payloads. Moreover, when sufficient laser power is available, a sail's acceleration length decreases and its chip payload capacity increases as its curvature increases. Our findings provide guidance for emerging light sail design programs, which herald a new era of interstellar space exploration.
△ Less
Submitted 22 May, 2021;
originally announced May 2021.
-
Interpretable Visualization and Higher-Order Dimension Reduction for ECoG Data
Authors:
Kelly Geyer,
Frederick Campbell,
Andersen Chang,
John Magnotti,
Michael Beauchamp,
Genevera I. Allen
Abstract:
ElectroCOrticoGraphy (ECoG) technology measures electrical activity in the human brain via electrodes placed directly on the cortical surface during neurosurgery. Through its capability to record activity at a fast temporal resolution, ECoG experiments have allowed scientists to better understand how the human brain processes speech. By its nature, ECoG data is difficult for neuroscientists to dir…
▽ More
ElectroCOrticoGraphy (ECoG) technology measures electrical activity in the human brain via electrodes placed directly on the cortical surface during neurosurgery. Through its capability to record activity at a fast temporal resolution, ECoG experiments have allowed scientists to better understand how the human brain processes speech. By its nature, ECoG data is difficult for neuroscientists to directly interpret for two major reasons. Firstly, ECoG data tends to be large in size, as each individual experiment yields data up to several gigabytes. Secondly, ECoG data has a complex, higher-order nature. After signal processing, this type of data may be organized as a 4-way tensor with dimensions representing trials, electrodes, frequency, and time. In this paper, we develop an interpretable dimension reduction approach called Regularized Higher Order Principal Components Analysis, as well as an extension to Regularized Higher Order Partial Least Squares, that allows neuroscientists to explore and visualize ECoG data. Our approach employs a sparse and functional Candecomp-Parafac (CP) decomposition that incorporates sparsity to select relevant electrodes and frequency bands, as well as smoothness over time and frequency, yielding directly interpretable factors. We demonstrate the performance and interpretability of our method with an ECoG case study on audio and visual processing of human speech.
△ Less
Submitted 12 December, 2020; v1 submitted 15 November, 2020;
originally announced November 2020.
-
Black-box density function estimation using recursive partitioning
Authors:
Erik Bodin,
Zhenwen Dai,
Neill D. F. Campbell,
Carl Henrik Ek
Abstract:
We present a novel approach to Bayesian inference and general Bayesian computation that is defined through a sequential decision loop. Our method defines a recursive partitioning of the sample space. It neither relies on gradients nor requires any problem-specific tuning, and is asymptotically exact for any density function with a bounded domain. The output is an approximation to the whole density…
▽ More
We present a novel approach to Bayesian inference and general Bayesian computation that is defined through a sequential decision loop. Our method defines a recursive partitioning of the sample space. It neither relies on gradients nor requires any problem-specific tuning, and is asymptotically exact for any density function with a bounded domain. The output is an approximation to the whole density function including the normalisation constant, via partitions organised in efficient data structures. Such approximations may be used for evidence estimation or fast posterior sampling, but also as building blocks to treat a larger class of estimation problems. The algorithm shows competitive performance to recent state-of-the-art methods on synthetic and real-world problems including parameter inference for gravitational-wave physics.
△ Less
Submitted 8 June, 2021; v1 submitted 26 October, 2020;
originally announced October 2020.
-
DiverseNet: When One Right Answer is not Enough
Authors:
Michael Firman,
Neill D. F. Campbell,
Lourdes Agapito,
Gabriel J. Brostow
Abstract:
Many structured prediction tasks in machine vision have a collection of acceptable answers, instead of one definitive ground truth answer. Segmentation of images, for example, is subject to human labeling bias. Similarly, there are multiple possible pixel values that could plausibly complete occluded image regions. State-of-the art supervised learning methods are typically optimized to make a sing…
▽ More
Many structured prediction tasks in machine vision have a collection of acceptable answers, instead of one definitive ground truth answer. Segmentation of images, for example, is subject to human labeling bias. Similarly, there are multiple possible pixel values that could plausibly complete occluded image regions. State-of-the art supervised learning methods are typically optimized to make a single test-time prediction for each query, failing to find other modes in the output space. Existing methods that allow for sampling often sacrifice speed or accuracy.
We introduce a simple method for training a neural network, which enables diverse structured predictions to be made for each test-time query. For a single input, we learn to predict a range of possible answers. We compare favorably to methods that seek diversity through an ensemble of networks. Such stochastic multiple choice learning faces mode collapse, where one or more ensemble members fail to receive any training signal. Our best performing solution can be deployed for various tasks, and just involves small modifications to the existing single-mode architecture, loss function, and training regime. We demonstrate that our method results in quantitative improvements across three challenging tasks: 2D image completion, 3D volume estimation, and flow prediction.
△ Less
Submitted 24 August, 2020;
originally announced August 2020.
-
Controlled photophoretic levitation of nanostructured thin films for near-space flight
Authors:
Mohsen Azadi,
George A. Popov,
Zhipeng Lu,
Andy G. Eskenazi,
Ji Won Bang,
Matthew F. Campbell,
Howard Hu,
Igor Bargatin
Abstract:
We report light-driven levitation of macroscopic polymer films whose bottom surface is engineered to maximize the thermal accommodation coefficient. Specifically, we levitated centimeter-scale disks made of commercial 0.5-micron-thick mylar film coated with carbon nanotubes on one side. When illuminated with light intensity comparable to natural sunlight, the polymer disk heats up and interacts wi…
▽ More
We report light-driven levitation of macroscopic polymer films whose bottom surface is engineered to maximize the thermal accommodation coefficient. Specifically, we levitated centimeter-scale disks made of commercial 0.5-micron-thick mylar film coated with carbon nanotubes on one side. When illuminated with light intensity comparable to natural sunlight, the polymer disk heats up and interacts with incident gas molecules differently on the top and bottom sides, producing a net recoil force. This lift force is maximized at gas pressures corresponding to Knudsen number on the order of 0.3, and correspondingly, we observed the levitation of 0.6-cm-diameter disks in a vacuum chamber at pressures between 10 and 30 Pa. Moreover, we controlled the flight of the disks using a shaped beam that optically trapped the levitating disks. Our experimentally validated theoretical model predicts that the lift forces can be many times the weight of the films, allowing payloads of up to 10 milligrams for sunlight-powered low-cost microflyers in the upper atmosphere at altitudes of 50-100 km.
△ Less
Submitted 13 May, 2020;
originally announced May 2020.
-
Field Evaluation of Column CO2 Retrievals from Intensity-Modulated Continuous-Wave Differential Absorption Lidar Measurements during ACT-America
Authors:
Joel F. Campbell,
Bing Lin,
Michael D. Obland,
Jeremy Dobler,
Wayne Erxleben,
Doug McGregor,
Chris O'Dell,
Emily Bell,
Sandip Pal,
Brad Weir,
Tai-Fang Fan,
Susan Kooi,
Abigail Corbett,
Kenneth Davis,
Iouli Gordon,
Roman Kochanov
Abstract:
We present an evaluation of airborne Intensity-Modulated Continuous-Wave (IM-CW) lidar measurements of atmospheric column CO2 mole fractions during the ACT-America project. This lidar system transmits online and offline wavelengths simultaneously on the 1.57111-um CO2 absorption line, with each modulated wavelength using orthogonal swept frequency waveforms. After the spectral characteristics of t…
▽ More
We present an evaluation of airborne Intensity-Modulated Continuous-Wave (IM-CW) lidar measurements of atmospheric column CO2 mole fractions during the ACT-America project. This lidar system transmits online and offline wavelengths simultaneously on the 1.57111-um CO2 absorption line, with each modulated wavelength using orthogonal swept frequency waveforms. After the spectral characteristics of this system were calibrated through short-path measurements, we used the HITRAN spectroscopic database to derive the average-column CO2 mixing ratio (XCO2) from the lidar measured optical depths. Based on in situ measurements of meteorological parameters and CO2 concentrations for calibration data, we demonstrate that our lidar CO2 measurements were consistent from season to season and had an absolute calibration error (standard deviation) of 0.80 ppm when compared to XCO2 values derived from in situ measurements. By using a 10-second or longer moving average, a long-term stability of 1 ppm or better was obtained. The estimated CO2 measurement precision for 0.1-s, 1-s, 10-s, and 60-s averages were determined to be 3.4 ppm (0.84%), 1.2 ppm (0.30%), 0.43 ppm (0.10%), and 0.26 ppm (0.063%), respectively. These correspond to measurement signal-to-noise ratios of 120, 330, 950, and 1600, respectively. The drift in XCO2 over one-hour of flight time was found to be below our detection limit of about 0.1 ppm. These analyses demonstrate that the measurement stability, precision and accuracy are all well below the thresholds needed to study synoptic-scale variations in atmospheric XCO2.
△ Less
Submitted 24 March, 2020;
originally announced March 2020.
-
Compositional uncertainty in deep Gaussian processes
Authors:
Ivan Ustyuzhaninov,
Ieva Kazlauskaite,
Markus Kaiser,
Erik Bodin,
Neill D. F. Campbell,
Carl Henrik Ek
Abstract:
Gaussian processes (GPs) are nonparametric priors over functions. Fitting a GP implies computing a posterior distribution of functions consistent with the observed data. Similarly, deep Gaussian processes (DGPs) should allow us to compute a posterior distribution of compositions of multiple functions giving rise to the observations. However, exact Bayesian inference is intractable for DGPs, motiva…
▽ More
Gaussian processes (GPs) are nonparametric priors over functions. Fitting a GP implies computing a posterior distribution of functions consistent with the observed data. Similarly, deep Gaussian processes (DGPs) should allow us to compute a posterior distribution of compositions of multiple functions giving rise to the observations. However, exact Bayesian inference is intractable for DGPs, motivating the use of various approximations. We show that the application of simplifying mean-field assumptions across the hierarchy leads to the layers of a DGP collapsing to near-deterministic transformations. We argue that such an inference scheme is suboptimal, not taking advantage of the potential of the model to discover the compositional structure in the data. To address this issue, we examine alternative variational inference schemes allowing for dependencies across different layers and discuss their advantages and limitations.
△ Less
Submitted 25 February, 2020; v1 submitted 17 September, 2019;
originally announced September 2019.
-
Modulating Surrogates for Bayesian Optimization
Authors:
Erik Bodin,
Markus Kaiser,
Ieva Kazlauskaite,
Zhenwen Dai,
Neill D. F. Campbell,
Carl Henrik Ek
Abstract:
Bayesian optimization (BO) methods often rely on the assumption that the objective function is well-behaved, but in practice, this is seldom true for real-world objectives even if noise-free observations can be collected. Common approaches, which try to model the objective as precisely as possible, often fail to make progress by spending too many evaluations modeling irrelevant details. We address…
▽ More
Bayesian optimization (BO) methods often rely on the assumption that the objective function is well-behaved, but in practice, this is seldom true for real-world objectives even if noise-free observations can be collected. Common approaches, which try to model the objective as precisely as possible, often fail to make progress by spending too many evaluations modeling irrelevant details. We address this issue by proposing surrogate models that focus on the well-behaved structure in the objective function, which is informative for search, while ignoring detrimental structure that is challenging to model from few observations. First, we demonstrate that surrogate models with appropriate noise distributions can absorb challenging structures in the objective function by treating them as irreducible uncertainty. Secondly, we show that a latent Gaussian process is an excellent surrogate for this purpose, comparing with Gaussian processes with standard noise distributions. We perform numerous experiments on a range of BO benchmarks and find that our approach improves reliability and performance when faced with challenging objective functions.
△ Less
Submitted 8 September, 2020; v1 submitted 26 June, 2019;
originally announced June 2019.
-
Monotonic Gaussian Process Flow
Authors:
Ivan Ustyuzhaninov,
Ieva Kazlauskaite,
Carl Henrik Ek,
Neill D. F. Campbell
Abstract:
We propose a new framework for imposing monotonicity constraints in a Bayesian nonparametric setting based on numerical solutions of stochastic differential equations. We derive a nonparametric model of monotonic functions that allows for interpretable priors and principled quantification of hierarchical uncertainty. We demonstrate the efficacy of the proposed model by providing competitive result…
▽ More
We propose a new framework for imposing monotonicity constraints in a Bayesian nonparametric setting based on numerical solutions of stochastic differential equations. We derive a nonparametric model of monotonic functions that allows for interpretable priors and principled quantification of hierarchical uncertainty. We demonstrate the efficacy of the proposed model by providing competitive results to other probabilistic monotonic models on a number of benchmark functions. In addition, we consider the utility of a monotonic random process as a part of a hierarchical probabilistic model; we examine the task of temporal alignment of time-series data where it is beneficial to use a monotonic random process in order to preserve the uncertainty in the temporal war**s.
△ Less
Submitted 25 February, 2020; v1 submitted 30 May, 2019;
originally announced May 2019.
-
Gaussian Process Deep Belief Networks: A Smooth Generative Model of Shape with Uncertainty Propagation
Authors:
Alessandro Di Martino,
Erik Bodin,
Carl Henrik Ek,
Neill D. F. Campbell
Abstract:
The shape of an object is an important characteristic for many vision problems such as segmentation, detection and tracking. Being independent of appearance, it is possible to generalize to a large range of objects from only small amounts of data. However, shapes represented as silhouette images are challenging to model due to complicated likelihood functions leading to intractable posteriors. In…
▽ More
The shape of an object is an important characteristic for many vision problems such as segmentation, detection and tracking. Being independent of appearance, it is possible to generalize to a large range of objects from only small amounts of data. However, shapes represented as silhouette images are challenging to model due to complicated likelihood functions leading to intractable posteriors. In this paper we present a generative model of shapes which provides a low dimensional latent encoding which importantly resides on a smooth manifold with respect to the silhouette images. The proposed model propagates uncertainty in a principled manner allowing it to learn from small amounts of data and providing predictions with associated uncertainty. We provide experiments that show how our proposed model provides favorable quantitative results compared with the state-of-the-art while simultaneously providing a representation that resides on a low-dimensional interpretable manifold.
△ Less
Submitted 13 December, 2018;
originally announced December 2018.
-
The GAN that Warped: Semantic Attribute Editing with Unpaired Data
Authors:
Garoe Dorta,
Sara Vicente,
Neill D. F. Campbell,
Ivor J. A. Simpson
Abstract:
Deep neural networks have recently been used to edit images with great success, in particular for faces. However, they are often limited to only being able to work at a restricted range of resolutions. Many methods are so flexible that face edits can often result in an unwanted loss of identity. This work proposes to learn how to perform semantic image edits through the application of smooth warp…
▽ More
Deep neural networks have recently been used to edit images with great success, in particular for faces. However, they are often limited to only being able to work at a restricted range of resolutions. Many methods are so flexible that face edits can often result in an unwanted loss of identity. This work proposes to learn how to perform semantic image edits through the application of smooth warp fields. Previous approaches that attempted to use war** for semantic edits required paired data, i.e. example images of the same subject with different semantic attributes. In contrast, we employ recent advances in Generative Adversarial Networks that allow our model to be trained with unpaired data. We demonstrate face editing at very high resolutions (4k images) with a single forward pass of a deep network at a lower resolution. We also show that our edits are substantially better at preserving the subject's identity. The robustness of our approach is demonstrated by showing plausible image editing results on the Cub200 birds dataset. To our knowledge this has not been previously accomplished, due the challenging nature of the dataset.
△ Less
Submitted 5 March, 2020; v1 submitted 30 November, 2018;
originally announced November 2018.
-
Sequence Alignment with Dirichlet Process Mixtures
Authors:
Ieva Kazlauskaite,
Ivan Ustyuzhaninov,
Carl Henrik Ek,
Neill D. F. Campbell
Abstract:
We present a probabilistic model for unsupervised alignment of high-dimensional time-warped sequences based on the Dirichlet Process Mixture Model (DPMM). We follow the approach introduced in (Kazlauskaite, 2018) of simultaneously representing each data sequence as a composition of a true underlying function and a time-war**, both of which are modelled using Gaussian processes (GPs) (Rasmussen,…
▽ More
We present a probabilistic model for unsupervised alignment of high-dimensional time-warped sequences based on the Dirichlet Process Mixture Model (DPMM). We follow the approach introduced in (Kazlauskaite, 2018) of simultaneously representing each data sequence as a composition of a true underlying function and a time-war**, both of which are modelled using Gaussian processes (GPs) (Rasmussen, 2005), and aligning the underlying functions using an unsupervised alignment method. In (Kazlauskaite, 2018) the alignment is performed using the GP latent variable model (GP-LVM) (Lawrence, 2005) as a model of sequences, while our main contribution is extending this approach to using DPMM, which allows us to align the sequences temporally and cluster them at the same time. We show that the DPMM achieves competitive results in comparison to the GP-LVM on synthetic and real-world data sets, and discuss the different properties of the estimated underlying functions and the time-warps favoured by these models.
△ Less
Submitted 26 November, 2018;
originally announced November 2018.
-
DP-GP-LVM: A Bayesian Non-Parametric Model for Learning Multivariate Dependency Structures
Authors:
Andrew R. Lawrence,
Carl Henrik Ek,
Neill D. F. Campbell
Abstract:
We present a non-parametric Bayesian latent variable model capable of learning dependency structures across dimensions in a multivariate setting. Our approach is based on flexible Gaussian process priors for the generative map**s and interchangeable Dirichlet process priors to learn the structure. The introduction of the Dirichlet process as a specific structural prior allows our model to circum…
▽ More
We present a non-parametric Bayesian latent variable model capable of learning dependency structures across dimensions in a multivariate setting. Our approach is based on flexible Gaussian process priors for the generative map**s and interchangeable Dirichlet process priors to learn the structure. The introduction of the Dirichlet process as a specific structural prior allows our model to circumvent issues associated with previous Gaussian process latent variable models. Inference is performed by deriving an efficient variational bound on the marginal log-likelihood on the model.
△ Less
Submitted 12 July, 2018;
originally announced July 2018.
-
Training VAEs Under Structured Residuals
Authors:
Garoe Dorta,
Sara Vicente,
Lourdes Agapito,
Neill D. F. Campbell,
Ivor Simpson
Abstract:
Variational auto-encoders (VAEs) are a popular and powerful deep generative model. Previous works on VAEs have assumed a factorized likelihood model, whereby the output uncertainty of each pixel is assumed to be independent. This approximation is clearly limited as demonstrated by observing a residual image from a VAE reconstruction, which often possess a high level of structure. This paper demons…
▽ More
Variational auto-encoders (VAEs) are a popular and powerful deep generative model. Previous works on VAEs have assumed a factorized likelihood model, whereby the output uncertainty of each pixel is assumed to be independent. This approximation is clearly limited as demonstrated by observing a residual image from a VAE reconstruction, which often possess a high level of structure. This paper demonstrates a novel scheme to incorporate a structured Gaussian likelihood prediction network within the VAE that allows the residual correlations to be modeled. Our novel architecture, with minimal increase in complexity, incorporates the covariance matrix prediction within the VAE. We also propose a new mechanism for allowing structured uncertainty on color images. Furthermore, we provide a scheme for effectively training this model, and include some suggestions for improving performance in terms of efficiency or modeling longer range correlations.
△ Less
Submitted 31 July, 2018; v1 submitted 3 April, 2018;
originally announced April 2018.
-
Gaussian Process Latent Variable Alignment Learning
Authors:
Ieva Kazlauskaite,
Carl Henrik Ek,
Neill D. F. Campbell
Abstract:
We present a model that can automatically learn alignments between high-dimensional data in an unsupervised manner. Our proposed method casts alignment learning in a framework where both alignment and data are modelled simultaneously. Further, we automatically infer grou**s of different types of sequences within the same dataset. We derive a probabilistic model built on non-parametric priors tha…
▽ More
We present a model that can automatically learn alignments between high-dimensional data in an unsupervised manner. Our proposed method casts alignment learning in a framework where both alignment and data are modelled simultaneously. Further, we automatically infer grou**s of different types of sequences within the same dataset. We derive a probabilistic model built on non-parametric priors that allows for flexible warps while at the same time providing means to specify interpretable constraints. We demonstrate the efficacy of our approach with superior quantitative performance to the state-of-the-art approaches and provide examples to illustrate the versatility of our model in automatic inference of sequence grou**s, absent from previous approaches, as well as easy specification of high level priors for different modalities of data.
△ Less
Submitted 1 March, 2019; v1 submitted 7 March, 2018;
originally announced March 2018.
-
Structured Uncertainty Prediction Networks
Authors:
Garoe Dorta,
Sara Vicente,
Lourdes Agapito,
Neill D. F. Campbell,
Ivor Simpson
Abstract:
This paper is the first work to propose a network to predict a structured uncertainty distribution for a synthesized image. Previous approaches have been mostly limited to predicting diagonal covariance matrices. Our novel model learns to predict a full Gaussian covariance matrix for each reconstruction, which permits efficient sampling and likelihood evaluation.
We demonstrate that our model ca…
▽ More
This paper is the first work to propose a network to predict a structured uncertainty distribution for a synthesized image. Previous approaches have been mostly limited to predicting diagonal covariance matrices. Our novel model learns to predict a full Gaussian covariance matrix for each reconstruction, which permits efficient sampling and likelihood evaluation.
We demonstrate that our model can accurately reconstruct ground truth correlated residual distributions for synthetic datasets and generate plausible high frequency samples for real face images. We also illustrate the use of these predicted covariances for structure preserving image denoising.
△ Less
Submitted 23 March, 2018; v1 submitted 20 February, 2018;
originally announced February 2018.
-
Nonparametric Inference for Auto-Encoding Variational Bayes
Authors:
Erik Bodin,
Iman Malik,
Carl Henrik Ek,
Neill D. F. Campbell
Abstract:
We would like to learn latent representations that are low-dimensional and highly interpretable. A model that has these characteristics is the Gaussian Process Latent Variable Model. The benefits and negative of the GP-LVM are complementary to the Variational Autoencoder, the former provides interpretable low-dimensional latent representations while the latter is able to handle large amounts of da…
▽ More
We would like to learn latent representations that are low-dimensional and highly interpretable. A model that has these characteristics is the Gaussian Process Latent Variable Model. The benefits and negative of the GP-LVM are complementary to the Variational Autoencoder, the former provides interpretable low-dimensional latent representations while the latter is able to handle large amounts of data and can use non-Gaussian likelihoods. Our inspiration for this paper is to marry these two approaches and reap the benefits of both. In order to do so we will introduce a novel approximate inference scheme inspired by the GP-LVM and the VAE. We show experimentally that the approximation allows the capacity of the generative bottle-neck (Z) of the VAE to be arbitrarily large without losing a highly interpretable representation, allowing reconstruction quality to be unlimited by Z at the same time as a low-dimensional space can be used to perform ancestral sampling from as well as a means to reason about the embedded data.
△ Less
Submitted 18 December, 2017;
originally announced December 2017.
-
Latent Gaussian Process Regression
Authors:
Erik Bodin,
Neill D. F. Campbell,
Carl Henrik Ek
Abstract:
We introduce Latent Gaussian Process Regression which is a latent variable extension allowing modelling of non-stationary multi-modal processes using GPs. The approach is built on extending the input space of a regression problem with a latent variable that is used to modulate the covariance function over the training data. We show how our approach can be used to model multi-modal and non-stationa…
▽ More
We introduce Latent Gaussian Process Regression which is a latent variable extension allowing modelling of non-stationary multi-modal processes using GPs. The approach is built on extending the input space of a regression problem with a latent variable that is used to modulate the covariance function over the training data. We show how our approach can be used to model multi-modal and non-stationary processes. We exemplify the approach on a set of synthetic data and provide results on real data from motion capture and geostatistics.
△ Less
Submitted 16 September, 2017; v1 submitted 18 July, 2017;
originally announced July 2017.
-
Responsive Action-based Video Synthesis
Authors:
Corneliu Ilisescu,
Halil Aytac Kanaci,
Matteo Romagnoli,
Neill D. F. Campbell,
Gabriel J. Brostow
Abstract:
We propose technology to enable a new medium of expression, where video elements can be looped, merged, and triggered, interactively. Like audio, video is easy to sample from the real world but hard to segment into clean reusable elements. Reusing a video clip means non-linear editing and compositing with novel footage. The new context dictates how carefully a clip must be prepared, so our end-to-…
▽ More
We propose technology to enable a new medium of expression, where video elements can be looped, merged, and triggered, interactively. Like audio, video is easy to sample from the real world but hard to segment into clean reusable elements. Reusing a video clip means non-linear editing and compositing with novel footage. The new context dictates how carefully a clip must be prepared, so our end-to-end approach enables previewing and easy iteration.
We convert static-camera videos into loopable sequences, synthesizing them in response to simple end-user requests. This is hard because a) users want essentially semantic-level control over the synthesized video content, and b) automatic loop-finding is brittle and leaves users limited opportunity to work through problems. We propose a human-in-the-loop system where adding effort gives the user progressively more creative control. Artists help us evaluate how our trigger interfaces can be used for authoring of videos and video-performances.
△ Less
Submitted 20 May, 2017;
originally announced May 2017.
-
Within Group Variable Selection through the Exclusive Lasso
Authors:
Frederick Campbell,
Genevera I. Allen
Abstract:
Many data sets consist of variables with an inherent group structure. The problem of group selection has been well studied, but in this paper, we seek to do the opposite: our goal is to select at least one variable from each group in the context of predictive regression modeling. This problem is NP-hard, but we study the tightest convex relaxation: a composite penalty that is a combination of the…
▽ More
Many data sets consist of variables with an inherent group structure. The problem of group selection has been well studied, but in this paper, we seek to do the opposite: our goal is to select at least one variable from each group in the context of predictive regression modeling. This problem is NP-hard, but we study the tightest convex relaxation: a composite penalty that is a combination of the $\ell_1$ and $\ell_2$ norms. Our so-called Exclusive Lasso method performs structured variable selection by ensuring that at least one variable is selected from each group. We study our method's statistical properties and develop computationally scalable algorithms for fitting the Exclusive Lasso. We study the effectiveness of our method via simulations as well as using NMR spectroscopy data. Here, we use the Exclusive Lasso to select the appropriate chemical shift from a dictionary of possible chemical shifts for each molecule in the biological sample.
△ Less
Submitted 27 May, 2015;
originally announced May 2015.
-
Hierarchical Subquery Evaluation for Active Learning on a Graph
Authors:
Oisin Mac Aodha,
Neill D. F. Campbell,
Jan Kautz,
Gabriel J. Brostow
Abstract:
To train good supervised and semi-supervised object classifiers, it is critical that we not waste the time of the human experts who are providing the training labels. Existing active learning strategies can have uneven performance, being efficient on some datasets but wasteful on others, or inconsistent just between runs on the same dataset. We propose perplexity based graph construction and a new…
▽ More
To train good supervised and semi-supervised object classifiers, it is critical that we not waste the time of the human experts who are providing the training labels. Existing active learning strategies can have uneven performance, being efficient on some datasets but wasteful on others, or inconsistent just between runs on the same dataset. We propose perplexity based graph construction and a new hierarchical subquery evaluation algorithm to combat this variability, and to release the potential of Expected Error Reduction.
Under some specific circumstances, Expected Error Reduction has been one of the strongest-performing informativeness criteria for active learning. Until now, it has also been prohibitively costly to compute for sizeable datasets. We demonstrate our highly practical algorithm, comparing it to other active learning measures on classification datasets that vary in sparsity, dimensionality, and size. Our algorithm is consistent over multiple runs and achieves high accuracy, while querying the human expert for labels at a frequency that matches their desired time budget.
△ Less
Submitted 30 April, 2015;
originally announced April 2015.
-
Design of a speed meter interferometer proof-of-principle experiment
Authors:
C. Gräf,
B. W. Barr,
A. S. Bell,
F. Campbell,
A. V. Cumming,
S. L. Danilishin,
N. A. Gordon,
G. D. Hammond,
J. Hennig,
E. A. Houston,
S. H. Huttner,
R. A. Jones,
S. S. Leavey,
H. Lück,
J. Macarthur,
M. Marwick,
S. Rigby,
R. Schilling,
B. Sorazu,
A. Spencer,
S. Steinlechner,
K. A. Strain,
S. Hild
Abstract:
The second generation of large scale interferometric gravitational wave detectors will be limited by quantum noise over a wide frequency range in their detection band. Further sensitivity improvements for future upgrades or new detectors beyond the second generation motivate the development of measurement schemes to mitigate the impact of quantum noise in these instruments. Two strands of develo…
▽ More
The second generation of large scale interferometric gravitational wave detectors will be limited by quantum noise over a wide frequency range in their detection band. Further sensitivity improvements for future upgrades or new detectors beyond the second generation motivate the development of measurement schemes to mitigate the impact of quantum noise in these instruments. Two strands of development are being pursued to reach this goal, focusing both on modifications of the well-established Michelson detector configuration and development of different detector topologies. In this paper, we present the design of the world's first Sagnac speed meter interferometer which is currently being constructed at the University of Glasgow. With this proof-of-principle experiment we aim to demonstrate the theoretically predicted lower quantum noise in a Sagnac interferometer compared to an equivalent Michelson interferometer, to qualify Sagnac speed meters for further research towards an implementation in a future generation large scale gravitational wave detector, such as the planned Einstein Telescope observatory.
△ Less
Submitted 11 September, 2014; v1 submitted 12 May, 2014;
originally announced May 2014.
-
Advanced sine wave modulation of continuous wave laser system for atmospheric CO2 differential absorption measurements
Authors:
Joel F. Campbell,
Bing Lin,
Amin R. Nehrir
Abstract:
In this theoretical study, modulation techniques are developed to support the Active Sensing of CO2 Emissions over Nights, Days, and Seasons (ASCENDS) mission. A CW lidar system using sine waves modulated by ML pseudo random noise codes is described for making simultaneous online/offline differential absorption measurements. Amplitude and Phase Shift Keying (PSK) modulated IM carriers, in addition…
▽ More
In this theoretical study, modulation techniques are developed to support the Active Sensing of CO2 Emissions over Nights, Days, and Seasons (ASCENDS) mission. A CW lidar system using sine waves modulated by ML pseudo random noise codes is described for making simultaneous online/offline differential absorption measurements. Amplitude and Phase Shift Keying (PSK) modulated IM carriers, in addition to a hybrid pulse technique are investigated that exhibit optimal autocorrelation properties. A method is presented to bandwidth limit the ML sequence based on a filter implemented in terms of Jacobi theta functions that does not significantly degrade the resolution or introduce side lobes as a means of reducing aliasing and IM carrier bandwidth.
△ Less
Submitted 4 January, 2014; v1 submitted 13 September, 2013;
originally announced September 2013.
-
Non-linear swept frequency technique for CO2 measurements using a CW laser system
Authors:
Joel F. Campbell
Abstract:
A system using a non-linear multi-swept sine wave system is described which employs a multi-channel, multi-swept orthogonal waves, to separate channels and make multiple, simultaneous online/offline CO2 measurements. An analytic expression and systematic method for determining the orthogonal frequencies for the unswept, linear swept and non-linear swept cases is presented. It is shown that one may…
▽ More
A system using a non-linear multi-swept sine wave system is described which employs a multi-channel, multi-swept orthogonal waves, to separate channels and make multiple, simultaneous online/offline CO2 measurements. An analytic expression and systematic method for determining the orthogonal frequencies for the unswept, linear swept and non-linear swept cases is presented. It is shown that one may reduce sidelobes of the autocorrelation function while preserving cross channel orthogonality, for thin cloud rejection.
△ Less
Submitted 20 March, 2013;
originally announced March 2013.
-
A Low Cost Remote Sensing System Using PC and Stereo Equipment
Authors:
Joel F. Campbell,
Michael A. Flood,
Narasimha S. Prasad,
Wade D. Hodson
Abstract:
A system using a personal computer, speaker, and a microphone is used to detect objects, and make crude measurements using a carrier modulated by a pseudorandom noise (PN) code. This system can be constructed using a personal computer and audio equipment commonly found in the laboratory or at home, or more sophisticated equipment that can be purchased at reasonable cost. We demonstrate its value a…
▽ More
A system using a personal computer, speaker, and a microphone is used to detect objects, and make crude measurements using a carrier modulated by a pseudorandom noise (PN) code. This system can be constructed using a personal computer and audio equipment commonly found in the laboratory or at home, or more sophisticated equipment that can be purchased at reasonable cost. We demonstrate its value as an instructional tool for teaching concepts of remote sensing and digital signal processing.
△ Less
Submitted 18 September, 2011; v1 submitted 10 June, 2011;
originally announced June 2011.
-
State-space based mass event-history model I: many decision-making agents with one target
Authors:
Hsieh Fushing,
Li Zhu,
David I. Shapiro-Ilan,
James F. Campbell,
Edwin E. Lewis
Abstract:
A dynamic decision-making system that includes a mass of indistinguishable agents could manifest impressive heterogeneity. This kind of nonhomogeneity is postulated to result from macroscopic behavioral tactics employed by almost all involved agents. A State-Space Based (SSB) mass event-history model is developed here to explore the potential existence of such macroscopic behaviors. By imposing…
▽ More
A dynamic decision-making system that includes a mass of indistinguishable agents could manifest impressive heterogeneity. This kind of nonhomogeneity is postulated to result from macroscopic behavioral tactics employed by almost all involved agents. A State-Space Based (SSB) mass event-history model is developed here to explore the potential existence of such macroscopic behaviors. By imposing an unobserved internal state-space variable into the system, each individual's event-history is made into a composition of a common state duration and an individual specific time to action. With the common state modeling of the macroscopic behavior, parametric statistical inferences are derived under the current-status data structure and conditional independence assumptions. Identifiability and computation related problems are also addressed. From the dynamic perspectives of system-wise heterogeneity, this SSB mass event-history model is shown to be very distinct from a random effect model via the Principle Component Analysis (PCA) in a numerical experiment. Real data showing the mass invasion by two species of parasitic nematode into two species of host larvae are also analyzed. The analysis results not only are found coherent in the context of the biology of the nematode as a parasite, but also include new quantitative interpretations.
△ Less
Submitted 27 January, 2009;
originally announced January 2009.
-
Mid-Infrared Photometry and Spectra of Three High Mass Protostellar Candidates at IRAS 18151-1208 and IRAS 20343+4129
Authors:
M. F. Campbell,
T. K. Sridharan,
H. Beuther,
J. H. Lacy,
J. L. Hora,
Q. Zhu,
M. Kassis,
M. Saito,
J. M. De Buizer,
S. H. Fung,
L. C. Johnson
Abstract:
We present arcsecond-scale mid-ir photometry (in the 10.5 micron N band and at 24.8 microns), and low resolution spectra in the N band (R~100) of a candidate high mass protostellar object (HMPO) in IRAS 18151-1208 and of two HMPO candidates in IRAS 20343+4129, IRS 1 and IRS 3. In addition we present high resolution mid-ir spectra (R~80000) of the two HMPO candidates in IRAS 20343+4129. These dat…
▽ More
We present arcsecond-scale mid-ir photometry (in the 10.5 micron N band and at 24.8 microns), and low resolution spectra in the N band (R~100) of a candidate high mass protostellar object (HMPO) in IRAS 18151-1208 and of two HMPO candidates in IRAS 20343+4129, IRS 1 and IRS 3. In addition we present high resolution mid-ir spectra (R~80000) of the two HMPO candidates in IRAS 20343+4129. These data are fitted with simple models to estimate the masses of gas and dust associated with the mid-ir emitting clumps, the column densities of overlying absorbing dust and gas, the luminosities of the HMPO candidates, and the likely spectral type of the HMPO candidate for which [Ne II] 12.8 micron emission was detected (IRAS 20343+4129 IRS 3). We suggest that IRAS 18151-1208 is a pre-ultracompact HII region HMPO, IRAS 20343+4129 IRS 1 is an embedded young stellar object with the luminosity of a B3 star, and IRAS 20343+4129 IRS 3 is a B2 ZAMS star that has formed an ultracompact HII region and disrupted its natal envelope.
△ Less
Submitted 19 October, 2007;
originally announced October 2007.