Search | arXiv e-print repository

Sparse Nystrom Approximation of Currents and Varifolds

Authors: Allen Paul, Neill Campbell, Tony Shardlow

Abstract: We derive an algorithm for compression of the currents and varifolds representations of shapes, using the Nystrom approximation in Reproducing Kernel Hilbert Spaces. Our method is faster than existing compression techniques, and comes with theoretical guarantees on the rate of convergence of the compressed approximation, as a function of the smoothness of the associated shape representation. The o… ▽ More We derive an algorithm for compression of the currents and varifolds representations of shapes, using the Nystrom approximation in Reproducing Kernel Hilbert Spaces. Our method is faster than existing compression techniques, and comes with theoretical guarantees on the rate of convergence of the compressed approximation, as a function of the smoothness of the associated shape representation. The obtained compression are shown to be useful for down-line tasks such as nonlinear shape registration in the Large Deformation Metric Map** (LDDMM) framework, even for very high compression ratios. The performance of our algorithm is demonstrated on large-scale shape data from modern geometry processing datasets, and is shown to be fast and scalable with rapid error decay. △ Less

Submitted 14 June, 2024; originally announced June 2024.

MSC Class: 65D18; 65D15; 68W20; 68W25

arXiv:2404.00172 [pdf, other]

Universal Bovine Identification via Depth Data and Deep Metric Learning

Authors: Asheesh Sharma, Lucy Randewich, William Andrew, Sion Hannuna, Neill Campbell, Siobhan Mullan, Andrew W. Dowsey, Melvyn Smith, Mark Hansen, Tilo Burghardt

Abstract: This paper proposes and evaluates, for the first time, a top-down (dorsal view), depth-only deep learning system for accurately identifying individual cattle and provides associated code, datasets, and training weights for immediate reproducibility. An increase in herd size skews the cow-to-human ratio at the farm and makes the manual monitoring of individuals more challenging. Therefore, real-tim… ▽ More This paper proposes and evaluates, for the first time, a top-down (dorsal view), depth-only deep learning system for accurately identifying individual cattle and provides associated code, datasets, and training weights for immediate reproducibility. An increase in herd size skews the cow-to-human ratio at the farm and makes the manual monitoring of individuals more challenging. Therefore, real-time cattle identification is essential for the farms and a crucial step towards precision livestock farming. Underpinned by our previous work, this paper introduces a deep-metric learning method for cattle identification using depth data from an off-the-shelf 3D camera. The method relies on CNN and MLP backbones that learn well-generalised embedding spaces from the body shape to differentiate individuals -- requiring neither species-specific coat patterns nor close-up muzzle prints for operation. The network embeddings are clustered using a simple algorithm such as $k$-NN for highly accurate identification, thus eliminating the need to retrain the network for enrolling new individuals. We evaluate two backbone architectures, ResNet, as previously used to identify Holstein Friesians using RGB images, and PointNet, which is specialised to operate on 3D point clouds. We also present CowDepth2023, a new dataset containing 21,490 synchronised colour-depth image pairs of 99 cows, to evaluate the backbones. Both ResNet and PointNet architectures, which consume depth maps and point clouds, respectively, led to high accuracy that is on par with the coat pattern-based backbone. △ Less

Submitted 29 March, 2024; originally announced April 2024.

Comments: LaTeX, 38 pages, 14 figures, 3 tables

arXiv:2403.16547 [pdf, other]

Frequency Comb Enhancement via the Self-Crystallization of Vectorial Cavity Solitons

Authors: Graeme Neil Campbell, Lewis Hill, Pascal Del'Haye, Gian-Luca Oppo

Abstract: Long range interactions between dark vectorial temporal cavity solitons are induced though the spontaneous symmetry breaking of orthogonally polarized fields in ring resonators. Turing patterns of alternating polarizations form between adjacent solitons, pushing them apart so that a random distribution of solitons along the cavity length reaches equal equilibrium distances. Enhancement of the freq… ▽ More Long range interactions between dark vectorial temporal cavity solitons are induced though the spontaneous symmetry breaking of orthogonally polarized fields in ring resonators. Turing patterns of alternating polarizations form between adjacent solitons, pushing them apart so that a random distribution of solitons along the cavity length reaches equal equilibrium distances. Enhancement of the frequency comb is achieved through the spontaneous formation of regularly spaced soliton crystals, 'self-crystallization', with greater power and spacing of the spectral lines for increasing soliton numbers. △ Less

Submitted 25 March, 2024; originally announced March 2024.

arXiv:2402.10673 [pdf, other]

Controlled light distribution with coupled microresonator chains via Kerr symmetry breaking

Authors: Alekhya Ghosh, Arghadeep Pal, Lewis Hill, Graeme N Campbell, Toby Bi, Yao**g Zhang, Abdullah Alabbadi, Shuangyou Zhang, Gian-Luca Oppo, Pascal Del'Haye

Abstract: Within optical microresonators, the Kerr interaction of photons can lead to symmetry breaking of optical modes. In a ring resonator, this leads to the interesting effect that light preferably circulates in one direction or in one polarization state. Applications of this effect range from chip-integrated optical diodes to nonlinear polarization controllers and optical gyroscopes. In this work, we s… ▽ More Within optical microresonators, the Kerr interaction of photons can lead to symmetry breaking of optical modes. In a ring resonator, this leads to the interesting effect that light preferably circulates in one direction or in one polarization state. Applications of this effect range from chip-integrated optical diodes to nonlinear polarization controllers and optical gyroscopes. In this work, we study Kerr-nonlinearity-induced symmetry breaking of light states in coupled resonator optical waveguides (CROWs). We discover a new type of controllable symmetry breaking that leads to emerging patterns of dark and bright resonators within the chains. Beyond stationary symmetry broken states, we observe periodic oscillations, switching and chaotic fluctuations of circulating powers in the resonators. Our findings are of interest for controlled multiplexing of light in photonic integrated circuits, neuromorphic computing, topological photonics and soliton frequency combs in coupled resonators. △ Less

Submitted 16 February, 2024; originally announced February 2024.

Comments: 15 pages, 11 figures

arXiv:2310.17432 [pdf, other]

Likelihood-based Out-of-Distribution Detection with Denoising Diffusion Probabilistic Models

Authors: Joseph Goodier, Neill D. F. Campbell

Abstract: Out-of-Distribution detection between dataset pairs has been extensively explored with generative models. We show that likelihood-based Out-of-Distribution detection can be extended to diffusion models by leveraging the fact that they, like other likelihood-based generative models, are dramatically affected by the input sample complexity. Currently, all Out-of-Distribution detection methods with D… ▽ More Out-of-Distribution detection between dataset pairs has been extensively explored with generative models. We show that likelihood-based Out-of-Distribution detection can be extended to diffusion models by leveraging the fact that they, like other likelihood-based generative models, are dramatically affected by the input sample complexity. Currently, all Out-of-Distribution detection methods with Diffusion Models are reconstruction-based. We propose a new likelihood ratio for Out-of-Distribution detection with Deep Denoising Diffusion Models, which we call the Complexity Corrected Likelihood Ratio. Our likelihood ratio is constructed using Evidence Lower-Bound evaluations from an individual model at various noising levels. We present results that are comparable to state-of-the-art Out-of-Distribution detection methods with generative models. △ Less

Submitted 26 October, 2023; originally announced October 2023.

Comments: 9 pages (main paper), 3 pages (acknowledgements & references), 3 figures, 2 tables, 1 algorithm, work accepted for BMVC 2023

arXiv:2309.15478 [pdf, other]

The Robust Semantic Segmentation UNCV2023 Challenge Results

Authors: Xuanlong Yu, Yi Zuo, Zitao Wang, Xiaowen Zhang, Jiaxuan Zhao, Yuting Yang, Licheng Jiao, Rui Peng, Xinyi Wang, Junpei Zhang, Kexin Zhang, Fang Liu, Roberto Alcover-Couso, Juan C. SanMiguel, Marcos Escudero-Viñolo, Hanlin Tian, Kenta Matsui, Tianhao Wang, Fahmy Adan, Zhitong Gao, Xuming He, Quentin Bouniot, Hossein Moghaddam, Shyam Nandan Rai, Fabio Cermelli , et al. (12 additional authors not shown)

Abstract: This paper outlines the winning solutions employed in addressing the MUAD uncertainty quantification challenge held at ICCV 2023. The challenge was centered around semantic segmentation in urban environments, with a particular focus on natural adversarial scenarios. The report presents the results of 19 submitted entries, with numerous techniques drawing inspiration from cutting-edge uncertainty q… ▽ More This paper outlines the winning solutions employed in addressing the MUAD uncertainty quantification challenge held at ICCV 2023. The challenge was centered around semantic segmentation in urban environments, with a particular focus on natural adversarial scenarios. The report presents the results of 19 submitted entries, with numerous techniques drawing inspiration from cutting-edge uncertainty quantification methodologies presented at prominent conferences in the fields of computer vision and machine learning and journals over the past few years. Within this document, the challenge is introduced, shedding light on its purpose and objectives, which primarily revolved around enhancing the robustness of semantic segmentation in urban scenes under varying natural adversarial conditions. The report then delves into the top-performing solutions. Moreover, the document aims to provide a comprehensive overview of the diverse solutions deployed by all participants. By doing so, it seeks to offer readers a deeper insight into the array of strategies that can be leveraged to effectively handle the inherent uncertainties associated with autonomous driving and semantic segmentation, especially within urban environments. △ Less

Submitted 27 September, 2023; originally announced September 2023.

Comments: 11 pages, 4 figures, accepted at ICCV 2023 UNCV workshop

arXiv:2308.13180 [pdf]

Electronic-grade epitaxial (111) KTaO3 heterostructures

Authors: Jieun Kim, Muqing Yu, Jung-Woo Lee, Shun-Li Shang, Gi-Yeop Kim, Pratap Pal, **sol Seo, Neil Campbell, Kitae Eom, Ranjani Ramachandran, Mark S. Rzchowski, Sang Ho Oh, Si-Young Choi, Zi-Kui Liu, Jeremy Levy, Chang-Beom Eom

Abstract: KTaO3 has recently attracted attention as a model system to study the interplay of quantum paraelectricity, spin-orbit coupling, and superconductivity. However, the high and low vapor pressures of potassium and tantalum present processing challenges to creating interfaces clean enough to reveal the intrinsic quantum properties. Here, we report superconducting heterostructures based on electronic-g… ▽ More KTaO3 has recently attracted attention as a model system to study the interplay of quantum paraelectricity, spin-orbit coupling, and superconductivity. However, the high and low vapor pressures of potassium and tantalum present processing challenges to creating interfaces clean enough to reveal the intrinsic quantum properties. Here, we report superconducting heterostructures based on electronic-grade epitaxial (111) KTaO3 thin films. Electrical and structural characterizations reveal that two-dimensional electron gas at the heterointerface between amorphous LaAlO3 and KTaO3 thin film exhibits significantly higher electron mobility, superconducting transition temperature and critical current density than those in bulk single crystal KTaO3-based heterostructures owing to cleaner interface in KTaO3 thin films. Our hybrid approach may enable epitaxial growth of other alkali metal-based oxides that lie beyond the capabilities of conventional methods. △ Less

Submitted 25 August, 2023; originally announced August 2023.

arXiv:2306.02946 [pdf, other]

Dark solitons in Fabry-Perot resonators with Kerr media and normal dispersion

Authors: Graeme Neil Campbell, Lewis Hill, Pascal Del'Haye, Gian-Luca Oppo

Abstract: Ranges of existence and stability of dark cavity-soliton stationary states in a Fabry-Perot resonator with a Kerr nonlinear medium and normal dispersion are determined. The Fabry-Perot configuration introduces nonlocal coupling that shifts the cavity detuning by the round trip average power of the intracavity field. When compared with ring resonators described by the Lugiato-Lefever equation, nonl… ▽ More Ranges of existence and stability of dark cavity-soliton stationary states in a Fabry-Perot resonator with a Kerr nonlinear medium and normal dispersion are determined. The Fabry-Perot configuration introduces nonlocal coupling that shifts the cavity detuning by the round trip average power of the intracavity field. When compared with ring resonators described by the Lugiato-Lefever equation, nonlocal coupling leads to strongly detuned dark cavity solitons that exist over a wide range of detunings. This shift is a consequence of the counterpropagation of intracavity fields inherent to Fabry-Perot resonators. At difference with ring resonators, the existence and stability of dark soliton solutions are dependent on the size and number of solitons in the cavity. We investigate the effect of nonlocal coupling of Fabry-Perot resonators on multiple dark solitons and demonstrate long range interactions and synchronization of temporal oscillations. △ Less

Submitted 5 June, 2023; originally announced June 2023.

arXiv:2210.14586 [pdf, other]

doi 10.1088/1361-6560/ace49a

Compressed Sensing MRI Reconstruction Regularized by VAEs with Structured Image Covariance

Authors: Margaret Duff, Ivor J. A. Simpson, Matthias J. Ehrhardt, Neill D. F. Campbell

Abstract: Objective: This paper investigates how generative models, trained on ground-truth images, can be used \changes{as} priors for inverse problems, penalizing reconstructions far from images the generator can produce. The aim is that learned regularization will provide complex data-driven priors to inverse problems while still retaining the control and insight of a variational regularization method. M… ▽ More Objective: This paper investigates how generative models, trained on ground-truth images, can be used \changes{as} priors for inverse problems, penalizing reconstructions far from images the generator can produce. The aim is that learned regularization will provide complex data-driven priors to inverse problems while still retaining the control and insight of a variational regularization method. Moreover, unsupervised learning, without paired training data, allows the learned regularizer to remain flexible to changes in the forward problem such as noise level, sampling pattern or coil sensitivities in MRI. Approach: We utilize variational autoencoders (VAEs) that generate not only an image but also a covariance uncertainty matrix for each image. The covariance can model changing uncertainty dependencies caused by structure in the image, such as edges or objects, and provides a new distance metric from the manifold of learned images. Main results: We evaluate these novel generative regularizers on retrospectively sub-sampled real-valued MRI measurements from the fastMRI dataset. We compare our proposed learned regularization against other unlearned regularization approaches and unsupervised and supervised deep learning methods. Significance: Our results show that the proposed method is competitive with other state-of-the-art methods and behaves consistently with changing sampling patterns and noise levels. △ Less

Submitted 16 June, 2023; v1 submitted 26 October, 2022; originally announced October 2022.

Journal ref: Phys. Med. Biol. 68 16500 (2023)

arXiv:2210.13231 [pdf, other]

Analysing Training-Data Leakage from Gradients through Linear Systems and Gradient Matching

Authors: Cangxiong Chen, Neill D. F. Campbell

Abstract: Recent works have demonstrated that it is possible to reconstruct training images and their labels from gradients of an image-classification model when its architecture is known. Unfortunately, there is still an incomplete theoretical understanding of the efficacy and failure of these gradient-leakage attacks. In this paper, we propose a novel framework to analyse training-data leakage from gradie… ▽ More Recent works have demonstrated that it is possible to reconstruct training images and their labels from gradients of an image-classification model when its architecture is known. Unfortunately, there is still an incomplete theoretical understanding of the efficacy and failure of these gradient-leakage attacks. In this paper, we propose a novel framework to analyse training-data leakage from gradients that draws insights from both analytic and optimisation-based gradient-leakage attacks. We formulate the reconstruction problem as solving a linear system from each layer iteratively, accompanied by corrections using gradient matching. Under this framework, we claim that the solubility of the reconstruction problem is primarily determined by that of the linear system at each layer. As a result, we are able to partially attribute the leakage of the training data in a deep network to its architecture. We also propose a metric to measure the level of security of a deep learning model against gradient-based attacks on the training data. △ Less

Submitted 20 October, 2022; originally announced October 2022.

Comments: To appear at the 33rd British Machine Vision Conference 2022

arXiv:2204.10905 [pdf, other]

Label a Herd in Minutes: Individual Holstein-Friesian Cattle Identification

Authors: **g Gao, Tilo Burghardt, Neill W. Campbell

Abstract: We describe a practically evaluated approach for training visual cattle ID systems for a whole farm requiring only ten minutes of labelling effort. In particular, for the task of automatic identification of individual Holstein-Friesians in real-world farm CCTV, we show that self-supervision, metric learning, cluster analysis, and active learning can complement each other to significantly reduce th… ▽ More We describe a practically evaluated approach for training visual cattle ID systems for a whole farm requiring only ten minutes of labelling effort. In particular, for the task of automatic identification of individual Holstein-Friesians in real-world farm CCTV, we show that self-supervision, metric learning, cluster analysis, and active learning can complement each other to significantly reduce the annotation requirements usually needed to train cattle identification frameworks. Evaluating the approach on the test portion of the publicly available Cows2021 dataset, for training we use 23,350 frames across 435 single individual tracklets generated by automated oriented cattle detection and tracking in operational farm footage. Self-supervised metric learning is first employed to initialise a candidate identity space where each tracklet is considered a distinct entity. Grou** entities into equivalence classes representing cattle identities is then performed by automated merging via cluster analysis and active learning. Critically, we identify the inflection point at which automated choices cannot replicate improvements based on human intervention to reduce annotation to a minimum. Experimental results show that cluster analysis and a few minutes of labelling after automated self-supervision can improve the test identification accuracy of 153 identities to 92.44% (ARI=0.93) from the 74.9% (ARI=0.754) obtained by self-supervision only. These promising results indicate that a tailored combination of human and machine reasoning in visual cattle ID pipelines can be highly effective whilst requiring only minimal labelling effort. We provide all key source code and network weights with this paper for easy result reproduction. △ Less

Submitted 22 April, 2022; originally announced April 2022.

Comments: ICIAP Workshop on Learning in Precision Livestock Farming (accepted). 10 pages, 7 figures

arXiv:2203.15485 [pdf, other]

Learning Structured Gaussians to Approximate Deep Ensembles

Authors: Ivor J. A. Simpson, Sara Vicente, Neill D. F. Campbell

Abstract: This paper proposes using a sparse-structured multivariate Gaussian to provide a closed-form approximator for the output of probabilistic ensemble models used for dense image prediction tasks. This is achieved through a convolutional neural network that predicts the mean and covariance of the distribution, where the inverse covariance is parameterised by a sparsely structured Cholesky matrix. Simi… ▽ More This paper proposes using a sparse-structured multivariate Gaussian to provide a closed-form approximator for the output of probabilistic ensemble models used for dense image prediction tasks. This is achieved through a convolutional neural network that predicts the mean and covariance of the distribution, where the inverse covariance is parameterised by a sparsely structured Cholesky matrix. Similarly to distillation approaches, our single network is trained to maximise the probability of samples from pre-trained probabilistic models, in this work we use a fixed ensemble of networks. Once trained, our compact representation can be used to efficiently draw spatially correlated samples from the approximated output distribution. Importantly, this approach captures the uncertainty and structured correlations in the predictions explicitly in a formal distribution, rather than implicitly through sampling alone. This allows direct introspection of the model, enabling visualisation of the learned structure. Moreover, this formulation provides two further benefits: estimation of a sample probability, and the introduction of arbitrary spatial conditioning at test time. We demonstrate the merits of our approach on monocular depth estimation and show that the advantages of our approach are obtained with comparable quantitative performance. △ Less

Submitted 29 March, 2022; originally announced March 2022.

Comments: Accepted at CVPR 2022

arXiv:2203.14375 [pdf, other]

Switching fronts, plateaus and Kerr oscillations of counterpropagating light in ring resonators

Authors: Graeme N. Campbell, Shuangyou Zhang, Leonardo Del Bino, Pascal Del'Haye, Gian-Luca Oppo

Abstract: We characterise stationary fronts and dark solitons for counterpropagating waves in micro-ring and fibre resonators with two input fields, normal dispersion and nonlocal coupling. These features are different from those in systems with local coupling in that their existence and stability are due to a careful balance of the areas of offset from homogeneous solutions. When scanning one of the two ca… ▽ More We characterise stationary fronts and dark solitons for counterpropagating waves in micro-ring and fibre resonators with two input fields, normal dispersion and nonlocal coupling. These features are different from those in systems with local coupling in that their existence and stability are due to a careful balance of the areas of offset from homogeneous solutions. When scanning one of the two cavity detunings, stable solutions formed plateaus separated by two fronts are present in one of the counter-propagating fields with the power of the other field being homogeneous. Two front plateau solutions have a one-to-one correspondence to solutions of a Lugiato-Lefever equation at the unique Maxwell point. By defining effective detunings and for fixed values of the input powers where the fronts are found, we determine expressions for both the Maxwell point and the distance of the stable fronts as functions of detunings and input powers of both fields in good agreement with numerical simulations. For certain values of the detunings we find multi-stable states of plateaus with fronts, oscillating homogeneous states and non-oscillating homogeneous states of the counter-propagating fields. Robustness and parameter ranges of these unusual dynamical states coexisting with stable non-homogeneous front solutions are provided. △ Less

Submitted 27 March, 2022; originally announced March 2022.

arXiv:2111.10350 [pdf, other]

doi 10.32374/atom.2020.2.4

Original Research By Young Twinkle Students (ORBYTS): Ephemeris Refinement of Transiting Exoplanets III

Authors: Billy Edwards, Cynthia S. K. Ho, Hannah L. M. Osborne, Nabeeha Deen, Ellie Hathorn, Solomon Johnson, Jiya Patel, Varun Vogireddy, Ansh Waddon, Ayuub Ahmed, Muhammad Bham, Nathan Campbell, Zahra Chummun, Nicholas Crossley, Reggie Dunsdon, Robert Hayes, Haroon Malik, Frank Marsden, Lois Mayfield, Liston Mitchell, Agnes Prosser, Valentina Rabrenovic, Emma Smith, Rico Thomas, Anastasia Kokori , et al. (4 additional authors not shown)

Abstract: We report photometric follow-up observations of thirteen exoplanets (HATS-1 b, HATS2 b, HATS-3 b, HAT-P-18 b, HAT-P-27 b, HAT-P-30 b, HAT-P-55 b, KELT-4A b, WASP-25 b, WASP-42 b, WASP-57 b, WASP-61 b and WASP-123 b), as part of the Original Research By Young Twinkle Students (ORBYTS) programme. All these planets are potentially viable targets for atmospheric characterisation and our data, which we… ▽ More We report photometric follow-up observations of thirteen exoplanets (HATS-1 b, HATS2 b, HATS-3 b, HAT-P-18 b, HAT-P-27 b, HAT-P-30 b, HAT-P-55 b, KELT-4A b, WASP-25 b, WASP-42 b, WASP-57 b, WASP-61 b and WASP-123 b), as part of the Original Research By Young Twinkle Students (ORBYTS) programme. All these planets are potentially viable targets for atmospheric characterisation and our data, which were taken using the LCOGT network of ground-based telescopes, will be combined with observations from other users of ExoClock to ensure that the transit times of these planets continue to be well-known, far into the future. △ Less

Submitted 20 July, 2022; v1 submitted 19 November, 2021; originally announced November 2021.

Comments: Accepted for publication in the Astronomy Theory, Observations and Methods Journal. Secondary school students (16-17 y/o) performed the majority of the analysis, as well as writing much of the paper, as part of the ORBYTS programme

Journal ref: Astronomy Theory, Observations and Methods Journal, Vol. 2, No. 1, August 2021

arXiv:2111.10178 [pdf, other]

Understanding Training-Data Leakage from Gradients in Neural Networks for Image Classification

Authors: Cangxiong Chen, Neill D. F. Campbell

Abstract: Federated learning of deep learning models for supervised tasks, e.g. image classification and segmentation, has found many applications: for example in human-in-the-loop tasks such as film post-production where it enables sharing of domain expertise of human artists in an efficient and effective fashion. In many such applications, we need to protect the training data from being leaked when gradie… ▽ More Federated learning of deep learning models for supervised tasks, e.g. image classification and segmentation, has found many applications: for example in human-in-the-loop tasks such as film post-production where it enables sharing of domain expertise of human artists in an efficient and effective fashion. In many such applications, we need to protect the training data from being leaked when gradients are shared in the training process due to IP or privacy concerns. Recent works have demonstrated that it is possible to reconstruct the training data from gradients for an image-classification model when its architecture is known. However, there is still an incomplete theoretical understanding of the efficacy and failure of such attacks. In this paper, we analyse the source of training-data leakage from gradients. We formulate the problem of training data reconstruction as solving an optimisation problem iteratively for each layer. The layer-wise objective function is primarily defined by weights and gradients from the current layer as well as the output from the reconstruction of the subsequent layer, but it might also involve a 'pull-back' constraint from the preceding layer. Training data can be reconstructed when we solve the problem backward from the output of the network through each layer. Based on this formulation, we are able to attribute the potential leakage of the training data in a deep network to its architecture. We also propose a metric to measure the level of security of a deep learning model against gradient-based attacks on the training data. △ Less

Submitted 19 November, 2021; originally announced November 2021.

arXiv:2110.15761 [pdf, other]

Aligned Multi-Task Gaussian Process

Authors: Olga Mikheeva, Ieva Kazlauskaite, Adam Hartshorne, Hedvig Kjellström, Carl Henrik Ek, Neill D. F. Campbell

Abstract: Multi-task learning requires accurate identification of the correlations between tasks. In real-world time-series, tasks are rarely perfectly temporally aligned; traditional multi-task models do not account for this and subsequent errors in correlation estimation will result in poor predictive performance and uncertainty quantification. We introduce a method that automatically accounts for tempora… ▽ More Multi-task learning requires accurate identification of the correlations between tasks. In real-world time-series, tasks are rarely perfectly temporally aligned; traditional multi-task models do not account for this and subsequent errors in correlation estimation will result in poor predictive performance and uncertainty quantification. We introduce a method that automatically accounts for temporal misalignment in a unified generative model that improves predictive performance. Our method uses Gaussian processes (GPs) to model the correlations both within and between the tasks. Building on the previous work by Kazlauskaiteet al. [2019], we include a separate monotonic warp of the input data to model temporal misalignment. In contrast to previous work, we formulate a lower bound that accounts for uncertainty in both the estimates of the war** process and the underlying functions. Also, our new take on a monotonic stochastic process, with efficient path-wise sampling for the warp functions, allows us to perform full Bayesian inference in the model rather than MAP estimates. Missing data experiments, on synthetic and real time-series, demonstrate the advantages of accounting for misalignments (vs standard unaligned method) as well as modelling the uncertainty in the war** process(vs baseline MAP alignment approach). △ Less

Submitted 29 October, 2021; originally announced October 2021.

arXiv:2110.03640 [pdf]

Local atomic configuration control of superconductivity in the undoped pnictide parent compound BaFe2As2

Authors: Jong-Hoon Kang, Philip J. Ryan, Jong-Woo Kim, Jonathon Schad, Jacob P. Podkaminer, Neil Campbell, Joseph Suttle, Tae Heon Kim, Liang Luo, Di Cheng, Yesusa G. Collantes, Eric E. Hellstrom, Jigang Wang, Robert McDermott, Mark S. Rzchowski, Chang-Beom Eom

Abstract: Emergent superconductivity is strongly correlated with the symmetry of local atomic configuration in the parent compounds of iron-based superconductors. While chemical do** or hydrostatic pressure can change the local geometry, these conventional approaches do not provide a clear pathway in tuning the detailed atomic arrangement predictably, due to the parent compounds complicated structural def… ▽ More Emergent superconductivity is strongly correlated with the symmetry of local atomic configuration in the parent compounds of iron-based superconductors. While chemical do** or hydrostatic pressure can change the local geometry, these conventional approaches do not provide a clear pathway in tuning the detailed atomic arrangement predictably, due to the parent compounds complicated structural deformation in the presence of the tetragonal-to-orthorhombic phase transition. Here, we demonstrate a systematic approach to manipulate the local structural configurations in BaFe2As2 epitaxial thin films by controlling two independent structural factors orthorhombicity (in-plane anisotropy) and tetragonality (out-of-plane/in-plane balance) from lattice parameters. We tune superconductivity without chemical do** utilizing both structural factors separately, controlling local tetrahedral coordination in designed thin film heterostructures with substrate clam** and bi-axial strain. We further show that this allows quantitative control of both the structural phase transition, associated magnetism, and superconductivity in the parent material BaFe2As2. This approach will advance the development of tunable thin film superconductors in reduced dimension. △ Less

Submitted 7 October, 2021; originally announced October 2021.

arXiv:2110.02305 [pdf]

Oxide two-dimensional electron gas with high mobility at room-temperature

Authors: Kitae Eom, Hanjong Paik, **sol Seo, Neil Campbell, Evgeny Y. Tsymbal, Sang Ho Oh, Mark Rzchowski, Darrell G. Schlom, Chang-beom Eom

Abstract: The prospect of 2-dimensional electron gases (2DEGs) possessing high mobility at room temperature in wide-bandgap perovskite stannates is enticing for oxide electronics, particularly to realize transparent and high-electron mobility transistors. Nonetheless only a small number of studies to date report 2DEGs in BaSnO3-based heterostructures. Here, we report 2DEG formation at the LaScO3/BaSnO3 (LSO… ▽ More The prospect of 2-dimensional electron gases (2DEGs) possessing high mobility at room temperature in wide-bandgap perovskite stannates is enticing for oxide electronics, particularly to realize transparent and high-electron mobility transistors. Nonetheless only a small number of studies to date report 2DEGs in BaSnO3-based heterostructures. Here, we report 2DEG formation at the LaScO3/BaSnO3 (LSO/BSO) interface with a room-temperature mobility of 60 cm2/V s at a carrier concentration of 1.7x1013 cm-2. This is an order of magnitude higher mobility at room temperature than achieved in SrTiO3-based 2DEGs. We achieved this by combining a thick BSO buffer layer with an ex-situ high-temperature treatment, which not only reduces the dislocation density but also produces a SnO2-terminated atomically flat surface, followed by the growth of an overlying BSO/LSO interface. Using weak-beam dark field imaging and in-line electron holography technique, we reveal a reduction of the threading dislocation density, and provide direct evidence for the spatial confinement of a 2DEG at the BSO/LSO interface. Our work opens a new pathway to explore the exciting physics of stannate-based 2DEGs at application-relevant temperatures for oxide nanoelectronics. △ Less

Submitted 5 October, 2021; originally announced October 2021.

Comments: 21 pages, 5 figures

arXiv:2107.11191 [pdf, other]

Regularising Inverse Problems with Generative Machine Learning Models

Authors: Margaret Duff, Neill D. F. Campbell, Matthias J. Ehrhardt

Abstract: Deep neural network approaches to inverse imaging problems have produced impressive results in the last few years. In this paper, we consider the use of generative models in a variational regularisation approach to inverse problems. The considered regularisers penalise images that are far from the range of a generative model that has learned to produce images similar to a training dataset. We name… ▽ More Deep neural network approaches to inverse imaging problems have produced impressive results in the last few years. In this paper, we consider the use of generative models in a variational regularisation approach to inverse problems. The considered regularisers penalise images that are far from the range of a generative model that has learned to produce images similar to a training dataset. We name this family \textit{generative regularisers}. The success of generative regularisers depends on the quality of the generative model and so we propose a set of desired criteria to assess generative models and guide future research. In our numerical experiments, we evaluate three common generative models, autoencoders, variational autoencoders and generative adversarial networks, against our desired criteria. We also test three different generative regularisers on the inverse problems of deblurring, deconvolution, and tomography. We show that restricting solutions of the inverse problem to lie exactly in the range of a generative model can give good results but that allowing small deviations from the range of the generator produces more consistent results. △ Less

Submitted 18 June, 2022; v1 submitted 22 July, 2021; originally announced July 2021.

arXiv:2105.12753 [pdf, other]

doi 10.3847/1538-4357/abec74

Extended X-ray emission associated with the radio lobes and the environments of 60 radio galaxies

Authors: Ajay Gill, Michelle M. Boyce, Christopher P. O'Dea, Stefi A. Baum, Preeti Kharb, Neil Campbell, Grant R. Tremblay, Suman Kundu

Abstract: This paper studied the faint, diffuse extended X-ray emission associated with the radio lobes and the hot gas in the intracluster medium (ICM) environment for a sample of radio galaxies. We used shallow ($\sim 10$ ks) archival Chandra observations for 60 radio galaxies (7 FR I and 53 FR II) with $0.0222 \le z \le 1.785$ selected from the 298 extragalactic radio sources identified in the 3CR catalo… ▽ More This paper studied the faint, diffuse extended X-ray emission associated with the radio lobes and the hot gas in the intracluster medium (ICM) environment for a sample of radio galaxies. We used shallow ($\sim 10$ ks) archival Chandra observations for 60 radio galaxies (7 FR I and 53 FR II) with $0.0222 \le z \le 1.785$ selected from the 298 extragalactic radio sources identified in the 3CR catalog. We used Bayesian statistics to look for any asymmetry in the extended X-ray emission between regions that contain the radio lobes and regions that contain the hot gas in the ICM. In the Chandra broadband ($0.5 - 7.0$ keV), which has the highest detected X-ray flux and the highest signal-to-noise ratio, we found that the non-thermal X-ray emission from the radio lobes dominates the thermal X-ray emission from the environment for $\sim 77\%$ of the sources in our sample. We also found that the relative amount of on-jet axis non-thermal emission from the radio lobes tends to increase with redshift compared to the off-jet axis thermal emission from the environment. This suggests that the dominant X-ray mechanism for the non-thermal X-ray emission in the radio lobes is due to the inverse Compton upscattering of cosmic microwave background (CMB) seed photons by relativistic electrons in the radio lobes, a process for which the observed flux is roughly redshift independent due to the increasing CMB energy density with increasing redshift. △ Less

Submitted 26 May, 2021; originally announced May 2021.

Comments: 19 pages, 18 figures

Journal ref: The Astrophysical Journal, Volume 912, Number 2, pp 19, 2021

arXiv:2105.01938 [pdf, other]

Towards Self-Supervision for Video Identification of Individual Holstein-Friesian Cattle: The Cows2021 Dataset

Authors: **g Gao, Tilo Burghardt, William Andrew, Andrew W. Dowsey, Neill W. Campbell

Abstract: In this paper we publish the largest identity-annotated Holstein-Friesian cattle dataset Cows2021 and a first self-supervision framework for video identification of individual animals. The dataset contains 10,402 RGB images with labels for localisation and identity as well as 301 videos from the same herd. The data shows top-down in-barn imagery, which captures the breed's individually distinctive… ▽ More In this paper we publish the largest identity-annotated Holstein-Friesian cattle dataset Cows2021 and a first self-supervision framework for video identification of individual animals. The dataset contains 10,402 RGB images with labels for localisation and identity as well as 301 videos from the same herd. The data shows top-down in-barn imagery, which captures the breed's individually distinctive black and white coat pattern. Motivated by the labelling burden involved in constructing visual cattle identification systems, we propose exploiting the temporal coat pattern appearance across videos as a self-supervision signal for animal identity learning. Using an individual-agnostic cattle detector that yields oriented bounding-boxes, rotation-normalised tracklets of individuals are formed via tracking-by-detection and enriched via augmentations. This produces a `positive' sample set per tracklet, which is paired against a `negative' set sampled from random cattle of other videos. Frame-triplet contrastive learning is then employed to construct a metric latent space. The fitting of a Gaussian Mixture Model to this space yields a cattle identity classifier. Results show an accuracy of Top-1 57.0% and Top-4: 76.9% and an Adjusted Rand Index: 0.53 compared to the ground truth. Whilst supervised training surpasses this benchmark by a large margin, we conclude that self-supervision can nevertheless play a highly effective role in speeding up labelling efforts when initially constructing supervision information. We provide all data and full source code alongside an analysis and evaluation of the system. △ Less

Submitted 5 May, 2021; originally announced May 2021.

Comments: 6 pages, 8 figures, 1 table, dataset will be available, code will be available

arXiv:2010.13632 [pdf, other]

Black-box density function estimation using recursive partitioning

Authors: Erik Bodin, Zhenwen Dai, Neill D. F. Campbell, Carl Henrik Ek

Abstract: We present a novel approach to Bayesian inference and general Bayesian computation that is defined through a sequential decision loop. Our method defines a recursive partitioning of the sample space. It neither relies on gradients nor requires any problem-specific tuning, and is asymptotically exact for any density function with a bounded domain. The output is an approximation to the whole density… ▽ More We present a novel approach to Bayesian inference and general Bayesian computation that is defined through a sequential decision loop. Our method defines a recursive partitioning of the sample space. It neither relies on gradients nor requires any problem-specific tuning, and is asymptotically exact for any density function with a bounded domain. The output is an approximation to the whole density function including the normalisation constant, via partitions organised in efficient data structures. Such approximations may be used for evidence estimation or fast posterior sampling, but also as building blocks to treat a larger class of estimation problems. The algorithm shows competitive performance to recent state-of-the-art methods on synthetic and real-world problems including parameter inference for gravitational-wave physics. △ Less

Submitted 8 June, 2021; v1 submitted 26 October, 2020; originally announced October 2020.

Comments: International Conference on Machine Learning (ICML) 2021

arXiv:2009.09453 [pdf]

Route to in situ synthesis of epitaxial Pr2Ir2O7 thin films guided by thermodynamic calculations

Authors: Lu Guo, Shun-Li Shang, Neil Campbell, Mark Rzchowski, Zi-Kui Liu, Chang-Beom Eom, +These two authors equally contributed to this work

Abstract: In situ growth of pyrochlore iridate thin films has been a long-standing challenge due to the low reactivity of Ir at low temperatures and the vaporization of volatile gas species such as IrO3(g) and IrO2(g) at high temperatures and high oxygen partial pressures. To address this challenge, we combine thermodynamic analysis of the Pr-Ir-O2 system with experimental results from the conventional phys… ▽ More In situ growth of pyrochlore iridate thin films has been a long-standing challenge due to the low reactivity of Ir at low temperatures and the vaporization of volatile gas species such as IrO3(g) and IrO2(g) at high temperatures and high oxygen partial pressures. To address this challenge, we combine thermodynamic analysis of the Pr-Ir-O2 system with experimental results from the conventional physical vapor deposition (PVD) technique of co-sputtering. Our results indicate that only high growth temperatures yield films with crystallinity sufficient for utilizing and tailoring the desired topological electronic properties. Thermodynamic calculations indicate that high deposition temperatures and high partial pressures of gas species O2(g) and IrO3(g), are required to stabilize Pr2Ir2O7. We further find that the gas species partial pressure requirements are beyond that achievable by any conventional PVD technique. We experimentally show that conventional PVD growth parameters produce exclusively Pr3IrO7, which conclusion we reproduce with theoretical calculations. Our findings provide solid evidence that in situ synthesis of Pr2Ir2O7 thin films is fettered by the inability to grow with oxygen partial pressure on the order of 10 Torr, a limitation inherent to the PVD process. Thus, we suggest high-pressure techniques, in particular chemical vapor deposition (CVD), as a route to synthesis of Pr2Ir2O7, as this can support thin film deposition under the high pressure needed for in situ stabilization of Pr2Ir2O7. △ Less

Submitted 22 September, 2020; v1 submitted 20 September, 2020; originally announced September 2020.

arXiv:2008.10634 [pdf, other]

DiverseNet: When One Right Answer is not Enough

Authors: Michael Firman, Neill D. F. Campbell, Lourdes Agapito, Gabriel J. Brostow

Abstract: Many structured prediction tasks in machine vision have a collection of acceptable answers, instead of one definitive ground truth answer. Segmentation of images, for example, is subject to human labeling bias. Similarly, there are multiple possible pixel values that could plausibly complete occluded image regions. State-of-the art supervised learning methods are typically optimized to make a sing… ▽ More Many structured prediction tasks in machine vision have a collection of acceptable answers, instead of one definitive ground truth answer. Segmentation of images, for example, is subject to human labeling bias. Similarly, there are multiple possible pixel values that could plausibly complete occluded image regions. State-of-the art supervised learning methods are typically optimized to make a single test-time prediction for each query, failing to find other modes in the output space. Existing methods that allow for sampling often sacrifice speed or accuracy. We introduce a simple method for training a neural network, which enables diverse structured predictions to be made for each test-time query. For a single input, we learn to predict a range of possible answers. We compare favorably to methods that seek diversity through an ensemble of networks. Such stochastic multiple choice learning faces mode collapse, where one or more ensemble members fail to receive any training signal. Our best performing solution can be deployed for various tasks, and just involves small modifications to the existing single-mode architecture, loss function, and training regime. We demonstrate that our method results in quantitative improvements across three challenging tasks: 2D image completion, 3D volume estimation, and flow prediction. △ Less

Submitted 24 August, 2020; originally announced August 2020.

Comments: Presented at CVPR 2018

arXiv:2006.09205 [pdf, other]

doi 10.1016/j.compag.2021.106133

Visual Identification of Individual Holstein-Friesian Cattle via Deep Metric Learning

Authors: William Andrew, **g Gao, Siobhan Mullan, Neill Campbell, Andrew W Dowsey, Tilo Burghardt

Abstract: Holstein-Friesian cattle exhibit individually-characteristic black and white coat patterns visually akin to those arising from Turing's reaction-diffusion systems. This work takes advantage of these natural markings in order to automate visual detection and biometric identification of individual Holstein-Friesians via convolutional neural networks and deep metric learning techniques. Existing appr… ▽ More Holstein-Friesian cattle exhibit individually-characteristic black and white coat patterns visually akin to those arising from Turing's reaction-diffusion systems. This work takes advantage of these natural markings in order to automate visual detection and biometric identification of individual Holstein-Friesians via convolutional neural networks and deep metric learning techniques. Existing approaches rely on markings, tags or wearables with a variety of maintenance requirements, whereas we present a totally hands-off method for the automated detection, localisation, and identification of individual animals from overhead imaging in an open herd setting, i.e. where new additions to the herd are identified without re-training. We propose the use of SoftMax-based reciprocal triplet loss to address the identification problem and evaluate the techniques in detail against fixed herd paradigms. We find that deep metric learning systems show strong performance even when many cattle unseen during system training are to be identified and re-identified -- achieving 93.8% accuracy when trained on just half of the population. This work paves the way for facilitating the non-intrusive monitoring of cattle applicable to precision farming and surveillance for automated productivity, health and welfare monitoring, and to veterinary research such as behavioural analysis, disease outbreak tracing, and more. Key parts of the source code, network weights and datasets are available publicly. △ Less

Submitted 14 October, 2020; v1 submitted 16 June, 2020; originally announced June 2020.

Comments: 41 pages, 18 figures, 2 tables; Submitted to Computers and Electronics in Agriculture ; Source code and network weights available at https://github.com/CWOA/MetricLearningIdentification ; OpenCows2020 dataset available at https://doi.org/10.5523/bris.10m32xl88x2b61zlkkgz3fml17

ACM Class: I.4.8; I.4.9

Journal ref: Computers and Electronics in Agriculture 185, 106133 (2021)

arXiv:2001.00277 [pdf]

doi 10.1073/pnas.2001123117

Superconductivity in Undoped BaFe2As2 by Tetrahedral Geometry Design

Authors: J. H. Kang, J. -W. Kim, P. J. Ryan, L. Xie, L. Guo, C. Sundahl, J. Schad, N. Campbell, Y. G. Collantes, E. E. Hellstrom, M. S. Rzchowski, C. B. Eom

Abstract: Fe-based superconductors exhibit a diverse interplay between charge, orbital, and magnetic ordering1-4. Variations in atomic geometry affect electron hop** between Fe atoms5,6 and the Fermi surface topology, influencing magnetic frustration and the pairing mechanism through changes of orbital overlap and occupancies7-11. Here, we experimentally demonstrate a systematic approach to realize superc… ▽ More Fe-based superconductors exhibit a diverse interplay between charge, orbital, and magnetic ordering1-4. Variations in atomic geometry affect electron hop** between Fe atoms5,6 and the Fermi surface topology, influencing magnetic frustration and the pairing mechanism through changes of orbital overlap and occupancies7-11. Here, we experimentally demonstrate a systematic approach to realize superconductivity without chemical do** in BaFe2As2, employing geometric design within an epitaxial heterostructure. We control both tetragonality and orthorhombicity in BaFe2As2 through superlattice engineering, which we experimentally find to induce superconductivity when the As-Fe-As bond angle approaches that in a regular tetrahedron. This approach of superlattice design could lead to insights into low dimensional superconductivity in Fe-based superconductors. △ Less

Submitted 2 January, 2020; v1 submitted 1 January, 2020; originally announced January 2020.

arXiv:1912.12586 [pdf]

doi 10.1038/s41467-020-17999-4

Controlling spin current polarization through non-collinear antiferromagnetism

Authors: T. Nan, C. X. Quintela, J. Irwin, G. Gurung, D. F. Shao, J. Gibbons, N. Campbell, K. Song, S. Y. Choi, L. Guo, R. D. Johnson, P. Manuel, R. V. Chopdekar, I. Hallsteinsen, T. Tybell, P. J. Ryan, J. W. Kim, Y. S. Choi, P. G. Radaelli, D. C. Ralph, E. Y. Tsymba, M. S. Rzchowski, C. B. Eom

Abstract: The spin-Hall effect describes the interconversion of charge currents and spin currents, enabling highly efficient manipulation of magnetization for spintronics. Symmetry conditions generally restrict polarizations of these spin currents to be orthogonal to both the charge and spin flows. Spin polarizations can deviate from such direction in nonmagnetic materials only when the crystalline symmetry… ▽ More The spin-Hall effect describes the interconversion of charge currents and spin currents, enabling highly efficient manipulation of magnetization for spintronics. Symmetry conditions generally restrict polarizations of these spin currents to be orthogonal to both the charge and spin flows. Spin polarizations can deviate from such direction in nonmagnetic materials only when the crystalline symmetry is reduced11. Here we experimentally show control of the spin polarization direction by using a non-collinear antiferromagnet Mn$_{3}$GaN, in which the triangular spin structure creates a low magnetic symmetry state while maintaining a high crystalline symmetry. We demonstrate that epitaxial Mn3GaN/Permalloy heterostructures can generate unique types of spinHall torques at room temperature corresponding to unconventional spin polarizations collinear to spin currents or charge currents which are forbidden in any sample with two-fold rotational symmetry. Our results demonstrate an approach based on spin-structure design for controlling spinorbit torque, paving the way for further progress in the emergent field of antiferromagnetic spintronics. △ Less

Submitted 29 December, 2019; originally announced December 2019.

arXiv:1912.12583 [pdf]

Epitaxial antiperovskite/perovskite heterostructures for materials design

Authors: Camilo X. Quintela, Kyung Song, Ding-Fu Shao, Lin Xie, Tianxiang Nan, Tula R. Paudel, Neil Campbell, Xiaoqing Pan, Mark S. Rzchowski, Evgeny Y. Tsymbal, Si-Young Choi, Chang-Beom Eom

Abstract: We demonstrate fabrication of atomically sharp interfaces between nitride antiperovskite Mn$_{3}$GaN and oxide perovskites (La$_{0.3}$Sr$_{0.7}$)(Al$_{0.65}$Ta$_{0.35}$)O$_{3}$ (LSAT) and SrTiO$_{3}$ as paradigms of nitride-antiperovskite/oxide-perovskite heterostructures. Using a combination of scanning transmission electron microscopy (STEM), atomic-resolution spectroscopic techniques, and first… ▽ More We demonstrate fabrication of atomically sharp interfaces between nitride antiperovskite Mn$_{3}$GaN and oxide perovskites (La$_{0.3}$Sr$_{0.7}$)(Al$_{0.65}$Ta$_{0.35}$)O$_{3}$ (LSAT) and SrTiO$_{3}$ as paradigms of nitride-antiperovskite/oxide-perovskite heterostructures. Using a combination of scanning transmission electron microscopy (STEM), atomic-resolution spectroscopic techniques, and first-principle calculations, we investigated the atomic-scale structure, composition, and boding at the interface. We show that the epitaxial growth between the antiperovskite and perovskite compounds is mediated by a coherent interfacial monolayer that connects the two anti-structures. We anticipate our results to be a major step for the development of functional antiperovskite/perovskite heterostructures opening to harness a combination of their functional properties including topological properties for ultra low power applications. △ Less

Submitted 29 December, 2019; originally announced December 2019.

arXiv:1912.12401 [pdf]

doi 10.1103/PhysRevB.101.104405

Spontaneous Hall Effect enhanced by local Ir moments in epitaxial Pr$_2$Ir$_2$O$_7$ thin films

Authors: Lu Guo, Neil Campbell, Yongseong Choi, Jong-Woo Kim, Philip J. Ryan, Huaixun Huyan, Linze Li, Tianxiang Nan, Jong-Hong Kang, Chris Sundahl, Xiaoqing Pan, M. S. Rzchowski, Chang-Beom Eom

Abstract: Rare earth pyrochlore Iridates (RE2Ir2O7) consist of two interpenetrating cation sublattices, the RE with highly-frustrated magnetic moments, and the Iridium with extended conduction orbitals significantly mixed by spin-orbit interactions. The coexistence and coupling of these two sublattices create a landscape for discovery and manipulation of quantum phenomena such as the topological Hall effect… ▽ More Rare earth pyrochlore Iridates (RE2Ir2O7) consist of two interpenetrating cation sublattices, the RE with highly-frustrated magnetic moments, and the Iridium with extended conduction orbitals significantly mixed by spin-orbit interactions. The coexistence and coupling of these two sublattices create a landscape for discovery and manipulation of quantum phenomena such as the topological Hall effect, massless conduction bands, and quantum criticality. Thin films allow extended control of the material system via symmetry-lowering effects such as strain. While bulk Pr2Ir2O7 shows a spontaneous hysteretic Hall effect below 1.5K, we observe the effect at elevated temperatures up to 15K in epitaxial thin films on (111) YSZ substrates synthesized via solid phase epitaxy. Similar to the bulk, the lack of observable long-range magnetic order in the thin films points to a topological origin. We use synchrotron-based element-specific x-ray diffraction (XRD) and x-ray magnetic circular dichroism (XMCD) to compare powders and thin films to attribute the spontaneous Hall effect in the films to localization of the Ir moments. We link the thin film Ir local moments to lattice distortions absent in the bulk-like powders. We conclude that the elevated-temperature spontaneous Hall effect is caused by the topological effect originating either from the Ir or Pr sublattice, with interaction strength enhanced by the Ir local moments. This spontaneous Hall effect with weak net moment highlights the effect of vanishingly small lattice distortions as a means to discover topological phenomena in metallic frustrated magnetic materials. △ Less

Submitted 27 December, 2019; originally announced December 2019.

Journal ref: Phys. Rev. B 101, 104405 (2020)

arXiv:1909.07698 [pdf, other]

Compositional uncertainty in deep Gaussian processes

Authors: Ivan Ustyuzhaninov, Ieva Kazlauskaite, Markus Kaiser, Erik Bodin, Neill D. F. Campbell, Carl Henrik Ek

Abstract: Gaussian processes (GPs) are nonparametric priors over functions. Fitting a GP implies computing a posterior distribution of functions consistent with the observed data. Similarly, deep Gaussian processes (DGPs) should allow us to compute a posterior distribution of compositions of multiple functions giving rise to the observations. However, exact Bayesian inference is intractable for DGPs, motiva… ▽ More Gaussian processes (GPs) are nonparametric priors over functions. Fitting a GP implies computing a posterior distribution of functions consistent with the observed data. Similarly, deep Gaussian processes (DGPs) should allow us to compute a posterior distribution of compositions of multiple functions giving rise to the observations. However, exact Bayesian inference is intractable for DGPs, motivating the use of various approximations. We show that the application of simplifying mean-field assumptions across the hierarchy leads to the layers of a DGP collapsing to near-deterministic transformations. We argue that such an inference scheme is suboptimal, not taking advantage of the potential of the model to discover the compositional structure in the data. To address this issue, we examine alternative variational inference schemes allowing for dependencies across different layers and discuss their advantages and limitations. △ Less

Submitted 25 February, 2020; v1 submitted 17 September, 2019; originally announced September 2019.

Comments: 17 pages

arXiv:1906.11152 [pdf, other]

Modulating Surrogates for Bayesian Optimization

Authors: Erik Bodin, Markus Kaiser, Ieva Kazlauskaite, Zhenwen Dai, Neill D. F. Campbell, Carl Henrik Ek

Abstract: Bayesian optimization (BO) methods often rely on the assumption that the objective function is well-behaved, but in practice, this is seldom true for real-world objectives even if noise-free observations can be collected. Common approaches, which try to model the objective as precisely as possible, often fail to make progress by spending too many evaluations modeling irrelevant details. We address… ▽ More Bayesian optimization (BO) methods often rely on the assumption that the objective function is well-behaved, but in practice, this is seldom true for real-world objectives even if noise-free observations can be collected. Common approaches, which try to model the objective as precisely as possible, often fail to make progress by spending too many evaluations modeling irrelevant details. We address this issue by proposing surrogate models that focus on the well-behaved structure in the objective function, which is informative for search, while ignoring detrimental structure that is challenging to model from few observations. First, we demonstrate that surrogate models with appropriate noise distributions can absorb challenging structures in the objective function by treating them as irreducible uncertainty. Secondly, we show that a latent Gaussian process is an excellent surrogate for this purpose, comparing with Gaussian processes with standard noise distributions. We perform numerous experiments on a range of BO benchmarks and find that our approach improves reliability and performance when faced with challenging objective functions. △ Less

Submitted 8 September, 2020; v1 submitted 26 June, 2019; originally announced June 2019.

Journal ref: 37th International Conference On Machine Learning (ICML 2020)

arXiv:1905.12930 [pdf, other]

Monotonic Gaussian Process Flow

Authors: Ivan Ustyuzhaninov, Ieva Kazlauskaite, Carl Henrik Ek, Neill D. F. Campbell

Abstract: We propose a new framework for imposing monotonicity constraints in a Bayesian nonparametric setting based on numerical solutions of stochastic differential equations. We derive a nonparametric model of monotonic functions that allows for interpretable priors and principled quantification of hierarchical uncertainty. We demonstrate the efficacy of the proposed model by providing competitive result… ▽ More We propose a new framework for imposing monotonicity constraints in a Bayesian nonparametric setting based on numerical solutions of stochastic differential equations. We derive a nonparametric model of monotonic functions that allows for interpretable priors and principled quantification of hierarchical uncertainty. We demonstrate the efficacy of the proposed model by providing competitive results to other probabilistic monotonic models on a number of benchmark functions. In addition, we consider the utility of a monotonic random process as a part of a hierarchical probabilistic model; we examine the task of temporal alignment of time-series data where it is beneficial to use a monotonic random process in order to preserve the uncertainty in the temporal war**s. △ Less

Submitted 25 February, 2020; v1 submitted 30 May, 2019; originally announced May 2019.

Comments: Proceedings of the 23nd International Conference on Artificial Intelligence and Statistics (AISTATS) 2020 (14 pages)

arXiv:1812.05477 [pdf, other]

Gaussian Process Deep Belief Networks: A Smooth Generative Model of Shape with Uncertainty Propagation

Authors: Alessandro Di Martino, Erik Bodin, Carl Henrik Ek, Neill D. F. Campbell

Abstract: The shape of an object is an important characteristic for many vision problems such as segmentation, detection and tracking. Being independent of appearance, it is possible to generalize to a large range of objects from only small amounts of data. However, shapes represented as silhouette images are challenging to model due to complicated likelihood functions leading to intractable posteriors. In… ▽ More The shape of an object is an important characteristic for many vision problems such as segmentation, detection and tracking. Being independent of appearance, it is possible to generalize to a large range of objects from only small amounts of data. However, shapes represented as silhouette images are challenging to model due to complicated likelihood functions leading to intractable posteriors. In this paper we present a generative model of shapes which provides a low dimensional latent encoding which importantly resides on a smooth manifold with respect to the silhouette images. The proposed model propagates uncertainty in a principled manner allowing it to learn from small amounts of data and providing predictions with associated uncertainty. We provide experiments that show how our proposed model provides favorable quantitative results compared with the state-of-the-art while simultaneously providing a representation that resides on a low-dimensional interpretable manifold. △ Less

Submitted 13 December, 2018; originally announced December 2018.

arXiv:1811.12784 [pdf, other]

The GAN that Warped: Semantic Attribute Editing with Unpaired Data

Authors: Garoe Dorta, Sara Vicente, Neill D. F. Campbell, Ivor J. A. Simpson

Abstract: Deep neural networks have recently been used to edit images with great success, in particular for faces. However, they are often limited to only being able to work at a restricted range of resolutions. Many methods are so flexible that face edits can often result in an unwanted loss of identity. This work proposes to learn how to perform semantic image edits through the application of smooth warp… ▽ More Deep neural networks have recently been used to edit images with great success, in particular for faces. However, they are often limited to only being able to work at a restricted range of resolutions. Many methods are so flexible that face edits can often result in an unwanted loss of identity. This work proposes to learn how to perform semantic image edits through the application of smooth warp fields. Previous approaches that attempted to use war** for semantic edits required paired data, i.e. example images of the same subject with different semantic attributes. In contrast, we employ recent advances in Generative Adversarial Networks that allow our model to be trained with unpaired data. We demonstrate face editing at very high resolutions (4k images) with a single forward pass of a deep network at a lower resolution. We also show that our edits are substantially better at preserving the subject's identity. The robustness of our approach is demonstrated by showing plausible image editing results on the Cub200 birds dataset. To our knowledge this has not been previously accomplished, due the challenging nature of the dataset. △ Less

Submitted 5 March, 2020; v1 submitted 30 November, 2018; originally announced November 2018.

Comments: CVPR 2020

arXiv:1811.10689 [pdf, other]

Sequence Alignment with Dirichlet Process Mixtures

Authors: Ieva Kazlauskaite, Ivan Ustyuzhaninov, Carl Henrik Ek, Neill D. F. Campbell

Abstract: We present a probabilistic model for unsupervised alignment of high-dimensional time-warped sequences based on the Dirichlet Process Mixture Model (DPMM). We follow the approach introduced in (Kazlauskaite, 2018) of simultaneously representing each data sequence as a composition of a true underlying function and a time-war**, both of which are modelled using Gaussian processes (GPs) (Rasmussen,… ▽ More We present a probabilistic model for unsupervised alignment of high-dimensional time-warped sequences based on the Dirichlet Process Mixture Model (DPMM). We follow the approach introduced in (Kazlauskaite, 2018) of simultaneously representing each data sequence as a composition of a true underlying function and a time-war**, both of which are modelled using Gaussian processes (GPs) (Rasmussen, 2005), and aligning the underlying functions using an unsupervised alignment method. In (Kazlauskaite, 2018) the alignment is performed using the GP latent variable model (GP-LVM) (Lawrence, 2005) as a model of sequences, while our main contribution is extending this approach to using DPMM, which allows us to align the sequences temporally and cluster them at the same time. We show that the DPMM achieves competitive results in comparison to the GP-LVM on synthetic and real-world data sets, and discuss the different properties of the estimated underlying functions and the time-warps favoured by these models. △ Less

Submitted 26 November, 2018; originally announced November 2018.

Comments: 6 pages, 3 figures, "All Of Bayesian Nonparametrics" Workshop at the 32nd Annual Conference on Neural Information Processing Systems (BNP@NeurIPS2018)

arXiv:1808.06650 [pdf]

doi 10.1073/pnas.1812822116

Anisotropic spin-orbit torque generation in epitaxial SrIrO3 by symmetry design

Authors: T. Nan, T. J. Anderson, J. Gibbons, K. Hwang, N. Campbell, H. Zhou, Y. Q. Dong, G. Y. Kim, N. Reynolds, X. J. Wang, N. X. Sun, S. Y. Choi, M. S. Rzchowski, Yong Baek Kim, D. C. Ralph, C. B. Eom

Abstract: Spin-orbit coupling (SOC), the interaction between the electron spin and the orbital angular momentum, can unlock rich phenomena at interfaces, in particular interconverting spin and charge currents. Conventional heavy metals have been extensively explored due to their strong SOC of conduction electrons. However, spin-orbit effects in classes of materials such as epitaxial 5d-electron transition m… ▽ More Spin-orbit coupling (SOC), the interaction between the electron spin and the orbital angular momentum, can unlock rich phenomena at interfaces, in particular interconverting spin and charge currents. Conventional heavy metals have been extensively explored due to their strong SOC of conduction electrons. However, spin-orbit effects in classes of materials such as epitaxial 5d-electron transition metal complex oxides, which also host strong SOC, remain largely unreported. In addition to strong SOC, these complex oxides can also provide the additional tuning knob of epitaxy to control the electronic structure and the engineering of spin-to-charge conversion by crystalline symmetry. Here, we demonstrate room-temperature generation of spin-orbit torque on a ferromagnet with extremely high efficiency via the spin-Hall effect in epitaxial metastable perovskite SrIrO3. We first predict a large intrinsic spin-Hall conductivity in orthorhombic bulk SrIrO3 arising from the Berry curvature in the electronic band structure. By manipulating the intricate interplay between SOC and crystalline symmetry, we control the spin-Hall torque ratio by engineering the tilt of the corner-sharing oxygen octahedra in perovskite SrIrO3 through epitaxial strain. This allows the presence of an anisotropic spin-Hall effect due to a characteristic structural anisotropy in SrIrO3 with orthorhombic symmetry. Our experimental findings demonstrate the heteroepitaxial symmetry design approach to engineer spin-orbit effects. We therefore anticipate that these epitaxial 5d transition-metal oxide thin films can be an ideal building block for low-power spintronics. △ Less

Submitted 20 August, 2018; originally announced August 2018.

arXiv:1807.04833 [pdf, ps, other]

DP-GP-LVM: A Bayesian Non-Parametric Model for Learning Multivariate Dependency Structures

Authors: Andrew R. Lawrence, Carl Henrik Ek, Neill D. F. Campbell

Abstract: We present a non-parametric Bayesian latent variable model capable of learning dependency structures across dimensions in a multivariate setting. Our approach is based on flexible Gaussian process priors for the generative map**s and interchangeable Dirichlet process priors to learn the structure. The introduction of the Dirichlet process as a specific structural prior allows our model to circum… ▽ More We present a non-parametric Bayesian latent variable model capable of learning dependency structures across dimensions in a multivariate setting. Our approach is based on flexible Gaussian process priors for the generative map**s and interchangeable Dirichlet process priors to learn the structure. The introduction of the Dirichlet process as a specific structural prior allows our model to circumvent issues associated with previous Gaussian process latent variable models. Inference is performed by deriving an efficient variational bound on the marginal log-likelihood on the model. △ Less

Submitted 12 July, 2018; originally announced July 2018.

arXiv:1804.01050 [pdf, other]

Training VAEs Under Structured Residuals

Authors: Garoe Dorta, Sara Vicente, Lourdes Agapito, Neill D. F. Campbell, Ivor Simpson

Abstract: Variational auto-encoders (VAEs) are a popular and powerful deep generative model. Previous works on VAEs have assumed a factorized likelihood model, whereby the output uncertainty of each pixel is assumed to be independent. This approximation is clearly limited as demonstrated by observing a residual image from a VAE reconstruction, which often possess a high level of structure. This paper demons… ▽ More Variational auto-encoders (VAEs) are a popular and powerful deep generative model. Previous works on VAEs have assumed a factorized likelihood model, whereby the output uncertainty of each pixel is assumed to be independent. This approximation is clearly limited as demonstrated by observing a residual image from a VAE reconstruction, which often possess a high level of structure. This paper demonstrates a novel scheme to incorporate a structured Gaussian likelihood prediction network within the VAE that allows the residual correlations to be modeled. Our novel architecture, with minimal increase in complexity, incorporates the covariance matrix prediction within the VAE. We also propose a new mechanism for allowing structured uncertainty on color images. Furthermore, we provide a scheme for effectively training this model, and include some suggestions for improving performance in terms of efficiency or modeling longer range correlations. △ Less

Submitted 31 July, 2018; v1 submitted 3 April, 2018; originally announced April 2018.

Comments: Simplified training methodology, added more results

arXiv:1803.02603 [pdf, other]

Gaussian Process Latent Variable Alignment Learning

Authors: Ieva Kazlauskaite, Carl Henrik Ek, Neill D. F. Campbell

Abstract: We present a model that can automatically learn alignments between high-dimensional data in an unsupervised manner. Our proposed method casts alignment learning in a framework where both alignment and data are modelled simultaneously. Further, we automatically infer grou**s of different types of sequences within the same dataset. We derive a probabilistic model built on non-parametric priors tha… ▽ More We present a model that can automatically learn alignments between high-dimensional data in an unsupervised manner. Our proposed method casts alignment learning in a framework where both alignment and data are modelled simultaneously. Further, we automatically infer grou**s of different types of sequences within the same dataset. We derive a probabilistic model built on non-parametric priors that allows for flexible warps while at the same time providing means to specify interpretable constraints. We demonstrate the efficacy of our approach with superior quantitative performance to the state-of-the-art approaches and provide examples to illustrate the versatility of our model in automatic inference of sequence grou**s, absent from previous approaches, as well as easy specification of high level priors for different modalities of data. △ Less

Submitted 1 March, 2019; v1 submitted 7 March, 2018; originally announced March 2018.

Comments: Proceedings of the 22nd International Conference on Artificial Intelligence and Statistics (AISTATS) 2019 (13 pages, 11 figures)

arXiv:1802.07079 [pdf, other]

Structured Uncertainty Prediction Networks

Authors: Garoe Dorta, Sara Vicente, Lourdes Agapito, Neill D. F. Campbell, Ivor Simpson

Abstract: This paper is the first work to propose a network to predict a structured uncertainty distribution for a synthesized image. Previous approaches have been mostly limited to predicting diagonal covariance matrices. Our novel model learns to predict a full Gaussian covariance matrix for each reconstruction, which permits efficient sampling and likelihood evaluation. We demonstrate that our model ca… ▽ More This paper is the first work to propose a network to predict a structured uncertainty distribution for a synthesized image. Previous approaches have been mostly limited to predicting diagonal covariance matrices. Our novel model learns to predict a full Gaussian covariance matrix for each reconstruction, which permits efficient sampling and likelihood evaluation. We demonstrate that our model can accurately reconstruct ground truth correlated residual distributions for synthetic datasets and generate plausible high frequency samples for real face images. We also illustrate the use of these predicted covariances for structure preserving image denoising. △ Less

Submitted 23 March, 2018; v1 submitted 20 February, 2018; originally announced February 2018.

Comments: CVPR 2018 (final version)

arXiv:1801.03864 [pdf, other]

doi 10.1103/PhysRevMaterials.2.041801

Superconductivity-localization interplay and fluctuation magnetoresistance in epitaxial BaPb$_{1-x}$Bi$_x$O$_3$ thin films

Authors: David T. Harris, Neil Campbell, Reinhard Uecker, Mario Brützam, Darrell G. Schlom, Alex Levchenko, Mark S. Rzchowski, Chang-Beom Eom

Abstract: BaPb$_{1-x}$Bi$_x$O$_3$ is a superconductor, with transition temperature $T_c=11$ K, whose parent compound BaBiO$_3$ possess a charge ordering phase and perovskite crystal structure reminiscent of the cuprates. The lack of magnetism simplifies the BaPb$_{1-x}$Bi$_{x}$O$_3$ phase diagram, making this system an ideal platform for contrasting high-$T_c$ systems with isotropic superconductors. Here we… ▽ More BaPb$_{1-x}$Bi$_x$O$_3$ is a superconductor, with transition temperature $T_c=11$ K, whose parent compound BaBiO$_3$ possess a charge ordering phase and perovskite crystal structure reminiscent of the cuprates. The lack of magnetism simplifies the BaPb$_{1-x}$Bi$_{x}$O$_3$ phase diagram, making this system an ideal platform for contrasting high-$T_c$ systems with isotropic superconductors. Here we use high-quality epitaxial thin films and magnetotransport to demonstrate superconducting fluctuations that extend well beyond $T_c$. For the thickest films (thickness above $\sim100$ nm) this region extends to $\sim27$ K, well above the bulk $T_c$ and remarkably close to the higher $T_c$ of Ba$_{1-x}$K$_x$BiO$_3$ ($T_c=31$ K). We drive the system through a superconductor-insulator transition by decreasing thickness and find the observed $T_c$ correlates strongly with disorder. This material manifests strong fluctuations across a wide range of thicknesses, temperatures, and disorder presenting new opportunities for understanding the precursor of superconductivity near the 2D-3D dimensionality crossover. △ Less

Submitted 23 March, 2018; v1 submitted 11 January, 2018; originally announced January 2018.

Comments: 20 pages, 8 figures

Journal ref: Phys. Rev. Materials 2, 041801 (2018)

arXiv:1712.06536 [pdf, other]

Nonparametric Inference for Auto-Encoding Variational Bayes

Authors: Erik Bodin, Iman Malik, Carl Henrik Ek, Neill D. F. Campbell

Abstract: We would like to learn latent representations that are low-dimensional and highly interpretable. A model that has these characteristics is the Gaussian Process Latent Variable Model. The benefits and negative of the GP-LVM are complementary to the Variational Autoencoder, the former provides interpretable low-dimensional latent representations while the latter is able to handle large amounts of da… ▽ More We would like to learn latent representations that are low-dimensional and highly interpretable. A model that has these characteristics is the Gaussian Process Latent Variable Model. The benefits and negative of the GP-LVM are complementary to the Variational Autoencoder, the former provides interpretable low-dimensional latent representations while the latter is able to handle large amounts of data and can use non-Gaussian likelihoods. Our inspiration for this paper is to marry these two approaches and reap the benefits of both. In order to do so we will introduce a novel approximate inference scheme inspired by the GP-LVM and the VAE. We show experimentally that the approximation allows the capacity of the generative bottle-neck (Z) of the VAE to be arbitrarily large without losing a highly interpretable representation, allowing reconstruction quality to be unlimited by Z at the same time as a low-dimensional space can be used to perform ancestral sampling from as well as a means to reason about the embedded data. △ Less

Submitted 18 December, 2017; originally announced December 2017.

Comments: Presented at NIPS 2017 Workshop on Advances in Approximate Bayesian Inference

arXiv:1707.05534 [pdf, other]

Latent Gaussian Process Regression

Authors: Erik Bodin, Neill D. F. Campbell, Carl Henrik Ek

Abstract: We introduce Latent Gaussian Process Regression which is a latent variable extension allowing modelling of non-stationary multi-modal processes using GPs. The approach is built on extending the input space of a regression problem with a latent variable that is used to modulate the covariance function over the training data. We show how our approach can be used to model multi-modal and non-stationa… ▽ More We introduce Latent Gaussian Process Regression which is a latent variable extension allowing modelling of non-stationary multi-modal processes using GPs. The approach is built on extending the input space of a regression problem with a latent variable that is used to modulate the covariance function over the training data. We show how our approach can be used to model multi-modal and non-stationary processes. We exemplify the approach on a set of synthetic data and provide results on real data from motion capture and geostatistics. △ Less

Submitted 16 September, 2017; v1 submitted 18 July, 2017; originally announced July 2017.

arXiv:1705.07273 [pdf, other]

doi 10.1145/3025453.3025880

Responsive Action-based Video Synthesis

Authors: Corneliu Ilisescu, Halil Aytac Kanaci, Matteo Romagnoli, Neill D. F. Campbell, Gabriel J. Brostow

Abstract: We propose technology to enable a new medium of expression, where video elements can be looped, merged, and triggered, interactively. Like audio, video is easy to sample from the real world but hard to segment into clean reusable elements. Reusing a video clip means non-linear editing and compositing with novel footage. The new context dictates how carefully a clip must be prepared, so our end-to-… ▽ More We propose technology to enable a new medium of expression, where video elements can be looped, merged, and triggered, interactively. Like audio, video is easy to sample from the real world but hard to segment into clean reusable elements. Reusing a video clip means non-linear editing and compositing with novel footage. The new context dictates how carefully a clip must be prepared, so our end-to-end approach enables previewing and easy iteration. We convert static-camera videos into loopable sequences, synthesizing them in response to simple end-user requests. This is hard because a) users want essentially semantic-level control over the synthesized video content, and b) automatic loop-finding is brittle and leaves users limited opportunity to work through problems. We propose a human-in-the-loop system where adding effort gives the user progressively more creative control. Artists help us evaluate how our trigger interfaces can be used for authoring of videos and video-performances. △ Less

Submitted 20 May, 2017; originally announced May 2017.

Comments: 10 pages, 12 figures, 1 table, accepted and published in Proceedings of the 2017 CHI Conference on Human Factors in Computing Systems

ACM Class: H.5.2

arXiv:1702.01287 [pdf, other]

Doubly-Attentive Decoder for Multi-modal Neural Machine Translation

Authors: Iacer Calixto, Qun Liu, Nick Campbell

Abstract: We introduce a Multi-modal Neural Machine Translation model in which a doubly-attentive decoder naturally incorporates spatial visual features obtained using pre-trained convolutional neural networks, bridging the gap between image description and translation. Our decoder learns to attend to source-language words and parts of an image independently by means of two separate attention mechanisms as… ▽ More We introduce a Multi-modal Neural Machine Translation model in which a doubly-attentive decoder naturally incorporates spatial visual features obtained using pre-trained convolutional neural networks, bridging the gap between image description and translation. Our decoder learns to attend to source-language words and parts of an image independently by means of two separate attention mechanisms as it generates words in the target language. We find that our model can efficiently exploit not just back-translated in-domain multi-modal data but also large general-domain text-only MT corpora. We also report state-of-the-art results on the Multi30k data set. △ Less

Submitted 4 February, 2017; originally announced February 2017.

Comments: 8 pages (11 including references), 2 figures

ACM Class: I.2.7

arXiv:1702.01101 [pdf, other]

Multilingual Multi-modal Embeddings for Natural Language Processing

Authors: Iacer Calixto, Qun Liu, Nick Campbell

Abstract: We propose a novel discriminative model that learns embeddings from multilingual and multi-modal data, meaning that our model can take advantage of images and descriptions in multiple languages to improve embedding quality. To that end, we introduce a modification of a pairwise contrastive estimation optimisation function as our training objective. We evaluate our embeddings on an image-sentence r… ▽ More We propose a novel discriminative model that learns embeddings from multilingual and multi-modal data, meaning that our model can take advantage of images and descriptions in multiple languages to improve embedding quality. To that end, we introduce a modification of a pairwise contrastive estimation optimisation function as our training objective. We evaluate our embeddings on an image-sentence ranking (ISR), a semantic textual similarity (STS), and a neural machine translation (NMT) task. We find that the additional multilingual signals lead to improvements on both the ISR and STS tasks, and the discriminative cost can also be used in re-ranking $n$-best lists produced by NMT models, yielding strong improvements. △ Less

Submitted 3 February, 2017; originally announced February 2017.

Comments: 4 pages (5 including references), no figures

ACM Class: I.2.7

arXiv:1701.06521 [pdf, other]

Incorporating Global Visual Features into Attention-Based Neural Machine Translation

Authors: Iacer Calixto, Qun Liu, Nick Campbell

Abstract: We introduce multi-modal, attention-based neural machine translation (NMT) models which incorporate visual features into different parts of both the encoder and the decoder. We utilise global image features extracted using a pre-trained convolutional neural network and incorporate them (i) as words in the source sentence, (ii) to initialise the encoder hidden state, and (iii) as additional data to… ▽ More We introduce multi-modal, attention-based neural machine translation (NMT) models which incorporate visual features into different parts of both the encoder and the decoder. We utilise global image features extracted using a pre-trained convolutional neural network and incorporate them (i) as words in the source sentence, (ii) to initialise the encoder hidden state, and (iii) as additional data to initialise the decoder hidden state. In our experiments, we evaluate how these different strategies to incorporate global image features compare and which ones perform best. We also study the impact that adding synthetic multi-modal, multilingual data brings and find that the additional data have a positive impact on multi-modal models. We report new state-of-the-art results and our best models also significantly improve on a comparable phrase-based Statistical MT (PBSMT) model trained on the Multi30k data set according to all metrics evaluated. To the best of our knowledge, it is the first time a purely neural model significantly improves over a PBSMT model on all metrics evaluated on this data set. △ Less

Submitted 23 January, 2017; originally announced January 2017.

Comments: 8 pages (11 including references), 5 figures

ACM Class: I.2.7

arXiv:1605.07807 [pdf, other]

doi 10.1103/PhysRevLett.118.030502

Quantum State Discrimination Using the Minimum Average Number of Copies

Authors: Sergei Slussarenko, Morgan M. Weston, Jun-Gang Li, Nicholas Campbell, Howard M. Wiseman, Geoff J. Pryde

Abstract: In the task of discriminating between nonorthogonal quantum states from multiple copies, the key parameters are the error probability and the resources (number of copies) used. Previous studies have considered the task of minimizing the average error probability for fixed resources. Here we introduce a new state discrimination task: minimizing the average resources for a fixed admissible error pro… ▽ More In the task of discriminating between nonorthogonal quantum states from multiple copies, the key parameters are the error probability and the resources (number of copies) used. Previous studies have considered the task of minimizing the average error probability for fixed resources. Here we introduce a new state discrimination task: minimizing the average resources for a fixed admissible error probability. We show that this new task is not performed optimally by previously known strategies, and derive and experimentally test a detection scheme that performs better. △ Less

Submitted 20 February, 2017; v1 submitted 25 May, 2016; originally announced May 2016.

Comments: 5 pages, 4 figures, supplementary material in ancillary files

Journal ref: Phys. Rev. Lett. 118, 030502 (2017)

arXiv:1603.01607 [pdf, other]

Computing Shortest Paths Using A*, Landmarks, and Polygon Inequalities (Abstract)

Authors: Newton Campbell Jr

Abstract: We introduce a new heuristic for the A* algorithm that references a data structure much smaller than the one required by the ALT heuristic. This heuristic's benefits are permitted by a new approach for computing lower bounds using generalized polygon inequalities, leveraging distance information from two landmarks as opposed to the common single landmark paradigm. In this paper, we demonstrate tha… ▽ More We introduce a new heuristic for the A* algorithm that references a data structure much smaller than the one required by the ALT heuristic. This heuristic's benefits are permitted by a new approach for computing lower bounds using generalized polygon inequalities, leveraging distance information from two landmarks as opposed to the common single landmark paradigm. In this paper, we demonstrate that this heuristic stores a reduced amount of preprocessing information in comparison to previous landmark algorithms while performing faster search queries. △ Less

Submitted 4 March, 2016; originally announced March 2016.

Comments: Abstract for poster presented at the SIAM 2015 Workshop on Network Science in Snowbird, UT

arXiv:1603.00963 [pdf, other]

Using Quadrilaterals to Compute the Shortest Path

Authors: Newton Campbell Jr

Abstract: We introduce a new heuristic for the A* algorithm that references a data structure significantly smaller than that of ALT. We characterize the behavior of this new heuristic based on a dual landmark configuration that leverages quadrilateral inequalities to identify the lower bound for shortest path. Using this approach, we demonstrate both the utility and detriments of using polygon inequalities… ▽ More We introduce a new heuristic for the A* algorithm that references a data structure significantly smaller than that of ALT. We characterize the behavior of this new heuristic based on a dual landmark configuration that leverages quadrilateral inequalities to identify the lower bound for shortest path. Using this approach, we demonstrate both the utility and detriments of using polygon inequalities aside from the triangle inequality to establish lower bounds for shortest path queries. While this new heuristic does not dominate previous heuristics based on triangle inequalities, the inverse is true, as well. Further, we demonstrate that an A* heuristic function does not necessarily outperform another heuristic that it dominates. In comparison to other landmark methods, the new heuristic maintains a larger average search space while commonly decreasing the number of computed arithmetic operations. The new heuristic can significantly outperform previous methods, particularly in graphs with larger path lengths. The characterization of the use of these inequalities for bounding offers insight into its applications in other theoretical spaces. △ Less

Submitted 2 March, 2016; originally announced March 2016.

Showing 1–50 of 51 results for author: Campbell, N