Search | arXiv e-print repository

BIOSCAN-5M: A Multimodal Dataset for Insect Biodiversity

Authors: Zahra Gharaee, Scott C. Lowe, ZeMing Gong, Pablo Millan Arias, Nicholas Pellegrino, Austin T. Wang, Joakim Bruslund Haurum, Iuliia Zarubiieva, Lila Kari, Dirk Steinke, Graham W. Taylor, Paul Fieguth, Angel X. Chang

Abstract: As part of an ongoing worldwide effort to comprehend and monitor insect biodiversity, this paper presents the BIOSCAN-5M Insect dataset to the machine learning community and establish several benchmark tasks. BIOSCAN-5M is a comprehensive dataset containing multi-modal information for over 5 million insect specimens, and it significantly expands existing image-based biological datasets by includin… ▽ More As part of an ongoing worldwide effort to comprehend and monitor insect biodiversity, this paper presents the BIOSCAN-5M Insect dataset to the machine learning community and establish several benchmark tasks. BIOSCAN-5M is a comprehensive dataset containing multi-modal information for over 5 million insect specimens, and it significantly expands existing image-based biological datasets by including taxonomic labels, raw nucleotide barcode sequences, assigned barcode index numbers, and geographical information. We propose three benchmark experiments to demonstrate the impact of the multi-modal data types on the classification and clustering accuracy. First, we pretrain a masked language model on the DNA barcode sequences of the BIOSCAN-5M dataset, and demonstrate the impact of using this large reference library on species- and genus-level classification performance. Second, we propose a zero-shot transfer learning task applied to images and DNA barcodes to cluster feature embeddings obtained from self-supervised learning, to investigate whether meaningful clusters can be derived from these representation embeddings. Third, we benchmark multi-modality by performing contrastive learning on DNA barcodes, image data, and taxonomic information. This yields a general shared embedding space enabling taxonomic classification using multiple types of information and modalities. The code repository of the BIOSCAN-5M Insect dataset is available at https://github.com/zahrag/BIOSCAN-5M. △ Less

Submitted 24 June, 2024; v1 submitted 18 June, 2024; originally announced June 2024.

arXiv:2406.02465 [pdf, other]

An Empirical Study into Clustering of Unseen Datasets with Self-Supervised Encoders

Authors: Scott C. Lowe, Joakim Bruslund Haurum, Sageev Oore, Thomas B. Moeslund, Graham W. Taylor

Abstract: Can pretrained models generalize to new datasets without any retraining? We deploy pretrained image models on datasets they were not trained for, and investigate whether their embeddings form meaningful clusters. Our suite of benchmarking experiments use encoders pretrained solely on ImageNet-1k with either supervised or self-supervised training techniques, deployed on image datasets that were not… ▽ More Can pretrained models generalize to new datasets without any retraining? We deploy pretrained image models on datasets they were not trained for, and investigate whether their embeddings form meaningful clusters. Our suite of benchmarking experiments use encoders pretrained solely on ImageNet-1k with either supervised or self-supervised training techniques, deployed on image datasets that were not seen during training, and clustered with conventional clustering algorithms. This evaluation provides new insights into the embeddings of self-supervised models, which prioritize different features to supervised models. Supervised encoders typically offer more utility than SSL encoders within the training domain, and vice-versa far outside of it, however, fine-tuned encoders demonstrate the opposite trend. Clustering provides a way to evaluate the utility of self-supervised learned representations orthogonal to existing methods such as kNN. Additionally, we find the silhouette score when measured in a UMAP-reduced space is highly correlated with clustering performance, and can therefore be used as a proxy for clustering performance on data with no ground truth labels. Our code implementation is available at \url{https://github.com/scottclowe/zs-ssl-clustering/}. △ Less

Submitted 4 June, 2024; originally announced June 2024.

arXiv:2405.17537 [pdf, other]

BIOSCAN-CLIP: Bridging Vision and Genomics for Biodiversity Monitoring at Scale

Authors: ZeMing Gong, Austin T. Wang, Joakim Bruslund Haurum, Scott C. Lowe, Graham W. Taylor, Angel X. Chang

Abstract: Measuring biodiversity is crucial for understanding ecosystem health. While prior works have developed machine learning models for the taxonomic classification of photographic images and DNA separately, in this work, we introduce a multimodal approach combining both, using CLIP-style contrastive learning to align images, DNA barcodes, and textual data in a unified embedding space. This allows for… ▽ More Measuring biodiversity is crucial for understanding ecosystem health. While prior works have developed machine learning models for the taxonomic classification of photographic images and DNA separately, in this work, we introduce a multimodal approach combining both, using CLIP-style contrastive learning to align images, DNA barcodes, and textual data in a unified embedding space. This allows for accurate classification of both known and unknown insect species without task-specific fine-tuning, leveraging contrastive learning for the first time to fuse DNA and image data. Our method surpasses previous single-modality approaches in accuracy by over 11% on zero-shot learning tasks, showcasing its effectiveness in biodiversity studies. △ Less

Submitted 27 May, 2024; originally announced May 2024.

Comments: 16 pages with 9 figures

arXiv:2405.05241 [pdf, other]

BenthicNet: A global compilation of seafloor images for deep learning applications

Authors: Scott C. Lowe, Benjamin Misiuk, Isaac Xu, Shakhboz Abdulazizov, Amit R. Baroi, Alex C. Bastos, Merlin Best, Vicki Ferrini, Ariell Friedman, Deborah Hart, Ove Hoegh-Guldberg, Daniel Ierodiaconou, Julia Mackin-McLaughlin, Kathryn Markey, Pedro S. Menandro, Jacquomo Monk, Shreya Nemani, John O'Brien, Elizabeth Oh, Luba Y. Reshitnyk, Katleen Robert, Chris M. Roelfsema, Jessica A. Sameoto, Alexandre C. G. Schimel, Jordan A. Thomson , et al. (4 additional authors not shown)

Abstract: Advances in underwater imaging enable the collection of extensive seafloor image datasets that are necessary for monitoring important benthic ecosystems. The ability to collect seafloor imagery has outpaced our capacity to analyze it, hindering expedient mobilization of this crucial environmental information. Recent machine learning approaches provide opportunities to increase the efficiency with… ▽ More Advances in underwater imaging enable the collection of extensive seafloor image datasets that are necessary for monitoring important benthic ecosystems. The ability to collect seafloor imagery has outpaced our capacity to analyze it, hindering expedient mobilization of this crucial environmental information. Recent machine learning approaches provide opportunities to increase the efficiency with which seafloor image datasets are analyzed, yet large and consistent datasets necessary to support development of such approaches are scarce. Here we present BenthicNet: a global compilation of seafloor imagery designed to support the training and evaluation of large-scale image recognition models. An initial set of over 11.4 million images was collected and curated to represent a diversity of seafloor environments using a representative subset of 1.3 million images. These are accompanied by 2.6 million annotations translated to the CATAMI scheme, which span 190,000 of the images. A large deep learning model was trained on this compilation and preliminary results suggest it has utility for automating large and small-scale image analysis tasks. The compilation and model are made openly available for use by the scientific community at https://doi.org/10.20383/103.0614. △ Less

Submitted 8 May, 2024; originally announced May 2024.

arXiv:2405.00820 [pdf, other]

HLSFactory: A Framework Empowering High-Level Synthesis Datasets for Machine Learning and Beyond

Authors: Stefan Abi-Karam, Rishov Sarkar, Allison Seigler, Sean Lowe, Zhigang Wei, Hanqiu Chen, Nanditha Rao, Lizy John, Aman Arora, Cong Hao

Abstract: Machine learning (ML) techniques have been applied to high-level synthesis (HLS) flows for quality-of-result (QoR) prediction and design space exploration (DSE). Nevertheless, the scarcity of accessible high-quality HLS datasets and the complexity of building such datasets present challenges. Existing datasets have limitations in terms of benchmark coverage, design space enumeration, vendor extens… ▽ More Machine learning (ML) techniques have been applied to high-level synthesis (HLS) flows for quality-of-result (QoR) prediction and design space exploration (DSE). Nevertheless, the scarcity of accessible high-quality HLS datasets and the complexity of building such datasets present challenges. Existing datasets have limitations in terms of benchmark coverage, design space enumeration, vendor extensibility, or lack of reproducible and extensible software for dataset construction. Many works also lack user-friendly ways to add more designs, limiting wider adoption of such datasets. In response to these challenges, we introduce HLSFactory, a comprehensive framework designed to facilitate the curation and generation of high-quality HLS design datasets. HLSFactory has three main stages: 1) a design space expansion stage to elaborate single HLS designs into large design spaces using various optimization directives across multiple vendor tools, 2) a design synthesis stage to execute HLS and FPGA tool flows concurrently across designs, and 3) a data aggregation stage for extracting standardized data into packaged datasets for ML usage. This tripartite architecture ensures broad design space coverage via design space expansion and supports multiple vendor tools. Users can contribute to each stage with their own HLS designs and synthesis results and extend the framework itself with custom frontends and tool flows. We also include an initial set of built-in designs from common HLS benchmarks curated open-source HLS designs. We showcase the versatility and multi-functionality of our framework through six case studies: I) Design space sampling; II) Fine-grained parallelism backend speedup; III) Targeting Intel's HLS flow; IV) Adding new auxiliary designs; V) Integrating published HLS data; VI) HLS tool version regression benchmarking. Code at https://github.com/sharc-lab/HLSFactory. △ Less

Submitted 17 May, 2024; v1 submitted 1 May, 2024; originally announced May 2024.

Comments: Edit to "Section V.E" for proper attribution of open-source HLSyn, AutoDSE, and the Merlin compiler

arXiv:2402.05627 [pdf, other]

Binding Dynamics in Rotating Features

Authors: Sindy Löwe, Francesco Locatello, Max Welling

Abstract: In human cognition, the binding problem describes the open question of how the brain flexibly integrates diverse information into cohesive object representations. Analogously, in machine learning, there is a pursuit for models capable of strong generalization and reasoning by learning object-centric representations in an unsupervised manner. Drawing from neuroscientific theories, Rotating Features… ▽ More In human cognition, the binding problem describes the open question of how the brain flexibly integrates diverse information into cohesive object representations. Analogously, in machine learning, there is a pursuit for models capable of strong generalization and reasoning by learning object-centric representations in an unsupervised manner. Drawing from neuroscientific theories, Rotating Features learn such representations by introducing vector-valued features that encapsulate object characteristics in their magnitudes and object affiliation in their orientations. The "$χ$-binding" mechanism, embedded in every layer of the architecture, has been shown to be crucial, but remains poorly understood. In this paper, we propose an alternative "cosine binding" mechanism, which explicitly computes the alignment between features and adjusts weights accordingly, and we show that it achieves equivalent performance. This allows us to draw direct connections to self-attention and biological neural processes, and to shed light on the fundamental dynamics for object-centric representations to emerge in Rotating Features. △ Less

Submitted 8 February, 2024; originally announced February 2024.

arXiv:2311.16943 [pdf, other]

Image segmentation with traveling waves in an exactly solvable recurrent neural network

Authors: Luisa H. B. Liboni, Roberto C. Budzinski, Alexandra N. Busch, Sindy Löwe, Thomas A. Keller, Max Welling, Lyle E. Muller

Abstract: We study image segmentation using spatiotemporal dynamics in a recurrent neural network where the state of each unit is given by a complex number. We show that this network generates sophisticated spatiotemporal dynamics that can effectively divide an image into groups according to a scene's structural characteristics. Using an exact solution of the recurrent network's dynamics, we present a preci… ▽ More We study image segmentation using spatiotemporal dynamics in a recurrent neural network where the state of each unit is given by a complex number. We show that this network generates sophisticated spatiotemporal dynamics that can effectively divide an image into groups according to a scene's structural characteristics. Using an exact solution of the recurrent network's dynamics, we present a precise description of the mechanism underlying object segmentation in this network, providing a clear mathematical interpretation of how the network performs this task. We then demonstrate a simple algorithm for object segmentation that generalizes across inputs ranging from simple geometric objects in grayscale images to natural images. Object segmentation across all images is accomplished with one recurrent neural network that has a single, fixed set of weights. This demonstrates the expressive potential of recurrent neural networks when constructed using a mathematical approach that brings together their structure, dynamics, and computation. △ Less

Submitted 28 November, 2023; originally announced November 2023.

arXiv:2311.02401 [pdf, other]

BarcodeBERT: Transformers for Biodiversity Analysis

Authors: Pablo Millan Arias, Niousha Sadjadi, Monireh Safari, ZeMing Gong, Austin T. Wang, Scott C. Lowe, Joakim Bruslund Haurum, Iuliia Zarubiieva, Dirk Steinke, Lila Kari, Angel X. Chang, Graham W. Taylor

Abstract: Understanding biodiversity is a global challenge, in which DNA barcodes - short snippets of DNA that cluster by species - play a pivotal role. In particular, invertebrates, a highly diverse and under-explored group, pose unique taxonomic complexities. We explore machine learning approaches, comparing supervised CNNs, fine-tuned foundation models, and a DNA barcode-specific masking strategy across… ▽ More Understanding biodiversity is a global challenge, in which DNA barcodes - short snippets of DNA that cluster by species - play a pivotal role. In particular, invertebrates, a highly diverse and under-explored group, pose unique taxonomic complexities. We explore machine learning approaches, comparing supervised CNNs, fine-tuned foundation models, and a DNA barcode-specific masking strategy across datasets of varying complexity. While simpler datasets and tasks favor supervised CNNs or fine-tuned transformers, challenging species-level identification demands a paradigm shift towards self-supervised pretraining. We propose BarcodeBERT, the first self-supervised method for general biodiversity analysis, leveraging a 1.5 M invertebrate DNA barcode reference library. This work highlights how dataset specifics and coverage impact model selection, and underscores the role of self-supervised pretraining in achieving high-accuracy DNA barcode-based identification at the species and genus level. Indeed, without the fine-tuning step, BarcodeBERT pretrained on a large DNA barcode dataset outperforms DNABERT and DNABERT-2 on multiple downstream classification tasks. The code repository is available at https://github.com/Kari-Genomics-Lab/BarcodeBERT △ Less

Submitted 4 November, 2023; originally announced November 2023.

Comments: Main text: 5 pages, Total: 9 pages, 2 figures, accepted at the 4th Workshop on Self-Supervised Learning: Theory and Practice (NeurIPS 2023)

arXiv:2307.10455 [pdf, other]

A Step Towards Worldwide Biodiversity Assessment: The BIOSCAN-1M Insect Dataset

Authors: Zahra Gharaee, ZeMing Gong, Nicholas Pellegrino, Iuliia Zarubiieva, Joakim Bruslund Haurum, Scott C. Lowe, Jaclyn T. A. McKeown, Chris C. Y. Ho, Joschka McLeod, Yi-Yun C Wei, Jireh Agda, Sujeevan Ratnasingham, Dirk Steinke, Angel X. Chang, Graham W. Taylor, Paul Fieguth

Abstract: In an effort to catalog insect biodiversity, we propose a new large dataset of hand-labelled insect images, the BIOSCAN-Insect Dataset. Each record is taxonomically classified by an expert, and also has associated genetic information including raw nucleotide barcode sequences and assigned barcode index numbers, which are genetically-based proxies for species classification. This paper presents a c… ▽ More In an effort to catalog insect biodiversity, we propose a new large dataset of hand-labelled insect images, the BIOSCAN-Insect Dataset. Each record is taxonomically classified by an expert, and also has associated genetic information including raw nucleotide barcode sequences and assigned barcode index numbers, which are genetically-based proxies for species classification. This paper presents a curated million-image dataset, primarily to train computer-vision models capable of providing image-based taxonomic assessment, however, the dataset also presents compelling characteristics, the study of which would be of interest to the broader machine learning community. Driven by the biological nature inherent to the dataset, a characteristic long-tailed class-imbalance distribution is exhibited. Furthermore, taxonomic labelling is a hierarchical classification scheme, presenting a highly fine-grained classification problem at lower levels. Beyond spurring interest in biodiversity research within the machine learning community, progress on creating an image-based taxonomic classifier will also further the ultimate goal of all BIOSCAN research: to lay the foundation for a comprehensive survey of global biodiversity. This paper introduces the dataset and explores the classification task through the implementation and analysis of a baseline classifier. △ Less

Submitted 13 November, 2023; v1 submitted 19 July, 2023; originally announced July 2023.

arXiv:2306.10064 [pdf, other]

doi 10.1121/10.0024467

Computing leaky Lamb waves for waveguides between elastic half-spaces using spectral collocation

Authors: Evripides Georgiades, Michael J. S. Lowe, Richard V. Craster

Abstract: In non-destructive evaluation guided wave inspections, the elastic structure to be inspected is often embedded within other elastic media and the ensuing leaky waves are complex and non-trivial to compute; we consider the canonical example of an elastic waveguide surrounded by other elastic materials that demonstrates the fundamental issues with calculating the leaky waves in such systems. Due to… ▽ More In non-destructive evaluation guided wave inspections, the elastic structure to be inspected is often embedded within other elastic media and the ensuing leaky waves are complex and non-trivial to compute; we consider the canonical example of an elastic waveguide surrounded by other elastic materials that demonstrates the fundamental issues with calculating the leaky waves in such systems. Due to the complex wavenumber solutions required to represent them, leaky waves pose significant challenges to existing numerical methods, with methods that spatially discretise the field to retrieve them suffering from the exponential growth of their amplitude far into the surrounding media. We present a spectral collocation method yielding an accurate and efficient identification of these modes, leaking into elastic half-spaces. We discretise the elastic domains and, depending on the exterior bulk wavespeeds, select appropriate map**s of the discretised domain to complex paths, in which the numerical solution decays and the physics of the problem are preserved. By iterating through all possible radiation cases, the full set of dispersion and attenuation curves are successfully retrieved and validated, where possible, against the commercially available software DISPERSE. As an independent validation, dispersion curves are obtained from finite element simulations of time-dependent waves using Fourier analysis. △ Less

Submitted 3 July, 2024; v1 submitted 14 June, 2023; originally announced June 2023.

Journal ref: J. Acoust. Soc. Am. 155 (2024) 629-639

arXiv:2306.09643 [pdf, other]

BISCUIT: Causal Representation Learning from Binary Interactions

Authors: Phillip Lippe, Sara Magliacane, Sindy Löwe, Yuki M. Asano, Taco Cohen, Efstratios Gavves

Abstract: Identifying the causal variables of an environment and how to intervene on them is of core value in applications such as robotics and embodied AI. While an agent can commonly interact with the environment and may implicitly perturb the behavior of some of these causal variables, often the targets it affects remain unknown. In this paper, we show that causal variables can still be identified for ma… ▽ More Identifying the causal variables of an environment and how to intervene on them is of core value in applications such as robotics and embodied AI. While an agent can commonly interact with the environment and may implicitly perturb the behavior of some of these causal variables, often the targets it affects remain unknown. In this paper, we show that causal variables can still be identified for many common setups, e.g., additive Gaussian noise models, if the agent's interactions with a causal variable can be described by an unknown binary variable. This happens when each causal variable has two different mechanisms, e.g., an observational and an interventional one. Using this identifiability result, we propose BISCUIT, a method for simultaneously learning causal variables and their corresponding binary interaction variables. On three robotic-inspired datasets, BISCUIT accurately identifies causal variables and can even be scaled to complex, realistic environments for embodied AI. △ Less

Submitted 16 June, 2023; originally announced June 2023.

Comments: Published in: Uncertainty in Artificial Intelligence (UAI 2023). Project page: https://phlippe.github.io/BISCUIT/

arXiv:2306.00600 [pdf, other]

Rotating Features for Object Discovery

Authors: Sindy Löwe, Phillip Lippe, Francesco Locatello, Max Welling

Abstract: The binding problem in human cognition, concerning how the brain represents and connects objects within a fixed network of neural connections, remains a subject of intense debate. Most machine learning efforts addressing this issue in an unsupervised setting have focused on slot-based methods, which may be limiting due to their discrete nature and difficulty to express uncertainty. Recently, the C… ▽ More The binding problem in human cognition, concerning how the brain represents and connects objects within a fixed network of neural connections, remains a subject of intense debate. Most machine learning efforts addressing this issue in an unsupervised setting have focused on slot-based methods, which may be limiting due to their discrete nature and difficulty to express uncertainty. Recently, the Complex AutoEncoder was proposed as an alternative that learns continuous and distributed object-centric representations. However, it is only applicable to simple toy data. In this paper, we present Rotating Features, a generalization of complex-valued features to higher dimensions, and a new evaluation procedure for extracting objects from distributed representations. Additionally, we show the applicability of our approach to pre-trained features. Together, these advancements enable us to scale distributed object-centric representations from simple toy to real-world data. We believe this work advances a new paradigm for addressing the binding problem in machine learning and has the potential to inspire further innovation in the field. △ Less

Submitted 17 October, 2023; v1 submitted 1 June, 2023; originally announced June 2023.

Comments: Oral presentation at NeurIPS 2023

arXiv:2304.10189 [pdf, other]

Investigation of the Influence of Macrozones in Titanium Alloys on the Propagation and Scattering of Ultrasound

Authors: Wei Yi Yeoh, Bo Lan, Michael J. S. Lowe

Abstract: The presence of macrozones (or micro-textured regions) in Ti-6Al-4V (Ti64) was shown to be a potential cause to the onset of cold dwell fatigue which reduces fatigue life significantly. Past research has demonstrated the potential of using ultrasonic testing for macrozone characterisation, with the variation of ultrasound attenuation, backscatter, and velocity in the presence of macrozones. Howeve… ▽ More The presence of macrozones (or micro-textured regions) in Ti-6Al-4V (Ti64) was shown to be a potential cause to the onset of cold dwell fatigue which reduces fatigue life significantly. Past research has demonstrated the potential of using ultrasonic testing for macrozone characterisation, with the variation of ultrasound attenuation, backscatter, and velocity in the presence of macrozones. However, due to the complexity of the microstructure, some physical phenomena that were observed are still not well understood. In this study, we propose the use of Finite Element (FE) polycrystalline models to provide us with a means to systematically study the wave-macrozone interaction. Through this investigation performed using two-dimensional (2D) models, we are able to identify important correlations between macrozone characteristics (size, shape, and texture) and ultrasound responses (attenuation, backscatter, and velocity). The observed behaviours are then validated experimentally, and we also highlight how this understanding can potentially aid with the characterisation of macrozones in Ti-64 samples. △ Less

Submitted 20 April, 2023; originally announced April 2023.

Comments: 20 pages, 21 figures, 2 tables

arXiv:2303.14664 [pdf]

A plastics hierarchy of fates: sustainable choices for a circular future

Authors: Kristoffer Kortsen, Siobhan Kilbride, Stephen R. Lowe, Adam Peirce, Michael P. Shaver

Abstract: Plastics are ubiquitous in modern society, but the linear model of produce, use, and dispose results in massive amounts of resource consumption and pollution. Landfill and incineration of plastic waste is endemic, with no consensus on a clear path towards a more sustainable model. Progress is often hampered by a lack of clarity on what choices will enable a more sustainable circular plastics econo… ▽ More Plastics are ubiquitous in modern society, but the linear model of produce, use, and dispose results in massive amounts of resource consumption and pollution. Landfill and incineration of plastic waste is endemic, with no consensus on a clear path towards a more sustainable model. Progress is often hampered by a lack of clarity on what choices will enable a more sustainable circular plastics economy. Our Plastics Hierarchy of Fates tool addresses this by bringing together scattered information on the end-of-life fates of plastics in a more accessible format. This tool will support manufacturing, processing, and policy decisions in the push for a more sustainable future. Potential sorting and recycling decisions for plastics waste are discussed in the hierarchy and the consequences of different decisions are highlighted. The hierarchy is meant to inform potential outcomes but can also be used to help shape future interventions. △ Less

Submitted 26 March, 2023; originally announced March 2023.

Comments: to access the interactive tool, see https://lucid.app/documents/embedded/0ad93c05-0179-40bb-9468-63e25fc4dfae#

arXiv:2206.06977 [pdf, other]

doi 10.1121/10.0013897

Leaky wave characterisation using spectral methods

Authors: Evripides Georgiades, Michael J. S. Lowe, Richard V. Craster

Abstract: Leaky waves are an important class of waves, particularly for guiding waves along structures embedded within another medium; a mismatch in wavespeeds often leads to leakage of energy from the waveguide, or interface, into the medium, which consequently attenuates the guided wave. The accurate and efficient identification of theoretical solutions for leaky waves is a key requirement for the choices… ▽ More Leaky waves are an important class of waves, particularly for guiding waves along structures embedded within another medium; a mismatch in wavespeeds often leads to leakage of energy from the waveguide, or interface, into the medium, which consequently attenuates the guided wave. The accurate and efficient identification of theoretical solutions for leaky waves is a key requirement for the choices of modes and frequencies required for non-destructive evaluation inspection techniques. We choose a typical situation to study: an elastic waveguide with a fluid on either side. Historically, leaky waves are identified via root-finding methods that have issues with conditioning, or, numerical methods` that struggle with the exponential growth of solutions at infinity. By building upon a spectral collocation method, we show how it can be adjusted to find exponentially growing solutions, i.e. leaky waves, leading to an accurate, fast and efficient identification of their dispersion properties. The key concept required is a map**, in the fluid region, that allows for exponential growth of the physical solution at infinity, whilst the mapped numerical setting decays. We illustrate this by studying leaky Lamb waves in an elastic waveguide immersed between two different fluids and verify this using the commercially available software Disperse. △ Less

Submitted 8 June, 2022; originally announced June 2022.

Journal ref: J. Acoust. Soc. Am. 152 (2022) 1487-1497

arXiv:2206.06169 [pdf, other]

Causal Representation Learning for Instantaneous and Temporal Effects in Interactive Systems

Authors: Phillip Lippe, Sara Magliacane, Sindy Löwe, Yuki M. Asano, Taco Cohen, Efstratios Gavves

Abstract: Causal representation learning is the task of identifying the underlying causal variables and their relations from high-dimensional observations, such as images. Recent work has shown that one can reconstruct the causal variables from temporal sequences of observations under the assumption that there are no instantaneous causal relations between them. In practical applications, however, our measur… ▽ More Causal representation learning is the task of identifying the underlying causal variables and their relations from high-dimensional observations, such as images. Recent work has shown that one can reconstruct the causal variables from temporal sequences of observations under the assumption that there are no instantaneous causal relations between them. In practical applications, however, our measurement or frame rate might be slower than many of the causal effects. This effectively creates "instantaneous" effects and invalidates previous identifiability results. To address this issue, we propose iCITRIS, a causal representation learning method that allows for instantaneous effects in intervened temporal sequences when intervention targets can be observed, e.g., as actions of an agent. iCITRIS identifies the potentially multidimensional causal variables from temporal observations, while simultaneously using a differentiable causal discovery method to learn their causal graph. In experiments on three datasets of interactive systems, iCITRIS accurately identifies the causal variables and their causal graph. △ Less

Submitted 7 March, 2023; v1 submitted 13 June, 2022; originally announced June 2022.

Comments: Published at International Conference on Learning Representations (ICLR), 2023

arXiv:2204.02075 [pdf, other]

Complex-Valued Autoencoders for Object Discovery

Authors: Sindy Löwe, Phillip Lippe, Maja Rudolph, Max Welling

Abstract: Object-centric representations form the basis of human perception, and enable us to reason about the world and to systematically generalize to new settings. Currently, most works on unsupervised object discovery focus on slot-based approaches, which explicitly separate the latent representations of individual objects. While the result is easily interpretable, it usually requires the design of invo… ▽ More Object-centric representations form the basis of human perception, and enable us to reason about the world and to systematically generalize to new settings. Currently, most works on unsupervised object discovery focus on slot-based approaches, which explicitly separate the latent representations of individual objects. While the result is easily interpretable, it usually requires the design of involved architectures. In contrast to this, we propose a comparatively simple approach - the Complex AutoEncoder (CAE) - that creates distributed object-centric representations. Following a coding scheme theorized to underlie object representations in biological neurons, its complex-valued activations represent two messages: their magnitudes express the presence of a feature, while the relative phase differences between neurons express which features should be bound together to create joint object representations. In contrast to previous approaches using complex-valued activations for object discovery, we present a fully unsupervised approach that is trained end-to-end - resulting in significant improvements in performance and efficiency. Further, we show that the CAE achieves competitive or better unsupervised object discovery performance on simple multi-object datasets compared to a state-of-the-art slot-based approach while being up to 100 times faster to train. △ Less

Submitted 18 November, 2022; v1 submitted 5 April, 2022; originally announced April 2022.

Comments: Published in Transactions on Machine Learning Research (TMLR)

arXiv:2202.09648 [pdf, other]

doi 10.3389/fmars.2022.867857

Echofilter: A Deep Learning Segmentation Model Improves the Automation, Standardization, and Timeliness for Post-Processing Echosounder Data in Tidal Energy Streams

Authors: Scott C. Lowe, Louise P. McGarry, Jessica Douglas, Jason Newport, Sageev Oore, Christopher Whidden, Daniel J. Hasselman

Abstract: Understanding the abundance and distribution of fish in tidal energy streams is important to assess risks presented by introducing tidal energy devices to the habitat. However tidal current flows suitable for tidal energy are often highly turbulent, complicating the interpretation of echosounder data. The portion of the water column contaminated by returns from entrained air must be excluded from… ▽ More Understanding the abundance and distribution of fish in tidal energy streams is important to assess risks presented by introducing tidal energy devices to the habitat. However tidal current flows suitable for tidal energy are often highly turbulent, complicating the interpretation of echosounder data. The portion of the water column contaminated by returns from entrained air must be excluded from data used for biological analyses. Application of a single conventional algorithm to identify the depth-of-penetration of entrained air is insufficient for a boundary that is discontinuous, depth-dynamic, porous, and varies with tidal flow speed. Using a case study at a tidal energy demonstration site in the Bay of Fundy, we describe the development and application of a deep machine learning model with a U-Net based architecture. Our model, Echofilter, was highly responsive to the dynamic range of turbulence conditions and sensitive to the fine-scale nuances in the boundary position, producing an entrained-air boundary line with an average error of 0.33m on mobile downfacing and 0.5-1.0m on stationary upfacing data, less than half that of existing algorithmic solutions. The model's overall annotations had a high level of agreement with the human segmentation, with an intersection-over-union score of 99% for mobile downfacing recordings and 92-95% for stationary upfacing recordings. This resulted in a 50% reduction in the time required for manual edits when compared to the time required to manually edit the line placement produced by the currently available algorithms. Because of the improved initial automated placement, the implementation of the models permits an increase in the standardization and repeatability of line placement. △ Less

Submitted 18 August, 2022; v1 submitted 19 February, 2022; originally announced February 2022.

Journal ref: Front. Mar. Sci. 9:867857 (2022)

arXiv:2202.03169 [pdf, other]

CITRIS: Causal Identifiability from Temporal Intervened Sequences

Authors: Phillip Lippe, Sara Magliacane, Sindy Löwe, Yuki M. Asano, Taco Cohen, Efstratios Gavves

Abstract: Understanding the latent causal factors of a dynamical system from visual observations is considered a crucial step towards agents reasoning in complex environments. In this paper, we propose CITRIS, a variational autoencoder framework that learns causal representations from temporal sequences of images in which underlying causal factors have possibly been intervened upon. In contrast to the recen… ▽ More Understanding the latent causal factors of a dynamical system from visual observations is considered a crucial step towards agents reasoning in complex environments. In this paper, we propose CITRIS, a variational autoencoder framework that learns causal representations from temporal sequences of images in which underlying causal factors have possibly been intervened upon. In contrast to the recent literature, CITRIS exploits temporality and observing intervention targets to identify scalar and multidimensional causal factors, such as 3D rotation angles. Furthermore, by introducing a normalizing flow, CITRIS can be easily extended to leverage and disentangle representations obtained by already pretrained autoencoders. Extending previous results on scalar causal factors, we prove identifiability in a more general setting, in which only some components of a causal factor are affected by interventions. In experiments on 3D rendered image sequences, CITRIS outperforms previous methods on recovering the underlying causal variables. Moreover, using pretrained autoencoders, CITRIS can even generalize to unseen instantiations of causal factors, opening future research areas in sim-to-real generalization for causal representation learning. △ Less

Submitted 15 June, 2022; v1 submitted 7 February, 2022; originally announced February 2022.

Comments: Accepted at the International Conference on Machine Learning (ICML), 2022

arXiv:2202.01895 [pdf]

doi 10.1098/rsta.2021.0382

Appraising scattering theories for polycrystals of any symmetry using finite elements

Authors: Ming Huang, Stanislav I. Rokhlin, Michael J. S. Lowe

Abstract: This paper uses 3D grain-scale finite element (FE) simulations to appraise the classical scattering theory of plane longitudinal wave propagation in untextured polycrystals with statistically equiaxed grains belonging to the seven crystal symmetries. As revealed from the results of 10,390 materials, the classical theory has a linear relationship with the elastic scattering factor at the quasi-stat… ▽ More This paper uses 3D grain-scale finite element (FE) simulations to appraise the classical scattering theory of plane longitudinal wave propagation in untextured polycrystals with statistically equiaxed grains belonging to the seven crystal symmetries. As revealed from the results of 10,390 materials, the classical theory has a linear relationship with the elastic scattering factor at the quasi-static velocity limit, whereas the reference FE and self-consistent (SC) results generally exhibit a quadratic relationship. As supported by the results of 90 materials, such order difference also extends to the attenuation and phase velocity, leading to larger differences between the classical theory and the FE results for more strongly scattering materials. Alternatively, two approximate models are proposed to achieve more accurate calculations by including an additional quadratic term. One model uses quadratic coefficients from quasi-static SC velocity fits and is thus symmetry-specific, while the other uses theoretically-determined coefficients and is valid for any individual material. These simple models generally deliver more accurate attenuation and phase velocity (particularly the second model) than the classical theory, especially for strongly scattering materials. However, the models are invalid for the attenuation of materials with negative quadratic coefficients. △ Less

Submitted 1 February, 2022; originally announced February 2022.

Comments: 26 pages, 6 figures, 5 tables, submitted to Philosophical Transactions of the Royal Society A

arXiv:2111.14913 [pdf]

doi 10.1098/rspa.2021.0850

Finite element and semi-analytical study of elastic wave propagation in strongly scattering polycrystals

Authors: Ming Huang, Peter Huthwaite, Stanislav I. Rokhlin, Michael J. S. Lowe

Abstract: This work studies scattering-induced elastic wave attenuation and phase velocity variation in 3D untextured cubic polycrystals with statistically equiaxed grains using the theoretical second-order approximation (SOA) and Born approximation models and the grain-scale finite element (FE) model, pushing the boundary towards strongly scattering materials. The results for materials with Zener anisotrop… ▽ More This work studies scattering-induced elastic wave attenuation and phase velocity variation in 3D untextured cubic polycrystals with statistically equiaxed grains using the theoretical second-order approximation (SOA) and Born approximation models and the grain-scale finite element (FE) model, pushing the boundary towards strongly scattering materials. The results for materials with Zener anisotropy indices A>1 show a good agreement between the theoretical and FE models in the transition and stochastic regions. In the Rayleigh regime, the agreement is reasonable for common structural materials with 1<A<3.2 but it deteriorates as A increases. The wavefields and signals from FE modelling show the emergence of very strong scattering at low frequencies for strongly scattering materials that cannot be fully accounted for by the theoretical models. To account for such strong scattering at A>1, a semi-analytical model is proposed by iterating the far-field Born approximation and optimising the iterative coefficient. The proposed model agrees remarkably well with the FE model across all studied materials with greatly differing microstructures; the model validity also extends to the quasi-static velocity limit. For polycrystals with A<1, it is found that the agreement between the SOA and FE results is excellent for all studied materials and the correction of the model is not needed. △ Less

Submitted 29 November, 2021; originally announced November 2021.

Comments: 26 pages, 9 figures, 3 tables, submitted to Proceedings of the Royal Society A

Journal ref: Proceedings of the Royal Society A 478(2022): 20210850

arXiv:2111.01742 [pdf, ps, other]

LogAvgExp Provides a Principled and Performant Global Pooling Operator

Authors: Scott C. Lowe, Thomas Trappenberg, Sageev Oore

Abstract: We seek to improve the pooling operation in neural networks, by applying a more theoretically justified operator. We demonstrate that LogSumExp provides a natural OR operator for logits. When one corrects for the number of elements inside the pooling operator, this becomes $\text{LogAvgExp} := \log(\text{mean}(\exp(x)))$. By introducing a single temperature parameter, LogAvgExp smoothly transition… ▽ More We seek to improve the pooling operation in neural networks, by applying a more theoretically justified operator. We demonstrate that LogSumExp provides a natural OR operator for logits. When one corrects for the number of elements inside the pooling operator, this becomes $\text{LogAvgExp} := \log(\text{mean}(\exp(x)))$. By introducing a single temperature parameter, LogAvgExp smoothly transitions from the max of its operands to the mean (found at the limiting cases $t \to 0^+$ and $t \to +\infty$). We experimentally tested LogAvgExp, both with and without a learnable temperature parameter, in a variety of deep neural network architectures for computer vision. △ Less

Submitted 2 November, 2021; originally announced November 2021.

arXiv:2110.11940 [pdf, other]

Logical Activation Functions: Logit-space equivalents of Probabilistic Boolean Operators

Authors: Scott C. Lowe, Robert Earle, Jason d'Eon, Thomas Trappenberg, Sageev Oore

Abstract: The choice of activation functions and their motivation is a long-standing issue within the neural network community. Neuronal representations within artificial neural networks are commonly understood as logits, representing the log-odds score of presence of features within the stimulus. We derive logit-space operators equivalent to probabilistic Boolean logic-gates AND, OR, and XNOR for independe… ▽ More The choice of activation functions and their motivation is a long-standing issue within the neural network community. Neuronal representations within artificial neural networks are commonly understood as logits, representing the log-odds score of presence of features within the stimulus. We derive logit-space operators equivalent to probabilistic Boolean logic-gates AND, OR, and XNOR for independent probabilities. Such theories are important to formalize more complex dendritic operations in real neurons, and these operations can be used as activation functions within a neural network, introducing probabilistic Boolean-logic as the core operation of the neural network. Since these functions involve taking multiple exponents and logarithms, they are computationally expensive and not well suited to be directly used within neural networks. Consequently, we construct efficient approximations named $\text{AND}_\text{AIL}$ (the AND operator Approximate for Independent Logits), $\text{OR}_\text{AIL}$, and $\text{XNOR}_\text{AIL}$, which utilize only comparison and addition operations, have well-behaved gradients, and can be deployed as activation functions in neural networks. Like MaxOut, $\text{AND}_\text{AIL}$ and $\text{OR}_\text{AIL}$ are generalizations of ReLU to two-dimensions. While our primary aim is to formalize dendritic computations within a logit-space probabilistic-Boolean framework, we deploy these new activation functions, both in isolation and in conjunction to demonstrate their effectiveness on a variety of tasks including image classification, transfer learning, abstract reasoning, and compositional zero-shot learning. △ Less

Submitted 29 November, 2022; v1 submitted 22 October, 2021; originally announced October 2021.

Journal ref: Neural Information Processing Systems (2022)

arXiv:2107.07820 [pdf, other]

Contrastive Predictive Coding for Anomaly Detection

Authors: Puck de Haan, Sindy Löwe

Abstract: Reliable detection of anomalies is crucial when deploying machine learning models in practice, but remains challenging due to the lack of labeled data. To tackle this challenge, contrastive learning approaches are becoming increasingly popular, given the impressive results they have achieved in self-supervised representation learning settings. However, while most existing contrastive anomaly detec… ▽ More Reliable detection of anomalies is crucial when deploying machine learning models in practice, but remains challenging due to the lack of labeled data. To tackle this challenge, contrastive learning approaches are becoming increasingly popular, given the impressive results they have achieved in self-supervised representation learning settings. However, while most existing contrastive anomaly detection and segmentation approaches have been applied to images, none of them can use the contrastive losses directly for both anomaly detection and segmentation. In this paper, we close this gap by making use of the Contrastive Predictive Coding model (arXiv:1807.03748). We show that its patch-wise contrastive loss can directly be interpreted as an anomaly score, and how this allows for the creation of anomaly segmentation masks. The resulting model achieves promising results for both anomaly detection and segmentation on the challenging MVTec-AD dataset. △ Less

Submitted 16 July, 2021; originally announced July 2021.

Comments: 7 pages, ICML 2021 Workshop on Uncertainty and Robustness in Deep Learning

arXiv:2011.10287 [pdf, other]

Learning Object-Centric Video Models by Contrasting Sets

Authors: Sindy Löwe, Klaus Greff, Rico Jonschkowski, Alexey Dosovitskiy, Thomas Kipf

Abstract: Contrastive, self-supervised learning of object representations recently emerged as an attractive alternative to reconstruction-based training. Prior approaches focus on contrasting individual object representations (slots) against one another. However, a fundamental problem with this approach is that the overall contrastive loss is the same for (i) representing a different object in each slot, as… ▽ More Contrastive, self-supervised learning of object representations recently emerged as an attractive alternative to reconstruction-based training. Prior approaches focus on contrasting individual object representations (slots) against one another. However, a fundamental problem with this approach is that the overall contrastive loss is the same for (i) representing a different object in each slot, as it is for (ii) (re-)representing the same object in all slots. Thus, this objective does not inherently push towards the emergence of object-centric representations in the slots. We address this problem by introducing a global, set-based contrastive loss: instead of contrasting individual slot representations against one another, we aggregate the representations and contrast the joined sets against one another. Additionally, we introduce attention-based encoders to this contrastive setup which simplifies training and provides interpretable object masks. Our results on two synthetic video datasets suggest that this approach compares favorably against previous contrastive methods in terms of reconstruction, future prediction and object separation performance. △ Less

Submitted 20 November, 2020; originally announced November 2020.

Comments: NeurIPS 2020 Workshop on Object Representations for Learning and Reasoning

arXiv:2006.10833 [pdf, other]

Amortized Causal Discovery: Learning to Infer Causal Graphs from Time-Series Data

Authors: Sindy Löwe, David Madras, Richard Zemel, Max Welling

Abstract: On time-series data, most causal discovery methods fit a new model whenever they encounter samples from a new underlying causal graph. However, these samples often share relevant information which is lost when following this approach. Specifically, different samples may share the dynamics which describe the effects of their causal relations. We propose Amortized Causal Discovery, a novel framework… ▽ More On time-series data, most causal discovery methods fit a new model whenever they encounter samples from a new underlying causal graph. However, these samples often share relevant information which is lost when following this approach. Specifically, different samples may share the dynamics which describe the effects of their causal relations. We propose Amortized Causal Discovery, a novel framework that leverages such shared dynamics to learn to infer causal relations from time-series data. This enables us to train a single, amortized model that infers causal relations across samples with different underlying causal graphs, and thus leverages the shared dynamics information. We demonstrate experimentally that this approach, implemented as a variational model, leads to significant improvements in causal discovery performance, and show how it can be extended to perform well under added noise and hidden confounding. △ Less

Submitted 21 February, 2022; v1 submitted 18 June, 2020; originally announced June 2020.

Comments: Accepted as a conference paper at CLeaR 2022

arXiv:1911.07721 [pdf, other]

Program synthesis performance constrained by non-linear spatial relations in Synthetic Visual Reasoning Test

Authors: Lu Yihe, Scott C. Lowe, Penelope A. Lewis, Mark C. W. van Rossum

Abstract: Despite remarkable advances in automated visual recognition by machines, some visual tasks remain challenging for machines. Fleuret et al. (2011) introduced the Synthetic Visual Reasoning Test (SVRT) to highlight this point, which required classification of images consisting of randomly generated shapes based on hidden abstract rules using only a few examples. Ellis et al. (2015) demonstrated that… ▽ More Despite remarkable advances in automated visual recognition by machines, some visual tasks remain challenging for machines. Fleuret et al. (2011) introduced the Synthetic Visual Reasoning Test (SVRT) to highlight this point, which required classification of images consisting of randomly generated shapes based on hidden abstract rules using only a few examples. Ellis et al. (2015) demonstrated that a program synthesis approach could solve some of the SVRT problems with unsupervised, few-shot learning, whereas they remained challenging for several convolutional neural networks trained with thousands of examples. Here we re-considered the human and machine experiments, because they followed different protocols and yielded different statistics. We thus proposed a quantitative reintepretation of the data between the protocols, so that we could make fair comparison between human and machine performance. We improved the program synthesis classifier by correcting the image parsings, and compared the results to the performance of other machine agents and human subjects. We grouped the SVRT problems into different types by the two aspects of the core characteristics for classification: shape specification and location relation. We found that the program synthesis classifier could not solve problems involving shape distances, because it relied on symbolic computation which scales poorly with input dimension and adding distances into such computation would increase the dimension combinatorially with the number of shapes in an image. Therefore, although the program synthesis classifier is capable of abstract reasoning, its performance is highly constrained by the accessible information in image parsings. △ Less

Submitted 19 November, 2019; v1 submitted 18 November, 2019; originally announced November 2019.

arXiv:1907.04352 [pdf, other]

Exploring Conditioning for Generative Music Systems with Human-Interpretable Controls

Authors: Nicholas Meade, Nicholas Barreyre, Scott C. Lowe, Sageev Oore

Abstract: Performance RNN is a machine-learning system designed primarily for the generation of solo piano performances using an event-based (rather than audio) representation. More specifically, Performance RNN is a long short-term memory (LSTM) based recurrent neural network that models polyphonic music with expressive timing and dynamics (Oore et al., 2018). The neural network uses a simple language mode… ▽ More Performance RNN is a machine-learning system designed primarily for the generation of solo piano performances using an event-based (rather than audio) representation. More specifically, Performance RNN is a long short-term memory (LSTM) based recurrent neural network that models polyphonic music with expressive timing and dynamics (Oore et al., 2018). The neural network uses a simple language model based on the Musical Instrument Digital Interface (MIDI) file format. Performance RNN is trained on the e-Piano Junior Competition Dataset (International Piano e-Competition, 2018), a collection of solo piano performances by expert pianists. As an artistic tool, one of the limitations of the original model has been the lack of useable controls. The standard form of Performance RNN can generate interesting pieces, but little control is provided over what specifically is generated. This paper explores a set of conditioning-based controls used to influence the generation process. △ Less

Submitted 3 August, 2019; v1 submitted 9 July, 2019; originally announced July 2019.

Journal ref: International Conference on Computational Creativity, 2019

arXiv:1906.05926 [pdf, other]

N-body Approach to the Traveling Salesman Problem (TSP)

Authors: Johnny Seay, Edwin Gonzalez, Stephen Lowe, Jesse Crawford, Bryant Wyatt

Abstract: In the Traveling Salesman Problem (TSP), a list of cities and the distances between them are given. The goal is to find the shortest possible route that visits each city exactly once and returns to the original city. The TSP has a wide range of applications in many different industries including, but not limited to, optimizing mail and ship** routes, guiding industrial machines, map** genomes,… ▽ More In the Traveling Salesman Problem (TSP), a list of cities and the distances between them are given. The goal is to find the shortest possible route that visits each city exactly once and returns to the original city. The TSP has a wide range of applications in many different industries including, but not limited to, optimizing mail and ship** routes, guiding industrial machines, map** genomes, and improving autonomous vehicles. For centuries, traveling salesmen, politicians, and circuit preachers have tackled their own versions of the problem. Within the last century, the TSP has become one of the most important problems in the fields of mathematics and computer science. The time to find an exact solution is often impractically long, which has led to the development of numerous approximation techniques, ranging from linear programming methods to nature-inspired models. Here, we present a novel N-body approach to the TSP. △ Less

Submitted 13 June, 2019; originally announced June 2019.

Comments: 19 pages, 19 figures, 2 tables

MSC Class: 70F10 (Primary) 90C27; 90C59 (Secondary) ACM Class: G.2.1; I.6.0

arXiv:1905.11786 [pdf, other]

Putting An End to End-to-End: Gradient-Isolated Learning of Representations

Authors: Sindy Löwe, Peter O'Connor, Bastiaan S. Veeling

Abstract: We propose a novel deep learning method for local self-supervised representation learning that does not require labels nor end-to-end backpropagation but exploits the natural order in data instead. Inspired by the observation that biological neural networks appear to learn without backpropagating a global error signal, we split a deep neural network into a stack of gradient-isolated modules. Each… ▽ More We propose a novel deep learning method for local self-supervised representation learning that does not require labels nor end-to-end backpropagation but exploits the natural order in data instead. Inspired by the observation that biological neural networks appear to learn without backpropagating a global error signal, we split a deep neural network into a stack of gradient-isolated modules. Each module is trained to maximally preserve the information of its inputs using the InfoNCE bound from Oord et al. [2018]. Despite this greedy training, we demonstrate that each module improves upon the output of its predecessor, and that the representations created by the top module yield highly competitive results on downstream classification tasks in the audio and visual domain. The proposal enables optimizing modules asynchronously, allowing large-scale distributed training of very deep neural networks on unlabelled datasets. △ Less

Submitted 27 January, 2020; v1 submitted 28 May, 2019; originally announced May 2019.

Comments: Honorable Mention for Outstanding New Directions Paper Award at NeurIPS 2019

arXiv:1807.02011 [pdf, other]

doi 10.5220/0007364503720380

Improving Unsupervised Defect Segmentation by Applying Structural Similarity to Autoencoders

Authors: Paul Bergmann, Sindy Löwe, Michael Fauser, David Sattlegger, Carsten Steger

Abstract: Convolutional autoencoders have emerged as popular methods for unsupervised defect segmentation on image data. Most commonly, this task is performed by thresholding a pixel-wise reconstruction error based on an $\ell^p$ distance. This procedure, however, leads to large residuals whenever the reconstruction encompasses slight localization inaccuracies around edges. It also fails to reveal defective… ▽ More Convolutional autoencoders have emerged as popular methods for unsupervised defect segmentation on image data. Most commonly, this task is performed by thresholding a pixel-wise reconstruction error based on an $\ell^p$ distance. This procedure, however, leads to large residuals whenever the reconstruction encompasses slight localization inaccuracies around edges. It also fails to reveal defective regions that have been visually altered when intensity values stay roughly consistent. We show that these problems prevent these approaches from being applied to complex real-world scenarios and that it cannot be easily avoided by employing more elaborate architectures such as variational or feature matching autoencoders. We propose to use a perceptual loss function based on structural similarity which examines inter-dependencies between local image regions, taking into account luminance, contrast and structural information, instead of simply comparing single pixel values. It achieves significant performance gains on a challenging real-world dataset of nanofibrous materials and a novel dataset of two woven fabrics over the state of the art approaches for unsupervised defect segmentation that use pixel-wise reconstruction error metrics. △ Less

Submitted 1 February, 2019; v1 submitted 5 July, 2018; originally announced July 2018.

arXiv:1502.00045 [pdf, other]

Domain-Type-Guided Refinement Selection Based on Sliced Path Prefixes

Authors: Dirk Beyer, Stefan Löwe, Philipp Wendler

Abstract: Abstraction is a successful technique in software verification, and interpolation on infeasible error paths is a successful approach to automatically detect the right level of abstraction in counterexample-guided abstraction refinement. Because the interpolants have a significant influence on the quality of the abstraction, and thus, the effectiveness of the verification, an algorithm for deriving… ▽ More Abstraction is a successful technique in software verification, and interpolation on infeasible error paths is a successful approach to automatically detect the right level of abstraction in counterexample-guided abstraction refinement. Because the interpolants have a significant influence on the quality of the abstraction, and thus, the effectiveness of the verification, an algorithm for deriving the best possible interpolants is desirable. We present an analysis-independent technique that makes it possible to extract several alternative sequences of interpolants from one given infeasible error path, if there are several reasons for infeasibility in the error path. We take as input the given infeasible error path and apply a slicing technique to obtain a set of error paths that are more abstract than the original error path but still infeasible, each for a different reason. The (more abstract) constraints of the new paths can be passed to a standard interpolation engine, in order to obtain a set of interpolant sequences, one for each new path. The analysis can then choose from this set of interpolant sequences and select the most appropriate, instead of being bound to the single interpolant sequence that the interpolation engine would normally return. For example, we can select based on domain types of variables in the interpolants, prefer to avoid loop counters, or compare with templates for potential loop invariants, and thus control what kind of information occurs in the abstraction of the program. We implemented the new algorithm in the open-source verification framework CPAchecker and show that our proof-technique-independent approach yields a significant improvement of the effectiveness and efficiency of the verification process. △ Less

Submitted 30 January, 2015; originally announced February 2015.

Comments: 10 pages, 5 figures, 1 table, 4 algorithms

Report number: MIP-1501

arXiv:1308.6647 [pdf, ps, other]

doi 10.1017/pasa.2013.30

Representing the "butterfly" projection in FITS - projection code XPH

Authors: Mark R. Calabretta, Stuart R. Lowe

Abstract: The "butterfly" projection is constructed as the polar layout of the HEALPix projection with (H,K) = (4,3). This short article formalises its representation in FITS. The "butterfly" projection is constructed as the polar layout of the HEALPix projection with (H,K) = (4,3). This short article formalises its representation in FITS. △ Less

Submitted 30 August, 2013; originally announced August 2013.

Comments: 2 pages, 1 figure. Accepted by Publications of the Astronomical Society of Australia (PASA) with open access

arXiv:1307.2029 [pdf, ps, other]

doi 10.1088/1748-0221/8/07/T07001

In-flight calibration and verification of the Planck-LFI instrument

Authors: Anna Gregorio, Francesco Cuttaia, Aniello Mennella, Marco Bersanelli, Michele Maris, Peter Meinhold, Maura Sandri, Luca Terenzi, Maurizio Tomasi, Fabrizio Villa, Marco Frailis, Gianluca Morgante, Dave Pearson, Andrea Zacchei, Paola Battaglia, Reginald Christophe Butler, Richard Davis, Cristian Franceschet, Enrico Franceschi, Samuele Galeotta, Rodrigo Leonardi, Steve Lowe, Nazzareno Mandolesi, Frederick Melot, Luis Mendes , et al. (18 additional authors not shown)

Abstract: In this paper we discuss the Planck-LFI in-flight calibration campaign. After a brief overview of the ground test campaigns, we describe in detail the calibration and performance verification (CPV) phase, carried out in space during and just after the cool-down of LFI. We discuss in detail the functionality verification, the tuning of the front-end and warm electronics, the preliminary performance… ▽ More In this paper we discuss the Planck-LFI in-flight calibration campaign. After a brief overview of the ground test campaigns, we describe in detail the calibration and performance verification (CPV) phase, carried out in space during and just after the cool-down of LFI. We discuss in detail the functionality verification, the tuning of the front-end and warm electronics, the preliminary performance assessment and the thermal susceptibility tests. The logic, sequence, goals and results of the in-flight tests are discussed. All the calibration activities were successfully carried out and the instrument response was comparable to the one observed on ground. For some channels the in-flight tuning activity allowed us to improve significantly the noise performance. △ Less

Submitted 8 July, 2013; originally announced July 2013.

Comments: Long technical paper on Planck LFI in flight calibration campaign: 109 pages in this (not final) version, 100 page in the final JINST version

MSC Class: 85-05 (primary) ACM Class: B.4.4; B.8.0; J.2

Journal ref: A Gregorio et al 2013 JINST 8 T07001

arXiv:1305.6915 [pdf, other]

Reusing Precisions for Efficient Regression Verification

Authors: Dirk Beyer, Stefan Löwe, Evgeny Novikov, Andreas Stahlbauer, Philipp Wendler

Abstract: Continuous testing during development is a well-established technique for software-quality assurance. Continuous model checking from revision to revision is not yet established as a standard practice, because the enormous resource consumption makes its application impractical. Model checkers compute a large number of verification facts that are necessary for verifying if a given specification hold… ▽ More Continuous testing during development is a well-established technique for software-quality assurance. Continuous model checking from revision to revision is not yet established as a standard practice, because the enormous resource consumption makes its application impractical. Model checkers compute a large number of verification facts that are necessary for verifying if a given specification holds. We have identified a category of such intermediate results that are easy to store and efficient to reuse: abstraction precisions. The precision of an abstract domain specifies the level of abstraction that the analysis works on. Precisions are thus a precious result of the verification effort and it is a waste of resources to throw them away after each verification run. In particular, precisions are small and thus easy to store; they are easy to process and have a large impact on resource consumption. We experimentally show the impact of precision reuse on industrial verification problems, namely, 59 device drivers with 1119 revisions from the Linux kernel. △ Less

Submitted 29 May, 2013; originally announced May 2013.

Comments: 14 pages, 2 figures, 6 tables

Report number: MIP-1302

arXiv:1303.5062 [pdf, other]

doi 10.1051/0004-6361/201321529

Planck 2013 results. I. Overview of products and scientific results

Authors: Planck Collaboration, P. A. R. Ade, N. Aghanim, M. I. R. Alves, C. Armitage-Caplan, M. Arnaud, M. Ashdown, F. Atrio-Barandela, J. Aumont, H. Aussel, C. Baccigalupi, A. J. Banday, R. B. Barreiro, R. Barrena, M. Bartelmann, J. G. Bartlett, N. Bartolo, S. Basak, E. Battaner, R. Battye, K. Benabed, A. Benoît, A. Benoit-Lévy, J. -P. Bernard, M. Bersanelli , et al. (376 additional authors not shown)

Abstract: The ESA's Planck satellite, dedicated to studying the early Universe and its subsequent evolution, was launched 14 May 2009 and has been scanning the microwave and submillimetre sky continuously since 12 August 2009. This paper gives an overview of the mission and its performance, the processing, analysis, and characteristics of the data, the scientific results, and the science data products and p… ▽ More The ESA's Planck satellite, dedicated to studying the early Universe and its subsequent evolution, was launched 14 May 2009 and has been scanning the microwave and submillimetre sky continuously since 12 August 2009. This paper gives an overview of the mission and its performance, the processing, analysis, and characteristics of the data, the scientific results, and the science data products and papers in the release. The science products include maps of the CMB and diffuse extragalactic foregrounds, a catalogue of compact Galactic and extragalactic sources, and a list of sources detected through the SZ effect. The likelihood code used to assess cosmological models against the Planck data and a lensing likelihood are described. Scientific results include robust support for the standard six-parameter LCDM model of cosmology and improved measurements of its parameters, including a highly significant deviation from scale invariance of the primordial power spectrum. The Planck values for these parameters and others derived from them are significantly different from those previously determined. Several large-scale anomalies in the temperature distribution of the CMB, first detected by WMAP, are confirmed with higher confidence. Planck sets new limits on the number and mass of neutrinos, and has measured gravitational lensing of CMB anisotropies at greater than 25 sigma. Planck finds no evidence for non-Gaussianity in the CMB. Planck's results agree well with results from the measurements of baryon acoustic oscillations. Planck finds a lower Hubble constant than found in some more local measures. Some tension is also present between the amplitude of matter fluctuations derived from CMB data and that derived from SZ data. The Planck and WMAP power spectra are offset from each other by an average level of about 2% around the first acoustic peak. △ Less

Submitted 5 June, 2014; v1 submitted 20 March, 2013; originally announced March 2013.

arXiv:1212.6542 [pdf, other]

Explicit-Value Analysis Based on CEGAR and Interpolation

Authors: Dirk Beyer, Stefan Löwe

Abstract: Abstraction, counterexample-guided refinement, and interpolation are techniques that are essential to the success of predicate-based program analysis. These techniques have not yet been applied together to explicit-value program analysis. We present an approach that integrates abstraction and interpolation-based refinement into an explicit-value analysis, i.e., a program analysis that tracks expli… ▽ More Abstraction, counterexample-guided refinement, and interpolation are techniques that are essential to the success of predicate-based program analysis. These techniques have not yet been applied together to explicit-value program analysis. We present an approach that integrates abstraction and interpolation-based refinement into an explicit-value analysis, i.e., a program analysis that tracks explicit values for a specified set of variables (the precision). The algorithm uses an abstract reachability graph as central data structure and a path-sensitive dynamic approach for precision adjustment. We evaluate our algorithm on the benchmark set of the Competition on Software Verification 2012 (SV-COMP'12) to show that our new approach is highly competitive. In addition, we show that combining our new approach with an auxiliary predicate analysis scores significantly higher than the SV-COMP'12 winner. △ Less

Submitted 28 December, 2012; originally announced December 2012.

Comments: 12 pages, 5 figures, 3 tables, 4 algorithms

Report number: MIP-1205

arXiv:1108.2307 [pdf, ps, other]

doi 10.1088/0004-637X/741/1/53

Statistical Studies of Giant Pulse Emission from the Crab Pulsar

Authors: Walid A. Majid, Charles J. Naudet, Stephen T. Lowe, Thomas B. H. Kuiper

Abstract: We have observed the Crab pulsar with the Deep Space Network (DSN) Goldstone 70 m antenna at 1664 MHz during three observing epochs for a total of 4 hours. Our data analysis has detected more than 2500 giant pulses, with flux densities ranging from 0.1 kJy to 150 kJy and pulse widths from 125 ns (limited by our bandwidth) to as long as 100 microseconds, with median power amplitudes and widths of 1… ▽ More We have observed the Crab pulsar with the Deep Space Network (DSN) Goldstone 70 m antenna at 1664 MHz during three observing epochs for a total of 4 hours. Our data analysis has detected more than 2500 giant pulses, with flux densities ranging from 0.1 kJy to 150 kJy and pulse widths from 125 ns (limited by our bandwidth) to as long as 100 microseconds, with median power amplitudes and widths of 1 kJy and 2 microseconds respectively. The most energetic pulses in our sample have energy fluxes of approximately 100 kJy-microsecond. We have used this large sample to investigate a number of giant-pulse emission properties in the Crab pulsar, including correlations among pulse flux density, width, energy flux, phase and time of arrival. We present a consistent accounting of the probability distributions and threshold cuts in order to reduce pulse-width biases. The excellent sensitivity obtained has allowed us to probe further into the population of giant pulses. We find that a significant portion, no less than 50%, of the overall pulsed energy flux at our observing frequency is emitted in the form of giant pulses. △ Less

Submitted 10 August, 2011; originally announced August 2011.

Comments: 19 pages, 17 figures; to be published in Astrophysical Journal

arXiv:1106.3766 [pdf, ps, other]

doi 10.1111/j.1365-2966.2011.19241.x

Sunyaev Zel'dovich observations of a statistically complete sample of galaxy clusters with OCRA-p

Authors: Katy Lancaster, Mark Birkinshaw, Marcin P. Gawronski, Richard Battye, Ian Browne, Richard Davis, Paul Giles, Roman Feiler, Andrzej Kus, Bartosz Lew, Stuart Lowe, Ben Maughan, Abdulaziz Mohammad, Bogna Pazderska, Eugeniusz Pazderski, Mike Peel, Boud Roukema, Peter Wilkinson

Abstract: We present 30 GHz Sunyaev Zel'dovich observations of a statistically complete sample of galaxy clusters with OCRA-p. The clusters are the 18 most X-ray luminous clusters at z > 0.2 in the ROSAT Brightest Cluster Sample. We correct for contaminant radio sources via supplementary observations with the Green Bank Telescope, also at 30 GHz, and remove a cluster that is contaminated by an unresolved X-… ▽ More We present 30 GHz Sunyaev Zel'dovich observations of a statistically complete sample of galaxy clusters with OCRA-p. The clusters are the 18 most X-ray luminous clusters at z > 0.2 in the ROSAT Brightest Cluster Sample. We correct for contaminant radio sources via supplementary observations with the Green Bank Telescope, also at 30 GHz, and remove a cluster that is contaminated by an unresolved X-ray source. All 17 remaining clusters have central SZ effects with Comptonisation parameter y_0 exceeding 1.9x10^-4, and 13 are detected at significance > 3 sigma. We use our data to examine scalings between y_0 and X-ray temperature, X-ray luminosity, and the X-ray mass proxy Y_X, and find good agreement with predictions from self-similar models of cluster formation, with an intrinsic scatter in y_0 of about 25%. We also comment on the success of the observations in the face of the contaminant source population, and the implications for upcoming cm-wave surveys. △ Less

Submitted 19 June, 2011; originally announced June 2011.

Comments: 14 pages, 2 figures, accepted by MNRAS

arXiv:1101.2038 [pdf, other]

doi 10.1051/0004-6361/201116480

Planck early results. III. First assessment of the Low Frequency Instrument in-flight performance

Authors: A. Mennella, M. Bersanelli, R. C. Butler, A. Curto, F. Cuttaia, R. J. Davis, J. Dick, M. Frailis, S. Galeotta, A. Gregorio, H. Kurki-Suonio, C. R. Lawrence, S. Leach, J. P. Leahy, S. Lowe, D. Maino, N. Mandolesi, M. Maris, E. Martínez-González, P. R. Meinhold, G. Morgante, D. Pearson, F. Perrotta, G. Polenta, T. Poutanen , et al. (136 additional authors not shown)

Abstract: The scientific performance of the Planck Low Frequency Instrument (LFI) after one year of in-orbit operation is presented. We describe the main optical parameters and discuss photometric calibration, white noise sensitivity, and noise properties. A preliminary evaluation of the impact of the main systematic effects is presented. For each of the performance parameters, we outline the methods used t… ▽ More The scientific performance of the Planck Low Frequency Instrument (LFI) after one year of in-orbit operation is presented. We describe the main optical parameters and discuss photometric calibration, white noise sensitivity, and noise properties. A preliminary evaluation of the impact of the main systematic effects is presented. For each of the performance parameters, we outline the methods used to obtain them from the flight data and provide a comparison with pre-launch ground assessments, which are essentially confirmed in flight. △ Less

Submitted 19 December, 2011; v1 submitted 11 January, 2011; originally announced January 2011.

Comments: Published version

Journal ref: A&A Vol 536, A3 (Dec 2011)

arXiv:1101.2022 [pdf, other]

doi 10.1051/0004-6361/201116464

Planck Early Results: The Planck mission

Authors: Planck Collaboration, P. A. R. Ade, N. Aghanim, M. Arnaud, M. Ashdown, J. Aumont, C. Baccigalupi, M. Baker, A. Balbi, A. J. Banday, R. B. Barreiro, J. G. Bartlett, E. Battaner, K. Benabed, K. Bennett, A. Benoît, J. -P. Bernard, M. Bersanelli, R. Bhatia, J. J. Bock, A. Bonaldi, J. R. Bond, J. Borrill, F. R. Bouchet, T. Bradshaw , et al. (250 additional authors not shown)

Abstract: The European Space Agency's Planck satellite was launched on 14 May 2009, and has been surveying the sky stably and continuously since 13 August 2009. Its performance is well in line with expectations, and it will continue to gather scientific data until the end of its cryogenic lifetime. We give an overview of the history of Planck in its first year of operations, and describe some of the key per… ▽ More The European Space Agency's Planck satellite was launched on 14 May 2009, and has been surveying the sky stably and continuously since 13 August 2009. Its performance is well in line with expectations, and it will continue to gather scientific data until the end of its cryogenic lifetime. We give an overview of the history of Planck in its first year of operations, and describe some of the key performance aspects of the satellite. This paper is part of a package submitted in conjunction with Planck's Early Release Compact Source Catalogue, the first data product based on Planck to be released publicly. The package describes the scientific performance of the Planck payload, and presents results on a variety of astrophysical topics related to the sources included in the Catalogue, as well as selected topics on diffuse emission. △ Less

Submitted 16 June, 2011; v1 submitted 11 January, 2011; originally announced January 2011.

Comments: This is part of a package of Planck papers labelled in their titles as "Planck Early Results". The whole package can also be downloaded from http://www.rssd.esa.int/Planck. This paper was accepted by Astronomy & Astrophysics on 31 May 2011

arXiv:1007.5242 [pdf, other]

doi 10.1111/j.1365-2966.2010.17640.x

One Centimetre Receiver Array-prototype observations of the CRATES sources at 30 GHz

Authors: M. W. Peel, M. P. Gawronski, R. A. Battye, M. Birkinshaw, I. W. A. Browne, R. J. Davis, R. Feiler, A. J. Kus, K. Lancaster, S. R. Lowe, B. M. Pazderska, E. Pazderski, B. F. Roukema, P. N. Wilkinson

Abstract: Knowledge of the population of radio sources in the range ~2-200 GHz is important for understanding their effects on measurements of the Cosmic Microwave Background power spectrum. We report measurements of the 30 GHz flux densities of 605 radio sources from the Combined Radio All-sky Targeted Eight-GHz Survey (CRATES), which have been made with the One Centimetre Receiver Array prototype (OCRA-p)… ▽ More Knowledge of the population of radio sources in the range ~2-200 GHz is important for understanding their effects on measurements of the Cosmic Microwave Background power spectrum. We report measurements of the 30 GHz flux densities of 605 radio sources from the Combined Radio All-sky Targeted Eight-GHz Survey (CRATES), which have been made with the One Centimetre Receiver Array prototype (OCRA-p) on the Torun 32-m telescope. The flux densities of sources that were also observed by WMAP and previous OCRA surveys are in broad agreement with those reported here, however a number of sources display intrinsic variability. We find a good correlation between the 30 GHz and Fermi gamma-ray flux densities for common sources. We examine the radio spectra of all observed sources and report a number of Gigahertz-peaked and inverted spectrum sources. These measurements will be useful for comparison to those from the Low Frequency Instrument of the Planck satellite, which will make some of its most sensitive observations in the region covered here. △ Less

Submitted 17 May, 2011; v1 submitted 29 July, 2010; originally announced July 2010.

Comments: 21 pages (9 pages of text, 12 pages of table), 7 figures. Erratum appended to end (page 20). Accepted by MNRAS. The definitive version is available at www.blackwell-synergy.com

Journal ref: Monthly Notices of the Royal Astronomical Society, 2011, Volume 410, Issue 4, pp. 2690-2697

arXiv:1005.2541 [pdf, other]

doi 10.1051/0004-6361/200912860

Planck pre-launch status: calibration of the Low Frequency Instrument flight model radiometers

Authors: F. Villa, L. Terenzi, M. Sandri, P. Meinhold, T. Poutanen, P. Battaglia, C. Franceschet, N. Hughes, M. Laaninen, P. Lapolla, M. Bersanelli, R. C. Butler, F. Cuttaia, O. D'Arcangelo, M. Frailis, E. Franceschi, S. Galeotta, A. Gregorio, R. Leonardi, S. R. Lowe, N. Mandolesi, M. Maris, L. Mendes, A. Mennella, G. Morgante , et al. (49 additional authors not shown)

Abstract: The Low Frequency Instrument (LFI) on-board the ESA Planck satellite carries eleven radiometer subsystems, called Radiometer Chain Assemblies (RCAs), each composed of a pair of pseudo-correlation receivers. We describe the on-ground calibration campaign performed to qualify the flight model RCAs and to measure their pre-launch performances. Each RCA was calibrated in a dedicated flight-like cryoge… ▽ More The Low Frequency Instrument (LFI) on-board the ESA Planck satellite carries eleven radiometer subsystems, called Radiometer Chain Assemblies (RCAs), each composed of a pair of pseudo-correlation receivers. We describe the on-ground calibration campaign performed to qualify the flight model RCAs and to measure their pre-launch performances. Each RCA was calibrated in a dedicated flight-like cryogenic environment with the radiometer front-end cooled to 20K and the back-end at 300K, and with an external input load cooled to 4K. A matched load simulating a blackbody at different temperatures was placed in front of the sky horn to derive basic radiometer properties such as noise temperature, gain, and noise performance, e.g. 1/f noise. The spectral response of each detector was measured as was their susceptibility to thermal variation. All eleven LFI RCAs were calibrated. Instrumental parameters measured in these tests, such as noise temperature, bandwidth, radiometer isolation, and linearity, provide essential inputs to the Planck-LFI data analysis. △ Less

Submitted 14 May, 2010; originally announced May 2010.

Comments: 15 pages, 18 figures. Accepted for publication in Astronomy and Astrophysics

arXiv:1001.4838 [pdf, other]

doi 10.1088/1748-0221/4/12/T12021

A systematic approach to the Planck LFI end-to-end test and its application to the DPC Level 1 pipeline

Authors: M. Frailis, M. Maris, A. Zacchei, N. Morisset, R. Rohlfs, M. Meharga, P. Binko, M. Turler, S. Galeotta, F. Gasparo, E. Franceschi, R. C. Butler, O. D'Arcangelo, S. Fogliani, A. Gregorio, S. R. Lowe, G. Maggio, M. Malaspina, N. Mandolesi, P. Manzato, F. Pasian, F. Perrotta, M. Sandri, L. Terenzi, M. Tomasi , et al. (1 additional authors not shown)

Abstract: The Level 1 of the Planck LFI Data Processing Centre (DPC) is devoted to the handling of the scientific and housekee** telemetry. It is a critical component of the Planck ground segment which has to strictly commit to the project schedule to be ready for the launch and flight operations. In order to guarantee the quality necessary to achieve the objectives of the Planck mission, the design and… ▽ More The Level 1 of the Planck LFI Data Processing Centre (DPC) is devoted to the handling of the scientific and housekee** telemetry. It is a critical component of the Planck ground segment which has to strictly commit to the project schedule to be ready for the launch and flight operations. In order to guarantee the quality necessary to achieve the objectives of the Planck mission, the design and development of the Level 1 software has followed the ESA Software Engineering Standards. A fundamental step in the software life cycle is the Verification and Validation of the software. The purpose of this work is to show an example of procedures, test development and analysis successfully applied to a key software project of an ESA mission. We present the end-to-end validation tests performed on the Level 1 of the LFI-DPC, by detailing the methods used and the results obtained. Different approaches have been used to test the scientific and housekee** data processing. Scientific data processing has been tested by injecting signals with known properties directly into the acquisition electronics, in order to generate a test dataset of real telemetry data and reproduce as much as possible nominal conditions. For the HK telemetry processing, validation software have been developed to inject known parameter values into a set of real housekee** packets and perform a comparison with the corresponding timelines generated by the Level 1. With the proposed validation and verification procedure, where the on-board and ground processing are viewed as a single pipeline, we demonstrated that the scientific and housekee** processing of the Planck-LFI raw data is correct and meets the project requirements. △ Less

Submitted 26 January, 2010; originally announced January 2010.

Comments: 20 pages, 7 figures; this paper is part of the Prelaunch status LFI papers published on JINST: http://www.iop.org/EJ/journal/-page=extra.proc5/**st

Journal ref: M Frailis et al 2009 JINST 4 T12021

arXiv:1001.4778 [pdf, ps, other]

doi 10.1088/1748-0221/4/12/T12006

Planck-LFI: Design and Performance of the 4 Kelvin Reference Load Unit

Authors: Luca Valenziano, Francesco Cuttaia, Adriano De Rosa, Luca Terenzi, Alberto Brighenti, GianPaolo Cazzola, Anna Garbesi, Sergio Mariotti, Giordano Orsi, Luca Pagan, Francesco Cavaliere, Roberto Lapini, Matteo Biggi, Enzo Panagin, Battaglia Paola, Chris Butler, Marco Bersanelli, Ocleto D'Arcangelo, Steve Levin, Nazzareno Mandolesi, Aniello Mennella, Gianluca Morgante, Gabriele Morigi, Maura Sandri, Alessandro Simonetto , et al. (13 additional authors not shown)

Abstract: The LFI radiometers use a pseudo-correlation design where the signal from the sky is continuously compared with a stable reference signal, provided by a cryogenic reference load system. The reference unit is composed by small pyramidal horns, one for each radiometer, 22 in total, facing small absorbing targets, made of a commercial resin ECCOSORB CR (TM), cooled to approximately 4.5 K. Horns and… ▽ More The LFI radiometers use a pseudo-correlation design where the signal from the sky is continuously compared with a stable reference signal, provided by a cryogenic reference load system. The reference unit is composed by small pyramidal horns, one for each radiometer, 22 in total, facing small absorbing targets, made of a commercial resin ECCOSORB CR (TM), cooled to approximately 4.5 K. Horns and targets are separated by a small gap to allow thermal decoupling. Target and horn design is optimized for each of the LFI bands, centered at 70, 44 and 30 GHz. Pyramidal horns are either machined inside the radiometer 20K module or connected via external electro-formed bended waveguides. The requirement of high stability of the reference signal imposed a careful design for the radiometric and thermal properties of the loads. Materials used for the manufacturing have been characterized for thermal, RF and mechanical properties. We describe in this paper the design and the performance of the reference system. △ Less

Submitted 26 January, 2010; originally announced January 2010.

Comments: This is an author-created, un-copyedited version of an article accepted for publication in JINST. IOP Publishing Ltd is not responsible for any errors or omissions in this version of the manuscript or any version derived from it. The definitive publisher authenticated version is available online at [10.1088/1748-0221/4/12/T12006]. 14 pages, 34 figures

Journal ref: 2009 JINST 4 T12006

arXiv:1001.4743 [pdf, ps, other]

doi 10.1088/1748-0221/4/12/T12002

Design, development and verification of the 30 and 44 GHz front-end modules for the Planck Low Frequency Instrument

Authors: R. J. Davis, A. Wilkinson, R. D. Davies, W. F. Winder, N. Roddis, E. J. Blackhurst, D. Lawson, S. R. Lowe, C. Baines, M. Butlin, A. Galtress, D. Shepherd, B. Aja, E. Artal, M. Bersanelli, R. C. Butler, C. Castelli, F. Cuttaia, O. D'Arcangelo, T. Gaier, R. Hoyland, D. Kettle, R. Leonardi, N. Mandolesi, A. Mennella , et al. (6 additional authors not shown)

Abstract: We give a description of the design, construction and testing of the 30 and 44 GHz Front End Modules (FEMs) for the Low Frequency Instrument (LFI) of the Planck mission to be launched in 2009. The scientific requirements of the mission determine the performance parameters to be met by the FEMs, including their linear polarization characteristics. The FEM design is that of a differential pseudo… ▽ More We give a description of the design, construction and testing of the 30 and 44 GHz Front End Modules (FEMs) for the Low Frequency Instrument (LFI) of the Planck mission to be launched in 2009. The scientific requirements of the mission determine the performance parameters to be met by the FEMs, including their linear polarization characteristics. The FEM design is that of a differential pseudo-correlation radiometer in which the signal from the sky is compared with a 4-K blackbody load. The Low Noise Amplifier (LNA) at the heart of the FEM is based on indium phosphide High Electron Mobility Transistors (HEMTs). The radiometer incorporates a novel phase-switch design which gives excellent amplitude and phase match across the band. The noise temperature requirements are met within the measurement errors at the two frequencies. For the most sensitive LNAs, the noise temperature at the band centre is 3 and 5 times the quantum limit at 30 and 44 GHz respectively. For some of the FEMs, the noise temperature is still falling as the ambient temperature is reduced to 20 K. Stability tests of the FEMs, including a measurement of the 1/f knee frequency, also meet mission requirements. The 30 and 44 GHz FEMs have met or bettered the mission requirements in all critical aspects. The most sensitive LNAs have reached new limits of noise temperature for HEMTs at their band centres. The FEMs have well-defined linear polarization characteristcs. △ Less

Submitted 26 January, 2010; originally announced January 2010.

Comments: 39 pages, 33 figures (33 EPS files), 12 tables. Planck LFI technical papers published by JINST: http://www.iop.org/EJ/journal/-page=extra.proc5/1748-0221

Journal ref: R J Davis et al 2009 JINST 4 T12002

arXiv:1001.4730 [pdf, ps, other]

doi 10.1088/1748-0221/4/12/T12019

Level 1 on-ground telemetry handling in Planck LFI

Authors: A. Zacchei, M. Frailis, M. Maris, N. Morisset, R. Rohlfs, M. Meharga, P. Binko, M. Turler, S. Galeotta, F. Gasparo, E. Franceschi, R. C. Butler, F. Cuttaia, O. D'Arcangelo, S. Fogliani, A. Gregorio, R. Leonardi, S. R. Lowe, D. Maino, G. Maggio, M. Malaspina, N. Mandolesi, P. Manzato, P. Meinhold, L. Mendes , et al. (9 additional authors not shown)

Abstract: The Planck Low Frequency Instrument (LFI) will observe the Cosmic Microwave Background (CMB) by covering the frequency range 30-70 GHz in three bands. The primary instrument data source are the temperature samples acquired by the 22 radiometers mounted on the Planck focal plane. Such samples represent the scientific data of LFI. In addition, the LFI instrument generates the so called housekeepin… ▽ More The Planck Low Frequency Instrument (LFI) will observe the Cosmic Microwave Background (CMB) by covering the frequency range 30-70 GHz in three bands. The primary instrument data source are the temperature samples acquired by the 22 radiometers mounted on the Planck focal plane. Such samples represent the scientific data of LFI. In addition, the LFI instrument generates the so called housekee** data by sampling regularly the on-board sensors and registers. The housekee** data provides information on the overall health status of the instrument and on the scientific data quality. The scientific and housekee** data are collected on-board into telemetry packets compliant with the ESA Packet Telemetry standards. They represent the primary input to the first processing level of the LFI Data Processing Centre. In this work we show the software systems which build the LFI Level 1. A real-time assessment system, based on the ESA SCOS 2000 generic mission control system, has the main purpose of monitoring the housekee** parameters of LFI and detect possible anomalies. A telemetry handler system processes the housekee** and scientific telemetry of LFI, generating timelines for each acquisition chain and each housekee** parameter. Such timelines represent the main input to the subsequent processing levels of the LFI DPC. A telemetry quick-look system allows the real-time visualization of the LFI scientific and housekee** data, by also calculating quick statistical functions and fast Fourier transforms. The LFI Level 1 has been designed to support all the mission phases, from the instrument ground tests and calibration to the flight operations, and developed according to the ESA engineering standards. △ Less

Submitted 26 January, 2010; originally announced January 2010.

Comments: This paper is part of the Prelaunch status LFI papers published on JINST: http://www.iop.org/EJ/journal/-page=extra.proc5/**st

Journal ref: 2009 JINST 4 T12019

arXiv:1001.4648 [pdf, ps, other]

doi 10.1088/1748-0221/4/12/T12013

Planck-LFI radiometers tuning

Authors: Francesco Cuttaia, Aniello Mennella, Luca Stringhetti, Michele Maris, Luca Terenzi, Maurizio Tomasi, Fabrizio Villa, Marco Bersanelli, Christopher Reginald Butler, Benedetta Cappellini, Leticia Perez Cuevas, Ocleto D'Arcangelo, Richard Davis, Marco Frailis, Cristian Franceschet, Enrico Franceschi, Anna Gregorio, Roger Hoyland, Rodrigo Leonardi, Stuart Lowe, Nazzareno Mandolesi, Peter Meinhold, Luis Mendes, Neil Roddis, Maura Sandri , et al. (11 additional authors not shown)

Abstract: "This paper is part of the Prelaunch status LFI papers published on JINST: http://www.iop.org/EJ/journal/-page=extra.proc5/**st" This paper describes the Planck Low Frequency Instrument tuning activities performed through the ground test campaigns, from Unit to Satellite Levels. Tuning is key to achieve the best possible instrument performance and tuning parameters strongly depend on thermal… ▽ More "This paper is part of the Prelaunch status LFI papers published on JINST: http://www.iop.org/EJ/journal/-page=extra.proc5/**st" This paper describes the Planck Low Frequency Instrument tuning activities performed through the ground test campaigns, from Unit to Satellite Levels. Tuning is key to achieve the best possible instrument performance and tuning parameters strongly depend on thermal and electrical conditions. For this reason tuning has been repeated several times during ground tests and it has been repeated in flight before starting nominal operations. The paper discusses the tuning philosophy, the activities and the obtained results, highlighting developments and changes occurred during test campaigns. The paper concludes with an overview of tuning performed during the satellite cryogenic test campaign (Summer 2008) and of the plans for the just started in-flight calibration. △ Less

Submitted 26 January, 2010; originally announced January 2010.

Comments: This is an author-created, un-copyedited version of an article accepted for publication in JINST. IOP Publishing Ltd is not responsible for any errors or omissions in this version of the manuscript or any version derived from it. The definitive publisher authenticated version is available online at http://dx.doi.org/10.1088/1748-0221/4/12/T12013]

Journal ref: Journal of Instrumentation, Volume 4, Issue 12, pp. T12013 (2009)

arXiv:1001.4642 [pdf, other]

doi 10.1088/1748-0221/4/12/T12020

Off-line radiometric analysis of Planck/LFI data

Authors: M. Tomasi, A. Mennella, S. Galeotta, S. R. Lowe, L. Mendes, R. Leonardi, F. Villa, B. Cappellini, A. Gregorio, P. Meinhold, M. Sandri, F. Cuttaia, L. Terenzi, M. Maris, L. Valenziano, M. J. Salmon, M. Bersanelli, P. Binko, R. C. Butler, O. D'Arcangelo, S. Fogliani, M. Frailis, E. Franceschi, F. Gasparo, G. Maggio , et al. (13 additional authors not shown)

Abstract: The Planck Low Frequency Instrument (LFI) is an array of 22 pseudo-correlation radiometers on-board the Planck satellite to measure temperature and polarization anisotropies in the Cosmic Microwave Background (CMB) in three frequency bands (30, 44 and 70 GHz). To calibrate and verify the performances of the LFI, a software suite named LIFE has been developed. Its aims are to provide a common pla… ▽ More The Planck Low Frequency Instrument (LFI) is an array of 22 pseudo-correlation radiometers on-board the Planck satellite to measure temperature and polarization anisotropies in the Cosmic Microwave Background (CMB) in three frequency bands (30, 44 and 70 GHz). To calibrate and verify the performances of the LFI, a software suite named LIFE has been developed. Its aims are to provide a common platform to use for analyzing the results of the tests performed on the single components of the instrument (RCAs, Radiometric Chain Assemblies) and on the integrated Radiometric Array Assembly (RAA). Moreover, its analysis tools are designed to be used during the flight as well to produce periodic reports on the status of the instrument. The LIFE suite has been developed using a multi-layered, cross-platform approach. It implements a number of analysis modules written in RSI IDL, each accessing the data through a portable and heavily optimized library of functions written in C and C++. One of the most important features of LIFE is its ability to run the same data analysis codes both using ground test data and real flight data as input. The LIFE software suite has been successfully used during the RCA/RAA tests and the Planck Integrated System Tests. Moreover, the software has also passed the verification for its in-flight use during the System Operations Verification Tests, held in October 2008. △ Less

Submitted 26 January, 2010; originally announced January 2010.

Comments: Planck LFI technical papers published by JINST: http://www.iop.org/EJ/journal/-page=extra.proc5/1748-0221

Journal ref: 2009 JINST 4 T12020

arXiv:1001.4610 [pdf, ps, other]

doi 10.1088/1748-0221/4/12/T12011

The linearity response of the Planck-LFI flight model receivers

Authors: A. Mennella, F. Villa, L. Terenzi, F. Cuttaia, P. Battaglia, M. Bersanelli, R. C. Butler, O. D'Arcangelo, E. Artal, R. Davis, M. Frailis, C. Franceschet, S. Galeotta, A. Gregorio, N. Hughes, P. Jukkala, D. Kettle, V. -H. Kilpiä, M. Laaninen, P. M. Lapolla, R. Leonardi, P. Leutenegger, S. Lowe, N. Mandolesi, M. Maris , et al. (15 additional authors not shown)

Abstract: In this paper we discuss the linearity response of the Planck-LFI receivers, with particular reference to signal compression measured on the 30 and 44 GHz channels. In the article we discuss the various sources of compression and present a model that accurately describes data measured during tests performed with individual radiomeric chains. After discussing test results we present the best para… ▽ More In this paper we discuss the linearity response of the Planck-LFI receivers, with particular reference to signal compression measured on the 30 and 44 GHz channels. In the article we discuss the various sources of compression and present a model that accurately describes data measured during tests performed with individual radiomeric chains. After discussing test results we present the best parameter set representing the receiver response and discuss the impact of non linearity on in-flight calibration, which is shown to be negligible. △ Less

Submitted 26 January, 2010; originally announced January 2010.

Comments: this paper is part of the Prelaunch status LFI papers published on JINST: http://www.iop.org/EJ/journal/-page=extra.proc5/**st; This is an author-created, un-copyedited version of an article accepted for publication in JINST. IOP Publishing Ltd is not responsible for any errors or omissions in this version of the manuscript or any version derived from it. The definitive publisher authenticated version is available online at 10.1088/1748-0221/4/12/T12011.

Journal ref: 2009 JINST 4 T12011

Showing 1–50 of 60 results for author: Löwe, S