-
Dense-Resolution Network for Point Cloud Classification and Segmentation
Authors:
Shi Qiu,
Saeed Anwar,
Nick Barnes
Abstract:
Point cloud analysis is attracting attention from Artificial Intelligence research since it can be widely used in applications such as robotics, Augmented Reality, self-driving. However, it is always challenging due to irregularities, unorderedness, and sparsity. In this article, we propose a novel network named Dense-Resolution Network (DRNet) for point cloud analysis. Our DRNet is designed to le…
▽ More
Point cloud analysis is attracting attention from Artificial Intelligence research since it can be widely used in applications such as robotics, Augmented Reality, self-driving. However, it is always challenging due to irregularities, unorderedness, and sparsity. In this article, we propose a novel network named Dense-Resolution Network (DRNet) for point cloud analysis. Our DRNet is designed to learn local point features from the point cloud in different resolutions. In order to learn local point groups more effectively, we present a novel grou** method for local neighborhood searching and an error-minimizing module for capturing local features. In addition to validating the network on widely used point cloud segmentation and classification benchmarks, we also test and visualize the performance of the components. Comparing with other state-of-the-art methods, our network shows superiority on ModelNet40, ShapeNet synthetic and ScanObjectNN real point cloud datasets.
△ Less
Submitted 17 November, 2020; v1 submitted 14 May, 2020;
originally announced May 2020.
-
Attention Based Real Image Restoration
Authors:
Saeed Anwar,
Nick Barnes,
Lars Petersson
Abstract:
Deep convolutional neural networks perform better on images containing spatially invariant degradations, also known as synthetic degradations; however, their performance is limited on real-degraded photographs and requires multiple-stage network modeling. To advance the practicability of restoration algorithms, this paper proposes a novel single-stage blind real image restoration network (R$^2$Net…
▽ More
Deep convolutional neural networks perform better on images containing spatially invariant degradations, also known as synthetic degradations; however, their performance is limited on real-degraded photographs and requires multiple-stage network modeling. To advance the practicability of restoration algorithms, this paper proposes a novel single-stage blind real image restoration network (R$^2$Net) by employing a modular architecture. We use a residual on the residual structure to ease the flow of low-frequency information and apply feature attention to exploit the channel dependencies. Furthermore, the evaluation in terms of quantitative metrics and visual quality for four restoration tasks i.e. Denoising, Super-resolution, Raindrop Removal, and JPEG Compression on 11 real degraded datasets against more than 30 state-of-the-art algorithms demonstrate the superiority of our R$^2$Net. We also present the comparison on three synthetically generated degraded datasets for denoising to showcase the capability of our method on synthetics denoising. The codes, trained models, and results are available on https://github.com/saeed-anwar/R2Net.
△ Less
Submitted 1 October, 2020; v1 submitted 26 April, 2020;
originally announced April 2020.
-
UC-Net: Uncertainty Inspired RGB-D Saliency Detection via Conditional Variational Autoencoders
Authors:
**g Zhang,
Deng-** Fan,
Yuchao Dai,
Saeed Anwar,
Fatemeh Sadat Saleh,
Tong Zhang,
Nick Barnes
Abstract:
In this paper, we propose the first framework (UCNet) to employ uncertainty for RGB-D saliency detection by learning from the data labeling process. Existing RGB-D saliency detection methods treat the saliency detection task as a point estimation problem, and produce a single saliency map following a deterministic learning pipeline. Inspired by the saliency data labeling process, we propose probab…
▽ More
In this paper, we propose the first framework (UCNet) to employ uncertainty for RGB-D saliency detection by learning from the data labeling process. Existing RGB-D saliency detection methods treat the saliency detection task as a point estimation problem, and produce a single saliency map following a deterministic learning pipeline. Inspired by the saliency data labeling process, we propose probabilistic RGB-D saliency detection network via conditional variational autoencoders to model human annotation uncertainty and generate multiple saliency maps for each input image by sampling in the latent space. With the proposed saliency consensus process, we are able to generate an accurate saliency map based on these multiple predictions. Quantitative and qualitative evaluations on six challenging benchmark datasets against 18 competing algorithms demonstrate the effectiveness of our approach in learning the distribution of saliency maps, leading to a new state-of-the-art in RGB-D saliency detection.
△ Less
Submitted 13 April, 2020;
originally announced April 2020.
-
A Systematic Evaluation: Fine-Grained CNN vs. Traditional CNN Classifiers
Authors:
Saeed Anwar,
Nick Barnes,
Lars Petersson
Abstract:
To make the best use of the underlying minute and subtle differences, fine-grained classifiers collect information about inter-class variations. The task is very challenging due to the small differences between the colors, viewpoint, and structure in the same class entities. The classification becomes more difficult due to the similarities between the differences in viewpoint with other classes an…
▽ More
To make the best use of the underlying minute and subtle differences, fine-grained classifiers collect information about inter-class variations. The task is very challenging due to the small differences between the colors, viewpoint, and structure in the same class entities. The classification becomes more difficult due to the similarities between the differences in viewpoint with other classes and differences with its own. In this work, we investigate the performance of the landmark general CNN classifiers, which presented top-notch results on large scale classification datasets, on the fine-grained datasets, and compare it against state-of-the-art fine-grained classifiers. In this paper, we pose two specific questions: (i) Do the general CNN classifiers achieve comparable results to fine-grained classifiers? (ii) Do general CNN classifiers require any specific information to improve upon the fine-grained ones? Throughout this work, we train the general CNN classifiers without introducing any aspect that is specific to fine-grained datasets. We show an extensive evaluation on six datasets to determine whether the fine-grained classifier is able to elevate the baseline in their experiments.
△ Less
Submitted 2 November, 2021; v1 submitted 24 March, 2020;
originally announced March 2020.
-
Reducing the Sim-to-Real Gap for Event Cameras
Authors:
Timo Stoffregen,
Cedric Scheerlinck,
Davide Scaramuzza,
Tom Drummond,
Nick Barnes,
Lindsay Kleeman,
Robert Mahony
Abstract:
Event cameras are paradigm-shifting novel sensors that report asynchronous, per-pixel brightness changes called 'events' with unparalleled low latency. This makes them ideal for high speed, high dynamic range scenes where conventional cameras would fail. Recent work has demonstrated impressive results using Convolutional Neural Networks (CNNs) for video reconstruction and optic flow with events. W…
▽ More
Event cameras are paradigm-shifting novel sensors that report asynchronous, per-pixel brightness changes called 'events' with unparalleled low latency. This makes them ideal for high speed, high dynamic range scenes where conventional cameras would fail. Recent work has demonstrated impressive results using Convolutional Neural Networks (CNNs) for video reconstruction and optic flow with events. We present strategies for improving training data for event based CNNs that result in 20-40% boost in performance of existing state-of-the-art (SOTA) video reconstruction networks retrained with our method, and up to 15% for optic flow networks. A challenge in evaluating event based video reconstruction is lack of quality ground truth images in existing datasets. To address this, we present a new High Quality Frames (HQF) dataset, containing events and ground truth frames from a DAVIS240C that are well-exposed and minimally motion-blurred. We evaluate our method on HQF + several existing major event camera datasets.
△ Less
Submitted 22 August, 2020; v1 submitted 19 March, 2020;
originally announced March 2020.
-
Any-Shot Object Detection
Authors:
Shafin Rahman,
Salman Khan,
Nick Barnes,
Fahad Shahbaz Khan
Abstract:
Previous work on novel object detection considers zero or few-shot settings where none or few examples of each category are available for training. In real world scenarios, it is less practical to expect that 'all' the novel classes are either unseen or {have} few-examples. Here, we propose a more realistic setting termed 'Any-shot detection', where totally unseen and few-shot categories can simul…
▽ More
Previous work on novel object detection considers zero or few-shot settings where none or few examples of each category are available for training. In real world scenarios, it is less practical to expect that 'all' the novel classes are either unseen or {have} few-examples. Here, we propose a more realistic setting termed 'Any-shot detection', where totally unseen and few-shot categories can simultaneously co-occur during inference. Any-shot detection offers unique challenges compared to conventional novel object detection such as, a high imbalance between unseen, few-shot and seen object classes, susceptibility to forget base-training while learning novel classes and distinguishing novel classes from the background. To address these challenges, we propose a unified any-shot detection model, that can concurrently learn to detect both zero-shot and few-shot object classes. Our core idea is to use class semantics as prototypes for object detection, a formulation that naturally minimizes knowledge forgetting and mitigates the class-imbalance in the label space. Besides, we propose a rebalanced loss function that emphasizes difficult few-shot cases but avoids overfitting on the novel classes to allow detection of totally unseen classes. Without bells and whistles, our framework can also be used solely for Zero-shot detection and Few-shot detection tasks. We report extensive experiments on Pascal VOC and MS-COCO datasets where our approach is shown to provide significant improvements.
△ Less
Submitted 15 March, 2020;
originally announced March 2020.
-
Accuracy vs. Complexity: A Trade-off in Visual Question Answering Models
Authors:
Moshiur R. Farazi,
Salman H. Khan,
Nick Barnes
Abstract:
Visual Question Answering (VQA) has emerged as a Visual Turing Test to validate the reasoning ability of AI agents. The pivot to existing VQA models is the joint embedding that is learned by combining the visual features from an image and the semantic features from a given question. Consequently, a large body of literature has focused on develo** complex joint embedding strategies coupled with v…
▽ More
Visual Question Answering (VQA) has emerged as a Visual Turing Test to validate the reasoning ability of AI agents. The pivot to existing VQA models is the joint embedding that is learned by combining the visual features from an image and the semantic features from a given question. Consequently, a large body of literature has focused on develo** complex joint embedding strategies coupled with visual attention mechanisms to effectively capture the interplay between these two modalities. However, modelling the visual and semantic features in a high dimensional (joint embedding) space is computationally expensive, and more complex models often result in trivial improvements in the VQA accuracy. In this work, we systematically study the trade-off between the model complexity and the performance on the VQA task. VQA models have a diverse architecture comprising of pre-processing, feature extraction, multimodal fusion, attention and final classification stages. We specifically focus on the effect of "multi-modal fusion" in VQA models that is typically the most expensive step in a VQA pipeline. Our thorough experimental evaluation leads us to two proposals, one optimized for minimal complexity and the other one optimized for state-of-the-art VQA performance.
△ Less
Submitted 20 January, 2020;
originally announced January 2020.
-
Spectral-GANs for High-Resolution 3D Point-cloud Generation
Authors:
Sameera Ramasinghe,
Salman Khan,
Nick Barnes,
Stephen Gould
Abstract:
Point-clouds are a popular choice for vision and graphics tasks due to their accurate shape description and direct acquisition from range-scanners. This demands the ability to synthesize and reconstruct high-quality point-clouds. Current deep generative models for 3D data generally work on simplified representations (e.g., voxelized objects) and cannot deal with the inherent redundancy and irregul…
▽ More
Point-clouds are a popular choice for vision and graphics tasks due to their accurate shape description and direct acquisition from range-scanners. This demands the ability to synthesize and reconstruct high-quality point-clouds. Current deep generative models for 3D data generally work on simplified representations (e.g., voxelized objects) and cannot deal with the inherent redundancy and irregularity in point-clouds. A few recent efforts on 3D point-cloud generation offer limited resolution and their complexity grows with the increase in output resolution. In this paper, we develop a principled approach to synthesize 3D point-clouds using a spectral-domain Generative Adversarial Network (GAN). Our spectral representation is highly structured and allows us to disentangle various frequency bands such that the learning task is simplified for a GAN model. As compared to spatial-domain generative approaches, our formulation allows us to generate arbitrary number of points high-resolution point-clouds with minimal computational overhead. Furthermore, we propose a fully differentiable block to transform from {the} spectral to the spatial domain and back, thereby allowing us to integrate knowledge from well-established spatial models. We demonstrate that Spectral-GAN performs well for point-cloud generation task. Additionally, it can learn {a} highly discriminative representation in an unsupervised fashion and can be used to accurately reconstruct 3D objects.
△ Less
Submitted 19 July, 2020; v1 submitted 4 December, 2019;
originally announced December 2019.
-
Representation Learning on Unit Ball with 3D Roto-Translational Equivariance
Authors:
Sameera Ramasinghe,
Salman Khan,
Nick Barnes,
Stephen Gould
Abstract:
Convolution is an integral operation that defines how the shape of one function is modified by another function. This powerful concept forms the basis of hierarchical feature learning in deep neural networks. Although performing convolution in Euclidean geometries is fairly straightforward, its extension to other topological spaces---such as a sphere ($\mathbb{S}^2$) or a unit ball (…
▽ More
Convolution is an integral operation that defines how the shape of one function is modified by another function. This powerful concept forms the basis of hierarchical feature learning in deep neural networks. Although performing convolution in Euclidean geometries is fairly straightforward, its extension to other topological spaces---such as a sphere ($\mathbb{S}^2$) or a unit ball ($\mathbb{B}^3$)---entails unique challenges. In this work, we propose a novel `\emph{volumetric convolution}' operation that can effectively model and convolve arbitrary functions in $\mathbb{B}^3$. We develop a theoretical framework for \emph{volumetric convolution} based on Zernike polynomials and efficiently implement it as a differentiable and an easily pluggable layer in deep networks. By construction, our formulation leads to the derivation of a novel formula to measure the symmetry of a function in $\mathbb{B}^3$ around an arbitrary axis, that is useful in function analysis tasks. We demonstrate the efficacy of proposed volumetric convolution operation on one viable use case i.e., 3D object recognition.
△ Less
Submitted 29 November, 2019;
originally announced December 2019.
-
Geometric Back-projection Network for Point Cloud Classification
Authors:
Shi Qiu,
Saeed Anwar,
Nick Barnes
Abstract:
As the basic task of point cloud analysis, classification is fundamental but always challenging. To address some unsolved problems of existing methods, we propose a network that captures geometric features of point clouds for better representations. To achieve this, on the one hand, we enrich the geometric information of points in low-level 3D space explicitly. On the other hand, we apply CNN-base…
▽ More
As the basic task of point cloud analysis, classification is fundamental but always challenging. To address some unsolved problems of existing methods, we propose a network that captures geometric features of point clouds for better representations. To achieve this, on the one hand, we enrich the geometric information of points in low-level 3D space explicitly. On the other hand, we apply CNN-based structures in high-level feature spaces to learn local geometric context implicitly. Specifically, we leverage an idea of error-correcting feedback structure to capture the local features of point clouds comprehensively. Furthermore, an attention module based on channel affinity assists the feature map to avoid possible redundancy by emphasizing its distinct channels. The performance on both synthetic and real-world point clouds datasets demonstrate the superiority and applicability of our network. Comparing with other state-of-the-art methods, our approach balances accuracy and efficiency.
△ Less
Submitted 13 April, 2021; v1 submitted 28 November, 2019;
originally announced November 2019.
-
Blended Convolution and Synthesis for Efficient Discrimination of 3D Shapes
Authors:
Sameera Ramasinghe,
Salman Khan,
Nick Barnes,
Stephen Gould
Abstract:
Existing networks directly learn feature representations on 3D point clouds for shape analysis. We argue that 3D point clouds are highly redundant and hold irregular (permutation-invariant) structure, which makes it difficult to achieve inter-class discrimination efficiently. In this paper, we propose a two-faceted solution to this problem that is seamlessly integrated in a single `Blended Convolu…
▽ More
Existing networks directly learn feature representations on 3D point clouds for shape analysis. We argue that 3D point clouds are highly redundant and hold irregular (permutation-invariant) structure, which makes it difficult to achieve inter-class discrimination efficiently. In this paper, we propose a two-faceted solution to this problem that is seamlessly integrated in a single `Blended Convolution and Synthesis' layer. This fully differentiable layer performs two critical tasks in succession. In the first step, it projects the input 3D point clouds into a latent 3D space to synthesize a highly compact and more inter-class discriminative point cloud representation. Since, 3D point clouds do not follow a Euclidean topology, standard 2/3D Convolutional Neural Networks offer limited representation capability. Therefore, in the second step, it uses a novel 3D convolution operator functioning inside the unit ball ($\mathbb{B}^3$) to extract useful volumetric features. We extensively derive formulae to achieve both translation and rotation of our novel convolution kernels. Finally, using the proposed techniques we present an extremely light-weight, end-to-end architecture that achieves compelling results on 3D shape recognition and retrieval.
△ Less
Submitted 19 July, 2020; v1 submitted 24 August, 2019;
originally announced August 2019.
-
Question-Agnostic Attention for Visual Question Answering
Authors:
Moshiur R Farazi,
Salman H Khan,
Nick Barnes
Abstract:
Visual Question Answering (VQA) models employ attention mechanisms to discover image locations that are most relevant for answering a specific question. For this purpose, several multimodal fusion strategies have been proposed, ranging from relatively simple operations (e.g., linear sum) to more complex ones (e.g., Block). The resulting multimodal representations define an intermediate feature spa…
▽ More
Visual Question Answering (VQA) models employ attention mechanisms to discover image locations that are most relevant for answering a specific question. For this purpose, several multimodal fusion strategies have been proposed, ranging from relatively simple operations (e.g., linear sum) to more complex ones (e.g., Block). The resulting multimodal representations define an intermediate feature space for capturing the interplay between visual and semantic features, that is helpful in selectively focusing on image content. In this paper, we propose a question-agnostic attention mechanism that is complementary to the existing question-dependent attention mechanisms. Our proposed model parses object instances to obtain an `object map' and applies this map on the visual features to generate Question-Agnostic Attention (QAA) features. In contrast to question-dependent attention approaches that are learned end-to-end, the proposed QAA does not involve question-specific training, and can be easily included in almost any existing VQA model as a generic light-weight pre-processing step, thereby adding minimal computation overhead for training. Further, when used in complement with the question-dependent attention, the QAA allows the model to focus on the regions containing objects that might have been overlooked by the learned attention representation. Through extensive evaluation on VQAv1, VQAv2 and TDIUC datasets, we show that incorporating complementary QAA allows state-of-the-art VQA models to perform better, and provides significant boost to simplistic VQA models, enabling them to performance on par with highly sophisticated fusion strategies.
△ Less
Submitted 5 September, 2020; v1 submitted 8 August, 2019;
originally announced August 2019.
-
Densely Residual Laplacian Super-Resolution
Authors:
Saeed Anwar,
Nick Barnes
Abstract:
Super-Resolution convolutional neural networks have recently demonstrated high-quality restoration for single images. However, existing algorithms often require very deep architectures and long training times. Furthermore, current convolutional neural networks for super-resolution are unable to exploit features at multiple scales and weigh them equally, limiting their learning capability. In this…
▽ More
Super-Resolution convolutional neural networks have recently demonstrated high-quality restoration for single images. However, existing algorithms often require very deep architectures and long training times. Furthermore, current convolutional neural networks for super-resolution are unable to exploit features at multiple scales and weigh them equally, limiting their learning capability. In this exposition, we present a compact and accurate super-resolution algorithm namely, Densely Residual Laplacian Network (DRLN). The proposed network employs cascading residual on the residual structure to allow the flow of low-frequency information to focus on learning high and mid-level features. In addition, deep supervision is achieved via the densely concatenated residual blocks settings, which also helps in learning from high-level complex features. Moreover, we propose Laplacian attention to model the crucial features to learn the inter and intra-level dependencies between the feature maps. Furthermore, comprehensive quantitative and qualitative evaluations on low-resolution, noisy low-resolution, and real historical image benchmark datasets illustrate that our DRLN algorithm performs favorably against the state-of-the-art methods visually and accurately.
△ Less
Submitted 1 July, 2019; v1 submitted 27 June, 2019;
originally announced June 2019.
-
Unsupervised Primitive Discovery for Improved 3D Generative Modeling
Authors:
Salman H. Khan,
Yulan Guo,
Munawar Hayat,
Nick Barnes
Abstract:
3D shape generation is a challenging problem due to the high-dimensional output space and complex part configurations of real-world objects. As a result, existing algorithms experience difficulties in accurate generative modeling of 3D shapes. Here, we propose a novel factorized generative model for 3D shape generation that sequentially transitions from coarse to fine scale shape generation. To th…
▽ More
3D shape generation is a challenging problem due to the high-dimensional output space and complex part configurations of real-world objects. As a result, existing algorithms experience difficulties in accurate generative modeling of 3D shapes. Here, we propose a novel factorized generative model for 3D shape generation that sequentially transitions from coarse to fine scale shape generation. To this end, we introduce an unsupervised primitive discovery algorithm based on a higher-order conditional random field model. Using the primitive parts for shapes as attributes, a parameterized 3D representation is modeled in the first stage. This representation is further refined in the next stage by adding fine scale details to shape. Our results demonstrate improved representation ability of the generative model and better quality samples of newly generated 3D shapes. Further, our primitive generation approach can accurately parse common objects into a simplified representation.
△ Less
Submitted 9 June, 2019;
originally announced June 2019.
-
CED: Color Event Camera Dataset
Authors:
Cedric Scheerlinck,
Henri Rebecq,
Timo Stoffregen,
Nick Barnes,
Robert Mahony,
Davide Scaramuzza
Abstract:
Event cameras are novel, bio-inspired visual sensors, whose pixels output asynchronous and independent timestamped spikes at local intensity changes, called 'events'. Event cameras offer advantages over conventional frame-based cameras in terms of latency, high dynamic range (HDR) and temporal resolution. Until recently, event cameras have been limited to outputting events in the intensity channel…
▽ More
Event cameras are novel, bio-inspired visual sensors, whose pixels output asynchronous and independent timestamped spikes at local intensity changes, called 'events'. Event cameras offer advantages over conventional frame-based cameras in terms of latency, high dynamic range (HDR) and temporal resolution. Until recently, event cameras have been limited to outputting events in the intensity channel, however, recent advances have resulted in the development of color event cameras, such as the Color-DAVIS346. In this work, we present and release the first Color Event Camera Dataset (CED), containing 50 minutes of footage with both color frames and events. CED features a wide variety of indoor and outdoor scenes, which we hope will help drive forward event-based vision research. We also present an extension of the event camera simulator ESIM that enables simulation of color events. Finally, we present an evaluation of three state-of-the-art image reconstruction methods that can be used to convert the Color-DAVIS346 into a continuous-time, HDR, color video camera to visualise the event stream, and for use in downstream vision applications.
△ Less
Submitted 24 April, 2019;
originally announced April 2019.
-
A Deep Journey into Super-resolution: A survey
Authors:
Saeed Anwar,
Salman Khan,
Nick Barnes
Abstract:
Deep convolutional networks based super-resolution is a fast-growing field with numerous practical applications. In this exposition, we extensively compare 30+ state-of-the-art super-resolution Convolutional Neural Networks (CNNs) over three classical and three recently introduced challenging datasets to benchmark single image super-resolution. We introduce a taxonomy for deep-learning based super…
▽ More
Deep convolutional networks based super-resolution is a fast-growing field with numerous practical applications. In this exposition, we extensively compare 30+ state-of-the-art super-resolution Convolutional Neural Networks (CNNs) over three classical and three recently introduced challenging datasets to benchmark single image super-resolution. We introduce a taxonomy for deep-learning based super-resolution networks that groups existing methods into nine categories including linear, residual, multi-branch, recursive, progressive, attention-based and adversarial designs. We also provide comparisons between the models in terms of network complexity, memory footprint, model input and output, learning details, the type of network losses and important architectural differences (e.g., depth, skip-connections, filters). The extensive evaluation performed, shows the consistent and rapid growth in the accuracy in the past few years along with a corresponding boost in model complexity and the availability of large-scale datasets. It is also observed that the pioneering methods identified as the benchmark have been significantly outperformed by the current contenders. Despite the progress in recent years, we identify several shortcomings of existing techniques and provide future research directions towards the solution of these open problems.
△ Less
Submitted 23 March, 2020; v1 submitted 16 April, 2019;
originally announced April 2019.
-
Real Image Denoising with Feature Attention
Authors:
Saeed Anwar,
Nick Barnes
Abstract:
Deep convolutional neural networks perform better on images containing spatially invariant noise (synthetic noise); however, their performance is limited on real-noisy photographs and requires multiple stage network modeling. To advance the practicability of denoising algorithms, this paper proposes a novel single-stage blind real image denoising network (RIDNet) by employing a modular architectur…
▽ More
Deep convolutional neural networks perform better on images containing spatially invariant noise (synthetic noise); however, their performance is limited on real-noisy photographs and requires multiple stage network modeling. To advance the practicability of denoising algorithms, this paper proposes a novel single-stage blind real image denoising network (RIDNet) by employing a modular architecture. We use a residual on the residual structure to ease the flow of low-frequency information and apply feature attention to exploit the channel dependencies. Furthermore, the evaluation in terms of quantitative metrics and visual quality on three synthetic and four real noisy datasets against 19 state-of-the-art algorithms demonstrate the superiority of our RIDNet.
△ Less
Submitted 23 March, 2020; v1 submitted 15 April, 2019;
originally announced April 2019.
-
Volumetric Convolution: Automatic Representation Learning in Unit Ball
Authors:
Sameera Ramasinghe,
Salman Khan,
Nick Barnes
Abstract:
Convolution is an efficient technique to obtain abstract feature representations using hierarchical layers in deep networks. Although performing convolution in Euclidean geometries is fairly straightforward, its extension to other topological spaces---such as a sphere ($\mathbb{S}^2$) or a unit ball ($\mathbb{B}^3$)---entails unique challenges. In this work, we propose a novel `\emph{volumetric co…
▽ More
Convolution is an efficient technique to obtain abstract feature representations using hierarchical layers in deep networks. Although performing convolution in Euclidean geometries is fairly straightforward, its extension to other topological spaces---such as a sphere ($\mathbb{S}^2$) or a unit ball ($\mathbb{B}^3$)---entails unique challenges. In this work, we propose a novel `\emph{volumetric convolution}' operation that can effectively convolve arbitrary functions in $\mathbb{B}^3$. We develop a theoretical framework for \emph{volumetric convolution} based on Zernike polynomials and efficiently implement it as a differentiable and an easily pluggable layer for deep networks. Furthermore, our formulation leads to derivation of a novel formula to measure the symmetry of a function in $\mathbb{B}^3$ around an arbitrary axis, that is useful in 3D shape analysis tasks. We demonstrate the efficacy of proposed volumetric convolution operation on a possible use-case i.e., 3D object recognition task.
△ Less
Submitted 3 January, 2019;
originally announced January 2019.
-
Asynchronous Spatial Image Convolutions for Event Cameras
Authors:
Cedric Scheerlinck,
Nick Barnes,
Robert Mahony
Abstract:
Spatial convolution is arguably the most fundamental of 2D image processing operations. Conventional spatial image convolution can only be applied to a conventional image, that is, an array of pixel values (or similar image representation) that are associated with a single instant in time. Event cameras have serial, asynchronous output with no natural notion of an image frame, and each event arriv…
▽ More
Spatial convolution is arguably the most fundamental of 2D image processing operations. Conventional spatial image convolution can only be applied to a conventional image, that is, an array of pixel values (or similar image representation) that are associated with a single instant in time. Event cameras have serial, asynchronous output with no natural notion of an image frame, and each event arrives with a different timestamp. In this paper, we propose a method to compute the convolution of a linear spatial kernel with the output of an event camera. The approach operates on the event stream output of the camera directly without synthesising pseudo-image frames as is common in the literature. The key idea is the introduction of an internal state that directly encodes the convolved image information, which is updated asynchronously as each event arrives from the camera. The state can be read-off as-often-as and whenever required for use in higher level vision algorithms for real-time robotic systems. We demonstrate the application of our method to corner detection, providing an implementation of a Harris corner-response "state" that can be used in real-time for feature detection and tracking on robotic systems.
△ Less
Submitted 8 February, 2019; v1 submitted 2 December, 2018;
originally announced December 2018.
-
From Known to the Unknown: Transferring Knowledge to Answer Questions about Novel Visual and Semantic Concepts
Authors:
Moshiur R Farazi,
Salman H Khan,
Nick Barnes
Abstract:
Current Visual Question Answering (VQA) systems can answer intelligent questions about `Known' visual content. However, their performance drops significantly when questions about visually and linguistically `Unknown' concepts are presented during inference (`Open-world' scenario). A practical VQA system should be able to deal with novel concepts in real world settings. To address this problem, we…
▽ More
Current Visual Question Answering (VQA) systems can answer intelligent questions about `Known' visual content. However, their performance drops significantly when questions about visually and linguistically `Unknown' concepts are presented during inference (`Open-world' scenario). A practical VQA system should be able to deal with novel concepts in real world settings. To address this problem, we propose an exemplar-based approach that transfers learning (i.e., knowledge) from previously `Known' concepts to answer questions about the `Unknown'. We learn a highly discriminative joint embedding space, where visual and semantic features are fused to give a unified representation. Once novel concepts are presented to the model, it looks for the closest match from an exemplar set in the joint embedding space. This auxiliary information is used alongside the given Image-Question pair to refine visual attention in a hierarchical fashion. Since handling the high dimensional exemplars on large datasets can be a significant challenge, we introduce an efficient matching scheme that uses a compact feature description for search and retrieval. To evaluate our model, we propose a new split for VQA, separating Unknown visual and semantic concepts from the training set. Our approach shows significant improvements over state-of-the-art VQA models on the proposed Open-World VQA dataset and standard VQA datasets.
△ Less
Submitted 30 November, 2018;
originally announced November 2018.
-
Polarity Loss for Zero-shot Object Detection
Authors:
Shafin Rahman,
Salman Khan,
Nick Barnes
Abstract:
Conventional object detection models require large amounts of training data. In comparison, humans can recognize previously unseen objects by merely knowing their semantic description. To mimic similar behaviour, zero-shot object detection aims to recognize and localize 'unseen' object instances by using only their semantic information. The model is first trained to learn the relationships between…
▽ More
Conventional object detection models require large amounts of training data. In comparison, humans can recognize previously unseen objects by merely knowing their semantic description. To mimic similar behaviour, zero-shot object detection aims to recognize and localize 'unseen' object instances by using only their semantic information. The model is first trained to learn the relationships between visual and semantic domains for seen objects, later transferring the acquired knowledge to totally unseen objects. This setting gives rise to the need for correct alignment between visual and semantic concepts, so that the unseen objects can be identified using only their semantic attributes. In this paper, we propose a novel loss function called 'Polarity loss', that promotes correct visual-semantic alignment for an improved zero-shot object detection. On one hand, it refines the noisy semantic embeddings via metric learning on a 'Semantic vocabulary' of related concepts to establish a better synergy between visual and semantic domains. On the other hand, it explicitly maximizes the gap between positive and negative predictions to achieve better discrimination between seen, unseen and background objects. Our approach is inspired by embodiment theories in cognitive science, that claim human semantic understanding to be grounded in past experiences (seen objects), related linguistic concepts (word vocabulary) and visual perception (seen/unseen object images). We conduct extensive evaluations on MS-COCO and Pascal VOC datasets, showing significant improvements over state of the art.
△ Less
Submitted 2 April, 2020; v1 submitted 21 November, 2018;
originally announced November 2018.
-
Continuous-time Intensity Estimation Using Event Cameras
Authors:
Cedric Scheerlinck,
Nick Barnes,
Robert Mahony
Abstract:
Event cameras provide asynchronous, data-driven measurements of local temporal contrast over a large dynamic range with extremely high temporal resolution. Conventional cameras capture low-frequency reference intensity information. These two sensor modalities provide complementary information. We propose a computationally efficient, asynchronous filter that continuously fuses image frames and even…
▽ More
Event cameras provide asynchronous, data-driven measurements of local temporal contrast over a large dynamic range with extremely high temporal resolution. Conventional cameras capture low-frequency reference intensity information. These two sensor modalities provide complementary information. We propose a computationally efficient, asynchronous filter that continuously fuses image frames and events into a single high-temporal-resolution, high-dynamic-range image state. In absence of conventional image frames, the filter can be run on events only. We present experimental results on high-speed, high-dynamic-range sequences, as well as on new ground truth datasets we generate to demonstrate the proposed algorithm outperforms existing state-of-the-art methods.
△ Less
Submitted 1 November, 2018;
originally announced November 2018.
-
Adversarial Training of Variational Auto-encoders for High Fidelity Image Generation
Authors:
Salman H. Khan,
Munawar Hayat,
Nick Barnes
Abstract:
Variational auto-encoders (VAEs) provide an attractive solution to image generation problem. However, they tend to produce blurred and over-smoothed images due to their dependence on pixel-wise reconstruction loss. This paper introduces a new approach to alleviate this problem in the VAE based generative models. Our model simultaneously learns to match the data, reconstruction loss and the latent…
▽ More
Variational auto-encoders (VAEs) provide an attractive solution to image generation problem. However, they tend to produce blurred and over-smoothed images due to their dependence on pixel-wise reconstruction loss. This paper introduces a new approach to alleviate this problem in the VAE based generative models. Our model simultaneously learns to match the data, reconstruction loss and the latent distributions of real and fake images to improve the quality of generated samples. To compute the loss distributions, we introduce an auto-encoder based discriminator model which allows an adversarial learning procedure. The discriminator in our model also provides perceptual guidance to the VAE by matching the learned similarity metric of the real and fake samples in the latent space. To stabilize the overall training process, our model uses an error feedback approach to maintain the equilibrium between competing networks in the model. Our experiments show that the generated samples from our proposed model exhibit a diverse set of attributes and facial expressions and scale up to high-resolution images very well.
△ Less
Submitted 26 April, 2018;
originally announced April 2018.
-
Deep Texture and Structure Aware Filtering Network for Image Smoothing
Authors:
Kaiyue Lu,
Shaodi You,
Nick Barnes
Abstract:
Image smoothing is a fundamental task in computer vision, that aims to retain salient structures and remove insignificant textures. In this paper, we aim to address the fundamental shortcomings of existing image smoothing methods, which cannot properly distinguish textures and structures with similar low-level appearance. While deep learning approaches have started to explore the preservation of s…
▽ More
Image smoothing is a fundamental task in computer vision, that aims to retain salient structures and remove insignificant textures. In this paper, we aim to address the fundamental shortcomings of existing image smoothing methods, which cannot properly distinguish textures and structures with similar low-level appearance. While deep learning approaches have started to explore the preservation of structure through image smoothing, existing work does not yet properly address textures. To this end, we generate a large dataset by blending natural textures with clean structure-only images, and then build a texture prediction network (TPN) that predicts the location and magnitude of textures. We then combine the TPN with a semantic structure prediction network (SPN) so that the final texture and structure aware filtering network (TSAFN) is able to identify the textures to remove ("texture-awareness") and the structures to preserve ("structure-awareness"). The proposed model is easy to understand and implement, and shows excellent performance on real images in the wild as well as our generated dataset.
△ Less
Submitted 7 May, 2018; v1 submitted 7 December, 2017;
originally announced December 2017.
-
Learning RGB-D Salient Object Detection using background enclosure, depth contrast, and top-down features
Authors:
Riku Shigematsu,
David Feng,
Shaodi You,
Nick Barnes
Abstract:
Recently, deep Convolutional Neural Networks (CNN) have demonstrated strong performance on RGB salient object detection. Although, depth information can help improve detection results, the exploration of CNNs for RGB-D salient object detection remains limited. Here we propose a novel deep CNN architecture for RGB-D salient object detection that exploits high-level, mid-level, and low level feature…
▽ More
Recently, deep Convolutional Neural Networks (CNN) have demonstrated strong performance on RGB salient object detection. Although, depth information can help improve detection results, the exploration of CNNs for RGB-D salient object detection remains limited. Here we propose a novel deep CNN architecture for RGB-D salient object detection that exploits high-level, mid-level, and low level features. Further, we present novel depth features that capture the ideas of background enclosure and depth contrast that are suitable for a learned approach. We show improved results compared to state-of-the-art RGB-D salient object detection methods. We also show that the low-level and mid-level depth features both contribute to improvements in the results. Especially, F-Score of our method is 0.848 on RGBD1000 dataset, which is 10.7% better than the second place.
△ Less
Submitted 10 May, 2017;
originally announced May 2017.
-
Perceptually Consistent Color-to-Gray Image Conversion
Authors:
Shaodi You,
Nick Barnes,
Janine Walker
Abstract:
In this paper, we propose a color to grayscale image conversion algorithm (C2G) that aims to preserve the perceptual properties of the color image as much as possible. To this end, we propose measures for two perceptual properties based on contemporary research in vision science: brightness and multi-scale contrast. The brightness measurement is based on the idea that the brightness of a grayscale…
▽ More
In this paper, we propose a color to grayscale image conversion algorithm (C2G) that aims to preserve the perceptual properties of the color image as much as possible. To this end, we propose measures for two perceptual properties based on contemporary research in vision science: brightness and multi-scale contrast. The brightness measurement is based on the idea that the brightness of a grayscale image will affect the perception of the probability of color information. The color contrast measurement is based on the idea that the contrast of a given pixel to its surroundings can be measured as a linear combination of color contrast at different scales. Based on these measures we propose a graph based optimization framework to balance the brightness and contrast measurements. To solve the optimization, an $\ell_1$-norm based method is provided which converts color discontinuities to brightness discontinuities. To validate our methods, we evaluate against the existing \cadik and Color250 datasets, and against NeoColor, a new dataset that improves over existing C2G datasets. NeoColor contains around 300 images from typical C2G scenarios, including: commercial photograph, printing, books, magazines, masterpiece artworks and computer designed graphics. We show improvements in metrics of performance, and further through a user study, we validate the performance of both the algorithm and the metric.
△ Less
Submitted 6 May, 2016;
originally announced May 2016.
-
Totally Corrective Multiclass Boosting with Binary Weak Learners
Authors:
Zhihui Hao,
Chunhua Shen,
Nick Barnes,
Bo Wang
Abstract:
In this work, we propose a new optimization framework for multiclass boosting learning. In the literature, AdaBoost.MO and AdaBoost.ECC are the two successful multiclass boosting algorithms, which can use binary weak learners. We explicitly derive these two algorithms' Lagrange dual problems based on their regularized loss functions. We show that the Lagrange dual formulations enable us to design…
▽ More
In this work, we propose a new optimization framework for multiclass boosting learning. In the literature, AdaBoost.MO and AdaBoost.ECC are the two successful multiclass boosting algorithms, which can use binary weak learners. We explicitly derive these two algorithms' Lagrange dual problems based on their regularized loss functions. We show that the Lagrange dual formulations enable us to design totally-corrective multiclass algorithms by using the primal-dual optimization technique. Experiments on benchmark data sets suggest that our multiclass boosting can achieve a comparable generalization capability with state-of-the-art, but the convergence speed is much faster than stage-wise gradient descent boosting. In other words, the new totally corrective algorithms can maximize the margin more aggressively.
△ Less
Submitted 20 September, 2010;
originally announced September 2010.
-
Asymmetric Totally-corrective Boosting for Real-time Object Detection
Authors:
Peng Wang,
Chunhua Shen,
Nick Barnes,
Hong Zheng,
Zhang Ren
Abstract:
Real-time object detection is one of the core problems in computer vision. The cascade boosting framework proposed by Viola and Jones has become the standard for this problem. In this framework, the learning goal for each node is asymmetric, which is required to achieve a high detection rate and a moderate false positive rate. We develop new boosting algorithms to address this asymmetric learning…
▽ More
Real-time object detection is one of the core problems in computer vision. The cascade boosting framework proposed by Viola and Jones has become the standard for this problem. In this framework, the learning goal for each node is asymmetric, which is required to achieve a high detection rate and a moderate false positive rate. We develop new boosting algorithms to address this asymmetric learning problem. We show that our methods explicitly optimize asymmetric loss objectives in a totally corrective fashion. The methods are totally corrective in the sense that the coefficients of all selected weak classifiers are updated at each iteration. In contract, conventional boosting like AdaBoost is stage-wise in that only the current weak classifier's coefficient is updated. At the heart of the totally corrective boosting is the column generation technique. Experiments on face detection show that our methods outperform the state-of-the-art asymmetric boosting methods.
△ Less
Submitted 15 September, 2010;
originally announced September 2010.
-
Totally Corrective Boosting for Regularized Risk Minimization
Authors:
Chunhua Shen,
Hanxi Li,
Nick Barnes
Abstract:
Consideration of the primal and dual problems together leads to important new insights into the characteristics of boosting algorithms. In this work, we propose a general framework that can be used to design new boosting algorithms. A wide variety of machine learning problems essentially minimize a regularized risk functional. We show that the proposed boosting framework, termed CGBoost, can accom…
▽ More
Consideration of the primal and dual problems together leads to important new insights into the characteristics of boosting algorithms. In this work, we propose a general framework that can be used to design new boosting algorithms. A wide variety of machine learning problems essentially minimize a regularized risk functional. We show that the proposed boosting framework, termed CGBoost, can accommodate various loss functions and different regularizers in a totally-corrective optimization fashion. We show that, by solving the primal rather than the dual, a large body of totally-corrective boosting algorithms can actually be efficiently solved and no sophisticated convex optimization solvers are needed. We also demonstrate that some boosting algorithms like AdaBoost can be interpreted in our framework--even their optimization is not totally corrective. We empirically show that various boosting algorithms based on the proposed framework perform similarly on the UCIrvine machine learning datasets [1] that we have used in the experiments.
△ Less
Submitted 11 December, 2011; v1 submitted 30 August, 2010;
originally announced August 2010.
-
The effects of superconductor-stabilizer interfacial resistance on quench of a pancake coil made out of coated conductor
Authors:
G. A. Levin,
W. A. Jones,
K. A. Novak,
P. N. Barnes
Abstract:
We present the results of numerical analysis of normal zone propagation in a stack of $YBa_2Cu_3O_{7-x}$ coated conductors which imitates a pancake coil. Our main purpose is to determine whether the quench protection quality of such coils can be substantially improved by increased contact resistance between the superconducting film and the stabilizer. We show that with increased contact resistance…
▽ More
We present the results of numerical analysis of normal zone propagation in a stack of $YBa_2Cu_3O_{7-x}$ coated conductors which imitates a pancake coil. Our main purpose is to determine whether the quench protection quality of such coils can be substantially improved by increased contact resistance between the superconducting film and the stabilizer. We show that with increased contact resistance the speed of normal zone propagation increases, the detection of a normal zone inside the coil becomes possible earlier, when the peak temperature inside the normal zone is lower, and stability margins shrink. Thus, increasing contact resistance may become a viable option for improving the prospects of coated conductors for high $T_c$ magnets applications.
△ Less
Submitted 28 July, 2010; v1 submitted 21 July, 2010;
originally announced July 2010.
-
Use of 2G coated conductors for efficient shielding of DC magnetic fields
Authors:
J. -F. Fagnard,
M. Dirickx,
G. A. Levin,
P. N. Barnes,
B. Vanderheyden,
P. Vanderbemden
Abstract:
This paper reports the results of an experimental investigation of the performance of two types of magnetic screens assembled from YBa2Cu3O7-d (YBCO) coated conductors. Since effective screening of the axial DC magnetic field requires the unimpeded flow of an azimuthal persistent current, we demonstrate a configuration of a screening shell made out of standard YBCO coated conductor capable to acco…
▽ More
This paper reports the results of an experimental investigation of the performance of two types of magnetic screens assembled from YBa2Cu3O7-d (YBCO) coated conductors. Since effective screening of the axial DC magnetic field requires the unimpeded flow of an azimuthal persistent current, we demonstrate a configuration of a screening shell made out of standard YBCO coated conductor capable to accomplish that. The screen allows the persistent current to flow in the predominantly azimuthal direction at a temperature of 77 K. The persistent screen, incorporating a single layer of superconducting film, can attenuate an external magnetic field of up to 5 mT by more than an order of magnitude. For comparison purposes, another type of screen which incorporates low critical temperature quasi-persistent joints was also built. The shielding technique we describe here appears to be especially promising for the realization of large scale high-Tc superconducting screens.
△ Less
Submitted 17 June, 2010;
originally announced June 2010.
-
The effects of superconductor-stabilizer interfacial resistance on quench of current-carrying coated conductor
Authors:
G. A. Levin,
P. N. Barnes,
K. A. Novak
Abstract:
We present the results of numerical analysis of a model of normal zone propagation in coated conductors. The main emphasis is on the effects of increased contact resistance between the superconducting film and the stabilizer on the speed of normal zone propagation, the maximum temperature rise inside the normal zone, and the stability margins. We show that with increasing contact resistance the…
▽ More
We present the results of numerical analysis of a model of normal zone propagation in coated conductors. The main emphasis is on the effects of increased contact resistance between the superconducting film and the stabilizer on the speed of normal zone propagation, the maximum temperature rise inside the normal zone, and the stability margins. We show that with increasing contact resistance the speed of normal zone propagation increases, the maximum temperature inside the normal zone decreases, and stability margins shrink. This may have an overall beneficial effect on quench protection quality of coated conductors. We also briefly discuss the propagation of solitons and development of the temperature modulation along the wire.
△ Less
Submitted 28 September, 2009;
originally announced September 2009.
-
Emergence of dissipative structures in current-carrying superconducting wires
Authors:
G. A. Levin,
P. N. Barnes,
J. P. Rodriguez,
J. A. Connors,
J. S. Bulmer
Abstract:
We discuss the emergence of a spontaneous temperature and critical current spatial modulation in current-carrying high temperature superconducting wire. The modulation of the critical current along the wire on a scale of 3 - 10 mm forces a fraction of the transport current to crisscross the resistive interface between the superconducting film and normal metal stabilizer attached to it. This gene…
▽ More
We discuss the emergence of a spontaneous temperature and critical current spatial modulation in current-carrying high temperature superconducting wire. The modulation of the critical current along the wire on a scale of 3 - 10 mm forces a fraction of the transport current to crisscross the resistive interface between the superconducting film and normal metal stabilizer attached to it. This generates additional heat that allows such a structure to be self sustainable. Stability and the conditions for experimental observation of this phenomenon are also discussed.
△ Less
Submitted 22 April, 2009;
originally announced April 2009.
-
Stability and normal zone propagation speed in YBCO coated conductors with increased interfacial resistance
Authors:
George A. Levin,
Paul N. Barnes,
Jose P. Rodriguez,
Jake A. Connors,
John S. Bulmer
Abstract:
We will discuss how stability and speed of normal zone propagation in YBCO-coated conductors is affected by interfacial resistance between the superconducting film and the stabilizer. Our numerical simulation has shown that the increased interfacial resistance substantially increases speed of normal zone propagation and decreases the stability margins. Optimization of the value of the resistance…
▽ More
We will discuss how stability and speed of normal zone propagation in YBCO-coated conductors is affected by interfacial resistance between the superconducting film and the stabilizer. Our numerical simulation has shown that the increased interfacial resistance substantially increases speed of normal zone propagation and decreases the stability margins. Optimization of the value of the resistance may lead to a better compromise between stability and quench protection requirements than what is found in currently manufactured coated conductors.
△ Less
Submitted 4 November, 2008; v1 submitted 25 August, 2008;
originally announced August 2008.
-
In-Field Critical Current of Type-II Superconductors Caused by Strain from Nano-scale Columnar Inclusions
Authors:
J. P. Rodriguez,
P. N. Barnes,
C. V. Varanasi
Abstract:
The results of a linear elasticity analysis yields that nano-rod inclusions aligned along the c axis of a thin film of YBa2Cu3O{7-delta}, such as BaZrO3 and BaSnO3, squeeze that matrix by pure shear. The sensitivity of the superconducting critical temperature in that material to the latter implies that the phase boundary separating the nano-rod inclusion from the superconductor acts as a collect…
▽ More
The results of a linear elasticity analysis yields that nano-rod inclusions aligned along the c axis of a thin film of YBa2Cu3O{7-delta}, such as BaZrO3 and BaSnO3, squeeze that matrix by pure shear. The sensitivity of the superconducting critical temperature in that material to the latter implies that the phase boundary separating the nano-rod inclusion from the superconductor acts as a collective pinning center for the vortex lattice that appears in external magnetic field. A dominant contribution to the in-field critical current can result. The elasticity analysis also finds that the growth of nano-rod inclusions can be weakly metastable when the inclusion is softer than the matrix.
△ Less
Submitted 21 October, 2008; v1 submitted 2 April, 2008;
originally announced April 2008.
-
Normal zone in $YBa_2Cu_3O_{6+x}$-coated conductors
Authors:
George A. Levin,
Paul N. Barnes
Abstract:
We consider the distribution of an electric field in YBCO-coated conductors for a situation in which the DC transport current is forced into the copper stabilizer due to a weak link -- a section of the superconducting film with a critical current less than the transport current. The electric field in the metal substrate is also discussed. The results are compared with recent experiments on norma…
▽ More
We consider the distribution of an electric field in YBCO-coated conductors for a situation in which the DC transport current is forced into the copper stabilizer due to a weak link -- a section of the superconducting film with a critical current less than the transport current. The electric field in the metal substrate is also discussed. The results are compared with recent experiments on normal zone propagation in coated conductors for which the substrate and stabilizer are insulated from each other. The potential difference between the substrate and stabilizer, and the electric field in the substrate outside the normal zone can be accounted for by a large screening length in the substrate, comparable to the length of the sample. During a quench, the electric field inside the interface between YBCO and stabilizer, as well as in the buffer layer, can be several orders of magnitude greater than the longitudinal macroscopic electric field inside the normal zone. We speculate on the possibility of using possible microscopic electric discharges caused by this large ($\sim $kV/cm) electric field as a means to detect a quench.
△ Less
Submitted 27 June, 2007;
originally announced June 2007.
-
Current sharing between superconducting film and normal metal
Authors:
George A. Levin,
Paul N. Barnes,
John S. Bulmer
Abstract:
A two-dimensional model is introduced that describes current sharing between the superconducting and normal metal layers in configuration typical of YBCO-coated conductors. The model is used to compare the effectiveness of surround stabilizer and more conventional one-sided stabilizer. When the resistance of the interface between the superconductor and stabilizer is low enough, the surround stab…
▽ More
A two-dimensional model is introduced that describes current sharing between the superconducting and normal metal layers in configuration typical of YBCO-coated conductors. The model is used to compare the effectiveness of surround stabilizer and more conventional one-sided stabilizer. When the resistance of the interface between the superconductor and stabilizer is low enough, the surround stabilizer is less effective than the one-sided stabilizer in stabilizing a hairline crack in the superconducting film.
△ Less
Submitted 7 March, 2007;
originally announced March 2007.
-
AC-Tolerant Multifilament Coated Conductors
Authors:
G. A. Levin,
P. N. Barnes,
N. Amemiya
Abstract:
We report the magnetization losses in an experimental multifilament coated conductor. A 4 mm wide and 10 cm long YBCO coated conductor was subdivided into eight 0.5 mm wide filaments by laser ablation and subjected to post-ablation treatment. As the result, the hysteresis loss was reduced, as expected, in proportion to the width of the filaments. However, the coupling loss was reduced dramatical…
▽ More
We report the magnetization losses in an experimental multifilament coated conductor. A 4 mm wide and 10 cm long YBCO coated conductor was subdivided into eight 0.5 mm wide filaments by laser ablation and subjected to post-ablation treatment. As the result, the hysteresis loss was reduced, as expected, in proportion to the width of the filaments. However, the coupling loss was reduced dramatically, and became practically negligible, in the range of a sweep rate up to 20 T/s. This represents a drastic improvement on previous multifilament conductors in which often the coupling losses became equal to the hysteresis loss at a sweep rate as low as 3-4 T/s. These results demonstrate that there is an effective and practical way to suppress coupling losses in coated multifilament conductors.
△ Less
Submitted 19 September, 2006;
originally announced September 2006.
-
Multifilament YBa2Cu3O6+x -coated conductors with minimized coupling losses
Authors:
G. A. Levin,
P. N. Barnes,
J. W. Kell,
N. Amemiya,
Z. Jiang,
K. Yoda,
F. Kimura
Abstract:
We report an experimental approach to making multifilament coated conductors with low losses in applied time-varying magnetic field. Previously, the multifilament conductors obtained for that purpose by laser ablation suffered from high coupling losses. Here we report how this problem can be solved. When the substrate metal in the grooves segregating the filaments is exposed to oxygen, it forms…
▽ More
We report an experimental approach to making multifilament coated conductors with low losses in applied time-varying magnetic field. Previously, the multifilament conductors obtained for that purpose by laser ablation suffered from high coupling losses. Here we report how this problem can be solved. When the substrate metal in the grooves segregating the filaments is exposed to oxygen, it forms high resistivity oxides that electrically insulate the stripes from each other and from the substrate. As the result, the coupling loss has become negligible over the entire range of tested parameters (magnetic field amplitudes B and frequencies f) available to us.
△ Less
Submitted 11 May, 2006;
originally announced May 2006.
-
Coupling losses and transverse resistivity of multifilament YBCO coated superconductors
Authors:
M. Polak,
L. Krempasky,
E. Usak,
L. Jansak,
E. Demencik,
G. A. Levin,
P. N. Barnes,
D. Wehler,
B. Moenter
Abstract:
We studied the magnetization losses of four different types of filamentary YBCO coated conductors. A 10 mm wide YBCO coated conductor was subdivided into 20 filaments by laser cutting. The separation of coupling loss from the total is possible because the energy loss per cycle in samples with electrically insulated filaments has a very small frequency dependence. We measured the frequency depend…
▽ More
We studied the magnetization losses of four different types of filamentary YBCO coated conductors. A 10 mm wide YBCO coated conductor was subdivided into 20 filaments by laser cutting. The separation of coupling loss from the total is possible because the energy loss per cycle in samples with electrically insulated filaments has a very small frequency dependence. We measured the frequency dependence of the total losses in the frequency range between 0.1 Hz and 500 Hz. The coupling loss was obtained from the total by subtracting the hysteresis loss. The latter was measured at low frequencies since only hysteresis loss is non-negligible at frequencies below 1 Hz. The transverse resistivity was determined from the coupling losses; it was assumed that the sample length is equal to half of the twist pitch. The values of transverse resistivity deduced from the loss data were compared with those obtained by the four-point measurements with current flowing perpendicular to the filaments. Preliminary results indicate that the current method of laser ablation creates electrical contacts between the superconducting filaments and the substrate. This was also confirmed by the Hall probe map** of the magnetic field in the vicinity of the tape. The measured transverse resistivity was close to the resistivity of the substrate (Hastelloy).
△ Less
Submitted 17 February, 2006;
originally announced February 2006.
-
Magnetization Losses in Multiply Connected YBa2Cu3O6+x Coated Conductors
Authors:
G. A. Levin,
P. N. Barnes,
Naoyuki Amemiya,
Satoshi Kasai,
Keiji Yoda,
Zhenan Jiang,
A. Polyanskii
Abstract:
We report the results of a magnetization losses study in experimental multifilament, multiply connected coated superconductors exposed to time-varying magnetic field. In these samples, the superconducting layer is divided into parallel stripes segregated by non-superconducting grooves. In order to facilitate the current sharing between the stripes and thus increase the reliability of the striate…
▽ More
We report the results of a magnetization losses study in experimental multifilament, multiply connected coated superconductors exposed to time-varying magnetic field. In these samples, the superconducting layer is divided into parallel stripes segregated by non-superconducting grooves. In order to facilitate the current sharing between the stripes and thus increase the reliability of the striated conductors, a sparse network of superconducting bridges is superimposed on the striated film. We find that the presence of the bridges does not substantially increase the magnetization losses, both hysteresis and coupling, as long as the number of bridges per length of the sample is not large. These results indicate that it is possible to find a reasonable compromise between the competing requirements of connectivity and loss reduction in an ac-tolerant version of the high temperature coated conductors specifically designed for ac power applications.
△ Less
Submitted 21 October, 2005;
originally announced October 2005.
-
The Integration of YBCO Coated Conductors into Magnets and Rotating Machinery
Authors:
G. A. Levin,
P. N. Barnes
Abstract:
The implementation of the 2nd generation high-Tc superconductors in power applications, such as electrical transformers, motors and generators requires superconducting wires that are superior to copper Litz wires at cryogenic temperatures in terms of losses in time-varying magnetic field, as well as in engineering current density. Another problem is to find a way to make practical coils and arma…
▽ More
The implementation of the 2nd generation high-Tc superconductors in power applications, such as electrical transformers, motors and generators requires superconducting wires that are superior to copper Litz wires at cryogenic temperatures in terms of losses in time-varying magnetic field, as well as in engineering current density. Another problem is to find a way to make practical coils and armatures out of flat tape-like conductors with low bending strain tolerance. We discuss several novel approaches to the construction of coils and armatures based specifically on the properties of coated conductors manufactured today.
△ Less
Submitted 13 September, 2005;
originally announced September 2005.
-
Concept of Multiply Connected Superconducting Tapes
Authors:
G. A. Levin,
P. N. Barnes
Abstract:
The possibility of a substantial reduction of weight and size of electrical generators is the main incentive behind the effort to develop superconducting armature windings based on coated conductors in the form of wide tapes with large aspect ratio. The main obstacle to the application of coated superconductors in stator windings is the large losses incurred due to the ac magnetic field produced…
▽ More
The possibility of a substantial reduction of weight and size of electrical generators is the main incentive behind the effort to develop superconducting armature windings based on coated conductors in the form of wide tapes with large aspect ratio. The main obstacle to the application of coated superconductors in stator windings is the large losses incurred due to the ac magnetic field produced by the rotor's dc coils of the field windings. In the range of frequencies typical for aircraft generators, the hysteretic losses in wide tapes are unacceptably high. They can be reduced by dividing the superconducting layer into multiple filaments separated by non-superconducting barriers. However, the lack of current sharing between the filaments makes the conductor vulnerable to the localized defects, so that a single blockage can impede the flow of transport current through the whole length of a given filament. We present estimates of reliability as well as the magnetization losses in multiply connected superconductors. In this type of superconducting tape, a sparse network of superconducting bridges, which allows for current sharing, connects the filaments. The trade-off between the different types of losses and the connectivity requirement imposes restrictions on the number of filaments and properties of the network of bridges.
△ Less
Submitted 16 November, 2004;
originally announced November 2004.
-
AC Loss in Striped (Filamentary) YBCO Coated Conductors Leading to Designs for High Frequencies and Field-Sweep Amplitudes
Authors:
M. D. Sumption,
P. N. Barnes,
E. W. Collings
Abstract:
AC losses of YBCO coated conductors are investigated by calculation and experiment for the higher frequency regime. Previous research using YBCO film deposited onto single crystal substrates demonstrated the effectiveness of filamentary subdivision as a technique for AC loss reduction. As a result of these studies the idea of subdividing YBCO coated conductors (both YBCO, overlayer, and even und…
▽ More
AC losses of YBCO coated conductors are investigated by calculation and experiment for the higher frequency regime. Previous research using YBCO film deposited onto single crystal substrates demonstrated the effectiveness of filamentary subdivision as a technique for AC loss reduction. As a result of these studies the idea of subdividing YBCO coated conductors (both YBCO, overlayer, and even underlayer) into such stripes suggested itself. The suggestion was implemented by burning grooves into samples of coated conductor using laser micromachining. Various machining parameters were investigated, and the stri** and slicing characteristics are presented. Loss measurements were performed on unstriped as well as striped samples by the pick-up coil technique at frequencies of from 50-200 Hz at field sweep amplitudes of up to 150 mT. The effect of soft ferromagnetic Fe shielding was also investigated. The results of the experiments form a starting point for a more general study of reduced loss coated conductor design (including hysteretic, coupling, normal eddy current, and transport losses) projected into higher ranges of frequency and field sweep amplitude with transformer and all cryogenic motor/generator applications in mind.
△ Less
Submitted 12 October, 2004;
originally announced October 2004.
-
Magnetization Losses in Multifilament Coated Superconductors
Authors:
G. A. Levin,
P. N. Barnes,
N. Amemiya,
S. Kasai,
K. Yoda,
Z. Jiang
Abstract:
We report the results of a study of the magnetization losses in experimental multifilament, as well as control (uniform), coated superconductors exposed to time-varying magnetic field of various frequencies. Both the hysteresis loss, proportional to the sweep rate of the applied magnetic field, and the coupling loss, proportional to the square of the sweep rate, have been observed. A scaling is…
▽ More
We report the results of a study of the magnetization losses in experimental multifilament, as well as control (uniform), coated superconductors exposed to time-varying magnetic field of various frequencies. Both the hysteresis loss, proportional to the sweep rate of the applied magnetic field, and the coupling loss, proportional to the square of the sweep rate, have been observed. A scaling is found that allows us to quantify each of these contributions and extrapolate the results of the experiment beyond the envelope of accessible field amplitude and frequency. The combined loss in the multifilament conductor is reduced by about 90% in comparison with the uniform conductor at full field penetration at sweep rate as high as 3T/s.
△ Less
Submitted 9 September, 2004;
originally announced September 2004.