-
Towards Naturalistic Voice Conversion: NaturalVoices Dataset with an Automatic Processing Pipeline
Authors:
Ali N. Salman,
Zongyang Du,
Shreeram Suresh Chandra,
Ismail Rasim Ulgen,
Carlos Busso,
Berrak Sisman
Abstract:
Voice conversion (VC) research traditionally depends on scripted or acted speech, which lacks the natural spontaneity of real-life conversations. While natural speech data is limited for VC, our study focuses on filling in this gap. We introduce a novel data-sourcing pipeline that makes the release of a natural speech dataset for VC, named NaturalVoices. The pipeline extracts rich information in s…
▽ More
Voice conversion (VC) research traditionally depends on scripted or acted speech, which lacks the natural spontaneity of real-life conversations. While natural speech data is limited for VC, our study focuses on filling in this gap. We introduce a novel data-sourcing pipeline that makes the release of a natural speech dataset for VC, named NaturalVoices. The pipeline extracts rich information in speech such as emotion and signal-to-noise ratio (SNR) from raw podcast data, utilizing recent deep learning methods and providing flexibility and ease of use. NaturalVoices marks a large-scale, spontaneous, expressive, and emotional speech dataset, comprising over 3,800 hours speech sourced from the original podcasts in the MSP-Podcast dataset. Objective and subjective evaluations demonstrate the effectiveness of using our pipeline for providing natural and expressive data for VC, suggesting the potential of NaturalVoices for broader speech generation tasks.
△ Less
Submitted 6 June, 2024;
originally announced June 2024.
-
Style Mixture of Experts for Expressive Text-To-Speech Synthesis
Authors:
Ahad Jawaid,
Shreeram Suresh Chandra,
Junchen Lu,
Berrak Sisman
Abstract:
Recent advances in style transfer text-to-speech (TTS) have improved the expressiveness of synthesized speech. Despite these advancements, encoding stylistic information from diverse and unseen reference speech remains challenging. This paper introduces StyleMoE, an approach that divides the embedding space, modeled by the style encoder, into tractable subsets handled by style experts. The propose…
▽ More
Recent advances in style transfer text-to-speech (TTS) have improved the expressiveness of synthesized speech. Despite these advancements, encoding stylistic information from diverse and unseen reference speech remains challenging. This paper introduces StyleMoE, an approach that divides the embedding space, modeled by the style encoder, into tractable subsets handled by style experts. The proposed method replaces the style encoder in a TTS system with a Mixture of Experts (MoE) layer. By utilizing a gating network to route reference speeches to different style experts, each expert specializes in aspects of the style space during optimization. Our experiments objectively and subjectively demonstrate the effectiveness of our proposed method in increasing the coverage of the style space for diverse and unseen styles. This approach can enhance the performance of existing state-of-the-art style transfer TTS models, marking the first study of MoE in style transfer TTS to our knowledge.
△ Less
Submitted 5 June, 2024;
originally announced June 2024.
-
Exploring speech style spaces with language models: Emotional TTS without emotion labels
Authors:
Shreeram Suresh Chandra,
Zongyang Du,
Berrak Sisman
Abstract:
Many frameworks for emotional text-to-speech (E-TTS) rely on human-annotated emotion labels that are often inaccurate and difficult to obtain. Learning emotional prosody implicitly presents a tough challenge due to the subjective nature of emotions. In this study, we propose a novel approach that leverages text awareness to acquire emotional styles without the need for explicit emotion labels or t…
▽ More
Many frameworks for emotional text-to-speech (E-TTS) rely on human-annotated emotion labels that are often inaccurate and difficult to obtain. Learning emotional prosody implicitly presents a tough challenge due to the subjective nature of emotions. In this study, we propose a novel approach that leverages text awareness to acquire emotional styles without the need for explicit emotion labels or text prompts. We present TEMOTTS, a two-stage framework for E-TTS that is trained without emotion labels and is capable of inference without auxiliary inputs. Our proposed method performs knowledge transfer between the linguistic space learned by BERT and the emotional style space constructed by global style tokens. Our experimental results demonstrate the effectiveness of our proposed framework, showcasing improvements in emotional accuracy and naturalness. This is one of the first studies to leverage the emotional correlation between spoken content and expressive delivery for emotional TTS.
△ Less
Submitted 18 May, 2024;
originally announced May 2024.
-
Machine Learning Applications in Traumatic Brain Injury: A Spotlight on Mild TBI
Authors:
Hanem Ellethy,
Shekhar S. Chandra,
Viktor Vegh
Abstract:
Traumatic Brain Injury (TBI) poses a significant global public health challenge, contributing to high morbidity and mortality rates and placing a substantial economic burden on healthcare systems worldwide. The diagnosis of TBI relies on clinical information along with Computed Tomography (CT) scans. Addressing the multifaceted challenges posed by TBI has seen the development of innovative, data-d…
▽ More
Traumatic Brain Injury (TBI) poses a significant global public health challenge, contributing to high morbidity and mortality rates and placing a substantial economic burden on healthcare systems worldwide. The diagnosis of TBI relies on clinical information along with Computed Tomography (CT) scans. Addressing the multifaceted challenges posed by TBI has seen the development of innovative, data-driven approaches, for this complex condition. Particularly noteworthy is the prevalence of mild TBI (mTBI), which constitutes the majority of TBI cases where conventional methods often fall short. As such, we review the state-of-the-art Machine Learning (ML) techniques applied to clinical information and CT scans in TBI, with a particular focus on mTBI. We categorize ML applications based on their data sources, and there is a spectrum of ML techniques used to date. Most of these techniques have primarily focused on diagnosis, with relatively few attempts at predicting the prognosis. This review may serve as a source of inspiration for future research studies aimed at improving the diagnosis of TBI using data-driven approaches and standard diagnostic data.
△ Less
Submitted 11 January, 2024; v1 submitted 7 January, 2024;
originally announced January 2024.
-
Enhancing mTBI Diagnosis with Residual Triplet Convolutional Neural Network Using 3D CT
Authors:
Hanem Ellethy,
Shekhar S. Chandra,
Viktor Vegh
Abstract:
Mild Traumatic Brain Injury (mTBI) is a common and challenging condition to diagnose accurately. Timely and precise diagnosis is essential for effective treatment and improved patient outcomes. Traditional diagnostic methods for mTBI often have limitations in terms of accuracy and sensitivity. In this study, we introduce an innovative approach to enhance mTBI diagnosis using 3D Computed Tomography…
▽ More
Mild Traumatic Brain Injury (mTBI) is a common and challenging condition to diagnose accurately. Timely and precise diagnosis is essential for effective treatment and improved patient outcomes. Traditional diagnostic methods for mTBI often have limitations in terms of accuracy and sensitivity. In this study, we introduce an innovative approach to enhance mTBI diagnosis using 3D Computed Tomography (CT) images and a metric learning technique trained with triplet loss. To address these challenges, we propose a Residual Triplet Convolutional Neural Network (RTCNN) model to distinguish between mTBI cases and healthy ones by embedding 3D CT scans into a feature space. The triplet loss function maximizes the margin between similar and dissimilar image pairs, optimizing feature representations. This facilitates better context placement of individual cases, aids informed decision-making, and has the potential to improve patient outcomes. Our RTCNN model shows promising performance in mTBI diagnosis, achieving an average accuracy of 94.3%, a sensitivity of 94.1%, and a specificity of 95.2%, as confirmed through a five-fold cross-validation. Importantly, when compared to the conventional Residual Convolutional Neural Network (RCNN) model, the RTCNN exhibits a significant improvement, showcasing a remarkable 22.5% increase in specificity, a notable 16.2% boost in accuracy, and an 11.3% enhancement in sensitivity. Moreover, RTCNN requires lower memory resources, making it not only highly effective but also resource-efficient in minimizing false positives while maximizing its diagnostic accuracy in distinguishing normal CT scans from mTBI cases. The quantitative performance metrics provided and utilization of occlusion sensitivity maps to visually explain the model's decision-making process further enhance the interpretability and transparency of our approach.
△ Less
Submitted 23 November, 2023;
originally announced November 2023.
-
Single Image Compressed Sensing MRI via a Self-Supervised Deep Denoising Approach
Authors:
Marlon Bran Lorenzana,
Feng Liu,
Shekhar S. Chandra
Abstract:
Popular methods in compressed sensing (CS) are dependent on deep learning (DL), where large amounts of data are used to train non-linear reconstruction models. However, ensuring generalisability over and access to multiple datasets is challenging to realise for real-world applications. To address these concerns, this paper proposes a single image, self-supervised (SS) CS-MRI framework that enables…
▽ More
Popular methods in compressed sensing (CS) are dependent on deep learning (DL), where large amounts of data are used to train non-linear reconstruction models. However, ensuring generalisability over and access to multiple datasets is challenging to realise for real-world applications. To address these concerns, this paper proposes a single image, self-supervised (SS) CS-MRI framework that enables a joint deep and sparse regularisation of CS artefacts. The approach effectively dampens structured CS artefacts, which can be difficult to remove assuming sparse reconstruction, or relying solely on the inductive biases of CNN to produce noise-free images. Image quality is thereby improved compared to either approach alone. Metrics are evaluated using Cartesian 1D masks on a brain and knee dataset, with PSNR improving by 2-4dB on average.
△ Less
Submitted 21 November, 2023;
originally announced November 2023.
-
Multi-scale MRI reconstruction via dilated ensemble networks
Authors:
Wendi Ma,
Marlon Bran Lorenzana,
Wei Dai,
Hongfu Sun,
Shekhar S. Chandra
Abstract:
As aliasing artefacts are highly structural and non-local, many MRI reconstruction networks use pooling to enlarge filter coverage and incorporate global context. However, this inadvertently impedes fine detail recovery as downsampling creates a resolution bottleneck. Moreover, real and imaginary features are commonly split into separate channels, discarding phase information particularly importan…
▽ More
As aliasing artefacts are highly structural and non-local, many MRI reconstruction networks use pooling to enlarge filter coverage and incorporate global context. However, this inadvertently impedes fine detail recovery as downsampling creates a resolution bottleneck. Moreover, real and imaginary features are commonly split into separate channels, discarding phase information particularly important to high frequency textures. In this work, we introduce an efficient multi-scale reconstruction network using dilated convolutions to preserve resolution and experiment with a complex-valued version using complex convolutions. Inspired by parallel dilated filters, multiple receptive fields are processed simultaneously with branches that see both large structural artefacts and fine local features. We also adopt dense residual connections for feature aggregation to efficiently increase scale and the deep cascade global architecture to reduce overfitting. The real-valued version of this model outperformed common reconstruction architectures as well as a state-of-the-art multi-scale network whilst being three times more efficient. The complex-valued network yielded better qualitative results when more phase information was present.
△ Less
Submitted 30 November, 2023; v1 submitted 7 October, 2023;
originally announced October 2023.
-
Interpretable 3D Multi-Modal Residual Convolutional Neural Network for Mild Traumatic Brain Injury Diagnosis
Authors:
Hanem Ellethy,
Viktor Vegh,
Shekhar S. Chandra
Abstract:
Mild Traumatic Brain Injury (mTBI) is a significant public health challenge due to its high prevalence and potential for long-term health effects. Despite Computed Tomography (CT) being the standard diagnostic tool for mTBI, it often yields normal results in mTBI patients despite symptomatic evidence. This fact underscores the complexity of accurate diagnosis. In this study, we introduce an interp…
▽ More
Mild Traumatic Brain Injury (mTBI) is a significant public health challenge due to its high prevalence and potential for long-term health effects. Despite Computed Tomography (CT) being the standard diagnostic tool for mTBI, it often yields normal results in mTBI patients despite symptomatic evidence. This fact underscores the complexity of accurate diagnosis. In this study, we introduce an interpretable 3D Multi-Modal Residual Convolutional Neural Network (MRCNN) for mTBI diagnostic model enhanced with Occlusion Sensitivity Maps (OSM). Our MRCNN model exhibits promising performance in mTBI diagnosis, demonstrating an average accuracy of 82.4%, sensitivity of 82.6%, and specificity of 81.6%, as validated by a five-fold cross-validation process. Notably, in comparison to the CT-based Residual Convolutional Neural Network (RCNN) model, the MRCNN shows an improvement of 4.4% in specificity and 9.0% in accuracy. We show that the OSM offers superior data-driven insights into CT images compared to the Grad-CAM approach. These results highlight the efficacy of the proposed multi-modal model in enhancing the diagnostic precision of mTBI.
△ Less
Submitted 21 September, 2023;
originally announced September 2023.
-
Ugly Ducklings or Swans: A Tiered Quadruplet Network with Patient-Specific Mining for Improved Skin Lesion Classification
Authors:
Nathasha Naranpanawa,
H. Peter Soyer,
Adam Mothershaw,
Gayan K. Kulatilleke,
Zongyuan Ge,
Brigid Betz-Stablein,
Shekhar S. Chandra
Abstract:
An ugly duckling is an obviously different skin lesion from surrounding lesions of an individual, and the ugly duckling sign is a criterion used to aid in the diagnosis of cutaneous melanoma by differentiating between highly suspicious and benign lesions. However, the appearance of pigmented lesions, can change drastically from one patient to another, resulting in difficulties in visual separation…
▽ More
An ugly duckling is an obviously different skin lesion from surrounding lesions of an individual, and the ugly duckling sign is a criterion used to aid in the diagnosis of cutaneous melanoma by differentiating between highly suspicious and benign lesions. However, the appearance of pigmented lesions, can change drastically from one patient to another, resulting in difficulties in visual separation of ugly ducklings. Hence, we propose DMT-Quadruplet - a deep metric learning network to learn lesion features at two tiers - patient-level and lesion-level. We introduce a patient-specific quadruplet mining approach together with a tiered quadruplet network, to drive the network to learn more contextual information both globally and locally between the two tiers. We further incorporate a dynamic margin within the patient-specific mining to allow more useful quadruplets to be mined within individuals. Comprehensive experiments show that our proposed method outperforms traditional classifiers, achieving 54% higher sensitivity than a baseline ResNet18 CNN and 37% higher than a naive triplet network in classifying ugly duckling lesions. Visualisation of the data manifold in the metric space further illustrates that DMT-Quadruplet is capable of classifying ugly duckling lesions in both patient-specific and patient-agnostic manner successfully.
△ Less
Submitted 18 September, 2023;
originally announced September 2023.
-
Fractal Compressive Sensing
Authors:
Marlon Bran Lorenzana,
Benjamin Cottier,
Matthew Marques,
Andrew Kingston,
Shekhar S. Chandra
Abstract:
This paper introduces a sparse projection matrix composed of discrete (digital) periodic lines that create a pseudo-random (p.frac) sampling scheme. Our approach enables random Cartesian sampling whilst employing deterministic and one-dimensional (1D) trajectories derived from the discrete Radon transform (DRT). Unlike radial trajectories, DRT projections can be back-projected without interpolatio…
▽ More
This paper introduces a sparse projection matrix composed of discrete (digital) periodic lines that create a pseudo-random (p.frac) sampling scheme. Our approach enables random Cartesian sampling whilst employing deterministic and one-dimensional (1D) trajectories derived from the discrete Radon transform (DRT). Unlike radial trajectories, DRT projections can be back-projected without interpolation. Thus, we also propose a novel reconstruction method based on the exact projections of the DRT called finite Fourier reconstruction (FFR). We term this combined p.frac and FFR strategy, finite compressive sensing (FCS), with image recovery demonstrated on experimental and simulated data; image quality comparisons are made with Cartesian random sampling in 1D and two-dimensional (2D), as well as radial under-sampling in a more constrained experiment. Our experiments indicate FCS enables 3-5dB gain in peak signal-to-noise ratio (PSNR) for 2-, 4- and 8-fold under-sampling compared to 1D Cartesian random sampling. This paper aims to: Review common sampling strategies for compressed sensing (CS)-magnetic resonance imaging (MRI) to inform the motivation of a projective and Cartesian sampling scheme. Compare the incoherence of these sampling strategies and the proposed p.frac. Compare reconstruction quality of the sampling schemes under various reconstruction strategies to determine the suitability of p.frac for CS-MRI. It is hypothesised that because p.frac is a highly incoherent sampling scheme, that reconstructions will be of high quality compared to 1D Cartesian phase-encode under-sampling.
△ Less
Submitted 6 January, 2024; v1 submitted 14 September, 2023;
originally announced September 2023.
-
Application of Machine Learning in Melanoma Detection and the Identification of 'Ugly Duckling' and Suspicious Naevi: A Review
Authors:
Fatima Al Zegair,
Nathasha Naranpanawa,
Brigid Betz-Stablein,
Monika Janda,
H. Peter Soyer,
Shekhar S. Chandra
Abstract:
Skin lesions known as naevi exhibit diverse characteristics such as size, shape, and colouration. The concept of an "Ugly Duckling Naevus" comes into play when monitoring for melanoma, referring to a lesion with distinctive features that sets it apart from other lesions in the vicinity. As lesions within the same individual typically share similarities and follow a predictable pattern, an ugly duc…
▽ More
Skin lesions known as naevi exhibit diverse characteristics such as size, shape, and colouration. The concept of an "Ugly Duckling Naevus" comes into play when monitoring for melanoma, referring to a lesion with distinctive features that sets it apart from other lesions in the vicinity. As lesions within the same individual typically share similarities and follow a predictable pattern, an ugly duckling naevus stands out as unusual and may indicate the presence of a cancerous melanoma. Computer-aided diagnosis (CAD) has become a significant player in the research and development field, as it combines machine learning techniques with a variety of patient analysis methods. Its aim is to increase accuracy and simplify decision-making, all while responding to the shortage of specialized professionals. These automated systems are especially important in skin cancer diagnosis where specialist availability is limited. As a result, their use could lead to life-saving benefits and cost reductions within healthcare. Given the drastic change in survival when comparing early stage to late-stage melanoma, early detection is vital for effective treatment and patient outcomes. Machine learning (ML) and deep learning (DL) techniques have gained popularity in skin cancer classification, effectively addressing challenges, and providing results equivalent to that of specialists. This article extensively covers modern Machine Learning and Deep Learning algorithms for detecting melanoma and suspicious naevi. It begins with general information on skin cancer and different types of naevi, then introduces AI, ML, DL, and CAD. The article then discusses the successful applications of various ML techniques like convolutional neural networks (CNN) for melanoma detection compared to dermatologists' performance. Lastly, it examines ML methods for UD naevus detection and identifying suspicious naevi.
△ Less
Submitted 5 September, 2023; v1 submitted 1 September, 2023;
originally announced September 2023.
-
Structural and Magnetic Properties of V-Ti-Si Alloy Superconductors
Authors:
Asi Khandelwal,
L. S. Sharath Chandra,
Archna Sagdeo,
Rashmi Singh,
M. K. Chattopadhyay
Abstract:
The structural and magnetic properties of the as-cast and annealed V$_{0.6-x}$Si$_x$Ti$_{0.4}$ ($x$ = 0, 0.05, 0.10, 0.15) alloy superconductors are reported here. It is found that addition of silicon to the V-Ti alloys results in eutectic precipitation of Ti$_{5}$Si$_3$-phase in the body centred cubic (bcc) $β$-V-Ti matrix. In the as-cast V$_{0.6-x}$Si$_x$Ti$_{0.4}$ alloys, the superconducting tr…
▽ More
The structural and magnetic properties of the as-cast and annealed V$_{0.6-x}$Si$_x$Ti$_{0.4}$ ($x$ = 0, 0.05, 0.10, 0.15) alloy superconductors are reported here. It is found that addition of silicon to the V-Ti alloys results in eutectic precipitation of Ti$_{5}$Si$_3$-phase in the body centred cubic (bcc) $β$-V-Ti matrix. In the as-cast V$_{0.6-x}$Si$_x$Ti$_{0.4}$ alloys, the superconducting transition temperature (T$_{C}$) changes non-monotonically with increasing silicon content whereas after annealing, it is about 7.7 K for all the alloys. On the other hand, the upper critical field decreases and the coherence length increases after annealing in the x = 0.10 alloy. The variations in the superconducting properties in the alloys are related to the solubility of 6 at.% Si in the V$_{0.60}$Ti$_{0.40}$ alloy and the vanadium enrichment in the $β$ matrix due to the precipitation of Ti$_{5}$Si$_3$ phase.
△ Less
Submitted 4 August, 2023; v1 submitted 31 July, 2023;
originally announced July 2023.
-
TriFormer: A Multi-modal Transformer Framework For Mild Cognitive Impairment Conversion Prediction
Authors:
Linfeng Liu,
Junyan Lyu,
Siyu Liu,
Xiaoying Tang,
Shekhar S. Chandra,
Fatima A. Nasrallah
Abstract:
The prediction of mild cognitive impairment (MCI) conversion to Alzheimer's disease (AD) is important for early treatment to prevent or slow the progression of AD. To accurately predict the MCI conversion to stable MCI or progressive MCI, we propose Triformer, a novel transformer-based framework with three specialized transformers to incorporate multi-model data. Triformer uses I) an image transfo…
▽ More
The prediction of mild cognitive impairment (MCI) conversion to Alzheimer's disease (AD) is important for early treatment to prevent or slow the progression of AD. To accurately predict the MCI conversion to stable MCI or progressive MCI, we propose Triformer, a novel transformer-based framework with three specialized transformers to incorporate multi-model data. Triformer uses I) an image transformer to extract multi-view image features from medical scans, II) a clinical transformer to embed and correlate multi-modal clinical data, and III) a modality fusion transformer that produces an accurate prediction based on fusing the outputs from the image and clinical transformers. Triformer is evaluated on the Alzheimer's Disease Neuroimaging Initiative (ANDI)1 and ADNI2 datasets and outperforms previous state-of-the-art single and multi-modal methods.
△ Less
Submitted 14 July, 2023;
originally announced July 2023.
-
Evidence-aware multi-modal data fusion and its application to total knee replacement prediction
Authors:
Xinwen Liu,
**g Wang,
S. Kevin Zhou,
Craig Engstrom,
Shekhar S. Chandra
Abstract:
Deep neural networks have been widely studied for predicting a medical condition, such as total knee replacement (TKR). It has shown that data of different modalities, such as imaging data, clinical variables and demographic information, provide complementary information and thus can improve the prediction accuracy together. However, the data sources of various modalities may not always be of high…
▽ More
Deep neural networks have been widely studied for predicting a medical condition, such as total knee replacement (TKR). It has shown that data of different modalities, such as imaging data, clinical variables and demographic information, provide complementary information and thus can improve the prediction accuracy together. However, the data sources of various modalities may not always be of high quality, and each modality may have only partial information of medical condition. Thus, predictions from different modalities can be opposite, and the final prediction may fail in the presence of such a conflict. Therefore, it is important to consider the reliability of each source data and the prediction output when making a final decision. In this paper, we propose an evidence-aware multi-modal data fusion framework based on the Dempster-Shafer theory (DST). The backbone models contain an image branch, a non-image branch and a fusion branch. For each branch, there is an evidence network that takes the extracted features as input and outputs an evidence score, which is designed to represent the reliability of the output from the current branch. The output probabilities along with the evidence scores from multiple branches are combined with the Dempster's combination rule to make a final prediction. Experimental results on the public OA initiative (OAI) dataset for the TKR prediction task show the superiority of the proposed fusion strategy on various backbone models.
△ Less
Submitted 24 March, 2023;
originally announced March 2023.
-
Explainable Semantic Medical Image Segmentation with Style
Authors:
Wei Dai,
Siyu Liu,
Craig B. Engstrom,
Shekhar S. Chandra
Abstract:
Semantic medical image segmentation using deep learning has recently achieved high accuracy, making it appealing to clinical problems such as radiation therapy. However, the lack of high-quality semantically labelled data remains a challenge leading to model brittleness to small shifts to input data. Most works require extra data for semi-supervised learning and lack the interpretability of the bo…
▽ More
Semantic medical image segmentation using deep learning has recently achieved high accuracy, making it appealing to clinical problems such as radiation therapy. However, the lack of high-quality semantically labelled data remains a challenge leading to model brittleness to small shifts to input data. Most works require extra data for semi-supervised learning and lack the interpretability of the boundaries of the training data distribution during training, which is essential for model deployment in clinical practice. We propose a fully supervised generative framework that can achieve generalisable segmentation with only limited labelled data by simultaneously constructing an explorable manifold during training. The proposed approach creates medical image style paired with a segmentation task driven discriminator incorporating end-to-end adversarial training. The discriminator is generalised to small domain shifts as much as permissible by the training data, and the generator automatically diversifies the training samples using a manifold of input features learnt during segmentation. All the while, the discriminator guides the manifold learning by supervising the semantic content and fine-grained features separately during the image diversification. After training, visualisation of the learnt manifold from the generator is available to interpret the model limits. Experiments on a fully semantic, publicly available pelvis dataset demonstrated that our method is more generalisable to shifts than other state-of-the-art methods while being more explainable using an explorable manifold.
△ Less
Submitted 9 March, 2023;
originally announced March 2023.
-
Towards Trustable Skin Cancer Diagnosis via Rewriting Model's Decision
Authors:
Siyuan Yan,
Zhen Yu,
Xuelin Zhang,
Dwarikanath Mahapatra,
Shekhar S. Chandra,
Monika Janda,
Peter Soyer,
Zongyuan Ge
Abstract:
Deep neural networks have demonstrated promising performance on image recognition tasks. However, they may heavily rely on confounding factors, using irrelevant artifacts or bias within the dataset as the cue to improve performance. When a model performs decision-making based on these spurious correlations, it can become untrustable and lead to catastrophic outcomes when deployed in the real-world…
▽ More
Deep neural networks have demonstrated promising performance on image recognition tasks. However, they may heavily rely on confounding factors, using irrelevant artifacts or bias within the dataset as the cue to improve performance. When a model performs decision-making based on these spurious correlations, it can become untrustable and lead to catastrophic outcomes when deployed in the real-world scene. In this paper, we explore and try to solve this problem in the context of skin cancer diagnosis. We introduce a human-in-the-loop framework in the model training process such that users can observe and correct the model's decision logic when confounding behaviors happen. Specifically, our method can automatically discover confounding factors by analyzing the co-occurrence behavior of the samples. It is capable of learning confounding concepts using easily obtained concept exemplars. By map** the black-box model's feature representation onto an explainable concept space, human users can interpret the concept and intervene via first order-logic instruction. We systematically evaluate our method on our newly crafted, well-controlled skin lesion dataset and several public skin lesion datasets. Experiments show that our method can effectively detect and remove confounding factors from datasets without any prior knowledge about the category distribution and does not require fully annotated concept labels. We also show that our method enables the model to focus on clinical-related concepts, improving the model's performance and trustworthiness during model inference.
△ Less
Submitted 1 March, 2023;
originally announced March 2023.
-
AliasNet: Alias Artefact Suppression Network for Accelerated Phase-Encode MRI
Authors:
Marlon E. Bran Lorenzana,
Shekhar S. Chandra,
Feng Liu
Abstract:
Sparse reconstruction is an important aspect of MRI, hel** to reduce acquisition time and improve spatial-temporal resolution. Popular methods are based mostly on compressed sensing (CS), which relies on the random sampling of k-space to produce incoherent (noise-like) artefacts. Due to hardware constraints, 1D Cartesian phase-encode under-sampling schemes are popular for 2D CS-MRI. However, 1D…
▽ More
Sparse reconstruction is an important aspect of MRI, hel** to reduce acquisition time and improve spatial-temporal resolution. Popular methods are based mostly on compressed sensing (CS), which relies on the random sampling of k-space to produce incoherent (noise-like) artefacts. Due to hardware constraints, 1D Cartesian phase-encode under-sampling schemes are popular for 2D CS-MRI. However, 1D under-sampling limits 2D incoherence between measurements, yielding structured aliasing artefacts (ghosts) that may be difficult to remove assuming a 2D sparsity model. Reconstruction algorithms typically deploy direction-insensitive 2D regularisation for these direction-associated artefacts. Recognising that phase-encode artefacts can be separated into contiguous 1D signals, we develop two decoupling techniques that enable explicit 1D regularisation and leverage the excellent 1D incoherence characteristics. We also derive a combined 1D + 2D reconstruction technique that takes advantage of spatial relationships within the image. Experiments conducted on retrospectively under-sampled brain and knee data demonstrate that combination of the proposed 1D AliasNet modules with existing 2D deep learned (DL) recovery techniques leads to an improvement in image quality. We also find AliasNet enables a superior scaling of performance compared to increasing the size of the original 2D network layers. AliasNet therefore improves the regularisation of aliasing artefacts arising from phase-encode under-sampling, by tailoring the network architecture to account for their expected appearance. The proposed 1D + 2D approach is compatible with any existing 2D DL recovery technique deployed for this application.
△ Less
Submitted 10 October, 2023; v1 submitted 17 February, 2023;
originally announced February 2023.
-
NBC-Softmax : Darkweb Author fingerprinting and migration tracking
Authors:
Gayan K. Kulatilleke,
Shekhar S. Chandra,
Marius Portmann
Abstract:
Metric learning aims to learn distances from the data, which enhances the performance of similarity-based algorithms. An author style detection task is a metric learning problem, where learning style features with small intra-class variations and larger inter-class differences is of great importance to achieve better performance. Recently, metric learning based on softmax loss has been used succes…
▽ More
Metric learning aims to learn distances from the data, which enhances the performance of similarity-based algorithms. An author style detection task is a metric learning problem, where learning style features with small intra-class variations and larger inter-class differences is of great importance to achieve better performance. Recently, metric learning based on softmax loss has been used successfully for style detection. While softmax loss can produce separable representations, its discriminative power is relatively poor. In this work, we propose NBC-Softmax, a contrastive loss based clustering technique for softmax loss, which is more intuitive and able to achieve superior performance. Our technique meets the criterion for larger number of samples, thus achieving block contrastiveness, which is proven to outperform pair-wise losses. It uses mini-batch sampling effectively and is scalable. Experiments on 4 darkweb social forums, with NBCSAuthor that uses the proposed NBC-Softmax for author and sybil detection, shows that our negative block contrastive approach constantly outperforms state-of-the-art methods using the same network architecture.
Our code is publicly available at : https://github.com/gayanku/NBC-Softmax
△ Less
Submitted 15 December, 2022;
originally announced December 2022.
-
Automated anomaly-aware 3D segmentation of bones and cartilages in knee MR images from the Osteoarthritis Initiative
Authors:
Boyeong Woo,
Craig Engstrom,
William Baresic,
Jurgen Fripp,
Stuart Crozier,
Shekhar S. Chandra
Abstract:
In medical image analysis, automated segmentation of multi-component anatomical structures, which often have a spectrum of potential anomalies and pathologies, is a challenging task. In this work, we develop a multi-step approach using U-Net-based neural networks to initially detect anomalies (bone marrow lesions, bone cysts) in the distal femur, proximal tibia and patella from 3D magnetic resonan…
▽ More
In medical image analysis, automated segmentation of multi-component anatomical structures, which often have a spectrum of potential anomalies and pathologies, is a challenging task. In this work, we develop a multi-step approach using U-Net-based neural networks to initially detect anomalies (bone marrow lesions, bone cysts) in the distal femur, proximal tibia and patella from 3D magnetic resonance (MR) images of the knee in individuals with varying grades of osteoarthritis. Subsequently, the extracted data are used for downstream tasks involving semantic segmentation of individual bone and cartilage volumes as well as bone anomalies. For anomaly detection, the U-Net-based models were developed to reconstruct the bone profiles of the femur and tibia in images via inpainting so anomalous bone regions could be replaced with close to normal appearances. The reconstruction error was used to detect bone anomalies. A second anomaly-aware network, which was compared to anomaly-naïve segmentation networks, was used to provide a final automated segmentation of the femoral, tibial and patellar bones and cartilages from the knee MR images containing a spectrum of bone anomalies. The anomaly-aware segmentation approach provided up to 58% reduction in Hausdorff distances for bone segmentations compared to the results from the anomaly-naïve segmentation networks. In addition, the anomaly-aware networks were able to detect bone lesions in the MR images with greater sensitivity and specificity (area under the receiver operating characteristic curve [AUC] up to 0.896) compared to the anomaly-naïve segmentation networks (AUC up to 0.874).
△ Less
Submitted 1 December, 2022; v1 submitted 29 November, 2022;
originally announced November 2022.
-
Cascaded Multi-Modal Mixing Transformers for Alzheimer's Disease Classification with Incomplete Data
Authors:
Linfeng Liu,
Siyu Liu,
Lu Zhang,
Xuan Vinh To,
Fatima Nasrallah,
Shekhar S. Chandra
Abstract:
Accurate medical classification requires a large number of multi-modal data, and in many cases, different feature types. Previous studies have shown promising results when using multi-modal data, outperforming single-modality models when classifying diseases such as Alzheimer's Disease (AD). However, those models are usually not flexible enough to handle missing modalities. Currently, the most com…
▽ More
Accurate medical classification requires a large number of multi-modal data, and in many cases, different feature types. Previous studies have shown promising results when using multi-modal data, outperforming single-modality models when classifying diseases such as Alzheimer's Disease (AD). However, those models are usually not flexible enough to handle missing modalities. Currently, the most common workaround is discarding samples with missing modalities which leads to considerable data under-utilization. Adding to the fact that labeled medical images are already scarce, the performance of data-driven methods like deep learning can be severely hampered. Therefore, a multi-modal method that can handle missing data in various clinical settings is highly desirable. In this paper, we present Multi-Modal Mixing Transformer (3MAT), a disease classification transformer that not only leverages multi-modal data but also handles missing data scenarios. In this work, we test 3MT for AD and Cognitively normal (CN) classification and mild cognitive impairment (MCI) conversion prediction to progressive MCI (pMCI) or stable MCI (sMCI) using clinical and neuroimaging data. The model uses a novel Cascaded Modality Transformer architecture with cross-attention to incorporate multi-modal information for more informed predictions. We propose a novel modality dropout mechanism to ensure an unprecedented level of modality independence and robustness to handle missing data scenarios. The result is a versatile network that enables the mixing of arbitrary numbers of modalities with different feature types and also ensures full data utilization missing data scenarios. The model is trained and evaluated on the ADNI dataset with the SOTRA performance and further evaluated with the AIBL dataset with missing data.
△ Less
Submitted 16 July, 2023; v1 submitted 1 October, 2022;
originally announced October 2022.
-
Efficient block contrastive learning via parameter-free meta-node approximation
Authors:
Gayan K. Kulatilleke,
Marius Portmann,
Shekhar S. Chandra
Abstract:
Contrastive learning has recently achieved remarkable success in many domains including graphs. However contrastive loss, especially for graphs, requires a large number of negative samples which is unscalable and computationally prohibitive with a quadratic time complexity. Sub-sampling is not optimal and incorrect negative sampling leads to sampling bias. In this work, we propose a meta-node base…
▽ More
Contrastive learning has recently achieved remarkable success in many domains including graphs. However contrastive loss, especially for graphs, requires a large number of negative samples which is unscalable and computationally prohibitive with a quadratic time complexity. Sub-sampling is not optimal and incorrect negative sampling leads to sampling bias. In this work, we propose a meta-node based approximation technique that can (a) proxy all negative combinations (b) in quadratic cluster size time complexity, (c) at graph level, not node level, and (d) exploit graph sparsity. By replacing node-pairs with additive cluster-pairs, we compute the negatives in cluster-time at graph level. The resulting Proxy approximated meta-node Contrastive (PamC) loss, based on simple optimized GPU operations, captures the full set of negatives, yet is efficient with a linear time complexity. By avoiding sampling, we effectively eliminate sample bias. We meet the criterion for larger number of samples, thus achieving block-contrastiveness, which is proven to outperform pair-wise losses. We use learnt soft cluster assignments for the meta-node constriction, and avoid possible heterophily and noise added during edge creation. Theoretically, we show that real world graphs easily satisfy conditions necessary for our approximation. Empirically, we show promising accuracy gains over state-of-the-art graph clustering on 6 benchmarks. Importantly, we gain substantially in efficiency; up to 3x in training time, 1.8x in inference time and over 5x in GPU memory reduction.
△ Less
Submitted 28 September, 2022;
originally announced September 2022.
-
Temperature and terahertz frequency dependence of the dielectric properties of Fe3O4 thin films deposited on Si substrate
Authors:
Ashish Khandelwal,
L. S. Sharath Chandra,
Shilpam Sharma,
Archna Sagdeo,
J. Jayabalan,
R. J. Choudhary,
M. K. Chattopadhyay
Abstract:
The Fe$_3$O$_4$/Si films are considered to be promising materials for THz spintronic applications due to their high temperature magnetic transition and semiconducting properties. In this article, we present the real part of the dielectric constant ($ε_1$) and the optical conductivity ($σ_1$) of Fe$_3$O$_4$ films of different thicknesses deposited on Si substrate (Fe$_3$O$_4$/Si) in the THz range a…
▽ More
The Fe$_3$O$_4$/Si films are considered to be promising materials for THz spintronic applications due to their high temperature magnetic transition and semiconducting properties. In this article, we present the real part of the dielectric constant ($ε_1$) and the optical conductivity ($σ_1$) of Fe$_3$O$_4$ films of different thicknesses deposited on Si substrate (Fe$_3$O$_4$/Si) in the THz range at temperatures 2- 300 K. Although the magnetization of the films with thickness $\geq$ 115 nm shows a clear change at the Verwey transition temperature T$_v$ = 121 K, their optical properties in the THz frequency range are drastically different from each other. We have shown that $σ_1$ is maximum and $ε_1$ is minimum when the Fe$^{+2}$/Fe$^{+3}$ ratio is equal to 0.54 which is the ratio of Fe+2/Fe+3 for pure Fe$_3$O$_4$. The $σ_1$ reduces and $ε_1$ increases at all temperatures when the Fe$^{+2}$/Fe$^{+3}$ ratio deviates from 0.54. We have shown that a slight change in the Fe$^{+2}$/Fe$^{+3}$ ratio can induce large changes in the optical properties which shall have implications in the application of the Fe3O4 films in THz spintronics.
△ Less
Submitted 23 September, 2022;
originally announced September 2022.
-
Skin Lesion Recognition with Class-Hierarchy Regularized Hyperbolic Embeddings
Authors:
Zhen Yu,
Toan Nguyen,
Yaniv Gal,
Lie Ju,
Shekhar S. Chandra,
Lei Zhang,
Paul Bonnington,
Victoria Mar,
Zhiyong Wang,
Zongyuan Ge
Abstract:
In practice, many medical datasets have an underlying taxonomy defined over the disease label space. However, existing classification algorithms for medical diagnoses often assume semantically independent labels. In this study, we aim to leverage class hierarchy with deep learning algorithms for more accurate and reliable skin lesion recognition. We propose a hyperbolic network to learn image embe…
▽ More
In practice, many medical datasets have an underlying taxonomy defined over the disease label space. However, existing classification algorithms for medical diagnoses often assume semantically independent labels. In this study, we aim to leverage class hierarchy with deep learning algorithms for more accurate and reliable skin lesion recognition. We propose a hyperbolic network to learn image embeddings and class prototypes jointly. The hyperbola provably provides a space for modeling hierarchical relations better than Euclidean geometry. Meanwhile, we restrict the distribution of hyperbolic prototypes with a distance matrix that is encoded from the class hierarchy. Accordingly, the learned prototypes preserve the semantic class relations in the embedding space and we can predict the label of an image by assigning its feature to the nearest hyperbolic class prototype. We use an in-house skin lesion dataset which consists of around 230k dermoscopic images on 65 skin diseases to verify our method. Extensive experiments provide evidence that our model can achieve higher accuracy with less severe classification errors than models without considering class relations.
△ Less
Submitted 13 September, 2022;
originally announced September 2022.
-
SCGC : Self-Supervised Contrastive Graph Clustering
Authors:
Gayan K. Kulatilleke,
Marius Portmann,
Shekhar S. Chandra
Abstract:
Graph clustering discovers groups or communities within networks. Deep learning methods such as autoencoders (AE) extract effective clustering and downstream representations but cannot incorporate rich structural information. While Graph Neural Networks (GNN) have shown great success in encoding graph structure, typical GNNs based on convolution or attention variants suffer from over-smoothing, no…
▽ More
Graph clustering discovers groups or communities within networks. Deep learning methods such as autoencoders (AE) extract effective clustering and downstream representations but cannot incorporate rich structural information. While Graph Neural Networks (GNN) have shown great success in encoding graph structure, typical GNNs based on convolution or attention variants suffer from over-smoothing, noise, heterophily, are computationally expensive and typically require the complete graph being present. Instead, we propose Self-Supervised Contrastive Graph Clustering (SCGC), which imposes graph-structure via contrastive loss signals to learn discriminative node representations and iteratively refined soft cluster labels. We also propose SCGC*, with a more effective, novel, Influence Augmented Contrastive (IAC) loss to fuse richer structural information, and half the original model parameters. SCGC(*) is faster with simple linear units, completely eliminate convolutions and attention of traditional GNNs, yet efficiently incorporates structure. It is impervious to layer depth and robust to over-smoothing, incorrect edges and heterophily. It is scalable by batching, a limitation in many prior GNN models, and trivially parallelizable. We obtain significant improvements over state-of-the-art on a wide range of benchmark graph datasets, including images, sensor data, text, and citation networks efficiently. Specifically, 20% on ARI and 18% on NMI for DBLP; overall 55% reduction in training time and overall, 81% reduction on inference time. Our code is available at : https://github.com/gayanku/SCGC
△ Less
Submitted 26 April, 2022;
originally announced April 2022.
-
Transformer Compressed Sensing via Global Image Tokens
Authors:
Marlon Bran Lorenzana,
Craig Engstrom,
Shekhar S. Chandra
Abstract:
Convolutional neural networks (CNN) have demonstrated outstanding Compressed Sensing (CS) performance compared to traditional, hand-crafted methods. However, they are broadly limited in terms of generalisability, inductive bias and difficulty to model long distance relationships. Transformer neural networks (TNN) overcome such issues by implementing an attention mechanism designed to capture depen…
▽ More
Convolutional neural networks (CNN) have demonstrated outstanding Compressed Sensing (CS) performance compared to traditional, hand-crafted methods. However, they are broadly limited in terms of generalisability, inductive bias and difficulty to model long distance relationships. Transformer neural networks (TNN) overcome such issues by implementing an attention mechanism designed to capture dependencies between inputs. However, high-resolution tasks typically require vision Transformers (ViT) to decompose an image into patch-based tokens, limiting inputs to inherently local contexts. We propose a novel image decomposition that naturally embeds images into low-resolution inputs. These Kaleidoscope tokens (KD) provide a mechanism for global attention, at the same computational cost as a patch-based approach. To showcase this development, we replace CNN components in a well-known CS-MRI neural network with TNN blocks and demonstrate the improvements afforded by KD. We also propose an ensemble of image tokens, which enhance overall image quality and reduces model size. Supplementary material is available: https://github.com/uqmarlonbran/TCS.git
△ Less
Submitted 12 July, 2022; v1 submitted 24 March, 2022;
originally announced March 2022.
-
Undersampled MRI Reconstruction with Side Information-Guided Normalisation
Authors:
Xinwen Liu,
**g Wang,
Cheng Peng,
Shekhar S. Chandra,
Feng Liu,
S. Kevin Zhou
Abstract:
Magnetic resonance (MR) images exhibit various contrasts and appearances based on factors such as different acquisition protocols, views, manufacturers, scanning parameters, etc. This generally accessible appearance-related side information affects deep learning-based undersampled magnetic resonance imaging (MRI) reconstruction frameworks, but has been overlooked in the majority of current works.…
▽ More
Magnetic resonance (MR) images exhibit various contrasts and appearances based on factors such as different acquisition protocols, views, manufacturers, scanning parameters, etc. This generally accessible appearance-related side information affects deep learning-based undersampled magnetic resonance imaging (MRI) reconstruction frameworks, but has been overlooked in the majority of current works. In this paper, we investigate the use of such side information as normalisation parameters in a convolutional neural network (CNN) to improve undersampled MRI reconstruction. Specifically, a Side Information-Guided Normalisation (SIGN) module, containing only few layers, is proposed to efficiently encode the side information and output the normalisation parameters. We examine the effectiveness of such a module on two popular reconstruction architectures, D5C5 and OUCR. The experimental results on both brain and knee images under various acceleration rates demonstrate that the proposed method improves on its corresponding baseline architectures with a significant margin.
△ Less
Submitted 7 March, 2022;
originally announced March 2022.
-
Automated volumetric and statistical shape assessment of cam-type morphology of the femoral head-neck region from 3D magnetic resonance images
Authors:
Jessica M. Bugeja,
Ying Xia,
Shekhar S. Chandra,
Nicholas J. Murphy,
Jillian Eyles,
Libby Spiers,
Stuart Crozier,
David J. Hunter,
Jurgen Fripp,
Craig Engstrom
Abstract:
Femoroacetabular im**ement (FAI) cam morphology is routinely assessed using two-dimensional alpha angles which do not provide specific data on cam size characteristics. The purpose of this study is to implement a novel, automated three-dimensional (3D) pipeline, CamMorph, for segmentation and measurement of cam volume, surface area and height from magnetic resonance (MR) images in patients with…
▽ More
Femoroacetabular im**ement (FAI) cam morphology is routinely assessed using two-dimensional alpha angles which do not provide specific data on cam size characteristics. The purpose of this study is to implement a novel, automated three-dimensional (3D) pipeline, CamMorph, for segmentation and measurement of cam volume, surface area and height from magnetic resonance (MR) images in patients with FAI. The CamMorph pipeline involves two processes: i) proximal femur segmentation using an approach integrating 3D U-net with focused shape modelling (FSM); ii) use of patient-specific anatomical information from 3D FSM to simulate healthy femoral bone models and pathological region constraints to identify cam bone mass. Agreement between manual and automated segmentation of the proximal femur was evaluated with the Dice similarity index (DSI) and surface distance measures. Independent t-tests or Mann-Whitney U rank tests were used to compare the femoral head volume, cam volume, surface area and height data between female and male patients with FAI. There was a mean DSI value of 0.964 between manual and automated segmentation of proximal femur volume. Compared to female FAI patients, male patients had a significantly larger mean femoral head volume (66.12cm3 v 46.02cm3, p<0.001). Compared to female FAI patients, male patients had a significantly larger mean cam volume (1136.87mm3 v 337.86mm3, p<0.001), surface area (657.36mm2 v 306.93mm2 , p<0.001), maximum-height (3.89mm v 2.23mm, p<0.001) and average-height (1.94mm v 1.00mm, p<0.001). Automated analyses of 3D MR images from patients with FAI using the CamMorph pipeline showed that, in comparison with female patients, male patients had significantly greater cam volume, surface area and height.
△ Less
Submitted 5 December, 2021;
originally announced December 2021.
-
Enhancement of functional properties of V$_{0.6}$Ti$_{0.4}$ alloy superconductor by the addition of yttrium
Authors:
SK. Ramjan,
L. S. Sharath Chandra,
Rashmi Singh,
P. Ganesh,
Archna Sagdeo,
M. K. Chattopadhyay
Abstract:
We show here that the yttrium is immiscible and precipitates with various sizes in the body centred cubic V$_{0.6}$Ti$_{0.4}$ alloy superconductor. The number and size of the precipitates are found to depend on the amount of yttrium added. Precipitates with various sizes up to 30~$μ$m are found in the V$_{0.6}$Ti$_{0.4}$ alloy containing 5 at.\% yttrium. The large amount of line disorders generate…
▽ More
We show here that the yttrium is immiscible and precipitates with various sizes in the body centred cubic V$_{0.6}$Ti$_{0.4}$ alloy superconductor. The number and size of the precipitates are found to depend on the amount of yttrium added. Precipitates with various sizes up to 30~$μ$m are found in the V$_{0.6}$Ti$_{0.4}$ alloy containing 5 at.\% yttrium. The large amount of line disorders generated by the addition of yttrium in this alloy are found to be effective in pinning the magnetic flux lines. While the superconducting transition temperature increases with the increasing amount of yttrium in the V$_{0.6}$Ti$_{0.4}$ alloy, the critical current density is maximum for the alloy containing 2 at. \% yttrium, where it is more than 7.5 times the parent alloy in fields higher than 1~T. We found that the effectiveness of each type of defect in pinning the flux lines is dependent on the temperature and the applied magnetic filed.
△ Less
Submitted 23 November, 2021;
originally announced November 2021.
-
Temperature induced first order electronic topological transition in $β$-Ag$_2$Se
Authors:
L. S. Sharath Chandra,
SK. Ramjan,
Soma Banik,
Archna Sagdeo,
M. K. Chattopadhyay
Abstract:
$β$-Ag$_2$Se is a promising material for room temperature thermoelectric applications and magneto-resistive sensors. However, no attention was paid earlier to the hysteresis in the temperature dependence of resistivity ($ρ$($T$)). Here, we show that a broad hysteresis above 35 K is observed not only in $ρ$($T$), but also in other electronic properties such as Hall coefficient ($R_H$($T…
▽ More
$β$-Ag$_2$Se is a promising material for room temperature thermoelectric applications and magneto-resistive sensors. However, no attention was paid earlier to the hysteresis in the temperature dependence of resistivity ($ρ$($T$)). Here, we show that a broad hysteresis above 35 K is observed not only in $ρ$($T$), but also in other electronic properties such as Hall coefficient ($R_H$($T$)), Seebeck coefficient, thermal conductivity and ultraviolet photoelectron spectra (UPS). We also show that the hysteresis is not associated with a structural transition. The $ρ$($T$) and $R_H$($T$) show that $β$-Ag$_2$Se is semiconducting above 300 K, but metallicity is retained below 300 K. While electronic states are absent in the energy range from the Fermi level ($E_F$) to 0.4 eV below the $E_F$ at 300 K, a distinct Fermi edge is observed in the UPS at 15 K suggesting that the $β$-Ag$_2$Se undergoes an electronic topological transition from a high temperature semiconducting state to a low temperature metallic state. Our study reveals that a constant and moderately high thermoelectric figure of merit ($ZT$) in the range 300-395 K is observed due to the broad semiconductor to metal transition in $β$-Ag$_2$Se.
△ Less
Submitted 26 October, 2021;
originally announced October 2021.
-
FDGATII : Fast Dynamic Graph Attention with Initial Residual and Identity Map**
Authors:
Gayan K. Kulatilleke,
Marius Portmann,
Ryan Ko,
Shekhar S. Chandra
Abstract:
While Graph Neural Networks have gained popularity in multiple domains, graph-structured input remains a major challenge due to (a) over-smoothing, (b) noisy neighbours (heterophily), and (c) the suspended animation problem. To address all these problems simultaneously, we propose a novel graph neural network FDGATII, inspired by attention mechanism's ability to focus on selective information supp…
▽ More
While Graph Neural Networks have gained popularity in multiple domains, graph-structured input remains a major challenge due to (a) over-smoothing, (b) noisy neighbours (heterophily), and (c) the suspended animation problem. To address all these problems simultaneously, we propose a novel graph neural network FDGATII, inspired by attention mechanism's ability to focus on selective information supplemented with two feature preserving mechanisms. FDGATII combines Initial Residuals and Identity Map** with the more expressive dynamic self-attention to handle noise prevalent from the neighbourhoods in heterophilic data sets. By using sparse dynamic attention, FDGATII is inherently parallelizable in design, whist efficient in operation; thus theoretically able to scale to arbitrary graphs with ease. Our approach has been extensively evaluated on 7 datasets. We show that FDGATII outperforms GAT and GCN based benchmarks in accuracy and performance on fully supervised tasks, obtaining state-of-the-art results on Chameleon and Cornell datasets with zero domain-specific graph pre-processing, and demonstrate its versatility and fairness.
△ Less
Submitted 25 October, 2021; v1 submitted 21 October, 2021;
originally announced October 2021.
-
CAN3D: Fast 3D Medical Image Segmentation via Compact Context Aggregation
Authors:
Wei Dai,
Boyeong Woo,
Siyu Liu,
Matthew Marques,
Craig B. Engstrom,
Peter B. Greer,
Stuart Crozier,
Jason A. Dowling,
Shekhar S. Chandra
Abstract:
Direct automatic segmentation of objects from 3D medical imaging, such as magnetic resonance (MR) imaging, is challenging as it often involves accurately identifying a number of individual objects with complex geometries within a large volume under investigation. To address these challenges, most deep learning approaches typically enhance their learning capability by substantially increasing the c…
▽ More
Direct automatic segmentation of objects from 3D medical imaging, such as magnetic resonance (MR) imaging, is challenging as it often involves accurately identifying a number of individual objects with complex geometries within a large volume under investigation. To address these challenges, most deep learning approaches typically enhance their learning capability by substantially increasing the complexity or the number of trainable parameters within their models. Consequently, these models generally require long inference time on standard workstations operating clinical MR systems and are restricted to high-performance computing hardware due to their large memory requirement. Further, to fit 3D dataset through these large models using limited computer memory, trade-off techniques such as patch-wise training are often used which sacrifice the fine-scale geometric information from input images which could be clinically significant for diagnostic purposes. To address these challenges, we present a compact convolutional neural network with a shallow memory footprint to efficiently reduce the number of model parameters required for state-of-art performance. This is critical for practical employment as most clinical environments only have low-end hardware with limited computing power and memory. The proposed network can maintain data integrity by directly processing large full-size 3D input volumes with no patches required and significantly reduces the computational time required for both training and inference. We also propose a novel loss function with extra shape constraint to improve the accuracy for imbalanced classes in 3D MR images.
△ Less
Submitted 22 September, 2021; v1 submitted 12 September, 2021;
originally announced September 2021.
-
Bespoke Fractal Sampling Patterns for Discrete Fourier Space via the Kaleidoscope Transform
Authors:
Jacob M. White,
Stuart Crozier,
Shekhar S. Chandra
Abstract:
Sampling strategies are important for sparse imaging methodologies, especially those employing the discrete Fourier transform (DFT). Chaotic sensing is one such methodology that employs deterministic, fractal sampling in conjunction with finite, iterative reconstruction schemes to form an image from limited samples. Using a sampling pattern constructed entirely from periodic lines in DFT space, ch…
▽ More
Sampling strategies are important for sparse imaging methodologies, especially those employing the discrete Fourier transform (DFT). Chaotic sensing is one such methodology that employs deterministic, fractal sampling in conjunction with finite, iterative reconstruction schemes to form an image from limited samples. Using a sampling pattern constructed entirely from periodic lines in DFT space, chaotic sensing was found to outperform traditional compressed sensing for magnetic resonance imaging; however, only one such sampling pattern was presented and the reason for its fractal nature was not proven. Through the introduction of a novel image transform known as the kaleidoscope transform, which formalises and extends upon the concept of downsampling and concatenating an image with itself, this paper: (1) demonstrates a fundamental relationship between multiplication in modular arithmetic and downsampling; (2) provides a rigorous mathematical explanation for the fractal nature of the sampling pattern in the DFT; and (3) leverages this understanding to develop a collection of novel fractal sampling patterns for the 2D DFT with customisable properties. The ability to design tailor-made fractal sampling patterns expands the utility of the DFT in chaotic imaging and may form the basis for a bespoke chaotic sensing methodology, in which the fractal sampling matches the imaging task for improved reconstruction.
△ Less
Submitted 2 August, 2021;
originally announced August 2021.
-
A Survey on Machine Learning Algorithms for Applications in Cognitive Radio Networks
Authors:
Akshay Upadhye,
Purushothaman Saravanan,
Shreeram Suresh Chandra,
Sanjeev Gurugopinath
Abstract:
In this paper, we present a survey on the utility of machine learning (ML) algorithms for applications in cognitive radio networks (CRN). We start with a high-level overview of some of the major challenges in CRNs, and mention the ML architectures and algorithms that can be used to alleviate them. In particular, our focus is on two fundamental applications in CRNs, namely spectrum sensing -- with…
▽ More
In this paper, we present a survey on the utility of machine learning (ML) algorithms for applications in cognitive radio networks (CRN). We start with a high-level overview of some of the major challenges in CRNs, and mention the ML architectures and algorithms that can be used to alleviate them. In particular, our focus is on two fundamental applications in CRNs, namely spectrum sensing -- with non-cooperative and cooperative scenarios, and dynamic spectrum access -- with spectrum auction and prediction. We present a detailed study of recent advancements in the field of ML in CRNs for these applications, and briefly discuss the set of challenges in real-time implementation of ML techniques for CRNs.
△ Less
Submitted 18 June, 2021;
originally announced June 2021.
-
Interesting magnetic response of the nuclear fuel material UO2
Authors:
Sudip Pal,
L. S. Sharath Chandra,
M. K. Chattopadhyay,
S. B. Roy
Abstract:
Magnetic response of uranium dioxide (UO2) has been investigated through temperature and magnetic field dependent dc magnetization measurements. UO2 is a paramagnet at room temperature. The magnetic susceptibility, however, deviates from Curie-Weiss (CW) like paramagnetic behavior below T = 280 K. Further down the temperature UO2 undergoes phase transition to an antiferromagnetic state below TN =…
▽ More
Magnetic response of uranium dioxide (UO2) has been investigated through temperature and magnetic field dependent dc magnetization measurements. UO2 is a paramagnet at room temperature. The magnetic susceptibility, however, deviates from Curie-Weiss (CW) like paramagnetic behavior below T = 280 K. Further down the temperature UO2 undergoes phase transition to an antiferromagnetic state below TN = 30.6 K. The zero field cooled (ZFC) and field cooled (FC) magnetizations exhibit some distinct thermomagnetic irreversibility below TN. The temperature dependence of the FC magnetization is more like a ferromagnet, whereas ZFC magnetization exhibits distinct structures not usually observed in the antiferromagnets. In low applied magnetic field this thermomagnetic irreversibility in magnetization exists in a subtle way even in the paramagnetic regime above TN up to a fairly high temperature, but vanishes in high applied magnetic fields. Deviation from CW law and irreversibility between ZFC and FC magnetization indicate that the paramagnetic state above TN is not a trivial one. Magnetic response below TN changes significantly with the increase in the applied magnetic field. Thermomagnetic irreversibility in magnetization initially increases with the increase in the strength of applied magnetic field, but then gets reduced in the high applied fields. A subtle signature of a magnetic field induced phase transition is also observed in the isothermal magnetic field vartaion of magnetization. All these experimetal results highlight the non-trivial nature of the antiferromagnetic state in UO2
△ Less
Submitted 23 April, 2021;
originally announced April 2021.
-
Deep Simultaneous Optimisation of Sampling and Reconstruction for Multi-contrast MRI
Authors:
Xinwen Liu,
**g Wang,
Fangfang Tang,
Shekhar S. Chandra,
Feng Liu,
Stuart Crozier
Abstract:
MRI images of the same subject in different contrasts contain shared information, such as the anatomical structure. Utilizing the redundant information amongst the contrasts to sub-sample and faithfully reconstruct multi-contrast images could greatly accelerate the imaging speed, improve image quality and shorten scanning protocols. We propose an algorithm that generates the optimised sampling pat…
▽ More
MRI images of the same subject in different contrasts contain shared information, such as the anatomical structure. Utilizing the redundant information amongst the contrasts to sub-sample and faithfully reconstruct multi-contrast images could greatly accelerate the imaging speed, improve image quality and shorten scanning protocols. We propose an algorithm that generates the optimised sampling pattern and reconstruction scheme of one contrast (e.g. T2-weighted image) when images with different contrast (e.g. T1-weighted image) have been acquired. The proposed algorithm achieves increased PSNR and SSIM with the resulting optimal sampling pattern compared to other acquisition patterns and single contrast methods.
△ Less
Submitted 30 March, 2021;
originally announced March 2021.
-
Coexisting superconductivity and ferromagnetism in the (V$_{0.60}$Ti$_{0.40}$)-Gd alloys
Authors:
Sabyasachi Paul,
SK. Ramjan,
L. S. Sharath Chandra,
M. K. Chattopadhyay
Abstract:
We present here, the effect of microstructure on the magnetic, electrical and thermal properties of (V$_{0.60}$Ti$_{0.40}$)-Gd alloys. The gadolinium is found to be immiscible and precipitates with a size $<$1.2~$μ$m in the (V$_{0.60}$Ti$_{0.40}$)-Gd alloys. These precipitates enhance the grain boundary density. The (V$_{0.60}$Ti$_{0.40}$)-Gd alloys become ferromagnetic below $T_{mc}$ = 295~K with…
▽ More
We present here, the effect of microstructure on the magnetic, electrical and thermal properties of (V$_{0.60}$Ti$_{0.40}$)-Gd alloys. The gadolinium is found to be immiscible and precipitates with a size $<$1.2~$μ$m in the (V$_{0.60}$Ti$_{0.40}$)-Gd alloys. These precipitates enhance the grain boundary density. The (V$_{0.60}$Ti$_{0.40}$)-Gd alloys become ferromagnetic below $T_{mc}$ = 295~K with an increase in the superconducting transition temperature ($T_{sc}$). Though the disorder increases with increasing Gd content, the electronic thermal conductivity ($κ_{e} (H = 0)$) reduces by at most 15\% which is in contrast with the 80\% decrease of the phononic thermal conductivity ($κ_{l} (H = 0)$). Our analysis suggests that the magnetic moments of Gd precipitates polarize the conduction electrons along and around the grain boundaries leading to increase in the mean free path of the electrons. The partial suppression of spin fluctuations in the (V$_{0.60}$Ti$_{0.40}$)-Gd alloy by the conduction electron polarization enhances the $T_{sc}$.
△ Less
Submitted 25 March, 2021;
originally announced March 2021.
-
Manipulating Medical Image Translation with Manifold Disentanglement
Authors:
Siyu Liu,
Jason A. Dowling,
Craig Engstrom,
Peter B. Greer,
Stuart Crozier,
Shekhar S. Chandra
Abstract:
Medical image translation (e.g. CT to MR) is a challenging task as it requires I) faithful translation of domain-invariant features (e.g. shape information of anatomical structures) and II) realistic synthesis of target-domain features (e.g. tissue appearance in MR). In this work, we propose Manifold Disentanglement Generative Adversarial Network (MDGAN), a novel image translation framework that e…
▽ More
Medical image translation (e.g. CT to MR) is a challenging task as it requires I) faithful translation of domain-invariant features (e.g. shape information of anatomical structures) and II) realistic synthesis of target-domain features (e.g. tissue appearance in MR). In this work, we propose Manifold Disentanglement Generative Adversarial Network (MDGAN), a novel image translation framework that explicitly models these two types of features. It employs a fully convolutional generator to model domain-invariant features, and it uses style codes to separately model target-domain features as a manifold. This design aims to explicitly disentangle domain-invariant features and domain-specific features while gaining individual control of both. The image translation process is formulated as a stylisation task, where the input is "stylised" (translated) into diverse target-domain images based on style codes sampled from the learnt manifold. We test MDGAN for multi-modal medical image translation, where we create two domain-specific manifold clusters on the manifold to translate segmentation maps into pseudo-CT and pseudo-MR images, respectively. We show that by traversing a path across the MR manifold cluster, the target output can be manipulated while still retaining the shape information from the input.
△ Less
Submitted 27 November, 2020;
originally announced November 2020.
-
Automatic lesion detection, segmentation and characterization via 3D multiscale morphological sifting in breast MRI
Authors:
Hang Min,
Darryl McClymont,
Shekhar S. Chandra,
Stuart Crozier,
Andrew P. Bradley
Abstract:
Previous studies on computer aided detection/diagnosis (CAD) in 4D breast magnetic resonance imaging (MRI) regard lesion detection, segmentation and characterization as separate tasks, and typically require users to manually select 2D MRI slices or regions of interest as the input. In this work, we present a breast MRI CAD system that can handle 4D multimodal breast MRI data, and integrate lesion…
▽ More
Previous studies on computer aided detection/diagnosis (CAD) in 4D breast magnetic resonance imaging (MRI) regard lesion detection, segmentation and characterization as separate tasks, and typically require users to manually select 2D MRI slices or regions of interest as the input. In this work, we present a breast MRI CAD system that can handle 4D multimodal breast MRI data, and integrate lesion detection, segmentation and characterization with no user intervention. The proposed CAD system consists of three major stages: region candidate generation, feature extraction and region candidate classification. Breast lesions are firstly extracted as region candidates using the novel 3D multiscale morphological sifting (MMS). The 3D MMS, which uses linear structuring elements to extract lesion-like patterns, can segment lesions from breast images accurately and efficiently. Analytical features are then extracted from all available 4D multimodal breast MRI sequences, including T1-, T2-weighted and DCE sequences, to represent the signal intensity, texture, morphological and enhancement kinetic characteristics of the region candidates. The region candidates are lastly classified as lesion or normal tissue by the random under-sampling boost (RUSboost), and as malignant or benign lesion by the random forest. Evaluated on a breast MRI dataset which contains a total of 117 cases with 95 malignant and 46 benign lesions, the proposed system achieves a true positive rate (TPR) of 0.90 at 3.19 false positives per patient (FPP) for lesion detection and a TPR of 0.91 at a FPP of 2.95 for identifying malignant lesions without any user intervention. The average dice similarity index (DSI) is 0.72 for lesion segmentation. Compared with previously proposed systems evaluated on the same breast MRI dataset, the proposed CAD system achieves a favourable performance in breast lesion detection and characterization.
△ Less
Submitted 7 July, 2020;
originally announced July 2020.
-
Generalisable 3D Fabric Architecture for Streamlined Universal Multi-Dataset Medical Image Segmentation
Authors:
Siyu Liu,
Wei Dai,
Craig Engstrom,
Jurgen Fripp,
Stuart Crozier,
Jason A. Dowling,
Shekhar S. Chandra
Abstract:
Data scarcity is common in deep learning models for medical image segmentation. Previous works proposed multi-dataset learning, either simultaneously or via transfer learning to expand training sets. However, medical image datasets have diverse-sized images and features, and develo** a model simultaneously for multiple datasets is challenging. This work proposes Fabric Image Representation Encod…
▽ More
Data scarcity is common in deep learning models for medical image segmentation. Previous works proposed multi-dataset learning, either simultaneously or via transfer learning to expand training sets. However, medical image datasets have diverse-sized images and features, and develo** a model simultaneously for multiple datasets is challenging. This work proposes Fabric Image Representation Encoding Network (FIRENet), a universal architecture for simultaneous multi-dataset segmentation and transfer learning involving arbitrary numbers of dataset(s). To handle different-sized image and feature, a 3D fabric module is used to encapsulate many multi-scale sub-architectures. An optimal combination of these sub-architectures can be implicitly learnt to best suit the target dataset(s). For diverse-scale feature extraction, a 3D extension of atrous spatial pyramid pooling (ASPP3D) is used in each fabric node for a fine-grained coverage of rich-scale image features. In the first experiment, FIRENet performed 3D universal bone segmentation of multiple musculoskeletal datasets of the human knee, shoulder and hip joints and exhibited excellent simultaneous multi-dataset segmentation performance. When tested for transfer learning, FIRENet further exhibited excellent single dataset performance (when pre-training on a prostate dataset), as well as significantly improved universal bone segmentation performance. The following experiment involves the simultaneous segmentation of the 10 Medical Segmentation Decathlon (MSD) challenge datasets. FIRENet demonstrated good multi-dataset segmentation results and inter-dataset adaptability of highly diverse image sizes. In both experiments, FIRENet's streamlined multi-dataset learning with one unified network that requires no hyper-parameter tuning.
△ Less
Submitted 28 November, 2022; v1 submitted 28 June, 2020;
originally announced June 2020.
-
Localization of electronic states resulting from electronic topological transitions in the Mo$_{1-x}$Re$_x$ alloys: A photoemission study
Authors:
L. S. Sharath Chandra,
Shyam Sundar,
Soma Banik,
SK. Ramjan,
M. K. Chattopadhyay,
S. N. Jha,
S. B. Roy
Abstract:
We present the results of resonant photoemission spectroscopy experiments on the Mo$_{1-x}$Re$_{x}$ alloy compositions spanning over two electronic topological transitions (ETT) at the critical concentrations $x_{C1}$ = 0.05 and $x_{C2}$ = 0.11. The photoelectrons show an additional resonance ($R3$) in the constant initial state (CIS) spectra of the alloys along with two resonances ($R1$ and $R2$)…
▽ More
We present the results of resonant photoemission spectroscopy experiments on the Mo$_{1-x}$Re$_{x}$ alloy compositions spanning over two electronic topological transitions (ETT) at the critical concentrations $x_{C1}$ = 0.05 and $x_{C2}$ = 0.11. The photoelectrons show an additional resonance ($R3$) in the constant initial state (CIS) spectra of the alloys along with two resonances ($R1$ and $R2$) which are similar to those observed in molybdenum. All the resonances show Fano-like line shapes. The asymmetry parameter $q$ of the resonances $R1$ and $R3$ of the alloys is observed to be large and negative. Our analysis suggests that the origin of large negative q is associated with phonon assisted inter band scattering between the Mo-like states and the narrow band that appeared due to the ETT.
△ Less
Submitted 26 May, 2020;
originally announced May 2020.
-
Grain refinement and enhancement of critical current density in the V_0.60Ti_0.40 alloy superconductors with Gd addition
Authors:
Sabyasachi Paul,
SK Ramjan,
R Venkatesh,
L S Sharath Chandra,
M K Chattopadhyay
Abstract:
The V-Ti alloys are promising materials as alternate to the commercial Nb-based superconductors for high current-high magnetic field applications. However, the critical current density (J_c) of these alloys are somewhat low due to their low grain-boundary density. We show here that grain refinement of the V-Ti alloys and enhancement of the J_c can be achieved by the addition of Gd into the system,…
▽ More
The V-Ti alloys are promising materials as alternate to the commercial Nb-based superconductors for high current-high magnetic field applications. However, the critical current density (J_c) of these alloys are somewhat low due to their low grain-boundary density. We show here that grain refinement of the V-Ti alloys and enhancement of the J_c can be achieved by the addition of Gd into the system, which precipitates as clusters along the grain boundaries. Both the J_c and the pinning force density (F_P) increase with the increasing Gd content up to 1 at. % Gd, where they are more than 20 times higher than those of the parent V_0.60Ti_0.40 alloy. Introduction of Gd into the system also leads to ferromagnetic (FM) correlations, and the alloys containing more than 0.5 at. % Gd exhibit spontaneous magnetization. In spite of the FM correlations, the superconducting transition temperature increases slightly with Gd-addition.
△ Less
Submitted 4 November, 2020; v1 submitted 10 December, 2019;
originally announced December 2019.
-
Two channel heat conduction in the superconducting state of the as-cast V$_{1-x}$Zr$_x$ alloys
Authors:
Sabyasachi Paul,
L. S. Sharath Chandra,
M. K. Chattopadhyay
Abstract:
We present here the temperature dependence of heat capacity ($C$($T$)) and thermal conductivity ($κ$($T$)) in the superconducting state as well as in the normal state of as-cast V$_{1-x}$Zr$_x$ alloys. Distinct jumps in the $C$($T$) of the alloys indicate the presence of three superconducting phases with transition temperatures $T_{C1}$ = 5.4~K, $T_{C2}$ = 8.2~K and $T_{C3}$ = 8.5~K. From the meta…
▽ More
We present here the temperature dependence of heat capacity ($C$($T$)) and thermal conductivity ($κ$($T$)) in the superconducting state as well as in the normal state of as-cast V$_{1-x}$Zr$_x$ alloys. Distinct jumps in the $C$($T$) of the alloys indicate the presence of three superconducting phases with transition temperatures $T_{C1}$ = 5.4~K, $T_{C2}$ = 8.2~K and $T_{C3}$ = 8.5~K. From the metallography micrographs, these three phases are identified to be $β$-V, $γ$-ZrV$_2$, and $γ'$-ZrV$_2$ respectively. Apart from these phases, $α$-Zr and $β$-Zr phases are also detected in these samples. The experimental $κ$($T$) in the superconducting state of these alloys is observed to be significantly higher than that expected theoretically. Our analysis suggests that the above observation is due to the coexistence of multiple superconducting and non superconducting phases which resulted in the two-parallel channels for the conduction of heat.
△ Less
Submitted 28 August, 2019;
originally announced August 2019.
-
Superconductivity in V$_{1-x}$Zr$_x$ alloys]{Evolution of high field superconductivity and high critical current density in the as-cast V$_{1-x}$Zr$_x$ alloys
Authors:
L. S. Sharath Chandra,
Sabyasachi Paul,
Ashish Khandelwal,
Archna Sagdeo,
R. Venkatesh,
Kranti Kumar,
A. Banerjee,
M. K. Chattopadhyay
Abstract:
We report here the structural, electrical and magnetic properties of as-cast V$_{1-x}$Zr$_x$ alloys ($x$ =0 - 0.4) at low temperatures. We observe that all the alloys undergo successive peritectic and eutectic reactions during cooling from the melt which leads to the formation of five phases, namely, a body centred cubic $β$-V phase, two phases with slightly different compositions having face cent…
▽ More
We report here the structural, electrical and magnetic properties of as-cast V$_{1-x}$Zr$_x$ alloys ($x$ =0 - 0.4) at low temperatures. We observe that all the alloys undergo successive peritectic and eutectic reactions during cooling from the melt which leads to the formation of five phases, namely, a body centred cubic $β$-V phase, two phases with slightly different compositions having face centred cubic ZrV$_2$ structure, a hexagonal closed packed $α$-Zr phase, and the $β$-Zr precipitates. The amount of each phase is found to be dependent on the concentration of zirconium in vanadium. The $β$-V and ZrV$_2$ phases show superconductivity below 5.3~K and 8.5~K respectively. We show that the critical current density is large for V-rich V$_{1-x}$Zr$_x$ alloys with $x >$ 0.1. The grain boundaries generated from the eutectic reaction, and the point defects formed due to the variation in the composition are found to be responsible for the pinning of flux lines in low and high magnetic fields respectively. Our studies reveal that the choice of the composition and the heat treatment which leads to eutectic reaction are important in improving the critical current density in this alloy system.
△ Less
Submitted 20 August, 2019;
originally announced August 2019.
-
Fully automatic computer-aided mass detection and segmentation via pseudo-color mammograms and Mask R-CNN
Authors:
Hang Min,
Devin Wilson,
Yinhuang Huang,
Siyu Liu,
Stuart Crozier,
Andrew P Bradley,
Shekhar S. Chandra
Abstract:
Mammographic mass detection and segmentation are usually performed as serial and separate tasks, with segmentation often only performed on manually confirmed true positive detections in previous studies. We propose a fully-integrated computer-aided detection (CAD) system for simultaneous mammographic mass detection and segmentation without user intervention. The proposed CAD only consists of a pse…
▽ More
Mammographic mass detection and segmentation are usually performed as serial and separate tasks, with segmentation often only performed on manually confirmed true positive detections in previous studies. We propose a fully-integrated computer-aided detection (CAD) system for simultaneous mammographic mass detection and segmentation without user intervention. The proposed CAD only consists of a pseudo-color image generation and a mass detection-segmentation stage based on Mask R-CNN. Grayscale mammograms are transformed into pseudo-color images based on multi-scale morphological sifting where mass-like patterns are enhanced to improve the performance of Mask R-CNN. Transfer learning with the Mask R-CNN is then adopted to simultaneously detect and segment masses on the pseudo-color images. Evaluated on the public dataset INbreast, the method outperforms the state-of-the-art methods by achieving an average true positive rate of 0.90 at 0.9 false positive per image and an average Dice similarity index of 0.88 for mass segmentation.
△ Less
Submitted 19 October, 2019; v1 submitted 28 June, 2019;
originally announced June 2019.
-
Teaching Programming Concepts by Develo** Games
Authors:
Kailash Chandra,
Shyamal Suhana Chandra
Abstract:
Learning to program could possibly be analogous to acquiring expertise in abstract mathematics, which may be boring or dull for a majority of students. Thus, among the countless options to approach learning coding [1-14], acquiring concepts through game creation could possibly be the most enriching experience for students. Consequently, it is important to select a lucid and familiar game for stude…
▽ More
Learning to program could possibly be analogous to acquiring expertise in abstract mathematics, which may be boring or dull for a majority of students. Thus, among the countless options to approach learning coding [1-14], acquiring concepts through game creation could possibly be the most enriching experience for students. Consequently, it is important to select a lucid and familiar game for students. Then, the following step is to choose a language that introduces the basic concepts of object-oriented programming really well. For this paper, we chose the game of Tic-Tac-Toe, which is straight-forward for most people. The programming language chosen here is C++.
△ Less
Submitted 12 March, 2019;
originally announced March 2019.
-
Thermal conductivity of strong coupling V$_{1-x}$Ti$_x$ superconductors in the Mott-Ioffe-Regel limit
Authors:
Sabyasachi Paul,
L. S. Sharath Chandra,
M. K. Chattopadhyay
Abstract:
We report an enhancement of thermal conductivity ($κ$) below the superconducting transition temperature ($T_C$) in the high carrier density $β$-V$_{1-x}$Ti$_x$ alloys. We find that the point defects generated when Ti is added to V reduce the electron mean free path down to the inter-atomic distances and make the high frequency phonons ineffective in carrying heat. In this Mott-Ioffe-Regel limit, t…
▽ More
We report an enhancement of thermal conductivity ($κ$) below the superconducting transition temperature ($T_C$) in the high carrier density $β$-V$_{1-x}$Ti$_x$ alloys. We find that the point defects generated when Ti is added to V reduce the electron mean free path down to the inter-atomic distances and make the high frequency phonons ineffective in carrying heat. In this Mott-Ioffe-Regel limit, the phonon thermal conductivity is dominated by the low frequency phonons limited by the scattering due to the electrons. The formation of Cooper pairs below the $T_C$ re-normalizes the phonon mean free path and enhances the $κ$.
△ Less
Submitted 4 October, 2018;
originally announced October 2018.
-
Study of Nb$_{0.18}$Re$_{0.82}$ non-centrosymmetric superconductor in the normal and superconducting states
Authors:
Shyam Sundar,
S. Salem-Sugui Jr.,
M. K. Chattopadhyay,
S. B. Roy,
L. S. Sharath Chandra,
L. F. Cohen,
L. Ghivelder
Abstract:
We examine the evidence for multiband superconductivity and non s-wave pairing in the non-centrosymmetric superconductor Nb$_{0.18}$Re$_{0.82}$, using electrical transport, magnetization and specific heat measurements. In the normal state, both the evolution of resistivity with temperature and with magnetic field support a multiband picture. In the superconducting state, the Werthamer, Helfand and…
▽ More
We examine the evidence for multiband superconductivity and non s-wave pairing in the non-centrosymmetric superconductor Nb$_{0.18}$Re$_{0.82}$, using electrical transport, magnetization and specific heat measurements. In the normal state, both the evolution of resistivity with temperature and with magnetic field support a multiband picture. In the superconducting state, the Werthamer, Helfand and Hohenberg (WHH) model cannot adequately describe the temperature dependence of the upper critical field, $H_{c2}(T)$, over the whole temperature range measured. In addition, the observed $H_{c2}(0)$ exceeds the Pauli limit, suggesting non-s-wave pairing. Interestingly, the Kadowaki-Woods ratio and Uemura plot reveal a behavior in Nb$_{0.18}$Re$_{0.82}$ which is similar to that found in unconventional superconductors. The temperature dependence of the lower critical field, $H_{c1}(T)$, follows an anomalous $T^3$ behavior and the derived normalized superfluid density ($ρ_s$) is well explained using a nodeless two-gap description. Phase-fluctuation analysis conducted on the reversible magnetization data, reveals a significant deviation from the mean-field conventional s-wave behavior. This trend is interpreted in terms of a non s-wave spin-triplet component in the pairing symmetry as might be anticipated in a non-centrosymmetric superconductor where anti-symmetric spin-orbit coupling plays a dominant role.
△ Less
Submitted 12 July, 2018;
originally announced July 2018.
-
Strong electron-phonon coupling and multiband effects in the superconducting $β$-phase Mo$_{1-x}$Re$_x$ alloys
Authors:
Shyam Sundar,
L S Sharath Chandra,
M K Chattopadhyay,
Sudhir K Pandey,
D Venkateshwarlu,
R Rawat,
V Ganesan,
S B Roy
Abstract:
Superconducting transition temperature $T_C$ of some of the cubic $β$-phase Mo$_{1-x}$Re$_x$ alloys with x > 0.10 is an order of magnitude higher than that in the elements Mo and Re. We investigate this rather enigmatic issue of the enhanced superconductivity with the help of experimental studies of the temperature dependent electrical resistivity ($ρ$(T)) and heat capacity (C$_P$(T)), as well as…
▽ More
Superconducting transition temperature $T_C$ of some of the cubic $β$-phase Mo$_{1-x}$Re$_x$ alloys with x > 0.10 is an order of magnitude higher than that in the elements Mo and Re. We investigate this rather enigmatic issue of the enhanced superconductivity with the help of experimental studies of the temperature dependent electrical resistivity ($ρ$(T)) and heat capacity (C$_P$(T)), as well as the theoretical estimation of electronic density of states (DOS) using band structure calculations. The $ρ$(T) in the normal state of the Mo$_{1-x}$Re$_x$ alloys with x > 0.15 is distinctly different from that of Mo and the alloys with x < 0.10. We have also observed that the Sommerfeld coefficient of electronic heat capacity $γ$, superconducting transition temperature $T_C$ and the DOS at the Fermi level show an abrupt change above x > 0.10. The analysis of these results indicates that the value of electron-phonon coupling constant λep required to explain the $T_C$ of the alloys with x > 0.10 is much higher than that estimated from $γ$. On the other hand the analysis of the results of the $ρ$(T) reveals the presence of phonon assisted inter-band s-d scattering in this composition range. We argue that a strong electron-phonon coupling arising due to the multiband effects is responsible for the enhanced $T_C$ in the $β$-phase Mo$_{1-x}$Re$_x$ alloys with x > 0.10.
△ Less
Submitted 28 March, 2018;
originally announced March 2018.
-
High field paramagnetic Meissner effect in Mo$_{100-x}$Re$_x$ alloy superconductors
Authors:
Shyam Sundar,
M K Chattopadhyay,
L S Sharath Chandra,
S B Roy
Abstract:
We have performed an experimental study on the temperature and field dependence of magnetization of a series of superconducting Mo$_{100-x}$Re$_x$ alloys. Our studies reveal the presence of high field paramagnetic effect (HFPME) in these low temperature superconductors. The results of our studies indicate that the HFPME in the Mo$_{100-x}$Re$_x$ alloys is related to the inhomogeneous distribution…
▽ More
We have performed an experimental study on the temperature and field dependence of magnetization of a series of superconducting Mo$_{100-x}$Re$_x$ alloys. Our studies reveal the presence of high field paramagnetic effect (HFPME) in these low temperature superconductors. The results of our studies indicate that the HFPME in the Mo$_{100-x}$Re$_x$ alloys is related to the inhomogeneous distribution of strong and weak flux-line pinning centers, and the flux compression resulting due to the same while cooling down the samples in the presence of high magnetic fields. The results are complemented by the studies on the temperature dependence of the electrical resistivity and heat capacity of these alloys in different constant magnetic fields. We compare our findings with the studies reported in literature on both low $T_C$ and high $T_C$ superconductors.
△ Less
Submitted 28 March, 2018;
originally announced March 2018.
-
Evidence of multiband superconductivity in the $β$-phase Mo$_{1-x}$Re$_x$ alloys
Authors:
Shyam Sundar,
L S Sharath Chandra,
M K Chattopadhyay,
S B Roy
Abstract:
We present a detailed study of the superconducting properties in the beta-phase Mo$_{1-x}$Re$_x$ (x = 0.25 and 0.4) solid solution alloys pursued through magnetization and heat capacity measurements. The temperature dependence of the upper critical field H$_{C2}$(T) in these binary alloys shows a deviation from the prediction of the Werthamer-Helfand-Hohenberg (WHH) theory. The temperature depende…
▽ More
We present a detailed study of the superconducting properties in the beta-phase Mo$_{1-x}$Re$_x$ (x = 0.25 and 0.4) solid solution alloys pursued through magnetization and heat capacity measurements. The temperature dependence of the upper critical field H$_{C2}$(T) in these binary alloys shows a deviation from the prediction of the Werthamer-Helfand-Hohenberg (WHH) theory. The temperature dependence of superfluid density estimated from the variation of lower critical field H$_{C1}$ with temperature, cannot be explained within the framework of a single superconducting energy gap. The heat capacity also shows an anomalous feature in its temperature dependence. All these results can be reasonably explained by considering the existence of two superconducting energy gaps in these Mo$_{1-x}$Re$_x$ alloys. Initial results of electronic structure calculations and resonant photoelectron spectroscopy measurements support this possibility and suggest that the Re-5d like states at the Fermi level may not intermix with the Mo-5p and 5s like states in the beta-phase Mo$_{1-x}$Re$_x$ alloys and contribute quite distinctly to the superconductivity of these alloys.
△ Less
Submitted 27 March, 2018;
originally announced March 2018.