-
Deep learning for detection and segmentation of artefact and disease instances in gastrointestinal endoscopy
Authors:
Sharib Ali,
Mariia Dmitrieva,
Noha Ghatwary,
Sophia Bano,
Gorkem Polat,
Alptekin Temizel,
Adrian Krenzer,
Amar Hekalo,
Yun Bo Guo,
Bogdan Matuszewski,
Mourad Gridach,
Irina Voiculescu,
Vishnusai Yoganand,
Arnav Chavan,
Aryan Raj,
Nhan T. Nguyen,
Dat Q. Tran,
Le Duy Huynh,
Nicolas Boutry,
Shahadate Rezvy,
Haijian Chen,
Yoon Ho Choi,
Anand Subramanian,
Velmurugan Balasubramanian,
Xiaohong W. Gao
, et al. (12 additional authors not shown)
Abstract:
The Endoscopy Computer Vision Challenge (EndoCV) is a crowd-sourcing initiative to address eminent problems in develo** reliable computer aided detection and diagnosis endoscopy systems and suggest a pathway for clinical translation of technologies. Whilst endoscopy is a widely used diagnostic and treatment tool for hollow-organs, there are several core challenges often faced by endoscopists, ma…
▽ More
The Endoscopy Computer Vision Challenge (EndoCV) is a crowd-sourcing initiative to address eminent problems in develo** reliable computer aided detection and diagnosis endoscopy systems and suggest a pathway for clinical translation of technologies. Whilst endoscopy is a widely used diagnostic and treatment tool for hollow-organs, there are several core challenges often faced by endoscopists, mainly: 1) presence of multi-class artefacts that hinder their visual interpretation, and 2) difficulty in identifying subtle precancerous precursors and cancer abnormalities. Artefacts often affect the robustness of deep learning methods applied to the gastrointestinal tract organs as they can be confused with tissue of interest. EndoCV2020 challenges are designed to address research questions in these remits. In this paper, we present a summary of methods developed by the top 17 teams and provide an objective comparison of state-of-the-art methods and methods designed by the participants for two sub-challenges: i) artefact detection and segmentation (EAD2020), and ii) disease detection and segmentation (EDD2020). Multi-center, multi-organ, multi-class, and multi-modal clinical endoscopy datasets were compiled for both EAD2020 and EDD2020 sub-challenges. The out-of-sample generalization ability of detection algorithms was also evaluated. Whilst most teams focused on accuracy improvements, only a few methods hold credibility for clinical usability. The best performing teams provided solutions to tackle class imbalance, and variabilities in size, origin, modality and occurrences by exploring data augmentation, data fusion, and optimal class thresholding techniques.
△ Less
Submitted 17 February, 2021; v1 submitted 12 October, 2020;
originally announced October 2020.
-
Collaboratively Optimizing Power Scheduling and Mitigating Congestion using Local Pricing in a Receding Horizon Market
Authors:
Cornelis Jan van Leeuwen,
Joost Stam,
Arun Subramanian,
Koen Kok
Abstract:
A distributed, hierarchical, market based approach is introduced to solve the economic dispatch problem. The approach requires only a minimal amount of information to be shared between a central market operator and the end-users. Price signals from the market operator are sent down to end-user device agents, which in turn respond with power schedules. Intermediate congestion agents make sure that…
▽ More
A distributed, hierarchical, market based approach is introduced to solve the economic dispatch problem. The approach requires only a minimal amount of information to be shared between a central market operator and the end-users. Price signals from the market operator are sent down to end-user device agents, which in turn respond with power schedules. Intermediate congestion agents make sure that local power constraints are satisfied and any potential congestion is avoided by adding local pricing differences. Our results show that in 20% of the evaluated scenarios the solutions are identical to the global optimum when perfect knowledge is available. In the other 80% the results are not significantly worse, while providing a higher level of scalability and increasing the consumer's privacy.
△ Less
Submitted 4 September, 2020;
originally announced September 2020.
-
Measures of Complexity for Large Scale Image Datasets
Authors:
Ameet Annasaheb Rahane,
Anbumani Subramanian
Abstract:
Large scale image datasets are a growing trend in the field of machine learning. However, it is hard to quantitatively understand or specify how various datasets compare to each other - i.e., if one dataset is more complex or harder to ``learn'' with respect to a deep-learning based network. In this work, we build a series of relatively computationally simple methods to measure the complexity of a…
▽ More
Large scale image datasets are a growing trend in the field of machine learning. However, it is hard to quantitatively understand or specify how various datasets compare to each other - i.e., if one dataset is more complex or harder to ``learn'' with respect to a deep-learning based network. In this work, we build a series of relatively computationally simple methods to measure the complexity of a dataset. Furthermore, we present an approach to demonstrate visualizations of high dimensional data, in order to assist with visual comparison of datasets. We present our analysis using four datasets from the autonomous driving research community - Cityscapes, IDD, BDD and Vistas. Using entropy based metrics, we present a rank-order complexity of these datasets, which we compare with an established rank-order with respect to deep learning.
△ Less
Submitted 10 August, 2020;
originally announced August 2020.
-
Reinforcement Learning and its Connections with Neuroscience and Psychology
Authors:
Ajay Subramanian,
Sharad Chitlangia,
Veeky Baths
Abstract:
Reinforcement learning methods have recently been very successful at performing complex sequential tasks like playing Atari games, Go and Poker. These algorithms have outperformed humans in several tasks by learning from scratch, using only scalar rewards obtained through interaction with their environment. While there certainly has been considerable independent innovation to produce such results,…
▽ More
Reinforcement learning methods have recently been very successful at performing complex sequential tasks like playing Atari games, Go and Poker. These algorithms have outperformed humans in several tasks by learning from scratch, using only scalar rewards obtained through interaction with their environment. While there certainly has been considerable independent innovation to produce such results, many core ideas in reinforcement learning are inspired by phenomena in animal learning, psychology and neuroscience. In this paper, we comprehensively review a large number of findings in both neuroscience and psychology that evidence reinforcement learning as a promising candidate for modeling learning and decision making in the brain. In doing so, we construct a map** between various classes of modern RL algorithms and specific findings in both neurophysiological and behavioral literature. We then discuss the implications of this observed relationship between RL, neuroscience and psychology and its role in advancing research in both AI and brain science.
△ Less
Submitted 26 September, 2021; v1 submitted 25 June, 2020;
originally announced July 2020.
-
The JHU Multi-Microphone Multi-Speaker ASR System for the CHiME-6 Challenge
Authors:
Ashish Arora,
Desh Raj,
Aswin Shanmugam Subramanian,
Ke Li,
Bar Ben-Yair,
Matthew Maciejewski,
Piotr Żelasko,
Paola García,
Shinji Watanabe,
Sanjeev Khudanpur
Abstract:
This paper summarizes the JHU team's efforts in tracks 1 and 2 of the CHiME-6 challenge for distant multi-microphone conversational speech diarization and recognition in everyday home environments. We explore multi-array processing techniques at each stage of the pipeline, such as multi-array guided source separation (GSS) for enhancement and acoustic model training data, posterior fusion for spee…
▽ More
This paper summarizes the JHU team's efforts in tracks 1 and 2 of the CHiME-6 challenge for distant multi-microphone conversational speech diarization and recognition in everyday home environments. We explore multi-array processing techniques at each stage of the pipeline, such as multi-array guided source separation (GSS) for enhancement and acoustic model training data, posterior fusion for speech activity detection, PLDA score fusion for diarization, and lattice combination for automatic speech recognition (ASR). We also report results with different acoustic model architectures, and integrate other techniques such as online multi-channel weighted prediction error (WPE) dereverberation and variational Bayes-hidden Markov model (VB-HMM) based overlap assignment to deal with reverberation and overlap** speakers, respectively. As a result of these efforts, our ASR systems achieve a word error rate of 40.5% and 67.5% on tracks 1 and 2, respectively, on the evaluation set. This is an improvement of 10.8% and 10.4% absolute, over the challenge baselines for the respective tracks.
△ Less
Submitted 14 June, 2020;
originally announced June 2020.
-
The ELFIN Mission
Authors:
V. Angelopoulos,
E. Tsai,
L. Bingley,
C. Shaffer,
D. L. Turner,
A. Runov,
W. Li,
J. Liu,
A. V. Artemyev,
X. -J. Zhang,
R. J. Strangeway,
R. E. Wirz,
Y. Y. Shprits,
V. A. Sergeev,
R. P. Caron,
M. Chung,
P. Cruce,
W. Greer,
E. Grimes,
K. Hector,
M. J. Lawson,
D. Leneman,
E. V. Masongsong,
C. L. Russell,
C. Wilkins
, et al. (57 additional authors not shown)
Abstract:
The Electron Loss and Fields Investigation with a Spatio-Temporal Ambiguity-Resolving option (ELFIN-STAR, or simply: ELFIN) mission comprises two identical 3-Unit (3U) CubeSats on a polar (~93deg inclination), nearly circular, low-Earth (~450 km altitude) orbit. Launched on September 15, 2018, ELFIN is expected to have a >2.5 year lifetime. Its primary science objective is to resolve the mechanism…
▽ More
The Electron Loss and Fields Investigation with a Spatio-Temporal Ambiguity-Resolving option (ELFIN-STAR, or simply: ELFIN) mission comprises two identical 3-Unit (3U) CubeSats on a polar (~93deg inclination), nearly circular, low-Earth (~450 km altitude) orbit. Launched on September 15, 2018, ELFIN is expected to have a >2.5 year lifetime. Its primary science objective is to resolve the mechanism of storm-time relativistic electron precipitation, for which electromagnetic ion cyclotron (EMIC) waves are a prime candidate. From its ionospheric vantage point, ELFIN uses its unique pitch-angle-resolving capability to determine whether measured relativistic electron pitch-angle and energy spectra within the loss cone bear the characteristic signatures of scattering by EMIC waves or whether such scattering may be due to other processes. Pairing identical ELFIN satellites with slowly-variable along-track separation allows disambiguation of spatial and temporal evolution of the precipitation over minutes-to-tens-of-minutes timescales, faster than the orbit period of a single low-altitude satellite (~90min). Each satellite carries an energetic particle detector for electrons (EPDE) that measures 50keV to 5MeV electrons with deltaE/E<40% and a fluxgate magnetometer (FGM) on a ~72cm boom that measures magnetic field waves (e.g., EMIC waves) in the range from DC to 5Hz Nyquist (nominally) with <0.3nT/sqrt(Hz) noise at 1Hz. The spinning satellites (T_spin~3s) are equipped with magnetorquers that permit spin-up/down and reorientation maneuvers. The spin axis is placed normal to the orbit plane, allowing full pitch-angle resolution twice per spin. An energetic particle detector for ions (EPDI) measures 250keV-5MeV ions, addressing secondary science. Funded initially by CalSpace and the University Nanosat Program, ELFIN was selected for flight with joint support from NSF and NASA between 2014 and 2018.
△ Less
Submitted 16 June, 2020; v1 submitted 13 June, 2020;
originally announced June 2020.
-
End-to-End Far-Field Speech Recognition with Unified Dereverberation and Beamforming
Authors:
Wangyou Zhang,
Aswin Shanmugam Subramanian,
Xuankai Chang,
Shinji Watanabe,
Yanmin Qian
Abstract:
Despite successful applications of end-to-end approaches in multi-channel speech recognition, the performance still degrades severely when the speech is corrupted by reverberation. In this paper, we integrate the dereverberation module into the end-to-end multi-channel speech recognition system and explore two different frontend architectures. First, a multi-source mask-based weighted prediction e…
▽ More
Despite successful applications of end-to-end approaches in multi-channel speech recognition, the performance still degrades severely when the speech is corrupted by reverberation. In this paper, we integrate the dereverberation module into the end-to-end multi-channel speech recognition system and explore two different frontend architectures. First, a multi-source mask-based weighted prediction error (WPE) module is incorporated in the frontend for dereverberation. Second, another novel frontend architecture is proposed, which extends the weighted power minimization distortionless response (WPD) convolutional beamformer to perform simultaneous separation and dereverberation. We derive a new formulation from the original WPD, which can handle multi-source input, and replace eigenvalue decomposition with the matrix inverse operation to make the back-propagation algorithm more stable. The above two architectures are optimized in a fully end-to-end manner, only using the speech recognition criterion. Experiments on both spatialized wsj1-2mix corpus and REVERB show that our proposed model outperformed the conventional methods in reverberant scenarios.
△ Less
Submitted 26 October, 2020; v1 submitted 21 May, 2020;
originally announced May 2020.
-
CHiME-6 Challenge:Tackling Multispeaker Speech Recognition for Unsegmented Recordings
Authors:
Shinji Watanabe,
Michael Mandel,
Jon Barker,
Emmanuel Vincent,
Ashish Arora,
Xuankai Chang,
Sanjeev Khudanpur,
Vimal Manohar,
Daniel Povey,
Desh Raj,
David Snyder,
Aswin Shanmugam Subramanian,
Jan Trmal,
Bar Ben Yair,
Christoph Boeddeker,
Zhaoheng Ni,
Yusuke Fujita,
Shota Horiguchi,
Naoyuki Kanda,
Takuya Yoshioka,
Neville Ryant
Abstract:
Following the success of the 1st, 2nd, 3rd, 4th and 5th CHiME challenges we organize the 6th CHiME Speech Separation and Recognition Challenge (CHiME-6). The new challenge revisits the previous CHiME-5 challenge and further considers the problem of distant multi-microphone conversational speech diarization and recognition in everyday home environments. Speech material is the same as the previous C…
▽ More
Following the success of the 1st, 2nd, 3rd, 4th and 5th CHiME challenges we organize the 6th CHiME Speech Separation and Recognition Challenge (CHiME-6). The new challenge revisits the previous CHiME-5 challenge and further considers the problem of distant multi-microphone conversational speech diarization and recognition in everyday home environments. Speech material is the same as the previous CHiME-5 recordings except for accurate array synchronization. The material was elicited using a dinner party scenario with efforts taken to capture data that is representative of natural conversational speech. This paper provides a baseline description of the CHiME-6 challenge for both segmented multispeaker speech recognition (Track 1) and unsegmented multispeaker speech recognition (Track 2). Of note, Track 2 is the first challenge activity in the community to tackle an unsegmented multispeaker speech recognition scenario with a complete set of reproducible open source baselines providing speech enhancement, speaker diarization, and speech recognition modules.
△ Less
Submitted 2 May, 2020; v1 submitted 20 April, 2020;
originally announced April 2020.
-
Inverse Design of Potential Singlet Fission Molecules using a Transfer Learning Based Approach
Authors:
Akshay Subramanian,
Utkarsh Saha,
Tejasvini Sharma,
Naveen K. Tailor,
Soumitra Satapathi
Abstract:
Singlet fission has emerged as one of the most exciting phenomena known to improve the efficiencies of different types of solar cells and has found uses in diverse optoelectronic applications. The range of available singlet fission molecules is, however, limited as to undergo singlet fission, molecules have to satisfy certain energy conditions. Recent advances in material search using inverse desi…
▽ More
Singlet fission has emerged as one of the most exciting phenomena known to improve the efficiencies of different types of solar cells and has found uses in diverse optoelectronic applications. The range of available singlet fission molecules is, however, limited as to undergo singlet fission, molecules have to satisfy certain energy conditions. Recent advances in material search using inverse design has enabled the prediction of materials for a wide range of applications and has emerged as one of the most efficient methods in the discovery of suitable materials. It is particularly helpful in manipulating large datasets, uncovering hidden information from the molecular dataset and generating new structures. However, we seldom encounter large datasets in structure prediction problems in material science. In our work, we put forward inverse design of possible singlet fission molecules using a transfer learning based approach where we make use of a much larger ChEMBL dataset of structurally similar molecules to transfer the learned characteristics to the singlet fission dataset.
△ Less
Submitted 17 March, 2020;
originally announced March 2020.
-
Attention-based ASR with Lightweight and Dynamic Convolutions
Authors:
Yuya Fujita,
Aswin Shanmugam Subramanian,
Motoi Omachi,
Shinji Watanabe
Abstract:
End-to-end (E2E) automatic speech recognition (ASR) with sequence-to-sequence models has gained attention because of its simple model training compared with conventional hidden Markov model based ASR. Recently, several studies report the state-of-the-art E2E ASR results obtained by Transformer. Compared to recurrent neural network (RNN) based E2E models, training of Transformer is more efficient a…
▽ More
End-to-end (E2E) automatic speech recognition (ASR) with sequence-to-sequence models has gained attention because of its simple model training compared with conventional hidden Markov model based ASR. Recently, several studies report the state-of-the-art E2E ASR results obtained by Transformer. Compared to recurrent neural network (RNN) based E2E models, training of Transformer is more efficient and also achieves better performance on various tasks. However, self-attention used in Transformer requires computation quadratic in its input length. In this paper, we propose to apply lightweight and dynamic convolution to E2E ASR as an alternative architecture to the self-attention to make the computational order linear. We also propose joint training with connectionist temporal classification, convolution on the frequency axis, and combination with self-attention. With these techniques, the proposed architectures achieve better performance than RNN-based E2E model and performance competitive to state-of-the-art Transformer on various ASR benchmarks including noisy/reverberant tasks.
△ Less
Submitted 19 February, 2020; v1 submitted 26 December, 2019;
originally announced December 2019.
-
3D Conditional Generative Adversarial Networks to enable large-scale seismic image enhancement
Authors:
Praneet Dutta,
Bruce Power,
Adam Halpert,
Carlos Ezequiel,
Aravind Subramanian,
Chanchal Chatterjee,
Sindhu Hari,
Kenton Prindle,
Vishal Vaddina,
Andrew Leach,
Raj Domala,
Laura Bandura,
Massimo Mascaro
Abstract:
We propose GAN-based image enhancement models for frequency enhancement of 2D and 3D seismic images. Seismic imagery is used to understand and characterize the Earth's subsurface for energy exploration. Because these images often suffer from resolution limitations and noise contamination, our proposed method performs large-scale seismic volume frequency enhancement and denoising. The enhanced imag…
▽ More
We propose GAN-based image enhancement models for frequency enhancement of 2D and 3D seismic images. Seismic imagery is used to understand and characterize the Earth's subsurface for energy exploration. Because these images often suffer from resolution limitations and noise contamination, our proposed method performs large-scale seismic volume frequency enhancement and denoising. The enhanced images reduce uncertainty and improve decisions about issues, such as optimal well placement, that often rely on low signal-to-noise ratio (SNR) seismic volumes. We explored the impact of adding lithology class information to the models, resulting in improved performance on PSNR and SSIM metrics over a baseline model with no conditional information.
△ Less
Submitted 15 November, 2019;
originally announced November 2019.
-
A simple and effective hybrid genetic search for the job sequencing and tool switching problem
Authors:
Jordana Mecler,
Anand Subramanian,
Thibaut Vidal
Abstract:
The job sequencing and tool switching problem (SSP) has been extensively studied in the field of operations research, due to its practical relevance and methodological interest. Given a machine that can load a limited amount of tools simultaneously and a number of jobs that require a subset of the available tools, the SSP seeks a job sequence that minimizes the number of tool switches in the machi…
▽ More
The job sequencing and tool switching problem (SSP) has been extensively studied in the field of operations research, due to its practical relevance and methodological interest. Given a machine that can load a limited amount of tools simultaneously and a number of jobs that require a subset of the available tools, the SSP seeks a job sequence that minimizes the number of tool switches in the machine. To solve this problem, we propose a simple and efficient hybrid genetic search based on a generic solution representation, a tailored decoding operator, efficient local searches and diversity management techniques. To guide the search, we introduce a secondary objective designed to break ties. These techniques allow to explore structurally different solutions and escape local optima. As shown in our computational experiments on classical benchmark instances, our algorithm significantly outperforms all previous approaches while remaining simple to apprehend and easy to implement. We finally report results on a new set of larger instances to stimulate future research and comparative analyses.
△ Less
Submitted 10 October, 2019;
originally announced October 2019.
-
Enhancing Object Detection in Adverse Conditions using Thermal Imaging
Authors:
Kshitij Agrawal,
Anbumani Subramanian
Abstract:
Autonomous driving relies on deriving understanding of objects and scenes through images. These images are often captured by sensors in the visible spectrum. For improved detection capabilities we propose the use of thermal sensors to augment the vision capabilities of an autonomous vehicle. In this paper, we present our investigations on the fusion of visible and thermal spectrum images using a p…
▽ More
Autonomous driving relies on deriving understanding of objects and scenes through images. These images are often captured by sensors in the visible spectrum. For improved detection capabilities we propose the use of thermal sensors to augment the vision capabilities of an autonomous vehicle. In this paper, we present our investigations on the fusion of visible and thermal spectrum images using a publicly available dataset, and use it to analyze the performance of object recognition on other known driving datasets. We present an comparison of object detection in night time imagery and qualitatively demonstrate that thermal images significantly improve detection accuracy.
△ Less
Submitted 30 September, 2019;
originally announced September 2019.
-
Machine Learning for Stochastic Parameterization: Generative Adversarial Networks in the Lorenz '96 Model
Authors:
David John Gagne II,
Hannah M. Christensen,
Aneesh C. Subramanian,
Adam H. Monahan
Abstract:
Stochastic parameterizations account for uncertainty in the representation of unresolved sub-grid processes by sampling from the distribution of possible sub-grid forcings. Some existing stochastic parameterizations utilize data-driven approaches to characterize uncertainty, but these approaches require significant structural assumptions that can limit their scalability. Machine learning models, i…
▽ More
Stochastic parameterizations account for uncertainty in the representation of unresolved sub-grid processes by sampling from the distribution of possible sub-grid forcings. Some existing stochastic parameterizations utilize data-driven approaches to characterize uncertainty, but these approaches require significant structural assumptions that can limit their scalability. Machine learning models, including neural networks, are able to represent a wide range of distributions and build optimized map**s between a large number of inputs and sub-grid forcings. Recent research on machine learning parameterizations has focused only on deterministic parameterizations. In this study, we develop a stochastic parameterization using the generative adversarial network (GAN) machine learning framework. The GAN stochastic parameterization is trained and evaluated on output from the Lorenz '96 model, which is a common baseline model for evaluating both parameterization and data assimilation techniques. We evaluate different ways of characterizing the input noise for the model and perform model runs with the GAN parameterization at weather and climate timescales. Some of the GAN configurations perform better than a baseline bespoke parameterization at both timescales, and the networks closely reproduce the spatio-temporal correlations and regimes of the Lorenz '96 system. We also find that in general those models which produce skillful forecasts are also associated with the best climate simulations.
△ Less
Submitted 10 September, 2019;
originally announced September 2019.
-
Mean Spectral Normalization of Deep Neural Networks for Embedded Automation
Authors:
Anand Krishnamoorthy Subramanian,
Nak Young Chong
Abstract:
Deep Neural Networks (DNNs) have begun to thrive in the field of automation systems, owing to the recent advancements in standardising various aspects such as architecture, optimization techniques, and regularization. In this paper, we take a step towards a better understanding of Spectral Normalization (SN) and its potential for standardizing regularization of a wider range of Deep Learning model…
▽ More
Deep Neural Networks (DNNs) have begun to thrive in the field of automation systems, owing to the recent advancements in standardising various aspects such as architecture, optimization techniques, and regularization. In this paper, we take a step towards a better understanding of Spectral Normalization (SN) and its potential for standardizing regularization of a wider range of Deep Learning models, following an empirical approach. We conduct several experiments to study their training dynamics, in comparison with the ubiquitous Batch Normalization (BN) and show that SN increases the gradient sparsity and controls the gradient variance. Furthermore, we show that SN suffers from a phenomenon, we call the mean-drift effect, which mitigates its performance. We, then, propose a weight reparameterization called as the Mean Spectral Normalization (MSN) to resolve the mean drift, thereby significantly improving the network's performance. Our model performs ~16% faster as compared to BN in practice, and has fewer trainable parameters. We also show the performance of our MSN for small, medium, and large CNNs - 3-layer CNN, VGG7 and DenseNet-BC, respectively - and unsupervised image generation tasks using Generative Adversarial Networks (GANs) to evaluate its applicability for a broad range of embedded automation tasks.
△ Less
Submitted 9 July, 2019;
originally announced July 2019.
-
An Investigation of End-to-End Multichannel Speech Recognition for Reverberant and Mismatch Conditions
Authors:
Aswin Shanmugam Subramanian,
Xiaofei Wang,
Shinji Watanabe,
Toru Taniguchi,
Dung Tran,
Yuya Fujita
Abstract:
Sequence-to-sequence (S2S) modeling is becoming a popular paradigm for automatic speech recognition (ASR) because of its ability to jointly optimize all the conventional ASR components in an end-to-end (E2E) fashion. This report investigates the ability of E2E ASR from standard close-talk to far-field applications by encompassing entire multichannel speech enhancement and ASR components within the…
▽ More
Sequence-to-sequence (S2S) modeling is becoming a popular paradigm for automatic speech recognition (ASR) because of its ability to jointly optimize all the conventional ASR components in an end-to-end (E2E) fashion. This report investigates the ability of E2E ASR from standard close-talk to far-field applications by encompassing entire multichannel speech enhancement and ASR components within the S2S model. There have been previous studies on jointly optimizing neural beamforming alongside E2E ASR for denoising. It is clear from both recent challenge outcomes and successful products that far-field systems would be incomplete without solving both denoising and dereverberation simultaneously. This report uses a recently developed architecture for far-field ASR by composing neural extensions of dereverberation and beamforming modules with the S2S ASR module as a single differentiable neural network and also clearly defining the role of each subnetwork. The original implementation of this architecture was successfully applied to the noisy speech recognition task (CHiME-4), while we applied this implementation to noisy reverberant tasks (DIRHA and REVERB). Our investigation shows that the method achieves better performance than conventional pipeline methods on the DIRHA English dataset and comparable performance on the REVERB dataset. It also has additional advantages of being neither iterative nor requiring parallel noisy and clean speech data.
△ Less
Submitted 28 April, 2019; v1 submitted 18 April, 2019;
originally announced April 2019.
-
IDD: A Dataset for Exploring Problems of Autonomous Navigation in Unconstrained Environments
Authors:
Girish Varma,
Anbumani Subramanian,
Anoop Namboodiri,
Manmohan Chandraker,
C V Jawahar
Abstract:
While several datasets for autonomous navigation have become available in recent years, they tend to focus on structured driving environments. This usually corresponds to well-delineated infrastructure such as lanes, a small number of well-defined categories for traffic participants, low variation in object or background appearance and strict adherence to traffic rules. We propose IDD, a novel dat…
▽ More
While several datasets for autonomous navigation have become available in recent years, they tend to focus on structured driving environments. This usually corresponds to well-delineated infrastructure such as lanes, a small number of well-defined categories for traffic participants, low variation in object or background appearance and strict adherence to traffic rules. We propose IDD, a novel dataset for road scene understanding in unstructured environments where the above assumptions are largely not satisfied. It consists of 10,004 images, finely annotated with 34 classes collected from 182 drive sequences on Indian roads. The label set is expanded in comparison to popular benchmarks such as Cityscapes, to account for new classes. It also reflects label distributions of road scenes significantly different from existing datasets, with most classes displaying greater within-class diversity. Consistent with real driving behaviours, it also identifies new classes such as drivable areas besides the road. We propose a new four-level label hierarchy, which allows varying degrees of complexity and opens up possibilities for new training methods. Our empirical study provides an in-depth analysis of the label characteristics. State-of-the-art methods for semantic segmentation achieve much lower accuracies on our dataset, demonstrating its distinction compared to Cityscapes. Finally, we propose that our dataset is an ideal opportunity for new problems such as domain adaptation, few-shot learning and behaviour prediction in road scenes.
△ Less
Submitted 26 November, 2018;
originally announced November 2018.
-
A Smart System for Selection of Optimal Product Images in E-Commerce
Authors:
Abon Chaudhuri,
Paolo Messina,
Samrat Kokkula,
Aditya Subramanian,
Abhinandan Krishnan,
Shreyansh Gandhi,
Alessandro Magnani,
Venkatesh Kandaswamy
Abstract:
In e-commerce, content quality of the product catalog plays a key role in delivering a satisfactory experience to the customers. In particular, visual content such as product images influences customers' engagement and purchase decisions. With the rapid growth of e-commerce and the advent of artificial intelligence, traditional content management systems are giving way to automated scalable system…
▽ More
In e-commerce, content quality of the product catalog plays a key role in delivering a satisfactory experience to the customers. In particular, visual content such as product images influences customers' engagement and purchase decisions. With the rapid growth of e-commerce and the advent of artificial intelligence, traditional content management systems are giving way to automated scalable systems. In this paper, we present a machine learning driven visual content management system for extremely large e-commerce catalogs. For a given product, the system aggregates images from various suppliers, understands and analyzes them to produce a superior image set with optimal image count and quality, and arranges them in an order tailored to the demands of the customers. The system makes use of an array of technologies, ranging from deep learning to traditional computer vision, at different stages of analysis. In this paper, we outline how the system works and discuss the unique challenges related to applying machine learning techniques to real-world data from e-commerce domain. We emphasize how we tune state-of-the-art image classification techniques to develop solutions custom made for a massive, diverse, and constantly evolving product catalog. We also provide the details of how we measure the system's impact on various customer engagement metrics.
△ Less
Submitted 11 November, 2018;
originally announced November 2018.
-
One-Click Annotation with Guided Hierarchical Object Detection
Authors:
Adithya Subramanian,
Anbumani Subramanian
Abstract:
The increase in data collection has made data annotation an interesting and valuable task in the contemporary world. This paper presents a new methodology for quickly annotating data using click-supervision and hierarchical object detection. The proposed work is semi-automatic in nature where the task of annotations is split between the human and a neural network. We show that our improved method…
▽ More
The increase in data collection has made data annotation an interesting and valuable task in the contemporary world. This paper presents a new methodology for quickly annotating data using click-supervision and hierarchical object detection. The proposed work is semi-automatic in nature where the task of annotations is split between the human and a neural network. We show that our improved method of annotation reduces the time, cost and mental stress on a human annotator. The research also highlights how our method performs better than the current approach in different circumstances such as variation in number of objects, object size and different datasets. Our approach also proposes a new method of using object detectors making it suitable for data annotation task. The experiment conducted on PASCAL VOC dataset revealed that annotation created from our approach achieves a mAP of 0.995 and a recall of 0.903. The Our Approach has shown an overall improvement by 8.5%, 18.6% in mean average precision and recall score for KITTI and 69.6%, 36% for CITYSCAPES dataset. The proposed framework is 3-4 times faster as compared to the standard annotation method.
△ Less
Submitted 1 October, 2018;
originally announced October 2018.
-
Learning End-to-end Autonomous Driving using Guided Auxiliary Supervision
Authors:
Ashish Mehta,
Adithya Subramanian,
Anbumani Subramanian
Abstract:
Learning to drive faithfully in highly stochastic urban settings remains an open problem. To that end, we propose a Multi-task Learning from Demonstration (MT-LfD) framework which uses supervised auxiliary task prediction to guide the main task of predicting the driving commands. Our framework involves an end-to-end trainable network for imitating the expert demonstrator's driving commands. The ne…
▽ More
Learning to drive faithfully in highly stochastic urban settings remains an open problem. To that end, we propose a Multi-task Learning from Demonstration (MT-LfD) framework which uses supervised auxiliary task prediction to guide the main task of predicting the driving commands. Our framework involves an end-to-end trainable network for imitating the expert demonstrator's driving commands. The network intermediately predicts visual affordances and action primitives through direct supervision which provide the aforementioned auxiliary supervised guidance. We demonstrate that such joint learning and supervised guidance facilitates hierarchical task decomposition, assisting the agent to learn faster, achieve better driving performance and increases transparency of the otherwise black-box end-to-end network. We run our experiments to validate the MT-LfD framework in CARLA, an open-source urban driving simulator. We introduce multiple non-player agents in CARLA and induce temporal noise in them for realistic stochasticity.
△ Less
Submitted 30 August, 2018;
originally announced August 2018.
-
How To Extract Fashion Trends From Social Media? A Robust Object Detector With Support For Unsupervised Learning
Authors:
Vijay Gabale,
Anand Prabhu Subramanian
Abstract:
With the proliferation of social media, fashion inspired from celebrities, reputed designers as well as fashion influencers has shortened the cycle of fashion design and manufacturing. However, with the explosion of fashion related content and large number of user generated fashion photos, it is an arduous task for fashion designers to wade through social media photos and create a digest of trendi…
▽ More
With the proliferation of social media, fashion inspired from celebrities, reputed designers as well as fashion influencers has shortened the cycle of fashion design and manufacturing. However, with the explosion of fashion related content and large number of user generated fashion photos, it is an arduous task for fashion designers to wade through social media photos and create a digest of trending fashion. This necessitates deep parsing of fashion photos on social media to localize and classify multiple fashion items from a given fashion photo. While object detection competitions such as MSCOCO have thousands of samples for each of the object categories, it is quite difficult to get large labeled datasets for fast fashion items. Moreover, state-of-the-art object detectors do not have any functionality to ingest large amount of unlabeled data available on social media in order to fine tune object detectors with labeled datasets. In this work, we show application of a generic object detector, that can be pretrained in an unsupervised manner, on 24 categories from recently released Open Images V4 dataset. We first train the base architecture of the object detector using unsupervisd learning on 60K unlabeled photos from 24 categories gathered from social media, and then subsequently fine tune it on 8.2K labeled photos from Open Images V4 dataset. On 300 X 300 image inputs, we achieve 72.7% mAP on a test dataset of 2.4K photos while performing 11% to 17% better as compared to the state-of-the-art object detectors. We show that this improvement is due to our choice of architecture that lets us do unsupervised learning and that performs significantly better in identifying small objects.
△ Less
Submitted 28 June, 2018;
originally announced June 2018.
-
Deep Recurrent Neural Networks for Product Attribute Extraction in eCommerce
Authors:
Bodhisattwa Prasad Majumder,
Aditya Subramanian,
Abhinandan Krishnan,
Shreyansh Gandhi,
A**kya More
Abstract:
Extracting accurate attribute qualities from product titles is a vital component in delivering eCommerce customers with a rewarding online shop** experience via an enriched faceted search. We demonstrate the potential of Deep Recurrent Networks in this domain, primarily models such as Bidirectional LSTMs and Bidirectional LSTM-CRF with or without an attention mechanism. These have improved overa…
▽ More
Extracting accurate attribute qualities from product titles is a vital component in delivering eCommerce customers with a rewarding online shop** experience via an enriched faceted search. We demonstrate the potential of Deep Recurrent Networks in this domain, primarily models such as Bidirectional LSTMs and Bidirectional LSTM-CRF with or without an attention mechanism. These have improved overall F1 scores, as compared to the previous benchmarks (More et al.) by at least 0.0391, showcasing an overall precision of 97.94%, recall of 94.12% and the F1 score of 0.9599. This has made us achieve a significant coverage of important facets or attributes of products which not only shows the efficacy of deep recurrent models over previous machine learning benchmarks but also greatly enhances the overall customer experience while shop** online.
△ Less
Submitted 29 March, 2018;
originally announced March 2018.
-
Building state-of-the-art distant speech recognition using the CHiME-4 challenge with a setup of speech enhancement baseline
Authors:
Szu-Jui Chen,
Aswin Shanmugam Subramanian,
Hainan Xu,
Shinji Watanabe
Abstract:
This paper describes a new baseline system for automatic speech recognition (ASR) in the CHiME-4 challenge to promote the development of noisy ASR in speech processing communities by providing 1) state-of-the-art system with a simplified single system comparable to the complicated top systems in the challenge, 2) publicly available and reproducible recipe through the main repository in the Kaldi s…
▽ More
This paper describes a new baseline system for automatic speech recognition (ASR) in the CHiME-4 challenge to promote the development of noisy ASR in speech processing communities by providing 1) state-of-the-art system with a simplified single system comparable to the complicated top systems in the challenge, 2) publicly available and reproducible recipe through the main repository in the Kaldi speech recognition toolkit. The proposed system adopts generalized eigenvalue beamforming with bidirectional long short-term memory (LSTM) mask estimation. We also propose to use a time delay neural network (TDNN) based on the lattice-free version of the maximum mutual information (LF-MMI) trained with augmented all six microphones plus the enhanced data after beamforming. Finally, we use a LSTM language model for lattice and n-best re-scoring. The final system achieved 2.74\% WER for the real test set in the 6-channel track, which corresponds to the 2nd place in the challenge. In addition, the proposed baseline recipe includes four different speech enhancement measures, short-time objective intelligibility measure (STOI), extended STOI (eSTOI), perceptual evaluation of speech quality (PESQ) and speech distortion ratio (SDR) for the simulation test set. Thus, the recipe also provides an experimental platform for speech enhancement studies with these performance measures.
△ Less
Submitted 27 March, 2018;
originally announced March 2018.
-
Student-Teacher Learning for BLSTM Mask-based Speech Enhancement
Authors:
Aswin Shanmugam Subramanian,
Szu-Jui Chen,
Shinji Watanabe
Abstract:
Spectral mask estimation using bidirectional long short-term memory (BLSTM) neural networks has been widely used in various speech enhancement applications, and it has achieved great success when it is applied to multichannel enhancement techniques with a mask-based beamformer. However, when these masks are used for single channel speech enhancement they severely distort the speech signal and make…
▽ More
Spectral mask estimation using bidirectional long short-term memory (BLSTM) neural networks has been widely used in various speech enhancement applications, and it has achieved great success when it is applied to multichannel enhancement techniques with a mask-based beamformer. However, when these masks are used for single channel speech enhancement they severely distort the speech signal and make them unsuitable for speech recognition. This paper proposes a student-teacher learning paradigm for single channel speech enhancement. The beamformed signal from multichannel enhancement is given as input to the teacher network to obtain soft masks. An additional cross-entropy loss term with the soft mask target is combined with the original loss, so that the student network with single-channel input is trained to mimic the soft mask obtained with multichannel input through beamforming. Experiments with the CHiME-4 challenge single channel track data shows improvement in ASR performance.
△ Less
Submitted 27 March, 2018;
originally announced March 2018.
-
A Hybrid Heuristic for a Broad Class of Vehicle Routing Problems with Heterogeneous Fleet
Authors:
Puca Huachi Vaz Penna,
Anand Subramanian,
Luiz Satoru Ochi,
Thibaut Vidal,
Christian Prins
Abstract:
We consider a family of Rich Vehicle Routing Problems (RVRP) which have the particularity to combine a heterogeneous fleet with other attributes, such as backhauls, multiple depots, split deliveries, site dependency, open routes, duration limits, and time windows. To efficiently solve these problems, we propose a hybrid metaheuristic which combines an iterated local search with variable neighborho…
▽ More
We consider a family of Rich Vehicle Routing Problems (RVRP) which have the particularity to combine a heterogeneous fleet with other attributes, such as backhauls, multiple depots, split deliveries, site dependency, open routes, duration limits, and time windows. To efficiently solve these problems, we propose a hybrid metaheuristic which combines an iterated local search with variable neighborhood descent, for solution improvement, and a set partitioning formulation, to exploit the memory of the past search. Moreover, we investigate a class of combined neighborhoods which jointly modify the sequences of visits and perform either heuristic or optimal reassignments of vehicles to routes. To the best of our knowledge, this is the first unified approach for a large class of heterogeneous fleet RVRPs, capable of solving more than 12 problem variants. The efficiency of the algorithm is evaluated on 643 well-known benchmark instances, and 71.70\% of the best known solutions are either retrieved or improved. Moreover, the proposed metaheuristic, which can be considered as a matheuristic, produces high quality solutions with low standard deviation in comparison with previous methods. Finally, we observe that the use of combined neighborhoods does not lead to significant quality gains. Contrary to intuition, the computational effort seems better spent on more intensive route optimization rather than on more intelligent and frequent fleet re-assignments.
△ Less
Submitted 5 March, 2018;
originally announced March 2018.
-
SPINE: SParse Interpretable Neural Embeddings
Authors:
Anant Subramanian,
Danish Pruthi,
Harsh Jhamtani,
Taylor Berg-Kirkpatrick,
Eduard Hovy
Abstract:
Prediction without justification has limited utility. Much of the success of neural models can be attributed to their ability to learn rich, dense and expressive representations. While these representations capture the underlying complexity and latent trends in the data, they are far from being interpretable. We propose a novel variant of denoising k-sparse autoencoders that generates highly effic…
▽ More
Prediction without justification has limited utility. Much of the success of neural models can be attributed to their ability to learn rich, dense and expressive representations. While these representations capture the underlying complexity and latent trends in the data, they are far from being interpretable. We propose a novel variant of denoising k-sparse autoencoders that generates highly efficient and interpretable distributed word representations (word embeddings), beginning with existing word representations from state-of-the-art methods like GloVe and word2vec. Through large scale human evaluation, we report that our resulting word embedddings are much more interpretable than the original GloVe and word2vec embeddings. Moreover, our embeddings outperform existing popular word embeddings on a diverse suite of benchmark downstream tasks.
△ Less
Submitted 23 November, 2017;
originally announced November 2017.
-
Spin order and dynamics in the diamond-lattice Heisenberg antiferromagnets CuRh2O4 and CoRh2O4
Authors:
L. Ge,
J. Flynn,
J. A. M. Paddison,
M. B. Stone,
S. Calder,
M. A. Subramanian,
A. P. Ramirez,
M. Mourigal
Abstract:
Antiferromagnetic insulators on the diamond lattice are candidate materials to host exotic magnetic phenomena ranging from spin-orbital entanglement to degenerate spiral ground-states and topological paramagnetism. Compared to other three-dimensional networks of magnetic ions, such as the geometrically frustrated pyrochlore lattice, the investigation of diamond-lattice magnetism in real materials…
▽ More
Antiferromagnetic insulators on the diamond lattice are candidate materials to host exotic magnetic phenomena ranging from spin-orbital entanglement to degenerate spiral ground-states and topological paramagnetism. Compared to other three-dimensional networks of magnetic ions, such as the geometrically frustrated pyrochlore lattice, the investigation of diamond-lattice magnetism in real materials is less mature. In this work, we characterize the magnetic properties of model A-site spinels CoRh2O4 (cobalt rhodite) and CuRh2O4 (copper rhodite) by means of thermo-magnetic and neutron scattering measurements and perform group theory analysis, Rietveld refinement, mean-field theory, and spin wave theory calculations to analyze the experimental results. Our investigation reveals that cubic CoRh2O4 is a canonical S=3/2 diamond-lattice Heisenberg antiferromagnet with a nearest neighbor exchange J = 0.63 meV and a Neel ordered ground-state below a temperature of 25 K. In tetragonally distorted CuRh2O4, competiting exchange interactions between up to third nearest-neighbor spins lead to the development of an incommensurate spin helix at 24 K with a magnetic propagation vector k = (0,0,0.79). Strong reduction of the ordered moment is observed for the S=1/2 spins in CuRh2O4 and captured by our 1/S corrections to the staggered magnetization. Our work identifies CoRh2O4 and CuRh2O4 as reference materials to guide future work searching for exotic quantum behavior in diamond-lattice antiferromagnets.
△ Less
Submitted 19 June, 2017;
originally announced June 2017.
-
High-throughput validation of ceRNA regulatory networks
Authors:
Hua-Sheng Chiu,
María Rodríguez Martínez,
Mukesh Bansal,
Aravind Subramanian,
Todd R. Golub,
Xuerui Yang,
Pavel Sumazin,
Andrea Califano
Abstract:
Background: MicroRNAs (miRNAs) play multiple roles in tumor biology [1]. Interestingly, reports from multiple groups suggest that miRNA targets may be coupled through competitive stoichiometric sequestration [2]. Specifically, computational models predicted [3, 4] and experimental assays confirmed [5, 6] that miRNA activity is dependent on miRNA target abundance, and consequently, changes to the a…
▽ More
Background: MicroRNAs (miRNAs) play multiple roles in tumor biology [1]. Interestingly, reports from multiple groups suggest that miRNA targets may be coupled through competitive stoichiometric sequestration [2]. Specifically, computational models predicted [3, 4] and experimental assays confirmed [5, 6] that miRNA activity is dependent on miRNA target abundance, and consequently, changes to the abundance of some miRNA targets lead to changes to the regulation and abundance of their other targets. The resulting indirect regulatory influence between miRNA targets resembles competition and has been dubbed competitive endogenous RNA (ceRNA) [5, 7, 8]. Recent studies have questioned the physiological relevance of ceRNA interactions [9], researchers ability to accurately predict these interactions [10], and the number of genes that are impacted by ceRNA interactions in specific cellular contexts [11]. Results: To address these concerns, we reverse engineered ceRNA networks (ceRNETs) in breast and prostate adenocarcinomas using context-specific TCGA profiles [12-14], and tested whether ceRNA interactions can predict the effects of RNAi-mediated gene silencing perturbations in PC3 and MCF7 cells. Our results, based on tests of thousands of inferred ceRNA interactions that are predicted to alter hundreds of cancer genes in each of the two tumor contexts, confirmed statistically significant effects for half of the predicted targets. Conclusions: Our results suggest that the expression of a significant fraction of cancer genes may be regulated by ceRNA interactions in each of the two tumor contexts.
△ Less
Submitted 31 January, 2017;
originally announced January 2017.
-
Super-giant magnetoresistance at room-temperature in copper nanowires due to magnetic field modulation of potential barrier heights at nanowire-contact interfaces
Authors:
Md. I. Hossain,
M. Maksud,
N. K. R. Palapati,
A. Subramanian,
J. Atulasimha,
S. Bandyopadhyay
Abstract:
We have observed a super-giant (~10,000,000%) negative magnetoresistance at 39 mT field in Cu nanowires contacted with Au contact pads. In these nanowires, potential barriers form at the two Cu/Au interfaces because of Cu oxidation that results in an ultrathin copper oxide layer forming between Cu and Au. Current flows when electrons tunnel through, and/or thermionically emit over, these barriers.…
▽ More
We have observed a super-giant (~10,000,000%) negative magnetoresistance at 39 mT field in Cu nanowires contacted with Au contact pads. In these nanowires, potential barriers form at the two Cu/Au interfaces because of Cu oxidation that results in an ultrathin copper oxide layer forming between Cu and Au. Current flows when electrons tunnel through, and/or thermionically emit over, these barriers. A magnetic field applied transverse to the direction of current flow along the wire deflects electrons toward one edge of the wire because of the Lorentz force, causing electron accumulation at that edge and depletion at the other. This lowers the potential barrier at the accumulated edge and raises it at the depleted edge, causing a super-giant magnetoresistance at room temperature.
△ Less
Submitted 24 July, 2016;
originally announced July 2016.
-
A heuristic algorithm for a single vehicle static bike sharing rebalancing problem
Authors:
Fábio Cruz,
Anand Subramanian,
Bruno P. Bruck,
Manuel Iori
Abstract:
The static bike rebalancing problem (SBRP) concerns the task of repositioning bikes among stations in self-service bike-sharing systems. This problem can be seen as a variant of the one-commodity pickup and delivery vehicle routing problem, where multiple visits are allowed to be performed at each station, i.e., the demand of a station is allowed to be split. Moreover, a vehicle may temporarily dr…
▽ More
The static bike rebalancing problem (SBRP) concerns the task of repositioning bikes among stations in self-service bike-sharing systems. This problem can be seen as a variant of the one-commodity pickup and delivery vehicle routing problem, where multiple visits are allowed to be performed at each station, i.e., the demand of a station is allowed to be split. Moreover, a vehicle may temporarily drop its load at a station, leaving it in excess or, alternatively, collect more bikes from a station (even all of them), thus leaving it in default. Both cases require further visits in order to meet the actual demands of such station. This paper deals with a particular case of the SBRP, in which only a single vehicle is available and the objective is to find a least-cost route that meets the demand of all stations and does not violate the minimum (zero) and maximum (vehicle capacity) load limits along the tour. Therefore, the number of bikes to be collected or delivered at each station should be appropriately determined in order to respect such constraints. We propose an iterated local search (ILS) based heuristic to solve the problem. The ILS algorithm was tested on 980 benchmark instances from the literature and the results obtained are quite competitive when compared to other existing methods. Moreover, our heuristic was capable of finding most of the known optimal solutions and also of improving the results on a number of open instances.
△ Less
Submitted 3 May, 2016; v1 submitted 2 May, 2016;
originally announced May 2016.
-
Transition from reconstruction towards thin film on the (110) surface of strontium titanate
Authors:
Z. Wang,
A. Loon,
A. Subramanian,
S. Gerhold,
E. McDermott,
J. A. Enterkin,
M. Hieckel,
B. C. Russell,
R. J. Green,
A. Moewes,
J. Guo,
P. Blaha,
M. R. Castell,
U. Diebold,
L. D. Marks
Abstract:
The surfaces of metal oxides often are reconstructed with a geometry and composition that is considerably different from a simple termination of the bulk. Such structures can also be viewed as ultrathin films, epitaxed on a substrate. Here, the reconstructions of the SrTiO3 (110) surface are studied combining scanning tunneling microscopy, transmission electron diffraction, and X-ray absorption sp…
▽ More
The surfaces of metal oxides often are reconstructed with a geometry and composition that is considerably different from a simple termination of the bulk. Such structures can also be viewed as ultrathin films, epitaxed on a substrate. Here, the reconstructions of the SrTiO3 (110) surface are studied combining scanning tunneling microscopy, transmission electron diffraction, and X-ray absorption spectroscopy, and analyzed with density functional theory calculations. While SrTiO3 (110) invariably terminates with an overlayer of titania, with increasing density its structure switches from nx1 and 2xn. At the same time the coordination of the Ti atoms changes from a network of corner-sharing tetrahedra to a double layer of edge-shared octahedra with bridging units of octahedrally coordinated strontium. This transition from the nx1 to 2xn reconstructions is a transition from a pseudomorphically stabilized tetrahedral network towards an octahedral titania thin film with stress-relief from octahedral strontia units at the surface.
△ Less
Submitted 10 February, 2016;
originally announced February 2016.
-
Construction of Near-Capacity Protograph LDPC Code Sequences with Block-Error Thresholds
Authors:
Asit Kumar Pradhan,
Andrew Thangaraj,
Arunkumar Subramanian
Abstract:
Density evolution for protograph Low-Density Parity-Check (LDPC) codes is considered, and it is shown that the message-error rate falls double-exponentially with iterations whenever the degree-2 subgraph of the protograph is cycle-free and noise level is below threshold. Conditions for stability of protograph density evolution are established and related to the structure of the protograph. Using l…
▽ More
Density evolution for protograph Low-Density Parity-Check (LDPC) codes is considered, and it is shown that the message-error rate falls double-exponentially with iterations whenever the degree-2 subgraph of the protograph is cycle-free and noise level is below threshold. Conditions for stability of protograph density evolution are established and related to the structure of the protograph. Using large-girth graphs, sequences of protograph LDPC codes with block-error threshold equal to bit-error threshold and block-error rate falling near-exponentially with blocklength are constructed deterministically. Small-sized protographs are optimized to obtain thresholds near capacity for binary erasure and binary-input Gaussian channels.
△ Less
Submitted 23 October, 2015;
originally announced October 2015.
-
A unified heuristic and an annotated bibliography for a large class of earliness-tardiness scheduling problems
Authors:
Arthur Kramer,
Anand Subramanian
Abstract:
This work proposes a unified heuristic algorithm for a large class of earliness-tardiness (E-T) scheduling problems. We consider single/parallel machine E-T problems that may or may not consider some additional features such as idle time, setup times and release dates. In addition, we also consider those problems whose objective is to minimize either the total (average) weighted completion time or…
▽ More
This work proposes a unified heuristic algorithm for a large class of earliness-tardiness (E-T) scheduling problems. We consider single/parallel machine E-T problems that may or may not consider some additional features such as idle time, setup times and release dates. In addition, we also consider those problems whose objective is to minimize either the total (average) weighted completion time or the total (average) weighted flow time, which arise as particular cases when the due dates of all jobs are either set to zero or to their associated release dates, respectively. The developed local search based metaheuristic framework is quite simple, but at the same time relies on sophisticated procedures for efficiently performing local search according to the characteristics of the problem. We present efficient move evaluation approaches for some parallel machine problems that generalize the existing ones for single machine problems. The algorithm was tested in hundreds of instances of several E-T problems and particular cases. The results obtained show that our unified heuristic is capable of producing high quality solutions when compared to the best ones available in the literature that were obtained by specific methods. Moreover, we provide an extensive annotated bibliography on the problems related to those considered in this work, where we not only indicate the approach(es) used in each publication, but we also point out the characteristics of the problem(s) considered. Beyond that, we classify the existing methods in different categories so as to have a better idea of the popularity of each type of solution procedure.
△ Less
Submitted 10 January, 2017; v1 submitted 8 September, 2015;
originally announced September 2015.
-
Efficient local search limitation strategy for single machine total weighted tardiness scheduling with sequence-dependent setup times
Authors:
Anand Subramanian,
Katyanne Farias
Abstract:
This paper concerns the single machine total weighted tardiness scheduling with sequence-dependent setup times, usually referred as $1|s_{ij}|\sum w_jT_j$. In this $\mathcal{NP}$-hard problem, each job has an associated processing time, due date and a weight. For each pair of jobs $i$ and $j$, there may be a setup time before starting to process $j$ in case this job is scheduled immediately after…
▽ More
This paper concerns the single machine total weighted tardiness scheduling with sequence-dependent setup times, usually referred as $1|s_{ij}|\sum w_jT_j$. In this $\mathcal{NP}$-hard problem, each job has an associated processing time, due date and a weight. For each pair of jobs $i$ and $j$, there may be a setup time before starting to process $j$ in case this job is scheduled immediately after $i$. The objective is to determine a schedule that minimizes the total weighted tardiness, where the tardiness of a job is equal to its completion time minus its due date, in case the job is completely processed only after its due date, and is equal to zero otherwise. Due to its complexity, this problem is most commonly solved by heuristics. The aim of this work is to develop a simple yet effective limitation strategy that speeds up the local search procedure without a significant loss in the solution quality. Such strategy consists of a filtering mechanism that prevents unpromising moves to be evaluated. The proposed strategy has been embedded in a local search based metaheuristic from the literature and tested in classical benchmark instances. Computational experiments revealed that the limitation strategy enabled the metaheuristic to be extremely competitive when compared to other algorithms from the literature, since it allowed the use of a large number of neighborhood structures without a significant increase in the CPU time and, consequently, high quality solutions could be achieved in a matter of seconds. In addition, we analyzed the effectiveness of the proposed strategy in two other well-known metaheuristics. Further experiments were also carried out on benchmark instances of problem $1|s_{ij}|\sum T_j$.
△ Less
Submitted 30 November, 2015; v1 submitted 23 January, 2015;
originally announced January 2015.
-
A speed and departure time optimization algorithm for the Pollution-Routing Problem
Authors:
Raphael Kramer,
Nelson Maculan,
Anand Subramanian,
Thibaut Vidal
Abstract:
We propose a new speed and departure time optimization algorithm for the Pollution-Routing Problem (PRP), which runs in quadratic time and returns a certified optimal schedule. This algorithm is embedded into an iterated local search-based metaheuristic to achieve a combined speed, scheduling and routing optimization. The start of the working day is set as a decision variable for individual routes…
▽ More
We propose a new speed and departure time optimization algorithm for the Pollution-Routing Problem (PRP), which runs in quadratic time and returns a certified optimal schedule. This algorithm is embedded into an iterated local search-based metaheuristic to achieve a combined speed, scheduling and routing optimization. The start of the working day is set as a decision variable for individual routes, thus enabling a better assignment of human resources to required demands. Some routes that were evaluated as unprofitable can now appear as viable candidates later in the day, leading to a larger search space and further opportunities of distance optimization via better service consolidation. Extensive computational experiments on available PRP benchmark instances demonstrate the good performance of the algorithms. The flexible departure times from the depot contribute to reduce the operational costs by 8.36% on the considered instances.
△ Less
Submitted 31 March, 2018; v1 submitted 21 January, 2015;
originally announced January 2015.
-
Hybrid Metaheuristics for the Clustered Vehicle Routing Problem
Authors:
Thibaut Vidal,
Maria Battarra,
Anand Subramanian,
Güneş Erdoǧan
Abstract:
The Clustered Vehicle Routing Problem (CluVRP) is a variant of the Capacitated Vehicle Routing Problem in which customers are grouped into clusters. Each cluster has to be visited once, and a vehicle entering a cluster cannot leave it until all customers have been visited. This article presents two alternative hybrid metaheuristic algorithms for the CluVRP. The first algorithm is based on an Itera…
▽ More
The Clustered Vehicle Routing Problem (CluVRP) is a variant of the Capacitated Vehicle Routing Problem in which customers are grouped into clusters. Each cluster has to be visited once, and a vehicle entering a cluster cannot leave it until all customers have been visited. This article presents two alternative hybrid metaheuristic algorithms for the CluVRP. The first algorithm is based on an Iterated Local Search algorithm, in which only feasible solutions are explored and problem-specific local search moves are utilized. The second algorithm is a Hybrid Genetic Search, for which the shortest Hamiltonian path between each pair of vertices within each cluster should be precomputed. Using this information, a sequence of clusters can be used as a solution representation and large neighborhoods can be efficiently explored by means of bi-directional dynamic programming, sequence concatenations, by using appropriate data structures. Extensive computational experiments are performed on benchmark instances from the literature, as well as new large scale ones. Recommendations on promising algorithm choices are provided relatively to average cluster size.
△ Less
Submitted 26 April, 2014;
originally announced April 2014.
-
A matheuristic approach for the Pollution-Routing Problem
Authors:
Raphael Kramer,
Anand Subramanian,
Thibaut Vidal,
Lucídio dos Anjos Formiga Cabral
Abstract:
This paper deals with the Pollution-Routing Problem (PRP), a Vehicle Routing Problem (VRP) with environmental considerations, recently introduced in the literature by [Bektas and Laporte (2011), Transport. Res. B-Meth. 45 (8), 1232-1250]. The objective is to minimize operational and environmental costs while respecting capacity constraints and service time windows. Costs are based on driver wages…
▽ More
This paper deals with the Pollution-Routing Problem (PRP), a Vehicle Routing Problem (VRP) with environmental considerations, recently introduced in the literature by [Bektas and Laporte (2011), Transport. Res. B-Meth. 45 (8), 1232-1250]. The objective is to minimize operational and environmental costs while respecting capacity constraints and service time windows. Costs are based on driver wages and fuel consumption, which depends on many factors, such as travel distance and vehicle load. The vehicle speeds are considered as decision variables. They complement routing decisions, impacting the total cost, the travel time between locations, and thus the set of feasible routes. We propose a method which combines a local search-based metaheuristic with an integer programming approach over a set covering formulation and a recursive speed-optimization algorithm. This hybridization enables to integrate more tightly route and speed decisions. Moreover, two other "green" VRP variants, the Fuel Consumption VRP (FCVRP) and the Energy Minimizing VRP (EMVRP), are addressed. The proposed method compares very favorably with previous algorithms from the literature and many new improved solutions are reported.
△ Less
Submitted 18 April, 2014;
originally announced April 2014.
-
On Solving Manufacturing Cell Formation via Bicluster Editing
Authors:
Rian G. S. Pinheiro,
Ivan C. Martins,
Fábio Protti,
Luiz S. Ochi,
Luidi G. Simonetti,
Anand Subramanian
Abstract:
This work investigates the Bicluster Graph Editing Problem (BGEP) and how it can be applied to solve the Manufacturing Cell Formation Problem (MCFP). We develop an exact method for the BGEP that consists of a Branch-and-Cut approach combined with a special separation algorithm based on dynamic programming. We also describe a new preprocessing procedure for the BGEP derived from theoretical results…
▽ More
This work investigates the Bicluster Graph Editing Problem (BGEP) and how it can be applied to solve the Manufacturing Cell Formation Problem (MCFP). We develop an exact method for the BGEP that consists of a Branch-and-Cut approach combined with a special separation algorithm based on dynamic programming. We also describe a new preprocessing procedure for the BGEP derived from theoretical results on vertex distances in the input graph. Computational experiments performed on randomly generated instances with various levels of difficulty show that our separation algorithm accelerates the convergence speed, and our preprocessing procedure is effective for low density instances. Other contribution of this work is to reveal the similarities between the BGEP and the MCFP. We show that the BGEP and the MCFP have the same solution space. This fact leads to the proposal of two new exact approaches for the MCFP based on mathematical formulations for the BGEP. Both approaches use the grou** efficacy measure as the objective function. Up to the authors' knowledge, these are the first exact methods that employ such a measure to optimally solve instances of the MCFP. The first approach consists of iteratively running several calls to a parameterized version of the BGEP, and the second is a linearization of a new fractional-linear model for the MCFP. Computational experiments performed on instances of the MCFP found in the literature show that our exact methods for the MCFP are able to prove several previously unknown optima.
△ Less
Submitted 11 December, 2013;
originally announced December 2013.
-
Offline and Online Incentive Mechanism Design for Smart-phone Crowd-sourcing
Authors:
Ashwin Subramanian,
G Sai Kanth,
Rahul Vaze
Abstract:
In this paper, we consider the problem of incentive mechanism design for smart-phone crowd-sourcing. Each user participating in crowd-sourcing submits a set of tasks it can accomplish and its corresponding bid. The platform then selects the users and their payments to maximize its utility while ensuring truthfulness, individual rationality, profitability, and polynomial algorithm complexity. Both…
▽ More
In this paper, we consider the problem of incentive mechanism design for smart-phone crowd-sourcing. Each user participating in crowd-sourcing submits a set of tasks it can accomplish and its corresponding bid. The platform then selects the users and their payments to maximize its utility while ensuring truthfulness, individual rationality, profitability, and polynomial algorithm complexity. Both the offline and the online scenarios are considered, where in the offline case, all users submit their profiles simultaneously, while in the online case they do it sequentially, and the decision whether to accept or reject each user is done instantaneously with no revocation. The proposed algorithms for both the offline and the online case are shown to satisfy all the four desired properties of an efficient auction. Through extensive simulation, the performance of the offline and the online algorithm is also compared.
△ Less
Submitted 7 October, 2013;
originally announced October 2013.
-
Deterministic Constructions for Large Girth Protograph LDPC Codes
Authors:
Asit Kumar Pradhan,
Arunkumar Subramanian,
Andrew Thangaraj
Abstract:
The bit-error threshold of the standard ensemble of Low Density Parity Check (LDPC) codes is known to be close to capacity, if there is a non-zero fraction of degree-two bit nodes. However, the degree-two bit nodes preclude the possibility of a block-error threshold. Interestingly, LDPC codes constructed using protographs allow the possibility of having both degree-two bit nodes and a block-error…
▽ More
The bit-error threshold of the standard ensemble of Low Density Parity Check (LDPC) codes is known to be close to capacity, if there is a non-zero fraction of degree-two bit nodes. However, the degree-two bit nodes preclude the possibility of a block-error threshold. Interestingly, LDPC codes constructed using protographs allow the possibility of having both degree-two bit nodes and a block-error threshold. In this paper, we analyze density evolution for protograph LDPC codes over the binary erasure channel and show that their bit-error probability decreases double exponentially with the number of iterations when the erasure probability is below the bit-error threshold and long chain of degree-two variable nodes are avoided in the protograph. We present deterministic constructions of such protograph LDPC codes with girth logarithmic in blocklength, resulting in an exponential fall in bit-error probability below the threshold. We provide optimized protographs, whose block-error thresholds are better than that of the standard ensemble with minimum bit-node degree three. These protograph LDPC codes are theoretically of great interest, and have applications, for instance, in coding with strong secrecy over wiretap channels.
△ Less
Submitted 21 May, 2013; v1 submitted 26 January, 2013;
originally announced January 2013.
-
Topologies and Price of Stability of Complex Strategic Networks with Localized Payoffs : Analytical and Simulation Studies
Authors:
Rohith Dwarakanath Vallam,
C. A. Subramanian,
Ramasuri Narayanam,
Y. Narahari,
Srinath Narasimha
Abstract:
We analyze a network formation game in a strategic setting where payoffs of individuals depend only on their immediate neighbourhood. We call these payoffs as localized payoffs. In this game, the payoff of each individual captures (1) the gain from immediate neighbors, (2) the bridging benefits, and (3) the cost to form links. This implies that the payoff of each individual can be computed using o…
▽ More
We analyze a network formation game in a strategic setting where payoffs of individuals depend only on their immediate neighbourhood. We call these payoffs as localized payoffs. In this game, the payoff of each individual captures (1) the gain from immediate neighbors, (2) the bridging benefits, and (3) the cost to form links. This implies that the payoff of each individual can be computed using only its single-hop neighbourhood information. Based on this simple model of network formation, our study explores the structure of networks that form, satisfying one or both of the properties, namely, pairwise stability and efficiency. We analytically prove the pairwise stability of several interesting network structures, notably, the complete bi-partite network, complete equi-k-partite network, complete network and cycle network, under various configurations of the model. We validate and extend these results through extensive simulations. We characterize topologies of efficient networks by drawing upon classical results from extremal graph theory and discover that the Turan graph (or the complete equi-bi-partite network) is the unique efficient network under many configurations of parameters. We examine the tradeoffs between topologies of pairwise stable networks and efficient networks using the notion of price of stability, which is the ratio of the sum of payoffs of the players in an optimal pairwise stable network to that of an efficient network. Interestingly, we find that price of stability is equal to 1 for almost all configurations of parameters in the proposed model; and for the rest of the configurations of the parameters, we obtain a lower bound of 0.5 on the price of stability. This leads to another key insight of this paper: under mild conditions, efficient networks will form when strategic individuals choose to add or delete links based on only localized payoffs.
△ Less
Submitted 30 December, 2011;
originally announced January 2012.
-
Strong Secrecy on the Binary Erasure Wiretap Channel Using Large-Girth LDPC Codes
Authors:
Arunkumar Subramanian,
Andrew Thangaraj,
Matthieu Bloch,
Steven W. McLaughlin
Abstract:
For an arbitrary degree distribution pair (DDP), we construct a sequence of low-density parity-check (LDPC) code ensembles with girth growing logarithmically in block-length using Ramanujan graphs. When the DDP has minimum left degree at least three, we show using density evolution analysis that the expected bit-error probability of these ensembles, when passed through a binary erasure channel wit…
▽ More
For an arbitrary degree distribution pair (DDP), we construct a sequence of low-density parity-check (LDPC) code ensembles with girth growing logarithmically in block-length using Ramanujan graphs. When the DDP has minimum left degree at least three, we show using density evolution analysis that the expected bit-error probability of these ensembles, when passed through a binary erasure channel with erasure probability $ε$, decays as $\mathcal{O}(\exp(-c_1 n^{c_2}))$ with the block-length $n$ for positive constants $c_1$ and $c_2$, as long as $ε$ is lesser than the erasure threshold $ε_\mathrm{th}$ of the DDP. This guarantees that the coset coding scheme using the dual sequence provides strong secrecy over the binary erasure wiretap channel for erasure probabilities greater than $1 - ε_\mathrm{th}$.
△ Less
Submitted 22 February, 2011; v1 submitted 16 September, 2010;
originally announced September 2010.
-
Strong Secrecy for Erasure Wiretap Channels
Authors:
Ananda T. Suresh,
Arunkumar Subramanian,
Andrew Thangaraj,
Matthieu Bloch,
Steven McLaughlin
Abstract:
We show that duals of certain low-density parity-check (LDPC) codes, when used in a standard coset coding scheme, provide strong secrecy over the binary erasure wiretap channel (BEWC). This result hinges on a stop** set analysis of ensembles of LDPC codes with block length $n$ and girth $\geq 2k$, for some $k \geq 2$. We show that if the minimum left degree of the ensemble is $l_\mathrm{min}$, t…
▽ More
We show that duals of certain low-density parity-check (LDPC) codes, when used in a standard coset coding scheme, provide strong secrecy over the binary erasure wiretap channel (BEWC). This result hinges on a stop** set analysis of ensembles of LDPC codes with block length $n$ and girth $\geq 2k$, for some $k \geq 2$. We show that if the minimum left degree of the ensemble is $l_\mathrm{min}$, the expected probability of block error is $\calO(\frac{1}{n^{\lceil l_\mathrm{min} k /2 \rceil - k}})$ when the erasure probability $ε< ε_\mathrm{ef}$, where $ε_\mathrm{ef}$ depends on the degree distribution of the ensemble. As long as $l_\mathrm{min} > 2$ and $k > 2$, the dual of this LDPC code provides strong secrecy over a BEWC of erasure probability greater than $1 - ε_\mathrm{ef}$.
△ Less
Submitted 30 April, 2010;
originally announced April 2010.
-
Suppression of multiferroic order in hexagonal YMn1-xInxO3 ceramics
Authors:
A. Dixit,
Andrew E. Smith,
M. A. Subramanian,
G. Lawes
Abstract:
We have investigated the effects of substituting In for Mn on the antiferromagnetic phase transition in YMnO3 using magnetic, dielectric, and specific heat measurements. We prepared a set of isostructural phase pure hexagonal YMn$_{1-x}$In$_{x}$O$_{3}$ samples having x=0 to x=0.9, which exhibit a systematic decrease of the antiferromagnetic ordering temperature with increasing In content. The mu…
▽ More
We have investigated the effects of substituting In for Mn on the antiferromagnetic phase transition in YMnO3 using magnetic, dielectric, and specific heat measurements. We prepared a set of isostructural phase pure hexagonal YMn$_{1-x}$In$_{x}$O$_{3}$ samples having x=0 to x=0.9, which exhibit a systematic decrease of the antiferromagnetic ordering temperature with increasing In content. The multiferroic phase, which develops below TN, appears to be completely suppressed for x$\geq$0.5 in the temperature range investigated, which can be attributed solely to the dilution of magnetic interactions as the crystal structure remains hexagonal. Similar to previous reports, we find an enhancement of the magnetocapacitive coupling on dilution with non-magnetic ions.
△ Less
Submitted 15 December, 2009;
originally announced December 2009.
-
Structure and morphology of hydroxylated nickel oxide (111) surfaces
Authors:
J. Ciston,
A. Subramanian,
D. M. Kienzle,
L. D. Marks
Abstract:
We report an experimental and theoretical analysis of the sqrt(3)x sqrt(3)-R30 and 2x2 reconstructions on the NiO (111) surface combining transmission electron microscopy, x-ray photoelectron spectroscopy, and reasonably accurate density functional calculations using the meta-GGA hybrid functional TPSSh. While the main focus here is on the surface structure, we also observe an unusual step morph…
▽ More
We report an experimental and theoretical analysis of the sqrt(3)x sqrt(3)-R30 and 2x2 reconstructions on the NiO (111) surface combining transmission electron microscopy, x-ray photoelectron spectroscopy, and reasonably accurate density functional calculations using the meta-GGA hybrid functional TPSSh. While the main focus here is on the surface structure, we also observe an unusual step morphology with terraces containing only even numbers of unit cells during annealing of the surfaces. The experimental data clearly shows that the surfaces contain significant coverage of hydroxyl terminations, and the surface structures are essentially the same as those reported on the MgO (111) surface implying an identical kinetically-limited water-driven structural transition pathway. The octapole structure can therefore be all but ruled out for single crystals of NiO annealed in or transported through humid air. . The theoretical analysis indicates, as expected, that simple density functional theory methods for such strongly-correlated oxide surfaces are marginal, while better consideration of the metal d-electrons has a large effect although, it is still not perfect.
△ Less
Submitted 13 August, 2009;
originally announced August 2009.
-
Magnetically tunable dielectric materials
Authors:
G. Lawes,
T. Kimura,
C. M. Varma,
M. A. Subramanian,
R. J. Cava,
A. P. Ramirez
Abstract:
The coupling between localized spins and phonons can lead to shifts in the dielectric constant of insulating materials at magnetic ordering transitions. Studies on isostructural SeCuO3 (ferromagnetic) and TeCuO3 (antiferromagnetic) illustrate how the q-dependent spin-spin correlation function couples to phonon frequencies leading to a shift in the dielectric constant. A model is discussed for th…
▽ More
The coupling between localized spins and phonons can lead to shifts in the dielectric constant of insulating materials at magnetic ordering transitions. Studies on isostructural SeCuO3 (ferromagnetic) and TeCuO3 (antiferromagnetic) illustrate how the q-dependent spin-spin correlation function couples to phonon frequencies leading to a shift in the dielectric constant. A model is discussed for this spin-phonon coupling. The magnetodielectric coupling in multiferroic materials can be very large at a ferroelectric transition temperature. This coupling is investigated in the recently identified multiferroic Ni3V2O8.
△ Less
Submitted 13 April, 2009;
originally announced April 2009.
-
MDS codes on the erasure-erasure wiretap channel
Authors:
Arunkumar Subramanian,
Steven W. McLaughlin
Abstract:
This paper considers the problem of perfectly secure communication on a modified version of Wyner's wiretap channel II where both the main and wiretapper's channels have some erasures. A secret message is to be encoded into $n$ channel symbols and transmitted. The main channel is such that the legitimate receiver receives the transmitted codeword with exactly $n - ν$ erasures, where the position…
▽ More
This paper considers the problem of perfectly secure communication on a modified version of Wyner's wiretap channel II where both the main and wiretapper's channels have some erasures. A secret message is to be encoded into $n$ channel symbols and transmitted. The main channel is such that the legitimate receiver receives the transmitted codeword with exactly $n - ν$ erasures, where the positions of the erasures are random. Additionally, an eavesdropper (wire-tapper) is able to observe the transmitted codeword with $n - μ$ erasures in a similar fashion. This paper studies the maximum achievable information rate with perfect secrecy on this channel and gives a coding scheme using nested codes that achieves the secrecy capacity.
△ Less
Submitted 18 February, 2009;
originally announced February 2009.
-
Charge Density Refinement of the Si (111) 7x7 Surface
Authors:
J. Ciston,
A. Subramanian,
I. K. Robinson,
L. D. Marks
Abstract:
We report an experimental refinement of the local charge density at the Si (111) 7x7 surface utilizing a combination of x-ray and high energy electron diffraction. By perturbing about a bond-centered pseudoatom model, we find experimentally that the adatoms are in an anti-bonding state with the atoms directly below. We are also able to experimentally refine a charge transfer of 0.26(4) e- from e…
▽ More
We report an experimental refinement of the local charge density at the Si (111) 7x7 surface utilizing a combination of x-ray and high energy electron diffraction. By perturbing about a bond-centered pseudoatom model, we find experimentally that the adatoms are in an anti-bonding state with the atoms directly below. We are also able to experimentally refine a charge transfer of 0.26(4) e- from each adatom site to the underlying layers. These results are compared with a full-potential all-electron density functional DFT calculation.
△ Less
Submitted 20 January, 2009;
originally announced January 2009.
-
Effect of oxygen concentration on the structural and magnetic properties of LaRh1/2Mn1/2O3 thin films
Authors:
W. C. Sheets,
A. E. Smith,
M. A. Subramanian,
W. Prellier
Abstract:
Epitaxial LaRh1/2Mn1/2O3 thin films have been grown on (001)-oriented LaAlO3 and SrTiO3 substrates using pulsed laser deposition. The optimized thin film samples are semiconducting and ferromagnetic with a Curie temperature close to 100 K, a coercive field of 1200 Oe, and a saturation magnetization of 1.7muB per formula unit. The surface texture, structural, electrical, and magnetic properties o…
▽ More
Epitaxial LaRh1/2Mn1/2O3 thin films have been grown on (001)-oriented LaAlO3 and SrTiO3 substrates using pulsed laser deposition. The optimized thin film samples are semiconducting and ferromagnetic with a Curie temperature close to 100 K, a coercive field of 1200 Oe, and a saturation magnetization of 1.7muB per formula unit. The surface texture, structural, electrical, and magnetic properties of the LaRh1/2Mn1/2O3 films was examined as a function of the oxygen concentration during deposition. While an elevated oxygen concentration yields thin films with optimal magnetic properties, slightly lower oxygen concentrations result in films with improved texture and crystallinity.
△ Less
Submitted 10 December, 2008;
originally announced December 2008.
-
Hydroxylated MgO (111) reconstructions: why the case for clean surfaces does not hold water
Authors:
J. Ciston,
A. Subramanian,
L. D. Marks
Abstract:
We report an experimental and theoretical analysis of the root(3)xroot(3)-R30 and 2x2 reconstructions on the MgO (111) surface combining transmission electron microscopy, x-ray photoelectron spectroscopy, and reasonably accurate density functional calculations using the meta-GGA functional TPSS. The experimental data clearly shows that the surfaces contain significant coverages of hydroxyl termi…
▽ More
We report an experimental and theoretical analysis of the root(3)xroot(3)-R30 and 2x2 reconstructions on the MgO (111) surface combining transmission electron microscopy, x-ray photoelectron spectroscopy, and reasonably accurate density functional calculations using the meta-GGA functional TPSS. The experimental data clearly shows that the surfaces contain significant coverages of hydroxyl terminations, even after UHV annealing, and as such cannot be the structures which have been previously reported. For the 2x2 surfaces a relatively simple structural framework is detailed which fits all the experimental and theoretical data. For the root(3)xroot(3) there turn out to be two plausible structures and neither the experimental nor theoretical results can differentiate between the two within error. However, by examining the conditions under which the surface is formed we describe a kinetic route for the transformation between the different reconstructions that involves mobile hydroxyl groups and protons, and relatively immobile cations, which strongly suggests only one of the two root(3)xroot(3) structures.
△ Less
Submitted 15 September, 2008;
originally announced September 2008.