Search | arXiv e-print repository

RE-GAINS & EnChAnT: Intelligent Tool Manipulation Systems For Enhanced Query Responses

Authors: Sahil Girhepuje, Siva Sankar Sajeev, Purvam Jain, Arya Sikder, Adithya Rama Varma, Ryan George, Akshay Govind Srinivasan, Mahendra Kurup, Ashmit Sinha, Sudip Mondal

Abstract: Large Language Models (LLMs) currently struggle with tool invocation and chaining, as they often hallucinate or miss essential steps in a sequence. We propose RE-GAINS and EnChAnT, two novel frameworks that empower LLMs to tackle complex user queries by making API calls to external tools based on tool descriptions and argument lists. Tools are chained based on the expected output, without receivin… ▽ More Large Language Models (LLMs) currently struggle with tool invocation and chaining, as they often hallucinate or miss essential steps in a sequence. We propose RE-GAINS and EnChAnT, two novel frameworks that empower LLMs to tackle complex user queries by making API calls to external tools based on tool descriptions and argument lists. Tools are chained based on the expected output, without receiving the actual results from each individual call. EnChAnT, an open-source solution, leverages an LLM format enforcer, OpenChat 3.5 (an LLM), and ToolBench's API Retriever. RE-GAINS utilizes OpenAI models and embeddings with a specialized prompt based on the $\underline{R}$easoning vi$\underline{a}$ $\underline{P}$lanning $(RAP)$ framework. Both frameworks are low cost (0.01\$ per query). Our key contribution is enabling LLMs for tool invocation and chaining using modifiable, externally described tools. △ Less

Submitted 20 June, 2024; v1 submitted 28 January, 2024; originally announced January 2024.

arXiv:2312.13451 [pdf, other]

Learning the Factors Controlling Mineralization for Geologic Carbon Sequestration

Authors: Aleksandra Pachalieva, Jeffrey D. Hyman, Daniel O'Malley, Hari Viswanathan, Gowri Srinivasan

Abstract: We perform a set of flow and reactive transport simulations within three-dimensional fracture networks to learn the factors controlling mineral reactions. CO$_2$ mineralization requires CO$_2$-laden water, dissolution of a mineral that then leads to precipitation of a CO$_2$-bearing mineral. Our discrete fracture networks (DFN) are partially filled with quartz that gradually dissolves until it rea… ▽ More We perform a set of flow and reactive transport simulations within three-dimensional fracture networks to learn the factors controlling mineral reactions. CO$_2$ mineralization requires CO$_2$-laden water, dissolution of a mineral that then leads to precipitation of a CO$_2$-bearing mineral. Our discrete fracture networks (DFN) are partially filled with quartz that gradually dissolves until it reaches a quasi-steady state. At the end of the simulation, we measure the quartz remaining in each fracture within the domain. We observe that a small backbone of fracture exists, where the quartz is fully dissolved which leads to increased flow and transport. However, depending on the DFN topology and the rate of dissolution, we observe a large variability of these changes, which indicates an interplay between the fracture network structure and the impact of geochemical dissolution. In this work, we developed a machine learning framework to extract the important features that support mineralization in the form of dissolution. In addition, we use structural and topological features of the fracture network to predict the remaining quartz volume in quasi-steady state conditions. As a first step to characterizing carbon mineralization, we study dissolution with this framework. We studied a variety of reaction and fracture parameters and their impact on the dissolution of quartz in fracture networks. We found that the dissolution reaction rate constant of quartz and the distance to the flowing backbone in the fracture network are the two most important features that control the amount of quartz left in the system. For the first time, we use a combination of a finite-volume reservoir model and graph-based approach to study reactive transport in a complex fracture network to determine the key features that control dissolution. △ Less

Submitted 20 December, 2023; originally announced December 2023.

Comments: 23 pages, 5 figures, 2 tables

arXiv:2308.03353 [pdf]

$\textit{In situ}$ electric-field control of ferromagnetic resonance in the low-loss organic-based ferrimagnet V[TCNE]$_{x\sim 2}$

Authors: Seth W. Kurfman, Andrew Franson, Piyush Shah, Yueguang Shi, Hil Fung Harry Cheung, Katherine E. Nygren, Mitchell Swyt, Kristen S. Buchanan, Gregory D. Fuchs, Michael E. Flatté, Gopalan Srinivasan, Michael Page, Ezekiel Johnston-Halperin

Abstract: We demonstrate indirect electric-field control of ferromagnetic resonance (FMR) in devices that integrate the low-loss, molecule-based, room-temperature ferrimagnet vanadium tetracyanoethylene (V[TCNE]$_{x \sim 2}$) mechanically coupled to PMN-PT piezoelectric transducers. Upon straining the V[TCNE]$_x$ films, the FMR frequency is tuned by more than 6 times the resonant linewidth with no change in… ▽ More We demonstrate indirect electric-field control of ferromagnetic resonance (FMR) in devices that integrate the low-loss, molecule-based, room-temperature ferrimagnet vanadium tetracyanoethylene (V[TCNE]$_{x \sim 2}$) mechanically coupled to PMN-PT piezoelectric transducers. Upon straining the V[TCNE]$_x$ films, the FMR frequency is tuned by more than 6 times the resonant linewidth with no change in Gilbert dam** for samples with $α= 6.5 \times 10^{-5}$. We show this tuning effect is due to a strain-dependent magnetic anisotropy in the films and find the magnetoelastic coefficient $|λ_S| \sim (1 - 4.4)$ ppm, backed by theoretical predictions from DFT calculations and magnetoelastic theory. Noting the rapidly expanding application space for strain-tuned FMR, we define a new metric for magnetostrictive materials, $\textit{magnetostrictive agility}$, given by the ratio of the magnetoelastic coefficient to the FMR linewidth. This agility allows for a direct comparison between magnetostrictive materials in terms of their comparative efficacy for magnetoelectric applications requiring ultra-low loss magnetic resonance modulated by strain. With this metric, we show V[TCNE]$_x$ is competitive with other magnetostrictive materials including YIG and Terfenol-D. This combination of ultra-narrow linewidth and magnetostriction in a system that can be directly integrated into functional devices without requiring heterogeneous integration in a thin-film geometry promises unprecedented functionality for electric-field tuned microwave devices ranging from low-power, compact filters and circulators to emerging applications in quantum information science and technology. △ Less

Submitted 7 August, 2023; originally announced August 2023.

arXiv:2305.03216 [pdf, other]

Near-realtime Facial Animation by Deep 3D Simulation Super-Resolution

Authors: Hyojoon Park, Sangeetha Grama Srinivasan, Matthew Cong, Doyub Kim, Byungsoo Kim, Jonathan Swartz, Ken Museth, Eftychios Sifakis

Abstract: We present a neural network-based simulation super-resolution framework that can efficiently and realistically enhance a facial performance produced by a low-cost, realtime physics-based simulation to a level of detail that closely approximates that of a reference-quality off-line simulator with much higher resolution (26x element count in our examples) and accurate physical modeling. Our approach… ▽ More We present a neural network-based simulation super-resolution framework that can efficiently and realistically enhance a facial performance produced by a low-cost, realtime physics-based simulation to a level of detail that closely approximates that of a reference-quality off-line simulator with much higher resolution (26x element count in our examples) and accurate physical modeling. Our approach is rooted in our ability to construct - via simulation - a training set of paired frames, from the low- and high-resolution simulators respectively, that are in semantic correspondence with each other. We use face animation as an exemplar of such a simulation domain, where creating this semantic congruence is achieved by simply dialing in the same muscle actuation controls and skeletal pose in the two simulators. Our proposed neural network super-resolution framework generalizes from this training set to unseen expressions, compensates for modeling discrepancies between the two simulations due to limited resolution or cost-cutting approximations in the real-time variant, and does not require any semantic descriptors or parameters to be provided as input, other than the result of the real-time simulation. We evaluate the efficacy of our pipeline on a variety of expressive performances and provide comparisons and ablation experiments for plausible variations and alternatives to our proposed scheme. △ Less

Submitted 9 August, 2023; v1 submitted 4 May, 2023; originally announced May 2023.

arXiv:2210.08930 [pdf, other]

A modular quantum-classical framework for simulating chemical reaction pathways accurately

Authors: Nirmal M R, Shampa Sarkar, Manoj Nambiar, Sriram Goverapet Srinivasan

Abstract: A lot of progress has been made in recent times for simulating accurately the ground state energy of small molecules and their potential energy surface, using quantum-classical hybrid computing architecture. While these single point energy calculations are a significant milestone for quantum chemistry simulation on quantum hardware, a similarly important application is to trace accurately the reac… ▽ More A lot of progress has been made in recent times for simulating accurately the ground state energy of small molecules and their potential energy surface, using quantum-classical hybrid computing architecture. While these single point energy calculations are a significant milestone for quantum chemistry simulation on quantum hardware, a similarly important application is to trace accurately the reaction pathway of various chemical transformations. Such computations require accurate determination of the equilibrium or lowest energy molecular geometry, either by computing energy gradients with respect to the molecule's nuclear coordinates or perturbative distortion of the molecular configuration. In this work, we present a modular quantum-classical hybrid framework, to accurately simulate chemical reaction pathway of various kinds of molecular reactions. We demonstrate our framework by accurately tracing the isomerization pathway for small organic molecules. This framework can now be readily applied to study other 'active' molecules from the pharma and chemical industries. △ Less

Submitted 17 October, 2022; originally announced October 2022.

Comments: 9 pages, 8 figures

arXiv:2209.10736 [pdf, other]

doi 10.1145/3550454.3555429

Fluidic Topology Optimization with an Anisotropic Mixture Model

Authors: Yifei Li, Tao Du, Sangeetha Grama Srinivasan, Kui Wu, Bo Zhu, Eftychios Sifakis, Wojciech Matusik

Abstract: Fluidic devices are crucial components in many industrial applications involving fluid mechanics. Computational design of a high-performance fluidic system faces multifaceted challenges regarding its geometric representation and physical accuracy. We present a novel topology optimization method to design fluidic devices in a Stokes flow context. Our approach is featured by its capability in accomm… ▽ More Fluidic devices are crucial components in many industrial applications involving fluid mechanics. Computational design of a high-performance fluidic system faces multifaceted challenges regarding its geometric representation and physical accuracy. We present a novel topology optimization method to design fluidic devices in a Stokes flow context. Our approach is featured by its capability in accommodating a broad spectrum of boundary conditions at the solid-fluid interface. Our key contribution is an anisotropic and differentiable constitutive model that unifies the representation of different phases and boundary conditions in a Stokes model, enabling a topology optimization method that can synthesize novel structures with accurate boundary conditions from a background grid discretization. We demonstrate the efficacy of our approach by conducting several fluidic system design tasks with over four million design parameters. △ Less

Submitted 24 September, 2022; v1 submitted 21 September, 2022; originally announced September 2022.

Comments: Accepted by SIGGRAPH Asia 2022. For low resolution paper see https://people.csail.mit.edu/liyifei/publication/anisotropic-stokes-fluidic-device/

Journal ref: ACM Transactions on Graphics (TOG), 2022

arXiv:2202.04137 [pdf, other]

Machine Learning in Heterogeneous Porous Materials

Authors: Marta D'Elia, Hang Deng, Cedric Fraces, Krishna Garikipati, Lori Graham-Brady, Amanda Howard, George Karniadakis, Vahid Keshavarzzadeh, Robert M. Kirby, Nathan Kutz, Chunhui Li, Xing Liu, Hannah Lu, Pania Newell, Daniel O'Malley, Masa Prodanovic, Gowri Srinivasan, Alexandre Tartakovsky, Daniel M. Tartakovsky, Hamdi Tchelepi, Bozo Vazic, Hari Viswanathan, Hongkyu Yoon, Piotr Zarzycki

Abstract: The "Workshop on Machine learning in heterogeneous porous materials" brought together international scientific communities of applied mathematics, porous media, and material sciences with experts in the areas of heterogeneous materials, machine learning (ML) and applied mathematics to identify how ML can advance materials research. Within the scope of ML and materials research, the goal of the wor… ▽ More The "Workshop on Machine learning in heterogeneous porous materials" brought together international scientific communities of applied mathematics, porous media, and material sciences with experts in the areas of heterogeneous materials, machine learning (ML) and applied mathematics to identify how ML can advance materials research. Within the scope of ML and materials research, the goal of the workshop was to discuss the state-of-the-art in each community, promote crosstalk and accelerate multi-disciplinary collaborative research, and identify challenges and opportunities. As the end result, four topic areas were identified: ML in predicting materials properties, and discovery and design of novel materials, ML in porous and fractured media and time-dependent phenomena, Multi-scale modeling in heterogeneous porous materials via ML, and Discovery of materials constitutive laws and new governing equations. This workshop was part of the AmeriMech Symposium series sponsored by the National Academies of Sciences, Engineering and Medicine and the U.S. National Committee on Theoretical and Applied Mechanics. △ Less

Submitted 4 February, 2022; originally announced February 2022.

Comments: The workshop link is: https://amerimech.mech.utah.edu

arXiv:2111.03971 [pdf, ps, other]

Towards noise robust trigger-word detection with contrastive learning pre-task for fast on-boarding of new trigger-words

Authors: Sivakumar Balasubramanian, Aditya Jajodia, Gowtham Srinivasan

Abstract: Trigger-word detection plays an important role as the entry point of user's communication with voice assistants. But supporting a particular word as a trigger-word involves huge amount of data collection, augmentation and labelling for that word. This makes supporting new trigger-words a tedious and time consuming process. To combat this, we explore the use of contrastive learning as a pre-trainin… ▽ More Trigger-word detection plays an important role as the entry point of user's communication with voice assistants. But supporting a particular word as a trigger-word involves huge amount of data collection, augmentation and labelling for that word. This makes supporting new trigger-words a tedious and time consuming process. To combat this, we explore the use of contrastive learning as a pre-training task that helps the detection model to generalize to different words and noise conditions. We explore supervised contrastive techniques and also propose a novel self-supervised training technique using chunked words from long sentence audios. We show that both supervised and the new self-supervised contrastive pre-training techniques have comparable results to a traditional classification pre-training on new trigger words with less data availability. △ Less

Submitted 27 July, 2022; v1 submitted 6 November, 2021; originally announced November 2021.

Comments: submitted to ICMLA

arXiv:2109.06440 [pdf, other]

Complexity-aware Adaptive Training and Inference for Edge-Cloud Distributed AI Systems

Authors: Yinghan Long, Indranil Chakraborty, Gopalakrishnan Srinivasan, Kaushik Roy

Abstract: The ubiquitous use of IoT and machine learning applications is creating large amounts of data that require accurate and real-time processing. Although edge-based smart data processing can be enabled by deploying pretrained models, the energy and memory constraints of edge devices necessitate distributed deep learning between the edge and the cloud for complex data. In this paper, we propose a dist… ▽ More The ubiquitous use of IoT and machine learning applications is creating large amounts of data that require accurate and real-time processing. Although edge-based smart data processing can be enabled by deploying pretrained models, the energy and memory constraints of edge devices necessitate distributed deep learning between the edge and the cloud for complex data. In this paper, we propose a distributed AI system to exploit both the edge and the cloud for training and inference. We propose a new architecture, MEANet, with a main block, an extension block, and an adaptive block for the edge. The inference process can terminate at either the main block, the extension block, or the cloud. The MEANet is trained to categorize inputs into easy/hard/complex classes. The main block identifies instances of easy/hard classes and classifies easy classes with high confidence. Only data with high probabilities of belonging to hard classes would be sent to the extension block for prediction. Further, only if the neural network at the edge shows low confidence in the prediction, the instance is considered complex and sent to the cloud for further processing. The training technique lends to the majority of inference on edge devices while going to the cloud only for a small set of complex jobs, as determined by the edge. The performance of the proposed system is evaluated via extensive experiments using modified models of ResNets and MobileNetV2 on CIFAR-100 and ImageNet datasets. The results show that the proposed distributed model has improved accuracy and energy consumption, indicating its capacity to adapt. △ Less

Submitted 14 September, 2021; originally announced September 2021.

Comments: 41st IEEE International Conference on Distributed Computing Systems, 2021

arXiv:2011.10227 [pdf, other]

StressNet: Deep Learning to Predict Stress With Fracture Propagation in Brittle Materials

Authors: Yinan Wang, Diane Oyen, Weihong, Guo, Anishi Mehta, Cory Braker Scott, Nishant Panda, M. Giselle Fernández-Godino, Gowri Srinivasan, Xiaowei Yue

Abstract: Catastrophic failure in brittle materials is often due to the rapid growth and coalescence of cracks aided by high internal stresses. Hence, accurate prediction of maximum internal stress is critical to predicting time to failure and improving the fracture resistance and reliability of materials. Existing high-fidelity methods, such as the Finite-Discrete Element Model (FDEM), are limited by their… ▽ More Catastrophic failure in brittle materials is often due to the rapid growth and coalescence of cracks aided by high internal stresses. Hence, accurate prediction of maximum internal stress is critical to predicting time to failure and improving the fracture resistance and reliability of materials. Existing high-fidelity methods, such as the Finite-Discrete Element Model (FDEM), are limited by their high computational cost. Therefore, to reduce computational cost while preserving accuracy, a novel deep learning model, "StressNet," is proposed to predict the entire sequence of maximum internal stress based on fracture propagation and the initial stress data. More specifically, the Temporal Independent Convolutional Neural Network (TI-CNN) is designed to capture the spatial features of fractures like fracture path and spall regions, and the Bidirectional Long Short-term Memory (Bi-LSTM) Network is adapted to capture the temporal features. By fusing these features, the evolution in time of the maximum internal stress can be accurately predicted. Moreover, an adaptive loss function is designed by dynamically integrating the Mean Squared Error (MSE) and the Mean Absolute Percentage Error (MAPE), to reflect the fluctuations in maximum internal stress. After training, the proposed model is able to compute accurate multi-step predictions of maximum internal stress in approximately 20 seconds, as compared to the FDEM run time of 4 hours, with an average MAPE of 2% relative to test data. △ Less

Submitted 20 November, 2020; originally announced November 2020.

Comments: 13 pages

ACM Class: J.2

arXiv:2010.15208 [pdf, other]

Identifying Entangled Physics Relationships through Sparse Matrix Decomposition to Inform Plasma Fusion Design

Authors: M. Giselle Fernández-Godino, Michael J. Grosskopf, Julia B. Nakhleh, Brandon M. Wilson, John Kline, Gowri Srinivasan

Abstract: A sustainable burn platform through inertial confinement fusion (ICF) has been an ongoing challenge for over 50 years. Mitigating engineering limitations and improving the current design involves an understanding of the complex coupling of physical processes. While sophisticated simulations codes are used to model ICF implosions, these tools contain necessary numerical approximation but miss physi… ▽ More A sustainable burn platform through inertial confinement fusion (ICF) has been an ongoing challenge for over 50 years. Mitigating engineering limitations and improving the current design involves an understanding of the complex coupling of physical processes. While sophisticated simulations codes are used to model ICF implosions, these tools contain necessary numerical approximation but miss physical processes that limit predictive capability. Identification of relationships between controllable design inputs to ICF experiments and measurable outcomes (e.g. yield, shape) from performed experiments can help guide the future design of experiments and development of simulation codes, to potentially improve the accuracy of the computational models used to simulate ICF experiments. We use sparse matrix decomposition methods to identify clusters of a few related design variables. Sparse principal component analysis (SPCA) identifies grou**s that are related to the physical origin of the variables (laser, hohlraum, and capsule). A variable importance analysis finds that in addition to variables highly correlated with neutron yield such as picket power and laser energy, variables that represent a dramatic change of the ICF design such as number of pulse steps are also very important. The obtained sparse components are then used to train a random forest (RF) surrogate for predicting total yield. The RF performance on the training and testing data compares with the performance of the RF surrogate trained using all design variables considered. This work is intended to inform design changes in future ICF experiments by augmenting the expert intuition and simulations results. △ Less

Submitted 28 October, 2020; originally announced October 2020.

Comments: 8 pages, 7 figures

Report number: LA-UR-20-28715

arXiv:2010.04254 [pdf, other]

doi 10.1109/TPS.2021.3090299

Exploring Sensitivity of ICF Outputs to Design Parameters in Experiments Using Machine Learning

Authors: Julia B. Nakhleh, M. Giselle Fernández-Godino, Michael J. Grosskopf, Brandon M. Wilson, John Kline, Gowri Srinivasan

Abstract: Building a sustainable burn platform in inertial confinement fusion (ICF) requires an understanding of the complex coupling of physical processes and the effects that key experimental design changes have on implosion performance. While simulation codes are used to model ICF implosions, incomplete physics and the need for approximations deteriorate their predictive capability. Identification of rel… ▽ More Building a sustainable burn platform in inertial confinement fusion (ICF) requires an understanding of the complex coupling of physical processes and the effects that key experimental design changes have on implosion performance. While simulation codes are used to model ICF implosions, incomplete physics and the need for approximations deteriorate their predictive capability. Identification of relationships between controllable design inputs and measurable outcomes can help guide the future design of experiments and development of simulation codes, which can potentially improve the accuracy of the computational models used to simulate ICF implosions. In this paper, we leverage developments in machine learning (ML) and methods for ML feature importance/sensitivity analysis to identify complex relationships in ways that are difficult to process using expert judgment alone. We present work using random forest (RF) regression for prediction of yield, velocity, and other experimental outcomes given a suite of design parameters, along with an assessment of important relationships and uncertainties in the prediction model. We show that RF models are capable of learning and predicting on ICF experimental data with high accuracy, and we extract feature importance metrics that provide insight into the physical significance of different controllable design inputs for various ICF design configurations. These results can be used to augment expert intuition and simulation results for optimal design of future ICF experiments. △ Less

Submitted 1 September, 2021; v1 submitted 8 October, 2020; originally announced October 2020.

Comments: 10 pages, 9 figures. Published in IEEE Transactions on Plasma Science, July 2021 (see Journal Reference info)

Report number: LA-UR-20-27991

Journal ref: IEEE Transactions on Plasma Science, vol. 49, no. 7, pp. 2238-2246, July 2021

arXiv:2005.08237 [pdf, ps, other]

doi 10.1007/s12045-021-1136-x

Reflections on Euler's reflection formula and an additive analogue of Legendre's duplication formula

Authors: Ritesh Goenka, Gopala Krishna Srinivasan

Abstract: In this note, we look at some of the less explored aspects of the gamma function. We provide a new proof of Euler's reflection formula and discuss its significance in the theory of special functions. We also discuss a result of Landau concerning the determination of values of the gamma function using functional identities. We show that his result is sharp and extend it to complex arguments. In 184… ▽ More In this note, we look at some of the less explored aspects of the gamma function. We provide a new proof of Euler's reflection formula and discuss its significance in the theory of special functions. We also discuss a result of Landau concerning the determination of values of the gamma function using functional identities. We show that his result is sharp and extend it to complex arguments. In 1848, Oskar Schlömilch gave an interesting additive analogue of the duplication formula. We prove a generalized version of this formula using the theory of hypergeometric functions. △ Less

Submitted 17 May, 2020; originally announced May 2020.

MSC Class: 33B15 (Primary) 44A05; 33C05 (Secondary)

Journal ref: Resonance 26 (2021) 367-386

arXiv:2005.01807 [pdf, other]

Enabling Deep Spiking Neural Networks with Hybrid Conversion and Spike Timing Dependent Backpropagation

Authors: Nitin Rathi, Gopalakrishnan Srinivasan, Priyadarshini Panda, Kaushik Roy

Abstract: Spiking Neural Networks (SNNs) operate with asynchronous discrete events (or spikes) which can potentially lead to higher energy-efficiency in neuromorphic hardware implementations. Many works have shown that an SNN for inference can be formed by copying the weights from a trained Artificial Neural Network (ANN) and setting the firing threshold for each layer as the maximum input received in that… ▽ More Spiking Neural Networks (SNNs) operate with asynchronous discrete events (or spikes) which can potentially lead to higher energy-efficiency in neuromorphic hardware implementations. Many works have shown that an SNN for inference can be formed by copying the weights from a trained Artificial Neural Network (ANN) and setting the firing threshold for each layer as the maximum input received in that layer. These type of converted SNNs require a large number of time steps to achieve competitive accuracy which diminishes the energy savings. The number of time steps can be reduced by training SNNs with spike-based backpropagation from scratch, but that is computationally expensive and slow. To address these challenges, we present a computationally-efficient training technique for deep SNNs. We propose a hybrid training methodology: 1) take a converted SNN and use its weights and thresholds as an initialization step for spike-based backpropagation, and 2) perform incremental spike-timing dependent backpropagation (STDB) on this carefully initialized network to obtain an SNN that converges within few epochs and requires fewer time steps for input processing. STDB is performed with a novel surrogate gradient function defined using neuron's spike time. The proposed training methodology converges in less than 20 epochs of spike-based backpropagation for most standard image classification datasets, thereby greatly reducing the training complexity compared to training SNNs from scratch. We perform experiments on CIFAR-10, CIFAR-100, and ImageNet datasets for both VGG and ResNet architectures. We achieve top-1 accuracy of 65.19% for ImageNet dataset on SNN with 250 time steps, which is 10X faster compared to converted SNNs with similar accuracy. △ Less

Submitted 4 May, 2020; originally announced May 2020.

Comments: International Conference on Learning Representations (ICLR), 2020 https://openreview.net/forum?id=B1xSperKvH&noteId=B1xSperKvH

arXiv:2003.02800 [pdf, other]

Pruning Filters while Training for Efficiently Optimizing Deep Learning Networks

Authors: Sourjya Roy, Priyadarshini Panda, Gopalakrishnan Srinivasan, Anand Raghunathan

Abstract: Modern deep networks have millions to billions of parameters, which leads to high memory and energy requirements during training as well as during inference on resource-constrained edge devices. Consequently, pruning techniques have been proposed that remove less significant weights in deep networks, thereby reducing their memory and computational requirements. Pruning is usually performed after t… ▽ More Modern deep networks have millions to billions of parameters, which leads to high memory and energy requirements during training as well as during inference on resource-constrained edge devices. Consequently, pruning techniques have been proposed that remove less significant weights in deep networks, thereby reducing their memory and computational requirements. Pruning is usually performed after training the original network, and is followed by further retraining to compensate for the accuracy loss incurred during pruning. The prune-and-retrain procedure is repeated iteratively until an optimum tradeoff between accuracy and efficiency is reached. However, such iterative retraining adds to the overall training complexity of the network. In this work, we propose a dynamic pruning-while-training procedure, wherein we prune filters of the convolutional layers of a deep network during training itself, thereby precluding the need for separate retraining. We evaluate our dynamic pruning-while-training approach with three different pre-existing pruning strategies, viz. mean activation-based pruning, random pruning, and L1 normalization-based pruning. Our results for VGG-16 trained on CIFAR10 shows that L1 normalization provides the best performance among all the techniques explored in this work with less than 1% drop in accuracy after pruning 80% of the filters compared to the original network. We further evaluated the L1 normalization based pruning mechanism on CIFAR100. Results indicate that pruning while training yields a compressed network with almost no accuracy loss after pruning 50% of the filters compared to the original network and ~5% loss for high pruning rates (>80%). The proposed pruning methodology yields 41% reduction in the number of computations and memory accesses during training for CIFAR10, CIFAR100 and ImageNet compared to training with retraining for 10 epochs . △ Less

Submitted 5 March, 2020; originally announced March 2020.

arXiv:2003.01811 [pdf, other]

RMP-SNN: Residual Membrane Potential Neuron for Enabling Deeper High-Accuracy and Low-Latency Spiking Neural Network

Authors: Bing Han, Gopalakrishnan Srinivasan, Kaushik Roy

Abstract: Spiking Neural Networks (SNNs) have recently attracted significant research interest as the third generation of artificial neural networks that can enable low-power event-driven data analytics. The best performing SNNs for image recognition tasks are obtained by converting a trained Analog Neural Network (ANN), consisting of Rectified Linear Units (ReLU), to SNN composed of integrate-and-fire neur… ▽ More Spiking Neural Networks (SNNs) have recently attracted significant research interest as the third generation of artificial neural networks that can enable low-power event-driven data analytics. The best performing SNNs for image recognition tasks are obtained by converting a trained Analog Neural Network (ANN), consisting of Rectified Linear Units (ReLU), to SNN composed of integrate-and-fire neurons with "proper" firing thresholds. The converted SNNs typically incur loss in accuracy compared to that provided by the original ANN and require sizable number of inference time-steps to achieve the best accuracy. We find that performance degradation in the converted SNN stems from using "hard reset" spiking neuron that is driven to fixed reset potential once its membrane potential exceeds the firing threshold, leading to information loss during SNN inference. We propose ANN-SNN conversion using "soft reset" spiking neuron model, referred to as Residual Membrane Potential (RMP) spiking neuron, which retains the "residual" membrane potential above threshold at the firing instants. We demonstrate near loss-less ANN-SNN conversion using RMP neurons for VGG-16, ResNet-20, and ResNet-34 SNNs on challenging datasets including CIFAR-10 (93.63% top-1), CIFAR-100 (70.93% top-1), and ImageNet (73.09% top-1 accuracy). Our results also show that RMP-SNN surpasses the best inference accuracy provided by the converted SNN with "hard reset" spiking neurons using 2-8 times fewer inference time-steps across network architectures and datasets. △ Less

Submitted 1 April, 2020; v1 submitted 25 February, 2020; originally announced March 2020.

Comments: to be published in CVPR'20

arXiv:2003.01250 [pdf, ps, other]

Explicitly Trained Spiking Sparsity in Spiking Neural Networks with Backpropagation

Authors: Jason M. Allred, Steven J. Spencer, Gopalakrishnan Srinivasan, Kaushik Roy

Abstract: Spiking Neural Networks (SNNs) are being explored for their potential energy efficiency resulting from sparse, event-driven computations. Many recent works have demonstrated effective backpropagation for deep Spiking Neural Networks (SNNs) by approximating gradients over discontinuous neuron spikes or firing events. A beneficial side-effect of these surrogate gradient spiking backpropagation algor… ▽ More Spiking Neural Networks (SNNs) are being explored for their potential energy efficiency resulting from sparse, event-driven computations. Many recent works have demonstrated effective backpropagation for deep Spiking Neural Networks (SNNs) by approximating gradients over discontinuous neuron spikes or firing events. A beneficial side-effect of these surrogate gradient spiking backpropagation algorithms is that the spikes, which trigger additional computations, may now themselves be directly considered in the gradient calculations. We propose an explicit inclusion of spike counts in the loss function, along with a traditional error loss, causing the backpropagation learning algorithms to optimize weight parameters for both accuracy and spiking sparsity. As supported by existing theory of over-parameterized neural networks, there are many solution states with effectively equivalent accuracy. As such, appropriate weighting of the two loss goals during training in this multi-objective optimization process can yield an improvement in spiking sparsity without a significant loss of accuracy. We additionally explore a simulated annealing-inspired loss weighting technique to increase the weighting for sparsity as training time increases. Our preliminary results on the Cifar-10 dataset show up to 70.1% reduction in spiking activity with iso-accuracy compared to an equivalent SNN trained only for accuracy and up to 73.3% reduction in spiking activity if allowed a trade-off of 1% reduction in classification accuracy. △ Less

Submitted 2 March, 2020; originally announced March 2020.

arXiv:2002.11163 [pdf, other]

sBSNN: Stochastic-Bits Enabled Binary Spiking Neural Network with On-Chip Learning for Energy Efficient Neuromorphic Computing at the Edge

Authors: Minsuk Koo, Gopalakrishnan Srinivasan, Yong Shim, Kaushik Roy

Abstract: In this work, we propose stochastic Binary Spiking Neural Network (sBSNN) composed of stochastic spiking neurons and binary synapses (stochastic only during training) that computes probabilistically with one-bit precision for power-efficient and memory-compressed neuromorphic computing. We present an energy-efficient implementation of the proposed sBSNN using 'stochastic bit' as the core computati… ▽ More In this work, we propose stochastic Binary Spiking Neural Network (sBSNN) composed of stochastic spiking neurons and binary synapses (stochastic only during training) that computes probabilistically with one-bit precision for power-efficient and memory-compressed neuromorphic computing. We present an energy-efficient implementation of the proposed sBSNN using 'stochastic bit' as the core computational primitive to realize the stochastic neurons and synapses, which are fabricated in 90nm CMOS process, to achieve efficient on-chip training and inference for image recognition tasks. The measured data shows that the 'stochastic bit' can be programmed to mimic spiking neurons, and stochastic Spike Timing Dependent Plasticity (or sSTDP) rule for training the binary synaptic weights without expensive random number generators. Our results indicate that the proposed sBSNN realization offers possibility of up to 32x neuronal and synaptic memory compression compared to full precision (32-bit) SNN and energy efficiency of 89.49 TOPS/Watt for two-layer fully-connected SNN. △ Less

Submitted 25 February, 2020; originally announced February 2020.

arXiv:2001.11328 [pdf, other]

doi 10.1016/j.commatsci.2020.109959

Accelerating High-Strain Continuum-Scale Brittle Fracture Simulations with Machine Learning

Authors: M. Giselle Fernández-Godino, Nishant Panda, Daniel O'Malley, Kevin Larkin, Abigail Hunter, Raphael T. Haftka, Gowri Srinivasan

Abstract: Failure in brittle materials under dynamic loading conditions is a result of the propagation and coalescence of microcracks. Simulating this mechanism at the continuum level is computationally expensive or, in some cases, intractable. The computational cost is due to the need for highly resolved computational meshes required to capture complex crack growth behavior, such as branching, turning, etc… ▽ More Failure in brittle materials under dynamic loading conditions is a result of the propagation and coalescence of microcracks. Simulating this mechanism at the continuum level is computationally expensive or, in some cases, intractable. The computational cost is due to the need for highly resolved computational meshes required to capture complex crack growth behavior, such as branching, turning, etc. Typically, continuum-scale models that account for brittle damage evolution homogenize the crack network in some way, which reduces the overall computational cost, but can also neglect key physics of the subgrid crack growth behavior, sacrificing accuracy for efficiency. We have developed an approach using machine learning that overcomes the current inability to represent micro-scale physics at the macro-scale. Our approach leverages damage and stress data from a high-fidelity model that explicitly resolves microcrack behavior to build an inexpensive machine learning emulator, which runs in seconds as opposed to the high-fidelity model, which takes hours. Once trained, the machine learning emulator is used to predict the evolution of crack length statistics. A continuum-scale constitutive model is then informed with these crack statistics, speeding up the workflow by four orders of magnitude. Both the machine learning model and the continuum-scale model are validated against a high-fidelity model and experimental data, respectively, showing excellent agreement. There are two key findings. The first is that we can reduce the dimensionality of the problem, establishing that the machine learning emulator only needs the length of the longest crack and one of the maximum stress components to capture the necessary physics. Another compelling finding is that the emulator can be trained in one experimental setting and transferred successfully to predict behavior in a different setting. △ Less

Submitted 12 May, 2020; v1 submitted 8 January, 2020; originally announced January 2020.

Comments: Keywords: Computational Material Science, Machine Learning. 27 pages,13 figures, in review at COMMAT Elsevier journal

Report number: LA-UR-20-20148

Journal ref: Computational Materials Science, Elsevier, Volume 186, January 2021, p. 109959

arXiv:1912.13407 [pdf, other]

Probing magnon-magnon coupling in exchange coupled Y$_3$Fe$_5$O$_{12}$/Permalloy bilayers with magneto-optical effects

Authors: Yuzan Xiong, Yi Li, Mouhamad Hammami, Rao Bidthanapally, Joseph Sklenar, Xufeng Zhang, Hongwei Qu, Gopalan Srinivasan, John Pearson, Axel Hoffmann, Valentine Novosad, Wei Zhang

Abstract: We demonstrate the magnetically-induced transparency (MIT) effect in Y$_3$Fe$_5$O$_{12}$(YIG)/Permalloy(Py) coupled bilayers. The measurement is achieved via a heterodyne detection of the coupled magnetization dynamics using a single wavelength that probes the magneto-optical Kerr and Faraday effects of Py and YIG, respectively. Clear features of the MIT effect are evident from the deeply modulate… ▽ More We demonstrate the magnetically-induced transparency (MIT) effect in Y$_3$Fe$_5$O$_{12}$(YIG)/Permalloy(Py) coupled bilayers. The measurement is achieved via a heterodyne detection of the coupled magnetization dynamics using a single wavelength that probes the magneto-optical Kerr and Faraday effects of Py and YIG, respectively. Clear features of the MIT effect are evident from the deeply modulated ferromagnetic resonance of Py due to the perpendicular-standing-spin-wave of YIG. We develop a phenomenological model that nicely reproduces the experimental results including the induced amplitude and phase evolution caused by the magnon-magnon coupling. Our work offers a new route towards studying phase-resolved spin dynamics and hybrid magnonic systems. △ Less

Submitted 10 July, 2020; v1 submitted 31 December, 2019; originally announced December 2019.

Comments: 16 pages, 3 figures

arXiv:1908.02073 [pdf]

Tunable magnetization in nanoscale LuFeO3: Role of morphology, ortho-hexa phase ratio and local structure

Authors: Smita Chaturvedi, Priyank Shyam, Mandar M. Shirolkar, Swathi Krishna, Bhavesh Sinha, Wolfgang Caliebe, Aleksandr Kalinko, Gopalan Srinivasan, Satishchandra Ogale

Abstract: We have observed enhancement and shift in the spin reorientation transition temperature as a consequence of coexistence of orthorhombic and hexagonal phases and higher aspect ratio in nanoscale LuFeO3. Nanoparticles and nanofibers of LuFeO3 are considered for this work. Nanoparticles have 75 % orthorhombic phase and 25 % hexagonal phase, while nanofibers have 23% orthorhombic phase and 77%-hexagon… ▽ More We have observed enhancement and shift in the spin reorientation transition temperature as a consequence of coexistence of orthorhombic and hexagonal phases and higher aspect ratio in nanoscale LuFeO3. Nanoparticles and nanofibers of LuFeO3 are considered for this work. Nanoparticles have 75 % orthorhombic phase and 25 % hexagonal phase, while nanofibers have 23% orthorhombic phase and 77%-hexagonal phase. Larger aspect ratio in case of nanofibers is seen to help strain-stabilize the hexagonal phase in the material. Magnetic measurements show significant difference in the magnetic behavior and spin reorientation temperature; 183K for the nanoparticle case and 150K for the case of nanofibers. Moreover, the ferromagnetic moment is two order of magnitude higher for nanofibers than that of nanoparticles, In hexagonal phase, frustration of triangular lattice, works against the long range ordering while magnetic anisotropy works in favor of the long range ordering, which contributes towards the enhanced and anomalous magnetic behavior in case of fibers. X -ray absorption near edge spectroscopy (XANES) at the Fe K-edge has been used to probe the symmetry driven dynamics of Fe 3d- 4p orbitals. It established that due to noncentrocymmetry of the Fe atom, the nanofibers have decreased 3d-4p orbital mixing and reduced crystal field splitting energy, which are also contributing factor for the enhanced magnetic behaviour. △ Less

Submitted 6 August, 2019; originally announced August 2019.

arXiv:1906.01695 [pdf, other]

Reinforcement Learning with Low-Complexity Liquid State Machines

Authors: Wachirawit Ponghiran, Gopalakrishnan Srinivasan, Kaushik Roy

Abstract: We propose reinforcement learning on simple networks consisting of random connections of spiking neurons (both recurrent and feed-forward) that can learn complex tasks with very little trainable parameters. Such sparse and randomly interconnected recurrent spiking networks exhibit highly non-linear dynamics that transform the inputs into rich high-dimensional representations based on past context.… ▽ More We propose reinforcement learning on simple networks consisting of random connections of spiking neurons (both recurrent and feed-forward) that can learn complex tasks with very little trainable parameters. Such sparse and randomly interconnected recurrent spiking networks exhibit highly non-linear dynamics that transform the inputs into rich high-dimensional representations based on past context. The random input representations can be efficiently interpreted by an output (or readout) layer with trainable parameters. Systematic initialization of the random connections and training of the readout layer using Q-learning algorithm enable such small random spiking networks to learn optimally and achieve the same learning efficiency as humans on complex reinforcement learning tasks like Atari games. The spike-based approach using small random recurrent networks provides a computationally efficient alternative to state-of-the-art deep reinforcement learning networks with several layers of trainable parameters. The low-complexity spiking networks can lead to improved energy efficiency in event-driven neuromorphic hardware for complex reinforcement learning tasks. △ Less

Submitted 4 June, 2019; originally announced June 2019.

Comments: 6 figures

arXiv:1904.07689 [pdf, ps, other]

Free groups, covering spaces and Artin's theorem

Authors: Gopala Krishna Srinivasan

Abstract: In this expository note we provide a proof of Artin's theorem which states that the commutator subgroup of a free group on two generators is not finitely generated. The proof employs the infinite grid as in two other proofs in the literature mentioned in the note but takes a somewhat different approach which seems to be of didactic value. In this expository note we provide a proof of Artin's theorem which states that the commutator subgroup of a free group on two generators is not finitely generated. The proof employs the infinite grid as in two other proofs in the literature mentioned in the note but takes a somewhat different approach which seems to be of didactic value. △ Less

Submitted 14 May, 2019; v1 submitted 13 April, 2019; originally announced April 2019.

arXiv:1903.06379 [pdf, other]

doi 10.3389/fnins.2020.00119

Enabling Spike-based Backpropagation for Training Deep Neural Network Architectures

Authors: Chankyu Lee, Syed Shakib Sarwar, Priyadarshini Panda, Gopalakrishnan Srinivasan, Kaushik Roy

Abstract: Spiking Neural Networks (SNNs) have recently emerged as a prominent neural computing paradigm. However, the typical shallow SNN architectures have limited capacity for expressing complex representations while training deep SNNs using input spikes has not been successful so far. Diverse methods have been proposed to get around this issue such as converting off-the-shelf trained deep Artificial Neur… ▽ More Spiking Neural Networks (SNNs) have recently emerged as a prominent neural computing paradigm. However, the typical shallow SNN architectures have limited capacity for expressing complex representations while training deep SNNs using input spikes has not been successful so far. Diverse methods have been proposed to get around this issue such as converting off-the-shelf trained deep Artificial Neural Networks (ANNs) to SNNs. However, the ANN-SNN conversion scheme fails to capture the temporal dynamics of a spiking system. On the other hand, it is still a difficult problem to directly train deep SNNs using input spike events due to the discontinuous, non-differentiable nature of the spike generation function. To overcome this problem, we propose an approximate derivative method that accounts for the leaky behavior of LIF neurons. This method enables training deep convolutional SNNs directly (with input spike events) using spike-based backpropagation. Our experiments show the effectiveness of the proposed spike-based learning on deep networks (VGG and Residual architectures) by achieving the best classification accuracies in MNIST, SVHN and CIFAR-10 datasets compared to other SNNs trained with a spike-based learning. Moreover, we analyze sparse event-based computations to demonstrate the efficacy of the proposed SNN training method for inference operation in the spiking domain. △ Less

Submitted 24 March, 2020; v1 submitted 15 March, 2019; originally announced March 2019.

Comments: Chankyu Lee and Syed Shakib Sarwar contributed equally to the work

Journal ref: Frontiers in Neuroscience, 14 (2020)

arXiv:1902.08029 [pdf, other]

Multilevel Graph Partitioning for Three-Dimensional Discrete Fracture Network Flow Simulations

Authors: Hayato Ushijima-Mwesigwa, Jeffrey D. Hyman, Aric Hagberg, Ilya Safro, Satish Karra, Carl W. Gable, Matthew R. Sweeney, Gowri Srinivasan

Abstract: We present a topology-based method for mesh-partitioning in three-dimensional discrete fracture network (DFN) simulations that take advantage of the intrinsic multi-level nature of a DFN. DFN models are used to simulate flow and transport through low-permeability fractured media in the subsurface by explicitly representing fractures as discrete entities. The governing equations for flow and transp… ▽ More We present a topology-based method for mesh-partitioning in three-dimensional discrete fracture network (DFN) simulations that take advantage of the intrinsic multi-level nature of a DFN. DFN models are used to simulate flow and transport through low-permeability fractured media in the subsurface by explicitly representing fractures as discrete entities. The governing equations for flow and transport are numerically integrated on computational meshes generated on the interconnected fracture networks. Modern high-fidelity DFN simulations require high-performance computing on multiple processors where performance and scalability depend partially on obtaining a high-quality partition of the mesh to balance work-loads and minimize communication across all processors. The discrete structure of a DFN naturally lends itself to various graph representations. We develop two applications of the multilevel graph partitioning algorithm to partition the mesh of a DFN. In the first, we project a partition of the graph based on the DFN topology onto the mesh of the DFN and in the second, this projection is used as the initial condition for further partitioning refinement of the mesh. We compare the performance of these methods with standard multi-level graph partitioning using graph-based metrics (cut, imbalance, partitioning time), computational-based metrics (FLOPS, iterations, solver time), and total run time. The DFN-based and the mesh-based partitioning methods are comparable in terms of the graph-based metrics, but the time required to obtain the partition is several orders of magnitude faster using the DFN-based partitions. In combination, these partitions are several orders of magnitude faster than the mesh-based partition. In turn, this hybrid method outperformed both of the other methods in terms of the total run time. △ Less

Submitted 1 April, 2021; v1 submitted 18 February, 2019; originally announced February 2019.

arXiv:1902.04161 [pdf, other]

ReStoCNet: Residual Stochastic Binary Convolutional Spiking Neural Network for Memory-Efficient Neuromorphic Computing

Authors: Gopalakrishnan Srinivasan, Kaushik Roy

Abstract: In this work, we propose ReStoCNet, a residual stochastic multilayer convolutional Spiking Neural Network (SNN) composed of binary kernels, to reduce the synaptic memory footprint and enhance the computational efficiency of SNNs for complex pattern recognition tasks. ReStoCNet consists of an input layer followed by stacked convolutional layers for hierarchical input feature extraction, pooling lay… ▽ More In this work, we propose ReStoCNet, a residual stochastic multilayer convolutional Spiking Neural Network (SNN) composed of binary kernels, to reduce the synaptic memory footprint and enhance the computational efficiency of SNNs for complex pattern recognition tasks. ReStoCNet consists of an input layer followed by stacked convolutional layers for hierarchical input feature extraction, pooling layers for dimensionality reduction, and fully-connected layer for inference. In addition, we introduce residual connections between the stacked convolutional layers to improve the hierarchical feature learning capability of deep SNNs. We propose Spike Timing Dependent Plasticity (STDP) based probabilistic learning algorithm, referred to as Hybrid-STDP (HB-STDP), incorporating Hebbian and anti-Hebbian learning mechanisms, to train the binary kernels forming ReStoCNet in a layer-wise unsupervised manner. We demonstrate the efficacy of ReStoCNet and the presented HB-STDP based unsupervised training methodology on the MNIST and CIFAR-10 datasets. We show that residual connections enable the deeper convolutional layers to self-learn useful high-level input features and mitigate the accuracy loss observed in deep SNNs devoid of residual connections. The proposed ReStoCNet offers >20x kernel memory compression compared to full-precision (32-bit) SNN while yielding high enough classification accuracy on the chosen pattern recognition tasks. △ Less

Submitted 11 February, 2019; originally announced February 2019.

Comments: 27 pages, 11 figures, and 6 tables

arXiv:1901.01923 [pdf, other]

doi 10.1103/PhysRevApplied.11.034047

Simultaneous Optical and Electrical Spin-Torque Magnetometry with Stroboscopic Detection of Spin-Precession Phase

Authors: Yi Li, Hilal Saglam, Zhizhi Zhang, Rao Bidthanapally, Yuzan Xiong, John E. Pearson, Valentine Novosad, Hongwei Qu, Gopalan Srinivasan, Axel Hoffmann andand Wei Zhang

Abstract: Spin-based coherent information processing and encoding utilize the precession phase of spins in magnetic materials. However, the detection and manipulation of spin precession phases remain a major challenge for advanced spintronic functionalities. By using simultaneous electrical and optical detection, we demonstrate the direct measurement of the precession phase of Permalloy ferromagnetic resona… ▽ More Spin-based coherent information processing and encoding utilize the precession phase of spins in magnetic materials. However, the detection and manipulation of spin precession phases remain a major challenge for advanced spintronic functionalities. By using simultaneous electrical and optical detection, we demonstrate the direct measurement of the precession phase of Permalloy ferromagnetic resonance driven by the spin-orbit torques from adjacent heavy metals. The spin Hall angle of the heavy metals can be independently determined from concurrent electrical and optical signals. The stroboscopic optical detection also allows spatially measuring local spin-torque parameters and the induced ferromagnetic resonance with comprehensive amplitude and phase information. Our study offers a route towards future advanced characterizations of spin-torque oscillators, magnonic circuits, and tunnelling junctions, where measuring the current-induced spin dynamics of individual nanomagnets are required. △ Less

Submitted 7 January, 2019; originally announced January 2019.

Comments: 12 pages, 9 figures

Journal ref: Phys. Rev. Applied 11, 034047 (2019)

arXiv:1812.11023 [pdf, other]

doi 10.1073/pnas.1818529116

Branching of Hydraulic Cracks in Gas or Oil Shale with Closed Natural Fractures: How to Master Permeability

Authors: Saeed Rahimi-Agham, Viet-Tuan Chau, Huyn** Lee, Hoang Nguyen, Weixin Li, Satish Karra, Esteban Rougier, Hari Viswanathan, Gowri Srinivasan, Zdenek P. Bazant

Abstract: While the hydraulic fracturing technology, aka fracking (or fraccing, frac), has become highly developed and astonishingly successful, a consistent formulation of the associated fracture mechanics that would not conflict with some observations is still unavailable. It is attempted here. Classical fracture mechanics, as well as the current commercial softwares, predict vertical cracks to propagate… ▽ More While the hydraulic fracturing technology, aka fracking (or fraccing, frac), has become highly developed and astonishingly successful, a consistent formulation of the associated fracture mechanics that would not conflict with some observations is still unavailable. It is attempted here. Classical fracture mechanics, as well as the current commercial softwares, predict vertical cracks to propagate without branching from the perforations of the horizontal well casing, which are typically spaced at 10 m or more. However, to explain the gas production rate at the wellhead, the crack spacing would have to be only about 0.1 m, which would increase the overall gas permeability of shale mass about 10,000$\times$. This permeability increase has generally been attributed to a preexisting system of orthogonal natural cracks, whose spacing is about 0.1 m. But their average age is about 100 million years, and a recent analysis indicated that these cracks must have been completely closed by secondary creep of shale in less than a million years. Here it is considered that the tectonic events that produced the natural cracks in shale must have also created weak layers with nano- or micro-cracking damage. It is numerically demonstrated that a greatly enhanced permeability along the weak layers, with a greatly increased transverse Biot coefficient, must cause the fracking to engender lateral branching and the opening of hydraulic cracks along the weak layers, even if these cracks are initially almost closed. A finite element crack band model, based on recently developed anisotropic spherocylindrical microplane constitutive law, demonstrates these findings. △ Less

Submitted 5 December, 2018; originally announced December 2018.

Journal ref: Proceedings of the National Academy of Sciences Jan 2019, 116 (5) 1532-1537;

arXiv:1810.06118 [pdf, other]

doi 10.1016/j.commatsci.2019.02.046

Learning to fail: Predicting fracture evolution in brittle material models using recurrent graph convolutional neural networks

Authors: Max Schwarzer, Bryce Rogan, Yadong Ruan, Zhengming Song, Diana Y. Lee, Allon G. Percus, Viet T. Chau, Bryan A. Moore, Esteban Rougier, Hari S. Viswanathan, Gowri Srinivasan

Abstract: We propose a machine learning approach to address a key challenge in materials science: predicting how fractures propagate in brittle materials under stress, and how these materials ultimately fail. Our methods use deep learning and train on simulation data from high-fidelity models, emulating the results of these models while avoiding the overwhelming computational demands associated with running… ▽ More We propose a machine learning approach to address a key challenge in materials science: predicting how fractures propagate in brittle materials under stress, and how these materials ultimately fail. Our methods use deep learning and train on simulation data from high-fidelity models, emulating the results of these models while avoiding the overwhelming computational demands associated with running a statistically significant sample of simulations. We employ a graph convolutional network that recognizes features of the fracturing material and a recurrent neural network that models the evolution of these features, along with a novel form of data augmentation that compensates for the modest size of our training data. We simultaneously generate predictions for qualitatively distinct material properties. Results on fracture damage and length are within 3% of their simulated values, and results on time to material failure, which is notoriously difficult to predict even with high-fidelity models, are within approximately 15% of simulated values. Once trained, our neural networks generate predictions within seconds, rather than the hours needed to run a single simulation. △ Less

Submitted 15 March, 2019; v1 submitted 14 October, 2018; originally announced October 2018.

Report number: LA-UR-18-29693

Journal ref: Computational Materials Science 162, 322-332 (2019)

arXiv:1808.09627 [pdf, ps, other]

The Exterior Derivative - A direct approach

Authors: Gopala Krishna Srinivasan

Abstract: In this note we provide a direct approach to the most basic operator in this theory namely the exterior derivative. The crucial ingredient is a calculus lemma based on determinants. We maintain the view that in a first course at least this direct approach is preferable to the more abstract one based on characterization of the exterior derivative in terms of its properties. In this note we provide a direct approach to the most basic operator in this theory namely the exterior derivative. The crucial ingredient is a calculus lemma based on determinants. We maintain the view that in a first course at least this direct approach is preferable to the more abstract one based on characterization of the exterior derivative in terms of its properties. △ Less

Submitted 28 August, 2018; originally announced August 2018.

MSC Class: 58A10

arXiv:1807.11537 [pdf, other]

Estimating Failure in Brittle Materials using Graph Theory

Authors: M. K. Mudunuru, N. Panda, S. Karra, G. Srinivasan, V. T. Chau, E. Rougier, A. Hunter, H. S. Viswanathan

Abstract: In brittle fracture applications, failure paths, regions where the failure occurs and damage statistics, are some of the key quantities of interest (QoI). High-fidelity models for brittle failure that accurately predict these QoI exist but are highly computationally intensive, making them infeasible to incorporate in upscaling and uncertainty quantification frameworks. The goal of this paper is to… ▽ More In brittle fracture applications, failure paths, regions where the failure occurs and damage statistics, are some of the key quantities of interest (QoI). High-fidelity models for brittle failure that accurately predict these QoI exist but are highly computationally intensive, making them infeasible to incorporate in upscaling and uncertainty quantification frameworks. The goal of this paper is to provide a fast heuristic to reasonably estimate quantities such as failure path and damage in the process of brittle failure. Towards this goal, we first present a method to predict failure paths under tensile loading conditions and low-strain rates. The method uses a $k$-nearest neighbors algorithm built on fracture process zone theory, and identifies the set of all possible pre-existing cracks that are likely to join early to form a large crack. The method then identifies zone of failure and failure paths using weighted graphs algorithms. We compare these failure paths to those computed with a high-fidelity model called the Hybrid Optimization Software Simulation Suite (HOSS). A probabilistic evolution model for average damage in a system is also developed that is trained using 150 HOSS simulations and tested on 40 simulations. A non-parametric approach based on confidence intervals is used to determine the damage evolution over time along the dominant failure path. For upscaling, damage is the key QoI needed as an input by the continuum models. This needs to be informed accurately by the surrogate models for calculating effective modulii at continuum-scale. We show that for the proposed average damage evolution model, the prediction accuracy on the test data is more than 90\%. In terms of the computational time, the proposed models are $\approx \mathcal{O}(10^6)$ times faster compared to high-fidelity HOSS. △ Less

Submitted 30 July, 2018; originally announced July 2018.

Comments: 20 pages, 10 figures

arXiv:1807.00343 [pdf, other]

Xcel-RAM: Accelerating Binary Neural Networks in High-Throughput SRAM Compute Arrays

Authors: Amogh Agrawal, Akhilesh Jaiswal, Deboleena Roy, Bing Han, Gopalakrishnan Srinivasan, Aayush Ankit, Kaushik Roy

Abstract: Deep neural networks are a biologically-inspired class of algorithms that have recently demonstrated state-of-the-art accuracies involving large-scale classification and recognition tasks. Indeed, a major landmark that enables efficient hardware accelerators for deep networks is the recent advances from the machine learning community that have demonstrated aggressively scaled deep binary networks… ▽ More Deep neural networks are a biologically-inspired class of algorithms that have recently demonstrated state-of-the-art accuracies involving large-scale classification and recognition tasks. Indeed, a major landmark that enables efficient hardware accelerators for deep networks is the recent advances from the machine learning community that have demonstrated aggressively scaled deep binary networks with state-of-the-art accuracies. In this paper, we demonstrate how deep binary networks can be accelerated in modified von-Neumann machines by enabling binary convolutions within the SRAM array. In general, binary convolutions consist of bit-wise XNOR followed by a population-count (popcount). We present a charge sharing XNOR and popcount operation in 10 transistor SRAM cells. We have employed multiple circuit techniques including dual-read-worldines (Dual-RWL) along with a dual-stage ADC that overcomes the inaccuracies of a low precision ADC, to achieve a fairly accurate popcount. In addition, a key highlight of the present work is the fact that we propose sectioning of the SRAM array by adding switches onto the read-bitlines, thereby achieving improved parallelism. This is beneficial for deep networks, where the kernel size grows and requires to be stored in multiple sub-banks. As such, one needs to evaluate the partial popcount from multiple sub-banks and sum them up for achieving the final popcount. For n-sections per sub-array, we can perform n convolutions within one particular sub-bank, thereby improving overall system throughput as well as the energy efficiency. Our results at the array level show that the energy consumption and delay per-operation was 1.914pJ and 45ns, respectively. Moreover, an energy improvement of 2.5x, and a performance improvement of 4x was achieved by using the proposed sectioned-SRAM, compared to a non-sectioned SRAM design. △ Less

Submitted 21 October, 2018; v1 submitted 1 July, 2018; originally announced July 2018.

arXiv:1806.01949 [pdf, ps, other]

Reduced-Order Modeling through Machine Learning Approaches for Brittle Fracture Applications

Authors: A. Hunter, B. A. Moore, M. K. Mudunuru, V. T. Chau, R. L. Miller, R. B. Tchoua, C. Nyshadham, S. Karra, D. O. Malley, E. Rougier, H. S. Viswanathan, G. Srinivasan

Abstract: In this paper, five different approaches for reduced-order modeling of brittle fracture in geomaterials, specifically concrete, are presented and compared. Four of the five methods rely on machine learning (ML) algorithms to approximate important aspects of the brittle fracture problem. In addition to the ML algorithms, each method incorporates different physics-based assumptions in order to reduc… ▽ More In this paper, five different approaches for reduced-order modeling of brittle fracture in geomaterials, specifically concrete, are presented and compared. Four of the five methods rely on machine learning (ML) algorithms to approximate important aspects of the brittle fracture problem. In addition to the ML algorithms, each method incorporates different physics-based assumptions in order to reduce the computational complexity while maintaining the physics as much as possible. This work specifically focuses on using the ML approaches to model a 2D concrete sample under low strain rate pure tensile loading conditions with 20 preexisting cracks present. A high-fidelity finite element-discrete element model is used to both produce a training dataset of 150 simulations and an additional 35 simulations for validation. Results from the ML approaches are directly compared against the results from the high-fidelity model. Strengths and weaknesses of each approach are discussed and the most important conclusion is that a combination of physics-informed and data-driven features are necessary for emulating the physics of crack propagation, interaction and coalescence. All of the models presented here have runtimes that are orders of magnitude faster than the original high-fidelity model and pave the path for develo** accurate reduced order models that could be used to inform larger length-scale models with important sub-scale physics that often cannot be accounted for due to computational cost. △ Less

Submitted 5 June, 2018; originally announced June 2018.

Comments: 25 pages, 8 figures

arXiv:1709.01797 [pdf]

ZnO/LSMO Nanocomposites for Energy Harvesting

Authors: Robert Kinner, Abdul-Majeed Azad, G Srinivasan, G Sreenivasulu, Menka Jain

Abstract: The composites of strontium-doped lanthanum manganite (LSMO) with zinc oxide (ZnO) are candidate materials for energy harvesting by virtue of their magnetic and piezoelectric characteristics. They could be used to harvest energy from stray sources, such as the vibrations and electromagnetic noise from transformers and compressors within electrical grid power stations to power small diagnostic sens… ▽ More The composites of strontium-doped lanthanum manganite (LSMO) with zinc oxide (ZnO) are candidate materials for energy harvesting by virtue of their magnetic and piezoelectric characteristics. They could be used to harvest energy from stray sources, such as the vibrations and electromagnetic noise from transformers and compressors within electrical grid power stations to power small diagnostic sensors, among other applications. The LSMO/ZnO nanocomposites were made by: (i) milling the two bulk powders and, (ii) a wet chemical process which resulted in core-shell structures. The electrical, piezoelectric, and magnetoelectric properties showed strong dependence on the fabrication method. Growth of ZnO nanopillars on the particulate core of LSMO surface appears to have improved the piezoelectric properties. Moreover, the chemical bath deposition process can be easily modified to incorporate dopants to augment these properties further. △ Less

Submitted 22 August, 2017; originally announced September 2017.

Journal ref: Smart Nanosystems in Engineering and Medicine Vol.2 pp3-17 (2012)

arXiv:1709.01794 [pdf]

doi 10.1109/ICSensT.2015.7438459

A pico-Tesla magnetic sensor with PZT bimorph and permanent magnet proof mass

Authors: G Srinivasan, G Sreenivasulu, Peng Qu, Hongwei Qu, Vladimir Petrov

Abstract: Ferromagnetic-ferroelectric composites have attracted interests in recent years for use as magnetic field sensors. The sensing is based on magneto-electric (ME) coupling between the electric and magnetic subsystems and is mediated by mechanical strain. Such sensors for AC magnetic fields require a bias magnetic field to achieve pT-sensitivity. Here we discuss measurements and theory for a novel pa… ▽ More Ferromagnetic-ferroelectric composites have attracted interests in recent years for use as magnetic field sensors. The sensing is based on magneto-electric (ME) coupling between the electric and magnetic subsystems and is mediated by mechanical strain. Such sensors for AC magnetic fields require a bias magnetic field to achieve pT-sensitivity. Here we discuss measurements and theory for a novel passive, AC magnetic sensor that does not require a bias magnetic field and is based on a PZT bimorph with a permanent magnet proof mass. Mechanical strain on the PZT bimorph in this case is produced by interaction between the applied AC magnetic field and remnant magnetization of the permanent magnet, resulting in an induced voltage across PZT. Our studies have been performed on sensors with a bimorph of oppositely poled PZT platelets and a NdFeB permanent magnet proof mass. Magnetic floor noise N on the order of 100 pT per sqrHz and 10 nT per sqrHz are measured at 1 Hz and 10 Hz, respectively. When the AC magnetic field is applied at the bending resonance of 40 Hz for the bimorph, the measured N 700 pT per sqrHz. We also discuss a theory for the magneto-electro-mechanical coupling at low frequency and bending resonance in the sensor and theoretical estimates of ME voltage coefficients are in very good agreement with the data. △ Less

Submitted 22 August, 2017; originally announced September 2017.

Journal ref: IEEE sensors, 2015

arXiv:1708.08556 [pdf, other]

doi 10.1103/PhysRevE.97.033304

Modeling flow and transport in fracture networks using graphs

Authors: S. Karra, D. O'Malley, J. D. Hyman, H. S. Viswanathan, G. Srinivasan

Abstract: Fractures form the main pathways for flow in the subsurface within low-permeability rock. For this reason, accurately predicting flow and transport in fractured systems is vital for improving the performance of subsurface applications. Fracture sizes in these systems can range from millimeters to kilometers. Although, modeling flow and transport using the discrete fracture network (DFN) approach i… ▽ More Fractures form the main pathways for flow in the subsurface within low-permeability rock. For this reason, accurately predicting flow and transport in fractured systems is vital for improving the performance of subsurface applications. Fracture sizes in these systems can range from millimeters to kilometers. Although, modeling flow and transport using the discrete fracture network (DFN) approach is known to be more accurate due to incorporation of the detailed fracture network structure over continuum-based methods, capturing the flow and transport in such a wide range of scales is still computationally intractable. Furthermore, if one has to quantify uncertainty, hundreds of realizations of these DFN models have to be run. To reduce the computational burden, we solve flow and transport on a graph representation of a DFN. We study the accuracy of the graph approach by comparing breakthrough times and tracer particle statistical data between the graph-based and the high-fidelity DFN approaches, for fracture networks with varying number of fractures and degree of heterogeneity. We show that the graph approach shows a consistent bias with up to an order of magnitude slower breakthrough when compared to the DFN approach. We show that this is due to graph algorithm's under-prediction of the pressure gradients across intersections on a given fracture, leading to slower tracer particle speeds between intersections and longer travel times. We present a bias correction methodology to the graph algorithm that reduces the discrepancy between the DFN and graph predictions. We show that with this bias correction, the graph algorithm predictions significantly improve and the results are very accurate. The good accuracy and the low computational cost, with $O(10^4)$ times lower times than the DFN, makes the graph algorithm, an ideal technique to incorporate in uncertainty quantification methods. △ Less

Submitted 20 February, 2018; v1 submitted 28 August, 2017; originally announced August 2017.

Comments: 11 pages, 10 figures, 2 tables

Journal ref: Phys. Rev. E 97, 033304 (2018)

arXiv:1708.05231 [pdf]

doi 10.3390/ma11010018

Multiferroic Core-Shell Nanofibers, Assembly in a Magnetic field and Studies on MagnetoElectric Interactions

Authors: G. Sreenivasulu, Jitao Zhang, Ru Zhang, M. Popov, V. M. Petrov, G. Srinivasan

Abstract: Ferromagnetic-ferroelectric nanocomposites are of interest for realizing strong strain mediated coupling between electric and magnetic subsystems due to high surface area-to-volume ratio. This report is on the synthesis of nickel ferrite (NFO) -barium titanate (BTO) core-shell nano-fibers, magnetic field assisted assembly into superstructures, and studies on magneto-electric (ME) interactions. Ele… ▽ More Ferromagnetic-ferroelectric nanocomposites are of interest for realizing strong strain mediated coupling between electric and magnetic subsystems due to high surface area-to-volume ratio. This report is on the synthesis of nickel ferrite (NFO) -barium titanate (BTO) core-shell nano-fibers, magnetic field assisted assembly into superstructures, and studies on magneto-electric (ME) interactions. Electrospinning techniques were used to prepare coaxial fibers of 0.5-1.5 micron in diameter. The core-shell structure of annealed fibers was confirmed by electron microscopy and scanning probe microscopy. The fibers were assembled into discs and films in a uniform magnetic field or a field gradient. Studies on ME coupling in the assembled films and discs were done by magnetic field H induced polarization, magneto-dielectric effects at low frequencies and at 16-24 GHz, and low frequency ME voltage coefficients (MEVC). We measured 2~ 2-7% change in remnant polarization and in the permittivity for H = 7 kOe, and a MEVC of 0.4 mV/cm Oe at 30 Hz. A model has been developed for low-frequency ME effects in an assembly of fibers and takes into account dipole-dipole interactions between the fibers and fiber discontinuity. Theoretical estimates for the low-frequency MEVC have been compared with the data. These results indicate strong ME coupling in superstructures of the core-shell fibers. △ Less

Submitted 17 August, 2017; originally announced August 2017.

Journal ref: Materials 2017, 11, 18

arXiv:1705.09866 [pdf, other]

doi 10.1007/s10596-018-9720-1

Machine learning for graph-based representations of three-dimensional discrete fracture networks

Authors: Manuel Valera, Zhengyang Guo, Priscilla Kelly, Sean Matz, Vito Adrian Cantu, Allon G. Percus, Jeffrey D. Hyman, Gowri Srinivasan, Hari S. Viswanathan

Abstract: Structural and topological information play a key role in modeling flow and transport through fractured rock in the subsurface. Discrete fracture network (DFN) computational suites such as dfnWorks are designed to simulate flow and transport in such porous media. Flow and transport calculations reveal that a small backbone of fractures exists, where most flow and transport occurs. Restricting the… ▽ More Structural and topological information play a key role in modeling flow and transport through fractured rock in the subsurface. Discrete fracture network (DFN) computational suites such as dfnWorks are designed to simulate flow and transport in such porous media. Flow and transport calculations reveal that a small backbone of fractures exists, where most flow and transport occurs. Restricting the flowing fracture network to this backbone provides a significant reduction in the network's effective size. However, the particle tracking simulations needed to determine the reduction are computationally intensive. Such methods may be impractical for large systems or for robust uncertainty quantification of fracture networks, where thousands of forward simulations are needed to bound system behavior. In this paper, we develop an alternative network reduction approach to characterizing transport in DFNs, by combining graph theoretical and machine learning methods. We consider a graph representation where nodes signify fractures and edges denote their intersections. Using random forest and support vector machines, we rapidly identify a subnetwork that captures the flow patterns of the full DFN, based primarily on node centrality features in the graph. Our supervised learning techniques train on particle-tracking backbone paths found by dfnWorks, but run in negligible time compared to those simulations. We find that our predictions can reduce the network to approximately 20% of its original size, while still generating breakthrough curves consistent with those of the original network. △ Less

Submitted 29 January, 2018; v1 submitted 27 May, 2017; originally announced May 2017.

Comments: Computational Geosciences (2018)

Report number: LA-UR-17-24300

Journal ref: Computational Geosciences 22, 695-710 (2018)

arXiv:1703.03854 [pdf, other]

Convolutional Spike Timing Dependent Plasticity based Feature Learning in Spiking Neural Networks

Authors: Priyadarshini Panda, Gopalakrishnan Srinivasan, Kaushik Roy

Abstract: Brain-inspired learning models attempt to mimic the cortical architecture and computations performed in the neurons and synapses constituting the human brain to achieve its efficiency in cognitive tasks. In this work, we present convolutional spike timing dependent plasticity based feature learning with biologically plausible leaky-integrate-and-fire neurons in Spiking Neural Networks (SNNs). We u… ▽ More Brain-inspired learning models attempt to mimic the cortical architecture and computations performed in the neurons and synapses constituting the human brain to achieve its efficiency in cognitive tasks. In this work, we present convolutional spike timing dependent plasticity based feature learning with biologically plausible leaky-integrate-and-fire neurons in Spiking Neural Networks (SNNs). We use shared weight kernels that are trained to encode representative features underlying the input patterns thereby improving the sparsity as well as the robustness of the learning model. We demonstrate that the proposed unsupervised learning methodology learns several visual categories for object recognition with fewer number of examples and outperforms traditional fully-connected SNN architectures while yielding competitive accuracy. Additionally, we observe that the learning model performs out-of-set generalization further making the proposed biologically plausible framework a viable and efficient architecture for future neuromorphic applications. △ Less

Submitted 20 March, 2017; v1 submitted 10 March, 2017; originally announced March 2017.

Comments: 11 pages, 10 figures, Under Consideration in Scientific Reports

arXiv:1611.00021 [pdf, other]

doi 10.1142/S0218863516500429

Light dynamics in nonlinear trimers and twisted multicore fibers

Authors: Claudia Castro-Castro, Yannan Shen, Gowri Srinivasan, Alejandro B. Aceves, Panayotis G. Kevrekidis

Abstract: Novel photonic structures such as multi-core fibers and graphene based arrays present unique opportunities to manipulate and control the propagation of light. Here we discuss nonlinear dynamics for structures with a few (2 to 6) elements for which linear and nonlinear properties can be tuned. Specifically we show how nonlinearity, coupling, and parity-time PT symmetric gain/loss relate to existenc… ▽ More Novel photonic structures such as multi-core fibers and graphene based arrays present unique opportunities to manipulate and control the propagation of light. Here we discuss nonlinear dynamics for structures with a few (2 to 6) elements for which linear and nonlinear properties can be tuned. Specifically we show how nonlinearity, coupling, and parity-time PT symmetric gain/loss relate to existence, stability and in general, dynamical properties of nonlinear optical modes. The main emphasis of our presentation will be on systems with few degrees of freedom, most notably couplers, trimers and generalizations thereof to systems with 6 nodes. △ Less

Submitted 2 November, 2016; v1 submitted 31 October, 2016; originally announced November 2016.

Comments: 11 pages, 5 figures, Submitted to JNOPM

arXiv:1609.09158 [pdf, other]

doi 10.1109/TED.2017.2671353

Proposal for a Leaky-Integrate-Fire Spiking Neuron based on Magneto-Electric Switching of Ferro-magnets

Authors: Akhilesh Jaiswal, Sourjya Roy, Gopalakrishnan Srinivasan, Kaushik Roy

Abstract: The efficiency of the human brain in performing classification tasks has attracted considerable research interest in brain-inspired neuromorphic computing. Hardware implementations of a neuromorphic system aims to mimic the computations in the brain through interconnection of neurons and synaptic weights. A leaky-integrate-fire (LIF) spiking model is widely used to emulate the dynamics of neuronal… ▽ More The efficiency of the human brain in performing classification tasks has attracted considerable research interest in brain-inspired neuromorphic computing. Hardware implementations of a neuromorphic system aims to mimic the computations in the brain through interconnection of neurons and synaptic weights. A leaky-integrate-fire (LIF) spiking model is widely used to emulate the dynamics of neuronal action potentials. In this work, we propose a spin based LIF spiking neuron using the magneto-electric (ME) switching of ferro-magnets. The voltage across the ME oxide exhibits a typical leaky-integrate behavior, which in turn switches an underlying ferro-magnet. Due to the effect of thermal noise, the ferro-magnet exhibits probabilistic switching dynamics, which is reminiscent of the stochasticity exhibited by biological neurons. The energy-efficiency of the ME switching mechanism coupled with the intrinsic non-volatility of ferro-magnets result in lower energy consumption, when compared to a CMOS LIF neuron. A device to system-level simulation framework has been developed to investigate the feasibility of the proposed LIF neuron for a hand-written digit recognition problem △ Less

Submitted 28 September, 2016; originally announced September 2016.

arXiv:1603.07643 [pdf]

Giant Magnetoelectric coupling in Single Phase Pb(Zr0.20Ti0.80)0.70Pd0.30O3-δ Multiferroics

Authors: Shalini Kumari, Dhiren K. Pradhan, Nora Ortega, Kallol Pradhan, Christopher DeVreugd, Gopalan Srinivasan, Ashok Kumar, J. F. Scott, Ram S. Katiyar

Abstract: During the last fifteen years, multiferroic (MF) research communities have been searching for an alternative room temperature MF material with large magnetoelectric (ME) coupling for possible applications in high density electronic components, low heat dissipation memory and logic devices. We have studied Pb(Zr0.20Ti0.80)0.70Pd0.30O3-δ (PZTP30) system with an unusually large (30%) palladium occupa… ▽ More During the last fifteen years, multiferroic (MF) research communities have been searching for an alternative room temperature MF material with large magnetoelectric (ME) coupling for possible applications in high density electronic components, low heat dissipation memory and logic devices. We have studied Pb(Zr0.20Ti0.80)0.70Pd0.30O3-δ (PZTP30) system with an unusually large (30%) palladium occupancy in B site of PZT. This material exhibited a giant ME coupling coefficient ~0.36 mV/cm.Oe. Interestingly, this is the first time any room temperature single phase compound that showed ME trends, and magnitude similar to those in the well established mechanical strain-mediated ferroelectric and ferromagnetic composites; the latter ones are already in the commercial stage as nT/pT magnetic field sensors due to their large ME values. The presence of Pd in PZTP30 has been confirmed by XPS and XRF studies and assigned with related binding energies of Pd+2 and Pd+4 ions as 336.37 eV, 342.9 eV, and 337.53 eV, 343.43 eV, respectively, which may be the origin of room temperature magnetism in Pd substituted PZT ceramics. A sharp first order ferroelectric phase transition was observed at ~569 K (+/-5 K) that is confirmed from dielectric, Raman, and thermal analysis. Both ferromagnetic and ferroelectric orderings with large ME coupling were found above room temperature, a significant step forward in the development of single phase ME material with enhanced functionalities. △ Less

Submitted 24 March, 2016; originally announced March 2016.

Comments: 37 pages, 9 figures

arXiv:1602.08556 [pdf, other]

Significance Driven Hybrid 8T-6T SRAM for Energy-Efficient Synaptic Storage in Artificial Neural Networks

Authors: Gopalakrishnan Srinivasan, Parami Wijesinghe, Syed Shakib Sarwar, Akhilesh Jaiswal, Kaushik Roy

Abstract: Multilayered artificial neural networks (ANN) have found widespread utility in classification and recognition applications. The scale and complexity of such networks together with the inadequacies of general purpose computing platforms have led to a significant interest in the development of efficient hardware implementations. In this work, we focus on designing energy efficient on-chip storage fo… ▽ More Multilayered artificial neural networks (ANN) have found widespread utility in classification and recognition applications. The scale and complexity of such networks together with the inadequacies of general purpose computing platforms have led to a significant interest in the development of efficient hardware implementations. In this work, we focus on designing energy efficient on-chip storage for the synaptic weights. In order to minimize the power consumption of typical digital CMOS implementations of such large-scale networks, the digital neurons could be operated reliably at scaled voltages by reducing the clock frequency. On the contrary, the on-chip synaptic storage designed using a conventional 6T SRAM is susceptible to bitcell failures at reduced voltages. However, the intrinsic error resiliency of NNs to small synaptic weight perturbations enables us to scale the operating voltage of the 6TSRAM. Our analysis on a widely used digit recognition dataset indicates that the voltage can be scaled by 200mV from the nominal operating voltage (950mV) for practically no loss (less than 0.5%) in accuracy (22nm predictive technology). Scaling beyond that causes substantial performance degradation owing to increased probability of failures in the MSBs of the synaptic weights. We, therefore propose a significance driven hybrid 8T-6T SRAM, wherein the sensitive MSBs are stored in 8T bitcells that are robust at scaled voltages due to decoupled read and write paths. In an effort to further minimize the area penalty, we present a synaptic-sensitivity driven hybrid memory architecture consisting of multiple 8T-6T SRAM banks. Our circuit to system-level simulation framework shows that the proposed synaptic-sensitivity driven architecture provides a 30.91% reduction in the memory access power with a 10.41% area overhead, for less than 1% loss in the classification accuracy. △ Less

Submitted 27 February, 2016; originally announced February 2016.

Comments: Accepted in Design, Automation and Test in Europe 2016 conference (DATE-2016)

Journal ref: In Design, Automation & Test in Europe Conference & Exhibition (DATE), 2016, pp. 151-156

arXiv:1510.08426 [pdf, other]

doi 10.1088/1751-8113/49/29/295205

Existence, Stability and Dynamics of Discrete Solitary Waves in a Binary Waveguide Array

Authors: Y. Shen, P. G. Kevrekidis, G. Srinivasan, A. B. Aceves

Abstract: Recent work has explored binary waveguide arrays in the long-wavelength, near-continuum limit, here we examine the opposite limit, namely the vicinity of the so-called anti-continuum limit. We provide a systematic discussion of states involving one, two and three excited waveguides, and provide comparisons that illustrate how the stability of these states differ from the monoatomic limit of a sing… ▽ More Recent work has explored binary waveguide arrays in the long-wavelength, near-continuum limit, here we examine the opposite limit, namely the vicinity of the so-called anti-continuum limit. We provide a systematic discussion of states involving one, two and three excited waveguides, and provide comparisons that illustrate how the stability of these states differ from the monoatomic limit of a single type of waveguide. We do so by develo** a general theory which systematically tracks down the key eigenvalues of the linearized system. When we find the states to be unstable, we explore their dynamical evolution through direct numerical simulations. The latter typically illustrate, for the parameter values considered herein, the persistence of localized dynamics and the emergence for the duration of our simulations of robust quasi-periodic states for two excited sites. As the number of excited nodes increase, the unstable dynamics feature less regular oscillations of the solution's amplitude. △ Less

Submitted 28 October, 2015; originally announced October 2015.

arXiv:1303.2258 [pdf, ps, other]

doi 10.1103/PhysRevB.87.104114

Thermal properties of fluorinated graphene

Authors: S. K. Singh, S. Goverapet Srinivasan, M. Neek-Amal, S. Costamagna, Adri C. T. van Duin, F. M. Peeters

Abstract: Large scale atomistic simulations using the reactive force field approach (ReaxFF) are implemented to investigate the thermomechanical properties of fluorinated graphene (FG). A new set of parameters for the reactive force field potential (ReaxFF) optimized to reproduce key quantum mechanical properties of relevant carbon-fluor cluster systems are presented. Molecular dynamics (MD) simulations are… ▽ More Large scale atomistic simulations using the reactive force field approach (ReaxFF) are implemented to investigate the thermomechanical properties of fluorinated graphene (FG). A new set of parameters for the reactive force field potential (ReaxFF) optimized to reproduce key quantum mechanical properties of relevant carbon-fluor cluster systems are presented. Molecular dynamics (MD) simulations are used to investigate the thermal rippling behavior of FG and its mechanical properties and compare them with graphene (GE), graphane (GA) and a sheet of BN. The mean square value of the height fluctuations $< h^2>$ and the height-height correlation function $H(q)$ for different system sizes and temperatures show that FG is an un-rippled system in contrast to the thermal rippling behavior of graphene (GE). The effective Young's modulus of a flake of fluorinated graphene is obtained to be 273 N/m and 250 N/m for a flake of FG under uniaxial strain along arm-chair and zig-zag direction, respectively. △ Less

Submitted 9 March, 2013; originally announced March 2013.

Comments: To appear in Phys. Rev. B

Journal ref: Physical Review B 87 (10), 104114 (2013)

arXiv:1208.6079 [pdf, ps, other]

A unified approach to the integrals of Mellin--Barnes--Hecke type

Authors: Gopala Krishna Srinivasan

Abstract: In this paper we provide a unified approach to a family of integrals of Mellin--Barnes type using distribution theory and Fourier transforms. Interesting features arise in many of the cases which call for the application of pull-backs of distributions via smooth submersive maps defined by Hörmander. We derive by this method the integrals of Hecke and Sonine relating to various types of Bessel fu… ▽ More In this paper we provide a unified approach to a family of integrals of Mellin--Barnes type using distribution theory and Fourier transforms. Interesting features arise in many of the cases which call for the application of pull-backs of distributions via smooth submersive maps defined by Hörmander. We derive by this method the integrals of Hecke and Sonine relating to various types of Bessel functions which have found applications in analytic and algebraic number theory. △ Less

Submitted 30 August, 2012; originally announced August 2012.

Comments: The paper has been accepted for publication in Expositiones Mathematicae

arXiv:1207.3411 [pdf]

doi 10.1103/PhysRevB.86.214405

Magnetoelectric Interactions in Layered Composites of Piezoelectric Quartz and Magnetostrictive Alloys

Authors: G. Sreenivasulu, V. M. Petrov, L. Y. Fetisov, Y. K. Fetisov, G. Srinivasan

Abstract: Mechanical strain mediated magnetoelectric effects are studied in bilayers and trilayers of piezoelectric quartz and magnetostrictive permendur (P), an alloy of Fe-Co-V. It is shown that the magneto-electric voltage coefficient (MEVC), proportional to the ratio of the piezoelectric coupling coefficient to the permittivity, is higher in quart-based composites than for traditional ferroelectrics bas… ▽ More Mechanical strain mediated magnetoelectric effects are studied in bilayers and trilayers of piezoelectric quartz and magnetostrictive permendur (P), an alloy of Fe-Co-V. It is shown that the magneto-electric voltage coefficient (MEVC), proportional to the ratio of the piezoelectric coupling coefficient to the permittivity, is higher in quart-based composites than for traditional ferroelectrics based ME composites. In bilayers of X-cut single crystal quartz and permendur, the MEVC varies from 1.5 V/cm Oe at 20 Hz to ~ 185 V/cm Oe at bending resonance or electromechanical resonance corresponding to longitudinal acoustic modes. In symmetric quartz-P trilayers, the MEVC ~ 4.8 V/cm Oe at 20 Hz and ~ 175 V/cm Oe at longitudinal acoustic resonance. A model for low-frequency and resonance ME effects is provided for theoretical estimates of MEVC and calculated MEVC are in general agreement with measured values. Magneto-electric composites with quartz have the desired characteristics such as the absence of ferroelectric hysteresis and pyroelectric losses and could potentially replace ferroelectrics in composite-based magnetic sensors, transducers and high frequency devices. △ Less

Submitted 24 November, 2012; v1 submitted 14 July, 2012; originally announced July 2012.

arXiv:1206.5122 [pdf, ps, other]

On a remarkable formula of Ramanujan

Authors: Debraj Chakrabarti, Gopala Krishna Srinivasan

Abstract: A simple proof of Ramanujan's formula for the Fourier transform of the square of the modulus of the Gamma function restricted to a vertical line in the right half-plane is given. The result is extended to vertical lines in the left half-plane by solving an inhomogeneous ODE. We then use it to calculate the jump across the imaginary axis. A simple proof of Ramanujan's formula for the Fourier transform of the square of the modulus of the Gamma function restricted to a vertical line in the right half-plane is given. The result is extended to vertical lines in the left half-plane by solving an inhomogeneous ODE. We then use it to calculate the jump across the imaginary axis. △ Less

Submitted 22 June, 2012; originally announced June 2012.

Comments: To appear in Archiv der Mathematik

MSC Class: 33B15

arXiv:1202.4363 [pdf]

doi 10.1002/pssa.201228154

In-plane Dielectric and Magnetoelectric Studies of BiFeO3

Authors: Ashok Kumar, J. F. Scott, R. Martinez, G. Srinivasan, R. S. Katiyar

Abstract: In-plane temperature dependent dielectric behavior of BiFeO3 (BFO) as-grown thin films show diffuse but prominent phase transitions near 450 (+/-10) K and 550 K with dielectric loss temperature dependences that suggest skin layer effects. The 450 K anomalies are near the "transition" first reported by Polomska et al. [Phys. Stat. Sol. 23, 567 (1974)]. The 550 K anomalies coincide with the surface… ▽ More In-plane temperature dependent dielectric behavior of BiFeO3 (BFO) as-grown thin films show diffuse but prominent phase transitions near 450 (+/-10) K and 550 K with dielectric loss temperature dependences that suggest skin layer effects. The 450 K anomalies are near the "transition" first reported by Polomska et al. [Phys. Stat. Sol. 23, 567 (1974)]. The 550 K anomalies coincide with the surface phase transition recently reported [Xavi et al. PRL 106, 236101 (2011)]. In addition, anomalies are found at low temperatures: After several experimental cycles the dielectric loss shows a clear relaxor-like phase transition near what was previously suggested to be a spin reorientation transition (SRT) temperature (~ 201 K) for frequencies 1 kHz < f < 1MHz which follow a nonlinear Vogel-Fulcher (V-F) relation; an additional sharp anomaly is observed near ~180 K at frequencies below 1 kHz. As emphasized recently by Cowley et al. [Adv. Phys. 60, 229 (2011)], skin effects are expected for all relaxor ferroelectrics. Using the interdigital electrodes, experimental data and a theoretical model for in-plane longitudinal and transverse direct magnetoelectric (ME) coefficient are presented. △ Less

Submitted 20 February, 2012; originally announced February 2012.

Comments: 14 pages, 4 figures

arXiv:0711.1528 [pdf]

doi 10.1063/1.2884529

Structural, elastic and electronic properties of Fe3C from first-principles

Authors: Chao Jiang, S. G. Srinivasan, A. Caro, S. A. Maloy

Abstract: Using first-principles calculations within the generalized gradient approximation, we predicted the lattice parameters, elastic constants, vibrational properties, and electronic structure of cementite (Fe3C). Its nine single-crystal elastic constants were obtained by computing total energies or stresses as a function of applied strain. Furthermore, six of them were determined from the initial sl… ▽ More Using first-principles calculations within the generalized gradient approximation, we predicted the lattice parameters, elastic constants, vibrational properties, and electronic structure of cementite (Fe3C). Its nine single-crystal elastic constants were obtained by computing total energies or stresses as a function of applied strain. Furthermore, six of them were determined from the initial slopes of the calculated longitudinal and transverse acoustic phonon branches along the [100], [010] and [001] directions. The three methods agree well with each other, the calculated polycrystalline elastic moduli are also in good overall agreement with experiments. Our calculations indicate that Fe3C is mechanically stable. The experimentally observed high elastic anisotropy of Fe3C is also confirmed by our study. Based on electronic density of states and charge density distribution, the chemical bonding in Fe3C was analyzed and was found to exhibit a complex mixture of metallic, covalent, and ionic characters. △ Less

Submitted 9 November, 2007; originally announced November 2007.

Showing 1–50 of 76 results for author: Srinivasan, G