-
Exploring Geometrical Properties of Chaotic Systems Through an Analysis of the Rulkov Neuron Maps
Authors:
Brandon B. Le,
Nivika A. Gandhi
Abstract:
While extensive research has been conducted on chaos emerging from a dynamical system's temporal dynamics, the research documented in this paper examines extreme sensitivity to initial conditions in discrete-time dynamical systems from a geometrical perspective. The heart of this paper focuses on two simple neuron maps developed by Nikolai F. Rulkov in the early 2000s and the complex geometrical s…
▽ More
While extensive research has been conducted on chaos emerging from a dynamical system's temporal dynamics, the research documented in this paper examines extreme sensitivity to initial conditions in discrete-time dynamical systems from a geometrical perspective. The heart of this paper focuses on two simple neuron maps developed by Nikolai F. Rulkov in the early 2000s and the complex geometrical structures that emerge from them. Beginning with a conversational introduction to the geometry of chaos, this paper integrates mathematics, physics, neurobiology, computational modeling, and electrochemistry to present original research that provides a novel perspective on how types of geometrical sensitivity to initial conditions appear in discrete-time neuron systems.
This paper was developed in the Thomas Jefferson High School for Science and Technology Quantum Lab as part of a senior research project.
△ Less
Submitted 12 June, 2024;
originally announced June 2024.
-
MathDivide: Improved mathematical reasoning by large language models
Authors:
Saksham Sahai Srivastava,
Ashutosh Gandhi
Abstract:
Large language models have been proven to be capable of handling complex linguistic and cognitive tasks. Therefore their usage has been extended to tasks requiring logical reasoning ability such as Mathematics. In this paper, we propose a prompting technique called MathDivide that breaks down the mathematical problem into simpler subproblems. Each of the subproblems is formulated as an algebraic e…
▽ More
Large language models have been proven to be capable of handling complex linguistic and cognitive tasks. Therefore their usage has been extended to tasks requiring logical reasoning ability such as Mathematics. In this paper, we propose a prompting technique called MathDivide that breaks down the mathematical problem into simpler subproblems. Each of the subproblems is formulated as an algebraic expression whose value is evaluated by the Python code generated by the LLM for the corresponding algebraic expression. The values fed to the Python code are the numerical values provided in the problem statement. The solutions for the subproblems are composed together to obtain the final answer for the problem statement. Finally, the final answer is compared to the correct answer. If the final answer matches the correct answer, it is produced as output else a refinement prompt is fed to the LLM. We experiment with this prompting technique on both closed-source LLM models and open-source LLM models using GSM8K dataset. The results obtained demonstrate that MathDivide was able to significantly outperform the leading prompting technique called Math-prompter.
△ Less
Submitted 12 May, 2024;
originally announced May 2024.
-
Large Language Models in Biomedical and Health Informatics: A Bibliometric Review
Authors:
Huizi Yu,
Lizhou Fan,
Lingyao Li,
Jiayan Zhou,
Zihui Ma,
Lu Xian,
Wenyue Hua,
Sijia He,
Mingyu **,
Yongfeng Zhang,
Ashvin Gandhi,
Xin Ma
Abstract:
Large Language Models (LLMs) have rapidly become important tools in Biomedical and Health Informatics (BHI), enabling new ways to analyze data, treat patients, and conduct research. This bibliometric review aims to provide a panoramic view of how LLMs have been used in BHI by examining research articles and collaboration networks from 2022 to 2023. It further explores how LLMs can improve Natural…
▽ More
Large Language Models (LLMs) have rapidly become important tools in Biomedical and Health Informatics (BHI), enabling new ways to analyze data, treat patients, and conduct research. This bibliometric review aims to provide a panoramic view of how LLMs have been used in BHI by examining research articles and collaboration networks from 2022 to 2023. It further explores how LLMs can improve Natural Language Processing (NLP) applications in various BHI areas like medical diagnosis, patient engagement, electronic health record management, and personalized medicine. To do this, our bibliometric review identifies key trends, maps out research networks, and highlights major developments in this fast-moving field. Lastly, it discusses the ethical concerns and practical challenges of using LLMs in BHI, such as data privacy and reliable medical recommendations. Looking ahead, we consider how LLMs could further transform biomedical research as well as healthcare delivery and patient outcomes. This bibliometric review serves as a resource for stakeholders in healthcare, including researchers, clinicians, and policymakers, to understand the current state and future potential of LLMs in BHI.
△ Less
Submitted 23 April, 2024; v1 submitted 24 March, 2024;
originally announced March 2024.
-
Eigenmode Decomposition Method for Full-Wave Modeling of Microring Resonators
Authors:
Yuriy Akimov,
Aswin Alexander Eapen,
Shiyang Zhu,
Doris K. T. Ng,
Nanxi Li,
Woon Leng Loh,
Lennon Y. T. Lee,
Alagappan Gandhi,
Aravind P. Anthur
Abstract:
We develop a theoretical predictive model for an all-pass ring resonator that enables the most complete description of linear coupling regimes. The model is based on eigenmode decomposition of Maxwell's equations with full account of the confined and leaky modes, as opposed to the existing phenomenological methods restricted to the confined modes only. This model enables quantitative description o…
▽ More
We develop a theoretical predictive model for an all-pass ring resonator that enables the most complete description of linear coupling regimes. The model is based on eigenmode decomposition of Maxwell's equations with full account of the confined and leaky modes, as opposed to the existing phenomenological methods restricted to the confined modes only. This model enables quantitative description of all-pass ring resonators and provides insights into the physics underlying microring-waveguide coupling. We experimentally validate the model using transmission measurements in the linear regime of aluminium nitride resonators. The developed model is then used to explore the field enhancement in microrings crucial for nonlinear photonic applications.
△ Less
Submitted 6 February, 2024;
originally announced February 2024.
-
Annotated Hands for Generative Models
Authors:
Yue Yang,
Atith N Gandhi,
Greg Turk
Abstract:
Generative models such as GANs and diffusion models have demonstrated impressive image generation capabilities. Despite these successes, these systems are surprisingly poor at creating images with hands. We propose a novel training framework for generative models that substantially improves the ability of such systems to create hand images. Our approach is to augment the training images with three…
▽ More
Generative models such as GANs and diffusion models have demonstrated impressive image generation capabilities. Despite these successes, these systems are surprisingly poor at creating images with hands. We propose a novel training framework for generative models that substantially improves the ability of such systems to create hand images. Our approach is to augment the training images with three additional channels that provide annotations to hands in the image. These annotations provide additional structure that coax the generative model to produce higher quality hand images. We demonstrate this approach on two different generative models: a generative adversarial network and a diffusion model. We demonstrate our method both on a new synthetic dataset of hand images and also on real photographs that contain hands. We measure the improved quality of the generated hands through higher confidence in finger joint identification using an off-the-shelf hand detector.
△ Less
Submitted 26 January, 2024;
originally announced January 2024.
-
Catastrophic Interference is Mitigated in Naturalistic Power-Law Learning Environments
Authors:
Atith Gandhi,
Raj Sanjay Shah,
Vijay Marupudi,
Sashank Varma
Abstract:
Neural networks often suffer from catastrophic interference (CI): performance on previously learned tasks drops off significantly when learning a new task. This contrasts strongly with humans, who can sequentially learn new tasks without appreciably forgetting previous tasks. Prior work has explored various techniques for mitigating CI such as regularization, rehearsal, generative replay, and dist…
▽ More
Neural networks often suffer from catastrophic interference (CI): performance on previously learned tasks drops off significantly when learning a new task. This contrasts strongly with humans, who can sequentially learn new tasks without appreciably forgetting previous tasks. Prior work has explored various techniques for mitigating CI such as regularization, rehearsal, generative replay, and distillation methods. The current work takes a different approach, one guided by cognitive science research showing that in naturalistic environments, the probability of encountering a task decreases as a power-law of the time since it was last performed. We argue that a realistic evaluation of techniques for the mitigation of CI should be performed in simulated naturalistic learning environments. Thus, we evaluate the extent of mitigation of CI when training simple rehearsal-based methods in power-law environments similar to the ones humans face. Our work explores this novel rehearsal-based approach for a domain-incremental task: learning permutations in the MNIST task. We compare our rehearsal environment with other baselines to show its efficacy in promoting continual learning. Additionally, we investigate whether this environment shows forward facilitation, i.e., faster learning of later tasks. Next, we explore the robustness of our learning environment to the number of tasks, model size, and amount of data rehearsed after each task. Notably, our results show that the performance is comparable or superior to that of models trained using popular regularization methods and also to rehearsals in non-power-law environments. The benefits of this training paradigm include simplicity and the lack of a need for extra neural circuitry. In addition, because our method is orthogonal to other methods, future research can combine training in power-law environments with other continual learning mechanisms.
△ Less
Submitted 22 January, 2024; v1 submitted 18 January, 2024;
originally announced January 2024.
-
From Augmentation to Decomposition: A New Look at CUPED in 2023
Authors:
Alex Deng,
Luke Hagar,
Nathaniel Stevens,
Tatiana Xifara,
Lo-Hua Yuan,
Amit Gandhi
Abstract:
Ten years ago, CUPED (Controlled Experiments Utilizing Pre-Experiment Data) mainstreamed the idea of variance reduction leveraging pre-experiment covariates. Since its introduction, it has been implemented, extended, and modernized by major online experimentation platforms. Many researchers and practitioners often interpret CUPED as a regression adjustment. In this article, we clarify its similari…
▽ More
Ten years ago, CUPED (Controlled Experiments Utilizing Pre-Experiment Data) mainstreamed the idea of variance reduction leveraging pre-experiment covariates. Since its introduction, it has been implemented, extended, and modernized by major online experimentation platforms. Many researchers and practitioners often interpret CUPED as a regression adjustment. In this article, we clarify its similarities and differences to regression adjustment and present CUPED as a more general augmentation framework which is closer to the spirit of the 2013 paper. We show that the augmentation view naturally leads to cleaner developments of variance reduction beyond simple average metrics, including ratio metrics and percentile metrics. Moreover, the augmentation view can go beyond using pre-experiment data and leverage in-experiment data, leading to significantly larger variance reduction. We further introduce metric decomposition using approximate null augmentation (ANA) as a mental model for in-experiment variance reduction. We study it under both a Bayesian framework and a frequentist optimal proxy metric framework. Metric decomposition arises naturally in conversion funnels, so this work has broad applicability.
△ Less
Submitted 5 December, 2023;
originally announced December 2023.
-
Verifiable Sustainability in Data Centers
Authors:
Syed Rafiul Hussain,
Patrick McDaniel,
Anshul Gandhi,
Kanad Ghose,
Kartik Gopalan,
Dongyoon Lee,
Yu David Liu,
Zhenhua Liu,
Shuai Mu,
Erez Zadok
Abstract:
Data centers have significant energy needs, both embodied and operational, affecting sustainability adversely. The current techniques and tools for collecting, aggregating, and reporting verifiable sustainability data are vulnerable to cyberattacks and misuse, requiring new security and privacy-preserving solutions. This paper outlines security challenges and research directions for addressing the…
▽ More
Data centers have significant energy needs, both embodied and operational, affecting sustainability adversely. The current techniques and tools for collecting, aggregating, and reporting verifiable sustainability data are vulnerable to cyberattacks and misuse, requiring new security and privacy-preserving solutions. This paper outlines security challenges and research directions for addressing these pressing requirements.
△ Less
Submitted 12 January, 2024; v1 submitted 22 July, 2023;
originally announced July 2023.
-
Natural Language Commanding via Program Synthesis
Authors:
Apurva Gandhi,
Thong Q. Nguyen,
Huitian Jiao,
Robert Steen,
Ameya Bhatawdekar
Abstract:
We present Semantic Interpreter, a natural language-friendly AI system for productivity software such as Microsoft Office that leverages large language models (LLMs) to execute user intent across application features. While LLMs are excellent at understanding user intent expressed as natural language, they are not sufficient for fulfilling application-specific user intent that requires more than t…
▽ More
We present Semantic Interpreter, a natural language-friendly AI system for productivity software such as Microsoft Office that leverages large language models (LLMs) to execute user intent across application features. While LLMs are excellent at understanding user intent expressed as natural language, they are not sufficient for fulfilling application-specific user intent that requires more than text-to-text transformations. We therefore introduce the Office Domain Specific Language (ODSL), a concise, high-level language specialized for performing actions in and interacting with entities in Office applications. Semantic Interpreter leverages an Analysis-Retrieval prompt construction method with LLMs for program synthesis, translating natural language user utterances to ODSL programs that can be transpiled to application APIs and then executed. We focus our discussion primarily on a research exploration for Microsoft PowerPoint.
△ Less
Submitted 6 June, 2023;
originally announced June 2023.
-
A Comparative Evaluation of Visual Summarization Techniques for Event Sequences
Authors:
Kazi Tasnim Zinat,
**hua Yang,
Arjun Gandhi,
Nistha Mitra,
Zhicheng Liu
Abstract:
Real-world event sequences are often complex and heterogeneous, making it difficult to create meaningful visualizations using simple data aggregation and visual encoding techniques. Consequently, visualization researchers have developed numerous visual summarization techniques to generate concise overviews of sequential data. These techniques vary widely in terms of summary structures and contents…
▽ More
Real-world event sequences are often complex and heterogeneous, making it difficult to create meaningful visualizations using simple data aggregation and visual encoding techniques. Consequently, visualization researchers have developed numerous visual summarization techniques to generate concise overviews of sequential data. These techniques vary widely in terms of summary structures and contents, and currently there is a knowledge gap in understanding the effectiveness of these techniques. In this work, we present the design and results of an insight-based crowdsourcing experiment evaluating three existing visual summarization techniques: CoreFlow, SentenTree, and Sequence Synopsis. We compare the visual summaries generated by these techniques across three tasks, on six datasets, at six levels of granularity. We analyze the effects of these variables on summary quality as rated by participants and completion time of the experiment tasks. Our analysis shows that Sequence Synopsis produces the highest-quality visual summaries for all three tasks, but understanding Sequence Synopsis results also takes the longest time. We also find that the participants evaluate visual summary quality based on two aspects: content and interpretability. We discuss the implications of our findings on develo** and evaluating new visual summarization techniques.
△ Less
Submitted 4 June, 2023;
originally announced June 2023.
-
Observation of terahertz second harmonic generation from Dirac surface states in the topological insulator Bi$_2$Se$_3$
Authors:
Jonathan Stensberg,
Xingyue Han,
Zhuoliang Ni,
Xiong Yao,
Xiaoyu Yuan,
Debarghya Mallick,
Akshat Gandhi,
Seongshik Oh,
Liang Wu
Abstract:
We report the observation of second harmonic generation with high conversion efficiency $\sim 0.005\%$ in the terahertz regime from thin films of the topological insulator Bi$_2$Se$_3$ that exhibit the linear photogalvanic effect, measured via time-domain terahertz spectroscopy and terahertz emission, respectively. As neither phenomena is observable from topologically trivial In-doped Bi$_2$Se…
▽ More
We report the observation of second harmonic generation with high conversion efficiency $\sim 0.005\%$ in the terahertz regime from thin films of the topological insulator Bi$_2$Se$_3$ that exhibit the linear photogalvanic effect, measured via time-domain terahertz spectroscopy and terahertz emission, respectively. As neither phenomena is observable from topologically trivial In-doped Bi$_2$Se$_3$, and since no enhancement is observed when subject to band bending, the efficient thickness-independent nonliear responses are attributable to the Dirac fermions of topological surface states of Bi$_2$Se$_3$. This observation of intrinsic terahertz second harmonic generation in an equilibrium system unlocks the full suite of both even and odd harmonic orders in the terahertz regime and opens new pathways to probing quantum geometry via intraband nonlinear processes. We hope our work will motivate the theoretical development of a full treatment of second harmonic generation for probing the quantum geometry in various inversion-breaking topological and twisted materials.
△ Less
Submitted 7 June, 2024; v1 submitted 12 January, 2023;
originally announced January 2023.
-
SLATE: A Sequence Labeling Approach for Task Extraction from Free-form Inked Content
Authors:
Apurva Gandhi,
Ryan Serrao,
Biyi Fang,
Gilbert Antonius,
Jenna Hong,
Tra My Nguyen,
Sheng Yi,
Ehi Nosakhare,
Irene Shaffer,
Soundararajan Srinivasan,
Vivek Gupta
Abstract:
We present SLATE, a sequence labeling approach for extracting tasks from free-form content such as digitally handwritten (or "inked") notes on a virtual whiteboard. Our approach allows us to create a single, low-latency model to simultaneously perform sentence segmentation and classification of these sentences into task/non-task sentences. SLATE greatly outperforms a baseline two-model (sentence s…
▽ More
We present SLATE, a sequence labeling approach for extracting tasks from free-form content such as digitally handwritten (or "inked") notes on a virtual whiteboard. Our approach allows us to create a single, low-latency model to simultaneously perform sentence segmentation and classification of these sentences into task/non-task sentences. SLATE greatly outperforms a baseline two-model (sentence segmentation followed by classification model) approach, achieving a task F1 score of 84.4%, a sentence segmentation (boundary similarity) score of 88.4% and three times lower latency compared to the baseline. Furthermore, we provide insights into tackling challenges of performing NLP on the inking domain. We release both our code and dataset for this novel task.
△ Less
Submitted 17 November, 2022; v1 submitted 8 November, 2022;
originally announced November 2022.
-
The Tensor Data Platform: Towards an AI-centric Database System
Authors:
Apurva Gandhi,
Yuki Asada,
Victor Fu,
Advitya Gemawat,
Lihao Zhang,
Rathijit Sen,
Carlo Curino,
Jesús Camacho-Rodríguez,
Matteo Interlandi
Abstract:
Database engines have historically absorbed many of the innovations in data processing, adding features to process graph data, XML, object oriented, and text among many others. In this paper, we make the case that it is time to do the same for AI -- but with a twist! While existing approaches have tried to achieve this by integrating databases with external ML tools, in this paper we claim that ac…
▽ More
Database engines have historically absorbed many of the innovations in data processing, adding features to process graph data, XML, object oriented, and text among many others. In this paper, we make the case that it is time to do the same for AI -- but with a twist! While existing approaches have tried to achieve this by integrating databases with external ML tools, in this paper we claim that achieving a truly AI-centric database requires moving the DBMS engine, at its core, from a relational to a tensor abstraction. This allows us to: (1) support multi-modal data processing such as images, videos, audio, text as well as relational; (2) leverage the wellspring of innovation in HW and runtimes for tensor computation; and (3) exploit automatic differentiation to enable a novel class of "trainable" queries that can learn to perform a task.
To support the above scenarios, we introduce TDP: a system that builds upon our prior work map** relational queries to tensors. Thanks to a tighter integration with the tensor runtime, TDP is able to provide a broader coverage of new emerging scenarios requiring access to multi-modal data and automatic differentiation.
△ Less
Submitted 4 November, 2022;
originally announced November 2022.
-
Share the Tensor Tea: How Databases can Leverage the Machine Learning Ecosystem
Authors:
Yuki Asada,
Victor Fu,
Apurva Gandhi,
Advitya Gemawat,
Lihao Zhang,
Dong He,
Vivek Gupta,
Ehi Nosakhare,
Dalitso Banda,
Rathijit Sen,
Matteo Interlandi
Abstract:
We demonstrate Tensor Query Processor (TQP): a query processor that automatically compiles relational operators into tensor programs. By leveraging tensor runtimes such as PyTorch, TQP is able to: (1) integrate with ML tools (e.g., Pandas for data ingestion, Tensorboard for visualization); (2) target different hardware (e.g., CPU, GPU) and software (e.g., browser) backends; and (3) end-to-end acce…
▽ More
We demonstrate Tensor Query Processor (TQP): a query processor that automatically compiles relational operators into tensor programs. By leveraging tensor runtimes such as PyTorch, TQP is able to: (1) integrate with ML tools (e.g., Pandas for data ingestion, Tensorboard for visualization); (2) target different hardware (e.g., CPU, GPU) and software (e.g., browser) backends; and (3) end-to-end accelerate queries containing both relational and ML operators. TQP is generic enough to support the TPC-H benchmark, and it provides performance that is comparable to, and often better than, that of specialized CPU and GPU query processors.
△ Less
Submitted 9 September, 2022;
originally announced September 2022.
-
Statistical analysis of proton induced reactions to generate recommended data for the production of medical radio-isotopes
Authors:
Sourav Mondal,
A. Gandhi,
Rebecca Pachuau
Abstract:
Radio-isotopes produced via proton induced reaction holds special significance regarding nuclear medicine, astrophysical p-process, theragnostic and diagnostic processes. $^{76}$Br, $^{80m}$Br and $^{61}$Cu are positron emitter and they are useful in the functional studies via Positron Emission Tomography (PET), whereas $^{77}$Br bears the potential for the application in Single Photon Emission Co…
▽ More
Radio-isotopes produced via proton induced reaction holds special significance regarding nuclear medicine, astrophysical p-process, theragnostic and diagnostic processes. $^{76}$Br, $^{80m}$Br and $^{61}$Cu are positron emitter and they are useful in the functional studies via Positron Emission Tomography (PET), whereas $^{77}$Br bears the potential for the application in Single Photon Emission Computed Tomography (SPECT) which involves electron capture process. PET and SPECT have been in high application in medical physics, diagnostics, therapy and nuclear medicine. $^{99m}$Tc and $^{64}$Cu are two popular radionuclide which play important role in nuclear medicine, currently being used in bio-medical physics, bone scan, modern imaging, blood pool leveling, oncology and diagnosis of copper related diseases. This paper focus on the generation of recommended nuclear reaction cross sections for the production of some useful medical radio-isotopes using the experimental datasets obtained from EXFOR database and simulated datasets from nuclear reaction model codes TALYS-1.95 and EMPIRE-3.1.1. 95\% confidence interval has been implemented to ensure confidence and precision.
△ Less
Submitted 19 July, 2022;
originally announced July 2022.
-
Measurement of alpha-induced reaction cross-sections for $^{nat}$Zn with detailed covariance analysis
Authors:
Mahesh Choudhary,
Aman Sharma,
Namrata Singh,
A. Gandhi,
S. Dasgupta,
J. Datta,
K. Katovsky,
A. Kumar
Abstract:
The production cross-section of $^{68}$Ge, $^{69}$Ge, $^{65}$Zn and $^{67}$Ga radioisotopes from alpha-induced nuclear reaction with $^{nat}$Zn have been measured using the stacked foil activation technique followed by the off-line $γ$-ray spectroscopy in the incident alpha energy range 14-37 MeV. The obtained nuclear reaction cross-sections are compared with previous experimental data available i…
▽ More
The production cross-section of $^{68}$Ge, $^{69}$Ge, $^{65}$Zn and $^{67}$Ga radioisotopes from alpha-induced nuclear reaction with $^{nat}$Zn have been measured using the stacked foil activation technique followed by the off-line $γ$-ray spectroscopy in the incident alpha energy range 14-37 MeV. The obtained nuclear reaction cross-sections are compared with previous experimental data available in the EXFOR data library, evaluated nuclear data from TENDL-2019 and theoretical results, calculated using TALYS nuclear reaction code. We have also performed the detailed uncertainty analysis for these nuclear reactions and their respective correlation metrics are presented. Since $α$-induced reactions are important in nuclear medicine and develo** the nuclear reaction codes so needful corrections related to the coincidence summing factor and the geometric factor have been considered during the data analysis in the present study.
△ Less
Submitted 25 February, 2023; v1 submitted 7 June, 2022;
originally announced June 2022.
-
A First-principles study on ABBr3 (A = Cs, Rb, K, Na; B = Ge, Sn) halide perovskites for photovoltaic applications
Authors:
Dibyajyoti Saikia,
Mahfooz Alam,
Jayanta Bera,
Atanu Betal,
Appala Naidu Gandhi,
Satyajit Sahu
Abstract:
In recent years, halide perovskite-based solar cells have received intensive attention, and demonstrated power conversion efficiency as high as 25.8%. With regard to the toxicity of Pb and the instability of organic elements, all inorganic lead-free perovskites (ILPs) have been extensively studied to achieve comparable or greater photovoltaic performance. In order to develop ILPs as an alternative…
▽ More
In recent years, halide perovskite-based solar cells have received intensive attention, and demonstrated power conversion efficiency as high as 25.8%. With regard to the toxicity of Pb and the instability of organic elements, all inorganic lead-free perovskites (ILPs) have been extensively studied to achieve comparable or greater photovoltaic performance. In order to develop ILPs as an alternative for solar cell applications, we performed first-principles calculations of ABBr3 perovskites (A = Cs, Rb, K, and Na, and B = Sn, and Ge). Structural, electronic, and optical properties were systematically studied to probe the potentiality in photovoltaic applications. All these ILPs exhibited a direct bandgap in the range of 1.10 to 1.97 eV, highly beneficial for absorbing solar energy. Furthermore, these ILPs demonstrated significant optical absorption (over 105 cm-1) in the whole UV-Vis spectrum. These results will be helpful for designing highly efficient lead-free perovskite solar cells.
△ Less
Submitted 3 May, 2022;
originally announced May 2022.
-
SHOP: A Deep Learning Based Pipeline for near Real-Time Detection of Small Handheld Objects Present in Blurry Video
Authors:
Abhinav Ganguly,
Amar C Gandhi,
Sylvia E,
Jeffrey D Chang,
Ian M Hudson
Abstract:
While prior works have investigated and developed computational models capable of object detection, models still struggle to reliably interpret images with motion blur and small objects. Moreover, none of these models are specifically designed for handheld object detection. In this work, we present SHOP (Small Handheld Object Pipeline), a pipeline that reliably and efficiently interprets blurry im…
▽ More
While prior works have investigated and developed computational models capable of object detection, models still struggle to reliably interpret images with motion blur and small objects. Moreover, none of these models are specifically designed for handheld object detection. In this work, we present SHOP (Small Handheld Object Pipeline), a pipeline that reliably and efficiently interprets blurry images containing handheld objects. The specific models used in each stage of the pipeline are flexible and can be changed based on performance requirements. First, images are deblurred and then run through a pose detection system where areas-of-interest are proposed around the hands of any people present. Next, object detection is performed on the images by a single-stage object detector. Finally, the proposed areas-of-interest are used to filter out low confidence detections. Testing on a handheld subset of Microsoft Common Objects in Context (MS COCO) demonstrates that this 3 stage process results in a 70 percent decrease in false positives while only reducing true positives by 17 percent in its strongest configuration. We also present a subset of MS COCO consisting solely of handheld objects that can be used to continue the development of handheld object detection methods. https://github.com/spider-sense/SHOP
△ Less
Submitted 29 March, 2022;
originally announced March 2022.
-
Measurement of alpha induced reaction cross-sections on $^{nat}$Mo with detailed covariance analysis
Authors:
Mahesh Choudhary,
A. Gandhi,
Aman Sharma,
Namrata Singh,
S. Dasgupta,
J. Datta,
A. Kumar
Abstract:
In the present study we have measured the excitation functions for the nuclear reactions $^{100}$Mo($α$,n)$^{103}$Ru, $^{nat}$Mo($α$,x)$^{97}$Ru, $^{nat}$Mo($α$,x)$^{95}$Ru, $^{nat}$Mo($α$,x)$^{96g}$Tc, $^{nat}$Mo($α$,x)$^{95g}$Tc and $^{nat}$Mo($α$,x)$^{94g}$Tc in the energy range 11-32 MeV. We have used the stacked foil activation technique followed by off-line gamma ray spectroscopy technique t…
▽ More
In the present study we have measured the excitation functions for the nuclear reactions $^{100}$Mo($α$,n)$^{103}$Ru, $^{nat}$Mo($α$,x)$^{97}$Ru, $^{nat}$Mo($α$,x)$^{95}$Ru, $^{nat}$Mo($α$,x)$^{96g}$Tc, $^{nat}$Mo($α$,x)$^{95g}$Tc and $^{nat}$Mo($α$,x)$^{94g}$Tc in the energy range 11-32 MeV. We have used the stacked foil activation technique followed by off-line gamma ray spectroscopy technique to measure the excitation functions. In this study we have also documented detailed uncertainty analysis for these nuclear reactions and their corresponding covariance matrix are also presented. The excitation functions are compared with the available experimental data from EXFOR data library and the theoretical prediction from TALYS nuclear reaction code. The present measurements are found to be consistent with the available experimental data.
△ Less
Submitted 5 January, 2022;
originally announced January 2022.
-
Neural-guided, Bidirectional Program Search for Abstraction and Reasoning
Authors:
Simon Alford,
Anshula Gandhi,
Akshay Rangamani,
Andrzej Banburski,
Tony Wang,
Sylee Dandekar,
John Chin,
Tomaso Poggio,
Peter Chin
Abstract:
One of the challenges facing artificial intelligence research today is designing systems capable of utilizing systematic reasoning to generalize to new tasks. The Abstraction and Reasoning Corpus (ARC) measures such a capability through a set of visual reasoning tasks. In this paper we report incremental progress on ARC and lay the foundations for two approaches to abstraction and reasoning not ba…
▽ More
One of the challenges facing artificial intelligence research today is designing systems capable of utilizing systematic reasoning to generalize to new tasks. The Abstraction and Reasoning Corpus (ARC) measures such a capability through a set of visual reasoning tasks. In this paper we report incremental progress on ARC and lay the foundations for two approaches to abstraction and reasoning not based in brute-force search. We first apply an existing program synthesis system called DreamCoder to create symbolic abstractions out of tasks solved so far, and show how it enables solving of progressively more challenging ARC tasks. Second, we design a reasoning algorithm motivated by the way humans approach ARC. Our algorithm constructs a search graph and reasons over this graph structure to discover task solutions. More specifically, we extend existing execution-guided program synthesis approaches with deductive reasoning based on function inverse semantics to enable a neural-guided bidirectional search algorithm. We demonstrate the effectiveness of the algorithm on three domains: ARC, 24-Game tasks, and a 'double-and-add' arithmetic puzzle.
△ Less
Submitted 26 October, 2021; v1 submitted 21 October, 2021;
originally announced October 2021.
-
City-wide modeling of Vehicle-to-Grid Economics to Understand Effects of Battery Performance
Authors:
Heta A. Gandhi,
Andrew D. White
Abstract:
Vehicle-to-grid (V2G) is a promising approach to solve the problem of grid-level intermittent supply and demand mismatch, caused due to renewable energy resources, because it uses the existing resource of electric vehicle (EV) batteries as the energy storage medium. EV battery design together with an impetus on profitability for participating EV owners is pivotal for V2G success. To better underst…
▽ More
Vehicle-to-grid (V2G) is a promising approach to solve the problem of grid-level intermittent supply and demand mismatch, caused due to renewable energy resources, because it uses the existing resource of electric vehicle (EV) batteries as the energy storage medium. EV battery design together with an impetus on profitability for participating EV owners is pivotal for V2G success. To better understand what battery device parameters are most important for V2G adoption, we model the economics of V2G process under realistic conditions. Most previous studies that perform V2G economic analysis, assume ideal driving conditions, use linear battery degradation models, or only consider V2G for ancillary services. Our model accounts realistic battery degradation, empirical charging efficiencies, for randomness in commute behavior, and historic hourly electricity prices in six cities in the United States. We model user behavior with Bayesian optimization to provide a best-case scenario for V2G. Across all cities, we find that charging rate and efficiency are the most important factors that determine EV users' profits. Surprisingly, EV battery cost and thus degradation due to cycling has little effect. These findings should help focus research on figures of merit that better reflect real usage of batteries in a V2G economy.
△ Less
Submitted 12 August, 2021;
originally announced August 2021.
-
Iterative Symbolic Regression for Learning Transport Equations
Authors:
Mehrad Ansari,
Heta A. Gandhi,
David G. Foster,
Andrew D. White
Abstract:
Computational fluid dynamics (CFD) analysis is widely used in engineering. Although CFD calculations are accurate, the computational cost associated with complex systems makes it difficult to obtain empirical equations between system variables. Here we combine active learning (AL) and symbolic regression (SR) to get a symbolic equation for system variables from CFD simulations. Gaussian process re…
▽ More
Computational fluid dynamics (CFD) analysis is widely used in engineering. Although CFD calculations are accurate, the computational cost associated with complex systems makes it difficult to obtain empirical equations between system variables. Here we combine active learning (AL) and symbolic regression (SR) to get a symbolic equation for system variables from CFD simulations. Gaussian process regression-based AL allows for automated selection of variables by selecting the most instructive points from the available range of possible parameters. The results from these experiments are then passed to SR to find empirical symbolic equations for CFD models. This approach is scalable and applicable for any desired number of CFD design parameters. To demonstrate the effectiveness, we use this method with two model systems. We recover an empirical equation for the pressure drop in a bent pipe and a new equation for predicting backflow in a heart valve under arotic insufficiency.
△ Less
Submitted 16 March, 2022; v1 submitted 6 August, 2021;
originally announced August 2021.
-
Mission-level Robustness with Rapidly-deployed, Autonomous Aerial Vehicles by Carnegie Mellon Team Tartan at MBZIRC 2020
Authors:
Anish Bhattacharya,
Akshit Gandhi,
Lukas Merkle,
Rohan Tiwari,
Karun Warrior,
Stanley Winata,
Andrew Saba,
Kevin Zhang,
Oliver Kroemer,
Sebastian Scherer
Abstract:
For robotic systems to succeed in high risk, real-world situations, they have to be quickly deployable and robust to environmental changes, under-performing hardware, and mission subtask failures. These robots are often designed to consider a single sequence of mission events, with complex algorithms lowering individual subtask failure rates under some critical constraints. Our approach utilizes c…
▽ More
For robotic systems to succeed in high risk, real-world situations, they have to be quickly deployable and robust to environmental changes, under-performing hardware, and mission subtask failures. These robots are often designed to consider a single sequence of mission events, with complex algorithms lowering individual subtask failure rates under some critical constraints. Our approach utilizes common techniques in vision and control, and encodes robustness into mission structure through outcome monitoring and recovery strategies. In addition, our system infrastructure enables rapid deployment and requires no central communication. This report also includes lessons in rapid field robotic development and testing. We developed and evaluated our systems through real-robot experiments at an outdoor test site in Pittsburgh, Pennsylvania, USA, as well as in the 2020 Mohamed Bin Zayed International Robotics Challenge. All competition trials were completed in fully autonomous mode without RTK-GPS. Our system placed fourth in Challenge 2 and seventh in the Grand Challenge, with notable achievements such as pop** five balloons (Challenge 1), successfully picking and placing a block (Challenge 2), and dispensing the most water onto an outdoor, real fire with an autonomous UAV (Challenge 3).
△ Less
Submitted 13 September, 2022; v1 submitted 3 July, 2021;
originally announced July 2021.
-
Graph Neural Network Based Coarse-Grained Map** Prediction
Authors:
Zhiheng Li,
Geemi P. Wellawatte,
Maghesree Chakraborty,
Heta A. Gandhi,
Chenliang Xu,
Andrew D. White
Abstract:
The selection of coarse-grained (CG) map** operators is a critical step for CG molecular dynamics (MD) simulation. It is still an open question about what is optimal for this choice and there is a need for theory. The current state-of-the art method is map** operators manually selected by experts. In this work, we demonstrate an automated approach by viewing this problem as supervised learning…
▽ More
The selection of coarse-grained (CG) map** operators is a critical step for CG molecular dynamics (MD) simulation. It is still an open question about what is optimal for this choice and there is a need for theory. The current state-of-the art method is map** operators manually selected by experts. In this work, we demonstrate an automated approach by viewing this problem as supervised learning where we seek to reproduce the map** operators produced by experts. We present a graph neural network based CG map** predictor called DEEP SUPERVISED GRAPH PARTITIONING MODEL(DSGPM) that treats map** operators as a graph segmentation problem. DSGPM is trained on a novel dataset, Human-annotated Map**s (HAM), consisting of 1,206 molecules with expert annotated map** operators. HAM can be used to facilitate further research in this area. Our model uses a novel metric learning objective to produce high-quality atomic features that are used in spectral clustering. The results show that the DSGPM outperforms state-of-the-art methods in the field of graph segmentation. Finally, we find that predicted CG map** operators indeed result in good CG MD models when used in simulation.
△ Less
Submitted 19 August, 2021; v1 submitted 24 June, 2020;
originally announced July 2020.
-
Integrated Single Photon Emitters
Authors:
Junyi Lee,
Victor Leong,
Dmitry Kalashnikov,
Jibo Dai,
Alagappan Gandhi,
Leonid Krivitsky
Abstract:
The realization of scalable systems for quantum information processing and networking is of utmost importance to the quantum information community. However, building such systems is difficult because of challenges in achieving all the necessary functionalities on a unified platform while maintaining stringent performance requirements of the individual elements. A promising approach which addresses…
▽ More
The realization of scalable systems for quantum information processing and networking is of utmost importance to the quantum information community. However, building such systems is difficult because of challenges in achieving all the necessary functionalities on a unified platform while maintaining stringent performance requirements of the individual elements. A promising approach which addresses this challenge is based on the consolidation of experimental and theoretical capabilities in quantum physics and integrated photonics. Integrated quantum photonics devices allow efficient control and read-out of quantum information while being scalable and cost effective. Here we review recent developments in solid-state single photon emitters coupled with various integrated photonic structures, which form a critical component of future scalable quantum devices. Our work contributes to the further development and realization of quantum networking protocols and quantum logic on a scalable and fabrication-friendly platform.
△ Less
Submitted 28 July, 2020; v1 submitted 22 May, 2020;
originally announced May 2020.
-
Adversarial Perturbations Fool Deepfake Detectors
Authors:
Apurva Gandhi,
Shomik Jain
Abstract:
This work uses adversarial perturbations to enhance deepfake images and fool common deepfake detectors. We created adversarial perturbations using the Fast Gradient Sign Method and the Carlini and Wagner L2 norm attack in both blackbox and whitebox settings. Detectors achieved over 95% accuracy on unperturbed deepfakes, but less than 27% accuracy on perturbed deepfakes. We also explore two improve…
▽ More
This work uses adversarial perturbations to enhance deepfake images and fool common deepfake detectors. We created adversarial perturbations using the Fast Gradient Sign Method and the Carlini and Wagner L2 norm attack in both blackbox and whitebox settings. Detectors achieved over 95% accuracy on unperturbed deepfakes, but less than 27% accuracy on perturbed deepfakes. We also explore two improvements to deepfake detectors: (i) Lipschitz regularization, and (ii) Deep Image Prior (DIP). Lipschitz regularization constrains the gradient of the detector with respect to the input in order to increase robustness to input perturbations. The DIP defense removes perturbations using generative convolutional neural networks in an unsupervised manner. Regularization improved the detection of perturbed deepfakes on average, including a 10% accuracy boost in the blackbox case. The DIP defense achieved 95% accuracy on perturbed deepfakes that fooled the original detector, while retaining 98% accuracy in other cases on a 100 image subsample.
△ Less
Submitted 15 May, 2020; v1 submitted 23 March, 2020;
originally announced March 2020.
-
Road Accidents in the UK (Analysis and Visualization)
Authors:
Anjul Tyagi,
Ayush Kumar,
Anshul Gandhi,
Klaus Mueller
Abstract:
Analysis of road accidents is crucial to understand the factors involved and their impact. Accidents usually involve multiple variables like time, weather conditions, age of driver, etc. and hence it is challenging to analyze the data. To solve this problem, we use Multiple Correspondence Analysis (MCA) to first, filter out the most number of variables which can be visualized effectively in two di…
▽ More
Analysis of road accidents is crucial to understand the factors involved and their impact. Accidents usually involve multiple variables like time, weather conditions, age of driver, etc. and hence it is challenging to analyze the data. To solve this problem, we use Multiple Correspondence Analysis (MCA) to first, filter out the most number of variables which can be visualized effectively in two dimensions and then study the correlations among these variables in a two dimensional scatter plot. Other variables, for which MCA cannot capture ample variance in the projected dimensions, we use hypothesis testing and time series analysis for the study.
△ Less
Submitted 29 July, 2019;
originally announced August 2019.
-
Measurement of the response of a liquid scintillation detector to monoenergetic electrons and neutrons
Authors:
P. C. Rout,
A. Gandhi,
T. Basak,
R. G. Thomas,
C. Ghosh,
A. Mitra,
G. Mishra,
S. P. Behera,
R. Kujur,
E. T. Mirgule,
B. K. Nayak,
A. Saxena,
Suresh Kumar,
V. M. Datar
Abstract:
The response of the liquid scintillator (EJ-301 equivalent to NE-213) to the monoenergetic electrons produced in Compton scattered $γ$-ray tagging has been carried out for various radioactive $γ$-ray sources. The measured electron response is found to be linear up to $\sim$4~MeVee and the resolution of the liquid scintillator at 1~MeVee is observed to be $\sim$~11\%. The pulse shape discrimination…
▽ More
The response of the liquid scintillator (EJ-301 equivalent to NE-213) to the monoenergetic electrons produced in Compton scattered $γ$-ray tagging has been carried out for various radioactive $γ$-ray sources. The measured electron response is found to be linear up to $\sim$4~MeVee and the resolution of the liquid scintillator at 1~MeVee is observed to be $\sim$~11\%. The pulse shape discrimination and pulse height response of the liquid scintillator for neutrons has been measured using $^7$Li(p,n$_1$)$^7$Be*(0.429 MeV) reaction. Non linear response to mono-energetic neutrons for the liquid scintillator is observed at E$_n$=5.3, 9.0 and 12.7 MeV. The measured response of the liquid scintillator for electrons and neutrons have been compared with Geant4 simulation.
△ Less
Submitted 16 May, 2017;
originally announced May 2017.
-
GeThR-Net: A Generalized Temporally Hybrid Recurrent Neural Network for Multimodal Information Fusion
Authors:
Ankit Gandhi,
Arjun Sharma,
Arijit Biswas,
Om Deshmukh
Abstract:
Data generated from real world events are usually temporal and contain multimodal information such as audio, visual, depth, sensor etc. which are required to be intelligently combined for classification tasks. In this paper, we propose a novel generalized deep neural network architecture where temporal streams from multiple modalities are combined. There are total M+1 (M is the number of modalitie…
▽ More
Data generated from real world events are usually temporal and contain multimodal information such as audio, visual, depth, sensor etc. which are required to be intelligently combined for classification tasks. In this paper, we propose a novel generalized deep neural network architecture where temporal streams from multiple modalities are combined. There are total M+1 (M is the number of modalities) components in the proposed network. The first component is a novel temporally hybrid Recurrent Neural Network (RNN) that exploits the complimentary nature of the multimodal temporal information by allowing the network to learn both modality specific temporal dynamics as well as the dynamics in a multimodal feature space. M additional components are added to the network which extract discriminative but non-temporal cues from each modality. Finally, the predictions from all of these components are linearly combined using a set of automatically learned weights. We perform exhaustive experiments on three different datasets spanning four modalities. The proposed network is relatively 3.5%, 5.7% and 2% better than the best performing temporal multimodal baseline for UCF-101, CCV and Multimodal Gesture datasets respectively.
△ Less
Submitted 17 September, 2016;
originally announced September 2016.
-
Weakly Supervised Learning of Heterogeneous Concepts in Videos
Authors:
Sohil Shah,
Kuldeep Kulkarni,
Arijit Biswas,
Ankit Gandhi,
Om Deshmukh,
Larry Davis
Abstract:
Typical textual descriptions that accompany online videos are 'weak': i.e., they mention the main concepts in the video but not their corresponding spatio-temporal locations. The concepts in the description are typically heterogeneous (e.g., objects, persons, actions). Certain location constraints on these concepts can also be inferred from the description. The goal of this paper is to present a g…
▽ More
Typical textual descriptions that accompany online videos are 'weak': i.e., they mention the main concepts in the video but not their corresponding spatio-temporal locations. The concepts in the description are typically heterogeneous (e.g., objects, persons, actions). Certain location constraints on these concepts can also be inferred from the description. The goal of this paper is to present a generalization of the Indian Buffet Process (IBP) that can (a) systematically incorporate heterogeneous concepts in an integrated framework, and (b) enforce location constraints, for efficient classification and localization of the concepts in the videos. Finally, we develop posterior inference for the proposed formulation using mean-field variational approximation. Comparative evaluations on the Casablanca and the A2D datasets show that the proposed approach significantly outperforms other state-of-the-art techniques: 24% relative improvement for pairwise concept classification in the Casablanca dataset and 9% relative improvement for localization in the A2D dataset as compared to the most competitive baseline.
△ Less
Submitted 12 July, 2016;
originally announced July 2016.
-
Directly created electrostatic micro-domains on hydroxyapatite: probing with a Kelvin Force probe and a protein
Authors:
Tomas Plecenik,
Sylvain Robin,
Maros Gregor,
Martin Truchly,
Sidney Lang,
Abbasi Gandhi,
Miroslav Zahoran,
Fathima Laffir,
Tewfik Soulimane,
Melinda Vargova,
Gustav Plesch,
Peter Kus,
Andrej Plecenik,
S. A. M. Tofail
Abstract:
Micro-domains of modified surface potential (SP) were created on hydroxyapatite (HAp) films by direct patterning by midenergy focused electron beam, typically available as a microprobe of Scanning Electron Microscopes. The SP distribution of these patterns has been studied on sub-micrometer scale by the Kelvin Probe Force Microscopy method as well as lysozyme adsorption. Since the lysozyme is posi…
▽ More
Micro-domains of modified surface potential (SP) were created on hydroxyapatite (HAp) films by direct patterning by midenergy focused electron beam, typically available as a microprobe of Scanning Electron Microscopes. The SP distribution of these patterns has been studied on sub-micrometer scale by the Kelvin Probe Force Microscopy method as well as lysozyme adsorption. Since the lysozyme is positively charged at physiological pH, it allows us to track positively and negatively charged areas of the SP patterns. Distribution of the adsorbed proteins over the domains was in good agreement with the observed SP patterns.
△ Less
Submitted 2 January, 2013;
originally announced January 2013.
-
Enhancing Curriculum Acceptance among Students with E-learning 2.0
Authors:
Kamaljit I. Lakhtaria,
Paresh Patel,
Ankita Gandhi
Abstract:
E-learning; enhanced by communicating and interacting is becoming increasingly accepted and this puts Web 2.0 at the center of the new educational technologies. E-Learning 2.0 emerges as an innovative method of online learning for its incorporation of Web 2.0 tools. For any academic study, the curriculum provides overview of intact learning area. The Curriculum provides overview to content of the…
▽ More
E-learning; enhanced by communicating and interacting is becoming increasingly accepted and this puts Web 2.0 at the center of the new educational technologies. E-Learning 2.0 emerges as an innovative method of online learning for its incorporation of Web 2.0 tools. For any academic study, the curriculum provides overview of intact learning area. The Curriculum provides overview to content of the Subject. Many institutions place student interaction as a priority of their online curriculum design. It is proved that interaction has a great effect on the students' involvement in learning and acceptance of Curriculum. Students are accepting curriculum that is designed by teacher; whereas E-learning 2.0 enabled Curriculum management system allows student to involve in learning activities. It works as a stimulus and increases their dedication to the Curriculum. While Institute adapts E-Learning 2.0 as Learning Management System, it also provides Social Networking services and provides direct and transparent interaction between students and teachers. This view of the e-Learning 2.0 shifts its focus from LMS to the students, equip** them, with the means to become ever more autonomous, accepting them to make use of these means in solving problems on their own initiative. Curriculum usage will empower student involvement and enhancing E-learning 2.0 spreading. This paper, analyzing implementation E-learning 2.0 for Curriculum management and discusses Opportunities & Challenges for Curriculum over Web 2.0.
△ Less
Submitted 15 April, 2010;
originally announced April 2010.