Skip to main content

Showing 1–21 of 21 results for author: Munikoti, S

.
  1. arXiv:2407.07321  [pdf, other

    cs.CL

    RAG vs. Long Context: Examining Frontier Large Language Models for Environmental Review Document Comprehension

    Authors: Hung Phan, Anurag Acharya, Sarthak Chaturvedi, Shivam Sharma, Mike Parker, Dan Nally, Ali Jannesari, Karl Pazdernik, Mahantesh Halappanavar, Sai Munikoti, Sameera Horawalavithana

    Abstract: Large Language Models (LLMs) have been applied to many research problems across various domains. One of the applications of LLMs is providing question-answering systems that cater to users from different fields. The effectiveness of LLM-based question-answering systems has already been established at an acceptable level for users posing questions in popular and public domains such as trivia and li… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

    Comments: 14 pages

  2. arXiv:2406.05496  [pdf, other

    cs.CL

    Generalist Multimodal AI: A Review of Architectures, Challenges and Opportunities

    Authors: Sai Munikoti, Ian Stewart, Sameera Horawalavithana, Henry Kvinge, Tegan Emerson, Sandra E Thompson, Karl Pazdernik

    Abstract: Multimodal models are expected to be a critical component to future advances in artificial intelligence. This field is starting to grow rapidly with a surge of new design elements motivated by the success of foundation models in natural language processing (NLP) and vision. It is widely hoped that further extending the foundation models to multiple modalities (e.g., text, image, video, sensor, tim… ▽ More

    Submitted 8 June, 2024; originally announced June 2024.

    Comments: 25 pages, 3 figures, 5 tables

  3. arXiv:2311.12289  [pdf, other

    cs.CL cs.AI

    ATLANTIC: Structure-Aware Retrieval-Augmented Language Model for Interdisciplinary Science

    Authors: Sai Munikoti, Anurag Acharya, Sridevi Wagle, Sameera Horawalavithana

    Abstract: Large language models record impressive performance on many natural language processing tasks. However, their knowledge capacity is limited to the pretraining corpus. Retrieval augmentation offers an effective solution by retrieving context from external knowledge sources to complement the language model. However, existing retrieval augmentation techniques ignore the structural relationships betwe… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

    ACM Class: I.2.7

  4. arXiv:2311.09358  [pdf, other

    cs.CL cs.AI

    Empirical evaluation of Uncertainty Quantification in Retrieval-Augmented Language Models for Science

    Authors: Sridevi Wagle, Sai Munikoti, Anurag Acharya, Sara Smith, Sameera Horawalavithana

    Abstract: Large language models (LLMs) have shown remarkable achievements in natural language processing tasks, producing high-quality outputs. However, LLMs still exhibit limitations, including the generation of factually incorrect information. In safety-critical applications, it is important to assess the confidence of LLM-generated content to make informed decisions. Retrieval Augmented Language Models (… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

    ACM Class: I.2.7

  5. arXiv:2311.04348  [pdf, other

    cs.CL cs.AI

    Evaluating the Effectiveness of Retrieval-Augmented Large Language Models in Scientific Document Reasoning

    Authors: Sai Munikoti, Anurag Acharya, Sridevi Wagle, Sameera Horawalavithana

    Abstract: Despite the dramatic progress in Large Language Model (LLM) development, LLMs often provide seemingly plausible but not factual information, often referred to as hallucinations. Retrieval-augmented LLMs provide a non-parametric approach to solve these issues by retrieving relevant information from external data sources and augment the training process. These models help to trace evidence from an e… ▽ More

    Submitted 7 November, 2023; originally announced November 2023.

    Comments: 5 pages

    ACM Class: I.2.7

  6. arXiv:2310.10920  [pdf, other

    cs.CL cs.AI

    NuclearQA: A Human-Made Benchmark for Language Models for the Nuclear Domain

    Authors: Anurag Acharya, Sai Munikoti, Aaron Hellinger, Sara Smith, Sridevi Wagle, Sameera Horawalavithana

    Abstract: As LLMs have become increasingly popular, they have been used in almost every field. But as the application for LLMs expands from generic fields to narrow, focused science domains, there exists an ever-increasing gap in ways to evaluate their efficacy in those fields. For the benchmarks that do exist, a lot of them focus on questions that don't require proper understanding of the subject in questi… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

    Comments: 9 pages

    ACM Class: I.2.7

  7. arXiv:2307.01139  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    SCITUNE: Aligning Large Language Models with Scientific Multimodal Instructions

    Authors: Sameera Horawalavithana, Sai Munikoti, Ian Stewart, Henry Kvinge

    Abstract: Instruction finetuning is a popular paradigm to align large language models (LLM) with human intent. Despite its popularity, this idea is less explored in improving the LLMs to align existing foundation models with scientific disciplines, concepts and goals. In this work, we present SciTune as a tuning framework to improve the ability of LLMs to follow scientific multimodal instructions. To test o… ▽ More

    Submitted 3 July, 2023; originally announced July 2023.

    Comments: Preprint. Work in progress

  8. arXiv:2306.01189  [pdf, other

    cs.LG

    A General Framework for Uncertainty Quantification via Neural SDE-RNN

    Authors: Shweta Dahale, Sai Munikoti, Balasubramaniam Natarajan

    Abstract: Uncertainty quantification is a critical yet unsolved challenge for deep learning, especially for the time series imputation with irregularly sampled measurements. To tackle this problem, we propose a novel framework based on the principles of recurrent neural networks and neural stochastic differential equations for reconciling irregularly sampled measurements. We impute measurements at any arbit… ▽ More

    Submitted 1 June, 2023; originally announced June 2023.

    Comments: 7 pages, 3 figures

  9. arXiv:2305.19871  [pdf, other

    cs.LG

    There is more to graphs than meets the eye: Learning universal features with self-supervision

    Authors: Laya Das, Sai Munikoti, Mahantesh Halappanavar

    Abstract: We study the problem of learning universal features across multiple graphs through self-supervision. Graph self supervised learning has been shown to facilitate representation learning, and produce competitive models compared to supervised baselines. However, existing methods of self-supervision learn features from one graph, and thus, produce models that are specialized to a particular graph. We… ▽ More

    Submitted 31 May, 2023; originally announced May 2023.

    Comments: arXiv admin note: text overlap with arXiv:2302.11939, arXiv:2301.13287, arXiv:2305.12686, arXiv:2305.02299

  10. arXiv:2211.03882  [pdf, other

    eess.SY

    Latent Neural ODE for Integrating Multi-timescale measurements in Smart Distribution Grids

    Authors: Shweta Dahale, Sai Munikoti, Balasubramaniam Natarajan, Rui Yang

    Abstract: Under a smart grid paradigm, there has been an increase in sensor installations to enhance situational awareness. The measurements from these sensors can be leveraged for real-time monitoring, control, and protection. However, these measurements are typically irregularly sampled. These measurements may also be intermittent due to communication bandwidth limitations. To tackle this problem, this pa… ▽ More

    Submitted 7 November, 2022; originally announced November 2022.

  11. arXiv:2206.07922  [pdf, other

    cs.LG

    Challenges and Opportunities in Deep Reinforcement Learning with Graph Neural Networks: A Comprehensive review of Algorithms and Applications

    Authors: Sai Munikoti, Deepesh Agarwal, Laya Das, Mahantesh Halappanavar, Balasubramaniam Natarajan

    Abstract: Deep reinforcement learning (DRL) has empowered a variety of artificial intelligence fields, including pattern recognition, robotics, recommendation-systems, and gaming. Similarly, graph neural networks (GNN) have also demonstrated their superior performance in supervised learning for graph-structured data. In recent times, the fusion of GNN with DRL for graph-structured environments has attracted… ▽ More

    Submitted 7 November, 2022; v1 submitted 16 June, 2022; originally announced June 2022.

    Comments: 20 pages, 3 figures, 2 tables

  12. arXiv:2205.14834  [pdf, other

    cs.LG cs.SI

    GraMeR: Graph Meta Reinforcement Learning for Multi-Objective Influence Maximization

    Authors: Sai Munikoti, Balasubramaniam Natarajan, Mahantesh Halappanavar

    Abstract: Influence maximization (IM) is a combinatorial problem of identifying a subset of nodes called the seed nodes in a network (graph), which when activated, provide a maximal spread of influence in the network for a given diffusion model and a budget for seed set size. IM has numerous applications such as viral marketing, epidemic control, sensor placement and other network-related tasks. However, th… ▽ More

    Submitted 29 May, 2022; originally announced May 2022.

    Comments: 11 pages, 6 figures

  13. arXiv:2205.09968  [pdf, other

    cs.LG stat.ML

    A General Framework for quantifying Aleatoric and Epistemic uncertainty in Graph Neural Networks

    Authors: Sai Munikoti, Deepesh Agarwal, Laya Das, Balasubramaniam Natarajan

    Abstract: Graph Neural Networks (GNN) provide a powerful framework that elegantly integrates Graph theory with Machine learning for modeling and analysis of networked data. We consider the problem of quantifying the uncertainty in predictions of GNN stemming from modeling errors and measurement uncertainty. We consider aleatoric uncertainty in the form of probabilistic links and noise in feature vector of n… ▽ More

    Submitted 20 May, 2022; originally announced May 2022.

    Comments: 10 pages, 1 figure, 6 Tables

  14. arXiv:2106.01587  [pdf, ps, other

    eess.SY

    An Information Theoretic approach to identify Dominant Voltage Influencers for Unbalanced Distribution Systems

    Authors: Sai Munikoti, Mohammad Abujubbeh, Kumarsinh Jhala, Balasubramaniam Natarajan

    Abstract: Smart distribution grid with multiple renewable energy sources can experience random voltage fluctuations due to variable generation, which may result in voltage violations. Traditional voltage control algorithms are inadequate to handle fast voltage variations. Therefore, new dynamic control methods are being developed that can significantly benefit from the knowledge of dominant voltage influenc… ▽ More

    Submitted 3 June, 2021; originally announced June 2021.

    Comments: 8 pages, 6 tables

  15. Bayesian Graph Neural Network for Fast identification of critical nodes in Uncertain Complex Networks

    Authors: Sai Munikoti, Laya Das, Balasubramaniam Natarajan

    Abstract: In the quest to improve efficiency, interdependence and complexity are becoming defining characteristics of modern complex networks representing engineered and natural systems. Graph theory is a widely used framework for modeling such complex networks and to evaluate their robustness to disruptions. Particularly, identification of critical nodes/links in a graph can facilitate the enhancement of g… ▽ More

    Submitted 17 May, 2021; v1 submitted 26 December, 2020; originally announced December 2020.

    Comments: 6 pages, 2 figures, 3 Tables

  16. Scalable Graph Neural Network-based framework for identifying critical nodes and links in Complex Networks

    Authors: Sai Munikoti, Laya Das, Balasubramaniam Natarajan

    Abstract: Identifying critical nodes and links in graphs is a crucial task. These nodes/links typically represent critical elements/communication links that play a key role in a system's performance. However, a majority of the methods available in the literature on the identification of critical nodes/links are based on an iterative approach that explores each node/link of a graph at a time, repeating for a… ▽ More

    Submitted 10 May, 2021; v1 submitted 26 December, 2020; originally announced December 2020.

    Comments: 29 pages, single column, 3 figures

  17. Spatio-Temporal Probabilistic Voltage Sensitivity Analysis - A Novel Framework for Hosting Capacity Analysis

    Authors: Sai Munikoti, Mohammad Abujubbeh, Kumarsinh Jhala, Balasubramaniam Natarajan

    Abstract: Smart grids are envisioned to accommodate high penetration of distributed photovoltaic (PV) generation, which may cause adverse grid impacts in terms of voltage violations. Therefore, PV Hosting capacity (HC) is being used as a planning tool to determine the maximum PV installation capacity that causes the first voltage violation and above which would require infrastructure upgrades. Traditional m… ▽ More

    Submitted 8 February, 2021; v1 submitted 17 September, 2020; originally announced September 2020.

    Comments: 8 pages, 2 figures, discussion added

  18. Probabilistic Voltage Sensitivity Analysis (PVSA) to Quantify Impact of High PV Penetration on Unbalanced Distribution System

    Authors: Sai Munikoti, Balasubramaniam Natarajan, Kumarsinh Jhala, Kexing Lai

    Abstract: From an operational and planning perspective, it is important to quantify the impact of increasing penetration of photovoltaics on the distribution system. Most existing impact assessment studies are scenario-based where derived results are scenario specific and not generalizable. Moreover, stochasticity in the temporal behavior of spatially distributed PVs requires a large number of scenarios tha… ▽ More

    Submitted 12 September, 2020; originally announced September 2020.

    Comments: 10 pages, 9 figures, Preprint under review in IEEE Transactions on Power Systems

  19. Analytical Voltage Sensitivity Analysis for Unbalanced Power Distribution System

    Authors: Sai Munikoti, Kumarsinh Jhala, Kexing Lai, Balasubramaniam Natarajan

    Abstract: Large scale integration of distributed energy resources and electric vehicles in a transactive energy environment present new challenges in terms of voltage stability and fluctuations in a power distribution system. The impact of different level of DER/EV penetration on the voltages across the network is typically quantified through voltage sensitivity analyses. Existing methods of voltage sensiti… ▽ More

    Submitted 11 September, 2020; originally announced September 2020.

    Comments: 5 pages, 3 figures, Preprint accepted in IEEE PES GM 2020 Conference

  20. Robustness Assessment of Hetero-Functional Graph Theory Based Model of Interdependent Urban Utility Networks

    Authors: Sai Munikoti, Kexing Lai, Balasubramaniam Natarajan

    Abstract: The increasing urban population imposes a substantial and growing burden on the supporting infrastructure, such as electricity, water, heating, natural gas, road transportation, etc. This paper presents a Hetero-functional graph theory (HFGT) based modeling framework for these integrated infrastructures followed by an analysis of network robustness. The supporting infrastructures along with the in… ▽ More

    Submitted 26 August, 2020; originally announced August 2020.

  21. arXiv:2008.10814  [pdf, ps, other

    eess.SY

    Probabilistic Voltage Sensitivity based Preemptive Voltage Monitoring in Unbalanced Distribution Networks

    Authors: Mohammad Abujubbeh, Sai Munikoti, Balasubramaniam Natarajan

    Abstract: With increasing penetration of renewable energy and active consumers, control and management of power distribution networks has become challenging. Renewable energy sources can cause random voltage fluctuations as their output power depends on weather conditions. Conventional voltage control schemes such as tap changers and capacitor banks lack the foresight required to quickly alleviate voltage v… ▽ More

    Submitted 25 August, 2020; originally announced August 2020.