-
A Survey of using Large Language Models for Generating Infrastructure as Code
Authors:
Kalahasti Ganesh Srivatsa,
Sabyasachi Mukhopadhyay,
Ganesh Katrapati,
Manish Shrivastava
Abstract:
Infrastructure as Code (IaC) is a revolutionary approach which has gained significant prominence in the Industry. IaC manages and provisions IT infrastructure using machine-readable code by enabling automation, consistency across the environments, reproducibility, version control, error reduction and enhancement in scalability. However, IaC orchestration is often a painstaking effort which require…
▽ More
Infrastructure as Code (IaC) is a revolutionary approach which has gained significant prominence in the Industry. IaC manages and provisions IT infrastructure using machine-readable code by enabling automation, consistency across the environments, reproducibility, version control, error reduction and enhancement in scalability. However, IaC orchestration is often a painstaking effort which requires specialised skills as well as a lot of manual effort. Automation of IaC is a necessity in the present conditions of the Industry and in this survey, we study the feasibility of applying Large Language Models (LLM) to address this problem. LLMs are large neural network-based models which have demonstrated significant language processing abilities and shown to be capable of following a range of instructions within a broad scope. Recently, they have also been adapted for code understanding and generation tasks successfully, which makes them a promising choice for the automatic generation of IaC configurations. In this survey, we delve into the details of IaC, usage of IaC in different platforms, their challenges, LLMs in terms of code-generation aspects and the importance of LLMs in IaC along with our own experiments. Finally, we conclude by presenting the challenges in this area and highlighting the scope for future research.
△ Less
Submitted 29 March, 2024;
originally announced April 2024.
-
On the Robustness of Explanations of Deep Neural Network Models: A Survey
Authors:
Amlan Jyoti,
Karthik Balaji Ganesh,
Manoj Gayala,
Nandita Lakshmi Tunuguntla,
Sandesh Kamath,
Vineeth N Balasubramanian
Abstract:
Explainability has been widely stated as a cornerstone of the responsible and trustworthy use of machine learning models. With the ubiquitous use of Deep Neural Network (DNN) models expanding to risk-sensitive and safety-critical domains, many methods have been proposed to explain the decisions of these models. Recent years have also seen concerted efforts that have shown how such explanations can…
▽ More
Explainability has been widely stated as a cornerstone of the responsible and trustworthy use of machine learning models. With the ubiquitous use of Deep Neural Network (DNN) models expanding to risk-sensitive and safety-critical domains, many methods have been proposed to explain the decisions of these models. Recent years have also seen concerted efforts that have shown how such explanations can be distorted (attacked) by minor input perturbations. While there have been many surveys that review explainability methods themselves, there has been no effort hitherto to assimilate the different methods and metrics proposed to study the robustness of explanations of DNN models. In this work, we present a comprehensive survey of methods that study, understand, attack, and defend explanations of DNN models. We also present a detailed review of different metrics used to evaluate explanation methods, as well as describe attributional attack and defense methods. We conclude with lessons and take-aways for the community towards ensuring robust explanations of DNN model predictions.
△ Less
Submitted 9 November, 2022;
originally announced November 2022.
-
MIST: A Novel Training Strategy for Low-latency Scalable Neural Net Decoders
Authors:
Kumar Yashashwi,
Deepak Anand,
Sibi Raj B Pillai,
Prasanna Chaporkar,
K Ganesh
Abstract:
In this paper, we propose a low latency, robust and scalable neural net based decoder for convolutional and low-density parity-check (LPDC) coding schemes. The proposed decoders are demonstrated to have bit error rate (BER) and block error rate (BLER) performances at par with the state-of-the-art neural net based decoders while achieving more than 8 times higher decoding speed. The enhanced decodi…
▽ More
In this paper, we propose a low latency, robust and scalable neural net based decoder for convolutional and low-density parity-check (LPDC) coding schemes. The proposed decoders are demonstrated to have bit error rate (BER) and block error rate (BLER) performances at par with the state-of-the-art neural net based decoders while achieving more than 8 times higher decoding speed. The enhanced decoding speed is due to the use of convolutional neural network (CNN) as opposed to recurrent neural network (RNN) used in the best known neural net based decoders. This contradicts existing doctrine that only RNN based decoders can provide a performance close to the optimal ones. The key ingredient to our approach is a novel Mixed-SNR Independent Samples based Training (MIST), which allows for training of CNN with only 1\% of possible datawords, even for block length as high as 1000. The proposed decoder is robust as, once trained, the same decoder can be used for a wide range of SNR values. Finally, in the presence of channel outages, the proposed decoders outperform the best known decoders, {\it viz.} unquantized Viterbi decoder for convolutional code, and belief propagation for LDPC. This gives the CNN decoder a significant advantage in 5G millimeter wave systems, where channel outages are prevalent.
△ Less
Submitted 22 May, 2019;
originally announced May 2019.
-
Implementation of a Real Time Passenger Information System
Authors:
K. Ganesh,
M. Thrivikraman,
Joy Kuri,
Haresh Dagale,
G. Sudhakar,
Sugata Sanyal
Abstract:
Intelligent Transportation Systems (ITS) are gaining recognition in develo** countries like India. This paper describes the various components of our prototype implementation of a Real-time Passenger Information System (RTPIS) for a public transport system like a fleet of buses. Vehicle-mounted units, bus station units and a server located at the transport company premises comprise the system. T…
▽ More
Intelligent Transportation Systems (ITS) are gaining recognition in develo** countries like India. This paper describes the various components of our prototype implementation of a Real-time Passenger Information System (RTPIS) for a public transport system like a fleet of buses. Vehicle-mounted units, bus station units and a server located at the transport company premises comprise the system. The vehicle unit reports the current position of the vehicle to a central server periodically via General Packet Radio Service (GPRS). An Estimated Time of Arrival (ETA) algorithm running on the server predicts the arrival times of buses at their stops based on real-time observations of the buses' current Global Positioning System (GPS) coordinates. This information is displayed and announced to passengers at stops using station units, which periodically fetch the required ETA from the server via GPRS. Novel features of our prototype include: (a) a route creator utility which automatically creates new routes from scratch when a bus is driven along the new route, and (b) voice tagging of stops and points of interest along any route. Besides, the prototype provides: (i) web-based applications for passengers, providing useful information like a snapshot of present bus locations on the streets, and (ii) web-based analysis tools for the transport authority, providing information useful for fleet management, like number of trips undertaken by a specific bus. The prototype has been demonstrated in a campus environment, with four-wheelers and two-wheelers emulating buses. The automatic real-time passenger information system has the potential of making the public transport system an attractive alternative for city-dwellers, thereby contributing to fewer private vehicles on the road, leading to lower congestion levels and less pollution.
△ Less
Submitted 3 June, 2012;
originally announced June 2012.