Search | arXiv e-print repository

Granite-Function Calling Model: Introducing Function Calling Abilities via Multi-task Learning of Granular Tasks

Authors: Ibrahim Abdelaziz, Kinjal Basu, Mayank Agarwal, Sadhana Kumaravel, Matthew Stallone, Rameswar Panda, Yara Rizk, GP Bhargav, Maxwell Crouse, Chulaka Gunasekara, Shajith Ikbal, Sachin Joshi, Hima Karanam, Vineet Kumar, Asim Munawar, Sumit Neelam, Dinesh Raghu, Udit Sharma, Adriana Meza Soria, Dheeraj Sreedhar, Praveen Venkateswaran, Merve Unuvar, David Cox, Salim Roukos, Luis Lastras , et al. (1 additional authors not shown)

Abstract: Large language models (LLMs) have recently shown tremendous promise in serving as the backbone to agentic systems, as demonstrated by their performance in multi-faceted, challenging benchmarks like SWE-Bench and Agent-Bench. However, to realize the true potential of LLMs as autonomous agents, they must learn to identify, call, and interact with external tools and application program interfaces (AP… ▽ More Large language models (LLMs) have recently shown tremendous promise in serving as the backbone to agentic systems, as demonstrated by their performance in multi-faceted, challenging benchmarks like SWE-Bench and Agent-Bench. However, to realize the true potential of LLMs as autonomous agents, they must learn to identify, call, and interact with external tools and application program interfaces (APIs) to complete complex tasks. These tasks together are termed function calling. Endowing LLMs with function calling abilities leads to a myriad of advantages, such as access to current and domain-specific information in databases and knowledge sources, and the ability to outsource tasks that can be reliably performed by tools, e.g., a Python interpreter or calculator. While there has been significant progress in function calling with LLMs, there is still a dearth of open models that perform on par with proprietary LLMs like GPT, Claude, and Gemini. Therefore, in this work, we introduce the GRANITE-20B-FUNCTIONCALLING model under an Apache 2.0 license. The model is trained using a multi-task training approach on seven fundamental tasks encompassed in function calling, those being Nested Function Calling, Function Chaining, Parallel Functions, Function Name Detection, Parameter-Value Pair Detection, Next-Best Function, and Response Generation. We present a comprehensive evaluation on multiple out-of-domain datasets comparing GRANITE-20B-FUNCTIONCALLING to more than 15 other best proprietary and open models. GRANITE-20B-FUNCTIONCALLING provides the best performance among all open models on the Berkeley Function Calling Leaderboard and fourth overall. As a result of the diverse tasks and datasets used for training our model, we show that GRANITE-20B-FUNCTIONCALLING has better generalizability on multiple tasks in seven different evaluation datasets. △ Less

Submitted 27 June, 2024; originally announced July 2024.

arXiv:2310.15298 [pdf, other]

TaskDiff: A Similarity Metric for Task-Oriented Conversations

Authors: Ankita Bhaumik, Praveen Venkateswaran, Yara Rizk, Vatche Isahagian

Abstract: The popularity of conversational digital assistants has resulted in the availability of large amounts of conversational data which can be utilized for improved user experience and personalized response generation. Building these assistants using popular large language models like ChatGPT also require additional emphasis on prompt engineering and evaluation methods. Textual similarity metrics are a… ▽ More The popularity of conversational digital assistants has resulted in the availability of large amounts of conversational data which can be utilized for improved user experience and personalized response generation. Building these assistants using popular large language models like ChatGPT also require additional emphasis on prompt engineering and evaluation methods. Textual similarity metrics are a key ingredient for such analysis and evaluations. While many similarity metrics have been proposed in the literature, they have not proven effective for task-oriented conversations as they do not take advantage of unique conversational features. To address this gap, we present TaskDiff, a novel conversational similarity metric that utilizes different dialogue components (utterances, intents, and slots) and their distributions to compute similarity. Extensive experimental evaluation of TaskDiff on a benchmark dataset demonstrates its superior performance and improved robustness over other related approaches. △ Less

Submitted 25 October, 2023; v1 submitted 23 October, 2023; originally announced October 2023.

Comments: Accepted to the main conference at EMNLP 2023

arXiv:2309.04863 [pdf, other]

Design of a Low-Power High-Gain Bio-Medical Operational Amplifier in 65nm Technology using gm/ID Methodology

Authors: Ayan Biswas, Supriya Dhabal, Palaniandavar Venkateswaran

Abstract: Operational Amplifiers (Op-Amps) play a crucial role in the field of biomedical engineering, as they enable signal amplification and processing in various medical devices. With the increasing demand for portable and low-power biomedical devices, designing Op-Amps specifically tailored for such applications is essential. In response to this need, a low-power high-gain Op-Amp designed for biomedical… ▽ More Operational Amplifiers (Op-Amps) play a crucial role in the field of biomedical engineering, as they enable signal amplification and processing in various medical devices. With the increasing demand for portable and low-power biomedical devices, designing Op-Amps specifically tailored for such applications is essential. In response to this need, a low-power high-gain Op-Amp designed for biomedical applications using TSMC 65nm technology has been proposed. This Op-Amp incorporates a two-stage miller compensated topology, which is well-known for its superior performance in gain, gain bandwidth product and power consumption. The proposed Op-Amp contributes to the field of biomedical engineering by offering a tailored solution that enhances signal processing capabilities, enables accurate data acquisition, and improves overall efficiency in healthcare systems. The design methodology and simulation results presented in this paper provide insights into the performance and potential impact of the Op-Amp in advancing biomedical devices and systems. △ Less

Submitted 9 September, 2023; originally announced September 2023.

arXiv:2309.04861 [pdf, other]

Exploring Music Genre Classification: Algorithm Analysis and Deployment Architecture

Authors: Ayan Biswas, Supriya Dhabal, Palaniandavar Venkateswaran

Abstract: Music genre classification has become increasingly critical with the advent of various streaming applications. Nowadays, we find it impossible to imagine using the artist's name and song title to search for music in a sophisticated music app. It is always difficult to classify music correctly because the information linked to music, such as region, artist, album, or non-album, is so variable. This… ▽ More Music genre classification has become increasingly critical with the advent of various streaming applications. Nowadays, we find it impossible to imagine using the artist's name and song title to search for music in a sophisticated music app. It is always difficult to classify music correctly because the information linked to music, such as region, artist, album, or non-album, is so variable. This paper presents a study on music genre classification using a combination of Digital Signal Processing (DSP) and Deep Learning (DL) techniques. A novel algorithm is proposed that utilizes both DSP and DL methods to extract relevant features from audio signals and classify them into various genres. The algorithm was tested on the GTZAN dataset and achieved high accuracy. An end-to-end deployment architecture is also proposed for integration into music-related applications. The performance of the algorithm is analyzed and future directions for improvement are discussed. The proposed DSP and DL-based music genre classification algorithm and deployment architecture demonstrate a promising approach for music genre classification. △ Less

Submitted 14 September, 2023; v1 submitted 9 September, 2023; originally announced September 2023.

arXiv:2212.02851 [pdf, other]

DiSTRICT: Dialogue State Tracking with Retriever Driven In-Context Tuning

Authors: Praveen Venkateswaran, Evelyn Duesterwald, Vatche Isahagian

Abstract: Dialogue State Tracking (DST), a key component of task-oriented conversation systems, represents user intentions by determining the values of pre-defined slots in an ongoing dialogue. Existing approaches use hand-crafted templates and additional slot information to fine-tune and prompt large pre-trained language models and elicit slot values from the dialogue context. Significant manual effort and… ▽ More Dialogue State Tracking (DST), a key component of task-oriented conversation systems, represents user intentions by determining the values of pre-defined slots in an ongoing dialogue. Existing approaches use hand-crafted templates and additional slot information to fine-tune and prompt large pre-trained language models and elicit slot values from the dialogue context. Significant manual effort and domain knowledge is required to design effective prompts, limiting the generalizability of these approaches to new domains and tasks. In this work, we propose DiSTRICT, a generalizable in-context tuning approach for DST that retrieves highly relevant training examples for a given dialogue to fine-tune the model without any hand-crafted templates. Experiments with the MultiWOZ benchmark datasets show that DiSTRICT outperforms existing approaches in various zero-shot and few-shot settings using a much smaller model, thereby providing an important advantage for real-world deployments that often have limited resource availability. △ Less

Submitted 21 October, 2023; v1 submitted 6 December, 2022; originally announced December 2022.

arXiv:2211.01914 [pdf, other]

FedGen: Generalizable Federated Learning for Sequential Data

Authors: Praveen Venkateswaran, Vatche Isahagian, Vinod Muthusamy, Nalini Venkatasubramanian

Abstract: Existing federated learning models that follow the standard risk minimization paradigm of machine learning often fail to generalize in the presence of spurious correlations in the training data. In many real-world distributed settings, spurious correlations exist due to biases and data sampling issues on distributed devices or clients that can erroneously influence models. Current generalization a… ▽ More Existing federated learning models that follow the standard risk minimization paradigm of machine learning often fail to generalize in the presence of spurious correlations in the training data. In many real-world distributed settings, spurious correlations exist due to biases and data sampling issues on distributed devices or clients that can erroneously influence models. Current generalization approaches are designed for centralized training and attempt to identify features that have an invariant causal relationship with the target, thereby reducing the effect of spurious features. However, such invariant risk minimization approaches rely on apriori knowledge of training data distributions which is hard to obtain in many applications. In this work, we present a generalizable federated learning framework called FedGen, which allows clients to identify and distinguish between spurious and invariant features in a collaborative manner without prior knowledge of training distributions. We evaluate our approach on real-world datasets from different domains and show that FedGen results in models that achieve significantly better generalization and can outperform the accuracy of current federated learning approaches by over 24%. △ Less

Submitted 30 May, 2023; v1 submitted 3 November, 2022; originally announced November 2022.

arXiv:2210.14739 [pdf, other]

A Case for Business Process-Specific Foundation Models

Authors: Yara Rizk, Praveen Venkateswaran, Vatche Isahagian, Vinod Muthusamy

Abstract: The inception of large language models has helped advance state-of-the-art performance on numerous natural language tasks. This has also opened the door for the development of foundation models for other domains and data modalities such as images, code, and music. In this paper, we argue that business process data representations have unique characteristics that warrant the development of a new cl… ▽ More The inception of large language models has helped advance state-of-the-art performance on numerous natural language tasks. This has also opened the door for the development of foundation models for other domains and data modalities such as images, code, and music. In this paper, we argue that business process data representations have unique characteristics that warrant the development of a new class of foundation models to handle tasks like process mining, optimization, and decision making. These models should also tackle the unique challenges of applying AI to business processes which include data scarcity, multi-modal representations, domain specific terminology, and privacy concerns. △ Less

Submitted 30 November, 2022; v1 submitted 26 October, 2022; originally announced October 2022.

arXiv:2209.12726 [pdf, other]

An Improved PMOS-Based Low Dropout Regulator Design for Large Loads

Authors: Arijit Saha, Ayan Biswas, Supriya Dhabal, Palaniandavar Venkateswaran

Abstract: A stable low dropout (LDO) voltage regulator topology is presented in this paper. LDOs are linear voltage regulators that do not produce ripples in the DC voltage. Despite the close proximity of the supply input voltage to the output, this regulator will maintain the desired output voltage. Based on a detailed comparison between NMOS and PMOS-based LDOs, we decided to opt for a PMOS design because… ▽ More A stable low dropout (LDO) voltage regulator topology is presented in this paper. LDOs are linear voltage regulators that do not produce ripples in the DC voltage. Despite the close proximity of the supply input voltage to the output, this regulator will maintain the desired output voltage. Based on a detailed comparison between NMOS and PMOS-based LDOs, we decided to opt for a PMOS design because it does not require an additional charge pump as compared to NMOS. A demonstration of how Miller capacitance enhances overall design stability is also presented here. Multiple pass elements are arranged in parallel in order to increase the current carrying capacity of the pass network. △ Less

Submitted 1 May, 2023; v1 submitted 26 September, 2022; originally announced September 2022.

arXiv:2203.00418 [pdf, other]

doi 10.23919/EUSIPCO55093.2022.9909940

Recovery of Missing Sensor Data by Reconstructing Time-varying Graph Signals

Authors: Anindya Mondal, Mayukhmali Das, Aditi Chatterjee, Palaniandavar Venkateswaran

Abstract: Wireless sensor networks are among the most promising technologies of the current era because of their small size, lower cost, and ease of deployment. With the increasing number of wireless sensors, the probability of generating missing data also rises. This incomplete data could lead to disastrous consequences if used for decision-making. There is rich literature dealing with this problem. Howeve… ▽ More Wireless sensor networks are among the most promising technologies of the current era because of their small size, lower cost, and ease of deployment. With the increasing number of wireless sensors, the probability of generating missing data also rises. This incomplete data could lead to disastrous consequences if used for decision-making. There is rich literature dealing with this problem. However, most approaches show performance degradation when a sizable amount of data is lost. Inspired by the emerging field of graph signal processing, this paper performs a new study of a Sobolev reconstruction algorithm in wireless sensor networks. Experimental comparisons on several publicly available datasets demonstrate that the algorithm surpasses multiple state-of-the-art techniques by a maximum margin of 54%. We further show that this algorithm consistently retrieves the missing data even during massive data loss situations. △ Less

Submitted 23 December, 2022; v1 submitted 1 March, 2022; originally announced March 2022.

Comments: Five pages, two figures, 2022 30th European Signal Processing Conference (EUSIPCO). Published version available at: https://ieeexplore.ieee.org/document/9909940

Showing 1–9 of 9 results for author: Venkateswaran, P