Search | arXiv e-print repository

Do Multimodal Foundation Models Understand Enterprise Workflows? A Benchmark for Business Process Management Tasks

Authors: Michael Wornow, Avanika Narayan, Ben Viggiano, Ishan S. Khare, Tathagat Verma, Tibor Thompson, Miguel Angel Fuentes Hernandez, Sudharsan Sundar, Chloe Trujillo, Krrish Chawla, Rongfei Lu, Justin Shen, Divya Nagaraj, Joshua Martinez, Vardhan Agrawal, Althea Hudson, Nigam H. Shah, Christopher Re

Abstract: Existing ML benchmarks lack the depth and diversity of annotations needed for evaluating models on business process management (BPM) tasks. BPM is the practice of documenting, measuring, improving, and automating enterprise workflows. However, research has focused almost exclusively on one task - full end-to-end automation using agents based on multimodal foundation models (FMs) like GPT-4. This f… ▽ More Existing ML benchmarks lack the depth and diversity of annotations needed for evaluating models on business process management (BPM) tasks. BPM is the practice of documenting, measuring, improving, and automating enterprise workflows. However, research has focused almost exclusively on one task - full end-to-end automation using agents based on multimodal foundation models (FMs) like GPT-4. This focus on automation ignores the reality of how most BPM tools are applied today - simply documenting the relevant workflow takes 60% of the time of the typical process optimization project. To address this gap we present WONDERBREAD, the first benchmark for evaluating multimodal FMs on BPM tasks beyond automation. Our contributions are: (1) a dataset containing 2928 documented workflow demonstrations; (2) 6 novel BPM tasks sourced from real-world applications ranging from workflow documentation to knowledge transfer to process improvement; and (3) an automated evaluation harness. Our benchmark shows that while state-of-the-art FMs can automatically generate documentation (e.g. recalling 88% of the steps taken in a video demonstration of a workflow), they struggle to re-apply that knowledge towards finer-grained validation of workflow completion (F1 < 0.3). We hope WONDERBREAD encourages the development of more "human-centered" AI tooling for enterprise applications and furthers the exploration of multimodal FMs for the broader universe of BPM tasks. We publish our dataset and experiments here: https://github.com/HazyResearch/wonderbread △ Less

Submitted 19 June, 2024; originally announced June 2024.

arXiv:2406.04803 [pdf, other]

Embracing Nonlinearity and Geometry: A dimensional analysis guided design of shock absorbing materials

Authors: Abhishek Gupta, Komal Chawla, Ramathasan Thevamaran

Abstract: Protective applications require energy-absorbing materials that are soft and compressible enough to absorb kinetic energy from impacts, yet stiff enough to bear crushing loads. Achieving this balance requires careful consideration of both mechanical properties and geometric design. Conventional shock-absorbing pads are made of very thick foams that exhibit a plateau of constant stress in their str… ▽ More Protective applications require energy-absorbing materials that are soft and compressible enough to absorb kinetic energy from impacts, yet stiff enough to bear crushing loads. Achieving this balance requires careful consideration of both mechanical properties and geometric design. Conventional shock-absorbing pads are made of very thick foams that exhibit a plateau of constant stress in their stress-strain response. Contrary to this belief, we report that foams with a nonlinear stress-strain response can be useful to achieve simultaneously thin and lightweight protective pads. We introduce a new framework for the thickness or volume-constrained design of compact and lightweight protective foams while ensuring the desired structural integrity and mechanical performance. Our streamlined dimensional analysis approach provides geometric constraints on the dimensionless thickness and cross-sectional area of a protective foam with a given stress-strain response to limit the acceleration and compressive strain within desired critical limits. We also identify optimal mechanical properties that will result in the most compact and lightest protective foam layer for absorbing a given kinetic energy of impact. Guided by this design framework, we achieve optimal protective properties in hierarchically architected vertically aligned carbon nanotube (VACNT) foams, enabling next generation protective applications in extreme environments. △ Less

Submitted 7 June, 2024; originally announced June 2024.

arXiv:2402.13550 [pdf, other]

Are LLMs Effective Negotiators? Systematic Evaluation of the Multifaceted Capabilities of LLMs in Negotiation Dialogues

Authors: Deuksin Kwon, Emily Weiss, Tara Kulshrestha, Kushal Chawla, Gale M. Lucas, Jonathan Gratch

Abstract: A successful negotiation demands a deep comprehension of the conversation context, Theory-of-Mind (ToM) skills to infer the partner's motives, as well as strategic reasoning and effective communication, making it challenging for automated systems. Given the remarkable performance of LLMs across a variety of NLP tasks, in this work, we aim to understand how LLMs can advance different aspects of neg… ▽ More A successful negotiation demands a deep comprehension of the conversation context, Theory-of-Mind (ToM) skills to infer the partner's motives, as well as strategic reasoning and effective communication, making it challenging for automated systems. Given the remarkable performance of LLMs across a variety of NLP tasks, in this work, we aim to understand how LLMs can advance different aspects of negotiation research, ranging from designing dialogue systems to providing pedagogical feedback and scaling up data collection practices. To this end, we devise a methodology to analyze the multifaceted capabilities of LLMs across diverse dialogue scenarios covering all the time stages of a typical negotiation interaction. Our analysis adds to the increasing evidence for the superiority of GPT-4 across various tasks while also providing insights into specific tasks that remain difficult for LLMs. For instance, the models correlate poorly with human players when making subjective assessments about the negotiation dialogues and often struggle to generate responses that are contextually appropriate as well as strategically advantageous. △ Less

Submitted 21 February, 2024; originally announced February 2024.

arXiv:2402.02077 [pdf, other]

Investigating Content Planning for Navigating Trade-offs in Knowledge-Grounded Dialogue

Authors: Kushal Chawla, Hannah Rashkin, Gaurav Singh Tomar, David Reitter

Abstract: Knowledge-grounded dialogue generation is a challenging task because it requires satisfying two fundamental yet often competing constraints: being responsive in a manner that is specific to what the conversation partner has said while also being attributable to an underlying source document. In this work, we bring this trade-off between these two objectives (specificity and attribution) to light a… ▽ More Knowledge-grounded dialogue generation is a challenging task because it requires satisfying two fundamental yet often competing constraints: being responsive in a manner that is specific to what the conversation partner has said while also being attributable to an underlying source document. In this work, we bring this trade-off between these two objectives (specificity and attribution) to light and ask the question: Can explicit content planning before the response generation help the model to address this challenge? To answer this question, we design a framework called PLEDGE, which allows us to experiment with various plan variables explored in prior work, supporting both metric-agnostic and metric-aware approaches. While content planning shows promise, our results on whether it can actually help to navigate this trade-off are mixed -- planning mechanisms that are metric-aware (use automatic metrics during training) are better at automatic evaluations but underperform in human judgment compared to metric-agnostic mechanisms. We discuss how this may be caused by over-fitting to automatic metrics and the need for future work to better calibrate these metrics towards human judgment. We hope the observations from our analysis will inform future work that aims to apply content planning in this context. △ Less

Submitted 3 February, 2024; originally announced February 2024.

Comments: Accepted at EACL 2024 Main Conference (Long)

arXiv:2310.18506 [pdf, other]

Random walks on groups and superlinear divergent geodesics

Authors: Kunal Chawla, Inhyeok Choi, Vivian He, Kasra Rafi

Abstract: In this paper, we study random walks on groups that contain superlinear divergent geodesics, in the line of thoughts of Goldsborough-Sisto. The existence of a superlinear divergent geodesic is a quasi-isometry invariant which allows us to execute Gouëzel's pivoting technique. We develop the theory of superlinear divergence and establish a central limit theorem for random walks on these groups. In this paper, we study random walks on groups that contain superlinear divergent geodesics, in the line of thoughts of Goldsborough-Sisto. The existence of a superlinear divergent geodesic is a quasi-isometry invariant which allows us to execute Gouëzel's pivoting technique. We develop the theory of superlinear divergence and establish a central limit theorem for random walks on these groups. △ Less

Submitted 4 December, 2023; v1 submitted 27 October, 2023; originally announced October 2023.

Comments: Correction on references and background and minor edits

MSC Class: 60G50; 20F65; 20F69

arXiv:2310.14404 [pdf, other]

Be Selfish, But Wisely: Investigating the Impact of Agent Personality in Mixed-Motive Human-Agent Interactions

Authors: Kushal Chawla, Ian Wu, Yu Rong, Gale M. Lucas, Jonathan Gratch

Abstract: A natural way to design a negotiation dialogue system is via self-play RL: train an agent that learns to maximize its performance by interacting with a simulated user that has been designed to imitate human-human dialogue data. Although this procedure has been adopted in prior work, we find that it results in a fundamentally flawed system that fails to learn the value of compromise in a negotiatio… ▽ More A natural way to design a negotiation dialogue system is via self-play RL: train an agent that learns to maximize its performance by interacting with a simulated user that has been designed to imitate human-human dialogue data. Although this procedure has been adopted in prior work, we find that it results in a fundamentally flawed system that fails to learn the value of compromise in a negotiation, which can often lead to no agreements (i.e., the partner walking away without a deal), ultimately hurting the model's overall performance. We investigate this observation in the context of the DealOrNoDeal task, a multi-issue negotiation over books, hats, and balls. Grounded in negotiation theory from Economics, we modify the training procedure in two novel ways to design agents with diverse personalities and analyze their performance with human partners. We find that although both techniques show promise, a selfish agent, which maximizes its own performance while also avoiding walkaways, performs superior to other variants by implicitly learning to generate value for both itself and the negotiation partner. We discuss the implications of our findings for what it means to be a successful negotiation dialogue system and how these systems should be designed in the future. △ Less

Submitted 22 October, 2023; originally announced October 2023.

Comments: Accepted at EMNLP 2023 (Main)

arXiv:2308.01877 [pdf, ps, other]

Genericity of contracting geodesics in groups

Authors: Kunal Chawla, Inhyeok Choi, Giulio Tiozzo

Abstract: Let G be a finitely generated group and Cay(G, S) be the Cayley graph of G with respect to a finite generating set S. We characterize the Gromov hyperbolicity of G in terms of the genericity of contracting elements in Cay(G, S). Let G be a finitely generated group and Cay(G, S) be the Cayley graph of G with respect to a finite generating set S. We characterize the Gromov hyperbolicity of G in terms of the genericity of contracting elements in Cay(G, S). △ Less

Submitted 3 August, 2023; originally announced August 2023.

Comments: 20 pages

MSC Class: 20F67; 20F69

arXiv:2211.00083 [pdf, other]

WHEN FLUE MEETS FLANG: Benchmarks and Large Pre-trained Language Model for Financial Domain

Authors: Raj Sanjay Shah, Kunal Chawla, Dheeraj Eidnani, Agam Shah, Wendi Du, Sudheer Chava, Natraj Raman, Charese Smiley, Jiaao Chen, Diyi Yang

Abstract: Pre-trained language models have shown impressive performance on a variety of tasks and domains. Previous research on financial language models usually employs a generic training scheme to train standard model architectures, without completely leveraging the richness of the financial data. We propose a novel domain specific Financial LANGuage model (FLANG) which uses financial keywords and phrases… ▽ More Pre-trained language models have shown impressive performance on a variety of tasks and domains. Previous research on financial language models usually employs a generic training scheme to train standard model architectures, without completely leveraging the richness of the financial data. We propose a novel domain specific Financial LANGuage model (FLANG) which uses financial keywords and phrases for better masking, together with span boundary objective and in-filing objective. Additionally, the evaluation benchmarks in the field have been limited. To this end, we contribute the Financial Language Understanding Evaluation (FLUE), an open-source comprehensive suite of benchmarks for the financial domain. These include new benchmarks across 5 NLP tasks in financial domain as well as common benchmarks used in the previous research. Experiments on these benchmarks suggest that our model outperforms those in prior literature on a variety of NLP tasks. Our models, code and benchmark data are publicly available on Github and Huggingface. △ Less

Submitted 31 October, 2022; originally announced November 2022.

arXiv:2210.05664 [pdf, other]

Social Influence Dialogue Systems: A Survey of Datasets and Models For Social Influence Tasks

Authors: Kushal Chawla, Weiyan Shi, **gwen Zhang, Gale Lucas, Zhou Yu, Jonathan Gratch

Abstract: Dialogue systems capable of social influence such as persuasion, negotiation, and therapy, are essential for extending the use of technology to numerous realistic scenarios. However, existing research primarily focuses on either task-oriented or open-domain scenarios, a categorization that has been inadequate for capturing influence skills systematically. There exists no formal definition or categ… ▽ More Dialogue systems capable of social influence such as persuasion, negotiation, and therapy, are essential for extending the use of technology to numerous realistic scenarios. However, existing research primarily focuses on either task-oriented or open-domain scenarios, a categorization that has been inadequate for capturing influence skills systematically. There exists no formal definition or category for dialogue systems with these skills and data-driven efforts in this direction are highly limited. In this work, we formally define and introduce the category of social influence dialogue systems that influence users' cognitive and emotional responses, leading to changes in thoughts, opinions, and behaviors through natural conversations. We present a survey of various tasks, datasets, and methods, compiling the progress across seven diverse domains. We discuss the commonalities and differences between the examined systems, identify limitations, and recommend future directions. This study serves as a comprehensive reference for social influence dialogue systems to inspire more dedicated research and discussion in this emerging area. △ Less

Submitted 24 January, 2023; v1 submitted 11 October, 2022; originally announced October 2022.

Comments: Accepted at EACL 2023

arXiv:2209.02114 [pdf, other]

The Poisson boundary of hyperbolic groups without moment conditions

Authors: Kunal Chawla, Behrang Forghani, Joshua Frisch, Giulio Tiozzo

Abstract: We prove that the Poisson boundary of a random walk with finite entropy on a non-elementary hyperbolic group can be identified with its hyperbolic boundary, without assuming any moment condition on the measure. We also extend our method to groups with an action by isometries on a hyperbolic metric space containing a WPD element; this applies to a large class of non-hyperbolic groups such as relati… ▽ More We prove that the Poisson boundary of a random walk with finite entropy on a non-elementary hyperbolic group can be identified with its hyperbolic boundary, without assuming any moment condition on the measure. We also extend our method to groups with an action by isometries on a hyperbolic metric space containing a WPD element; this applies to a large class of non-hyperbolic groups such as relatively hyperbolic groups, map** class groups, and groups acting on CAT(0) spaces. △ Less

Submitted 29 November, 2022; v1 submitted 5 September, 2022; originally announced September 2022.

Comments: 23 pages. Changes in the new version: we included Theorem 1.2, which now applies to all groups with a WPD element, and its applications in Corollary 1.3. The proof is in Section 5. For the sake of exposition, we also added Section 3.1, explaining the concept of pivots for free groups

MSC Class: 60G50; 20F67

arXiv:2207.03023 [pdf, other]

Superior mechanical properties by exploiting size-effects and multiscale interactions in hierarchically architected foams

Authors: Komal Chawla, Abhishek Gupta, Abhijeet S. Bhardwaj, Ramathasan Thevamaran

Abstract: Protective applications in extreme environments demand thermally stable materials with superior modulus, strength, and specific energy absorption (SEA) at lightweight. However, these properties typically have a trade-off. Hierarchically architected materials--such as the architected vertically aligned carbon nanotube (VACNT) foams--offer the potential to overcome these trade-offs to achieve synerg… ▽ More Protective applications in extreme environments demand thermally stable materials with superior modulus, strength, and specific energy absorption (SEA) at lightweight. However, these properties typically have a trade-off. Hierarchically architected materials--such as the architected vertically aligned carbon nanotube (VACNT) foams--offer the potential to overcome these trade-offs to achieve synergistic enhancement in mechanical properties. Here, we adopt a full-factorial design of experiments (DOE) approach to optimize multitier design parameters to achieve synergistic enhancement in SEA, strength, and modulus at lightweight in VACNT foams with mesoscale cylindrical architecture. We exploit the size effects from geometrically-confined synthesis and the highly interactive morphology of CNTs to enable higher-order design parameter interactions that intriguingly break the diameter-to-thickness (D/t)-dependent scaling laws found in common tubular architected materials. We show that exploiting complementary hierarchical mechanisms in architected material design can lead to unprecedented synergistic enhancement of mechanical properties and performance desirable for extreme protective applications. △ Less

Submitted 6 July, 2022; originally announced July 2022.

Comments: 17 Pages, 6 Figures

arXiv:2205.00344 [pdf, other]

Opponent Modeling in Negotiation Dialogues by Related Data Adaptation

Authors: Kushal Chawla, Gale M. Lucas, Jonathan May, Jonathan Gratch

Abstract: Opponent modeling is the task of inferring another party's mental state within the context of social interactions. In a multi-issue negotiation, it involves inferring the relative importance that the opponent assigns to each issue under discussion, which is crucial for finding high-value deals. A practical model for this task needs to infer these priorities of the opponent on the fly based on part… ▽ More Opponent modeling is the task of inferring another party's mental state within the context of social interactions. In a multi-issue negotiation, it involves inferring the relative importance that the opponent assigns to each issue under discussion, which is crucial for finding high-value deals. A practical model for this task needs to infer these priorities of the opponent on the fly based on partial dialogues as input, without needing additional annotations for training. In this work, we propose a ranker for identifying these priorities from negotiation dialogues. The model takes in a partial dialogue as input and predicts the priority order of the opponent. We further devise ways to adapt related data sources for this task to provide more explicit supervision for incorporating the opponent's preferences and offers, as a proxy to relying on granular utterance-level annotations. We show the utility of our proposed approach through extensive experiments based on two dialogue datasets. We find that the proposed data adaptations lead to strong performance in zero-shot and few-shot scenarios. Moreover, they allow the model to perform better than baselines while accessing fewer utterances from the opponent. We release our code to support future work in this direction. △ Less

Submitted 3 May, 2022; v1 submitted 30 April, 2022; originally announced May 2022.

Comments: Appearing at Findings of NAACL 2022

arXiv:2204.06141 [pdf, other]

Effective Drift Estimates for Random Walks on Graph Products

Authors: Kunal Chawla

Abstract: We find uniform lower bounds on the drift for a large family of random walks on graph products, of the form $ \mathbb{P} (|Z_{n}| \leq κn) \leq e ^{-κn} $ for $ κ> 0 $. This includes the simple random walk for a right-angled Artin group with a sparse defining graph. This is done by extending an argument of Gouëzel, along with the combinatorial notion of a piling introduced by Crisp, Godelle, and W… ▽ More We find uniform lower bounds on the drift for a large family of random walks on graph products, of the form $ \mathbb{P} (|Z_{n}| \leq κn) \leq e ^{-κn} $ for $ κ> 0 $. This includes the simple random walk for a right-angled Artin group with a sparse defining graph. This is done by extending an argument of Gouëzel, along with the combinatorial notion of a piling introduced by Crisp, Godelle, and Wiest. We do not use any moment conditions, instead considering random walks which alternate between one measure uniformly distributed on vertex groups, and another measure over which we make almost no assumptions. △ Less

Submitted 12 April, 2022; originally announced April 2022.

Comments: 2 figures

arXiv:2107.13165 [pdf, other]

Towards Emotion-Aware Agents For Negotiation Dialogues

Authors: Kushal Chawla, Rene Clever, Jaysa Ramirez, Gale Lucas, Jonathan Gratch

Abstract: Negotiation is a complex social interaction that encapsulates emotional encounters in human decision-making. Virtual agents that can negotiate with humans are useful in pedagogy and conversational AI. To advance the development of such agents, we explore the prediction of two important subjective goals in a negotiation - outcome satisfaction and partner perception. Specifically, we analyze the ext… ▽ More Negotiation is a complex social interaction that encapsulates emotional encounters in human decision-making. Virtual agents that can negotiate with humans are useful in pedagogy and conversational AI. To advance the development of such agents, we explore the prediction of two important subjective goals in a negotiation - outcome satisfaction and partner perception. Specifically, we analyze the extent to which emotion attributes extracted from the negotiation help in the prediction, above and beyond the individual difference variables. We focus on a recent dataset in chat-based negotiations, grounded in a realistic cam** scenario. We study three degrees of emotion dimensions - emoticons, lexical, and contextual by leveraging affective lexicons and a state-of-the-art deep learning architecture. Our insights will be helpful in designing adaptive negotiation agents that interact through realistic communication interfaces. △ Less

Submitted 28 July, 2021; originally announced July 2021.

Comments: Accepted at 9th International Conference on Affective Computing & Intelligent Interaction (ACII 2021)

arXiv:2103.15721 [pdf, other]

CaSiNo: A Corpus of Campsite Negotiation Dialogues for Automatic Negotiation Systems

Authors: Kushal Chawla, Jaysa Ramirez, Rene Clever, Gale Lucas, Jonathan May, Jonathan Gratch

Abstract: Automated systems that negotiate with humans have broad applications in pedagogy and conversational AI. To advance the development of practical negotiation systems, we present CaSiNo: a novel corpus of over a thousand negotiation dialogues in English. Participants take the role of campsite neighbors and negotiate for food, water, and firewood packages for their upcoming trip. Our design results in… ▽ More Automated systems that negotiate with humans have broad applications in pedagogy and conversational AI. To advance the development of practical negotiation systems, we present CaSiNo: a novel corpus of over a thousand negotiation dialogues in English. Participants take the role of campsite neighbors and negotiate for food, water, and firewood packages for their upcoming trip. Our design results in diverse and linguistically rich negotiations while maintaining a tractable, closed-domain environment. Inspired by the literature in human-human negotiations, we annotate persuasion strategies and perform correlation analysis to understand how the dialogue behaviors are associated with the negotiation performance. We further propose and evaluate a multi-task framework to recognize these strategies in a given utterance. We find that multi-task learning substantially improves the performance for all strategy labels, especially for the ones that are the most skewed. We release the dataset, annotations, and the code to propel future work in human-machine negotiations: https://github.com/kushalchawla/CaSiNo △ Less

Submitted 28 April, 2021; v1 submitted 29 March, 2021; originally announced March 2021.

Comments: Accepted at NAACL 2021

arXiv:2010.05090 [pdf, other]

Semi-supervised Formality Style Transfer using Language Model Discriminator and Mutual Information Maximization

Authors: Kunal Chawla, Diyi Yang

Abstract: Formality style transfer is the task of converting informal sentences to grammatically-correct formal sentences, which can be used to improve performance of many downstream NLP tasks. In this work, we propose a semi-supervised formality style transfer model that utilizes a language model-based discriminator to maximize the likelihood of the output sentence being formal, which allows us to use maxi… ▽ More Formality style transfer is the task of converting informal sentences to grammatically-correct formal sentences, which can be used to improve performance of many downstream NLP tasks. In this work, we propose a semi-supervised formality style transfer model that utilizes a language model-based discriminator to maximize the likelihood of the output sentence being formal, which allows us to use maximization of token-level conditional probabilities for training. We further propose to maximize mutual information between source and target styles as our training objective instead of maximizing the regular likelihood that often leads to repetitive and trivial generated responses. Experiments showed that our model outperformed previous state-of-the-art baselines significantly in terms of both automated metrics and human judgement. We further generalized our model to unsupervised text style transfer task, and achieved significant improvements on two benchmark sentiment style transfer datasets. △ Less

Submitted 10 October, 2020; originally announced October 2020.

Comments: EMNLP 2020 Findings

arXiv:2009.06781 [pdf, ps, other]

Pilot: Winner of the Human-Agent Negotiation Challenge at IJCAI 2020

Authors: Kushal Chawla, Gale Lucas

Abstract: This document describes our agent Pilot, winner of the Human-Agent Negotiation Challenge at ANAC, IJCAI 2020. Pilot is a virtual human that participates in a sequence of three negotiations with a human partner. Our system is based on the Interactive Arbitration Guide Online (IAGO) negotiation framework. We leverage prior Affective Computing and Psychology research in negotiations to guide various… ▽ More This document describes our agent Pilot, winner of the Human-Agent Negotiation Challenge at ANAC, IJCAI 2020. Pilot is a virtual human that participates in a sequence of three negotiations with a human partner. Our system is based on the Interactive Arbitration Guide Online (IAGO) negotiation framework. We leverage prior Affective Computing and Psychology research in negotiations to guide various key principles that define the behavior and personality of our agent. △ Less

Submitted 18 January, 2021; v1 submitted 14 September, 2020; originally announced September 2020.

Comments: Winner at ANAC, IJCAI 2020

ACM Class: I.2

arXiv:2009.03849 [pdf, other]

LynyrdSkynyrd at WNUT-2020 Task 2: Semi-Supervised Learning for Identification of Informative COVID-19 English Tweets

Authors: Abhilasha Sancheti, Kushal Chawla, Gaurav Verma

Abstract: We describe our system for WNUT-2020 shared task on the identification of informative COVID-19 English tweets. Our system is an ensemble of various machine learning methods, leveraging both traditional feature-based classifiers as well as recent advances in pre-trained language models that help in capturing the syntactic, semantic, and contextual features from the tweets. We further employ pseudo-… ▽ More We describe our system for WNUT-2020 shared task on the identification of informative COVID-19 English tweets. Our system is an ensemble of various machine learning methods, leveraging both traditional feature-based classifiers as well as recent advances in pre-trained language models that help in capturing the syntactic, semantic, and contextual features from the tweets. We further employ pseudo-labelling to incorporate the unlabelled Twitter data released on the pandemic. Our best performing model achieves an F1-score of 0.9179 on the provided validation set and 0.8805 on the blind test-set. △ Less

Submitted 8 September, 2020; originally announced September 2020.

ACM Class: I.2.7

arXiv:2004.02363 [pdf, ps, other]

Exploring Early Prediction of Buyer-Seller Negotiation Outcomes

Authors: Kushal Chawla, Gale Lucas, Jonathan May, Jonathan Gratch

Abstract: Agents that negotiate with humans find broad applications in pedagogy and conversational AI. Most efforts in human-agent negotiations rely on restrictive menu-driven interfaces for communication. To advance the research in language-based negotiation systems, we explore a novel task of early prediction of buyer-seller negotiation outcomes, by varying the fraction of utterances that the model can ac… ▽ More Agents that negotiate with humans find broad applications in pedagogy and conversational AI. Most efforts in human-agent negotiations rely on restrictive menu-driven interfaces for communication. To advance the research in language-based negotiation systems, we explore a novel task of early prediction of buyer-seller negotiation outcomes, by varying the fraction of utterances that the model can access. We explore the feasibility of early prediction by using traditional feature-based methods, as well as by incorporating the non-linguistic task context into a pretrained language model using sentence templates. We further quantify the extent to which linguistic features help in making better predictions apart from the task-specific price information. Finally, probing the pretrained model helps us to identify specific features, such as trust and agreement, that contribute to the prediction performance. △ Less

Submitted 25 February, 2021; v1 submitted 5 April, 2020; originally announced April 2020.

ACM Class: I.2.7

arXiv:1912.08492 [pdf, other]

Generating summaries tailored to target characteristics

Authors: Kushal Chawla, Hrituraj Singh, Arijit Pramanik, Mithlesh Kumar, Balaji Vasan Srinivasan

Abstract: Recently, research efforts have gained pace to cater to varied user preferences while generating text summaries. While there have been attempts to incorporate a few handpicked characteristics such as length or entities, a holistic view around these preferences is missing and crucial insights on why certain characteristics should be incorporated in a specific manner are absent. With this objective,… ▽ More Recently, research efforts have gained pace to cater to varied user preferences while generating text summaries. While there have been attempts to incorporate a few handpicked characteristics such as length or entities, a holistic view around these preferences is missing and crucial insights on why certain characteristics should be incorporated in a specific manner are absent. With this objective, we provide a categorization around these characteristics relevant to the task of text summarization: one, focusing on what content needs to be generated and second, focusing on the stylistic aspects of the output summaries. We use our insights to provide guidelines on appropriate methods to incorporate various classes characteristics in sequence-to-sequence summarization framework. Our experiments with incorporating topics, readability and simplicity indicate the viability of the proposed prescriptions △ Less

Submitted 18 December, 2019; originally announced December 2019.

Comments: Appeared in CiCLing 2019

arXiv:1901.11492 [pdf]

Improving generation quality of pointer networks via guided attention

Authors: Kushal Chawla, Kundan Krishna, Balaji Vasan Srinivasan

Abstract: Pointer generator networks have been used successfully for abstractive summarization. Along with the capability to generate novel words, it also allows the model to copy from the input text to handle out-of-vocabulary words. In this paper, we point out two key shortcomings of the summaries generated with this framework via manual inspection, statistical analysis and human evaluation. The first sho… ▽ More Pointer generator networks have been used successfully for abstractive summarization. Along with the capability to generate novel words, it also allows the model to copy from the input text to handle out-of-vocabulary words. In this paper, we point out two key shortcomings of the summaries generated with this framework via manual inspection, statistical analysis and human evaluation. The first shortcoming is the extractive nature of the generated summaries, since the network eventually learns to copy from the input article most of the times, affecting the abstractive nature of the generated summaries. The second shortcoming is the factual inaccuracies in the generated text despite grammatical correctness. Our analysis indicates that this arises due to incorrect attention transition between different parts of the article. We propose an initial attempt towards addressing both these shortcomings by externally appending traditional linguistic information parsed from the input text, thereby teaching networks on the structure of the underlying text. Results indicate feasibility and potential of such additional cues for improved generation. △ Less

Submitted 20 January, 2019; originally announced January 2019.

Comments: In AAAI-19 Workshop on Network Interpretability for Deep Learning

arXiv:1901.02412 [pdf, other]

Forecasting Granular Audience Size for Online Advertising

Authors: Ritwik Sinha, Dhruv Singal, Pranav Maneriker, Kushal Chawla, Yash Shrivastava, Deepak Pai, Atanu R Sinha

Abstract: Orchestration of campaigns for online display advertising requires marketers to forecast audience size at the granularity of specific attributes of web traffic, characterized by the categorical nature of all attributes (e.g. {US, Chrome, Mobile}). With each attribute taking many values, the very large attribute combination set makes estimating audience size for any specific attribute combination c… ▽ More Orchestration of campaigns for online display advertising requires marketers to forecast audience size at the granularity of specific attributes of web traffic, characterized by the categorical nature of all attributes (e.g. {US, Chrome, Mobile}). With each attribute taking many values, the very large attribute combination set makes estimating audience size for any specific attribute combination challenging. We modify Eclat, a frequent itemset mining (FIM) algorithm, to accommodate categorical variables. For consequent frequent and infrequent itemsets, we then provide forecasts using time series analysis with conditional probabilities to aid approximation. An extensive simulation, based on typical characteristics of audience data, is built to stress test our modified-FIM approach. In two real datasets, comparison with baselines including neural network models, shows that our method lowers computation time of FIM for categorical data. On hold out samples we show that the proposed forecasting method outperforms these baselines. △ Less

Submitted 8 January, 2019; originally announced January 2019.

Comments: Published at AdKDD & TargetAd 2018

arXiv:1808.10703 [pdf]

PythonRobotics: a Python code collection of robotics algorithms

Authors: Atsushi Sakai, Daniel Ingram, Joseph Dinius, Karan Chawla, Antonin Raffin, Alexis Paques

Abstract: This paper describes an Open Source Software (OSS) project: PythonRobotics. This is a collection of robotics algorithms implemented in the Python programming language. The focus of the project is on autonomous navigation, and the goal is for beginners in robotics to understand the basic ideas behind each algorithm. In this project, the algorithms which are practical and widely used in both academi… ▽ More This paper describes an Open Source Software (OSS) project: PythonRobotics. This is a collection of robotics algorithms implemented in the Python programming language. The focus of the project is on autonomous navigation, and the goal is for beginners in robotics to understand the basic ideas behind each algorithm. In this project, the algorithms which are practical and widely used in both academia and industry are selected. Each sample code is written in Python3 and only depends on some standard modules for readability and ease of use. It includes intuitive animations to understand the behavior of the simulation. △ Less

Submitted 7 September, 2018; v1 submitted 31 August, 2018; originally announced August 2018.

Comments: 8pages, 6 figures, this is the code repository: https://github.com/AtsushiSakai/PythonRobotics

arXiv:1805.07966 [pdf, other]

Aff2Vec: Affect--Enriched Distributional Word Representations

Authors: Sopan Khosla, Niyati Chhaya, Kushal Chawla

Abstract: Human communication includes information, opinions, and reactions. Reactions are often captured by the affective-messages in written as well as verbal communications. While there has been work in affect modeling and to some extent affective content generation, the area of affective word distributions in not well studied. Synsets and lexica capture semantic relationships across words. These models… ▽ More Human communication includes information, opinions, and reactions. Reactions are often captured by the affective-messages in written as well as verbal communications. While there has been work in affect modeling and to some extent affective content generation, the area of affective word distributions in not well studied. Synsets and lexica capture semantic relationships across words. These models however lack in encoding affective or emotional word interpretations. Our proposed model, Aff2Vec provides a method for enriched word embeddings that are representative of affective interpretations of words. Aff2Vec outperforms the state--of--the--art in intrinsic word-similarity tasks. Further, the use of Aff2Vec representations outperforms baseline embeddings in downstream natural language understanding tasks including sentiment analysis, personality detection, and frustration prediction. △ Less

Submitted 21 May, 2018; originally announced May 2018.

arXiv:1804.00382 [pdf, ps, other]

Attention-based Ensemble for Deep Metric Learning

Authors: Wonsik Kim, Bhavya Goyal, Kunal Chawla, Jungmin Lee, Keunjoo Kwon

Abstract: Deep metric learning aims to learn an embedding function, modeled as deep neural network. This embedding function usually puts semantically similar images close while dissimilar images far from each other in the learned embedding space. Recently, ensemble has been applied to deep metric learning to yield state-of-the-art results. As one important aspect of ensemble, the learners should be diverse… ▽ More Deep metric learning aims to learn an embedding function, modeled as deep neural network. This embedding function usually puts semantically similar images close while dissimilar images far from each other in the learned embedding space. Recently, ensemble has been applied to deep metric learning to yield state-of-the-art results. As one important aspect of ensemble, the learners should be diverse in their feature embeddings. To this end, we propose an attention-based ensemble, which uses multiple attention masks, so that each learner can attend to different parts of the object. We also propose a divergence loss, which encourages diversity among the learners. The proposed method is applied to the standard benchmarks of deep metric learning and experimental results show that it outperforms the state-of-the-art methods by a significant margin on image retrieval tasks. △ Less

Submitted 31 August, 2018; v1 submitted 1 April, 2018; originally announced April 2018.

Comments: ECCV 2018 camera-ready

arXiv:1709.00659 [pdf, other]

Investigating how well contextual features are captured by bi-directional recurrent neural network models

Authors: Kushal Chawla, Sunil Kumar Sahu, Ashish Anand

Abstract: Learning algorithms for natural language processing (NLP) tasks traditionally rely on manually defined relevant contextual features. On the other hand, neural network models using an only distributional representation of words have been successfully applied for several NLP tasks. Such models learn features automatically and avoid explicit feature engineering. Across several domains, neural models… ▽ More Learning algorithms for natural language processing (NLP) tasks traditionally rely on manually defined relevant contextual features. On the other hand, neural network models using an only distributional representation of words have been successfully applied for several NLP tasks. Such models learn features automatically and avoid explicit feature engineering. Across several domains, neural models become a natural choice specifically when limited characteristics of data are known. However, this flexibility comes at the cost of interpretability. In this paper, we define three different methods to investigate ability of bi-directional recurrent neural networks (RNNs) in capturing contextual features. In particular, we analyze RNNs for sequence tagging tasks. We perform a comprehensive analysis on general as well as biomedical domain datasets. Our experiments focus on important contextual words as features, which can easily be extended to analyze various other feature types. We also investigate positional effects of context words and show how the developed methods can be used for error analysis. △ Less

Submitted 29 November, 2017; v1 submitted 3 September, 2017; originally announced September 2017.

Comments: Camera ready version of ICON-2017

arXiv:1202.0617 [pdf]

Classification of Flames in Computer Mediated Communications

Authors: Nitin, Ankush Bansal, Siddhartha Mahadev Sharma, Kapil Kumar, Anuj Aggarwal, Sheenu Goyal, Kanika Choudhary, Kunal Chawla, Kunal Jain, Manav Bhasin

Abstract: Computer Mediated Communication (CMC) has brought about a revolution in the way the world communicates with each other. With the increasing number of people, interacting through the internet and the rise of new platforms and technologies has brought together the people from different social, cultural and geographical backgrounds to present their thoughts, ideas and opinions on topics of their inte… ▽ More Computer Mediated Communication (CMC) has brought about a revolution in the way the world communicates with each other. With the increasing number of people, interacting through the internet and the rise of new platforms and technologies has brought together the people from different social, cultural and geographical backgrounds to present their thoughts, ideas and opinions on topics of their interest. CMC has, in some cases, gave users more freedom to express themselves as compared to Face-to-face communication. This has also led to rise in the use of hostile and aggressive language and terminologies uninhibitedly. Since such use of language is detrimental to the discussion process and affects the audience and individuals negatively, efforts are being taken to control them. The research sees the need to understand the concept of flaming and hence attempts to classify them in order to give a better understanding of it. The classification is done on the basis of type of flame content being presented and the Style in which they are presented. △ Less

Submitted 17 February, 2012; v1 submitted 3 February, 2012; originally announced February 2012.

Comments: 6 pages, 4 figures

Report number: pxc3872505

Journal ref: International Journal of Computer Applications (0975-8887), Volume 14 - No.6, February 2011

arXiv:cs/0508081 [pdf]

ZEUS - A Domain-Oriented Fact Comparison Based Authentication Protocol

Authors: Kirti Chawla

Abstract: In this paper, facts existing in different domains are explored, which are comparable by their end result. Properties of various domains and the facts that are part of such a unit are also presented, examples of comparison and methods of usage as means of zero-knowledge protocols are given, finally a zero-knowledge protocol based on afore-mentioned concept is given. In this paper, facts existing in different domains are explored, which are comparable by their end result. Properties of various domains and the facts that are part of such a unit are also presented, examples of comparison and methods of usage as means of zero-knowledge protocols are given, finally a zero-knowledge protocol based on afore-mentioned concept is given. △ Less

Submitted 18 August, 2005; originally announced August 2005.

Comments: 5 Pages

arXiv:cs/0508080 [pdf]

A 3D RGB Axis-based Color-oriented Cryptography

Authors: Kirti Chawla

Abstract: In this document, a formal approach to encrypt, decrypt, transmit and receive information using colors is explored. A piece of information consists of set of symbols with a definite property imposed on the generating set. The symbols are usually encoded using ascii scheme. A linear to 3d transformation is presented. The change of axis from traditional xyz to rgb is highlighted and its effect are… ▽ More In this document, a formal approach to encrypt, decrypt, transmit and receive information using colors is explored. A piece of information consists of set of symbols with a definite property imposed on the generating set. The symbols are usually encoded using ascii scheme. A linear to 3d transformation is presented. The change of axis from traditional xyz to rgb is highlighted and its effect are studied. A point in this new axis is then represented as a unique color and a vector or matrix is associated with it, making it amenable to standard vector or matrix operations. A formal notion on hybrid cryptography is introduced as the algorithm lies on the boundary of symmetric and asymmetric cryptography. No discussion is complete, without mentioning reference to communication aspects of secure information in a channel. Transmission scheme pertaining to light as carrier is introduced and studied. Key-exchanges do not come under the scope of current frame of document. △ Less

Submitted 18 August, 2005; originally announced August 2005.

Comments: 16 Pages, 12 figures

Showing 1–29 of 29 results for author: Chawla, K