Search | arXiv e-print repository

Dynamic Inhomogeneous Quantum Resource Scheduling with Reinforcement Learning

Authors: Linsen Li, Pratyush Anand, Kaiming He, Dirk Englund

Abstract: A central challenge in quantum information science and technology is achieving real-time estimation and feedforward control of quantum systems. This challenge is compounded by the inherent inhomogeneity of quantum resources, such as qubit properties and controls, and their intrinsically probabilistic nature. This leads to stochastic challenges in error detection and probabilistic outcomes in proce… ▽ More A central challenge in quantum information science and technology is achieving real-time estimation and feedforward control of quantum systems. This challenge is compounded by the inherent inhomogeneity of quantum resources, such as qubit properties and controls, and their intrinsically probabilistic nature. This leads to stochastic challenges in error detection and probabilistic outcomes in processes such as heralded remote entanglement. Given these complexities, optimizing the construction of quantum resource states is an NP-hard problem. In this paper, we address the quantum resource scheduling issue by formulating the problem and simulating it within a digitized environment, allowing the exploration and development of agent-based optimization strategies. We employ reinforcement learning agents within this probabilistic setting and introduce a new framework utilizing a Transformer model that emphasizes self-attention mechanisms for pairs of qubits. This approach facilitates dynamic scheduling by providing real-time, next-step guidance. Our method significantly improves the performance of quantum systems, achieving more than a 3$\times$ improvement over rule-based agents, and establishes an innovative framework that improves the joint design of physical and control systems for quantum applications in communication, networking, and computing. △ Less

Submitted 25 May, 2024; originally announced May 2024.

arXiv:2306.11207 [pdf, other]

Quilt-1M: One Million Image-Text Pairs for Histopathology

Authors: Wisdom Oluchi Ikezogwo, Mehmet Saygin Seyfioglu, Fatemeh Ghezloo, Dylan Stefan Chan Geva, Fatwir Sheikh Mohammed, Pavan Kumar Anand, Ranjay Krishna, Linda Shapiro

Abstract: Recent accelerations in multi-modal applications have been made possible with the plethora of image and text data available online. However, the scarcity of analogous data in the medical field, specifically in histopathology, has slowed comparable progress. To enable similar representation learning for histopathology, we turn to YouTube, an untapped resource of videos, offering $1,087$ hours of va… ▽ More Recent accelerations in multi-modal applications have been made possible with the plethora of image and text data available online. However, the scarcity of analogous data in the medical field, specifically in histopathology, has slowed comparable progress. To enable similar representation learning for histopathology, we turn to YouTube, an untapped resource of videos, offering $1,087$ hours of valuable educational histopathology videos from expert clinicians. From YouTube, we curate QUILT: a large-scale vision-language dataset consisting of $802, 144$ image and text pairs. QUILT was automatically curated using a mixture of models, including large language models, handcrafted algorithms, human knowledge databases, and automatic speech recognition. In comparison, the most comprehensive datasets curated for histopathology amass only around $200$K samples. We combine QUILT with datasets from other sources, including Twitter, research papers, and the internet in general, to create an even larger dataset: QUILT-1M, with $1$M paired image-text samples, marking it as the largest vision-language histopathology dataset to date. We demonstrate the value of QUILT-1M by fine-tuning a pre-trained CLIP model. Our model outperforms state-of-the-art models on both zero-shot and linear probing tasks for classifying new histopathology images across $13$ diverse patch-level datasets of $8$ different sub-pathologies and cross-modal retrieval tasks. △ Less

Submitted 27 October, 2023; v1 submitted 19 June, 2023; originally announced June 2023.

arXiv:2305.15336 [pdf, other]

From Text to MITRE Techniques: Exploring the Malicious Use of Large Language Models for Generating Cyber Attack Payloads

Authors: P. V. Sai Charan, Hrushikesh Chunduri, P. Mohan Anand, Sandeep K Shukla

Abstract: This research article critically examines the potential risks and implications arising from the malicious utilization of large language models(LLM), focusing specifically on ChatGPT and Google's Bard. Although these large language models have numerous beneficial applications, the misuse of this technology by cybercriminals for creating offensive payloads and tools is a significant concern. In this… ▽ More This research article critically examines the potential risks and implications arising from the malicious utilization of large language models(LLM), focusing specifically on ChatGPT and Google's Bard. Although these large language models have numerous beneficial applications, the misuse of this technology by cybercriminals for creating offensive payloads and tools is a significant concern. In this study, we systematically generated implementable code for the top-10 MITRE Techniques prevalent in 2022, utilizing ChatGPT, and conduct a comparative analysis of its performance with Google's Bard. Our experimentation reveals that ChatGPT has the potential to enable attackers to accelerate the operation of more targeted and sophisticated attacks. Additionally, the technology provides amateur attackers with more capabilities to perform a wide range of attacks and empowers script kiddies to develop customized tools that contribute to the acceleration of cybercrime. Furthermore, LLMs significantly benefits malware authors, particularly ransomware gangs, in generating sophisticated variants of wiper and ransomware attacks with ease. On a positive note, our study also highlights how offensive security researchers and pentesters can make use of LLMs to simulate realistic attack scenarios, identify potential vulnerabilities, and better protect organizations. Overall, we conclude by emphasizing the need for increased vigilance in mitigating the risks associated with LLMs. This includes implementing robust security measures, increasing awareness and education around the potential risks of this technology, and collaborating with security experts to stay ahead of emerging threats. △ Less

Submitted 24 May, 2023; originally announced May 2023.

arXiv:2303.08863 [pdf, other]

Class-Guided Image-to-Image Diffusion: Cell Painting from Brightfield Images with Class Labels

Authors: Jan Oscar Cross-Zamirski, Praveen Anand, Guy Williams, Elizabeth Mouchet, Yinhai Wang, Carola-Bibiane Schönlieb

Abstract: Image-to-image reconstruction problems with free or inexpensive metadata in the form of class labels appear often in biological and medical image domains. Existing text-guided or style-transfer image-to-image approaches do not translate to datasets where additional information is provided as discrete classes. We introduce and implement a model which combines image-to-image and class-guided denoisi… ▽ More Image-to-image reconstruction problems with free or inexpensive metadata in the form of class labels appear often in biological and medical image domains. Existing text-guided or style-transfer image-to-image approaches do not translate to datasets where additional information is provided as discrete classes. We introduce and implement a model which combines image-to-image and class-guided denoising diffusion probabilistic models. We train our model on a real-world dataset of microscopy images used for drug discovery, with and without incorporating metadata labels. By exploring the properties of image-to-image diffusion with relevant labels, we show that class-guided image-to-image diffusion can improve the meaningful content of the reconstructed images and outperform the unguided model in useful downstream tasks. △ Less

Submitted 29 March, 2023; v1 submitted 15 March, 2023; originally announced March 2023.

arXiv:2110.08588 [pdf]

Preproduction Deploys: Cloud-Native Integration Testing

Authors: Jeremy J. Carroll, Pankaj Anand, David Guo

Abstract: The microservice architecture for cloud-based systems is extended to not only require each loosely coupled component to be independently deployable, but also to provide independent routing for each component. This supports canary deployments, green/blue deployments and roll-back. Both ad hoc and system integration test traffic can be directed to components before they are released to production tr… ▽ More The microservice architecture for cloud-based systems is extended to not only require each loosely coupled component to be independently deployable, but also to provide independent routing for each component. This supports canary deployments, green/blue deployments and roll-back. Both ad hoc and system integration test traffic can be directed to components before they are released to production traffic. Front-end code is included in this architecture by using server-side rendering of JS bundles. Environments for integration testing are created with preproduction deploys side by side with production deploys using appropriate levels of isolation. After a successful integration test run, preproduction components are known to work with production precisely as it is. For isolation, test traffic uses staging databases that are copied daily from the production databases, omitting sensitive data. Safety and security concerns are dealt with in a targeted fashion, not monolithically. This architecture scales well with organization size; is more effective for integration testing; and is better aligned with agile business practices than traditional approaches. △ Less

Submitted 16 October, 2021; originally announced October 2021.

Comments: 8 pages, 1 figure, submitted to IEEE CloudSummit 2021

arXiv:2106.01109 [pdf, other]

Improvement over Pinball Loss Support Vector Machine

Authors: Pritam Anand, Reshma Rastogi, Suresh Chandra

Abstract: Recently, there have been several papers that discuss the extension of the Pinball loss Support Vector Machine (Pin-SVM) model, originally proposed by Huang et al.,[1][2]. Pin-SVM classifier deals with the pinball loss function, which has been defined in terms of the parameter $τ$. The parameter $τ$ can take values in $[ -1,1]$. The existing Pin-SVM model requires to solve the same optimization pr… ▽ More Recently, there have been several papers that discuss the extension of the Pinball loss Support Vector Machine (Pin-SVM) model, originally proposed by Huang et al.,[1][2]. Pin-SVM classifier deals with the pinball loss function, which has been defined in terms of the parameter $τ$. The parameter $τ$ can take values in $[ -1,1]$. The existing Pin-SVM model requires to solve the same optimization problem for all values of $τ$ in $[ -1,1]$. In this paper, we improve the existing Pin-SVM model for the binary classification task. At first, we note that there is major difficulty in Pin-SVM model (Huang et al. [1]) for $ -1 \leq τ< 0$. Specifically, we show that the Pin-SVM model requires the solution of different optimization problem for $ -1 \leq τ< 0$. We further propose a unified model termed as Unified Pin-SVM which results in a QPP valid for all $-1\leq τ\leq 1$ and hence more convenient to use. The proposed Unified Pin-SVM model can obtain a significant improvement in accuracy over the existing Pin-SVM model which has also been empirically justified by extensive numerical experiments with real-world datasets. △ Less

Submitted 2 June, 2021; originally announced June 2021.

Comments: The numerical results presented in this paper can be regenerated by the code available at https://github.com/ltpritamanand/UnifiedPinSVM/ . We hope that our this work will let the researchers to use the correct formulation of Pin-SVM model in future and improve the predictions across different domain of technologies

arXiv:2102.11058 [pdf, other]

Anyone GAN Sing

Authors: Shreeviknesh Sankaran, Sukavanan Nanjundan, G. Paavai Anand

Abstract: The problem of audio synthesis has been increasingly solved using deep neural networks. With the introduction of Generative Adversarial Networks (GAN), another efficient and adjective path has opened up to solve this problem. In this paper, we present a method to synthesize the singing voice of a person using a Convolutional Long Short-term Memory (ConvLSTM) based GAN optimized using the Wasserste… ▽ More The problem of audio synthesis has been increasingly solved using deep neural networks. With the introduction of Generative Adversarial Networks (GAN), another efficient and adjective path has opened up to solve this problem. In this paper, we present a method to synthesize the singing voice of a person using a Convolutional Long Short-term Memory (ConvLSTM) based GAN optimized using the Wasserstein loss function. Our work is inspired by WGANSing by Chandna et al. Our model inputs consecutive frame-wise linguistic and frequency features, along with singer identity and outputs vocoder features. We train the model on a dataset of 48 English songs sung and spoken by 12 non-professional singers. For inference, sequential blocks are concatenated using an overlap-add procedure. We test the model using the Mel-Cepstral Distance metric and a subjective listening test with 18 participants. △ Less

Submitted 22 February, 2021; originally announced February 2021.

Comments: 5 pages, 8 figures

Journal ref: International Journal of Emerging Technologies and Innovative Research (www.jetir.org), ISSN: 2349-5162, Vol.7, Issue 5, page no. 25-29, May-2020

arXiv:2102.04849 [pdf, other]

Learning a powerful SVM using piece-wise linear loss functions

Authors: Pritam Anand

Abstract: In this paper, we have considered general k-piece-wise linear convex loss functions in SVM model for measuring the empirical risk. The resulting k-Piece-wise Linear loss Support Vector Machine (k-PL-SVM) model is an adaptive SVM model which can learn a suitable piece-wise linear loss function according to nature of the given training set. The k-PL-SVM models are general SVM models and existing pop… ▽ More In this paper, we have considered general k-piece-wise linear convex loss functions in SVM model for measuring the empirical risk. The resulting k-Piece-wise Linear loss Support Vector Machine (k-PL-SVM) model is an adaptive SVM model which can learn a suitable piece-wise linear loss function according to nature of the given training set. The k-PL-SVM models are general SVM models and existing popular SVM models, like C-SVM, LS-SVM and Pin-SVM models, are their particular cases. We have performed the extensive numerical experiments with k-PL-SVM models for k = 2 and 3 and shown that they are improvement over existing SVM models. △ Less

Submitted 9 February, 2021; originally announced February 2021.

Comments: 9 pages

arXiv:2009.02459 [pdf, other]

Bio-inspired Structure Identification in Language Embeddings

Authors: Hongwei, Zhou, Oskar Elek, Pranav Anand, Angus G. Forbes

Abstract: Word embeddings are a popular way to improve downstream performances in contemporary language modeling. However, the underlying geometric structure of the embedding space is not well understood. We present a series of explorations using bio-inspired methodology to traverse and visualize word embeddings, demonstrating evidence of discernible structure. Moreover, our model also produces word similar… ▽ More Word embeddings are a popular way to improve downstream performances in contemporary language modeling. However, the underlying geometric structure of the embedding space is not well understood. We present a series of explorations using bio-inspired methodology to traverse and visualize word embeddings, demonstrating evidence of discernible structure. Moreover, our model also produces word similarity rankings that are plausible yet very different from common similarity metrics, mainly cosine similarity and Euclidean distance. We show that our bio-inspired model can be used to investigate how different word embedding techniques result in different semantic outputs, which can emphasize or obscure particular interpretations in textual data. △ Less

Submitted 15 September, 2020; v1 submitted 5 September, 2020; originally announced September 2020.

Comments: 7 pages, 8 figures, 2 tables, Visualisation for the Digital Humanities 2020. Comments: Fixed white spaces in abstract

arXiv:2004.09338 [pdf]

Augmented Curation of Unstructured Clinical Notes from a Massive EHR System Reveals Specific Phenotypic Signature of Impending COVID-19 Diagnosis

Authors: FNU Shweta, Karthik Murugadoss, Samir Awasthi, AJ Venkatakrishnan, Arjun Puranik, Martin Kang, Brian W. Pickering, John C. O'Horo, Philippe R. Bauer, Raymund R. Razonable, Paschalis Vergidis, Zelalem Temesgen, Stacey Rizza, Maryam Mahmood, Walter R. Wilson, Douglas Challener, Praveen Anand, Matt Liebers, Zainab Doctor, Eli Silvert, Hugo Solomon, Tyler Wagner, Gregory J. Gores, Amy W. Williams, John Halamka , et al. (2 additional authors not shown)

Abstract: Understanding the temporal dynamics of COVID-19 patient phenotypes is necessary to derive fine-grained resolution of pathophysiology. Here we use state-of-the-art deep neural networks over an institution-wide machine intelligence platform for the augmented curation of 15.8 million clinical notes from 30,494 patients subjected to COVID-19 PCR diagnostic testing. By contrasting the Electronic Health… ▽ More Understanding the temporal dynamics of COVID-19 patient phenotypes is necessary to derive fine-grained resolution of pathophysiology. Here we use state-of-the-art deep neural networks over an institution-wide machine intelligence platform for the augmented curation of 15.8 million clinical notes from 30,494 patients subjected to COVID-19 PCR diagnostic testing. By contrasting the Electronic Health Record (EHR)-derived clinical phenotypes of COVID-19-positive (COVIDpos, n=635) versus COVID-19-negative (COVIDneg, n=29,859) patients over each day of the week preceding the PCR testing date, we identify anosmia/dysgeusia (37.4-fold), myalgia/arthralgia (2.6-fold), diarrhea (2.2-fold), fever/chills (2.1-fold), respiratory difficulty (1.9-fold), and cough (1.8-fold) as significantly amplified in COVIDpos over COVIDneg patients. The specific combination of cough and diarrhea has a 3.2-fold amplification in COVIDpos patients during the week prior to PCR testing, and along with anosmia/dysgeusia, constitutes the earliest EHR-derived signature of COVID-19 (4-7 days prior to typical PCR testing date). This study introduces an Augmented Intelligence platform for the real-time synthesis of institutional knowledge captured in EHRs. The platform holds tremendous potential for scaling up curation throughput, with minimal need for retraining underlying neural networks, thus promising EHR-powered early diagnosis for a broad spectrum of diseases. △ Less

Submitted 28 April, 2020; v1 submitted 17 April, 2020; originally announced April 2020.

arXiv:2001.09134 [pdf, other]

Touchless Ty** Using Head Movement-based Gestures

Authors: Shivam Rustagi, Aakash Garg, Pranay Raj Anand, Rajesh Kumar, Yaman Kumar, Rajiv Ratn Shah

Abstract: In this paper, we propose a novel touchless ty** interface that makes use of an on-screen QWERTY keyboard and a smartphone camera. The keyboard was divided into nine color-coded clusters. The user moved their head toward clusters, which contained the letters that they wanted to type. A front-facing smartphone camera recorded the head movements. A bidirectional GRU based model which used pre-trai… ▽ More In this paper, we propose a novel touchless ty** interface that makes use of an on-screen QWERTY keyboard and a smartphone camera. The keyboard was divided into nine color-coded clusters. The user moved their head toward clusters, which contained the letters that they wanted to type. A front-facing smartphone camera recorded the head movements. A bidirectional GRU based model which used pre-trained embedding rich in head pose features was employed to translate the recordings into cluster sequences. The model achieved an accuracy of 96.78% and 86.81% under intra- and inter-user scenarios, respectively, over a dataset of 2234 video sequences collected from 22 users. △ Less

Submitted 10 October, 2020; v1 submitted 24 January, 2020; originally announced January 2020.

Comments: *The two lead authors contributed equally. More details are available at https://sites.google.com/iiitd.ac.in/touchless-ty**/home

ACM Class: I.2.7

Journal ref: The Sixth IEEE International Conference on Multimedia Big Data, August 2020

arXiv:2001.03463 [pdf, other]

Compressive sensing based privacy for fall detection

Authors: Ronak Gupta, Prashant Anand, Santanu Chaudhury, Brejesh Lall, Sanjay Singh

Abstract: Fall detection holds immense importance in the field of healthcare, where timely detection allows for instant medical assistance. In this context, we propose a 3D ConvNet architecture which consists of 3D Inception modules for fall detection. The proposed architecture is a custom version of Inflated 3D (I3D) architecture, that takes compressed measurements of video sequence as spatio-temporal inpu… ▽ More Fall detection holds immense importance in the field of healthcare, where timely detection allows for instant medical assistance. In this context, we propose a 3D ConvNet architecture which consists of 3D Inception modules for fall detection. The proposed architecture is a custom version of Inflated 3D (I3D) architecture, that takes compressed measurements of video sequence as spatio-temporal input, obtained from compressive sensing framework, rather than video sequence as input, as in the case of I3D convolutional neural network. This is adopted since privacy raises a huge concern for patients being monitored through these RGB cameras. The proposed framework for fall detection is flexible enough with respect to a wide variety of measurement matrices. Ten action classes randomly selected from Kinetics-400 with no fall examples, are employed to train our 3D ConvNet post compressive sensing with different types of sensing matrices on the original video clips. Our results show that 3D ConvNet performance remains unchanged with different sensing matrices. Also, the performance obtained with Kinetics pre-trained 3D ConvNet on compressively sensed fall videos from benchmark datasets is better than the state-of-the-art techniques. △ Less

Submitted 10 January, 2020; originally announced January 2020.

Comments: accepted in NCVPRIPG 2019

arXiv:1912.00643 [pdf]

Identifying the number of clusters for K-Means: A hypersphere density based approach

Authors: Sukavanan Nanjundan, Shreeviknesh Sankaran, C. R. Arjun, G. Paavai Anand

Abstract: Application of K-Means algorithm is restricted by the fact that the number of clusters should be known beforehand. Previously suggested methods to solve this problem are either ad hoc or require parametric assumptions and complicated calculations. The proposed method aims to solve this conundrum by considering cluster hypersphere density as the factor to determine the number of clusters in the giv… ▽ More Application of K-Means algorithm is restricted by the fact that the number of clusters should be known beforehand. Previously suggested methods to solve this problem are either ad hoc or require parametric assumptions and complicated calculations. The proposed method aims to solve this conundrum by considering cluster hypersphere density as the factor to determine the number of clusters in the given dataset. The density is calculated by assuming a hypersphere around the cluster centroid for n-different number of clusters. The calculated values are plotted against their corresponding number of clusters and then the optimum number of clusters is obtained after assaying the elbow region of the graph. The method is simple, easy to comprehend, and provides robust and reliable results. △ Less

Submitted 4 December, 2019; v1 submitted 2 December, 2019; originally announced December 2019.

Comments: 5 pages, 13 figures, International Conference on Computers, Communication and Signal Processing - 2019

arXiv:1910.09168 [pdf, other]

A $ν$- support vector quantile regression model with automatic accuracy control

Authors: Pritam Anand, Reshma Rastogi, Suresh Chandra

Abstract: This paper proposes a novel '$ν$-support vector quantile regression' ($ν$-SVQR) model for the quantile estimation. It can facilitate the automatic control over accuracy by creating a suitable asymmetric $ε$-insensitive zone according to the variance present in data. The proposed $ν$-SVQR model uses the $ν$ fraction of training data points for the estimation of the quantiles. In the $ν$-SVQR model,… ▽ More This paper proposes a novel '$ν$-support vector quantile regression' ($ν$-SVQR) model for the quantile estimation. It can facilitate the automatic control over accuracy by creating a suitable asymmetric $ε$-insensitive zone according to the variance present in data. The proposed $ν$-SVQR model uses the $ν$ fraction of training data points for the estimation of the quantiles. In the $ν$-SVQR model, training points asymptotically appear above and below of the asymmetric $ε$-insensitive tube in the ratio of $1-τ$ and $τ$. Further, there are other interesting properties of the proposed $ν$-SVQR model, which we have briefly described in this paper. These properties have been empirically verified using the artificial and real world dataset also. △ Less

Submitted 21 October, 2019; originally announced October 2019.

arXiv:1908.06923 [pdf, other]

A new asymmetric $ε$-insensitive pinball loss function based support vector quantile regression model

Authors: Pritam Anand, Reshma Rastogi, Suresh Chandra

Abstract: In this paper, we propose a novel asymmetric $ε$-insensitive pinball loss function for quantile estimation. There exists some pinball loss functions which attempt to incorporate the $ε$-insensitive zone approach in it but, they fail to extend the $ε$-insensitive approach for quantile estimation in true sense. The proposed asymmetric $ε$-insensitive pinball loss function can make an asymmetric $ε$-… ▽ More In this paper, we propose a novel asymmetric $ε$-insensitive pinball loss function for quantile estimation. There exists some pinball loss functions which attempt to incorporate the $ε$-insensitive zone approach in it but, they fail to extend the $ε$-insensitive approach for quantile estimation in true sense. The proposed asymmetric $ε$-insensitive pinball loss function can make an asymmetric $ε$- insensitive zone of fixed width around the data and divide it using $τ$ value for the estimation of the $τ$th quantile. The use of the proposed asymmetric $ε$-insensitive pinball loss function in Support Vector Quantile Regression (SVQR) model improves its prediction ability significantly. It also brings the sparsity back in SVQR model. Further, the numerical results obtained by several experiments carried on artificial and real world datasets empirically show the efficacy of the proposed `$ε$-Support Vector Quantile Regression' ($ε$-SVQR) model over other existing SVQR models. △ Less

Submitted 19 August, 2019; originally announced August 2019.

arXiv:1904.12331 [pdf, other]

Support Vector Regression via a Combined Reward Cum Penalty Loss Function

Authors: Pritam Anand, Reshma Rastogi, Suresh Chandra

Abstract: In this paper, we introduce a novel combined reward cum penalty loss function to handle the regression problem. The proposed combined reward cum penalty loss function penalizes the data points which lie outside the $ε$-tube of the regressor and also assigns reward for the data points which lie inside of the $ε$-tube of the regressor. The combined reward cum penalty loss function based regression (… ▽ More In this paper, we introduce a novel combined reward cum penalty loss function to handle the regression problem. The proposed combined reward cum penalty loss function penalizes the data points which lie outside the $ε$-tube of the regressor and also assigns reward for the data points which lie inside of the $ε$-tube of the regressor. The combined reward cum penalty loss function based regression (RP-$ε$-SVR) model has several interesting properties which are investigated in this paper and are also supported with the experimental results. △ Less

Submitted 3 May, 2020; v1 submitted 28 April, 2019; originally announced April 2019.

Comments: For any assistance , reader can contact on email with Pritam Anand. Email id - [email protected]. The valuable opinion/comments on the work are welcomed. Looking for collaboration especially for speeding up the solution of optimization problems

arXiv:1904.08775 [pdf, other]

Few Shot Speaker Recognition using Deep Neural Networks

Authors: Prashant Anand, Ajeet Kumar Singh, Siddharth Srivastava, Brejesh Lall

Abstract: The recent advances in deep learning are mostly driven by availability of large amount of training data. However, availability of such data is not always possible for specific tasks such as speaker recognition where collection of large amount of data is not possible in practical scenarios. Therefore, in this paper, we propose to identify speakers by learning from only a few training examples. To a… ▽ More The recent advances in deep learning are mostly driven by availability of large amount of training data. However, availability of such data is not always possible for specific tasks such as speaker recognition where collection of large amount of data is not possible in practical scenarios. Therefore, in this paper, we propose to identify speakers by learning from only a few training examples. To achieve this, we use a deep neural network with prototypical loss where the input to the network is a spectrogram. For output, we project the class feature vectors into a common embedding space, followed by classification. Further, we show the effectiveness of capsule net in a few shot learning setting. To this end, we utilize an auto-encoder to learn generalized feature embeddings from class-specific embeddings obtained from capsule network. We provide exhaustive experiments on publicly available datasets and competitive baselines, demonstrating the superiority and generalization ability of the proposed few shot learning pipelines. △ Less

Submitted 17 April, 2019; originally announced April 2019.

arXiv:1902.06024 [pdf, other]

CruzAffect at AffCon 2019 Shared Task: A feature-rich approach to characterize happiness

Authors: Jiaqi Wu, Ryan Compton, Geetanjali Rakshit, Marilyn Walker, Pranav Anand, Steve Whittaker

Abstract: We present our system, CruzAffect, for the CL-Aff Shared Task 2019. CruzAffect consists of several types of robust and efficient models for affective classification tasks. We utilize both traditional classifiers, such as XGBoosted Forest, as well as a deep learning Convolutional Neural Networks (CNN) classifier. We explore rich feature sets such as syntactic features, emotional features, and profi… ▽ More We present our system, CruzAffect, for the CL-Aff Shared Task 2019. CruzAffect consists of several types of robust and efficient models for affective classification tasks. We utilize both traditional classifiers, such as XGBoosted Forest, as well as a deep learning Convolutional Neural Networks (CNN) classifier. We explore rich feature sets such as syntactic features, emotional features, and profile features, and utilize several sentiment lexicons, to discover essential indicators of social involvement and control that a subject might exercise in their happy moments, as described in textual snippets from the HappyDB database. The data comes with a labeled set (10K), and a larger unlabeled set (70K). We therefore use supervised methods on the 10K dataset, and a bootstrapped semi-supervised approach for the 70K. We evaluate these models for binary classification of agency and social labels (Task 1), as well as multi-class prediction for concepts labels (Task 2). We obtain promising results on the held-out data, suggesting that the proposed feature sets effectively represent the data for affective classification tasks. We also build concepts models that discover general themes recurring in happy moments. Our results indicate that generic characteristics are shared between the classes of agency, social and concepts, suggesting it should be possible to build general models for affective classification tasks. △ Less

Submitted 15 February, 2019; originally announced February 2019.

Comments: Workshop on Affective Content Analysis (AffCon) 2019, Workshop of Association for the Advancement of Artificial Intelligence (AAAI) 2019, Hawaii, USA January 2019

arXiv:1711.00092 [pdf, ps, other]

Summarizing Dialogic Arguments from Social Media

Authors: Amita Misra, Shereen Oraby, Shubhangi Tandon, Sharath TS, Pranav Anand, Marilyn Walker

Abstract: Online argumentative dialog is a rich source of information on popular beliefs and opinions that could be useful to companies as well as governmental or public policy agencies. Compact, easy to read, summaries of these dialogues would thus be highly valuable. A priori, it is not even clear what form such a summary should take. Previous work on summarization has primarily focused on summarizing wri… ▽ More Online argumentative dialog is a rich source of information on popular beliefs and opinions that could be useful to companies as well as governmental or public policy agencies. Compact, easy to read, summaries of these dialogues would thus be highly valuable. A priori, it is not even clear what form such a summary should take. Previous work on summarization has primarily focused on summarizing written texts, where the notion of an abstract of the text is well defined. We collect gold standard training data consisting of five human summaries for each of 161 dialogues on the topics of Gay Marriage, Gun Control and Abortion. We present several different computational models aimed at identifying segments of the dialogues whose content should be used for the summary, using linguistic features and Word2vec features with both SVMs and Bidirectional LSTMs. We show that we can identify the most important arguments by using the dialog context with a best F-measure of 0.74 for gun control, 0.71 for gay marriage, and 0.67 for abortion. △ Less

Submitted 31 October, 2017; originally announced November 2017.

Comments: Proceedings of the 21th Workshop on the Semantics and Pragmatics of Dialogue (SemDial 2017)

arXiv:1709.00662 [pdf, other]

doi 10.3115/v1/n15-1046

Using Summarization to Discover Argument Facets in Online Ideological Dialog

Authors: Amita Misra, Pranav Anand, Jean E Fox Tree, Marilyn Walker

Abstract: More and more of the information available on the web is dialogic, and a significant portion of it takes place in online forum conversations about current social and political topics. We aim to develop tools to summarize what these conversations are about. What are the CENTRAL PROPOSITIONS associated with different stances on an issue, what are the abstract objects under discussion that are centra… ▽ More More and more of the information available on the web is dialogic, and a significant portion of it takes place in online forum conversations about current social and political topics. We aim to develop tools to summarize what these conversations are about. What are the CENTRAL PROPOSITIONS associated with different stances on an issue, what are the abstract objects under discussion that are central to a speaker's argument? How can we recognize that two CENTRAL PROPOSITIONS realize the same FACET of the argument? We hypothesize that the CENTRAL PROPOSITIONS are exactly those arguments that people find most salient, and use human summarization as a probe for discovering them. We describe our corpus of human summaries of opinionated dialogs, then show how we can identify similar repeated arguments, and group them into FACETS across many discussions of a topic. We define a new task, ARGUMENT FACET SIMILARITY (AFS), and show that we can predict AFS with a .54 correlation score, versus an ngram system baseline of .39 and a semantic textual similarity system baseline of .45. △ Less

Submitted 3 September, 2017; originally announced September 2017.

Comments: @inproceedings{Misra2015UsingST,title={Using Summarization to Discover Argument Facets in Online Idealogical Dialog},author={Amita Misra and Pranav Anand and Jean E. Fox Tree and Marilyn A. Walker},booktitle={HLT-NAACL},year={2015}}

arXiv:1709.00094 [pdf, other]

Linguistic Reflexes of Well-Being and Happiness in Echo

Authors: Jiaqi Wu, Marilyn Walker, Pranav Anand, Steve Whittaker

Abstract: Different theories posit different sources for feelings of well-being and happiness. Appraisal theory grounds our emotional responses in our goals and desires and their fulfillment, or lack of fulfillment. Self Determination theory posits that the basis for well-being rests on our assessment of our competence, autonomy, and social connection. And surveys that measure happiness empirically note tha… ▽ More Different theories posit different sources for feelings of well-being and happiness. Appraisal theory grounds our emotional responses in our goals and desires and their fulfillment, or lack of fulfillment. Self Determination theory posits that the basis for well-being rests on our assessment of our competence, autonomy, and social connection. And surveys that measure happiness empirically note that people require their basic needs to be met for food and shelter, but beyond that tend to be happiest when socializing, eating or having sex. We analyze a corpus of private microblogs from a well-being application called ECHO, where users label each written post about daily events with a happiness score between 1 and 9. Our goal is to ground the linguistic descriptions of events that users experience in theories of well-being and happiness, and then examine the extent to which different theoretical accounts can explain the variance in the happiness scores. We show that recurrent event types, such as OBLIGATION and INCOMPETENCE, which affect people's feelings of well-being are not captured in current lexical or semantic resources. △ Less

Submitted 31 August, 2017; originally announced September 2017.

Comments: 11 pages, 8th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis (WASSA 2017)

arXiv:1708.09789 [pdf, ps, other]

Learning Lexico-Functional Patterns for First-Person Affect

Authors: Lena Reed, Jiaqi Wu, Shereen Oraby, Pranav Anand, Marilyn Walker

Abstract: Informal first-person narratives are a unique resource for computational models of everyday events and people's affective reactions to them. People blogging about their day tend not to explicitly say I am happy. Instead they describe situations from which other humans can readily infer their affective reactions. However current sentiment dictionaries are missing much of the information needed to m… ▽ More Informal first-person narratives are a unique resource for computational models of everyday events and people's affective reactions to them. People blogging about their day tend not to explicitly say I am happy. Instead they describe situations from which other humans can readily infer their affective reactions. However current sentiment dictionaries are missing much of the information needed to make similar inferences. We build on recent work that models affect in terms of lexical predicate functions and affect on the predicate's arguments. We present a method to learn proxies for these functions from first-person narratives. We construct a novel fine-grained test set, and show that the patterns we learn improve our ability to predict first-person affective reactions to everyday events, from a Stanford sentiment baseline of .67F to .75F. △ Less

Submitted 31 August, 2017; originally announced August 2017.

Comments: 7 pages, Association for Computational Linguistics (ACL) 2017

arXiv:1708.09085 [pdf, other]

Argument Strength is in the Eye of the Beholder: Audience Effects in Persuasion

Authors: Stephanie M. Lukin, Pranav Anand, Marilyn Walker, Steve Whittaker

Abstract: Americans spend about a third of their time online, with many participating in online conversations on social and political issues. We hypothesize that social media arguments on such issues may be more engaging and persuasive than traditional media summaries, and that particular types of people may be more or less convinced by particular styles of argument, e.g. emotional arguments may resonate wi… ▽ More Americans spend about a third of their time online, with many participating in online conversations on social and political issues. We hypothesize that social media arguments on such issues may be more engaging and persuasive than traditional media summaries, and that particular types of people may be more or less convinced by particular styles of argument, e.g. emotional arguments may resonate with some personalities while factual arguments resonate with others. We report a set of experiments testing at large scale how audience variables interact with argument style to affect the persuasiveness of an argument, an under-researched topic within natural language processing. We show that belief change is affected by personality factors, with conscientious, open and agreeable people being more convinced by emotional arguments. △ Less

Submitted 29 August, 2017; originally announced August 2017.

Comments: European Chapter of the Association for Computational Linguistics (EACL 2017)

arXiv:1708.09040 [pdf, ps, other]

Modelling Protagonist Goals and Desires in First-Person Narrative

Authors: Elahe Rahimtoroghi, Jiaqi Wu, Ruimin Wang, Pranav Anand, Marilyn A Walker

Abstract: Many genres of natural language text are narratively structured, a testament to our predilection for organizing our experiences as narratives. There is broad consensus that understanding a narrative requires identifying and tracking the goals and desires of the characters and their narrative outcomes. However, to date, there has been limited work on computational models for this problem. We introd… ▽ More Many genres of natural language text are narratively structured, a testament to our predilection for organizing our experiences as narratives. There is broad consensus that understanding a narrative requires identifying and tracking the goals and desires of the characters and their narrative outcomes. However, to date, there has been limited work on computational models for this problem. We introduce a new dataset, DesireDB, which includes gold-standard labels for identifying statements of desire, textual evidence for desire fulfillment, and annotations for whether the stated desire is fulfilled given the evidence in the narrative context. We report experiments on tracking desire fulfillment using different methods, and show that LSTM Skip-Thought model achieves F-measure of 0.7 on our corpus. △ Less

Submitted 29 August, 2017; originally announced August 2017.

Comments: 10 pages, 18th Annual SIGdial Meeting on Discourse and Dialogue (SIGDIAL 2017)

arXiv:cs/0107006 [pdf, ps, other]

Looking Under the Hood : Tools for Diagnosing your Question Answering Engine

Authors: Eric Breck, Marc Light, Gideon S. Mann, Ellen Riloff, Brianne Brown Pranav Anand, Mats Rooth, Michael Thelen

Abstract: In this paper we analyze two question answering tasks : the TREC-8 question answering task and a set of reading comprehension exams. First, we show that Q/A systems perform better when there are multiple answer opportunities per question. Next, we analyze common approaches to two subproblems: term overlap for answer sentence identification, and answer ty** for short answer extraction. We prese… ▽ More In this paper we analyze two question answering tasks : the TREC-8 question answering task and a set of reading comprehension exams. First, we show that Q/A systems perform better when there are multiple answer opportunities per question. Next, we analyze common approaches to two subproblems: term overlap for answer sentence identification, and answer ty** for short answer extraction. We present general tools for analyzing the strengths and limitations of techniques for these subproblems. Our results quantify the limitations of both term overlap and answer ty** to distinguish between competing answer candidates. △ Less

Submitted 3 July, 2001; originally announced July 2001.

Comments: Revision of paper appearing in the Proceedings of the Workshop on Open-Domain Question Answering

ACM Class: I.2.7

Showing 1–25 of 25 results for author: Anand, P