Search | arXiv e-print repository

Set-based State Estimation with Probabilistic Consistency Guarantee under Epistemic Uncertainty

Authors: Shen Li, Theodoros Stouraitis, Michael Gienger, Sethu Vijayakumar, Julie A. Shah

Abstract: Consistent state estimation is challenging, especially under the epistemic uncertainties arising from learned (nonlinear) dynamic and observation models. In this work, we propose a set-based estimation algorithm, named Gaussian Process-Zonotopic Kalman Filter (GP-ZKF), that produces zonotopic state estimates while respecting both the epistemic uncertainties in the learned models and aleatoric unce… ▽ More Consistent state estimation is challenging, especially under the epistemic uncertainties arising from learned (nonlinear) dynamic and observation models. In this work, we propose a set-based estimation algorithm, named Gaussian Process-Zonotopic Kalman Filter (GP-ZKF), that produces zonotopic state estimates while respecting both the epistemic uncertainties in the learned models and aleatoric uncertainties. Our method guarantees probabilistic consistency, in the sense that the true states are bounded by sets (zonotopes) across all time steps, with high probability. We formally relate GP-ZKF with the corresponding stochastic approach, GP-EKF, in the case of learned (nonlinear) models. In particular, when linearization errors and aleatoric uncertainties are omitted and epistemic uncertainties are simplified, GP-ZKF reduces to GP-EKF. We empirically demonstrate our method's efficacy in both a simulated pendulum domain and a real-world robot-assisted dressing domain, where GP-ZKF produced more consistent and less conservative set-based estimates than all baseline stochastic methods. △ Less

Submitted 25 February, 2022; v1 submitted 18 October, 2021; originally announced October 2021.

Comments: Published at IEEE Robotics and Automation Letters, 2022. Video: https://www.youtube.com/watch?v=CvIPJlALaFU Copyright: 2022 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any media, including reprinting/republishing for any purposes, creating new works, for resale or redistribution, or reuse of any copyrighted component of this work

arXiv:2110.07707 [pdf, ps, other]

On the parametrized Tate construction

Authors: J. D. Quigley, Jay Shah

Abstract: We introduce and study a genuine equivariant refinement of the Tate construction associated to an extension $\widehat{G}$ of a finite group $G$ by a compact Lie group $K$, which we call the parametrized Tate construction $(-)^{t_G K}$. Our main theorem establishes the coincidence of three conceptually distinct approaches to its construction when $K$ is also finite: one via recollement theory for t… ▽ More We introduce and study a genuine equivariant refinement of the Tate construction associated to an extension $\widehat{G}$ of a finite group $G$ by a compact Lie group $K$, which we call the parametrized Tate construction $(-)^{t_G K}$. Our main theorem establishes the coincidence of three conceptually distinct approaches to its construction when $K$ is also finite: one via recollement theory for the $K$-free $\widehat{G}$-family, another via parametrized ambidexterity for $G$-local systems, and the last via parametrized assembly maps. We also show that $(-)^{t_G K}$ uniquely admits the structure of a lax $G$-symmetric monoidal functor, thereby refining a theorem of Nikolaus and Scholze. Along the way, we apply a theorem of the second author to reprove a result of Ayala--Mazel-Gee--Rozenblyum on reconstructing a genuine $G$-spectrum from its geometric fixed points; our method of proof further yields a formula for the geometric fixed points of an $\mathcal{F}$-complete $G$-spectrum for any $G$-family $\mathcal{F}$. △ Less

Submitted 6 January, 2022; v1 submitted 14 October, 2021; originally announced October 2021.

Comments: Revision and expansion of sections 3-5 of arXiv:1909.03920. 66 pages. v2: minor edits

MSC Class: 55P91

arXiv:2110.07550 [pdf, other]

The Irrationality of Neural Rationale Models

Authors: Yiming Zheng, Serena Booth, Julie Shah, Yilun Zhou

Abstract: Neural rationale models are popular for interpretable predictions of NLP tasks. In these, a selector extracts segments of the input text, called rationales, and passes these segments to a classifier for prediction. Since the rationale is the only information accessible to the classifier, it is plausibly defined as the explanation. Is such a characterization unconditionally correct? In this paper,… ▽ More Neural rationale models are popular for interpretable predictions of NLP tasks. In these, a selector extracts segments of the input text, called rationales, and passes these segments to a classifier for prediction. Since the rationale is the only information accessible to the classifier, it is plausibly defined as the explanation. Is such a characterization unconditionally correct? In this paper, we argue to the contrary, with both philosophical perspectives and empirical evidence suggesting that rationale models are, perhaps, less rational and interpretable than expected. We call for more rigorous and comprehensive evaluations of these models to ensure desired properties of interpretability are indeed achieved. The code can be found at https://github.com/yimingz89/Neural-Rationale-Analysis. △ Less

Submitted 23 July, 2022; v1 submitted 14 October, 2021; originally announced October 2021.

Comments: NAACL Workshop on Trustworthy Natural Language Processing (TrustNLP) 2022

arXiv:2110.06567 [pdf, ps, other]

Recollements and stratification

Authors: Jay Shah

Abstract: We develop various aspects of the theory of recollements of $\infty$-categories, including a symmetric monoidal refinement of the theory. Our main result establishes a formula for the gluing functor of a recollement on the right-lax limit of a locally cocartesian fibration determined by a sieve-cosieve decomposition of the base. As an application, we prove a reconstruction theorem for sheaves in a… ▽ More We develop various aspects of the theory of recollements of $\infty$-categories, including a symmetric monoidal refinement of the theory. Our main result establishes a formula for the gluing functor of a recollement on the right-lax limit of a locally cocartesian fibration determined by a sieve-cosieve decomposition of the base. As an application, we prove a reconstruction theorem for sheaves in an $\infty$-topos stratified over a finite poset $P$ in the sense of Barwick-Glasman-Haine. Combining our theorem with methods from the work of Ayala-Mazel-Gee-Rozenblyum, we then prove a conjecture of Barwick-Glasman-Haine that asserts an equivalence between the $\infty$-category of $P$-stratified $\infty$-topoi and that of toposic locally cocartesian fibrations over $P^{\mathrm{op}}$. △ Less

Submitted 12 January, 2022; v1 submitted 13 October, 2021; originally announced October 2021.

Comments: Revision and expansion of sections 1 and 2 of arXiv:1909.03920. 47 pages. v2: minor changes

MSC Class: 18N60; 18N70

arXiv:2110.04192 [pdf, other]

Explaining Reward Functions to Humans for Better Human-Robot Collaboration

Authors: Lindsay Sanneman, Julie Shah

Abstract: Explainable AI techniques that describe agent reward functions can enhance human-robot collaboration in a variety of settings. One context where human understanding of agent reward functions is particularly beneficial is in the value alignment setting. In the value alignment context, an agent aims to infer a human's reward function through interaction so that it can assist the human with their tas… ▽ More Explainable AI techniques that describe agent reward functions can enhance human-robot collaboration in a variety of settings. One context where human understanding of agent reward functions is particularly beneficial is in the value alignment setting. In the value alignment context, an agent aims to infer a human's reward function through interaction so that it can assist the human with their tasks. If the human can understand where gaps exist in the agent's reward understanding, they will be able to teach more efficiently and effectively, leading to quicker human-agent team performance improvements. In order to support human collaborators in the value alignment setting and similar contexts, it is first important to understand the effectiveness of different reward explanation techniques in a variety of domains. In this paper, we introduce a categorization of information modalities for reward explanation techniques, suggest a suite of assessment techniques for human reward understanding, and introduce four axes of domain complexity. We then propose an experiment to study the relative efficacy of a broad set of reward explanation techniques covering multiple modalities of information in a set of domains of varying complexity. △ Less

Submitted 8 October, 2021; originally announced October 2021.

Comments: Presented at AI-HRI symposium as part of AAAI-FSS 2021 (arXiv:2109.10836)

Report number: AIHRI/2021/40

arXiv:2109.11954 [pdf, ps, other]

Parametrized higher category theory II: Universal constructions

Authors: Jay Shah

Abstract: We develop parametrized generalizations of a number of fundamental concepts in the theory of $\infty$-categories, including factorization systems, free fibrations, exponentiable fibrations, relative colimits and relative Kan extensions, filtered and sifted diagrams, and the universal constructions Ind and $P^Σ$. We develop parametrized generalizations of a number of fundamental concepts in the theory of $\infty$-categories, including factorization systems, free fibrations, exponentiable fibrations, relative colimits and relative Kan extensions, filtered and sifted diagrams, and the universal constructions Ind and $P^Σ$. △ Less

Submitted 8 January, 2022; v1 submitted 24 September, 2021; originally announced September 2021.

Comments: 42 pages. v2: added reference to Hinich's work on promonoidal Day convolution

MSC Class: 18N60; 18N70

arXiv:2108.07163 [pdf]

Causal Impact Of European Union Emission Trading Scheme On Firm Behaviour And Economic Performance: A Study Of German Manufacturing Firms

Authors: Nitish Gupta, Jay Shah, Satwik Gupta, Ruchir Kaul

Abstract: In this paper, we estimate the causal impact (i.e. Average Treatment Effect, ATT) of the EU ETS on GHG emissions and firm competitiveness (primarily measured by employment, turnover, and exports levels) by combining a difference-in-differences approach with semi-parametric matching techniques and estimators an to investigate the effect of the EU ETS on the economic performance of these German manu… ▽ More In this paper, we estimate the causal impact (i.e. Average Treatment Effect, ATT) of the EU ETS on GHG emissions and firm competitiveness (primarily measured by employment, turnover, and exports levels) by combining a difference-in-differences approach with semi-parametric matching techniques and estimators an to investigate the effect of the EU ETS on the economic performance of these German manufacturing firms using a Stochastic Production Frontier model. △ Less

Submitted 16 August, 2021; originally announced August 2021.

Comments: 19 pages

arXiv:2108.07116 [pdf]

Study Of German Manufacturing Firms: Causal Impact Of European Union Emission Trading Scheme On Firm Behaviour And Economic Performance

Authors: Nitish Gupta, Ruchir Kaul, Satwik Gupta, Jay Shah

Abstract: The results based on the nonparametric nearest neighbor matching suggest a statistically significant positive effect of the EU ETS on the economic performance of the regulated firms during Phase I of the EU ETS. A year-by-year analysis shows that the effect was only significant during the first year of Phase I. The EU ETS, therefore, had a particularly strong effect when it was introduced. It is i… ▽ More The results based on the nonparametric nearest neighbor matching suggest a statistically significant positive effect of the EU ETS on the economic performance of the regulated firms during Phase I of the EU ETS. A year-by-year analysis shows that the effect was only significant during the first year of Phase I. The EU ETS, therefore, had a particularly strong effect when it was introduced. It is important to note that the EU ETS does not homogeneously affect firms in the manufacturing sector. We found a significant positive impact of EU ETS on the economic performance of regulated firms in the paper industry. △ Less

Submitted 16 August, 2021; originally announced August 2021.

Comments: 23 pages

arXiv:2108.01828 [pdf, other]

Emergent Discrete Communication in Semantic Spaces

Authors: Mycal Tucker, Huao Li, Siddharth Agrawal, Dana Hughes, Katia Sycara, Michael Lewis, Julie Shah

Abstract: Neural agents trained in reinforcement learning settings can learn to communicate among themselves via discrete tokens, accomplishing as a team what agents would be unable to do alone. However, the current standard of using one-hot vectors as discrete communication tokens prevents agents from acquiring more desirable aspects of communication such as zero-shot understanding. Inspired by word embedd… ▽ More Neural agents trained in reinforcement learning settings can learn to communicate among themselves via discrete tokens, accomplishing as a team what agents would be unable to do alone. However, the current standard of using one-hot vectors as discrete communication tokens prevents agents from acquiring more desirable aspects of communication such as zero-shot understanding. Inspired by word embedding techniques from natural language processing, we propose neural agent architectures that enables them to communicate via discrete tokens derived from a learned, continuous space. We show in a decision theoretic framework that our technique optimizes communication over a wide range of scenarios, whereas one-hot tokens are only optimal under restrictive assumptions. In self-play experiments, we validate that our trained agents learn to cluster tokens in semantically-meaningful ways, allowing them communicate in noisy environments where other techniques fail. Lastly, we demonstrate both that agents using our method can effectively respond to novel human communication and that humans can understand unlabeled emergent agent communication, outperforming the use of one-hot communication. △ Less

Submitted 4 November, 2021; v1 submitted 3 August, 2021; originally announced August 2021.

arXiv:2107.02912 [pdf, other]

Supervised Bayesian Specification Inference from Demonstrations

Authors: Ankit Shah, Pritish Kamath, Shen Li, Patrick Craven, Kevin Landers, Kevin Oden, Julie Shah

Abstract: When observing task demonstrations, human apprentices are able to identify whether a given task is executed correctly long before they gain expertise in actually performing that task. Prior research into learning from demonstrations (LfD) has failed to capture this notion of the acceptability of a task's execution; meanwhile, temporal logics provide a flexible language for expressing task specific… ▽ More When observing task demonstrations, human apprentices are able to identify whether a given task is executed correctly long before they gain expertise in actually performing that task. Prior research into learning from demonstrations (LfD) has failed to capture this notion of the acceptability of a task's execution; meanwhile, temporal logics provide a flexible language for expressing task specifications. Inspired by this, we present Bayesian specification inference, a probabilistic model for inferring task specification as a temporal logic formula. We incorporate methods from probabilistic programming to define our priors, along with a domain-independent likelihood function to enable sampling-based inference. We demonstrate the efficacy of our model for inferring specifications, with over 90% similarity observed between the inferred specification and the ground truth, both within a synthetic domain and during a real-world table setting task. △ Less

Submitted 6 July, 2021; originally announced July 2021.

arXiv:2104.14403 [pdf, other]

Do Feature Attribution Methods Correctly Attribute Features?

Authors: Yilun Zhou, Serena Booth, Marco Tulio Ribeiro, Julie Shah

Abstract: Feature attribution methods are popular in interpretable machine learning. These methods compute the attribution of each input feature to represent its importance, but there is no consensus on the definition of "attribution", leading to many competing methods with little systematic evaluation, complicated in particular by the lack of ground truth attribution. To address this, we propose a dataset… ▽ More Feature attribution methods are popular in interpretable machine learning. These methods compute the attribution of each input feature to represent its importance, but there is no consensus on the definition of "attribution", leading to many competing methods with little systematic evaluation, complicated in particular by the lack of ground truth attribution. To address this, we propose a dataset modification procedure to induce such ground truth. Using this procedure, we evaluate three common methods: saliency maps, rationales, and attentions. We identify several deficiencies and add new perspectives to the growing body of evidence questioning the correctness and reliability of these methods applied on datasets in the wild. We further discuss possible avenues for remedy and recommend new attribution methods to be tested against ground truth before deployment. The code is available at https://github.com/YilunZhou/feature-attribution-evaluation △ Less

Submitted 15 December, 2021; v1 submitted 27 April, 2021; originally announced April 2021.

Comments: AAAI 2022. Video summary at https://www.youtube.com/watch?v=kAodFw6jvvo

arXiv:2104.08668 [pdf, other]

Generating Related Work

Authors: Darsh J Shah, Regina Barzilay

Abstract: Communicating new research ideas involves highlighting similarities and differences with past work. Authors write fluent, often long sections to survey the distinction of a new paper with related work. In this work we model generating related work sections while being cognisant of the motivation behind citing papers. Our content planning model generates a tree of cited papers before a surface real… ▽ More Communicating new research ideas involves highlighting similarities and differences with past work. Authors write fluent, often long sections to survey the distinction of a new paper with related work. In this work we model generating related work sections while being cognisant of the motivation behind citing papers. Our content planning model generates a tree of cited papers before a surface realization model lexicalizes this skeleton. Our model outperforms several strong state-of-the-art summarization and multi-document summarization models on generating related work on an ACL Anthology (AA) based dataset which we contribute. △ Less

Submitted 17 April, 2021; originally announced April 2021.

arXiv:2104.03465 [pdf, other]

Nutribullets Hybrid: Multi-document Health Summarization

Authors: Darsh J Shah, Lili Yu, Tao Lei, Regina Barzilay

Abstract: We present a method for generating comparative summaries that highlights similarities and contradictions in input documents. The key challenge in creating such summaries is the lack of large parallel training data required for training typical summarization systems. To this end, we introduce a hybrid generation approach inspired by traditional concept-to-text systems. To enable accurate comparison… ▽ More We present a method for generating comparative summaries that highlights similarities and contradictions in input documents. The key challenge in creating such summaries is the lack of large parallel training data required for training typical summarization systems. To this end, we introduce a hybrid generation approach inspired by traditional concept-to-text systems. To enable accurate comparison between different sources, the model first learns to extract pertinent relations from input documents. The content planning component uses deterministic operators to aggregate these relations after identifying a subset for inclusion into a summary. The surface realization component lexicalizes this information using a text-infilling language model. By separately modeling content selection and realization, we can effectively train them with limited annotations. We implemented and tested the model in the domain of nutrition and health -- rife with inconsistencies. Compared to conventional methods, our framework leads to more faithful, relevant and aggregation-sensitive summarization -- while being equally fluent. △ Less

Submitted 7 April, 2021; originally announced April 2021.

Comments: NAACL 2021 Camera Ready

arXiv:2104.01511 [pdf, other]

Late fusion of machine learning models using passively captured interpersonal social interactions and motion from smartphones predicts decompensation in heart failure

Authors: Ayse S. Cakmak, Samuel Densen, Gabriel Najarro, Pratik Rout, Christopher J. Rozell, Omer T. Inan, Amit J. Shah, Gari D. Clifford

Abstract: Objective: Worldwide, heart failure (HF) is a major cause of morbidity and mortality and one of the leading causes of hospitalization. Early detection of HF symptoms and pro-active management may reduce adverse events. Approach: Twenty-eight participants were monitored using a smartphone app after discharge from hospitals, and each clinical event during the enrollment (N=110 clinical events) was r… ▽ More Objective: Worldwide, heart failure (HF) is a major cause of morbidity and mortality and one of the leading causes of hospitalization. Early detection of HF symptoms and pro-active management may reduce adverse events. Approach: Twenty-eight participants were monitored using a smartphone app after discharge from hospitals, and each clinical event during the enrollment (N=110 clinical events) was recorded. Motion, social, location, and clinical survey data collected via the smartphone-based monitoring system were used to develop and validate an algorithm for predicting or classifying HF decompensation events (hospitalizations or clinic visit) versus clinic monitoring visits in which they were determined to be compensated or stable. Models based on single modality as well as early and late fusion approaches combining patient-reported outcomes and passive smartphone data were evaluated. Results: The highest AUCPr for classifying decompensation with a late fusion approach was 0.80 using leave one subject out cross-validation. Significance: Passively collected data from smartphones, especially when combined with weekly patient-reported outcomes, may reflect behavioral and physiological changes due to HF and thus could enable prediction of HF decompensation. △ Less

Submitted 3 April, 2021; originally announced April 2021.

arXiv:2103.15785 [pdf, ps, other]

Dualizable objects in stratified categories and the 1-dimensional bordism hypothesis for recollements

Authors: Grigory Kondyrev, Aaron Mazel-Gee, Jay Shah

Abstract: Given a monoidal $\infty$-category $C$ equipped with a monoidal recollement, we give a simple criterion for an object in $C$ to be dualizable in terms of the dualizability of each of its factors and a projection formula relating them. Predicated on this, we then characterize dualizability in any monoidally stratified $\infty$-category in terms of stratumwise dualizability and a projection formula… ▽ More Given a monoidal $\infty$-category $C$ equipped with a monoidal recollement, we give a simple criterion for an object in $C$ to be dualizable in terms of the dualizability of each of its factors and a projection formula relating them. Predicated on this, we then characterize dualizability in any monoidally stratified $\infty$-category in terms of stratumwise dualizability and a projection formula for the links. Using our criterion, we prove a 1-dimensional bordism hypothesis for symmetric monoidal recollements. Namely, we provide an algebraic enhancement of the 1-dimensional framed bordism $\infty$-category that corepresents dualizable objects in symmetric monoidal recollements. We also give a number of examples and applications of our criterion drawn from algebra and homotopy theory, including equivariant and cyclotomic spectra and a multiplicative form of the Thom isomorphism. △ Less

Submitted 29 March, 2021; originally announced March 2021.

Comments: 61 pages, comments welcome!

MSC Class: 18N70

arXiv:2103.15171 [pdf, other]

A Bayesian Approach to Identifying Representational Errors

Authors: Ramya Ramakrishnan, Vaibhav Unhelkar, Ece Kamar, Julie Shah

Abstract: Trained AI systems and expert decision makers can make errors that are often difficult to identify and understand. Determining the root cause for these errors can improve future decisions. This work presents Generative Error Model (GEM), a generative model for inferring representational errors based on observations of an actor's behavior (either simulated agent, robot, or human). The model conside… ▽ More Trained AI systems and expert decision makers can make errors that are often difficult to identify and understand. Determining the root cause for these errors can improve future decisions. This work presents Generative Error Model (GEM), a generative model for inferring representational errors based on observations of an actor's behavior (either simulated agent, robot, or human). The model considers two sources of error: those that occur due to representational limitations -- "blind spots" -- and non-representational errors, such as those caused by noise in execution or systematic errors present in the actor's policy. Disambiguating these two error types allows for targeted refinement of the actor's policy (i.e., representational errors require perceptual augmentation, while other errors can be reduced through methods such as improved training or attention support). We present a Bayesian inference algorithm for GEM and evaluate its utility in recovering representational errors on multiple domains. Results show that our approach can recover blind spots of both reinforcement learning agents as well as human users. △ Less

Submitted 28 March, 2021; originally announced March 2021.

arXiv:2103.14464 [pdf, other]

Reactive Task and Motion Planning under Temporal Logic Specifications

Authors: Shen Li, Daehyung Park, Yoonchang Sung, Julie A. Shah, Nicholas Roy

Abstract: We present a task-and-motion planning (TAMP) algorithm robust against a human operator's cooperative or adversarial interventions. Interventions often invalidate the current plan and require replanning on the fly. Replanning can be computationally expensive and often interrupts seamless task execution. We introduce a dynamically reconfigurable planning methodology with behavior tree-based control… ▽ More We present a task-and-motion planning (TAMP) algorithm robust against a human operator's cooperative or adversarial interventions. Interventions often invalidate the current plan and require replanning on the fly. Replanning can be computationally expensive and often interrupts seamless task execution. We introduce a dynamically reconfigurable planning methodology with behavior tree-based control strategies toward reactive TAMP, which takes the advantage of previous plans and incremental graph search during temporal logic-based reactive synthesis. Our algorithm also shows efficient recovery functionalities that minimize the number of replanning steps. Finally, our algorithm produces a robust, efficient, and complete TAMP solution. Our experimental results show the algorithm results in superior manipulation performance in both simulated and real-world tasks. △ Less

Submitted 26 March, 2021; originally announced March 2021.

Comments: 7 pages, 6 figures, Published in IEEE International Conference on Robotics and Automation (ICRA), 2021

arXiv:2103.11921 [pdf, other]

Nutri-bullets: Summarizing Health Studies by Composing Segments

Authors: Darsh J Shah, Lili Yu, Tao Lei, Regina Barzilay

Abstract: We introduce \emph{Nutri-bullets}, a multi-document summarization task for health and nutrition. First, we present two datasets of food and health summaries from multiple scientific studies. Furthermore, we propose a novel \emph{extract-compose} model to solve the problem in the regime of limited parallel data. We explicitly select key spans from several abstracts using a policy network, followed… ▽ More We introduce \emph{Nutri-bullets}, a multi-document summarization task for health and nutrition. First, we present two datasets of food and health summaries from multiple scientific studies. Furthermore, we propose a novel \emph{extract-compose} model to solve the problem in the regime of limited parallel data. We explicitly select key spans from several abstracts using a policy network, followed by composing the selected spans to present a summary via a task specific language model. Compared to state-of-the-art methods, our approach leads to more faithful, relevant and diverse summarization -- properties imperative to this application. For instance, on the BreastCancer dataset our approach gets a more than 50\% improvement on relevance and faithfulness.\footnote{Our code and data is available at \url{https://github.com/darsh10/Nutribullets.}} △ Less

Submitted 22 March, 2021; originally announced March 2021.

Comments: 12 pages

Journal ref: AAAI 2021 Camera Ready

arXiv:2102.08507 [pdf, other]

doi 10.1109/CogSIMA51574.2021.9475925

Towards an AI Coach to Infer Team Mental Model Alignment in Healthcare

Authors: Sangwon Seo, Lauren R. Kennedy-Metz, Marco A. Zenati, Julie A. Shah, Roger D. Dias, Vaibhav V. Unhelkar

Abstract: Shared mental models are critical to team success; however, in practice, team members may have misaligned models due to a variety of factors. In safety-critical domains (e.g., aviation, healthcare), lack of shared mental models can lead to preventable errors and harm. Towards the goal of mitigating such preventable errors, here, we present a Bayesian approach to infer misalignment in team members'… ▽ More Shared mental models are critical to team success; however, in practice, team members may have misaligned models due to a variety of factors. In safety-critical domains (e.g., aviation, healthcare), lack of shared mental models can lead to preventable errors and harm. Towards the goal of mitigating such preventable errors, here, we present a Bayesian approach to infer misalignment in team members' mental models during complex healthcare task execution. As an exemplary application, we demonstrate our approach using two simulated team-based scenarios, derived from actual teamwork in cardiac surgery. In these simulated experiments, our approach inferred model misalignment with over 75% recall, thereby providing a building block for enabling computer-assisted interventions to augment human cognition in the operating room and improve teamwork. △ Less

Submitted 16 February, 2021; originally announced February 2021.

Comments: Submitted to the 2021 IEEE Conference on Cognitive and Computational Aspects of Situation Management (CogSIMA)

MSC Class: 68T37; 62F15 (Primary) 90C40; 62M05; 62P10; 91C99 (Secondary) ACM Class: I.2.m; G.3; J.3

arXiv:2101.00387 [pdf, other]

What all do audio transformer models hear? Probing Acoustic Representations for Language Delivery and its Structure

Authors: Jui Shah, Yaman Kumar Singla, Changyou Chen, Rajiv Ratn Shah

Abstract: In recent times, BERT based transformer models have become an inseparable part of the 'tech stack' of text processing models. Similar progress is being observed in the speech domain with a multitude of models observing state-of-the-art results by using audio transformer models to encode speech. This begs the question of what are these audio transformer models learning. Moreover, although the stand… ▽ More In recent times, BERT based transformer models have become an inseparable part of the 'tech stack' of text processing models. Similar progress is being observed in the speech domain with a multitude of models observing state-of-the-art results by using audio transformer models to encode speech. This begs the question of what are these audio transformer models learning. Moreover, although the standard methodology is to choose the last layer embedding for any downstream task, but is it the optimal choice? We try to answer these questions for the two recent audio transformer models, Mockingjay and wave2vec2.0. We compare them on a comprehensive set of language delivery and structure features including audio, fluency and pronunciation features. Additionally, we probe the audio models' understanding of textual surface, syntax, and semantic features and compare them to BERT. We do this over exhaustive settings for native, non-native, synthetic, read and spontaneous speech datasets △ Less

Submitted 12 July, 2021; v1 submitted 2 January, 2021; originally announced January 2021.

arXiv:2012.13615 [pdf, other]

RoCUS: Robot Controller Understanding via Sampling

Authors: Yilun Zhou, Serena Booth, Nadia Figueroa, Julie Shah

Abstract: As robots are deployed in complex situations, engineers and end users must develop a holistic understanding of their behaviors, capabilities, and limitations. Some behaviors are directly optimized by the objective function. They often include success rate, completion time or energy consumption. Other behaviors -- e.g., collision avoidance, trajectory smoothness or motion legibility -- are typicall… ▽ More As robots are deployed in complex situations, engineers and end users must develop a holistic understanding of their behaviors, capabilities, and limitations. Some behaviors are directly optimized by the objective function. They often include success rate, completion time or energy consumption. Other behaviors -- e.g., collision avoidance, trajectory smoothness or motion legibility -- are typically emergent but equally important for safe and trustworthy deployment. Designing an objective which optimizes every aspect of robot behavior is hard. In this paper, we advocate for systematic analysis of a wide array of behaviors for holistic understanding of robot controllers and, to this end, propose a framework, RoCUS, which uses Bayesian posterior sampling to find situations where the robot controller exhibits user-specified behaviors, such as highly jerky motions. We use RoCUS to analyze three controller classes (deep learning models, rapidly exploring random trees and dynamical system formulations) on two domains (2D navigation and a 7 degree-of-freedom arm reaching), and uncover insights to further our understanding of these controllers and ultimately improve their designs. △ Less

Submitted 14 October, 2021; v1 submitted 25 December, 2020; originally announced December 2020.

Comments: CoRL 2021. The project website is at https://yilunzhou.github.io/RoCUS/

arXiv:2012.00603 [pdf, ps, other]

Fourier Analysis and the closed form for the Zeta Function at even positive integers

Authors: Jibran Iqbal Shah

Abstract: Using a summation identity obtained for the Fourier coefficients of $x^{2k}$, we derive a closed form expression for the zeta function at even positive integers, using a technique similar to one in an existing proof by Aladdi and Defant[1], but in a simpler and shorter way. Using a summation identity obtained for the Fourier coefficients of $x^{2k}$, we derive a closed form expression for the zeta function at even positive integers, using a technique similar to one in an existing proof by Aladdi and Defant[1], but in a simpler and shorter way. △ Less

Submitted 3 December, 2020; v1 submitted 29 November, 2020; originally announced December 2020.

Comments: 6 pages, no figures. Comments welcome

MSC Class: 11R42(Primary); 11B68 (Secondary)

arXiv:2011.10898 [pdf, other]

Experimental Assessment of Human-Robot Teaming for Multi-Step Remote Manipulation with Expert Operators

Authors: Claudia Pérez-D'Arpino, Rebecca P. Khurshid, Julie A. Shah

Abstract: Remote robot manipulation with human control enables applications where safety and environmental constraints are adverse to humans (e.g. underwater, space robotics and disaster response) or the complexity of the task demands human-level cognition and dexterity (e.g. robotic surgery and manufacturing). These systems typically use direct teleoperation at the motion level, and are usually limited to… ▽ More Remote robot manipulation with human control enables applications where safety and environmental constraints are adverse to humans (e.g. underwater, space robotics and disaster response) or the complexity of the task demands human-level cognition and dexterity (e.g. robotic surgery and manufacturing). These systems typically use direct teleoperation at the motion level, and are usually limited to low-DOF arms and 2D perception. Improving dexterity and situational awareness demands new interaction and planning workflows. We explore the use of human-robot teaming through teleautonomy with assisted planning for remote control of a dual-arm dexterous robot for multi-step manipulation tasks, and conduct a within-subjects experimental assessment (n=12 expert users) to compare it with other methods, resulting in the following four conditions: (A) Direct teleoperation with imitation controller + 2D perception, (B) Condition A + 3D perception, (C) Teleautonomy interface teleoperation + 2D & 3D perception, (D) Condition C + assisted planning. The results indicate that this approach (D) achieves task times comparable with direct teleoperation (A,B) while improving a number of other objective and subjective metrics, including re-grasps, collisions, and TLX workload metrics. When compared to a similar interface but removing the assisted planning (C), D reduces the task time and removes a significant interaction with the level of expertise of the operator, resulting in a performance equalizer across users. △ Less

Submitted 21 November, 2020; originally announced November 2020.

Comments: For accompanying video, see https://sites.google.com/view/teleautonomy/

arXiv:2011.00249 [pdf, ps, other]

doi 10.1103/PhysRevB.103.195118

Renormalization group study of systems with quadratic band touching

Authors: Jeet Shah, Subroto Mukerjee

Abstract: Lifshitz transitions in two 2D systems with a single quadratic band touching point as the chemical potential is varied have been studied here. The effects of interactions have been studied using the renormalization group (RG) and it is found that at the transition a repulsive interaction is marginally relevant and an attractive interaction is marginally irrelevant. We corroborate the results obtai… ▽ More Lifshitz transitions in two 2D systems with a single quadratic band touching point as the chemical potential is varied have been studied here. The effects of interactions have been studied using the renormalization group (RG) and it is found that at the transition a repulsive interaction is marginally relevant and an attractive interaction is marginally irrelevant. We corroborate the results obtained from the RG calculation by studying a microscopic model whose ground state and Green's functions can be obtained exactly. We find that away from the transition, the system displays an instability towards forming and excitonic condensate. △ Less

Submitted 31 October, 2020; originally announced November 2020.

Comments: 12 pages, 6 figures

Journal ref: Phys. Rev. B 103, 195118 (2021)

arXiv:2010.14537 [pdf]

The State of Industrial Robotics: Emerging Technologies, Challenges, and Key Research Directions

Authors: Lindsay Sanneman, Christopher Fourie, Julie A. Shah

Abstract: Robotics and related technologies are central to the ongoing digitization and advancement of manufacturing. In recent years, a variety of strategic initiatives around the world including "Industry 4.0", introduced in Germany in 2011 have aimed to improve and connect manufacturing technologies in order to optimize production processes. In this work, we study the changing technological landscape of… ▽ More Robotics and related technologies are central to the ongoing digitization and advancement of manufacturing. In recent years, a variety of strategic initiatives around the world including "Industry 4.0", introduced in Germany in 2011 have aimed to improve and connect manufacturing technologies in order to optimize production processes. In this work, we study the changing technological landscape of robotics and "internet-of-things" (IoT)-based connective technologies over the last 7-10 years in the wake of Industry 4.0. We interviewed key players within the European robotics ecosystem, including robotics manufacturers and integrators, original equipment manufacturers (OEMs), and applied industrial research institutions and synthesize our findings in this paper. We first detail the state-of-the-art robotics and IoT technologies we observed and that the companies discussed during our interviews. We then describe the processes the companies follow when deciding whether and how to integrate new technologies, the challenges they face when integrating these technologies, and some immediate future technological avenues they are exploring in robotics and IoT. Finally, based on our findings, we highlight key research directions for the robotics community that can enable improved capabilities in the context of manufacturing. △ Less

Submitted 27 October, 2020; originally announced October 2020.

arXiv:2010.00607 [pdf, other]

doi 10.21468/SciPostPhys.10.6.148

Gauging the Kitaev chain

Authors: Umberto Borla, Ruben Verresen, Jeet Shah, Sergej Moroz

Abstract: We gauge the fermion parity symmetry of the Kitaev chain. While the bulk of the model becomes an Ising chain of gauge-invariant spins in a tilted field, near the boundaries the global fermion parity symmetry survives gauging, leading to local gauge-invariant Majorana operators. In the absence of vortices, the Higgs phase exhibits fermionic symmetry-protected topological (SPT) order distinct from t… ▽ More We gauge the fermion parity symmetry of the Kitaev chain. While the bulk of the model becomes an Ising chain of gauge-invariant spins in a tilted field, near the boundaries the global fermion parity symmetry survives gauging, leading to local gauge-invariant Majorana operators. In the absence of vortices, the Higgs phase exhibits fermionic symmetry-protected topological (SPT) order distinct from the Kitaev chain. Moreover, the deconfined phase can be stable even in the presence of vortices. We also undertake a comprehensive study of a gently gauged model which interpolates between the ordinary and gauged Kitaev chains. This showcases rich quantum criticality and illuminates the topological nature of the Higgs phase. Even in the absence of superconducting terms, gauging leads to an SPT phase which is intrinsically gapless due to an emergent anomaly. △ Less

Submitted 19 May, 2021; v1 submitted 1 October, 2020; originally announced October 2020.

Comments: V2. 6 figures in the main text, 2 figures in the appendix

Journal ref: SciPost Phys. 10, 148 (2021)

arXiv:2009.03468 [pdf]

Quad-Core RSA Processor with Countermeasure Against Power Analysis Attacks

Authors: Javad Bagherzadeh, Vishishtha Bothra, Disha Gujar, Sugandha Gupta, **al Shah

Abstract: Rivest-Shamir-Adleman (RSA) cryptosystem uses modular multiplication for encryption and decryption. So, performance of RSA can be drastically improved by optimizing modular multiplication. This paper proposes a new parallel, high-radix Montgomery multiplier for 1024 bits multi-core RSA processor. Each computation step operates in radix 4. The computation speed is increased by more than 4 times. We… ▽ More Rivest-Shamir-Adleman (RSA) cryptosystem uses modular multiplication for encryption and decryption. So, performance of RSA can be drastically improved by optimizing modular multiplication. This paper proposes a new parallel, high-radix Montgomery multiplier for 1024 bits multi-core RSA processor. Each computation step operates in radix 4. The computation speed is increased by more than 4 times. We also implement a True Random Number Generator based resilience block to protect the coprocessor against power attacks. △ Less

Submitted 7 September, 2020; originally announced September 2020.

arXiv:2008.07788 [pdf, other]

CinC-GAN for Effective F0 prediction for Whisper-to-Normal Speech Conversion

Authors: Maitreya Patel, Mirali Purohit, Jui Shah, Hemant A. Patil

Abstract: Recently, Generative Adversarial Networks (GAN)-based methods have shown remarkable performance for the Voice Conversion and WHiSPer-to-normal SPeeCH (WHSP2SPCH) conversion. One of the key challenges in WHSP2SPCH conversion is the prediction of fundamental frequency (F0). Recently, authors have proposed state-of-the-art method Cycle-Consistent Generative Adversarial Networks (CycleGAN) for WHSP2SP… ▽ More Recently, Generative Adversarial Networks (GAN)-based methods have shown remarkable performance for the Voice Conversion and WHiSPer-to-normal SPeeCH (WHSP2SPCH) conversion. One of the key challenges in WHSP2SPCH conversion is the prediction of fundamental frequency (F0). Recently, authors have proposed state-of-the-art method Cycle-Consistent Generative Adversarial Networks (CycleGAN) for WHSP2SPCH conversion. The CycleGAN-based method uses two different models, one for Mel Cepstral Coefficients (MCC) map**, and another for F0 prediction, where F0 is highly dependent on the pre-trained model of MCC map**. This leads to additional non-linear noise in predicted F0. To suppress this noise, we propose Cycle-in-Cycle GAN (i.e., CinC-GAN). It is specially designed to increase the effectiveness in F0 prediction without losing the accuracy of MCC map**. We evaluated the proposed method on a non-parallel setting and analyzed on speaker-specific, and gender-specific tasks. The objective and subjective tests show that CinC-GAN significantly outperforms the CycleGAN. In addition, we analyze the CycleGAN and CinC-GAN for unseen speakers and the results show the clear superiority of CinC-GAN. △ Less

Submitted 18 August, 2020; originally announced August 2020.

Comments: Accepted in 28th European Signal Processing Conference (EUSIPCO), 2020

arXiv:2005.05940 [pdf]

Trust Considerations for Explainable Robots: A Human Factors Perspective

Authors: Lindsay Sanneman, Julie A. Shah

Abstract: Recent advances in artificial intelligence (AI) and robotics have drawn attention to the need for AI systems and robots to be understandable to human users. The explainable AI (XAI) and explainable robots literature aims to enhance human understanding and human-robot team performance by providing users with necessary information about AI and robot behavior. Simultaneously, the human factors litera… ▽ More Recent advances in artificial intelligence (AI) and robotics have drawn attention to the need for AI systems and robots to be understandable to human users. The explainable AI (XAI) and explainable robots literature aims to enhance human understanding and human-robot team performance by providing users with necessary information about AI and robot behavior. Simultaneously, the human factors literature has long addressed important considerations that contribute to human performance, including human trust in autonomous systems. In this paper, drawing from the human factors literature, we discuss three important trust-related considerations for the design of explainable robot systems: the bases of trust, trust calibration, and trust specificity. We further detail existing and potential metrics for assessing trust in robotic systems based on explanations provided by explainable robots. △ Less

Submitted 12 May, 2020; originally announced May 2020.

Comments: Presented at the 2020 Workshop on Assessing, Explaining, and Conveying Robot Proficiency for Human-Robot Teaming

Report number: RobotProficiency/2020/05

arXiv:2004.14190 [pdf]

Giant Nonreciprocity of Surface Acoustic Waves enabled by the Magnetoelastic Interaction

Authors: Piyush J. Shah, Derek A. Bas, Ivan Lisenkov, Alexei Matyushov, Nianxiang Sun, Michael R. Page

Abstract: Nonreciprocity, the defining characteristic of isolators, circulators and a wealth of other applications in radio/microwave communications technologies, is in general difficult to achieve as most physical systems incorporate symmetries that prevent the effect. In particular, acoustic waves are an important medium for information transport, but they are inherently symmetric in time. In this work, w… ▽ More Nonreciprocity, the defining characteristic of isolators, circulators and a wealth of other applications in radio/microwave communications technologies, is in general difficult to achieve as most physical systems incorporate symmetries that prevent the effect. In particular, acoustic waves are an important medium for information transport, but they are inherently symmetric in time. In this work, we report giant nonreciprocity in the transmission of surface acoustic waves (SAWs) on lithium niobate substrate coated with ferromagnet/insulator/ferromagnet (FeGaB/Al2O3/FeGaB) multilayer structure. We exploit this novel structure with a unique asymmetric band diagram, and expand on magnetoelastic coupling theory to show how the magnetic bands couple with acoustic waves only in a single direction. We measure 48.4 dB (ratio of 1:100,000) isolation which outperforms current state of the art microwave isolator devices in a novel acoustic wave system that facilitates unprecedented size, weight, and power reduction. Additionally, these results offer a promising platform to study nonreciprocal SAW devices. △ Less

Submitted 29 April, 2020; originally announced April 2020.

Comments: 16 pages, 7 figures

arXiv:2004.04931 [pdf]

doi 10.1016/j.cmpb.2020.105581

CoroNet: A deep neural network for detection and diagnosis of COVID-19 from chest x-ray images

Authors: Asif Iqbal Khan, Junaid Latief Shah, Mudasir Bhat

Abstract: Background and Objective The novel Coronavirus also called COVID-19 originated in Wuhan, China in December 2019 and has now spread across the world. It has so far infected around 1.8 million people and claimed approximately 114,698 lives overall. As the number of cases are rapidly increasing, most of the countries are facing shortage of testing kits and resources. The limited quantity of testing… ▽ More Background and Objective The novel Coronavirus also called COVID-19 originated in Wuhan, China in December 2019 and has now spread across the world. It has so far infected around 1.8 million people and claimed approximately 114,698 lives overall. As the number of cases are rapidly increasing, most of the countries are facing shortage of testing kits and resources. The limited quantity of testing kits and increasing number of daily cases encouraged us to come up with a Deep Learning model that can aid radiologists and clinicians in detecting COVID-19 cases using chest X-rays. Methods In this study, we propose CoroNet, a Deep Convolutional Neural Network model to automatically detect COVID-19 infection from chest X-ray images. The proposed model is based on Xception architecture pre-trained on ImageNet dataset and trained end-to-end on a dataset prepared by collecting COVID-19 and other chest pneumonia X-ray images from two different publically available databases. Results and Conclusion CoroNet has been trained and tested on the prepared dataset and the experimental results show that our proposed model achieved an overall accuracy of 89.6%, and more importantly the precision and recall rate for COVID-19 cases are 93% and 98.2% for 4-class cases (COVID vs Pneumonia bacterial vs pneumonia viral vs normal). For 3-class classification (COVID vs Pneumonia vs normal), the proposed model produced a classification accuracy of 95%. The preliminary results of this study look promising which can be further improved as more training data becomes available. Overall, the proposed model substantially advances the current radiology based methodology and during COVID-19 pandemic, it can be very helpful tool for clinical practitioners and radiologists to aid them in diagnosis, quantification and follow-up of COVID-19 cases. △ Less

Submitted 12 June, 2020; v1 submitted 10 April, 2020; originally announced April 2020.

Comments: 9 pages, 8 Figures and 8 Tables

Journal ref: Computer Methods and Programs in Biomedicine 196C (2020) 105581

arXiv:2003.02232 [pdf, other]

Interactive Robot Training for Non-Markov Tasks

Authors: Ankit Shah, Samir Wadhwania, Julie Shah

Abstract: Defining sound and complete specifications for robots using formal languages is challenging, while learning formal specifications directly from demonstrations can lead to over-constrained task policies. In this paper, we propose a Bayesian interactive robot training framework that allows the robot to learn from both demonstrations provided by a teacher, and that teacher's assessments of the robot'… ▽ More Defining sound and complete specifications for robots using formal languages is challenging, while learning formal specifications directly from demonstrations can lead to over-constrained task policies. In this paper, we propose a Bayesian interactive robot training framework that allows the robot to learn from both demonstrations provided by a teacher, and that teacher's assessments of the robot's task executions. We also present an active learning approach -- inspired by uncertainty sampling -- to identify the task execution with the most uncertain degree of acceptability. Through a simulated experiment, we demonstrate that our active learning approach identifies a teacher's intended task specification with an equivalent or greater similarity when compared to an approach that learns purely from demonstrations. Finally, we demonstrate the efficacy of our approach in a real-world setting through a user-study based on teaching a robot to set a dinner table. △ Less

Submitted 28 November, 2020; v1 submitted 4 March, 2020; originally announced March 2020.

arXiv:2002.11416 [pdf, ps, other]

Analytical Equations based Prediction Approach for PM2.5 using Artificial Neural Network

Authors: Jalpa Shah, Biswajit Mishra

Abstract: Particulate matter pollution is one of the deadliest types of air pollution worldwide due to its significant impacts on the global environment and human health. Particulate Matter (PM2.5) is one of the important particulate pollutants to measure the Air Quality Index (AQI). The conventional instruments used by the air quality monitoring stations to monitor PM2.5 are costly, bulkier, time-consuming… ▽ More Particulate matter pollution is one of the deadliest types of air pollution worldwide due to its significant impacts on the global environment and human health. Particulate Matter (PM2.5) is one of the important particulate pollutants to measure the Air Quality Index (AQI). The conventional instruments used by the air quality monitoring stations to monitor PM2.5 are costly, bulkier, time-consuming, and power-hungry. Furthermore, due to limited data availability and non-scalability, these stations cannot provide high spatial and temporal resolution in real-time. To overcome the disadvantages of existing methodology this article presents analytical equations based prediction approach for PM2.5 using an Artificial Neural Network (ANN). Since the derived analytical equations for the prediction can be computed using a Wireless Sensor Node (WSN) or low-cost processing tool, it demonstrates the usefulness of the proposed approach. Moreover, the study related to correlation among the PM2.5 and other pollutants is performed to select the appropriate predictors. The large authenticate data set of Central Pollution Control Board (CPCB) online station, India is used for the proposed approach. The RMSE and coefficient of determination (R2) obtained for the proposed prediction approach using eight predictors are 1.7973 ug/m3 and 0.9986 respectively. While the proposed approach results show RMSE of 7.5372 ug/m3 and R2 of 0.9708 using three predictors. Therefore, the results demonstrate that the proposed approach is one of the promising approaches for monitoring PM2.5 without power-hungry gas sensors and bulkier analyzers. △ Less

Submitted 26 February, 2020; originally announced February 2020.

arXiv:2002.10248 [pdf, other]

Bayes-TrEx: a Bayesian Sampling Approach to Model Transparency by Example

Authors: Serena Booth, Yilun Zhou, Ankit Shah, Julie Shah

Abstract: Post-hoc explanation methods are gaining popularity for interpreting, understanding, and debugging neural networks. Most analyses using such methods explain decisions in response to inputs drawn from the test set. However, the test set may have few examples that trigger some model behaviors, such as high-confidence failures or ambiguous classifications. To address these challenges, we introduce a… ▽ More Post-hoc explanation methods are gaining popularity for interpreting, understanding, and debugging neural networks. Most analyses using such methods explain decisions in response to inputs drawn from the test set. However, the test set may have few examples that trigger some model behaviors, such as high-confidence failures or ambiguous classifications. To address these challenges, we introduce a flexible model inspection framework: Bayes-TrEx. Given a data distribution, Bayes-TrEx finds in-distribution examples with a specified prediction confidence. We demonstrate several use cases of Bayes-TrEx, including revealing highly confident (mis)classifications, visualizing class boundaries via ambiguous examples, understanding novel-class extrapolation behavior, and exposing neural network overconfidence. We use Bayes-TrEx to study classifiers trained on CLEVR, MNIST, and Fashion-MNIST, and we show that this framework enables more flexible holistic model analysis than just inspecting the test set. Code is available at https://github.com/serenabooth/Bayes-TrEx. △ Less

Submitted 16 December, 2020; v1 submitted 19 February, 2020; originally announced February 2020.

Comments: Accepted at AAAI 2021

arXiv:2001.05994 [pdf, other]

Adversarially Guided Self-Play for Adopting Social Conventions

Authors: Mycal Tucker, Yilun Zhou, Julie Shah

Abstract: Robotic agents must adopt existing social conventions in order to be effective teammates. These social conventions, such as driving on the right or left side of the road, are arbitrary choices among optimal policies, but all agents on a successful team must use the same convention. Prior work has identified a method of combining self-play with paired input-output data gathered from existing agents… ▽ More Robotic agents must adopt existing social conventions in order to be effective teammates. These social conventions, such as driving on the right or left side of the road, are arbitrary choices among optimal policies, but all agents on a successful team must use the same convention. Prior work has identified a method of combining self-play with paired input-output data gathered from existing agents in order to learn their social convention without interacting with them. We build upon this work by introducing a technique called Adversarial Self-Play (ASP) that uses adversarial training to shape the space of possible learned policies and substantially improves learning efficiency. ASP only requires the addition of unpaired data: a dataset of outputs produced by the social convention without associated inputs. Theoretical analysis reveals how ASP shapes the policy space and the circumstances (when behaviors are clustered or exhibit some other structure) under which it offers the greatest benefits. Empirical results across three domains confirm ASP's advantages: it produces models that more closely match the desired social convention when given as few as two paired datapoints. △ Less

Submitted 7 October, 2020; v1 submitted 16 January, 2020; originally announced January 2020.

Comments: 9 pages, 8 figures

arXiv:2001.03076 [pdf, other]

Sampling Prediction-Matching Examples in Neural Networks: A Probabilistic Programming Approach

Authors: Serena Booth, Ankit Shah, Yilun Zhou, Julie Shah

Abstract: Though neural network models demonstrate impressive performance, we do not understand exactly how these black-box models make individual predictions. This drawback has led to substantial research devoted to understand these models in areas such as robustness, interpretability, and generalization ability. In this paper, we consider the problem of exploring the prediction level sets of a classifier… ▽ More Though neural network models demonstrate impressive performance, we do not understand exactly how these black-box models make individual predictions. This drawback has led to substantial research devoted to understand these models in areas such as robustness, interpretability, and generalization ability. In this paper, we consider the problem of exploring the prediction level sets of a classifier using probabilistic programming. We define a prediction level set to be the set of examples for which the predictor has the same specified prediction confidence with respect to some arbitrary data distribution. Notably, our sampling-based method does not require the classifier to be differentiable, making it compatible with arbitrary classifiers. As a specific instantiation, if we take the classifier to be a neural network and the data distribution to be that of the training data, we can obtain examples that will result in specified predictions by the neural network. We demonstrate this technique with experiments on a synthetic dataset and MNIST. Such level sets in classification may facilitate human understanding of classification behaviors. △ Less

Submitted 9 January, 2020; originally announced January 2020.

Comments: AAAI 2020 Workshop on Statistical Relational AI (StarAI 2020)

arXiv:1912.11557 [pdf, ps, other]

Scheiderer motives and equivariant higher topos theory

Authors: Elden Elmanto, Jay Shah

Abstract: We give an algebro-geometric interpretation of $C_2$-equivariant stable homotopy theory by means of the $b$-topology introduced by Claus Scheiderer in his study of $2$-torsion phenomena in étale cohomology. To accomplish this, we first revisit and extend work of Scheiderer on equivariant topos theory by functorially associating to a $\infty$-topos $\mathscr{X}$ with $G$-action a presentable stable… ▽ More We give an algebro-geometric interpretation of $C_2$-equivariant stable homotopy theory by means of the $b$-topology introduced by Claus Scheiderer in his study of $2$-torsion phenomena in étale cohomology. To accomplish this, we first revisit and extend work of Scheiderer on equivariant topos theory by functorially associating to a $\infty$-topos $\mathscr{X}$ with $G$-action a presentable stable $\infty$-category $\mathrm{Sp}^G(\mathscr{X})$, which recovers the $\infty$-category $\mathrm{Sp}^G$ of genuine $G$-spectra when $\mathscr{X}$ is the terminal $G$-$\infty$-topos. Given a scheme $X$ with $1/2 \in \mathcal{O}_X$, our construction then specializes to produce an $\infty$-category $\mathrm{Sp}^{C_2}_b(X)$ of "$b$-sheaves with transfers" as $b$-sheaves of spectra on the small étale site of $X$ equipped with certain transfers along the extension $X[i] \rightarrow X$; if $X$ is the spectrum of a real closed field, then $\mathrm{Sp}^{C_2}_b(X)$ recovers $\mathrm{Sp}^{C_2}$. On a large class of schemes, we prove that, after $p$-completion, our construction assembles into a premotivic functor satisfying the full six functors formalism. We then introduce the $b$-variant $\mathrm{SH}_b(X)$ of the $\infty$-category $\mathrm{SH}(X)$ of motivic spectra over $X$ (in the sense of Morel-Voevodsky), and produce a natural equivalence of $\infty$-categories $\mathrm{SH}_b(X)^{\wedge}_p \simeq \mathrm{Sp}^{C_2}_b(X)^{\wedge}_p$ through amalgamating the étale and real étale motivic rigidity theorems of Tom Bachmann. This involves a purely algebro-geometric construction of the $C_2$-Tate construction, which may be of independent interest. Finally, as applications, we deduce a "$b$-rigidity" theorem, use the Segal conjecture to show étale descent of the $2$-complete $b$-motivic sphere spectrum, and construct a parametrized version of the $C_2$-Betti realization functor of Heller-Ormsby. △ Less

Submitted 8 February, 2021; v1 submitted 24 December, 2019; originally announced December 2019.

Comments: 79 pages, minor revision, to appear in Advances in Mathematics

MSC Class: 14F42; 14P10; 55P91

arXiv:1911.07515 [pdf]

Automated Human Claustrum Segmentation using Deep Learning Technologies

Authors: Ahmed Awad Albishri, Syed Jawad Hussain Shah, Anthony Schmiedler, Seung Suk Kang, Yugyung Lee

Abstract: In recent years, Deep Learning (DL) has shown promising results in conducting AI tasks such as computer vision and image segmentation. Specifically, Convolutional Neural Network (CNN) models in DL have been applied to prevention,detection, and diagnosis in predictive medicine. Image segmentation plays a significant role in disease detection and prevention.However, there are enormous challenges in… ▽ More In recent years, Deep Learning (DL) has shown promising results in conducting AI tasks such as computer vision and image segmentation. Specifically, Convolutional Neural Network (CNN) models in DL have been applied to prevention,detection, and diagnosis in predictive medicine. Image segmentation plays a significant role in disease detection and prevention.However, there are enormous challenges in performing DL-based automatic segmentation due to the nature of medical images such as heterogeneous modalities and formats, insufficient labeled training data, and the high-class imbalance in the labeled data. Furthermore, automating segmentation of medical images,like magnetic resonance images (MRI), becomes a challenging task. The need for automated segmentation or annotation is what motivates our work. In this paper, we propose a fully automated approach that aims to segment the human claustrum for analytical purposes. We applied a U-Net CNN model to segment the claustrum (Cl) from a MRI dataset. With this approach, we have achieved an average Dice per case score of 0.72 for Cl segmentation, with K=5 for cross-validation. The expert in the medical domain also evaluates these results. △ Less

Submitted 18 November, 2019; originally announced November 2019.

Comments: 6 pages, 4 figures

arXiv:1910.10274 [pdf, other]

Capturing Greater Context for Question Generation

Authors: Luu Anh Tuan, Darsh J Shah, Regina Barzilay

Abstract: Automatic question generation can benefit many applications ranging from dialogue systems to reading comprehension. While questions are often asked with respect to long documents, there are many challenges with modeling such long documents. Many existing techniques generate questions by effectively looking at one sentence at a time, leading to questions that are easy and not reflective of the huma… ▽ More Automatic question generation can benefit many applications ranging from dialogue systems to reading comprehension. While questions are often asked with respect to long documents, there are many challenges with modeling such long documents. Many existing techniques generate questions by effectively looking at one sentence at a time, leading to questions that are easy and not reflective of the human process of question generation. Our goal is to incorporate interactions across multiple sentences to generate realistic questions for long documents. In order to link a broad document context to the target answer, we represent the relevant context via a multi-stage attention mechanism, which forms the foundation of a sequence to sequence model. We outperform state-of-the-art methods on question generation on three question-answering datasets -- SQuAD, MS MARCO and NewsQA. △ Less

Submitted 22 October, 2019; originally announced October 2019.

arXiv:1909.13838 [pdf, other]

Automatic Fact-guided Sentence Modification

Authors: Darsh J Shah, Tal Schuster, Regina Barzilay

Abstract: Online encyclopediae like Wikipedia contain large amounts of text that need frequent corrections and updates. The new information may contradict existing content in encyclopediae. In this paper, we focus on rewriting such dynamically changing articles. This is a challenging constrained generation task, as the output must be consistent with the new information and fit into the rest of the existing… ▽ More Online encyclopediae like Wikipedia contain large amounts of text that need frequent corrections and updates. The new information may contradict existing content in encyclopediae. In this paper, we focus on rewriting such dynamically changing articles. This is a challenging constrained generation task, as the output must be consistent with the new information and fit into the rest of the existing document. To this end, we propose a two-step solution: (1) We identify and remove the contradicting components in a target text for a given claim, using a neutralizing stance model; (2) We expand the remaining text to be consistent with the given claim, using a novel two-encoder sequence-to-sequence model with copy attention. Applied to a Wikipedia fact update dataset, our method successfully generates updated sentences for new claims, achieving the highest SARI score. Furthermore, we demonstrate that generating synthetic data through such rewritten sentences can successfully augment the FEVER fact-checking training dataset, leading to a relative error reduction of 13%. △ Less

Submitted 2 December, 2019; v1 submitted 30 September, 2019; originally announced September 2019.

Comments: AAAI 2020

arXiv:1909.09008 [pdf]

doi 10.1021/acs.jpclett.0c00123

Formation of a Te-Ag Honeycomb Alloy: A New Type of Two-Dimensional Material

Authors: J. Shah, H. M. Sohail, R. I. G. Uhrberg, W. Wang

Abstract: Inspired by the unique properties of graphene, the focus in the literature is now on investigations of various two-dimensional (2D) materials with the aim to explore their properties for future applications. The group IV analogues of graphene, i.e., silicene, germanene and stanene have been intensively studied in recent years. However, their semi-metallic band structures hamper their use in electr… ▽ More Inspired by the unique properties of graphene, the focus in the literature is now on investigations of various two-dimensional (2D) materials with the aim to explore their properties for future applications. The group IV analogues of graphene, i.e., silicene, germanene and stanene have been intensively studied in recent years. However, their semi-metallic band structures hamper their use in electronic applications. Hence, the synthesis of 2D materials with band gaps of various sizes has attracted a large interest. Here, we report a successful preparation of a 2D Te-Ag binary alloy with a honeycomb structure. Angle-resolved photoelectron spectroscopy (ARPES) in combination with first-principles calculations using density functional theory (DFT) confirmed the formation of this binary alloy. The semiconducting property is verified by the ARPES data and a direct gap of ~0.7 eV is predicted by the DFT calculations. △ Less

Submitted 19 September, 2019; originally announced September 2019.

Comments: 7 pages, 3 figures and 2 figures in supplementary

Journal ref: J. Phys. Chem. Lett. 11, 1609 (2020)

arXiv:1909.06414 [pdf, other]

Learning Household Task Knowledge from WikiHow Descriptions

Authors: Yilun Zhou, Julie A. Shah, Steven Schockaert

Abstract: Commonsense procedural knowledge is important for AI agents and robots that operate in a human environment. While previous attempts at constructing procedural knowledge are mostly rule- and template-based, recent advances in deep learning provide the possibility of acquiring such knowledge directly from natural language sources. As a first step in this direction, we propose a model to learn embedd… ▽ More Commonsense procedural knowledge is important for AI agents and robots that operate in a human environment. While previous attempts at constructing procedural knowledge are mostly rule- and template-based, recent advances in deep learning provide the possibility of acquiring such knowledge directly from natural language sources. As a first step in this direction, we propose a model to learn embeddings for tasks, as well as the individual steps that need to be taken to solve them, based on WikiHow articles. We learn these embeddings such that they are predictive of both step relevance and step ordering. We also experiment with the use of integer programming for inferring consistent global step orderings from noisy pairwise predictions. △ Less

Submitted 13 September, 2019; originally announced September 2019.

Comments: IJCAI 2019 Workshop on Semantic Deep Learning

arXiv:1909.05232 [pdf, other]

On Memory Mechanism in Multi-Agent Reinforcement Learning

Authors: Yilun Zhou, Derrik E. Asher, Nicholas R. Waytowich, Julie A. Shah

Abstract: Multi-agent reinforcement learning (MARL) extends (single-agent) reinforcement learning (RL) by introducing additional agents and (potentially) partial observability of the environment. Consequently, algorithms for solving MARL problems incorporate various extensions beyond traditional RL methods, such as a learned communication protocol between cooperative agents that enables exchange of private… ▽ More Multi-agent reinforcement learning (MARL) extends (single-agent) reinforcement learning (RL) by introducing additional agents and (potentially) partial observability of the environment. Consequently, algorithms for solving MARL problems incorporate various extensions beyond traditional RL methods, such as a learned communication protocol between cooperative agents that enables exchange of private information or adaptive modeling of opponents in competitive settings. One popular algorithmic construct is a memory mechanism such that an agent's decisions can depend not only upon the current state but also upon the history of observed states and actions. In this paper, we study how a memory mechanism can be useful in environments with different properties, such as observability, internality and presence of a communication channel. Using both prior work and new experiments, we show that a memory mechanism is helpful when learning agents need to model other agents and/or when communication is constrained in some way; however we must to be cautious of agents achieving effective memoryfulness through other means. △ Less

Submitted 11 September, 2019; originally announced September 2019.

arXiv:1909.03920 [pdf, ps, other]

On the parametrized Tate construction and two theories of real $p$-cyclotomic spectra

Authors: J. D. Quigley, Jay Shah

Abstract: We give a new formula for $p$-typical real topological cyclic homology that refines the fiber sequence formula discovered by Nikolaus and Scholze for $p$-typical topological cyclic homology to one involving genuine $C_2$-spectra. To accomplish this, we give a new definition of the $\infty$-category of real $p$-cyclotomic spectra that replaces the usage of genuinely equivariant dihedral spectra wit… ▽ More We give a new formula for $p$-typical real topological cyclic homology that refines the fiber sequence formula discovered by Nikolaus and Scholze for $p$-typical topological cyclic homology to one involving genuine $C_2$-spectra. To accomplish this, we give a new definition of the $\infty$-category of real $p$-cyclotomic spectra that replaces the usage of genuinely equivariant dihedral spectra with the parametrized Tate construction $(-)^{t_{C_2} μ_p}$ associated to the dihedral group $D_{2p} = μ_p \rtimes C_2$. We then define a $p$-typical and $\infty$-categorical version of Høgenhaven's $O(2)$-orthogonal cyclotomic spectra, construct a forgetful functor relating the two theories, and show that this functor restricts to an equivalence between full subcategories of appropriately bounded below objects. △ Less

Submitted 9 September, 2019; originally announced September 2019.

Comments: 110 pages

MSC Class: 19D55; 55P42; 55P43; 55P91; 18D05; 16E40; 13D03

arXiv:1908.09805 [pdf, other]

The Limitations of Stylometry for Detecting Machine-Generated Fake News

Authors: Tal Schuster, Roei Schuster, Darsh J Shah, Regina Barzilay

Abstract: Recent developments in neural language models (LMs) have raised concerns about their potential misuse for automatically spreading misinformation. In light of these concerns, several studies have proposed to detect machine-generated fake news by capturing their stylistic differences from human-written text. These approaches, broadly termed stylometry, have found success in source attribution and mi… ▽ More Recent developments in neural language models (LMs) have raised concerns about their potential misuse for automatically spreading misinformation. In light of these concerns, several studies have proposed to detect machine-generated fake news by capturing their stylistic differences from human-written text. These approaches, broadly termed stylometry, have found success in source attribution and misinformation detection in human-written texts. However, in this work, we show that stylometry is limited against machine-generated misinformation. While humans speak differently when trying to deceive, LMs generate stylistically consistent text, regardless of underlying motive. Thus, though stylometry can successfully prevent impersonation by identifying text provenance, it fails to distinguish legitimate LM applications from those that introduce false information. We create two benchmarks demonstrating the stylistic similarity between malicious and legitimate uses of LMs, employed in auto-completion and editing-assistance settings. Our findings highlight the need for non-stylometry approaches in detecting machine-generated misinformation, and open up the discussion on the desired evaluation benchmarks. △ Less

Submitted 20 February, 2020; v1 submitted 26 August, 2019; originally announced August 2019.

Comments: Accepted for Computational Linguistics journal (squib). Previously posted with title "Are We Safe Yet? The Limitations of Distributional Features for Fake News Detection"

arXiv:1908.08378 [pdf, other]

doi 10.2140/akt.2020.5.411

C_2-equivariant stable homotopy from real motivic stable homotopy

Authors: Mark Behrens, Jay Shah

Abstract: We give a method for computing the C_2-equivariant homotopy groups of the Betti realization of a p-complete cellular motivic spectrum over R in terms of its motivic homotopy groups. More generally, we show that Betti realization presents the C_2-equivariant p-complete stable homotopy category as a localization of the p-complete cellular real motivic stable homotopy category. We give a method for computing the C_2-equivariant homotopy groups of the Betti realization of a p-complete cellular motivic spectrum over R in terms of its motivic homotopy groups. More generally, we show that Betti realization presents the C_2-equivariant p-complete stable homotopy category as a localization of the p-complete cellular real motivic stable homotopy category. △ Less

Submitted 13 April, 2020; v1 submitted 22 August, 2019; originally announced August 2019.

Comments: 49 pages, 4 figures. v3: revisions based on comments from referee

Journal ref: Ann. K-Th. 5 (2020) 411-464

arXiv:1908.05267 [pdf, other]

Towards Debiasing Fact Verification Models

Authors: Tal Schuster, Darsh J Shah, Yun Jie Serene Yeo, Daniel Filizzola, Enrico Santus, Regina Barzilay

Abstract: Fact verification requires validating a claim in the context of evidence. We show, however, that in the popular FEVER dataset this might not necessarily be the case. Claim-only classifiers perform competitively with top evidence-aware models. In this paper, we investigate the cause of this phenomenon, identifying strong cues for predicting labels solely based on the claim, without considering any… ▽ More Fact verification requires validating a claim in the context of evidence. We show, however, that in the popular FEVER dataset this might not necessarily be the case. Claim-only classifiers perform competitively with top evidence-aware models. In this paper, we investigate the cause of this phenomenon, identifying strong cues for predicting labels solely based on the claim, without considering any evidence. We create an evaluation set that avoids those idiosyncrasies. The performance of FEVER-trained models significantly drops when evaluated on this test set. Therefore, we introduce a regularization method which alleviates the effect of bias in the training data, obtaining improvements on the newly created test set. This work is a step towards a more sound evaluation of reasoning capabilities in fact verification models. △ Less

Submitted 30 August, 2019; v1 submitted 14 August, 2019; originally announced August 2019.

Comments: EMNLP IJCNLP 2019

arXiv:1906.06870 [pdf, other]

Robust Zero-Shot Cross-Domain Slot Filling with Example Values

Authors: Darsh J Shah, Raghav Gupta, Amir A Fayazi, Dilek Hakkani-Tur

Abstract: Task-oriented dialog systems increasingly rely on deep learning-based slot filling models, usually needing extensive labeled training data for target domains. Often, however, little to no target domain training data may be available, or the training and target domain schemas may be misaligned, as is common for web forms on similar websites. Prior zero-shot slot filling models use slot descriptions… ▽ More Task-oriented dialog systems increasingly rely on deep learning-based slot filling models, usually needing extensive labeled training data for target domains. Often, however, little to no target domain training data may be available, or the training and target domain schemas may be misaligned, as is common for web forms on similar websites. Prior zero-shot slot filling models use slot descriptions to learn concepts, but are not robust to misaligned schemas. We propose utilizing both the slot description and a small number of examples of slot values, which may be easily available, to learn semantic representations of slots which are transferable across domains and robust to misaligned schemas. Our approach outperforms state-of-the-art models on two multi-domain datasets, especially in the low-data setting. △ Less

Submitted 17 June, 2019; originally announced June 2019.

Comments: To appear in ACL 2019

arXiv:1906.03218 [pdf, other]

doi 10.1109/LRA.2020.2977217

Planning With Uncertain Specifications (PUnS)

Authors: Ankit Shah, Shen Li, Julie Shah

Abstract: Reward engineering is crucial to high performance in reinforcement learning systems. Prior research into reward design has largely focused on Markovian functions representing the reward. While there has been research into expressing non-Markov rewards as linear temporal logic (LTL) formulas, this has focused on task specifications directly defined by the user. However, in many real-world applicati… ▽ More Reward engineering is crucial to high performance in reinforcement learning systems. Prior research into reward design has largely focused on Markovian functions representing the reward. While there has been research into expressing non-Markov rewards as linear temporal logic (LTL) formulas, this has focused on task specifications directly defined by the user. However, in many real-world applications, task specifications are ambiguous, and can only be expressed as a belief over LTL formulas. In this paper, we introduce planning with uncertain specifications (PUnS), a novel formulation that addresses the challenge posed by non-Markovian specifications expressed as beliefs over LTL formulas. We present four criteria that capture the semantics of satisfying a belief over specifications for different applications, and analyze the qualitative implications of these criteria within a synthetic domain. We demonstrate the existence of an equivalent Markov decision process (MDP) for any instance of PUnS. Finally, we demonstrate our approach on the real-world task of setting a dinner table automatically with a robot that inferred task specifications from human demonstrations. △ Less

Submitted 25 February, 2020; v1 submitted 7 June, 2019; originally announced June 2019.

Comments: Accepted for publication by IEEE Robotics and Automation Letters. Accepted for presentation at the 2020 IEEE International Conference on Robotics and Automation

arXiv:1905.02827 [pdf, other]

Mesoscale Organization and Dynamics in Binary Ionic Liquid Mixtures

Authors: Tyler Cosby, Utkarsh Kapoor, **dal K. Shah, Joshua Sangoro

Abstract: The impact of mesoscale organization on dynamics and ion transport in binary ionic liquid mixtures is investigated by broadband dielectric spectroscopy, dynamic-mechanical spectroscopy, x-ray scattering, and molecular dynamics simulations. The mixtures are found to form distinct liquids with macroscopic properties that significantly deviate from weighted contributions of the neat components. For i… ▽ More The impact of mesoscale organization on dynamics and ion transport in binary ionic liquid mixtures is investigated by broadband dielectric spectroscopy, dynamic-mechanical spectroscopy, x-ray scattering, and molecular dynamics simulations. The mixtures are found to form distinct liquids with macroscopic properties that significantly deviate from weighted contributions of the neat components. For instance, it is shown that the mesoscale morphologies in ionic liquids can be tuned by mixing to enhance the static dielectric permittivity of the resulting liquid by as high as 100$\%$ relative to the neat ionic liquid components. This enhancement is attributed to the intricate role of interfacial dynamics associated with the changes in the mesoscopic aggregate morphologies in these systems. These results demonstrate the potential to design the physicochemical properties of ionic liquids through control of solvophobic aggregation △ Less

Submitted 7 May, 2019; originally announced May 2019.

Showing 51–100 of 151 results for author: Shah, J