-
Black Hole - Neutron Star mergers: using kilonovae to constrain the equation of state
Authors:
Lowri Wyn Prys Mathias,
Francesco Di Clemente,
Mattia Bulla,
Alessandro Drago
Abstract:
The merging of a binary system involving two neutron stars (NSs), or a black hole (BH) and a NS, often results in the emission of an electromagnetic (EM) transient. One component of this EM transient is the epic explosion known as a kilonova (KN). The characteristics of the KN emission can be used to probe the equation of state (EoS) of NS matter responsible for its formation. We predict KN light…
▽ More
The merging of a binary system involving two neutron stars (NSs), or a black hole (BH) and a NS, often results in the emission of an electromagnetic (EM) transient. One component of this EM transient is the epic explosion known as a kilonova (KN). The characteristics of the KN emission can be used to probe the equation of state (EoS) of NS matter responsible for its formation. We predict KN light curves from computationally simulated BH-NS mergers, by using the 3D radiative transfer code \texttt{POSSIS}. We investigate two EoSs spanning most of the allowed range of the mass-radius diagram. We also consider a soft EoS compatible with the observational data within the so-called 2-families scenario in which hadronic stars coexist with strange stars. Computed results show that the 2-families scenario, characterized by a soft EoS, should not produce a KN unless the mass of the binary components are small ($M_{\rm BH} \leq 6M_{\odot}$, $M_{\rm NS} \leq 1.4M_{\odot}$) and the BH is rapidly spinning ($χ_{\rm BH} \geq 0.3$). In contrast, a strong KN signal potentially observable from future surveys (e.g. VRO/LSST) is produced in the 1-family scenario for a wider region of the parameter space, and even for non-rotating BHs ($χ_{\rm BH} = 0$) when $M_{\rm BH} = 4M_{\odot}$ and $M_{\rm NS} = 1.2M_{\odot}$. We also provide a fit that allows for the calculation of the unbound mass from the observed KN magnitude, without running timely and costly radiative transfer simulations. Findings presented in this paper will be used to interpret light curves anticipated during the fourth observing run (O4), of the advanced LIGO, advanced Virgo and KAGRA interferometers and thus to constrain the EoS of NS matter.
△ Less
Submitted 19 December, 2023; v1 submitted 2 September, 2023;
originally announced September 2023.
-
Tag-Based Annotation for Avatar Face Creation
Authors:
An Ngo,
Daniel Phelps,
Derrick Lai,
Thanyared Wong,
Lucas Mathias,
Anish Shivamurthy,
Mustafa Ajmal,
Minghao Liu,
James Davis
Abstract:
Currently, digital avatars can be created manually using human images as reference. Systems such as Bitmoji are excellent producers of detailed avatar designs, with hundreds of choices for customization. A supervised learning model could be trained to generate avatars automatically, but the hundreds of possible options create difficulty in securing non-noisy data to train a model. As a solution, w…
▽ More
Currently, digital avatars can be created manually using human images as reference. Systems such as Bitmoji are excellent producers of detailed avatar designs, with hundreds of choices for customization. A supervised learning model could be trained to generate avatars automatically, but the hundreds of possible options create difficulty in securing non-noisy data to train a model. As a solution, we train a model to produce avatars from human images using tag-based annotations. This method provides better annotator agreement, leading to less noisy data and higher quality model predictions. Our contribution is an application of tag-based annotation to train a model for avatar face creation. We design tags for 3 different facial facial features offered by Bitmoji, and train a model using tag-based annotation to predict the nose.
△ Less
Submitted 24 August, 2023;
originally announced August 2023.
-
Meta-training with Demonstration Retrieval for Efficient Few-shot Learning
Authors:
Aaron Mueller,
Kanika Narang,
Lambert Mathias,
Qifan Wang,
Hamed Firooz
Abstract:
Large language models show impressive results on few-shot NLP tasks. However, these models are memory and computation-intensive. Meta-training allows one to leverage smaller models for few-shot generalization in a domain-general and task-agnostic manner; however, these methods alone results in models that may not have sufficient parameterization or knowledge to adapt quickly to a large variety of…
▽ More
Large language models show impressive results on few-shot NLP tasks. However, these models are memory and computation-intensive. Meta-training allows one to leverage smaller models for few-shot generalization in a domain-general and task-agnostic manner; however, these methods alone results in models that may not have sufficient parameterization or knowledge to adapt quickly to a large variety of tasks. To overcome this issue, we propose meta-training with demonstration retrieval, where we use a dense passage retriever to retrieve semantically similar labeled demonstrations to each example for more varied supervision. By separating external knowledge from model parameters, we can use meta-training to train parameter-efficient models that generalize well on a larger variety of tasks. We construct a meta-training set from UnifiedQA and CrossFit, and propose a demonstration bank based on UnifiedQA tasks. To our knowledge, our work is the first to combine retrieval with meta-training, to use DPR models to retrieve demonstrations, and to leverage demonstrations from many tasks simultaneously, rather than randomly sampling demonstrations from the training set of the target task. Our approach outperforms a variety of targeted parameter-efficient and retrieval-augmented few-shot methods on QA, NLI, and text classification tasks (including SQuAD, QNLI, and TREC). Our approach can be meta-trained and fine-tuned quickly on a single GPU.
△ Less
Submitted 30 June, 2023;
originally announced July 2023.
-
TimelineQA: A Benchmark for Question Answering over Timelines
Authors:
Wang-Chiew Tan,
Jane Dwivedi-Yu,
Yuliang Li,
Lambert Mathias,
Marzieh Saeidi,
**g Nathan Yan,
Alon Y. Halevy
Abstract:
Lifelogs are descriptions of experiences that a person had during their life. Lifelogs are created by fusing data from the multitude of digital services, such as online photos, maps, shop** and content streaming services. Question answering over lifelogs can offer personal assistants a critical resource when they try to provide advice in context. However, obtaining answers to questions over life…
▽ More
Lifelogs are descriptions of experiences that a person had during their life. Lifelogs are created by fusing data from the multitude of digital services, such as online photos, maps, shop** and content streaming services. Question answering over lifelogs can offer personal assistants a critical resource when they try to provide advice in context. However, obtaining answers to questions over lifelogs is beyond the current state of the art of question answering techniques for a variety of reasons, the most pronounced of which is that lifelogs combine free text with some degree of structure such as temporal and geographical information.
We create and publicly release TimelineQA1, a benchmark for accelerating progress on querying lifelogs. TimelineQA generates lifelogs of imaginary people. The episodes in the lifelog range from major life episodes such as high school graduation to those that occur on a daily basis such as going for a run. We describe a set of experiments on TimelineQA with several state-of-the-art QA models. Our experiments reveal that for atomic queries, an extractive QA system significantly out-performs a state-of-the-art retrieval-augmented QA system. For multi-hop queries involving aggregates, we show that the best result is obtained with a state-of-the-art table QA technique, assuming the ground truth set of episodes for deriving the answer is available.
△ Less
Submitted 1 June, 2023;
originally announced June 2023.
-
ToKen: Task Decomposition and Knowledge Infusion for Few-Shot Hate Speech Detection
Authors:
Badr AlKhamissi,
Faisal Ladhak,
Srini Iyer,
Ves Stoyanov,
Zornitsa Kozareva,
Xian Li,
Pascale Fung,
Lambert Mathias,
Asli Celikyilmaz,
Mona Diab
Abstract:
Hate speech detection is complex; it relies on commonsense reasoning, knowledge of stereotypes, and an understanding of social nuance that differs from one culture to the next. It is also difficult to collect a large-scale hate speech annotated dataset. In this work, we frame this problem as a few-shot learning task, and show significant gains with decomposing the task into its "constituent" parts…
▽ More
Hate speech detection is complex; it relies on commonsense reasoning, knowledge of stereotypes, and an understanding of social nuance that differs from one culture to the next. It is also difficult to collect a large-scale hate speech annotated dataset. In this work, we frame this problem as a few-shot learning task, and show significant gains with decomposing the task into its "constituent" parts. In addition, we see that infusing knowledge from reasoning datasets (e.g. Atomic2020) improves the performance even further. Moreover, we observe that the trained models generalize to out-of-distribution datasets, showing the superiority of task decomposition and knowledge infusion compared to previously used methods. Concretely, our method outperforms the baseline by 17.83% absolute gain in the 16-shot case.
△ Less
Submitted 20 May, 2023; v1 submitted 25 May, 2022;
originally announced May 2022.
-
Logical Satisfiability of Counterfactuals for Faithful Explanations in NLI
Authors:
Suzanna Sia,
Anton Belyy,
Amjad Almahairi,
Madian Khabsa,
Luke Zettlemoyer,
Lambert Mathias
Abstract:
Evaluating an explanation's faithfulness is desired for many reasons such as trust, interpretability and diagnosing the sources of model's errors. In this work, which focuses on the NLI task, we introduce the methodology of Faithfulness-through-Counterfactuals, which first generates a counterfactual hypothesis based on the logical predicates expressed in the explanation, and then evaluates if the…
▽ More
Evaluating an explanation's faithfulness is desired for many reasons such as trust, interpretability and diagnosing the sources of model's errors. In this work, which focuses on the NLI task, we introduce the methodology of Faithfulness-through-Counterfactuals, which first generates a counterfactual hypothesis based on the logical predicates expressed in the explanation, and then evaluates if the model's prediction on the counterfactual is consistent with that expressed logic (i.e. if the new formula is \textit{logically satisfiable}). In contrast to existing approaches, this does not require any explanations for training a separate verification model. We first validate the efficacy of automatic counterfactual hypothesis generation, leveraging on the few-shot priming paradigm. Next, we show that our proposed metric distinguishes between human-model agreement and disagreement on new counterfactual input. In addition, we conduct a sensitivity analysis to validate that our metric is sensitive to unfaithful explanations.
△ Less
Submitted 24 May, 2022;
originally announced May 2022.
-
Policy Compliance Detection via Expression Tree Inference
Authors:
Neema Kotonya,
Andreas Vlachos,
Majid Yazdani,
Lambert Mathias,
Marzieh Saeidi
Abstract:
Policy Compliance Detection (PCD) is a task we encounter when reasoning over texts, e.g. legal frameworks. Previous work to address PCD relies heavily on modeling the task as a special case of Recognizing Textual Entailment. Entailment is applicable to the problem of PCD, however viewing the policy as a single proposition, as opposed to multiple interlinked propositions, yields poor performance an…
▽ More
Policy Compliance Detection (PCD) is a task we encounter when reasoning over texts, e.g. legal frameworks. Previous work to address PCD relies heavily on modeling the task as a special case of Recognizing Textual Entailment. Entailment is applicable to the problem of PCD, however viewing the policy as a single proposition, as opposed to multiple interlinked propositions, yields poor performance and lacks explainability. To address this challenge, more recent proposals for PCD have argued for decomposing policies into expression trees consisting of questions connected with logic operators. Question answering is used to obtain answers to these questions with respect to a scenario. Finally, the expression tree is evaluated in order to arrive at an overall solution. However, this work assumes expression trees are provided by experts, thus limiting its applicability to new policies. In this work, we learn how to infer expression trees automatically from policy texts. We ensure the validity of the inferred trees by introducing constrained decoding using a finite state automaton to ensure the generation of valid trees. We determine through automatic evaluation that 63% of the expression trees generated by our constrained generation model are logically equivalent to gold trees. Human evaluation shows that 88% of trees generated by our model are correct.
△ Less
Submitted 24 May, 2022;
originally announced May 2022.
-
PERFECT: Prompt-free and Efficient Few-shot Learning with Language Models
Authors:
Rabeeh Karimi Mahabadi,
Luke Zettlemoyer,
James Henderson,
Marzieh Saeidi,
Lambert Mathias,
Veselin Stoyanov,
Majid Yazdani
Abstract:
Current methods for few-shot fine-tuning of pretrained masked language models (PLMs) require carefully engineered prompts and verbalizers for each new task to convert examples into a cloze-format that the PLM can score. In this work, we propose PERFECT, a simple and efficient method for few-shot fine-tuning of PLMs without relying on any such handcrafting, which is highly effective given as few as…
▽ More
Current methods for few-shot fine-tuning of pretrained masked language models (PLMs) require carefully engineered prompts and verbalizers for each new task to convert examples into a cloze-format that the PLM can score. In this work, we propose PERFECT, a simple and efficient method for few-shot fine-tuning of PLMs without relying on any such handcrafting, which is highly effective given as few as 32 data points. PERFECT makes two key design choices: First, we show that manually engineered task prompts can be replaced with task-specific adapters that enable sample-efficient fine-tuning and reduce memory and storage costs by roughly factors of 5 and 100, respectively. Second, instead of using handcrafted verbalizers, we learn new multi-token label embeddings during fine-tuning, which are not tied to the model vocabulary and which allow us to avoid complex auto-regressive decoding. These embeddings are not only learnable from limited data but also enable nearly 100x faster training and inference. Experiments on a wide range of few-shot NLP tasks demonstrate that PERFECT, while being simple and efficient, also outperforms existing state-of-the-art few-shot learning methods. Our code is publicly available at https://github.com/facebookresearch/perfect.git.
△ Less
Submitted 25 April, 2022; v1 submitted 3 April, 2022;
originally announced April 2022.
-
UNIREX: A Unified Learning Framework for Language Model Rationale Extraction
Authors:
Aaron Chan,
Maziar Sanjabi,
Lambert Mathias,
Liang Tan,
Shaoliang Nie,
Xiaochang Peng,
Xiang Ren,
Hamed Firooz
Abstract:
An extractive rationale explains a language model's (LM's) prediction on a given task instance by highlighting the text inputs that most influenced the prediction. Ideally, rationale extraction should be faithful (reflective of LM's actual behavior) and plausible (convincing to humans), without compromising the LM's (i.e., task model's) task performance. Although attribution algorithms and select-…
▽ More
An extractive rationale explains a language model's (LM's) prediction on a given task instance by highlighting the text inputs that most influenced the prediction. Ideally, rationale extraction should be faithful (reflective of LM's actual behavior) and plausible (convincing to humans), without compromising the LM's (i.e., task model's) task performance. Although attribution algorithms and select-predict pipelines are commonly used in rationale extraction, they both rely on certain heuristics that hinder them from satisfying all three desiderata. In light of this, we propose UNIREX, a flexible learning framework that generalizes rationale extractor optimization as follows: (1) specify architecture for a learned rationale extractor; (2) select explainability objectives (i.e., faithfulness and plausibility criteria); and (3) jointly the train task model and rationale extractor on the task using the selected objectives. UNIREX enables replacing prior works' heuristic design choices with a generic learned rationale extractor in (1) and optimizing it for all three desiderata in (2)-(3). To facilitate comparison between methods with respect to multiple desiderata, we introduce the Normalized Relative Gain (NRG) metric. Across five text classification datasets, our best UNIREX configuration outperforms baselines by an average of 32.9% NRG. Plus, we find that UNIREX-trained rationale extractors can even generalize to unseen datasets and tasks.
△ Less
Submitted 26 February, 2023; v1 submitted 16 December, 2021;
originally announced December 2021.
-
UniPELT: A Unified Framework for Parameter-Efficient Language Model Tuning
Authors:
Yuning Mao,
Lambert Mathias,
Rui Hou,
Amjad Almahairi,
Hao Ma,
Jiawei Han,
Wen-tau Yih,
Madian Khabsa
Abstract:
Recent parameter-efficient language model tuning (PELT) methods manage to match the performance of fine-tuning with much fewer trainable parameters and perform especially well when training data is limited. However, different PELT methods may perform rather differently on the same task, making it nontrivial to select the most appropriate method for a specific task, especially considering the fast-…
▽ More
Recent parameter-efficient language model tuning (PELT) methods manage to match the performance of fine-tuning with much fewer trainable parameters and perform especially well when training data is limited. However, different PELT methods may perform rather differently on the same task, making it nontrivial to select the most appropriate method for a specific task, especially considering the fast-growing number of new PELT methods and tasks. In light of model diversity and the difficulty of model selection, we propose a unified framework, UniPELT, which incorporates different PELT methods as submodules and learns to activate the ones that best suit the current data or task setup via gating mechanism. On the GLUE benchmark, UniPELT consistently achieves 1~4% gains compared to the best individual PELT method that it incorporates and even outperforms fine-tuning under different setups. Moreover, UniPELT generally surpasses the upper bound that takes the best performance of all its submodules used individually on each task, indicating that a mixture of multiple PELT methods may be inherently more effective than single methods.
△ Less
Submitted 4 September, 2022; v1 submitted 14 October, 2021;
originally announced October 2021.
-
Personalized Query Rewriting in Conversational AI Agents
Authors:
Alireza Roshan-Ghias,
Clint Solomon Mathialagan,
Pragaash Ponnusamy,
Lambert Mathias,
Chenlei Guo
Abstract:
Spoken language understanding (SLU) systems in conversational AI agents often experience errors in the form of misrecognitions by automatic speech recognition (ASR) or semantic gaps in natural language understanding (NLU). These errors easily translate to user frustrations, particularly so in recurrent events e.g. regularly toggling an appliance, calling a frequent contact, etc. In this work, we p…
▽ More
Spoken language understanding (SLU) systems in conversational AI agents often experience errors in the form of misrecognitions by automatic speech recognition (ASR) or semantic gaps in natural language understanding (NLU). These errors easily translate to user frustrations, particularly so in recurrent events e.g. regularly toggling an appliance, calling a frequent contact, etc. In this work, we propose a query rewriting approach by leveraging users' historically successful interactions as a form of memory. We present a neural retrieval model and a pointer-generator network with hierarchical attention and show that they perform significantly better at the query rewriting task with the aforementioned user memories than without. We also highlight how our approach with the proposed models leverages the structural and semantic diversity in ASR's output towards recovering users' intents.
△ Less
Submitted 9 November, 2020;
originally announced November 2020.
-
Pre-Training for Query Rewriting in A Spoken Language Understanding System
Authors:
Zheng Chen,
Xing Fan,
Yuan Ling,
Lambert Mathias,
Chenlei Guo
Abstract:
Query rewriting (QR) is an increasingly important technique to reduce customer friction caused by errors in a spoken language understanding pipeline, where the errors originate from various sources such as speech recognition errors, language understanding errors or entity resolution errors. In this work, we first propose a neural-retrieval based approach for query rewriting. Then, inspired by the…
▽ More
Query rewriting (QR) is an increasingly important technique to reduce customer friction caused by errors in a spoken language understanding pipeline, where the errors originate from various sources such as speech recognition errors, language understanding errors or entity resolution errors. In this work, we first propose a neural-retrieval based approach for query rewriting. Then, inspired by the wide success of pre-trained contextual language embeddings, and also as a way to compensate for insufficient QR training data, we propose a language-modeling (LM) based approach to pre-train query embeddings on historical user conversation data with a voice assistant. In addition, we propose to use the NLU hypotheses generated by the language understanding system to augment the pre-training. Our experiments show pre-training provides rich prior information and help the QR task achieve strong performance. We also show joint pre-training with NLU hypotheses has further benefit. Finally, after pre-training, we find a small set of rewrite pairs is enough to fine-tune the QR model to outperform a strong baseline by full training on all QR training data.
△ Less
Submitted 13 February, 2020;
originally announced February 2020.
-
Leveraging External Knowledge for Out-Of-Vocabulary Entity Labeling
Authors:
Adrian de Wynter,
Lambert Mathias
Abstract:
Dealing with previously unseen slots is a challenging problem in a real-world multi-domain dialogue state tracking task. Other approaches rely on predefined map**s to generate candidate slot keys, as well as their associated values. This, however, may fail when the key, the value, or both, are not seen during training. To address this problem we introduce a neural network that leverages external…
▽ More
Dealing with previously unseen slots is a challenging problem in a real-world multi-domain dialogue state tracking task. Other approaches rely on predefined map**s to generate candidate slot keys, as well as their associated values. This, however, may fail when the key, the value, or both, are not seen during training. To address this problem we introduce a neural network that leverages external knowledge bases (KBs) to better classify out-of-vocabulary slot keys and values. This network projects the slot into an attribute space derived from the KB, and, by leveraging similarities in this space, we propose candidate slot keys and values to the dialogue state tracker. We provide extensive experiments that demonstrate that our stratagem can improve upon a previous approach, which relies on predefined candidate map**s. In particular, we evaluate this approach by training a state-of-the-art model with candidates generated from our network, and obtained relative increases of 57.7% and 82.7% in F1 score and accuracy, respectively, for the aforementioned model, when compared to the current candidate generation strategy.
△ Less
Submitted 26 August, 2019;
originally announced August 2019.
-
Time Masking: Leveraging Temporal Information in Spoken Dialogue Systems
Authors:
Rylan Conway,
Lambert Mathias
Abstract:
In a spoken dialogue system, dialogue state tracker (DST) components track the state of the conversation by updating a distribution of values associated with each of the slots being tracked for the current user turn, using the interactions until then. Much of the previous work has relied on modeling the natural order of the conversation, using distance based offsets as an approximation of time. In…
▽ More
In a spoken dialogue system, dialogue state tracker (DST) components track the state of the conversation by updating a distribution of values associated with each of the slots being tracked for the current user turn, using the interactions until then. Much of the previous work has relied on modeling the natural order of the conversation, using distance based offsets as an approximation of time. In this work, we hypothesize that leveraging the wall-clock temporal difference between turns is crucial for finer-grained control of dialogue scenarios. We develop a novel approach that applies a {\it time mask}, based on the wall-clock time difference, to the associated slot embeddings and empirically demonstrate that our proposed approach outperforms existing approaches that leverage distance offsets, on both an internal benchmark dataset as well as DSTC2.
△ Less
Submitted 25 July, 2019;
originally announced July 2019.
-
Improving Long Distance Slot Carryover in Spoken Dialogue Systems
Authors:
Tongfei Chen,
Chetan Naik,
Hua He,
Pushpendre Rastogi,
Lambert Mathias
Abstract:
Tracking the state of the conversation is a central component in task-oriented spoken dialogue systems. One such approach for tracking the dialogue state is slot carryover, where a model makes a binary decision if a slot from the context is relevant to the current turn. Previous work on the slot carryover task used models that made independent decisions for each slot. A close analysis of the resul…
▽ More
Tracking the state of the conversation is a central component in task-oriented spoken dialogue systems. One such approach for tracking the dialogue state is slot carryover, where a model makes a binary decision if a slot from the context is relevant to the current turn. Previous work on the slot carryover task used models that made independent decisions for each slot. A close analysis of the results show that this approach results in poor performance over longer context dialogues. In this paper, we propose to jointly model the slots. We propose two neural network architectures, one based on pointer networks that incorporate slot ordering information, and the other based on transformer networks that uses self attention mechanism to model the slot interdependencies. Our experiments on an internal dialogue benchmark dataset and on the public DSTC2 dataset demonstrate that our proposed models are able to resolve longer distance slot references and are able to achieve competitive performance.
△ Less
Submitted 3 June, 2019;
originally announced June 2019.
-
Pre-distortion and Pre-equalization for Non-Linearities and Low-Pass Effect Mitigation in OFDM-VLC Systems
Authors:
Luis Carlos Mathias,
Jose Carlos Marinello Filho,
Taufik Abrao
Abstract:
The orthogonal frequency division multiplexing (OFDM) transmission has shown promise in applications of visible light communication (VLC). However, the variation of the nonlinearity of the optical power emitted by the high power light emitting diode (HPLED) as a function of current and temperature implies in drastic OFDM-VLC performance degradation. The first part of this work, experimentally conf…
▽ More
The orthogonal frequency division multiplexing (OFDM) transmission has shown promise in applications of visible light communication (VLC). However, the variation of the nonlinearity of the optical power emitted by the high power light emitting diode (HPLED) as a function of current and temperature implies in drastic OFDM-VLC performance degradation. The first part of this work, experimentally confirms and models this degradation due to temperature in a high power white HPLED. The higher attenuation at high frequencies, which is inherent to the HPLED and which is accentuated by the effect of the intrinsic capacitance of the photodiode, is another factor of degradation due to the reduction of the signal-to-noise ratio (SNR) at the receiver for such frequencies. For the mitigation of these effects, we propose a pre-distortion and digital pre-equalization scheme using a luminous feedback signal in the transmitter module. The system is modeled so that the operating points are mathematically deduced and evaluated by simulations and by an experimental setup. By allowing the linearization of the transmitted light signal and the maintenance of an average SNR in all OFDM subcarriers, the performance improvement is confirmed in comparison with other schemes, such as with non-predistortion, pre-distortion with fixed parameters, and simple post-equalization.
△ Less
Submitted 24 April, 2019;
originally announced April 2019.
-
A dataset for resolving referring expressions in spoken dialogue via contextual query rewrites (CQR)
Authors:
Michael Regan,
Pushpendre Rastogi,
Arpit Gupta,
Lambert Mathias
Abstract:
We present Contextual Query Rewrite (CQR) a dataset for multi-domain task-oriented spoken dialogue systems that is an extension of the Stanford dialog corpus (Eric et al., 2017a). While previous approaches have addressed the issue of diverse schemas by learning candidate transformations (Naik et al., 2018), we instead model the reference resolution task as a user query reformulation task, where th…
▽ More
We present Contextual Query Rewrite (CQR) a dataset for multi-domain task-oriented spoken dialogue systems that is an extension of the Stanford dialog corpus (Eric et al., 2017a). While previous approaches have addressed the issue of diverse schemas by learning candidate transformations (Naik et al., 2018), we instead model the reference resolution task as a user query reformulation task, where the dialog state is serialized into a natural language query that can be executed by the downstream spoken language understanding system. In this paper, we describe our methodology for creating the query reformulation extension to the dialog corpus, and present an initial set of experiments to establish a baseline for the CQR task. We have released the corpus to the public [1] to support further research in this area.
△ Less
Submitted 31 March, 2019; v1 submitted 28 March, 2019;
originally announced March 2019.
-
Scaling Multi-Domain Dialogue State Tracking via Query Reformulation
Authors:
Pushpendre Rastogi,
Arpit Gupta,
Tongfei Chen,
Lambert Mathias
Abstract:
We present a novel approach to dialogue state tracking and referring expression resolution tasks. Successful contextual understanding of multi-turn spoken dialogues requires resolving referring expressions across turns and tracking the entities relevant to the conversation across turns. Tracking conversational state is particularly challenging in a multi-domain scenario when there exist multiple s…
▽ More
We present a novel approach to dialogue state tracking and referring expression resolution tasks. Successful contextual understanding of multi-turn spoken dialogues requires resolving referring expressions across turns and tracking the entities relevant to the conversation across turns. Tracking conversational state is particularly challenging in a multi-domain scenario when there exist multiple spoken language understanding (SLU) sub-systems, and each SLU sub-system operates on its domain-specific meaning representation. While previous approaches have addressed the disparate schema issue by learning candidate transformations of the meaning representation, in this paper, we instead model the reference resolution as a dialogue context-aware user query reformulation task -- the dialog state is serialized to a sequence of natural language tokens representing the conversation. We develop our model for query reformulation using a pointer-generator network and a novel multi-task learning setup. In our experiments, we show a significant improvement in absolute F1 on an internal as well as a, soon to be released, public benchmark respectively.
△ Less
Submitted 29 March, 2019; v1 submitted 12 March, 2019;
originally announced March 2019.
-
3-D Localization with Multiple LEDs Lamps in OFDM-VLC system
Authors:
Luis C. Mathias,
Leonimer F. de Melo,
Taufik Abrao
Abstract:
Visible light communication (VLC) based localization is a potential candidate for wide range indoor localization applications. In this paper, we propose a VLC architecture based on orthogonal frequency division multiplexing (OFDM) with multiple functionalities integrated in the same system, i.e., the 3- D receiver location, the control of the room illumination intensity, as well as the data transm…
▽ More
Visible light communication (VLC) based localization is a potential candidate for wide range indoor localization applications. In this paper, we propose a VLC architecture based on orthogonal frequency division multiplexing (OFDM) with multiple functionalities integrated in the same system, i.e., the 3- D receiver location, the control of the room illumination intensity, as well as the data transmission capability. Herein we propose an original methodology for LED power discrimination applying spatial optical OFDM (SO-OFDM) structure for position estimation. The hybrid locator initially makes a first estimate using a weighted angle-of-arrival (WAoA)-based locator which is then used as the starting point of the recursive estimator based on the strength of the received signal (RSS). Hence, the first stage is deployed to increase convergence probability, reducing the root-mean-square error (RMSE) and the number of iterations of the second stage. Also, a performance vs computational complexity comparative analysis is carried out with parameter variations of these estimators. The numerical results indicate a decade improvement in the RMSE for each two decades of decrement of power noise on the receiver photodiode. The best clip** factor is obtained through the analysis of locator accuracy and transmission capacity for each simulated system. Finally, the numerical results also demonstrate effectiveness, robustness, and efficiency of the proposed architecture.
△ Less
Submitted 22 December, 2018;
originally announced December 2018.
-
Cross-Lingual Approaches to Reference Resolution in Dialogue Systems
Authors:
Amr Sharaf,
Arpit Gupta,
Hancheng Ge,
Chetan Naik,
Lambert Mathias
Abstract:
In the slot-filling paradigm, where a user can refer back to slots in the context during the conversation, the goal of the contextual understanding system is to resolve the referring expressions to the appropriate slots in the context. In this paper, we build on the context carryover system~\citep{Naik2018ContextualSC}, which provides a scalable multi-domain framework for resolving references. How…
▽ More
In the slot-filling paradigm, where a user can refer back to slots in the context during the conversation, the goal of the contextual understanding system is to resolve the referring expressions to the appropriate slots in the context. In this paper, we build on the context carryover system~\citep{Naik2018ContextualSC}, which provides a scalable multi-domain framework for resolving references. However, scaling this approach across languages is not a trivial task, due to the large demand on acquisition of annotated data in the target language. Our main focus is on cross-lingual methods for reference resolution as a way to alleviate the need for annotated data in the target language. In the cross-lingual setup, we assume there is access to annotated resources as well as a well trained model in the source language and little to no annotated data in the target language. In this paper, we explore three different approaches for cross-lingual transfer \textemdash~\ delexicalization as data augmentation, multilingual embeddings and machine translation. We compare these approaches both on a low resource setting as well as a large resource setting. Our experiments show that multilingual embeddings and delexicalization via data augmentation have a significant impact in the low resource setting, but the gains diminish as the amount of available data in the target language increases. Furthermore, when combined with machine translation we can get performance very close to actual live data in the target language, with only 25\% of the data projected into the target language.
△ Less
Submitted 27 November, 2018;
originally announced November 2018.
-
Contextual Slot Carryover for Disparate Schemas
Authors:
Chetan Naik,
Arpit Gupta,
Hancheng Ge,
Lambert Mathias,
Ruhi Sarikaya
Abstract:
In the slot-filling paradigm, where a user can refer back to slots in the context during a conversation, the goal of the contextual understanding system is to resolve the referring expressions to the appropriate slots in the context. In large-scale multi-domain systems, this presents two challenges - scaling to a very large and potentially unbounded set of slot values, and dealing with diverse sch…
▽ More
In the slot-filling paradigm, where a user can refer back to slots in the context during a conversation, the goal of the contextual understanding system is to resolve the referring expressions to the appropriate slots in the context. In large-scale multi-domain systems, this presents two challenges - scaling to a very large and potentially unbounded set of slot values, and dealing with diverse schemas. We present a neural network architecture that addresses the slot value scalability challenge by reformulating the contextual interpretation as a decision to carryover a slot from a set of possible candidates. To deal with heterogenous schemas, we introduce a simple data-driven method for trans- forming the candidate slots. Our experiments show that our approach can scale to multiple domains and provides competitive results over a strong baseline.
△ Less
Submitted 5 June, 2018;
originally announced June 2018.
-
Transfer Learning for Neural Semantic Parsing
Authors:
Xing Fan,
Emilio Monti,
Lambert Mathias,
Markus Dreyer
Abstract:
The goal of semantic parsing is to map natural language to a machine interpretable meaning representation language (MRL). One of the constraints that limits full exploration of deep learning technologies for semantic parsing is the lack of sufficient annotation training data. In this paper, we propose using sequence-to-sequence in a multi-task setup for semantic parsing with a focus on transfer le…
▽ More
The goal of semantic parsing is to map natural language to a machine interpretable meaning representation language (MRL). One of the constraints that limits full exploration of deep learning technologies for semantic parsing is the lack of sufficient annotation training data. In this paper, we propose using sequence-to-sequence in a multi-task setup for semantic parsing with a focus on transfer learning. We explore three multi-task architectures for sequence-to-sequence modeling and compare their performance with an independently trained model. Our experiments show that the multi-task setup aids transfer learning from an auxiliary task with large labeled data to a target task with smaller labeled data. We see absolute accuracy gains ranging from 1.0% to 4.4% in our in- house data set, and we also see good gains ranging from 2.5% to 7.0% on the ATIS semantic parsing tasks with syntactic and semantic auxiliary tasks.
△ Less
Submitted 14 June, 2017;
originally announced June 2017.