Learning Manner of Execution from Partial Corrections
Authors:
Mattias Appelgren,
Alex Lascarides
Abstract:
Some actions must be executed in different ways depending on the context. For example, wi** away marker requires vigorous force while wi** away almonds requires more gentle force. In this paper we provide a model where an agent learns which manner of action execution to use in which context, drawing on evidence from trial and error and verbal corrections when it makes a mistake (e.g., ``no, ge…
▽ More
Some actions must be executed in different ways depending on the context. For example, wi** away marker requires vigorous force while wi** away almonds requires more gentle force. In this paper we provide a model where an agent learns which manner of action execution to use in which context, drawing on evidence from trial and error and verbal corrections when it makes a mistake (e.g., ``no, gently''). The learner starts out with a domain model that lacks the concepts denoted by the words in the teacher's feedback; both the words describing the context (e.g., marker) and the adverbs like ``gently''. We show that through the the semantics of coherence, our agent can perform the symbol grounding that's necessary for exploiting the teacher's feedback so as to solve its domain-level planning problem: to perform its actions in the current context in the right way.
△ Less
Submitted 7 February, 2023;
originally announced February 2023.
Language Transfer for Early Warning of Epidemics from Social Media
Authors:
Mattias Appelgren,
Patrick Schrempf,
Matúš Falis,
Satoshi Ikeda,
Alison Q O'Neil
Abstract:
Statements on social media can be analysed to identify individuals who are experiencing red flag medical symptoms, allowing early detection of the spread of disease such as influenza. Since disease does not respect cultural borders and may spread between populations speaking different languages, we would like to build multilingual models. However, the data required to train models for every langua…
▽ More
Statements on social media can be analysed to identify individuals who are experiencing red flag medical symptoms, allowing early detection of the spread of disease such as influenza. Since disease does not respect cultural borders and may spread between populations speaking different languages, we would like to build multilingual models. However, the data required to train models for every language may be difficult, expensive and time-consuming to obtain, particularly for low-resource languages. Taking Japanese as our target language, we explore methods by which data in one language might be used to build models for a different language. We evaluate strategies of training on machine translated data and of zero-shot transfer through the use of multilingual models. We find that the choice of source language impacts the performance, with Chinese-Japanese being a better language pair than English-Japanese. Training on machine translated data shows promise, especially when used in conjunction with a small amount of target language data.
△ Less
Submitted 10 October, 2019;
originally announced October 2019.