Skip to main content

Showing 1–6 of 6 results for author: Assael, Y M

.
  1. arXiv:1801.09466  [pdf, other

    cs.AI math.OC

    Using deep Q-learning to understand the tax evasion behavior of risk-averse firms

    Authors: Nikolaos D. Goumagias, Dimitrios Hristu-Varsakelis, Yannis M. Assael

    Abstract: Designing tax policies that are effective in curbing tax evasion and maximize state revenues requires a rigorous understanding of taxpayer behavior. This work explores the problem of determining the strategy a self-interested, risk-averse tax entity is expected to follow, as it "navigates" - in the context of a Markov Decision Process - a government-controlled tax environment that includes random… ▽ More

    Submitted 29 January, 2018; originally announced January 2018.

    Comments: Preprint - accepted for publication in Expert Systems with Applications

  2. arXiv:1711.02448  [pdf, other

    q-bio.NC cs.NE stat.ML

    Cortical microcircuits as gated-recurrent neural networks

    Authors: Rui Ponte Costa, Yannis M. Assael, Brendan Shillingford, Nando de Freitas, Tim P. Vogels

    Abstract: Cortical circuits exhibit intricate recurrent architectures that are remarkably similar across different brain areas. Such stereotyped structure suggests the existence of common computational principles. However, such principles have remained largely elusive. Inspired by gated-memory networks, namely long short-term memory networks (LSTMs), we introduce a recurrent neural network in which informat… ▽ More

    Submitted 3 January, 2018; v1 submitted 7 November, 2017; originally announced November 2017.

    Comments: To appear in Advances in Neural Information Processing Systems 30 (NIPS 2017). 13 pages, 2 figures (and 1 supp. figure)

  3. arXiv:1611.01599  [pdf, other

    cs.LG cs.CL cs.CV

    LipNet: End-to-End Sentence-level Lipreading

    Authors: Yannis M. Assael, Brendan Shillingford, Shimon Whiteson, Nando de Freitas

    Abstract: Lipreading is the task of decoding text from the movement of a speaker's mouth. Traditional approaches separated the problem into two stages: designing or learning visual features, and prediction. More recent deep lipreading approaches are end-to-end trainable (Wand et al., 2016; Chung & Zisserman, 2016a). However, existing work on models trained end-to-end perform only word classification, rather… ▽ More

    Submitted 16 December, 2016; v1 submitted 5 November, 2016; originally announced November 2016.

  4. arXiv:1610.02707  [pdf, other

    cs.AI

    Multi-Objective Deep Reinforcement Learning

    Authors: Hossam Mossalam, Yannis M. Assael, Diederik M. Roijers, Shimon Whiteson

    Abstract: We propose Deep Optimistic Linear Support Learning (DOL) to solve high-dimensional multi-objective decision problems where the relative importances of the objectives are not known a priori. Using features from the high-dimensional inputs, DOL computes the convex coverage set containing all potential optimal solutions of the convex combinations of the objectives. To our knowledge, this is the first… ▽ More

    Submitted 9 October, 2016; originally announced October 2016.

  5. arXiv:1605.06676  [pdf, other

    cs.AI cs.LG cs.MA

    Learning to Communicate with Deep Multi-Agent Reinforcement Learning

    Authors: Jakob N. Foerster, Yannis M. Assael, Nando de Freitas, Shimon Whiteson

    Abstract: We consider the problem of multiple agents sensing and acting in environments with the goal of maximising their shared utility. In these environments, agents must learn communication protocols in order to share information that is needed to solve the tasks. By embracing deep neural networks, we are able to demonstrate end-to-end learning of protocols in complex environments inspired by communicati… ▽ More

    Submitted 24 May, 2016; v1 submitted 21 May, 2016; originally announced May 2016.

  6. arXiv:1602.02672  [pdf, other

    cs.AI cs.LG

    Learning to Communicate to Solve Riddles with Deep Distributed Recurrent Q-Networks

    Authors: Jakob N. Foerster, Yannis M. Assael, Nando de Freitas, Shimon Whiteson

    Abstract: We propose deep distributed recurrent Q-networks (DDRQN), which enable teams of agents to learn to solve communication-based coordination tasks. In these tasks, the agents are not given any pre-designed communication protocol. Therefore, in order to successfully communicate, they must first automatically develop and agree upon their own communication protocol. We present empirical results on two m… ▽ More

    Submitted 8 February, 2016; originally announced February 2016.