Skip to main content

Showing 1–6 of 6 results for author: Kahembwe, E

.
  1. arXiv:2110.01963  [pdf, other

    cs.CY

    Multimodal datasets: misogyny, pornography, and malignant stereotypes

    Authors: Abeba Birhane, Vinay Uday Prabhu, Emmanuel Kahembwe

    Abstract: We have now entered the era of trillion parameter machine learning models trained on billion-sized datasets scraped from the internet. The rise of these gargantuan datasets has given rise to formidable bodies of critical work that has called for caution while generating these large datasets. These address concerns surrounding the dubious curation practices used to generate these datasets, the sord… ▽ More

    Submitted 5 October, 2021; originally announced October 2021.

    Comments: 33 pages

  2. arXiv:1912.08860  [pdf, other

    cs.CV cs.LG eess.IV stat.ML

    Lower Dimensional Kernels for Video Discriminators

    Authors: Emmanuel Kahembwe, Subramanian Ramamoorthy

    Abstract: This work presents an analysis of the discriminators used in Generative Adversarial Networks (GANs) for Video. We show that unconstrained video discriminator architectures induce a loss surface with high curvature which make optimisation difficult. We also show that this curvature becomes more extreme as the maximal kernel dimension of video discriminators increases. With these observations in han… ▽ More

    Submitted 18 December, 2019; originally announced December 2019.

    Journal ref: Neural.Networks 132 (2020) 506-520

  3. arXiv:1904.08378  [pdf, other

    cs.LG cs.NE stat.ML

    Dynamic Evaluation of Transformer Language Models

    Authors: Ben Krause, Emmanuel Kahembwe, Iain Murray, Steve Renals

    Abstract: This research note combines two methods that have recently improved the state of the art in language modeling: Transformers and dynamic evaluation. Transformers use stacked layers of self-attention that allow them to capture long range dependencies in sequential data. Dynamic evaluation fits models to the recent sequence history, allowing them to assign higher probabilities to re-occurring sequent… ▽ More

    Submitted 17 April, 2019; originally announced April 2019.

  4. arXiv:1809.06641  [pdf, other

    cs.CL cs.AI

    Talking to myself: self-dialogues as data for conversational agents

    Authors: Joachim Fainberg, Ben Krause, Mihai Dobre, Marco Damonte, Emmanuel Kahembwe, Daniel Duma, Bonnie Webber, Federico Fancellu

    Abstract: Conversational agents are gaining popularity with the increasing ubiquity of smart devices. However, training agents in a data driven manner is challenging due to a lack of suitable corpora. This paper presents a novel method for gathering topical, unstructured conversational data in an efficient way: self-dialogues through crowd-sourcing. Alongside this paper, we include a corpus of 3.6 million w… ▽ More

    Submitted 19 September, 2018; v1 submitted 18 September, 2018; originally announced September 2018.

    Comments: 5 pages, 5 pages appendix, 2 figures

  5. arXiv:1709.09816  [pdf, other

    cs.CL cs.AI

    Edina: Building an Open Domain Socialbot with Self-dialogues

    Authors: Ben Krause, Marco Damonte, Mihai Dobre, Daniel Duma, Joachim Fainberg, Federico Fancellu, Emmanuel Kahembwe, Jianpeng Cheng, Bonnie Webber

    Abstract: We present Edina, the University of Edinburgh's social bot for the Amazon Alexa Prize competition. Edina is a conversational agent whose responses utilize data harvested from Amazon Mechanical Turk (AMT) through an innovative new technique we call self-dialogues. These are conversations in which a single AMT Worker plays both participants in a dialogue. Such dialogues are surprisingly natural, eff… ▽ More

    Submitted 28 September, 2017; originally announced September 2017.

    Comments: 10 pages; submitted to the 1st Proceedings of the Alexa Prize

  6. arXiv:1709.07432  [pdf, other

    cs.NE cs.CL

    Dynamic Evaluation of Neural Sequence Models

    Authors: Ben Krause, Emmanuel Kahembwe, Iain Murray, Steve Renals

    Abstract: We present methodology for using dynamic evaluation to improve neural sequence models. Models are adapted to recent history via a gradient descent based mechanism, causing them to assign higher probabilities to re-occurring sequential patterns. Dynamic evaluation outperforms existing adaptation approaches in our comparisons. Dynamic evaluation improves the state-of-the-art word-level perplexities… ▽ More

    Submitted 25 October, 2017; v1 submitted 21 September, 2017; originally announced September 2017.