Skip to main content

Showing 1–17 of 17 results for author: Cisse, M

Searching in archive cs. Search in all archives.
.
  1. Social media in the Global South: A Network Dataset of the Malian Twittersphere

    Authors: Daniel Thilo Schroeder, Mirjam de Bruijn, Luca Bruls, Mulatu Alemayehu Moges, Samba Dialimpa Badji, Noëmie Fritz, Modibo Galy Cisse, Johannes Langguth, Bruce Mutsvairo, Kristin Skare Orgeret

    Abstract: With the expansion of mobile communications infrastructure, social media usage in the Global South is surging. Compared to the Global North, populations of the Global South have had less prior experience with social media from stationary computers and wired Internet. Many countries are experiencing violent conflicts that have a profound effect on their societies. As a result, social networks devel… ▽ More

    Submitted 24 October, 2023; v1 submitted 25 April, 2023; originally announced April 2023.

    Comments: 12 pages, 4 figures

    Journal ref: Journal of Data Mining & Digital Humanities, 2023 (November 3, 2023) jdmdh:11246

  2. arXiv:2111.11828  [pdf, other

    cs.LG cs.CV

    Variance Reduction in Deep Learning: More Momentum is All You Need

    Authors: Lionel Tondji, Sergii Kashubin, Moustapha Cisse

    Abstract: Variance reduction (VR) techniques have contributed significantly to accelerating learning with massive datasets in the smooth and strongly convex setting (Schmidt et al., 2017; Johnson & Zhang, 2013; Roux et al., 2012). However, such techniques have not yet met the same success in the realm of large-scale deep learning due to various factors such as the use of data augmentation or regularization… ▽ More

    Submitted 23 November, 2021; originally announced November 2021.

    Comments: 23 pages, 8 figures

  3. arXiv:2107.12283  [pdf, other

    cs.CV

    Continental-Scale Building Detection from High Resolution Satellite Imagery

    Authors: Wojciech Sirko, Sergii Kashubin, Marvin Ritter, Abigail Annkah, Yasser Salah Eddine Bouchareb, Yann Dauphin, Daniel Keysers, Maxim Neumann, Moustapha Cisse, John Quinn

    Abstract: Identifying the locations and footprints of buildings is vital for many practical and scientific purposes. Such information can be particularly useful in develo** regions where alternative data sources may be scarce. In this work, we describe a model training pipeline for detecting buildings across the entire continent of Africa, using 50 cm satellite imagery. Starting with the U-Net model, wide… ▽ More

    Submitted 29 July, 2021; v1 submitted 26 July, 2021; originally announced July 2021.

  4. arXiv:2006.13485  [pdf, other

    cs.LG stat.ML

    Fairness with Overlap** Groups

    Authors: Forest Yang, Moustapha Cisse, Sanmi Koyejo

    Abstract: In algorithmically fair prediction problems, a standard goal is to ensure the equality of fairness metrics across multiple overlap** groups simultaneously. We reconsider this standard fair classification problem using a probabilistic population analysis, which, in turn, reveals the Bayes-optimal classifier. Our approach unifies a variety of existing group-fair classification methods and enables… ▽ More

    Submitted 24 June, 2020; originally announced June 2020.

  5. arXiv:2006.06049  [pdf, other

    cs.LG stat.ML

    On Mixup Regularization

    Authors: Luigi Carratino, Moustapha Cissé, Rodolphe Jenatton, Jean-Philippe Vert

    Abstract: Mixup is a data augmentation technique that creates new examples as convex combinations of training points and labels. This simple technique has empirically shown to improve the accuracy of many state-of-the-art models in different settings and applications, but the reasons behind this empirical success remain poorly understood. In this paper we take a substantial step in explaining the theoretica… ▽ More

    Submitted 17 October, 2022; v1 submitted 10 June, 2020; originally announced June 2020.

  6. arXiv:2005.02827  [pdf

    cs.CL

    Digraphie des langues ouest africaines : Latin2Ajami : un algorithme de translitteration automatique

    Authors: El hadji M. Fall, El hadji M. Nguer, Bao Diop Sokhna, Mouhamadou Khoule, Mathieu Mangeot, Mame T. Cisse

    Abstract: The national languages of Senegal, like those of West Africa country in general, are written with two alphabets : the Latin alphabet that draws its strength from official decreesm and the completed Arabic script (Ajami), widespread and well integrated, that has little institutional support. This digraph created two worlds ignoring each other. Indeed, Ajami writing is generally used daily by popula… ▽ More

    Submitted 5 May, 2020; originally announced May 2020.

    Comments: in French. TAlaf TALN 2016

  7. arXiv:1802.04633  [pdf, ps, other

    cs.LG

    Turning Your Weakness Into a Strength: Watermarking Deep Neural Networks by Backdooring

    Authors: Yossi Adi, Carsten Baum, Moustapha Cisse, Benny Pinkas, Joseph Keshet

    Abstract: Deep Neural Networks have recently gained lots of success after enabling several breakthroughs in notoriously challenging problems. Training these networks is computationally expensive and requires vast amounts of training data. Selling such pre-trained models can, therefore, be a lucrative business model. Unfortunately, once the models are sold they can be easily copied and redistributed. To avoi… ▽ More

    Submitted 11 June, 2018; v1 submitted 13 February, 2018; originally announced February 2018.

  8. arXiv:1801.03339  [pdf, other

    cs.LG cs.CL

    Fooling End-to-end Speaker Verification by Adversarial Examples

    Authors: Felix Kreuk, Yossi Adi, Moustapha Cisse, Joseph Keshet

    Abstract: Automatic speaker verification systems are increasingly used as the primary means to authenticate costumers. Recently, it has been proposed to train speaker verification systems using end-to-end deep neural models. In this paper, we show that such systems are vulnerable to adversarial example attack. Adversarial examples are generated by adding a peculiar noise to original speaker examples, in suc… ▽ More

    Submitted 16 February, 2018; v1 submitted 10 January, 2018; originally announced January 2018.

  9. arXiv:1711.11443  [pdf, other

    cs.LG cs.AI cs.CV cs.CY stat.ML

    ConvNets and ImageNet Beyond Accuracy: Understanding Mistakes and Uncovering Biases

    Authors: Pierre Stock, Moustapha Cisse

    Abstract: ConvNets and Imagenet have driven the recent success of deep learning for image classification. However, the marked slowdown in performance improvement combined with the lack of robustness of neural networks to adversarial examples and their tendency to exhibit undesirable biases question the reliability of these methods. This work investigates these questions from the perspective of the end-user… ▽ More

    Submitted 20 July, 2018; v1 submitted 30 November, 2017; originally announced November 2017.

    Comments: ECCV 2018 camera-ready

  10. arXiv:1711.02604  [pdf, other

    cs.LG cs.CL

    Unbounded cache model for online language modeling with open vocabulary

    Authors: Edouard Grave, Moustapha Cisse, Armand Joulin

    Abstract: Recently, continuous cache models were proposed as extensions to recurrent neural network language models, to adapt their predictions to local changes in the data distribution. These models only capture the local context, of up to a few thousands tokens. In this paper, we propose an extension of continuous cache models, which can scale to larger contexts. In particular, we use a large scale non-pa… ▽ More

    Submitted 7 November, 2017; originally announced November 2017.

    Comments: Accepted to NIPS 2017

  11. arXiv:1711.00117  [pdf, other

    cs.CV

    Countering Adversarial Images using Input Transformations

    Authors: Chuan Guo, Mayank Rana, Moustapha Cisse, Laurens van der Maaten

    Abstract: This paper investigates strategies that defend against adversarial-example attacks on image-classification systems by transforming the inputs before feeding them to the system. Specifically, we study applying image transformations such as bit-depth reduction, JPEG compression, total variance minimization, and image quilting before feeding the image to a convolutional network classifier. Our experi… ▽ More

    Submitted 25 January, 2018; v1 submitted 31 October, 2017; originally announced November 2017.

    Comments: 12 pages, 6 figures, submitted to ICLR 2018

  12. arXiv:1710.09412  [pdf, other

    cs.LG stat.ML

    mixup: Beyond Empirical Risk Minimization

    Authors: Hongyi Zhang, Moustapha Cisse, Yann N. Dauphin, David Lopez-Paz

    Abstract: Large deep neural networks are powerful, but exhibit undesirable behaviors such as memorization and sensitivity to adversarial examples. In this work, we propose mixup, a simple learning principle to alleviate these issues. In essence, mixup trains a neural network on convex combinations of pairs of examples and their labels. By doing so, mixup regularizes the neural network to favor simple linear… ▽ More

    Submitted 27 April, 2018; v1 submitted 25 October, 2017; originally announced October 2017.

    Comments: ICLR camera ready version. Changes vs V1: fix repo URL; add ablation studies; add mixup + dropout etc

  13. arXiv:1707.05373  [pdf, other

    stat.ML cs.AI cs.CR cs.CV cs.LG

    Houdini: Fooling Deep Structured Prediction Models

    Authors: Moustapha Cisse, Yossi Adi, Natalia Neverova, Joseph Keshet

    Abstract: Generating adversarial examples is a critical step for evaluating and improving the robustness of learning machines. So far, most existing methods only work for classification and are not designed to alter the true performance measure of the problem at hand. We introduce a novel flexible approach named Houdini for generating adversarial examples specifically tailored for the final performance meas… ▽ More

    Submitted 17 July, 2017; originally announced July 2017.

    Comments: 12 pages, 8 figures, under review

  14. arXiv:1705.10142  [pdf, other

    cs.LG

    Kronecker Recurrent Units

    Authors: Cijo Jose, Moustpaha Cisse, Francois Fleuret

    Abstract: Our work addresses two important issues with recurrent neural networks: (1) they are over-parameterized, and (2) the recurrence matrix is ill-conditioned. The former increases the sample complexity of learning and the training time. The latter causes the vanishing and exploding gradient problem. We present a flexible recurrent neural network model called Kronecker Recurrent Units (KRU). KRU achiev… ▽ More

    Submitted 31 December, 2017; v1 submitted 29 May, 2017; originally announced May 2017.

  15. arXiv:1704.08847  [pdf, other

    stat.ML cs.AI cs.CR cs.LG

    Parseval Networks: Improving Robustness to Adversarial Examples

    Authors: Moustapha Cisse, Piotr Bojanowski, Edouard Grave, Yann Dauphin, Nicolas Usunier

    Abstract: We introduce Parseval networks, a form of deep neural networks in which the Lipschitz constant of linear, convolutional and aggregation layers is constrained to be smaller than 1. Parseval networks are empirically and theoretically motivated by an analysis of the robustness of the predictions made by deep neural networks when their input is subject to an adversarial perturbation. The most importan… ▽ More

    Submitted 1 May, 2017; v1 submitted 28 April, 2017; originally announced April 2017.

    Comments: submitted

  16. arXiv:1609.04309  [pdf, other

    cs.CL cs.LG

    Efficient softmax approximation for GPUs

    Authors: Edouard Grave, Armand Joulin, Moustapha Cissé, David Grangier, Hervé Jégou

    Abstract: We propose an approximate strategy to efficiently train neural network based language models over very large vocabularies. Our approach, called adaptive softmax, circumvents the linear dependency on the vocabulary size by exploiting the unbalanced word distribution to form clusters that explicitly minimize the expectation of computation time. Our approach further reduces the computational time by… ▽ More

    Submitted 19 June, 2017; v1 submitted 14 September, 2016; originally announced September 2016.

    Comments: Accepted to ICML 2017

  17. arXiv:1009.3589  [pdf, other

    cs.LG cs.CV cs.NE

    Deep Self-Taught Learning for Handwritten Character Recognition

    Authors: Frédéric Bastien, Yoshua Bengio, Arnaud Bergeron, Nicolas Boulanger-Lewandowski, Thomas Breuel, Youssouf Chherawala, Moustapha Cisse, Myriam Côté, Dumitru Erhan, Jeremy Eustache, Xavier Glorot, Xavier Muller, Sylvain Pannetier Lebeuf, Razvan Pascanu, Salah Rifai, Francois Savard, Guillaume Sicard

    Abstract: Recent theoretical and empirical work in statistical machine learning has demonstrated the importance of learning algorithms for deep architectures, i.e., function classes obtained by composing multiple non-linear transformations. Self-taught learning (exploiting unlabeled examples or examples from other distributions) has already been applied to deep learners, but mostly to show the advantage of… ▽ More

    Submitted 18 September, 2010; originally announced September 2010.

    Report number: 1353, Dept. IRO, U. Montreal MSC Class: 68T05 ACM Class: I.2.6