Skip to main content

Showing 1–7 of 7 results for author: Alsharif, O

Searching in archive cs. Search in all archives.
.
  1. arXiv:2005.01864  [pdf, other

    cs.CV

    Streaming Object Detection for 3-D Point Clouds

    Authors: Wei Han, Zhengdong Zhang, Benjamin Caine, Brandon Yang, Christoph Sprunk, Ouais Alsharif, Jiquan Ngiam, Vijay Vasudevan, Jonathon Shlens, Zhifeng Chen

    Abstract: Autonomous vehicles operate in a dynamic environment, where the speed with which a vehicle can perceive and react impacts the safety and efficacy of the system. LiDAR provides a prominent sensory modality that informs many existing perceptual systems including object detection, segmentation, motion estimation, and action recognition. The latency for perceptual systems based on point cloud data can… ▽ More

    Submitted 4 May, 2020; originally announced May 2020.

  2. arXiv:1908.11069  [pdf, other

    cs.CV

    StarNet: Targeted Computation for Object Detection in Point Clouds

    Authors: Jiquan Ngiam, Benjamin Caine, Wei Han, Brandon Yang, Yuning Chai, Pei Sun, Yin Zhou, Xi Yi, Ouais Alsharif, Patrick Nguyen, Zhifeng Chen, Jonathon Shlens, Vijay Vasudevan

    Abstract: Detecting objects from LiDAR point clouds is an important component of self-driving car technology as LiDAR provides high resolution spatial information. Previous work on point-cloud 3D object detection has re-purposed convolutional approaches from traditional camera imagery. In this work, we present an object detection system called StarNet designed specifically to take advantage of the sparse an… ▽ More

    Submitted 2 December, 2019; v1 submitted 29 August, 2019; originally announced August 2019.

  3. arXiv:1603.08042  [pdf, other

    cs.CL cs.LG cs.NE

    On the Compression of Recurrent Neural Networks with an Application to LVCSR acoustic modeling for Embedded Speech Recognition

    Authors: Rohit Prabhavalkar, Ouais Alsharif, Antoine Bruguier, Ian McGraw

    Abstract: We study the problem of compressing recurrent neural networks (RNNs). In particular, we focus on the compression of RNN acoustic models, which are motivated by the goal of building compact and accurate speech recognition systems which can be run efficiently on mobile devices. In this work, we present a technique for general recurrent model compression that jointly compresses both recurrent and non… ▽ More

    Submitted 2 May, 2016; v1 submitted 25 March, 2016; originally announced March 2016.

    Comments: Accepted in ICASSP 2016

  4. arXiv:1603.03185  [pdf, other

    cs.CL cs.LG cs.SD

    Personalized Speech recognition on mobile devices

    Authors: Ian McGraw, Rohit Prabhavalkar, Raziel Alvarez, Montse Gonzalez Arenas, Kanishka Rao, David Rybach, Ouais Alsharif, Hasim Sak, Alexander Gruenstein, Francoise Beaufays, Carolina Parada

    Abstract: We describe a large vocabulary speech recognition system that is accurate, has low latency, and yet has a small enough memory and computational footprint to run faster than real-time on a Nexus 5 Android smartphone. We employ a quantized Long Short-Term Memory (LSTM) acoustic model trained with connectionist temporal classification (CTC) to directly predict phoneme targets, and further reduce its… ▽ More

    Submitted 11 March, 2016; v1 submitted 10 March, 2016; originally announced March 2016.

  5. arXiv:1412.4864  [pdf, other

    stat.ML cs.LG cs.NE

    Learning with Pseudo-Ensembles

    Authors: Philip Bachman, Ouais Alsharif, Doina Precup

    Abstract: We formalize the notion of a pseudo-ensemble, a (possibly infinite) collection of child models spawned from a parent model by perturbing it according to some noise process. E.g., dropout (Hinton et. al, 2012) in a deep neural network trains a pseudo-ensemble of child subnetworks generated by randomly masking nodes in the parent network. We present a novel regularizer based on making the behavior o… ▽ More

    Submitted 15 December, 2014; originally announced December 2014.

    Comments: To appear in Advances in Neural Information Processing Systems 27 (NIPS 2014), Advances in Neural Information Processing Systems 27, Dec. 2014

  6. arXiv:1404.4108  [pdf, other

    cs.LG

    Representation as a Service

    Authors: Ouais Alsharif, Philip Bachman, Joelle Pineau

    Abstract: Consider a Machine Learning Service Provider (MLSP) designed to rapidly create highly accurate learners for a never-ending stream of new tasks. The challenge is to produce task-specific learners that can be trained from few labeled samples, even if tasks are not uniquely identified, and the number of tasks and input dimensionality are large. In this paper, we argue that the MLSP should exploit kno… ▽ More

    Submitted 9 July, 2014; v1 submitted 24 February, 2014; originally announced April 2014.

    Comments: 8 pages

  7. arXiv:1310.1811  [pdf, other

    cs.CV

    End-to-End Text Recognition with Hybrid HMM Maxout Models

    Authors: Ouais Alsharif, Joelle Pineau

    Abstract: The problem of detecting and recognizing text in natural scenes has proved to be more challenging than its counterpart in documents, with most of the previous work focusing on a single part of the problem. In this work, we propose new solutions to the character and word recognition problems and then show how to combine these solutions in an end-to-end text-recognition system. We do so by leveragin… ▽ More

    Submitted 7 October, 2013; originally announced October 2013.

    Comments: 9 pages, 7 figures