Skip to main content

Showing 1–13 of 13 results for author: Baluja, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2311.00056  [pdf, other

    cs.CV cs.AI cs.LG

    Diversity and Diffusion: Observations on Synthetic Image Distributions with Stable Diffusion

    Authors: David Marwood, Shumeet Baluja, Yair Alon

    Abstract: Recent progress in text-to-image (TTI) systems, such as StableDiffusion, Imagen, and DALL-E 2, have made it possible to create realistic images with simple text prompts. It is tempting to use these systems to eliminate the manual task of obtaining natural images for training a new machine learning classifier. However, in all of the experiments performed to date, classifiers trained solely with syn… ▽ More

    Submitted 31 October, 2023; originally announced November 2023.

  2. arXiv:2105.01768  [pdf, other

    cs.CV cs.GR cs.LG

    Texture for Colors: Natural Representations of Colors Using Variable Bit-Depth Textures

    Authors: Shumeet Baluja

    Abstract: Numerous methods have been proposed to transform color and grayscale images to their single bit-per-pixel binary counterparts. Commonly, the goal is to enhance specific attributes of the original image to make it more amenable for analysis. However, when the resulting binarized image is intended for human viewing, aesthetics must also be considered. Binarization techniques, such as half-toning, st… ▽ More

    Submitted 4 May, 2021; originally announced May 2021.

  3. arXiv:1906.04798  [pdf, other

    cs.LG stat.ML

    Table-Based Neural Units: Fully Quantizing Networks for Multiply-Free Inference

    Authors: Michele Covell, David Marwood, Shumeet Baluja, Nick Johnston

    Abstract: In this work, we propose to quantize all parts of standard classification networks and replace the activation-weight--multiply step with a simple table-based lookup. This approach results in networks that are free of floating-point operations and free of multiplications, suitable for direct FPGA and ASIC implementations. It also provides us with two simple measures of per-layer and network-wide co… ▽ More

    Submitted 11 June, 2019; originally announced June 2019.

  4. arXiv:1812.02831  [pdf, other

    cs.CV cs.NE

    Neural Image Decompression: Learning to Render Better Image Previews

    Authors: Shumeet Baluja, Dave Marwood, Nick Johnston, Michele Covell

    Abstract: A rapidly increasing portion of Internet traffic is dominated by requests from mobile devices with limited- and metered-bandwidth constraints. To satisfy these requests, it has become standard practice for websites to transmit small and extremely compressed image previews as part of the initial page-load process. Recent work, based on an adaptive triangulation of the target image, has shown the ab… ▽ More

    Submitted 6 December, 2018; originally announced December 2018.

  5. arXiv:1809.09244  [pdf, other

    cs.LG stat.ML

    No Multiplication? No Floating Point? No Problem! Training Networks for Efficient Inference

    Authors: Shumeet Baluja, David Marwood, Michele Covell, Nick Johnston

    Abstract: For successful deployment of deep neural networks on highly--resource-constrained devices (hearing aids, earbuds, wearables), we must simplify the types of operations and the memory/power resources used during inference. Completely avoiding inference-time floating-point operations is one of the simplest ways to design networks for these highly-constrained environments. By discretizing both our in-… ▽ More

    Submitted 28 September, 2018; v1 submitted 24 September, 2018; originally announced September 2018.

  6. arXiv:1809.02257  [pdf, other

    cs.CV

    Representing Images in 200 Bytes: Compression via Triangulation

    Authors: David Marwood, Pascal Massimino, Michele Covell, Shumeet Baluja

    Abstract: A rapidly increasing portion of internet traffic is dominated by requests from mobile devices with limited and metered bandwidth constraints. To satisfy these requests, it has become standard practice for websites to transmit small and extremely compressed image previews as part of the initial page load process to improve responsiveness. Increasing thumbnail compression beyond the capabilities of… ▽ More

    Submitted 20 September, 2018; v1 submitted 6 September, 2018; originally announced September 2018.

    Comments: IEEE ICIP 2018

  7. arXiv:1801.05156  [pdf, other

    cs.NE cs.AI cs.CV

    Empirical Explorations in Training Networks with Discrete Activations

    Authors: Shumeet Baluja

    Abstract: We present extensive experiments training and testing hidden units in deep networks that emit only a predefined, static, number of discretized values. These units provide benefits in real-world deployment in systems in which memory and/or computation may be limited. Additionally, they are particularly well suited for use in large recurrent network models that require the maintenance of large amoun… ▽ More

    Submitted 16 January, 2018; originally announced January 2018.

  8. arXiv:1703.09387  [pdf, other

    cs.NE cs.AI cs.CV

    Adversarial Transformation Networks: Learning to Generate Adversarial Examples

    Authors: Shumeet Baluja, Ian Fischer

    Abstract: Multiple different approaches of generating adversarial examples have been proposed to attack deep neural networks. These approaches involve either directly computing gradients with respect to the image pixels, or directly solving an optimization on the image pixels. In this work, we present a fundamentally new method for generating adversarial examples that is fast to execute and provides excepti… ▽ More

    Submitted 27 March, 2017; originally announced March 2017.

  9. arXiv:1703.07394  [pdf, other

    cs.NE cs.AI cs.LG

    Deep Learning for Explicitly Modeling Optimization Landscapes

    Authors: Shumeet Baluja

    Abstract: In all but the most trivial optimization problems, the structure of the solutions exhibit complex interdependencies between the input parameters. Decades of research with stochastic search techniques has shown the benefit of explicitly modeling the interactions between sets of parameters and the overall quality of the solutions discovered. We demonstrate a novel method, based on learning deep netw… ▽ More

    Submitted 21 March, 2017; originally announced March 2017.

  10. arXiv:1702.01205  [pdf, other

    cs.AI cs.LG eess.SY

    Traffic Lights with Auction-Based Controllers: Algorithms and Real-World Data

    Authors: Shumeet Baluja, Michele Covell, Rahul Sukthankar

    Abstract: Real-time optimization of traffic flow addresses important practical problems: reducing a driver's wasted time, improving city-wide efficiency, reducing gas emissions and improving air quality. Much of the current research in traffic-light optimization relies on extending the capabilities of traffic lights to either communicate with each other or communicate with vehicles. However, before such cap… ▽ More

    Submitted 3 February, 2017; originally announced February 2017.

  11. arXiv:1603.04000  [pdf, other

    cs.CV cs.LG cs.NE

    Learning Typographic Style

    Authors: Shumeet Baluja

    Abstract: Typography is a ubiquitous art form that affects our understanding, perception, and trust in what we read. Thousands of different font-faces have been created with enormous variations in the characters. In this paper, we learn the style of a font by analyzing a small subset of only four letters. From these four letters, we learn two tasks. The first is a discrimination task: given the four letters… ▽ More

    Submitted 13 March, 2016; originally announced March 2016.

  12. arXiv:1512.00517  [pdf, other

    cs.CV

    Labeling the Features Not the Samples: Efficient Video Classification with Minimal Supervision

    Authors: Marius Leordeanu, Alexandra Radu, Shumeet Baluja, Rahul Sukthankar

    Abstract: Feature selection is essential for effective visual recognition. We propose an efficient joint classifier learning and feature selection method that discovers sparse, compact representations of input features from a vast sea of candidates, with an almost unsupervised formulation. Our method requires only the following knowledge, which we call the \emph{feature sign}---whether or not a particular f… ▽ More

    Submitted 1 December, 2015; originally announced December 2015.

    Comments: arXiv admin note: text overlap with arXiv:1411.7714

  13. arXiv:1511.06085  [pdf, other

    cs.CV cs.LG cs.NE

    Variable Rate Image Compression with Recurrent Neural Networks

    Authors: George Toderici, Sean M. O'Malley, Sung ** Hwang, Damien Vincent, David Minnen, Shumeet Baluja, Michele Covell, Rahul Sukthankar

    Abstract: A large fraction of Internet traffic is now driven by requests from mobile devices with relatively small screens and often stringent bandwidth requirements. Due to these factors, it has become the norm for modern graphics-heavy websites to transmit low-resolution, low-bytecount image previews (thumbnails) as part of the initial page load process to improve apparent page responsiveness. Increasing… ▽ More

    Submitted 1 March, 2016; v1 submitted 19 November, 2015; originally announced November 2015.

    Comments: Under review as a conference paper at ICLR 2016