Skip to main content

Showing 1–4 of 4 results for author: Karayev, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2103.06450  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    Full Page Handwriting Recognition via Image to Sequence Extraction

    Authors: Sumeet S. Singh, Sergey Karayev

    Abstract: We present a Neural Network based Handwritten Text Recognition (HTR) model architecture that can be trained to recognize full pages of handwritten or printed text without image segmentation. Being based on Image to Sequence architecture, it can extract text present in an image and then sequence it correctly without imposing any constraints regarding orientation, layout and size of text and non-tex… ▽ More

    Submitted 26 June, 2022; v1 submitted 10 March, 2021; originally announced March 2021.

    Comments: Appeared in ICDAR 2021

  2. arXiv:1408.5093  [pdf, other

    cs.CV cs.LG cs.NE

    Caffe: Convolutional Architecture for Fast Feature Embedding

    Authors: Yangqing Jia, Evan Shelhamer, Jeff Donahue, Sergey Karayev, Jonathan Long, Ross Girshick, Sergio Guadarrama, Trevor Darrell

    Abstract: Caffe provides multimedia scientists and practitioners with a clean and modifiable framework for state-of-the-art deep learning algorithms and a collection of reference models. The framework is a BSD-licensed C++ library with Python and MATLAB bindings for training and deploying general-purpose convolutional neural networks and other deep models efficiently on commodity architectures. Caffe fits i… ▽ More

    Submitted 20 June, 2014; originally announced August 2014.

    Comments: Tech report for the Caffe software at http://github.com/BVLC/Caffe/

  3. arXiv:1404.1869  [pdf, other

    cs.CV

    DenseNet: Implementing Efficient ConvNet Descriptor Pyramids

    Authors: Forrest Iandola, Matt Moskewicz, Sergey Karayev, Ross Girshick, Trevor Darrell, Kurt Keutzer

    Abstract: Convolutional Neural Networks (CNNs) can provide accurate object classification. They can be extended to perform object detection by iterating over dense or selected proposed object regions. However, the runtime of such detectors scales as the total number and/or area of regions to examine per image, and training such detectors may be prohibitively slow. However, for some CNN classifier topologies… ▽ More

    Submitted 7 April, 2014; originally announced April 2014.

  4. Recognizing Image Style

    Authors: Sergey Karayev, Matthew Trentacoste, Helen Han, Aseem Agarwala, Trevor Darrell, Aaron Hertzmann, Holger Winnemoeller

    Abstract: The style of an image plays a significant role in how it is viewed, but style has received little attention in computer vision research. We describe an approach to predicting style of images, and perform a thorough evaluation of different image features for these tasks. We find that features learned in a multi-layer network generally perform best -- even when trained with object class (not style)… ▽ More

    Submitted 23 July, 2014; v1 submitted 14 November, 2013; originally announced November 2013.

    Journal ref: Proc. British Machine Vision Conference (BMVC) 2014