Skip to main content

Showing 1–9 of 9 results for author: Sankaran, B

Searching in archive cs. Search in all archives.
.
  1. arXiv:1706.03824  [pdf, other

    cs.CL

    Attention-based Vocabulary Selection for NMT Decoding

    Authors: Baskaran Sankaran, Markus Freitag, Yaser Al-Onaizan

    Abstract: Neural Machine Translation (NMT) models usually use large target vocabulary sizes to capture most of the words in the target language. The vocabulary size is a big factor when decoding new sentences as the final softmax layer normalizes over all possible target words. To address this problem, it is widely common to restrict the target vocabulary with candidate lists based on the source sentence. U… ▽ More

    Submitted 12 June, 2017; originally announced June 2017.

    Comments: Submitted to Second Conference on Machine Translation (WMT-17); 7 pages

  2. arXiv:1702.01802  [pdf, ps, other

    cs.CL

    Ensemble Distillation for Neural Machine Translation

    Authors: Markus Freitag, Yaser Al-Onaizan, Baskaran Sankaran

    Abstract: Knowledge distillation describes a method for training a student network to perform better by learning from a stronger teacher network. Translating a sentence with an Neural Machine Translation (NMT) engine is time expensive and having a smaller model speeds up this process. We demonstrate how to transfer the translation quality of an ensemble and an oracle BLEU teacher network into a single NMT s… ▽ More

    Submitted 7 August, 2017; v1 submitted 6 February, 2017; originally announced February 2017.

  3. arXiv:1608.02927  [pdf, other

    cs.CL

    Temporal Attention Model for Neural Machine Translation

    Authors: Baskaran Sankaran, Haitao Mi, Yaser Al-Onaizan, Abe Ittycheriah

    Abstract: Attention-based Neural Machine Translation (NMT) models suffer from attention deficiency issues as has been observed in recent research. We propose a novel mechanism to address some of these limitations and improve the NMT attention. Specifically, our approach memorizes the alignments temporally (within each sentence) and modulates the attention with the accumulated temporal memory, as the decoder… ▽ More

    Submitted 9 August, 2016; originally announced August 2016.

    Comments: 8 pages

  4. arXiv:1606.04164  [pdf, ps, other

    cs.CL

    Zero-Resource Translation with Multi-Lingual Neural Machine Translation

    Authors: Orhan Firat, Baskaran Sankaran, Yaser Al-Onaizan, Fatos T. Yarman Vural, Kyunghyun Cho

    Abstract: In this paper, we propose a novel finetuning algorithm for the recently introduced multi-way, mulitlingual neural machine translate that enables zero-resource machine translation. When used together with novel many-to-one translation strategies, we empirically show that this finetuning algorithm allows the multi-way, multilingual model to translate a zero-resource language pair (1) as well as a si… ▽ More

    Submitted 13 June, 2016; originally announced June 2016.

  5. arXiv:1605.03148  [pdf, other

    cs.CL

    Coverage Embedding Models for Neural Machine Translation

    Authors: Haitao Mi, Baskaran Sankaran, Zhiguo Wang, Abe Ittycheriah

    Abstract: In this paper, we enhance the attention-based neural machine translation (NMT) by adding explicit coverage embedding models to alleviate issues of repeating and drop** translations in NMT. For each source word, our model starts with a full coverage embedding vector to track the coverage status, and then keeps updating it with neural networks as the translation goes. Experiments on the large-scal… ▽ More

    Submitted 29 August, 2016; v1 submitted 10 May, 2016; originally announced May 2016.

    Comments: 6 pages; In Proceddings of EMNLP 2016

  6. Interactive Perception: Leveraging Action in Perception and Perception in Action

    Authors: Jeannette Bohg, Karol Hausman, Bharath Sankaran, Oliver Brock, Danica Kragic, Stefan Schaal, Gaurav Sukhatme

    Abstract: Recent approaches in robotics follow the insight that perception is facilitated by interaction with the environment. These approaches are subsumed under the term of Interactive Perception (IP). It provides the following benefits: (i) interaction with the environment creates a rich sensory signal that would otherwise not be present and (ii) knowledge of the regularity in the combined space of senso… ▽ More

    Submitted 5 December, 2017; v1 submitted 13 April, 2016; originally announced April 2016.

    Comments: Equal contribution by first three authors

    Journal ref: IEEE Transactions on Robotics 33 (2017) 1273-1291

  7. arXiv:1505.01576  [pdf, other

    cs.LG

    Learning and Optimization with Submodular Functions

    Authors: Bharath Sankaran, Marjan Ghazvininejad, Xinran He, David Kale, Liron Cohen

    Abstract: In many naturally occurring optimization problems one needs to ensure that the definition of the optimization problem lends itself to solutions that are tractable to compute. In cases where exact solutions cannot be computed tractably, it is beneficial to have strong guarantees on the tractable approximate solutions. In order operate under these criterion most optimization problems are cast under… ▽ More

    Submitted 7 May, 2015; originally announced May 2015.

    Comments: Tech Report - USC Computer Science CS-599, Convex and Combinatorial Optimization

  8. arXiv:1503.06375  [pdf, other

    cs.RO

    Policy Learning with Hypothesis based Local Action Selection

    Authors: Bharath Sankaran, Jeannette Bohg, Nathan Ratliff, Stefan Schaal

    Abstract: For robots to be able to manipulate in unknown and unstructured environments the robot should be capable of operating under partial observability of the environment. Object occlusions and unmodeled environments are some of the factors that result in partial observability. A common scenario where this is encountered is manipulation in clutter. In the case that the robot needs to locate an object of… ▽ More

    Submitted 8 May, 2015; v1 submitted 21 March, 2015; originally announced March 2015.

    Comments: RLDM abstract

  9. arXiv:1309.5401  [pdf, other

    cs.RO cs.CV eess.SY

    Nonmyopic View Planning for Active Object Detection

    Authors: Nikolay Atanasov, Bharath Sankaran, Jerome Le Ny, George J. Pappas, Kostas Daniilidis

    Abstract: One of the central problems in computer vision is the detection of semantically important objects and the estimation of their pose. Most of the work in object detection has been based on single image processing and its performance is limited by occlusions and ambiguity in appearance and geometry. This paper proposes an active approach to object detection by controlling the point of view of a mobil… ▽ More

    Submitted 20 September, 2013; originally announced September 2013.

    Comments: 12 pages (two-column); 7 figures; 2 tables; Manuscript submitted to the IEEE Transactions on Robotics (TRO)