Skip to main content

Showing 51–67 of 67 results for author: Misra, I

.
  1. arXiv:1906.02659  [pdf, other

    cs.CV cs.LG

    Does Object Recognition Work for Everyone?

    Authors: Terrance DeVries, Ishan Misra, Changhan Wang, Laurens van der Maaten

    Abstract: The paper analyzes the accuracy of publicly available object-recognition systems on a geographically diverse dataset. This dataset contains household items and was designed to have a more representative geographical coverage than commonly used image datasets in object recognition. We find that the systems perform relatively poorly on household items that commonly occur in countries with a low hous… ▽ More

    Submitted 18 June, 2019; v1 submitted 6 June, 2019; originally announced June 2019.

  2. arXiv:1905.01235  [pdf, other

    cs.CV cs.AI cs.LG

    Scaling and Benchmarking Self-Supervised Visual Representation Learning

    Authors: Priya Goyal, Dhruv Mahajan, Abhinav Gupta, Ishan Misra

    Abstract: Self-supervised learning aims to learn representations from the data itself without explicit manual supervision. Existing efforts ignore a crucial aspect of self-supervised learning - the ability to scale to large amount of data because self-supervision requires no manual labels. In this work, we revisit this principle and scale two popular self-supervised approaches to 100 million images. We show… ▽ More

    Submitted 6 June, 2019; v1 submitted 3 May, 2019; originally announced May 2019.

  3. arXiv:1901.06595  [pdf, other

    cs.CV cs.AI cs.CL

    Evaluating Text-to-Image Matching using Binary Image Selection (BISON)

    Authors: Hexiang Hu, Ishan Misra, Laurens van der Maaten

    Abstract: Providing systems the ability to relate linguistic and visual content is one of the hallmarks of computer vision. Tasks such as text-based image retrieval and image captioning were designed to test this ability but come with evaluation measures that have a high variance or are difficult to interpret. We study an alternative task for systems that match text and images: given a text query, the syste… ▽ More

    Submitted 5 April, 2019; v1 submitted 19 January, 2019; originally announced January 2019.

  4. arXiv:1712.01238  [pdf, other

    cs.CV cs.CL cs.LG

    Learning by Asking Questions

    Authors: Ishan Misra, Ross Girshick, Rob Fergus, Martial Hebert, Abhinav Gupta, Laurens van der Maaten

    Abstract: We introduce an interactive learning framework for the development and testing of intelligent visual systems, called learning-by-asking (LBA). We explore LBA in context of the Visual Question Answering (VQA) task. LBA differs from standard VQA training in that most questions are not observed during training time, and the learner must ask questions it wants answers to. Thus, LBA more closely mimics… ▽ More

    Submitted 4 December, 2017; originally announced December 2017.

  5. arXiv:1708.01642  [pdf, other

    cs.CV

    Cut, Paste and Learn: Surprisingly Easy Synthesis for Instance Detection

    Authors: Debidatta Dwibedi, Ishan Misra, Martial Hebert

    Abstract: A major impediment in rapidly deploying object detection models for instance detection is the lack of large annotated datasets. For example, finding a large labeled dataset containing instances in a particular kitchen is unlikely. Each new environment with new instances requires expensive data collection and annotation. In this paper, we propose a simple approach to generate large annotated instan… ▽ More

    Submitted 4 August, 2017; originally announced August 2017.

    Comments: To appear in ICCV 2017

  6. arXiv:1604.03968  [pdf, other

    cs.CL cs.AI cs.CV

    Visual Storytelling

    Authors: Ting-Hao, Huang, Francis Ferraro, Nasrin Mostafazadeh, Ishan Misra, Aishwarya Agrawal, Jacob Devlin, Ross Girshick, Xiaodong He, Pushmeet Kohli, Dhruv Batra, C. Lawrence Zitnick, Devi Parikh, Lucy Vanderwende, Michel Galley, Margaret Mitchell

    Abstract: We introduce the first dataset for sequential vision-to-language, and explore how this data may be used for the task of visual storytelling. The first release of this dataset, SIND v.1, includes 81,743 unique photos in 20,211 sequences, aligned to both descriptive (caption) and story language. We establish several strong baselines for the storytelling task, and motivate an automatic metric to benc… ▽ More

    Submitted 13 April, 2016; originally announced April 2016.

    Comments: to appear in NAACL 2016

  7. arXiv:1604.03539  [pdf, other

    cs.CV cs.LG

    Cross-stitch Networks for Multi-task Learning

    Authors: Ishan Misra, Abhinav Shrivastava, Abhinav Gupta, Martial Hebert

    Abstract: Multi-task learning in Convolutional Networks has displayed remarkable success in the field of recognition. This success can be largely attributed to learning shared representations from multiple supervisory tasks. However, existing multi-task approaches rely on enumerating multiple network architectures specific to the tasks at hand, that do not generalize. In this paper, we propose a principled… ▽ More

    Submitted 12 April, 2016; originally announced April 2016.

    Comments: To appear in CVPR 2016 (Spotlight)

  8. arXiv:1603.08561  [pdf, other

    cs.CV cs.AI cs.LG

    Shuffle and Learn: Unsupervised Learning using Temporal Order Verification

    Authors: Ishan Misra, C. Lawrence Zitnick, Martial Hebert

    Abstract: In this paper, we present an approach for learning a visual representation from the raw spatiotemporal signals in videos. Our representation is learned without supervision from semantic labels. We formulate our method as an unsupervised sequential verification task, i.e., we determine whether a sequence of frames from a video is in the correct temporal order. With this simple task and no semantic… ▽ More

    Submitted 26 July, 2016; v1 submitted 28 March, 2016; originally announced March 2016.

    Comments: Accepted at ECCV 2016

  9. arXiv:1603.06059  [pdf, other

    cs.CL cs.AI cs.CV

    Generating Natural Questions About an Image

    Authors: Nasrin Mostafazadeh, Ishan Misra, Jacob Devlin, Margaret Mitchell, Xiaodong He, Lucy Vanderwende

    Abstract: There has been an explosion of work in the vision & language community during the past few years from image captioning to video transcription, and answering questions about images. These tasks have focused on literal descriptions of the image. To move beyond the literal, we choose to explore how questions about an image are often directed at commonsense inference and the abstract events evoked by… ▽ More

    Submitted 8 June, 2016; v1 submitted 19 March, 2016; originally announced March 2016.

    Comments: Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics

  10. arXiv:1512.06974  [pdf, other

    cs.CV

    Seeing through the Human Reporting Bias: Visual Classifiers from Noisy Human-Centric Labels

    Authors: Ishan Misra, C. Lawrence Zitnick, Margaret Mitchell, Ross Girshick

    Abstract: When human annotators are given a choice about what to label in an image, they apply their own subjective judgments on what to ignore and what to mention. We refer to these noisy "human-centric" annotations as exhibiting human reporting bias. Examples of such annotations include image tags and keywords found on photo sharing sites, or in datasets containing image captions. In this paper, we use th… ▽ More

    Submitted 12 April, 2016; v1 submitted 22 December, 2015; originally announced December 2015.

    Comments: To appear in CVPR 2016

  11. arXiv:1505.05769  [pdf, other

    cs.CV

    Watch and Learn: Semi-Supervised Learning of Object Detectors from Videos

    Authors: Ishan Misra, Abhinav Shrivastava, Martial Hebert

    Abstract: We present a semi-supervised approach that localizes multiple unknown object instances in long videos. We start with a handful of labeled boxes and iteratively learn and label hundreds of thousands of object instances. We propose criteria for reliable object detection and tracking for constraining the semi-supervised learning process and minimizing semantic drift. Our approach does not assume exha… ▽ More

    Submitted 21 May, 2015; originally announced May 2015.

    Comments: To appear in CVPR 2015

  12. arXiv:1303.2171  [pdf, ps, other

    cs.DC

    CPU and/or GPU: Revisiting the GPU Vs. CPU Myth

    Authors: Kishore Kothapalli, Dip Sankar Banerjee, P. J. Narayanan, Surinder Sood, Aman Kumar Bahl, Shashank Sharma, Shrenik Lad, Krishna Kumar Singh, Kiran Matam, Sivaramakrishna Bharadwaj, Rohit Nigam, Parikshit Sakurikar, Aditya Deshpande, Ishan Misra, Siddharth Choudhary, Shubham Gupta

    Abstract: Parallel computing using accelerators has gained widespread research attention in the past few years. In particular, using GPUs for general purpose computing has brought forth several success stories with respect to time taken, cost, power, and other metrics. However, accelerator based computing has signifi- cantly relegated the role of CPUs in computation. As CPUs evolve and also offer matching c… ▽ More

    Submitted 9 March, 2013; originally announced March 2013.

    Comments: 20 pages

  13. Load Balancing with Reduced Unnecessary Handoff in Energy Efficient Macro/Femto-cell based BWA Networks

    Authors: Prasun Chowdhury, Anindita Kundu, Iti Saha Misra, Salil K Sanyal

    Abstract: The hierarchical macro/femto cell based BWA networks are observed to be quite promising for mobile operators as it improves their network coverage and capacity at the outskirt of the macro cell. However, this new technology introduces increased number of macro/femto handoff and wastage of electrical energy which in turn may affect the system performance. Users moving with high velocity or undergoi… ▽ More

    Submitted 11 July, 2012; originally announced July 2012.

    Comments: 14 pages

    Journal ref: International Journal of Wireless & Mobile Networks (IJWMN) Vol. 4, No. 3, June 2012

  14. arXiv:1204.1614  [pdf

    cs.NI

    Cross Layer QoS Support Architecture with Integrated CAC and Scheduling Algorithms for WiMAX BWA Networks

    Authors: Prasun Chowdhury, Iti Saha Misra, Salil K Sanyal

    Abstract: In this paper, a new technique for cross layer design, based on present Eb/N0 (bit energy per noise density) ratio of the connections and target values of the Quality of Service (QoS) information parameters from MAC layer, is proposed to dynamically select the Modulation and Coding Scheme (MCS) at the PHY layer for WiMAX Broadband Wireless Access (BWA) networks. The QoS information parameter inclu… ▽ More

    Submitted 7 April, 2012; originally announced April 2012.

    Comments: 17 pages, (IJACSA) International Journal of Advanced Computer Science and Applications, Vol. 3, No. 1, 2012. arXiv admin note: text overlap with arXiv:1012.2518 and arXiv:1110.0147 by other authors

    Journal ref: (IJACSA) International Journal of Advanced Computer Science and Applications, Vol. 3, No. 1, 2012

  15. arXiv:1111.4036  [pdf

    cs.NI

    VoIP Call Optimization in Diverse Network Scenarios Using Learning Based State-Space Search Technique

    Authors: Tamal Chakraborty, Atri Mukhopadhyay, Iti Saha Misra, Salil Kumar Sanyal

    Abstract: A VoIP based call has stringent QoS requirements with respect to delay, jitter, loss, MOS and R-Factor. Various QoS mechanisms implemented to satisfy these requirements must be adaptive under diverse network scenarios and applied in proper sequence, otherwise they may conflict with each other. The objective of this paper is to address the problem of adaptive QoS maintenance and sequential executio… ▽ More

    Submitted 17 November, 2011; originally announced November 2011.

    Journal ref: International Journal of Wireless & Mobile Networks (IJWMN) Vol. 3, No. 5, 2011, 211-228

  16. A Fair and Efficient Packet Scheduling Scheme for IEEE 802.16 Broadband Wireless Access Systems

    Authors: Prasun Chowdhury, Iti Saha Misra

    Abstract: This paper proposes a fair and efficient QoS scheduling scheme for IEEE 802.16 BWA systems that satisfies both throughput and delay guarantee to various real and non-real time applications. The proposed QoS scheduling scheme is compared with an existing QoS scheduling scheme proposed in literature in recent past. Simulation results show that the proposed scheduling scheme can provide a tight QoS g… ▽ More

    Submitted 30 September, 2010; originally announced September 2010.

    Journal ref: International Journal of Ad hoc, Sensor & Ubiquitous Computing ( IJASUC ),2010

  17. arXiv:1003.1478  [pdf

    cs.NI

    Use of Service Curve for Resource Reservation in Wired-cum-Wireless Scenario

    Authors: Nitul Dutta, Iti Saha Misra

    Abstract: In a network, arrival process is converted into departure process through network elements. The departure process suffer propagation delay in the link, processing delay at the network elements like router and data loss due to buffer overflow or congestion. For providing guaranteed service resources need to be reserved before conversation takes place. To reserve such resources estimation of them ar… ▽ More

    Submitted 7 March, 2010; originally announced March 2010.

    Comments: Pages IEEE format, International Journal of Computer Science and Information Security, IJCSIS February 2010, ISSN 1947 5500, http://sites.google.com/site/ijcsis/