Skip to main content

Showing 1–6 of 6 results for author: Kaeser-Chen, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2302.00763  [pdf, other

    cs.LG cs.AI cs.CL

    Collaborating with language models for embodied reasoning

    Authors: Ishita Dasgupta, Christine Kaeser-Chen, Kenneth Marino, Arun Ahuja, Sheila Babayan, Felix Hill, Rob Fergus

    Abstract: Reasoning in a complex and ambiguous environment is a key goal for Reinforcement Learning (RL) agents. While some sophisticated RL agents can successfully solve difficult tasks, they require a large amount of training data and often struggle to generalize to new unseen environments and new tasks. On the other hand, Large Scale Language Models (LSLMs) have exhibited strong reasoning ability and the… ▽ More

    Submitted 1 February, 2023; originally announced February 2023.

    Comments: Presented at NeurIPS 2022 Language and Reinforcement Learning Workshop (best paper) and NeurIPS 2022 Foundation Models for Decision Making Workshop. 4 pages main; 14 pages total (including references and appendix); 3 figures

  2. arXiv:2211.00177  [pdf, other

    cs.LG cs.IR cs.SI

    Learning to Navigate Wikipedia by Taking Random Walks

    Authors: Manzil Zaheer, Kenneth Marino, Will Grathwohl, John Schultz, Wendy Shang, Sheila Babayan, Arun Ahuja, Ishita Dasgupta, Christine Kaeser-Chen, Rob Fergus

    Abstract: A fundamental ability of an intelligent web-based agent is seeking out and acquiring new information. Internet search engines reliably find the correct vicinity but the top results may be a few links away from the desired target. A complementary approach is navigation via hyperlinks, employing a policy that comprehends local content and selects a link that moves it closer to the target. In this pa… ▽ More

    Submitted 31 October, 2022; originally announced November 2022.

    Journal ref: NeurIPS 2022

  3. arXiv:1909.04101  [pdf, other

    cs.CL cs.CV

    Neural Naturalist: Generating Fine-Grained Image Comparisons

    Authors: Maxwell Forbes, Christine Kaeser-Chen, Piyush Sharma, Serge Belongie

    Abstract: We introduce the new Birds-to-Words dataset of 41k sentences describing fine-grained differences between photographs of birds. The language collected is highly detailed, while remaining understandable to the everyday observer (e.g., "heart-shaped face," "squat body"). Paragraph-length descriptions naturally adapt to varying levels of taxonomic and visual distance---drawn from a novel stratified sa… ▽ More

    Submitted 13 November, 2019; v1 submitted 9 September, 2019; originally announced September 2019.

    Comments: Published at EMNLP 2019

  4. arXiv:1906.00901  [pdf, other

    cs.CV

    The iMet Collection 2019 Challenge Dataset

    Authors: Chenyang Zhang, Christine Kaeser-Chen, Grace Vesom, Jennie Choi, Maria Kessler, Serge Belongie

    Abstract: Existing computer vision technologies in artwork recognition focus mainly on instance retrieval or coarse-grained attribute classification. In this work, we present a novel dataset for fine-grained artwork attribute recognition. The images in the dataset are professional photographs of classic artworks from the Metropolitan Museum of Art, and annotations are curated and verified by world-class mus… ▽ More

    Submitted 3 June, 2019; v1 submitted 3 June, 2019; originally announced June 2019.

    Comments: 3 pages, 4 figures

  5. arXiv:1804.05870  [pdf, other

    cs.CV

    Egocentric 6-DoF Tracking of Small Handheld Objects

    Authors: Rohit Pandey, Pavel Pidlypenskyi, Shuoran Yang, Christine Kaeser-Chen

    Abstract: Virtual and augmented reality technologies have seen significant growth in the past few years. A key component of such systems is the ability to track the pose of head mounted displays and controllers in 3D space. We tackle the problem of efficient 6-DoF tracking of a handheld controller from egocentric camera perspectives. We collected the HMD Controller dataset which consist of over 540,000 ster… ▽ More

    Submitted 16 April, 2018; originally announced April 2018.

  6. arXiv:1712.04961  [pdf, other

    cs.CV

    Real-time Egocentric Gesture Recognition on Mobile Head Mounted Displays

    Authors: Rohit Pandey, Marie White, Pavel Pidlypenskyi, Xue Wang, Christine Kaeser-Chen

    Abstract: Mobile virtual reality (VR) head mounted displays (HMD) have become popular among consumers in recent years. In this work, we demonstrate real-time egocentric hand gesture detection and localization on mobile HMDs. Our main contributions are: 1) A novel mixed-reality data collection tool to automatic annotate bounding boxes and gesture labels; 2) The largest-to-date egocentric hand gesture and bou… ▽ More

    Submitted 13 December, 2017; originally announced December 2017.

    Comments: Extended Abstract NIPS 2017 Machine Learning on the Phone and other Consumer Devices Workshop