Skip to main content

Showing 1–13 of 13 results for author: Hollerer, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2401.08973  [pdf, other

    cs.CV cs.AI cs.CL

    OCTO+: A Suite for Automatic Open-Vocabulary Object Placement in Mixed Reality

    Authors: Aditya Sharma, Luke Yoffe, Tobias Höllerer

    Abstract: One key challenge in Augmented Reality is the placement of virtual content in natural locations. Most existing automated techniques can only work with a closed-vocabulary, fixed set of objects. In this paper, we introduce and evaluate several methods for automatic object placement using recent advances in open-vocabulary vision-language models. Through a multifaceted evaluation, we identify a new… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

    Comments: 2024 IEEE International Conference on Artificial Intelligence and eXtended and Virtual Reality (AIXVR)

  2. arXiv:2312.12815  [pdf, other

    cs.CV cs.AI cs.CL

    OCTOPUS: Open-vocabulary Content Tracking and Object Placement Using Semantic Understanding in Mixed Reality

    Authors: Luke Yoffe, Aditya Sharma, Tobias Höllerer

    Abstract: One key challenge in augmented reality is the placement of virtual content in natural locations. Existing automated techniques are only able to work with a closed-vocabulary, fixed set of objects. In this paper, we introduce a new open-vocabulary method for object placement. Our eight-stage pipeline leverages recent advances in segmentation models, vision-language models, and LLMs to place any vir… ▽ More

    Submitted 20 December, 2023; originally announced December 2023.

    Comments: IEEE International Symposium on Mixed and Augmented Reality (ISMAR) 2023

  3. arXiv:2204.10356  [pdf, other

    cs.CV astro-ph.IM cs.HC

    Interactive Segmentation and Visualization for Tiny Objects in Multi-megapixel Images

    Authors: Chengyuan Xu, Boning Dong, Noah Stier, Curtis McCully, D. Andrew Howell, Pradeep Sen, Tobias Höllerer

    Abstract: We introduce an interactive image segmentation and visualization framework for identifying, inspecting, and editing tiny objects (just a few pixels wide) in large multi-megapixel high-dynamic-range (HDR) images. Detecting cosmic rays (CRs) in astronomical observations is a cumbersome workflow that requires multiple tools, so we developed an interactive toolkit that unifies model inference, HDR ima… ▽ More

    Submitted 21 April, 2022; originally announced April 2022.

    Comments: 6 pages, 4 figures. Accepted by CVPR 2022 Demo Program

    ACM Class: I.4

  4. arXiv:2112.00236  [pdf, other

    cs.CV cs.LG

    VoRTX: Volumetric 3D Reconstruction With Transformers for Voxelwise View Selection and Fusion

    Authors: Noah Stier, Alexander Rich, Pradeep Sen, Tobias Höllerer

    Abstract: Recent volumetric 3D reconstruction methods can produce very accurate results, with plausible geometry even for unobserved surfaces. However, they face an undesirable trade-off when it comes to multi-view fusion. They can fuse all available view information by global averaging, thus losing fine detail, or they can heuristically cluster views for local fusion, thus restricting their ability to cons… ▽ More

    Submitted 30 November, 2021; originally announced December 2021.

    Comments: 3DV 2021

  5. arXiv:2112.00202  [pdf, other

    cs.CV

    3DVNet: Multi-View Depth Prediction and Volumetric Refinement

    Authors: Alexander Rich, Noah Stier, Pradeep Sen, Tobias Höllerer

    Abstract: We present 3DVNet, a novel multi-view stereo (MVS) depth-prediction method that combines the advantages of previous depth-based and volumetric MVS approaches. Our key idea is the use of a 3D scene-modeling network that iteratively updates a set of coarse depth predictions, resulting in highly accurate predictions which agree on the underlying scene geometry. Unlike existing depth-prediction techni… ▽ More

    Submitted 30 November, 2021; originally announced December 2021.

    Comments: 10 pages, 6 figures, 3 tables. Accepted to 3DV 2021

  6. arXiv:2111.11992  [pdf, ps, other

    cs.CV cs.LG

    Sparse Fusion for Multimodal Transformers

    Authors: Yi Ding, Alex Rich, Mason Wang, Noah Stier, Matthew Turk, Pradeep Sen, Tobias Höllerer

    Abstract: Multimodal classification is a core task in human-centric machine learning. We observe that information is highly complementary across modalities, thus unimodal information can be drastically sparsified prior to multimodal fusion without loss of accuracy. To this end, we present Sparse Fusion Transformers (SFT), a novel multimodal fusion method for transformers that performs comparably to existing… ▽ More

    Submitted 24 November, 2021; v1 submitted 23 November, 2021; originally announced November 2021.

    Comments: 11 pages, 4 figures, 5 tables, Yi Ding and Alex Rich contributed equally

  7. arXiv:2107.02965  [pdf, other

    cs.HC

    Telelife: The Future of Remote Living

    Authors: Jason Orlosky, Misha Sra, Kenan Bektaş, Huaishu Peng, Jeeeun Kim, Nataliya Kos'myna, Tobias Hollerer, Anthony Steed, Kiyoshi Kiyokawa, Kaan Akşit

    Abstract: In recent years, everyday activities such as work and socialization have steadily shifted to more remote and virtual settings. With the COVID-19 pandemic, the switch from physical to virtual has been accelerated, which has substantially affected various aspects of our lives, including business, education, commerce, healthcare, and personal life. This rapid and large-scale switch from in-person to… ▽ More

    Submitted 6 July, 2021; originally announced July 2021.

  8. arXiv:2103.02130  [pdf, other

    cs.CV

    Augmentation Strategies for Learning with Noisy Labels

    Authors: Kento Nishi, Yi Ding, Alex Rich, Tobias Höllerer

    Abstract: Imperfect labels are ubiquitous in real-world datasets. Several recent successful methods for training deep neural networks (DNNs) robust to label noise have used two primary techniques: filtering samples based on loss during a warm-up phase to curate an initial set of cleanly labeled samples, and using the output of a network as a pseudo-label for subsequent loss calculations. In this paper, we e… ▽ More

    Submitted 1 April, 2021; v1 submitted 2 March, 2021; originally announced March 2021.

  9. ARbis Pictus: A Study of Language Learning with Augmented Reality

    Authors: Adam Ibrahim, Brandon Huynh, Jonathan Downey, Tobias Höllerer, Dorothy Chun, John O'Donovan

    Abstract: This paper describes "ARbis Pictus" --a novel system for immersive language learning through dynamic labeling of real-world objects in augmented reality. We describe a within-subjects lab-based study (N=52) that explores the effect of our system on participants learning nouns in an unfamiliar foreign language, compared to a traditional flashcard-based approach. Our results show that the immersive… ▽ More

    Submitted 17 June, 2019; v1 submitted 30 November, 2017; originally announced November 2017.

    Comments: TVCG version

    Journal ref: IEEE Transactions on Visualization and Computer Graphics ( Volume: 24 , Issue: 11 , Nov. 2018 )

  10. Automated Assistants to Identify and Prompt Action on Visual News Bias

    Authors: Vishwajeet Narwal, Mohamed Hashim Salih, Jose Angel Lopez, Angel Ortega, John O'Donovan, Tobias Höllerer, Saiph Savage

    Abstract: Bias is a common problem in today's media, appearing frequently in text and in visual imagery. Users on social media websites such as Twitter need better methods for identifying bias. Additionally, activists --those who are motivated to effect change related to some topic, need better methods to identify and counteract bias that is contrary to their mission. With both of these use cases in mind, i… ▽ More

    Submitted 10 March, 2017; v1 submitted 21 February, 2017; originally announced February 2017.

    Comments: 6 pages, 6 figures, (Accepted) CHI 17 Extended Abstracts, May 06-11, 2017, Denver, CO, USA

    ACM Class: K.4.2

  11. arXiv:1607.03949  [pdf, other

    cs.CV

    Large Scale SfM with the Distributed Camera Model

    Authors: Chris Sweeney, Victor Fragoso, Tobias Hollerer, Matthew Turk

    Abstract: We introduce the distributed camera model, a novel model for Structure-from-Motion (SfM). This model describes image observations in terms of light rays with ray origins and directions rather than pixels. As such, the proposed model is capable of describing a single camera or multiple cameras simultaneously as the collection of all light rays observed. We show how the distributed camera model is a… ▽ More

    Submitted 30 November, 2016; v1 submitted 13 July, 2016; originally announced July 2016.

    Comments: Published at 2016 3DV Conference

  12. Botivist: Calling Volunteers to Action Using Online Bots

    Authors: Saiph Savage, Andres Monroy-Hernandez, Tobias Hollerer

    Abstract: To help activists call new volunteers to action, we present Botivist: a platform that uses Twitter bots to find potential volunteers and request contributions. By leveraging different Twitter accounts, Botivist employs different strategies to encourage participation. We explore how people respond to bots calling them to action using a test case about corruption in Latin America. Our results show t… ▽ More

    Submitted 20 September, 2015; originally announced September 2015.

    Comments: 9 pages, 3 figures, CSCW'16

    ACM Class: H.5.2

  13. arXiv:1509.01095  [pdf, other

    cs.SI cs.CY

    Tag Me Maybe: Perceptions of Public Targeted Sharing on Facebook

    Authors: Saiph Savage, Andres Monroy-Hernandez, Kasturi Bhattacharjee, Tobias Hollerer

    Abstract: Social network sites allow users to publicly tag people in their posts. These tagged posts allow users to share to both the general public and a targeted audience, dynamically assembled via notifications that alert the people mentioned. We investigate people's perceptions of this mixed sharing mode through a qualitative study with 120 participants. We found that individuals like this sharing modal… ▽ More

    Submitted 3 September, 2015; originally announced September 2015.

    Comments: 5 pages, one figure, Hypertext 2016

    ACM Class: H.5.3