Skip to main content

Showing 1–3 of 3 results for author: Blume, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.02352  [pdf, other

    cs.CV

    Region-Based Representations Revisited

    Authors: Michal Shlapentokh-Rothman, Ansel Blume, Yao Xiao, Yuqun Wu, Sethuraman T V, Heyi Tao, Jae Yong Lee, Wilfredo Torres, Yu-Xiong Wang, Derek Hoiem

    Abstract: We investigate whether region-based representations are effective for recognition. Regions were once a mainstay in recognition approaches, but pixel and patch-based features are now used almost exclusively. We show that recent class-agnostic segmenters like SAM can be effectively combined with strong unsupervised representations like DINOv2 and used for a wide variety of tasks, including semantic… ▽ More

    Submitted 9 June, 2024; v1 submitted 4 February, 2024; originally announced February 2024.

    Comments: CVPR 2024 Camera Ready; website: https://regionreps.web.illinois.edu/

  2. arXiv:2305.14647  [pdf, other

    cs.CL

    Scientific Opinion Summarization: Paper Meta-review Generation Dataset, Methods, and Evaluation

    Authors: Qi Zeng, Mankeerat Sidhu, Ansel Blume, Hou Pong Chan, Lu Wang, Heng Ji

    Abstract: Opinions in scientific research papers can be divergent, leading to controversies among reviewers. However, most existing datasets for opinion summarization are centered around product reviews and assume that the analyzed opinions are non-controversial, failing to account for the variability seen in other contexts such as academic papers, political debates, or social media discussions. To address… ▽ More

    Submitted 15 June, 2024; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: IJCAI 2024 AI4Research Workshop

  3. arXiv:2305.10683  [pdf, other

    cs.CV cs.CL

    Paxion: Patching Action Knowledge in Video-Language Foundation Models

    Authors: Zhenhailong Wang, Ansel Blume, Sha Li, Genglin Liu, Jaemin Cho, Zineng Tang, Mohit Bansal, Heng Ji

    Abstract: Action knowledge involves the understanding of textual, visual, and temporal aspects of actions. We introduce the Action Dynamics Benchmark (ActionBench) containing two carefully designed probing tasks: Action Antonym and Video Reversal, which targets multimodal alignment capabilities and temporal understanding skills of the model, respectively. Despite recent video-language models' (VidLM) impres… ▽ More

    Submitted 21 October, 2023; v1 submitted 17 May, 2023; originally announced May 2023.

    Comments: NeurIPS 2023 spotlight