Skip to main content

Showing 1–5 of 5 results for author: Shapira, L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2312.12468  [pdf, other

    cs.CV

    MaskINT: Video Editing via Interpolative Non-autoregressive Masked Transformers

    Authors: Haoyu Ma, Shahin Mahdizadehaghdam, Bichen Wu, Zhipeng Fan, Yuchao Gu, Wenliang Zhao, Lior Shapira, Xiaohui Xie

    Abstract: Recent advances in generative AI have significantly enhanced image and video editing, particularly in the context of text prompt control. State-of-the-art approaches predominantly rely on diffusion models to accomplish these tasks. However, the computational demands of diffusion-based methods are substantial, often necessitating large-scale paired datasets for training, and therefore challenging t… ▽ More

    Submitted 2 April, 2024; v1 submitted 19 December, 2023; originally announced December 2023.

    Comments: CVPR 2024

  2. arXiv:2104.12727  [pdf, other

    cs.CV

    2.5D Visual Relationship Detection

    Authors: Yu-Chuan Su, Soravit Changpinyo, Xiangning Chen, Sathish Thoppay, Cho-Jui Hsieh, Lior Shapira, Radu Soricut, Hartwig Adam, Matthew Brown, Ming-Hsuan Yang, Boqing Gong

    Abstract: Visual 2.5D perception involves understanding the semantics and geometry of a scene through reasoning about object relationships with respect to the viewer in an environment. However, existing works in visual recognition primarily focus on the semantics. To bridge this gap, we study 2.5D visual relationship detection (2.5VRD), in which the goal is to jointly detect objects and predict their relati… ▽ More

    Submitted 26 April, 2021; originally announced April 2021.

  3. arXiv:2104.07608  [pdf, other

    cs.CV

    Camera View Adjustment Prediction for Improving Image Composition

    Authors: Yu-Chuan Su, Raviteja Vemulapalli, Ben Weiss, Chun-Te Chu, Philip Andrew Mansfield, Lior Shapira, Colvin Pitts

    Abstract: Image composition plays an important role in the quality of a photo. However, not every camera user possesses the knowledge and expertise required for capturing well-composed photos. While post-capture crop** can improve the composition sometimes, it does not work in many common scenarios in which the photographer needs to adjust the camera view to capture the best shot. To address this issue, w… ▽ More

    Submitted 15 April, 2021; originally announced April 2021.

  4. arXiv:2012.06985  [pdf, other

    cs.CV cs.AI cs.LG

    Contrastive Learning for Label-Efficient Semantic Segmentation

    Authors: Xiangyun Zhao, Raviteja Vemulapalli, Philip Mansfield, Boqing Gong, Bradley Green, Lior Shapira, Ying Wu

    Abstract: Collecting labeled data for the task of semantic segmentation is expensive and time-consuming, as it requires dense pixel-level annotations. While recent Convolutional Neural Network (CNN) based semantic segmentation approaches have achieved impressive results by using large amounts of labeled training data, their performance drops significantly as the amount of labeled data decreases. This happen… ▽ More

    Submitted 18 August, 2021; v1 submitted 13 December, 2020; originally announced December 2020.

    Comments: International Conference on Computer Vision (ICCV), 2021

  5. arXiv:1512.01515  [pdf, other

    cs.CV

    ASIST: Automatic Semantically Invariant Scene Transformation

    Authors: Or Litany, Tal Remez, Daniel Freedman, Lior Shapira, Alex Bronstein, Ran Gal

    Abstract: We present ASIST, a technique for transforming point clouds by replacing objects with their semantically equivalent counterparts. Transformations of this kind have applications in virtual reality, repair of fused scans, and robotics. ASIST is based on a unified formulation of semantic labeling and object replacement; both result from minimizing a single objective. We present numerical tools for th… ▽ More

    Submitted 4 December, 2015; originally announced December 2015.