Skip to main content

Showing 1–3 of 3 results for author: Rainwater, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2306.06842  [pdf, other

    cs.CV

    AerialFormer: Multi-resolution Transformer for Aerial Image Segmentation

    Authors: Kashu Yamazaki, Taisei Hanyu, Minh Tran, Adrian de Luis, Roy McCann, Haitao Liao, Chase Rainwater, Meredith Adkins, Jackson Cothren, Ngan Le

    Abstract: Aerial Image Segmentation is a top-down perspective semantic segmentation and has several challenging characteristics such as strong imbalance in the foreground-background distribution, complex background, intra-class heterogeneity, inter-class homogeneity, and tiny objects. To handle these problems, we inherit the advantages of Transformers and propose AerialFormer, which unifies Transformers at… ▽ More

    Submitted 1 October, 2023; v1 submitted 11 June, 2023; originally announced June 2023.

    Comments: under review

  2. arXiv:2206.12972  [pdf, other

    cs.CV

    VLCap: Vision-Language with Contrastive Learning for Coherent Video Paragraph Captioning

    Authors: Kashu Yamazaki, Sang Truong, Khoa Vo, Michael Kidd, Chase Rainwater, Khoa Luu, Ngan Le

    Abstract: In this paper, we leverage the human perceiving process, that involves vision and language interaction, to generate a coherent paragraph description of untrimmed videos. We propose vision-language (VL) features consisting of two modalities, i.e., (i) vision modality to capture global visual content of the entire scene and (ii) language modality to extract scene elements description of both human a… ▽ More

    Submitted 6 August, 2022; v1 submitted 26 June, 2022; originally announced June 2022.

    Comments: accepted by The 29th IEEE International Conference on Image Processing (IEEE ICIP) 2022

  3. arXiv:2108.03267  [pdf, other

    cs.CV

    BiMaL: Bijective Maximum Likelihood Approach to Domain Adaptation in Semantic Scene Segmentation

    Authors: Thanh-Dat Truong, Chi Nhan Duong, Ngan Le, Son Lam Phung, Chase Rainwater, Khoa Luu

    Abstract: Semantic segmentation aims to predict pixel-level labels. It has become a popular task in various computer vision applications. While fully supervised segmentation methods have achieved high accuracy on large-scale vision datasets, they are unable to generalize on a new test environment or a new domain well. In this work, we first introduce a new Un-aligned Domain Score to measure the efficiency o… ▽ More

    Submitted 6 August, 2021; originally announced August 2021.

    Comments: Accepted to ICCV 2021