Skip to main content

Showing 1–8 of 8 results for author: Santos-Villalobos, H

.
  1. arXiv:2308.12962  [pdf, other

    cs.CV

    Motion-Guided Masking for Spatiotemporal Representation Learning

    Authors: David Fan, Jue Wang, Shuai Liao, Yi Zhu, Vimal Bhat, Hector Santos-Villalobos, Rohith MV, Xinyu Li

    Abstract: Several recent works have directly extended the image masked autoencoder (MAE) with random masking into video domain, achieving promising results. However, unlike images, both spatial and temporal information are important for video understanding. This suggests that the random masking strategy that is inherited from the image MAE is less effective for video MAE. This motivates the design of a nove… ▽ More

    Submitted 24 August, 2023; originally announced August 2023.

    Comments: Accepted to ICCV 2023

  2. arXiv:2308.11185  [pdf, other

    cs.CV

    MEGA: Multimodal Alignment Aggregation and Distillation For Cinematic Video Segmentation

    Authors: Najmeh Sadoughi, Xinyu Li, Avijit Vajpayee, David Fan, Bing Shuai, Hector Santos-Villalobos, Vimal Bhat, Rohith MV

    Abstract: Previous research has studied the task of segmenting cinematic videos into scenes and into narrative acts. However, these studies have overlooked the essential task of multimodal alignment and fusion for effectively and efficiently processing long-form videos (>60min). In this paper, we introduce Multimodal alignmEnt aGgregation and distillAtion (MEGA) for cinematic long-video segmentation. MEGA t… ▽ More

    Submitted 22 August, 2023; originally announced August 2023.

    Comments: ICCV 2023 accepted

  3. arXiv:2211.15859  [pdf, other

    eess.IV physics.med-ph

    Model-based Reconstruction for Multi-Frequency Collimated Beam Ultrasound Systems

    Authors: Abdulrahman M. Alanazi, Singanallur Venkatakrishnan, Hector Santos-Villalobos, Gregery T. Buzzard, Charles Bouman

    Abstract: Collimated beam ultrasound systems are a technology for imaging inside multi-layered structures such as geothermal wells. These systems work by using a collimated narrow-band ultrasound transmitter that can penetrate through multiple layers of heterogeneous material. A series of measurements can then be made at multiple transmit frequencies. However, commonly used reconstruction algorithms such as… ▽ More

    Submitted 28 November, 2022; originally announced November 2022.

  4. arXiv:2211.01917  [pdf, other

    cs.CV cs.AI cs.LG

    Expanding Accurate Person Recognition to New Altitudes and Ranges: The BRIAR Dataset

    Authors: David Cornett III, Joel Brogan, Nell Barber, Deniz Aykac, Seth Baird, Nick Burchfield, Carl Dukes, Andrew Duncan, Regina Ferrell, Jim Goddard, Gavin Jager, Matt Larson, Bart Murphy, Christi Johnson, Ian Shelley, Nisha Srinivas, Brandon Stockwell, Leanne Thompson, Matt Yohe, Robert Zhang, Scott Dolvin, Hector J. Santos-Villalobos, David S. Bolme

    Abstract: Face recognition technology has advanced significantly in recent years due largely to the availability of large and increasingly complex training datasets for use in deep learning models. These datasets, however, typically comprise images scraped from news sites or social media platforms and, therefore, have limited utility in more advanced security, forensics, and military applications. These app… ▽ More

    Submitted 3 November, 2022; originally announced November 2022.

  5. arXiv:2202.09703  [pdf, other

    eess.IV eess.SP

    Model-Based Reconstruction for Collimated Beam Ultrasound Systems

    Authors: Abdulrahman Alanazi, Singanallur Venkatakrishnan, Hector Santos-Villalobos, Gregery Buzzard, Charles Bouman

    Abstract: Collimated beam ultrasound systems are a novel technology for imaging inside multi-layered structures such as geothermal wells. Such systems include a transmitter and multiple receivers to capture reflected signals. Common algorithms for ultrasound reconstruction use delay-and-sum (DAS) approaches; these have low computational complexity but produce inaccurate images in the presence of complex str… ▽ More

    Submitted 19 February, 2022; originally announced February 2022.

    Journal ref: ICASSP 2022

  6. arXiv:2002.12257  [pdf, other

    cs.CV cs.LG eess.IV

    The Mertens Unrolled Network (MU-Net): A High Dynamic Range Fusion Neural Network for Through the Windshield Driver Recognition

    Authors: Max Ruby, David S. Bolme, Joel Brogan, David Cornett III, Baldemar Delgado, Gavin Jager, Christi Johnson, Jose Martinez-Mendoza, Hector Santos-Villalobos, Nisha Srinivas

    Abstract: Face recognition of vehicle occupants through windshields in unconstrained environments poses a number of unique challenges ranging from glare, poor illumination, driver pose and motion blur. In this paper, we further develop the hardware and software components of a custom vehicle imaging system to better overcome these challenges. After the build out of a physical prototype system that performs… ▽ More

    Submitted 27 February, 2020; originally announced February 2020.

    Comments: Accepted to SPEI Autonomous Systems: Sensors, Processing and Security for Vehicles & Infrastructure 2020

  7. arXiv:1808.03336  [pdf, other

    eess.IV

    Model-Based Iterative Reconstruction for One-Sided Ultrasonic Non-Destructive Evaluation

    Authors: Hani Almansouri, Singanallur Venkatakrishnan, Charles Bouman, Hector Santos-Villalobos

    Abstract: One-sided ultrasonic non-destructive evaluation (UNDE) is extensively used to characterize structures that need to be inspected and maintained from defects and flaws that could affect the performance of power plants, such as nuclear power plants. Most UNDE systems send acoustic pulses into the structure of interest, measure the received waveform and use an algorithm to reconstruct the quantity of… ▽ More

    Submitted 9 August, 2018; originally announced August 2018.

  8. arXiv:1807.01224  [pdf, other

    eess.IV eess.SP

    Deep neural networks for non-linear model-based ultrasound reconstruction

    Authors: Hani Almansouri, S. V. Venkatakrishnan, Gregery T. Buzzard, Charles A. Bouman, Hector Santos-Villalobos

    Abstract: Ultrasound reflection tomography is widely used to image large complex specimens that are only accessible from a single side, such as well systems and nuclear power plant containment walls. Typical methods for inverting the measurement rely on delay-and-sum algorithms that rapidly produce reconstructions but with significant artifacts. Recently, model-based reconstruction approaches using a linear… ▽ More

    Submitted 28 September, 2018; v1 submitted 3 July, 2018; originally announced July 2018.