Skip to main content

Showing 1–15 of 15 results for author: Hamid, R

.
  1. arXiv:2304.13166  [pdf, other

    cs.CV

    LEMaRT: Label-Efficient Masked Region Transform for Image Harmonization

    Authors: Sheng Liu, Cong Phuoc Huynh, Cong Chen, Maxim Arap, Raffay Hamid

    Abstract: We present a simple yet effective self-supervised pre-training method for image harmonization which can leverage large-scale unannotated image datasets. To achieve this goal, we first generate pre-training data online with our Label-Efficient Masked Region Transform (LEMaRT) pipeline. Given an image, LEMaRT generates a foreground mask and then applies a set of transformations to perturb various vi… ▽ More

    Submitted 25 April, 2023; originally announced April 2023.

    Comments: Accepted by CVPR'23, 19 pages

  2. arXiv:2303.14526  [pdf, other

    cs.CV

    Selective Structured State-Spaces for Long-Form Video Understanding

    Authors: Jue Wang, Wentao Zhu, Pichao Wang, Xiang Yu, Linda Liu, Mohamed Omar, Raffay Hamid

    Abstract: Effective modeling of complex spatiotemporal dependencies in long-form videos remains an open problem. The recently proposed Structured State-Space Sequence (S4) model with its linear complexity offers a promising direction in this space. However, we demonstrate that treating all image-tokens equally as done by S4 model can adversely affect its efficiency and accuracy. To address this limitation,… ▽ More

    Submitted 25 March, 2023; originally announced March 2023.

    Comments: Accepted by CVPR 2023

  3. arXiv:2206.08429  [pdf, other

    cs.CV

    Scalable Temporal Localization of Sensitive Activities in Movies and TV Episodes

    Authors: Xiang Hao, **gxiang Chen, Shixing Chen, Ahmed Saad, Raffay Hamid

    Abstract: To help customers make better-informed viewing choices, video-streaming services try to moderate their content and provide more visibility into which portions of their movies and TV episodes contain age-appropriate material (e.g., nudity, sex, violence, or drug-use). Supervised models to localize these sensitive activities require large amounts of clip-level labeled data which is hard to obtain, w… ▽ More

    Submitted 16 June, 2022; originally announced June 2022.

  4. arXiv:2204.04588  [pdf, other

    cs.CV cs.LG

    Robust Cross-Modal Representation Learning with Progressive Self-Distillation

    Authors: Alex Andonian, Shixing Chen, Raffay Hamid

    Abstract: The learning objective of vision-language approach of CLIP does not effectively account for the noisy many-to-many correspondences found in web-harvested image captioning datasets, which contributes to its compute and data inefficiency. To address this challenge, we introduce a novel training framework based on cross-modal contrastive learning that uses progressive self-distillation and soft image… ▽ More

    Submitted 9 April, 2022; originally announced April 2022.

    Comments: Accepted to CVPR 2022

  5. arXiv:2204.02509  [pdf, other

    cs.CV

    Depth-Guided Sparse Structure-from-Motion for Movies and TV Shows

    Authors: Sheng Liu, Xiaohan Nie, Raffay Hamid

    Abstract: Existing approaches for Structure from Motion (SfM) produce impressive 3-D reconstruction results especially when using imagery captured with large parallax. However, to create engaging video-content in movies and TV shows, the amount by which a camera can be moved while filming a particular shot is often limited. The resulting small-motion parallax between video frames makes standard geometry-bas… ▽ More

    Submitted 5 April, 2022; originally announced April 2022.

  6. arXiv:2202.10650  [pdf, other

    cs.CV

    Movies2Scenes: Using Movie Metadata to Learn Scene Representation

    Authors: Shixing Chen, Chun-Hao Liu, Xiang Hao, Xiaohan Nie, Maxim Arap, Raffay Hamid

    Abstract: Understanding scenes in movies is crucial for a variety of applications such as video moderation, search, and recommendation. However, labeling individual scenes is a time-consuming process. In contrast, movie level metadata (e.g., genre, synopsis, etc.) regularly gets produced as part of the film production process, and is therefore significantly more commonly available. In this work, we propose… ▽ More

    Submitted 29 March, 2023; v1 submitted 21 February, 2022; originally announced February 2022.

    Comments: Accepted to CVPR 2023

  7. arXiv:2105.08506  [pdf, other

    eess.IV cs.CV cs.LG

    COVID-19 Detection in Computed Tomography Images with 2D and 3D Approaches

    Authors: Sara Atito Ali Ahmed, Mehmet Can Yavuz, Mehmet Umut Sen, Fatih Gulsen, Onur Tutar, Bora Korkmazer, Cesur Samanci, Sabri Sirolu, Rauf Hamid, Ali Ergun Eryurekli, Toghrul Mammadov, Berrin Yanikoglu

    Abstract: Detecting COVID-19 in computed tomography (CT) or radiography images has been proposed as a supplement to the definitive RT-PCR test. We present a deep learning ensemble for detecting COVID-19 infection, combining slice-based (2D) and volume-based (3D) approaches. The 2D system detects the infection on each CT slice independently, combining them to obtain the patient-level decision via different m… ▽ More

    Submitted 20 May, 2021; v1 submitted 16 May, 2021; originally announced May 2021.

  8. arXiv:2104.13537  [pdf, other

    cs.CV

    Shot Contrastive Self-Supervised Learning for Scene Boundary Detection

    Authors: Shixing Chen, Xiaohan Nie, David Fan, Dongqing Zhang, Vimal Bhat, Raffay Hamid

    Abstract: Scenes play a crucial role in breaking the storyline of movies and TV episodes into semantically cohesive parts. However, given their complex temporal structure, finding scene boundaries can be a challenging task requiring large amounts of labeled training data. To address this challenge, we present a self-supervised shot contrastive learning approach (ShotCoL) to learn a shot representation that… ▽ More

    Submitted 27 April, 2021; originally announced April 2021.

    Comments: Accepted to CVPR 2021

  9. arXiv:1906.11495  [pdf, other

    physics.atom-ph

    Guidelines for develo** optical clocks with $10^{-18}$ fractional frequency uncertainty

    Authors: Moustafa Abdel-Hafiz, Piotr Ablewski, Ali Al-Masoudi, Héctor Álvarez Martínez, Petr Balling, Geoffrey Barwood, Erik Benkler, Marcin Bober, Mateusz Borkowski, William Bowden, Roman Ciuryło, Hubert Cybulski, Alexandre Didier, Miroslav Doležal, Sören Dörscher, Stephan Falke, Rachel M. Godun, Ramiz Hamid, Ian R. Hill, Richard Hobson, Nils Huntemann, Yann Le Coq, Rodolphe Le Targat, Thomas Legero, Thomas Lindvall , et al. (20 additional authors not shown)

    Abstract: There has been tremendous progress in the performance of optical frequency standards since the first proposals to carry out precision spectroscopy on trapped, single ions in the 1970s. The estimated fractional frequency uncertainty of today's leading optical standards is currently in the $10^{-18}$ range, approximately two orders of magnitude better than that of the best caesium primary frequency… ▽ More

    Submitted 13 August, 2019; v1 submitted 27 June, 2019; originally announced June 2019.

    Comments: 130 pages, Table 5.3 corrected in v2

  10. Tailored design of mode-locking dynamics for low-noise frequency comb generation

    Authors: Çağrı Şenel, Ramiz Hamid, Cihangir Erdoğan, Mehmet Çelik, Fatih Őmer Ilday

    Abstract: We report on a mode-locked laser design using Yb-doped fiber lasers for low-noise frequency-comb generation. The frequency comb covers the spectral range from $700$ to $1400$ nm. Although this range is more practical for many measurements than that produced by the more commonly used Er-fiber lasers, it has been addressed in only a handful of reports, mainly due to the difficulty of generating a fu… ▽ More

    Submitted 3 May, 2019; originally announced May 2019.

    Journal ref: Phys. Rev. Applied, vol. 10, no. 2, pp. 024027, 2018

  11. arXiv:1404.5351  [pdf, other

    cs.CV

    Fast Approximate Matching of Cell-Phone Videos for Robust Background Subtraction

    Authors: Raffay Hamid, Atish Das Sarma, Dennis DeCoste, Neel Sundaresan

    Abstract: We identify a novel instance of the background subtraction problem that focuses on extracting near-field foreground objects captured using handheld cameras. Given two user-generated videos of a scene, one with and the other without the foreground object(s), our goal is to efficiently generate an output video with only the foreground object(s) present in it. We cast this challenge as a spatio-tempo… ▽ More

    Submitted 21 April, 2014; originally announced April 2014.

  12. arXiv:1404.0466  [pdf, other

    cs.LG math.NA

    piCholesky: Polynomial Interpolation of Multiple Cholesky Factors for Efficient Approximate Cross-Validation

    Authors: Da Kuang, Alex Gittens, Raffay Hamid

    Abstract: The dominant cost in solving least-square problems using Newton's method is often that of factorizing the Hessian matrix over multiple values of the regularization parameter ($λ$). We propose an efficient way to interpolate the Cholesky factors of the Hessian matrix computed over a small set of $λ$ values. This approximation enables us to optimally minimize the hold-out error while incurring only… ▽ More

    Submitted 10 June, 2015; v1 submitted 2 April, 2014; originally announced April 2014.

  13. arXiv:1312.4626  [pdf, other

    stat.ML cs.LG

    Compact Random Feature Maps

    Authors: Raffay Hamid, Ying Xiao, Alex Gittens, Dennis DeCoste

    Abstract: Kernel approximation using randomized feature maps has recently gained a lot of interest. In this work, we identify that previous approaches for polynomial kernel approximation create maps that are rank deficient, and therefore do not utilize the capacity of the projected feature space effectively. To address this challenge, we propose compact random feature maps (CRAFTMaps) to approximate polynom… ▽ More

    Submitted 16 December, 2013; originally announced December 2013.

    Comments: 9 pages

  14. arXiv:1212.3834  [pdf

    physics.atom-ph physics.optics

    Coherent Population Trap** resonances on lower atomic levels of Doppler broadened optical lines

    Authors: Ersoy Sahin, Gonul Ozen, Ramiz Hamid, Mehmet Celik, Azad Ch. Izmailov

    Abstract: We have detected and analysed narrow high-contrast coherent population trap** (CPT) resonances, which are induced in absorption of the weak probe light beam by the counterpropagating two-frequency pum** radiation. Our experimental investigations have been carried out on example of nonclosed three level Lambda systems formed by spectral components of the Doppler broadened D2 line of cesium atom… ▽ More

    Submitted 16 December, 2012; originally announced December 2012.

  15. arXiv:1201.5250  [pdf

    physics.atom-ph quant-ph

    High contrast resonances of the coherent population trap** on sublevels of the ground atomic term

    Authors: Ersoy Sahin, Ramiz Hamid, Cengiz Birlikseven, Gönül Özen, Azad Ch. Izmailov

    Abstract: We have detected and analyzed narrow, high contrast coherent population trap** resonances, which appear in transmission of the probe monochromatic light beam under action of the counterpropagating two-frequency laser radiation, on example of the nonclosed three level Λ-system formed by spectral components of the Doppler broadened D2 line of cesium atoms (in the cell with the rarefied Cs vapor).… ▽ More

    Submitted 25 January, 2012; originally announced January 2012.

    Comments: 4