Skip to main content

Showing 1–22 of 22 results for author: Rameau, F

.
  1. arXiv:2406.18898  [pdf, other

    cs.CV cs.AI

    360 in the Wild: Dataset for Depth Prediction and View Synthesis

    Authors: Kibaek Park, Francois Rameau, Jaesik Park, In So Kweon

    Abstract: The large abundance of perspective camera datasets facilitated the emergence of novel learning-based strategies for various tasks, such as camera localization, single image depth estimation, or view synthesis. However, panoramic or omnidirectional image datasets, including essential information, such as pose and depth, are mostly made with synthetic scenes. In this work, we introduce a large scale… ▽ More

    Submitted 4 July, 2024; v1 submitted 27 June, 2024; originally announced June 2024.

  2. arXiv:2312.15242  [pdf, other

    cs.CV

    CaLDiff: Camera Localization in NeRF via Pose Diffusion

    Authors: Rashik Shrestha, Bishad Koju, Abhigyan Bhusal, Danda Pani Paudel, François Rameau

    Abstract: With the widespread use of NeRF-based implicit 3D representation, the need for camera localization in the same representation becomes manifestly apparent. Doing so not only simplifies the localization process -- by avoiding an outside-the-NeRF-based localization -- but also has the potential to offer the benefit of enhanced localization. This paper studies the problem of localizing cameras in NeRF… ▽ More

    Submitted 23 December, 2023; originally announced December 2023.

  3. arXiv:2306.06211  [pdf, other

    cs.CV

    A Survey on Segment Anything Model (SAM): Vision Foundation Model Meets Prompt Engineering

    Authors: Chaoning Zhang, Fachrina Dewi Puspitasari, Sheng Zheng, Chenghao Li, Yu Qiao, Taegoo Kang, Xinru Shan, Chenshuang Zhang, Caiyan Qin, Francois Rameau, Lik-Hang Lee, Sung-Ho Bae, Choong Seon Hong

    Abstract: Segment anything model (SAM) developed by Meta AI Research has recently attracted significant attention. Trained on a large segmentation dataset of over 1 billion masks, SAM is capable of segmenting any object on a certain image. In the original SAM work, the authors turned to zero-short transfer tasks (like edge detection) for evaluating the performance of SAM. Recently, numerous works have attem… ▽ More

    Submitted 3 July, 2023; v1 submitted 12 May, 2023; originally announced June 2023.

    Comments: First survey on Segment Anything Model (SAM), work under progress

  4. arXiv:2305.10947  [pdf, other

    cs.LG cs.AI cs.CV cs.PF

    Comparative Study: Standalone IEEE 16-bit Floating-Point for Image Classification

    Authors: Juyoung Yun, Byungkon Kang, Francois Rameau, Zhoulai Fu

    Abstract: Reducing the number of bits needed to encode the weights and activations of neural networks is highly desirable as it speeds up their training and inference time while reducing memory consumption. It is unsurprising that considerable attention has been drawn to develo** neural networks that employ lower-precision computation. This includes IEEE 16-bit, Google bfloat16, 8-bit, 4-bit floating-poin… ▽ More

    Submitted 25 August, 2023; v1 submitted 18 May, 2023; originally announced May 2023.

  5. arXiv:2305.06131  [pdf, other

    cs.CV

    Generative AI meets 3D: A Survey on Text-to-3D in AIGC Era

    Authors: Chenghao Li, Chaoning Zhang, Atish Waghwase, Lik-Hang Lee, Francois Rameau, Yang Yang, Sung-Ho Bae, Choong Seon Hong

    Abstract: Generative AI (AIGC, a.k.a. AI generated content) has made significant progress in recent years, with text-guided content generation being the most practical as it facilitates interaction between human instructions and AIGC. Due to advancements in text-to-image and 3D modeling technologies (like NeRF), text-to-3D has emerged as a nascent yet highly active research field. Our work conducts the firs… ▽ More

    Submitted 10 June, 2024; v1 submitted 10 May, 2023; originally announced May 2023.

  6. arXiv:2301.04470  [pdf, other

    cs.CV

    InstaGraM: Instance-level Graph Modeling for Vectorized HD Map Learning

    Authors: Juyeb Shin, Francois Rameau, Hyeonjun Jeong, Dongsuk Kum

    Abstract: Inferring traffic object such as lane information is of foremost importance for deployment of autonomous driving. Previous approaches focus on offline construction of HD map inferred with GPS localization, which is insufficient for globally scalable autonomous driving. To alleviate these issues, we propose online HD map learning framework that detects HD map elements from onboard sensor observatio… ▽ More

    Submitted 22 June, 2023; v1 submitted 10 January, 2023; originally announced January 2023.

    Comments: Workshop on Vision-Centric Autonomous Driving (VCAD) at Conference on Computer Vision and Pattern Recognition (CVPR) 2023

  7. arXiv:2208.12300  [pdf, other

    cs.CV

    A Deep Perceptual Measure for Lens and Camera Calibration

    Authors: Yannick Hold-Geoffroy, Dominique Piché-Meunier, Kalyan Sunkavalli, Jean-Charles Bazin, François Rameau, Jean-François Lalonde

    Abstract: Image editing and compositing have become ubiquitous in entertainment, from digital art to AR and VR experiences. To produce beautiful composites, the camera needs to be geometrically calibrated, which can be tedious and requires a physical calibration target. In place of the traditional multi-image calibration process, we propose to infer the camera calibration parameters such as pitch, roll, fie… ▽ More

    Submitted 26 July, 2023; v1 submitted 25 August, 2022; originally announced August 2022.

    Comments: 12 pages, 12 figures, project page (including live demo) available at https://lvsn.github.io/deepcalib. arXiv admin note: text overlap with arXiv:1712.01259

  8. arXiv:2206.00181  [pdf, other

    cs.CV

    Labeling Where Adapting Fails: Cross-Domain Semantic Segmentation with Point Supervision via Active Selection

    Authors: Fei Pan, Francois Rameau, Junsik Kim, In So Kweon

    Abstract: Training models dedicated to semantic segmentation requires a large amount of pixel-wise annotated data. Due to their costly nature, these annotations might not be available for the task at hand. To alleviate this problem, unsupervised domain adaptation approaches aim at aligning the feature distributions between the labeled source and the unlabeled target data. While these strategies lead to noti… ▽ More

    Submitted 4 June, 2022; v1 submitted 31 May, 2022; originally announced June 2022.

  9. arXiv:2203.12848  [pdf, other

    cs.CV

    Keypoints Tracking via Transformer Networks

    Authors: Oleksii Nasypanyi, Francois Rameau

    Abstract: In this thesis, we propose a pioneering work on sparse keypoints tracking across images using transformer networks. While deep learning-based keypoints matching have been widely investigated using graph neural networks - and more recently transformer networks, they remain relatively too slow to operate in real-time and are particularly sensitive to the poor repeatability of the keypoints detectors… ▽ More

    Submitted 24 March, 2022; originally announced March 2022.

  10. arXiv:2111.11704  [pdf, other

    cs.CV

    Deep Point Cloud Reconstruction

    Authors: Jaesung Choe, Byeongin Joung, Francois Rameau, Jaesik Park, In So Kweon

    Abstract: Point cloud obtained from 3D scanning is often sparse, noisy, and irregular. To cope with these issues, recent studies have been separately conducted to densify, denoise, and complete inaccurate point cloud. In this paper, we advocate that jointly solving these tasks leads to significant improvement for point cloud reconstruction. To this end, we propose a deep point cloud reconstruction network c… ▽ More

    Submitted 15 March, 2022; v1 submitted 23 November, 2021; originally announced November 2021.

    Comments: ICLR 2022 accepted

  11. arXiv:2111.11187  [pdf, other

    cs.CV

    PointMixer: MLP-Mixer for Point Cloud Understanding

    Authors: Jaesung Choe, Chunghyun Park, Francois Rameau, Jaesik Park, In So Kweon

    Abstract: MLP-Mixer has newly appeared as a new challenger against the realm of CNNs and transformer. Despite its simplicity compared to transformer, the concept of channel-mixing MLPs and token-mixing MLPs achieves noticeable performance in visual recognition tasks. Unlike images, point clouds are inherently sparse, unordered and irregular, which limits the direct use of MLP-Mixer for point cloud understan… ▽ More

    Submitted 20 July, 2022; v1 submitted 22 November, 2021; originally announced November 2021.

    Comments: Accepted to ECCV 2022

  12. arXiv:2110.06853  [pdf, other

    cs.CV cs.LG cs.RO

    Attentive and Contrastive Learning for Joint Depth and Motion Field Estimation

    Authors: Seokju Lee, Francois Rameau, Fei Pan, In So Kweon

    Abstract: Estimating the motion of the camera together with the 3D structure of the scene from a monocular vision system is a complex task that often relies on the so-called scene rigidity assumption. When observing a dynamic environment, this assumption is violated which leads to an ambiguity between the ego-motion of the camera and the motion of the objects. To solve this problem, we present a self-superv… ▽ More

    Submitted 13 October, 2021; originally announced October 2021.

    Comments: ICCV 2021

  13. arXiv:2108.08623  [pdf, other

    cs.CV

    VolumeFusion: Deep Depth Fusion for 3D Scene Reconstruction

    Authors: Jaesung Choe, Sunghoon Im, Francois Rameau, Minjun Kang, In So Kweon

    Abstract: To reconstruct a 3D scene from a set of calibrated views, traditional multi-view stereo techniques rely on two distinct stages: local depth maps computation and global depth maps fusion. Recent studies concentrate on deep neural architectures for depth estimation by using conventional depth fusion method or direct 3D reconstruction network by regressing Truncated Signed Distance Function (TSDF). I… ▽ More

    Submitted 19 August, 2021; originally announced August 2021.

    Comments: ICCV 2021 Accepted

  14. arXiv:2104.09134  [pdf, other

    cs.CV

    Restoration of Video Frames from a Single Blurred Image with Motion Understanding

    Authors: Dawit Mureja Argaw, Junsik Kim, Francois Rameau, Chaoning Zhang, In So Kweon

    Abstract: We propose a novel framework to generate clean video frames from a single motion-blurred image. While a broad range of literature focuses on recovering a single image from a blurred image, in this work, we tackle a more challenging task i.e. video restoration from a blurred image. We formulate video restoration from a single blurred image as an inverse problem by setting clean image sequence and t… ▽ More

    Submitted 19 April, 2021; originally announced April 2021.

    Comments: Accepted to CVPRW, NTIRE 2021

  15. arXiv:2103.12498  [pdf, other

    cs.CV

    Stereo Object Matching Network

    Authors: Jaesung Choe, Kyungdon Joo, Francois Rameau, In So Kweon

    Abstract: This paper presents a stereo object matching method that exploits both 2D contextual information from images as well as 3D object-level information. Unlike existing stereo matching methods that exclusively focus on the pixel-level correspondence between stereo images within a volumetric space (i.e., cost volume), we exploit this volumetric structure in a different manner. The cost volume explicitl… ▽ More

    Submitted 23 March, 2021; originally announced March 2021.

    Comments: Accepted at ICRA 2021

  16. arXiv:2103.02996  [pdf, other

    cs.CV

    Optical Flow Estimation from a Single Motion-blurred Image

    Authors: Dawit Mureja Argaw, Junsik Kim, Francois Rameau, Jae Won Cho, In So Kweon

    Abstract: In most of computer vision applications, motion blur is regarded as an undesirable artifact. However, it has been shown that motion blur in an image may have practical interests in fundamental computer vision problems. In this work, we propose a novel framework to estimate optical flow from a single motion-blurred image in an end-to-end manner. We design our network with transformer networks to le… ▽ More

    Submitted 10 March, 2021; v1 submitted 4 March, 2021; originally announced March 2021.

    Comments: Accepted to AAAI 2021

  17. arXiv:2103.02984  [pdf, other

    cs.CV

    Motion-blurred Video Interpolation and Extrapolation

    Authors: Dawit Mureja Argaw, Junsik Kim, Francois Rameau, In So Kweon

    Abstract: Abrupt motion of camera or objects in a scene result in a blurry video, and therefore recovering high quality video requires two types of enhancements: visual enhancement and temporal upsampling. A broad range of research attempted to recover clean frames from blurred image sequences or temporally upsample frames by interpolation, yet there are very limited studies handling both problems jointly.… ▽ More

    Submitted 10 March, 2021; v1 submitted 4 March, 2021; originally announced March 2021.

    Comments: Accepted to AAAI 2021

  18. arXiv:2010.12496  [pdf, other

    cs.CV

    ResNet or DenseNet? Introducing Dense Shortcuts to ResNet

    Authors: Chaoning Zhang, Philipp Benz, Dawit Mureja Argaw, Seokju Lee, Junsik Kim, Francois Rameau, Jean-Charles Bazin, In So Kweon

    Abstract: ResNet or DenseNet? Nowadays, most deep learning based approaches are implemented with seminal backbone networks, among them the two arguably most famous ones are ResNet and DenseNet. Despite their competitive performance and overwhelming popularity, inherent drawbacks exist for both of them. For ResNet, the identity shortcut that stabilizes training also limits its representation capacity, while… ▽ More

    Submitted 23 October, 2020; originally announced October 2020.

    Comments: Accepted to WACV2021 first round

  19. arXiv:2004.07703  [pdf, other

    cs.CV cs.LG cs.RO

    Unsupervised Intra-domain Adaptation for Semantic Segmentation through Self-Supervision

    Authors: Fei Pan, Inkyu Shin, Francois Rameau, Seokju Lee, In So Kweon

    Abstract: Convolutional neural network-based approaches have achieved remarkable progress in semantic segmentation. However, these approaches heavily rely on annotated data which are labor intensive. To cope with this limitation, automatically annotated data generated from graphic engines are used to train segmentation models. However, the models trained from synthetic data are difficult to transfer to real… ▽ More

    Submitted 15 July, 2020; v1 submitted 16 April, 2020; originally announced April 2020.

    Comments: Accepted to CVPR 2020 as an Oral Presentation. Code is available at https://github.com/feipan664/IntraDA

  20. arXiv:1907.12646  [pdf, other

    cs.CV cs.RO

    Camera Exposure Control for Robust Robot Vision with Noise-Aware Image Quality Assessment

    Authors: Ukcheol Shin, **sun Park, Gyumin Shim, Francois Rameau, In So Kweon

    Abstract: In this paper, we propose a noise-aware exposure control algorithm for robust robot vision. Our method aims to capture the best-exposed image which can boost the performance of various computer vision and robotics tasks. For this purpose, we carefully design an image quality metric which captures complementary quality attributes and ensures light-weight computation. Specifically, our metric consis… ▽ More

    Submitted 11 July, 2019; originally announced July 2019.

    Comments: 8 pages,6 figures, accepted in IROS2019

  21. arXiv:1710.07434  [pdf, other

    cs.CV

    Light-weight place recognition and loop detection using road markings

    Authors: Oleksandr Bailo, Francois Rameau, In So Kweon

    Abstract: In this paper, we propose an efficient algorithm for robust place recognition and loop detection using camera information only. Our pipeline purely relies on spatial localization and semantic information of road markings. The creation of the database of road markings sequences is performed online, which makes the method applicable for real-time loop closure for visual SLAM techniques. Furthermore,… ▽ More

    Submitted 20 October, 2017; originally announced October 2017.

  22. arXiv:1708.05137  [pdf, other

    cs.CV

    Pixel-Level Matching for Video Object Segmentation using Convolutional Neural Networks

    Authors: Jae Shin Yoon, Francois Rameau, Junsik Kim, Seokju Lee, Seunghak Shin, In So Kweon

    Abstract: We propose a novel video object segmentation algorithm based on pixel-level matching using Convolutional Neural Networks (CNN). Our network aims to distinguish the target area from the background on the basis of the pixel-level similarity between two object units. The proposed network represents a target object using features from different depth layers in order to take advantage of both the spati… ▽ More

    Submitted 17 August, 2017; originally announced August 2017.

    Comments: To appear on ICCV 2017