Skip to main content

Showing 1–28 of 28 results for author: Loy, C C

Searching in archive eess. Search in all archives.
.
  1. arXiv:2405.04867  [pdf, other

    eess.IV cs.CV

    MIPI 2024 Challenge on Demosaic for HybridEVS Camera: Methods and Results

    Authors: Yaqi Wu, Zhihao Fan, Xiaofeng Chu, Jimmy S. Ren, Xiaoming Li, Zongsheng Yue, Chongyi Li, Shangcheng Zhou, Ruicheng Feng, Yuekun Dai, Peiqing Yang, Chen Change Loy, Senyan Xu, Zhi**g Sun, Jiaying Zhu, Yurui Zhu, Xueyang Fu, Zheng-Jun Zha, Jun Cao, Cheng Li, Shu Chen, Liang Ma, Shiyang Zhou, Hai** Zeng, Kai Feng , et al. (24 additional authors not shown)

    Abstract: The increasing demand for computational photography and imaging on mobile platforms has led to the widespread development and integration of advanced image sensors with novel algorithms in camera systems. However, the scarcity of high-quality data for research and the rare opportunity for in-depth exchange of views from industry and academia constrain the development of mobile intelligent photogra… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

    Comments: MIPI@CVPR2024. Website: https://mipi-challenge.org/MIPI2024/

  2. arXiv:2403.18811  [pdf, other

    cs.CV cs.GR cs.SD eess.AS

    Duolando: Follower GPT with Off-Policy Reinforcement Learning for Dance Accompaniment

    Authors: Li Siyao, Tianpei Gu, Zhitao Yang, Zhengyu Lin, Ziwei Liu, Henghui Ding, Lei Yang, Chen Change Loy

    Abstract: We introduce a novel task within the field of 3D dance generation, termed dance accompaniment, which necessitates the generation of responsive movements from a dance partner, the "follower", synchronized with the lead dancer's movements and the underlying musical rhythm. Unlike existing solo or group dance generation tasks, a duet dance scenario entails a heightened degree of interaction between t… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

    Comments: ICLR 2024

  3. arXiv:2306.04236  [pdf, other

    cs.CV eess.IV

    Flare7K++: Mixing Synthetic and Real Datasets for Nighttime Flare Removal and Beyond

    Authors: Yuekun Dai, Chongyi Li, Shangchen Zhou, Ruicheng Feng, Yihang Luo, Chen Change Loy

    Abstract: Artificial lights commonly leave strong lens flare artifacts on the images captured at night, degrading both the visual quality and performance of vision algorithms. Existing flare removal approaches mainly focus on removing daytime flares and fail in nighttime cases. Nighttime flare removal is challenging due to the unique luminance and spectrum of artificial lights, as well as the diverse patter… ▽ More

    Submitted 7 June, 2023; v1 submitted 7 June, 2023; originally announced June 2023.

    Comments: Extension of arXiv:2210.06570; Project page at https://ykdai.github.io/projects/Flare7K

  4. arXiv:2305.13770  [pdf, other

    cs.CV eess.IV

    MIPI 2023 Challenge on Nighttime Flare Removal: Methods and Results

    Authors: Yuekun Dai, Chongyi Li, Shangchen Zhou, Ruicheng Feng, Qingpeng Zhu, Qianhui Sun, Wenxiu Sun, Chen Change Loy, **wei Gu

    Abstract: Develo** and integrating advanced image sensors with novel algorithms in camera systems are prevalent with the increasing demand for computational photography and imaging on mobile platforms. However, the lack of high-quality data for research and the rare opportunity for in-depth exchange of views from industry and academia constrain the development of mobile intelligent photography and imaging… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

    Comments: CVPR 2023 Mobile Intelligent Photography and Imaging (MIPI) Workshop--Nighttime Flare Removal Challenge Report. Website: https://mipi-challenge.org/MIPI2023/

  5. arXiv:2304.10551  [pdf, other

    eess.IV cs.CV

    MIPI 2023 Challenge on RGBW Remosaic: Methods and Results

    Authors: Qianhui Sun, Qingyu Yang, Chongyi Li, Shangchen Zhou, Ruicheng Feng, Yuekun Dai, Wenxiu Sun, Qingpeng Zhu, Chen Change Loy, **wei Gu

    Abstract: Develo** and integrating advanced image sensors with novel algorithms in camera systems are prevalent with the increasing demand for computational photography and imaging on mobile platforms. However, the lack of high-quality data for research and the rare opportunity for an in-depth exchange of views from industry and academia constrain the development of mobile intelligent photography and imag… ▽ More

    Submitted 20 April, 2023; originally announced April 2023.

    Comments: CVPR 2023 Mobile Intelligent Photography and Imaging (MIPI) Workshop--RGBW Sensor Remosaic Challenge Report. Website: https://mipi-challenge.org/MIPI2023/. arXiv admin note: substantial text overlap with arXiv:2209.08471, arXiv:2209.07060, arXiv:2209.07530, arXiv:2304.10089

  6. arXiv:2304.10089  [pdf, other

    eess.IV cs.CV

    MIPI 2023 Challenge on RGBW Fusion: Methods and Results

    Authors: Qianhui Sun, Qingyu Yang, Chongyi Li, Shangchen Zhou, Ruicheng Feng, Yuekun Dai, Wenxiu Sun, Qingpeng Zhu, Chen Change Loy, **wei Gu

    Abstract: Develo** and integrating advanced image sensors with novel algorithms in camera systems are prevalent with the increasing demand for computational photography and imaging on mobile platforms. However, the lack of high-quality data for research and the rare opportunity for an in-depth exchange of views from industry and academia constrain the development of mobile intelligent photography and imag… ▽ More

    Submitted 24 April, 2023; v1 submitted 20 April, 2023; originally announced April 2023.

    Comments: CVPR 2023 Mobile Intelligent Photography and Imaging (MIPI) Workshop--RGBW Sensor Fusion Challenge Report. Website: https://mipi-challenge.org/MIPI2023/. arXiv admin note: substantial text overlap with arXiv:2209.07530, arXiv:2209.08471, arXiv:2209.07060

  7. arXiv:2304.06019  [pdf, other

    cs.CV eess.IV

    Generating Aligned Pseudo-Supervision from Non-Aligned Data for Image Restoration in Under-Display Camera

    Authors: Ruicheng Feng, Chongyi Li, Huai** Chen, Shuai Li, **wei Gu, Chen Change Loy

    Abstract: Due to the difficulty in collecting large-scale and perfectly aligned paired training data for Under-Display Camera (UDC) image restoration, previous methods resort to monitor-based image systems or simulation-based methods, sacrificing the realness of the data and introducing domain gaps. In this work, we revisit the classic stereo setup for training data collection -- capturing two images of the… ▽ More

    Submitted 12 April, 2023; originally announced April 2023.

    Comments: Accepted by CVPR 2023

  8. arXiv:2303.15046  [pdf, other

    cs.CV eess.IV

    Nighttime Smartphone Reflective Flare Removal Using Optical Center Symmetry Prior

    Authors: Yuekun Dai, Yihang Luo, Shangchen Zhou, Chongyi Li, Chen Change Loy

    Abstract: Reflective flare is a phenomenon that occurs when light reflects inside lenses, causing bright spots or a "ghosting effect" in photos, which can impact their quality. Eliminating reflective flare is highly desirable but challenging. Many existing methods rely on manually designed features to detect these bright spots, but they often fail to identify reflective flares created by various types of li… ▽ More

    Submitted 27 March, 2023; originally announced March 2023.

    Comments: CVPR2023 (Highlight)

  9. arXiv:2210.06570  [pdf, other

    cs.CV eess.IV

    Flare7K: A Phenomenological Nighttime Flare Removal Dataset

    Authors: Yuekun Dai, Chongyi Li, Shangchen Zhou, Ruicheng Feng, Chen Change Loy

    Abstract: Artificial lights commonly leave strong lens flare artifacts on images captured at night. Nighttime flare not only affects the visual quality but also degrades the performance of vision algorithms. Existing flare removal methods mainly focus on removing daytime flares and fail in nighttime. Nighttime flare removal is challenging because of the unique luminance and spectrum of artificial lights and… ▽ More

    Submitted 12 October, 2022; originally announced October 2022.

    Comments: Camera-ready version for NeurIPS 2022 Track Datasets and Benchmarks

  10. arXiv:2209.08471  [pdf, other

    cs.CV eess.IV

    MIPI 2022 Challenge on RGBW Sensor Re-mosaic: Dataset and Report

    Authors: Qingyu Yang, Guang Yang, Jun Jiang, Chongyi Li, Ruicheng Feng, Shangchen Zhou, Wenxiu Sun, Qingpeng Zhu, Chen Change Loy, **wei Gu

    Abstract: Develo** and integrating advanced image sensors with novel algorithms in camera systems are prevalent with the increasing demand for computational photography and imaging on mobile platforms. However, the lack of high-quality data for research and the rare opportunity for in-depth exchange of views from industry and academia constrain the development of mobile intelligent photography and imaging… ▽ More

    Submitted 15 September, 2022; originally announced September 2022.

    Comments: ECCV 2022 Mobile Intelligent Photography and Imaging (MIPI) Workshop--RGBW Sensor Re-mosaic Challenge Report. MIPI workshop website: http://mipi-challenge.org/. arXiv admin note: substantial text overlap with arXiv:2209.07060, arXiv:2209.07530, arXiv:2209.07057

  11. arXiv:2209.07530  [pdf, other

    eess.IV cs.CV

    MIPI 2022 Challenge on RGBW Sensor Fusion: Dataset and Report

    Authors: Qingyu Yang, Guang Yang, Jun Jiang, Chongyi Li, Ruicheng Feng, Shangchen Zhou, Wenxiu Sun, Qingpeng Zhu, Chen Change Loy, **wei Gu

    Abstract: Develo** and integrating advanced image sensors with novel algorithms in camera systems are prevalent with the increasing demand for computational photography and imaging on mobile platforms. However, the lack of high-quality data for research and the rare opportunity for in-depth exchange of views from industry and academia constrain the development of mobile intelligent photography and imaging… ▽ More

    Submitted 27 September, 2022; v1 submitted 15 September, 2022; originally announced September 2022.

    Comments: ECCV 2022 Mobile Intelligent Photography and Imaging (MIPI) Workshop--RGBW Sensor Fusion Challenge Report. MIPI workshop website: http://mipi-challenge.org/. arXiv admin note: substantial text overlap with arXiv:2209.07060

  12. arXiv:2209.07060  [pdf, other

    eess.IV cs.CV

    MIPI 2022 Challenge on Quad-Bayer Re-mosaic: Dataset and Report

    Authors: Qingyu Yang, Guang Yang, Jun Jiang, Chongyi Li, Ruicheng Feng, Shangchen Zhou, Wenxiu Sun, Qingpeng Zhu, Chen Change Loy, **wei Gu

    Abstract: Develo** and integrating advanced image sensors with novel algorithms in camera systems are prevalent with the increasing demand for computational photography and imaging on mobile platforms. However, the lack of high-quality data for research and the rare opportunity for in-depth exchange of views from industry and academia constrain the development of mobile intelligent photography and imaging… ▽ More

    Submitted 15 September, 2022; originally announced September 2022.

    Comments: ECCV 2022 Mobile Intelligent Photography and Imaging (MIPI) Workshop--Quad-Bayer Re-mosaic Challenge Report. MIPI workshop website: http://mipi-challenge.org/

  13. arXiv:2209.07052  [pdf, other

    eess.IV cs.CV

    MIPI 2022 Challenge on Under-Display Camera Image Restoration: Methods and Results

    Authors: Ruicheng Feng, Chongyi Li, Shangchen Zhou, Wenxiu Sun, Qingpeng Zhu, Jun Jiang, Qingyu Yang, Chen Change Loy, **wei Gu

    Abstract: Develo** and integrating advanced image sensors with novel algorithms in camera systems are prevalent with the increasing demand for computational photography and imaging on mobile platforms. However, the lack of high-quality data for research and the rare opportunity for in-depth exchange of views from industry and academia constrain the development of mobile intelligent photography and imaging… ▽ More

    Submitted 23 October, 2022; v1 submitted 15 September, 2022; originally announced September 2022.

    Comments: ECCV 2022 Mobile Intelligent Photography and Imaging (MIPI) Workshop--Under-display Camera Image Restoration Challenge Report. MIPI workshop website: http://mipi-challenge.org/

  14. arXiv:2203.13055  [pdf, other

    cs.SD cs.CV eess.AS

    Bailando: 3D Dance Generation by Actor-Critic GPT with Choreographic Memory

    Authors: Li Siyao, Weijiang Yu, Tianpei Gu, Chunze Lin, Quan Wang, Chen Qian, Chen Change Loy, Ziwei Liu

    Abstract: Driving 3D characters to dance following a piece of music is highly challenging due to the spatial constraints applied to poses by choreography norms. In addition, the generated dance sequence also needs to maintain temporal coherency with different music genres. To tackle these challenges, we propose a novel music-to-dance framework, Bailando, with two powerful components: 1) a choreographic memo… ▽ More

    Submitted 24 March, 2022; v1 submitted 24 March, 2022; originally announced March 2022.

    Comments: Accepted by CVPR 2022. Code and video link: https://github.com/lisiyao21/Bailando/

  15. arXiv:2202.03373  [pdf, other

    eess.IV cs.CV

    LEDNet: Joint Low-light Enhancement and Deblurring in the Dark

    Authors: Shangchen Zhou, Chongyi Li, Chen Change Loy

    Abstract: Night photography typically suffers from both low light and blurring issues due to the dim environment and the common use of long exposure. While existing light enhancement and deblurring methods could deal with each problem individually, a cascade of such methods cannot work harmoniously to cope well with joint degradation of visibility and textures. Training an end-to-end network is also infeasi… ▽ More

    Submitted 30 August, 2022; v1 submitted 7 February, 2022; originally announced February 2022.

    Comments: ECCV2022

  16. arXiv:2110.04562  [pdf, other

    cs.CV eess.IV

    Temporally Consistent Video Colorization with Deep Feature Propagation and Self-regularization Learning

    Authors: Yihao Liu, Hengyuan Zhao, Kelvin C. K. Chan, Xintao Wang, Chen Change Loy, Yu Qiao, Chao Dong

    Abstract: Video colorization is a challenging and highly ill-posed problem. Although recent years have witnessed remarkable progress in single image colorization, there is relatively less research effort on video colorization and existing methods always suffer from severe flickering artifacts (temporal inconsistency) or unsatisfying colorization performance. We address this problem from a new perspective, b… ▽ More

    Submitted 9 October, 2021; originally announced October 2021.

    Comments: 13 pages, 10 figures

  17. arXiv:2109.04760  [pdf, other

    eess.IV cs.CV

    ReconfigISP: Reconfigurable Camera Image Processing Pipeline

    Authors: Ke Yu, Zexian Li, Yue Peng, Chen Change Loy, **wei Gu

    Abstract: Image Signal Processor (ISP) is a crucial component in digital cameras that transforms sensor signals into images for us to perceive and understand. Existing ISP designs always adopt a fixed architecture, e.g., several sequential modules connected in a rigid order. Such a fixed ISP architecture may be suboptimal for real-world applications, where camera sensors, scenes and tasks are diverse. In th… ▽ More

    Submitted 10 September, 2021; originally announced September 2021.

    Comments: ICCV 2021

  18. arXiv:2106.01863  [pdf, other

    cs.CV cs.LG eess.IV

    Robust Reference-based Super-Resolution via C2-Matching

    Authors: Yuming Jiang, Kelvin C. K. Chan, Xintao Wang, Chen Change Loy, Ziwei Liu

    Abstract: Reference-based Super-Resolution (Ref-SR) has recently emerged as a promising paradigm to enhance a low-resolution (LR) input image by introducing an additional high-resolution (HR) reference image. Existing Ref-SR methods mostly rely on implicit correspondence matching to borrow HR textures from reference images to compensate for the information loss in input images. However, performing local tra… ▽ More

    Submitted 3 June, 2021; originally announced June 2021.

    Comments: To appear in CVPR2021. The source code is available at https://github.com/yumingj/C2-Matching

  19. arXiv:2104.11116  [pdf, other

    cs.CV cs.LG cs.MM cs.SD eess.AS eess.IV

    Pose-Controllable Talking Face Generation by Implicitly Modularized Audio-Visual Representation

    Authors: Hang Zhou, Yasheng Sun, Wayne Wu, Chen Change Loy, Xiaogang Wang, Ziwei Liu

    Abstract: While accurate lip synchronization has been achieved for arbitrary-subject audio-driven talking face generation, the problem of how to efficiently drive the head pose remains. Previous methods rely on pre-estimated structural information such as landmarks and 3D parameters, aiming to generate personalized rhythmic movements. However, the inaccuracy of such estimated information under extreme condi… ▽ More

    Submitted 22 April, 2021; originally announced April 2021.

    Comments: Accepted to IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2021. Code and models are available at https://github.com/Hangz-nju-cuhk/Talking-Face_PC-AVS

  20. arXiv:2104.10781  [pdf, other

    eess.IV cs.CV

    NTIRE 2021 Challenge on Quality Enhancement of Compressed Video: Methods and Results

    Authors: Ren Yang, Radu Timofte, **g Liu, Yi Xu, Xinjian Zhang, Minyi Zhao, Shuigeng Zhou, Kelvin C. K. Chan, Shangchen Zhou, Xiangyu Xu, Chen Change Loy, Xin Li, Fanglong Liu, He Zheng, Lielin Jiang, Qi Zhang, Dongliang He, Fu Li, Qingqing Dang, Yibin Huang, Matteo Maggioni, Zhongqian Fu, Shuai Xiao, Cheng li, Thomas Tanay , et al. (47 additional authors not shown)

    Abstract: This paper reviews the first NTIRE challenge on quality enhancement of compressed video, with a focus on the proposed methods and results. In this challenge, the new Large-scale Diverse Video (LDV) dataset is employed. The challenge has three tracks. Tracks 1 and 2 aim at enhancing the videos compressed by HEVC at a fixed QP, while Track 3 is designed for enhancing the videos compressed by x265 at… ▽ More

    Submitted 31 August, 2022; v1 submitted 21 April, 2021; originally announced April 2021.

    Comments: Corrected the MOS values in Table 2, and corrected some minor typos

  21. arXiv:2104.09556  [pdf, other

    cs.CV eess.IV

    Removing Diffraction Image Artifacts in Under-Display Camera via Dynamic Skip Connection Network

    Authors: Ruicheng Feng, Chongyi Li, Huai** Chen, Shuai Li, Chen Change Loy, **wei Gu

    Abstract: Recent development of Under-Display Camera (UDC) systems provides a true bezel-less and notch-free viewing experience on smartphones (and TV, laptops, tablets), while allowing images to be captured from the selfie camera embedded underneath. In a typical UDC system, the microstructure of the semi-transparent organic light-emitting diode (OLED) pixel array attenuates and diffracts the incident ligh… ▽ More

    Submitted 19 April, 2021; originally announced April 2021.

    Comments: CVPR 2021 camera-ready version

  22. arXiv:2012.12821  [pdf, other

    cs.CV cs.LG eess.IV

    Focal Frequency Loss for Image Reconstruction and Synthesis

    Authors: Liming Jiang, Bo Dai, Wayne Wu, Chen Change Loy

    Abstract: Image reconstruction and synthesis have witnessed remarkable progress thanks to the development of generative models. Nonetheless, gaps could still exist between the real and generated images, especially in the frequency domain. In this study, we show that narrowing gaps in the frequency domain can ameliorate image reconstruction and synthesis quality further. We propose a novel focal frequency lo… ▽ More

    Submitted 23 August, 2021; v1 submitted 23 December, 2020; originally announced December 2020.

    Comments: ICCV 2021. GitHub: https://github.com/EndlessSora/focal-frequency-loss Project page: https://www.mmlab-ntu.com/project/ffl/index.html

  23. arXiv:2009.13240  [pdf, other

    cs.CV cs.LG eess.IV

    Texture Memory-Augmented Deep Patch-Based Image Inpainting

    Authors: Rui Xu, Minghao Guo, Jiaqi Wang, Xiaoxiao Li, Bolei Zhou, Chen Change Loy

    Abstract: Patch-based methods and deep networks have been employed to tackle image inpainting problem, with their own strengths and weaknesses. Patch-based methods are capable of restoring a missing region with high-quality texture through searching nearest neighbor patches from the unmasked regions. However, these methods bring problematic contents when recovering large missing regions. Deep networks, on t… ▽ More

    Submitted 4 November, 2021; v1 submitted 28 September, 2020; originally announced September 2020.

    Comments: Published on TIP. Project Page: https://nbei.github.io/tmad.html

  24. arXiv:2007.12072  [pdf, other

    cs.CV cs.LG eess.IV

    TSIT: A Simple and Versatile Framework for Image-to-Image Translation

    Authors: Liming Jiang, Changxu Zhang, Mingyang Huang, Chunxiao Liu, Jian** Shi, Chen Change Loy

    Abstract: We introduce a simple and versatile framework for image-to-image translation. We unearth the importance of normalization layers, and provide a carefully designed two-stream generative model with newly proposed feature transformations in a coarse-to-fine fashion. This allows multi-scale semantic structure information and style representation to be effectively captured and fused by the network, perm… ▽ More

    Submitted 25 July, 2020; v1 submitted 23 July, 2020; originally announced July 2020.

    Comments: ECCV 2020 (Spotlight). Table 2 is updated. GitHub: https://github.com/EndlessSora/TSIT

  25. arXiv:2003.13659  [pdf, other

    eess.IV cs.CV

    Exploiting Deep Generative Prior for Versatile Image Restoration and Manipulation

    Authors: Xingang Pan, Xiaohang Zhan, Bo Dai, Dahua Lin, Chen Change Loy, ** Luo

    Abstract: Learning a good image prior is a long-term goal for image restoration and manipulation. While existing methods like deep image prior (DIP) capture low-level image statistics, there are still gaps toward an image prior that captures rich image semantics including color, spatial coherence, textures, and high-level concepts. This work presents an effective way to exploit the image prior captured by a… ▽ More

    Submitted 20 July, 2020; v1 submitted 30 March, 2020; originally announced March 2020.

    Comments: Accepted to ECCV2020 as oral. 1) Precise GAN-inversion by discriminator-guided generator finetuning. 2) A versatile way for high-quality image restoration and manipulation. Code: https://github.com/XingangPan/deep-generative-prior

  26. arXiv:2002.05512  [pdf, other

    cs.LG cs.CV eess.IV stat.ML

    Real or Not Real, that is the Question

    Authors: Yuanbo Xiangli, Yubin Deng, Bo Dai, Chen Change Loy, Dahua Lin

    Abstract: While generative adversarial networks (GAN) have been widely adopted in various topics, in this paper we generalize the standard GAN to a new perspective by treating realness as a random variable that can be estimated from multiple angles. In this generalized framework, referred to as RealnessGAN, the discriminator outputs a distribution as the measure of realness. While RealnessGAN shares similar… ▽ More

    Submitted 12 February, 2020; originally announced February 2020.

    Comments: ICLR2020 spotlight. 1) train GAN by maximizing kl-divergence. 2) train non-progressive GAN (DCGAN) architecture at 1024*1024 resolution

  27. arXiv:1908.03251  [pdf, other

    cs.CV eess.IV

    One-shot Face Reenactment

    Authors: Yunxuan Zhang, Siwei Zhang, Yue He, Cheng Li, Chen Change Loy, Ziwei Liu

    Abstract: To enable realistic shape (e.g. pose and expression) transfer, existing face reenactment methods rely on a set of target faces for learning subject-specific traits. However, in real-world scenario end-users often only have one target face at hand, rendering existing methods inapplicable. In this work, we bridge this gap by proposing a novel one-shot face reenactment learning framework. Our key ins… ▽ More

    Submitted 5 August, 2019; originally announced August 2019.

    Comments: To appear in BMVC 2019 as a spotlight presentation. Code and models are available at: https://github.com/bj80heyue/One_Shot_Face_Reenactment

  28. arXiv:1906.07155  [pdf, other

    cs.CV cs.LG eess.IV

    MMDetection: Open MMLab Detection Toolbox and Benchmark

    Authors: Kai Chen, Jiaqi Wang, Jiangmiao Pang, Yuhang Cao, Yu Xiong, Xiaoxiao Li, Shuyang Sun, Wansen Feng, Ziwei Liu, Jiarui Xu, Zheng Zhang, Dazhi Cheng, Chenchen Zhu, Tianheng Cheng, Qijie Zhao, Buyu Li, Xin Lu, Rui Zhu, Yue Wu, Jifeng Dai, **gdong Wang, Jian** Shi, Wanli Ouyang, Chen Change Loy, Dahua Lin

    Abstract: We present MMDetection, an object detection toolbox that contains a rich set of object detection and instance segmentation methods as well as related components and modules. The toolbox started from a codebase of MMDet team who won the detection track of COCO Challenge 2018. It gradually evolves into a unified platform that covers many popular detection methods and contemporary modules. It not onl… ▽ More

    Submitted 17 June, 2019; originally announced June 2019.

    Comments: Technical report of MMDetection. 11 pages