Skip to main content

Showing 1–14 of 14 results for author: Pham, T X

.
  1. arXiv:2402.01516  [pdf, other

    cs.CV

    Cross-view Masked Diffusion Transformers for Person Image Synthesis

    Authors: Trung X. Pham, Zhang Kang, Chang D. Yoo

    Abstract: We present X-MDPT ($\underline{Cross}$-view $\underline{M}$asked $\underline{D}$iffusion $\underline{P}$rediction $\underline{T}$ransformers), a novel diffusion model designed for pose-guided human image generation. X-MDPT distinguishes itself by employing masked diffusion transformers that operate on latent patches, a departure from the commonly-used Unet structures in existing works. The model c… ▽ More

    Submitted 3 June, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

    Comments: ICML 2024

  2. arXiv:2311.18508  [pdf, other

    eess.IV cs.CV

    DifAugGAN: A Practical Diffusion-style Data Augmentation for GAN-based Single Image Super-resolution

    Authors: Axi Niu, Kang Zhang, Joshua Tian ** Tee, Trung X. Pham, **qiu Sun, Chang D. Yoo, In So Kweon, Yanning Zhang

    Abstract: It is well known the adversarial optimization of GAN-based image super-resolution (SR) methods makes the preceding SR model generate unpleasant and undesirable artifacts, leading to large distortion. We attribute the cause of such distortions to the poor calibration of the discriminator, which hampers its ability to provide meaningful feedback to the generator for learning high-quality images. To… ▽ More

    Submitted 30 November, 2023; originally announced November 2023.

  3. arXiv:2305.18547  [pdf, other

    cs.CV

    Learning from Multi-Perception Features for Real-Word Image Super-resolution

    Authors: Axi Niu, Kang Zhang, Trung X. Pham, Pei Wang, **qiu Sun, In So Kweon, Yanning Zhang

    Abstract: Currently, there are two popular approaches for addressing real-world image super-resolution problems: degradation-estimation-based and blind-based methods. However, degradation-estimation-based methods may be inaccurate in estimating the degradation, making them less applicable to real-world LR images. On the other hand, blind-based methods are often limited by their fixed single perception infor… ▽ More

    Submitted 26 May, 2023; originally announced May 2023.

  4. arXiv:2302.12831  [pdf, other

    eess.IV cs.CV

    CDPMSR: Conditional Diffusion Probabilistic Models for Single Image Super-Resolution

    Authors: Axi Niu, Kang Zhang, Trung X. Pham, **qiu Sun, Yu Zhu, In So Kweon, Yanning Zhang

    Abstract: Diffusion probabilistic models (DPM) have been widely adopted in image-to-image translation to generate high-quality images. Prior attempts at applying the DPM to image super-resolution (SR) have shown that iteratively refining a pure Gaussian noise with a conditional image using a U-Net trained on denoising at various-level noises can help obtain a satisfied high-resolution image for the low-reso… ▽ More

    Submitted 14 February, 2023; originally announced February 2023.

    Comments: 4 pages, 4 figures

  5. arXiv:2211.09861  [pdf, other

    cs.CV

    Self-Supervised Visual Representation Learning via Residual Momentum

    Authors: Trung X. Pham, Axi Niu, Zhang Kang, Sultan Rizky Madjid, Ji Woo Hong, Daehyeok Kim, Joshua Tian ** Tee, Chang D. Yoo

    Abstract: Self-supervised learning (SSL) approaches have shown promising capabilities in learning the representation from unlabeled data. Amongst them, momentum-based frameworks have attracted significant attention. Despite being a great success, these momentum-based SSL frameworks suffer from a large gap in representation between the online encoder (student) and the momentum encoder (teacher), which hinder… ▽ More

    Submitted 21 November, 2022; v1 submitted 17 November, 2022; originally announced November 2022.

    Comments: 18 pages, 16 figures

  6. arXiv:2210.08282  [pdf, other

    cs.CV

    LAD: A Hybrid Deep Learning System for Benign Paroxysmal Positional Vertigo Disorders Diagnostic

    Authors: Trung Xuan Pham, ** Woong Choi, Rusty John Lloyd Mina, Thanh Nguyen, Sultan Rizky Madjid, Chang Dong Yoo

    Abstract: Herein, we introduce "Look and Diagnose" (LAD), a hybrid deep learning-based system that aims to support doctors in the medical field in diagnosing effectively the Benign Paroxysmal Positional Vertigo (BPPV) disorder. Given the body postures of the patient in the Dix-Hallpike and lateral head turns test, the visual information of both eyes is captured and fed into LAD for analyzing and classifying… ▽ More

    Submitted 15 October, 2022; originally announced October 2022.

    Comments: Accepted to IEEE Access 2022, 13 pages, 14 figures

  7. arXiv:2206.02193  [pdf, other

    gr-qc math-ph math.AP math.DG

    Peeling for tensorial wave equations on Schwarzschild spacetime

    Authors: Truong Xuan Pham

    Abstract: In this paper, we establish the asymptotic behaviour along outgoing and incoming radial geodesics, i.e., the peeling property for the tensorial Fackrell-Ipser and spin $\pm 1$ Teukolsky equations on Schwarzschild spacetime. Our method combines a conformal compactification with vector field techniques to prove the two-side estimates of the energies of tensorial fields through the future and past nu… ▽ More

    Submitted 25 September, 2023; v1 submitted 5 June, 2022; originally announced June 2022.

    Comments: 22 pages, Reviews in Mathematical Physics, 2023. arXiv admin note: text overlap with arXiv:2006.02888

    Journal ref: Reviews in Mathematical Physics, 2023

  8. arXiv:2203.17248  [pdf, other

    cs.LG cs.AI

    Dual Temperature Helps Contrastive Learning Without Many Negative Samples: Towards Understanding and Simplifying MoCo

    Authors: Chaoning Zhang, Kang Zhang, Trung X. Pham, Axi Niu, Zhinan Qiao, Chang D. Yoo, In So Kweon

    Abstract: Contrastive learning (CL) is widely known to require many negative samples, 65536 in MoCo for instance, for which the performance of a dictionary-free framework is often inferior because the negative sample size (NSS) is limited by its mini-batch size (MBS). To decouple the NSS from the MBS, a dynamic dictionary has been adopted in a large volume of CL frameworks, among which arguably the most pop… ▽ More

    Submitted 30 March, 2022; originally announced March 2022.

    Comments: Accepted by CVPR2022

  9. arXiv:2203.16262  [pdf, other

    cs.LG cs.AI

    How Does SimSiam Avoid Collapse Without Negative Samples? A Unified Understanding with Self-supervised Contrastive Learning

    Authors: Chaoning Zhang, Kang Zhang, Chenshuang Zhang, Trung X. Pham, Chang D. Yoo, In So Kweon

    Abstract: To avoid collapse in self-supervised learning (SSL), a contrastive loss is widely used but often requires a large number of negative samples. Without negative samples yet achieving competitive performance, a recent work has attracted significant attention for providing a minimalist simple Siamese (SimSiam) method to avoid collapse. However, the reason for how it avoids collapse without negative sa… ▽ More

    Submitted 30 March, 2022; originally announced March 2022.

    Comments: accepted on ICLR 2022

  10. arXiv:2108.00475  [pdf, other

    cs.CV eess.IV

    Self-supervised Learning with Local Attention-Aware Feature

    Authors: Trung X. Pham, Rusty John Lloyd Mina, Dias Issa, Chang D. Yoo

    Abstract: In this work, we propose a novel methodology for self-supervised learning for generating global and local attention-aware visual features. Our approach is based on training a model to differentiate between specific image transformations of an input sample and the patched images. Utilizing this approach, the proposed method is able to outperform the previous best competitor by 1.03% on the Tiny-Ima… ▽ More

    Submitted 1 August, 2021; originally announced August 2021.

    Comments: 5 pages, 4 figures

  11. arXiv:2107.02969  [pdf, ps, other

    gr-qc math-ph math.AP math.DG

    Peeling of Dirac fields on Kerr spacetimes

    Authors: Truong Xuan Pham

    Abstract: In a recent paper with J.-P. Nicolas [J.-P. Nicolas and P.T. Xuan, Annales Henri Poincare 2019], we studied the peeling for scalar fields on Kerr metrics. The present work extends these results to Dirac fields on the same geometrical background. We follow the approach initiated by L.J. Mason and J.-P. Nicolas [L. Mason and J.-P. Nicolas, J.Inst.Math.Jussieu 2009; L. Mason and J.-P. Nicolas, J.Geom… ▽ More

    Submitted 25 September, 2023; v1 submitted 6 July, 2021; originally announced July 2021.

    Comments: 29 pages, Journal of mathematical physics, 2020

  12. arXiv:1909.06720  [pdf, other

    cs.CV

    Cascade RPN: Delving into High-Quality Region Proposal Network with Adaptive Convolution

    Authors: Thang Vu, Hyunjun Jang, Trung X. Pham, Chang D. Yoo

    Abstract: This paper considers an architecture referred to as Cascade Region Proposal Network (Cascade RPN) for improving the region-proposal quality and detection performance by \textit{systematically} addressing the limitation of the conventional RPN that \textit{heuristically defines} the anchors and \textit{aligns} the features to the anchors. First, instead of using multiple anchors with predefined sca… ▽ More

    Submitted 4 December, 2019; v1 submitted 14 September, 2019; originally announced September 2019.

    Comments: To appear in NeurIPS 2019 (spotlight)

  13. arXiv:1810.01641  [pdf, other

    cs.CV

    PIRM Challenge on Perceptual Image Enhancement on Smartphones: Report

    Authors: Andrey Ignatov, Radu Timofte, Thang Van Vu, Tung Minh Luu, Trung X Pham, Cao Van Nguyen, Yongwoo Kim, Jae-Seok Choi, Munchurl Kim, Jie Huang, Jiewen Ran, Chen Xing, Xingguang Zhou, Pengfei Zhu, Mingrui Geng, Yawei Li, Eirikur Agustsson, Shuhang Gu, Luc Van Gool, Etienne de Stoutz, Nikolay Kobyshev, Kehui Nie, Yan Zhao, Gen Li, Tong Tong , et al. (23 additional authors not shown)

    Abstract: This paper reviews the first challenge on efficient perceptual image enhancement with the focus on deploying deep learning models on smartphones. The challenge consisted of two tracks. In the first one, participants were solving the classical image super-resolution problem with a bicubic downscaling factor of 4. The second track was aimed at real-world photo enhancement, and the goal was to map lo… ▽ More

    Submitted 3 October, 2018; originally announced October 2018.

  14. arXiv:1801.08996  [pdf, ps, other

    gr-qc math-ph math.AP

    Peeling on Kerr spacetime~:linear and non linear scalar fields

    Authors: Jean-Philippe Nicolas, Truong Xuan Pham

    Abstract: We study the peeling on Kerr spacetime for fields satisfying conformally invariant linear and nonlinear scalar wave equations. We follow an approach initiated by L.J. Mason and the first author for the Schwarzschild metric, based on a Penrose compactification and energy estimates. This approach provides a definition of the peeling at all orders in terms of Sobolev regularity near ${\mathscr I}$ in… ▽ More

    Submitted 26 January, 2018; originally announced January 2018.

    Comments: 51 pages

    MSC Class: 35L05; 35Q75; 83C57