Skip to main content

Showing 1–20 of 20 results for author: Sun, T

Searching in archive eess. Search in all archives.
.
  1. arXiv:2404.11313  [pdf, other

    eess.IV cs.AI

    NTIRE 2024 Challenge on Short-form UGC Video Quality Assessment: Methods and Results

    Authors: Xin Li, Kun Yuan, Ya**g Pei, Yiting Lu, Ming Sun, Chao Zhou, Zhibo Chen, Radu Timofte, Wei Sun, Haoning Wu, Zicheng Zhang, Jun Jia, Zhichao Zhang, Linhan Cao, Qiubo Chen, Xiongkuo Min, Weisi Lin, Guangtao Zhai, Jianhui Sun, Tianyi Wang, Lei Li, Han Kong, Wenxuan Wang, Bing Li, Cheng Luo , et al. (43 additional authors not shown)

    Abstract: This paper reviews the NTIRE 2024 Challenge on Shortform UGC Video Quality Assessment (S-UGC VQA), where various excellent solutions are submitted and evaluated on the collected dataset KVQ from popular short-form video platform, i.e., Kuaishou/Kwai Platform. The KVQ database is divided into three parts, including 2926 videos for training, 420 videos for validation, and 854 videos for testing. The… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

    Comments: Accepted by CVPR2024 Workshop. The challenge report for CVPR NTIRE2024 Short-form UGC Video Quality Assessment Challenge

  2. arXiv:2404.10343  [pdf, other

    cs.CV eess.IV

    The Ninth NTIRE 2024 Efficient Super-Resolution Challenge Report

    Authors: Bin Ren, Yawei Li, Nancy Mehta, Radu Timofte, Hongyuan Yu, Cheng Wan, Yuxin Hong, Bingnan Han, Zhuoyuan Wu, Yajun Zou, Yuqing Liu, Jizhe Li, Keji He, Chao Fan, Heng Zhang, Xiaolin Zhang, Xuanwu Yin, Kunlong Zuo, Bohao Liao, Peizhe Xia, Long Peng, Zhibo Du, Xin Di, Wangkai Li, Yang Wang , et al. (109 additional authors not shown)

    Abstract: This paper provides a comprehensive review of the NTIRE 2024 challenge, focusing on efficient single-image super-resolution (ESR) solutions and their outcomes. The task of this challenge is to super-resolve an input image with a magnification factor of x4 based on pairs of low and corresponding high-resolution images. The primary objective is to develop networks that optimize various aspects such… ▽ More

    Submitted 25 June, 2024; v1 submitted 16 April, 2024; originally announced April 2024.

    Comments: The report paper of NTIRE2024 Efficient Super-resolution, accepted by CVPRW2024

  3. arXiv:2403.08247  [pdf, other

    eess.IV cs.CV

    A Dual-domain Regularization Method for Ring Artifact Removal of X-ray CT

    Authors: Hongyang Zhu, Xin Lu, Yanwei Qin, Xinran Yu, Tianjiao Sun, Yunsong Zhao

    Abstract: Ring artifacts in computed tomography images, arising from the undesirable responses of detector units, significantly degrade image quality and diagnostic reliability. To address this challenge, we propose a dual-domain regularization model to effectively remove ring artifacts, while maintaining the integrity of the original CT image. The proposed model corrects the vertical stripe artifacts on th… ▽ More

    Submitted 14 March, 2024; v1 submitted 13 March, 2024; originally announced March 2024.

  4. arXiv:2312.13752  [pdf

    eess.IV cs.AI cs.CV

    Hunting imaging biomarkers in pulmonary fibrosis: Benchmarks of the AIIB23 challenge

    Authors: Yang Nan, Xiaodan Xing, Shiyi Wang, Zeyu Tang, Federico N Felder, Sheng Zhang, Roberta Eufrasia Ledda, Xiaoliu Ding, Ruiqi Yu, Wei** Liu, Feng Shi, Tianyang Sun, Zehong Cao, Minghui Zhang, Yun Gu, Hanxiao Zhang, Jian Gao, **yu Wang, Wen Tang, Pengxin Yu, Han Kang, Junqiang Chen, Xing Lu, Boyu Zhang, Michail Mamalakis , et al. (16 additional authors not shown)

    Abstract: Airway-related quantitative imaging biomarkers are crucial for examination, diagnosis, and prognosis in pulmonary diseases. However, the manual delineation of airway trees remains prohibitively time-consuming. While significant efforts have been made towards enhancing airway modelling, current public-available datasets concentrate on lung diseases with moderate morphological variations. The intric… ▽ More

    Submitted 16 April, 2024; v1 submitted 21 December, 2023; originally announced December 2023.

    Comments: 19 pages

  5. arXiv:2306.09164  [pdf

    cs.NI eess.SP

    Network Architecture Design toward Convergence of Mobile Applications and Networks

    Authors: Shuangfeng Han, Zhiming Liu, Tao Sun, Xiaoyun Wang

    Abstract: With the quick proliferation of extended reality (XR) services, the mobile communications networks are faced with gigantic challenges to meet the diversified and challenging service requirements. A tight coordination or even convergence of applications and mobile networks is highly motivated. In this paper, a multi-domain (e.g. application layer, transport layer, the core network, radio access net… ▽ More

    Submitted 15 June, 2023; originally announced June 2023.

    Comments: 7 pages, 5 figures, IEEE communications magazine, under review

  6. arXiv:2210.06368  [pdf, other

    cs.SD cs.AI eess.AS

    Individualized Conditioning and Negative Distances for Speaker Separation

    Authors: Tao Sun, Nidal Abuhajar, Shuyu Gong, Zhewei Wang, Charles D. Smith, Xianhui Wang, Li Xu, Jundong Liu

    Abstract: Speaker separation aims to extract multiple voices from a mixed signal. In this paper, we propose two speaker-aware designs to improve the existing speaker separation solutions. The first model is a speaker conditioning network that integrates speech samples to generate individualized speaker conditions, which then provide informed guidance for a separation module to produce well-separated outputs… ▽ More

    Submitted 12 October, 2022; originally announced October 2022.

    Comments: Accepted to ICMLA 2022

  7. arXiv:2204.14057  [pdf, other

    cs.SD cs.CV cs.MM eess.AS

    Unsupervised Voice-Face Representation Learning by Cross-Modal Prototype Contrast

    Authors: Boqing Zhu, Kele Xu, Changjian Wang, Zheng Qin, Tao Sun, Huaimin Wang, Yuxing Peng

    Abstract: We present an approach to learn voice-face representations from the talking face videos, without any identity labels. Previous works employ cross-modal instance discrimination tasks to establish the correlation of voice and face. These methods neglect the semantic content of different videos, introducing false-negative pairs as training noise. Furthermore, the positive pairs are constructed based… ▽ More

    Submitted 26 May, 2022; v1 submitted 28 April, 2022; originally announced April 2022.

    Comments: 8 pages, 4 figures. Accepted by IJCAI-2022

  8. arXiv:2109.13521  [pdf, other

    cs.LG cs.AI eess.SP

    A multi-stage semi-supervised improved deep embedded clustering method for bearing fault diagnosis under the situation of insufficient labeled samples

    Authors: Tongda Sun, Gang Yu

    Abstract: Although data-driven fault diagnosis methods have been widely applied, massive labeled data are required for model training. However, a difficulty of implementing this in real industries hinders the application of these methods. Hence, an effective diagnostic approach that can work well in such situation is urgently needed.In this study, a multi-stage semi-supervised improved deep embedded cluster… ▽ More

    Submitted 23 November, 2021; v1 submitted 28 September, 2021; originally announced September 2021.

    Comments: 24 pages, 15 figures and 59 references

  9. arXiv:2107.01762  [pdf

    eess.SY

    Energy Management Strategy for Unmanned Tracked Vehicles Based on Local Speed Planning

    Authors: Tianxing Sun, Shaohang Xu, Zirui Li, Yingqi Tan, Huiyan Chen

    Abstract: The hybrid electric system has good potential for unmanned tracked vehicles due to its excellent power and economy. Due to unmanned tracked vehicles have no traditional driving devices, and the driving cycle is uncertain, it brings new challenges to conventional energy management strategies. This paper proposes a novel energy management strategy for unmanned tracked vehicles based on local speed p… ▽ More

    Submitted 4 July, 2021; originally announced July 2021.

  10. arXiv:2106.15410  [pdf

    physics.ins-det eess.IV

    Improvements in Micro-CT Method for Characterizing X-ray Monocapillary Optics

    Authors: Zhao Wang, Kai Pan, Shuang Zhang, Zhuxuan Duo, Zhiguo Liu, Tianxi Sun

    Abstract: Accurate characterization of the inner surface of X-ray monocapillary optics (XMCO) is of great significance in X-ray optics research. Compared with other characterization methods, the micro computed tomography (micro-CT) method has its unique advantages but also has some disadvantages, such as a long scanning time, long image reconstruction time, and inconvenient scanning process. In this paper,… ▽ More

    Submitted 15 September, 2021; v1 submitted 28 June, 2021; originally announced June 2021.

  11. arXiv:2008.10239  [pdf, other

    eess.SY math.OC

    Managing connected and automated vehicles with flexible routing at "lane-allocation-free'' intersections

    Authors: Wan**g Ma, Ruochen Hao, Chunhui Yu, Tuo Sun, Bart van Arem

    Abstract: Trajectory planning and coordination for connected and automated vehicles (CAVs) have been studied at isolated ``signal-free'' intersections and in ``signal-free'' corridors under the fully CAV environment in the literature. Most of the existing studies are based on the definition of approaching and exit lanes. The route a vehicle takes to pass through an intersection is determined from its moveme… ▽ More

    Submitted 24 August, 2020; originally announced August 2020.

    Comments: 31 pages, 5 figures, for simulation video, see https://magic.tongji.edu.cn/en/index.php?catid=41

  12. arXiv:2008.06988  [pdf

    physics.soc-ph eess.SY

    Power and the Pandemic: Exploring Global Changes in Electricity Demand During COVID-19

    Authors: Elizabeth Buechler, Siobhan Powell, Tao Sun, Chad Zanocco, Nicolas Astier, Jose Bolorinos, June Flora, Hilary Boudet, Ram Rajagopal

    Abstract: Understanding how efforts to limit exposure to COVID-19 have altered electricity demand provides insights not only into how dramatic restrictions shape electricity demand but also about future electricity use in a post-COVID-19 world. We develop a unified modeling framework to quantify and compare electricity usage changes in 58 countries and regions around the world from January-May 2020. We find… ▽ More

    Submitted 16 August, 2020; originally announced August 2020.

  13. arXiv:2001.00605  [pdf, other

    cs.LG cs.RO eess.SY

    Zero-Shot Reinforcement Learning with Deep Attention Convolutional Neural Networks

    Authors: Sahika Genc, Sunil Mallya, Sravan Bodapati, Tao Sun, Yunzhe Tao

    Abstract: Simulation-to-simulation and simulation-to-real world transfer of neural network models have been a difficult problem. To close the reality gap, prior methods to simulation-to-real world transfer focused on domain adaptation, decoupling perception and dynamics and solving each problem separately, and randomization of agent parameters and environment conditions to expose the learning agent to a var… ▽ More

    Submitted 2 January, 2020; originally announced January 2020.

  14. arXiv:1910.05253  [pdf, other

    cs.LG cs.GR eess.IV

    Adversarial Colorization Of Icons Based On Structure And Color Conditions

    Authors: Tsai-Ho Sun, Chien-Hsun Lai, Sai-Keung Wong, Yu-Shuen Wang

    Abstract: We present a system to help designers create icons that are widely used in banners, signboards, billboards, homepages, and mobile apps. Designers are tasked with drawing contours, whereas our system colorizes contours in different styles. This goal is achieved by training a dual conditional generative adversarial network (GAN) on our collected icon dataset. One condition requires the generated ima… ▽ More

    Submitted 3 October, 2019; originally announced October 2019.

  15. arXiv:1907.12945  [pdf, other

    math.OC cs.CV cs.MM eess.IV

    Inertial nonconvex alternating minimizations for the image deblurring

    Authors: Tao Sun, Roberto Barrio, Marcos Rodriguez, Hao Jiang

    Abstract: In image processing, Total Variation (TV) regularization models are commonly used to recover blurred images. One of the most efficient and popular methods to solve the convex TV problem is the Alternating Direction Method of Multipliers (ADMM) algorithm, recently extended using the inertial proximal point method. Although all the classical studies focus on only a convex formulation, recent article… ▽ More

    Submitted 26 July, 2019; originally announced July 2019.

    Comments: Transactions on Image Processing

  16. arXiv:1907.11956  [pdf, other

    cs.SD cs.LG eess.AS

    Dilated FCN: Listening Longer to Hear Better

    Authors: Shuyu Gong, Zhewei Wang, Tao Sun, Yuanhang Zhang, Charles D. Smith, Li Xu, Jundong Liu

    Abstract: Deep neural network solutions have emerged as a new and powerful paradigm for speech enhancement (SE). The capabilities to capture long context and extract multi-scale patterns are crucial to design effective SE networks. Such capabilities, however, are often in conflict with the goal of maintaining compact networks to ensure good system generalization. In this paper, we explore dilation operation… ▽ More

    Submitted 27 July, 2019; originally announced July 2019.

    Comments: 5 pages; will appear in WASPAA conference

  17. arXiv:1907.04536  [pdf

    cs.LG cs.SD eess.AS stat.ML

    Multi-layer Attention Mechanism for Speech Keyword Recognition

    Authors: Ruisen Luo, Tianran Sun, Chen Wang, Miao Du, Zuodong Tang, Kai Zhou, Xiaofeng Gong, Xiaomei Yang

    Abstract: As an important part of speech recognition technology, automatic speech keyword recognition has been intensively studied in recent years. Such technology becomes especially pivotal under situations with limited infrastructures and computational resources, such as voice command recognition in vehicles and robot interaction. At present, the mainstream methods in automatic speech keyword recognition… ▽ More

    Submitted 10 July, 2019; originally announced July 2019.

  18. arXiv:1906.00732  [pdf, other

    eess.SP

    Cloud Storage for Multi-Service Battery Operation (Extended Version)

    Authors: Mohammad Rasouli, Tao Sun, Camille Pache, Patrick Panciatici, Jean Maeght, Ramesh Johari, Ram Rajagopal

    Abstract: We study a cloud storage operator who provides shared storage service for electricity end-users using the residual part of a multi-service grid-scale battery primarily used for high priority grid services. We design an optimal product offering, pricing and customer portfolio. A framework and solution approach for assessing and operating such multi-service battery operations with stochastic service… ▽ More

    Submitted 13 August, 2021; v1 submitted 17 May, 2019; originally announced June 2019.

  19. arXiv:1905.00824  [pdf, other

    cs.GR cs.CV eess.IV

    Single Image Portrait Relighting

    Authors: Tiancheng Sun, Jonathan T. Barron, Yun-Ta Tsai, Zexiang Xu, Xueming Yu, Graham Fyffe, Christoph Rhemann, Jay Busch, Paul Debevec, Ravi Ramamoorthi

    Abstract: Lighting plays a central role in conveying the essence and depth of the subject in a portrait photograph. Professional photographers will carefully control the lighting in their studio to manipulate the appearance of their subject, while consumer photographers are usually constrained to the illumination of their environment. Though prior works have explored techniques for relighting an image, thei… ▽ More

    Submitted 2 May, 2019; originally announced May 2019.

    Comments: SIGGRAPH 2019 Technical Paper accepted

    Journal ref: ACM Transactions on Graphics (SIGGRAPH 2019) 38 (4)

  20. arXiv:1902.04062  [pdf, other

    math.OC cs.CV eess.IV

    Iteratively reweighted penalty alternating minimization methods with continuation for image deblurring

    Authors: Tao Sun, Dongsheng Li, Hao Jiang, Zhe Quan

    Abstract: In this paper, we consider a class of nonconvex problems with linear constraints appearing frequently in the area of image processing. We solve this problem by the penalty method and propose the iteratively reweighted alternating minimization algorithm. To speed up the algorithm, we also apply the continuation strategy to the penalty parameter. A convergence result is proved for the algorithm. Com… ▽ More

    Submitted 9 February, 2019; originally announced February 2019.