Skip to main content

Showing 1–50 of 50 results for author: Mei, K

.
  1. arXiv:2406.08266  [pdf, other

    eess.AS cs.SD

    Refining Self-Supervised Learnt Speech Representation using Brain Activations

    Authors: Hengyu Li, Kangdi Mei, Zhaoci Liu, Yang Ai, Li** Chen, Jie Zhang, Zhenhua Ling

    Abstract: It was shown in literature that speech representations extracted by self-supervised pre-trained models exhibit similarities with brain activations of human for speech perception and fine-tuning speech representation models on downstream tasks can further improve the similarity. However, it still remains unclear if this similarity can be used to optimize the pre-trained speech models. In this work,… ▽ More

    Submitted 13 June, 2024; v1 submitted 12 June, 2024; originally announced June 2024.

    Comments: accpeted by Interspeech2024

  2. arXiv:2405.06907  [pdf, other

    cs.CL cs.AI cs.LG cs.PL

    AIOS Compiler: LLM as Interpreter for Natural Language Programming and Flow Programming of AI Agents

    Authors: Shuyuan Xu, Zelong Li, Kai Mei, Yongfeng Zhang

    Abstract: Since their inception, programming languages have trended towards greater readability and lower barriers for programmers. Following this trend, natural language can be a promising type of programming language that provides great flexibility and usability and helps towards the democracy of programming. However, the inherent vagueness, ambiguity, and verbosity of natural language pose significant ch… ▽ More

    Submitted 21 May, 2024; v1 submitted 11 May, 2024; originally announced May 2024.

    Comments: 12 pages, 6 figures, comments and suggestions are welcome

  3. arXiv:2404.07066  [pdf, other

    cs.CL cs.AI cs.LG

    Exploring Concept Depth: How Large Language Models Acquire Knowledge at Different Layers?

    Authors: Mingyu **, Qinkai Yu, **gyuan Huang, Qingcheng Zeng, Zhenting Wang, Wenyue Hua, Haiyan Zhao, Kai Mei, Yanda Meng, Kaize Ding, Fan Yang, Mengnan Du, Yongfeng Zhang

    Abstract: Large language models (LLMs) have shown remarkable performances across a wide range of tasks. However, the mechanisms by which these models encode tasks of varying complexities remain poorly understood. In this paper, we explore the hypothesis that LLMs process concepts of varying complexities in different layers, introducing the idea of "Concept Depth" to suggest that more complex concepts are ty… ▽ More

    Submitted 30 April, 2024; v1 submitted 10 April, 2024; originally announced April 2024.

    Comments: 12 pages

  4. arXiv:2404.01367  [pdf, other

    cs.CV cs.LG

    Bigger is not Always Better: Scaling Properties of Latent Diffusion Models

    Authors: Kangfu Mei, Zhengzhong Tu, Mauricio Delbracio, Hossein Talebi, Vishal M. Patel, Peyman Milanfar

    Abstract: We study the scaling properties of latent diffusion models (LDMs) with an emphasis on their sampling efficiency. While improved network architecture and inference algorithms have shown to effectively boost sampling efficiency of diffusion models, the role of model size -- a critical determinant of sampling efficiency -- has not been thoroughly examined. Through empirical analysis of established te… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

  5. arXiv:2403.16971  [pdf, other

    cs.OS cs.AI cs.CL

    AIOS: LLM Agent Operating System

    Authors: Kai Mei, Zelong Li, Shuyuan Xu, Ruosong Ye, Yingqiang Ge, Yongfeng Zhang

    Abstract: The integration and deployment of large language model (LLM)-based intelligent agents have been fraught with challenges that compromise their efficiency and efficacy. Among these issues are sub-optimal scheduling and resource allocation of agent requests over the LLM, the difficulties in maintaining context during interactions between agent and LLM, and the complexities inherent in integrating het… ▽ More

    Submitted 25 March, 2024; v1 submitted 25 March, 2024; originally announced March 2024.

    Comments: 14 pages, 5 figures, 5 tables; comments and suggestions are appreciated

  6. arXiv:2402.13184  [pdf, other

    cs.CL

    What if LLMs Have Different World Views: Simulating Alien Civilizations with LLM-based Agents

    Authors: Mingyu **, Beichen Wang, Zhaoqian Xue, Suiyuan Zhu, Wenyue Hua, Hua Tang, Kai Mei, Mengnan Du, Yongfeng Zhang

    Abstract: In this study, we introduce "CosmoAgent," an innovative artificial intelligence framework utilizing Large Language Models (LLMs) to simulate complex interactions between human and extraterrestrial civilizations, with a special emphasis on Stephen Hawking's cautionary advice about not sending radio signals haphazardly into the universe. The goal is to assess the feasibility of peaceful coexistence… ▽ More

    Submitted 20 February, 2024; v1 submitted 20 February, 2024; originally announced February 2024.

  7. Careless Whisper: Speech-to-Text Hallucination Harms

    Authors: Allison Koenecke, Anna Seo Gyeong Choi, Katelyn X. Mei, Hilke Schellmann, Mona Sloane

    Abstract: Speech-to-text services aim to transcribe input audio as accurately as possible. They increasingly play a role in everyday life, for example in personal voice assistants or in customer-company interactions. We evaluate Open AI's Whisper, a state-of-the-art automated speech recognition service outperforming industry competitors, as of 2023. While many of Whisper's transcriptions were highly accurat… ▽ More

    Submitted 2 May, 2024; v1 submitted 12 February, 2024; originally announced February 2024.

  8. arXiv:2312.02156  [pdf, other

    cs.CV cs.AI

    Latent Feature-Guided Diffusion Models for Shadow Removal

    Authors: Kangfu Mei, Luis Figueroa, Zhe Lin, Zhihong Ding, Scott Cohen, Vishal M. Patel

    Abstract: Recovering textures under shadows has remained a challenging problem due to the difficulty of inferring shadow-free scenes from shadow images. In this paper, we propose the use of diffusion models as they offer a promising approach to gradually refine the details of shadow regions during the diffusion process. Our method improves this process by conditioning on a learned latent feature space that… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

    Comments: project page see https://kfmei.page/shadow-diffusion/index.html

  9. arXiv:2311.17227  [pdf, other

    cs.AI cs.CL cs.CY

    War and Peace (WarAgent): Large Language Model-based Multi-Agent Simulation of World Wars

    Authors: Wenyue Hua, Lizhou Fan, Lingyao Li, Kai Mei, Jianchao Ji, Yingqiang Ge, Libby Hemphill, Yongfeng Zhang

    Abstract: Can we avoid wars at the crossroads of history? This question has been pursued by individuals, scholars, policymakers, and organizations throughout human history. In this research, we attempt to answer the question based on the recent advances of Artificial Intelligence (AI) and Large Language Models (LLMs). We propose \textbf{WarAgent}, an LLM-powered multi-agent AI system, to simulate the partic… ▽ More

    Submitted 30 January, 2024; v1 submitted 28 November, 2023; originally announced November 2023.

    Comments: 47 pages, 9 figures, 5 tables

  10. arXiv:2310.17488  [pdf, other

    cs.IR cs.CL

    LightLM: A Lightweight Deep and Narrow Language Model for Generative Recommendation

    Authors: Kai Mei, Yongfeng Zhang

    Abstract: This paper presents LightLM, a lightweight Transformer-based language model for generative recommendation. While Transformer-based generative modeling has gained importance in various AI sub-fields such as NLP and vision, generative recommendation is still in its infancy due to its unique demand on personalized generative modeling. Existing works on generative recommendation often use NLP-oriented… ▽ More

    Submitted 29 October, 2023; v1 submitted 26 October, 2023; originally announced October 2023.

  11. arXiv:2310.01407  [pdf, other

    cs.CV cs.AI cs.LG

    CoDi: Conditional Diffusion Distillation for Higher-Fidelity and Faster Image Generation

    Authors: Kangfu Mei, Mauricio Delbracio, Hossein Talebi, Zhengzhong Tu, Vishal M. Patel, Peyman Milanfar

    Abstract: Large generative diffusion models have revolutionized text-to-image generation and offer immense potential for conditional generation tasks such as image enhancement, restoration, editing, and compositing. However, their widespread adoption is hindered by the high computational cost, which limits their real-time application. To address this challenge, we introduce a novel method dubbed CoDi, that… ▽ More

    Submitted 17 February, 2024; v1 submitted 2 October, 2023; originally announced October 2023.

  12. Bias Against 93 Stigmatized Groups in Masked Language Models and Downstream Sentiment Classification Tasks

    Authors: Katelyn X. Mei, Sonia Fereidooni, Aylin Caliskan

    Abstract: The rapid deployment of artificial intelligence (AI) models demands a thorough investigation of biases and risks inherent in these models to understand their impact on individuals and society. This study extends the focus of bias evaluation in extant work by examining bias against social stigmas on a large scale. It focuses on 93 stigmatized groups in the United States, including a wide range of c… ▽ More

    Submitted 8 June, 2023; originally announced June 2023.

    Comments: 20 pages,12 figures,2 tables; ACM FAccT 2023

    ACM Class: K.4; I.2.7; I.2.0

  13. arXiv:2305.17826  [pdf, other

    cs.CL cs.CR

    NOTABLE: Transferable Backdoor Attacks Against Prompt-based NLP Models

    Authors: Kai Mei, Zheng Li, Zhenting Wang, Yang Zhang, Shiqing Ma

    Abstract: Prompt-based learning is vulnerable to backdoor attacks. Existing backdoor attacks against prompt-based models consider injecting backdoors into the entire embedding layers or word embedding vectors. Such attacks can be easily affected by retraining on downstream tasks and with different prompting strategies, limiting the transferability of backdoor attacks. In this work, we propose transferable b… ▽ More

    Submitted 28 May, 2023; originally announced May 2023.

  14. arXiv:2305.14674  [pdf, other

    cs.CV

    T1: Scaling Diffusion Probabilistic Fields to High-Resolution on Unified Visual Modalities

    Authors: Kangfu Mei, Mo Zhou, Vishal M. Patel

    Abstract: Diffusion Probabilistic Field (DPF) models the distribution of continuous functions defined over metric spaces. While DPF shows great potential for unifying data generation of various modalities including images, videos, and 3D geometry, it does not scale to a higher data resolution. This can be attributed to the ``scaling property'', where it is difficult for the model to capture local structures… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

    Comments: for project page, see https://t1-diffusion-model.github.io

  15. arXiv:2304.05959  [pdf, other

    cs.RO cs.AI

    UAV Obstacle Avoidance by Human-in-the-Loop Reinforcement in Arbitrary 3D Environment

    Authors: Xuyang Li, Jianwu Fang, Kai Du, Kuizhi Mei, Jianru Xue

    Abstract: This paper focuses on the continuous control of the unmanned aerial vehicle (UAV) based on a deep reinforcement learning method for a large-scale 3D complex environment. The purpose is to make the UAV reach any target point from a certain starting point, and the flying height and speed are variable during navigation. In this work, we propose a deep reinforcement learning (DRL)-based method combine… ▽ More

    Submitted 6 April, 2023; originally announced April 2023.

    Comments: accepted in CCC2023

  16. arXiv:2304.04370  [pdf, other

    cs.AI cs.CL cs.LG

    OpenAGI: When LLM Meets Domain Experts

    Authors: Yingqiang Ge, Wenyue Hua, Kai Mei, Jianchao Ji, Juntao Tan, Shuyuan Xu, Zelong Li, Yongfeng Zhang

    Abstract: Human Intelligence (HI) excels at combining basic skills to solve complex tasks. This capability is vital for Artificial Intelligence (AI) and should be embedded in comprehensive AI Agents, enabling them to harness expert models for complex task-solving towards Artificial General Intelligence (AGI). Large Language Models (LLMs) show promising learning and reasoning abilities, and can effectively u… ▽ More

    Submitted 3 November, 2023; v1 submitted 9 April, 2023; originally announced April 2023.

    Comments: In NeurIPS 2023

  17. arXiv:2304.02786  [pdf, other

    cs.LG cs.AI cs.CR cs.CV

    UNICORN: A Unified Backdoor Trigger Inversion Framework

    Authors: Zhenting Wang, Kai Mei, Juan Zhai, Shiqing Ma

    Abstract: The backdoor attack, where the adversary uses inputs stamped with triggers (e.g., a patch) to activate pre-planted malicious behaviors, is a severe threat to Deep Neural Network (DNN) models. Trigger inversion is an effective way of identifying backdoor models and understanding embedded adversarial behaviors. A challenge of trigger inversion is that there are many ways of constructing the trigger.… ▽ More

    Submitted 5 April, 2023; originally announced April 2023.

  18. arXiv:2302.06992  [pdf, other

    cs.CV

    Hard-aware Instance Adaptive Self-training for Unsupervised Cross-domain Semantic Segmentation

    Authors: Chuang Zhu, Kebin Liu, Wenqi Tang, Ke Mei, Jiaqi Zou, Tiejun Huang

    Abstract: The divergence between labeled training data and unlabeled testing data is a significant challenge for recent deep learning models. Unsupervised domain adaptation (UDA) attempts to solve such problem. Recent works show that self-training is a powerful approach to UDA. However, existing methods have difficulty in balancing the scalability and performance. In this paper, we propose a hard-aware inst… ▽ More

    Submitted 14 February, 2023; originally announced February 2023.

    Comments: arXiv admin note: text overlap with arXiv:2008.12197

  19. arXiv:2212.07352  [pdf, other

    cs.CV

    Bi-Noising Diffusion: Towards Conditional Diffusion Models with Generative Restoration Priors

    Authors: Kangfu Mei, Nithin Gopalakrishnan Nair, Vishal M. Patel

    Abstract: Conditional diffusion probabilistic models can model the distribution of natural images and can generate diverse and realistic samples based on given conditions. However, oftentimes their results can be unrealistic with observable color shifts and textures. We believe that this issue results from the divergence between the probabilistic distribution learned by the model and the distribution of nat… ▽ More

    Submitted 14 December, 2022; originally announced December 2022.

  20. arXiv:2212.00235  [pdf, other

    cs.CV

    VIDM: Video Implicit Diffusion Models

    Authors: Kangfu Mei, Vishal M. Patel

    Abstract: Diffusion models have emerged as a powerful generative method for synthesizing high-quality and diverse set of images. In this paper, we propose a video generation method based on diffusion models, where the effects of motion are modeled in an implicit condition manner, i.e. one can sample plausible video motions according to the latent feature of frames. We improve the quality of the generated vi… ▽ More

    Submitted 30 November, 2022; originally announced December 2022.

    Comments: AAAI2023 https://kfmei.page/vidm/

  21. arXiv:2210.15127  [pdf, other

    cs.CR cs.AI cs.CV cs.LG

    Rethinking the Reverse-engineering of Trojan Triggers

    Authors: Zhenting Wang, Kai Mei, Hailun Ding, Juan Zhai, Shiqing Ma

    Abstract: Deep Neural Networks are vulnerable to Trojan (or backdoor) attacks. Reverse-engineering methods can reconstruct the trigger and thus identify affected models. Existing reverse-engineering methods only consider input space constraints, e.g., trigger size in the input space. Expressly, they assume the triggers are static patterns in the input space and fail to detect models with feature space trigg… ▽ More

    Submitted 26 October, 2022; originally announced October 2022.

  22. arXiv:2208.11284  [pdf, other

    cs.CV

    AT-DDPM: Restoring Faces degraded by Atmospheric Turbulence using Denoising Diffusion Probabilistic Models

    Authors: Nithin Gopalakrishnan Nair, Kangfu Mei, Vishal M. Patel

    Abstract: Although many long-range imaging systems are designed to support extended vision applications, a natural obstacle to their operation is degradation due to atmospheric turbulence. Atmospheric turbulence causes significant degradation to image quality by introducing blur and geometric distortion. In recent years, various deep learning-based single image atmospheric turbulence mitigation methods, inc… ▽ More

    Submitted 20 September, 2022; v1 submitted 23 August, 2022; originally announced August 2022.

    Comments: Accepted to IEEE WACV 2023

  23. arXiv:2207.09302  [pdf, other

    cs.CV

    Deep Semantic Statistics Matching (D2SM) Denoising Network

    Authors: Kangfu Mei, Vishal M. Patel, Rui Huang

    Abstract: The ultimate aim of image restoration like denoising is to find an exact correlation between the noisy and clear image domains. But the optimization of end-to-end denoising learning like pixel-wise losses is performed in a sample-to-sample manner, which ignores the intrinsic correlation of images, especially semantics. In this paper, we introduce the Deep Semantic Statistics Matching (D2SM) Denois… ▽ More

    Submitted 19 July, 2022; originally announced July 2022.

    Comments: ECCV2022, for Project Page, see https://kfmei.page/d2sm/

  24. arXiv:2204.08974  [pdf, other

    cs.CV eess.IV

    A comparison of different atmospheric turbulence simulation methods for image restoration

    Authors: Nithin Gopalakrishnan Nair, Kangfu Mei, Vishal M. Patel

    Abstract: Atmospheric turbulence deteriorates the quality of images captured by long-range imaging systems by introducing blur and geometric distortions to the captured scene. This leads to a drastic drop in performance when computer vision algorithms like object/face recognition and detection are performed on these images. In recent years, various deep learning-based atmospheric turbulence mitigation metho… ▽ More

    Submitted 19 April, 2022; originally announced April 2022.

  25. arXiv:2204.03057  [pdf, other

    cs.CV

    Thermal to Visible Image Synthesis under Atmospheric Turbulence

    Authors: Kangfu Mei, Yiqun Mei, Vishal M. Patel

    Abstract: In many practical applications of long-range imaging such as biometrics and surveillance, thermal imagining modalities are often used to capture images in low-light and nighttime conditions. However, such imaging systems often suffer from atmospheric turbulence, which introduces severe blur and deformation artifacts to the captured images. Such an issue is unavoidable in long-range imaging and sig… ▽ More

    Submitted 6 April, 2022; originally announced April 2022.

    Comments: 4 pages, 3 figures

  26. arXiv:2202.09954  [pdf, other

    eess.SP cs.IT cs.LG

    Theoretical Analysis of Deep Neural Networks in Physical Layer Communication

    Authors: Jun Liu, Haitao Zhao, Dongtang Ma, Kai Mei, Jibo Wei

    Abstract: Recently, deep neural network (DNN)-based physical layer communication techniques have attracted considerable interest. Although their potential to enhance communication systems and superb performance have been validated by simulation experiments, little attention has been paid to the theoretical analysis. Specifically, most studies in the physical layer have tended to focus on the application of… ▽ More

    Submitted 26 August, 2022; v1 submitted 20 February, 2022; originally announced February 2022.

    Comments: 15 pages, 13 figures, has been accepted for publication in IEEE Transactions on Communications. arXiv admin note: substantial text overlap with arXiv:2106.01124

    Journal ref: IEEE Transactions on Communications, 2022

  27. LTT-GAN: Looking Through Turbulence by Inverting GANs

    Authors: Kangfu Mei, Vishal M. Patel

    Abstract: In many applications of long-range imaging, we are faced with a scenario where a person appearing in the captured imagery is often degraded by atmospheric turbulence. However, restoring such degraded images for face verification is difficult since the degradation causes images to be geometrically distorted and blurry. To mitigate the turbulence effect, in this paper, we propose the first turbulenc… ▽ More

    Submitted 4 December, 2021; originally announced December 2021.

    Comments: Project Page: https://kfmei.page/LTT-GAN/

  28. Mobile App Crowdsourced Test Report Consistency Detection via Deep Image-and-Text Fusion Understanding

    Authors: Shengcheng Yu, Chunrong Fang, Quanjun Zhang, Zhihao Cao, Yexiao Yun, Zhenfei Cao, Kai Mei, Zhenyu Chen

    Abstract: Crowdsourced testing, as a distinct testing paradigm, has attracted much attention in software testing, especially in mobile application (app) testing field. Compared with in-house testing, crowdsourced testing shows superiority with the diverse testing environments when faced with the mobile testing fragmentation problem. However, crowdsourced testing also encounters the low-quality test report p… ▽ More

    Submitted 12 June, 2023; v1 submitted 16 August, 2021; originally announced August 2021.

  29. A Low Complexity Learning-based Channel Estimation for OFDM Systems with Online Training

    Authors: Kai Mei, Jun Liu, Xiaoying Zhang, Kuo Cao, Nandana Rajatheva, Jibo Wei

    Abstract: In this paper, we devise a highly efficient machine learning-based channel estimation for orthogonal frequency division multiplexing (OFDM) systems, in which the training of the estimator is performed online. A simple learning module is employed for the proposed learning-based estimator. The training process is thus much faster and the required training data is reduced significantly. Besides, a tr… ▽ More

    Submitted 14 July, 2021; originally announced July 2021.

    Comments: 12 pages, 12 figures. To appear in IEEE Transactions on Communications

  30. arXiv:2106.01124  [pdf, other

    eess.SP cs.IT cs.LG

    Opening the Black Box of Deep Neural Networks in Physical Layer Communication

    Authors: Jun Liu, Haitao Zhao, Dongtang Ma, Kai Mei, Jibo Wei

    Abstract: Deep Neural Network (DNN)-based physical layer techniques are attracting considerable interest due to their potential to enhance communication systems. However, most studies in the physical layer have tended to focus on the application of DNN models to wireless communication problems but not to theoretically understand how does a DNN work in a communication system. In this paper, we aim to quantit… ▽ More

    Submitted 18 February, 2022; v1 submitted 2 June, 2021; originally announced June 2021.

    Comments: 6 pages, 5 figures, to be presented in the IEEE Wireless Communications and Networking Conference (WCNC) 2022 Workshop on Machine Learning for Communications: Future Large Scale MIMO and AI-Native Air-Interface

  31. arXiv:2104.00848  [pdf, other

    cs.CV

    SDAN: Squared Deformable Alignment Network for Learning Misaligned Optical Zoom

    Authors: Kangfu Mei, Shenglong Ye, Rui Huang

    Abstract: Deep Neural Network (DNN) based super-resolution algorithms have greatly improved the quality of the generated images. However, these algorithms often yield significant artifacts when dealing with real-world super-resolution problems due to the difficulty in learning misaligned optical zoom. In this paper, we introduce a Squared Deformable Alignment Network (SDAN) to address this issue. Our networ… ▽ More

    Submitted 25 November, 2021; v1 submitted 1 April, 2021; originally announced April 2021.

    Comments: ICME21. Code is available at https://github.com/MKFMIKU/SDAN

  32. arXiv:2103.05930  [pdf, other

    cs.CV

    AttaNet: Attention-Augmented Network for Fast and Accurate Scene Parsing

    Authors: Qi Song, Kangfu Mei, Rui Huang

    Abstract: Two factors have proven to be very important to the performance of semantic segmentation models: global context and multi-level semantics. However, generating features that capture both factors always leads to high computational complexity, which is problematic in real-time scenarios. In this paper, we propose a new model, called Attention-Augmented Network (AttaNet), to capture both global contex… ▽ More

    Submitted 10 March, 2021; originally announced March 2021.

    Comments: AAAI 2021

  33. arXiv:2101.01479  [pdf, other

    cs.CV

    Scale-Aware Network with Regional and Semantic Attentions for Crowd Counting under Cluttered Background

    Authors: Qiaosi Yi, Yunxing Liu, Aiwen Jiang, Juncheng Li, Kangfu Mei, Mingwen Wang

    Abstract: Crowd counting is an important task that shown great application value in public safety-related fields, which has attracted increasing attention in recent years. In the current research, the accuracy of counting numbers and crowd density estimation are the main concerns. Although the emergence of deep learning has greatly promoted the development of this field, crowd counting under cluttered backg… ▽ More

    Submitted 7 January, 2021; v1 submitted 5 January, 2021; originally announced January 2021.

  34. arXiv:2008.13084  [pdf, other

    eess.IV cs.CV

    MDCN: Multi-scale Dense Cross Network for Image Super-Resolution

    Authors: Juncheng Li, Faming Fang, Jiaqian Li, Kangfu Mei, Guixu Zhang

    Abstract: Convolutional neural networks have been proven to be of great benefit for single-image super-resolution (SISR). However, previous works do not make full use of multi-scale features and ignore the inter-scale correlation between different upsampling factors, resulting in sub-optimal performance. Instead of blindly increasing the depth of the network, we are committed to mining image features and le… ▽ More

    Submitted 29 August, 2020; originally announced August 2020.

    Comments: 15 pages, 15 figures

  35. arXiv:2008.12197  [pdf, other

    cs.CV

    Instance Adaptive Self-Training for Unsupervised Domain Adaptation

    Authors: Ke Mei, Chuang Zhu, Jiaqi Zou, Shanghang Zhang

    Abstract: The divergence between labeled training data and unlabeled testing data is a significant challenge for recent deep learning models. Unsupervised domain adaptation (UDA) attempts to solve such a problem. Recent works show that self-training is a powerful approach to UDA. However, existing methods have difficulty in balancing scalability and performance. In this paper, we propose an instance adaptiv… ▽ More

    Submitted 27 August, 2020; originally announced August 2020.

    Comments: ECCV 2020

  36. arXiv:2008.10480  [pdf, other

    cs.CV

    3rd Place Solution to "Google Landmark Retrieval 2020"

    Authors: Ke Mei, Lei li, **chang Xu, Yanhua Cheng, Yugeng Lin

    Abstract: Image retrieval is a fundamental problem in computer vision. This paper presents our 3rd place detailed solution to the Google Landmark Retrieval 2020 challenge. We focus on the exploration of data cleaning and models with metric learning. We use a data cleaning strategy based on embedding clustering. Besides, we employ a data augmentation method called Corner-Cutmix, which improves the model's ab… ▽ More

    Submitted 24 August, 2020; v1 submitted 24 August, 2020; originally announced August 2020.

  37. Fine Timing and Frequency Synchronization for MIMO-OFDM: An Extreme Learning Approach

    Authors: Jun Liu, Kai Mei, Xiaochen Zhang, Des McLernon, Dongtang Ma, Jibo Wei, Syed Ali Raza Zaidi

    Abstract: Multiple-input multiple-output orthogonal frequency-division multiplexing (MIMO-OFDM) is a key technology component in the evolution towards cognitive radio (CR) in next-generation communication in which the accuracy of timing and frequency synchronization significantly impacts the overall system performance. In this paper, we propose a novel scheme leveraging extreme learning machine (ELM) to ach… ▽ More

    Submitted 1 June, 2022; v1 submitted 17 July, 2020; originally announced July 2020.

    Comments: 13 pages, 12 figures, has been accepted for publication in IEEE Transactions on Cognitive Communications and Networking

    Journal ref: IEEE Transactions on Cognitive Communications and Networking, 2021

  38. arXiv:2006.15954  [pdf, other

    eess.IV cs.CV

    Multi-level colonoscopy malignant tissue detection with adversarial CAC-UNet

    Authors: Chuang Zhu, Ke Mei, Ting Peng, Yihao Luo, Jun Liu, Ying Wang, Mulan **

    Abstract: The automatic and objective medical diagnostic model can be valuable to achieve early cancer detection, and thus reducing the mortality rate. In this paper, we propose a highly efficient multi-level malignant tissue detection through the designed adversarial CAC-UNet. A patch-level model with a pre-prediction strategy and a malignancy area guided label smoothing is adopted to remove the negative W… ▽ More

    Submitted 30 June, 2020; v1 submitted 29 June, 2020; originally announced June 2020.

    Comments: accepted by Neurocomputing; winner of the MICCAI DigestPath 2019 challenge on colonoscopy tissue segmentation and classification task

  39. arXiv:2006.13511  [pdf, other

    cs.CV

    Disentangle Perceptual Learning through Online Contrastive Learning

    Authors: Kangfu Mei, Yao Lu, Qiaosi Yi, Haoyu Wu, Juncheng Li, Rui Huang

    Abstract: Pursuing realistic results according to human visual perception is the central concern in the image transformation tasks. Perceptual learning approaches like perceptual loss are empirically powerful for such tasks but they usually rely on the pre-trained classification network to provide features, which are not necessarily optimal in terms of visual perception of image transformation. In this pape… ▽ More

    Submitted 24 June, 2020; originally announced June 2020.

    Comments: 12 pages, 8 figures

  40. arXiv:2004.13875  [pdf, other

    cs.IT eess.SP

    6G White Paper on Machine Learning in Wireless Communication Networks

    Authors: Samad Ali, Walid Saad, Nandana Rajatheva, Kapseok Chang, Daniel Steinbach, Benjamin Sliwa, Christian Wietfeld, Kai Mei, Hamid Shiri, Hans-Jürgen Zepernick, Thi My Chinh Chu, Ijaz Ahmad, Jyrki Huusko, Jaakko Suutala, Shubhangi Bhadauria, Vimal Bhatia, Rangeet Mitra, Saidhiraj Amuru, Robert Abbas, Baohua Shao, Michele Capobianco, Guanghui Yu, Maelick Claes, Teemu Karvonen, Mingzhe Chen , et al. (2 additional authors not shown)

    Abstract: The focus of this white paper is on machine learning (ML) in wireless communications. 6G wireless communication networks will be the backbone of the digital transformation of societies by providing ubiquitous, reliable, and near-instant wireless connectivity for humans and machines. Recent advances in ML research has led enable a wide range of novel technologies such as self-driving vehicles and v… ▽ More

    Submitted 28 April, 2020; originally announced April 2020.

  41. arXiv:2002.08587  [pdf, other

    eess.IV cs.CV

    Cross-stained Segmentation from Renal Biopsy Images Using Multi-level Adversarial Learning

    Authors: Ke Mei, Chuang Zhu, Lei Jiang, Jun Liu, Yuanyuan Qiao

    Abstract: Segmentation from renal pathological images is a key step in automatic analyzing the renal histological characteristics. However, the performance of models varies significantly in different types of stained datasets due to the appearance variations. In this paper, we design a robust and flexible model for cross-stained segmentation. It is a novel multi-level deep adversarial network architecture t… ▽ More

    Submitted 20 February, 2020; originally announced February 2020.

    Comments: Accepted by ICASSP2020

  42. Enhanced LMMSE Estimation Capable of Selecting Parameters

    Authors: Kai Mei, Jun Liu, Xiaoran Liu, Jun Xiong, Xiaoying Zhang, Jibo Wei

    Abstract: In the linear minimum mean square error (LMMSE) estimation for orthogonal frequency division multiplexing (OFDM) systems, the problem about the determination of the algorithm's parameters, especially those related with channel frequency response (CFR) correlation, has not been readily solved yet. Although many approaches have been proposed to determine the statistic parameters, it is hard to choos… ▽ More

    Submitted 17 December, 2019; originally announced December 2019.

  43. arXiv:1911.08098  [pdf, other

    eess.IV cs.CV

    HighEr-Resolution Network for Image Demosaicing and Enhancing

    Authors: Kangfu Mei, Juncheng Li, Jiajie Zhang, Haoyu Wu, Jie Li, Rui Huang

    Abstract: Neural-networks based image restoration methods tend to use low-resolution image patches for training. Although higher-resolution image patches can provide more global information, state-of-the-art methods cannot utilize them due to their huge GPU memory usage, as well as the instable training process. However, plenty of studies have shown that global information is crucial for image restoration t… ▽ More

    Submitted 19 November, 2019; originally announced November 2019.

    Comments: Accepted in ICCV 2019 Workshop (AIM2019 Raw to RGB Challenge Winner)

  44. Performance Analysis on Machine Learning-Based Channel Estimation

    Authors: Kai Mei, Jun Liu, Xiaochen Zhang, Nandana Rajatheva, Jibo Wei

    Abstract: Recently, machine learning-based channel estimation has attracted much attention. The performance of machine learning-based estimation has been validated by simulation experiments. However, little attention has been paid to the theoretical performance analysis. In this paper, we investigate the mean square error (MSE) performance of machine learning-based estimation. Hypothesis testing is employed… ▽ More

    Submitted 14 July, 2021; v1 submitted 10 November, 2019; originally announced November 2019.

    Comments: 11 pages, 10 figures. To appear in IEEE Transactions on Communications

  45. arXiv:1811.09346  [pdf, other

    eess.SP cs.LG

    Deep Neural Network Aided Scenario Identification in Wireless Multi-path Fading Channels

    Authors: Jun Liu, Kai Mei, Dongtang Ma, Jibo Wei

    Abstract: This letter illustrates our preliminary works in deep nerual network (DNN) for wireless communication scenario identification in wireless multi-path fading channels. In this letter, six kinds of channel scenarios referring to COST 207 channel model have been performed. 100% identification accuracy has been observed given signal-to-noise (SNR) over 20dB whereas a 88.4% average accuracy has been obt… ▽ More

    Submitted 22 November, 2018; originally announced November 2018.

    Comments: Draft of a four-page letter with 8 figures

  46. arXiv:1811.07445  [pdf, other

    eess.SP cs.IT

    High-precision timing and frequency synchronization method for MIMO-OFDM systems in double-selective channels

    Authors: Jun Liu, Kai Mei, Xiaochen Zhang, Xiaoying Zhang, Dongtang Ma, Jibo Wei

    Abstract: In this letter, a novel synchronization method for MIMO-OFDM systems is proposed. The new approach has an accurate estimate of both symbol timing and large frequency offest. Simulation results show the excellent robustness of our method in double-selective channel even if the strongest multipath component arrives behind the first path.

    Submitted 18 November, 2018; originally announced November 2018.

    Comments: 2 pages letter with 4 figures

  47. Phase asymmetry ultrasound despeckling with fractional anisotropic diffusion and total variation

    Authors: Kunqiang Mei, Bin Hu, Baowei Fei, Binjie Qin

    Abstract: We propose an ultrasound speckle filtering method for not only preserving various edge features but also filtering tissue-dependent complex speckle noises in ultrasound images. The key idea is to detect these various edges using a phase congruence-based edge significance measure called phase asymmetry (PAS), which is invariant to the intensity amplitude of edges and takes 0 in non-edge smooth regi… ▽ More

    Submitted 9 February, 2021; v1 submitted 30 October, 2018; originally announced October 2018.

    Comments: 12

    Journal ref: IEEE Transaction on Image Processing, 2020

  48. arXiv:1810.02283  [pdf, other

    cs.CV

    Progressive Feature Fusion Network for Realistic Image Dehazing

    Authors: Kangfu Mei, Aiwen Jiang, Juncheng Li, Mingwen Wang

    Abstract: Single image dehazing is a challenging ill-posed restoration problem. Various prior-based and learning-based methods have been proposed. Most of them follow a classic atmospheric scattering model which is an elegant simplified physical model based on the assumption of single-scattering and homogeneous atmospheric medium. The formulation of haze in realistic environment is more complicated. In this… ▽ More

    Submitted 4 October, 2018; originally announced October 2018.

    Comments: 14 pages, 7 figures, 1 tables, accepted by ACCV2018

  49. arXiv:1810.01831  [pdf, other

    cs.CV

    An Effective Single-Image Super-Resolution Model Using Squeeze-and-Excitation Networks

    Authors: Kangfu Mei, Aiwen Jiang, Juncheng Li, Jihua Ye, Mingwen Wang

    Abstract: Recent works on single-image super-resolution are concentrated on improving performance through enhancing spatial encoding between convolutional layers. In this paper, we focus on modeling the correlations between channels of convolutional features. We present an effective deep residual network based on squeeze-and-excitation blocks (SEBlock) to reconstruct high-resolution (HR) image from low-reso… ▽ More

    Submitted 3 October, 2018; originally announced October 2018.

    Comments: 12 pages, accepted by ICONIP2018

  50. Higgs Physics at the CLIC Electron-Positron Linear Collider

    Authors: H. Abramowicz, A. Abusleme, K. Afanaciev, N. Alipour Tehrani, C. Balázs, Y. Benhammou, M. Benoit, B. Bilki, J. -J. Blaising, M. J. Boland, M. Boronat, O. Borysov, I. Božović-Jelisavčić, M. Buckland, S. Bugiel, P. N. Burrows, T. K. Charles, W. Daniluk, D. Dannheim, R. Dasgupta, M. Demarteau, M. A. Díaz Gutierrez, G. Eigen, K. Elsener, U. Felzmann , et al. (99 additional authors not shown)

    Abstract: The Compact Linear Collider (CLIC) is an option for a future e+e- collider operating at centre-of-mass energies up to 3 TeV, providing sensitivity to a wide range of new physics phenomena and precision physics measurements at the energy frontier. This paper is the first comprehensive presentation of the Higgs physics reach of CLIC operating at three energy stages: sqrt(s) = 350 GeV, 1.4 TeV and 3… ▽ More

    Submitted 5 June, 2017; v1 submitted 26 August, 2016; originally announced August 2016.

    Comments: 42 pages, 29 figures, accepted for publication in the European Physical Journal C

    Report number: CLICdp-Pub-2016-001

    Journal ref: Eur. Phys. J. C 77, 475 (2017)