Skip to main content

Showing 1–50 of 180 results for author: Hou, Q

.
  1. arXiv:2407.00021  [pdf, other

    cs.CV cs.GR eess.IV

    Neural Graphics Texture Compression Supporting Random Acces

    Authors: Farzad Farhadzadeh, Qiqi Hou, Hoang Le, Amir Said, Randall Rauwendaal, Alex Bourd, Fatih Porikli

    Abstract: Advances in rendering have led to tremendous growth in texture assets, including resolution, complexity, and novel textures components, but this growth in data volume has not been matched by advances in its compression. Meanwhile Neural Image Compression (NIC) has advanced significantly and shown promising results, but the proposed methods cannot be directly adapted to neural texture compression.… ▽ More

    Submitted 6 May, 2024; originally announced July 2024.

    Comments: ECCV submission

  2. arXiv:2406.15819  [pdf, other

    cs.LG cs.IT cs.NI eess.SP

    Automatic AI Model Selection for Wireless Systems: Online Learning via Digital Twinning

    Authors: Qiushuo Hou, Matteo Zecchin, Sangwoo Park, Yunlong Cai, Guanding Yu, Kaushik Chowdhury, Osvaldo Simeone

    Abstract: In modern wireless network architectures, such as O-RAN, artificial intelligence (AI)-based applications are deployed at intelligent controllers to carry out functionalities like scheduling or power control. The AI "apps" are selected on the basis of contextual information such as network conditions, topology, traffic statistics, and design goals. The map** between context and AI model parameter… ▽ More

    Submitted 22 June, 2024; originally announced June 2024.

    Comments: submitted for a journal publication

  3. arXiv:2406.06858  [pdf, other

    cs.LG cs.DC

    FLUX: Fast Software-based Communication Overlap On GPUs Through Kernel Fusion

    Authors: Li-Wen Chang, Wenlei Bao, Qi Hou, Chengquan Jiang, Ningxin Zheng, Yinmin Zhong, Xuanrun Zhang, Zuquan Song, Ziheng Jiang, Haibin Lin, Xin **, Xin Liu

    Abstract: Large deep learning models have demonstrated strong ability to solve many tasks across a wide range of applications. Those large models typically require training and inference to be distributed. Tensor parallelism is a common technique partitioning computation of an operation or layer across devices to overcome the memory capacity limitation of a single processor, and/or to accelerate computation… ▽ More

    Submitted 18 June, 2024; v1 submitted 10 June, 2024; originally announced June 2024.

  4. arXiv:2406.00670  [pdf, other

    cs.CV

    Cascade-CLIP: Cascaded Vision-Language Embeddings Alignment for Zero-Shot Semantic Segmentation

    Authors: Yunheng Li, ZhongYu Li, Quansheng Zeng, Qibin Hou, Ming-Ming Cheng

    Abstract: Pre-trained vision-language models, e.g., CLIP, have been successfully applied to zero-shot semantic segmentation. Existing CLIP-based approaches primarily utilize visual features from the last layer to align with text embeddings, while they neglect the crucial information in intermediate layers that contain rich object details. However, we find that directly aggregating the multi-level visual fea… ▽ More

    Submitted 6 June, 2024; v1 submitted 2 June, 2024; originally announced June 2024.

    Comments: Accepted by ICML 2024

  5. arXiv:2405.08021  [pdf, other

    cs.SD eess.AS

    Diff-ETS: Learning a Diffusion Probabilistic Model for Electromyography-to-Speech Conversion

    Authors: Zhao Ren, Kevin Scheck, Qinhan Hou, Stefano van Gogh, Michael Wand, Tanja Schultz

    Abstract: Electromyography-to-Speech (ETS) conversion has demonstrated its potential for silent speech interfaces by generating audible speech from Electromyography (EMG) signals during silent articulations. ETS models usually consist of an EMG encoder which converts EMG signals to acoustic speech features, and a vocoder which then synthesises the speech signals. Due to an inadequate amount of available dat… ▽ More

    Submitted 11 May, 2024; originally announced May 2024.

    Comments: Accepted by EMBC 2024

  6. arXiv:2405.07469  [pdf, other

    quant-ph physics.optics

    Phase coding semi-quantum key distribution system based on the Single-state protocol

    Authors: Qincheng Hou, Siying Huang, Naida Mo, **dong Wang, Zhengjun Wei, Yafei Yu, Tianming Zhao, Zhiming Zhang

    Abstract: Semi-quantum key distribution (SQKD) allows sharing random keys between a quantum user and a classical user. However, implementing classical user operations is challenging, posing a hurdle to achieving the Single-state protocol. By using the "selective modulation" method, the feasibility of SQKD is verified in principle. The proposal of the selective modulation method enables the realization of ot… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

  7. arXiv:2405.01434  [pdf, other

    cs.CV

    StoryDiffusion: Consistent Self-Attention for Long-Range Image and Video Generation

    Authors: Yupeng Zhou, Daquan Zhou, Ming-Ming Cheng, Jiashi Feng, Qibin Hou

    Abstract: For recent diffusion-based generative models, maintaining consistent content across a series of generated images, especially those containing subjects and complex details, presents a significant challenge. In this paper, we propose a new way of self-attention calculation, termed Consistent Self-Attention, that significantly boosts the consistency between the generated images and augments prevalent… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

  8. 3D Gaussian Splatting with Deferred Reflection

    Authors: Keyang Ye, Qiming Hou, Kun Zhou

    Abstract: The advent of neural and Gaussian-based radiance field methods have achieved great success in the field of novel view synthesis. However, specular reflection remains non-trivial, as the high frequency radiance field is notoriously difficult to fit stably and accurately. We present a deferred shading method to effectively render specular reflection with Gaussian splatting. The key challenge comes f… ▽ More

    Submitted 4 June, 2024; v1 submitted 29 April, 2024; originally announced April 2024.

  9. arXiv:2404.11860  [pdf, ps, other

    quant-ph

    Active robustness against the detuning-error for Rydberg quantum gates

    Authors: Qing-Ling Hou, Han Wang, **g Qian

    Abstract: Error suppression to the experimental imperfections is a central challenge for useful quantum computing. Recent studies have shown the advantages of using single-modulated pulses based on optimal control which can realize high-fidelity two-qubit gates in neutral-atom arrays. However, typical optimization only minimizes the ideal gate error in the absence of any decay, which allows the gate to be p… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

    Comments: 13 pages, 7 figures

  10. arXiv:2404.11100  [pdf, other

    cs.CV cs.LG

    Synthesizing Realistic Data for Table Recognition

    Authors: Qiyu Hou, Jun Wang, Meixuan Qiao, Lujun Tian

    Abstract: To overcome the limitations and challenges of current automatic table data annotation methods and random table data synthesis approaches, we propose a novel method for synthesizing annotation data specifically designed for table recognition. This method utilizes the structure and content of existing complex tables, facilitating the efficient creation of tables that closely replicate the authentic… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

    Comments: ICDAR 2024

  11. arXiv:2404.04887  [pdf, other

    cs.CV

    A Clinical-oriented Multi-level Contrastive Learning Method for Disease Diagnosis in Low-quality Medical Images

    Authors: Qingshan Hou, Shuai Cheng, Peng Cao, **zhu Yang, Xiaoli Liu, Osmar R. Zaiane, Yih Chung Tham

    Abstract: Representation learning offers a conduit to elucidate distinctive features within the latent space and interpret the deep models. However, the randomness of lesion distribution and the complexity of low-quality factors in medical images pose great challenges for models to extract key lesion features. Disease diagnosis methods guided by contrastive learning (CL) have shown significant advantages in… ▽ More

    Submitted 7 April, 2024; originally announced April 2024.

  12. arXiv:2403.17879  [pdf, other

    cs.CV eess.IV

    Low-Latency Neural Stereo Streaming

    Authors: Qiqi Hou, Farzad Farhadzadeh, Amir Said, Guillaume Sautiere, Hoang Le

    Abstract: The rise of new video modalities like virtual reality or autonomous driving has increased the demand for efficient multi-view video compression methods, both in terms of rate-distortion (R-D) performance and in terms of delay and runtime. While most recent stereo video compression approaches have shown promising performance, they compress left and right views sequentially, leading to poor parallel… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    Comments: Accepted by CVPR2024

  13. arXiv:2403.17749  [pdf, other

    cs.CV

    Multi-Task Dense Prediction via Mixture of Low-Rank Experts

    Authors: Yuqi Yang, Peng-Tao Jiang, Qibin Hou, Hao Zhang, **wei Chen, Bo Li

    Abstract: Previous multi-task dense prediction methods based on the Mixture of Experts (MoE) have received great performance but they neglect the importance of explicitly modeling the global relations among all tasks. In this paper, we present a novel decoder-focused method for multi-task dense prediction, called Mixture-of-Low-Rank-Experts (MLoRE). To model the global task relationships, MLoRE adds a gener… ▽ More

    Submitted 27 May, 2024; v1 submitted 26 March, 2024; originally announced March 2024.

    Comments: Accepted at CVPR 2024

  14. arXiv:2403.11735  [pdf, other

    cs.CV cs.LG

    LSKNet: A Foundation Lightweight Backbone for Remote Sensing

    Authors: Yuxuan Li, Xiang Li, Yimian Dai, Qibin Hou, Li Liu, Yongxiang Liu, Ming-Ming Cheng, Jian Yang

    Abstract: Remote sensing images pose distinct challenges for downstream tasks due to their inherent complexity. While a considerable amount of research has been dedicated to remote sensing classification, object detection and semantic segmentation, most of these studies have overlooked the valuable prior knowledge embedded within remote sensing scenarios. Such prior knowledge can be useful because remote se… ▽ More

    Submitted 23 June, 2024; v1 submitted 18 March, 2024; originally announced March 2024.

    Comments: arXiv admin note: substantial text overlap with arXiv:2303.09030

  15. arXiv:2403.06534  [pdf, other

    cs.CV cs.AI cs.CE cs.LG

    SARDet-100K: Towards Open-Source Benchmark and ToolKit for Large-Scale SAR Object Detection

    Authors: Yuxuan Li, Xiang Li, Weijie Li, Qibin Hou, Li Liu, Ming-Ming Cheng, Jian Yang

    Abstract: Synthetic Aperture Radar (SAR) object detection has gained significant attention recently due to its irreplaceable all-weather imaging capabilities. However, this research field suffers from both limited public datasets (mostly comprising <2K images with only mono-category objects) and inaccessible source code. To tackle these challenges, we establish a new benchmark dataset and an open-source met… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

    Comments: 22 Pages, 10 Figures, 9 Tables

  16. arXiv:2402.17403  [pdf, other

    cs.CV

    Sora Generates Videos with Stunning Geometrical Consistency

    Authors: Xuanyi Li, Daquan Zhou, Chenxu Zhang, Shaodong Wei, Qibin Hou, Ming-Ming Cheng

    Abstract: The recently developed Sora model [1] has exhibited remarkable capabilities in video generation, sparking intense discussions regarding its ability to simulate real-world phenomena. Despite its growing popularity, there is a lack of established metrics to evaluate its fidelity to real-world physics quantitatively. In this paper, we introduce a new benchmark that assesses the quality of the generat… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

    Comments: 5 pages, 3 figures

  17. arXiv:2402.15627  [pdf, other

    cs.LG cs.DC

    MegaScale: Scaling Large Language Model Training to More Than 10,000 GPUs

    Authors: Ziheng Jiang, Haibin Lin, Yinmin Zhong, Qi Huang, Yangrui Chen, Zhi Zhang, Yanghua Peng, Xiang Li, Cong Xie, Shibiao Nong, Yulu Jia, Sun He, Hongmin Chen, Zhihao Bai, Qi Hou, Shipeng Yan, Ding Zhou, Yiyao Sheng, Zhuo Jiang, Haohan Xu, Haoran Wei, Zhang Zhang, Pengfei Nie, Leqi Zou, Sida Zhao , et al. (7 additional authors not shown)

    Abstract: We present the design, implementation and engineering experience in building and deploying MegaScale, a production system for training large language models (LLMs) at the scale of more than 10,000 GPUs. Training LLMs at this scale brings unprecedented challenges to training efficiency and stability. We take a full-stack approach that co-designs the algorithmic and system components across model bl… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

  18. arXiv:2402.09270  [pdf, other

    cs.CV

    Fast Window-Based Event Denoising with Spatiotemporal Correlation Enhancement

    Authors: Huachen Fang, **jian Wu, Qibin Hou, Weisheng Dong, Guangming Shi

    Abstract: Previous deep learning-based event denoising methods mostly suffer from poor interpretability and difficulty in real-time processing due to their complex architecture designs. In this paper, we propose window-based event denoising, which simultaneously deals with a stack of events while existing element-based denoising focuses on one event each time. Besides, we give the theoretical analysis based… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

  19. arXiv:2402.05375  [pdf, other

    cs.CV

    Get What You Want, Not What You Don't: Image Content Suppression for Text-to-Image Diffusion Models

    Authors: Senmao Li, Joost van de Weijer, Taihang Hu, Fahad Shahbaz Khan, Qibin Hou, Yaxing Wang, Jian Yang

    Abstract: The success of recent text-to-image diffusion models is largely due to their capacity to be guided by a complex text prompt, which enables users to precisely describe the desired content. However, these models struggle to effectively suppress the generation of undesired content, which is explicitly requested to be omitted from the generated image in the prompt. In this paper, we analyze how to man… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

    Comments: ICLR 2024. Our code is available in https://github.com/sen-mao/SuppressEOT

  20. arXiv:2401.11387  [pdf, ps, other

    math.CO

    Rational Solutions to the First Order Difference Equations in the Bivariate Difference Field

    Authors: Qing-Hu Hou, Yarong Wei

    Abstract: Inspired by Karr's algorithm, we consider the summations involving a sequence satisfying a recurrence of order two. The structure of such summations provides an algebraic framework for solving the difference equations of form $aσ(g)+bg=f$ in the bivariate difference field $(\mathbb{F}(α, β), σ)$, where $a, b,f\in\mathbb{F}(α,β)\setminus\{0\}$ are known binary functions of $α$, $β$, and $α$, $β$ ar… ▽ More

    Submitted 20 January, 2024; originally announced January 2024.

  21. arXiv:2312.13613  [pdf, ps, other

    math.CO math.NT

    Reduction on the congruences of partial sums of P-recursive sequences

    Authors: Qing-Hu Hou, Na Li

    Abstract: Hou and Liu developed a telesco** method to prove the congruence of partial sums of P-recursive sequences. We release the requirement on the telescoper and utilize the congruence of the sequence. With this approach, we are able to confirm a conjecture of Sun and find a new congruence on the central trinomial coefficient.

    Submitted 21 December, 2023; originally announced December 2023.

    Comments: 11 pages

    MSC Class: 33F10; 11A07; 05A19; 11B65

  22. arXiv:2312.08866  [pdf, other

    eess.IV cs.CV

    MCANet: Medical Image Segmentation with Multi-Scale Cross-Axis Attention

    Authors: Hao Shao, Quansheng Zeng, Qibin Hou, Jufeng Yang

    Abstract: Efficiently capturing multi-scale information and building long-range dependencies among pixels are essential for medical image segmentation because of the various sizes and shapes of the lesion regions or organs. In this paper, we present Multi-scale Cross-axis Attention (MCA) to solve the above challenging issues based on the efficient axial attention. Instead of simply connecting axial attentio… ▽ More

    Submitted 19 December, 2023; v1 submitted 14 December, 2023; originally announced December 2023.

  23. arXiv:2312.08735  [pdf, other

    cs.CV

    Polyper: Boundary Sensitive Polyp Segmentation

    Authors: Hao Shao, Yang Zhang, Qibin Hou

    Abstract: We present a new boundary sensitive framework for polyp segmentation, called Polyper. Our method is motivated by a clinical approach that seasoned medical practitioners often leverage the inherent features of interior polyp regions to tackle blurred boundaries.Inspired by this, we propose explicitly leveraging polyp regions to bolster the model's boundary discrimination capability while minimizing… ▽ More

    Submitted 14 December, 2023; originally announced December 2023.

    Comments: Accepted to AAAI 2024

  24. arXiv:2312.05830  [pdf, other

    cs.CV

    A Decoupled Spatio-Temporal Framework for Skeleton-based Action Segmentation

    Authors: Yunheng Li, Zhongyu Li, Shanghua Gao, Qilong Wang, Qibin Hou, Ming-Ming Cheng

    Abstract: Effectively modeling discriminative spatio-temporal information is essential for segmenting activities in long action sequences. However, we observe that existing methods are limited in weak spatio-temporal modeling capability due to two forms of decoupled modeling: (i) cascaded interaction couples spatial and temporal modeling, which over-smooths motion modeling over the long sequence, and (ii) j… ▽ More

    Submitted 10 December, 2023; originally announced December 2023.

  25. arXiv:2312.04248  [pdf, other

    cs.CV

    TeMO: Towards Text-Driven 3D Stylization for Multi-Object Meshes

    Authors: Xuying Zhang, Bo-Wen Yin, Yuming Chen, Zheng Lin, Yunheng Li, Qibin Hou, Ming-Ming Cheng

    Abstract: Recent progress in the text-driven 3D stylization of a single object has been considerably promoted by CLIP-based methods. However, the stylization of multi-object 3D scenes is still impeded in that the image-text pairs used for pre-training CLIP mostly consist of an object. Meanwhile, the local details of multiple objects may be susceptible to omission due to the existing supervision manner prima… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.

  26. arXiv:2311.06772  [pdf, other

    cs.CV cs.AI

    ChatAnything: Facetime Chat with LLM-Enhanced Personas

    Authors: Yilin Zhao, Xinbin Yuan, Shanghua Gao, Zhijie Lin, Qibin Hou, Jiashi Feng, Daquan Zhou

    Abstract: In this technical report, we target generating anthropomorphized personas for LLM-based characters in an online manner, including visual appearance, personality and tones, with only text descriptions. To achieve this, we first leverage the in-context learning capability of LLMs for personality generation by carefully designing a set of system prompts. We then propose two novel concepts: the mixtur… ▽ More

    Submitted 12 November, 2023; originally announced November 2023.

  27. arXiv:2310.19234  [pdf, ps, other

    math.CO

    Log-behavior of the root sequences of P-recursive sequences

    Authors: Qing-hu Hou, Zhongjie Li

    Abstract: In recent years, Sun has proposed numerous conjectures regarding the log-concavity of root sequences $\{\sqrt[n]{a_n}}_{n\geqslant 1}$. We establish criteria for the asymptotic log-concavity of $\{\sqrt[n]{a_n}}_{n\geqslant 1}$ and the asymptotic ratio log-convexity of $\{\sqrt[n]{a_n}}_{n\geqslant 1}$ for $P$-recursive sequences $\{\sqrt[n]{a_n}}_{n\geqslant{0}}$. Additionally, by the aid of symb… ▽ More

    Submitted 29 October, 2023; originally announced October 2023.

    Comments: 15 pages

    ACM Class: G.2.1

  28. arXiv:2310.13235  [pdf, other

    cs.GR cs.CV

    Auxiliary Features-Guided Super Resolution for Monte Carlo Rendering

    Authors: Qiqi Hou, Feng Liu

    Abstract: This paper investigates super resolution to reduce the number of pixels to render and thus speed up Monte Carlo rendering algorithms. While great progress has been made to super resolution technologies, it is essentially an ill-posed problem and cannot recover high-frequency details in renderings. To address this problem, we exploit high-resolution auxiliary features to guide super resolution of l… ▽ More

    Submitted 19 October, 2023; originally announced October 2023.

    Comments: Accepted by CGF

    Journal ref: Computer Graphics Forum 2023

  29. arXiv:2310.13215  [pdf, other

    cs.CV

    Zone Evaluation: Revealing Spatial Bias in Object Detection

    Authors: Zhaohui Zheng, Yuming Chen, Qibin Hou, Xiang Li, ** Wang, Ming-Ming Cheng

    Abstract: A fundamental limitation of object detectors is that they suffer from "spatial bias", and in particular perform less satisfactorily when detecting objects near image borders. For a long time, there has been a lack of effective ways to measure and identify spatial bias, and little is known about where it comes from and what degree it is. To this end, we present a new zone evaluation protocol, exten… ▽ More

    Submitted 1 June, 2024; v1 submitted 19 October, 2023; originally announced October 2023.

    Comments: Accepted by IEEE TPAMI

  30. arXiv:2310.03699  [pdf, other

    math.CO

    Taylor coefficients and series involving harmonic numbers

    Authors: Qing-Hu Hou, Zhi-Wei Sun

    Abstract: During 2022--2023 Z.-W. Sun posed many conjectures on infinite series with summands involving generalized harmonic numbers. Motivated by this, we deduce $31$ series identities involving harmonic numbers, three of which were previously conjectured by the second author. For example, we obtain that \[ \sum_{k=1}^{\infty} \frac{(-1)^k}{k^2{2k \choose k}{3k \choose k}} \big( \frac{7 k-2}{2 k-1} H_{k-1}… ▽ More

    Submitted 26 October, 2023; v1 submitted 5 October, 2023; originally announced October 2023.

    Comments: add some new series

    MSC Class: Primary 05A19; 11B65; Secondary 33B15

  31. arXiv:2309.09668  [pdf, other

    cs.CV

    DFormer: Rethinking RGBD Representation Learning for Semantic Segmentation

    Authors: Bowen Yin, Xuying Zhang, Zhongyu Li, Li Liu, Ming-Ming Cheng, Qibin Hou

    Abstract: We present DFormer, a novel RGB-D pretraining framework to learn transferable representations for RGB-D segmentation tasks. DFormer has two new key innovations: 1) Unlike previous works that encode RGB-D information with RGB pretrained backbone, we pretrain the backbone using image-depth pairs from ImageNet-1K, and hence the DFormer is endowed with the capacity to encode RGB-D representations; 2)… ▽ More

    Submitted 7 February, 2024; v1 submitted 18 September, 2023; originally announced September 2023.

    Comments: Accepted by ICLR 2024

  32. arXiv:2309.04399  [pdf, other

    cs.CV

    MaskDiffusion: Boosting Text-to-Image Consistency with Conditional Mask

    Authors: Yupeng Zhou, Daquan Zhou, Zuo-Liang Zhu, Yaxing Wang, Qibin Hou, Jiashi Feng

    Abstract: Recent advancements in diffusion models have showcased their impressive capacity to generate visually striking images. Nevertheless, ensuring a close match between the generated image and the given prompt remains a persistent challenge. In this work, we identify that a crucial factor leading to the text-image mismatch issue is the inadequate cross-modality relation learning between the prompt and… ▽ More

    Submitted 8 September, 2023; originally announced September 2023.

  33. arXiv:2308.05778  [pdf

    cond-mat.supr-con cond-mat.str-el

    Current percolation model for the special resistivity behavior observed in Cu-doped Apatite

    Authors: Qiang Hou, Wei Wei, Xin Zhou, Xinyue Wang, Yue Sun, ZhiXiang Shi

    Abstract: Since the initial report of the potential occurrence of room-temperature superconductivity under normal pressure [arXiv: 2307.12008], there has been significant interest in the field of condensed matter physics regarding Cu-doped Apatite (Pb10-xCux(PO4)6O). In this study, we performed temperature-dependent resistivity measurements on the synthesized Pb10-xCux(PO4)6O samples. The structure of the s… ▽ More

    Submitted 10 August, 2023; originally announced August 2023.

    Comments: This paper represents a continuation of our previous study [arXiv:2308.01192], now offering a more comprehensive analysis of the collected data

    Journal ref: Matter 6, 4408-4418 (2023)

  34. arXiv:2308.05480  [pdf, other

    cs.CV

    YOLO-MS: Rethinking Multi-Scale Representation Learning for Real-time Object Detection

    Authors: Yuming Chen, Xinbin Yuan, Ruiqi Wu, Jiabao Wang, Qibin Hou, Ming-Ming Cheng

    Abstract: We aim at providing the object detection community with an efficient and performant object detector, termed YOLO-MS. The core design is based on a series of investigations on how convolutions with different kernel sizes affect the detection performance of objects at different scales. The outcome is a new strategy that can strongly enhance multi-scale feature representations of real-time object det… ▽ More

    Submitted 10 August, 2023; originally announced August 2023.

  35. arXiv:2308.01192  [pdf

    cond-mat.supr-con

    Observation of zero resistance above 100$^\circ$ K in Pb$_{10-x}$Cu$_x$(PO$_4$)$_6$O

    Authors: Qiang Hou, Wei Wei, Xin Zhou, Yue Sun, Zhixiang Shi

    Abstract: Room-temperature superconductivity has always been regarded as the ultimate goal in the fields of solid-state physics and materials science, with its realization holding revolutionary significance, capable of triggering significant changes in energy transmission and storage. However, achieving it poses various challenges. Recent research revealed that material Pb$_{10-x}$Cu$_x$(PO$_4$)$_6$O displa… ▽ More

    Submitted 2 August, 2023; originally announced August 2023.

    Comments: 7 pages, 3 figures

    Journal ref: Matter 6, 4408-4418 (2023)

  36. arXiv:2307.02176  [pdf, other

    q-bio.BM

    Molecular Dynamics

    Authors: Halima Mouhib, Juami H. M. van Gils, Jose Gavaldá-Garciá, Qingzhen Hou, Ali May, Arriën Symon Rauh, Jocelyne Vreede, Sanne Abeln, K. Anton Feenstra

    Abstract: While many good textbooks are available on Protein Structure, Molecular Simulations, Thermodynamics and Bioinformatics methods in general, there is no good introductory level book for the field of Structural Bioinformatics. This book aims to give an introduction into Structural Bioinformatics, which is where the previous topics meet to explore three dimensional protein structures through computati… ▽ More

    Submitted 6 July, 2023; v1 submitted 5 July, 2023; originally announced July 2023.

    Comments: editorial responsability: Halima Mouhib, Sanne Abeln, K. Anton Feenstra. This chapter is part of the book "Introduction to Protein Structural Bioinformatics". The Preface arXiv:1801.09442 contains links to all the (published) chapters. The update adds available arxiv hyperlinks for the chapters

  37. arXiv:2307.02173  [pdf, other

    q-bio.BM

    Function Prediction

    Authors: Bas Stringer, Annika Jacobsen, Qingzhen Hou, Hans de Ferrante, Olga Ivanova, Katharina Waury, Jose Gavaldá-Garciá, Sanne Abeln, K. Anton Feenstra

    Abstract: While many good textbooks are available on Protein Structure, Molecular Simulations, Thermodynamics and Bioinformatics methods in general, there is no good introductory level book for the field of Structural Bioinformatics. This book aims to give an introduction into Structural Bioinformatics, which is where the previous topics meet to explore three dimensional protein structures through computati… ▽ More

    Submitted 6 July, 2023; v1 submitted 5 July, 2023; originally announced July 2023.

    Comments: editorial responsability: K. Anton Feenstra, Sanne Abeln. This chapter is part of the book "Introduction to Protein Structural Bioinformatics". The Preface arXiv:1801.09442 contains links to all the (published) chapters. The update adds available arxiv hyperlinks for the chapters

  38. arXiv:2306.13277  [pdf, ps, other

    eess.SP

    Meta-Gating Framework for Fast and Continuous Resource Optimization in Dynamic Wireless Environments

    Authors: Qiushuo Hou, Mengyuan Lee, Guanding Yu, Yunlong Cai

    Abstract: With the great success of deep learning (DL) in image classification, speech recognition, and other fields, more and more studies have applied various neural networks (NNs) to wireless resource allocation. Generally speaking, these artificial intelligent (AI) models are trained under some special learning hypotheses, especially that the statistics of the training data are static during the trainin… ▽ More

    Submitted 22 June, 2023; originally announced June 2023.

    Comments: accepted by IEEE TCOM

  39. arXiv:2306.11369  [pdf, other

    cs.CV

    CrossKD: Cross-Head Knowledge Distillation for Object Detection

    Authors: Jiabao Wang, Yuming Chen, Zhaohui Zheng, Xiang Li, Ming-Ming Cheng, Qibin Hou

    Abstract: Knowledge Distillation (KD) has been validated as an effective model compression technique for learning compact object detectors. Existing state-of-the-art KD methods for object detection are mostly based on feature imitation. In this paper, we present a general and effective prediction mimicking distillation scheme, called CrossKD, which delivers the intermediate features of the student's detecti… ▽ More

    Submitted 15 April, 2024; v1 submitted 20 June, 2023; originally announced June 2023.

  40. arXiv:2306.07532  [pdf, other

    cs.CV

    Referring Camouflaged Object Detection

    Authors: Xuying Zhang, Bowen Yin, Zheng Lin, Qibin Hou, Deng-** Fan, Ming-Ming Cheng

    Abstract: We consider the problem of referring camouflaged object detection (Ref-COD), a new task that aims to segment specified camouflaged objects based on a small set of referring images with salient target objects. We first assemble a large-scale dataset, called R2C7K, which consists of 7K images covering 64 object categories in real-world scenarios. Then, we develop a simple but strong dual-branch fram… ▽ More

    Submitted 11 July, 2023; v1 submitted 13 June, 2023; originally announced June 2023.

  41. arXiv:2306.04300  [pdf, other

    cs.CV

    CorrMatch: Label Propagation via Correlation Matching for Semi-Supervised Semantic Segmentation

    Authors: Boyuan Sun, Yuqi Yang, Le Zhang, Ming-Ming Cheng, Qibin Hou

    Abstract: This paper presents a simple but performant semi-supervised semantic segmentation approach, called CorrMatch. Previous approaches mostly employ complicated training strategies to leverage unlabeled data but overlook the role of correlation maps in modeling the relationships between pairs of locations. We observe that the correlation maps not only enable clustering pixels of the same category easil… ▽ More

    Submitted 10 December, 2023; v1 submitted 7 June, 2023; originally announced June 2023.

  42. arXiv:2305.15248  [pdf, other

    cs.CV

    Delving Deeper into Data Scaling in Masked Image Modeling

    Authors: Cheng-Ze Lu, Xiaojie **, Qibin Hou, Jun Hao Liew, Ming-Ming Cheng, Jiashi Feng

    Abstract: Understanding whether self-supervised learning methods can scale with unlimited data is crucial for training large-scale models. In this work, we conduct an empirical study on the scaling capability of masked image modeling (MIM) methods (e.g., MAE) for visual recognition. Unlike most previous works that depend on the widely-used ImageNet dataset, which is manually curated and object-centric, we t… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

  43. arXiv:2305.00498  [pdf, ps, other

    math.NT

    Ramanujan-inspired series for $1/π$ involving harmonic numbers

    Authors: Qinghu Hou, Haihong He, Xiaoxia Wang

    Abstract: By applying the derivative operator to the known identities from hypergeometric series or WZ pairs, we obtain seven series associated with harmonic numbers. Specifically, six of them are Ramanujan-like formulas for $1/π$ and the remaining onecontains harmonic numbers of order $2$. As conclusions, Sun's five conjectural series are proved.

    Submitted 8 July, 2023; v1 submitted 30 April, 2023; originally announced May 2023.

    Comments: 11 pages

  44. arXiv:2305.00371  [pdf, other

    nucl-th astro-ph.SR

    New 26P(p,γ)27S thermonuclear reaction rate and its astrophysical implication in rp-process

    Authors: S. Q. Hou, J. B. Liu, T. C. L. Trueman, J. G. Li, M. Pignatari, C. Bertulani, X. X. Xu

    Abstract: Accurate nuclear reaction rates for 26P(p,γ)27S are pivotal for a comprehensive understanding of rp-process nucleosynthesis path in the region of proton-rich sulfur and phosphorus isotopes. However, large uncertainties still exist in the current rate of 26P(p,γ)27S because of the lack of the nuclear mass and the energy level structure information of 27S. We reevaluate this reaction rate using the… ▽ More

    Submitted 29 April, 2023; originally announced May 2023.

  45. arXiv:2304.13240  [pdf, other

    cs.CV cs.LG

    Structure Diagram Recognition in Financial Announcements

    Authors: Meixuan Qiao, Jun Wang, Junfu Xiang, Qiyu Hou, Ruixuan Li

    Abstract: Accurately extracting structured data from structure diagrams in financial announcements is of great practical importance for building financial knowledge graphs and further improving the efficiency of various financial applications. First, we proposed a new method for recognizing structure diagrams in financial announcements, which can better detect and extract different types of connecting lines… ▽ More

    Submitted 1 May, 2023; v1 submitted 25 April, 2023; originally announced April 2023.

    Comments: ICDAR2023

  46. arXiv:2304.09790  [pdf, other

    cs.CV

    AMT: All-Pairs Multi-Field Transforms for Efficient Frame Interpolation

    Authors: Zhen Li, Zuo-Liang Zhu, Ling-Hao Han, Qibin Hou, Chun-Le Guo, Ming-Ming Cheng

    Abstract: We present All-Pairs Multi-Field Transforms (AMT), a new network architecture for video frame interpolation. It is based on two essential designs. First, we build bidirectional correlation volumes for all pairs of pixels, and use the predicted bilateral flows to retrieve correlations for updating both flows and the interpolated content feature. Second, we derive multiple groups of fine-grained flo… ▽ More

    Submitted 19 April, 2023; originally announced April 2023.

    Comments: Accepted to CVPR2023

  47. arXiv:2304.03582  [pdf, other

    astro-ph.GA astro-ph.SR

    Reaction kinetics of CN + toluene and its implication on the productions of aromatic nitriles in the Taurus molecular cloud and Titan's atmosphere

    Authors: Mengqi Wu, Xiaoqing Wu, Qifeng Hou, Jiangbin Huang, Dongfeng Zhao, Feng Zhang

    Abstract: Reactions between cyano radical and aromatic hydrocarbons are believed to be important pathways for the formation of aromatic nitriles in the interstellar medium (ISM) including those identified in the Taurus molecular cloud (TMC-1). Aromatic nitriles might participate in the formation of polycyclic aromatic nitrogen containing hydrocarbons (PANHs) in Titan's atmosphere. Here, ab initio kinetics s… ▽ More

    Submitted 7 April, 2023; originally announced April 2023.

  48. arXiv:2303.15649  [pdf, other

    cs.CV

    StyleDiffusion: Prompt-Embedding Inversion for Text-Based Editing

    Authors: Senmao Li, Joost van de Weijer, Taihang Hu, Fahad Shahbaz Khan, Qibin Hou, Yaxing Wang, Jian Yang

    Abstract: A significant research effort is focused on exploiting the amazing capacities of pretrained diffusion models for the editing of images. They either finetune the model, or invert the image in the latent space of the pretrained model. However, they suffer from two problems: (1) Unsatisfying results for selected regions, and unexpected changes in nonselected regions. (2) They require careful text pro… ▽ More

    Submitted 20 August, 2023; v1 submitted 27 March, 2023; originally announced March 2023.

  49. arXiv:2303.09735  [pdf, other

    cs.CV

    SRFormer: Permuted Self-Attention for Single Image Super-Resolution

    Authors: Yupeng Zhou, Zhen Li, Chun-Le Guo, Song Bai, Ming-Ming Cheng, Qibin Hou

    Abstract: Previous works have shown that increasing the window size for Transformer-based image super-resolution models (e.g., SwinIR) can significantly improve the model performance but the computation overhead is also considerable. In this paper, we present SRFormer, a simple but novel method that can enjoy the benefit of large window self-attention but introduces even less computational burden. The core… ▽ More

    Submitted 16 March, 2023; originally announced March 2023.

  50. arXiv:2303.09030  [pdf, other

    cs.CV

    Large Selective Kernel Network for Remote Sensing Object Detection

    Authors: Yuxuan Li, Qibin Hou, Zhaohui Zheng, Ming-Ming Cheng, Jian Yang, Xiang Li

    Abstract: Recent research on remote sensing object detection has largely focused on improving the representation of oriented bounding boxes but has overlooked the unique prior knowledge presented in remote sensing scenarios. Such prior knowledge can be useful because tiny remote sensing objects may be mistakenly detected without referencing a sufficiently long-range context, and the long-range context requi… ▽ More

    Submitted 19 March, 2023; v1 submitted 15 March, 2023; originally announced March 2023.

    Comments: Preprint, under review