Skip to main content

Showing 201–250 of 752 results for author: Hou, J

.
  1. arXiv:2211.14552  [pdf, other

    cs.CV

    Cross-Field Transformer for Diabetic Retinopathy Grading on Two-field Fundus Images

    Authors: Junlin Hou, Jilan Xu, Fan Xiao, Rui-Wei Zhao, Yuejie Zhang, Haidong Zou, Lina Lu, Wenwen Xue, Rui Feng

    Abstract: Automatic diabetic retinopathy (DR) grading based on fundus photography has been widely explored to benefit the routine screening and early treatment. Existing researches generally focus on single-field fundus images, which have limited field of view for precise eye examinations. In clinical applications, ophthalmologists adopt two-field fundus photography as the dominating tool, where the informa… ▽ More

    Submitted 1 December, 2022; v1 submitted 26 November, 2022; originally announced November 2022.

    Comments: BIBM 2022

  2. arXiv:2211.12294  [pdf, other

    cs.CV cs.CR

    PointCA: Evaluating the Robustness of 3D Point Cloud Completion Models Against Adversarial Examples

    Authors: Shengshan Hu, Junwei Zhang, Wei Liu, Junhui Hou, Minghui Li, Leo Yu Zhang, Hai **, Lichao Sun

    Abstract: Point cloud completion, as the upstream procedure of 3D recognition and segmentation, has become an essential part of many tasks such as navigation and scene understanding. While various point cloud completion models have demonstrated their powerful capabilities, their robustness against adversarial attacks, which have been proven to be fatally malicious towards deep neural networks, remains unkno… ▽ More

    Submitted 1 December, 2022; v1 submitted 22 November, 2022; originally announced November 2022.

    Comments: Accepted by the 37th AAAI Conference on Artificial Intelligence (AAAI-23)

  3. arXiv:2211.10829  [pdf

    cond-mat.mtrl-sci cond-mat.mes-hall physics.app-ph

    Depositing boron on Cu(111): Borophene or boride?

    Authors: Xiao-Ji Weng, Jie Bai, **gyu Hou, Yi Zhu, Li Wang, Penghui Li, Anmin Nie, Bo Xu, Xiang-Feng Zhou, Yongjun Tian

    Abstract: Large-area single-crystal surface structures were successfully prepared on Cu(111) substrate with boron deposition, which is critical for prospective applications. However, the proposed borophene structures do not match the scanning tunneling microscopy (STM) results very well, while the proposed copper boride is at odds with the traditional knowledge that ordered copper-rich borides normally do n… ▽ More

    Submitted 19 November, 2022; originally announced November 2022.

    Comments: 15 pages, 4 figures

  4. arXiv:2211.10627  [pdf, other

    cs.LG cs.AI cs.MM

    EGRC-Net: Embedding-induced Graph Refinement Clustering Network

    Authors: Zhihao Peng, Hui Liu, Yuheng Jia, Junhui Hou

    Abstract: Existing graph clustering networks heavily rely on a predefined yet fixed graph, which can lead to failures when the initial graph fails to accurately capture the data topology structure of the embedding space. In order to address this issue, we propose a novel clustering network called Embedding-Induced Graph Refinement Clustering Network (EGRC-Net), which effectively utilizes the learned embeddi… ▽ More

    Submitted 14 November, 2023; v1 submitted 19 November, 2022; originally announced November 2022.

    Comments: This paper has been accepted by IEEE Transactions on Image Processing

  5. arXiv:2211.10294  [pdf

    cond-mat.supr-con cond-mat.mtrl-sci

    Pressure-induced superconductivity in PdTeI with quasi-one-dimensional PdTe chains

    Authors: Yi Zhao, Jun Hou, Yang Fu, Cuiying Pei, Jian** Sun, Qi Wang, Lingling Gao, Weizheng Cao, Changhua Li, Shihao Zhu, Mingxin Zhang, Yulin Chen, Hechang Lei, **guang Cheng, Yanpeng Qi

    Abstract: The quasi-one-dimensional material PdTeI exhibits unusual electronic transport properties at ambient pressure. Here, we systematically investigate both the structural and electronic responses of PdTeI to external pressure, through a combination of electrical transport, synchrotron x-ray diffraction (XRD), and Raman spectroscopy measurements. The charge density wave (CDW) order in PdTeI is fragile… ▽ More

    Submitted 18 November, 2022; originally announced November 2022.

    Comments: 18 pages, 6 figures

  6. arXiv:2211.10253  [pdf, other

    cs.CV

    Delving into Transformer for Incremental Semantic Segmentation

    Authors: Zekai Xu, Mingyi Zhang, Jiayue Hou, Xing Gong, Chuan Wen, Chengjie Wang, Junge Zhang

    Abstract: Incremental semantic segmentation(ISS) is an emerging task where old model is updated by incrementally adding new classes. At present, methods based on convolutional neural networks are dominant in ISS. However, studies have shown that such methods have difficulty in learning new tasks while maintaining good performance on old ones (catastrophic forgetting). In contrast, a Transformer based method… ▽ More

    Submitted 18 November, 2022; originally announced November 2022.

  7. arXiv:2211.07913  [pdf, ps, other

    math.CO

    Extremal graphs for the suspension of edge-critical graphs

    Authors: Jianfeng Hou, Heng Li, Qinghou Zeng

    Abstract: The Turán number of a graph $H$, $\text{ex}(n,H)$, is the maximum number of edges in an $n$-vertex graph that does not contain $H$ as a subgraph. For a vertex $v$ and a multi-set $\mathcal{F}$ of graphs, the suspension $\mathcal{F}+v$ of $\mathcal{F}$ is the graph obtained by connecting the vertex $v$ to all vertices of $F$ for each $F\in \mathcal{F}$. For two integers $k\ge1$ and $r\ge2$, let… ▽ More

    Submitted 15 November, 2022; originally announced November 2022.

  8. arXiv:2211.05910  [pdf, other

    eess.IV cs.CV

    Efficient and Accurate Quantized Image Super-Resolution on Mobile NPUs, Mobile AI & AIM 2022 challenge: Report

    Authors: Andrey Ignatov, Radu Timofte, Maurizio Denna, Abdel Younes, Ganzorig Gankhuyag, **gang Huh, Myeong Kyun Kim, Kihwan Yoon, Hyeon-Cheol Moon, Seungho Lee, Yoonsik Choe, **woo Jeong, Sungjei Kim, Maciej Smyl, Tomasz Latkowski, Pawel Kubik, Michal Sokolski, Yujie Ma, Jiahao Chao, Zhou Zhou, Hongfan Gao, Zhengfeng Yang, Zhenbing Zeng, Zhengyang Zhuge, Chenghua Li , et al. (71 additional authors not shown)

    Abstract: Image super-resolution is a common task on mobile and IoT devices, where one often needs to upscale and enhance low-resolution images and video frames. While numerous solutions have been proposed for this problem in the past, they are usually not compatible with low-power mobile NPUs having many computational and memory constraints. In this Mobile AI challenge, we address this problem and propose… ▽ More

    Submitted 7 November, 2022; originally announced November 2022.

    Comments: arXiv admin note: text overlap with arXiv:2105.07825, arXiv:2105.08826, arXiv:2211.04470, arXiv:2211.03885, arXiv:2211.05256

  9. arXiv:2211.04894  [pdf, other

    cs.CV cs.LG cs.MM eess.IV

    Exploring Video Quality Assessment on User Generated Contents from Aesthetic and Technical Perspectives

    Authors: Haoning Wu, Erli Zhang, Liang Liao, Chaofeng Chen, **gwen Hou, Annan Wang, Wenxiu Sun, Qiong Yan, Weisi Lin

    Abstract: The rapid increase in user-generated-content (UGC) videos calls for the development of effective video quality assessment (VQA) algorithms. However, the objective of the UGC-VQA problem is still ambiguous and can be viewed from two perspectives: the technical perspective, measuring the perception of distortions; and the aesthetic perspective, which relates to preference and recommendation on conte… ▽ More

    Submitted 7 March, 2023; v1 submitted 9 November, 2022; originally announced November 2022.

  10. arXiv:2211.04098  [pdf, other

    eess.SY cs.SC

    Abstraction-Based Verification of Approximate Pre-Opacity for Control Systems

    Authors: Junyao Hou, Siyuan Liu, Xiang Yin, Majid Zamani

    Abstract: In this paper, we consider the problem of verifying pre-opacity for discrete-time control systems. Pre-opacity is an important information-flow security property that secures the intention of a system to execute some secret behaviors in the future. Existing works on pre-opacity only consider non-metric discrete systems, where it is assumed that intruders can distinguish different output behaviors… ▽ More

    Submitted 8 November, 2022; originally announced November 2022.

    Comments: Discrete Event Systems, Opacity, Formal Abstractions

  11. arXiv:2211.02838  [pdf, ps, other

    math.CO

    Two stability theorems for $\mathcal{K}_{\ell + 1}^{r}$-saturated hypergraphs

    Authors: Jianfeng Hou, Heng Li, Caihong Yang, Qinghou Zeng, Yixiao Zhang

    Abstract: An $\mathcal{F}$-saturated $r$-graph is a maximal $r$-graph not containing any member of $\mathcal{F}$ as a subgraph. Let $\mathcal{K}_{\ell + 1}^{r}$ be the collection of all $r$-graphs $F$ with at most $\binom{\ell+1}{2}$ edges such that for some $\left(\ell+1\right)$-set $S$ every pair $\{u, v\} \subset S$ is covered by an edge in $F$. Our first result shows that for each $\ell \geq r \geq 2$ e… ▽ More

    Submitted 5 November, 2022; originally announced November 2022.

  12. arXiv:2211.02419  [pdf, other

    eess.IV cs.CV cs.LG

    High-Resolution Boundary Detection for Medical Image Segmentation with Piece-Wise Two-Sample T-Test Augmented Loss

    Authors: Yucong Lin, **hua Su, Yuhang Li, Yuhao Wei, Hanchao Yan, Saining Zhang, Jiaan Luo, Danni Ai, Hong Song, **gfan Fan, Tianyu Fu, Deqiang Xiao, Feifei Wang, Jue Hou, Jian Yang

    Abstract: Deep learning methods have contributed substantially to the rapid advancement of medical image segmentation, the quality of which relies on the suitable design of loss functions. Popular loss functions, including the cross-entropy and dice losses, often fall short of boundary detection, thereby limiting high-resolution downstream applications such as automated diagnoses and procedures. We develope… ▽ More

    Submitted 4 November, 2022; originally announced November 2022.

  13. arXiv:2211.01601  [pdf

    math.OC eess.SY

    A Fast Solution Method for Large-scale Unit Commitment Based on Lagrangian Relaxation and Dynamic Programming

    Authors: Jiangwei Hou, Qiaozhu Zhai, Yuzhou Zhou, Xiaohong Guan

    Abstract: The unit commitment problem (UC) is crucial for the operation and market mechanism of power systems. With the development of modern electricity, the scale of power systems is expanding, and solving the UC problem is also becoming more and more difficult. To this end, this paper proposes a new fast solution method based on Lagrangian relaxation and dynamic program-ming. Firstly, the UC solution is… ▽ More

    Submitted 3 November, 2022; originally announced November 2022.

    Comments: 10 pages, journal paper, transactions

  14. arXiv:2211.00723  [pdf, other

    astro-ph.CO

    ${\rm S{\scriptsize IM}BIG}$: A Forward Modeling Approach To Analyzing Galaxy Clustering

    Authors: ChangHoon Hahn, Michael Eickenberg, Shirley Ho, Jiamin Hou, Pablo Lemos, Elena Massara, Chirag Modi, Azadeh Moradinezhad Dizgah, Bruno Régaldo-Saint Blancard, Muntazir M. Abidi

    Abstract: We present the first-ever cosmological constraints from a simulation-based inference (SBI) analysis of galaxy clustering from the new ${\rm S{\scriptsize IM}BIG}$ forward modeling framework. ${\rm S{\scriptsize IM}BIG}$ leverages the predictive power of high-fidelity simulations and provides an inference framework that can extract cosmological information on small non-linear scales, inaccessible w… ▽ More

    Submitted 1 November, 2022; originally announced November 2022.

    Comments: 9 pages, 5 figures

  15. ${\rm S{\scriptsize IM}BIG}$: Mock Challenge for a Forward Modeling Approach to Galaxy Clustering

    Authors: ChangHoon Hahn, Michael Eickenberg, Shirley Ho, Jiamin Hou, Pablo Lemos, Elena Massara, Chirag Modi, Azadeh Moradinezhad Dizgah, Bruno Régaldo-Saint Blancard, Muntazir M. Abidi

    Abstract: Simulation-Based Inference of Galaxies (${\rm S{\scriptsize IM}BIG}$) is a forward modeling framework for analyzing galaxy clustering using simulation-based inference. In this work, we present the ${\rm S{\scriptsize IM}BIG}$ forward model, which is designed to match the observed SDSS-III BOSS CMASS galaxy sample. The forward model is based on high-resolution ${\rm Q{\scriptsize UIJOTE}}$ $N$-body… ▽ More

    Submitted 1 November, 2022; originally announced November 2022.

    Comments: 28 pages, 6 figures

  16. arXiv:2210.17456  [pdf, other

    eess.AS cs.SD

    Audio-Visual Speech Enhancement and Separation by Utilizing Multi-Modal Self-Supervised Embeddings

    Authors: I-Chun Chern, Kuo-Hsuan Hung, Yi-Ting Chen, Tassadaq Hussain, Mandar Gogate, Amir Hussain, Yu Tsao, Jen-Cheng Hou

    Abstract: AV-HuBERT, a multi-modal self-supervised learning model, has been shown to be effective for categorical problems such as automatic speech recognition and lip-reading. This suggests that useful audio-visual speech representations can be obtained via utilizing multi-modal self-supervised embeddings. Nevertheless, it is unclear if such representations can be generalized to solve real-world multi-moda… ▽ More

    Submitted 31 May, 2023; v1 submitted 31 October, 2022; originally announced October 2022.

    Comments: ICASSP AMHAT 2023

  17. arXiv:2210.16743  [pdf, other

    eess.AS cs.SD

    WeKws: A production first small-footprint end-to-end Keyword Spotting Toolkit

    Authors: Jie Wang, Menglong Xu, **gyong Hou, Binbin Zhang, Xiao-Lei Zhang, Lei Xie, Fu** Pan

    Abstract: Keyword spotting (KWS) enables speech-based user interaction and gradually becomes an indispensable component of smart devices. Recently, end-to-end (E2E) methods have become the most popular approach for on-device KWS tasks. However, there is still a gap between the research and deployment of E2E KWS methods. In this paper, we introduce WeKws, a production-quality, easy-to-build, and convenient-t… ▽ More

    Submitted 30 October, 2022; originally announced October 2022.

  18. Cosmological Information in Skew Spectra of Biased Tracers in Redshift Space

    Authors: Jiamin Hou, Azadeh Moradinezhad Dizgah, ChangHoon Hahn, Elena Massara

    Abstract: Extracting the non-Gaussian information encoded in the higher-order clustering statistics of the large-scale structure is key to fully realizing the potential of upcoming galaxy surveys. We investigate the information content of the redshift-space {\it weighted skew spectra} of biased tracers as efficient estimators for 3-point clustering statistics. The skew spectra are constructed by correlating… ▽ More

    Submitted 23 October, 2022; originally announced October 2022.

    Comments: 43 pages, 25 figures

  19. Effects of spatial dimensionality and band tilting on the longitudinal optical conductivities in Dirac bands

    Authors: Jian-Tong Hou, Chang-Xu Yan, Chao-Yang Tan, Zhi-Qiang Li, Peng Wang, Hong Guo, Hao-Ran Chang

    Abstract: We report a unified theory based on linear response, for analyzing the longitudinal optical conductivity (LOC) of materials with tilted Dirac cones. Depending on the tilt parameter $t$, the Dirac electrons have four phases: untilted, type-I, type-II, and type-III; the Dirac dispersion can be isotropic or anisotropic; the spatial dimension of the material can be one-, two-, or three-dimensions (1D,… ▽ More

    Submitted 24 November, 2023; v1 submitted 19 October, 2022; originally announced October 2022.

    Comments: 22 pages, 7 figures

    Journal ref: Phys. Rev. B 108, 035407 (2023)

  20. arXiv:2210.10402  [pdf

    astro-ph.SR astro-ph.IM physics.space-ph

    Solar Ring Mission: Building a Panorama of the Sun and Inner-heliosphere

    Authors: Yuming Wang, Xianyong Bai, Changyong Chen, Linjie Chen, Xin Cheng, Lei Deng, Linhua Deng, Yuanyong Deng, Li Feng, Tingyu Gou, **gnan Guo, Yang Guo, Xinjun Hao, Jiansen He, Junfeng Hou, Huang Jiangjiang, Zhenghua Huang, Haisheng Ji, Chaowei Jiang, Jie Jiang, Chunlan **, Xiaolei Li, Yiren Li, Jiajia Liu, Kai Liu , et al. (29 additional authors not shown)

    Abstract: Solar Ring (SOR) is a proposed space science mission to monitor and study the Sun and inner heliosphere from a full 360° perspective in the ecliptic plane. It will deploy three 120°-separated spacecraft on the 1-AU orbit. The first spacecraft, S1, locates 30° upstream of the Earth, the second, S2, 90° downstream, and the third, S3, completes the configuration. This design with necessary science in… ▽ More

    Submitted 23 October, 2022; v1 submitted 19 October, 2022; originally announced October 2022.

    Comments: 41 pages, 6 figures, 1 table, to be published in Advances in Space Research

  21. arXiv:2210.10206  [pdf, other

    astro-ph.IM astro-ph.CO

    SARABANDE: 3/4 Point Correlation Functions with Fast Fourier Transforms

    Authors: James Sunseri, Zachary Slepian, Stephen Portillo, Jiamin Hou, Sule Kahraman, Douglas P. Finkbeiner

    Abstract: We present a new $\texttt{python}$ package SARABANDE for measuring 3 & 4 Point Correlation Functions (3/4 PCFs) in $\mathcal{O}(N_{\rm g} \log N_{\rm g})$ time using Fast Fourier Transforms (FFTs), with $N_{\rm g}$ the number of grid points used for the FFT. SARABANDE can measure both projected and full 3 and 4 PCFs on gridded 2D and 3D datasets. The general technique is to generate suitable angul… ▽ More

    Submitted 25 October, 2022; v1 submitted 18 October, 2022; originally announced October 2022.

    Comments: 16 Pages, 8 Figures, 8 Algorithms, 1 code package

  22. arXiv:2210.07749   

    eess.AS cs.SD

    LeVoice ASR Systems for the ISCSLP 2022 Intelligent Cockpit Speech Recognition Challenge

    Authors: Yan Jia, Mi Hong, **gyu Hou, Kailong Ren, Sifan Ma, ** Wang, Fangzhen Peng, Yinglin Ji, Lin Yang, Junjie Wang

    Abstract: This paper describes LeVoice automatic speech recognition systems to track2 of intelligent cockpit speech recognition challenge 2022. Track2 is a speech recognition task without limits on the scope of model size. Our main points include deep learning based speech enhancement, text-to-speech based speech generation, training data augmentation via various techniques and speech recognition model fusi… ▽ More

    Submitted 16 October, 2022; v1 submitted 14 October, 2022; originally announced October 2022.

    Comments: There are experimental errors

  23. Beyond $Λ$CDM constraints from the full shape clustering measurements from BOSS and eBOSS

    Authors: Agne Semenaite, Ariel G. Sánchez, Andrea Pezzotta, Jiamin Hou, Alexander Eggemeier, Martin Crocce, Cheng Zhao, Joel R. Brownstein, Graziano Rossi, Donald P. Schneider

    Abstract: We analyse the full shape of anisotropic clustering measurements from the extended Baryon Oscillation Spectroscopic survey (eBOSS) quasar sample together with the combined galaxy sample from the Baryon Oscillation Spectroscopic Survey (BOSS). We obtain constraints on the cosmological parameters independent of the Hubble parameter $h$ for the extensions of the $Λ$CDM models, focusing on cosmologies… ▽ More

    Submitted 13 October, 2022; originally announced October 2022.

    Comments: 12 pages, 6 figures, submitted to MNRAS

  24. arXiv:2210.05357  [pdf, other

    cs.CV cs.AI cs.MM

    Neighbourhood Representative Sampling for Efficient End-to-end Video Quality Assessment

    Authors: Haoning Wu, Chaofeng Chen, Liang Liao, **gwen Hou, Wenxiu Sun, Qiong Yan, **wei Gu, Weisi Lin

    Abstract: The increased resolution of real-world videos presents a dilemma between efficiency and accuracy for deep Video Quality Assessment (VQA). On the one hand, kee** the original resolution will lead to unacceptable computational costs. On the other hand, existing practices, such as resizing and crop**, will change the quality of original videos due to the loss of details and contents, and are ther… ▽ More

    Submitted 11 October, 2022; originally announced October 2022.

  25. Multi-messenger characterization of Mrk 501 during historically low X-ray and $γ$-ray activity

    Authors: MAGIC collaboration, H. Abe, S. Abe, V. A. Acciari, I. Agudo, T. Aniello, S. Ansoldi, L. A. Antonelli, A. Arbet Engels, C. Arcaro, M. Artero, K. Asano, D. Baack, A. Babić, A. Baquero, U. Barres de Almeida, J. A. Barrio, I. Batković, J. Baxter, J. Becerra González, W. Bednarek, E. Bernardini, M. Bernardos, A. Berti, J. Besenrieder , et al. (300 additional authors not shown)

    Abstract: We study the broadband emission of Mrk 501 using multi-wavelength observations from 2017 to 2020 performed with a multitude of instruments, involving, among others, MAGIC, Fermi-LAT, NuSTAR, Swift, GASP-WEBT, and OVRO. Mrk 501 showed an extremely low broadband activity, which may help to unravel its baseline emission. Nonetheless, significant flux variations are detected at all wavebands, with the… ▽ More

    Submitted 5 March, 2023; v1 submitted 5 October, 2022; originally announced October 2022.

    Comments: 55 pages, 30 figures, 14 tables, accepted by APJS. Corresponding authors are L. Heckmann, D. Paneque, S. Gasparyan, M. Cerruti, and N. Sahakyan

    Journal ref: ApJS 266 37 (2023)

  26. arXiv:2210.02030  [pdf, other

    cs.CV

    Point Cloud Recognition with Position-to-Structure Attention Transformers

    Authors: Zheng Ding, James Hou, Zhuowen Tu

    Abstract: In this paper, we present Position-to-Structure Attention Transformers (PS-Former), a Transformer-based algorithm for 3D point cloud recognition. PS-Former deals with the challenge in 3D point cloud representation where points are not positioned in a fixed grid structure and have limited feature description (only 3D coordinates ($x, y, z$) for scattered points). Existing Transformer-based architec… ▽ More

    Submitted 5 October, 2022; originally announced October 2022.

  27. arXiv:2210.00515  [pdf, other

    eess.IV cs.CV

    Deep-OCTA: Ensemble Deep Learning Approaches for Diabetic Retinopathy Analysis on OCTA Images

    Authors: Junlin Hou, Fan Xiao, Jilan Xu, Yuejie Zhang, Haidong Zou, Rui Feng

    Abstract: The ultra-wide optical coherence tomography angiography (OCTA) has become an important imaging modality in diabetic retinopathy (DR) diagnosis. However, there are few researches focusing on automatic DR analysis using ultra-wide OCTA. In this paper, we present novel and practical deep-learning solutions based on ultra-wide OCTA for the Diabetic Retinopathy Analysis Challenge (DRAC). In the segment… ▽ More

    Submitted 2 October, 2022; originally announced October 2022.

  28. arXiv:2209.15241  [pdf, other

    cond-mat.supr-con physics.comp-ph

    Helium-bearing superconductor at high pressure

    Authors: **gyu Hou, Xiao Dong, Artem R. Oganov, Xiao-Ji Weng, Chun-Mei Hao, Guochun Yang, Hui-Tian Wang, Xiang-Feng Zhou, Yongjun Tian

    Abstract: Helium (He) is the most inert noble gas at ambient conditions. It adopts a hexagonal close packed structure (P63/mmc) and remains in the insulating phase up to 32 TPa. In contrast, lithium (Li) is one of the most reactive metals at zero pressure, while its cubic high-pressure phase (Fd-3m) is a weak metallic electride above 475 GPa. Strikingly, a stable compound of Li5He2 (R-3m) was formed by mixi… ▽ More

    Submitted 30 September, 2022; originally announced September 2022.

    Comments: 5 pages, 3 figures

  29. arXiv:2209.13252  [pdf, other

    cs.CV

    RIGA: Rotation-Invariant and Globally-Aware Descriptors for Point Cloud Registration

    Authors: Hao Yu, Ji Hou, Zheng Qin, Mahdi Saleh, Ivan Shugurov, Kai Wang, Benjamin Busam, Slobodan Ilic

    Abstract: Successful point cloud registration relies on accurate correspondences established upon powerful descriptors. However, existing neural descriptors either leverage a rotation-variant backbone whose performance declines under large rotations, or encode local geometry that is less distinctive. To address this issue, we introduce RIGA to learn descriptors that are Rotation-Invariant by design and Glob… ▽ More

    Submitted 27 September, 2022; originally announced September 2022.

  30. arXiv:2209.08515  [pdf, other

    astro-ph.GA

    The Chocolate Chip Cookie Model: Dust Geometry of Milky-Way like Disk Galaxies

    Authors: Jiafeng Lu, Shiyin Shen, Fang-Ting Yuan, Zhengyi Shao, **liang Hou, Xianzhong Zheng

    Abstract: We present a new two-component dust geometry model, the \textit{Chocolate Chip Cookie} model, where the clumpy nebular regions are embedded in a diffuse stellar/ISM disk, like chocolate chips in a cookie. By approximating the binomial distribution of the clumpy nebular regions with a continuous Gaussian distribution and omitting the dust scattering effect, our model solves the dust attenuation pro… ▽ More

    Submitted 18 September, 2022; originally announced September 2022.

    Comments: 27 pages, 11 figures, 1 table

  31. arXiv:2209.05013  [pdf, other

    cs.CV

    Learning A Locally Unified 3D Point Cloud for View Synthesis

    Authors: Meng You, Mantang Guo, Xianqiang Lyu, Hui Liu, Junhui Hou

    Abstract: In this paper, we explore the problem of 3D point cloud representation-based view synthesis from a set of sparse source views. To tackle this challenging problem, we propose a new deep learning-based view synthesis paradigm that learns a locally unified 3D point cloud from source views. Specifically, we first construct sub-point clouds by projecting source views to 3D space based on their depth ma… ▽ More

    Submitted 30 September, 2023; v1 submitted 12 September, 2022; originally announced September 2022.

    Comments: Accepted to TIP

  32. arXiv:2208.12419  [pdf, other

    cs.CV

    Arbitrary Shape Text Detection via Segmentation with Probability Maps

    Authors: Shi-Xue Zhang, Xiaobin Zhu, Lei Chen, Jie-Bo Hou, Xu-Cheng Yin

    Abstract: Arbitrary shape text detection is a challenging task due to the significantly varied sizes and aspect ratios, arbitrary orientations or shapes, inaccurate annotations, etc. Due to the scalability of pixel-level prediction, segmentation-based methods can adapt to various shape texts and hence attracted considerable attention recently. However, accurate pixel-level annotations of texts are formidabl… ▽ More

    Submitted 25 August, 2022; originally announced August 2022.

    Comments: Accepted by TPAMI 2022. arXiv admin note: text overlap with arXiv:1812.01393 by other authors

  33. arXiv:2208.07137  [pdf, other

    cs.CV

    An Empirical Study of Pseudo-Labeling for Image-based 3D Object Detection

    Authors: Xinzhu Ma, Yuan Meng, Yinmin Zhang, Lei Bai, Jun Hou, Shuai Yi, Wanli Ouyang

    Abstract: Image-based 3D detection is an indispensable component of the perception system for autonomous driving. However, it still suffers from the unsatisfying performance, one of the main reasons for which is the limited training data. Unfortunately, annotating the objects in the 3D space is extremely time/resource-consuming, which makes it hard to extend the training set arbitrarily. In this work, we fo… ▽ More

    Submitted 15 August, 2022; originally announced August 2022.

    Comments: tech report

  34. $T\bar{T}$ flow as characteristic flows

    Authors: Jue Hou

    Abstract: We show that method of characteristics provides a powerful new point of view on $T\bar{T}$-and related deformations. Previously, the method of characteristics has been applied to $T\bar{T}$-deformation mainly to solve Burgers' equation, which governs the deformation of the \emph{quantum} spectrum. In the current work, we study \emph{classical} deformed quantities using this method and show that… ▽ More

    Submitted 26 January, 2023; v1 submitted 10 August, 2022; originally announced August 2022.

    Comments: 38 pages, 2 figures, references updated

  35. arXiv:2208.04053  [pdf, other

    math.OC

    Distributed Momentum-based Frank-Wolfe Algorithm for Stochastic Optimization

    Authors: Jie Hou, Xianlin Zeng, Gang Wang, Jian Sun, Jie Chen

    Abstract: This paper considers distributed stochastic optimization, in which a number of agents cooperate to optimize a global objective function through local computations and information exchanges with neighbors over a network. Stochastic optimization problems are usually tackled by variants of projected stochastic gradient descent. However, projecting a point onto a feasible set is often expensive. The F… ▽ More

    Submitted 8 August, 2022; originally announced August 2022.

    Comments: 15 pages, 11 figures, 2 tables

  36. arXiv:2208.03054  [pdf, other

    cs.CL cs.AI cs.LG

    Global Pointer: Novel Efficient Span-based Approach for Named Entity Recognition

    Authors: Jianlin Su, Ahmed Murtadha, Shengfeng Pan, **g Hou, Jun Sun, Wanwei Huang, Bo Wen, Yunfeng Liu

    Abstract: Named entity recognition (NER) task aims at identifying entities from a piece of text that belong to predefined semantic types such as person, location, organization, etc. The state-of-the-art solutions for flat entities NER commonly suffer from capturing the fine-grained semantic information in underlying texts. The existing span-based approaches overcome this limitation, but the computation time… ▽ More

    Submitted 5 August, 2022; originally announced August 2022.

  37. arXiv:2208.00794  [pdf, ps, other

    math.CO

    Generating non-jumps from a known one

    Authors: Jianfeng Hou, Heng Li, Caihong Yang, Yixiao Zhang

    Abstract: Let $r\ge 2$ be an integer. The real number $α\in [0,1]$ is a jump for $r$ if there exists a constant $c > 0$ such that for any $ε>0$ and any integer $m \geq r$, there exists an integer $n_0(ε, m)$ satisfying any $r$-uniform graph with $n\ge n_0(ε, m)$ vertices and density at least $α+ε$ contains a subgraph with $m$ vertices and density at least $α+c$. A result of Erdős, Stone and Simonovits impli… ▽ More

    Submitted 1 August, 2022; originally announced August 2022.

  38. CorrI2P: Deep Image-to-Point Cloud Registration via Dense Correspondence

    Authors: Siyu Ren, Yiming Zeng, Junhui Hou, Xiaodong Chen

    Abstract: Motivated by the intuition that the critical step of localizing a 2D image in the corresponding 3D point cloud is establishing 2D-3D correspondence between them, we propose the first feature-based dense correspondence framework for addressing the image-to-point cloud registration problem, dubbed CorrI2P, which consists of three modules, i.e., feature embedding, symmetric overlap** region detecti… ▽ More

    Submitted 20 September, 2022; v1 submitted 12 July, 2022; originally announced July 2022.

    Comments: Accepted by IEEE TCSVT

  39. arXiv:2207.04266  [pdf, other

    eess.IV cs.CV

    Rank-Enhanced Low-Dimensional Convolution Set for Hyperspectral Image Denoising

    Authors: **hui Hou, Zhiyu Zhu, Hui Liu, Junhui Hou

    Abstract: This paper tackles the challenging problem of hyperspectral (HS) image denoising. Unlike existing deep learning-based methods usually adopting complicated network architectures or empirically stacking off-the-shelf modules to pursue performance improvement, we focus on the efficient and effective feature extraction manner for capturing the high-dimensional characteristics of HS images. To be speci… ▽ More

    Submitted 9 July, 2022; originally announced July 2022.

    Comments: 10 pages, 8 figures

  40. arXiv:2207.03128  [pdf, other

    cs.CV

    PointMCD: Boosting Deep Point Cloud Encoders via Multi-view Cross-modal Distillation for 3D Shape Recognition

    Authors: Qijian Zhang, Junhui Hou, Yue Qian

    Abstract: As two fundamental representation modalities of 3D objects, 3D point clouds and multi-view 2D images record shape information from different domains of geometric structures and visual appearances. In the current deep learning era, remarkable progress in processing such two data modalities has been achieved through respectively customizing compatible 3D and 2D network architectures. However, unlike… ▽ More

    Submitted 15 June, 2023; v1 submitted 7 July, 2022; originally announced July 2022.

    Comments: Accepted to TMM

  41. arXiv:2207.03105  [pdf

    q-bio.TO cs.CV eess.IV physics.med-ph

    Uncertainty-Aware Self-supervised Neural Network for Liver $T_{1ρ}$ Map** with Relaxation Constraint

    Authors: Chaoxing Huang, Yurui Qian, Simon Chun Ho Yu, Jian Hou, Baiyan Jiang, Queenie Chan, Vincent Wai-Sun Wong, Winnie Chiu-Wing Chu, Weitian Chen

    Abstract: $T_{1ρ}$ map** is a promising quantitative MRI technique for the non-invasive assessment of tissue properties. Learning-based approaches can map $T_{1ρ}$ from a reduced number of $T_{1ρ}$ weighted images, but requires significant amounts of high quality training data. Moreover, existing methods do not provide the confidence level of the $T_{1ρ}… ▽ More

    Submitted 25 October, 2022; v1 submitted 7 July, 2022; originally announced July 2022.

    Comments: Provisionally accepted by Physics in Medicine and Biology

  42. arXiv:2207.02595  [pdf, other

    cs.CV cs.MM

    FAST-VQA: Efficient End-to-end Video Quality Assessment with Fragment Sampling

    Authors: Haoning Wu, Chaofeng Chen, **gwen Hou, Liang Liao, Annan Wang, Wenxiu Sun, Qiong Yan, Weisi Lin

    Abstract: Current deep video quality assessment (VQA) methods are usually with high computational costs when evaluating high-resolution videos. This cost hinders them from learning better video-quality-related representations via end-to-end training. Existing approaches typically consider naive sampling to reduce the computational cost, such as resizing and crop**. However, they obviously corrupt quality-… ▽ More

    Submitted 6 July, 2022; originally announced July 2022.

    Comments: Will appear on ECCV 2022. 14 Pages

    Journal ref: Proceedings of the European Conference on Computer Vision (ECCV) 2022

  43. arXiv:2207.02466  [pdf, other

    cs.CV

    GLENet: Boosting 3D Object Detectors with Generative Label Uncertainty Estimation

    Authors: Yifan Zhang, Qijian Zhang, Zhiyu Zhu, Junhui Hou, Yixuan Yuan

    Abstract: The inherent ambiguity in ground-truth annotations of 3D bounding boxes, caused by occlusions, signal missing, or manual annotation errors, can confuse deep 3D object detectors during training, thus deteriorating detection accuracy. However, existing methods overlook such issues to some extent and treat the labels as deterministic. In this paper, we formulate the label uncertainty problem as the d… ▽ More

    Submitted 2 June, 2023; v1 submitted 6 July, 2022; originally announced July 2022.

  44. arXiv:2207.01909  [pdf, other

    cs.CV cs.LG eess.IV

    StyleFlow For Content-Fixed Image to Image Translation

    Authors: Weichen Fan, **ghuan Chen, Jiabin Ma, Jun Hou, Shuai Yi

    Abstract: Image-to-image (I2I) translation is a challenging topic in computer vision. We divide this problem into three tasks: strongly constrained translation, normally constrained translation, and weakly constrained translation. The constraint here indicates the extent to which the content or semantic information in the original image is preserved. Although previous approaches have achieved good performan… ▽ More

    Submitted 5 July, 2022; originally announced July 2022.

  45. arXiv:2207.01758  [pdf, other

    eess.IV cs.CV

    FDVTS's Solution for 2nd COV19D Competition on COVID-19 Detection and Severity Analysis

    Authors: Junlin Hou, Jilan Xu, Rui Feng, Yuejie Zhang

    Abstract: This paper presents our solution for the 2nd COVID-19 Competition, occurring in the framework of the AIMIA Workshop in the European Conference on Computer Vision (ECCV 2022). In our approach, we employ an effective 3D Contrastive Mixup Classification network for COVID-19 diagnosis on chest CT images, which is composed of contrastive representation learning and mixup classification. For the COVID-1… ▽ More

    Submitted 4 July, 2022; originally announced July 2022.

  46. arXiv:2206.10157  [pdf, other

    cs.CV

    Probing Visual-Audio Representation for Video Highlight Detection via Hard-Pairs Guided Contrastive Learning

    Authors: Shuaicheng Li, Feng Zhang, Kunlin Yang, Lingbo Liu, Shinan Liu, Jun Hou, Shuai Yi

    Abstract: Video highlight detection is a crucial yet challenging problem that aims to identify the interesting moments in untrimmed videos. The key to this task lies in effective video representations that jointly pursue two goals, \textit{i.e.}, cross-modal representation learning and fine-grained feature discrimination. In this paper, these two challenges are tackled by not only enriching intra-modality a… ▽ More

    Submitted 21 June, 2022; originally announced June 2022.

  47. arXiv:2206.10095  [pdf, other

    cs.CV

    Pyramid Region-based Slot Attention Network for Temporal Action Proposal Generation

    Authors: Shuaicheng Li, Feng Zhang, Rui-Wei Zhao, Rui Feng, Kunlin Yang, Lingbo Liu, Jun Hou

    Abstract: It has been found that temporal action proposal generation, which aims to discover the temporal action instances within the range of the start and end frames in the untrimmed videos, can largely benefit from proper temporal and semantic context exploitation. The latest efforts were dedicated to considering the temporal context and similarity-based semantic contexts through self-attention modules.… ▽ More

    Submitted 20 June, 2022; originally announced June 2022.

  48. arXiv:2206.09853  [pdf, other

    cs.CV cs.MM

    DisCoVQA: Temporal Distortion-Content Transformers for Video Quality Assessment

    Authors: Haoning Wu, Chaofeng Chen, Liang Liao, **gwen Hou, Wenxiu Sun, Qiong Yan, Weisi Lin

    Abstract: The temporal relationships between frames and their influences on video quality assessment (VQA) are still under-studied in existing works. These relationships lead to two important types of effects for video quality. Firstly, some temporal variations (such as shaking, flicker, and abrupt scene transitions) are causing temporal distortions and lead to extra quality degradations, while other variat… ▽ More

    Submitted 20 June, 2022; originally announced June 2022.

  49. arXiv:2206.06067  [pdf, other

    cs.CV

    Better Teacher Better Student: Dynamic Prior Knowledge for Knowledge Distillation

    Authors: Zengyu Qiu, Xinzhu Ma, Kunlin Yang, Chunya Liu, Jun Hou, Shuai Yi, Wanli Ouyang

    Abstract: Knowledge distillation (KD) has shown very promising capabilities in transferring learning representations from large models (teachers) to small models (students). However, as the capacity gap between students and teachers becomes larger, existing KD methods fail to achieve better results. Our work shows that the `prior knowledge' is vital to KD, especially when applying large teachers. Particular… ▽ More

    Submitted 23 March, 2023; v1 submitted 13 June, 2022; originally announced June 2022.

    Comments: ICLR'23 accepted

  50. arXiv:2206.04904  [pdf, other

    astro-ph.GA astro-ph.SR

    New insights into the structure of open clusters in the Gaia era

    Authors: **g Zhong, Li Chen, Yueyue Jiang, Songmei Qin, **liang Hou

    Abstract: With the help of Gaia data, it is noted that in addition to the core components, there are low-density outer halo components in the extended region of open clusters. To study the extended structure beyond the core radius of the cluster ($\sim$ 10 pc), based on Gaia EDR3 data, taking up to 50 pc as the searching radius, we use the pyUPMASK algorithm to re-determine the member stars of the open clus… ▽ More

    Submitted 10 June, 2022; originally announced June 2022.

    Comments: 17 pages, 6 figures. Accepted for publication in AJ