Skip to main content

Showing 201–250 of 610 results for author: Ma, K

.
  1. arXiv:2210.12338  [pdf, other

    cs.CL

    Open-domain Question Answering via Chain of Reasoning over Heterogeneous Knowledge

    Authors: Kaixin Ma, Hao Cheng, Xiaodong Liu, Eric Nyberg, Jianfeng Gao

    Abstract: We propose a novel open-domain question answering (ODQA) framework for answering single/multi-hop questions across heterogeneous knowledge sources. The key novelty of our method is the introduction of the intermediary modules into the current retriever-reader pipeline. Unlike previous methods that solely rely on the retriever for gathering all evidence in isolation, our intermediary performs a cha… ▽ More

    Submitted 21 October, 2022; originally announced October 2022.

    Comments: Findings of EMNLP 2022

  2. arXiv:2210.03904  [pdf, other

    cs.CV eess.IV

    LW-ISP: A Lightweight Model with ISP and Deep Learning

    Authors: Hongyang Chen, Kaisheng Ma

    Abstract: The deep learning (DL)-based methods of low-level tasks have many advantages over the traditional camera in terms of hardware prospects, error accumulation and imaging effects. Recently, the application of deep learning to replace the image signal processing (ISP) pipeline has appeared one after another; however, there is still a long way to go towards real landing. In this paper, we show the poss… ▽ More

    Submitted 8 October, 2022; originally announced October 2022.

    Comments: 16 PAGES, ACCEPTED AS A CONFERENCE PAPER AT: BMVC 2022

  3. arXiv:2210.03659  [pdf, other

    cs.CV cs.AI

    Spatio-temporal Tendency Reasoning for Human Body Pose and Shape Estimation from Videos

    Authors: Boyang Zhang, Su** Wu, Hu Cao, Kehua Ma, Pan Li, Lei Lin

    Abstract: In this paper, we present a spatio-temporal tendency reasoning (STR) network for recovering human body pose and shape from videos. Previous approaches have focused on how to extend 3D human datasets and temporal-based learning to promote accuracy and temporal smoothing. Different from them, our STR aims to learn accurate and natural motion sequences in an unconstrained environment through temporal… ▽ More

    Submitted 9 October, 2022; v1 submitted 7 October, 2022; originally announced October 2022.

    Comments: Accepted by BMVC2022

  4. arXiv:2210.02257  [pdf, other

    cs.CR cs.CV cs.MM

    Hiding Images in Deep Probabilistic Models

    Authors: Haoyu Chen, Linqi Song, Zhenxing Qian, Xinpeng Zhang, Kede Ma

    Abstract: Data hiding with deep neural networks (DNNs) has experienced impressive successes in recent years. A prevailing scheme is to train an autoencoder, consisting of an encoding network to embed (or transform) secret messages in (or into) a carrier, and a decoding network to extract the hidden messages. This scheme may suffer from several limitations regarding practicability, security, and embedding ca… ▽ More

    Submitted 5 October, 2022; originally announced October 2022.

  5. arXiv:2210.02245  [pdf, other

    eess.SP eess.IV

    Channel Modeling for UAV-to-Ground Communications with Posture Variation and Fuselage Scattering Effect

    Authors: Boyu Hua, Haoran Ni, Qiuming Zhu, Cheng-Xiang Wang, Tongtong Zhou, Kai Mao, Junwei Bao, Xiaofei Zhang

    Abstract: Unmanned aerial vehicle (UAV)-to-ground (U2G) channel models play a pivotal role for reliable communications between UAV and ground terminal. This paper proposes a three-dimensional (3D) non-stationary hybrid model including both large-scale and small-scale fading for U2G multiple-input-multiple-output (MIMO) channels. Distinctive channel characteristics under U2G scenarios, i.e., 3D trajectory an… ▽ More

    Submitted 13 October, 2022; v1 submitted 5 October, 2022; originally announced October 2022.

  6. arXiv:2210.00933  [pdf, other

    cs.CV eess.IV

    Perceptual Attacks of No-Reference Image Quality Models with Human-in-the-Loop

    Authors: Weixia Zhang, Dingquan Li, Xiongkuo Min, Guangtao Zhai, Guodong Guo, Xiaokang Yang, Kede Ma

    Abstract: No-reference image quality assessment (NR-IQA) aims to quantify how humans perceive visual distortions of digital images without access to their undistorted references. NR-IQA models are extensively studied in computational vision, and are widely used for performance evaluation and perceptual optimization of man-made vision systems. Here we make one of the first attempts to examine the perceptual… ▽ More

    Submitted 3 October, 2022; originally announced October 2022.

    Comments: NeurIPS 2022

  7. arXiv:2209.11119  [pdf, ps, other

    cond-mat.mes-hall cond-mat.str-el quant-ph

    Anyon condensation, topological quantum information scrambling, and Andreev-like reflection of non-Abelian anyons in quantum Hall interfaces

    Authors: Ken K. W. Ma

    Abstract: Quantum information scrambling is the spread of local information into correlation throughout the entire quantum many-body system. This concept has become a central topic in different contexts. In this work, we restate the connection between anyon condensation and topological quantum information scrambling in quantum Hall interfaces. We consider the interface between the Abelian Halperin-330 state… ▽ More

    Submitted 10 October, 2022; v1 submitted 22 September, 2022; originally announced September 2022.

    Comments: References on the possible Andreev-like reflection of electrons in interacting one-dimensional wires are added

  8. arXiv:2209.10819  [pdf, other

    astro-ph.GA astro-ph.HE

    Structure in the Magnetic Field of the Milky Way Disk and Halo traced by Faraday Rotation

    Authors: John M. Dickey, Jennifer West, Alec J. M. Thomson, T. L. Landecker, A. Bracco, E. Carretti, J. L. Han, A. S. Hill, Y. K. Ma, S. A. Mao, A. Ordog, Jo-Anne C. Brown, K. A. Douglas, A. Erceg, V. Jelic, R. Kothes, M. Wolleben

    Abstract: Magnetic fields in the ionized medium of the disk and halo of the Milky Way impose Faraday rotation on linearly polarized radio emission. We compare two surveys map** the Galactic Faraday rotation, one showing the rotation measures of extragalactic sources seen through the Galaxy (from Hutschenreuter et al 2022), and one showing the Faraday depth of the diffuse Galactic synchrotron emission from… ▽ More

    Submitted 22 September, 2022; originally announced September 2022.

    Comments: 37 pages, 26 figures, Ap. J. accepted

  9. arXiv:2209.09965  [pdf, other

    cs.GR cs.LG

    FoVolNet: Fast Volume Rendering using Foveated Deep Neural Networks

    Authors: David Bauer, Qi Wu, Kwan-Liu Ma

    Abstract: Volume data is found in many important scientific and engineering applications. Rendering this data for visualization at high quality and interactive rates for demanding applications such as virtual reality is still not easily achievable even using professional-grade hardware. We introduce FoVolNet -- a method to significantly increase the performance of volume data visualization. We develop a cos… ▽ More

    Submitted 20 September, 2022; originally announced September 2022.

    Comments: To appear at IEEE VIS 2022 and later TVCG

  10. Exploring Inconsistent Knowledge Distillation for Object Detection with Data Augmentation

    Authors: Jiawei Liang, Siyuan Liang, Aishan Liu, Ke Ma, **gzhi Li, Xiaochun Cao

    Abstract: Knowledge Distillation (KD) for object detection aims to train a compact detector by transferring knowledge from a teacher model. Since the teacher model perceives data in a way different from humans, existing KD methods only distill knowledge that is consistent with labels annotated by human expert while neglecting knowledge that is not consistent with human perception, which results in insuffici… ▽ More

    Submitted 21 February, 2024; v1 submitted 20 September, 2022; originally announced September 2022.

    Comments: ACMMM 2023 Oral

  11. arXiv:2209.08800  [pdf, ps, other

    eess.SP

    A Realistic 3D Non-Stationary Channel Model for UAV-to-Vehicle Communications Incorporating Fuselage Posture

    Authors: Boyu Hua, Tongtong Zhou, Qiuming Zhu, Kai Mao, Junwei Bao, Weizhi Zhong, Naeem Ahmed

    Abstract: Considering the unmanned aerial vehicle (UAV) three-dimensional (3D) posture, a novel 3D non-stationary geometry-based stochastic model (GBSM) is proposed for multiple-input multiple-output (MIMO) UAV-to-vehicle (U2V) channels. It consists of a line-of-sight (LoS) and non-line-of-sight (NLoS) components. The factor of fuselage posture is considered by introducing a time-variant 3D posture matrix.… ▽ More

    Submitted 19 September, 2022; originally announced September 2022.

    Comments: 12 pages, 8 figures, CNCOM

  12. arXiv:2209.05742  [pdf, other

    cs.LG cs.CR cs.GT stat.ML

    A Tale of HodgeRank and Spectral Method: Target Attack Against Rank Aggregation Is the Fixed Point of Adversarial Game

    Authors: Ke Ma, Qianqian Xu, **shan Zeng, Guorong Li, Xiaochun Cao, Qingming Huang

    Abstract: Rank aggregation with pairwise comparisons has shown promising results in elections, sports competitions, recommendations, and information retrieval. However, little attention has been paid to the security issue of such algorithms, in contrast to numerous research work on the computational and statistical characteristics. Driven by huge profits, the potential adversary has strong motivation and in… ▽ More

    Submitted 13 September, 2022; originally announced September 2022.

    Comments: 33 pages, https://github.com/alphaprime/Target_Attack_Rank_Aggregation

    Journal ref: Early Access by TPAMI 2022 (https://ieeexplore.ieee.org/document/9830042)

  13. arXiv:2208.12848  [pdf, other

    cs.CL

    Coalescing Global and Local Information for Procedural Text Understanding

    Authors: Kaixin Ma, Filip Ilievski, Jonathan Francis, Eric Nyberg, Alessandro Oltramari

    Abstract: Procedural text understanding is a challenging language reasoning task that requires models to track entity states across the development of a narrative. A complete procedural understanding solution should combine three core aspects: local and global views of the inputs, and global view of outputs. Prior methods considered a subset of these aspects, resulting in either low precision or low recall.… ▽ More

    Submitted 26 August, 2022; originally announced August 2022.

    Comments: COLING 2022

  14. arXiv:2208.12462  [pdf, other

    cs.CV

    Seg4Reg+: Consistency Learning between Spine Segmentation and Cobb Angle Regression

    Authors: Yi Lin, Luyan Liu, Kai Ma, Yefeng Zheng

    Abstract: Automated methods for Cobb angle estimation are of high demand for scoliosis assessment. Existing methods typically calculate the Cobb angle from landmark estimation, or simply combine the low-level task (e.g., landmark detection and spine segmentation) with the Cobb angle regression task, without fully exploring the benefits from each other. In this study, we propose a novel multi-task framework,… ▽ More

    Submitted 26 August, 2022; originally announced August 2022.

    Comments: Accepted by MICCAI 2021

  15. arXiv:2208.07908  [pdf, other

    cond-mat.mes-hall cond-mat.str-el

    Fractional quantum Hall effect at the filling factor $ν=5/2$

    Authors: Ken K. W. Ma, Michael R. Peterson, V. W. Scarola, Kun Yang

    Abstract: The fractional quantum Hall (FQH) effect at the filling factor $ν=5/2$ was discovered in GaAs heterostructures more than 35 years ago. Various topological orders have been proposed as possible candidates to describe this FQH state. Some of them possess non-Abelian anyon excitations, an entirely new type of quasiparticle with fascinating properties. If observed, non-Abelian anyons could offer funda… ▽ More

    Submitted 29 September, 2022; v1 submitted 16 August, 2022; originally announced August 2022.

    Comments: Updated version; A chapter for Encyclopedia of Condensed Matter Physics, 2nd edition (Elsevier)

    Journal ref: Encyclopedia of Condensed Matter Physics, 2nd edition (2023)

  16. arXiv:2208.06970  [pdf, other

    stat.ME cs.HC

    Level Set Restricted Voronoi Tessellation for Large scale Spatial Statistical Analysis

    Authors: Tyson Neuroth, Martin Rieth, Konduri Aditya, Myoungkyu Lee, Jacqueline H Chen, Kwan-Liu Ma

    Abstract: Spatial statistical analysis of multivariate volumetric data can be challenging due to scale, complexity, and occlusion. Advances in topological segmentation, feature extraction, and statistical summarization have helped overcome the challenges. This work introduces a new spatial statistical decomposition method based on level sets, connected components, and a novel variation of the restricted cen… ▽ More

    Submitted 14 August, 2022; originally announced August 2022.

  17. arXiv:2207.14769  [pdf, other

    cs.CV

    Image Quality Assessment: Integrating Model-Centric and Data-Centric Approaches

    Authors: Peibei Cao, Dingquan Li, Kede Ma

    Abstract: Learning-based image quality assessment (IQA) has made remarkable progress in the past decade, but nearly all consider the two key components -- model and data -- in isolation. Specifically, model-centric IQA focuses on develo** ``better'' objective quality methods on fixed and extensively reused datasets, with a great danger of overfitting. Data-centric IQA involves conducting psychophysical ex… ▽ More

    Submitted 8 December, 2023; v1 submitted 29 July, 2022; originally announced July 2022.

  18. arXiv:2207.13688  [pdf, ps, other

    cond-mat.stat-mech quant-ph

    Eigenstate thermalization and disappearance of quantum many-body scar states in interacting fermion systems

    Authors: Ken K. W. Ma, A. Volya, Kun Yang

    Abstract: The recent discovery of quantum many-body scar states has revealed the possibility of having states with low entanglement that violate the eigenstate thermalization hypothesis in nonintegrable systems. Such states with low entanglement entropy are rare but naturally exist in the integrable system of free fermions. Here, we demonstrate analytically that these atypical states would be always elimina… ▽ More

    Submitted 2 January, 2023; v1 submitted 27 July, 2022; originally announced July 2022.

    Comments: Accepted version by PRB

    Journal ref: Phys. Rev. B 106, 214313 (2022)

  19. arXiv:2207.11620  [pdf, other

    cs.GR cs.LG

    Interactive Volume Visualization via Multi-Resolution Hash Encoding based Neural Representation

    Authors: Qi Wu, David Bauer, Michael J. Doyle, Kwan-Liu Ma

    Abstract: Neural networks have shown great potential in compressing volume data for visualization. However, due to the high cost of training and inference, such volumetric neural representations have thus far only been applied to offline data processing and non-interactive rendering. In this paper, we demonstrate that by simultaneously leveraging modern GPU tensor cores, a native CUDA neural network framewo… ▽ More

    Submitted 29 June, 2023; v1 submitted 23 July, 2022; originally announced July 2022.

    Comments: There is a supplementary video for this manuscript, which can be accessed via this link: https://drive.google.com/file/d/17wSgIm_VsoeGhfyZwMpOnCYy2Mj3ydGv/view?usp=sharing

  20. arXiv:2207.10232  [pdf, other

    math.OC

    Optimal, centralized dynamic curbside parking space zoning

    Authors: Nawaf Nazir, Shushman Choudhury, Stephen Zoepf, Ke Ma, Chase Dowling

    Abstract: In this paper we formulate a dynamic mixed integer program for optimally zoning curbside parking spaces subject to transportation policy-inspired constraints and regularization terms. First, we illustrate how given some objective of curb zoning valuation as a function of zone type (e.g., paid parking or bus stop), dynamically rezoning involves unrolling this optimization program over a fixed time… ▽ More

    Submitted 20 July, 2022; originally announced July 2022.

  21. arXiv:2207.09689  [pdf, other

    cs.CV

    Uncertainty Inspired Underwater Image Enhancement

    Authors: Zhenqi Fu, Wu Wang, Yue Huang, Xinghao Ding, Kai-Kuang Ma

    Abstract: A main challenge faced in the deep learning-based Underwater Image Enhancement (UIE) is that the ground truth high-quality image is unavailable. Most of the existing methods first generate approximate reference maps and then train an enhancement network with certainty. This kind of method fails to handle the ambiguity of the reference map. In this paper, we resolve UIE into distribution estimation… ▽ More

    Submitted 20 July, 2022; originally announced July 2022.

  22. arXiv:2207.09312  [pdf, other

    eess.IV cs.CV cs.LG

    Towards Trustworthy Healthcare AI: Attention-Based Feature Learning for COVID-19 Screening With Chest Radiography

    Authors: Kai Ma, Pengcheng Xi, Karim Habashy, Ashkan Ebadi, Stéphane Tremblay, Alexander Wong

    Abstract: Building AI models with trustworthiness is important especially in regulated areas such as healthcare. In tackling COVID-19, previous work uses convolutional neural networks as the backbone architecture, which has shown to be prone to over-caution and overconfidence in making decisions, rendering them less trustworthy -- a crucial flaw in the context of medical imaging. In this study, we propose a… ▽ More

    Submitted 19 July, 2022; originally announced July 2022.

    Comments: Accepted to 39th International Conference on Machine Learning, Workshop on Healthcare AI and COVID-19

  23. arXiv:2207.08859  [pdf, other

    cs.CV

    Prior-Guided Adversarial Initialization for Fast Adversarial Training

    Authors: Xiaojun Jia, Yong Zhang, Xingxing Wei, Baoyuan Wu, Ke Ma, Jue Wang, Xiaochun Cao

    Abstract: Fast adversarial training (FAT) effectively improves the efficiency of standard adversarial training (SAT). However, initial FAT encounters catastrophic overfitting, i.e.,the robust accuracy against adversarial attacks suddenly and dramatically decreases. Though several FAT variants spare no effort to prevent overfitting, they sacrifice much calculation cost. In this paper, we explore the differen… ▽ More

    Submitted 18 July, 2022; originally announced July 2022.

    Comments: ECCV 2022

    Journal ref: ECCV 2022

  24. arXiv:2207.08549  [pdf, other

    cs.CV

    Dense Cross-Query-and-Support Attention Weighted Mask Aggregation for Few-Shot Segmentation

    Authors: Xinyu Shi, Dong Wei, Yu Zhang, Donghuan Lu, Munan Ning, Jiashun Chen, Kai Ma, Yefeng Zheng

    Abstract: Research into Few-shot Semantic Segmentation (FSS) has attracted great attention, with the goal to segment target objects in a query image given only a few annotated support images of the target class. A key to this challenging task is to fully utilize the information in the support images by exploiting fine-grained correlations between the query and support images. However, most existing approach… ▽ More

    Submitted 18 July, 2022; originally announced July 2022.

    Comments: ECCV 2022

  25. Gigapixel Whole-Slide Images Classification using Locally Supervised Learning

    Authors: **gwei Zhang, Xin Zhang, Ke Ma, Rajarsi Gupta, Joel Saltz, Maria Vakalopoulou, Dimitris Samaras

    Abstract: Histopathology whole slide images (WSIs) play a very important role in clinical studies and serve as the gold standard for many cancer diagnoses. However, generating automatic tools for processing WSIs is challenging due to their enormous sizes. Currently, to deal with this issue, conventional methods rely on a multiple instance learning (MIL) strategy to process a WSI at patch level. Although eff… ▽ More

    Submitted 26 September, 2022; v1 submitted 17 July, 2022; originally announced July 2022.

    Comments: Accepted to MICCAI 2022 Oral

    Journal ref: International Conference on Medical Image Computing and Computer-Assisted Intervention. Springer, Cham, 2022

  26. arXiv:2207.06618  [pdf, other

    cond-mat.str-el cond-mat.mes-hall cond-mat.mtrl-sci

    Anisotropic, two-dimensional, disordered Wigner solid

    Authors: Md. S. Hossain, M. K. Ma, K. A. Villegas-Rosales, Y. J. Chung, L. N. Pfeiffer, K. W. West, K. W. Baldwin, M. Shayegan

    Abstract: The interplay between the Fermi sea anisotropy, electron-electron interaction, and localization phenomena can give rise to exotic many-body phases. An exciting example is an anisotropic two-dimensional (2D) Wigner solid (WS), where electrons form an ordered array with an anisotropic lattice structure. Such a state has eluded experiments up to now as its realization is extremely demanding: First, a… ▽ More

    Submitted 13 July, 2022; originally announced July 2022.

    Journal ref: Phys. Rev. Lett. 129, 036601 (2022)

  27. arXiv:2207.05306  [pdf, other

    cs.CV cs.AI

    Contrastive Deep Supervision

    Authors: Linfeng Zhang, Xin Chen, Junbo Zhang, Runpei Dong, Kaisheng Ma

    Abstract: The success of deep learning is usually accompanied by the growth in neural network depth. However, the traditional training method only supervises the neural network at its last layer and propagates the supervision layer-by-layer, which leads to hardship in optimizing the intermediate layers. Recently, deep supervision has been proposed to add auxiliary classifiers to the intermediate layers of d… ▽ More

    Submitted 12 July, 2022; originally announced July 2022.

    Comments: Accepted in ECCV2022

  28. arXiv:2206.13891  [pdf, other

    cs.LG stat.ML

    Feature Learning for Nonlinear Dimensionality Reduction toward Maximal Extraction of Hidden Patterns

    Authors: Takanori Fujiwara, Yun-Hsin Kuo, Anders Ynnerman, Kwan-Liu Ma

    Abstract: Dimensionality reduction (DR) plays a vital role in the visual analysis of high-dimensional data. One main aim of DR is to reveal hidden patterns that lie on intrinsic low-dimensional manifolds. However, DR often overlooks important patterns when the manifolds are distorted or masked by certain influential data attributes. This paper presents a feature learning framework, FEALM, designed to genera… ▽ More

    Submitted 24 February, 2023; v1 submitted 28 June, 2022; originally announced June 2022.

    Comments: Accepted by PacificVis 2023. The previous preprint version was titled "Feature Learning for Dimensionality Reduction toward Maximal Extraction of Hidden Patterns" (arxiv:2206.13891v2)

  29. arXiv:2206.13170  [pdf, other

    cs.LG cs.AI

    Measuring and Improving the Use of Graph Information in Graph Neural Networks

    Authors: Yifan Hou, Jian Zhang, James Cheng, Kaili Ma, Richard T. B. Ma, Hongzhi Chen, Ming-Chang Yang

    Abstract: Graph neural networks (GNNs) have been widely used for representation learning on graph data. However, there is limited understanding on how much performance GNNs actually gain from graph data. This paper introduces a context-surrounding GNN framework and proposes two smoothness metrics to measure the quantity and quality of information obtained from graph data. A new GNN model, called CS-GNN, is… ▽ More

    Submitted 27 June, 2022; originally announced June 2022.

    Comments: This paper has been published in ICLR 2020. Code and Dataset can be found here: https://github.com/yifan-h/CS-GNN

  30. arXiv:2206.09146  [pdf, other

    eess.IV cs.AI cs.CV

    A Perceptually Optimized and Self-Calibrated Tone Map** Operator

    Authors: Peibei Cao, Chenyang Le, Yuming Fang, Kede Ma

    Abstract: With the increasing popularity and accessibility of high dynamic range (HDR) photography, tone map** operators (TMOs) for dynamic range compression are practically demanding. In this paper, we develop a two-stage neural network-based TMO that is self-calibrated and perceptually optimized. In Stage one, motivated by the physiology of the early stages of the human visual system, we first decompose… ▽ More

    Submitted 25 August, 2023; v1 submitted 18 June, 2022; originally announced June 2022.

    Comments: 15 pages,17 figures

  31. arXiv:2206.08751  [pdf, other

    cs.CV eess.IV

    Perceptual Quality Assessment of Virtual Reality Videos in the Wild

    Authors: Wen Wen, Mu Li, Yiru Yao, Xiangjie Sui, Yabin Zhang, Long Lan, Yuming Fang, Kede Ma

    Abstract: Investigating how people perceive virtual reality (VR) videos in the wild (i.e., those captured by everyday users) is a crucial and challenging task in VR-related applications due to complex authentic distortions localized in space and time. Existing panoramic video databases only consider synthetic distortions, assume fixed viewing conditions, and are limited in size. To overcome these shortcomin… ▽ More

    Submitted 15 March, 2024; v1 submitted 12 June, 2022; originally announced June 2022.

    Comments: Accepted by IEEE Transactions on Circuits and Systems for Video Technology

  32. arXiv:2206.07766  [pdf, other

    cs.LG stat.ML

    Pareto Invariant Risk Minimization: Towards Mitigating the Optimization Dilemma in Out-of-Distribution Generalization

    Authors: Yongqiang Chen, Kaiwen Zhou, Yatao Bian, Binghui Xie, Bingzhe Wu, Yonggang Zhang, Kaili Ma, Han Yang, Peilin Zhao, Bo Han, James Cheng

    Abstract: Recently, there has been a growing surge of interest in enabling machine learning systems to generalize well to Out-of-Distribution (OOD) data. Most efforts are devoted to advancing optimization objectives that regularize models to capture the underlying invariance; however, there often are compromises in the optimization process of these OOD objectives: i) Many OOD objectives have to be relaxed a… ▽ More

    Submitted 2 March, 2023; v1 submitted 15 June, 2022; originally announced June 2022.

    Comments: ICLR 2023, 50 pages, 58 figures

  33. YOLoC: DeploY Large-Scale Neural Network by ROM-based Computing-in-Memory using ResiduaL Branch on a Chip

    Authors: Yiming Chen, Guodong Yin, Zhanhong Tan, Mingyen Lee, Zekun Yang, Yongpan Liu, Huazhong Yang, Kaisheng Ma, Xueqing Li

    Abstract: Computing-in-memory (CiM) is a promising technique to achieve high energy efficiency in data-intensive matrix-vector multiplication (MVM) by relieving the memory bottleneck. Unfortunately, due to the limited SRAM capacity, existing SRAM-based CiM needs to reload the weights from DRAM in large-scale networks. This undesired fact weakens the energy efficiency significantly. This work, for the first… ▽ More

    Submitted 1 June, 2022; originally announced June 2022.

    Comments: 6 pages, 14 figures. to be published in DAC 2022

    Journal ref: Design Automation Conference 2022

  34. arXiv:2206.00227  [pdf, other

    cs.CV

    Rethinking the Augmentation Module in Contrastive Learning: Learning Hierarchical Augmentation Invariance with Expanded Views

    Authors: Junbo Zhang, Kaisheng Ma

    Abstract: A data augmentation module is utilized in contrastive learning to transform the given data example into two views, which is considered essential and irreplaceable. However, the predetermined composition of multiple data augmentations brings two drawbacks. First, the artificial choice of augmentation types brings specific representational invariances to the model, which have different degrees of po… ▽ More

    Submitted 21 August, 2022; v1 submitted 1 June, 2022; originally announced June 2022.

    Comments: Accepted to CVPR 2022

    Journal ref: 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

  35. arXiv:2205.13489  [pdf, other

    cs.CV cs.GR eess.IV

    Measuring Perceptual Color Differences of Smartphone Photographs

    Authors: Zhihua Wang, Keshuo Xu, Yang Yang, Jianlei Dong, Shuhang Gu, Lihao Xu, Yuming Fang, Kede Ma

    Abstract: Measuring perceptual color differences (CDs) is of great importance in modern smartphone photography. Despite the long history, most CD measures have been constrained by psychophysical data of homogeneous color patches or a limited number of simplistic natural photographic images. It is thus questionable whether existing CD measures generalize in the age of smartphone photography characterized by… ▽ More

    Submitted 31 March, 2023; v1 submitted 26 May, 2022; originally announced May 2022.

    Comments: 10 figures, 8 tables, 14 pages

  36. arXiv:2205.12451  [pdf, other

    cs.CV

    Region-aware Knowledge Distillation for Efficient Image-to-Image Translation

    Authors: Linfeng Zhang, Xin Chen, Runpei Dong, Kaisheng Ma

    Abstract: Recent progress in image-to-image translation has witnessed the success of generative adversarial networks (GANs). However, GANs usually contain a huge number of parameters, which lead to intolerant memory and computation consumption and limit their deployment on edge devices. To address this issue, knowledge distillation is proposed to transfer the knowledge from a cumbersome teacher model to an… ▽ More

    Submitted 24 May, 2022; originally announced May 2022.

  37. arXiv:2205.11098  [pdf, other

    cs.CV cs.LG

    PointDistiller: Structured Knowledge Distillation Towards Efficient and Compact 3D Detection

    Authors: Linfeng Zhang, Runpei Dong, Hung-Shuo Tai, Kaisheng Ma

    Abstract: The remarkable breakthroughs in point cloud representation learning have boosted their usage in real-world applications such as self-driving cars and virtual reality. However, these applications usually have an urgent requirement for not only accurate but also efficient 3D object detection. Recently, knowledge distillation has been proposed as an effective model compression technique, which transf… ▽ More

    Submitted 23 May, 2022; originally announced May 2022.

  38. arXiv:2205.10661  [pdf, other

    cs.CL cs.AI

    An Empirical Investigation of Commonsense Self-Supervision with Knowledge Graphs

    Authors: Jiarui Zhang, Filip Ilievski, Kaixin Ma, Jonathan Francis, Alessandro Oltramari

    Abstract: Self-supervision based on the information extracted from large knowledge graphs has been shown to improve the generalization of language models, in zero-shot evaluation on various downstream language reasoning tasks. Since these improvements are reported in aggregate, however, little is known about (i) how to select the appropriate knowledge for solid performance across tasks, (ii) how to combine… ▽ More

    Submitted 21 May, 2022; originally announced May 2022.

  39. Mono-$γ$ Production of a Vector Dark Matter at Future $e^+e^-$ Collider

    Authors: Kai Ma

    Abstract: Associated production of a dark particle and a photon, represented as a mono-$γ$ event, is a promising channel to probe particle contents and dynamics in the dark sector. In this paper we study properties of the mono-$γ$ production of a vector dark matter at future $e^+e^-$ colliders. The photon-like and Pauli operators, as well as triple gauge bosons interactions involving the dark matter, are co… ▽ More

    Submitted 11 May, 2022; originally announced May 2022.

    Comments: 6 captioned figures and 1 table, 25 pages

  40. arXiv:2204.13892  [pdf, other

    cs.CV

    SideRT: A Real-time Pure Transformer Architecture for Single Image Depth Estimation

    Authors: Chang Shu, Ziming Chen, Lei Chen, Kuan Ma, Minghui Wang, Haibing Ren

    Abstract: Since context modeling is critical for estimating depth from a single image, researchers put tremendous effort into obtaining global context. Many global manipulations are designed for traditional CNN-based architectures to overcome the locality of convolutions. Attention mechanisms or transformers originally designed for capturing long-range dependencies might be a better choice, but usually comp… ▽ More

    Submitted 29 April, 2022; originally announced April 2022.

    Comments: 7 pages, 5 figures

  41. Learning Shape Priors by Pairwise Comparison for Robust Semantic Segmentation

    Authors: Cong Xie, Hualuo Liu, Shilei Cao, Dong Wei, Kai Ma, Liansheng Wang, Yefeng Zheng

    Abstract: Semantic segmentation is important in medical image analysis. Inspired by the strong ability of traditional image analysis techniques in capturing shape priors and inter-subject similarity, many deep learning (DL) models have been recently proposed to exploit such prior information and achieved robust performance. However, these two types of important prior information are usually studied separate… ▽ More

    Submitted 23 April, 2022; originally announced April 2022.

    Comments: IEEE ISBI 2021

  42. arXiv:2204.10090  [pdf, other

    eess.IV cs.CV

    Learn from Unpaired Data for Image Restoration: A Variational Bayes Approach

    Authors: Dihan Zheng, Xiaowen Zhang, Kaisheng Ma, Chenglong Bao

    Abstract: Collecting paired training data is difficult in practice, but the unpaired samples broadly exist. Current approaches aim at generating synthesized training data from unpaired samples by exploring the relationship between the corrupted and clean data. This work proposes LUD-VAE, a deep generative method to learn the joint probability density function from data sampled from marginal distributions. O… ▽ More

    Submitted 11 September, 2022; v1 submitted 21 April, 2022; originally announced April 2022.

  43. arXiv:2204.06951  [pdf, other

    cs.CV

    Unsupervised Deep Learning Meets Chan-Vese Model

    Authors: Dihan Zheng, Chenglong Bao, Zuoqiang Shi, Haibin Ling, Kaisheng Ma

    Abstract: The Chan-Vese (CV) model is a classic region-based method in image segmentation. However, its piecewise constant assumption does not always hold for practical applications. Many improvements have been proposed but the issue is still far from well solved. In this work, we propose an unsupervised image segmentation approach that integrates the CV model with deep neural networks, which significantly… ▽ More

    Submitted 14 April, 2022; originally announced April 2022.

  44. arXiv:2204.06187  [pdf, other

    cs.CV

    Calibrating Class Weights with Multi-Modal Information for Partial Video Domain Adaptation

    Authors: Xiyu Wang, Yuecong Xu, Kezhi Mao, Jianfei Yang

    Abstract: Assuming the source label space subsumes the target one, Partial Video Domain Adaptation (PVDA) is a more general and practical scenario for cross-domain video classification problems. The key challenge of PVDA is to mitigate the negative transfer caused by the source-only outlier classes. To tackle this challenge, a crucial step is to aggregate target predictions to assign class weights by up-wei… ▽ More

    Submitted 11 July, 2022; v1 submitted 13 April, 2022; originally announced April 2022.

    Comments: Accepted by ACM Multimedia (ACMMM) 2022, update to camera-ready version. 8 pages of text, 5 figures, 2 tables

  45. arXiv:2204.04088  [pdf, other

    eess.SY

    Stochastic Gradient-based Fast Distributed Multi-Energy Management for an Industrial Park with Temporally-Coupled Constraints

    Authors: Dafeng Zhu, Bo Yang, Chengbin Ma, Zhaojian Wang, Shanying Zhu, Kai Ma, ** Guan

    Abstract: Contemporary industrial parks are challenged by the growing concerns about high cost and low efficiency of energy supply. Moreover, in the case of uncertain supply/demand, how to mobilize delay-tolerant elastic loads and compensate real-time inelastic loads to match multi-energy generation/storage and minimize energy cost is a key issue. Since energy management is hardly to be implemented offline… ▽ More

    Submitted 8 April, 2022; originally announced April 2022.

    Comments: Accepted by Applied Energy

  46. arXiv:2203.16092  [pdf, other

    cs.CV

    Global Tracking via Ensemble of Local Trackers

    Authors: Zikun Zhou, Jianqiu Chen, Wenjie Pei, Kaige Mao, Hongpeng Wang, Zhenyu He

    Abstract: The crux of long-term tracking lies in the difficulty of tracking the target with discontinuous moving caused by out-of-view or occlusion. Existing long-term tracking methods follow two typical strategies. The first strategy employs a local tracker to perform smooth tracking and uses another re-detector to detect the target when the target is lost. While it can exploit the temporal context like hi… ▽ More

    Submitted 30 March, 2022; originally announced March 2022.

    Comments: 10 pages; 6 figures; accepted to CVPR2022

  47. arXiv:2203.12268  [pdf, other

    cs.AR

    Chiplet Actuary: A Quantitative Cost Model and Multi-Chiplet Architecture Exploration

    Authors: Yinxiao Feng, Kaisheng Ma

    Abstract: Multi-chip integration is widely recognized as the extension of Moore's Law. Cost-saving is a frequently mentioned advantage, but previous works rarely present quantitative demonstrations on the cost superiority of multi-chip integration over monolithic SoC. In this paper, we build a quantitative cost model and put forward an analytical method for multi-chip systems based on three typical multi-ch… ▽ More

    Submitted 9 April, 2024; v1 submitted 23 March, 2022; originally announced March 2022.

    Comments: Accepted by and presented at DAC 2022

  48. Domain Adaptation Meets Zero-Shot Learning: An Annotation-Efficient Approach to Multi-Modality Medical Image Segmentation

    Authors: Cheng Bian, Chenglang Yuan, Kai Ma, Shuang Yu, Dong Wei, Yefeng Zheng

    Abstract: Due to the lack of properly annotated medical data, exploring the generalization capability of the deep model is becoming a public concern. Zero-shot learning (ZSL) has emerged in recent years to equip the deep model with the ability to recognize unseen classes. However, existing studies mainly focus on natural images, which utilize linguistic models to extract auxiliary information for ZSL. It is… ▽ More

    Submitted 19 March, 2022; originally announced March 2022.

    Comments: IEEE TMI

  49. arXiv:2203.07859  [pdf, other

    physics.ins-det nucl-ex

    Construction and commissioning of the collinear laser spectroscopy system at BRIF

    Authors: S. J. Wang, X. F. Yang, S. W. Bai, Y. C. Liu, P. Zhang, Y. S. Liu, H. R. Hu, H. W. Li, B. Tang, B. Q. Cui, C. Y. He, X. Ma, Q. T. Li, J. H. Chen, K. Ma, L. S. Yang, Z. Y. Hu, W. L. Pu, Y. Chen, Y. F. Guo, Z. Y. Du, Z. Yan, F. L. Liu, H. R. Wang, G. Q. Yang , et al. (2 additional authors not shown)

    Abstract: We have constructed a collinear laser spectroscopy (CLS) system installed at the Bei**g Radioactive Ion-beam Facility (BRIF), aiming to investigate the nuclear properties of unstable nuclei. The first on-line commissioning experiment of this system was performed using the continuous stable ($^{39}$K) and unstable ($^{38}$K) ion beams produced by im**ing a 100-MeV proton beam on a CaO target. Hy… ▽ More

    Submitted 11 March, 2022; originally announced March 2022.

  50. arXiv:2203.07659  [pdf

    eess.IV cs.CV

    Breast Cancer Molecular Subtypes Prediction on Pathological Images with Discriminative Patch Selecting and Multi-Instance Learning

    Authors: Hong Liu, Wen-Dong Xu, Zi-Hao Shang, Xiang-Dong Wang, Hai-Yan Zhou, Ke-Wen Ma, Huan Zhou, Jia-Lin Qi, Jia-Rui Jiang, Li-Lan Tan, Hui-Min Zeng, Hui-Juan Cai, Kuan-Song Wang, Yue-Liang Qian

    Abstract: Molecular subtypes of breast cancer are important references to personalized clinical treatment. For cost and labor savings, only one of the patient's paraffin blocks is usually selected for subsequent immunohistochemistry (IHC) to obtain molecular subtypes. Inevitable sampling error is risky due to tumor heterogeneity and could result in a delay in treatment. Molecular subtype prediction from con… ▽ More

    Submitted 15 March, 2022; originally announced March 2022.