Skip to main content

Showing 1–50 of 75 results for author: Ying, H

.
  1. arXiv:2406.16620  [pdf, other

    cs.CV cs.CL

    OmAgent: A Multi-modal Agent Framework for Complex Video Understanding with Task Divide-and-Conquer

    Authors: Lu Zhang, Tiancheng Zhao, Heting Ying, Yibo Ma, Kyusong Lee

    Abstract: Recent advancements in Large Language Models (LLMs) have expanded their capabilities to multimodal contexts, including comprehensive video understanding. However, processing extensive videos such as 24-hour CCTV footage or full-length films presents significant challenges due to the vast data and processing demands. Traditional methods, like extracting key frames or converting frames to text, ofte… ▽ More

    Submitted 24 June, 2024; v1 submitted 24 June, 2024; originally announced June 2024.

  2. arXiv:2406.03847  [pdf, other

    cs.CL

    Lean Workbook: A large-scale Lean problem set formalized from natural language math problems

    Authors: Huaiyuan Ying, Zijian Wu, Yihan Geng, Jiayu Wang, Dahua Lin, Kai Chen

    Abstract: Large language models have demonstrated impressive capabilities across various natural language processing tasks, especially in solving mathematical problems. However, large language models are not good at math theorem proving using formal languages like Lean. A significant challenge in this area is the scarcity of training data available in these formal languages. To address this issue, we propos… ▽ More

    Submitted 7 June, 2024; v1 submitted 6 June, 2024; originally announced June 2024.

  3. arXiv:2405.18800  [pdf

    cs.CV

    Face processing emerges from object-trained convolutional neural networks

    Authors: Zhenhua Zhao, Ji Chen, Zhicheng Lin, Haojiang Ying

    Abstract: Whether face processing depends on unique, domain-specific neurocognitive mechanisms or domain-general object recognition mechanisms has long been debated. Directly testing these competing hypotheses in humans has proven challenging due to extensive exposure to both faces and objects. Here, we systematically test these hypotheses by capitalizing on recent progress in convolutional neural networks… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: 31 pages, 5 Figures

  4. Fair Evaluation of Federated Learning Algorithms for Automated Breast Density Classification: The Results of the 2022 ACR-NCI-NVIDIA Federated Learning Challenge

    Authors: Kendall Schmidt, Benjamin Bearce, Ken Chang, Laura Coombs, Keyvan Farahani, Marawan Elbatele, Kaouther Mouhebe, Robert Marti, Ruipeng Zhang, Yao Zhang, Yanfeng Wang, Yaojun Hu, Haochao Ying, Yuyang Xu, Conrad Testagrose, Mutlu Demirer, Vikash Gupta, Ünal Akünal, Markus Bujotzek, Klaus H. Maier-Hein, Yi Qin, Xiaomeng Li, Jayashree Kalpathy-Cramer, Holger R. Roth

    Abstract: The correct interpretation of breast density is important in the assessment of breast cancer risk. AI has been shown capable of accurately predicting breast density, however, due to the differences in imaging characteristics across mammography systems, models built using data from one system do not generalize well to other systems. Though federated learning (FL) has emerged as a way to improve the… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

    Comments: 16 pages, 9 figures

    Journal ref: Medical Image Analysis Volume 95, July 2024, 103206

  5. arXiv:2404.19246  [pdf

    cs.CR cs.AR

    Logistic Map Pseudo Random Number Generator in FPGA

    Authors: Mateo Jalen Andrew Calderon, Lee Jun Lei Lucas, Syarifuddin Azhar Bin Rosli, Stephanie See Hui Ying, Jarell Lim En Yu, Maoyang Xiang, T. Hui Teo

    Abstract: This project develops a pseudo-random number generator (PRNG) using the logistic map, implemented in Verilog HDL on an FPGA and processes its output through a Central Limit Theorem (CLT) function to achieve a Gaussian distribution. The system integrates additional FPGA modules for real-time interaction and visualisation, including a clock generator, UART interface, XADC, and a 7-segment display dr… ▽ More

    Submitted 30 April, 2024; originally announced April 2024.

    Comments: 10 pages, 6 figures

  6. arXiv:2404.11171  [pdf, other

    cs.LG cs.AI eess.SP

    Personalized Heart Disease Detection via ECG Digital Twin Generation

    Authors: Yaojun Hu, **tai Chen, Lianting Hu, Dantong Li, Jiahuan Yan, Haochao Ying, Huiying Liang, Jian Wu

    Abstract: Heart diseases rank among the leading causes of global mortality, demonstrating a crucial need for early diagnosis and intervention. Most traditional electrocardiogram (ECG) based automated diagnosis methods are trained at population level, neglecting the customization of personalized ECGs to enhance individual healthcare management. A potential solution to address this limitation is to employ dig… ▽ More

    Submitted 11 May, 2024; v1 submitted 17 April, 2024; originally announced April 2024.

  7. arXiv:2403.19124  [pdf, other

    cs.CV

    PoCo: A Self-Supervised Approach via Polar Transformation Based Progressive Contrastive Learning for Ophthalmic Disease Diagnosis

    Authors: **hong Wang, Tingting Chen, **tai Chen, Yixuan Wu, Yuyang Xu, Danny Chen, Haochao Ying, Jian Wu

    Abstract: Automatic ophthalmic disease diagnosis on fundus images is important in clinical practice. However, due to complex fundus textures and limited annotated data, develo** an effective automatic method for this problem is still challenging. In this paper, we present a self-supervised method via polar transformation based progressive contrastive learning, called PoCo, for ophthalmic disease diagnosis… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

  8. arXiv:2403.17297  [pdf, other

    cs.CL cs.AI

    InternLM2 Technical Report

    Authors: Zheng Cai, Maosong Cao, Haojiong Chen, Kai Chen, Keyu Chen, Xin Chen, Xun Chen, Zehui Chen, Zhi Chen, Pei Chu, Xiaoyi Dong, Haodong Duan, Qi Fan, Zhaoye Fei, Yang Gao, Jiaye Ge, Chenya Gu, Yuzhe Gu, Tao Gui, Aijia Guo, Qipeng Guo, Conghui He, Yingfan Hu, Ting Huang, Tao Jiang , et al. (75 additional authors not shown)

    Abstract: The evolution of Large Language Models (LLMs) like ChatGPT and GPT-4 has sparked discussions on the advent of Artificial General Intelligence (AGI). However, replicating such advancements in open-source models has been challenging. This paper introduces InternLM2, an open-source LLM that outperforms its predecessors in comprehensive evaluations across 6 dimensions and 30 benchmarks, long-context m… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

  9. arXiv:2403.15876  [pdf, other

    cs.CV cs.AI

    Cognitive resilience: Unraveling the proficiency of image-captioning models to interpret masked visual content

    Authors: Zhicheng Du, Zhaotian Xie, Huazhang Ying, Likun Zhang, Peiwu Qin

    Abstract: This study explores the ability of Image Captioning (IC) models to decode masked visual content sourced from diverse datasets. Our findings reveal the IC model's capability to generate captions from masked images, closely resembling the original content. Notably, even in the presence of masks, the model adeptly crafts descriptive textual information that goes beyond what is observable in the origi… ▽ More

    Submitted 23 March, 2024; originally announced March 2024.

    Comments: Accepted as tiny paper in ICLR 2024

  10. arXiv:2402.17246  [pdf, other

    eess.IV cs.CV cs.LG

    SDR-Former: A Siamese Dual-Resolution Transformer for Liver Lesion Classification Using 3D Multi-Phase Imaging

    Authors: Meng Lou, Hanning Ying, Xiaoqing Liu, Hong-Yu Zhou, Yuqing Zhang, Yizhou Yu

    Abstract: Automated classification of liver lesions in multi-phase CT and MR scans is of clinical significance but challenging. This study proposes a novel Siamese Dual-Resolution Transformer (SDR-Former) framework, specifically designed for liver lesion classification in 3D multi-phase CT and MR imaging with varying phase counts. The proposed SDR-Former utilizes a streamlined Siamese Neural Network (SNN) t… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

    Comments: 13 pages, 7 figures

  11. arXiv:2402.11177  [pdf, other

    cs.CL cs.IR

    A Question Answering Based Pipeline for Comprehensive Chinese EHR Information Extraction

    Authors: Huaiyuan Ying, Sheng Yu

    Abstract: Electronic health records (EHRs) hold significant value for research and applications. As a new way of information extraction, question answering (QA) can extract more flexible information than conventional methods and is more accessible to clinical researchers, but its progress is impeded by the scarcity of annotated data. In this paper, we propose a novel approach that automatically generates tr… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

  12. arXiv:2402.06332  [pdf, other

    cs.CL

    InternLM-Math: Open Math Large Language Models Toward Verifiable Reasoning

    Authors: Huaiyuan Ying, Shuo Zhang, Linyang Li, Zhejian Zhou, Yunfan Shao, Zhaoye Fei, Yichuan Ma, Jiawei Hong, Kuikun Liu, Ziyi Wang, Yudong Wang, Zijian Wu, Shuaibin Li, Fengzhe Zhou, Hongwei Liu, Songyang Zhang, Wenwei Zhang, Hang Yan, Xipeng Qiu, Jiayu Wang, Kai Chen, Dahua Lin

    Abstract: The math abilities of large language models can represent their abstract reasoning ability. In this paper, we introduce and open-source our math reasoning LLMs InternLM-Math which is continue pre-trained from InternLM2. We unify chain-of-thought reasoning, reward modeling, formal reasoning, data augmentation, and code interpreter in a unified seq2seq format and supervise our model to be a versatil… ▽ More

    Submitted 24 May, 2024; v1 submitted 9 February, 2024; originally announced February 2024.

  13. arXiv:2402.02334  [pdf, other

    cs.LG cs.AI

    Arithmetic Feature Interaction Is Necessary for Deep Tabular Learning

    Authors: Yi Cheng, Renjun Hu, Haochao Ying, Xing Shi, Jian Wu, Wei Lin

    Abstract: Until recently, the question of the effective inductive bias of deep models on tabular data has remained unanswered. This paper investigates the hypothesis that arithmetic feature interaction is necessary for deep tabular learning. To test this point, we create a synthetic tabular dataset with a mild feature interaction assumption and examine a modified transformer architecture enabling arithmetic… ▽ More

    Submitted 19 March, 2024; v1 submitted 3 February, 2024; originally announced February 2024.

    Comments: 11 pages, 8 figures, to be published to AAAI2024

    ACM Class: I.2.4

  14. arXiv:2312.08036  [pdf

    cs.CL

    CoRTEx: Contrastive Learning for Representing Terms via Explanations with Applications on Constructing Biomedical Knowledge Graphs

    Authors: Huaiyuan Ying, Zhengyun Zhao, Yang Zhao, Sihang Zeng, Sheng Yu

    Abstract: Objective: Biomedical Knowledge Graphs play a pivotal role in various biomedical research domains. Concurrently, term clustering emerges as a crucial step in constructing these knowledge graphs, aiming to identify synonymous terms. Due to a lack of knowledge, previous contrastive learning models trained with Unified Medical Language System (UMLS) synonyms struggle at clustering difficult terms and… ▽ More

    Submitted 13 December, 2023; originally announced December 2023.

  15. arXiv:2312.06171  [pdf, other

    cs.CV cs.MM

    Jointly Explicit and Implicit Cross-Modal Interaction Network for Anterior Chamber Inflammation Diagnosis

    Authors: Qian Shao, Ye Dai, Haochao Ying, Kan Xu, **hong Wang, Wei Chi, Jian Wu

    Abstract: Uveitis demands the precise diagnosis of anterior chamber inflammation (ACI) for optimal treatment. However, current diagnostic methods only rely on a limited single-modal disease perspective, which leads to poor performance. In this paper, we investigate a promising yet challenging way to fuse multimodal data for ACI diagnosis. Notably, existing fusion paradigms focus on empowering implicit modal… ▽ More

    Submitted 19 December, 2023; v1 submitted 11 December, 2023; originally announced December 2023.

  16. arXiv:2312.03023  [pdf, ps, other

    hep-lat

    A study of topological quantities of lattice QCD by a modified DCGAN frame

    Authors: Lin Gao, He** Ying, Jianbo Zhang

    Abstract: A modified deep convolutional generative adversarial network (M-DCGAN) frame is proposed to study the N-dimensional (ND) topological quantities in lattice QCD based on the Monte Carlo (MC) simulations. We construct a new scaling structure including fully connected layers to support the generation of high-quality high-dimensional images for the M-DCGAN. Our results show that the M-DCGAN scheme of t… ▽ More

    Submitted 17 February, 2024; v1 submitted 5 December, 2023; originally announced December 2023.

  17. arXiv:2311.13234  [pdf, other

    cs.CV cs.AI

    TSegFormer: 3D Tooth Segmentation in Intraoral Scans with Geometry Guided Transformer

    Authors: Huimin Xiong, Kunle Li, Kaiyuan Tan, Yang Feng, Joey Tianyi Zhou, ** Hao, Haochao Ying, Jian Wu, Zuozhu Liu

    Abstract: Optical Intraoral Scanners (IOS) are widely used in digital dentistry to provide detailed 3D information of dental crowns and the gingiva. Accurate 3D tooth segmentation in IOSs is critical for various dental applications, while previous methods are error-prone at complicated boundaries and exhibit unsatisfactory results across patients. In this paper, we propose TSegFormer which captures both loc… ▽ More

    Submitted 22 November, 2023; originally announced November 2023.

    Comments: MICCAI 2023, STAR(Student Travel) award. 11 pages, 3 figures, 5 tables. arXiv admin note: text overlap with arXiv:2210.16627

  18. arXiv:2311.11666  [pdf, other

    cs.CV

    OmniSeg3D: Omniversal 3D Segmentation via Hierarchical Contrastive Learning

    Authors: Haiyang Ying, Yixuan Yin, **zhi Zhang, Fan Wang, Tao Yu, Ruqi Huang, Lu Fang

    Abstract: Towards holistic understanding of 3D scenes, a general 3D segmentation method is needed that can segment diverse objects without restrictions on object quantity or categories, while also reflecting the inherent hierarchical structure. To achieve this, we propose OmniSeg3D, an omniversal segmentation method aims for segmenting anything in 3D all at once. The key insight is to lift multi-view incons… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

  19. Study of topological quantities of lattice QCD with a modified Wasserstein generative adversarial network

    Authors: Lin Gao, He** Ying, Jianbo Zhang

    Abstract: We propose a modified Wasserstein generative adversarial network (M-WGAN) to study the distribution of the topological charge in lattice QCD based on Monte Carlo simulations. We construct new generator and discriminator in M-WGAN to support the generation of high-quality distribution. Our results show that the M-WGAN scheme of machine learning should be helpful for us to calculate efficiently the… ▽ More

    Submitted 10 June, 2024; v1 submitted 15 November, 2023; originally announced November 2023.

  20. arXiv:2311.09757  [pdf, other

    cs.CV cs.AI

    UFPS: A unified framework for partially-annotated federated segmentation in heterogeneous data distribution

    Authors: Le Jiang, Li Yan Ma, Tie Yong Zeng, Shi Hui Ying

    Abstract: Partially supervised segmentation is a label-saving method based on datasets with fractional classes labeled and intersectant. However, it is still far from landing on real-world medical applications due to privacy concerns and data heterogeneity. As a remedy without privacy leakage, federated partially supervised segmentation (FPSS) is formulated in this work. The main challenges for FPSS are cla… ▽ More

    Submitted 16 November, 2023; originally announced November 2023.

  21. arXiv:2310.13674  [pdf, other

    cs.CV cs.LG q-bio.NC

    Using Human-like Mechanism to Weaken Effect of Pre-training Weight Bias in Face-Recognition Convolutional Neural Network

    Authors: Haojiang Ying, Yi-Fan Li, Yiyang Chen

    Abstract: Convolutional neural network (CNN), as an important model in artificial intelligence, has been widely used and studied in different disciplines. The computational mechanisms of CNNs are still not fully revealed due to the their complex nature. In this study, we focused on 4 extensively studied CNNs (AlexNet, VGG11, VGG13, and VGG16) which has been analyzed as human-like models by neuroscientists w… ▽ More

    Submitted 20 October, 2023; originally announced October 2023.

    Comments: 24 pages, 6 figures

  22. arXiv:2310.10958  [pdf

    cs.LG cs.CV

    Enhancing Deep Neural Network Training Efficiency and Performance through Linear Prediction

    Authors: Hejie Ying, Mengmeng Song, Yaohong Tang, Shungen Xiao, Zimin Xiao

    Abstract: Deep neural networks (DNN) have achieved remarkable success in various fields, including computer vision and natural language processing. However, training an effective DNN model still poses challenges. This paper aims to propose a method to optimize the training effectiveness of DNN, with the goal of improving model performance. Firstly, based on the observation that the DNN parameters change in… ▽ More

    Submitted 2 July, 2024; v1 submitted 16 October, 2023; originally announced October 2023.

  23. arXiv:2309.17190  [pdf, other

    cs.CV cs.AI

    PARF: Primitive-Aware Radiance Fusion for Indoor Scene Novel View Synthesis

    Authors: Haiyang Ying, Baowei Jiang, **zhi Zhang, Di Xu, Tao Yu, Qionghai Dai, Lu Fang

    Abstract: This paper proposes a method for fast scene radiance field reconstruction with strong novel view synthesis performance and convenient scene editing functionality. The key idea is to fully utilize semantic parsing and primitive extraction for constraining and accelerating the radiance field reconstruction process. To fulfill this goal, a primitive-aware hybrid rendering strategy was proposed to enj… ▽ More

    Submitted 29 September, 2023; originally announced September 2023.

    Comments: Accepted to ICCV 2023; Project page: https://oceanying.github.io/PARF/

  24. arXiv:2309.13235  [pdf, other

    cs.CV

    M$^3$CS: Multi-Target Masked Point Modeling with Learnable Codebook and Siamese Decoders

    Authors: Qibo Qiu, Honghui Yang, Wenxiao Wang, Shun Zhang, Haiming Gao, Haochao Ying, Wei Hua, Xiaofei He

    Abstract: Masked point modeling has become a promising scheme of self-supervised pre-training for point clouds. Existing methods reconstruct either the original points or related features as the objective of pre-training. However, considering the diversity of downstream tasks, it is necessary for the model to have both low- and high-level representation modeling capabilities to capture geometric details and… ▽ More

    Submitted 22 September, 2023; originally announced September 2023.

  25. arXiv:2307.08348  [pdf, other

    cs.CV

    Adaptive Local Basis Functions for Shape Completion

    Authors: Hui Ying, Tianjia Shao, He Wang, Yin Yang, Kun Zhou

    Abstract: In this paper, we focus on the task of 3D shape completion from partial point clouds using deep implicit functions. Existing methods seek to use voxelized basis functions or the ones from a certain family of functions (e.g., Gaussians), which leads to high computational costs or limited shape expressivity. On the contrary, our method employs adaptive local basis functions, which are learned end-to… ▽ More

    Submitted 17 July, 2023; originally announced July 2023.

    Comments: In SIGGRAPH 2023

  26. arXiv:2305.04213  [pdf, other

    cs.CV

    Robust Image Ordinal Regression with Controllable Image Generation

    Authors: Yi Cheng, Haochao Ying, Renjun Hu, **hong Wang, Wenhao Zheng, Xiao Zhang, Danny Chen, Jian Wu

    Abstract: Image ordinal regression has been mainly studied along the line of exploiting the order of categories. However, the issues of class imbalance and category overlap that are very common in ordinal regression were largely overlooked. As a result, the performance on minority categories is often unsatisfactory. In this paper, we propose a novel framework called CIG based on controllable image generatio… ▽ More

    Submitted 21 May, 2023; v1 submitted 7 May, 2023; originally announced May 2023.

    Comments: 8 pages, 12 figures, to be published in IJCAI2023

  27. arXiv:2305.00274  [pdf

    cond-mat.mtrl-sci

    Evolution of medium-range order and its correlation with magnetic nanodomains in Fe-Dy-B-Nb bulk metallic glasses

    Authors: Jiacheng Ge, Yao Gu, Zhongzhen Yao, Sinan Liu, Huiqiang Ying, Chenyu Lu, Zhenduo Wu, Yang Ren, Jun-ichi Suzuki, Zhenhua Xie, Yubin Ke, He Zhu, Song Tang, Xun-Li Wang, Si Lan

    Abstract: Fe-based metallic glasses are promising functional materials for advanced magnetism and sensor fields. Tailoring magnetic performance in amorphous materials requires a thorough knowledge of the correlation between structural disorder and magnetic order, which remains ambiguous. Two practical difficulties remain: the first is directly observing subtle magnetic structural changes on multiple scales,… ▽ More

    Submitted 29 April, 2023; originally announced May 2023.

    Comments: number of pages is 31 and number of figures is 14, including the Supplementary Material

  28. arXiv:2304.11672  [pdf

    cs.SE

    CBIM: A Graph-based Approach to Enhance Interoperability Using Semantic Enrichment

    Authors: Zijian Wang, Huaquan Ying, Rafael Sacks, André Borrmann

    Abstract: Interoperability remains a challenge in the construction industry. In this study, we propose a semantic enrichment approach to construct BIM knowledge graphs from pure building object geometries and demonstrate its potential to support BIM interoperability. Our approach involves machine learning and rule-based methods for object classification, relationship determination (e.g., hosting and adjacen… ▽ More

    Submitted 23 April, 2023; originally announced April 2023.

  29. arXiv:2303.15116  [pdf

    cs.CL

    An ontology-aided, natural language-based approach for multi-constraint BIM model querying

    Authors: Mengtian Yin, Llewellyn Tang, Chris Webster, Shen Xu, Xiongyi Li, Huaquan Ying

    Abstract: Being able to efficiently retrieve the required building information is critical for construction project stakeholders to carry out their engineering and management activities. Natural language interface (NLI) systems are emerging as a time and cost-effective way to query Building Information Models (BIMs). However, the existing methods cannot logically combine different constraints to perform fin… ▽ More

    Submitted 27 March, 2023; originally announced March 2023.

  30. FraudAuditor: A Visual Analytics Approach for Collusive Fraud in Health Insurance

    Authors: Jiehui Zhou, Xumeng Wang, Jie Wang, Hui Ye, Huanliang Wang, Zihan Zhou, Dongming Han, Haochao Ying, Jian Wu, Wei Chen

    Abstract: Collusive fraud, in which multiple fraudsters collude to defraud health insurance funds, threatens the operation of the healthcare system. However, existing statistical and machine learning-based methods have limited ability to detect fraud in the scenario of health insurance due to the high similarity of fraudulent behaviors to normal medical visits and the lack of labeled data. To ensure the acc… ▽ More

    Submitted 23 March, 2023; originally announced March 2023.

    Comments: 12 pages, 7 figures

    Journal ref: IEEE Transactions on Visualization and Computer Graphics, 2023

  31. arXiv:2212.14254  [pdf

    q-bio.NC

    The Markovian and Memoryless Properties of Visual System: Evidence from Serial Face Processing

    Authors: Jun-Ming Yu, Haojiang Ying

    Abstract: The visual system can be viewed and studied as an information processing system. If so, then the visual system should follow specific fundamental properties: either a memory or a memoryless system. Previous studies in serial dependence in vision found that the perception of the current stimulus is positively determined by the previous one. However, we are not entirely sure whether this phenomenon… ▽ More

    Submitted 29 December, 2022; originally announced December 2022.

  32. arXiv:2212.05794  [pdf, other

    eess.IV cs.CV

    CTT-Net: A Multi-view Cross-token Transformer for Cataract Postoperative Visual Acuity Prediction

    Authors: **hong Wang, **gwen Wang, Tingting Chen, Wenhao Zheng, Zhe Xu, Xingdi Wu, Wen Xu, Haochao Ying, Danny Chen, Jian Wu

    Abstract: Surgery is the only viable treatment for cataract patients with visual acuity (VA) impairment. Clinically, to assess the necessity of cataract surgery, accurately predicting postoperative VA before surgery by analyzing multi-view optical coherence tomography (OCT) images is crucially needed. Unfortunately, due to complicated fundus conditions, determining postoperative VA remains difficult for med… ▽ More

    Submitted 12 December, 2022; originally announced December 2022.

    Comments: 5 pages, 3 figures, accepted for publication in BIBM

  33. arXiv:2211.06614  [pdf, other

    cs.LG cs.AI

    Robust Training of Graph Neural Networks via Noise Governance

    Authors: Siyi Qian, Haochao Ying, Renjun Hu, **gbo Zhou, **tai Chen, Danny Z. Chen, Jian Wu

    Abstract: Graph Neural Networks (GNNs) have become widely-used models for semi-supervised learning. However, the robustness of GNNs in the presence of label noise remains a largely under-explored problem. In this paper, we consider an important yet challenging scenario where labels on nodes of graphs are not only noisy but also scarce. In this scenario, the performance of GNNs is prone to degrade due to lab… ▽ More

    Submitted 25 February, 2023; v1 submitted 12 November, 2022; originally announced November 2022.

    Comments: 9 pages, accepted to WSDM 2023 Research Track

  34. DPVisCreator: Incorporating Pattern Constraints to Privacy-preserving Visualizations via Differential Privacy

    Authors: Jiehui Zhou, Xumeng Wang, Jason K. Wong, Huanliang Wang, Zhongwei Wang, Xiaoyu Yang, Xiaoran Yan, Haozhe Feng, Huamin Qu, Haochao Ying, Wei Chen

    Abstract: Data privacy is an essential issue in publishing data visualizations. However, it is challenging to represent multiple data patterns in privacy-preserving visualizations. The prior approaches target specific chart types or perform an anonymization model uniformly without considering the importance of data patterns in visualizations. In this paper, we propose a visual analytics approach that facili… ▽ More

    Submitted 29 August, 2022; originally announced August 2022.

    Comments: 9 pages, 5 figures

    Journal ref: IEEE Transactions on Visualization and Computer Graphics, vol. 29, no. 1, pp. 809-819, Jan. 2023

  35. arXiv:2207.10670  [pdf, other

    cs.LG cs.AI eess.SP

    ME-GAN: Learning Panoptic Electrocardio Representations for Multi-view ECG Synthesis Conditioned on Heart Diseases

    Authors: **tai Chen, Kuanlun Liao, Kun Wei, Haochao Ying, Danny Z. Chen, Jian Wu

    Abstract: Electrocardiogram (ECG) is a widely used non-invasive diagnostic tool for heart diseases. Many studies have devised ECG analysis models (e.g., classifiers) to assist diagnosis. As an upstream task, researches have built generative models to synthesize ECG data, which are beneficial to providing training samples, privacy protection, and annotation reduction. However, previous generative methods for… ▽ More

    Submitted 29 May, 2023; v1 submitted 21 July, 2022; originally announced July 2022.

    Journal ref: In International Conference on Machine Learning, 3360--3370, (2022), PMLR

  36. arXiv:2207.08962  [pdf, ps, other

    math.CO math.AC math.NT

    $p$-numerical semigroups with $p$-symmetric properties

    Authors: Takao Komatsu, Haotian Ying

    Abstract: The so-called Frobenius number in the famous linear Diophantine problem of Frobenius is the largest integer such that the linear equation $a_1 x_1+\cdots+a_k x_k=n$ ($a_1,\dots,a_k$ are given positive integers with $\gcd(a_1,\dots,a_k)=1$) does not have a non-negative integer solution $(x_1,\dots,x_k)$. The generalized Frobenius number (called the $p$-Frobenius number) is the largest integer such… ▽ More

    Submitted 18 June, 2023; v1 submitted 18 July, 2022; originally announced July 2022.

    Comments: Journal of Algebra and its Applications (2024)

    MSC Class: 20M14; 11D07; 20M05; 05A15; 11B25

  37. arXiv:2206.13052  [pdf, ps, other

    math.NT math.CO

    The $p$-numerical semigroup of the triple of arithmetic progressions

    Authors: Takao Komatsu, Haotian Ying

    Abstract: For given positive integers $a_1,a_2,\dots,a_k$ with $\gcd(a_1,a_2,\dots,a_k)=1$, the denumerant $d(n)=d(n;a_1,a_2,\dots,a_k)$ is the number of nonnegative solutions $(x_1,x_2,\dots,x_k)$ of the linear equation $a_1 x_1+a_2 x_2+\dots+a_k x_k=n$ for a positive integer $n$. For a given nonnegative integer $p$, let $S_p=S_p(a_1,a_2,\dots,a_k)$ be the set of all nonnegative integers $n$'s such that… ▽ More

    Submitted 27 June, 2023; v1 submitted 27 June, 2022; originally announced June 2022.

    Comments: Symmetry Vol.15 (2023)

  38. arXiv:2206.08281  [pdf, ps, other

    physics.optics cond-mat.mes-hall

    Strain modulation of photocurrent in Weyl semimetal TaIrTe4

    Authors: Ying Ding, XinRu Wang, LieHong Liao, XinYu Chen, JiaYan Zhang, YueYue Wang, Hao Ying, Yuan Li

    Abstract: We study the effect of the strain on the energy bands of TaIrTe4 sheet and the photocurrent in the Cu-TaIrTe4-Cu heterojunction by using the quantum transport simulations. It is found that the Weyl points can be completely broken with increasing of the strain along z dirction. One can obtain a large photocurrent in the Cu-TaIrTe4-Cu heterojunction in the absence of the strain. While the photocurre… ▽ More

    Submitted 16 June, 2022; originally announced June 2022.

    Comments: 5 pages, 7 figures

  39. arXiv:2206.05660  [pdf, ps, other

    math.CO math.NT

    The $p$-Frobenius and $p$-Sylvester numbers for Fibonacci and Lucas triplets

    Authors: Takao Komatsu, Haotian Ying

    Abstract: In this paper we study a certain kind of generalized linear Diophantine problem of Frobenius. Let $a_1,a_2,\dots,a_l$ be positive integers such that their greatest common divisor is one. For a nonnegative integer $p$, denote the $p$-Frobenius number by $g_p(a_1,a_2,\dots,a_l)$, which is the largest integer that can be represented at most $p$ ways by a linear combination with nonnegative integer co… ▽ More

    Submitted 3 December, 2022; v1 submitted 12 June, 2022; originally announced June 2022.

    Comments: Mathematical Biosciences and Engineering

    MSC Class: 11D07; 05A15; 05A17; 05A19; 11B68; 11D04; 11P81; 20M14

  40. arXiv:2204.00194  [pdf

    cond-mat.mtrl-sci

    Direct synthesis of single-crystal bilayer graphene on various dielectric substrates

    Authors: ** Chen, Xianqin Xing, Wenyu Liu, Zhanjie Lu, Hao Ying, Le Huang, Zhiyong Zhang, Shunqing Wu, Zhihai Cheng, Shanshan Chen

    Abstract: In this work, a novel method to grow high-quality and large bilayer graphene (BLG) directly on various dielectric substrates was demonstrated. Large area single-crystal monolayer graphene was applied as a seeding layer to facilitate the homo-epitaxial synthesis of single crystal BLG directly on insulating substrates. The Cu nano-powders (Cu NP) with nanostructure and high surface-area were used as… ▽ More

    Submitted 1 April, 2022; originally announced April 2022.

  41. arXiv:2203.09975  [pdf

    cs.CL cs.LG

    BIOS: An Algorithmically Generated Biomedical Knowledge Graph

    Authors: Sheng Yu, Zheng Yuan, Jun Xia, Shengxuan Luo, Huaiyuan Ying, Sihang Zeng, **gyi Ren, Hongyi Yuan, Zhengyun Zhao, Yucong Lin, Keming Lu, **g Wang, Yutao Xie, Heung-Yeung Shum

    Abstract: Biomedical knowledge graphs (BioMedKGs) are essential infrastructures for biomedical and healthcare big data and artificial intelligence (AI), facilitating natural language processing, model development, and data exchange. For decades, these knowledge graphs have been developed via expert curation; however, this method can no longer keep up with today's AI development, and a transition to algorith… ▽ More

    Submitted 24 April, 2022; v1 submitted 18 March, 2022; originally announced March 2022.

  42. arXiv:2108.07975  [pdf, other

    cs.CV

    Unsupervised Image Generation with Infinite Generative Adversarial Networks

    Authors: Hui Ying, He Wang, Tianjia Shao, Yin Yang, Kun Zhou

    Abstract: Image generation has been heavily investigated in computer vision, where one core research challenge is to generate images from arbitrarily complex distributions with little supervision. Generative Adversarial Networks (GANs) as an implicit approach have achieved great successes in this direction and therefore been employed widely. However, GANs are known to suffer from issues such as mode collaps… ▽ More

    Submitted 18 August, 2021; originally announced August 2021.

    Comments: 18 pages, 11 figures

  43. arXiv:2104.08588  [pdf

    cs.CL

    Sentence Alignment with Parallel Documents Facilitates Biomedical Machine Translation

    Authors: Shengxuan Luo, Huaiyuan Ying, Jiao Li, Sheng Yu

    Abstract: Objective: Today's neural machine translation (NMT) can achieve near human-level translation quality and greatly facilitates international communications, but the lack of parallel corpora poses a key problem to the development of translation systems for highly specialized domains, such as biomedicine. This work presents an unsupervised algorithm for deriving parallel corpora from document-level tr… ▽ More

    Submitted 7 February, 2022; v1 submitted 17 April, 2021; originally announced April 2021.

    Comments: 16 pages, 5 figures

  44. Compacting Deep Neural Networks for Internet of Things: Methods and Applications

    Authors: Ke Zhang, Hanbo Ying, Hong-Ning Dai, Lin Li, Yuangyuang Peng, Keyi Guo, Hongfang Yu

    Abstract: Deep Neural Networks (DNNs) have shown great success in completing complex tasks. However, DNNs inevitably bring high computational cost and storage consumption due to the complexity of hierarchical structures, thereby hindering their wide deployment in Internet-of-Things (IoT) devices, which have limited computational capability and storage capacity. Therefore, it is a necessity to investigate th… ▽ More

    Submitted 19 March, 2021; originally announced March 2021.

    Comments: 25 pages, 11 figures

    MSC Class: 68T07 ACM Class: I.2.6; C.2

    Journal ref: IEEE Internet of Things Journal, 2021

  45. Biomedical Question Answering: A Survey of Approaches and Challenges

    Authors: Qiao **, Zheng Yuan, Guangzhi Xiong, Qianlan Yu, Huaiyuan Ying, Chuanqi Tan, Mosha Chen, Songfang Huang, Xiaozhong Liu, Sheng Yu

    Abstract: Automatic Question Answering (QA) has been successfully applied in various domains such as search engines and chatbots. Biomedical QA (BQA), as an emerging QA task, enables innovative applications to effectively perceive, access and understand complex biomedical knowledge. There have been tremendous developments of BQA in the past two decades, which we classify into 5 distinctive approaches: class… ▽ More

    Submitted 8 September, 2021; v1 submitted 10 February, 2021; originally announced February 2021.

    Comments: In submission to ACM Computing Surveys

  46. arXiv:2101.02969  [pdf, other

    cs.IR cs.DB cs.LG

    Spatial Object Recommendation with Hints: When Spatial Granularity Matters

    Authors: Hui Luo, **gbo Zhou, Zhifeng Bao, Shuangli Li, J. Shane Culpepper, Haochao Ying, Hao Liu, Hui Xiong

    Abstract: Existing spatial object recommendation algorithms generally treat objects identically when ranking them. However, spatial objects often cover different levels of spatial granularity and thereby are heterogeneous. For example, one user may prefer to be recommended a region (say Manhattan), while another user might prefer a venue (say a restaurant). Even for the same user, preferences can change at… ▽ More

    Submitted 8 January, 2021; originally announced January 2021.

    Journal ref: SIGIR Conference (2020) 781-790

  47. arXiv:2003.04873  [pdf, other

    stat.CO cs.DS math.OC math.PR stat.ML

    Moving Target Monte Carlo

    Authors: Haoyun Ying, Keheng Mao, Klaus Mosegaard

    Abstract: The Markov Chain Monte Carlo (MCMC) methods are popular when considering sampling from a high-dimensional random variable $\mathbf{x}$ with possibly unnormalised probability density $p$ and observed data $\mathbf{d}$. However, MCMC requires evaluating the posterior distribution $p(\mathbf{x}|\mathbf{d})$ of the proposed candidate $\mathbf{x}$ at each iteration when constructing the acceptance rate… ▽ More

    Submitted 10 March, 2020; originally announced March 2020.

  48. arXiv:1912.01954  [pdf, other

    cs.CV

    EmbedMask: Embedding Coupling for One-stage Instance Segmentation

    Authors: Hui Ying, Zhao** Huang, Shu Liu, Tianjia Shao, Kun Zhou

    Abstract: Current instance segmentation methods can be categorized into segmentation-based methods that segment first then do clustering, and proposal-based methods that detect first then predict masks for each instance proposal using repooling. In this work, we propose a one-stage method, named EmbedMask, that unifies both methods by taking advantages of them. Like proposal-based methods, EmbedMask builds… ▽ More

    Submitted 5 December, 2019; v1 submitted 4 December, 2019; originally announced December 2019.

    Comments: Code is available at github.com/yinghdb/EmbedMask

  49. arXiv:1811.07234  [pdf, other

    cs.SE cs.CL cs.LG

    Improving Automatic Source Code Summarization via Deep Reinforcement Learning

    Authors: Yao Wan, Zhou Zhao, Min Yang, Guandong Xu, Haochao Ying, Jian Wu, Philip S. Yu

    Abstract: Code summarization provides a high level natural language description of the function performed by code, as it can benefit the software maintenance, code categorization and retrieval. To the best of our knowledge, most state-of-the-art approaches follow an encoder-decoder framework which encodes the code into a hidden space and then decode it into natural language space, suffering from two major d… ▽ More

    Submitted 17 November, 2018; originally announced November 2018.

  50. arXiv:1809.07672  [pdf, other

    cond-mat.mes-hall

    Probing the Magnetodynamics of Magnetic Tunnel Junctions with the Aid of SiGe HBTs

    Authors: Jason Dark, Hanbin Ying, Grant Nunn, John D. Cressler, Dragomir Davidovic

    Abstract: High impedance (about 1 Megaohm) magnetic tunnel junctions (MTJs) are used to observe and record the magnetodynamics of the nanomagnets that form the junctions themselves. To counteract the bandwidth limitations caused by the high impedance of the junction and the parasitic capacitance intrinsic to any cryogenic system, silicon-germanium heterojunction bipolar transistors (SiGe HBTs) are used as c… ▽ More

    Submitted 20 September, 2018; originally announced September 2018.

    Comments: 9 pages, 8 figures