Skip to main content

Showing 1–50 of 77 results for author: Tian, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.03064  [pdf, other

    cs.LG cs.IR

    Path-Specific Causal Reasoning for Fairness-aware Cognitive Diagnosis

    Authors: Dacao Zhang, Kun Zhang, Le Wu, Mi Tian, Richang Hong, Meng Wang

    Abstract: Cognitive Diagnosis~(CD), which leverages students and exercise data to predict students' proficiency levels on different knowledge concepts, is one of fundamental components in Intelligent Education. Due to the scarcity of student-exercise interaction data, most existing methods focus on making the best use of available data, such as exercise content and student information~(e.g., educational con… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: Accpeted by KDD'2024

  2. arXiv:2404.10595  [pdf, other

    cs.CV

    Automated Evaluation of Large Vision-Language Models on Self-driving Corner Cases

    Authors: Kai Chen, Yanze Li, Wenhua Zhang, Yanxin Liu, Pengxiang Li, Ruiyuan Gao, Lanqing Hong, Meng Tian, Xinhai Zhao, Zhenguo Li, Dit-Yan Yeung, Huchuan Lu, Xu Jia

    Abstract: Large Vision-Language Models (LVLMs) have received widespread attention in advancing the interpretable self-driving. Existing evaluations of LVLMs primarily focus on the multi-faceted capabilities in natural circumstances, lacking automated and quantifiable assessment for self-driving, let alone the severe road corner cases. In this paper, we propose CODA-LM, the very first benchmark for the autom… ▽ More

    Submitted 26 June, 2024; v1 submitted 16 April, 2024; originally announced April 2024.

    Comments: Project Page: https://coda-dataset.github.io/coda-lm/

  3. arXiv:2404.10515  [pdf, other

    cs.NE

    An Enhanced Differential Grou** Method for Large-Scale Overlap** Problems

    Authors: Maojiang Tian, Mingke Chen, Wei Du, Yang Tang, Yaochu **

    Abstract: Large-scale overlap** problems are prevalent in practical engineering applications, and the optimization challenge is significantly amplified due to the existence of shared variables. Decomposition-based cooperative coevolution (CC) algorithms have demonstrated promising performance in addressing large-scale overlap** problems. However, current CC frameworks designed for overlap** problems r… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

  4. arXiv:2403.01192  [pdf, other

    math.OC cs.LG cs.NE

    A Composite Decomposition Method for Large-Scale Global Optimization

    Authors: Maojiang Tian, Minyang Chen, Wei Du, Yang Tang, Yaochu **, Gary G. Yen

    Abstract: Cooperative co-evolution (CC) algorithms, based on the divide-and-conquer strategy, have emerged as the predominant approach to solving large-scale global optimization (LSGO) problems. The efficiency and accuracy of the grou** stage significantly impact the performance of the optimization process. While the general separability grou** (GSG) method has overcome the limitation of previous differ… ▽ More

    Submitted 8 March, 2024; v1 submitted 2 March, 2024; originally announced March 2024.

  5. arXiv:2403.00261  [pdf, other

    cs.CV

    Spatial Cascaded Clustering and Weighted Memory for Unsupervised Person Re-identification

    Authors: Jiahao Hong, Jialong Zuo, Chuchu Han, Ruochen Zheng, Ming Tian, Changxin Gao, Nong Sang

    Abstract: Recent unsupervised person re-identification (re-ID) methods achieve high performance by leveraging fine-grained local context. These methods are referred to as part-based methods. However, most part-based methods obtain local contexts through horizontal division, which suffer from misalignment due to various human poses. Additionally, the misalignment of semantic information in part features rest… ▽ More

    Submitted 29 February, 2024; originally announced March 2024.

  6. arXiv:2402.19173  [pdf, other

    cs.SE cs.AI

    StarCoder 2 and The Stack v2: The Next Generation

    Authors: Anton Lozhkov, Raymond Li, Loubna Ben Allal, Federico Cassano, Joel Lamy-Poirier, Nouamane Tazi, Ao Tang, Dmytro Pykhtar, Jiawei Liu, Yuxiang Wei, Tianyang Liu, Max Tian, Denis Kocetkov, Arthur Zucker, Younes Belkada, Zijian Wang, Qian Liu, Dmitry Abulkhanov, Indraneil Paul, Zhuang Li, Wen-Ding Li, Megan Risdal, Jia Li, Jian Zhu, Terry Yue Zhuo , et al. (41 additional authors not shown)

    Abstract: The BigCode project, an open-scientific collaboration focused on the responsible development of Large Language Models for Code (Code LLMs), introduces StarCoder2. In partnership with Software Heritage (SWH), we build The Stack v2 on top of the digital commons of their source code archive. Alongside the SWH repositories spanning 619 programming languages, we carefully select other high-quality data… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

  7. arXiv:2402.17194  [pdf

    q-fin.TR cs.CE q-fin.PM

    The Random Forest Model for Analyzing and Forecasting the US Stock Market in the Context of Smart Finance

    Authors: Jiajian Zheng, Duan Xin, Qishuo Cheng, Miao Tian, Le Yang

    Abstract: The stock market is a crucial component of the financial market, playing a vital role in wealth accumulation for investors, financing costs for listed companies, and the stable development of the national macroeconomy. Significant fluctuations in the stock market can damage the interests of stock investors and cause an imbalance in the industrial structure, which can interfere with the macro level… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

    Comments: 10 pages, 8 figures

  8. arXiv:2402.17191  [pdf

    cs.CR cs.AI cs.LG

    AI-Driven Anonymization: Protecting Personal Data Privacy While Leveraging Machine Learning

    Authors: Le Yang, Miao Tian, Duan Xin, Qishuo Cheng, Jiajian Zheng

    Abstract: The development of artificial intelligence has significantly transformed people's lives. However, it has also posed a significant threat to privacy and security, with numerous instances of personal information being exposed online and reports of criminal attacks and theft. Consequently, the need to achieve intelligent protection of personal information through machine learning algorithms has becom… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

    Comments: 9 pages, 6 figures

  9. arXiv:2402.15994  [pdf

    q-fin.CP cs.CE cs.LG

    Optimizing Portfolio Management and Risk Assessment in Digital Assets Using Deep Learning for Predictive Analysis

    Authors: Qishuo Cheng, Le Yang, Jiajian Zheng, Miao Tian, Duan Xin

    Abstract: Portfolio management issues have been extensively studied in the field of artificial intelligence in recent years, but existing deep learning-based quantitative trading methods have some areas where they could be improved. First of all, the prediction mode of stocks is singular; often, only one trading expert is trained by a model, and the trading decision is solely based on the prediction results… ▽ More

    Submitted 25 February, 2024; originally announced February 2024.

    Comments: 10 pages, 5 figures

  10. arXiv:2312.17263  [pdf, other

    cs.CL

    TACIT: A Target-Agnostic Feature Disentanglement Framework for Cross-Domain Text Classification

    Authors: Rui Song, Fausto Giunchiglia, Yingji Li, Mingjie Tian, Hao Xu

    Abstract: Cross-domain text classification aims to transfer models from label-rich source domains to label-poor target domains, giving it a wide range of practical applications. Many approaches promote cross-domain generalization by capturing domain-invariant features. However, these methods rely on unlabeled samples provided by the target domains, which renders the model ineffective when the target domain… ▽ More

    Submitted 24 December, 2023; originally announced December 2023.

    Comments: Accepted by AAAI-2024

  11. arXiv:2312.06614  [pdf, other

    cs.CV

    AttenScribble: Attentive Similarity Learning for Scribble-Supervised Medical Image Segmentation

    Authors: Mu Tian, Qinzhu Yang, Yi Gao

    Abstract: The success of deep networks in medical image segmentation relies heavily on massive labeled training data. However, acquiring dense annotations is a time-consuming process. Weakly-supervised methods normally employ less expensive forms of supervision, among which scribbles started to gain popularity lately thanks to its flexibility. However, due to lack of shape and boundary information, it is ex… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

    Comments: 11 pages, 3 figures, a modified version was submitted to Computerized Medical Imaging and Graphics and is under review

  12. arXiv:2312.06072  [pdf, other

    cs.CV

    A dynamic interactive learning framework for automated 3D medical image segmentation

    Authors: Mu Tian, Xiaohui Chen, Yi Gao

    Abstract: Many deep learning based automated medical image segmentation systems, in reality, face difficulties in deployment due to the cost of massive data annotation and high latency in model iteration. We propose a dynamic interactive learning framework that addresses these challenges by integrating interactive segmentation into end-to-end weak supervised learning with streaming tasks. We develop novel r… ▽ More

    Submitted 10 December, 2023; originally announced December 2023.

    Comments: 24 pages, 8 figures, under review

  13. arXiv:2311.17088  [pdf, other

    cs.CV

    Unsupervised Multimodal Deepfake Detection Using Intra- and Cross-Modal Inconsistencies

    Authors: Mulin Tian, Mahyar Khayatkhoei, Joe Mathai, Wael AbdAlmageed

    Abstract: Deepfake videos present an increasing threat to society with potentially negative impact on criminal justice, democracy, and personal safety and privacy. Meanwhile, detecting deepfakes, at scale, remains a very challenging task that often requires labeled training data from existing deepfake generation methods. Further, even the most accurate supervised deepfake detection methods do not generalize… ▽ More

    Submitted 20 June, 2024; v1 submitted 27 November, 2023; originally announced November 2023.

    Comments: 11 pages, 3 figures, 3 tables

  14. arXiv:2311.13015  [pdf, other

    cs.LG cs.CY

    Fast and Interpretable Mortality Risk Scores for Critical Care Patients

    Authors: Chloe Qinyu Zhu, Muhang Tian, Lesia Semenova, Jiachang Liu, Jack Xu, Joseph Scarpa, Cynthia Rudin

    Abstract: Prediction of mortality in intensive care unit (ICU) patients is an important task in critical care medicine. Prior work in creating mortality risk models falls into two major categories: domain-expert-created scoring systems, and black box machine learning (ML) models. Both of these have disadvantages: black box models are unacceptable for use in hospitals, whereas manual creation of models (incl… ▽ More

    Submitted 21 November, 2023; originally announced November 2023.

  15. arXiv:2310.17190  [pdf, other

    cs.CV eess.IV

    Lookup Table meets Local Laplacian Filter: Pyramid Reconstruction Network for Tone Map**

    Authors: Feng Zhang, Ming Tian, Zhiqiang Li, Bin Xu, Qingbo Lu, Changxin Gao, Nong Sang

    Abstract: Tone map** aims to convert high dynamic range (HDR) images to low dynamic range (LDR) representations, a critical task in the camera imaging pipeline. In recent years, 3-Dimensional LookUp Table (3D LUT) based methods have gained attention due to their ability to strike a favorable balance between enhancement performance and computational efficiency. However, these methods often fail to deliver… ▽ More

    Submitted 3 January, 2024; v1 submitted 26 October, 2023; originally announced October 2023.

    Comments: 12 pages, 6 figures, accepted by NeurlPS 2023

  16. arXiv:2310.15290  [pdf, other

    cs.LG

    Reliable Generation of EHR Time Series via Diffusion Models

    Authors: Muhang Tian, Bernie Chen, Allan Guo, Shiyi Jiang, Anru R. Zhang

    Abstract: Electronic Health Records (EHRs) are rich sources of patient-level data, including laboratory tests, medications, and diagnoses, offering valuable resources for medical data analysis. However, concerns about privacy often restrict access to EHRs, hindering downstream analysis. Researchers have explored various methods for generating privacy-preserving EHR data. In this study, we introduce a new me… ▽ More

    Submitted 21 November, 2023; v1 submitted 23 October, 2023; originally announced October 2023.

  17. arXiv:2310.14821  [pdf, other

    cs.DC cs.CR

    Mysticeti: Reaching the Limits of Latency with Uncertified DAGs

    Authors: Kushal Babel, Andrey Chursin, George Danezis, Anastasios Kichidis, Lefteris Kokoris-Kogias, Arun Koshy, Alberto Sonnino, Mingwei Tian

    Abstract: We introduce Mysticeti-C the first DAG-based Byzantine consensus protocol to achieve the lower bounds of latency of 3 message rounds. Since Mysticeti-C is built over DAGs it also achieves high resource efficiency and censorship resistance. Mysticeti-C achieves this latency improvement by avoiding explicit certification of the DAG blocks and by proposing a novel commit rule such that every block ca… ▽ More

    Submitted 30 April, 2024; v1 submitted 23 October, 2023; originally announced October 2023.

  18. arXiv:2310.00052  [pdf, other

    astro-ph.IM cs.AI gr-qc

    AI ensemble for signal detection of higher order gravitational wave modes of quasi-circular, spinning, non-precessing binary black hole mergers

    Authors: Minyang Tian, E. A. Huerta, Huihuo Zheng

    Abstract: We introduce spatiotemporal-graph models that concurrently process data from the twin advanced LIGO detectors and the advanced Virgo detector. We trained these AI classifiers with 2.4 million IMRPhenomXPHM waveforms that describe quasi-circular, spinning, non-precessing binary black hole mergers with component masses $m_{\{1,2\}}\in[3M_\odot, 50 M_\odot]$, and individual spins… ▽ More

    Submitted 4 December, 2023; v1 submitted 29 September, 2023; originally announced October 2023.

    Comments: 4 pages, 2 figures, 1 table; v2: 5 pages, 2 figures, 1 table, accepted to NeurIPS 2023 workshop on Machine Learning and the Physical Sciences

    MSC Class: 68T01; 68T35; 83C35; 83C57

  19. arXiv:2307.02019  [pdf

    cs.CV cs.AI

    Generative Adversarial Networks for Dental Patient Identity Protection in Orthodontic Educational Imaging

    Authors: Mingchuan Tian, Wilson Weixun Lu, Kelvin Weng Chiong Foong, Eugene Loh

    Abstract: Objectives: This research introduces a novel area-preserving Generative Adversarial Networks (GAN) inversion technique for effectively de-identifying dental patient images. This innovative method addresses privacy concerns while preserving key dental features, thereby generating valuable resources for dental education and research. Methods: We enhanced the existing GAN Inversion methodology to m… ▽ More

    Submitted 5 July, 2023; originally announced July 2023.

  20. arXiv:2306.15914  [pdf, other

    cs.CV

    The 2nd Place Solution for 2023 Waymo Open Sim Agents Challenge

    Authors: Cheng Qian, Di Xiu, Minghao Tian

    Abstract: In this technical report, we present the 2nd place solution of 2023 Waymo Open Sim Agents Challenge (WOSAC)[4]. We propose a simple yet effective autoregressive method for simulating multi-agent behaviors, which is built upon a well-known multimodal motion forecasting framework called Motion Transformer (MTR)[5] with postprocessing algorithms applied. Our submission named MTR+++ achieves 0.4697 on… ▽ More

    Submitted 28 June, 2023; originally announced June 2023.

  21. arXiv:2306.15728  [pdf, other

    astro-ph.IM cs.AI gr-qc

    Physics-inspired spatiotemporal-graph AI ensemble for the detection of higher order wave mode signals of spinning binary black hole mergers

    Authors: Minyang Tian, E. A. Huerta, Huihuo Zheng, Prayush Kumar

    Abstract: We present a new class of AI models for the detection of quasi-circular, spinning, non-precessing binary black hole mergers whose waveforms include the higher order gravitational wave modes $(l, |m|)=\{(2, 2), (2, 1), (3, 3), (3, 2), (4, 4)\}$, and mode mixing effects in the $l = 3, |m| = 2$ harmonics. These AI models combine hybrid dilated convolution neural networks to accurately model both shor… ▽ More

    Submitted 18 June, 2024; v1 submitted 27 June, 2023; originally announced June 2023.

    Comments: 14 pages, 6 figures, and 3 tables

    MSC Class: 68T01; 68T35; 83C35; 83C57

    Journal ref: Mach. Learn.: Sci. Technol. 5 (2024) 025056

  22. arXiv:2304.07238  [pdf, other

    physics.soc-ph cs.SI

    Robustness of community structure under edge addition

    Authors: Moyi Tian, Pablo Moriano

    Abstract: Communities often represent key structural and functional clusters in networks. To preserve such communities, it is important to understand their robustness under network perturbations. Previous work in community robustness analysis has focused on studying changes in the community structure as a response of edge rewiring and node or edge removal. However, the impact of increasing connectivity on t… ▽ More

    Submitted 1 November, 2023; v1 submitted 14 April, 2023; originally announced April 2023.

    Comments: 17 pages, 30 figures

    Journal ref: Phys. Rev. E 108 (2023) 054302

  23. arXiv:2302.14350  [pdf, other

    cs.CV

    Knowledge Augmented Relation Inference for Group Activity Recognition

    Authors: Xianglong Lang, Zhuming Wang, Zun Li, Meng Tian, Ge Shi, Lifang Wu, Liang Wang

    Abstract: Most existing group activity recognition methods construct spatial-temporal relations merely based on visual representation. Some methods introduce extra knowledge, such as action labels, to build semantic relations and use them to refine the visual presentation. However, the knowledge they explored just stay at the semantic-level, which is insufficient for pursing notable accuracy. In this paper,… ▽ More

    Submitted 1 March, 2023; v1 submitted 28 February, 2023; originally announced February 2023.

  24. arXiv:2212.01382  [pdf, other

    cs.GT cs.AI cs.LG cs.MA

    Welfare and Fairness in Multi-objective Reinforcement Learning

    Authors: Zimeng Fan, Nianli Peng, Muhang Tian, Brandon Fain

    Abstract: We study fair multi-objective reinforcement learning in which an agent must learn a policy that simultaneously achieves high reward on multiple dimensions of a vector-valued reward. Motivated by the fair resource allocation literature, we model this as an expected welfare maximization problem, for some nonlinear fair welfare function of the vector of long-term cumulative rewards. One canonical exa… ▽ More

    Submitted 12 November, 2023; v1 submitted 29 November, 2022; originally announced December 2022.

  25. arXiv:2211.10805  [pdf, other

    stat.ML cs.LG math.ST

    On the Pointwise Behavior of Recursive Partitioning and Its Implications for Heterogeneous Causal Effect Estimation

    Authors: Matias D. Cattaneo, Jason M. Klusowski, Peter M. Tian

    Abstract: Decision tree learning is increasingly being used for pointwise inference. Important applications include causal heterogenous treatment effects and dynamic policy decisions, as well as conditional quantile regression and design of experiments, where tree estimation and inference is conducted at specific values of the covariates. In this paper, we call into question the use of decision trees (train… ▽ More

    Submitted 6 February, 2024; v1 submitted 19 November, 2022; originally announced November 2022.

  26. What Do Children and Parents Want and Perceive in Conversational Agents? Towards Transparent, Trustworthy, Democratized Agents

    Authors: Jessica Van Brummelen, Maura Kelleher, Mingyan Claire Tian, Nghi Hoang Nguyen

    Abstract: Historically, researchers have focused on analyzing WEIRD, adult perspectives on technology. This means we may not have technology developed appropriately for children and those from non-WEIRD countries. In this paper, we analyze children and parents from various countries' perspectives on an emerging technology: conversational agents. We aim to better understand participants' trust of agents, par… ▽ More

    Submitted 20 January, 2023; v1 submitted 16 September, 2022; originally announced September 2022.

    Comments: 18 pages, 9 figures, submitted to IDC 2023, for associated appendix: https://gist.github.com/jessvb/fa1d4c75910106d730d194ffd4d725d3

  27. arXiv:2209.05063  [pdf, other

    cs.HC

    Learning Affects Trust: Design Recommendations and Concepts for Teaching Children -- and Nearly Anyone -- about Conversational Agents

    Authors: Jessica Van Brummelen, Mingyan Claire Tian, Maura Kelleher, Nghi Hoang Nguyen

    Abstract: Research has shown that human-agent relationships form in similar ways to human-human relationships. Since children do not have the same critical analysis skills as adults (and may over-trust technology, for example), this relationship-formation is concerning. Nonetheless, little research investigates children's perceptions of conversational agents in-depth, and even less investigates how educatio… ▽ More

    Submitted 12 September, 2022; originally announced September 2022.

    Comments: 9 pages, 11 figures, submitted to EAAI at AAAI 2023, for associated appendix: https://gist.github.com/jessvb/e35bc0daf859c30f73008a1ad1b37824

  28. arXiv:2208.11353  [pdf, ps, other

    cs.CV

    Research on Mask Wearing Detection of Natural Population Based on Improved YOLOv4

    Authors: Xuecheng Wu, Mengmeng Tian, Lanhang Zhai

    Abstract: Recently, the domestic COVID-19 epidemic situation has been serious, but in some public places, some people do not wear masks or wear masks incorrectly, which requires the relevant staff to instantly remind and supervise them to wear masks correctly. However, in the face of such important and complicated work, it is necessary to carry out automated mask wearing detection in public places. This pap… ▽ More

    Submitted 24 August, 2022; originally announced August 2022.

    Comments: 4 pages, 1 figures

  29. arXiv:2208.11346  [pdf, ps, other

    cs.CV

    ICANet: A Method of Short Video Emotion Recognition Driven by Multimodal Data

    Authors: Xuecheng Wu, Mengmeng Tian, Lanhang Zhai

    Abstract: With the fast development of artificial intelligence and short videos, emotion recognition in short videos has become one of the most important research topics in human-computer interaction. At present, most emotion recognition methods still stay in a single modality. However, in daily life, human beings will usually disguise their real emotions, which leads to the problem that the accuracy of sin… ▽ More

    Submitted 24 August, 2022; originally announced August 2022.

    Comments: 4 pages, 5 figures

  30. arXiv:2206.05488  [pdf

    cs.CV cs.AI

    Kaggle Kinship Recognition Challenge: Introduction of Convolution-Free Model to boost conventional

    Authors: Mingchuan Tian, Guangway Teng, Yipeng Bao

    Abstract: This work aims to explore a convolution-free base classifier that can be used to widen the variations of the conventional ensemble classifier. Specifically, we propose Vision Transformers as base classifiers to combine with CNNs for a unique ensemble solution in Kaggle kinship recognition. In this paper, we verify our proposed idea by implementing and optimizing variants of the Vision Transformer… ▽ More

    Submitted 11 June, 2022; originally announced June 2022.

  31. arXiv:2205.10323  [pdf

    eess.SP cs.IT

    Low power communication signal enhancement method of Internet of things based on nonlocal mean denoising

    Authors: Mingchuan Tian, Jizheng Liu

    Abstract: In order to improve the transmission effect of low-power communication signal of Internet of things and compress the enhancement time of low-power communication signal, this paper designs a low-power communication signal enhancement method of Internet of things based on nonlocal mean denoising. Firstly, the residual of one-dimensional communication layer is pre processed by convolution core to obt… ▽ More

    Submitted 14 May, 2022; originally announced May 2022.

  32. arXiv:2203.03498  [pdf, other

    cs.CV

    Weakly Supervised Learning of Keypoints for 6D Object Pose Estimation

    Authors: Meng Tian, Gim Hee Lee

    Abstract: State-of-the-art approaches for 6D object pose estimation require large amounts of labeled data to train the deep networks. However, the acquisition of 6D object pose annotations is tedious and labor-intensive in large quantity. To alleviate this problem, we propose a weakly supervised 6D object pose estimation approach based on 2D keypoint detection. Our method trains only on image pairs with kno… ▽ More

    Submitted 7 March, 2022; originally announced March 2022.

  33. SODA: Site Object Detection dAtaset for Deep Learning in Construction

    Authors: Rui Duan, Hui Deng, Mao Tian, Yichuan Deng, Jiarui Lin

    Abstract: Computer vision-based deep learning object detection algorithms have been developed sufficiently powerful to support the ability to recognize various objects. Although there are currently general datasets for object detection, there is still a lack of large-scale, open-source dataset for the construction industry, which limits the developments of object detection algorithms as they tend to be data… ▽ More

    Submitted 19 February, 2022; originally announced February 2022.

    Journal ref: Automation in Construction, 2022

  34. arXiv:2202.06548  [pdf, other

    eess.IV cs.LG

    A resource-efficient deep learning framework for low-dose brain PET image reconstruction and analysis

    Authors: Yu Fu, Shunjie Dong, Yi Liao, Le Xue, Yuanfan Xu, Feng Li, Qianqian Yang, Tianbai Yu, Mei Tian, Cheng Zhuo

    Abstract: 18F-fluorodeoxyglucose (18F-FDG) Positron Emission Tomography (PET) imaging usually needs a full-dose radioactive tracer to obtain satisfactory diagnostic results, which raises concerns about the potential health risks of radiation exposure, especially for pediatric patients. Reconstructing the low-dose PET (L-PET) images to the high-quality full-dose PET (F-PET) ones is an effective way that both… ▽ More

    Submitted 14 February, 2022; originally announced February 2022.

  35. arXiv:2201.11133  [pdf, other

    gr-qc astro-ph.IM cs.AI cs.DC cs.LG

    Inference-optimized AI and high performance computing for gravitational wave detection at scale

    Authors: Pranshu Chaturvedi, Asad Khan, Minyang Tian, E. A. Huerta, Huihuo Zheng

    Abstract: We introduce an ensemble of artificial intelligence models for gravitational wave detection that we trained in the Summit supercomputer using 32 nodes, equivalent to 192 NVIDIA V100 GPUs, within 2 hours. Once fully trained, we optimized these models for accelerated inference using NVIDIA TensorRT. We deployed our inference-optimized AI ensemble in the ThetaGPU supercomputer at Argonne Leadership C… ▽ More

    Submitted 17 February, 2022; v1 submitted 26 January, 2022; originally announced January 2022.

    Comments: 19 pages, 8 figures; v2. Accepted to Frontiers in Artificial Intelligence, Special Issue: Efficient AI in Particle Physics and Astrophysics

    MSC Class: 68T10; 85-08; 83C35; 83C57 ACM Class: I.2

    Journal ref: Front. Artif. Intell. 5:828672 (2022)

  36. arXiv:2201.04019  [pdf, other

    cs.CV cs.AI

    Pyramid Fusion Transformer for Semantic Segmentation

    Authors: Zipeng Qin, Jianbo Liu, Xiaolin Zhang, Maoqing Tian, Aojun Zhou, Shuai Yi, Hongsheng Li

    Abstract: The recently proposed MaskFormer gives a refreshed perspective on the task of semantic segmentation: it shifts from the popular pixel-level classification paradigm to a mask-level classification method. In essence, it generates paired probabilities and masks corresponding to category segments and combines them during inference for the segmentation maps. In our study, we find that per-mask classifi… ▽ More

    Submitted 30 May, 2023; v1 submitted 11 January, 2022; originally announced January 2022.

  37. arXiv:2110.13261  [pdf, other

    quant-ph cs.DS

    SWAP Test for an Arbitrary Number of Quantum States

    Authors: Xavier Gitiaux, Ian Morris, Maria Emelianenko, Mingzhen Tian

    Abstract: We develop a recursive algorithm to generalize the quantum SWAP test for an arbitrary number $m$ of quantum states requiring $O(m)$ controlled-swap (CSWAP) gates and $O(\log m)$ ancillary qubits. We construct a quantum circuit able to simultaneously measure overlaps of $m$ arbitrary pure states. Our construction relies on a pairing unitary that generates a superposition state where every pair of i… ▽ More

    Submitted 25 October, 2021; originally announced October 2021.

  38. arXiv:2109.02303  [pdf, other

    cs.CV

    Encoder-decoder with Multi-level Attention for 3D Human Shape and Pose Estimation

    Authors: Ziniu Wan, Zhengjia Li, Maoqing Tian, Jianbo Liu, Shuai Yi, Hongsheng Li

    Abstract: 3D human shape and pose estimation is the essential task for human motion analysis, which is widely used in many 3D applications. However, existing methods cannot simultaneously capture the relations at multiple levels, including spatial-temporal level and human joint level. Therefore they fail to make accurate predictions in some hard scenarios when there is cluttered background, occlusion, or ex… ▽ More

    Submitted 6 September, 2021; originally announced September 2021.

  39. arXiv:2108.08505  [pdf, other

    eess.IV cs.CV

    Blindly Assess Quality of In-the-Wild Videos via Quality-aware Pre-training and Motion Perception

    Authors: Bowen Li, Weixia Zhang, Meng Tian, Guangtao Zhai, Xianpei Wang

    Abstract: Perceptual quality assessment of the videos acquired in the wilds is of vital importance for quality assurance of video services. The inaccessibility of reference videos with pristine quality and the complexity of authentic distortions pose great challenges for this kind of blind video quality assessment (BVQA) task. Although model-based transfer learning is an effective and efficient paradigm for… ▽ More

    Submitted 5 April, 2022; v1 submitted 19 August, 2021; originally announced August 2021.

    Comments: Accepted to IEEE TCSVT

  40. arXiv:2108.05568  [pdf, other

    cs.DC cs.LG

    A Contract Theory based Incentive Mechanism for Federated Learning

    Authors: Mengmeng Tian, Yuxin Chen, Yuan Liu, Zehui Xiong, Cyril Leung, Chunyan Miao

    Abstract: Federated learning (FL) serves as a data privacy-preserved machine learning paradigm, and realizes the collaborative model trained by distributed clients. To accomplish an FL task, the task publisher needs to pay financial incentives to the FL server and FL server offloads the task to the contributing FL clients. It is challenging to design proper incentives for the FL clients due to the fact that… ▽ More

    Submitted 12 August, 2021; originally announced August 2021.

    Comments: 7 pages, 2 figures, International Workshop on Federated and Transfer Learning for Data Sparsity and Confidentiality in Conjunction with IJCAI 2021 (FTL-IJCAI'21), Best Student Paper Award

  41. arXiv:2107.00222  [pdf, other

    cs.CV cs.RO

    Deep auxiliary learning for visual localization using colorization task

    Authors: Mi Tian, Qiong Nie, Hao Shen, Xiahua Xia

    Abstract: Visual localization is one of the most important components for robotics and autonomous driving. Recently, inspiring results have been shown with CNN-based methods which provide a direct formulation to end-to-end regress 6-DoF absolute pose. Additional information like geometric or semantic constraints is generally introduced to improve performance. Especially, the latter can aggregate high-level… ▽ More

    Submitted 1 July, 2021; originally announced July 2021.

  42. arXiv:2105.11422  [pdf, other

    cs.CV cs.AI

    Multi-Level Attentive Convoluntional Neural Network for Crowd Counting

    Authors: Mengxiao Tian, Hao Guo, Chengjiang Long

    Abstract: Recently the crowd counting has received more and more attention. Especially the technology of high-density environment has become an important research content, and the relevant methods for the existence of extremely dense crowd are not optimal. In this paper, we propose a multi-level attentive Convolutional Neural Network (MLAttnCNN) for crowd counting. We extract high-level contextual informati… ▽ More

    Submitted 24 May, 2021; originally announced May 2021.

  43. arXiv:2104.13881  [pdf, ps, other

    stat.ML cs.LG math.ST

    Large Scale Prediction with Decision Trees

    Authors: Jason M. Klusowski, Peter M. Tian

    Abstract: This paper shows that decision trees constructed with Classification and Regression Trees (CART) and C4.5 methodology are consistent for regression and classification tasks, even when the number of predictor variables grows sub-exponentially with the sample size, under natural 0-norm and 1-norm sparsity constraints. The theory applies to a wide range of models, including (ordinary or logistic) add… ▽ More

    Submitted 13 November, 2023; v1 submitted 28 April, 2021; originally announced April 2021.

  44. arXiv:2102.11099  [pdf, other

    eess.IV cs.CV

    RCoNet: Deformable Mutual Information Maximization and High-order Uncertainty-aware Learning for Robust COVID-19 Detection

    Authors: Shunjie Dong, Qianqian Yang, Yu Fu, Mei Tian, Cheng Zhuo

    Abstract: The novel 2019 Coronavirus (COVID-19) infection has spread world widely and is currently a major healthcare challenge around the world. Chest Computed Tomography (CT) and X-ray images have been well recognized to be two effective techniques for clinical COVID-19 disease diagnoses. Due to faster imaging time and considerably lower cost than CT, detecting COVID-19 in chest X-ray (CXR) images is pref… ▽ More

    Submitted 22 February, 2021; originally announced February 2021.

  45. arXiv:2012.15419  [pdf, other

    cs.CL cs.LG

    An Experimental Evaluation of Transformer-based Language Models in the Biomedical Domain

    Authors: Paul Grouchy, Shobhit Jain, Michael Liu, Kuhan Wang, Max Tian, Nidhi Arora, Hillary Ngai, Faiza Khan Khattak, Elham Dolatabadi, Sedef Akinli Kocak

    Abstract: With the growing amount of text in health data, there have been rapid advances in large pre-trained models that can be applied to a wide variety of biomedical tasks with minimal task-specific modifications. Emphasizing the cost of these models, which renders technical replication challenging, this paper summarizes experiments conducted in replicating BioBERT and further pre-training and careful fi… ▽ More

    Submitted 30 December, 2020; originally announced December 2020.

  46. arXiv:2012.08545  [pdf, other

    gr-qc astro-ph.IM cs.AI cs.DC

    Accelerated, Scalable and Reproducible AI-driven Gravitational Wave Detection

    Authors: E. A. Huerta, Asad Khan, Xiaobo Huang, Minyang Tian, Maksim Levental, Ryan Chard, Wei Wei, Maeve Heflin, Daniel S. Katz, Volodymyr Kindratenko, Dawei Mu, Ben Blaiszik, Ian Foster

    Abstract: The development of reusable artificial intelligence (AI) models for wider use and rigorous validation by the community promises to unlock new opportunities in multi-messenger astrophysics. Here we develop a workflow that connects the Data and Learning Hub for Science, a repository for publishing AI models, with the Hardware Accelerated Learning (HAL) cluster, using funcX as a universal distributed… ▽ More

    Submitted 9 July, 2021; v1 submitted 15 December, 2020; originally announced December 2020.

    Comments: 17 pages, 5 figures; v2: 12 pages, 6 figures. Accepted to Nature Astronomy. See also the Behind the Paper blog in Nature Astronomy "https://astronomycommunity.nature.com/posts/from-disruption-to-sustained-innovation-artificial-intelligence-for-gravitational-wave-astrophysics"

    MSC Class: 68T01; 68T35; 83C35; 83C57

    Journal ref: Nat Astron 5, 1062-1068 (2021)

  47. arXiv:2012.05352  [pdf

    eess.SY cs.LG

    Electric Vehicle Battery Remaining Charging Time Estimation Considering Charging Accuracy and Charging Profile Prediction

    Authors: Junzhe Shi, Min Tian, Sangwoo Han, Tung-Yan Wu, Yifan Tang

    Abstract: Electric vehicles (EVs) have been growing rapidly in popularity in recent years and have become a future trend. It is an important aspect of user experience to know the Remaining Charging Time (RCT) of an EV with confidence. However, it is difficult to find an algorithm that accurately estimates the RCT for vehicles in the current EV market. The maximum RCT estimation error of the Tesla Model X ca… ▽ More

    Submitted 9 December, 2020; originally announced December 2020.

  48. arXiv:2011.02683  [pdf, other

    stat.ML cs.LG

    Nonparametric Variable Screening with Optimal Decision Stumps

    Authors: Jason M. Klusowski, Peter M. Tian

    Abstract: Decision trees and their ensembles are endowed with a rich set of diagnostic tools for ranking and screening variables in a predictive model. Despite the widespread use of tree based variable importance measures, pinning down their theoretical properties has been challenging and therefore largely unexplored. To address this gap between theory and practice, we derive finite sample performance guara… ▽ More

    Submitted 10 December, 2020; v1 submitted 5 November, 2020; originally announced November 2020.

  49. arXiv:2011.00569  [pdf, other

    cs.CV cs.AI cs.CL cs.LG cs.MM

    DeepOpht: Medical Report Generation for Retinal Images via Deep Models and Visual Explanation

    Authors: Jia-Hong Huang, Chao-Han Huck Yang, Fangyu Liu, Meng Tian, Yi-Chieh Liu, Ting-Wei Wu, I-Hung Lin, Kang Wang, Hiromasa Morikawa, Hernghua Chang, Jesper Tegner, Marcel Worring

    Abstract: In this work, we propose an AI-based method that intends to improve the conventional retinal disease treatment procedure and help ophthalmologists increase diagnosis efficiency and accuracy. The proposed method is composed of a deep neural networks-based (DNN-based) module, including a retinal disease identifier and clinical description generator, and a DNN visual explanation module. To train and… ▽ More

    Submitted 1 November, 2020; originally announced November 2020.

    Comments: Accepted to IEEE WACV 2021

  50. arXiv:2007.09798  [pdf, other

    cs.IR

    Counterfactual Learning to Rank using Heterogeneous Treatment Effect Estimation

    Authors: Mucun Tian, Chun Guo, Vito Ostuni, Zhen Zhu

    Abstract: Learning-to-Rank (LTR) models trained from implicit feedback (e.g. clicks) suffer from inherent biases. A well-known one is the position bias -- documents in top positions are more likely to receive clicks due in part to their position advantages. To unbiasedly learn to rank, existing counterfactual frameworks first estimate the propensity (probability) of missing clicks with intervention data fro… ▽ More

    Submitted 19 July, 2020; originally announced July 2020.

    Comments: 9 pages; to be published in SIGIR eCom'20

    ACM Class: H.3.3