Skip to main content

Showing 101–150 of 321 results for author: Huo, Y

.
  1. arXiv:2304.04155  [pdf, other

    eess.IV cs.CV

    Segment Anything Model (SAM) for Digital Pathology: Assess Zero-shot Segmentation on Whole Slide Imaging

    Authors: Ruining Deng, Can Cui, Quan Liu, Tianyuan Yao, Lucas W. Remedios, Shunxing Bao, Bennett A. Landman, Lee E. Wheless, Lori A. Coburn, Keith T. Wilson, Yaohong Wang, Shilin Zhao, Agnes B. Fogo, Haichun Yang, Yucheng Tang, Yuankai Huo

    Abstract: The segment anything model (SAM) was released as a foundation model for image segmentation. The promptable segmentation model was trained by over 1 billion masks on 11M licensed and privacy-respecting images. The model supports zero-shot image segmentation with various segmentation prompts (e.g., points, boxes, masks). It makes the SAM attractive for medical image analysis, especially for digital… ▽ More

    Submitted 9 April, 2023; originally announced April 2023.

  2. arXiv:2304.03760  [pdf, other

    eess.IV cs.CV

    Zero-shot CT Field-of-view Completion with Unconditional Generative Diffusion Prior

    Authors: Kaiwen Xu, Aravind R. Krishnan, Thomas Z. Li, Yuankai Huo, Kim L. Sandler, Fabien Maldonado, Bennett A. Landman

    Abstract: Anatomically consistent field-of-view (FOV) completion to recover truncated body sections has important applications in quantitative analyses of computed tomography (CT) with limited FOV. Existing solution based on conditional generative models relies on the fidelity of synthetic truncation patterns at training phase, which poses limitations for the generalizability of the method to potential unkn… ▽ More

    Submitted 7 April, 2023; originally announced April 2023.

    Comments: Submitted to MIDL 2023, short paper track

  3. Microstructure and mechanical properties of mechanically-alloyed CoCrFeNi high-entropy alloys using low ball-to-powder ratio

    Authors: A. Olejarz, W. Y. Huo, M. Zielinski, R. Diduszko, E. Wyszkowska, A. Kosinska, D. Kalita, I. Jozwik, M. Chmielewski, F. Fang, L. Kurpaska

    Abstract: High-entropy alloys are extensively studied due to their very promising properties. However manufacturing methods currently used to prepare HEAs are complicated, costly, and likely non-industrially scalable processes. This limits their evolution and poses questions regarding the material's applicability in the future. Considering the abovementioned point, we developed a novel methodology for effic… ▽ More

    Submitted 6 April, 2023; originally announced April 2023.

    Journal ref: Volume 938, 25 March 2023, 168196

  4. arXiv:2304.00216  [pdf, other

    eess.IV cs.CV cs.LG

    Cross-scale Multi-instance Learning for Pathological Image Diagnosis

    Authors: Ruining Deng, Can Cui, Lucas W. Remedios, Shunxing Bao, R. Michael Womick, Sophie Chiron, Jia Li, Joseph T. Roland, Ken S. Lau, Qi Liu, Keith T. Wilson, Yaohong Wang, Lori A. Coburn, Bennett A. Landman, Yuankai Huo

    Abstract: Analyzing high resolution whole slide images (WSIs) with regard to information across multiple scales poses a significant challenge in digital pathology. Multi-instance learning (MIL) is a common solution for working with high resolution images by classifying bags of objects (i.e. sets of smaller image patches). However, such processing is typically performed at a single scale (e.g., 20x magnifica… ▽ More

    Submitted 16 February, 2024; v1 submitted 31 March, 2023; originally announced April 2023.

  5. arXiv:2303.16376  [pdf, other

    cs.LG

    A Unified Learning Model for Estimating Fiber Orientation Distribution Functions on Heterogeneous Multi-shell Diffusion-weighted MRI

    Authors: Tianyuan Yao, Nancy Newlin, Praitayini Kanakaraj, Vishwesh nath, Leon Y Cai, Karthik Ramadass, Kurt Schilling, Bennett A. Landman, Yuankai Huo

    Abstract: Diffusion-weighted (DW) MRI measures the direction and scale of the local diffusion process in every voxel through its spectrum in q-space, typically acquired in one or more shells. Recent developments in micro-structure imaging and multi-tissue decomposition have sparked renewed attention to the radial b-value dependence of the signal. Applications in tissue classification and micro-architecture… ▽ More

    Submitted 29 January, 2024; v1 submitted 28 March, 2023; originally announced March 2023.

  6. arXiv:2303.10674  [pdf

    cs.LG cs.AI

    URM4DMU: an user represention model for darknet markets users

    Authors: Hongmeng Liu, Jiapeng Zhao, Yixuan Huo, Yuyan Wang, Chun Liao, Liyan Shen, Shiyao Cui, **qiao Shi

    Abstract: Darknet markets provide a large platform for trading illicit goods and services due to their anonymity. Learning an invariant representation of each user based on their posts on different markets makes it easy to aggregate user information across different platforms, which helps identify anonymous users. Traditional user representation methods mainly rely on modeling the text information of posts… ▽ More

    Submitted 19 March, 2023; originally announced March 2023.

    Comments: 9pages

    MSC Class: 62 (Primary); 54 (Secondary) ACM Class: I.2.7

  7. arXiv:2303.07634  [pdf, other

    cs.CV cs.AI cs.GR

    I$^2$-SDF: Intrinsic Indoor Scene Reconstruction and Editing via Raytracing in Neural SDFs

    Authors: **gsen Zhu, Yuchi Huo, Qi Ye, Fujun Luan, Jifan Li, Dianbing Xi, Lisha Wang, Rui Tang, Wei Hua, Hujun Bao, Rui Wang

    Abstract: In this work, we present I$^2$-SDF, a new method for intrinsic indoor scene reconstruction and editing using differentiable Monte Carlo raytracing on neural signed distance fields (SDFs). Our holistic neural SDF-based framework jointly recovers the underlying shapes, incident radiance and materials from multi-view images. We introduce a novel bubble loss for fine-grained small objects and error-gu… ▽ More

    Submitted 29 March, 2023; v1 submitted 14 March, 2023; originally announced March 2023.

    Comments: Accepted by CVPR 2023, project page: https://**gsenzhu.github.io/i2-sdf

  8. arXiv:2303.05785  [pdf, other

    eess.IV cs.CV cs.LG

    Scaling Up 3D Kernels with Bayesian Frequency Re-parameterization for Medical Image Segmentation

    Authors: Ho Hin Lee, Quan Liu, Shunxing Bao, Qi Yang, Xin Yu, Leon Y. Cai, Thomas Li, Yuankai Huo, Xenofon Koutsoukos, Bennett A. Landman

    Abstract: With the inspiration of vision transformers, the concept of depth-wise convolution revisits to provide a large Effective Receptive Field (ERF) using Large Kernel (LK) sizes for medical image segmentation. However, the segmentation performance might be saturated and even degraded as the kernel sizes scaled up (e.g., $21\times 21\times 21$) in a Convolutional Neural Network (CNN). We hypothesize tha… ▽ More

    Submitted 5 June, 2023; v1 submitted 10 March, 2023; originally announced March 2023.

    Comments: Accepted to MICCAI 2023 (top 13.6%), both codes and pretrained models are available at: https://github.com/MASILab/RepUX-Net

  9. arXiv:2302.06605  [pdf, other

    cs.CV cs.CL

    UniAdapter: Unified Parameter-Efficient Transfer Learning for Cross-modal Modeling

    Authors: Haoyu Lu, Yuqi Huo, Guoxing Yang, Zhiwu Lu, Wei Zhan, Masayoshi Tomizuka, Mingyu Ding

    Abstract: Large-scale vision-language pre-trained models have shown promising transferability to various downstream tasks. As the size of these foundation models and the number of downstream tasks grow, the standard full fine-tuning paradigm becomes unsustainable due to heavy computational and storage costs. This paper proposes UniAdapter, which unifies unimodal and multimodal adapters for parameter-efficie… ▽ More

    Submitted 21 May, 2023; v1 submitted 13 February, 2023; originally announced February 2023.

  10. On the Schrödinger-Poisson system with $(p,q)$-Laplacian

    Authors: Yueqiang Song, Yuanyuan Huo, Dušan D. Repovš

    Abstract: We study a class of Schrödinger-Poisson systems with $(p,q)$-Laplacian. Using fixed point theory, we obtain a new existence result for nontrivial solutions. The main novelty of the paper is the combination of a double phase operator and the nonlocal term. Our results generalize some known results.

    Submitted 1 February, 2023; originally announced February 2023.

    MSC Class: 35J47; 35J60; 35R11

    Journal ref: Appl. Math. Lett. 141 (2023), art. 108595, 6 pp

  11. arXiv:2302.00133  [pdf, ps, other

    cs.DS

    Sublinear Approximation Schemes for Scheduling Precedence Graphs of Bounded Depth

    Authors: Bin Fu, Yumei Huo, Hairong Zhao

    Abstract: We study the classical scheduling problem on parallel machines %with precedence constraints where the precedence graph has the bounded depth $h$. Our goal is to minimize the maximum completion time. We focus on develo** approximation algorithms that use only sublinear space or sublinear time. We develop the first one-pass streaming approximation schemes using sublinear space when all jobs' proce… ▽ More

    Submitted 31 January, 2023; originally announced February 2023.

  12. arXiv:2301.01703  [pdf, other

    cs.IT eess.SP

    Technology Trends for Massive MIMO towards 6G

    Authors: Yiming Huo, Xingqin Lin, Boya Di, Hongliang Zhang, Francisco Javier Lorca Hernando, Ahmet Serdar Tan, Shahid Mumtaz, Özlem Tuğfe Demir, Kun Chen-Hu

    Abstract: At the dawn of the next-generation wireless systems and networks, massive multiple-input multiple-output (MIMO) has been envisioned as one of the enabling technologies. With the continued success of being applied in the 5G and beyond, the massive MIMO technology has demonstrated its advantageousness, integrability, and extendibility. Moreover, several evolutionary features and revolutionizing tren… ▽ More

    Submitted 5 January, 2023; v1 submitted 4 January, 2023; originally announced January 2023.

    Comments: 7 pages, 5 figures. This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  13. Selective conformal inference with false coverage-statement rate control

    Authors: Yajie Bao, Yuyang Huo, Haojie Ren, Changliang Zou

    Abstract: Conformal inference is a popular tool for constructing prediction intervals (PI). We consider here the scenario of post-selection/selective conformal inference, that is PIs are reported only for individuals selected from an unlabeled test data. To account for multiplicity, we develop a general split conformal framework to construct selective PIs with the false coverage-statement rate (FCR) control… ▽ More

    Submitted 12 March, 2024; v1 submitted 2 January, 2023; originally announced January 2023.

  14. Eliminating temporal correlation in quantum-dot entangled photon source by quantum interference

    Authors: Run-Ze Liu, Yu-Kun Qiao, Han-Sen Zhong, Zhen-Xuan Ge, Hui Wang, Tung-Hsun Chung, Chao-Yang Lu, Yong-Heng Huo, Jian-Wei Pan

    Abstract: Semiconductor quantum dots, as promising solid-state platform, have exhibited deterministic photon pair generation with high polarization entanglement f\textcompwordmark idelity for quantum information applications. However, due to temporal correlation from inherently cascaded emission, photon indistinguishability is limited, which restricts their potential scalability to multi-photon experiments.… ▽ More

    Submitted 26 December, 2022; originally announced December 2022.

  15. Experimental quantum computational chemistry with optimised unitary coupled cluster ansatz

    Authors: Shaojun Guo, **zhao Sun, Haoran Qian, Ming Gong, Yukun Zhang, Fusheng Chen, Yangsen Ye, Yulin Wu, Sirui Cao, Kun Liu, Chen Zha, Chong Ying, Qingling Zhu, He-Liang Huang, Youwei Zhao, Shaowei Li, Shiyu Wang, Jiale Yu, Dao** Fan, Dachao Wu, Hong Su, Hui Deng, Hao Rong, Yuan Li, Kaili Zhang , et al. (13 additional authors not shown)

    Abstract: Quantum computational chemistry has emerged as an important application of quantum computing. Hybrid quantum-classical computing methods, such as variational quantum eigensolvers (VQE), have been designed as promising solutions to quantum chemistry problems, yet challenges due to theoretical complexity and experimental imperfections hinder progress in achieving reliable and accurate results. Exper… ▽ More

    Submitted 17 June, 2024; v1 submitted 15 December, 2022; originally announced December 2022.

    Comments: 11 pages, 4 figures in the main text, and 29 pages supplementary materials with 17 figures

  16. arXiv:2212.00059  [pdf, other

    eess.IV cs.CV

    Single Slice Thigh CT Muscle Group Segmentation with Domain Adaptation and Self-Training

    Authors: Qi Yang, Xin Yu, Ho Hin Lee, Leon Y. Cai, Kaiwen Xu, Shunxing Bao, Yuankai Huo, Ann Zenobia Moore, Sokratis Makrogiannis, Luigi Ferrucci, Bennett A. Landman

    Abstract: Objective: Thigh muscle group segmentation is important for assessment of muscle anatomy, metabolic disease and aging. Many efforts have been put into quantifying muscle tissues with magnetic resonance (MR) imaging including manual annotation of individual muscles. However, leveraging publicly available annotations in MR images to achieve muscle group segmentation on single slice computed tomograp… ▽ More

    Submitted 30 November, 2022; originally announced December 2022.

  17. arXiv:2211.05436  [pdf, other

    cond-mat.mtrl-sci cond-mat.dis-nn cond-mat.mes-hall cond-mat.stat-mech

    Dynamic nanoindentation and short-range order in equiatomic NiCoCr medium entropy alloy lead to novel density wave ordering

    Authors: A. Naghdi, F. J. Dominguez-Gutierrez, W. Y. Huo, K. Karimi, S. Papanikolaou

    Abstract: Chemical short-range order (CSRO) is believed to be a key contributor to the exceptional properties of multicomponent alloys. However, direct validation and confirmation of CSRO has been highly elusive in most compounds. Recent studies for equiatomic NiCoCr alloys have shown that thermal treatments (i.e., annealing/aging) may facilitate and manipulate CSRO. In this work, by using molecular simulat… ▽ More

    Submitted 10 November, 2022; originally announced November 2022.

    Comments: 5 pages, 4 figures

  18. arXiv:2211.03017  [pdf, other

    cs.CV cs.AI cs.GR

    Learning-based Inverse Rendering of Complex Indoor Scenes with Differentiable Monte Carlo Raytracing

    Authors: **gsen Zhu, Fujun Luan, Yuchi Huo, Zihao Lin, Zhihua Zhong, Dianbing Xi, Jiaxiang Zheng, Rui Tang, Hujun Bao, Rui Wang

    Abstract: Indoor scenes typically exhibit complex, spatially-varying appearance from global illumination, making inverse rendering a challenging ill-posed problem. This work presents an end-to-end, learning-based inverse rendering framework incorporating differentiable Monte Carlo raytracing with importance sampling. The framework takes a single image as input to jointly recover the underlying geometry, spa… ▽ More

    Submitted 23 November, 2022; v1 submitted 5 November, 2022; originally announced November 2022.

  19. arXiv:2211.01254  [pdf, other

    cs.CV

    CircleSnake: Instance Segmentation with Circle Representation

    Authors: Ethan H. Nguyen, Haichun Yang, Zuhayr Asad, Ruining Deng, Agnes B. Fogo, Yuankai Huo

    Abstract: Circle representation has recently been introduced as a medical imaging optimized representation for more effective instance object detection on ball-shaped medical objects. With its superior performance on instance detection, it is appealing to extend the circle representation to instance medical object segmentation. In this work, we propose CircleSnake, a simple end-to-end circle contour deforma… ▽ More

    Submitted 2 November, 2022; originally announced November 2022.

    Comments: Machine Learning in Medical Imaging Workshop for 2022 MICCAI

  20. arXiv:2210.16705  [pdf, other

    eess.SP

    Distributed Swarm Learning for Internet of Things at the Edge: Where Artificial Intelligence Meets Biological Intelligence

    Authors: Yue Wang, Zhi Tian, Xin Fan, Yan Huo, Cameron Nowzari, Kai Zeng

    Abstract: With the proliferation of versatile Internet of Things (IoT) services, smart IoT devices are increasingly deployed at the edge of wireless networks to perform collaborative machine learning tasks using locally collected data, giving rise to the edge learning paradigm. Due to device restrictions and resource constraints, edge learning among massive IoT devices faces major technical challenges cause… ▽ More

    Submitted 29 October, 2022; originally announced October 2022.

  21. arXiv:2210.09245  [pdf, other

    cs.RO cs.AI

    Contact2Grasp: 3D Grasp Synthesis via Hand-Object Contact Constraint

    Authors: Haoming Li, Xinzhuo Lin, Yang Zhou, Xiang Li, Yuchi Huo, Jiming Chen, Qi Ye

    Abstract: 3D grasp synthesis generates gras** poses given an input object. Existing works tackle the problem by learning a direct map** from objects to the distributions of gras** poses. However, because the physical contact is sensitive to small changes in pose, the high-nonlinear map** between 3D object representation to valid poses is considerably non-smooth, leading to poor generation efficiency… ▽ More

    Submitted 6 May, 2023; v1 submitted 17 October, 2022; originally announced October 2022.

    Comments: Accepted at IJCAI 2023

  22. arXiv:2210.08652  [pdf, other

    cs.CV cs.LG

    Adaptive Contrastive Learning with Dynamic Correlation for Multi-Phase Organ Segmentation

    Authors: Ho Hin Lee, Yucheng Tang, Han Liu, Yubo Fan, Leon Y. Cai, Qi Yang, Xin Yu, Shunxing Bao, Yuankai Huo, Bennett A. Landman

    Abstract: Recent studies have demonstrated the superior performance of introducing ``scan-wise" contrast labels into contrastive learning for multi-organ segmentation on multi-phase computed tomography (CT). However, such scan-wise labels are limited: (1) a coarse classification, which could not capture the fine-grained ``organ-wise" contrast variations across all organs; (2) the label (i.e., contrast phase… ▽ More

    Submitted 16 October, 2022; originally announced October 2022.

    Comments: 11 pages

  23. arXiv:2210.07006  [pdf, other

    cs.LG cs.AI

    Sustainable Online Reinforcement Learning for Auto-bidding

    Authors: Zhiyu Mou, Yusen Huo, Rongquan Bai, Mingzhou Xie, Chuan Yu, Jian Xu, Bo Zheng

    Abstract: Recently, auto-bidding technique has become an essential tool to increase the revenue of advertisers. Facing the complex and ever-changing bidding environments in the real-world advertising system (RAS), state-of-the-art auto-bidding policies usually leverage reinforcement learning (RL) algorithms to generate real-time bids on behalf of the advertisers. Due to safety concerns, it was believed that… ▽ More

    Submitted 13 October, 2022; originally announced October 2022.

    Comments: NeurIPS 2022

  24. Distributed Reconfigurable Intelligent Surfaces for Energy Efficient Indoor Terahertz Wireless Communications

    Authors: Yiming Huo, Xiaodai Dong, Nuwan Ferdinand

    Abstract: With the fifth-generation (5G) networks widely commercialized and fast deployed, the sixth-generation (6G) wireless communication is envisioned to provide competitive quality of service (QoS) in multiple aspects to global users. The critical and underlying research of the 6G is, firstly, highly dependent on the precise modeling and characterization of the wireless propagation when the spectrum is… ▽ More

    Submitted 12 October, 2022; originally announced October 2022.

    Comments: 15 Pages, 9 Figures, 2 Tables. To appear in the IEEE Internet of Things Journal

  25. arXiv:2210.06246  [pdf, other

    cs.CL

    CIKQA: Learning Commonsense Inference with a Unified Knowledge-in-the-loop QA Paradigm

    Authors: Hongming Zhang, Yintong Huo, Yanai Elazar, Yangqiu Song, Yoav Goldberg, Dan Roth

    Abstract: Recently, the community has achieved substantial progress on many commonsense reasoning benchmarks. However, it is still unclear what is learned from the training process: the knowledge, inference capability, or both? We argue that due to the large scale of commonsense knowledge, it is infeasible to annotate a large enough training set for each task to cover all commonsense for learning. Thus we s… ▽ More

    Submitted 12 October, 2022; originally announced October 2022.

  26. ImmFusion: Robust mmWave-RGB Fusion for 3D Human Body Reconstruction in All Weather Conditions

    Authors: Anjun Chen, Xiangyu Wang, Kun Shi, Shaohao Zhu, Bin Fang, Yingfeng Chen, Jiming Chen, Yuchi Huo, Qi Ye

    Abstract: 3D human reconstruction from RGB images achieves decent results in good weather conditions but degrades dramatically in rough weather. Complementary, mmWave radars have been employed to reconstruct 3D human joints and meshes in rough weather. However, combining RGB and mmWave signals for robust all-weather 3D human reconstruction is still an open challenge, given the sparse nature of mmWave and th… ▽ More

    Submitted 20 September, 2023; v1 submitted 3 October, 2022; originally announced October 2022.

    Comments: Accepted to ICRA2023, Project Page: https://chen3110.github.io/ImmFusion/index.html

  27. arXiv:2210.00223  [pdf, other

    cs.CV

    Contour-Aware Equipotential Learning for Semantic Segmentation

    Authors: Xu Yin, Dongbo Min, Yuchi Huo, Sung-Eui Yoon

    Abstract: With increasing demands for high-quality semantic segmentation in the industry, hard-distinguishing semantic boundaries have posed a significant threat to existing solutions. Inspired by real-life experience, i.e., combining varied observations contributes to higher visual recognition confidence, we present the equipotential learning (EPL) method. This novel module transfers the predicted/ground-t… ▽ More

    Submitted 1 October, 2022; originally announced October 2022.

  28. arXiv:2209.15076  [pdf, other

    cs.CV cs.LG

    3D UX-Net: A Large Kernel Volumetric ConvNet Modernizing Hierarchical Transformer for Medical Image Segmentation

    Authors: Ho Hin Lee, Shunxing Bao, Yuankai Huo, Bennett A. Landman

    Abstract: The recent 3D medical ViTs (e.g., SwinUNETR) achieve the state-of-the-art performances on several 3D volumetric data benchmarks, including 3D medical image segmentation. Hierarchical transformers (e.g., Swin Transformers) reintroduced several ConvNet priors and further enhanced the practical viability of adapting volumetric segmentation in 3D medical datasets. The effectiveness of hybrid approache… ▽ More

    Submitted 1 March, 2023; v1 submitted 29 September, 2022; originally announced September 2022.

    Comments: Accepted to ICLR 2023

  29. Reducing Positional Variance in Cross-sectional Abdominal CT Slices with Deep Conditional Generative Models

    Authors: Xin Yu, Qi Yang, Yucheng Tang, Riqiang Gao, Shunxing Bao, LeonY. Cai, Ho Hin Lee, Yuankai Huo, Ann Zenobia Moore, Luigi Ferrucci, Bennett A. Landman

    Abstract: 2D low-dose single-slice abdominal computed tomography (CT) slice enables direct measurements of body composition, which are critical to quantitatively characterizing health relationships on aging. However, longitudinal analysis of body composition changes using 2D abdominal slices is challenging due to positional variance between longitudinal slices acquired in different years. To reduce the posi… ▽ More

    Submitted 28 September, 2022; originally announced September 2022.

    Comments: 11 pages, 4 figures

    Journal ref: Medical Image Computing and Computer Assisted Intervention MICCAI 2022, Cham, 2022, pp202,212

  30. arXiv:2209.14378  [pdf, other

    eess.IV cs.CV

    UNesT: Local Spatial Representation Learning with Hierarchical Transformer for Efficient Medical Segmentation

    Authors: Xin Yu, Qi Yang, Yinchi Zhou, Leon Y. Cai, Riqiang Gao, Ho Hin Lee, Thomas Li, Shunxing Bao, Zhoubing Xu, Thomas A. Lasko, Richard G. Abramson, Zizhao Zhang, Yuankai Huo, Bennett A. Landman, Yucheng Tang

    Abstract: Transformer-based models, capable of learning better global dependencies, have recently demonstrated exceptional representation learning capabilities in computer vision and medical image analysis. Transformer reformats the image into separate patches and realizes global communication via the self-attention mechanism. However, positional information between patches is hard to preserve in such 1D se… ▽ More

    Submitted 7 September, 2023; v1 submitted 28 September, 2022; originally announced September 2022.

    Comments: 19 pages, 17 figures. arXiv admin note: text overlap with arXiv:2203.02430

  31. arXiv:2209.11388  [pdf, other

    cs.CV cs.AI cs.MM

    LGDN: Language-Guided Denoising Network for Video-Language Modeling

    Authors: Haoyu Lu, Mingyu Ding, Nanyi Fei, Yuqi Huo, Zhiwu Lu

    Abstract: Video-language modeling has attracted much attention with the rapid growth of web videos. Most existing methods assume that the video frames and text description are semantically correlated, and focus on video-language modeling at video level. However, this hypothesis often fails for two reasons: (1) With the rich semantics of video contents, it is difficult to cover all frames with a single video… ▽ More

    Submitted 5 December, 2022; v1 submitted 22 September, 2022; originally announced September 2022.

    Comments: Accepted by NeurIPS2022

  32. arXiv:2208.14357  [pdf, other

    cs.CV

    Compound Figure Separation of Biomedical Images: Mining Large Datasets for Self-supervised Learning

    Authors: Tianyuan Yao, Chang Qu, Jun Long, Quan Liu, Ruining Deng, Yuanhan Tian, Jiachen Xu, Aadarsh Jha, Zuhayr Asad, Shunxing Bao, Mengyang Zhao, Agnes B. Fogo, Bennett A. Landman, Haichun Yang, Catie Chang, Yuankai Huo

    Abstract: With the rapid development of self-supervised learning (e.g., contrastive learning), the importance of having large-scale images (even without annotations) for training a more generalizable AI model has been widely recognized in medical image analysis. However, collecting large-scale task-specific unannotated data at scale can be challenging for individual labs. Existing online resources, such as… ▽ More

    Submitted 30 August, 2022; originally announced August 2022.

    Comments: Accepted for publication at the Journal of Machine Learning for Biomedical Imaging (MELBA) https://www.melba-journal.org/papers/2022:025.html. arXiv admin note: substantial text overlap with arXiv:2107.08650

    Journal ref: Machine.Learning.for.Biomedical.Imaging. 1 (2022)

  33. arXiv:2208.07322  [pdf, other

    cs.CV cs.AI

    Cross-scale Attention Guided Multi-instance Learning for Crohn's Disease Diagnosis with Pathological Images

    Authors: Ruining Deng, Can Cui, Lucas W. Remedios, Shunxing Bao, R. Michael Womick, Sophie Chiron, Jia Li, Joseph T. Roland, Ken S. Lau, Qi Liu, Keith T. Wilson, Yaohong Wang, Lori A. Coburn, Bennett A. Landman, Yuankai Huo

    Abstract: Multi-instance learning (MIL) is widely used in the computer-aided interpretation of pathological Whole Slide Images (WSIs) to solve the lack of pixel-wise or patch-wise annotations. Often, this approach directly applies "natural image driven" MIL algorithms which overlook the multi-scale (i.e. pyramidal) nature of WSIs. Off-the-shelf MIL algorithms are typically deployed on a single-scale of WSIs… ▽ More

    Submitted 15 August, 2022; originally announced August 2022.

  34. arXiv:2208.05578  [pdf, other

    eess.SP

    CB-DSL: Communication-efficient and Byzantine-robust Distributed Swarm Learning on Non-i.i.d. Data

    Authors: Xin Fan, Yue Wang, Yan Huo, Zhi Tian

    Abstract: The valuable data collected by IoT devices in edge networks together with the resurgence of ML stimulate the latest trend of edge AI. However, recent FL methods face major challenges including communication bottleneck, data heterogeneity and security concerns in edge IoT scenarios, especially when being adopted for distributed learning among massive IoT devices equipped with limited data and trans… ▽ More

    Submitted 20 October, 2022; v1 submitted 10 August, 2022; originally announced August 2022.

    Comments: update theoretical and simulation results

  35. arXiv:2207.06551  [pdf, other

    eess.IV cs.CV

    Body Composition Assessment with Limited Field-of-view Computed Tomography: A Semantic Image Extension Perspective

    Authors: Kaiwen Xu, Thomas Li, Mirza S. Khan, Riqiang Gao, Sanja L. Antic, Yuankai Huo, Kim L. Sandler, Fabien Maldonado, Bennett A. Landman

    Abstract: Field-of-view (FOV) tissue truncation beyond the lungs is common in routine lung screening computed tomography (CT). This poses limitations for opportunistic CT- based body composition (BC) assessment as key anatomical structures are missing. Traditionally, extending the FOV of CT is considered as a CT reconstruction problem using limited data. However, this approach relies on the projection domai… ▽ More

    Submitted 15 April, 2023; v1 submitted 13 July, 2022; originally announced July 2022.

    Comments: Updated with additional evaluation and clarification

  36. arXiv:2207.02651  [pdf

    cond-mat.soft

    How should the contact angle of a noncircular wetting boundary be described?

    Authors: Jianhui Zhang, Xiaosheng Chen, Zhenzhen Gui, Zhenlin Chen, Mingdong Ma, Yuxuan Huo, Weirong Zhang, Fan Zhang, Xiaosi Zhou, Xi Huang

    Abstract: For over 200 years, wettability has made significant contributions to understanding the properties of objects, advancing technological progress. Theoretical model of the contact angle (CA) for evaluating wettability has constantly been modified to address relevant emerging issues. However, these existing models disregard the difference in the CA along the contact line and use a single-point CA to… ▽ More

    Submitted 3 July, 2022; originally announced July 2022.

  37. arXiv:2207.00151  [pdf, other

    cs.NI cs.ET eess.SP eess.SY

    Space Broadband Access: The Race Has Just Begun

    Authors: Yiming Huo

    Abstract: Recent years have witnessed an exponential growth of the commercial space industry, including rocket launch, satellite network deployment, private space travel, and even extraterrestrial colonization. Several trends are predicted in this unprecedented transition to an era of space-enabled broadband access.

    Submitted 6 May, 2022; originally announced July 2022.

    Comments: 8 pages, 3 figures, 2 tables. Accepted by IEEE Magazine (https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=2)

  38. arXiv:2206.13632  [pdf, other

    eess.IV cs.CV

    Omni-Seg: A Scale-aware Dynamic Network for Renal Pathological Image Segmentation

    Authors: Ruining Deng, Quan Liu, Can Cui, Tianyuan Yao, Jun Long, Zuhayr Asad, R. Michael Womick, Zheyu Zhu, Agnes B. Fogo, Shilin Zhao, Haichun Yang, Yuankai Huo

    Abstract: Comprehensive semantic segmentation on renal pathological images is challenging due to the heterogeneous scales of the objects. For example, on a whole slide image (WSI), the cross-sectional areas of glomeruli can be 64 times larger than that of the peritubular capillaries, making it impractical to segment both objects on the same patch, at the same scale. To handle this scaling issue, prior studi… ▽ More

    Submitted 18 January, 2023; v1 submitted 27 June, 2022; originally announced June 2022.

  39. arXiv:2206.00123  [pdf, other

    cs.CV

    Glo-In-One: Holistic Glomerular Detection, Segmentation, and Lesion Characterization with Large-scale Web Image Mining

    Authors: Tianyuan Yao, Yuzhe Lu, Jun Long, Aadarsh Jha, Zheyu Zhu, Zuhayr Asad, Haichun Yang, Agnes B. Fogo, Yuankai Huo

    Abstract: The quantitative detection, segmentation, and characterization of glomeruli from high-resolution whole slide imaging (WSI) play essential roles in the computer-assisted diagnosis and scientific research in digital renal pathology. Historically, such comprehensive quantification requires extensive programming skills in order to be able to handle heterogeneous and customized computational tools. To… ▽ More

    Submitted 31 May, 2022; originally announced June 2022.

  40. arXiv:2205.08567  [pdf, other

    astro-ph.IM astro-ph.EP cs.CR eess.SP

    Internet of Spacecraft for Multi-planetary Defense and Prosperity

    Authors: Yiming Huo

    Abstract: Recent years have seen unprecedentedly fast-growing prosperity in the commercial space industry. Several privately funded aerospace manufacturers, such as Space Exploration Technologies Corporation (SpaceX) and Blue Origin have innovated what we used to know about this capital-intense industry and gradually reshaped the future of human civilization. As private spaceflight and multi-planetary immig… ▽ More

    Submitted 15 May, 2022; originally announced May 2022.

    Comments: 28 pages, 19 figures, submitted to a journal as an invited paper

  41. arXiv:2205.05898  [pdf

    eess.IV cs.CV cs.LG

    Pseudo-Label Guided Multi-Contrast Generalization for Non-Contrast Organ-Aware Segmentation

    Authors: Ho Hin Lee, Yucheng Tang, Riqiang Gao, Qi Yang, Xin Yu, Shunxing Bao, James G. Terry, J. Jeffrey Carr, Yuankai Huo, Bennett A. Landman

    Abstract: Non-contrast computed tomography (NCCT) is commonly acquired for lung cancer screening, assessment of general abdominal pain or suspected renal stones, trauma evaluation, and many other indications. However, the absence of contrast limits distinguishing organ in-between boundaries. In this paper, we propose a novel unsupervised approach that leverages pairwise contrast-enhanced CT (CECT) context t… ▽ More

    Submitted 12 May, 2022; originally announced May 2022.

  42. arXiv:2205.03050  [pdf, other

    physics.comp-ph

    Mechanisms of strength and hardening in austenitic stainless 310S steel: Nanoindentation experiments and multiscale modeling

    Authors: F. J. Domínguez-Gutíerrez, K. Mulewska, A. Ustrzycka, R. Alvarez-Donado, A. Kosínska, W. Y. Huo, L. Kurpaska, I. Jozwik, S. Papanikolaou, M. Alava

    Abstract: Austenitic stainless steels with low carbon have exceptional mechanical properties and are capable to reduce embrittlement, due to high chromium and nickel alloying, thus they are very attractive for efficient energy production in extreme environments. It is key to perform nanomechanical investigations of the role of chromium and the form of the particular alloy composition that give rise to the e… ▽ More

    Submitted 6 May, 2022; originally announced May 2022.

  43. arXiv:2204.07441  [pdf, other

    cs.CV cs.CL cs.IR

    COTS: Collaborative Two-Stream Vision-Language Pre-Training Model for Cross-Modal Retrieval

    Authors: Haoyu Lu, Nanyi Fei, Yuqi Huo, Yizhao Gao, Zhiwu Lu, Ji-Rong Wen

    Abstract: Large-scale single-stream pre-training has shown dramatic performance in image-text retrieval. Regrettably, it faces low inference efficiency due to heavy attention layers. Recently, two-stream methods like CLIP and ALIGN with high inference efficiency have also shown promising performance, however, they only consider instance-level alignment between the two streams (thus there is still room for i… ▽ More

    Submitted 20 May, 2022; v1 submitted 15 April, 2022; originally announced April 2022.

    Comments: Accepted by CVPR2022

  44. arXiv:2204.05575  [pdf, other

    cs.CV cs.AI

    DAIR-V2X: A Large-Scale Dataset for Vehicle-Infrastructure Cooperative 3D Object Detection

    Authors: Haibao Yu, Yizhen Luo, Mao Shu, Yiyi Huo, Zebang Yang, Yifeng Shi, Zhenglong Guo, Hanyu Li, Xing Hu, Jirui Yuan, Zaiqing Nie

    Abstract: Autonomous driving faces great safety challenges for a lack of global perspective and the limitation of long-range perception capabilities. It has been widely agreed that vehicle-infrastructure cooperation is required to achieve Level 5 autonomy. However, there is still NO dataset from real scenarios available for computer vision researchers to work on vehicle-infrastructure cooperation-related pr… ▽ More

    Submitted 12 April, 2022; originally announced April 2022.

    Comments: CVPR2022

  45. arXiv:2204.03237  [pdf, other

    cs.GR

    Rule-based Procedural Tree Modeling Approach

    Authors: Yinhui Yang, Rui Wang, Yuchi Huo

    Abstract: In some entertainment and virtual reality applications, it is necessary to model and draw the real world realistically, so as to improve the fidelity of natural scenes and make users have a better sense of immersion. However, due to the morphological structure of trees The complexity and variety present many challenges for photorealistic modeling and rendering of trees. This paper reviews the prog… ▽ More

    Submitted 7 April, 2022; originally announced April 2022.

  46. arXiv:2204.01976  [pdf, other

    cs.DS

    Streaming Approximation Scheme for Minimizing Total Completion Time on Parallel Machines Subject to Varying Processing Capacity

    Authors: Bin Fu, Yumei Huo, Hairong Zhao

    Abstract: We study the problem of minimizing total completion time on parallel machines subject to varying processing capacity. In this paper, we develop an approximation scheme for the problem under the data stream model where the input data is massive and cannot fit into memory and thus can only be scanned for a few passes. Our algorithm can compute the approximate value of the optimal total completion ti… ▽ More

    Submitted 5 April, 2022; originally announced April 2022.

  47. arXiv:2204.01970  [pdf, other

    cs.DS

    Streaming Algorithms for Multitasking Scheduling with Shared Processing

    Authors: Bin Fu, Yumei Huo, Hairong Zhao

    Abstract: In this paper, we design the first streaming algorithms for the problem of multitasking scheduling on parallel machines with shared processing. In one pass, our streaming approximation schemes can provide an approximate value of the optimal makespan. If the jobs can be read in two passes, the algorithm can find the schedule with the approximate value. This work not only provides an algorithmic big… ▽ More

    Submitted 4 April, 2022; originally announced April 2022.

  48. arXiv:2204.01859  [pdf, other

    cs.DS

    Multitasking Scheduling with Shared Processing

    Authors: Bin Fu, Yumei Huo, Hairong Zhao

    Abstract: Recently, the problem of multitasking scheduling has attracted a lot of attention in the service industries where workers frequently perform multiple tasks by switching from one task to another. Hall, Leung and Li (Discrete Applied Mathematics 2016) proposed a shared processing multitasking scheduling model which allows a team to continue to work on the primary tasks while processing the routinely… ▽ More

    Submitted 4 April, 2022; originally announced April 2022.

  49. arXiv:2203.15588  [pdf

    cs.LG cs.AI cs.CV

    Deep Multi-modal Fusion of Image and Non-image Data in Disease Diagnosis and Prognosis: A Review

    Authors: Can Cui, Haichun Yang, Yaohong Wang, Shilin Zhao, Zuhayr Asad, Lori A. Coburn, Keith T. Wilson, Bennett A. Landman, Yuankai Huo

    Abstract: The rapid development of diagnostic technologies in healthcare is leading to higher requirements for physicians to handle and integrate the heterogeneous, yet complementary data that are produced during routine practice. For instance, the personalized diagnosis and treatment planning for a single cancer patient relies on the various images (e.g., radiological, pathological, and camera images) and… ▽ More

    Submitted 26 January, 2023; v1 submitted 25 March, 2022; originally announced March 2022.

  50. arXiv:2203.14441  [pdf, other

    cs.CV cs.GR

    An Interactive Image-based Modeling System

    Authors: Zhi He, Rui Wang, Wei Hua, Yuchi Huo

    Abstract: This paper propose a interactive 3D modeling method and corresponding system based on single or multiple uncalibrated images. The main feature of this method is that, according to the modeling habits of ordinary people, the 3D model of the target is reconstructed from coarse to fine images. On the basis of determining the approximate shape, the user adds or modify projection constraints and spatia… ▽ More

    Submitted 27 March, 2022; originally announced March 2022.