Skip to main content

Showing 1–17 of 17 results for author: Pfister, H

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.16935  [pdf, other

    eess.SP cs.AI

    Benchmarking Out-of-Distribution Generalization Capabilities of DNN-based Encoding Models for the Ventral Visual Cortex

    Authors: Spandan Madan, Will Xiao, Mingran Cao, Hanspeter Pfister, Margaret Livingstone, Gabriel Kreiman

    Abstract: We characterized the generalization capabilities of DNN-based encoding models when predicting neuronal responses from the visual cortex. We collected \textit{MacaqueITBench}, a large-scale dataset of neural population responses from the macaque inferior temporal (IT) cortex to over $300,000$ images, comprising $8,233$ unique natural images presented to seven monkeys over $109$ sessions. Using \tex… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

  2. arXiv:2404.14435  [pdf, other

    cs.CV eess.IV

    FreSeg: Frenet-Frame-based Part Segmentation for 3D Curvilinear Structures

    Authors: Shixuan Gu, Jason Ken Adhinarta, Mikhail Bessmeltsev, Jiancheng Yang, Jessica Zhang, Daniel Berger, Jeff W. Lichtman, Hanspeter Pfister, Donglai Wei

    Abstract: Part segmentation is a crucial task for 3D curvilinear structures like neuron dendrites and blood vessels, enabling the analysis of dendritic spines and aneurysms with scientific and clinical significance. However, their diversely winded morphology poses a generalization challenge to existing deep learning methods, which leads to labor-intensive manual correction. In this work, we propose FreSeg,… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

    Comments: 10 pages, 4 figures

  3. arXiv:2402.09372  [pdf, other

    eess.IV cs.AI cs.CV

    Deep Rib Fracture Instance Segmentation and Classification from CT on the RibFrac Challenge

    Authors: Jiancheng Yang, Rui Shi, Liang **, Xiaoyang Huang, Kaiming Kuang, Donglai Wei, Shixuan Gu, Jianying Liu, Pengfei Liu, Zhizhong Chai, Yongjie Xiao, Hao Chen, Liming Xu, Bang Du, Xiangyi Yan, Hao Tang, Adam Alessio, Gregory Holste, Jiapeng Zhang, Xiaoming Wang, Jianye He, Lixuan Che, Hanspeter Pfister, Ming Li, Bingbing Ni

    Abstract: Rib fractures are a common and potentially severe injury that can be challenging and labor-intensive to detect in CT scans. While there have been efforts to address this field, the lack of large-scale annotated datasets and evaluation benchmarks has hindered the development and validation of deep learning algorithms. To address this issue, the RibFrac Challenge was introduced, providing a benchmar… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

    Comments: Challenge paper for MICCAI RibFrac Challenge (https://ribfrac.grand-challenge.org/)

  4. arXiv:2309.10724  [pdf, other

    cs.CV cs.AI cs.MM cs.SD eess.AS

    Sound Source Localization is All about Cross-Modal Alignment

    Authors: Arda Senocak, Hyeonggon Ryu, Junsik Kim, Tae-Hyun Oh, Hanspeter Pfister, Joon Son Chung

    Abstract: Humans can easily perceive the direction of sound sources in a visual scene, termed sound source localization. Recent studies on learning-based sound source localization have mainly explored the problem from a localization perspective. However, prior arts and existing benchmarks do not account for a more important aspect of the problem, cross-modal semantic understanding, which is essential for ge… ▽ More

    Submitted 19 September, 2023; originally announced September 2023.

    Comments: ICCV 2023

  5. arXiv:2212.10431  [pdf, other

    cs.CV cs.LG cs.MM eess.IV

    QuantArt: Quantizing Image Style Transfer Towards High Visual Fidelity

    Authors: Siyu Huang, Jie An, Donglai Wei, Jiebo Luo, Hanspeter Pfister

    Abstract: The mechanism of existing style transfer algorithms is by minimizing a hybrid loss function to push the generated image toward high similarities in both content and style. However, this type of approach cannot guarantee visual fidelity, i.e., the generated artworks should be indistinguishable from real ones. In this paper, we devise a new style transfer framework called QuantArt for high visual-fi… ▽ More

    Submitted 5 June, 2023; v1 submitted 20 December, 2022; originally announced December 2022.

    Comments: Accepted to CVPR 2023. Code is available at https://github.com/siyuhuang/QuantArt

  6. arXiv:2210.09309  [pdf, other

    eess.IV cs.CV cs.LG

    RibSeg v2: A Large-scale Benchmark for Rib Labeling and Anatomical Centerline Extraction

    Authors: Liang **, Shixuan Gu, Donglai Wei, Jason Ken Adhinarta, Kaiming Kuang, Yongjie Jessica Zhang, Hanspeter Pfister, Bingbing Ni, Jiancheng Yang, Ming Li

    Abstract: Automatic rib labeling and anatomical centerline extraction are common prerequisites for various clinical applications. Prior studies either use in-house datasets that are inaccessible to communities, or focus on rib segmentation that neglects the clinical significance of rib labeling. To address these issues, we extend our prior dataset (RibSeg) on the binary rib segmentation task to a comprehens… ▽ More

    Submitted 1 August, 2023; v1 submitted 17 October, 2022; originally announced October 2022.

    Comments: 10 pages, 6 figures, journal

  7. arXiv:2206.07150  [pdf, other

    eess.SY

    Attacks on Perception-Based Control Systems: Modeling and Fundamental Limits

    Authors: Amir Khazraei, Henry Pfister, Miroslav Pajic

    Abstract: We study the performance of perception-based control systems in the presence of attacks, and provide methods for modeling and analysis of their resiliency to stealthy attacks on both physical and perception-based sensing. Specifically, we consider a general setup with a nonlinear affine physical plant controlled with a perception-based controller that maps both the physical (e.g., IMUs) and percep… ▽ More

    Submitted 27 August, 2023; v1 submitted 14 June, 2022; originally announced June 2022.

  8. arXiv:2204.02844  [pdf, other

    cs.CV eess.IV

    Learning to Generate Realistic Noisy Images via Pixel-level Noise-aware Adversarial Training

    Authors: Yuanhao Cai, Xiaowan Hu, Haoqian Wang, Yulun Zhang, Hanspeter Pfister, Donglai Wei

    Abstract: Existing deep learning real denoising methods require a large amount of noisy-clean image pairs for supervision. Nonetheless, capturing a real noisy-clean dataset is an unacceptable expensive and cumbersome procedure. To alleviate this problem, this work investigates how to generate realistic noisy images. Firstly, we formulate a simple yet reasonable noise model that treats each real noisy pixel… ▽ More

    Submitted 14 September, 2022; v1 submitted 6 April, 2022; originally announced April 2022.

    Comments: NeurIPS 2021

  9. arXiv:2112.05754  [pdf, other

    eess.IV cs.CV q-bio.QM

    PyTorch Connectomics: A Scalable and Flexible Segmentation Framework for EM Connectomics

    Authors: Zudi Lin, Donglai Wei, Jeff Lichtman, Hanspeter Pfister

    Abstract: We present PyTorch Connectomics (PyTC), an open-source deep-learning framework for the semantic and instance segmentation of volumetric microscopy images, built upon PyTorch. We demonstrate the effectiveness of PyTC in the field of connectomics, which aims to segment and reconstruct neurons, synapses, and other organelles like mitochondria at nanometer resolution for understanding neuronal communi… ▽ More

    Submitted 9 December, 2021; originally announced December 2021.

    Comments: Technical report

  10. arXiv:2110.14795  [pdf, other

    cs.CV cs.AI cs.LG eess.IV

    MedMNIST v2 -- A large-scale lightweight benchmark for 2D and 3D biomedical image classification

    Authors: Jiancheng Yang, Rui Shi, Donglai Wei, Zequan Liu, Lin Zhao, Bilian Ke, Hanspeter Pfister, Bingbing Ni

    Abstract: We introduce MedMNIST v2, a large-scale MNIST-like dataset collection of standardized biomedical images, including 12 datasets for 2D and 6 datasets for 3D. All images are pre-processed into a small size of 28x28 (2D) or 28x28x28 (3D) with the corresponding classification labels so that no background knowledge is required for users. Covering primary data modalities in biomedical images, MedMNIST v… ▽ More

    Submitted 25 September, 2022; v1 submitted 27 October, 2021; originally announced October 2021.

    Comments: The data and code are publicly available at https://medmnist.com/. arXiv admin note: text overlap with arXiv:2010.14925

    Journal ref: Scientific Data 2023

  11. arXiv:2109.09521  [pdf, other

    eess.IV cs.AI cs.CV cs.GR cs.LG

    RibSeg Dataset and Strong Point Cloud Baselines for Rib Segmentation from CT Scans

    Authors: Jiancheng Yang, Shixuan Gu, Donglai Wei, Hanspeter Pfister, Bingbing Ni

    Abstract: Manual rib inspections in computed tomography (CT) scans are clinically critical but labor-intensive, as 24 ribs are typically elongated and oblique in 3D volumes. Automatic rib segmentation methods can speed up the process through rib measurement and visualization. However, prior arts mostly use in-house labeled datasets that are publicly unavailable and work on dense 3D volumes that are computat… ▽ More

    Submitted 17 September, 2021; originally announced September 2021.

    Comments: MICCAI 2021. The dataset, code, and model are available at https://github.com/M3DV/RibSeg

  12. arXiv:2109.08684  [pdf, other

    eess.IV cs.AI cs.CV cs.LG

    Asymmetric 3D Context Fusion for Universal Lesion Detection

    Authors: Jiancheng Yang, Yi He, Kaiming Kuang, Zudi Lin, Hanspeter Pfister, Bingbing Ni

    Abstract: Modeling 3D context is essential for high-performance 3D medical image analysis. Although 2D networks benefit from large-scale 2D supervised pretraining, it is weak in capturing 3D context. 3D networks are strong in 3D context yet lack supervised pretraining. As an emerging technique, \emph{3D context fusion operator}, which enables conversion from 2D pretrained networks, leverages the advantages… ▽ More

    Submitted 17 September, 2021; originally announced September 2021.

    Comments: MICCAI 2021. The code and model are available at https://github.com/M3DV/AlignShift

  13. arXiv:2010.14258  [pdf, other

    eess.SP cs.AI cs.IT stat.ML

    Physics-Based Deep Learning for Fiber-Optic Communication Systems

    Authors: Christian Häger, Henry D. Pfister

    Abstract: We propose a new machine-learning approach for fiber-optic communication systems whose signal propagation is governed by the nonlinear Schrödinger equation (NLSE). Our main observation is that the popular split-step method (SSM) for numerically solving the NLSE has essentially the same functional form as a deep multi-layer neural network; in both cases, one alternates linear steps and pointwise no… ▽ More

    Submitted 27 October, 2020; originally announced October 2020.

    Comments: 15 pages, 11 figures, submitted to IEEE J. Sel. Areas Commun., code available at https://github.com/chaeger/LDBP, extension of arXiv:1710.06234(1), arXiv:1804.02799(1), arXiv:1901.07592(2)

  14. arXiv:2010.12313  [pdf, other

    eess.SP cs.IT stat.ML

    Model-Based Machine Learning for Joint Digital Backpropagation and PMD Compensation

    Authors: Rick M. Bütler, Christian Häger, Henry D. Pfister, Gabriele Liga, Alex Alvarado

    Abstract: In this paper, we propose a model-based machine-learning approach for dual-polarization systems by parameterizing the split-step Fourier method for the Manakov-PMD equation. The resulting method combines hardware-friendly time-domain nonlinearity mitigation via the recently proposed learned digital backpropagation (LDBP) with distributed compensation of polarization-mode dispersion (PMD). We refer… ▽ More

    Submitted 23 October, 2020; originally announced October 2020.

    Comments: 10 pages, 11 figures, to appear in the IEEE/OSA Journal of Lightwave Technology

  15. Revisiting Efficient Multi-Step Nonlinearity Compensation with Machine Learning: An Experimental Demonstration

    Authors: Vinícius Oliari, Sebastiaan Goossens, Christian Häger, Gabriele Liga, Rick M. Bütler, Menno van den Hout, Sjoerd van der Heide, Henry D. Pfister, Chigo Okonkwo, Alex Alvarado

    Abstract: Efficient nonlinearity compensation in fiber-optic communication systems is considered a key element to go beyond the "capacity crunch''. One guiding principle for previous work on the design of practical nonlinearity compensation schemes is that fewer steps lead to better systems. In this paper, we challenge this assumption and show how to carefully design multi-step approaches that provide bette… ▽ More

    Submitted 20 July, 2020; originally announced July 2020.

    Comments: 10 pages, 5 figures. Author version of a paper published in the Journal of Lightwave Technology. OSA/IEEE copyright may apply

    Journal ref: Journal of Lightwave Technology, vol. 38, no. 12, pp. 3114-3124, 15 June, 2020

  16. arXiv:2001.09277  [pdf, other

    eess.SP cs.IT cs.LG stat.ML

    Model-Based Machine Learning for Joint Digital Backpropagation and PMD Compensation

    Authors: Christian Häger, Henry D. Pfister, Rick M. Bütler, Gabriele Liga, Alex Alvarado

    Abstract: We propose a model-based machine-learning approach for polarization-multiplexed systems by parameterizing the split-step method for the Manakov-PMD equation. This approach performs hardware-friendly DBP and distributed PMD compensation with performance close to the PMD-free case.

    Submitted 25 January, 2020; originally announced January 2020.

    Comments: 3 pages, 2 figures

  17. arXiv:1904.09807  [pdf, other

    eess.SP cs.AI cs.IT stat.ML

    Revisiting Multi-Step Nonlinearity Compensation with Machine Learning

    Authors: Christian Häger, Henry D. Pfister, Rick M. Bütler, Gabriele Liga, Alex Alvarado

    Abstract: For the efficient compensation of fiber nonlinearity, one of the guiding principles appears to be: fewer steps are better and more efficient. We challenge this assumption and show that carefully designed multi-step approaches can lead to better performance-complexity trade-offs than their few-step counterparts.

    Submitted 22 April, 2019; originally announced April 2019.

    Comments: 4 pages, 3 figures, This is a preprint of a paper submitted to the 2019 European Conference on Optical Communication