Skip to main content

Showing 1–12 of 12 results for author: Gong, B

Searching in archive eess. Search in all archives.
.
  1. arXiv:2405.12367  [pdf, other

    eess.IV cs.CV

    Large-Scale Multi-Center CT and MRI Segmentation of Pancreas with Deep Learning

    Authors: Zheyuan Zhang, Elif Keles, Gorkem Durak, Yavuz Taktak, Onkar Susladkar, Vandan Gorade, Debesh Jha, Asli C. Ormeci, Alpay Medetalibeyoglu, Lanhong Yao, Bin Wang, Ilkin Sevgi Isler, Linkai Peng, Hongyi Pan, Camila Lopes Vendrami, Amir Bourhani, Yury Velichko, Boqing Gong, Concetto Spampinato, Ayis Pyrros, Pallavi Tiwari, Derk C. F. Klatte, Megan Engels, Sanne Hoogenboom, Candice W. Bolan , et al. (13 additional authors not shown)

    Abstract: Automated volumetric segmentation of the pancreas on cross-sectional imaging is needed for diagnosis and follow-up of pancreatic diseases. While CT-based pancreatic segmentation is more established, MRI-based segmentation methods are understudied, largely due to a lack of publicly available datasets, benchmarking research efforts, and domain-specific deep learning methods. In this retrospective st… ▽ More

    Submitted 25 May, 2024; v1 submitted 20 May, 2024; originally announced May 2024.

    Comments: under review version

  2. arXiv:2404.15339  [pdf, other

    eess.IV

    Efficient EndoNeRF Reconstruction and Its Application for Data-driven Surgical Simulation

    Authors: Yuehao Wang, Bingchen Gong, Yonghao Long, Siu Hin Fan, Qi Dou

    Abstract: The healthcare industry has a growing need for realistic modeling and efficient simulation of surgical scenes. With effective models of deformable surgical scenes, clinicians are able to conduct surgical planning and surgery training on scenarios close to real-world cases. However, a significant challenge in achieving such a goal is the scarcity of high-quality soft tissue models with accurate sha… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

    Comments: 14 pages, 4 figures. Accepted by International Journal of Computer Assisted Radiology and Surgery

  3. arXiv:2309.09565  [pdf, other

    eess.SP

    A Covariance Adaptive Student's t Based Kalman Filter

    Authors: Benyang Gong, Jiacheng He, Gang Wang, Bei Peng

    Abstract: In the classical Kalman filter(KF), the estimated state is a linear combination of the one-step predicted state and measurement state, their confidence level change when the prediction mean square error matrix and covariance matrix of measurement noise vary. The existing student's t based Kalman filter(TKF) works similarly to the way KF works, they both work well with impulse noise, but when it co… ▽ More

    Submitted 18 September, 2023; originally announced September 2023.

  4. arXiv:2304.02720  [pdf, other

    eess.IV cs.CR cs.CV

    Domain Generalization with Adversarial Intensity Attack for Medical Image Segmentation

    Authors: Zheyuan Zhang, Bin Wang, Lanhong Yao, Ugur Demir, Debesh Jha, Ismail Baris Turkbey, Boqing Gong, Ulas Bagci

    Abstract: Most statistical learning algorithms rely on an over-simplified assumption, that is, the train and test data are independent and identically distributed. In real-world scenarios, however, it is common for models to encounter data from new and different domains to which they were not exposed to during training. This is often the case in medical imaging applications due to differences in acquisition… ▽ More

    Submitted 5 April, 2023; originally announced April 2023.

    Comments: Code is available upon publication

  5. arXiv:2303.08561  [pdf, other

    cs.SD eess.AS

    Enhancing Unsupervised Audio Representation Learning via Adversarial Sample Generation

    Authors: Yulin Pan, Xiangteng He, Biao Gong, Yuxin Peng, Yiliang Lv

    Abstract: Existing audio analysis methods generally first transform the audio stream to spectrogram, and then feed it into CNN for further analysis. A standard CNN recognizes specific visual patterns over feature map, then pools for high-level representation, which overlooks the positional information of recognized patterns. However, unlike natural image, the semantic of an audio spectrogram is sensitive to… ▽ More

    Submitted 15 March, 2023; originally announced March 2023.

    Comments: 8 pages, 4 figures

  6. arXiv:2104.11178  [pdf, other

    cs.CV cs.AI cs.LG cs.MM eess.IV

    VATT: Transformers for Multimodal Self-Supervised Learning from Raw Video, Audio and Text

    Authors: Hassan Akbari, Liangzhe Yuan, Rui Qian, Wei-Hong Chuang, Shih-Fu Chang, Yin Cui, Boqing Gong

    Abstract: We present a framework for learning multimodal representations from unlabeled data using convolution-free Transformer architectures. Specifically, our Video-Audio-Text Transformer (VATT) takes raw signals as inputs and extracts multimodal representations that are rich enough to benefit a variety of downstream tasks. We train VATT end-to-end from scratch using multimodal contrastive losses and eval… ▽ More

    Submitted 6 December, 2021; v1 submitted 22 April, 2021; originally announced April 2021.

    Comments: Published in the 35th Conference on Neural Information Processing Systems (NeurIPS 2021)

  7. arXiv:2005.01651  [pdf, ps, other

    eess.SP

    Structured Distributed Compressive Channel Estimation over Doubly Selective Channels

    Authors: Qibo Qin, Lin Gui, Bo Gong, Xiang Ren, Wen Chen

    Abstract: For an orthogonal frequency-division multiplexing (OFDM) system over a doubly selective (DS) channel, a large number of pilot subcarriers are needed to estimate the numerous channel parameters, resulting in low spectral efficiency. In this paper, by exploiting temporal correlation of practical wireless channels, we propose a highly efficient structured distributed compressive sensing (SDCS) based… ▽ More

    Submitted 23 April, 2020; originally announced May 2020.

    Comments: IEEE TVT

  8. arXiv:2004.10018  [pdf, ps, other

    eess.SP

    Block Distributed Compressive Sensing Based Doubly Selective Channel Estimation and Pilot Design for Large-Scale MIMO Systems

    Authors: Bo Gong, Lin Gui, Qibo Qin, Xiang Ren, Wen Chen

    Abstract: The doubly selective (DS) channel estimation in the large-scale multiple-input multiple-output (MIMO) systems is a challenging problem due to the large number of the channel coefficients to be estimated, which requires unaffordable and prohibitive pilot overhead. In this paper, firstly we conduct the analysis about the common sparsity of the basis expansion model (BEM) coefficients among all the B… ▽ More

    Submitted 21 April, 2020; originally announced April 2020.

    Comments: TVT

  9. arXiv:2003.02698  [pdf, ps, other

    eess.SP

    Position-Based Interference Elimination for High Mobility OFDM Channel Estimation in Multi-cell Systems

    Authors: Xiang Ren, Wen Chen, Bo Gong, Qibo Qin, Lin Gui

    Abstract: Orthogonal frequency-division multiplexing (OFD-M) and multi-cell architecture are widely adopted in current high speed train (HST) systems for providing high data rate wireless communications. In this paper, a typical multi-antenna OFDM HST communication system with multi-cell architecture is considered, where the inter-carrier interference (ICI) caused by high mobility and multi-cell interferenc… ▽ More

    Submitted 1 March, 2020; originally announced March 2020.

  10. arXiv:2002.00169  [pdf, other

    cs.CV cs.LG eess.IV

    Deep Multi-View Enhancement Hashing for Image Retrieval

    Authors: Chenggang Yan, Biao Gong, Yuxuan Wei, Yue Gao

    Abstract: Hashing is an efficient method for nearest neighbor search in large-scale data space by embedding high-dimensional feature descriptors into a similarity preserving Hamming space with a low dimension. However, large-scale high-speed retrieval through binary code has a certain degree of reduction in retrieval accuracy compared to traditional retrieval methods. We have noticed that multi-view methods… ▽ More

    Submitted 15 June, 2020; v1 submitted 1 February, 2020; originally announced February 2020.

  11. arXiv:1912.11684  [pdf, other

    cs.CV cs.LG cs.RO cs.SD eess.AS

    Look, Listen, and Act: Towards Audio-Visual Embodied Navigation

    Authors: Chuang Gan, Yiwei Zhang, Jiajun Wu, Boqing Gong, Joshua B. Tenenbaum

    Abstract: A crucial ability of mobile intelligent agents is to integrate the evidence from multiple sensory inputs in an environment and to make a sequence of actions to reach their goals. In this paper, we attempt to approach the problem of Audio-Visual Embodied Navigation, the task of planning the shortest path from a random starting location in a scene to the sound source in an indoor environment, given… ▽ More

    Submitted 7 March, 2020; v1 submitted 25 December, 2019; originally announced December 2019.

    Comments: Accepted by ICRA 2020. Project page: http://avn.csail.mit.edu

  12. Image Super-Resolution via Deterministic-Stochastic Synthesis and Local Statistical Rectification

    Authors: Weifeng Ge, Bingchen Gong, Yizhou Yu

    Abstract: Single image superresolution has been a popular research topic in the last two decades and has recently received a new wave of interest due to deep neural networks. In this paper, we approach this problem from a different perspective. With respect to a downsampled low resolution image, we model a high resolution image as a combination of two components, a deterministic component and a stochastic c… ▽ More

    Submitted 18 September, 2018; originally announced September 2018.

    Comments: to appear in SIGGRAPH Asia 2018