Skip to main content

Showing 1–8 of 8 results for author: Konwer, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2302.04308  [pdf, other

    cs.CV

    Enhancing Modality-Agnostic Representations via Meta-Learning for Brain Tumor Segmentation

    Authors: Aishik Konwer, Xiaoling Hu, Joseph Bae, Xuan Xu, Chao Chen, Prateek Prasanna

    Abstract: In medical vision, different imaging modalities provide complementary information. However, in practice, not all modalities may be available during inference or even training. Previous approaches, e.g., knowledge distillation or image synthesis, often assume the availability of full modalities for all patients during training; this is unrealistic and impractical due to the variability in data coll… ▽ More

    Submitted 22 August, 2023; v1 submitted 8 February, 2023; originally announced February 2023.

    Comments: Accepted in ICCV 2023

  2. arXiv:2203.01933  [pdf, other

    eess.IV cs.CV

    Temporal Context Matters: Enhancing Single Image Prediction with Disease Progression Representations

    Authors: Aishik Konwer, Xuan Xu, Joseph Bae, Chao Chen, Prateek Prasanna

    Abstract: Clinical outcome or severity prediction from medical images has largely focused on learning representations from single-timepoint or snapshot scans. It has been shown that disease progression can be better characterized by temporal imaging. We therefore hypothesized that outcome predictions can be improved by utilizing the disease progression information from sequential images. We present a deep l… ▽ More

    Submitted 30 March, 2022; v1 submitted 2 March, 2022; originally announced March 2022.

    Comments: Accepted in CVPR 2022 (ORAL)

  3. arXiv:2107.08330  [pdf, other

    eess.IV cs.CV

    Attention-based Multi-scale Gated Recurrent Encoder with Novel Correlation Loss for COVID-19 Progression Prediction

    Authors: Aishik Konwer, Joseph Bae, Gagandeep Singh, Rishabh Gattu, Syed Ali, Jeremy Green, Tej Phatak, Prateek Prasanna

    Abstract: COVID-19 image analysis has mostly focused on diagnostic tasks using single timepoint scans acquired upon disease presentation or admission. We present a deep learning-based approach to predict lung infiltrate progression from serial chest radiographs (CXRs) of COVID-19 patients. Our method first utilizes convolutional neural networks (CNNs) for feature extraction from patches within the concerned… ▽ More

    Submitted 17 July, 2021; originally announced July 2021.

    Comments: The paper is early accepted to MICCAI 2021

  4. Facial Micro-Expression Spotting and Recognition using Time Contrasted Feature with Visual Memory

    Authors: Sauradip Nag, Ayan Kumar Bhunia, Aishik Konwer, Partha Pratim Roy

    Abstract: Facial micro-expressions are sudden involuntary minute muscle movements which reveal true emotions that people try to conceal. Spotting a micro-expression and recognizing it is a major challenge owing to its short duration and intensity. Many works pursued traditional and deep learning based approaches to solve this issue but compromised on learning low-level features and higher accuracy due to un… ▽ More

    Submitted 18 April, 2019; v1 submitted 9 February, 2019; originally announced February 2019.

    Comments: International Conference on Acoustics, Speech, and Signal Processing(ICASSP), 2019

  5. arXiv:1801.07211  [pdf

    cs.CV

    Handwriting Trajectory Recovery using End-to-End Deep Encoder-Decoder Network

    Authors: Ayan Kumar Bhunia, Abir Bhowmick, Ankan Kumar Bhunia, Aishik Konwer, Prithaj Banerjee, Partha Pratim Roy, Umapada Pal

    Abstract: In this paper, we introduce a novel technique to recover the pen trajectory of offline characters which is a crucial step for handwritten character recognition. Generally, online acquisition approach has more advantage than its offline counterpart as the online technique keeps track of the pen movement. Hence, pen tip trajectory retrieval from offline text can bridge the gap between online and off… ▽ More

    Submitted 3 June, 2018; v1 submitted 22 January, 2018; originally announced January 2018.

    Comments: To be appeared in ICPR 2018, 2018 International Conference on Pattern Recognition, Code Link: https://drive.google.com/file/d/1clT-UuXgPp6uFn1tmIXx481qvPUcY0fV/view

  6. arXiv:1801.07156  [pdf

    cs.CV

    Word Level Font-to-Font Image Translation using Convolutional Recurrent Generative Adversarial Networks

    Authors: Ankan Kumar Bhunia, Ayan Kumar Bhunia, Prithaj Banerjee, Aishik Konwer, Abir Bhowmick, Partha Pratim Roy, Umapada Pal

    Abstract: Conversion of one font to another font is very useful in real life applications. In this paper, we propose a Convolutional Recurrent Generative model to solve the word level font transfer problem. Our network is able to convert the font style of any printed text images from its current font to the required font. The network is trained end-to-end for the complete word images. Thus it eliminates the… ▽ More

    Submitted 23 May, 2018; v1 submitted 22 January, 2018; originally announced January 2018.

    Comments: To be appeared in ICPR 2018, 2018 International Conference on Pattern Recognition

  7. arXiv:1801.07141  [pdf

    cs.CV

    Staff line Removal using Generative Adversarial Networks

    Authors: Aishik Konwer, Ayan Kumar Bhunia, Abir Bhowmick, Ankan Kumar Bhunia, Prithaj Banerjee, Partha Pratim Roy, Umapada Pal

    Abstract: Staff line removal is a crucial pre-processing step in Optical Music Recognition. It is a challenging task to simultaneously reduce the noise and also retain the quality of music symbol context in ancient degraded music score images. In this paper we propose a novel approach for staff line removal, based on Generative Adversarial Networks. We convert staff line images into patches and feed them in… ▽ More

    Submitted 5 June, 2018; v1 submitted 22 January, 2018; originally announced January 2018.

    Comments: To be appeared in ICPR 2018, 2018 International Conference on Pattern Recognition(Oral)

  8. Script Identification in Natural Scene Image and Video Frame using Attention based Convolutional-LSTM Network

    Authors: Ankan Kumar Bhunia, Aishik Konwer, Ayan Kumar Bhunia, Abir Bhowmick, Partha P. Roy, Umapada Pal

    Abstract: Script identification plays a significant role in analysing documents and videos. In this paper, we focus on the problem of script identification in scene text images and video scripts. Because of low image quality, complex background and similar layout of characters shared by some scripts like Greek, Latin, etc., text recognition in those cases become challenging. In this paper, we propose a nove… ▽ More

    Submitted 7 August, 2018; v1 submitted 1 January, 2018; originally announced January 2018.

    Comments: The first and second authors contributed equally. Accepted in Pattern Recognition Journal