Skip to main content

Showing 1–18 of 18 results for author: Blumenstein, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.19631  [pdf, other

    cs.LG cs.DC

    Personalized Interpretation on Federated Learning: A Virtual Concepts approach

    Authors: Peng Yan, Guodong Long, **g Jiang, Michael Blumenstein

    Abstract: Tackling non-IID data is an open challenge in federated learning research. Existing FL methods, including robust FL and personalized FL, are designed to improve model performance without consideration of interpreting non-IID across clients. This paper aims to design a novel FL method to robust and interpret the non-IID data across clients. Specifically, we interpret each client's dataset as a mixt… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  2. arXiv:2403.19211  [pdf, other

    cs.LG cs.AI cs.CL

    Dual-Personalizing Adapter for Federated Foundation Models

    Authors: Yiyuan Yang, Guodong Long, Tao Shen, **g Jiang, Michael Blumenstein

    Abstract: Recently, foundation models, particularly large language models (LLMs), have demonstrated an impressive ability to adapt to various tasks by fine-tuning large amounts of instruction data. Notably, federated foundation models emerge as a privacy preservation method to fine-tune models collaboratively under federated learning (FL) settings by leveraging many distributed datasets with non-IID data. T… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

  3. arXiv:2308.02905  [pdf, other

    cs.CV cs.MM

    FAST: Font-Agnostic Scene Text Editing

    Authors: Alloy Das, Prasun Roy, Saumik Bhattacharya, Subhankar Ghosh, Umapada Pal, Michael Blumenstein

    Abstract: Scene Text Editing (STE) is a challenging research problem, and it aims to modify existing texts in an image while preserving the background and the font style of the original text of the image. Due to its various real-life applications, researchers have explored several approaches toward STE in recent years. However, most of the existing STE methods show inferior editing performance because of (1… ▽ More

    Submitted 5 August, 2023; originally announced August 2023.

    Comments: 13 pages, in submission

  4. arXiv:2304.11993  [pdf, other

    cs.CV cs.MM

    MMC: Multi-Modal Colorization of Images using Textual Descriptions

    Authors: Subhankar Ghosh, Saumik Bhattacharya, Prasun Roy, Umapada Pal, Michael Blumenstein

    Abstract: Handling various objects with different colors is a significant challenge for image colorization techniques. Thus, for complex real-world scenes, the existing image colorization algorithms often fail to maintain color consistency. In this work, we attempt to integrate textual descriptions as an auxiliary condition, along with the grayscale image that is to be colorized, to improve the fidelity of… ▽ More

    Submitted 25 April, 2023; v1 submitted 24 April, 2023; originally announced April 2023.

    Comments: 9 pages

  5. arXiv:2302.14728  [pdf, other

    cs.CV cs.MM

    Global Context-Aware Person Image Generation

    Authors: Prasun Roy, Saumik Bhattacharya, Subhankar Ghosh, Umapada Pal, Michael Blumenstein

    Abstract: We propose a data-driven approach for context-aware person image generation. Specifically, we attempt to generate a person image such that the synthesized instance can blend into a complex scene. In our method, the position, scale, and appearance of the generated person are semantically conditioned on the existing persons in the scene. The proposed technique is divided into three sequential steps.… ▽ More

    Submitted 28 February, 2023; originally announced February 2023.

    Comments: 14 pages

  6. arXiv:2208.02843  [pdf, other

    cs.CV

    TIC: Text-Guided Image Colorization

    Authors: Subhankar Ghosh, Prasun Roy, Saumik Bhattacharya, Umapada Pal, Michael Blumenstein

    Abstract: Image colorization is a well-known problem in computer vision. However, due to the ill-posed nature of the task, image colorization is inherently challenging. Though several attempts have been made by researchers to make the colorization pipeline automatic, these processes often produce unrealistic results due to a lack of conditioning. In this work, we attempt to integrate textual descriptions as… ▽ More

    Submitted 4 August, 2022; originally announced August 2022.

  7. arXiv:2207.11718  [pdf, other

    cs.CV cs.MM

    TIPS: Text-Induced Pose Synthesis

    Authors: Prasun Roy, Subhankar Ghosh, Saumik Bhattacharya, Umapada Pal, Michael Blumenstein

    Abstract: In computer vision, human pose synthesis and transfer deal with probabilistic image generation of a person in a previously unseen pose from an already available observation of that person. Though researchers have recently proposed several methods to achieve this task, most of these techniques derive the target pose directly from the desired target image on a specific dataset, making the underlying… ▽ More

    Submitted 24 July, 2022; originally announced July 2022.

    Comments: Accepted in The European Conference on Computer Vision (ECCV) 2022

  8. arXiv:2206.02717  [pdf, other

    cs.CV cs.MM

    Scene Aware Person Image Generation through Global Contextual Conditioning

    Authors: Prasun Roy, Subhankar Ghosh, Saumik Bhattacharya, Umapada Pal, Michael Blumenstein

    Abstract: Person image generation is an intriguing yet challenging problem. However, this task becomes even more difficult under constrained situations. In this work, we propose a novel pipeline to generate and insert contextually relevant person images into an existing scene while preserving the global semantics. More specifically, we aim to insert a person such that the location, pose, and scale of the pe… ▽ More

    Submitted 6 June, 2022; originally announced June 2022.

    Comments: Accepted in The International Conference on Pattern Recognition (ICPR) 2022

  9. arXiv:2005.12524  [pdf

    cs.CV cs.MM

    A New Unified Method for Detecting Text from Marathon Runners and Sports Players in Video

    Authors: Sauradip Nag, Palaiahnakote Shivakumara, Umapada Pal, Tong Lu, Michael Blumenstein

    Abstract: Detecting text located on the torsos of marathon runners and sports players in video is a challenging issue due to poor quality and adverse effects caused by flexible/colorful clothing, and different structures of human bodies or actions. This paper presents a new unified method for tackling the above challenges. The proposed method fuses gradient magnitude and direction coherence of text pixels i… ▽ More

    Submitted 26 May, 2020; originally announced May 2020.

    Comments: Accepted in Pattern Recognition, Elsevier

  10. Robust Tensor Decomposition for Image Representation Based on Generalized Correntropy

    Authors: Miaohua Zhang, Yongsheng Gao, Changming Sun, Michael Blumenstein

    Abstract: Traditional tensor decomposition methods, e.g., two dimensional principal component analysis and two dimensional singular value decomposition, that minimize mean square errors, are sensitive to outliers. To overcome this problem, in this paper we propose a new robust tensor decomposition method using generalized correntropy criterion (Corr-Tensor). A Lagrange multiplier method is used to effective… ▽ More

    Submitted 10 May, 2020; originally announced May 2020.

    Comments: 13 pages

  11. arXiv:2005.04541  [pdf

    cs.CV

    A Robust Matching Pursuit Algorithm Using Information Theoretic Learning

    Authors: Miaohua Zhang, Yongsheng Gao, Changming Sun, Michael Blumenstein

    Abstract: Current orthogonal matching pursuit (OMP) algorithms calculate the correlation between two vectors using the inner product operation and minimize the mean square error, which are both suboptimal when there are non-Gaussian noises or outliers in the observation data. To overcome these problems, a new OMP algorithm is developed based on the information theoretic learning (ITL), which is built on the… ▽ More

    Submitted 9 May, 2020; originally announced May 2020.

    Comments: Accepted by "Pattern Recognition"

  12. arXiv:2002.10061  [pdf, other

    cs.LG stat.ML

    Omni-Scale CNNs: a simple and effective kernel size configuration for time series classification

    Authors: Wensi Tang, Guodong Long, Lu Liu, Tianyi Zhou, Michael Blumenstein, **g Jiang

    Abstract: The Receptive Field (RF) size has been one of the most important factors for One Dimensional Convolutional Neural Networks (1D-CNNs) on time series classification tasks. Large efforts have been taken to choose the appropriate size because it has a huge influence on the performance and differs significantly for each dataset. In this paper, we propose an Omni-Scale block (OS-block) for 1D-CNNs, wher… ▽ More

    Submitted 17 June, 2022; v1 submitted 23 February, 2020; originally announced February 2020.

    Comments: Accepted by ICLR 2022

    Journal ref: The Tenth International Conference on Learning Representations(ICLR 2022)

  13. Intra-Variable Handwriting Inspection Reinforced with Idiosyncrasy Analysis

    Authors: Chandranath Adak, Bidyut B. Chaudhuri, Chin-Teng Lin, Michael Blumenstein

    Abstract: In this paper, we work on intra-variable handwriting, where the writing samples of an individual can vary significantly. Such within-writer variation throws a challenge for automatic writer inspection, where the state-of-the-art methods do not perform well. To deal with intra-variability, we analyze the idiosyncrasy in individual handwriting. We identify/verify the writer from highly idiosyncratic… ▽ More

    Submitted 7 May, 2020; v1 submitted 19 December, 2019; originally announced December 2019.

    Journal ref: IEEE Transactions on Information Forensics and Security, 2020

  14. Temporal Self-Attention Network for Medical Concept Embedding

    Authors: Xue** Peng, Guodong Long, Tao Shen, Sen Wang, **g Jiang, Michael Blumenstein

    Abstract: In longitudinal electronic health records (EHRs), the event records of a patient are distributed over a long period of time and the temporal relations between the events reflect sufficient domain knowledge to benefit prediction tasks such as the rate of inpatient mortality. Medical concept embedding as a feature extraction method that transforms a set of medical concepts with a specific time stamp… ▽ More

    Submitted 15 September, 2019; originally announced September 2019.

    Comments: 10 pages, 7 figures, accepted at IEEE ICDM 2019

    MSC Class: 68T30 ACM Class: I.2.1

  15. FACLSTM: ConvLSTM with Focused Attention for Scene Text Recognition

    Authors: Qingqing Wang, Wen**g Jia, Xiangjian He, Yue Lu, Michael Blumenstein, Ye Huang

    Abstract: Scene text recognition has recently been widely treated as a sequence-to-sequence prediction problem, where traditional fully-connected-LSTM (FC-LSTM) has played a critical role. Due to the limitation of FC-LSTM, existing methods have to convert 2-D feature maps into 1-D sequential feature vectors, resulting in severe damages of the valuable spatial and structural information of text images. In th… ▽ More

    Submitted 5 January, 2020; v1 submitted 20 April, 2019; originally announced April 2019.

    Comments: Accepted by Science China Information Science

  16. arXiv:1807.06772  [pdf, ps, other

    cs.CV

    Bag-of-Visual-Words for Signature-Based Multi-Script Document Retrieval

    Authors: Ranju Mandal, Partha Pratim Roy, Umapada Pal, Michael Blumenstein

    Abstract: An end-to-end architecture for multi-script document retrieval using handwritten signatures is proposed in this paper. The user supplies a query signature sample and the system exclusively returns a set of documents that contain the query signature. In the first stage, a component-wise classification technique separates the potential signature components from all other components. A bag-of-visual-… ▽ More

    Submitted 18 July, 2018; originally announced July 2018.

  17. arXiv:1709.02243  [pdf, other

    cs.CV

    Towards a Dedicated Computer Vision Tool set for Crowd Simulation Models

    Authors: Sultan Daud Khan, Muhammad Saqib, Michael Blumenstein

    Abstract: As the population of world is increasing, and even more concentrated in urban areas, ensuring public safety is becoming a taunting job for security personnel and crowd managers. Mass events like sports, festivals, concerts, political gatherings attract thousand of people in a constraint environment,therefore adequate safety measures should be adopted. Despite safety measures, crowd disasters still… ▽ More

    Submitted 1 September, 2017; originally announced September 2017.

  18. An Empirical Study on Writer Identification & Verification from Intra-variable Individual Handwriting

    Authors: Chandranath Adak, Bidyut B. Chaudhuri, Michael Blumenstein

    Abstract: The handwriting of an individual may vary substantially with factors such as mood, time, space, writing speed, writing medium and tool, writing topic, etc. It becomes challenging to perform automated writer verification/identification on a particular set of handwritten patterns (e.g., speedy handwriting) of a person, especially when the system is trained using a different set of writing patterns (… ▽ More

    Submitted 20 January, 2019; v1 submitted 10 August, 2017; originally announced August 2017.