Skip to main content

Showing 1–50 of 54 results for author: Hasan, M K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.08635  [pdf

    cs.CV

    BdSLW60: A Word-Level Bangla Sign Language Dataset

    Authors: Husne Ara Rubaiyeat, Hasan Mahmud, Ahsan Habib, Md. Kamrul Hasan

    Abstract: Sign language discourse is an essential mode of daily communication for the deaf and hard-of-hearing people. However, research on Bangla Sign Language (BdSL) faces notable limitations, primarily due to the lack of datasets. Recognizing wordlevel signs in BdSL (WL-BdSL) presents a multitude of challenges, including the need for well-annotated datasets, capturing the dynamic nature of sign gestures… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

  2. arXiv:2310.15693  [pdf, other

    cs.CL cs.AI

    Towards Automated Recipe Genre Classification using Semi-Supervised Learning

    Authors: Nazmus Sakib, G. M. Shahariar, Md. Mohsinul Kabir, Md. Kamrul Hasan, Hasan Mahmud

    Abstract: Sharing cooking recipes is a great way to exchange culinary ideas and provide instructions for food preparation. However, categorizing raw recipes found online into appropriate food genres can be challenging due to a lack of adequate labeled data. In this study, we present a dataset named the ``Assorted, Archetypal, and Annotated Two Million Extended (3A2M+) Cooking Recipe Dataset" that contains t… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

  3. arXiv:2309.00831  [pdf, other

    eess.IV cs.CV

    Multi-scale, Data-driven and Anatomically Constrained Deep Learning Image Registration for Adult and Fetal Echocardiography

    Authors: Md. Kamrul Hasan, Haobo Zhu, Guang Yang, Choon Hwai Yap

    Abstract: Temporal echocardiography image registration is a basis for clinical quantifications such as cardiac motion estimation, myocardial strain assessments, and stroke volume quantifications. In past studies, deep learning image registration (DLIR) has shown promising results and is consistently accurate and precise, requiring less computational time. We propose that a greater focus on the warped moving… ▽ More

    Submitted 11 September, 2023; v1 submitted 2 September, 2023; originally announced September 2023.

    Comments: Our data-driven and anatomically constrained DLIR method's source code will be publicly available at https://github.com/kamruleee51/DdC-AC-DLIR

  4. arXiv:2306.13899  [pdf, other

    cs.CL cs.AI

    Math Word Problem Solving by Generating Linguistic Variants of Problem Statements

    Authors: Syed Rifat Raiyan, Md. Nafis Faiyaz, Shah Md. Jawad Kabir, Mohsinul Kabir, Hasan Mahmud, Md Kamrul Hasan

    Abstract: The art of mathematical reasoning stands as a fundamental pillar of intellectual progress and is a central catalyst in cultivating human ingenuity. Researchers have recently published a plethora of works centered around the task of solving Math Word Problems (MWP) $-$ a crucial stride towards general AI. These existing models are susceptible to dependency on shallow heuristics and spurious correla… ▽ More

    Submitted 24 June, 2023; originally announced June 2023.

    Comments: Accepted in Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics: Student Research Workshop (ACL-SRW 2023), 17 pages, 2 figures, 7 tables

  5. arXiv:2305.06595  [pdf

    cs.CL

    BanglaBook: A Large-scale Bangla Dataset for Sentiment Analysis from Book Reviews

    Authors: Mohsinul Kabir, Obayed Bin Mahfuz, Syed Rifat Raiyan, Hasan Mahmud, Md Kamrul Hasan

    Abstract: The analysis of consumer sentiment, as expressed through reviews, can provide a wealth of insight regarding the quality of a product. While the study of sentiment analysis has been widely explored in many popular languages, relatively less attention has been given to the Bangla language, mostly due to a lack of relevant data and cross-domain adaptability. To address this limitation, we present Ban… ▽ More

    Submitted 8 June, 2023; v1 submitted 11 May, 2023; originally announced May 2023.

    Comments: Accepted in Findings of the Association for Computational Linguistics: ACL 2023

  6. arXiv:2305.01044   

    cs.CV cs.AI

    Venn Diagram Multi-label Class Interpretation of Diabetic Foot Ulcer with Color and Sharpness Enhancement

    Authors: Md Mahamudul Hasan, Moi Hoon Yap, Md Kamrul Hasan

    Abstract: DFU is a severe complication of diabetes that can lead to amputation of the lower limb if not treated properly. Inspired by the 2021 Diabetic Foot Ulcer Grand Challenge, researchers designed automated multi-class classification of DFU, including infection, ischaemia, both of these conditions, and none of these conditions. However, it remains a challenge as classification accuracy is still not sati… ▽ More

    Submitted 5 May, 2023; v1 submitted 1 May, 2023; originally announced May 2023.

    Comments: The Paper is not complete, more modifications are needed

  7. Assorted, Archetypal and Annotated Two Million (3A2M) Cooking Recipes Dataset based on Active Learning

    Authors: Nazmus Sakib, G. M. Shahariar, Md. Mohsinul Kabir, Md. Kamrul Hasan, Hasan Mahmud

    Abstract: Cooking recipes allow individuals to exchange culinary ideas and provide food preparation instructions. Due to a lack of adequate labeled data, categorizing raw recipes found online to the appropriate food genres is a challenging task in this domain. Utilizing the knowledge of domain experts to categorize recipes could be a solution. In this study, we present a novel dataset of two million culinar… ▽ More

    Submitted 27 March, 2023; originally announced March 2023.

    Journal ref: International Conference on Machine Intelligence and Emerging Technologies. MIET 2022. Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering, vol 491, pp 188-203, Springer, Cham

  8. arXiv:2303.15430  [pdf, other

    cs.CL cs.LG

    TextMI: Textualize Multimodal Information for Integrating Non-verbal Cues in Pre-trained Language Models

    Authors: Md Kamrul Hasan, Md Saiful Islam, Sangwu Lee, Wasifur Rahman, Iftekhar Naim, Mohammed Ibrahim Khan, Ehsan Hoque

    Abstract: Pre-trained large language models have recently achieved ground-breaking performance in a wide variety of language understanding tasks. However, the same model can not be applied to multimodal behavior understanding tasks (e.g., video sentiment/humor detection) unless non-verbal features (e.g., acoustic and visual) can be integrated with language. Jointly modeling multiple modalities significantly… ▽ More

    Submitted 29 March, 2023; v1 submitted 27 March, 2023; originally announced March 2023.

  9. arXiv:2212.11486  [pdf, other

    cs.CR eess.SP

    Over-the-Air Federated Learning with Enhanced Privacy

    Authors: Xiaochan Xue, Moh Khalid Hasan, Shucheng Yu, Laxima Niure Kandel, Min Song

    Abstract: Federated learning (FL) has emerged as a promising learning paradigm in which only local model parameters (gradients) are shared. Private user data never leaves the local devices thus preserving data privacy. However, recent research has shown that even when local data is never shared by a user, exchanging model parameters without protection can also leak private information. Moreover, in wireless… ▽ More

    Submitted 22 December, 2022; originally announced December 2022.

    Comments: 6 pages

  10. arXiv:2210.04483  [pdf, other

    cs.HC

    Auxilio: A Sensor-Based Wireless Head-Mounted Mouse for People with Upper Limb Disability

    Authors: Mohammad Ridwan Kabir, Mohammad Ishrak Abedin, Rizvi Ahmed, Saad Bin Ashraf, Hasan Mahmud, Md. Kamrul Hasan

    Abstract: Upper limb disability may be caused either due to accidents, neurological disorders, or even birth defects, imposing limitations and restrictions on the interaction with a computer for the concerned individuals using a generic optical mouse. Our work proposes the design and development of a working prototype of a sensor-based wireless head-mounted Assistive Mouse Controller (AMC), Auxilio, facilit… ▽ More

    Submitted 10 October, 2022; originally announced October 2022.

    Comments: 28 pages, 9 figures, 5 tables

  11. arXiv:2209.08807  [pdf, other

    eess.IV cs.CV

    A Deep Learning Approach for Parallel Imaging and Compressed Sensing MRI Reconstruction

    Authors: Farhan Sadik, Md. Kamrul Hasan

    Abstract: Parallel imaging accelerates MRI data acquisition by acquiring additional sensitivity information with an array of receiver coils, resulting in fewer phase encoding steps. Because of fewer data requirements than parallel imaging, compressed sensing magnetic resonance imaging (CS-MRI) has gained popularity in the field of medical imaging. Parallel imaging and compressed sensing (CS) both reduce the… ▽ More

    Submitted 17 December, 2022; v1 submitted 19 September, 2022; originally announced September 2022.

    Comments: 13 pages, 11 figures

  12. A survey, review, and future trends of skin lesion segmentation and classification

    Authors: Md. Kamrul Hasan, Md. Asif Ahamad, Choon Hwai Yap, Guang Yang

    Abstract: The Computer-aided Diagnosis or Detection (CAD) approach for skin lesion analysis is an emerging field of research that has the potential to alleviate the burden and cost of skin cancer screening. Researchers have recently indicated increasing interest in develo** such CAD systems, with the intention of providing a user-friendly tool to dermatologists to reduce the challenges encountered or asso… ▽ More

    Submitted 2 February, 2023; v1 submitted 25 August, 2022; originally announced August 2022.

    Comments: This manuscript has been accepted to be published in Computers in Biology and Medicine and has a total of 106 pages (single column and double spacing), 13 figures, and 11 tables

    Journal ref: Computers in biology and medicine (2023): 106624

  13. arXiv:2203.08490  [pdf, other

    cs.SD cs.LG eess.AS

    Learning Audio Representations with MLPs

    Authors: Mashrur M. Morshed, Ahmad Omar Ahsan, Hasan Mahmud, Md. Kamrul Hasan

    Abstract: In this paper, we propose an efficient MLP-based approach for learning audio representations, namely timestamp and scene-level audio embeddings. We use an encoder consisting of sequentially stacked gated MLP blocks, which accept 2D MFCCs as inputs. In addition, we also provide a simple temporal interpolation-based algorithm for computing scene-level embeddings from timestamp embeddings. The audio… ▽ More

    Submitted 16 March, 2022; originally announced March 2022.

    Comments: In submission to Proceedings of Machine Learning Research (PMLR): NeurIPS 2021 Competition Track

  14. arXiv:2202.06128  [pdf, other

    eess.SP cs.AI cs.HC cs.RO

    Grasp-and-Lift Detection from EEG Signal Using Convolutional Neural Network

    Authors: Md. Kamrul Hasan, Sifat Redwan Wahid, Faria Rahman, Shanjida Khan Maliha, Sauda Binte Rahman

    Abstract: People undergoing neuromuscular dysfunctions and amputated limbs require automatic prosthetic appliances. In develo** such prostheses, the precise detection of brain motor actions is imperative for the Grasp-and-Lift (GAL) tasks. Because of the low-cost and non-invasive essence of Electroencephalography (EEG), it is widely preferred for detecting motor actions during the controls of prosthetic t… ▽ More

    Submitted 12 February, 2022; originally announced February 2022.

    Comments: Accepted in https://icaeee2022.com/

  15. VIS-iTrack: Visual Intention through Gaze Tracking using Low-Cost Webcam

    Authors: Shahed Anzarus Sabab, Mohammad Ridwan Kabir, Sayed Rizban Hussain, Hasan Mahmud, Md. Kamrul Hasan, Husne Ara Rubaiyeat

    Abstract: Human intention is an internal, mental characterization for acquiring desired information. From interactive interfaces containing either textual or graphical information, intention to perceive desired information is subjective and strongly connected with eye gaze. In this work, we determine such intention by analyzing real-time eye gaze data with a low-cost regular webcam. We extracted unique feat… ▽ More

    Submitted 5 February, 2022; originally announced February 2022.

    Comments: 15 pages, 9 figures, 4 tables

    ACM Class: I.4; I.5.2

  16. arXiv:2201.00458  [pdf, other

    eess.IV cs.CV cs.LG

    Lung-Originated Tumor Segmentation from Computed Tomography Scan (LOTUS) Benchmark

    Authors: Parnian Afshar, Arash Mohammadi, Konstantinos N. Plataniotis, Keyvan Farahani, Justin Kirby, Anastasia Oikonomou, Amir Asif, Leonard Wee, Andre Dekker, Xin Wu, Mohammad Ariful Haque, Shahruk Hossain, Md. Kamrul Hasan, Uday Kamal, Winston Hsu, Jhih-Yuan Lin, M. Sohel Rahman, Nabil Ibtehaz, Sh. M. Amir Foisol, Kin-Man Lam, Zhong Guang, Runze Zhang, Sumohana S. Channappayya, Shashank Gupta, Chander Dev

    Abstract: Lung cancer is one of the deadliest cancers, and in part its effective diagnosis and treatment depend on the accurate delineation of the tumor. Human-centered segmentation, which is currently the most common approach, is subject to inter-observer variability, and is also time-consuming, considering the fact that only experts are capable of providing annotations. Automatic and semi-automatic tumor… ▽ More

    Submitted 2 January, 2022; originally announced January 2022.

  17. arXiv:2111.10776  [pdf

    cs.CL cs.HC cs.LG

    A Case Study on the Independence of Speech Emotion Recognition in Bangla and English Languages using Language-Independent Prosodic Features

    Authors: Fardin Saad, Hasan Mahmud, Mohammad Ridwan Kabir, Md. Alamin Shaheen, Paresha Farastu, Md. Kamrul Hasan

    Abstract: A language agnostic approach to recognizing emotions from speech remains an incomplete and challenging task. In this paper, we performed a step-by-step comparative analysis of Speech Emotion Recognition (SER) using Bangla and English languages to assess whether distinguishing emotions from speech is independent of language. Six emotions were categorized for this study, such as - happy, angry, neut… ▽ More

    Submitted 13 May, 2022; v1 submitted 21 November, 2021; originally announced November 2021.

    Comments: 13 pages [currently under review]

  18. arXiv:2111.00601  [pdf

    cs.LG cs.CR cs.NI

    Explainable Artificial Intelligence for Smart City Application: A Secure and Trusted Platform

    Authors: M. Humayn Kabir, Khondokar Fida Hasan, Mohammad Kamrul Hasan, Keyvan Ansari

    Abstract: Artificial Intelligence (AI) is one of the disruptive technologies that is sha** the future. It has growing applications for data-driven decisions in major smart city solutions, including transportation, education, healthcare, public governance, and power systems. At the same time, it is gaining popularity in protecting critical cyber infrastructure from cyber threats, attacks, damages, or unaut… ▽ More

    Submitted 31 October, 2021; originally announced November 2021.

    Comments: Book_Chapter, Springer Nature

  19. arXiv:2109.07702  [pdf, other

    eess.IV cs.CV cs.LG

    A Multi-Task Cross-Task Learning Architecture for Ad-hoc Uncertainty Estimation in 3D Cardiac MRI Image Segmentation

    Authors: S. M. Kamrul Hasan, Cristian A. Linte

    Abstract: Medical image segmentation has significantly benefitted thanks to deep learning architectures. Furthermore, semi-supervised learning (SSL) has recently been a growing trend for improving a model's overall performance by leveraging abundant unlabeled data. Moreover, learning multiple tasks within the same model further improves model generalizability. To generate smoother and accurate segmentation… ▽ More

    Submitted 2 October, 2021; v1 submitted 15 September, 2021; originally announced September 2021.

    Comments: Accepted to 2021 Computing in Cardiology (CinC); Code is available at https://github.com/SMKamrulHasan/MTCTL

  20. ANTASID: A Novel Temporal Adjustment to Shannon's Index of Difficulty for Quantifying the Perceived Difficulty of Uncontrolled Pointing Tasks

    Authors: Mohammad Ridwan Kabir, Mohammad Ishrak Abedin, Rizvi Ahmed, Hasan Mahmud, Md. Kamrul Hasan

    Abstract: Shannon's Index of Difficulty ($ID$), reputable for quantifying the perceived difficulty of pointing tasks as a logarithmic relationship between movement-amplitude ($A$) and target-width ($W$), is used for modelling the corresponding observed movement-times ($MT_O$) in such tasks in controlled experimental setup. However, real-life pointing tasks are both spatially and temporally uncontrolled, bei… ▽ More

    Submitted 29 December, 2021; v1 submitted 10 September, 2021; originally announced September 2021.

    Comments: 14 pages, 7 figures, 7 tables

    ACM Class: G.3; H.1.2; H.5.2

  21. arXiv:2109.03631  [pdf, other

    cs.HC

    Renovo: Prototype of a Low-Cost Sensor-Based Therapeutic System for Upper Limb Rehabilitation

    Authors: Mohammad Ridwan Kabir, Mohammad Anas Jawad, Mohaimin Ehsan, Hasan Mahmud, Md. Kamrul Hasan

    Abstract: Stroke patients with Upper Limb Disability (ULD) are re-acclimated to their lost motor capability through therapeutic interventions, following assessment by Physiotherapists (PTs) using various qualitative assessment protocols. However, the assessments are often biased and prone to errors. Real-time visualization and quantitative analysis of various Performance Metrics (PMs) of patient's motion da… ▽ More

    Submitted 17 October, 2022; v1 submitted 8 September, 2021; originally announced September 2021.

    Comments: 27 pages, 10 figures, 5 tables

  22. arXiv:2107.02543  [pdf, other

    cs.CV cs.HC cs.LG

    A Deep Learning-based Multimodal Depth-Aware Dynamic Hand Gesture Recognition System

    Authors: Hasan Mahmud, Mashrur M. Morshed, Md. Kamrul Hasan

    Abstract: The dynamic hand gesture recognition task has seen studies on various unimodal and multimodal methods. Previously, researchers have explored depth and 2D-skeleton-based multimodal fusion CRNNs (Convolutional Recurrent Neural Networks) but have had limitations in getting expected recognition results. In this paper, we revisit this approach to hand gesture recognition and suggest several improvement… ▽ More

    Submitted 5 November, 2021; v1 submitted 6 July, 2021; originally announced July 2021.

  23. Bangla Natural Language Processing: A Comprehensive Analysis of Classical, Machine Learning, and Deep Learning Based Methods

    Authors: Ovishake Sen, Mohtasim Fuad, MD. Nazrul Islam, Jakaria Rabbi, Mehedi Masud, MD. Kamrul Hasan, Md. Abdul Awal, Awal Ahmed Fime, Md. Tahmid Hasan Fuad, Delowar Sikder, MD. Akil Raihan Iftee

    Abstract: The Bangla language is the seventh most spoken language, with 265 million native and non-native speakers worldwide. However, English is the predominant language for online resources and technical knowledge, journals, and documentation. Consequently, many Bangla-speaking people, who have limited command of English, face hurdles to utilize English resources. To bridge the gap between limited support… ▽ More

    Submitted 9 April, 2022; v1 submitted 31 May, 2021; originally announced May 2021.

    Comments: Accedpted in IEEE Access and it has 46 pages. Link: https://ieeexplore.ieee.org/document/9751052 (Early Access - April 10, 2022)

  24. arXiv:2105.03995  [pdf, other

    eess.IV cs.CV cs.LG

    Acute Lymphoblastic Leukemia Detection from Microscopic Images Using Weighted Ensemble of Convolutional Neural Networks

    Authors: Chayan Mondal, Md. Kamrul Hasan, Md. Tasnim Jawad, Aishwariya Dutta, Md. Rabiul Islam, Md. Abdul Awal, Mohiuddin Ahmad

    Abstract: Acute Lymphoblastic Leukemia (ALL) is a blood cell cancer characterized by numerous immature lymphocytes. Even though automation in ALL prognosis is an essential aspect of cancer diagnosis, it is challenging due to the morphological correlation between malignant and normal cells. The traditional ALL classification strategy demands experienced pathologists to carefully read the cell images, which i… ▽ More

    Submitted 9 May, 2021; originally announced May 2021.

    Comments: 31 pages, 9 figures

  25. arXiv:2102.06169  [pdf, other

    eess.IV cs.CV cs.LG

    COVID-19 identification from volumetric chest CT scans using a progressively resized 3D-CNN incorporating segmentation, augmentation, and class-rebalancing

    Authors: Md. Kamrul Hasan, Md. Tasnim Jawad, Kazi Nasim Imtiaz Hasan, Sajal Basak Partha, Md. Masum Al Masba, Shumit Saha

    Abstract: The novel COVID-19 is a global pandemic disease overgrowing worldwide. Computer-aided screening tools with greater sensitivity is imperative for disease diagnosis and prognosis as early as possible. It also can be a helpful tool in triage for testing and clinical supervision of COVID-19 patients. However, designing such an automated tool from non-invasive radiographic images is challenging as many… ▽ More

    Submitted 14 April, 2021; v1 submitted 11 February, 2021; originally announced February 2021.

    Comments: 33 pages

  26. arXiv:2102.01824  [pdf, other

    eess.IV cs.CV cs.LG

    Dermo-DOCTOR: A framework for concurrent skin lesion detection and recognition using a deep convolutional neural network with end-to-end dual encoders

    Authors: Md. Kamrul Hasan, Shidhartho Roy, Chayan Mondal, Md. Ashraful Alam, Md. Toufick E Elahi, Aishwariya Dutta, S. M. Taslim Uddin Raju, Md. Tasnim Jawad, Mohiuddin Ahmad

    Abstract: Automated skin lesion analysis for simultaneous detection and recognition is still challenging for inter-class homogeneity and intra-class heterogeneity, leading to low generic capability of a Single Convolutional Neural Network (CNN) with limited datasets. This article proposes an end-to-end deep CNN-based framework for simultaneous detection and recognition of the skin lesions, named Dermo-DOCTO… ▽ More

    Submitted 23 February, 2021; v1 submitted 2 February, 2021; originally announced February 2021.

    Comments: 39 Pages

  27. arXiv:2102.01822  [pdf, other

    eess.IV cs.CV

    Multi-class probabilistic atlas-based whole heart segmentation method in cardiac CT and MRI

    Authors: Tarun Kanti Ghosh, Md. Kamrul Hasan, Shidhartho Roy, Md. Ashraful Alam, Eklas Hossain, Mohiuddin Ahmad

    Abstract: Accurate and robust whole heart substructure segmentation is crucial in develo** clinical applications, such as computer-aided diagnosis and computer-aided surgery. However, segmentation of different heart substructures is challenging because of inadequate edge or boundary information, the complexity of the background and texture, and the diversity in different substructures' sizes and shapes. T… ▽ More

    Submitted 2 February, 2021; originally announced February 2021.

    Comments: 17 pages

  28. arXiv:2007.11993  [pdf, other

    eess.IV cs.CV

    CVR-Net: A deep convolutional neural network for coronavirus recognition from chest radiography images

    Authors: Md. Kamrul Hasan, Md. Ashraful Alam, Md. Toufick E Elahi, Shidhartho Roy, Sifat Redwan Wahid

    Abstract: The novel Coronavirus Disease 2019 (COVID-19) is a global pandemic disease spreading rapidly around the world. A robust and automatic early recognition of COVID-19, via auxiliary computer-aided diagnostic tools, is essential for disease cure and control. The chest radiography images, such as Computed Tomography (CT) and X-ray, and deep Convolutional Neural Networks (CNNs), can be a significant and… ▽ More

    Submitted 21 July, 2020; originally announced July 2020.

    Comments: 31 Pages

  29. Better User Recommendations using Enhancing Software Development Process Repository

    Authors: Ziaur Rahman, Md. Kamrul Hasan

    Abstract: Reusing previously completed software repository to enhance the development process is a common phenomenon. If developers get suggestions from the existing projects they might be benefited a lot what they eventually expect while coding. The strategies available in this field have been rapidly changing day by day. There are a number of efforts that have been focusing on mining process and construct… ▽ More

    Submitted 21 June, 2020; originally announced June 2020.

    Comments: 6 Pages 6 Figures 8 Tables

    ACM Class: K.6.3

    Journal ref: 2015 18th International Conference on Computer and Information Technology (ICCIT), Dhaka, 2015, pp. 70-75

  30. arXiv:2006.02578  [pdf, other

    eess.IV cs.CV cs.LG

    DFR-TSD: A Deep Learning Based Framework for Robust Traffic Sign Detection Under Challenging Weather Conditions

    Authors: Sabbir Ahmed, Uday Kamal, Md. Kamrul Hasan

    Abstract: Robust traffic sign detection and recognition (TSDR) is of paramount importance for the successful realization of autonomous vehicle technology. The importance of this task has led to a vast amount of research efforts and many promising methods have been proposed in the existing literature. However, the SOTA (SOTA) methods have been evaluated on clean and challenge-free datasets and overlooked the… ▽ More

    Submitted 3 June, 2020; originally announced June 2020.

  31. Opportunities of Optical Spectrum for Future Wireless Communications

    Authors: Mostafa Zaman Chowdhury, Moh Khalid Hasan, Md Shahjalal, Eun Bi Shin, Yeong Min Jang

    Abstract: The requirements in terms of service quality such as data rate, latency, power consumption, number of connectivity of future fifth-generation (5G) communication is very high. Moreover, in Internet of Things (IoT) requires massive connectivity. Optical wireless communication (OWC) technologies such as visible light communication, light fidelity, optical camera communication, and free space optical… ▽ More

    Submitted 30 May, 2020; originally announced June 2020.

    Comments: 2019 International Conference on Artificial Intelligence in Information and Communication (ICAIIC)

  32. Optical wireless hybrid networks for 5G and beyond communications

    Authors: Mostafa Zaman Chowdhury, Moh Khalid Hasan, Md Shahjalal, Md Tanvir Hossan, Yeong Min Jang

    Abstract: The next 5 th generation (5G) and above ultra-high speed, ultra-low latency, and extremely high reliable communication systems will consist of heterogeneous networks. These heterogeneous networks will consist not only radio frequency (RF) based systems but also optical wireless based systems. Hybrid architectures among different networks is an excellent approach for achieving the required level of… ▽ More

    Submitted 30 May, 2020; originally announced June 2020.

    Comments: 2018 International Conference on Information and Communication Technology Convergence (ICTC)

  33. arXiv:2004.11253  [pdf, other

    eess.IV cs.CV

    L-CO-Net: Learned Condensation-Optimization Network for Clinical Parameter Estimation from Cardiac Cine MRI

    Authors: S. M. Kamrul Hasan, Cristian A. Linte

    Abstract: In this work, we implement a fully convolutional segmenter featuring both a learned group structure and a regularized weight-pruner to reduce the high computational cost in volumetric image segmentation. We validated our framework on the ACDC dataset featuring one healthy and four pathology groups imaged throughout the cardiac cycle. Our technique achieved Dice scores of 96.8% (LV blood-pool), 93.… ▽ More

    Submitted 21 April, 2020; originally announced April 2020.

    Comments: 6 pages, 5 figures, IEEE Conference. arXiv admin note: text overlap with arXiv:2004.02249

  34. arXiv:2004.02249  [pdf, other

    eess.IV cs.CV cs.LG

    CondenseUNet: A Memory-Efficient Condensely-Connected Architecture for Bi-ventricular Blood Pool and Myocardium Segmentation

    Authors: S. M. Kamrul Hasan, Cristian A. Linte

    Abstract: With the advent of Cardiac Cine Magnetic Resonance (CMR) Imaging, there has been a paradigm shift in medical technology, thanks to its capability of imaging different structures within the heart without ionizing radiation. However, it is very challenging to conduct pre-operative planning of minimally invasive cardiac procedures without accurate segmentation and identification of the left ventricle… ▽ More

    Submitted 5 April, 2020; originally announced April 2020.

    Comments: 7 pages, 3 figures

  35. arXiv:1911.05305  [pdf, other

    cs.HC

    Emotion Recognition with Forearm-based Electromyography

    Authors: Muhammad Shihab Rashid, Zubayet Zaman, Hasan Mahmud, Md. Kamrul Hasan

    Abstract: Electromyography is an unexplored field of study when it comes to alternate input modality while interacting with a computer. However, to make computers understand human emotions is pivotal in the area of human-computer interaction and in assistive technology. Traditional input devices used currently have limitations and restrictions when it comes to express human emotions. The applications regard… ▽ More

    Submitted 13 November, 2019; originally announced November 2019.

  36. arXiv:1910.02579  [pdf

    eess.IV cs.CV

    A Novel Technique of Noninvasive Hemoglobin Level Measurement Using HSV Value of Fingertip Image

    Authors: Md Kamrul Hasan, Nazmus Sakib, Joshua Field, Richard R. Love, Sheikh I. Ahamed

    Abstract: Over the last decade, smartphones have changed radically to support us with mHealth technology, cloud computing, and machine learning algorithm. Having its multifaceted facilities, we present a novel smartphone-based noninvasive hemoglobin (Hb) level prediction model by analyzing hue, saturation and value (HSV) of a fingertip video. Here, we collect 60 videos of 60 subjects from two different loca… ▽ More

    Submitted 6 October, 2019; originally announced October 2019.

  37. arXiv:1908.05787  [pdf, other

    cs.LG cs.CL stat.ML

    Integrating Multimodal Information in Large Pretrained Transformers

    Authors: Wasifur Rahman, Md. Kamrul Hasan, Sangwu Lee, Amir Zadeh, Chengfeng Mao, Louis-Philippe Morency, Ehsan Hoque

    Abstract: Recent Transformer-based contextual word representations, including BERT and XLNet, have shown state-of-the-art performance in multiple disciplines within NLP. Fine-tuning the trained contextual models on task-specific datasets has been the key to achieving superior performance downstream. While fine-tuning these pre-trained models is straightforward for lexical applications (applications with onl… ▽ More

    Submitted 21 November, 2020; v1 submitted 15 August, 2019; originally announced August 2019.

  38. arXiv:1907.04424  [pdf, other

    cs.CV

    Automatic Mass Detection in Breast Using Deep Convolutional Neural Network and SVM Classifier

    Authors: Md. Kamrul Hasan, Tajwar Abrar Aleef

    Abstract: Mammography is the most widely used gold standard for screening breast cancer, where, mass detection is considered as the prominent step. Detecting mass in the breast is, however, an arduous problem as they usually have large variations between them in terms of shape, size, boundary, and texture. In this literature, the process of mass detection is automated with the use of transfer learning techn… ▽ More

    Submitted 9 July, 2019; originally announced July 2019.

    Comments: 11 pages

  39. arXiv:1907.04305  [pdf, other

    eess.IV cs.CV

    DSNet: Automatic Dermoscopic Skin Lesion Segmentation

    Authors: Md. Kamrul Hasan, Lavsen Dahal, Prasad N. Samarakoon, Fakrul Islam Tushar, Robert Marti Marly

    Abstract: Automatic segmentation of skin lesion is considered a crucial step in Computer Aided Diagnosis (CAD) for melanoma diagnosis. Despite its significance, skin lesion segmentation remains a challenging task due to their diverse color, texture, and indistinguishable boundaries and forms an open problem. Through this study, we present a new and automatic semantic segmentation network for robust skin les… ▽ More

    Submitted 23 January, 2020; v1 submitted 9 July, 2019; originally announced July 2019.

    Comments: 25 pages

  40. arXiv:1905.08392  [pdf, other

    cs.LG cs.CL stat.ML

    A Causality-Guided Prediction of the TED Talk Ratings from the Speech-Transcripts using Neural Networks

    Authors: Md Iftekhar Tanveer, Md Kamrul Hasan, Daniel Gildea, M. Ehsan Hoque

    Abstract: Automated prediction of public speaking performance enables novel systems for tutoring public speaking skills. We use the largest open repository---TED Talks---to predict the ratings provided by the online viewers. The dataset contains over 2200 talk transcripts and the associated meta information including over 5.5 million ratings from spontaneous visitors to the website. We carefully removed the… ▽ More

    Submitted 20 May, 2019; originally announced May 2019.

  41. arXiv:1904.06618  [pdf, other

    cs.LG cs.CL stat.ML

    UR-FUNNY: A Multimodal Language Dataset for Understanding Humor

    Authors: Md Kamrul Hasan, Wasifur Rahman, Amir Zadeh, Jianyuan Zhong, Md Iftekhar Tanveer, Louis-Philippe Morency, Mohammed, Hoque

    Abstract: Humor is a unique and creative communicative behavior displayed during social interactions. It is produced in a multimodal manner, through the usage of words (text), gestures (vision) and prosodic cues (acoustic). Understanding humor from these three modalities falls within boundaries of multimodal language; a recent research trend in natural language processing that models natural language as it… ▽ More

    Submitted 13 April, 2019; originally announced April 2019.

    Journal ref: EMNLP-IJCNLP, 2019, 2046-2056

  42. arXiv:1904.03075  [pdf, other

    cs.CV

    Comparative Analysis of Automatic Skin Lesion Segmentation with Two Different Implementations

    Authors: Md. Kamrul Hasan, Basel Alyafi, Fakrul Islam Tushar

    Abstract: Lesion segmentation from the surrounding skin is the first task for develo** automatic Computer-Aided Diagnosis of skin cancer. Variant features of lesion like uneven distribution of color, irregular shape, border and texture make this task challenging. The contribution of this paper is to present and compare two different approaches to skin lesion segmentation. The first approach uses watershed… ▽ More

    Submitted 5 April, 2019; originally announced April 2019.

    Comments: 4 pages, 4 figures, 4 tables, 4 sections

    MSC Class: 68U10 ACM Class: I.4.6; I.5.3

  43. arXiv:1904.00068  [pdf

    cs.CV

    Brain Tissue Segmentation Using NeuroNet With Different Pre-processing Techniques

    Authors: Fakrul Islam Tushar, Basel Alyafi, Md. Kamrul Hasan, Lavsen Dahal

    Abstract: Automatic segmentation of brain Magnetic Resonance Imaging (MRI) images is one of the vital steps for quantitative analysis of brain for further inspection. In this paper, NeuroNet has been adopted to segment the brain tissues (white matter (WM), grey matter (GM) and cerebrospinal fluid (CSF)) which uses Residual Network (ResNet) in encoder and Fully Convolution Network (FCN) in the decoder. To ac… ▽ More

    Submitted 29 March, 2019; originally announced April 2019.

    Comments: 3rd International Conference on Imaging, Vision & Pattern Recognition (IVPR)2019

  44. arXiv:1902.08994  [pdf, other

    cs.CV

    U-NetPlus: A Modified Encoder-Decoder U-Net Architecture for Semantic and Instance Segmentation of Surgical Instrument

    Authors: S. M. Kamrul Hasan, Cristian A. Linte

    Abstract: Conventional therapy approaches limit surgeons' dexterity control due to limited field-of-view. With the advent of robot-assisted surgery, there has been a paradigm shift in medical technology for minimally invasive surgery. However, it is very challenging to track the position of the surgical instruments in a surgical scene, and accurate detection & identification of surgical tools is paramount.… ▽ More

    Submitted 24 February, 2019; originally announced February 2019.

    Comments: 7 pages, 6 figures, IEEE conference submission

  45. arXiv:1810.04637  [pdf

    q-bio.QM cs.CV physics.med-ph

    Quantification of Trabeculae Inside the Heart from MRI Using Fractal Analysis

    Authors: Md. Kamrul Hasan, Fakrul Islam Tushar

    Abstract: Left ventricular non-compaction (LVNC) is a rare cardiomyopathy (CMP) that should be considered as a possible diagnosis because of its potential complications which are heart failure, ventricular arrhythmias, and embolic events. For analysis cardiac functionality, extracting information from the Left ventricular (LV) is already a broad field of Medical Imaging. Different algorithms and strategies… ▽ More

    Submitted 14 October, 2018; v1 submitted 30 September, 2018; originally announced October 2018.

  46. arXiv:1810.02600  [pdf

    cs.NI

    An Implementation Approach and Performance Analysis of Image Sensor Based Multilateral Indoor Localization and Navigation System

    Authors: Md. Shahjalal, Md. Tanvir Hossan, Moh. Khalid Hasan, Mostafa Zaman Chowdhury, Nam Tuan Le, Yeong Min Jang

    Abstract: Optical camera communication (OCC) exhibits considerable importance nowadays in various indoor camera based services such as smart home and robot-based automation. An android smart phone camera that is mounted on a mobile robot (MR) offers a uniform communication distance when the camera remains at the same level that can reduce the communication error rate. Indoor mobile robot navigation (MRN) is… ▽ More

    Submitted 5 October, 2018; originally announced October 2018.

    Journal ref: Wireless Communications and Mobile Computing, 2018

  47. Integrated RF/Optical Wireless Networks for Improving QoS in Indoor and Transportation Applications

    Authors: Mostafa Zaman Chowdhury, Md. Tanvir Hossan, Moh. Khalid Hasan, Yeong Min Jang

    Abstract: Communications based solely on radio frequency (RF) networks cannot provide adequate quality of service for the rapidly growing demands of wireless connectivity. Since devices operating in the optical spectrum do not interfere with those using the RF spectrum, wireless networks based on the optical spectrum can be added to existing RF networks to fulfill this demand. Hence, optical wireless commun… ▽ More

    Submitted 5 October, 2018; originally announced October 2018.

    Journal ref: Sept. 2018

  48. A New Vehicle Localization Scheme Based on Combined Optical Camera Communication and Photogrammetry

    Authors: Md. Tanvir Hossan, Mostafa Zaman Chowdhury, Moh. Khalid Hasan, Md. Shahjalal, Trang Nguyen, Nam Tuan Le, Yeong Min Jang

    Abstract: The demand for autonomous vehicles is increasing gradually owing to their enormous potential benefits. However, several challenges, such as vehicle localization, are involved in the development of autonomous vehicles. A simple and secure algorithm for vehicle positioning is proposed herein without massively modifying the existing transportation infrastructure. For vehicle localization, vehicles on… ▽ More

    Submitted 5 October, 2018; originally announced October 2018.

    Journal ref: Mobile Information Systems, vol. 2018, March 2018

  49. arXiv:1709.02414  [pdf

    cs.HC

    Automated Dyadic Data Recorder (ADDR) Framework and Analysis of Facial Cues in Deceptive Communication

    Authors: Tayan Sen, Md Kamrul Hasan, Zach Teicher, M. Ehsan Hoque

    Abstract: We developed an online framework that can automatically pair two crowd-sourced participants, prompt them to follow a research protocol, and record their audio and video on a remote server. The framework comprises two web applications: an Automatic Quality Gatekeeper for ensuring only high quality crowd-sourced participants are recruited for the study, and a Session Controller which directs partici… ▽ More

    Submitted 7 September, 2017; originally announced September 2017.

  50. Buildup of Speaking Skills in an Online Learning Community: A Network-Analytic Exploration

    Authors: Rasoul Shafipour, Raiyan Abdul Baten, Md Kamrul Hasan, Gourab Ghoshal, Gonzalo Mateos, Mohammed Ehsan Hoque

    Abstract: In this study, we explore peer-interaction effects in online networks on speaking skill development. In particular, we present an evidence for gradual buildup of skills in a small-group setting that has not been reported in the literature. We introduce a novel dataset of six online communities consisting of 158 participants focusing on improving their speaking skills. They video-record speeches fo… ▽ More

    Submitted 12 March, 2018; v1 submitted 6 July, 2017; originally announced July 2017.

    Journal ref: Palgrave Communications, vol. 4, June 2018