Skip to main content

Showing 1–50 of 56 results for author: Sarker, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.05912  [pdf

    cs.CV cs.AI

    BD-SAT: High-resolution Land Use Land Cover Dataset & Benchmark Results for Develo** Division: Dhaka, BD

    Authors: Ovi Paul, Abu Bakar Siddik Nayem, Anis Sarker, Amin Ahsan Ali, M Ashraful Amin, AKM Mahbubur Rahman

    Abstract: Land Use Land Cover (LULC) analysis on satellite images using deep learning-based methods is significantly helpful in understanding the geography, socio-economic conditions, poverty levels, and urban sprawl in develo** countries. Recent works involve segmentation with LULC classes such as farmland, built-up areas, forests, meadows, water bodies, etc. Training deep learning methods on satellite i… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

    Comments: 26 pages, 15 figures and 12 tables

  2. arXiv:2405.19519  [pdf, other

    cs.CL cs.AI

    Two-layer retrieval augmented generation framework for low-resource medical question-answering: proof of concept using Reddit data

    Authors: Sudeshna Das, Yao Ge, Yuting Guo, Swati Rajwal, JaMor Hairston, Jeanne Powell, Drew Walker, Snigdha Peddireddy, Sahithi Lakamana, Selen Bozkurt, Matthew Reyna, Reza Sameni, Yunyu Xiao, Sangmi Kim, Rasheeta Chandler, Natalie Hernandez, Danielle Mowery, Rachel Wightman, Jennifer Love, Anthony Spadaro, Jeanmarie Perrone, Abeed Sarker

    Abstract: Retrieval augmented generation (RAG) provides the capability to constrain generative model outputs, and mitigate the possibility of hallucination, by providing relevant in-context text. The number of tokens a generative large language model (LLM) can incorporate as context is finite, thus limiting the volume of knowledge from which to generate an answer. We propose a two-layer RAG framework for qu… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  3. arXiv:2405.18015  [pdf, other

    cs.CL

    MultiADE: A Multi-domain Benchmark for Adverse Drug Event Extraction

    Authors: Xiang Dai, Sarvnaz Karimi, Abeed Sarker, Ben Hachey, Cecile Paris

    Abstract: Objective. Active adverse event surveillance monitors Adverse Drug Events (ADE) from different data sources, such as electronic health records, medical literature, social media and search engine logs. Over years, many datasets are created, and shared tasks are organised to facilitate active adverse event surveillance. However, most-if not all-datasets or shared tasks focus on extracting ADEs from… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: Under review; feedback welcome

  4. arXiv:2405.06145  [pdf, other

    cs.CL cs.AI cs.LG

    Reddit-Impacts: A Named Entity Recognition Dataset for Analyzing Clinical and Social Effects of Substance Use Derived from Social Media

    Authors: Yao Ge, Sudeshna Das, Karen O'Connor, Mohammed Ali Al-Garadi, Graciela Gonzalez-Hernandez, Abeed Sarker

    Abstract: Substance use disorders (SUDs) are a growing concern globally, necessitating enhanced understanding of the problem and its trends through data-driven research. Social media are unique and important sources of information about SUDs, particularly since the data in such sources are often generated by people with lived experiences. In this paper, we introduce Reddit-Impacts, a challenging Named Entit… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

    Comments: 7 pages, 1 figure, 4 tables

  5. arXiv:2405.05204  [pdf

    cs.CL

    CARE-SD: Classifier-based analysis for recognizing and eliminating stigmatizing and doubt marker labels in electronic health records: model development and validation

    Authors: Drew Walker, Annie Thorne, Sudeshna Das, Jennifer Love, Hannah LF Cooper, Melvin Livingston III, Abeed Sarker

    Abstract: Objective: To detect and classify features of stigmatizing and biased language in intensive care electronic health records (EHRs) using natural language processing techniques. Materials and Methods: We first created a lexicon and regular expression lists from literature-driven stem words for linguistic features of stigmatizing patient labels, doubt markers, and scare quotes within EHRs. The lexico… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

    Comments: 28 pages, 3 figures, 4 tables. 5 Appendices

  6. arXiv:2403.19031  [pdf

    cs.CL cs.AI cs.LG

    Evaluating Large Language Models for Health-Related Text Classification Tasks with Public Social Media Data

    Authors: Yuting Guo, Anthony Ovadje, Mohammed Ali Al-Garadi, Abeed Sarker

    Abstract: Large language models (LLMs) have demonstrated remarkable success in NLP tasks. However, there is a paucity of studies that attempt to evaluate their performances on social media-based health-related natural language processing tasks, which have traditionally been difficult to achieve high scores in. We benchmarked one supervised classic machine learning model based on Support Vector Machines (SVM… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

  7. arXiv:2403.15721  [pdf, other

    cs.DC

    Design and Implementation of an Analysis Pipeline for Heterogeneous Data

    Authors: Arup Kumar Sarker, Aymen Alsaadi, Niranda Perera, Mills Staylor, Gregor von Laszewski, Matteo Turilli, Ozgur Ozan Kilic, Mikhail Titov, Andre Merzky, Shantenu Jha, Geoffrey Fox

    Abstract: Managing and preparing complex data for deep learning, a prevalent approach in large-scale data science can be challenging. Data transfer for model training also presents difficulties, impacting scientific fields like genomics, climate modeling, and astronomy. A large-scale solution like Google Pathways with a distributed execution environment for deep learning models exists but is proprietary. In… ▽ More

    Submitted 7 April, 2024; v1 submitted 23 March, 2024; originally announced March 2024.

    Comments: 14 pages, 16 figures, 2 tables

    ACM Class: H.2.4; D.2.7; D.2.2

  8. arXiv:2403.00821  [pdf, other

    cs.CL cs.LG cs.SI

    Social Media as a Sensor: Analyzing Twitter Data for Breast Cancer Medication Effects Using Natural Language Processing

    Authors: Seibi Kobara, Alireza Rafiei, Masoud Nateghi, Selen Bozkurt, Rishikesan Kamaleswaran, Abeed Sarker

    Abstract: Breast cancer is a significant public health concern and is the leading cause of cancer-related deaths among women. Despite advances in breast cancer treatments, medication non-adherence remains a major problem. As electronic health records do not typically capture patient-reported outcomes that may reveal information about medication-related experiences, social media presents an attractive resour… ▽ More

    Submitted 26 February, 2024; originally announced March 2024.

  9. arXiv:2402.01826  [pdf, other

    cs.CL cs.AI

    Leveraging Large Language Models for Analyzing Blood Pressure Variations Across Biological Sex from Scientific Literature

    Authors: Yuting Guo, Seyedeh Somayyeh Mousavi, Reza Sameni, Abeed Sarker

    Abstract: Hypertension, defined as blood pressure (BP) that is above normal, holds paramount significance in the realm of public health, as it serves as a critical precursor to various cardiovascular diseases (CVDs) and significantly contributes to elevated mortality rates worldwide. However, many existing BP measurement technologies and standards might be biased because they do not consider clinical outcom… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

  10. arXiv:2402.01598  [pdf, other

    q-bio.QM cs.LG stat.AP

    Learning from Two Decades of Blood Pressure Data: Demography-Specific Patterns Across 75 Million Patient Encounters

    Authors: Seyedeh Somayyeh Mousavi, Yuting Guo, Abeed Sarker, Reza Sameni

    Abstract: Hypertension is a global health concern with an increasing prevalence, underscoring the need for effective monitoring and analysis of blood pressure (BP) dynamics. We analyzed a substantial BP dataset comprising 75,636,128 records from 2,054,462 unique patients collected between 2000 and 2022 at Emory Healthcare in Georgia, USA, representing a demographically diverse population. We examined and co… ▽ More

    Submitted 23 April, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

  11. arXiv:2308.10783  [pdf, other

    cs.CL cs.LG

    Zero- and Few-Shot Prompting with LLMs: A Comparative Study with Fine-tuned Models for Bangla Sentiment Analysis

    Authors: Md. Arid Hasan, Shudipta Das, Afiyat Anjum, Firoj Alam, Anika Anjum, Avijit Sarker, Sheak Rashed Haider Noori

    Abstract: The rapid expansion of the digital world has propelled sentiment analysis into a critical tool across diverse sectors such as marketing, politics, customer service, and healthcare. While there have been significant advancements in sentiment analysis for widely spoken languages, low-resource languages, such as Bangla, remain largely under-researched due to resource constraints. Furthermore, the rec… ▽ More

    Submitted 4 April, 2024; v1 submitted 21 August, 2023; originally announced August 2023.

    Comments: Accepted at LREC-COLING 2024. Zero-Shot Prompting, Few-Shot Prompting, LLMs, Comparative Study, Fine-tuned Models, Bangla, Sentiment Analysis

    MSC Class: 68T50 ACM Class: I.2.7

  12. arXiv:2307.01394  [pdf, ps, other

    cs.DC cs.AI cs.IR cs.LG

    In-depth Analysis On Parallel Processing Patterns for High-Performance Dataframes

    Authors: Niranda Perera, Arup Kumar Sarker, Mills Staylor, Gregor von Laszewski, Kaiying Shan, Supun Kamburugamuve, Chathura Widanage, Vibhatha Abeykoon, Thejaka Amila Kanewela, Geoffrey Fox

    Abstract: The Data Science domain has expanded monumentally in both research and industry communities during the past decade, predominantly owing to the Big Data revolution. Artificial Intelligence (AI) and Machine Learning (ML) are bringing more complexities to data engineering applications, which are now integrated into data processing pipelines to process terabytes of data. Typically, a significant amoun… ▽ More

    Submitted 3 July, 2023; originally announced July 2023.

    Report number: FGCS-D-23-00577R1

  13. arXiv:2304.04314  [pdf, ps, other

    cs.IT eess.SP

    RIS-aided Mixed RF-FSO Wireless Networks: Secrecy Performance Analysis with Simultaneous Eavesdrop**

    Authors: Md. Mijanur Rahman, A. S. M. Badrudduza, Noor Ahmad Sarker, Md. Ibrahim, Imran Shafique Ansari

    Abstract: The appearance of sixth-generation networks has resulted in the proposal of several solutions to tackle signal loss. One of these solutions is the utilization of reconfigurable intelligent surfaces (RIS), which can reflect or refract signals as required. This integration offers significant potential to improve the coverage area from the sender to the receiver. In this paper, we present a comprehen… ▽ More

    Submitted 9 April, 2023; originally announced April 2023.

    Comments: No comments

  14. arXiv:2301.11806  [pdf, other

    cs.CV cs.SE

    PCV: A Point Cloud-Based Network Verifier

    Authors: Arup Kumar Sarker, Farzana Yasmin Ahmad, Matthew B. Dwyer

    Abstract: 3D vision with real-time LiDAR-based point cloud data became a vital part of autonomous system research, especially perception and prediction modules use for object classification, segmentation, and detection. Despite their success, point cloud-based network models are vulnerable to multiple adversarial attacks, where the certain factor of changes in the validation set causes significant performan… ▽ More

    Submitted 30 January, 2023; v1 submitted 27 January, 2023; originally announced January 2023.

    Comments: 11 pages, 12 figures

    ACM Class: D.2.2; D.2.3; D.2.4; D.2.5; I.2.10; I.5.4

  15. arXiv:2301.07896  [pdf, other

    cs.DC cs.DB

    Supercharging Distributed Computing Environments For High Performance Data Engineering

    Authors: Niranda Perera, Kaiying Shan, Supun Kamburugamuwe, Thejaka Amila Kanewela, Chathura Widanage, Arup Sarker, Mills Staylor, Tianle Zhong, Vibhatha Abeykoon, Geoffrey Fox

    Abstract: The data engineering and data science community has embraced the idea of using Python & R dataframes for regular applications. Driven by the big data revolution and artificial intelligence, these applications are now essential in order to process terabytes of data. They can easily exceed the capabilities of a single machine, but also demand significant developer time & effort. Therefore it is esse… ▽ More

    Submitted 19 January, 2023; originally announced January 2023.

  16. arXiv:2301.04591  [pdf, ps, other

    cs.CR

    MVAM: Multi-variant Attacks on Memory for IoT Trust Computing

    Authors: Arup Kumar Sarker, Md Khairul Islam, Yuan Tian

    Abstract: With the significant development of the Internet of Things and low-cost cloud services, the sensory and data processing requirements of IoT systems are continually going up. TrustZone is a hardware-protected Trusted Execution Environment (TEE) for ARM processors specifically designed for IoT handheld systems. It provides memory isolation techniques to protect trusted application data from being ex… ▽ More

    Submitted 11 January, 2023; originally announced January 2023.

    Comments: 12 pages, 6 figures, 6 code blocks

    ACM Class: F.2.2; I.2.7

  17. arXiv:2212.13732  [pdf, ps, other

    cs.DC

    Hybrid Cloud and HPC Approach to High-Performance Dataframes

    Authors: Kaiying Shan, Niranda Perera, Damitha Lenadora, Tianle Zhong, Arup Sarker, Supun Kamburugamuve, Thejaka Amila Kanewela, Chathura Widanage, Geoffrey Fox

    Abstract: Data pre-processing is a fundamental component in any data-driven application. With the increasing complexity of data processing operations and volume of data, Cylon, a distributed dataframe system, is developed to facilitate data processing both as a standalone application and as a library, especially for Python applications. While Cylon shows promising performance results, we experienced difficu… ▽ More

    Submitted 29 December, 2022; v1 submitted 28 December, 2022; originally announced December 2022.

  18. arXiv:2212.12454  [pdf

    cs.CL

    Generalizable Natural Language Processing Framework for Migraine Reporting from Social Media

    Authors: Yuting Guo, Swati Rajwal, Sahithi Lakamana, Chia-Chun Chiang, Paul C. Menell, Adnan H. Shahid, Yi-Chieh Chen, Nikita Chhabra, Wan-Ju Chao, Chieh-Ju Chao, Todd J. Schwedt, Imon Banerjee, Abeed Sarker

    Abstract: Migraine is a high-prevalence and disabling neurological disorder. However, information migraine management in real-world settings could be limited to traditional health information sources. In this paper, we (i) verify that there is substantial migraine-related chatter available on social media (Twitter and Reddit), self-reported by migraine sufferers; (ii) develop a platform-independent text cla… ▽ More

    Submitted 23 December, 2022; originally announced December 2022.

    Comments: Accepted by AMIA 2023 Informatics Summit

  19. arXiv:2211.10443  [pdf

    cs.CL cs.AI

    Social media mining for toxicovigilance of prescription medications: End-to-end pipeline, challenges and future work

    Authors: Abeed Sarker

    Abstract: Substance use, substance use disorder, and overdoses related to substance use are major public health problems globally and in the United States. A key aspect of addressing these problems from a public health standpoint is improved surveillance. Traditional surveillance systems are laggy, and social media are potentially useful sources of timely data. However, mining knowledge from social media is… ▽ More

    Submitted 2 September, 2023; v1 submitted 18 November, 2022; originally announced November 2022.

  20. arXiv:2210.01849  [pdf, other

    cs.SI cs.DS math.AT

    Link Partitioning on Simplicial Complexes Using Higher-Order Laplacians

    Authors: Xinyi Wu, Arnab Sarker, Ali Jadbabaie

    Abstract: Link partitioning is a popular approach in network science used for discovering overlap** communities by identifying clusters of strongly connected links. Current link partitioning methods are specifically designed for networks modelled by graphs representing pairwise relationships. Therefore, these methods are incapable of utilizing higher-order information about group interactions in network d… ▽ More

    Submitted 10 October, 2022; v1 submitted 4 October, 2022; originally announced October 2022.

    Comments: Accepted to 22nd IEEE International Conference on Data Mining (ICDM 2022). Fixed some typos in v1

  21. arXiv:2207.11335  [pdf, other

    cs.SI math.AT stat.AP

    Generalizing Homophily to Simplicial Complexes

    Authors: Arnab Sarker, Natalie Northrup, Ali Jadbabaie

    Abstract: Group interactions occur frequently in social settings, yet their properties beyond pairwise relationships in network models remain unexplored. In this work, we study homophily, the nearly ubiquitous phenomena wherein similar individuals are more likely than random to form connections with one another, and define it on simplicial complexes, a generalization of network models that goes beyond dyadi… ▽ More

    Submitted 22 July, 2022; originally announced July 2022.

    Comments: Preprint submitted to International Conference on Complex Networks and their Applications

  22. arXiv:2204.14081  [pdf

    cs.CL cs.LG

    Few-shot learning for medical text: A systematic review

    Authors: Yao Ge, Yuting Guo, Yuan-Chi Yang, Mohammed Ali Al-Garadi, Abeed Sarker

    Abstract: Objective: Few-shot learning (FSL) methods require small numbers of labeled instances for training. As many medical topics have limited annotated textual data in practical settings, FSL-based natural language processing (NLP) methods hold substantial promise. We aimed to conduct a systematic review to explore the state of FSL methods for medical NLP. Materials and Methods: We searched for articles… ▽ More

    Submitted 21 April, 2022; originally announced April 2022.

  23. arXiv:2201.04960  [pdf, other

    stat.ML cs.LG stat.AP

    Unifying Epidemic Models with Mixtures

    Authors: Arnab Sarker, Ali Jadbabaie, Devavrat Shah

    Abstract: The COVID-19 pandemic has emphasized the need for a robust understanding of epidemic models. Current models of epidemics are classified as either mechanistic or non-mechanistic: mechanistic models make explicit assumptions on the dynamics of disease, whereas non-mechanistic models make assumptions on the form of observed time series. Here, we introduce a simple mixture-based model which bridges th… ▽ More

    Submitted 7 January, 2022; originally announced January 2022.

  24. arXiv:2112.07723  [pdf, other

    cs.RO cs.CV

    Autonomous Navigation System from Simultaneous Localization and Map**

    Authors: Micheal Caracciolo, Owen Casciotti, Christopher Lloyd, Ernesto Sola-Thomas, Matthew Weaver, Kyle Bielby, Md Abdul Baset Sarker, Masudul H. Imtiaz

    Abstract: This paper presents the development of a Simultaneous Localization and Map** (SLAM) based Autonomous Navigation system. The motivation for this study was to find a solution for navigating interior spaces autonomously. Interior navigation is challenging as it can be forever evolving. Solving this issue is necessary for multitude of services, like cleaning, the health industry, and in manufacturin… ▽ More

    Submitted 14 December, 2021; originally announced December 2021.

  25. arXiv:2109.05171  [pdf, other

    cs.IT eess.SP

    On the Intercept Probability and Secure Outage Analysis of Mixed ($α$-$κ$-$μ$)-shadowed and Málaga Turbulent Model

    Authors: N. A. Sarker, A. S. M. Badrudduza, S. M. R. Islam, S. H. Islam, M. K. Kundu, I. S. Ansari, K. -S. Kwak

    Abstract: This work deals with the secrecy performance analysis of a dual-hop RF-FSO DF relaying network composed of a source, a relay, a destination, and an eavesdropper. We assume the eavesdropper is located close to the destination and overhears the relay's transmitted optical signal. The RF and FSO links undergo ($α$-$κ$-$μ$)-shadowed fading and unified Málaga turbulence with pointing error. The secrecy… ▽ More

    Submitted 10 September, 2021; originally announced September 2021.

  26. arXiv:2108.02091  [pdf, other

    cs.SI math.AT

    Which Bridges Are Weak Ties? Algebraic Topological Insights on Network Structure and Tie Strength

    Authors: Arnab Sarker, Jean-Baptiste Seby, Austin R. Benson, Ali Jadbabaie

    Abstract: Bridging relationships between individuals situated in different parts of a social network are important conduits for information and resources in social and organizational settings. Dyadic tie strength has often been used as an indicator for whether a relationship is bridging, under the assumption that bridging ties are always weak ties. However, recent empirical evidence suggests that bridging t… ▽ More

    Submitted 5 January, 2023; v1 submitted 4 August, 2021; originally announced August 2021.

  27. A Novel Disaster Image Dataset and Characteristics Analysis using Attention Model

    Authors: Fahim Faisal Niloy, Arif, Abu Bakar Siddik Nayem, Anis Sarker, Ovi Paul, M. Ashraful Amin, Amin Ahsan Ali, Moinul Islam Zaber, AKM Mahbubur Rahman

    Abstract: The advancement of deep learning technology has enabled us to develop systems that outperform any other classification technique. However, success of any empirical system depends on the quality and diversity of the data available to train the proposed system. In this research, we have carefully accumulated a relatively challenging dataset that contains images collected from various sources for thr… ▽ More

    Submitted 2 July, 2021; originally announced July 2021.

    Comments: ICPR 2020

  28. arXiv:2106.06951  [pdf, other

    cs.IT eess.SP

    Effects of Eavesdropper on the Performance of Mixed η-μ and DGG Cooperative Relaying System

    Authors: Noor Ahmed Sarker, A. S. M. Badrudduza, Milton Kumar Kundu, Imran Shafique Ansari

    Abstract: Free-space optical (FSO) channel offers line-of-sight wireless communication with high data rates and high secrecy utilizing unlicensed optical spectrum and also paves the way to the solution of the last-mile access problem. Since atmospheric turbulence is a hindrance to an enhanced secrecy performance, the mixed radio frequency (RF)-FSO system is gaining enormous research interest in recent days.… ▽ More

    Submitted 13 June, 2021; originally announced June 2021.

  29. arXiv:2104.14029  [pdf, other

    cs.CV cs.AI cs.LG

    Reducing Risk and Uncertainty of Deep Neural Networks on Diagnosing COVID-19 Infection

    Authors: Krishanu Sarker, Sharbani Pandit, Anupam Sarker, Saeid Belkasim, Shihao Ji

    Abstract: Effective and reliable screening of patients via Computer-Aided Diagnosis can play a crucial part in the battle against COVID-19. Most of the existing works focus on develo** sophisticated methods yielding high detection performance, yet not addressing the issue of predictive uncertainty. In this work, we introduce uncertainty estimation to detect confusing cases for expert referral to address t… ▽ More

    Submitted 28 April, 2021; originally announced April 2021.

    Comments: AAAI, TAIH workshop, 2021

  30. arXiv:2104.07849  [pdf, other

    cs.RO

    Task Space Planning with Complementarity Constraint-based Obstacle Avoidance

    Authors: Anirban Sinha, Anik Sarker, Nilanjan Chakraborty

    Abstract: In this paper, we present a task space-based local motion planner that incorporates collision avoidance and constraints on end-effector motion during the execution of a task. Our key technical contribution is the development of a novel kinematic state evolution model of the robot where the collision avoidance is encoded as a complementarity constraint. We show that the kinematic state evolution wi… ▽ More

    Submitted 15 April, 2021; originally announced April 2021.

  31. arXiv:2103.16761  [pdf, ps, other

    cs.IT

    Blockwise Phase Rotation-Aided Analog Transmit Beamforming for 5G mmWave Systems

    Authors: Md. Abdul Latif Sarker, Igbafe Orikumhi, Dong Seog Han, Sunwoo Kim

    Abstract: In this letter, we propose a blockwise phase rotation-aided analog transmit beamforming (BPR-ATB) scheme to improve the spectral efficiency and the bit-error-rate (BER) performance in millimeter wave (mmWave) communication systems. Due to the phase angle optimization issues of the conventional analog beamforming, we design the BPR-ATB for reducing the rotated beamspace of the equivalent channel an… ▽ More

    Submitted 27 July, 2021; v1 submitted 30 March, 2021; originally announced March 2021.

    Comments: 5 pages, 3 figures, 2 tables, Submit to IEEE Wireless Communication Letters

  32. arXiv:2011.12847  [pdf, other

    cs.CV cs.LG

    Deep-learning coupled with novel classification method to classify the urban environment of the develo** world

    Authors: Qianwei Cheng, AKM Mahbubur Rahman, Anis Sarker, Abu Bakar Siddik Nayem, Ovi Paul, Amin Ahsan Ali, M Ashraful Amin, Ryosuke Shibasaki, Moinul Zaber

    Abstract: Rapid globalization and the interdependence of humanity that engender tremendous in-flow of human migration towards the urban spaces. With advent of high definition satellite images, high resolution data, computational methods such as deep neural network, capable hardware; urban planning is seeing a paradigm shift. Legacy data on urban environments are now being complemented with high-volume, high… ▽ More

    Submitted 7 January, 2021; v1 submitted 25 November, 2020; originally announced November 2020.

    Comments: Accepted paper at 2nd International Conference on Signal Processing and Machine Learning (SIGML 2021); 20 pages, 7 figures, 1 table

  33. arXiv:2010.10192  [pdf, other

    cs.MA cs.AI

    A Particle Swarm Inspired Approach for Continuous Distributed Constraint Optimization Problems

    Authors: Moumita Choudhury, Amit Sarker, Md. Mosaddek Khan, William Yeoh

    Abstract: Distributed Constraint Optimization Problems (DCOPs) are a widely studied framework for coordinating interactions in cooperative multi-agent systems. In classical DCOPs, variables owned by agents are assumed to be discrete. However, in many applications, such as target tracking or sleep scheduling in sensor networks, continuous-valued variables are more suitable than discrete ones. To better model… ▽ More

    Submitted 20 October, 2020; originally announced October 2020.

  34. arXiv:2008.10736  [pdf

    cs.CV cs.LG eess.IV

    LULC Segmentation of RGB Satellite Image Using FCN-8

    Authors: Abu Bakar Siddik Nayem, Anis Sarker, Ovi Paul, Amin Ali, Md. Ashraful Amin, AKM Mahbubur Rahman

    Abstract: This work presents use of Fully Convolutional Network (FCN-8) for semantic segmentation of high-resolution RGB earth surface satel-lite images into land use land cover (LULC) categories. Specically, we propose a non-overlap** grid-based approach to train a Fully Convo-lutional Network (FCN-8) with vgg-16 weights to segment satellite im-ages into four (forest, built-up, farmland and water) classe… ▽ More

    Submitted 24 August, 2020; originally announced August 2020.

    Comments: Accepted paper at 3rd SLAAI-International Conference on Artificial Intelligence; 13 pages, 7 figures, 3 tables

  35. arXiv:2006.12687  [pdf, other

    eess.SY cs.LG math.OC stat.ML

    Accurate Parameter Estimation for Risk-aware Autonomous Systems

    Authors: Arnab Sarker, Peter Fisher, Joseph E. Gaudio, Anuradha M. Annaswamy

    Abstract: Analysis and synthesis of safety-critical autonomous systems are carried out using models which are often dynamic. Two central features of these dynamic systems are parameters and unmodeled dynamics. This paper addresses the use of a spectral lines-based approach for estimating parameters of the dynamic model of an autonomous system. Existing literature has treated all unmodeled components of the… ▽ More

    Submitted 16 March, 2022; v1 submitted 22 June, 2020; originally announced June 2020.

  36. arXiv:2005.00072  [pdf, other

    econ.EM cs.LG stat.AP

    Two Burning Questions on COVID-19: Did shutting down the economy help? Can we (partially) reopen the economy without risking the second wave?

    Authors: Anish Agarwal, Abdullah Alomar, Arnab Sarker, Devavrat Shah, Dennis Shen, Cindy Yang

    Abstract: As we reach the apex of the COVID-19 pandemic, the most pressing question facing us is: can we even partially reopen the economy without risking a second wave? We first need to understand if shutting down the economy helped. And if it did, is it possible to achieve similar gains in the war against the pandemic while partially opening up the economy? To do so, it is critical to understand the effec… ▽ More

    Submitted 10 May, 2020; v1 submitted 30 April, 2020; originally announced May 2020.

  37. arXiv:2002.12427  [pdf, other

    cs.AI cs.MA

    C-CoCoA: A Continuous Cooperative Constraint Approximation Algorithm to Solve Functional DCOPs

    Authors: Amit Sarker, Abdullahil Baki Arif, Moumita Choudhury, Md. Mosaddek Khan

    Abstract: Distributed Constraint Optimization Problems (DCOPs) have been widely used to coordinate interactions (i.e. constraints) in cooperative multi-agent systems. The traditional DCOP model assumes that variables owned by the agents can take only discrete values and constraints' cost functions are defined for every possible value assignment of a set of variables. While this formulation is often reasonab… ▽ More

    Submitted 27 February, 2020; originally announced February 2020.

    Comments: 7 pages, 4 figures

  38. arXiv:1909.13184  [pdf, ps, other

    cs.CL cs.SI

    Towards Automatic Bot Detection in Twitter for Health-related Tasks

    Authors: Anahita Davoudi, Ari Z. Klein, Abeed Sarker, Graciela Gonzalez-Hernandez

    Abstract: With the increasing use of social media data for health-related research, the credibility of the information from this source has been questioned as the posts may originate from automated accounts or "bots". While automatic bot detection approaches have been proposed, there are none that have been evaluated on users posting health-related information. In this paper, we extend an existing bot detec… ▽ More

    Submitted 28 September, 2019; originally announced September 2019.

  39. arXiv:1904.05308  [pdf

    cs.CL cs.IR cs.LG

    Deep Neural Networks Ensemble for Detecting Medication Mentions in Tweets

    Authors: Davy Weissenbacher, Abeed Sarker, Ari Klein, Karen O'Connor, Arjun Magge Ranganatha, Graciela Gonzalez-Hernandez

    Abstract: Objective: After years of research, Twitter posts are now recognized as an important source of patient-generated data, providing unique insights into population health. A fundamental step to incorporating Twitter data in pharmacoepidemiological research is to automatically recognize medication mentions in tweets. Given that lexical searches for medication names may fail due to misspellings or ambi… ▽ More

    Submitted 30 September, 2019; v1 submitted 10 April, 2019; originally announced April 2019.

    Comments: This is a pre-copy-editing, author-produced PDF of an article accepted for publication in JAMIA following peer review. The definitive publisher-authenticated version is "D. Weissenbacher, A. Sarker, A. Klein, K. O'Connor, A. Magge, G. Gonzalez-Hernandez, Deep neural networks ensemble for detecting medication mentions in tweets, Journal of the American Medical Informatics Association, ocz156, 2019"

    Journal ref: Journal of the American Medical Informatics Association, ocz156, 2019

  40. A Review of Sensing and Communication, Human Factors, and Controller Aspects for Information-Aware Connected and Automated Vehicles

    Authors: Ankur Sarker, Haiying Shen, Mizanur Rahman, Mashrur Chowdhury, Kakan Dey, Fangjian Li, Yue Wang, Husnu S. Narman

    Abstract: Information-aware connected and automated vehicles (CAVs) have drawn great attention in recent years due to its potentially significant positive impacts on roadway safety and operational efficiency. In this paper, we conduct an in-depth review of three basic and key interrelated aspects of a CAV: sensing and communication technologies, human factors, and information-aware controller design. First,… ▽ More

    Submitted 20 March, 2019; originally announced March 2019.

    Comments: IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS

  41. Automatically Detecting Self-Reported Birth Defect Outcomes on Twitter for Large-scale Epidemiological Research

    Authors: Ari Z. Klein, Abeed Sarker, Davy Weissenbacher, Graciela Gonzalez-Hernandez

    Abstract: In recent work, we identified and studied a small cohort of Twitter users whose pregnancies with birth defect outcomes could be observed via their publicly available tweets. Exploiting social media's large-scale potential to complement the limited methods for studying birth defects, the leading cause of infant mortality, depends on the further development of automatic methods. The primary objectiv… ▽ More

    Submitted 22 October, 2018; originally announced October 2018.

    Journal ref: npj Digital Medicine. 2019;2:96

  42. An unsupervised and customizable misspelling generator for mining noisy health-related text sources

    Authors: Abeed Sarker, Graciela Gonzalez-Hernandez

    Abstract: In this paper, we present a customizable datacentric system that automatically generates common misspellings for complex health-related terms. The spelling variant generator relies on a dense vector model learned from large unlabeled text, which is used to find semantically close terms to the original/seed keyword, followed by the filtering of terms that are lexically dissimilar beyond a given thr… ▽ More

    Submitted 3 June, 2018; originally announced June 2018.

    Journal ref: J Biomed Inform. 2018 Dec;88:98-107. Epub 2018 Nov 13

  43. Distortion-free Golden-Hadamard Codebook Design for MISO Systems

    Authors: Md. Abdul Latif Sarker, Md. Fazlul Kader, Moon Ho Lee, Dong Seog Han

    Abstract: In this letter, a novel Golden-Hadamard codebook (GHC) scheme is proposed to improve the performance of the traditional precoded Alamouti coding for multiple-input and single-output systems. Although the traditional discrete Fourier transform codebook (DFTC) performs satisfactorily with Alamouti coding and offers numerous benefits for the Rayleigh fading channel, this scheme inherently generates h… ▽ More

    Submitted 10 October, 2018; v1 submitted 25 October, 2017; originally announced October 2017.

    Comments: 4 pages,4 figures,2 table, Published (Early Access) in IEEE Communications Letters

  44. arXiv:1706.08162  [pdf, other

    cs.CL

    Automated text summarisation and evidence-based medicine: A survey of two domains

    Authors: Abeed Sarker, Diego Molla, Cecile Paris

    Abstract: The practice of evidence-based medicine (EBM) urges medical practitioners to utilise the latest research evidence when making clinical decisions. Because of the massive and growing volume of published research on various medical topics, practitioners often find themselves overloaded with information. As such, natural language processing research has recently commenced exploring techniques for perf… ▽ More

    Submitted 25 June, 2017; originally announced June 2017.

  45. arXiv:1702.02261  [pdf, ps, other

    cs.CL

    Social media mining for identification and exploration of health-related information from pregnant women

    Authors: Pramod Bharadwaj Chandrashekar, Arjun Magge, Abeed Sarker, Graciela Gonzalez

    Abstract: Widespread use of social media has led to the generation of substantial amounts of information about individuals, including health-related information. Social media provides the opportunity to study health-related information about selected population groups who may be of interest for a particular study. In this paper, we explore the possibility of utilizing social media to perform targeted data c… ▽ More

    Submitted 7 February, 2017; originally announced February 2017.

    Comments: 9 pages

  46. arXiv:1612.02145  [pdf

    cs.IT

    A Unified Linear Precoding Design for Multi-user MIMO Systems

    Authors: Md. Abdul Latif Sarker

    Abstract: We address the problem of the bit error rate (BER) performance gap between the sub-optimal and optimal linear precoder (LP) for a multiuser (MU) multiple input and multiple output (MIMO) broadcast systems in this paper. Particularly, mobile users suffer noise enhancement effect due to a sub-optimal LP that can be suppressed by an optimal LP matrix. A sub-optimal LP matrix such as a linear zero-for… ▽ More

    Submitted 7 December, 2016; originally announced December 2016.

    Comments: 4

  47. arXiv:1610.02567  [pdf

    cs.CL cs.CY

    Mining the Web for Pharmacovigilance: the Case Study of Duloxetine and Venlafaxine

    Authors: Abbas Chokor, Abeed Sarker, Graciela Gonzalez

    Abstract: Adverse reactions caused by drugs following their release into the market are among the leading causes of death in many countries. The rapid growth of electronically available health related information, and the ability to process large volumes of them automatically, using natural language processing (NLP) and machine learning algorithms, have opened new opportunities for pharmacovigilance. Survey… ▽ More

    Submitted 8 October, 2016; originally announced October 2016.

    Comments: Masters project report

  48. arXiv:1609.00775  [pdf

    cs.IT

    An Error Covariance Splitting Technique for Multi-User MIMO Interference Environment

    Authors: Md. Abdul Latif Sarker

    Abstract: This paper investigates an error covariance matrix splitting technique for multiuser multiple input and multiple output (MIMO) interference downlink channel. Most of the related work has thus far considered the traditional error covariance matrix which has not been well-shaped for maximizing the system capacity. Thus, we split and propose a new iterative error covariance matrix to mitigate the sys… ▽ More

    Submitted 2 September, 2016; originally announced September 2016.

  49. arXiv:1606.07137  [pdf, other

    cs.AI cs.CL cs.IR

    Automated Extraction of Number of Subjects in Randomised Controlled Trials

    Authors: Abeed Sarker

    Abstract: We present a simple approach for automatically extracting the number of subjects involved in randomised controlled trials (RCT). Our approach first applies a set of rule-based techniques to extract candidate study sizes from the abstracts of the articles. Supervised classification is then performed over the candidates with support vector machines, using a small set of lexical, structural, and cont… ▽ More

    Submitted 22 June, 2016; originally announced June 2016.

    Comments: unpublished

  50. arXiv:1509.01880  [pdf

    cs.IT

    Mean Capacity of Spatially Semi-Correlated MIMO Fading Channel

    Authors: Md. Abdul Latif Sarker, Moon Ho Lee

    Abstract: This study investigates the mean capacity of multiple-input multiple-output (MIMO) systems for spatially semi-correlated flat fading channels. In reality, the capacity degrades dramatic due to the channel covariance (CC) when correlations exist at the transmitter or receiver or on both sides. Most existing works have so far considered the traditional channel covariance matrices that have not been… ▽ More

    Submitted 23 September, 2016; v1 submitted 6 September, 2015; originally announced September 2015.

    Comments: 4