Skip to main content

Showing 1–50 of 73 results for author: Saad, A

Searching in archive cs. Search in all archives.
.
  1. GenSQL: A Probabilistic Programming System for Querying Generative Models of Database Tables

    Authors: Mathieu Huot, Matin Ghavami, Alexander K. Lew, Ulrich Schaechtle, Cameron E. Freer, Zane Shelby, Martin C. Rinard, Feras A. Saad, Vikash K. Mansinghka

    Abstract: This article presents GenSQL, a probabilistic programming system for querying probabilistic generative models of database tables. By augmenting SQL with only a few key primitives for querying probabilistic models, GenSQL enables complex Bayesian inference workflows to be concisely implemented. GenSQL's query planner rests on a unified programmatic interface for interacting with probabilistic model… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

    Comments: 54 pages, 30 figures, 1 table, published at PLDI 2024

  2. arXiv:2406.12658  [pdf, other

    cs.CV cs.LG

    Federated Learning with a Single Shared Image

    Authors: Sunny Soni, Aaqib Saeed, Yuki M. Asano

    Abstract: Federated Learning (FL) enables multiple machines to collaboratively train a machine learning model without sharing of private training data. Yet, especially for heterogeneous models, a key bottleneck remains the transfer of knowledge gained from each client model with the server. One popular method, FedDF, uses distillation to tackle this task with the use of a common, shared dataset on which pre… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: 8 Pages, 3 Figures, Appendix 4 Pages, CVPRW 2024

    Journal ref: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, 2024, pp. 7782-7790

  3. arXiv:2406.00696  [pdf, ps, other

    cs.CV

    Bilinear-Convolutional Neural Network Using a Matrix Similarity-based Joint Loss Function for Skin Disease Classification

    Authors: Belal Ahmad, Mohd Usama, Tanvir Ahmad, Adnan Saeed, Shabnam Khatoon, Long Hu

    Abstract: In this study, we proposed a model for skin disease classification using a Bilinear Convolutional Neural Network (BCNN) with a Constrained Triplet Network (CTN). BCNN can capture rich spatial interactions between features in image data. This computes the outer product of feature vectors from two different CNNs by a bilinear pooling. The resulting features encode second-order statistics, enabling t… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

    Comments: 16 pages, 11 figures, 2 tables

  4. arXiv:2403.10561   

    cs.LG cs.AI

    A collection of the accepted papers for the Human-Centric Representation Learning workshop at AAAI 2024

    Authors: Dimitris Spathis, Aaqib Saeed, Ali Etemad, Sana Tonekaboni, Stefanos Laskaridis, Shohreh Deldari, Chi Ian Tang, Patrick Schwab, Shyam Tailor

    Abstract: This non-archival index is not complete, as some accepted papers chose to opt-out of inclusion. The list of all accepted papers is available on the workshop website.

    Submitted 14 March, 2024; originally announced March 2024.

  5. arXiv:2402.16486  [pdf, other

    cs.CV cs.AI

    Intelligent Known and Novel Aircraft Recognition -- A Shift from Classification to Similarity Learning for Combat Identification

    Authors: Ahmad Saeed, Haasha Bin Atif, Usman Habib, Mohsin Bilal

    Abstract: Precise aircraft recognition in low-resolution remote sensing imagery is a challenging yet crucial task in aviation, especially combat identification. This research addresses this problem with a novel, scalable, and AI-driven solution. The primary hurdle in combat identification in remote sensing imagery is the accurate recognition of Novel/Unknown types of aircraft in addition to Known types. Tra… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

  6. arXiv:2401.14211  [pdf, other

    cs.LG cs.DC

    Communication-Efficient Federated Learning through Adaptive Weight Clustering and Server-Side Distillation

    Authors: Vasileios Tsouvalas, Aaqib Saeed, Tanir Ozcelebi, Nirvana Meratnia

    Abstract: Federated Learning (FL) is a promising technique for the collaborative training of deep neural networks across multiple devices while preserving data privacy. Despite its potential benefits, FL is hindered by excessive communication costs due to repeated server-client communication during training. To address this challenge, model compression techniques, such as sparsification and weight clusterin… ▽ More

    Submitted 25 February, 2024; v1 submitted 25 January, 2024; originally announced January 2024.

    Comments: 9 pages, 2 figures, Accepted on ICASSP 2024

  7. arXiv:2401.14107  [pdf, other

    cs.LG eess.SP

    Learning under Label Noise through Few-Shot Human-in-the-Loop Refinement

    Authors: Aaqib Saeed, Dimitris Spathis, Jungwoo Oh, Edward Choi, Ali Etemad

    Abstract: Wearable technologies enable continuous monitoring of various health metrics, such as physical activity, heart rate, sleep, and stress levels. A key challenge with wearable data is obtaining quality labels. Unlike modalities like video where the videos themselves can be effectively used to label objects or events, wearable data do not contain obvious cues about the physical manifestation of the us… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

  8. Hands-On Robotics: Enabling Communication Through Direct Gesture Control

    Authors: Max Pascher, Alia Saad, Jonathan Liebers, Roman Heger, Jens Gerken, Stefan Schneegass, Uwe Gruene

    Abstract: Effective Human-Robot Interaction (HRI) is fundamental to seamlessly integrating robotic systems into our daily lives. However, current communication modes require additional technological interfaces, which can be cumbersome and indirect. This paper presents a novel approach, using direct motion-based communication by moving a robot's end effector. Our strategy enables users to communicate with a… ▽ More

    Submitted 17 January, 2024; originally announced January 2024.

    Comments: Companion of the 2024 ACM/IEEE International Conference on Human-Robot Interaction

  9. arXiv:2312.07981  [pdf

    cs.LG cs.SD eess.SP

    Time Series Diffusion Method: A Denoising Diffusion Probabilistic Model for Vibration Signal Generation

    Authors: Haiming Yi, Lei Hou, Yuhong **, Nasser A. Saeed, Ali Kandil, Hao Duan

    Abstract: Diffusion models have demonstrated powerful data generation capabilities in various research fields such as image generation. However, in the field of vibration signal generation, the criteria for evaluating the quality of the generated signal are different from that of image generation and there is a fundamental difference between them. At present, there is no research on the ability of diffusion… ▽ More

    Submitted 30 June, 2024; v1 submitted 13 December, 2023; originally announced December 2023.

    Journal ref: Mechanical Systems and Signal Processing, 2024, 216: 111481

  10. arXiv:2311.17299  [pdf, other

    cs.LG cs.CV cs.DC

    Federated Fine-Tuning of Foundation Models via Probabilistic Masking

    Authors: Vasileios Tsouvalas, Yuki Asano, Aaqib Saeed

    Abstract: Foundation Models (FMs) have revolutionized machine learning with their adaptability and high performance across tasks; yet, their integration into Federated Learning (FL) is challenging due to substantial communication overhead from their extensive parameterization. Current communication-efficient FL strategies, such as gradient compression, reduce bitrates to around $1$ bit-per-parameter (bpp).… ▽ More

    Submitted 28 November, 2023; originally announced November 2023.

    Comments: 19 pages, 9 figures

  11. arXiv:2311.11714  [pdf, other

    cs.CV

    On the Importance of Large Objects in CNN Based Object Detection Algorithms

    Authors: Ahmed Ben Saad, Gabriele Facciolo, Axel Davy

    Abstract: Object detection models, a prominent class of machine learning algorithms, aim to identify and precisely locate objects in images or videos. However, this task might yield uneven performances sometimes caused by the objects sizes and the quality of the images and labels used for training. In this paper, we highlight the importance of large objects in learning features that are critical for all siz… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

    Journal ref: Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, Jan 2024, WAIKOLOA, HAWAII, United States

  12. arXiv:2310.00525  [pdf, other

    cs.RO cs.AI

    Reinforcement learning adaptive fuzzy controller for lighting systems: application to aircraft cabin

    Authors: Kritika Vashishtha, Anas Saad, Reza Faieghi, Fengfeng Xi

    Abstract: The lighting requirements are subjective and one light setting cannot work for all. However, there is little work on develo** smart lighting algorithms that can adapt to user preferences. To address this gap, this paper uses fuzzy logic and reinforcement learning to develop an adaptive lighting algorithm. In particular, we develop a baseline fuzzy inference system (FIS) using the domain knowledg… ▽ More

    Submitted 30 September, 2023; originally announced October 2023.

  13. arXiv:2307.09607  [pdf, other

    cs.LG cs.AI stat.ME stat.ML

    Sequential Monte Carlo Learning for Time Series Structure Discovery

    Authors: Feras A. Saad, Brian J. Patton, Matthew D. Hoffman, Rif A. Saurous, Vikash K. Mansinghka

    Abstract: This paper presents a new approach to automatically discovering accurate models of complex time series data. Working within a Bayesian nonparametric prior over a symbolic space of Gaussian process time series models, we present a novel structure learning algorithm that integrates sequential Monte Carlo (SMC) and involutive MCMC for highly effective posterior inference. Our method can be used both… ▽ More

    Submitted 13 July, 2023; originally announced July 2023.

    Comments: 17 pages, 8 figures, 2 tables. Appearing in ICML 2023

    Journal ref: Proceedings of the 40th International Conference on Machine Learning, PMLR 202:29473-29489, 2023

  14. arXiv:2305.03058  [pdf, other

    eess.AS cs.LG cs.SD

    Plug-and-Play Multilingual Few-shot Spoken Words Recognition

    Authors: Aaqib Saeed, Vasileios Tsouvalas

    Abstract: As technology advances and digital devices become prevalent, seamless human-machine communication is increasingly gaining significance. The growing adoption of mobile, wearable, and other Internet of Things (IoT) devices has changed how we interact with these smart devices, making accurate spoken words recognition a crucial component for effective interaction. However, building robust spoken words… ▽ More

    Submitted 3 May, 2023; originally announced May 2023.

    Comments: Code: https://github.com/FewshotML/plix

  15. arXiv:2303.10988  [pdf, ps, other

    cs.RO

    This Was (Not) Intended: How Intent Communication and Biometrics Can Enhance Social Interactions With Robots

    Authors: Khaled Kassem, Alia Saad

    Abstract: Socially Assistive Robots (SARs) are robots that are designed to replicate the role of a caregiver, coach, or teacher, providing emotional, cognitive, and social cues to support a specific group. SARs are becoming increasingly prevalent, especially in elderly care. Effective communication, both explicit and implicit, is a critical aspect of human-robot interaction involving SARs. Intent communicat… ▽ More

    Submitted 20 March, 2023; originally announced March 2023.

    Report number: SARTMI/2023/8

  16. arXiv:2301.04205  [pdf, other

    cs.LO

    A Performance Verification Methodology for Resource Allocation Heuristics

    Authors: Saksham Goel, Benjamin Mikek, Jehad Aly, Venkat Arun, Ahmed Saeed, Aditya Akella

    Abstract: Performance verification is a nascent but promising tool for understanding the performance and limitations of heuristics under realistic assumptions. Bespoke performance verification tools have already demonstrated their value in settings like congestion control and packet scheduling. In this paper, we aim to emphasize the broad applicability and utility of performance verification. To that end, w… ▽ More

    Submitted 28 February, 2024; v1 submitted 10 January, 2023; originally announced January 2023.

    Comments: 12 pages, 11 figures

  17. arXiv:2211.10177  [pdf, other

    cs.CV

    Improving Pixel-Level Contrastive Learning by Leveraging Exogenous Depth Information

    Authors: Ahmed Ben Saad, Kristina Prokopetc, Josselin Kherroubi, Axel Davy, Adrien Courtois, Gabriele Facciolo

    Abstract: Self-supervised representation learning based on Contrastive Learning (CL) has been the subject of much attention in recent years. This is due to the excellent results obtained on a variety of subsequent tasks (in particular classification), without requiring a large amount of labeled samples. However, most reference CL algorithms (such as SimCLR and MoCo, but also BYOL and Barlow Twins) are not a… ▽ More

    Submitted 18 November, 2022; originally announced November 2022.

    Comments: Accepted for WACV 2023

  18. Active Learning of Non-semantic Speech Tasks with Pretrained Models

    Authors: Harlin Lee, Aaqib Saeed, Andrea L. Bertozzi

    Abstract: Pretraining neural networks with massive unlabeled datasets has become popular as it equips the deep models with a better prior to solve downstream tasks. However, this approach generally assumes that the downstream tasks have access to annotated data of sufficient size. In this work, we propose ALOE, a novel system for improving the data- and label-efficiency of non-semantic speech tasks with act… ▽ More

    Submitted 25 February, 2023; v1 submitted 31 October, 2022; originally announced November 2022.

    Comments: Accepted at: ICASSP'23, Code: https://github.com/HarlinLee/ALOE

  19. arXiv:2210.15283  [pdf, other

    cs.SD cs.LG eess.AS

    On Out-of-Distribution Detection for Audio with Deep Nearest Neighbors

    Authors: Zaharah Bukhsh, Aaqib Saeed

    Abstract: Out-of-distribution (OOD) detection is concerned with identifying data points that do not belong to the same distribution as the model's training data. For the safe deployment of predictive models in a real-world environment, it is critical to avoid making confident predictions on OOD inputs as it can lead to potentially dangerous consequences. However, OOD detection largely remains an under-explo… ▽ More

    Submitted 25 February, 2023; v1 submitted 27 October, 2022; originally announced October 2022.

    Comments: Accepted at ICASSP'23. Webpage: https://zaharah.github.io/ood_audio, Code: https://github.com/Zaharah/ood_audio

  20. arXiv:2208.09378  [pdf, other

    cs.LG

    Labeling Chaos to Learning Harmony: Federated Learning with Noisy Labels

    Authors: Vasileios Tsouvalas, Aaqib Saeed, Tanir Ozcelebi, Nirvana Meratnia

    Abstract: Federated Learning (FL) is a distributed machine learning paradigm that enables learning models from decentralized private datasets, where the labeling effort is entrusted to the clients. While most existing FL approaches assume high-quality labels are readily available on users' devices; in reality, label noise can naturally occur in FL and is closely related to clients' characteristics. Due to s… ▽ More

    Submitted 26 May, 2023; v1 submitted 19 August, 2022; originally announced August 2022.

  21. arXiv:2208.00467  [pdf, other

    cs.CV cs.LG

    COCOA: Cross Modality Contrastive Learning for Sensor Data

    Authors: Shohreh Deldari, Hao Xue, Aaqib Saeed, Daniel V. Smith, Flora D. Salim

    Abstract: Self-Supervised Learning (SSL) is a new paradigm for learning discriminative representations without labelled data and has reached comparable or even state-of-the-art results in comparison to supervised counterparts. Contrastive Learning (CL) is one of the most well-known approaches in SSL that attempts to learn general, informative representations of data. CL methods have been mostly developed fo… ▽ More

    Submitted 3 August, 2022; v1 submitted 31 July, 2022; originally announced August 2022.

    Comments: 27 pages, 10 figures, 6 tables, Accepted with minor revision at IMWUT Vol. 6 No. 3

  22. arXiv:2207.06921  [pdf, other

    eess.SP cs.LG

    Automatic Sleep Scoring from Large-scale Multi-channel Pediatric EEG

    Authors: Harlin Lee, Aaqib Saeed

    Abstract: Sleep is particularly important to the health of infants, children, and adolescents, and sleep scoring is the first step to accurate diagnosis and treatment of potentially life-threatening conditions. But pediatric sleep is severely under-researched compared to adult sleep in the context of machine learning for health, and sleep scoring algorithms developed for adults usually perform poorly on inf… ▽ More

    Submitted 26 October, 2022; v1 submitted 30 June, 2022; originally announced July 2022.

    Comments: Learning from Time Series for Health. Workshop at NeurIPS 2022

  23. Distilled Non-Semantic Speech Embeddings with Binary Neural Networks for Low-Resource Devices

    Authors: Harlin Lee, Aaqib Saeed

    Abstract: This work introduces BRILLsson, a novel binary neural network-based representation learning model for a broad range of non-semantic speech tasks. We train the model with knowledge distillation from a large and real-valued TRILLsson model with only a fraction of the dataset used to train TRILLsson. The resulting BRILLsson models are only 2MB in size with a latency less than 8ms, making them suitabl… ▽ More

    Submitted 2 December, 2023; v1 submitted 12 July, 2022; originally announced July 2022.

    Journal ref: Pattern Recognition Letters, vol. 177, pp. 15-19, 2024

  24. arXiv:2206.09029  [pdf, other

    cs.LG

    Binary Early-Exit Network for Adaptive Inference on Low-Resource Devices

    Authors: Aaqib Saeed

    Abstract: Deep neural networks have significantly improved performance on a range of tasks with the increasing demand for computational resources, leaving deployment on low-resource devices (with limited memory and battery power) infeasible. Binary neural networks (BNNs) tackle the issue to an extent with extreme compression and speed-up gains compared to real-valued models. We propose a simple but effectiv… ▽ More

    Submitted 17 June, 2022; originally announced June 2022.

    Comments: Interspeech 2022

  25. arXiv:2206.08429  [pdf, other

    cs.CV

    Scalable Temporal Localization of Sensitive Activities in Movies and TV Episodes

    Authors: Xiang Hao, **gxiang Chen, Shixing Chen, Ahmed Saad, Raffay Hamid

    Abstract: To help customers make better-informed viewing choices, video-streaming services try to moderate their content and provide more visibility into which portions of their movies and TV episodes contain age-appropriate material (e.g., nudity, sex, violence, or drug-use). Supervised models to localize these sensitive activities require large amounts of clip-level labeled data which is hard to obtain, w… ▽ More

    Submitted 16 June, 2022; originally announced June 2022.

  26. arXiv:2206.02353  [pdf, other

    cs.LG cs.CV

    Beyond Just Vision: A Review on Self-Supervised Representation Learning on Multimodal and Temporal Data

    Authors: Shohreh Deldari, Hao Xue, Aaqib Saeed, Jiayuan He, Daniel V. Smith, Flora D. Salim

    Abstract: Recently, Self-Supervised Representation Learning (SSRL) has attracted much attention in the field of computer vision, speech, natural language processing (NLP), and recently, with other types of modalities, including time series from sensors. The popularity of self-supervised learning is driven by the fact that traditional models typically require a huge amount of well-annotated data for training… ▽ More

    Submitted 7 June, 2022; v1 submitted 6 June, 2022; originally announced June 2022.

    Comments: 36 pages, 5 figures, 9 tables, Survey paper

  27. Medical Dataset Classification for Kurdish Short Text over Social Media

    Authors: Ari M. Saeed, Shnya R. Hussein, Chro M. Ali, Tarik A. Rashid

    Abstract: The Facebook application is used as a resource for collecting the comments of this dataset, The dataset consists of 6756 comments to create a Medical Kurdish Dataset (MKD). The samples are comments of users, which are gathered from different posts of pages (Medical, News, Economy, Education, and Sport). Six steps as a preprocessing technique are performed on the raw dataset to clean and remove noi… ▽ More

    Submitted 26 March, 2022; originally announced April 2022.

    Comments: 11 pages

    Journal ref: DIB, 2020

  28. arXiv:2202.12363  [pdf, other

    stat.ML cs.LG stat.CO stat.ME

    Estimators of Entropy and Information via Inference in Probabilistic Models

    Authors: Feras A. Saad, Marco Cusumano-Towner, Vikash K. Mansinghka

    Abstract: Estimating information-theoretic quantities such as entropy and mutual information is central to many problems in statistics and machine learning, but challenging in high dimensions. This paper presents estimators of entropy via inference (EEVI), which deliver upper and lower bounds on many information quantities for arbitrary variables in a probabilistic generative model. These estimators use imp… ▽ More

    Submitted 12 December, 2022; v1 submitted 24 February, 2022; originally announced February 2022.

    Comments: 18 pages, 8 figures. Appearing in AISTATS 2022

    Journal ref: Proceedings of the 25th International Conference on Artificial Intelligence and Statistics, PMLR 151:5604-5621, 2022

  29. A comprehensive review and evaluation on text predictive and entertainment systems

    Authors: Hozan K. Hamarashid, Soran A. Saeed, Tarik A. Rashid

    Abstract: One of the most important ways to experience communication and interact with the systems is by handling the prediction of the most likely words to happen after ty** letters or words. It is helpful for people with disabilities due to disabling people who could type or enter texts at a limited slow speed. Also, it is beneficial for people with dyslexia and those people who are not well with spells… ▽ More

    Submitted 8 January, 2022; originally announced January 2022.

    Comments: 42 pages

    Journal ref: Soft computing, 2022

  30. arXiv:2112.00725  [pdf, other

    cs.CV

    The Augmented Image Prior: Distilling 1000 Classes by Extrapolating from a Single Image

    Authors: Yuki M. Asano, Aaqib Saeed

    Abstract: What can neural networks learn about the visual world when provided with only a single image as input? While any image obviously cannot contain the multitudes of all existing objects, scenes and lighting conditions - within the space of all 256^(3x224x224) possible 224-sized square images, it might still provide a strong prior for natural images. To analyze this `augmented image prior' hypothesis,… ▽ More

    Submitted 24 January, 2023; v1 submitted 1 December, 2021; originally announced December 2021.

    Comments: Accepted at ICLR'23. Webpage: https://single-image-distill.github.io/, code: https://github.com/yukimasano/single-img-extrapolating

  31. arXiv:2109.13192  [pdf, other

    cs.LG

    Consistency Training of Multi-exit Architectures for Sensor Data

    Authors: Aaqib Saeed

    Abstract: Deep neural networks have become larger over the years with increasing demand of computational resources for inference; incurring exacerbate costs and leaving little room for deployment on devices with limited battery and other resources for real-time applications. The multi-exit architectures are type of deep neural network that are interleaved with several output (or exit) layers at varying dept… ▽ More

    Submitted 27 September, 2021; originally announced September 2021.

  32. arXiv:2108.12811  [pdf

    cs.CV eess.IV

    Airplane Type Identification Based on Mask RCNN and Drone Images

    Authors: W. T Alshaibani, Mustafa Helvaci, Ibraheem Shayea, Sawsan A. Saad, Azizul Azizan, Fitri Yakub

    Abstract: For dealing with traffic bottlenecks at airports, aircraft object detection is insufficient. Every airport generally has a variety of planes with various physical and technological requirements as well as diverse service requirements. Detecting the presence of new planes will not address all traffic congestion issues. Identifying the type of airplane, on the other hand, will entirely fix the probl… ▽ More

    Submitted 29 August, 2021; originally announced August 2021.

    Comments: 14 page

  33. arXiv:2108.07208  [pdf, other

    cs.LG cs.AI stat.ME stat.ML

    Hierarchical Infinite Relational Model

    Authors: Feras A. Saad, Vikash K. Mansinghka

    Abstract: This paper describes the hierarchical infinite relational model (HIRM), a new probabilistic generative model for noisy, sparse, and heterogeneous relational data. Given a set of relations defined over a collection of domains, the model first infers multiple non-overlap** clusters of relations using a top-level Chinese restaurant process. Within each cluster of relations, a Dirichlet process mixt… ▽ More

    Submitted 16 August, 2021; originally announced August 2021.

    Comments: 11 pages, 6 figures, 4 tables. Appearing in UAI 2021

    Journal ref: Proceedings of the 37th Conference on Uncertainty in Artificial Intelligence, PMLR 161:1067-1077, 2021

  34. arXiv:2107.06877  [pdf, other

    cs.LG cs.DC cs.SD eess.AS

    Federated Self-Training for Semi-Supervised Audio Recognition

    Authors: Vasileios Tsouvalas, Aaqib Saeed, Tanir Ozcelebi

    Abstract: Federated Learning is a distributed machine learning paradigm dealing with decentralized and personal datasets. Since data reside on devices like smartphones and virtual assistants, labeling is entrusted to the clients, or labels are extracted in an automated way. Specifically, in the case of audio data, acquiring semantic annotations can be prohibitively expensive and time-consuming. As a result,… ▽ More

    Submitted 25 February, 2022; v1 submitted 14 July, 2021; originally announced July 2021.

  35. arXiv:2105.11999  [pdf, other

    cs.CY cs.MA cs.NI cs.RO eess.SY

    Throughput-Fairness Tradeoffs in Mobility Platforms

    Authors: Arjun Balasingam, Karthik Gopalakrishnan, Radhika Mittal, Venkat Arun, Ahmed Saeed, Mohammad Alizadeh, Hamsa Balakrishnan, Hari Balakrishnan

    Abstract: This paper studies the problem of allocating tasks from different customers to vehicles in mobility platforms, which are used for applications like food and package delivery, ridesharing, and mobile sensing. A mobility platform should allocate tasks to vehicles and schedule them in order to optimize both throughput and fairness across customers. However, existing approaches to scheduling tasks in… ▽ More

    Submitted 25 May, 2021; originally announced May 2021.

    Comments: Technical report for paper to appear at ACM MobiSys 2021

  36. Evaluating e-Government Services in Kurdistan Institution for Strategic Studies and Scientific Research Using the EGOVSAT Model

    Authors: Bryar A. Hassan, Aram M. Ahmed, Soran A. Saeed, Awin A. Saeed

    Abstract: Office automation is an initiative used to digitally deliver services to citizens, private and public sectors. It is used to digitally collect, store, create, and manipulate office information as a need of accomplishing basic tasks. Azya Office Automation has been implemented as a pilot project in Kurdistan Institution for Strategic Studies and Scientific Research (KISSR) since 2013. The efficienc… ▽ More

    Submitted 6 May, 2021; originally announced May 2021.

  37. Scouting the Path to a Million-Client Server

    Authors: Yimeng Zhao, Ahmed Saeed, Mostafa Ammar, Ellen Zegura

    Abstract: To keep up with demand, servers will scale up to handle hundreds of thousands of clients simultaneously. Much of the focus of the community has been on scaling servers in terms of aggregate traffic intensity (packets transmitted per second). However, bottlenecks caused by the increasing number of concurrent clients, resulting in a large number of concurrent flows, have received little attention. I… ▽ More

    Submitted 10 June, 2021; v1 submitted 28 April, 2021; originally announced April 2021.

    Journal ref: In: Hohlfeld O., Lutu A., Levin D. (eds) Passive and Active Measurement. PAM 2021. Lecture Notes in Computer Science, vol 12671. Springer, Cham

  38. arXiv:2104.00721  [pdf, other

    cs.LG cs.AI

    ProcessTransformer: Predictive Business Process Monitoring with Transformer Network

    Authors: Zaharah A. Bukhsh, Aaqib Saeed, Remco M. Dijkman

    Abstract: Predictive business process monitoring focuses on predicting future characteristics of a running process using event logs. The foresight into process execution promises great potentials for efficient operations, better resource management, and effective customer services. Deep learning-based approaches have been widely adopted in process mining to address the limitations of classical algorithms fo… ▽ More

    Submitted 1 April, 2021; originally announced April 2021.

  39. arXiv:2102.09099  [pdf

    eess.IV cs.CV cs.LG q-bio.QM

    NuCLS: A scalable crowdsourcing, deep learning approach and dataset for nucleus classification, localization and segmentation

    Authors: Mohamed Amgad, Lamees A. Atteya, Hagar Hussein, Kareem Hosny Mohammed, Ehab Hafiz, Maha A. T. Elsebaie, Ahmed M. Alhusseiny, Mohamed Atef AlMoslemany, Abdelmagid M. Elmatboly, Philip A. Pappalardo, Rokia Adel Sakr, Pooya Mobadersany, Ahmad Rachid, Anas M. Saad, Ahmad M. Alkashash, Inas A. Ruhban, Anas Alrefai, Nada M. Elgazar, Ali Abdulkarim, Abo-Alela Farag, Amira Etman, Ahmed G. Elsaeed, Yahya Alagha, Yomna A. Amer, Ahmed M. Raslan , et al. (12 additional authors not shown)

    Abstract: High-resolution map** of cells and tissue structures provides a foundation for develo** interpretable machine-learning models for computational pathology. Deep learning algorithms can provide accurate map**s given large numbers of labeled instances for training and validation. Generating adequate volume of quality labels has emerged as a critical barrier in computational pathology given the… ▽ More

    Submitted 17 February, 2021; originally announced February 2021.

    Journal ref: GigaScience, 11 (2022)

  40. Damage detection using in-domain and cross-domain transfer learning

    Authors: Zaharah A. Bukhsh, Nils Jansen, Aaqib Saeed

    Abstract: We investigate the capabilities of transfer learning in the area of structural health monitoring. In particular, we are interested in damage detection for concrete structures. Typical image datasets for such problems are relatively small, calling for the transfer of learned representation from a related large-scale dataset. Past efforts of damage detection using images have mainly considered cross… ▽ More

    Submitted 5 October, 2021; v1 submitted 7 February, 2021; originally announced February 2021.

    Comments: 16 pages, 8 figures, 7 tables

    Journal ref: Neural Comput & Applic (2021)

  41. arXiv:2010.13694  [pdf, other

    eess.SP cs.LG

    Learning from Heterogeneous EEG Signals with Differentiable Channel Reordering

    Authors: Aaqib Saeed, David Grangier, Olivier Pietquin, Neil Zeghidour

    Abstract: We propose CHARM, a method for training a single neural network across inconsistent input channels. Our work is motivated by Electroencephalography (EEG), where data collection protocols from different headsets result in varying channel ordering and number, which limits the feasibility of transferring trained systems across datasets. Our approach builds upon attention mechanisms to estimate a late… ▽ More

    Submitted 21 October, 2020; originally announced October 2020.

  42. arXiv:2010.13082  [pdf, other

    eess.IV cs.CV

    Context Aware 3D UNet for Brain Tumor Segmentation

    Authors: Parvez Ahmad, Saqib Qamar, Linlin Shen, Adnan Saeed

    Abstract: Deep convolutional neural network (CNN) achieves remarkable performance for medical image analysis. UNet is the primary source in the performance of 3D CNN architectures for medical imaging tasks, including brain tumor segmentation. The skip connection in the UNet architecture concatenates features from both encoder and decoder paths to extract multi-contextual information from image data. The mul… ▽ More

    Submitted 27 November, 2020; v1 submitted 25 October, 2020; originally announced October 2020.

    Comments: Accepted for MICCAI 2020 Brain Lesions (BrainLes) Workshop

  43. arXiv:2010.10915  [pdf, other

    cs.SD cs.LG eess.AS

    Contrastive Learning of General-Purpose Audio Representations

    Authors: Aaqib Saeed, David Grangier, Neil Zeghidour

    Abstract: We introduce COLA, a self-supervised pre-training approach for learning a general-purpose representation of audio. Our approach is based on contrastive learning: it learns a representation which assigns high similarity to audio segments extracted from the same recording while assigning lower similarity to segments from different recordings. We build on top of recent advances in contrastive learnin… ▽ More

    Submitted 21 October, 2020; originally announced October 2020.

  44. arXiv:2010.03485  [pdf, other

    cs.PL cs.LG cs.SC stat.CO stat.ML

    SPPL: Probabilistic Programming with Fast Exact Symbolic Inference

    Authors: Feras A. Saad, Martin C. Rinard, Vikash K. Mansinghka

    Abstract: We present the Sum-Product Probabilistic Language (SPPL), a new probabilistic programming language that automatically delivers exact solutions to a broad range of probabilistic inference queries. SPPL translates probabilistic programs into sum-product expressions, a new symbolic representation and associated semantic domain that extends standard sum-product networks to support mixed-type distribut… ▽ More

    Submitted 11 June, 2021; v1 submitted 7 October, 2020; originally announced October 2020.

    Journal ref: Proceedings of the 42nd ACM SIGPLAN International Conference on Programming Language Design and Implementation (PLDI '21), June 20-25, 2021, Virtual, Canada. ACM, New York, NY, USA

  45. arXiv:2009.13233  [pdf, other

    cs.LG stat.ML

    Sense and Learn: Self-Supervision for Omnipresent Sensors

    Authors: Aaqib Saeed, Victor Ungureanu, Beat Gfeller

    Abstract: Learning general-purpose representations from multisensor data produced by the omnipresent sensing systems (or IoT in general) has numerous applications in diverse use cases. Existing purely supervised end-to-end deep learning techniques depend on the availability of a massive amount of well-curated data, acquiring which is notoriously difficult but required to achieve a sufficient level of genera… ▽ More

    Submitted 6 September, 2021; v1 submitted 28 September, 2020; originally announced September 2020.

  46. arXiv:2008.06971  [pdf

    eess.SP cs.LG

    Physical Action Categorization using Signal Analysis and Machine Learning

    Authors: Asad Mansoor Khan, Ayesha Sadiq, Sajid Gul Khawaja, Norah Saleh Alghamdi, Muhammad Usman Akram, Ali Saeed

    Abstract: Daily life of thousands of individuals around the globe suffers due to physical or mental disability related to limb movement. The quality of life for such individuals can be made better by use of assistive applications and systems. In such scenario, map** of physical actions from movement to a computer aided application can lead the way for solution. Surface Electromyography (sEMG) presents a n… ▽ More

    Submitted 1 February, 2022; v1 submitted 16 August, 2020; originally announced August 2020.

  47. Next word prediction based on the N-gram model for Kurdish Sorani and Kurmanji

    Authors: Hozan K. Hamarashid, Soran A. Saeed, Tarik A. Rashid

    Abstract: Next word prediction is an input technology that simplifies the process of ty** by suggesting the next word to a user to select, as ty** in a conversation consumes time. A few previous studies have focused on the Kurdish language, including the use of next word prediction. However, the lack of a Kurdish text corpus presents a challenge. Moreover, the lack of a sufficient number of N-grams for… ▽ More

    Submitted 27 July, 2020; originally announced August 2020.

    Comments: 37 pages

    Journal ref: Neural Computing and Applications, NCAA-D-19-02773R1, 2020

  48. Federated Self-Supervised Learning of Multi-Sensor Representations for Embedded Intelligence

    Authors: Aaqib Saeed, Flora D. Salim, Tanir Ozcelebi, Johan Lukkien

    Abstract: Smartphones, wearables, and Internet of Things (IoT) devices produce a wealth of data that cannot be accumulated in a centralized repository for learning supervised models due to privacy, bandwidth limitations, and the prohibitive cost of annotations. Federated learning provides a compelling framework for learning models from decentralized data, but conventionally, it assumes the availability of l… ▽ More

    Submitted 25 July, 2020; originally announced July 2020.

    Comments: Accepted for publication at IEEE Internet of Things Journal

  49. arXiv:2005.03066  [pdf, other

    cs.CL cs.AI cs.LG

    Weakly-Supervised Neural Response Selection from an Ensemble of Task-Specialised Dialogue Agents

    Authors: Asir Saeed, Khai Mai, Pham Minh, Nguyen Tuan Duc, Danushka Bollegala

    Abstract: Dialogue engines that incorporate different types of agents to converse with humans are popular. However, conversations are dynamic in the sense that a selected response will change the conversation on-the-fly, influencing the subsequent utterances in the conversation, which makes the response selection a challenging problem. We model the problem of selecting the best response from a set of re… ▽ More

    Submitted 6 May, 2020; originally announced May 2020.

  50. arXiv:2003.03830  [pdf, other

    stat.CO cs.DM cs.DS cs.IT math.PR

    The Fast Loaded Dice Roller: A Near-Optimal Exact Sampler for Discrete Probability Distributions

    Authors: Feras A. Saad, Cameron E. Freer, Martin C. Rinard, Vikash K. Mansinghka

    Abstract: This paper introduces a new algorithm for the fundamental problem of generating a random integer from a discrete probability distribution using a source of independent and unbiased random coin flips. We prove that this algorithm, which we call the Fast Loaded Dice Roller (FLDR), is highly efficient in both space and time: (i) the size of the sampler is guaranteed to be linear in the number of bits… ▽ More

    Submitted 1 June, 2020; v1 submitted 8 March, 2020; originally announced March 2020.

    Comments: 12 pages, 5 figures, 1 table. Appearing in AISTATS 2020

    Journal ref: Proceedings of the 23rd International Conference on Artificial Intelligence and Statistics, PMLR 108:1036-1046, 2020