Skip to main content

Showing 1–46 of 46 results for author: Minhas, F

.
  1. arXiv:2405.02040  [pdf

    cs.CL

    Large Multimodal Model based Standardisation of Pathology Reports with Confidence and their Prognostic Significance

    Authors: Ethar Alzaid, Gabriele Pergola, Harriet Evans, David Snead, Fayyaz Minhas

    Abstract: Pathology reports are rich in clinical and pathological details but are often presented in free-text format. The unstructured nature of these reports presents a significant challenge limiting the accessibility of their content. In this work, we present a practical approach based on the use of large multimodal models (LMMs) for automatically extracting information from scanned images of pathology r… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

    Comments: 19 pages, 6 figures

  2. arXiv:2404.01626  [pdf, other

    cs.CL cs.IR

    Entity Disambiguation via Fusion Entity Decoding

    Authors: Junxiong Wang, Ali Mousavi, Omar Attia, Ronak Pradeep, Saloni Potdar, Alexander M. Rush, Umar Farooq Minhas, Yunyao Li

    Abstract: Entity disambiguation (ED), which links the mentions of ambiguous entities to their referent entities in a knowledge base, serves as a core component in entity linking (EL). Existing generative approaches demonstrate improved accuracy compared to classification approaches under the standardized ZELDA benchmark. Nevertheless, generative approaches suffer from the need for large-scale pre-training a… ▽ More

    Submitted 7 May, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

    Comments: Accepted at NAACL'24 main

  3. arXiv:2402.09990  [pdf, other

    eess.IV cs.CV cs.HC cs.LG

    TIAViz: A Browser-based Visualization Tool for Computational Pathology Models

    Authors: Mark Eastwood, John Pocock, Mostafa Jahanifar, Adam Shephard, Skiros Habib, Ethar Alzaid, Abdullah Alsalemi, Jan Lukas Robertus, Nasir Rajpoot, Shan Raza, Fayyaz Minhas

    Abstract: Digital pathology has gained significant traction in modern healthcare systems. This shift from optical microscopes to digital imagery brings with it the potential for improved diagnosis, efficiency, and the integration of AI tools into the pathologists workflow. A critical aspect of this is visualization. Throughout the development of a machine learning (ML) model in digital pathology, it is cruc… ▽ More

    Submitted 15 February, 2024; originally announced February 2024.

    Comments: Application note to be submitted to bioinformatics

  4. arXiv:2401.02879  [pdf, other

    quant-ph cs.LG

    Efficient Parameter Optimisation for Quantum Kernel Alignment: A Sub-sampling Approach in Variational Training

    Authors: M. Emre Sahin, Benjamin C. B. Symons, Pushpak Pati, Fayyaz Minhas, Declan Millar, Maria Gabrani, Jan Lukas Robertus, Stefano Mensa

    Abstract: Quantum machine learning with quantum kernels for classification problems is a growing area of research. Recently, quantum kernel alignment techniques that parameterise the kernel have been developed, allowing the kernel to be trained and therefore aligned with a specific dataset. While quantum kernel alignment is a promising technique, it has been hampered by considerable training costs because t… ▽ More

    Submitted 5 January, 2024; originally announced January 2024.

  5. arXiv:2311.15781  [pdf, other

    cs.AI cs.CL cs.LG

    Increasing Coverage and Precision of Textual Information in Multilingual Knowledge Graphs

    Authors: Simone Conia, Min Li, Daniel Lee, Umar Farooq Minhas, Ihab Ilyas, Yunyao Li

    Abstract: Recent work in Natural Language Processing and Computer Vision has been using textual information -- e.g., entity names and descriptions -- available in knowledge graphs to ground neural models to high-quality structured data. However, when it comes to non-English languages, the quantity and quality of textual information are comparatively scarce. To address this issue, we introduce the novel task… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.

    Comments: Camera ready for EMNLP 2023

  6. arXiv:2310.19656  [pdf, other

    eess.IV cs.CV cs.LG

    Domain Generalization in Computational Pathology: Survey and Guidelines

    Authors: Mostafa Jahanifar, Manahil Raza, Kesi Xu, Trinh Vuong, Rob Jewsbury, Adam Shephard, Neda Zamanitajeddin, ** Tae Kwak, Shan E Ahmed Raza, Fayyaz Minhas, Nasir Rajpoot

    Abstract: Deep learning models have exhibited exceptional effectiveness in Computational Pathology (CPath) by tackling intricate tasks across an array of histology image analysis applications. Nevertheless, the presence of out-of-distribution data (stemming from a multitude of sources such as disparate imaging devices and diverse tissue preparation methods) can cause \emph{domain shift} (DS). DS decreases t… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

    Comments: Extended Version

  7. arXiv:2307.03757  [pdf

    q-bio.QM cs.CV eess.IV

    A Fully Automated and Explainable Algorithm for the Prediction of Malignant Transformation in Oral Epithelial Dysplasia

    Authors: Adam J Shephard, Raja Muhammad Saad Bashir, Hanya Mahmood, Mostafa Jahanifar, Fayyaz Minhas, Shan E Ahmed Raza, Kris D McCombe, Stephanie G Craig, Jacqueline James, Jill Brooks, Paul Nankivell, Hisham Mehanna, Syed Ali Khurram, Nasir M Rajpoot

    Abstract: Oral epithelial dysplasia (OED) is a premalignant histopathological diagnosis given to lesions of the oral cavity. Its grading suffers from significant inter-/intra- observer variability, and does not reliably predict malignancy progression, potentially leading to suboptimal treatment decisions. To address this, we developed a novel artificial intelligence algorithm that can assign an Oral Maligna… ▽ More

    Submitted 6 July, 2023; originally announced July 2023.

  8. Growing and Serving Large Open-domain Knowledge Graphs

    Authors: Ihab F. Ilyas, JP Lacerda, Yunyao Li, Umar Farooq Minhas, Ali Mousavi, Jeffrey Pound, Theodoros Rekatsinas, Chiraag Sumanth

    Abstract: Applications of large open-domain knowledge graphs (KGs) to real-world problems pose many unique challenges. In this paper, we present extensions to Saga our platform for continuous construction and serving of knowledge at scale. In particular, we describe a pipeline for training knowledge graph embeddings that powers key capabilities such as fact ranking, fact verification, a related entities ser… ▽ More

    Submitted 16 May, 2023; originally announced May 2023.

    Comments: To be published in SIGMOD 2023

  9. arXiv:2305.05006  [pdf, other

    eess.IV cs.CV

    Synthesis of Annotated Colorectal Cancer Tissue Images from Gland Layout

    Authors: Srijay Deshpande, Fayyaz Minhas, Nasir Rajpoot

    Abstract: Generating realistic tissue images with annotations is a challenging task that is important in many computational histopathology applications. Synthetically generated images and annotations are valuable for training and evaluating algorithms in this domain. To address this, we propose an interactive framework generating pairs of realistic colorectal cancer histology images with corresponding gland… ▽ More

    Submitted 4 April, 2024; v1 submitted 8 May, 2023; originally announced May 2023.

  10. arXiv:2304.01926  [pdf

    cs.DB cs.AI cs.LG

    High-Throughput Vector Similarity Search in Knowledge Graphs

    Authors: Jason Mohoney, Anil Pacaci, Shihabur Rahman Chowdhury, Ali Mousavi, Ihab F. Ilyas, Umar Farooq Minhas, Jeffrey Pound, Theodoros Rekatsinas

    Abstract: There is an increasing adoption of machine learning for encoding data into vectors to serve online recommendation and search use cases. As a result, recent data management systems propose augmenting query processing with online vector similarity search. In this work, we explore vector similarity search in the context of Knowledge Graphs (KGs). Motivated by the tasks of finding related KG queries a… ▽ More

    Submitted 4 April, 2023; originally announced April 2023.

    Comments: 13 pages, 7 figures, to be published in ACM SIGMOD 2023

  11. arXiv:2303.06274  [pdf

    cs.CV cs.LG

    CoNIC Challenge: Pushing the Frontiers of Nuclear Detection, Segmentation, Classification and Counting

    Authors: Simon Graham, Quoc Dang Vu, Mostafa Jahanifar, Martin Weigert, Uwe Schmidt, Wenhua Zhang, Jun Zhang, Sen Yang, **xi Xiang, Xiyue Wang, Josef Lorenz Rumberger, Elias Baumann, Peter Hirsch, Lihao Liu, Chenyang Hong, Angelica I. Aviles-Rivero, Ayushi Jain, Heeyoung Ahn, Yiyu Hong, Hussam Azzuni, Min Xu, Mohammad Yaqub, Marie-Claire Blache, Benoît Piégu, Bertrand Vernay , et al. (64 additional authors not shown)

    Abstract: Nuclear detection, segmentation and morphometric profiling are essential in hel** us further understand the relationship between histology and patient outcome. To drive innovation in this area, we setup a community-wide challenge using the largest available dataset of its kind to assess nuclear segmentation and cellular composition. Our challenge, named CoNIC, stimulated the development of repro… ▽ More

    Submitted 14 March, 2023; v1 submitted 10 March, 2023; originally announced March 2023.

  12. arXiv:2302.12653  [pdf, other

    cs.CV cs.LG

    MesoGraph: Automatic Profiling of Malignant Mesothelioma Subtypes from Histological Images

    Authors: Mark Eastwood, Heba Sailem, Silviu Tudor, Xiaohong Gao, Judith Offman, Emmanouil Karteris, Angeles Montero Fernandez, Danny Jonigk, William Cookson, Miriam Moffatt, Sanjay Popat, Fayyaz Minhas, Jan Lukas Robertus

    Abstract: Malignant mesothelioma is classified into three histological subtypes, Epithelioid, Sarcomatoid, and Biphasic according to the relative proportions of epithelioid and sarcomatoid tumor cells present. Biphasic tumors display significant populations of both cell types. This subty** is subjective and limited by current diagnostic guidelines and can differ even between expert thoracic pathologists w… ▽ More

    Submitted 23 February, 2023; originally announced February 2023.

  13. arXiv:2301.09624  [pdf, other

    eess.IV cs.CV cs.LG

    Maximum Mean Discrepancy Kernels for Predictive and Prognostic Modeling of Whole Slide Images

    Authors: Piotr Keller, Muhammad Dawood, Fayyaz ul Amir Afsar Minhas

    Abstract: How similar are two images? In computational pathology, where Whole Slide Images (WSIs) of digitally scanned tissue samples from patients can be multi-gigapixels in size, determination of degree of similarity between two WSIs is a challenging task with a number of practical applications. In this work, we explore a novel strategy based on kernelized Maximum Mean Discrepancy (MMD) analysis for deter… ▽ More

    Submitted 23 January, 2023; originally announced January 2023.

    Comments: * Joint first authorship Accepted: IEEE - ISBI 2023 International Symposium on Biomedical Imaging

  14. Nuclear Segmentation and Classification: On Color & Compression Generalization

    Authors: Quoc Dang Vu, Robert Jewsbury, Simon Graham, Mostafa Jahanifar, Shan E Ahmed Raza, Fayyaz Minhas, Abhir Bhalerao, Nasir Rajpoot

    Abstract: Since the introduction of digital and computational pathology as a field, one of the major problems in the clinical application of algorithms has been the struggle to generalize well to examples outside the distribution of the training data. Existing work to address this in both pathology and natural images has focused almost exclusively on classification tasks. We explore and evaluate the robustn… ▽ More

    Submitted 9 January, 2023; originally announced January 2023.

    Comments: Oral presentation at MICCAI MLMI 2022, 7 pages, 6 figures

  15. arXiv:2212.13780  [pdf, other

    eess.IV cs.CV cs.LG

    SynCLay: Interactive Synthesis of Histology Images from Bespoke Cellular Layouts

    Authors: Srijay Deshpande, Muhammad Dawood, Fayyaz Minhas, Nasir Rajpoot

    Abstract: Automated synthesis of histology images has several potential applications in computational pathology. However, no existing method can generate realistic tissue images with a bespoke cellular layout or user-defined histology parameters. In this work, we propose a novel framework called SynCLay (Synthesis from Cellular Layouts) that can construct realistic and high-quality histology images from use… ▽ More

    Submitted 28 December, 2022; originally announced December 2022.

  16. arXiv:2208.12587  [pdf, other

    cs.CV

    Mitosis Detection, Fast and Slow: Robust and Efficient Detection of Mitotic Figures

    Authors: Mostafa Jahanifar, Adam Shephard, Neda Zamanitajeddin, Simon Graham, Shan E Ahmed Raza, Fayyaz Minhas, Nasir Rajpoot

    Abstract: Counting of mitotic figures is a fundamental step in grading and prognostication of several cancers. However, manual mitosis counting is tedious and time-consuming. In addition, variation in the appearance of mitotic figures causes a high degree of discordance among pathologists. With advances in deep learning models, several automatic mitosis detection algorithms have been proposed but they are s… ▽ More

    Submitted 25 September, 2023; v1 submitted 26 August, 2022; originally announced August 2022.

    Comments: Extended version of the work done for MIDOG challenge submission

  17. arXiv:2203.00077  [pdf, other

    eess.IV cs.CV

    One Model is All You Need: Multi-Task Learning Enables Simultaneous Histology Image Segmentation and Classification

    Authors: Simon Graham, Quoc Dang Vu, Mostafa Jahanifar, Shan E Ahmed Raza, Fayyaz Minhas, David Snead, Nasir Rajpoot

    Abstract: The recent surge in performance for image analysis of digitised pathology slides can largely be attributed to the advances in deep learning. Deep models can be used to initially localise various structures in the tissue and hence facilitate the extraction of interpretable features for biomarker discovery. However, these models are typically trained for a single task and therefore scale poorly as w… ▽ More

    Submitted 14 November, 2022; v1 submitted 28 February, 2022; originally announced March 2022.

  18. arXiv:2202.00001  [pdf

    q-bio.QM cs.LG

    Insights into performance evaluation of com-pound-protein interaction prediction methods

    Authors: Adiba Yaseen, Imran Amin, Naeem Akhter, Asa Ben-Hur, Fayyaz Minhas

    Abstract: Motivation: Machine learning based prediction of compound-protein interactions (CPIs) is important for drug design, screening and repurposing studies and can improve the efficiency and cost-effectiveness of wet lab assays. Despite the publication of many research papers reporting CPI predictors in the recent years, we have observed a number of fundamental issues in experiment design that lead to o… ▽ More

    Submitted 28 January, 2022; originally announced February 2022.

    Comments: Supplementary information: Supplementary data files are available as part of the GitHub repository

  19. arXiv:2201.12311  [pdf

    cs.LG cs.CV

    REET: Robustness Evaluation and Enhancement Toolbox for Computational Pathology

    Authors: Alex Foote, Amina Asif, Nasir Rajpoot, Fayyaz Minhas

    Abstract: Motivation: Digitization of pathology laboratories through digital slide scanners and advances in deep learning approaches for objective histological assessment have resulted in rapid progress in the field of computational pathology (CPath) with wide-ranging applications in medical and pharmaceutical research as well as clinical workflows. However, the estimation of robustness of CPath models to v… ▽ More

    Submitted 28 January, 2022; originally announced January 2022.

  20. arXiv:2112.09496  [pdf

    eess.IV cs.CV cs.LG

    Towards Launching AI Algorithms for Cellular Pathology into Clinical & Pharmaceutical Orbits

    Authors: Amina Asif, Kashif Rajpoot, David Snead, Fayyaz Minhas, Nasir Rajpoot

    Abstract: Computational Pathology (CPath) is an emerging field concerned with the study of tissue pathology via computational algorithms for the processing and analysis of digitized high-resolution images of tissue slides. Recent deep learning based developments in CPath have successfully leveraged sheer volume of raw pixel data in histology images for predicting target parameters in the domains of diagnost… ▽ More

    Submitted 17 December, 2021; originally announced December 2021.

  21. arXiv:2111.14905  [pdf, other

    cs.DB cs.LG

    Bounding the Last Mile: Efficient Learned String Indexing

    Authors: Benjamin Spector, Andreas Kipf, Kapil Vaidya, Chi Wang, Umar Farooq Minhas, Tim Kraska

    Abstract: We introduce the RadixStringSpline (RSS) learned index structure for efficiently indexing strings. RSS is a tree of radix splines each indexing a fixed number of bytes. RSS approaches or exceeds the performance of traditional string indexes while using 7-70$\times$ less memory. RSS achieves this by using the minimal string prefix to sufficiently distinguish the data unlike most learned approaches… ▽ More

    Submitted 29 November, 2021; originally announced November 2021.

    Comments: 3rd International Workshop on Applied AI for Database Systems and Applications (AIDB'21), August 20, 2021, Copenhagen, Denmark

  22. arXiv:2111.14485  [pdf, other

    cs.CV

    CoNIC: Colon Nuclei Identification and Counting Challenge 2022

    Authors: Simon Graham, Mostafa Jahanifar, Quoc Dang Vu, Giorgos Hadjigeorghiou, Thomas Leech, David Snead, Shan E Ahmed Raza, Fayyaz Minhas, Nasir Rajpoot

    Abstract: Nuclear segmentation, classification and quantification within Haematoxylin & Eosin stained histology images enables the extraction of interpretable cell-based features that can be used in downstream explainable models in computational pathology (CPath). However, automatic recognition of different nuclei is faced with a major challenge in that there are several different types of nuclei, some of t… ▽ More

    Submitted 29 November, 2021; originally announced November 2021.

  23. arXiv:2110.06042  [pdf, other

    cs.CV cs.LG

    SlideGraph+: Whole Slide Image Level Graphs to Predict HER2Status in Breast Cancer

    Authors: Wenqi Lu, Michael Toss, Emad Rakha, Nasir Rajpoot, Fayyaz Minhas

    Abstract: Human epidermal growth factor receptor 2 (HER2) is an important prognostic and predictive factor which is overexpressed in 15-20% of breast cancer (BCa). The determination of its status is a key clinical decision making step for selection of treatment regimen and prognostication. HER2 status is evaluated using transcroptomics or immunohistochemistry (IHC) through situ hybridisation (ISH) which req… ▽ More

    Submitted 12 October, 2021; originally announced October 2021.

    Comments: 20 pages, 11 figures, 3 tables

  24. arXiv:2109.00853  [pdf, ps, other

    cs.CV

    Stain-Robust Mitotic Figure Detection for the Mitosis Domain Generalization Challenge

    Authors: Mostafa Jahanifar, Adam Shephard, Neda Zamani Tajeddin, R. M. Saad Bashir, Mohsin Bilal, Syed Ali Khurram, Fayyaz Minhas, Nasir Rajpoot

    Abstract: The detection of mitotic figures from different scanners/sites remains an important topic of research, owing to its potential in assisting clinicians with tumour grading. The MItosis DOmain Generalization (MIDOG) challenge aims to test the robustness of detection models on unseen data from multiple scanners for this task. We present a short summary of the approach employed by the TIA Centre team t… ▽ More

    Submitted 29 September, 2021; v1 submitted 2 September, 2021; originally announced September 2021.

    Comments: MIDOG challenge at MICCAI 2021

  25. arXiv:2108.11195  [pdf, other

    cs.CV cs.LG

    Lizard: A Large-Scale Dataset for Colonic Nuclear Instance Segmentation and Classification

    Authors: Simon Graham, Mostafa Jahanifar, Ayesha Azam, Mohammed Nimir, Yee-Wah Tsang, Katherine Dodd, Emily Hero, Harvir Sahota, Atisha Tank, Ksenija Benes, Noorul Wahab, Fayyaz Minhas, Shan E Ahmed Raza, Hesham El Daly, Kishore Gopalakrishnan, David Snead, Nasir Rajpoot

    Abstract: The development of deep segmentation models for computational pathology (CPath) can help foster the investigation of interpretable morphological biomarkers. Yet, there is a major bottleneck in the success of such approaches because supervised deep learning models require an abundance of accurately labelled data. This issue is exacerbated in the field of CPath because the generation of detailed ann… ▽ More

    Submitted 29 November, 2021; v1 submitted 25 August, 2021; originally announced August 2021.

  26. arXiv:2108.10446  [pdf

    eess.IV cs.AI cs.CV q-bio.QM

    All You Need is Color: Image based Spatial Gene Expression Prediction using Neural Stain Learning

    Authors: Muhammad Dawood, Kim Branson, Nasir M. Rajpoot, Fayyaz ul Amir Afsar Minhas

    Abstract: "Is it possible to predict expression levels of different genes at a given spatial location in the routine histology image of a tumor section by modeling its stain absorption characteristics?" In this work, we propose a "stain-aware" machine learning approach for prediction of spatial transcriptomic gene expression profiles using digital pathology image of a routine Hematoxylin & Eosin (H&E) histo… ▽ More

    Submitted 26 August, 2021; v1 submitted 23 August, 2021; originally announced August 2021.

    Comments: 14 pages, 4 figures, 1 table

  27. arXiv:2108.10365  [pdf

    cs.LG stat.AP

    L1-regularized neural ranking for risk stratification and its application to prediction of time to distant metastasis in luminal node negative chemotherapy naïve breast cancer patients

    Authors: Fayyaz Minhas, Michael S. Toss, Noor ul Wahab, Emad Rakha, Nasir M. Rajpoot

    Abstract: Can we predict if an early stage cancer patient is at high risk of develo** distant metastasis and what clinicopathological factors are associated with such a risk? In this paper, we propose a ranking based censoring-aware machine learning model for answering such questions. The proposed model is able to generate an interpretable formula for risk stratifi-cation using a minimal number of clinico… ▽ More

    Submitted 23 August, 2021; originally announced August 2021.

    Comments: Accepted in proc. (Machine Learning for Pharma and Healthcare Applications PharML 2021) European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, Sep 13-17, 2021

  28. arXiv:2108.08306  [pdf, other

    q-bio.QM eess.IV

    ALBRT: Cellular Composition Prediction in Routine Histology Images

    Authors: Muhammad Dawood, Kim Branson, Nasir M. Rajpoot, Fayyaz ul Amir Afsar Minhas

    Abstract: Cellular composition prediction, i.e., predicting the presence and counts of different types of cells in the tumor microenvironment from a digitized image of a Hematoxylin and Eosin (H&E) stained tissue section can be used for various tasks in computational pathology such as the analysis of cellular topology and interactions, subtype prediction, survival analysis, etc. In this work, we propose an… ▽ More

    Submitted 26 August, 2021; v1 submitted 18 August, 2021; originally announced August 2021.

    Comments: 11 pages, 5 figures

  29. arXiv:2106.13689  [pdf

    eess.IV cs.CV cs.LG

    Semantic annotation for computational pathology: Multidisciplinary experience and best practice recommendations

    Authors: Noorul Wahab, Islam M Miligy, Katherine Dodd, Harvir Sahota, Michael Toss, Wenqi Lu, Mostafa Jahanifar, Mohsin Bilal, Simon Graham, Young Park, Giorgos Hadjigeorghiou, Abhir Bhalerao, Ayat Lashen, Asmaa Ibrahim, Ayaka Katayama, Henry O Ebili, Matthew Parkin, Tom Sorell, Shan E Ahmed Raza, Emily Hero, Hesham Eldaly, Yee Wah Tsang, Kishore Gopalakrishnan, David Snead, Emad Rakha , et al. (2 additional authors not shown)

    Abstract: Recent advances in whole slide imaging (WSI) technology have led to the development of a myriad of computer vision and artificial intelligence (AI) based diagnostic, prognostic, and predictive algorithms. Computational Pathology (CPath) offers an integrated solution to utilize information embedded in pathology WSIs beyond what we obtain through visual assessment. For automated analysis of WSIs and… ▽ More

    Submitted 25 June, 2021; originally announced June 2021.

  30. arXiv:2106.08153  [pdf

    eess.IV cs.LG

    Now You See It, Now You Dont: Adversarial Vulnerabilities in Computational Pathology

    Authors: Alex Foote, Amina Asif, Ayesha Azam, Tim Marshall-Cox, Nasir Rajpoot, Fayyaz Minhas

    Abstract: Deep learning models are routinely employed in computational pathology (CPath) for solving problems of diagnostic and prognostic significance. Typically, the generalization performance of CPath models is analyzed using evaluation protocols such as cross-validation and testing on multi-centric cohorts. However, to ensure that such CPath solutions are robust and safe for use in a clinical setting, a… ▽ More

    Submitted 16 June, 2021; v1 submitted 14 June, 2021; originally announced June 2021.

    Comments: 10 pages

  31. APEX: A High-Performance Learned Index on Persistent Memory

    Authors: Baotong Lu, Jialin Ding, Eric Lo, Umar Farooq Minhas, Tianzheng Wang

    Abstract: The recently released persistent memory (PM) offers high performance, persistence, and is cheaper than DRAM. This opens up new possibilities for indexes that operate and persist data directly on the memory bus. Recent learned indexes exploit data distribution and have shown great potential for some workloads. However, none support persistence or instant recovery, and existing PM-based indexes typi… ▽ More

    Submitted 6 December, 2021; v1 submitted 3 May, 2021; originally announced May 2021.

    Comments: To appear at VLDB 2022 (PVLDB Vol. 15 Issue 3)

  32. arXiv:2011.11381  [pdf, other

    cs.AI cs.MA physics.soc-ph

    Elementary Effects Analysis of factors controlling COVID-19 infections in computational simulation reveals the importance of Social Distancing and Mask Usage

    Authors: Kelvin K. F. Li, Stephen A. Jarvis, Fayyaz Minhas

    Abstract: COVID-19 was declared a pandemic by the World Health Organization (WHO) on March 11th, 2020. With half of the world's countries in lockdown as of April due to this pandemic, monitoring and understanding the spread of the virus and infection rates and how these factors relate to behavioural and societal parameters is crucial for effective policy making. This paper aims to investigate the effectiven… ▽ More

    Submitted 27 February, 2021; v1 submitted 19 November, 2020; originally announced November 2020.

    Comments: 14 pages, 10 figures

    Journal ref: Computers in Biology and Medicine 134 (2021) 104369

  33. arXiv:2008.04526  [pdf, other

    eess.IV cs.CV

    SAFRON: Stitching Across the Frontier for Generating Colorectal Cancer Histology Images

    Authors: Srijay Deshpande, Fayyaz Minhas, Simon Graham, Nasir Rajpoot

    Abstract: Synthetic images can be used for the development and evaluation of deep learning algorithms in the context of limited availability of data. In the field of computational pathology, where histology images are large in size and visual context is crucial, synthesis of large high resolution images via generative modeling is a challenging task. This is due to memory and computational constraints hinder… ▽ More

    Submitted 26 March, 2021; v1 submitted 11 August, 2020; originally announced August 2020.

  34. arXiv:2004.10898  [pdf, other

    cs.DB cs.DS cs.LG

    Qd-tree: Learning Data Layouts for Big Data Analytics

    Authors: Zongheng Yang, Badrish Chandramouli, Chi Wang, Johannes Gehrke, Yinan Li, Umar Farooq Minhas, Per-Åke Larson, Donald Kossmann, Rajeev Acharya

    Abstract: Corporations today collect data at an unprecedented and accelerating scale, making the need to run queries on large datasets increasingly important. Technologies such as columnar block-based data organization and compression have become standard practice in most commercial database systems. However, the problem of best assigning records to data blocks on storage is still open. For example, today's… ▽ More

    Submitted 22 April, 2020; originally announced April 2020.

    Comments: ACM SIGMOD 2020

  35. arXiv:1912.12187  [pdf

    cs.LG cs.NE stat.ML

    Learning Neural Activations

    Authors: Fayyaz ul Amir Afsar Minhas, Amina Asif

    Abstract: An artificial neuron is modelled as a weighted summation followed by an activation function which determines its output. A wide variety of activation functions such as rectified linear units (ReLU), leaky-ReLU, Swish, MISH, etc. have been explored in the literature. In this short paper, we explore what happens when the activation function of each neuron in an artificial neural network is learned n… ▽ More

    Submitted 27 December, 2019; originally announced December 2019.

    Comments: 10 pages

  36. arXiv:1912.01978  [pdf, other

    cs.LG stat.ML

    FANNet: Formal Analysis of Noise Tolerance, Training Bias and Input Sensitivity in Neural Networks

    Authors: Mahum Naseer, Mishal Fatima Minhas, Faiq Khalid, Muhammad Abdullah Hanif, Osman Hasan, Muhammad Shafique

    Abstract: With a constant improvement in the network architectures and training methodologies, Neural Networks (NNs) are increasingly being deployed in real-world Machine Learning systems. However, despite their impressive performance on "known inputs", these NNs can fail absurdly on the "unseen inputs", especially if these real-time inputs deviate from the training dataset distributions, or contain certain… ▽ More

    Submitted 14 May, 2020; v1 submitted 3 December, 2019; originally announced December 2019.

    Comments: To appear at the 23rd Design, Automation and Test in Europe (DATE 2020). Grenoble, France

  37. arXiv:1911.06106  [pdf

    q-bio.BM cs.LG stat.ML

    AMP0: Species-Specific Prediction of Anti-microbial Peptides using Zero and Few Shot Learning

    Authors: Sadaf Gull, Fayyaz Minhas

    Abstract: The evolution of drug-resistant microbial species is one of the major challenges to global health. The development of new antimicrobial treatments such as antimicrobial peptides needs to be accelerated to combat this threat. However, the discovery of novel antimicrobial peptides is hampered by low-throughput biochemical assays. Computational techniques can be used for rapid screening of promising… ▽ More

    Submitted 28 October, 2019; originally announced November 2019.

    Comments: Under journal submission, 2019

  38. arXiv:1911.00896  [pdf

    cs.LG stat.ML

    Generalized Learning with Rejection for Classification and Regression Problems

    Authors: Amina Asif, Fayyaz ul Amir Afsar Minhas

    Abstract: Learning with rejection (LWR) allows development of machine learning systems with the ability to discard low confidence decisions generated by a prediction model. That is, just like human experts, LWR allows machine models to abstain from generating a prediction when reliability of the prediction is expected to be low. Several frameworks for this learning with rejection have been proposed in the l… ▽ More

    Submitted 3 November, 2019; originally announced November 2019.

  39. arXiv:1905.08898  [pdf, other

    cs.DB cs.DS cs.LG

    ALEX: An Updatable Adaptive Learned Index

    Authors: Jialin Ding, Umar Farooq Minhas, Jia Yu, Chi Wang, Jaeyoung Do, Yinan Li, Hantian Zhang, Badrish Chandramouli, Johannes Gehrke, Donald Kossmann, David Lomet, Tim Kraska

    Abstract: Recent work on "learned indexes" has changed the way we look at the decades-old field of DBMS indexing. The key idea is that indexes can be thought of as "models" that predict the position of a key in a dataset. Indexes can, thus, be learned. The original work by Kraska et al. shows that a learned index beats a B+Tree by a factor of up to three in search time and by an order of magnitude in memory… ▽ More

    Submitted 20 May, 2020; v1 submitted 21 May, 2019; originally announced May 2019.

    Report number: MSR-TR-2020-12

  40. An embarrassingly simple approach to neural multiple instance classification

    Authors: Amina Asif, Fayyaz ul Amir Afsar Minhas

    Abstract: Multiple Instance Learning (MIL) is a weak supervision learning paradigm that allows modeling of machine learning problems in which labels are available only for groups of examples called bags. A positive bag may contain one or more positive examples but it is not known which examples in the bag are positive. All examples in a negative bag belong to the negative class. Such problems arise frequent… ▽ More

    Submitted 6 May, 2019; originally announced May 2019.

    Comments: 7 pages

    Journal ref: Pattern Recognition Letters, vol. 128, pp. 474-479, Dec. 1, 2019

  41. arXiv:1901.01686  [pdf

    cs.LG stat.ML

    Ten ways to fool the masses with machine learning

    Authors: Fayyaz Minhas, Amina Asif, Asa Ben-Hur

    Abstract: If you want to tell people the truth, make them laugh, otherwise they'll kill you. (source unclear) Machine learning and deep learning are the technologies of the day for develo** intelligent automatic systems. However, a key hurdle for progress in the field is the literature itself: we often encounter papers that report results that are difficult to reconstruct or reproduce, results that mis-… ▽ More

    Submitted 7 January, 2019; originally announced January 2019.

    Comments: 11 pages, 8 figures

  42. arXiv:1811.06885  [pdf

    cs.LG stat.ML

    A Generalized Meta-loss function for regression and classification using privileged information

    Authors: Amina Asif, Muhammad Dawood, Fayyaz ul Amir Afsar Minhas

    Abstract: Learning using privileged information (LUPI) is a powerful heterogenous feature space machine learning framework that allows a machine learning model to learn from highly informative or privileged features which are available during training only to generate test predictions using input space features which are available both during training and testing. LUPI can significantly improve prediction p… ▽ More

    Submitted 25 March, 2019; v1 submitted 16 November, 2018; originally announced November 2018.

  43. arXiv:1811.04463  [pdf

    cs.LG cs.CV eess.IV stat.ML

    Machine Learning with Abstention for Automated Liver Disease Diagnosis

    Authors: Kanza Hamid, Amina Asif, Wajid Abbasi, Durre Sabih, Fayyaz Minhas

    Abstract: This paper presents a novel approach for detection of liver abnormalities in an automated manner using ultrasound images. For this purpose, we have implemented a machine learning model that can not only generate labels (normal and abnormal) for a given ultrasound image but it can also detect when its prediction is likely to be incorrect. The proposed model abstains from generating the label of a t… ▽ More

    Submitted 11 November, 2018; originally announced November 2018.

    Comments: Preprint version before submission for publication. complete version published in proc. 15th International Conference on Frontiers of Information Technology (FIT 2017), December 18-20, 2017, Islamabad, Pakistan. http://ieeexplore.ieee.org/document/8261064/

    Journal ref: 15th IEEE International Conference on Frontiers of Information Technology (FIT 2017), December 18-20, 2017, Islamabad, Pakistan

  44. ISLAND: In-Silico Prediction of Proteins Binding Affinity Using Sequence Descriptors

    Authors: Wajid Arshad Abbasi, Fahad Ul Hassan, Adiba Yaseen, Fayyaz Ul Amir Afsar Minhas

    Abstract: Determination of binding affinity of proteins in the formation of protein complexes requires sophisticated, expensive and time-consuming experimentation which can be replaced with computational methods. Most computational prediction techniques require protein structures which limit their applicability to protein complexes with known structures. In this work, we explore sequence based protein bindi… ▽ More

    Submitted 22 March, 2018; v1 submitted 22 November, 2017; originally announced November 2017.

    Comments: Keywords: Protein sequence analysis, Protein-protein interaction, Support vector machines, Web services, Binding affinity

    Journal ref: BioData Mining, 2020 13:20

  45. arXiv:1711.07886  [pdf

    cs.LG q-bio.QM stat.ML

    Training large margin host-pathogen protein-protein interaction predictors

    Authors: Abdul Hannan Basit, Wajid Arshad Abbasi, Amina Asif, Fayyaz Ul Amir Afsar Minhas

    Abstract: Detection of protein-protein interactions (PPIs) plays a vital role in molecular biology. Particularly, infections are caused by the interactions of host and pathogen proteins. It is important to identify host-pathogen interactions (HPIs) to discover new drugs to counter infectious diseases. Conventional wet lab PPI prediction techniques have limitations in terms of large scale application and bud… ▽ More

    Submitted 21 November, 2017; originally announced November 2017.

    Comments: 12 pages

    Report number: Vol. 16, No. 04 1850014

    Journal ref: Journal of Bioinformatics and Computational Biology 2018

  46. arXiv:1711.04913  [pdf

    cs.LG stat.ML

    pyLEMMINGS: Large Margin Multiple Instance Classification and Ranking for Bioinformatics Applications

    Authors: Amina Asif, Wajid Arshad Abbasi, Farzeen Munir, Asa Ben-Hur, Fayyaz ul Amir Afsar Minhas

    Abstract: Motivation: A major challenge in the development of machine learning based methods in computational biology is that data may not be accurately labeled due to the time and resources required for experimentally annotating properties of proteins and DNA sequences. Standard supervised learning algorithms assume accurate instance-level labeling of training data. Multiple instance learning is a paradigm… ▽ More

    Submitted 13 November, 2017; originally announced November 2017.