Skip to main content

Showing 1–13 of 13 results for author: Diallo, B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.08884  [pdf, other

    cs.CV cs.LG stat.ML

    The Penalized Inverse Probability Measure for Conformal Classification

    Authors: Paul Melki, Lionel Bombrun, Boubacar Diallo, Jérôme Dias, Jean-Pierre da Costa

    Abstract: The deployment of safe and trustworthy machine learning systems, and particularly complex black box neural networks, in real-world applications requires reliable and certified guarantees on their performance. The conformal prediction framework offers such formal guarantees by transforming any point into a set predictor with valid, finite-set, guarantees on the coverage of the true at a chosen leve… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Journal ref: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), IEEE/CVF, Jun 2024, Seattle, United States

  2. arXiv:2405.02374  [pdf, other

    q-bio.QM cs.AI cs.LG

    Protein binding affinity prediction under multiple substitutions applying eGNNs on Residue and Atomic graphs combined with Language model information: eGRAL

    Authors: Arturo Fiorellini-Bernardis, Sebastien Boyer, Christoph Brunken, Bakary Diallo, Karim Beguir, Nicolas Lopez-Carranza, Oliver Bent

    Abstract: Protein-protein interactions (PPIs) play a crucial role in numerous biological processes. Develo** methods that predict binding affinity changes under substitution mutations is fundamental for modelling and re-engineering biological systems. Deep learning is increasingly recognized as a powerful tool capable of bridging the gap between in-silico predictions and in-vitro observations. With this c… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

  3. arXiv:2404.02580  [pdf, other

    cs.CV cs.AI

    Active learning for efficient annotation in precision agriculture: a use-case on crop-weed semantic segmentation

    Authors: Bart M. van Marrewijk, Charbel Dand**ou, Dan Jeric Arcega Rustia, Nicolas Franco Gonzalez, Boubacar Diallo, Jérôme Dias, Paul Melki, Pieter M. Blok

    Abstract: Optimizing deep learning models requires large amounts of annotated images, a process that is both time-intensive and costly. Especially for semantic segmentation models in which every pixel must be annotated. A potential strategy to mitigate annotation effort is active learning. Active learning facilitates the identification and selection of the most informative images from a large unlabelled poo… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

  4. arXiv:2308.15094  [pdf, other

    cs.CV cs.LG stat.AP stat.ML

    Group-Conditional Conformal Prediction via Quantile Regression Calibration for Crop and Weed Classification

    Authors: Paul Melki, Lionel Bombrun, Boubacar Diallo, Jérôme Dias, Jean-Pierre da Costa

    Abstract: As deep learning predictive models become an integral part of a large spectrum of precision agricultural systems, a barrier to the adoption of such automated solutions is the lack of user trust in these highly complex, opaque and uncertain models. Indeed, deep neural networks are not equipped with any explicit guarantees that can be used to certify the system's performance, especially in highly va… ▽ More

    Submitted 29 August, 2023; originally announced August 2023.

    Journal ref: 2023 IEEE/CVF International Conference on Computer Vision Workshops (ICCVW), IEEE/CVF, Oct 2023, Paris, France

  5. arXiv:2306.00114  [pdf, other

    cs.CV cs.AI cs.LG

    The Canadian Cropland Dataset: A New Land Cover Dataset for Multitemporal Deep Learning Classification in Agriculture

    Authors: Amanda A. Boatswain Jacques, Abdoulaye Baniré Diallo, Etienne Lord

    Abstract: Monitoring land cover using remote sensing is vital for studying environmental changes and ensuring global food security through crop yield forecasting. Specifically, multitemporal remote sensing imagery provides relevant information about the dynamics of a scene, which has proven to lead to better land cover classification results. Nevertheless, few studies have benefited from high spatial and te… ▽ More

    Submitted 4 June, 2023; v1 submitted 31 May, 2023; originally announced June 2023.

    Comments: 24 pages, 5 figures, dataset descriptor

  6. Prior Density Learning in Variational Bayesian Phylogenetic Parameters Inference

    Authors: Amine M. Remita, Golrokh Vitae, Abdoulaye Baniré Diallo

    Abstract: The advances in variational inference are providing promising paths in Bayesian estimation problems. These advances make variational phylogenetic inference an alternative approach to Markov Chain Monte Carlo methods for approximating the phylogenetic posterior. However, one of the main drawbacks of such approaches is modelling the prior through fixed distributions, which could bias the posterior a… ▽ More

    Submitted 8 September, 2023; v1 submitted 5 February, 2023; originally announced February 2023.

    Comments: Accepted as a full paper for publication at RECOMB-CG 2023 (LNBI proof version). 15 pages (excluding references), 6 tables and 1 figure

    Journal ref: In Jahn, K., Vinař, T. (eds) Comparative Genomics. RECOMB-CG 2023. Lecture Notes in Computer Science, vol 13883. Springer, Cham

  7. EvoVGM: a Deep Variational Generative Model for Evolutionary Parameter Estimation

    Authors: Amine M. Remita, Abdoulaye Baniré Diallo

    Abstract: Most evolutionary-oriented deep generative models do not explicitly consider the underlying evolutionary dynamics of biological sequences as it is performed within the Bayesian phylogenetic inference framework. In this study, we propose a method for a deep variational Bayesian generative model (EvoVGM) that jointly approximates the true posterior of local evolutionary parameters and generates sequ… ▽ More

    Submitted 30 June, 2022; v1 submitted 25 May, 2022; originally announced May 2022.

    Comments: Accepted as a full paper for publication in ACM-BCB 2022 (Camera-ready version)

    Journal ref: In 13th ACM International Conference on Bioinformatics, Computational Biology and Health Informatics (BCB '22), August 7-10, 2022, Northbrook, IL, USA. ACM, New York, NY, USA, 10 pages

  8. Active learning with MaskAL reduces annotation effort for training Mask R-CNN

    Authors: Pieter M. Blok, Gert Kootstra, Hakim Elchaoui Elghor, Boubacar Diallo, Frits K. van Evert, Eldert J. van Henten

    Abstract: The generalisation performance of a convolutional neural network (CNN) is influenced by the quantity, quality, and variety of the training images. Training images must be annotated, and this is time consuming and expensive. The goal of our work was to reduce the number of annotated images needed to train a CNN while maintaining its performance. We hypothesised that the performance of a CNN can be… ▽ More

    Submitted 26 March, 2022; v1 submitted 13 December, 2021; originally announced December 2021.

    Comments: 30 pages, 10 figures, 3 tables

    Journal ref: Computers and Electronics in Agriculture, 197 (2022)

  9. arXiv:2001.03260  [pdf, other

    q-bio.GN cs.LG stat.ML

    Supporting supervised learning in fungal Biosynthetic Gene Cluster discovery: new benchmark datasets

    Authors: Hayda Almeida, Adrian Tsang, Abdoulaye Baniré Diallo

    Abstract: Fungal Biosynthetic Gene Clusters (BGCs) of secondary metabolites are clusters of genes capable of producing natural products, compounds that play an important role in the production of a wide variety of bioactive compounds, including antibiotics and pharmaceuticals. Identifying BGCs can lead to the discovery of novel natural products to benefit human health. Previous work has been focused on deve… ▽ More

    Submitted 9 January, 2020; originally announced January 2020.

    Comments: Accepted to Machine Learning and Artificial Intelligence in Bioinformatics and Medical Informatics (MABM2019) at IEEE BIBM 2019

  10. arXiv:1910.05421  [pdf, other

    cs.LG q-bio.GN stat.ML

    Statistical Linear Models in Virus Genomic Alignment-free Classification: Application to Hepatitis C Viruses

    Authors: Amine M. Remita, Abdoulaye Baniré Diallo

    Abstract: Viral sequence classification is an important task in pathogen detection, epidemiological surveys and evolutionary studies. Statistical learning methods are widely used to classify and identify viral sequences in samples from environments. These methods face several challenges associated with the nature and properties of viral genomes such as recombination, mutation rate and diversity. Also, new g… ▽ More

    Submitted 28 May, 2024; v1 submitted 11 October, 2019; originally announced October 2019.

    Comments: Accepted as a regular paper for publication in IEEE BIBM 2019 [v3: Fix indices in Markov classifier]

    Journal ref: 2019 IEEE International Conference on Bioinformatics and Biomedicine (BIBM), San Diego, CA, USA, 2019, pp. 474-481

  11. arXiv:1604.00045  [pdf, other

    q-bio.BM cs.OH

    PGR: A Graph Repository of Protein 3D-Structures

    Authors: Wajdi Dhifli, Abdoulaye Baniré Diallo

    Abstract: Graph theory and graph mining constitute rich fields of computational techniques to study the structures, topologies and properties of graphs. These techniques constitute a good asset in bioinformatics if there exist efficient methods for transforming biological data into graphs. In this paper, we present Protein Graph Repository (PGR), a novel database of protein 3D-structures transformed into gr… ▽ More

    Submitted 24 January, 2016; originally announced April 2016.

  12. ProtNN: Fast and Accurate Nearest Neighbor Protein Function Prediction based on Graph Embedding in Structural and Topological Space

    Authors: Wajdi Dhifli, Abdoulaye Baniré Diallo

    Abstract: Studying the function of proteins is important for understanding the molecular mechanisms of life. The number of publicly available protein structures has increasingly become extremely large. Still, the determination of the function of a protein structure remains a difficult, costly, and time consuming task. The difficulties are often due to the essential role of spatial and topological structures… ▽ More

    Submitted 24 January, 2016; v1 submitted 2 November, 2015; originally announced November 2015.

    Journal ref: BMC BioData Mining, 9:30, 2016

  13. arXiv:1511.00725  [pdf, ps, other

    cs.LG cs.AI cs.DB cs.IR

    Toward an Efficient Multi-class Classification in an Open Universe

    Authors: Wajdi Dhifli, Abdoulaye Baniré Diallo

    Abstract: Classification is a fundamental task in machine learning and data mining. Existing classification methods are designed to classify unknown instances within a set of previously known training classes. Such a classification takes the form of a prediction within a closed-set of classes. However, a more realistic scenario that fits real-world applications is to consider the possibility of encountering… ▽ More

    Submitted 1 March, 2018; v1 submitted 2 November, 2015; originally announced November 2015.