Skip to main content

Showing 1–50 of 131 results for author: Ibrahim, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.00116  [pdf, other

    cs.LG cs.AI

    Generative AI for Synthetic Data Across Multiple Medical Modalities: A Systematic Review of Recent Developments and Challenges

    Authors: Mahmoud Ibrahim, Yasmina Al Khalil, Sina Amirrajab, Chang Sun, Marcel Breeuwer, Josien Pluim, Bart Elen, Gokhan Ertaylan, Michel Dumontier

    Abstract: This paper presents a comprehensive systematic review of generative models (GANs, VAEs, DMs, and LLMs) used to synthesize various medical data types, including imaging (dermoscopic, mammographic, ultrasound, CT, MRI, and X-ray), text, time-series, and tabular data (EHR). Unlike previous narrowly focused reviews, our study encompasses a broad array of medical data modalities and explores various ge… ▽ More

    Submitted 2 July, 2024; v1 submitted 27 June, 2024; originally announced July 2024.

  2. arXiv:2406.10743  [pdf, other

    cs.LG cs.AI

    Occam's Razor for Self Supervised Learning: What is Sufficient to Learn Good Representations?

    Authors: Mark Ibrahim, David Klindt, Randall Balestriero

    Abstract: Deep Learning is often depicted as a trio of data-architecture-loss. Yet, recent Self Supervised Learning (SSL) solutions have introduced numerous additional design choices, e.g., a projector network, positive views, or teacher-student networks. These additions pose two challenges. First, they limit the impact of theoretical studies that often fail to incorporate all those intertwined designs. Sec… ▽ More

    Submitted 15 June, 2024; originally announced June 2024.

  3. arXiv:2406.05183  [pdf, other

    cs.LG cs.AI cs.CL

    The Factorization Curse: Which Tokens You Predict Underlie the Reversal Curse and More

    Authors: Ouail Kitouni, Niklas Nolte, Diane Bouchacourt, Adina Williams, Mike Rabbat, Mark Ibrahim

    Abstract: Today's best language models still struggle with hallucinations: factually incorrect generations, which impede their ability to reliably retrieve information seen during training. The reversal curse, where models cannot recall information when probed in a different order than was encountered during training, exemplifies this in information retrieval. We reframe the reversal curse as a factorizatio… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: 18 pages, 7 figures

  4. arXiv:2406.00360  [pdf, other

    cs.AR

    L2R-CIPU: Efficient CNN Computation with Left-to-Right Composite Inner Product Units

    Authors: Malik Zohaib Nisar, Mohammad Sohail Ibrahim, Muhammad Usman, Jeong-A Lee

    Abstract: This paper proposes a composite inner-product computation unit based on left-to-right (LR) arithmetic for the acceleration of convolution neural networks (CNN) on hardware. The efficacy of the proposed L2R-CIPU method has been shown on the VGG-16 network, and assessment is done on various performance metrics. The L2R-CIPU design achieves 1.06x to 6.22x greater performance, 4.8x to 15x more TOPS/W,… ▽ More

    Submitted 10 June, 2024; v1 submitted 1 June, 2024; originally announced June 2024.

  5. arXiv:2405.17247  [pdf, other

    cs.LG

    An Introduction to Vision-Language Modeling

    Authors: Florian Bordes, Richard Yuanzhe Pang, Anurag Ajay, Alexander C. Li, Adrien Bardes, Suzanne Petryk, Oscar Mañas, Zhiqiu Lin, Anas Mahmoud, Bargav Jayaraman, Mark Ibrahim, Melissa Hall, Yunyang Xiong, Jonathan Lebensold, Candace Ross, Srihari Jayakumar, Chuan Guo, Diane Bouchacourt, Haider Al-Tahan, Karthik Padthe, Vasu Sharma, Hu Xu, Xiaoqing Ellen Tan, Megan Richards, Samuel Lavoie , et al. (16 additional authors not shown)

    Abstract: Following the recent popularity of Large Language Models (LLMs), several attempts have been made to extend them to the visual domain. From having a visual assistant that could guide us through unfamiliar environments to generative models that produce images using only a high-level text description, the vision-language model (VLM) applications will significantly impact our relationship with technol… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  6. arXiv:2405.13356  [pdf, other

    cs.AI

    Large Language Models (LLMs) Assisted Wireless Network Deployment in Urban Settings

    Authors: Nurullah Sevim, Mostafa Ibrahim, Sabit Ekin

    Abstract: The advent of Large Language Models (LLMs) has revolutionized language understanding and human-like text generation, drawing interest from many other fields with this question in mind: What else are the LLMs capable of? Despite their widespread adoption, ongoing research continues to explore new ways to integrate LLMs into diverse systems. This paper explores new techniques to harness the power… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

    Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  7. arXiv:2405.00740  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    Modeling Caption Diversity in Contrastive Vision-Language Pretraining

    Authors: Samuel Lavoie, Polina Kirichenko, Mark Ibrahim, Mahmoud Assran, Andrew Gordon Wilson, Aaron Courville, Nicolas Ballas

    Abstract: There are a thousand ways to caption an image. Contrastive Language Pretraining (CLIP) on the other hand, works by map** an image and its caption to a single vector -- limiting how well CLIP-like models can represent the diverse ways to describe an image. In this work, we introduce Llip, Latent Language Image Pretraining, which models the diversity of captions that could match an image. Llip's v… ▽ More

    Submitted 14 May, 2024; v1 submitted 29 April, 2024; originally announced May 2024.

    Comments: 14 pages, 8 figures, 7 tables, to be published at ICML2024

  8. arXiv:2404.16717  [pdf, other

    cs.CV cs.AI cs.HC

    Embracing Diversity: Interpretable Zero-shot classification beyond one vector per class

    Authors: Mazda Moayeri, Michael Rabbat, Mark Ibrahim, Diane Bouchacourt

    Abstract: Vision-language models enable open-world classification of objects without the need for any retraining. While this zero-shot paradigm marks a significant advance, even today's best models exhibit skewed performance when objects are dissimilar from their typical depiction. Real world objects such as pears appear in a variety of forms -- from diced to whole, on a table or in a bowl -- yet standard V… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

    Comments: Accepted to FAccT 2024

  9. arXiv:2404.10960  [pdf, other

    cs.CL cs.AI

    Uncertainty-Based Abstention in LLMs Improves Safety and Reduces Hallucinations

    Authors: Christian Tomani, Kamalika Chaudhuri, Ivan Evtimov, Daniel Cremers, Mark Ibrahim

    Abstract: A major barrier towards the practical deployment of large language models (LLMs) is their lack of reliability. Three situations where this is particularly apparent are correctness, hallucinations when given unanswerable questions, and safety. In all three cases, models should ideally abstain from responding, much like humans, whose ability to understand uncertainty makes us refrain from answering… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

  10. arXiv:2404.04173  [pdf, other

    cs.AR cs.LG

    H3DFact: Heterogeneous 3D Integrated CIM for Factorization with Holographic Perceptual Representations

    Authors: Zishen Wan, Che-Kai Liu, Mohamed Ibrahim, Hanchen Yang, Samuel Spetalnick, Tushar Krishna, Arijit Raychowdhury

    Abstract: Disentangling attributes of various sensory signals is central to human-like perception and reasoning and a critical task for higher-order cognitive and neuro-symbolic AI systems. An elegant approach to represent this intricate factorization is via high-dimensional holographic vectors drawing on brain-inspired vector symbolic architectures. However, holographic factorization involves iterative com… ▽ More

    Submitted 5 April, 2024; originally announced April 2024.

    Comments: 2024 Design Automation and Test in Europe (DATE); The first two authors have equal contributions

  11. arXiv:2403.20297  [pdf, other

    cs.AR cs.DC

    Balanced Data Placement for GEMV Acceleration with Processing-In-Memory

    Authors: Mohamed Assem Ibrahim, Mahzabeen Islam, Shaizeen Aga

    Abstract: With unprecedented demand for generative AI (GenAI) inference, acceleration of primitives that dominate GenAI such as general matrix-vector multiplication (GEMV) is receiving considerable attention. A challenge with GEMVs is the high memory bandwidth this primitive demands. Multiple memory vendors have proposed commercially viable processing-in-memory (PIM) prototypes that attain bandwidth boost o… ▽ More

    Submitted 1 April, 2024; v1 submitted 29 March, 2024; originally announced March 2024.

  12. arXiv:2403.14124  [pdf, other

    cs.CV

    Soft Masked Transformer for Point Cloud Processing with Skip Attention-Based Upsampling

    Authors: Yong He, Hongshan Yu, Muhammad Ibrahim, Xiaoyan Liu, Tongjia Chen, Anwaar Ulhaq, Ajmal Mian

    Abstract: Point cloud processing methods leverage local and global point features %at the feature level to cater to downstream tasks, yet they often overlook the task-level context inherent in point clouds during the encoding stage. We argue that integrating task-level information into the encoding stage significantly enhances performance. To that end, we propose SMTransformer which incorporates task-level… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

    Comments: 14 pages, 8 figures

  13. arXiv:2402.07329  [pdf, other

    cs.CV

    The Bias of Harmful Label Associations in Vision-Language Models

    Authors: Caner Hazirbas, Alicia Sun, Yonathan Efroni, Mark Ibrahim

    Abstract: Despite the remarkable performance of foundation vision-language models, the shared representation space for text and vision can also encode harmful label associations detrimental to fairness. While prior work has uncovered bias in vision-language models' (VLMs) classification performance across geography, work has been limited along the important axis of harmful label associations due to a lack o… ▽ More

    Submitted 15 April, 2024; v1 submitted 11 February, 2024; originally announced February 2024.

  14. arXiv:2402.01933  [pdf, other

    eess.AS cs.SD

    ToMoBrush: Exploring Dental Health Sensing using a Sonic Toothbrush

    Authors: Kuang Yuan, Mohamed Ibrahim, Yiwen Song, Guoxiang Deng, Suvendra Vijayan, Robert Nerone, Akshay Gadre, Swarun Kumar

    Abstract: Early detection of dental disease is crucial to prevent adverse outcomes. Today, dental X-rays are currently the most accurate gold standard for dental disease detection. Unfortunately, regular X-ray exam is still a privilege for billions of people around the world. In this paper, we ask: "Can we develop a low-cost sensing system that enables dental self-examination in the comfort of one's home?"… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

    ACM Class: J.3; C.3; H.5.2

  15. arXiv:2401.14109  [pdf, other

    cs.CL cs.AI cs.LG quant-ph

    CompactifAI: Extreme Compression of Large Language Models using Quantum-Inspired Tensor Networks

    Authors: Andrei Tomut, Saeed S. Jahromi, Abhijoy Sarkar, Uygar Kurt, Sukhbinder Singh, Faysal Ishtiaq, Cesar Muñoz, Prabdeep Singh Bajaj, Ali Elborady, Gianni del Bimbo, Mehrazin Alizadeh, David Montero, Pablo Martin-Ramiro, Muhammad Ibrahim, Oussama Tahiri Alaoui, John Malcolm, Samuel Mugel, Roman Orus

    Abstract: Large Language Models (LLMs) such as ChatGPT and LlaMA are advancing rapidly in generative Artificial Intelligence (AI), but their immense size poses significant challenges, such as huge training and inference costs, substantial energy demands, and limitations for on-site deployment. Traditional compression methods such as pruning, distillation, and low-rank approximation focus on reducing the eff… ▽ More

    Submitted 13 May, 2024; v1 submitted 25 January, 2024; originally announced January 2024.

    Comments: 5 pages, 4 figures, 2 tables, and supplementary information of 2 pages and 1 figure. Revised version with new benchmarks for LlaMA2-7B

  16. arXiv:2401.13472  [pdf, other

    eess.IV cs.CV

    Segmenting Cardiac Muscle Z-disks with Deep Neural Networks

    Authors: Mihaela Croitor Ibrahim, Nishant Ravikumar, Alistair Curd, Joanna Leng, Oliver Umney, Michelle Peckham

    Abstract: Z-disks are complex structures that delineate repeating sarcomeres in striated muscle. They play significant roles in cardiomyocytes such as providing mechanical stability for the contracting sarcomere, cell signalling and autophagy. Changes in Z-disk architecture have been associated with impaired cardiac function. Hence, there is a strong need to create tools to segment Z-disks from microscopy i… ▽ More

    Submitted 24 January, 2024; originally announced January 2024.

  17. arXiv:2401.10271  [pdf, other

    cs.DB cs.AI

    Querying Triadic Concepts through Partial or Complete Matching of Triples

    Authors: Pedro Henrique B. Ruas, Rokia Missaoui, Mohamed Hamza Ibrahim

    Abstract: In this paper, we introduce a new method for querying triadic concepts through partial or complete matching of triples using an inverted index, to retrieve already computed triadic concepts that contain a set of terms in their extent, intent, and/or modus. As opposed to the approximation approach described in Ananias, this method (i) does not need to keep the initial triadic context or its three d… ▽ More

    Submitted 4 January, 2024; originally announced January 2024.

  18. arXiv:2401.01764  [pdf, other

    cs.CV cs.LG

    Understanding the Detrimental Class-level Effects of Data Augmentation

    Authors: Polina Kirichenko, Mark Ibrahim, Randall Balestriero, Diane Bouchacourt, Ramakrishna Vedantam, Hamed Firooz, Andrew Gordon Wilson

    Abstract: Data augmentation (DA) encodes invariance and provides implicit regularization critical to a model's performance in image classification tasks. However, while DA improves average accuracy, recent studies have shown that its impact can be highly class dependent: achieving optimal average accuracy comes at the cost of significantly hurting individual class accuracy by as much as 20% on ImageNet. The… ▽ More

    Submitted 7 December, 2023; originally announced January 2024.

    Comments: Neural Information Processing Systems (NeurIPS), 2023

  19. arXiv:2312.14421  [pdf, other

    cs.AI

    Enhancing Actionable Formal Concept Identification with Base-Equivalent Conceptual-Relevance

    Authors: Ayao Bobi, Rokia Missaoui, Mohamed Hamza Ibrahim

    Abstract: In knowledge discovery applications, the pattern set generated from data can be tremendously large and hard to explore by analysts. In the Formal Concept Analysis (FCA) framework, there have been studies to identify important formal concepts through the stability index and other quality measures. In this paper, we introduce the Base-Equivalent Conceptual Relevance (BECR) score, a novel conceptual… ▽ More

    Submitted 21 December, 2023; originally announced December 2023.

  20. arXiv:2311.15930  [pdf, other

    cs.CL cs.AI

    WorldSense: A Synthetic Benchmark for Grounded Reasoning in Large Language Models

    Authors: Youssef Benchekroun, Megi Dervishi, Mark Ibrahim, Jean-Baptiste Gaya, Xavier Martinet, Grégoire Mialon, Thomas Scialom, Emmanuel Dupoux, Dieuwke Hupkes, Pascal Vincent

    Abstract: We propose WorldSense, a benchmark designed to assess the extent to which LLMs are consistently able to sustain tacit world models, by testing how they draw simple inferences from descriptions of simple arrangements of entities. Worldsense is a synthetic benchmark with three problem types, each with their own trivial control, which explicitly avoids bias by decorrelating the abstract structure of… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.

  21. arXiv:2311.10025  [pdf, other

    cs.LG cs.AI cs.DC cs.NE

    A Novel Neural Network-Based Federated Learning System for Imbalanced and Non-IID Data

    Authors: Mahfuzur Rahman Chowdhury, Muhammad Ibrahim

    Abstract: With the growth of machine learning techniques, privacy of data of users has become a major concern. Most of the machine learning algorithms rely heavily on large amount of data which may be collected from various sources. Collecting these data yet maintaining privacy policies has become one of the most challenging tasks for the researchers. To combat this issue, researchers have introduced federa… ▽ More

    Submitted 16 November, 2023; originally announced November 2023.

    Comments: 48 pages

  22. arXiv:2311.08815  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Self-Supervised Disentanglement by Leveraging Structure in Data Augmentations

    Authors: Cian Eastwood, Julius von Kügelgen, Linus Ericsson, Diane Bouchacourt, Pascal Vincent, Bernhard Schölkopf, Mark Ibrahim

    Abstract: Self-supervised representation learning often uses data augmentations to induce some invariance to "style" attributes of the data. However, with downstream tasks generally unknown at training time, it is difficult to deduce a priori which attributes of the data are indeed "style" and can be safely discarded. To address this, we introduce a more principled approach that seeks to disentangle style f… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

  23. arXiv:2311.05716  [pdf, other

    cs.AR

    ML-based Real-Time Control at the Edge: An Approach Using hls4ml

    Authors: R. Shi, S. Ogrenci, J. M. Arnold, J. R. Berlioz, P. Hanlet, K. J. Hazelwood, M. A. Ibrahim, H. Liu, V. P. Nagaslaev, A. Narayanan 1, D. J. Nicklaus, J. Mitrevski, G. Pradhan, A. L. Saewert, B. A. Schupbach, K. Seiya, M. Thieme, R. M. Thurman-Keup, N. V. Tran

    Abstract: This study focuses on implementing a real-time control system for a particle accelerator facility that performs high energy physics experiments. A critical operating parameter in this facility is beam loss, which is the fraction of particles deviating from the accelerated proton beam into a cascade of secondary particles. Accelerators employ a large number of sensors to monitor beam loss. The data… ▽ More

    Submitted 9 November, 2023; originally announced November 2023.

  24. arXiv:2311.05034  [pdf, other

    cs.AR cs.DC

    Just-in-time Quantization with Processing-In-Memory for Efficient ML Training

    Authors: Mohamed Assem Ibrahim, Shaizeen Aga, Ada Li, Suchita Pati, Mahzabeen Islam

    Abstract: Data format innovations have been critical for machine learning (ML) scaling, which in turn fuels ground-breaking ML capabilities. However, even in the presence of low-precision formats, model weights are often stored in both high-precision and low-precision during training. Furthermore, with emerging directional data formats (e.g., MX9, MX6, etc.) multiple low-precision weight copies can be requi… ▽ More

    Submitted 8 November, 2023; originally announced November 2023.

  25. arXiv:2310.19909  [pdf, other

    cs.CV cs.LG

    Battle of the Backbones: A Large-Scale Comparison of Pretrained Models across Computer Vision Tasks

    Authors: Micah Goldblum, Hossein Souri, Renkun Ni, Manli Shu, Viraj Prabhu, Gowthami Somepalli, Prithvijit Chattopadhyay, Mark Ibrahim, Adrien Bardes, Judy Hoffman, Rama Chellappa, Andrew Gordon Wilson, Tom Goldstein

    Abstract: Neural network based computer vision systems are typically built on a backbone, a pretrained or randomly initialized feature extractor. Several years ago, the default option was an ImageNet-trained convolutional neural network. However, the recent past has seen the emergence of countless backbones pretrained using various algorithms and datasets. While this abundance of choice has led to performan… ▽ More

    Submitted 19 November, 2023; v1 submitted 30 October, 2023; originally announced October 2023.

    Comments: Accepted to NeurIPS 2023

  26. arXiv:2310.13269  [pdf, other

    cs.LG cs.AI cs.IR

    An Exploratory Study on Simulated Annealing for Feature Selection in Learning-to-Rank

    Authors: Mohd. Sayemul Haque, Md. Fahim, Muhammad Ibrahim

    Abstract: Learning-to-rank is an applied domain of supervised machine learning. As feature selection has been found to be effective for improving the accuracy of learning models in general, it is intriguing to investigate this process for learning-to-rank domain. In this study, we investigate the use of a popular meta-heuristic approach called simulated annealing for this task. Under the general framework o… ▽ More

    Submitted 20 October, 2023; originally announced October 2023.

    Comments: 29 pages

  27. arXiv:2309.16748  [pdf, other

    cs.LG cs.AI stat.ML

    Discovering environments with XRM

    Authors: Mohammad Pezeshki, Diane Bouchacourt, Mark Ibrahim, Nicolas Ballas, Pascal Vincent, David Lopez-Paz

    Abstract: Successful out-of-distribution generalization requires environment annotations. Unfortunately, these are resource-intensive to obtain, and their relevance to model performance is limited by the expectations and perceptual biases of human annotators. Therefore, to enable robust AI systems across applications, we must develop algorithms to automatically discover environments inducing broad generaliz… ▽ More

    Submitted 28 September, 2023; originally announced September 2023.

  28. arXiv:2309.15251  [pdf, other

    cs.CV cs.AI

    VPA: Fully Test-Time Visual Prompt Adaptation

    Authors: Jiachen Sun, Mark Ibrahim, Melissa Hall, Ivan Evtimov, Z. Morley Mao, Cristian Canton Ferrer, Caner Hazirbas

    Abstract: Textual prompt tuning has demonstrated significant performance improvements in adapting natural language processing models to a variety of downstream tasks by treating hand-engineered prompts as trainable parameters. Inspired by the success of textual prompting, several studies have investigated the efficacy of visual prompt tuning. In this work, we present Visual Prompt Adaptation (VPA), the firs… ▽ More

    Submitted 26 September, 2023; originally announced September 2023.

  29. arXiv:2309.07984  [pdf, other

    cs.AR

    Inclusive-PIM: Hardware-Software Co-design for Broad Acceleration on Commercial PIM Architectures

    Authors: Johnathan Alsop, Shaizeen Aga, Mohamed Ibrahim, Mahzabeen Islam, Andrew Mccrabb, Nuwan Jayasena

    Abstract: Continual demand for memory bandwidth has made it worthwhile for memory vendors to reassess processing in memory (PIM), which enables higher bandwidth by placing compute units in/near-memory. As such, memory vendors have recently proposed commercially viable PIM designs. However, these proposals are largely driven by the needs of (a narrow set of) machine learning (ML) primitives. While such propo… ▽ More

    Submitted 17 January, 2024; v1 submitted 14 September, 2023; originally announced September 2023.

  30. arXiv:2309.07610  [pdf, other

    cs.LG cs.AI cs.IR

    Feature Engineering in Learning-to-Rank for Community Question Answering Task

    Authors: Nafis Sajid, Md Rashidul Hasan, Muhammad Ibrahim

    Abstract: Community question answering (CQA) forums are Internet-based platforms where users ask questions about a topic and other expert users try to provide solutions. Many CQA forums such as Quora, Stackoverflow, Yahoo!Answer, StackExchange exist with a lot of user-generated data. These data are leveraged in automated CQA ranking systems where similar questions (and answers) are presented in response to… ▽ More

    Submitted 14 September, 2023; originally announced September 2023.

    Comments: 20 pages

  31. arXiv:2309.06019  [pdf, other

    cs.AR cs.AI cs.PF

    DSLOT-NN: Digit-Serial Left-to-Right Neural Network Accelerator

    Authors: Muhammad Sohail Ibrahim, Muhammad Usman, Malik Zohaib Nisar, Jeong-A Lee

    Abstract: We propose a Digit-Serial Left-tO-righT (DSLOT) arithmetic based processing technique called DSLOT-NN with aim to accelerate inference of the convolution operation in the deep neural networks (DNNs). The proposed work has the ability to assess and terminate the ineffective convolutions which results in massive power and energy savings. The processing engine is comprised of low-latency most-signifi… ▽ More

    Submitted 21 September, 2023; v1 submitted 12 September, 2023; originally announced September 2023.

    Comments: Presented at 2023 26th Euromicro Conference on Digital System Design (DSD)

  32. Information Fusion for Assistance Systems in Production Assessment

    Authors: Fernando Arévalo, Christian Alison M. Piolo, M. Tahasanul Ibrahim, Andreas Schwung

    Abstract: We propose a novel methodology to define assistance systems that rely on information fusion to combine different sources of information while providing an assessment. The main contribution of this paper is providing a general framework for the fusion of n number of information sources using the evidence theory. The fusion provides a more robust prediction and an associated uncertainty that can be… ▽ More

    Submitted 31 August, 2023; originally announced September 2023.

    Comments: 21 Pages, 10 Figures

  33. arXiv:2308.15395  [pdf, other

    cs.LG q-bio.MN q-bio.QM

    The CausalBench challenge: A machine learning contest for gene network inference from single-cell perturbation data

    Authors: Mathieu Chevalley, Jacob Sackett-Sanders, Yusuf Roohani, Pascal Notin, Artemy Bakulin, Dariusz Brzezinski, Kaiwen Deng, Yuanfang Guan, Justin Hong, Michael Ibrahim, Wojciech Kotlowski, Marcin Kowiel, Panagiotis Misiakos, Achille Nazaret, Markus Püschel, Chris Wendler, Arash Mehrjou, Patrick Schwab

    Abstract: In drug discovery, map** interactions between genes within cellular systems is a crucial early step. This helps formulate hypotheses regarding molecular mechanisms that could potentially be targeted by future medicines. The CausalBench Challenge was an initiative to invite the machine learning community to advance the state of the art in constructing gene-gene interaction networks. These network… ▽ More

    Submitted 29 August, 2023; originally announced August 2023.

  34. arXiv:2308.13276  [pdf, ps, other

    cs.SE

    Knowledge-Based Version Incompatibility Detection for Deep Learning

    Authors: Zhongkai Zhao, Bonan Kou, Mohamed Yilmaz Ibrahim, Muhao Chen, Tianyi Zhang

    Abstract: Version incompatibility issues are rampant when reusing or reproducing deep learning models and applications. Existing techniques are limited to library dependency specifications declared in PyPI. Therefore, these techniques cannot detect version issues due to undocumented version constraints or issues involving hardware drivers or OS. To address this challenge, we propose to leverage the abundant… ▽ More

    Submitted 28 August, 2023; v1 submitted 25 August, 2023; originally announced August 2023.

    Comments: 12 pages, FSE 2023

  35. FaceTouch: Detecting hand-to-face touch with supervised contrastive learning to assist in tracing infectious disease

    Authors: Mohamed R. Ibrahim, Terry Lyons

    Abstract: Through our respiratory system, many viruses and diseases frequently spread and pass from one person to another. Covid-19 served as an example of how crucial it is to track down and cut back on contacts to stop its spread. There is a clear gap in finding automatic methods that can detect hand-to-face contact in complex urban scenes or indoors. In this paper, we introduce a computer vision framewor… ▽ More

    Submitted 24 August, 2023; originally announced August 2023.

    Comments: Set to be published in the PLoS ONE Journal

  36. arXiv:2308.03977  [pdf, other

    cs.CV cs.LG

    PUG: Photorealistic and Semantically Controllable Synthetic Data for Representation Learning

    Authors: Florian Bordes, Shashank Shekhar, Mark Ibrahim, Diane Bouchacourt, Pascal Vincent, Ari S. Morcos

    Abstract: Synthetic image datasets offer unmatched advantages for designing and evaluating deep neural networks: they make it possible to (i) render as many data samples as needed, (ii) precisely control each scene and yield granular ground truth labels (and captions), (iii) precisely control distribution shifts between training and testing to isolate variables of interest for sound experimentation. Despite… ▽ More

    Submitted 12 December, 2023; v1 submitted 7 August, 2023; originally announced August 2023.

  37. arXiv:2308.03973  [pdf, other

    cs.AR cs.DC

    Collaborative Acceleration for FFT on Commercial Processing-In-Memory Architectures

    Authors: Mohamed Assem Ibrahim, Shaizeen Aga

    Abstract: This paper evaluates the efficacy of recent commercial processing-in-memory (PIM) solutions to accelerate fast Fourier transform (FFT), an important primitive across several domains. Specifically, we observe that efficient implementations of FFT on modern GPUs are memory bandwidth bound. As such, the memory bandwidth boost availed by commercial PIM solutions makes a case for PIM to accelerate FFT.… ▽ More

    Submitted 7 August, 2023; originally announced August 2023.

  38. arXiv:2307.13136  [pdf, other

    cs.CV cs.LG

    Does Progress On Object Recognition Benchmarks Improve Real-World Generalization?

    Authors: Megan Richards, Polina Kirichenko, Diane Bouchacourt, Mark Ibrahim

    Abstract: For more than a decade, researchers have measured progress in object recognition on ImageNet-based generalization benchmarks such as ImageNet-A, -C, and -R. Recent advances in foundation models, trained on orders of magnitude more data, have begun to saturate these standard benchmarks, but remain brittle in practice. This suggests standard benchmarks, which tend to focus on predefined or synthetic… ▽ More

    Submitted 24 July, 2023; originally announced July 2023.

  39. arXiv:2307.00741  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    UnLoc: A Universal Localization Method for Autonomous Vehicles using LiDAR, Radar and/or Camera Input

    Authors: Muhammad Ibrahim, Naveed Akhtar, Saeed Anwar, Ajmal Mian

    Abstract: Localization is a fundamental task in robotics for autonomous navigation. Existing localization methods rely on a single input data modality or train several computational models to process different modalities. This leads to stringent computational requirements and sub-optimal results that fail to capitalize on the complementary information in other data streams. This paper proposes UnLoc, a nove… ▽ More

    Submitted 3 July, 2023; originally announced July 2023.

    Comments: UnLoc: A Universal Localization Method for Autonomous Vehicles using LiDAR, Radar and/or Camera Input has been accepted for publication in the Proceedings of the 2023 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2023)

  40. arXiv:2305.14362  [pdf, ps, other

    math.NT cs.CR math.CO

    On the Eight Levels theorem and applications towards Lucas-Lehmer primality test for Mersenne primes, I

    Authors: Moustafa Ibrahim

    Abstract: Lucas-Lehmer test is the current standard algorithm used for testing the primality of Mersenne numbers, but it may have limitations in terms of its efficiency and accuracy. Develo** new algorithms or improving upon existing ones could potentially improve the search for Mersenne primes and the understanding of the distribution of Mersenne primes and composites. The development of new versions of… ▽ More

    Submitted 11 May, 2023; originally announced May 2023.

    Comments: 33 pages, 4 tables, 3 new versions for the Lucas-Lehmer primality test, 4 new combinatorial identities. arXiv admin note: substantial text overlap with arXiv:2108.13792

    MSC Class: 11Y16; 11Y55; 11Y11; 11A51; 11B37; 11B83; 11B75; 11B39; 11B37 ACM Class: G.2.1; K.2; K.7.3; J.7; G.1.0; G.4

    Journal ref: Arab Journal of Basic and Applied Sciences, 30:1, 267-284

  41. arXiv:2304.12210  [pdf, other

    cs.LG cs.CV

    A Cookbook of Self-Supervised Learning

    Authors: Randall Balestriero, Mark Ibrahim, Vlad Sobal, Ari Morcos, Shashank Shekhar, Tom Goldstein, Florian Bordes, Adrien Bardes, Gregoire Mialon, Yuandong Tian, Avi Schwarzschild, Andrew Gordon Wilson, Jonas Gei**, Quentin Garrido, Pierre Fernandez, Amir Bar, Hamed Pirsiavash, Yann LeCun, Micah Goldblum

    Abstract: Self-supervised learning, dubbed the dark matter of intelligence, is a promising path to advance machine learning. Yet, much like cooking, training SSL methods is a delicate art with a high barrier to entry. While many components are familiar, successfully training a SSL method involves a dizzying set of choices from the pretext tasks to training hyper-parameters. Our goal is to lower the barrier… ▽ More

    Submitted 28 June, 2023; v1 submitted 24 April, 2023; originally announced April 2023.

  42. arXiv:2304.05391  [pdf, other

    cs.CV

    Pinpointing Why Object Recognition Performance Degrades Across Income Levels and Geographies

    Authors: Laura Gustafson, Megan Richards, Melissa Hall, Caner Hazirbas, Diane Bouchacourt, Mark Ibrahim

    Abstract: Despite impressive advances in object-recognition, deep learning systems' performance degrades significantly across geographies and lower income levels raising pressing concerns of inequity. Addressing such performance gaps remains a challenge, as little is understood about why performance degrades across incomes or geographies. We take a step in this direction by annotating images from Dollar Str… ▽ More

    Submitted 11 April, 2023; originally announced April 2023.

  43. arXiv:2304.04314  [pdf, ps, other

    cs.IT eess.SP

    RIS-aided Mixed RF-FSO Wireless Networks: Secrecy Performance Analysis with Simultaneous Eavesdrop**

    Authors: Md. Mijanur Rahman, A. S. M. Badrudduza, Noor Ahmad Sarker, Md. Ibrahim, Imran Shafique Ansari

    Abstract: The appearance of sixth-generation networks has resulted in the proposal of several solutions to tackle signal loss. One of these solutions is the utilization of reconfigurable intelligent surfaces (RIS), which can reflect or refract signals as required. This integration offers significant potential to improve the coverage area from the sender to the receiver. In this paper, we present a comprehen… ▽ More

    Submitted 9 April, 2023; originally announced April 2023.

    Comments: No comments

  44. arXiv:2303.09063  [pdf, other

    cs.CV cs.LG

    Plant Disease Detection using Region-Based Convolutional Neural Network

    Authors: Hasin Rehana, Muhammad Ibrahim, Md. Haider Ali

    Abstract: Agriculture plays an important role in the food and economy of Bangladesh. The rapid growth of population over the years also has increased the demand for food production. One of the major reasons behind low crop production is numerous bacteria, virus and fungal plant diseases. Early detection of plant diseases and proper usage of pesticides and fertilizers are vital for preventing the diseases an… ▽ More

    Submitted 12 September, 2023; v1 submitted 15 March, 2023; originally announced March 2023.

    Comments: 23 pages

  45. arXiv:2303.00261  [pdf, other

    cs.LG cs.CV

    Speeding Up EfficientNet: Selecting Update Blocks of Convolutional Neural Networks using Genetic Algorithm in Transfer Learning

    Authors: Md. Mehedi Hasana, Muhammad Ibrahim, Md. Sawkat Ali

    Abstract: The performance of convolutional neural networks (CNN) depends heavily on their architectures. Transfer learning performance of a CNN relies quite strongly on selection of its trainable layers. Selecting the most effective update layers for a certain target dataset often requires expert knowledge on CNN architecture which many practitioners do not posses. General users prefer to use an available a… ▽ More

    Submitted 1 March, 2023; originally announced March 2023.

    Comments: 9 pages

  46. arXiv:2302.10876  [pdf, ps, other

    cs.IT

    Effects of Co-channel Interference on RIS Empowered Wireless Networks amid Multiple Eavesdrop** Attempts

    Authors: Md. Roisul Ajom Ruku, Md. Ibrahim, A. S. M. Badrudduza, Imran Shafique Ansari

    Abstract: This letter is concerned with the secrecy performance of reconfigurable intelligent surfaces (RIS)-aided wireless networks in the existence of multiple interferers towards the destination. To be more precise, we analyze three critical issues in the design of secure RIS-assisted networks: 1) How do interferers affect the performance of secure wireless networks? 2) Which of the two groups of eavesdr… ▽ More

    Submitted 21 February, 2023; originally announced February 2023.

    Comments: No

  47. arXiv:2302.10257  [pdf, other

    cs.IT eess.SP

    Secrecy Outage Analysis of Energy Harvesting Relay-based Mixed UOWC-RF Network with Multiple Eavesdroppers

    Authors: Moloy Kumar Ghosh, Milton Kumar Kundu, Md Ibrahim, A. S. M. Badrudduza, Md. Shamim Anower, Imran Shafique Ansari, Ali A. Shaikhi, Mohammed A. Mohandes

    Abstract: This work deals with the physical layer security performance of a dual-hop underwater optical communication (UOWC)-radio frequency (RF) network under the intruding attempts of multiple eavesdroppers via RF links. The intermediate decode and forward relay node between the underwater source and the destination transforms the optical signal into electrical form and re-transmits it to the destination… ▽ More

    Submitted 20 February, 2023; originally announced February 2023.

    Comments: No

  48. arXiv:2302.02191  [pdf, ps, other

    cs.IT cs.LG eess.SP

    Unsupervised Learning for Pilot-free Transmission in 3GPP MIMO Systems

    Authors: Omar M. Sleem, Mohamed Salah Ibrahim, Akshay Malhotra, Mihaela Beluri, Philip Pietraski

    Abstract: Reference signals overhead reduction has recently evolved as an effective solution for improving the system spectral efficiency. This paper introduces a new downlink data structure that is free from demodulation reference signals (DM-RS), and hence does not require any channel estimation at the receiver. The new proposed data transmission structure involves a simple repetition step of part of the… ▽ More

    Submitted 4 February, 2023; originally announced February 2023.

  49. arXiv:2301.08957  [pdf, other

    cs.CV cs.AI cs.LG cs.RO

    Slice Transformer and Self-supervised Learning for 6DoF Localization in 3D Point Cloud Maps

    Authors: Muhammad Ibrahim, Naveed Akhtar, Saeed Anwar, Michael Wise, Ajmal Mian

    Abstract: Precise localization is critical for autonomous vehicles. We present a self-supervised learning method that employs Transformers for the first time for the task of outdoor localization using LiDAR data. We propose a pre-text task that reorganizes the slices of a $360^\circ$ LiDAR scan to leverage its axial properties. Our model, called Slice Transformer, employs multi-head attention while systemat… ▽ More

    Submitted 13 August, 2023; v1 submitted 21 January, 2023; originally announced January 2023.

    Comments: Accepted in IEEE International Conference on Robotics and Automation (ICRA), 2023

  50. arXiv:2212.10567  [pdf, other

    q-bio.QM cs.LG eess.SP

    Anticancer Peptides Classification using Kernel Sparse Representation Classifier

    Authors: Ehtisham Fazal, Muhammad Sohail Ibrahim, Seongyong Park, Imran Naseem, Abdul Wahab

    Abstract: Cancer is one of the most challenging diseases because of its complexity, variability, and diversity of causes. It has been one of the major research topics over the past decades, yet it is still poorly understood. To this end, multifaceted therapeutic frameworks are indispensable. \emph{Anticancer peptides} (ACPs) are the most promising treatment option, but their large-scale identification and s… ▽ More

    Submitted 19 December, 2022; originally announced December 2022.