Skip to main content

Showing 1–21 of 21 results for author: Shehata, S

.
  1. arXiv:2406.11073  [pdf, other

    cs.CL

    Exploring the Limitations of Detecting Machine-Generated Text

    Authors: Jad Doughman, Osama Mohammed Afzal, Hawau Olamide Toyin, Shady Shehata, Preslav Nakov, Zeerak Talat

    Abstract: Recent improvements in the quality of the generations by large language models have spurred research into identifying machine-generated text. Systems proposed for the task often achieve high performance. However, humans and machines can produce text in different styles and in different domains, and it remains unclear whether machine generated-text detection models favour particular styles or domai… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

  2. arXiv:2406.09933  [pdf, other

    cs.SD cs.AI cs.HC cs.LG

    What Does it Take to Generalize SER Model Across Datasets? A Comprehensive Benchmark

    Authors: Adham Ibrahim, Shady Shehata, A**kya Kulkarni, Mukhtar Mohamed, Muhammad Abdul-Mageed

    Abstract: Speech emotion recognition (SER) is essential for enhancing human-computer interaction in speech-based applications. Despite improvements in specific emotional datasets, there is still a research gap in SER's capability to generalize across real-world situations. In this paper, we investigate approaches to generalize the SER system across different emotion datasets. In particular, we incorporate 1… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: ACCEPTED AT INTERSPEECH 2024, GREECE

  3. arXiv:2405.19525  [pdf, other

    cs.CV

    Lifelong Learning Using a Dynamically Growing Tree of Sub-networks for Domain Generalization in Video Object Segmentation

    Authors: Islam Osman, Mohamed S. Shehata

    Abstract: Current state-of-the-art video object segmentation models have achieved great success using supervised learning with massive labeled training datasets. However, these models are trained using a single source domain and evaluated using videos sampled from the same source domain. When these models are evaluated using videos sampled from a different target domain, their performance degrades significa… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  4. arXiv:2405.10444  [pdf, other

    cs.CV

    A Novel Bounding Box Regression Method for Single Object Tracking

    Authors: Omar Abdelaziz, Mohamed Sami Shehata

    Abstract: Locating an object in a sequence of frames, given its appearance in the first frame of the sequence, is a hard problem that involves many stages. Usually, state-of-the-art methods focus on bringing novel ideas in the visual encoding or relational modelling phases. However, in this work, we show that bounding box regression from learned joint search and template features is of high importance as we… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

  5. arXiv:2404.18473  [pdf

    math.RA

    On Rings of MAL'CEV-NEUMANN Series

    Authors: Mohammad. H. Fahmy, Refaat. M. Salem, Shaimaa. Sh. Shehata

    Abstract: In this paper, we investigate the conditions for the Mal'cev-Neumann series ring Λ = R((G;σ;τ)) to be left fusible and an SA-ring. Also, we show that: if G is a quasitotally ordered group and U a Σ-compatible semiprime ideal of R, then R((G;σ;τ)) is a Σ(U((G; σ; τ)))-zip ring if and only if R is a Σ(U )-zip ring.

    Submitted 29 April, 2024; originally announced April 2024.

  6. arXiv:2402.12840  [pdf, other

    cs.CL

    ArabicMMLU: Assessing Massive Multitask Language Understanding in Arabic

    Authors: Fajri Koto, Haonan Li, Sara Shatnawi, Jad Doughman, Abdelrahman Boda Sadallah, Aisha Alraeesi, Khalid Almubarak, Zaid Alyafeai, Neha Sengupta, Shady Shehata, Nizar Habash, Preslav Nakov, Timothy Baldwin

    Abstract: The focus of language model evaluation has transitioned towards reasoning and knowledge-intensive tasks, driven by advancements in pretraining large models. While state-of-the-art models are partially trained on large Arabic texts, evaluating their performance in Arabic remains challenging due to the limited availability of relevant datasets. To bridge this gap, we present ArabicMMLU, the first mu… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

  7. arXiv:2401.13802  [pdf, other

    cs.SE cs.AI cs.CL cs.LG

    Investigating the Efficacy of Large Language Models for Code Clone Detection

    Authors: Mohamad Khajezade, Jie JW Wu, Fatemeh Hendijani Fard, Gema Rodríguez-Pérez, Mohamed Sami Shehata

    Abstract: Large Language Models (LLMs) have demonstrated remarkable success in various natural language processing and software engineering tasks, such as code generation. The LLMs are mainly utilized in the prompt-based zero/few-shot paradigm to guide the model in accomplishing the task. GPT-based models are one of the popular ones studied for tasks such as code comment generation or test generation. These… ▽ More

    Submitted 30 January, 2024; v1 submitted 24 January, 2024; originally announced January 2024.

  8. arXiv:2309.11627  [pdf, other

    cs.CV

    GenLayNeRF: Generalizable Layered Representations with 3D Model Alignment for Multi-Human View Synthesis

    Authors: Youssef Abdelkareem, Shady Shehata, Fakhri Karray

    Abstract: Novel view synthesis (NVS) of multi-human scenes imposes challenges due to the complex inter-human occlusions. Layered representations handle the complexities by dividing the scene into multi-layered radiance fields, however, they are mainly constrained to per-scene optimization making them inefficient. Generalizable human view synthesis methods combine the pre-fitted 3D human meshes with image fe… ▽ More

    Submitted 20 September, 2023; originally announced September 2023.

    Comments: Accepted to GCPR 2023

  9. arXiv:2309.09510  [pdf, ps, other

    eess.AS cs.LG cs.SD

    Dynamic-SUPERB: Towards A Dynamic, Collaborative, and Comprehensive Instruction-Tuning Benchmark for Speech

    Authors: Chien-yu Huang, Ke-Han Lu, Shih-Heng Wang, Chi-Yuan Hsiao, Chun-Yi Kuan, Haibin Wu, Siddhant Arora, Kai-Wei Chang, Jiatong Shi, Yifan Peng, Roshan Sharma, Shinji Watanabe, Bhiksha Ramakrishnan, Shady Shehata, Hung-yi Lee

    Abstract: Text language models have shown remarkable zero-shot capability in generalizing to unseen tasks when provided with well-formulated instructions. However, existing studies in speech processing primarily focus on limited or specific tasks. Moreover, the lack of standardized benchmarks hinders a fair comparison across different approaches. Thus, we present Dynamic-SUPERB, a benchmark designed for bui… ▽ More

    Submitted 22 March, 2024; v1 submitted 18 September, 2023; originally announced September 2023.

    Comments: To appear in the proceedings of ICASSP 2024

  10. arXiv:2306.04368  [pdf, other

    cs.SD cs.CL eess.AS

    Arabic Dysarthric Speech Recognition Using Adversarial and Signal-Based Augmentation

    Authors: Massa Baali, Ibrahim Almakky, Shady Shehata, Fakhri Karray

    Abstract: Despite major advancements in Automatic Speech Recognition (ASR), the state-of-the-art ASR systems struggle to deal with impaired speech even with high-resource languages. In Arabic, this challenge gets amplified, with added complexities in collecting data from dysarthric speakers. In this paper, we aim to improve the performance of Arabic dysarthric automatic speech recognition through a multi-st… ▽ More

    Submitted 7 June, 2023; originally announced June 2023.

    Comments: Accepted to Interspeech 2023

  11. arXiv:2305.14534  [pdf, other

    cs.CL cs.AI

    Detecting Propaganda Techniques in Code-Switched Social Media Text

    Authors: Muhammad Umar Salman, Asif Hanif, Shady Shehata, Preslav Nakov

    Abstract: Propaganda is a form of communication intended to influence the opinions and the mindset of the public to promote a particular agenda. With the rise of social media, propaganda has spread rapidly, leading to the need for automatic propaganda detection systems. Most work on propaganda detection has focused on high-resource languages, such as English, and little effort has been made to detect propag… ▽ More

    Submitted 15 March, 2024; v1 submitted 23 May, 2023; originally announced May 2023.

  12. arXiv:2303.01736  [pdf

    cs.CV

    Multi-Plane Neural Radiance Fields for Novel View Synthesis

    Authors: Youssef Abdelkareem, Shady Shehata, Fakhri Karray

    Abstract: Novel view synthesis is a long-standing problem that revolves around rendering frames of scenes from novel camera viewpoints. Volumetric approaches provide a solution for modeling occlusions through the explicit 3D representation of the camera frustum. Multi-plane Images (MPI) are volumetric methods that represent the scene using front-parallel planes at distinct depths but suffer from depth discr… ▽ More

    Submitted 3 March, 2023; originally announced March 2023.

    Comments: ICDIPV 2023

  13. arXiv:2204.07501  [pdf, other

    cs.SE

    Evaluating few shot and Contrastive learning Methods for Code Clone Detection

    Authors: Mohamad Khajezade, Fatemeh Hendijani Fard, Mohamed S. Shehata

    Abstract: Context: Code Clone Detection (CCD) is a software engineering task that is used for plagiarism detection, code search, and code comprehension. Recently, deep learning-based models have achieved an F1 score (a metric used to assess classifiers) of $\sim$95\% on the CodeXGLUE benchmark. These models require many training data, mainly fine-tuned on Java or C++ datasets. However, no previous study eva… ▽ More

    Submitted 9 November, 2023; v1 submitted 15 April, 2022; originally announced April 2022.

  14. Automated Human Cell Classification in Sparse Datasets using Few-Shot Learning

    Authors: Reece Walsh, Mohamed H. Abdelpakey, Mohamed S. Shehata, Mostafa M. Mohamed

    Abstract: Classifying and analyzing human cells is a lengthy procedure, often involving a trained professional. In an attempt to expedite this process, an active area of research involves automating cell classification through use of deep learning-based techniques. In practice, a large amount of data is required to accurately train these deep learning models. However, due to the sparse human cell datasets c… ▽ More

    Submitted 11 March, 2022; v1 submitted 27 July, 2021; originally announced July 2021.

    Comments: 12 pages, 4 figures

    Journal ref: Scientific Reports 12.1 (2022): 1-11

  15. arXiv:2107.03924  [pdf

    cs.CY cs.AI cs.HC cs.MA cs.NI

    Smart Healthcare in the Age of AI: Recent Advances, Challenges, and Future Prospects

    Authors: Mahmoud Nasr, MD. Milon Islam, Shady Shehata, Fakhri Karray, Yuri Quintana

    Abstract: The significant increase in the number of individuals with chronic ailments (including the elderly and disabled) has dictated an urgent need for an innovative model for healthcare systems. The evolved model will be more personalized and less reliant on traditional brick-and-mortar healthcare institutions such as hospitals, nursing homes, and long-term healthcare centers. The smart healthcare syste… ▽ More

    Submitted 24 June, 2021; originally announced July 2021.

  16. arXiv:2005.08669  [pdf

    cs.CY

    Translating the Concept of Goal Setting into Practice -- What 'Else' does it Require than a Goal Setting Tool?

    Authors: Gábor Kismihók, Catherine Zhao, Michaéla C. Schippers, Stefan T. Mol, Scott Harrison, Shady Shehata

    Abstract: This conceptual paper reviews the current status of goal setting in the area of technology enhanced learning and education. Besides a brief literature review, three current projects on goal setting are discussed. The paper shows that the main barriers for goal setting applications in education are not related to the technology, the available data or analytical methods, but rather the human factor.… ▽ More

    Submitted 18 May, 2020; originally announced May 2020.

    Comments: This paper has been accepted to be published in the proceedings of CSEDU 2020 by SciTePress

  17. arXiv:2004.12058  [pdf, other

    cs.LG stat.ML

    NullSpaceNet: Nullspace Convoluional Neural Network with Differentiable Loss Function

    Authors: Mohamed H. Abdelpakey, Mohamed S. Shehata

    Abstract: We propose NullSpaceNet, a novel network that maps from the pixel level input to a joint-nullspace (as opposed to the traditional feature space), where the newly learned joint-nullspace features have clearer interpretation and are more separable. NullSpaceNet ensures that all inputs from the same class are collapsed into one point in this new joint-nullspace, and the different classes are collapse… ▽ More

    Submitted 25 April, 2020; originally announced April 2020.

    Comments: 17 pages

  18. arXiv:1908.07905  [pdf, other

    cs.CV

    DomainSiam: Domain-Aware Siamese Network for Visual Object Tracking

    Authors: Mohamed H. Abdelpakey, Mohamed S. Shehata

    Abstract: Visual object tracking is a fundamental task in the field of computer vision. Recently, Siamese trackers have achieved state-of-the-art performance on recent benchmarks. However, Siamese trackers do not fully utilize semantic and objectness information from pre-trained networks that have been trained on the image classification task. Furthermore, the pre-trained Siamese architecture is sparsely ac… ▽ More

    Submitted 21 August, 2019; originally announced August 2019.

    Comments: 13 pages

    Journal ref: 14th International Symposium on Visual Computing (ISVC2019)

  19. arXiv:1809.02714  [pdf, other

    cs.CV

    DensSiam: End-to-End Densely-Siamese Network with Self-Attention Model for Object Tracking

    Authors: Mohamed H. Abdelpakey, Mohamed S. Shehata, Mostafa M. Mohamed

    Abstract: Convolutional Siamese neural networks have been recently used to track objects using deep features. Siamese architecture can achieve real time speed, however it is still difficult to find a Siamese architecture that maintains the generalization capability, high accuracy and speed while decreasing the number of shared parameters especially when it is very deep. Furthermore, a conventional Siamese a… ▽ More

    Submitted 7 September, 2018; originally announced September 2018.

    Comments: 11 pages, 3 figures, Accepted by ISVC18

  20. arXiv:1602.03012  [pdf, other

    cs.CV

    EndoNet: A Deep Architecture for Recognition Tasks on Laparoscopic Videos

    Authors: Andru P. Twinanda, Sherif Shehata, Didier Mutter, Jacques Marescaux, Michel de Mathelin, Nicolas Padoy

    Abstract: Surgical workflow recognition has numerous potential medical applications, such as the automatic indexing of surgical video databases and the optimization of real-time operating room scheduling, among others. As a result, phase recognition has been studied in the context of several kinds of surgeries, such as cataract, neurological, and laparoscopic surgeries. In the literature, two types of featu… ▽ More

    Submitted 23 May, 2016; v1 submitted 9 February, 2016; originally announced February 2016.

    Comments: Video: https://www.youtube.com/watch?v=6v0NWrFOUUM

  21. arXiv:0710.0518  [pdf

    cond-mat.mtrl-sci

    Self-directed growth of AlGaAs core-shell nanowires for visible light applications

    Authors: C. Chen, S. Shehata, C. Fradin, R. LaPierre, C. Couteau, G. Weihs

    Abstract: Al(0.37)Ga(0.63)As nanowires (NWs) were grown in a molecular beam epitaxy system on GaAs(111)B substrates. Micro-photoluminescence measurements and energy dispersive X-ray spectroscopy indicated a core-shell structure and Al composition gradient along the NW axis, producing a potential minimum for carrier confinement. The core-shell structure formed during the growth as a consequence of the diff… ▽ More

    Submitted 2 October, 2007; originally announced October 2007.

    Comments: 20 pages, 7 figures

    Journal ref: Nano Lett.; (Letter); 2007; 7(9); 2584-2589