Skip to main content

Showing 1–20 of 20 results for author: Abdelmoniem, A M

.
  1. arXiv:2407.05268  [pdf, other

    cs.LG cs.AI cs.CV

    Federated Knowledge Transfer Fine-tuning Large Server Model with Resource-Constrained IoT Clients

    Authors: Shaoyuan Chen, Linlin You, Rui Liu, Shuo Yu, Ahmed M. Abdelmoniem

    Abstract: The training of large models, involving fine-tuning, faces the scarcity of high-quality data. Compared to the solutions based on centralized data centers, updating large models in the Internet of Things (IoT) faces challenges in coordinating knowledge from distributed clients by using their private and heterogeneous data. To tackle such a challenge, we propose KOALA (Federated Knowledge Transfer F… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

  2. arXiv:2404.03048  [pdf, other

    cs.CY cs.CL

    Decentralised Moderation for Interoperable Social Networks: A Conversation-based Approach for Pleroma and the Fediverse

    Authors: Vibhor Agarwal, Aravindh Raman, Nishanth Sastry, Ahmed M. Abdelmoniem, Gareth Tyson, Ignacio Castro

    Abstract: The recent development of decentralised and interoperable social networks (such as the "fediverse") creates new challenges for content moderators. This is because millions of posts generated on one server can easily "spread" to another, even if the recipient server has very different moderation policies. An obvious solution would be to leverage moderation tools to automatically tag (and filter) po… ▽ More

    Submitted 16 April, 2024; v1 submitted 3 April, 2024; originally announced April 2024.

    Comments: Accepted at International AAAI Conference on Web and Social Media (ICWSM) 2024. Please cite accordingly!

  3. arXiv:2402.05558  [pdf, other

    cs.LG cs.AI cs.CV cs.DC

    Flashback: Understanding and Mitigating Forgetting in Federated Learning

    Authors: Mohammed Aljahdali, Ahmed M. Abdelmoniem, Marco Canini, Samuel Horváth

    Abstract: In Federated Learning (FL), forgetting, or the loss of knowledge across rounds, hampers algorithm convergence, particularly in the presence of severe data heterogeneity among clients. This study explores the nuances of this issue, emphasizing the critical role of forgetting in FL's inefficient learning within heterogeneous data contexts. Knowledge loss occurs in both client-local updates and serve… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

  4. arXiv:2401.04850  [pdf, other

    cs.NI

    FairQ: Fair and Fast Rate Allocation in Data Centers

    Authors: Ahmed M. Abdelmoniem, Brahim Bensaou

    Abstract: The peculiar congestion patterns in data centers are caused by the bursty and composite nature of traffic, the small bandwidth-delay product, and the tiny switch buffers. It is not practical to modify TCP to adapt to data centers, especially in public clouds where multiple congestion control protocols coexist. In this work, we design a switch-based method to address such congestion issues; our app… ▽ More

    Submitted 9 January, 2024; originally announced January 2024.

    Comments: arXiv admin note: substantial text overlap with arXiv:2012.00339

  5. arXiv:2312.15375  [pdf, other

    cs.LG cs.CR cs.DC

    An Empirical Study of Efficiency and Privacy of Federated Learning Algorithms

    Authors: Sofia Zahri, Hajar Bennouri, Ahmed M. Abdelmoniem

    Abstract: In today's world, the rapid expansion of IoT networks and the proliferation of smart devices in our daily lives, have resulted in the generation of substantial amounts of heterogeneous data. These data forms a stream which requires special handling. To handle this data effectively, advanced data processing technologies are necessary to guarantee the preservation of both privacy and efficiency. Fed… ▽ More

    Submitted 23 December, 2023; originally announced December 2023.

  6. A Meta-learning based Stacked Regression Approach for Customer Lifetime Value Prediction

    Authors: Karan Gadgil, Sukhpal Singh Gill, Ahmed M. Abdelmoniem

    Abstract: Companies across the globe are keen on targeting potential high-value customers in an attempt to expand revenue and this could be achieved only by understanding the customers more. Customer Lifetime Value (CLV) is the total monetary value of transactions/purchases made by a customer with the business over an intended period of time and is used as means to estimate future customer interactions. CLV… ▽ More

    Submitted 7 August, 2023; originally announced August 2023.

    Comments: 11 pages, 7 figures

    Journal ref: Elsevier Journal of Economy and Technology 2024

  7. arXiv:2308.04419  [pdf

    cs.AI cs.LG

    Stock Market Price Prediction: A Hybrid LSTM and Sequential Self-Attention based Approach

    Authors: Karan Pardeshi, Sukhpal Singh Gill, Ahmed M. Abdelmoniem

    Abstract: One of the most enticing research areas is the stock market, and projecting stock prices may help investors profit by making the best decisions at the correct time. Deep learning strategies have emerged as a critical technique in the field of the financial market. The stock market is impacted due to two aspects, one is the geo-political, social and global events on the bases of which the price tre… ▽ More

    Submitted 7 August, 2023; originally announced August 2023.

    Comments: 13 pages, 11 figures

  8. arXiv:2307.13602  [pdf

    physics.soc-ph cs.DC

    Fortaleza: The emergence of a network hub

    Authors: Eric Bragion, Habiba Akter, Mohit Kumar, Minxian Xu, Ahmed M. Abdelmoniem, Sukhpal Singh Gill

    Abstract: Digitalisation, accelerated by the pandemic, has brought the opportunity for companies to expand their businesses beyond their geographic location and has considerably affected networks around the world. Cloud services have a better acceptance nowadays, and it is foreseen that this industry will grow exponentially in the following years. With more distributed networks that need to support customer… ▽ More

    Submitted 28 June, 2023; originally announced July 2023.

    Journal ref: Published in Internet of Things and Cyber-Physical Systems, Volume 3, 2023, Pages 272-279

  9. arXiv:2306.10848  [pdf, other

    cs.LG cs.DC

    Leveraging The Edge-to-Cloud Continuum for Scalable Machine Learning on Decentralized Data

    Authors: Ahmed M. Abdelmoniem

    Abstract: With mobile, IoT and sensor devices becoming pervasive in our life and recent advances in Edge Computational Intelligence (e.g., Edge AI/ML), it became evident that the traditional methods for training AI/ML models are becoming obsolete, especially with the growing concerns over privacy and security. This work tries to highlight the key challenges that prohibit Edge AI/ML from seeing wide-range ad… ▽ More

    Submitted 19 June, 2023; originally announced June 2023.

  10. AI-based Fog and Edge Computing: A Systematic Review, Taxonomy and Future Directions

    Authors: Sundas Iftikhar, Sukhpal Singh Gill, Chenghao Song, Minxian Xu, Mohammad Sadegh Aslanpour, Adel N. Toosi, Junhui Du, Huaming Wu, Shreya Ghosh, Deepraj Chowdhury, Muhammed Golec, Mohit Kumar, Ahmed M. Abdelmoniem, Felix Cuadrado, Blesson Varghese, Omer Rana, Schahram Dustdar, Steve Uhlig

    Abstract: Resource management in computing is a very challenging problem that involves making sequential decisions. Resource limitations, resource heterogeneity, dynamic and diverse nature of workload, and the unpredictability of fog/edge computing environments have made resource management even more challenging to be considered in the fog landscape. Recently Artificial Intelligence (AI) and Machine Learnin… ▽ More

    Submitted 8 December, 2022; originally announced December 2022.

    Comments: 49 page, 15 figures, 10 tables

    Journal ref: Preprint for Publication in Elsevier IoT Journal 2022

  11. arXiv:2209.04729  [pdf, other

    cs.NI

    The Switch from Conventional to SDN: The Case for Transport-Agnostic Congestion Control

    Authors: Ahmed M. Abdelmoniem, Brahim Bensaou

    Abstract: To meet the timing requirements of interactive applications, the no-frills congestion-agnostic transport protocols like UDP are increasingly deployed side-by-side in the same network with congestion-responsive TCP. In cloud platforms, even though the computation and storage is totally virtualized, they lack a true virtualization mechanism for the network (i.e., the underlying data centers networks… ▽ More

    Submitted 10 September, 2022; originally announced September 2022.

    Comments: This work combines works published in IEEE ICC 2016 and IEEE LCN 2017

  12. Towards Energy-Aware Federated Learning on Battery-Powered Clients

    Authors: Amna Arouj, Ahmed M. Abdelmoniem

    Abstract: Federated learning (FL) is a newly emerged branch of AI that facilitates edge devices to collaboratively train a global machine learning model without centralizing data and with privacy by default. However, despite the remarkable advancement, this paradigm comes with various challenges. Specifically, in large-scale deployments, client heterogeneity is the norm which impacts training quality such a… ▽ More

    Submitted 24 August, 2022; v1 submitted 8 August, 2022; originally announced August 2022.

    Comments: Accepted in FedEdge workshop of ACM MobiCom 2022

  13. Resource-Efficient Federated Learning

    Authors: Ahmed M. Abdelmoniem, Atal Narayan Sahu, Marco Canini, Suhaib A. Fahmy

    Abstract: Federated Learning (FL) enables distributed training by learners using local data, thereby enhancing privacy and reducing communication. However, it presents numerous challenges relating to the heterogeneity of the data distribution, device capabilities, and participant availability as deployments scale, which can impact both model convergence and bias. Existing FL schemes use random participant s… ▽ More

    Submitted 4 November, 2022; v1 submitted 1 November, 2021; originally announced November 2021.

    Comments: Accepted to appear in ACM EuroSys 2023

  14. arXiv:2108.00951  [pdf, other

    cs.LG cs.DC math.OC

    Rethinking gradient sparsification as total error minimization

    Authors: Atal Narayan Sahu, Aritra Dutta, Ahmed M. Abdelmoniem, Trambak Banerjee, Marco Canini, Panos Kalnis

    Abstract: Gradient compression is a widely-established remedy to tackle the communication bottleneck in distributed training of large deep neural networks (DNNs). Under the error-feedback framework, Top-$k$ sparsification, sometimes with $k$ as little as $0.1\%$ of the gradient size, enables training to the same model quality as the uncompressed case for a similar iteration count. From the optimization pers… ▽ More

    Submitted 2 August, 2021; originally announced August 2021.

    Comments: 33 pages, 31 figures

  15. arXiv:2106.14100  [pdf, other

    cs.NI cs.DC

    Implementation and Evaluation of Data Center Congestion Controller with Switch Assistance

    Authors: Ahmed M. Abdelmoniem, Brahim Bensaou

    Abstract: In this work, we provide the design and implementation of a switch-assisted congestion control algorithm for data center networks (DCNs). In particular, we provide a prototype of the switch-driven congestion control algorithm and deploy it in a real data center. The prototype is based on few simple modifications to the switch software. The modifications imposed by the algorithm on the switch are t… ▽ More

    Submitted 26 June, 2021; originally announced June 2021.

    Comments: arXiv admin note: text overlap with arXiv:2012.00339

  16. arXiv:2102.07500  [pdf, other

    cs.LG cs.DC cs.PF

    On the Impact of Device and Behavioral Heterogeneity in Federated Learning

    Authors: Ahmed M. Abdelmoniem, Chen-Yu Ho, Pantelis Papageorgiou, Muhammad Bilal, Marco Canini

    Abstract: Federated learning (FL) is becoming a popular paradigm for collaborative learning over distributed, private datasets owned by non-trusting entities. FL has seen successful deployment in production environments, and it has been adopted in services such as virtual keyboards, auto-completion, item recommendation, and several IoT applications. However, FL comes with the challenge of performing trainin… ▽ More

    Submitted 15 February, 2021; originally announced February 2021.

  17. arXiv:2102.07477  [pdf, other

    cs.NI cs.DC cs.PF

    T-RACKs: A Faster Recovery Mechanism for TCP in Data Center Networks

    Authors: Ahmed M. Abdelmoniem, Brahim Bensaou

    Abstract: Cloud interactive data-driven applications generate swarms of small TCP flows that compete for the small buffer space in data-center switches. Such applications require a short flow completion time (FCT) to perform their jobs effectively. However, TCP is oblivious to the composite nature of application data and artificially inflates the FCT of such flows by several orders of magnitude. This is due… ▽ More

    Submitted 15 February, 2021; originally announced February 2021.

    Comments: Accepted for Publication in ACM/IEEE Transactions on Networking (ToN)

  18. arXiv:2101.10761  [pdf, other

    cs.LG cs.DC

    An Efficient Statistical-based Gradient Compression Technique for Distributed Training Systems

    Authors: Ahmed M. Abdelmoniem, Ahmed Elzanaty, Mohamed-Slim Alouini, Marco Canini

    Abstract: The recent many-fold increase in the size of deep neural networks makes efficient distributed training challenging. Many proposals exploit the compressibility of the gradients and propose lossy compression techniques to speed up the communication stage of distributed training. Nevertheless, compression comes at the cost of reduced model quality and extra computation overhead. In this work, we desi… ▽ More

    Submitted 17 March, 2021; v1 submitted 26 January, 2021; originally announced January 2021.

    Comments: Accepted at the 2021 Machine Learning and Systems (MLSys) Conference

  19. arXiv:2012.00339  [pdf, other

    cs.NI

    Design and Implementation of Fair Congestion Control for Data Centers Networks

    Authors: Ahmed M. Abdelmoniem, Brahim Bensaou

    Abstract: In data centers, the nature of the composite bursty traffic along with the small bandwidth-delay product and switch buffers lead to several congestion problems that are not handled well by traditional congestion control mechanisms such as TCP. Existing work try to address the problem by modifying TCP to suit the operational nature of data centers. This is practically feasible in private settings,… ▽ More

    Submitted 1 December, 2020; originally announced December 2020.

  20. arXiv:1911.08250  [pdf, other

    cs.DC cs.LG math.OC

    On the Discrepancy between the Theoretical Analysis and Practical Implementations of Compressed Communication for Distributed Deep Learning

    Authors: Aritra Dutta, El Houcine Bergou, Ahmed M. Abdelmoniem, Chen-Yu Ho, Atal Narayan Sahu, Marco Canini, Panos Kalnis

    Abstract: Compressed communication, in the form of sparsification or quantization of stochastic gradients, is employed to reduce communication costs in distributed data-parallel training of deep neural networks. However, there exists a discrepancy between theory and practice: while theoretical analysis of most existing compression methods assumes compression is applied to the gradients of the entire model,… ▽ More

    Submitted 19 November, 2019; originally announced November 2019.

    Comments: To Appear In Proceedings of Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

    Journal ref: In Proceedings of Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020