Skip to main content

Showing 1–15 of 15 results for author: Afzal, S

Searching in archive cs. Search in all archives.
.
  1. Optimal Quality and Efficiency in Adaptive Live Streaming with JND-Aware Low latency Encoding

    Authors: Vignesh V Menon, **gwen Zhu, Prajit T Rajendran, Samira Afzal, Klaus Schoeffmann, Patrick Le Callet, Christian Timmerer

    Abstract: In HTTP adaptive live streaming applications, video segments are encoded at a fixed set of bitrate-resolution pairs known as bitrate ladder. Live encoders use the fastest available encoding configuration, referred to as preset, to ensure the minimum possible latency in video encoding. However, an optimized preset and optimized number of CPU threads for each encoding instance may result in (i) incr… ▽ More

    Submitted 27 January, 2024; originally announced January 2024.

    Comments: 2024 Mile High Video (MHV)

  2. arXiv:2401.09854  [pdf, other

    cs.MM

    A Survey on Energy Consumption and Environmental Impact of Video Streaming

    Authors: Samira Afzal, Narges Mehran, Zoha Azimi Ourimi, Farzad Tashtarian, Hadi Amirpour, Radu Prodan, Christian Timmerer

    Abstract: Climate change challenges require a notable decrease in worldwide greenhouse gas (GHG) emissions across technology sectors. Digital technologies, especially video streaming, accounting for most Internet traffic, make no exception. Video streaming demand increases with remote working, multimedia communication services (e.g., WhatsApp, Skype), video streaming content (e.g., YouTube, Netflix), video… ▽ More

    Submitted 18 January, 2024; originally announced January 2024.

  3. arXiv:2311.08074  [pdf, other

    cs.MM

    Content-Adaptive Variable Framerate Encoding Scheme for Green Live Streaming

    Authors: Vignesh V Menon, Samira Afzal, Prajit T Rajendran, Klaus Schoeffmann, Radu Prodan, Christian Timmerer

    Abstract: Adaptive live video streaming applications use a fixed predefined configuration for the bitrate ladder with constant framerate and encoding presets in a session. However, selecting optimized framerates and presets for every bitrate ladder representation can enhance perceptual quality, improve computational resource allocation, and thus, the streaming energy efficiency. In particular, low framerate… ▽ More

    Submitted 14 November, 2023; originally announced November 2023.

  4. arXiv:2310.09570  [pdf, other

    cs.MM

    Energy-Efficient Multi-Codec Bitrate-Ladder Estimation for Adaptive Video Streaming

    Authors: Vignesh V Menon, Reza Farahani, Prajit T Rajendran, Samira Afzal, Klaus Schoeffmann, Christian Timmerer

    Abstract: With the emergence of multiple modern video codecs, streaming service providers are forced to encode, store, and transmit bitrate ladders of multiple codecs separately, consequently suffering from additional energy costs for encoding, storage, and transmission. To tackle this issue, we introduce an online energy-efficient Multi-Codec Bitrate ladder Estimation scheme (MCBE) for adaptive video strea… ▽ More

    Submitted 14 October, 2023; originally announced October 2023.

    Comments: Accepted in IEEE International Conference on Visual Communications and Image Processing (VCIP), 2023

  5. arXiv:2305.07665  [pdf, other

    cs.AI

    A Comprehensive Survey on Affective Computing; Challenges, Trends, Applications, and Future Directions

    Authors: Sitara Afzal, Haseeb Ali Khan, Imran Ullah Khan, Md. Jalil Piran, Jong Weon Lee

    Abstract: As the name suggests, affective computing aims to recognize human emotions, sentiments, and feelings. There is a wide range of fields that study affective computing, including languages, sociology, psychology, computer science, and physiology. However, no research has ever been done to determine how machine learning (ML) and mixed reality (XR) interact together. This paper discusses the significan… ▽ More

    Submitted 8 May, 2023; originally announced May 2023.

  6. arXiv:2305.07445  [pdf, other

    eess.AS cs.CL cs.SD

    QVoice: Arabic Speech Pronunciation Learning Application

    Authors: Yassine El Kheir, Fouad Khnaisser, Shammur Absar Chowdhury, Hamdy Mubarak, Shazia Afzal, Ahmed Ali

    Abstract: This paper introduces a novel Arabic pronunciation learning application QVoice, powered with end-to-end mispronunciation detection and feedback generator module. The application is designed to support non-native Arabic speakers in enhancing their pronunciation skills, while also hel** native speakers mitigate any potential influence from regional dialects on their Modern Standard Arabic (MSA) pr… ▽ More

    Submitted 9 May, 2023; originally announced May 2023.

    Comments: 2 pages, Accepted InterSpeech23 Show & Tell Demo Session

    Journal ref: InterSpeech 2023

  7. arXiv:2302.09688  [pdf, other

    cs.HC cs.AI cs.LG

    AutoDOViz: Human-Centered Automation for Decision Optimization

    Authors: Daniel Karl I. Weidele, Shazia Afzal, Abel N. Valente, Cole Makuch, Owen Cornec, Long Vu, Dharmashankar Subramanian, Werner Geyer, Rahul Nair, Inge Vejsbjerg, Radu Marinescu, Paulito Palmes, Elizabeth M. Daly, Loraine Franke, Daniel Haehn

    Abstract: We present AutoDOViz, an interactive user interface for automated decision optimization (AutoDO) using reinforcement learning (RL). Decision optimization (DO) has classically being practiced by dedicated DO researchers where experts need to spend long periods of time fine tuning a solution through trial-and-error. AutoML pipeline search has sought to make it easier for a data scientist to find the… ▽ More

    Submitted 19 February, 2023; originally announced February 2023.

  8. arXiv:2211.00923  [pdf, other

    cs.SD cs.CL eess.AS

    SpeechBlender: Speech Augmentation Framework for Mispronunciation Data Generation

    Authors: Yassine El Kheir, Shammur Absar Chowdhury, Ahmed Ali, Hamdy Mubarak, Shazia Afzal

    Abstract: The lack of labeled second language (L2) speech data is a major challenge in designing mispronunciation detection models. We introduce SpeechBlender - a fine-grained data augmentation pipeline for generating mispronunciation errors to overcome such data scarcity. The SpeechBlender utilizes varieties of masks to target different regions of phonetic units, and use the mixing factors to linearly inte… ▽ More

    Submitted 12 July, 2023; v1 submitted 2 November, 2022; originally announced November 2022.

    Comments: 5 pages

  9. Towards Battery-Free Machine Learning and Inference in Underwater Environments

    Authors: Yuchen Zhao, Sayed Saad Afzal, Waleed Akbar, Osvy Rodriguez, Fan Mo, David Boyle, Fadel Adib, Hamed Haddadi

    Abstract: This paper is motivated by a simple question: Can we design and build battery-free devices capable of machine learning and inference in underwater environments? An affirmative answer to this question would have significant implications for a new generation of underwater sensing and monitoring applications for environmental monitoring, scientific exploration, and climate/weather prediction. To an… ▽ More

    Submitted 16 February, 2022; originally announced February 2022.

    Comments: 6 pages, HotMobile '22, March 9-10, 2022, Tempe, AZ, USA

  10. arXiv:2108.05935  [pdf, other

    cs.LG

    Data Quality Toolkit: Automatic assessment of data quality and remediation for machine learning datasets

    Authors: Nitin Gupta, Hima Patel, Shazia Afzal, Naveen Panwar, Ruhi Sharma Mittal, Shanmukha Guttula, Abhinav Jain, Lokesh Nagalapatti, Sameep Mehta, Sandeep Hans, Pranay Lohia, Aniya Aggarwal, Diptikalyan Saha

    Abstract: The quality of training data has a huge impact on the efficiency, accuracy and complexity of machine learning tasks. Various tools and techniques are available that assess data quality with respect to general cleaning and profiling checks. However these techniques are not applicable to detect data issues in the context of machine learning tasks, like noisy labels, existence of overlap** classes… ▽ More

    Submitted 5 September, 2021; v1 submitted 12 August, 2021; originally announced August 2021.

  11. arXiv:2107.10232  [pdf, other

    cs.CR

    A low-overhead approach for self-sovereign identity in IoT

    Authors: Geovane Fedrecheski, Laisa C. P. Costa, Samira Afzal, Jan M. Rabaey, Roseli D. Lopes, Marcelo K. Zuffo

    Abstract: We present a low-overhead mechanism for self-sovereign identification and communication of IoT agents in constrained networks. Our main contribution is to enable native use of Decentralized Identifiers (DIDs) and DID-based secure communication on constrained networks, whereas previous works either did not consider the issue or relied on proxy-based architectures. We propose a new extension to DIDs… ▽ More

    Submitted 21 July, 2021; originally announced July 2021.

  12. arXiv:2010.11897  [pdf

    cs.HC

    A Visual Analytics Based Decision Making Environment for COVID-19 Modeling and Visualization

    Authors: Shehzad Afzal, Sohaib Ghani, Hank C. Jenkins-Smith, David S. Ebert, Markus Hadwiger, Ibrahim Hoteit

    Abstract: Public health officials dealing with pandemics like COVID-19 have to evaluate and prepare response plans. This planning phase requires not only looking into the spatiotemporal dynamics and impact of the pandemic using simulation models, but they also need to plan and ensure the availability of resources under different spread scenarios. To this end, we have developed a visual analytics environment… ▽ More

    Submitted 22 October, 2020; originally announced October 2020.

  13. arXiv:2010.07213  [pdf, other

    cs.DB cs.AI

    Data Readiness Report

    Authors: Shazia Afzal, Rajmohan C, Manish Kesarwani, Sameep Mehta, Hima Patel

    Abstract: Data exploration and quality analysis is an important yet tedious process in the AI pipeline. Current practices of data cleaning and data readiness assessment for machine learning tasks are mostly conducted in an arbitrary manner which limits their reuse and results in loss of productivity. We introduce the concept of a Data Readiness Report as an accompanying documentation to a dataset that allow… ▽ More

    Submitted 15 October, 2020; v1 submitted 14 October, 2020; originally announced October 2020.

  14. arXiv:1909.10173  [pdf, other

    cs.HC

    Route Packing: Geospatially-Accurate Visualization of Route Networks

    Authors: Jieqiong Zhao, Morteza Karimzadeh, Hanye Xu, Abish Malik, Shehzad Afzal, Guizhen Wang, Niklas Elmqvist, David S. Ebert

    Abstract: We present route packing, a novel (geo)visualization technique for displaying several routes simultaneously on a geographic map while preserving the geospatial layout, identity, directionality, and volume of individual routes. The technique collects variable-width route lines side by side while minimizing crossings, encodes them with categorical colors, and decorates them with glyphs to show their… ▽ More

    Submitted 23 September, 2019; originally announced September 2019.

    Comments: 10 pages, 11 figures, 2 tables, The 53rd Hawaii International Conference on System Sciences (HICSS-53)

  15. arXiv:1906.06184  [pdf, other

    cs.NI cs.MM

    A Holistic Survey of Wireless Multipath Video Streaming

    Authors: Samira Afzal, Vanessa Testoni, Christian Esteve Rothenberg, Prakash Kolan, Imed Bouazizi

    Abstract: Most of today's mobile devices are equipped with multiple network interfaces and one of the main bandwidth-hungry applications that would benefit from multipath communications is wireless video streaming. However, most of the current transport protocols do not match the requirements of video streaming applications or are not designed to address relevant issues, such as delay constraints, networks… ▽ More

    Submitted 21 September, 2021; v1 submitted 14 June, 2019; originally announced June 2019.

    Comments: 44 pages. 11 figures. 9 Tables. 228 References. Preprint article version under submission to Journal of Network and Computer Applications (JNCA) 2021