-
Towards Enabling High-Five Over WiFi
Authors:
Vineet Gokhale,
Mohamad Eid,
Kees Kroep,
R. Venkatesha Prasad,
Vijay Rao
Abstract:
The next frontier for immersive applications is enabling sentience over the Internet. Tactile Internet (TI) envisages transporting skills by providing Ultra-Low Latency (ULL) communications for transporting touch senses. In this work, we focus our study on the first/last mile communication, where the future generation WiFi-7 is pitched as the front-runner for ULL applications. We discuss a few can…
▽ More
The next frontier for immersive applications is enabling sentience over the Internet. Tactile Internet (TI) envisages transporting skills by providing Ultra-Low Latency (ULL) communications for transporting touch senses. In this work, we focus our study on the first/last mile communication, where the future generation WiFi-7 is pitched as the front-runner for ULL applications. We discuss a few candidate features of WiFi-7 and highlight its major pitfalls with respect to ULL communication. Further, through a specific implementation of WiFi-7 (vanilla WiFi-7) in our custom simulator, we demonstrate the impact of one of the pitfalls - standard practice of using jitter buffer in conjunction with frame aggregation - on TI communication. To circumvent this, we propose Non-Buffered Scheme (NoBuS) - a simple MAC layer enhancement for enabling TI applications on WiFi-7. NoBuS trades off packet loss for latency enabling swift synchronization between the master and controlled domains. Our findings reveal that employing NoBuS yields a significant improvement in RMSE of TI signals. Further, we show that the worst-case WiFi latency with NoBuS is 3.72 ms - an order of magnitude lower than vanilla WiFi-7 even under highly congested network conditions.
△ Less
Submitted 2 November, 2021;
originally announced November 2021.
-
ETVO: Effectively Measuring Tactile Internet with Experimental Validation
Authors:
H. J. C. Kroep,
V. Gokhale,
J. Verburg,
R. Venkatesha Prasad
Abstract:
The next frontier in communications is teleoperation -- manipulation and control of remote environments with feedback. Compared to conventional networked applications, teleoperation poses widely different requirements, ultra-low latency (ULL) is primary. Realizing ULL communication demands significant redesign of conventional networking techniques, and the network infrastructure envisioned for ach…
▽ More
The next frontier in communications is teleoperation -- manipulation and control of remote environments with feedback. Compared to conventional networked applications, teleoperation poses widely different requirements, ultra-low latency (ULL) is primary. Realizing ULL communication demands significant redesign of conventional networking techniques, and the network infrastructure envisioned for achieving this is termed as Tactile Internet (TI). The design of the network infrastructure and meaningful performance metrics are crucial for seamless TI communication. However, existing performance metrics fall severely short of comprehensively characterizing TI performance. We take the first step towards bridging this gap. We take Dynamic Time War**(DTW) as the basis of our work and identify necessary changes for characterizing TI performance. Through substantial refinements to DTW, we design Effective Time- and Value-Offset (ETVO) -- a new method for measuring the fine-grained performance of TI systems. Through an in-depth objective analysis, we demonstrate the improvements of ETVO over DTW. Through human-in-the-loop subjective experiments, we demonstrate how and why existing QoS and QoE methods fall short of estimating the TI session performance accurately. Using subjective experiments, we demonstrate the behavior of the proposed metrics, their ability to match theoretically derived performance, and finally their ability to reflect user satisfaction in a practical setting. The results are highly encouraging.
△ Less
Submitted 12 July, 2021;
originally announced July 2021.
-
FEEL: Fast, Energy-Efficient Localization for Autonomous Indoor Vehicles
Authors:
Vineet Gokhale,
Gerardo Moyers Barrera,
R. Venkatesha Prasad
Abstract:
Autonomous vehicles have created a sensation in both outdoor and indoor applications. The famous indoor use-case is process automation inside a warehouse using Autonomous Indoor Vehicles (AIV). These vehicles need to locate themselves not only with an accuracy of a few centimetres but also within a few milliseconds in an energy-efficient manner. Due to these challenges, localization is a holy grai…
▽ More
Autonomous vehicles have created a sensation in both outdoor and indoor applications. The famous indoor use-case is process automation inside a warehouse using Autonomous Indoor Vehicles (AIV). These vehicles need to locate themselves not only with an accuracy of a few centimetres but also within a few milliseconds in an energy-efficient manner. Due to these challenges, localization is a holy grail. In this paper, we propose FEEL - an indoor localization system that uses a fusion of three low-energy sensors: IMU, UWB, and radar. We provide detailed software and hardware architecture of FEEL. Further, we propose Adaptive Sensing Algorithm (ASA) for opportunistically minimizing energy consumption of FEEL by adjusting the sensing frequency to the dynamics of the physical environment. Our extensive performance evaluation over diverse test settings reveal that FEEL provides a localization accuracy of <7cm with ultra-low latency of around 3ms. Further, ASA yields up to 20% energy saving with only a marginal trade-off in accuracy.
△ Less
Submitted 1 February, 2021;
originally announced February 2021.
-
Setting the Yardstick: A Quantitative Metric for Effectively Measuring Tactile Internet
Authors:
J. P. Verburg,
H. J. C. Kroep,
V. Gokhale,
R. Venkatesha Prasad,
V. Rao
Abstract:
The next frontier in communications is teleoperation -- manipulation and control of remote environments. Compared to conventional networked applications, teleoperation poses widely different requirements, ultra-low latency (ULL) being the primary one. Teleoperation, along with a host of other applications requiring ULL communication, is termed as Tactile Internet (TI). A significant redesign of co…
▽ More
The next frontier in communications is teleoperation -- manipulation and control of remote environments. Compared to conventional networked applications, teleoperation poses widely different requirements, ultra-low latency (ULL) being the primary one. Teleoperation, along with a host of other applications requiring ULL communication, is termed as Tactile Internet (TI). A significant redesign of conventional networking techniques is necessary to realize TI applications. Further, these advancements can be evaluated only when meaningful performance metrics are available. However, existing TI performance metrics fall severely short of comprehensively characterizing TI performance. In this paper, we take the first step towards bridging this gap. To this end, we propose a method that captures the fine-grained performance of TI in terms of delay and precision. We take Dynamic Time War** (DTW) as the basis of our work and identify whether it is sufficient in characterizing TI systems. We refine DTW by develo** a framework called Effective Time- and Value-Offset (ETVO) that extracts fine-grained time and value offsets between input and output signals of TI. Using ETVO, we present two quantitative metrics for TI -- Effective Delay-Derivative (EDD) and Effective Root Mean Square Error. Through rigorous experiments conducted on a realistic TI setup, we demonstrate the potential of the proposed metrics to precisely characterize TI interactions.
△ Less
Submitted 27 January, 2020; v1 submitted 6 January, 2020;
originally announced January 2020.
-
On QoS-Compliant Telehaptic Communication over Shared Networks
Authors:
Vineet Gokhale,
Jayakrishnan Nair,
Subhasis Chaudhuri,
Jan Fesl
Abstract:
The development of communication protocols for teleoperation with force feedback (generally known as telehaptics) has gained widespread interest over the past decade. Several protocols have been proposed for performing telehaptic interaction over shared networks. However, a comprehensive analysis of the impact of network cross-traffic on telehaptic streams, and the feasibility of Quality of Servic…
▽ More
The development of communication protocols for teleoperation with force feedback (generally known as telehaptics) has gained widespread interest over the past decade. Several protocols have been proposed for performing telehaptic interaction over shared networks. However, a comprehensive analysis of the impact of network cross-traffic on telehaptic streams, and the feasibility of Quality of Service (QoS) compliance is lacking in the literature. In this paper, we seek to fill this gap. Specifically, we explore the QoS experienced by two classes of telehaptic protocols on shared networks - Constant Bitrate (CBR) protocols and adaptive sampling based protocols, accounting for CBR as well as TCP cross-traffic. Our treatment of CBR-based telehaptic protocols is based on a micro-analysis of the interplay between TCP and CBR flows on a shared bottleneck link, which is broadly applicable for performance evaluation of CBR-based media streaming applications. Based on our analytical characterization of telehaptic QoS, and via extensive simulations and real network experiments, we formulate a set of sufficient conditions for telehaptic QoS-compliance. These conditions provide guidelines for designers of telehaptic protocols, and for network administrators to configure their networks for guaranteeing QoS-compliant telehaptic communication.
△ Less
Submitted 28 May, 2018;
originally announced May 2018.
-
Snowflake: A Model Agnostic Accelerator for Deep Convolutional Neural Networks
Authors:
Vinayak Gokhale,
Aliasger Zaidy,
Andre Xian Ming Chang,
Eugenio Culurciello
Abstract:
Deep convolutional neural networks (CNNs) are the deep learning model of choice for performing object detection, classification, semantic segmentation and natural language processing tasks. CNNs require billions of operations to process a frame. This computational complexity, combined with the inherent parallelism of the convolution operation make CNNs an excellent target for custom accelerators.…
▽ More
Deep convolutional neural networks (CNNs) are the deep learning model of choice for performing object detection, classification, semantic segmentation and natural language processing tasks. CNNs require billions of operations to process a frame. This computational complexity, combined with the inherent parallelism of the convolution operation make CNNs an excellent target for custom accelerators. However, when optimizing for different CNN hierarchies and data access patterns, it is difficult for custom accelerators to achieve close to 100% computational efficiency. In this work, we present Snowflake, a scalable and efficient accelerator that is agnostic to CNN workloads, and was designed to always perform at near-peak hardware utilization. Snowflake is able to achieve a computational efficiency of over 91% on modern CNN models. Snowflake, implemented on a Xilinx Zynq XC7Z045 SoC is capable of achieving a peak throughput of 128G-ops/s and a measured throughput of 100 frames per second and 120 G-ops/s on the AlexNet CNN model, 36 frames per second and 116G- ops/s on the GoogLeNet CNN model and 17 frames per second and 122 G-ops/s on the ResNet-50 CNN model. To the best of our knowledge, Snowflake is the only implemented system capable of achieving over 91% efficiency on modern CNNs and the only implemented system with GoogLeNet and ResNet as part of the benchmark suite.
△ Less
Submitted 8 August, 2017;
originally announced August 2017.
-
Compiling Deep Learning Models for Custom Hardware Accelerators
Authors:
Andre Xian Ming Chang,
Aliasger Zaidy,
Vinayak Gokhale,
Eugenio Culurciello
Abstract:
Convolutional neural networks (CNNs) are the core of most state-of-the-art deep learning algorithms specialized for object detection and classification. CNNs are both computationally complex and embarrassingly parallel. Two properties that leave room for potential software and hardware optimizations for embedded systems. Given a programmable hardware accelerator with a CNN oriented custom instruct…
▽ More
Convolutional neural networks (CNNs) are the core of most state-of-the-art deep learning algorithms specialized for object detection and classification. CNNs are both computationally complex and embarrassingly parallel. Two properties that leave room for potential software and hardware optimizations for embedded systems. Given a programmable hardware accelerator with a CNN oriented custom instructions set, the compiler's task is to exploit the hardware's full potential, while abiding with the hardware constraints and maintaining generality to run different CNN models with varying workload properties. Snowflake is an efficient and scalable hardware accelerator implemented on programmable logic devices. It implements a control pipeline for a custom instruction set. The goal of this paper is to present Snowflake's compiler that generates machine level instructions from Torch7 model description files. The main software design points explored in this work are: model structure parsing, CNN workload breakdown, loop rearrangement for memory bandwidth optimizations and memory access balancing. The performance achieved by compiler generated instructions matches against hand optimized code for convolution layers. Generated instructions also efficiently execute AlexNet and ResNet18 inference on Snowflake. Snowflake with $256$ processing units was synthesized on Xilinx's Zynq XC7Z045 FPGA. At $250$ MHz, AlexNet achieved in $93.6$ frames/s and $1.2$ GB/s of off-chip memory bandwidth, and $21.4$ frames/s and $2.2$ GB/s for ResNet18. Total on-chip power is $5$ W.
△ Less
Submitted 10 December, 2017; v1 submitted 31 July, 2017;
originally announced August 2017.
-
Congestion Control for Network-Aware Telehaptic Communication
Authors:
Vineet Gokhale,
Jayakrishnan Nair,
Subhasis Chaudhuri
Abstract:
Telehaptic applications involve delay-sensitive multimedia communication between remote locations with distinct Quality of Service (QoS) requirements for different media components. These QoS constraints pose a variety of challenges, especially when the communication occurs over a shared network, with unknown and time-varying cross-traffic. In this work, we propose a transport layer congestion con…
▽ More
Telehaptic applications involve delay-sensitive multimedia communication between remote locations with distinct Quality of Service (QoS) requirements for different media components. These QoS constraints pose a variety of challenges, especially when the communication occurs over a shared network, with unknown and time-varying cross-traffic. In this work, we propose a transport layer congestion control protocol for telehaptic applications operating over shared networks, termed as dynamic packetization module (DPM). DPM is a lossless, network-aware protocol which tunes the telehaptic packetization rate based on the level of congestion in the network. To monitor the network congestion, we devise a novel network feedback module, which communicates the end-to-end delays encountered by the telehaptic packets to the respective transmitters with negligible overhead. Via extensive simulations, we show that DPM meets the QoS requirements of telehaptic applications over a wide range of network cross-traffic conditions. We also report qualitative results of a real-time telepottery experiment with several human subjects, which reveal that DPM preserves the quality of telehaptic activity even under heavily congested network scenarios. Finally, we compare the performance of DPM with several previously proposed telehaptic communication protocols and demonstrate that DPM outperforms these protocols.
△ Less
Submitted 11 January, 2017; v1 submitted 3 October, 2016;
originally announced October 2016.