-
Wireless MapReduce Arrays for Coded Distributed Computing
Authors:
Elizabath Peter,
K. K. Krishnan Namboodiri,
B. Sundar Rajan
Abstract:
We consider a wireless distributed computing system based on the MapReduce framework, which consists of three phases: \textit{Map}, \textit{Shuffle}, and \textit{Reduce}. The system consists of a set of distributed nodes assigned to compute arbitrary output functions depending on a file library. The computation of the output functions is decomposed into Map and Reduce functions, and the Shuffle ph…
▽ More
We consider a wireless distributed computing system based on the MapReduce framework, which consists of three phases: \textit{Map}, \textit{Shuffle}, and \textit{Reduce}. The system consists of a set of distributed nodes assigned to compute arbitrary output functions depending on a file library. The computation of the output functions is decomposed into Map and Reduce functions, and the Shuffle phase, which involves the data exchange, links the two. In our model, the Shuffle phase communication happens over a full-duplex wireless interference channel. For this setting, a coded wireless MapReduce distributed computing scheme exists in the literature, achieving optimal performance under one-shot linear schemes. However, the scheme requires the number of input files to be very large, growing exponentially with the number of nodes. We present schemes that require the number of files to be in the order of the number of nodes and achieve the same performance as the existing scheme. The schemes are obtained by designing a structure called wireless MapReduce array that succinctly represents all three phases in a single array. The wireless MapReduce arrays can also be obtained from the extended placement delivery arrays known for multi-antenna coded caching schemes.
△ Less
Submitted 22 June, 2024;
originally announced June 2024.
-
Placement Delivery Arrays for Coded Caching with Shared and Private Caches
Authors:
K. K. Krishnan Namboodiri,
Elizabath Peter,
B. Sundar Rajan
Abstract:
We consider a coded caching network consisting of a server with a library of $N$ files connected to $K$ users, where each user is equipped with a dedicated cache of size $M_p$ units. In addition to that, the network consists of $Λ\leq K$ helper caches, each with a size $M_h$ units. Each helper cache can serve an arbitrary number of users; however, each user can access only a single helper cache. A…
▽ More
We consider a coded caching network consisting of a server with a library of $N$ files connected to $K$ users, where each user is equipped with a dedicated cache of size $M_p$ units. In addition to that, the network consists of $Λ\leq K$ helper caches, each with a size $M_h$ units. Each helper cache can serve an arbitrary number of users; however, each user can access only a single helper cache. Also, we assume that the server knows the user-to-helper cache association, defined as the sets of users connected to each helper cache, during the cache placement phase. We propose a solution for the aforementioned coded caching problem by introducing a combinatorial structure called a Shared and Private Placement Delivery Array (SP-PDA). These SP-PDAs describe the helper cache placement, private cache placement, and the server transmissions in a single array. Further, we propose a novel construction of SP-PDAs using two Placement Delivery Arrays (PDAs). Interestingly, we observe that the permutations of the columns of the two chosen PDAs result in SP-PDAs with different performances. Moreover, we characterize the conditions for selecting the best column permutations of the chosen PDAs. Furthermore, the coded caching schemes resulting from SP-PDAs subsume two existing coded caching schemes as special cases. Additionally, SP-PDAs enable the construction of coded caching schemes with much smaller subpacketization numbers -subpacketization number is defined as the number of subfiles to which a file is divided- compared to the existing schemes, without paying much in terms of rate (the size of the transmission in the delivery phase).
△ Less
Submitted 4 May, 2024;
originally announced May 2024.
-
Two-Dimensional Multi-Access Coded Caching with Multiple Transmit Antennas
Authors:
K. K. Krishnan Namboodiri,
Elizabath Peter,
B. Sundar Rajan
Abstract:
This work introduces a multi-antenna coded caching problem in a two-dimensional multi-access network, where a server with $L$ transmit antennas and $N$ files communicates to $K_1K_2$ users, each with a single receive antenna, through a wireless broadcast link. The network consists of $K_1K_2$ cache nodes and $K_1K_2$ users. The cache nodes, each with capacity $M$, are placed on a rectangular grid…
▽ More
This work introduces a multi-antenna coded caching problem in a two-dimensional multi-access network, where a server with $L$ transmit antennas and $N$ files communicates to $K_1K_2$ users, each with a single receive antenna, through a wireless broadcast link. The network consists of $K_1K_2$ cache nodes and $K_1K_2$ users. The cache nodes, each with capacity $M$, are placed on a rectangular grid with $K_1$ rows and $K_2$ columns, and the users are placed regularly on the square grid such that a user can access $r^2$ neighbouring caches in a cyclic wrap-around fashion. For a given cache memory $M$, the goal of the coded caching problem is to serve the user demands with a minimum delivery time. We propose a solution for the aforementioned coded caching problem by designing two arrays: a caching array and a delivery array. Further, we present two classes of caching and delivery arrays and obtain corresponding multi-access coded caching schemes. The first scheme achieves a normalized delivery time (NDT) $\frac{K_1K_2(1-r^2\frac{M}{N})}{L+K_1K_2\frac{M}{N}}$. The second scheme achieves an NDT $\frac{K_1K_2(1-r^2\frac{M}{N})}{L+K_1K_2r^2\frac{M}{N}}$ when $M/N=1/K_1K_2$ and $L=K_1K_2-r^2$, which is optimal under uncoded placement and one-shot delivery.
△ Less
Submitted 4 May, 2024;
originally announced May 2024.
-
Multi-Antenna Coded Caching for Multi-Access Networks with Cyclic Wrap-Around
Authors:
Elizabath Peter,
K. K. Krishnan Namboodiri,
B. Sundar Rajan
Abstract:
This work explores a multiple transmit antenna setting in a multi-access coded caching (MACC) network where each user accesses more than one cache. A MACC network has $K$ users and $K$ caches, and each user has access to $r < K$ consecutive caches in a cyclic wrap-around manner. There are $L$ antennas at the server, and each cache has a normalized size of $M/N \leq 1$. The cyclic wrap-around MACC…
▽ More
This work explores a multiple transmit antenna setting in a multi-access coded caching (MACC) network where each user accesses more than one cache. A MACC network has $K$ users and $K$ caches, and each user has access to $r < K$ consecutive caches in a cyclic wrap-around manner. There are $L$ antennas at the server, and each cache has a normalized size of $M/N \leq 1$. The cyclic wrap-around MACC network with a single antenna at the server has been a well-investigated topic, and several coded caching schemes and improved lower bounds on the performance are known for the same. However, this MACC network has not yet been studied under multi-antenna settings in the coded caching literature. We study the multi-antenna MACC problem and propose a solution for the same by constructing a pair of arrays called caching and delivery arrays. We present three constructions of caching and delivery arrays for different scenarios and obtain corresponding multi-antenna MACC schemes for the same. Two schemes resulting from the above constructions achieve optimal performance under uncoded placement and one-shot delivery. The optimality is shown by matching the performance of the multi-antenna MACC scheme to that of an optimal multi-antenna scheme for a dedicated cache network having an identical number of users, and each user has a normalized cache size of $rM/N$. Further, as a special case, one of the proposed schemes subsumes an existing optimal MACC scheme for the single-antenna setting.
△ Less
Submitted 13 October, 2023;
originally announced October 2023.
-
Coded Caching with Shared Caches and Private Caches
Authors:
Elizabath Peter,
K. K. Krishnan Namboodiri,
B. Sundar Rajan
Abstract:
This work studies the coded caching problem in a setting where the users are simultaneously endowed with a private cache and a shared cache. The setting consists of a server connected to a set of users, assisted by a smaller number of helper nodes that are equipped with their own storage. In addition to the helper cache, each user possesses a dedicated cache which is also used to prefetch file con…
▽ More
This work studies the coded caching problem in a setting where the users are simultaneously endowed with a private cache and a shared cache. The setting consists of a server connected to a set of users, assisted by a smaller number of helper nodes that are equipped with their own storage. In addition to the helper cache, each user possesses a dedicated cache which is also used to prefetch file contents. Each helper cache can serve an arbitrary number of users, but each user gets served by only one helper cache. We consider two scenarios: (a) the server has no prior information about the user-to-helper cache association, and (b) the server knows the user-to-helper cache association at the placement phase itself. We design centralized coded caching schemes under uncoded placement for the above two settings. For case (b), two schemes are proposed that are optimal in certain memory regimes. Further, a cut-set based lower bound is derived and used to show that one of the proposed schemes for case (b) is optimal in certain memory regime.
△ Less
Submitted 24 March, 2024; v1 submitted 1 September, 2022;
originally announced September 2022.
-
Extended Placement Delivery Arrays for Multi-Antenna Coded Caching Scheme
Authors:
K. K. Krishnan Namboodiri,
Elizabath Peter,
B. Sundar Rajan
Abstract:
The multi-antenna coded caching problem, where the server having $L$ transmit antennas communicating to $K$ users through a wireless broadcast link, is addressed. In the problem setting, the server has a library of $N$ files, and each user is equipped with a dedicated cache of capacity $M$. The idea of extended placement delivery array (EPDA), an array which consists of a special symbol $\star$ an…
▽ More
The multi-antenna coded caching problem, where the server having $L$ transmit antennas communicating to $K$ users through a wireless broadcast link, is addressed. In the problem setting, the server has a library of $N$ files, and each user is equipped with a dedicated cache of capacity $M$. The idea of extended placement delivery array (EPDA), an array which consists of a special symbol $\star$ and integers in a set $\{1,2,\dots,S\}$, is proposed to obtain a novel solution for the aforementioned multi-antenna coded caching problem. From a $(K,L,F,Z,S)$ EPDA, a multi-antenna coded caching scheme with $K$ users, and the server with $L$ transmit antennas, can be obtained in which the normalized memory $\frac{M}{N}=\frac{Z}{F}$, and the delivery time $T=\frac{S}{F}$. The placement delivery array (for single-antenna coded caching scheme) is a special class of EPDAs with $L=1$. For the multi-antenna coded caching schemes constructed from EPDAs, it is shown that the maximum possible Degree of Freedom (DoF) that can be achieved is $t+L$, where $t=\frac{KM}{N}$ is an integer. Furthermore, two constructions of EPDAs are proposed: a) $ K=t+L$, and b) $K=nt+(n-1)L, \hspace{0.1cm}L\geq t$, where $n\geq 2$ is an integer. In the resulting multi-antenna schemes from those EPDAs achieve the full DoF, while requiring a subpacketization number $\frac{K}{\text{gcd}(K,t,L)}$. This subpacketization number is less than that required by previously known schemes in the literature.
△ Less
Submitted 25 January, 2022;
originally announced January 2022.
-
Shared Cache Coded Caching Schemes with known User-to-Cache Association Profile using Placement Delivery Arrays
Authors:
Elizabath Peter,
K. K. Krishnan Namboodiri,
B. Sundar Rajan
Abstract:
This work considers the coded caching problem with shared caches, where users share the caches, and each user gets access only to one cache. The user-to-cache association is assumed to be known at the server during the placement phase. We focus on the schemes derived using placement delivery arrays (PDAs). The PDAs were originally designed to address the sub-packetization bottleneck of coded cachi…
▽ More
This work considers the coded caching problem with shared caches, where users share the caches, and each user gets access only to one cache. The user-to-cache association is assumed to be known at the server during the placement phase. We focus on the schemes derived using placement delivery arrays (PDAs). The PDAs were originally designed to address the sub-packetization bottleneck of coded caching in a dedicated cache setup. We observe that in the setup of this paper permuting the columns of the PDA results in schemes with different performance for the same problem, but the sub-packetization level remains the same. This is contrary to what was observed for dedicated cache networks. We propose a procedure to identify the ordering of columns that gives the best performance possible for the PDA employed for the given problem. Further, some specific classes of PDAs are chosen and the performance gain achieved by reordering the columns of the PDA is illustrated.
△ Less
Submitted 27 January, 2022; v1 submitted 25 January, 2022;
originally announced January 2022.
-
A Secretive Coded Caching for Shared Cache Systems using PDAs
Authors:
Elizabath Peter,
K. K. Krishnan Namboodiri,
B. Sundar Rajan
Abstract:
This paper considers the secretive coded caching problem with shared caches in which no user must have access to the files that it did not demand. In a shared cache network, the users are served by a smaller number of helper caches and each user is connected to exactly one helper cache. To ensure the secrecy constraint in shared cache networks, each user is required to have an individual cache of…
▽ More
This paper considers the secretive coded caching problem with shared caches in which no user must have access to the files that it did not demand. In a shared cache network, the users are served by a smaller number of helper caches and each user is connected to exactly one helper cache. To ensure the secrecy constraint in shared cache networks, each user is required to have an individual cache of at least unit file size. For this setting, a secretive coded caching scheme was proposed recently in the literature (\enquote{Secretive Coded Caching with Shared Caches}, in \textit{IEEE Communications Letters}, 2021), and it requires a subpacketization level which is in the exponential order of the number of helper caches. By utilizing the PDA constructions, we propose a procedure to obtain new secretive coded caching schemes for shared caches with reduced subpacketization levels. We also show that the existing secretive coded caching scheme for shared caches can be recovered using our procedure. Furthermore, we derive a lower bound on the secretive transmission rate using cut-set arguments and demonstrate the order-optimality of the proposed secretive coded caching scheme.
△ Less
Submitted 21 October, 2021;
originally announced October 2021.
-
Coded Caching with Shared Caches from Generalized Placement Delivery Arrays
Authors:
Elizabath Peter,
B. Sundar Rajan
Abstract:
We consider the coded caching problem with shared caches where several users share a cache, but each user has access to only a single cache. For this network, the fundamental limits of coded caching are known for centralized and decentralized settings under uncoded placement. In the centralized case, to achieve the gains offered by coded caching, one requires a sub-packetization which increases ex…
▽ More
We consider the coded caching problem with shared caches where several users share a cache, but each user has access to only a single cache. For this network, the fundamental limits of coded caching are known for centralized and decentralized settings under uncoded placement. In the centralized case, to achieve the gains offered by coded caching, one requires a sub-packetization which increases exponentially with the number of caches. The dedicated cache networks had a similar issue, and placement delivery arrays (PDAs) were introduced as a solution to it. Using the PDA framework, we propose a procedure to obtain new coded caching schemes for shared caches with lower sub-packetization requirements. The advantage of this procedure is that we can transform all the existing PDA structures into coded caching schemes for shared caches, thus resulting in low sub-packetization schemes. We also show that the optimal scheme given by Parrinello, Unsal and Elia (Fundamental Limits of Coded Caching with Multiple Antennas, Shared Caches and Uncoded Prefetching) can be recovered using a Maddah-Ali Niesen PDA.
△ Less
Submitted 1 July, 2021;
originally announced July 2021.
-
Decentralized and Online Coded Caching with Shared Caches: Fundamental Limits with Uncoded Prefetching
Authors:
Elizabath Peter,
B. Sundar Rajan
Abstract:
Decentralized coded caching scheme, introduced by Maddah-Ali and Niesen, assumes that the caches are filled with no coordination. This work identifies a decentralized coded caching scheme -- under the assumption of uncoded placement -- for shared cache network, where each cache serves multiple users. Each user has access to only a single cache and the number of caches is less than or equal to the…
▽ More
Decentralized coded caching scheme, introduced by Maddah-Ali and Niesen, assumes that the caches are filled with no coordination. This work identifies a decentralized coded caching scheme -- under the assumption of uncoded placement -- for shared cache network, where each cache serves multiple users. Each user has access to only a single cache and the number of caches is less than or equal to the number of users. For this setting, we derive the optimal worst-case delivery time for any user-to-cache association profile where each such profile describes the number of users served by each cache. The optimality is shown using an index-coding based converse. Further, we improve the delivery scheme to accommodate redundant demands. Also, an optimal linear error correcting delivery scheme is proposed for the worst-case demand scenario. Next, we consider the Least Recently Sent (LRS) online coded caching scheme where the caches need to be updated based on the sequence of demands made by the users. Cache update happens if any of the demanded file was not partially cached at the users. The update is done by replacing the least recently sent file with the new file. But, the least recently sent file need not be unique. In that case, there needs to be some ordering of the files which are getting partially cached, or else centralized coordination would have to be assumed which does not exist. If each user removes any of the least recently used files at random, then the next delivery phase will not serve the purpose. A modification is suggested for the scheme by incorporating an ordering of files. Moreover, all the above results with shared caches are extended to the online setting.
△ Less
Submitted 23 January, 2021;
originally announced January 2021.
-
Demystifying Graph Databases: Analysis and Taxonomy of Data Organization, System Designs, and Graph Queries
Authors:
Maciej Besta,
Robert Gerstenberger,
Emanuel Peter,
Marc Fischer,
Michał Podstawski,
Claude Barthels,
Gustavo Alonso,
Torsten Hoefler
Abstract:
Graph processing has become an important part of multiple areas of computer science, such as machine learning, computational sciences, medical applications, social network analysis, and many others. Numerous graphs such as web or social networks may contain up to trillions of edges. Often, these graphs are also dynamic (their structure changes over time) and have domain-specific rich data associat…
▽ More
Graph processing has become an important part of multiple areas of computer science, such as machine learning, computational sciences, medical applications, social network analysis, and many others. Numerous graphs such as web or social networks may contain up to trillions of edges. Often, these graphs are also dynamic (their structure changes over time) and have domain-specific rich data associated with vertices and edges. Graph database systems such as Neo4j enable storing, processing, and analyzing such large, evolving, and rich datasets. Due to the sheer size of such datasets, combined with the irregular nature of graph processing, these systems face unique design challenges. To facilitate the understanding of this emerging domain, we present the first survey and taxonomy of graph database systems. We focus on identifying and analyzing fundamental categories of these systems (e.g., triple stores, tuple stores, native graph database systems, or object-oriented systems), the associated graph models (e.g., RDF or Labeled Property Graph), data organization techniques (e.g., storing graph data in indexing structures or dividing data into records), and different aspects of data distribution and query execution (e.g., support for sharding and ACID). 51 graph database systems are presented and compared, including Neo4j, OrientDB, or Virtuoso. We outline graph database queries and relationships with associated domains (NoSQL stores, graph streaming, and dynamic graph algorithms). Finally, we describe research and engineering challenges to outline the future of graph databases.
△ Less
Submitted 30 August, 2023; v1 submitted 20 October, 2019;
originally announced October 2019.
-
Fundamental Results for a Generic Implementation of Barriers using Optical Interconnects
Authors:
Sandeep Chandran,
Eldhose Peter,
Preeti Ranjan Panda,
Smruti R. Sarangi
Abstract:
In this report, we report some fundamental results and bounds on the number of messages and storage required to implement barriers using futuristic on-chip optical and RF networks. We prove that it is necessary to maintain a count to at least N (number of threads) in memory, broadcast the barrier id at least once, and if we elect a co-ordinator, we can reduce the number of messages by a factor of…
▽ More
In this report, we report some fundamental results and bounds on the number of messages and storage required to implement barriers using futuristic on-chip optical and RF networks. We prove that it is necessary to maintain a count to at least N (number of threads) in memory, broadcast the barrier id at least once, and if we elect a co-ordinator, we can reduce the number of messages by a factor of O(N ).
△ Less
Submitted 1 October, 2015;
originally announced October 2015.
-
Exciton photon strong-coupling regime for a single quantum dot in a microcavity
Authors:
Emmanuelle Peter,
Pascale Senellart,
David Martrou,
Aristide Lemaitre,
Jacqueline Bloch,
Julien Hours,
Jean-Michel Gerard
Abstract:
We report on the observation of the strong coupling regime between a single GaAs quantum dot and a microdisk optical mode. Photoluminescence is performed at various temperatures to tune the quantum dot exciton with respect to the optical mode. At resonance, we observe an anticrossing, signature of the strong coupling regime with a well resolved doublet. The Vacuum Rabi splitting amounts to 400 &…
▽ More
We report on the observation of the strong coupling regime between a single GaAs quantum dot and a microdisk optical mode. Photoluminescence is performed at various temperatures to tune the quantum dot exciton with respect to the optical mode. At resonance, we observe an anticrossing, signature of the strong coupling regime with a well resolved doublet. The Vacuum Rabi splitting amounts to 400 μeV and is twice as large as the individual linewidths.
△ Less
Submitted 3 December, 2004; v1 submitted 10 November, 2004;
originally announced November 2004.