Skip to main content

Showing 1–7 of 7 results for author: Maturana, F

.
  1. arXiv:2405.09010  [pdf, ps, other

    cs.IT

    On Low Field Size Constructions of Access-Optimal Convertible Codes

    Authors: Saransh Chopra, Francisco Maturana, K. V. Rashmi

    Abstract: Most large-scale storage systems employ erasure coding to provide resilience against disk failures. Recent work has shown that tuning this redundancy to changes in disk failure rates leads to substantial storage savings. This process requires code conversion, wherein data encoded using an $[n^{I\mskip-2mu},k^{I\mskip-2mu}]$ initial code has to be transformed into data encoded using an… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

    Comments: This is an extended version of an IEEE ISIT 2024 paper with the same title

  2. arXiv:2205.06793  [pdf, other

    cs.IT cs.DC cs.NI

    Bandwidth Cost of Code Conversions in the Split Regime

    Authors: Francisco Maturana, K. V. Rashmi

    Abstract: Distributed storage systems must store large amounts of data over long periods of time. To avoid data loss due to device failures, an $[n,k]$ erasure code is used to encode $k$ data symbols into a codeword of $n$ symbols that are stored across different devices. However, device failure rates change throughout the life of the data, and tuning $n$ and $k$ according to these changes has been shown to… ▽ More

    Submitted 13 May, 2022; originally announced May 2022.

    Comments: Extended version of paper accepted in the 2022 IEEE International Symposium on Information Theory (ISIT)

  3. arXiv:2103.08191  [pdf, other

    cs.DC

    PACEMAKER: Avoiding HeART attacks in storage clusters with disk-adaptive redundancy

    Authors: Saurabh Kadekodi, Francisco Maturana, Suhas Jayaram Subramanya, Juncheng Yang, K. V. Rashmi, Gregory R. Ganger

    Abstract: Data redundancy provides resilience in large-scale storage clusters, but imposes significant cost overhead. Substantial space-savings can be realized by tuning redundancy schemes to observed disk failure rates. However, prior design proposals for such tuning are unusable in real-world clusters, because the IO load of transitions between schemes overwhelms the storage infrastructure (termed transit… ▽ More

    Submitted 15 March, 2021; originally announced March 2021.

    Comments: Published in USENIX Symposium on Operating Systems Design and Implementation (OSDI) 2020

    ACM Class: B.8.1; C.4; D.4.2; D.4.5

    Journal ref: 14th USENIX Symposium on Operating Systems Design and Implementation (OSDI), 2020, (pp. 369-385)

  4. arXiv:2008.12707  [pdf, other

    cs.IT cs.DC cs.NI

    Bandwidth Cost of Code Conversions in Distributed Storage: Fundamental Limits and Optimal Constructions

    Authors: Francisco Maturana, K. V. Rashmi

    Abstract: Erasure codes have become an integral part of distributed storage systems as a tool for providing data reliability and durability under the constant threat of device failures. In such systems, an $[n, k]$ code over a finite field $\mathbb{F}_q$ encodes $k$ message symbols into $n$ codeword symbols from $\mathbb{F}_q$ which are then stored on $n$ different nodes in the system. Recent work has shown… ▽ More

    Submitted 28 August, 2020; originally announced August 2020.

  5. arXiv:2006.03042  [pdf, other

    cs.IT cs.DC cs.NI

    Access-optimal Linear MDS Convertible Codes for All Parameters

    Authors: Francisco Maturana, V. S. Chaitanya Mukka, K. V. Rashmi

    Abstract: In large-scale distributed storage systems, erasure codes are used to achieve fault tolerance in the face of node failures. Tuning code parameters to observed failure rates has been shown to significantly reduce storage cost. Such tuning of redundancy requires "code conversion", i.e., a change in code dimension and length on already encoded data. Convertible codes are a new class of codes designed… ▽ More

    Submitted 4 June, 2020; originally announced June 2020.

    Comments: This is an extended version of an IEEE ISIT 2020 paper with the same title

  6. arXiv:1907.13119  [pdf, other

    cs.IT cs.DC cs.NI

    Convertible Codes: Efficient Conversion of Coded Data in Distributed Storage

    Authors: Francisco Maturana, K. V. Rashmi

    Abstract: Large-scale distributed storage systems typically use erasure codes to provide durability of data in the face of failures. A set of $k$ blocks to be stored is encoded using an $[n, k]$ code to generate $n$ blocks that are then stored on different storage nodes. The redundancy configuration is chosen based on the failure rates of storage devices, and is typically kept constant. However, a recent wo… ▽ More

    Submitted 30 July, 2019; originally announced July 2019.

  7. arXiv:1707.00827  [pdf, ps, other

    cs.DB

    Document Spanners for Extracting Incomplete Information: Expressiveness and Complexity

    Authors: Francisco Maturana, Cristian Riveros, Domagoj Vrgoč

    Abstract: Rule-based information extraction has lately received a fair amount of attention from the database community, with several languages appearing in the last few years. Although information extraction systems are intended to deal with semistructured data, all language proposals introduced so far are designed to output relations, thus making them incapable of handling incomplete information. To remedy… ▽ More

    Submitted 29 December, 2017; v1 submitted 4 July, 2017; originally announced July 2017.