Skip to main content

Showing 1–3 of 3 results for author: Knorr, E

.
  1. arXiv:2401.08858  [pdf, ps, other

    cs.OS

    File System Aging

    Authors: Alex Conway, Ainesh Bakshi, Arghya Bhattacharya, Rory Bennett, Yizheng Jiao, Eric Knorr, Yang Zhan, Michael A. Bender, William Jannen, Rob Johnson, Bradley C. Kuszmaul, Donald E. Porter, Jun Yuan, Martin Farach-Colton

    Abstract: File systems must allocate space for files without knowing what will be added or removed in the future. Over the life of a file system, this may cause suboptimal file placement decisions that eventually lead to slower performance, or aging. Conventional wisdom suggests that file system aging is a solved problem in the common case; heuristics to avoid aging, such as colocating related files and dat… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

    Comments: 36 pages, 12 figures. Article is an extension of Conway et al. FAST 17. (see https://www.usenix.org/conference/fast17/technical-sessions/presentation/conway) and Conway et al. HotStorage 19. (see https://www.usenix.org/conference/hotstorage19/presentation/conway)

    ACM Class: H.3.2; D.4.3; D.4.2; D.4.8; E.1; E.5; H.3.4

  2. arXiv:2207.01503  [pdf, other

    cs.DB cs.DS cs.LG

    Proteus: A Self-Designing Range Filter

    Authors: Eric R. Knorr, Baptiste Lemaire, Andrew Lim, Siqiang Luo, Huanchen Zhang, Stratos Idreos, Michael Mitzenmacher

    Abstract: We introduce Proteus, a novel self-designing approximate range filter, which configures itself based on sampled data in order to optimize its false positive rate (FPR) for a given space requirement. Proteus unifies the probabilistic and deterministic design spaces of state-of-the-art range filters to achieve robust performance across a larger variety of use cases. At the core of Proteus lies our C… ▽ More

    Submitted 30 June, 2022; originally announced July 2022.

    Comments: 14 pages, 9 figures, originally published in the Proceedings of the 2022 International Conference on Management of Data (SIGMOD'22), ISBN: 9781450392495

    ACM Class: F.2.m; H.3.3

    Journal ref: SIGMOD'22: Proceedings of the 2022 International Conference on Management of Data, June 2022, 1670-1684

  3. arXiv:2006.03176  [pdf, other

    cs.DS cs.DB cs.LG

    Partitioned Learned Bloom Filter

    Authors: Kapil Vaidya, Eric Knorr, Tim Kraska, Michael Mitzenmacher

    Abstract: Bloom filters are space-efficient probabilistic data structures that are used to test whether an element is a member of a set, and may return false positives. Recently, variations referred to as learned Bloom filters were developed that can provide improved performance in terms of the rate of false positives, by using a learned model for the represented set. However, previous methods for learned B… ▽ More

    Submitted 4 October, 2020; v1 submitted 4 June, 2020; originally announced June 2020.

    Comments: 13 pages, 3 figures