Skip to main content

Showing 1–8 of 8 results for author: Kourtis, K

.
  1. arXiv:2006.04658  [pdf, other

    cs.DB cs.PF

    Toward a Better Understanding and Evaluation of Tree Structures on Flash SSDs

    Authors: Diego Didona, Nikolas Ioannou, Radu Stoica, Kornilios Kourtis

    Abstract: Solid-state drives (SSDs) are extensively used to deploy persistent data stores, as they provide low latency random access, high write throughput, high data density, and low cost. Tree-based data structures are widely used to build persistent data stores, and indeed they lie at the backbone of many of the data management systems used in production and research today. In this paper, we show that be… ▽ More

    Submitted 8 June, 2020; originally announced June 2020.

  2. arXiv:2003.04293  [pdf, other

    cs.DC cs.ET

    Compiling Neural Networks for a Computational Memory Accelerator

    Authors: Kornilios Kourtis, Martino Dazzi, Nikolas Ioannou, Tobias Grosser, Abu Sebastian, Evangelos Eleftheriou

    Abstract: Computational memory (CM) is a promising approach for accelerating inference on neural networks (NN) by using enhanced memories that, in addition to storing data, allow computations on them. One of the main challenges of this approach is defining a hardware/software interface that allows a compiler to map NN models for efficient execution on the underlying CM accelerator. This is a non-trivial tas… ▽ More

    Submitted 24 April, 2020; v1 submitted 5 March, 2020; originally announced March 2020.

    Comments: Accepted at SPMA '20

  3. arXiv:2002.11528  [pdf, other

    cs.DC cs.OS

    Safe and Efficient Remote Application Code Execution on Disaggregated NVM Storage with eBPF

    Authors: Kornilios Kourtis, Animesh Trivedi, Nikolas Ioannou

    Abstract: With rapid improvements in NVM storage devices, the performance bottleneck is gradually shifting to the network, thus giving rise to the notion of "data movement wall". To reduce the amount of data movement over the network, researchers have proposed near-data computing by ship** operations and compute-extensions closer to storage devices. However, running arbitrary, user-provided extensions in… ▽ More

    Submitted 25 February, 2020; originally announced February 2020.

  4. arXiv:1911.08367  [pdf, other

    cs.OS

    Cichlid: Explicit physical memory management for large machines

    Authors: Simon Gerber, Gerd Zellweger, Reto Achermann, Moritz Hoffmann, Kornilios Kourtis, Timothy Roscoe, Dejan Milojicic

    Abstract: In this paper, we rethink how an OS supports virtual memory. Classical VM is an opaque abstraction of RAM, backed by demand paging. However, most systems today (from phones to data-centers) do not page, and indeed may require the performance benefits of non-paged physical memory, precise NUMA allocation, etc. Moreover, MMU hardware is now useful for other purposes, such as detecting page access or… ▽ More

    Submitted 19 November, 2019; originally announced November 2019.

  5. arXiv:1909.04885  [pdf, other

    cs.LG cs.DC cs.PF stat.ML

    Addressing Algorithmic Bottlenecks in Elastic Machine Learning with Chicle

    Authors: Michael Kaufmann, Kornilios Kourtis, Celestine Mendler-Dünner, Adrian Schüpbach, Thomas Parnell

    Abstract: Distributed machine learning training is one of the most common and important workloads running on data centers today, but it is rarely executed alone. Instead, to reduce costs, computing resources are consolidated and shared by different applications. In this scenario, elasticity and proper load balancing are vital to maximize efficiency, fairness, and utilization. Currently, most distributed tra… ▽ More

    Submitted 11 September, 2019; originally announced September 2019.

  6. arXiv:1811.02322  [pdf, other

    cs.LG cs.HC stat.ML

    Elastic CoCoA: Scaling In to Improve Convergence

    Authors: Michael Kaufmann, Thomas Parnell, Kornilios Kourtis

    Abstract: In this paper we experimentally analyze the convergence behavior of CoCoA and show, that the number of workers required to achieve the highest convergence rate at any point in time, changes over the course of the training. Based on this observation, we build Chicle, an elastic framework that dynamically adjusts the number of workers based on feedback from the training algorithm, in order to select… ▽ More

    Submitted 6 November, 2018; originally announced November 2018.

  7. arXiv:1811.01564  [pdf, other

    cs.LG stat.ML

    Parallel training of linear models without compromising convergence

    Authors: Nikolas Ioannou, Celestine Dünner, Kornilios Kourtis, Thomas Parnell

    Abstract: In this paper we analyze, evaluate, and improve the performance of training generalized linear models on modern CPUs. We start with a state-of-the-art asynchronous parallel training algorithm, identify system-level performance bottlenecks, and apply optimizations that improve data parallelism, cache line locality, and cache line prefetching of the algorithm. These modifications reduce the per-epoc… ▽ More

    Submitted 19 December, 2018; v1 submitted 5 November, 2018; originally announced November 2018.

    Comments: Presented at the Workshop on Systems for ML and Open Source Software at NeurIPS 2018

  8. Elevating commodity storage with the SALSA host translation layer

    Authors: Nikolas Ioannou, Kornilios Kourtis, Ioannis Koltsidas

    Abstract: To satisfy increasing storage demands in both capacity and performance, industry has turned to multiple storage technologies, including Flash SSDs and SMR disks. These devices employ a translation layer that conceals the idiosyncrasies of their mediums and enables random access. Device translation layers are, however, inherently constrained: resources on the drive are scarce, they cannot be adapte… ▽ More

    Submitted 10 January, 2019; v1 submitted 17 January, 2018; originally announced January 2018.

    Comments: Presented at 2018 IEEE 26th International Symposium on Modeling, Analysis, and Simulation of Computer and Telecommunication Systems (MASCOTS)