Skip to main content

Showing 1–8 of 8 results for author: Poppe, O

.
  1. Towards Building Autonomous Data Services on Azure

    Authors: Yiwen Zhu, Yuanyuan Tian, Joyce Cahoon, Subru Krishnan, Ankita Agarwal, Rana Alotaibi, Jesús Camacho-Rodríguez, Bibin Chundatt, Andrew Chung, Niharika Dutta, Andrew Fogarty, Anja Gruenheid, Brandon Haynes, Matteo Interlandi, Minu Iyer, Nick Jurgens, Sumeet Khushalani, Brian Kroth, Manoj Kumar, Jyoti Leeka, Sergiy Matusevych, Minni Mittal, Andreas Mueller, Kartheek Muthyala, Harsha Nagulapalli , et al. (13 additional authors not shown)

    Abstract: Modern cloud has turned data services into easily accessible commodities. With just a few clicks, users are now able to access a catalog of data processing systems for a wide range of tasks. However, the cloud brings in both complexity and opportunity. While cloud users can quickly start an application by using various data services, it can be difficult to configure and optimize these services to… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

    Comments: SIGMOD Companion of the 2023 International Conference on Management of Data. 2023

  2. arXiv:2101.00361  [pdf, other

    cs.DB cs.PF

    To Share, or not to Share Online Event Trend Aggregation Over Bursty Event Streams

    Authors: Olga Poppe, Chuan Lei, Lei Ma, Allison Rozet, Elke A. Rundensteiner

    Abstract: Complex event processing (CEP) systems continuously evaluate large workloads of pattern queries under tight time constraints. Event trend aggregation queries with Kleene patterns are commonly used to retrieve summarized insights about the recent trends in event streams. State-of-art methods are limited either due to repetitive computations or unnecessary trend construction. Existing shared approac… ▽ More

    Submitted 3 March, 2021; v1 submitted 1 January, 2021; originally announced January 2021.

    Comments: Technical report for the paper in SIGMOD 2021

  3. arXiv:2010.02989  [pdf, other

    cs.DB cs.PF

    Sharon: Shared Online Event Sequence Aggregation

    Authors: Olga Poppe, Allison Rozet, Chuan Lei, Elke A. Rundensteiner, David Maier

    Abstract: Streaming systems evaluate massive workloads of event sequence aggregation queries. State-of-the-art approaches suffer from long delays caused by not sharing intermediate results of similar queries and by constructing event sequences prior to their aggregation. To overcome these limitations, our Shared Online Event Sequence Aggregation (Sharon) approach shares intermediate aggregates among multipl… ▽ More

    Submitted 6 October, 2020; originally announced October 2020.

    Comments: Technical report for the paper in ICDE 2018

  4. arXiv:2010.02988  [pdf, other

    cs.DS cs.DB cs.PF

    GRETA: Graph-based Real-time Event Trend Aggregation

    Authors: Olga Poppe, Chuan Lei, Elke A. Rundensteiner, David Maier

    Abstract: Streaming applications from algorithmic trading to traffic management deploy Kleene patterns to detect and aggregate arbitrarily-long event sequences, called event trends. State-of-the-art systems process such queries in two steps. Namely, they first construct all trends and then aggregate them. Due to the exponential costs of trend construction, this two-step approach suffers from both a long del… ▽ More

    Submitted 6 October, 2020; originally announced October 2020.

    Comments: Technical report for the paper in VLDB 2017

  5. arXiv:2010.02987  [pdf, other

    cs.DB cs.PF

    Event Trend Aggregation Under Rich Event Matching Semantics

    Authors: Olga Poppe, Chuan Lei, Elke A. Rundensteiner, David Maier

    Abstract: Streaming applications from health care analytics to algorithmic trading deploy Kleene queries to detect and aggregate event trends. Rich event matching semantics determine how to compose events into trends. The expressive power of state-of-the-art systems remains limited in that they do not support the rich variety of these semantics. Worse yet, they suffer from long delays and high memory costs… ▽ More

    Submitted 6 October, 2020; originally announced October 2020.

    Comments: Technical report for the paper in SIGMOD 2019

  6. arXiv:2009.12922  [pdf, other

    cs.DC cs.DB cs.LG cs.PF

    Seagull: An Infrastructure for Load Prediction and Optimized Resource Allocation

    Authors: Olga Poppe, Tayo Amuneke, Dalitso Banda, Aritra De, Ari Green, Manon Knoertzer, Ehi Nosakhare, Karthik Rajendran, Deepak Shankargouda, Meina Wang, Alan Au, Carlo Curino, Qun Guo, Alekh **dal, Ajay Kalhan, Morgan Oslake, Sonia Parchani, Vijay Ramani, Raj Sellappan, Saikat Sen, Sheetal Shrotri, Soundararajan Srinivasan, ** Xia, Shize Xu, Alicia Yang , et al. (1 additional authors not shown)

    Abstract: Microsoft Azure is dedicated to guarantee high quality of service to its customers, in particular, during periods of high customer activity, while controlling cost. We employ a Data Science (DS) driven solution to predict user load and leverage these predictions to optimize resource allocation. To this end, we built the Seagull infrastructure that processes per-server telemetry, validates the data… ▽ More

    Submitted 16 October, 2020; v1 submitted 27 September, 2020; originally announced September 2020.

    Comments: Technical report for the paper in VLDB 2021

  7. arXiv:2006.02155  [pdf, other

    cs.DC cs.DB cs.LG cs.PF cs.SE

    MLOS: An Infrastructure for Automated Software Performance Engineering

    Authors: Carlo Curino, Neha Godwal, Brian Kroth, Sergiy Kuryata, Greg Lapinski, Siqi Liu, Slava Oks, Olga Poppe, Adam Smiechowski, Ed Thayer, Markus Weimer, Yiwen Zhu

    Abstract: Develo** modern systems software is a complex task that combines business logic programming and Software Performance Engineering (SPE). The later is an experimental and labor-intensive activity focused on optimizing the system for a given hardware, software, and workload (hw/sw/wl) context. Today's SPE is performed during build/release phases by specialized teams, and cursed by: 1) lack of sta… ▽ More

    Submitted 4 June, 2020; v1 submitted 1 June, 2020; originally announced June 2020.

    Comments: 4 pages, DEEM 2020

  8. arXiv:1909.00084  [pdf, other

    cs.DB cs.DC cs.LG

    Cloudy with high chance of DBMS: A 10-year prediction for Enterprise-Grade ML

    Authors: Ashvin Agrawal, Rony Chatterjee, Carlo Curino, Avrilia Floratou, Neha Gowdal, Matteo Interlandi, Alekh **dal, Kostantinos Karanasos, Subru Krishnan, Brian Kroth, Jyoti Leeka, Kwanghyun Park, Hiren Patel, Olga Poppe, Fotis Psallidas, Raghu Ramakrishnan, Abhishek Roy, Karla Saur, Rathijit Sen, Markus Weimer, Travis Wright, Yiwen Zhu

    Abstract: Machine learning (ML) has proven itself in high-value web applications such as search ranking and is emerging as a powerful tool in a much broader range of enterprise scenarios including voice recognition and conversational understanding for customer support, autotuning for videoconferencing, intelligent feedback loops in large-scale sysops, manufacturing and autonomous vehicle management, complex… ▽ More

    Submitted 27 December, 2019; v1 submitted 30 August, 2019; originally announced September 2019.