Skip to main content

Showing 1–10 of 10 results for author: Chockchowwat, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.13856  [pdf, other

    cs.DB

    Kishu: Time-Traveling for Computational Notebooks

    Authors: Zhaoheng Li, Supawit Chockchowwat, Ribhav Sahu, Areet Sheth, Yongjoo Park

    Abstract: Computational notebooks (e.g., Jupyter, Google Colab) are widely used by data scientists. A key feature of notebooks is the interactive computing model of iteratively executing cells (i.e., a set of statements) and observing the result (e.g., model or plot). Unfortunately, existing notebook systems do not offer time-traveling to past states: when the user executes a cell, the notebook session stat… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  2. AirIndex: Versatile Index Tuning Through Data and Storage

    Authors: Supawit Chockchowwat, Wenjie Liu, Yongjoo Park

    Abstract: The end-to-end lookup latency of a hierarchical index -- such as a B-tree or a learned index -- is determined by its structure such as the number of layers, the kinds of branching functions appearing in each layer, the amount of data we must fetch from layers, etc. Our primary observation is that by optimizing those structural parameters (or designs) specifically to a target system's I/O character… ▽ More

    Submitted 1 September, 2023; v1 submitted 25 June, 2023; originally announced June 2023.

    Comments: 13 pages, 3 appendices, 19 figures, to appear at SIGMOD 2024

  3. arXiv:2305.08770  [pdf, other

    cs.DB cs.LG cs.PL

    Transactional Python for Durable Machine Learning: Vision, Challenges, and Feasibility

    Authors: Supawit Chockchowwat, Zhaoheng Li, Yongjoo Park

    Abstract: In machine learning (ML), Python serves as a convenient abstraction for working with key libraries such as PyTorch, scikit-learn, and others. Unlike DBMS, however, Python applications may lose important data, such as trained models and extracted features, due to machine failures or human errors, leading to a waste of time and resources. Specifically, they lack four essential properties that could… ▽ More

    Submitted 15 May, 2023; originally announced May 2023.

    Comments: 5 pages, 5 figures, to appear at DEEM 2023

  4. arXiv:2303.04103  [pdf, other

    cs.DB

    A Step Toward Deep Online Aggregation (Extended Version)

    Authors: Nikhil Sheoran, Supawit Chockchowwat, Arav Chheda, Suwen Wang, Riya Verma, Yongjoo Park

    Abstract: For exploratory data analysis, it is often desirable to know what answers you are likely to get before actually obtaining those answers. This can potentially be achieved by designing systems to offer the estimates of a data operation result -- say op(data) -- earlier in the process based on partial data processing. Those estimates continuously refine as more data is processed and finally converge… ▽ More

    Submitted 7 March, 2023; originally announced March 2023.

    Comments: 16 pages, 13 figures, 3 appendices, to appear at SIGMOD 2023

  5. arXiv:2208.03823  [pdf, other

    cs.DB

    Automatically Finding Optimal Index Structure

    Authors: Supawit Chockchowwat, Wenjie Liu, Yongjoo Park

    Abstract: Existing learned indexes (e.g., RMI, ALEX, PGM) optimize the internal regressor of each node, not the overall structure such as index height, the size of each layer, etc. In this paper, we share our recent findings that we can achieve significantly faster lookup speed by optimizing the structure as well as internal regressors. Specifically, our approach (called AirIndex) expresses the end-to-end l… ▽ More

    Submitted 7 August, 2022; originally announced August 2022.

    Comments: 5 pages, to be published in AIDB at VLDB 2022

  6. arXiv:2206.12959  [pdf, other

    cs.CV cs.CE cs.LG

    Probabilistic PolarGMM: Unsupervised Cluster Learning of Very Noisy Projection Images of Unknown Pose

    Authors: Supawit Chockchowwat, Chandrajit L. Bajaj

    Abstract: A crucial step in single particle analysis (SPA) of cryogenic electron microscopy (Cryo-EM), 2D classification and alignment takes a collection of noisy particle images to infer orientations and group similar images together. Averaging these aligned and clustered noisy images produces a set of clean images, ready for further analysis such as 3D reconstruction. Fourier-Bessel steerable principal co… ▽ More

    Submitted 26 June, 2022; originally announced June 2022.

    Comments: 13 pages, including appendices

  7. arXiv:2112.13323  [pdf, other

    cs.DB cs.IR

    Airphant: Cloud-oriented Document Indexing

    Authors: Supawit Chockchowwat, Chaitanya Sood, Yongjoo Park

    Abstract: Modern data warehouses can scale compute nodes independently of storage. These systems persist their data on cloud storage, which is always available and cost-efficient. Ad-hoc compute nodes then fetch necessary data on-demand from cloud storage. This ability to quickly scale or shrink data systems is highly beneficial if query workloads may change over time. We apply this new architecture to sear… ▽ More

    Submitted 26 December, 2021; originally announced December 2021.

    Comments: 17 pages, to be published in ICDE 2022

  8. arXiv:2106.10408  [pdf, other

    cs.SI

    Step Out of Your Comfort Zone: More Inclusive Content Recommendation for Networked Systems

    Authors: Jiaxin Wu, Supawit Chockchowwat

    Abstract: Networked systems are widely applicable in real-world scenarios such as social networks, infrastructure networks, and biological networks. Among those applications, we are interested in social networks due to their complexity and popularity. One crucial task on the social network is to recommend new content based on special characteristics of the graph structure. In this project, we aim to enhance… ▽ More

    Submitted 18 June, 2021; originally announced June 2021.

  9. arXiv:1906.01408  [pdf, other

    cs.LG cs.AI stat.ML

    Hypothesis-Driven Skill Discovery for Hierarchical Deep Reinforcement Learning

    Authors: Caleb Chuck, Supawit Chockchowwat, Scott Niekum

    Abstract: Deep reinforcement learning (DRL) is capable of learning high-performing policies on a variety of complex high-dimensional tasks, ranging from video games to robotic manipulation. However, standard DRL methods often suffer from poor sample efficiency, partially because they aim to be entirely problem-agnostic. In this work, we introduce a novel approach to exploration and hierarchical skill learni… ▽ More

    Submitted 3 March, 2020; v1 submitted 27 May, 2019; originally announced June 2019.

    Comments: Submitted to IROS 2020

  10. Functional Generative Design: An Evolutionary Approach to 3D-Printing

    Authors: Cem C. Tutum, Supawit Chockchowwat, Etienne Vouga, Risto Miikkulainen

    Abstract: Consumer-grade printers are widely available, but their ability to print complex objects is limited. Therefore, new designs need to be discovered that serve the same function, but are printable. A representative such problem is to produce a working, reliable mechanical spring. The proposed methodology for discovering solutions to this problem consists of three components: First, an effective searc… ▽ More

    Submitted 19 April, 2018; originally announced April 2018.

    Comments: 8 pages, 12 figures, GECCO'18