Skip to main content

Showing 1–11 of 11 results for author: Zeighami, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.11318  [pdf, other

    cs.LG cs.CY cs.DB

    BiasBuster: a Neural Approach for Accurate Estimation of Population Statistics using Biased Location Data

    Authors: Sepanta Zeighami, Cyrus Shahabi

    Abstract: While extremely useful (e.g., for COVID-19 forecasting and policy-making, urban mobility analysis and marketing, and obtaining business insights), location data collected from mobile devices often contain data from a biased population subset, with some communities over or underrepresented in the collected datasets. As a result, aggregate statistics calculated from such datasets (as is done by vari… ▽ More

    Submitted 17 February, 2024; originally announced February 2024.

  2. arXiv:2306.10651  [pdf, other

    cs.DB cs.DS cs.LG

    On Distribution Dependent Sub-Logarithmic Query Time of Learned Indexing

    Authors: Sepanta Zeighami, Cyrus Shahabi

    Abstract: A fundamental problem in data management is to find the elements in an array that match a query. Recently, learned indexes are being extensively used to solve this problem, where they learn a model to predict the location of the items in the array. They are empirically shown to outperform non-learned methods (e.g., B-trees or binary search that answer queries in $O(\log n)$ time) by orders of magn… ▽ More

    Submitted 18 June, 2023; originally announced June 2023.

    Comments: Proceedings of 40th International Conference on Machine Learning, ICML 2023

  3. arXiv:2211.10832  [pdf, other

    cs.DB

    NeuroSketch: Fast and Approximate Evaluation of Range Aggregate Queries with Neural Networks

    Authors: Sepanta Zeighami, Cyrus Shahabi, Vatsal Sharan

    Abstract: Range aggregate queries (RAQs) are an integral part of many real-world applications, where, often, fast and approximate answers for the queries are desired. Recent work has studied answering RAQs using machine learning (ML) models, where a model of the data is learned to answer the queries. However, there is no theoretical understanding of why and when the ML based approaches perform well. Further… ▽ More

    Submitted 7 April, 2023; v1 submitted 19 November, 2022; originally announced November 2022.

    Comments: Conference paper in SIGMOD 2023. arXiv admin note: text overlap with arXiv:2107.04922

  4. arXiv:2208.09744  [pdf, other

    cs.DB cs.CR

    A Neural Approach to Spatio-Temporal Data Release with User-Level Differential Privacy

    Authors: Ritesh Ahuja, Sepanta Zeighami, Gabriel Ghinita, Cyrus Shahabi

    Abstract: Several companies (e.g., Meta, Google) have initiated "data-for-good" projects where aggregate location data are first sanitized and released publicly, which is useful to many applications in transportation, public health (e.g., COVID-19 spread) and urban planning. Differential privacy (DP) is the protection model of choice to ensure the privacy of the individuals who generated the raw location da… ▽ More

    Submitted 20 August, 2022; originally announced August 2022.

    Comments: SIGMOD 2023

  5. arXiv:2108.01496  [pdf, other

    cs.DB

    A Neural Database for Differentially Private Spatial Range Queries

    Authors: Sepanta Zeighami, Ritesh Ahuja, Gabriel Ghinita, Cyrus Shahabi

    Abstract: Mobile apps and location-based services generate large amounts of location data that can benefit research on traffic optimization, context-aware notifications and public health (e.g., spread of contagious diseases). To preserve individual privacy, one must first sanitize location data, which is commonly done using the powerful differential privacy (DP) concept. However, existing solutions fall sho… ▽ More

    Submitted 3 August, 2021; originally announced August 2021.

  6. arXiv:2107.04922  [pdf, other

    cs.DB

    NeuroDB: A Neural Network Framework for Answering Range Aggregate Queries and Beyond

    Authors: Sepanta Zeighami, Cyrus Shahabi

    Abstract: Range aggregate queries (RAQs) are an integral part of many real-world applications, where, often, fast and approximate answers for the queries are desired. Recent work has studied answering RAQs using machine learning models, where a model of the data is learned to answer the queries. However, such modelling choices fail to utilize any query specific information. To capture such information, we o… ▽ More

    Submitted 10 July, 2021; originally announced July 2021.

  7. arXiv:2012.07283  [pdf, other

    cs.LG stat.ML

    Towards Accurate Spatiotemporal COVID-19 Risk Scores using High Resolution Real-World Mobility Data

    Authors: Sirisha Rambhatla, Sepanta Zeighami, Kameron Shahabi, Cyrus Shahabi, Yan Liu

    Abstract: As countries look towards re-opening of economic activities amidst the ongoing COVID-19 pandemic, ensuring public health has been challenging. While contact tracing only aims to track past activities of infected users, one path to safe reopening is to develop reliable spatiotemporal risk scores to indicate the propensity of the disease. Existing works which aim to develop risk scores either rely o… ▽ More

    Submitted 14 December, 2020; originally announced December 2020.

  8. arXiv:2012.06987  [pdf, other

    cs.SI physics.soc-ph

    Estimating Spread of Contact-Based Contagions in a Population Through Sub-Sampling

    Authors: Sepanta Zeighami, Cyrus Shahabi, John Krumm

    Abstract: Physical contacts result in the spread of various phenomena such as viruses, gossips, ideas, packages and marketing pamphlets across a population. The spread depends on how people move and co-locate with each other, or their mobility patterns. How far such phenomena spread has significance for both policy making and personal decision making, e.g., studying the spread of COVID-19 under different in… ▽ More

    Submitted 13 December, 2020; originally announced December 2020.

    Journal ref: Proc. VLDB Endow. 14, 9, 1557-1569 (2021)

  9. arXiv:2003.01064  [pdf, ps, other

    cs.DB

    Bridging the Gap Between Theory and Practice on Insertion-Intensive Database

    Authors: Sepanta Zeighami, Raymond Chi-Wing Wong

    Abstract: With the prevalence of online platforms, today, data is being generated and accessed by users at a very high rate. Besides, applications such as stock trading or high frequency trading require guaranteed low delays for performing an operation on a database. It is consequential to design databases that guarantee data insertion and query at a consistently high rate without introducing any long delay… ▽ More

    Submitted 2 March, 2020; originally announced March 2020.

  10. arXiv:2003.00051  [pdf, ps, other

    cs.DB

    Dynamic Skyline Queries on Encrypted Data Using Result Materialization

    Authors: Sepanta Zeighami, Gabriel Ghinita, Cyrus Shahabi

    Abstract: Skyline computation is an increasingly popular query, with broad applicability in domains such as healthcare, travel and finance. Given the recent trend to outsource databases and query evaluation, and due to the proprietary and sometimes highly sensitivity nature of the data (e.g., in healthcare), it is essential to evaluate skylines on encrypted datasets. Several research efforts acknowledged th… ▽ More

    Submitted 28 February, 2020; originally announced March 2020.

  11. arXiv:1810.08047  [pdf, ps, other

    cs.DB cs.IR cs.LG

    Finding Average Regret Ratio Minimizing Set in Database

    Authors: Sepanta Zeighami, Raymong Chi-Wing Wong

    Abstract: Selecting a certain number of data points (or records) from a database which "best" satisfy users' expectations is a very prevalent problem with many applications. One application is a hotel booking website showing a certain number of hotels on a single page. However, this problem is very challenging since the selected points should "collectively" satisfy the expectation of all users. Showing a ce… ▽ More

    Submitted 18 October, 2018; originally announced October 2018.

    Comments: Submitted to ICDE '19