Skip to main content

Showing 1–3 of 3 results for author: Talaei, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.16755  [pdf, other

    cs.LG cs.AI cs.DB

    CHESS: Contextual Harnessing for Efficient SQL Synthesis

    Authors: Shayan Talaei, Mohammadreza Pourreza, Yu-Chen Chang, Azalia Mirhoseini, Amin Saberi

    Abstract: Utilizing large language models (LLMs) for transforming natural language questions into SQL queries (text-to-SQL) is a promising yet challenging approach, particularly when applied to real-world databases with complex and extensive schemas. In particular, effectively incorporating data catalogs and database values for SQL generation remains an obstacle, leading to suboptimal solutions. We address… ▽ More

    Submitted 27 June, 2024; v1 submitted 26 May, 2024; originally announced May 2024.

  2. arXiv:2210.07703  [pdf, other

    cs.LG cs.DC

    Hybrid Decentralized Optimization: First- and Zeroth-Order Optimizers Can Be Jointly Leveraged For Faster Convergence

    Authors: Shayan Talaei, Giorgi Nadiradze, Dan Alistarh

    Abstract: Distributed optimization has become one of the standard ways of speeding up machine learning training, and most of the research in the area focuses on distributed first-order, gradient-based methods. Yet, there are settings where some computationally-bounded nodes may not be able to implement first-order, gradient-based optimization, while they could still contribute to joint optimization tasks. I… ▽ More

    Submitted 14 October, 2022; originally announced October 2022.

  3. arXiv:2206.10032  [pdf, other

    cs.LG

    Communication-Efficient Federated Learning With Data and Client Heterogeneity

    Authors: Hossein Zakerinia, Shayan Talaei, Giorgi Nadiradze, Dan Alistarh

    Abstract: Federated Learning (FL) enables large-scale distributed training of machine learning models, while still allowing individual nodes to maintain data locally. However, executing FL at scale comes with inherent practical challenges: 1) heterogeneity of the local node data distributions, 2) heterogeneity of node computational speeds (asynchrony), but also 3) constraints in the amount of commun… ▽ More

    Submitted 3 June, 2023; v1 submitted 20 June, 2022; originally announced June 2022.