Showing 1–2 of 2 results for author: Sivek, G

Search v0.5.6 released 2020-02-24

arXiv:2209.11311 [pdf, other]

cs.HC

doi 10.1145/3546737

Spatial model personalization in Gboard

Authors: Gary Sivek, Michael Riley

Abstract: We introduce a framework for adapting a virtual keyboard to individual user behavior by modifying a Gaussian spatial model to use personalized key center offset means and, optionally, learned covariances. Through numerous real-world studies, we determine the importance of training data quantity and weights, as well as the number of clusters into which to group keys to avoid overfitting. While past… ▽ More We introduce a framework for adapting a virtual keyboard to individual user behavior by modifying a Gaussian spatial model to use personalized key center offset means and, optionally, learned covariances. Through numerous real-world studies, we determine the importance of training data quantity and weights, as well as the number of clusters into which to group keys to avoid overfitting. While past research has shown potential of this technique using artificially-simple virtual keyboards and games or fixed ty** prompts, we demonstrate effectiveness using the highly-tuned Gboard app with a representative set of users and their real ty** behaviors. Across a variety of top languages, we achieve small-but-significant improvements in both ty** speed and decoder accuracy. △ Less

Submitted 22 September, 2022; originally announced September 2022.

Comments: 17 pages, to be published in the Proceedings of the 24th International Conference on Mobile Human-Computer Interaction (MobileHCI 2022)
arXiv:1902.00146 [pdf, other]

cs.LG stat.ML

Agnostic Federated Learning

Authors: Mehryar Mohri, Gary Sivek, Ananda Theertha Suresh

Abstract: A key learning scenario in large-scale applications is that of federated learning, where a centralized model is trained based on data originating from a large number of clients. We argue that, with the existing training and inference, federated models can be biased towards different clients. Instead, we propose a new framework of agnostic federated learning, where the centralized model is optimize… ▽ More A key learning scenario in large-scale applications is that of federated learning, where a centralized model is trained based on data originating from a large number of clients. We argue that, with the existing training and inference, federated models can be biased towards different clients. Instead, we propose a new framework of agnostic federated learning, where the centralized model is optimized for any target distribution formed by a mixture of the client distributions. We further show that this framework naturally yields a notion of fairness. We present data-dependent Rademacher complexity guarantees for learning with this objective, which guide the definition of an algorithm for agnostic federated learning. We also give a fast stochastic optimization algorithm for solving the corresponding optimization problem, for which we prove convergence bounds, assuming a convex loss function and hypothesis set. We further empirically demonstrate the benefits of our approach in several datasets. Beyond federated learning, our framework and algorithm can be of interest to other learning scenarios such as cloud computing, domain adaptation, drifting, and other contexts where the training and test distributions do not coincide. △ Less

Submitted 31 January, 2019; originally announced February 2019.

Comments: 30 pages

Search v0.5.6 released 2020-02-24