Skip to main content

Showing 1–12 of 12 results for author: Shiebler, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2203.09018  [pdf, other

    cs.LG stat.ML

    Kan Extensions in Data Science and Machine Learning

    Authors: Dan Shiebler

    Abstract: A common problem in data science is "use this function defined over this small set to generate predictions over that larger set." Extrapolation, interpolation, statistical inference and forecasting all reduce to this problem. The Kan extension is a powerful tool in category theory that generalizes this notion. In this work we explore several applications of Kan extensions to data science. We begin… ▽ More

    Submitted 26 July, 2022; v1 submitted 16 March, 2022; originally announced March 2022.

  2. arXiv:2109.10262  [pdf, other

    math.OC cs.LG stat.ML

    Generalized Optimization: A First Step Towards Category Theoretic Learning Theory

    Authors: Dan Shiebler

    Abstract: The Cartesian reverse derivative is a categorical generalization of reverse-mode automatic differentiation. We use this operator to generalize several optimization algorithms, including a straightforward generalization of gradient descent and a novel generalization of Newton's method. We then explore which properties of these algorithms are preserved in this generalized setting. First, we show tha… ▽ More

    Submitted 20 September, 2021; originally announced September 2021.

  3. arXiv:2106.07032  [pdf, ps, other

    cs.LG

    Category Theory in Machine Learning

    Authors: Dan Shiebler, Bruno Gavranović, Paul Wilson

    Abstract: Over the past two decades machine learning has permeated almost every realm of technology. At the same time, many researchers have begun using category theory as a unifying language, facilitating communication between different scientific disciplines. It is therefore unsurprising that there is a burgeoning interest in applying category theory to machine learning. We aim to document the motivations… ▽ More

    Submitted 13 June, 2021; originally announced June 2021.

  4. arXiv:2105.09293  [pdf, other

    cs.IR

    Lessons Learned Addressing Dataset Bias in Model-Based Candidate Generation at Twitter

    Authors: Alim Virani, Jay Baxter, Dan Shiebler, Philip Gautier, Shivam Verma, Yan Xia, Apoorv Sharma, Sumit Binnani, Linlin Chen, Chenguang Yu

    Abstract: Traditionally, heuristic methods are used to generate candidates for large scale recommender systems. Model-based candidate generation promises multiple potential advantages, primarily that we can explicitly optimize the same objective as the downstream ranking model. However, large scale model-based candidate generation approaches suffer from dataset bias problems caused by the infeasibility of o… ▽ More

    Submitted 13 May, 2021; originally announced May 2021.

  5. arXiv:2104.14734  [pdf, other

    cs.LG cs.AI

    Flattening Multiparameter Hierarchical Clustering Functors

    Authors: Dan Shiebler

    Abstract: We bring together topological data analysis, applied category theory, and machine learning to study multiparameter hierarchical clustering. We begin by introducing a procedure for flattening multiparameter hierarchical clusterings. We demonstrate that this procedure is a functor from a category of multiparameter hierarchical partitions to a category of binary integer programs. We also include empi… ▽ More

    Submitted 29 April, 2021; originally announced April 2021.

  6. Functorial Manifold Learning

    Authors: Dan Shiebler

    Abstract: We adapt previous research on category theory and topological unsupervised learning to develop a functorial perspective on manifold learning, also known as nonlinear dimensionality reduction. We first characterize manifold learning algorithms as functors that map pseudometric spaces to optimization objectives and that factor through hierarchical clustering functors. We then use this characteriza… ▽ More

    Submitted 3 November, 2022; v1 submitted 14 November, 2020; originally announced November 2020.

    Comments: In Proceedings ACT 2021, arXiv:2211.01102

    Journal ref: EPTCS 372, 2022, pp. 1-13

  7. arXiv:2009.12192  [pdf, other

    cs.IR cs.CL cs.LG

    Tuning Word2vec for Large Scale Recommendation Systems

    Authors: Benjamin P. Chamberlain, Emanuele Rossi, Dan Shiebler, Suvash Sedhain, Michael M. Bronstein

    Abstract: Word2vec is a powerful machine learning tool that emerged from Natural Lan-guage Processing (NLP) and is now applied in multiple domains, including recom-mender systems, forecasting, and network analysis. As Word2vec is often used offthe shelf, we address the question of whether the default hyperparameters are suit-able for recommender systems. The answer is emphatically no. In this paper, wefirst… ▽ More

    Submitted 24 September, 2020; originally announced September 2020.

    Comments: 11 pages, 4 figures, Fourteenth ACM Conference on Recommender Systems

    Journal ref: Fourteenth ACM Conference on Recommender Systems (RecSys '20), September 22--26, 2020, Virtual Event, Brazil

  8. Categorical Stochastic Processes and Likelihood

    Authors: Dan Shiebler

    Abstract: In this work we take a Category Theoretic perspective on the relationship between probabilistic modeling and function approximation. We begin by defining two extensions of function composition to stochastic process subordination: one based on the co-Kleisli category under the comonad (Omega x -) and one based on the parameterization of a category with a Lawvere theory. We show how these extensions… ▽ More

    Submitted 9 January, 2022; v1 submitted 10 May, 2020; originally announced May 2020.

    Journal ref: Compositionality 3, 1 (2021)

  9. arXiv:2001.02296  [pdf, other

    cs.FL cs.AI cs.LO

    Incremental Monoidal Grammars

    Authors: Dan Shiebler, Alexis Toumi, Mehrnoosh Sadrzadeh

    Abstract: In this work we define formal grammars in terms of free monoidal categories, along with a functor from the category of formal grammars to the category of automata. Generalising from the Booleans to arbitrary semirings, we extend our construction to weighted formal grammars and weighted automata. This allows us to link the categorical viewpoint on natural language to the standard machine learning n… ▽ More

    Submitted 10 January, 2020; v1 submitted 2 January, 2020; originally announced January 2020.

  10. arXiv:1809.07703  [pdf, other

    cs.SI cs.LG stat.ML

    Fighting Redundancy and Model Decay with Embeddings

    Authors: Dan Shiebler, Luca Belli, Jay Baxter, Hanchen Xiong, Abhishek Tayal

    Abstract: Every day, hundreds of millions of new Tweets containing over 40 languages of ever-shifting vernacular flow through Twitter. Models that attempt to extract insight from this firehose of information must face the torrential covariate shift that is endemic to the Twitter platform. While regularly-retrained algorithms can maintain performance in the face of this shift, fixed model features that fail… ▽ More

    Submitted 18 September, 2018; originally announced September 2018.

    Comments: Presented at the Common Model Infrastructure Workshop at KDD 2018 (link: https://cmi2018.sdsc.edu/)

  11. arXiv:1809.03497  [pdf, other

    cs.IR cs.LG stat.ML

    A Correlation Maximization Approach for Cross Domain Co-Embeddings

    Authors: Dan Shiebler

    Abstract: Although modern recommendation systems can exploit the structure in users' item feedback, most are powerless in the face of new users who provide no structure for them to exploit. In this paper we introduce ImplicitCE, an algorithm for recommending items to new users during their sign-up flow. ImplicitCE works by transforming users' implicit feedback towards auxiliary domain items into an embeddin… ▽ More

    Submitted 10 September, 2018; originally announced September 2018.

    Comments: Submitted to AAAI 2019

  12. arXiv:1805.08819  [pdf, other

    cs.CV

    Learning what and where to attend

    Authors: Drew Linsley, Dan Shiebler, Sven Eberhardt, Thomas Serre

    Abstract: Most recent gains in visual recognition have originated from the inclusion of attention mechanisms in deep convolutional networks (DCNs). Because these networks are optimized for object recognition, they learn where to attend using only a weak form of supervision derived from image class labels. Here, we demonstrate the benefit of using stronger supervisory signals by teaching DCNs to attend to im… ▽ More

    Submitted 11 June, 2019; v1 submitted 22 May, 2018; originally announced May 2018.

    Comments: Previously called Global-and-local attention networks for visual recognition. Current version published in ICLR 2019: https://openreview.net/forum?id=BJgLg3R9KQ