Skip to main content

Showing 1–6 of 6 results for author: Berens, P

Searching in archive stat. Search in all archives.
.
  1. arXiv:2403.14385  [pdf, other

    stat.ML cs.LG econ.EM stat.ME

    Estimating Causal Effects with Double Machine Learning -- A Method Evaluation

    Authors: Jonathan Fuhr, Philipp Berens, Dominik Papies

    Abstract: The estimation of causal effects with observational data continues to be a very active research area. In recent years, researchers have developed new frameworks which use machine learning to relax classical assumptions necessary for the estimation of causal effects. In this paper, we review one of the most prominent methods - "double/debiased machine learning" (DML) - and empirically evaluate it b… ▽ More

    Submitted 30 April, 2024; v1 submitted 21 March, 2024; originally announced March 2024.

  2. arXiv:2206.04841  [pdf, other

    cs.LG stat.ML

    Hierarchical mixtures of Gaussians for combined dimensionality reduction and clustering

    Authors: Sacha Sokoloski, Philipp Berens

    Abstract: To avoid the curse of dimensionality, a common approach to clustering high-dimensional data is to first project the data into a space of reduced dimension, and then cluster the projected data. Although effective, this two-stage approach prevents joint optimization of the dimensionality-reduction and clustering models, and obscures how well the complete model describes the data. Here, we show how a… ▽ More

    Submitted 9 June, 2022; originally announced June 2022.

  3. arXiv:2007.08902  [pdf, other

    cs.LG stat.ML

    Attraction-Repulsion Spectrum in Neighbor Embeddings

    Authors: Jan Niklas Böhm, Philipp Berens, Dmitry Kobak

    Abstract: Neighbor embeddings are a family of methods for visualizing complex high-dimensional datasets using $k$NN graphs. To find the low-dimensional embedding, these algorithms combine an attractive force between neighboring pairs of points with a repulsive force between all points. One of the most popular examples of such algorithms is t-SNE. Here we empirically show that changing the balance between th… ▽ More

    Submitted 18 October, 2022; v1 submitted 17 July, 2020; originally announced July 2020.

    Journal ref: JMLR 23(95):1-32, 2022

  4. arXiv:2006.10411  [pdf, other

    cs.LG stat.ML

    Sparse bottleneck neural networks for exploratory non-linear visualization of Patch-seq data

    Authors: Yves Bernaerts, Philipp Berens, Dmitry Kobak

    Abstract: Patch-seq, a recently developed experimental technique, allows neuroscientists to obtain transcriptomic and electrophysiological information from the same neurons. Efficiently analyzing and visualizing such paired multivariate data in order to extract biologically meaningful interpretations has, however, remained a challenge. Here, we use sparse deep neural networks with and without a two-dimensio… ▽ More

    Submitted 25 January, 2022; v1 submitted 18 June, 2020; originally announced June 2020.

    Comments: 17 pages, 16 figures

  5. Heavy-tailed kernels reveal a finer cluster structure in t-SNE visualisations

    Authors: Dmitry Kobak, George Linderman, Stefan Steinerberger, Yuval Kluger, Philipp Berens

    Abstract: T-distributed stochastic neighbour embedding (t-SNE) is a widely used data visualisation technique. It differs from its predecessor SNE by the low-dimensional similarity kernel: the Gaussian kernel was replaced by the heavy-tailed Cauchy kernel, solving the "crowding problem" of SNE. Here, we develop an efficient implementation of t-SNE for a $t$-distribution kernel with an arbitrary degree of fre… ▽ More

    Submitted 4 April, 2019; v1 submitted 15 February, 2019; originally announced February 2019.

    Journal ref: ECML PKDD 2019

  6. arXiv:1503.00135  [pdf

    stat.ML stat.AP

    Supervised learning sets benchmark for robust spike detection from calcium imaging signals

    Authors: Lucas Theis, Philipp Berens, Emmanouil Froudarakis, Jacob Reimer, Miroslav Román Rosón, Tom Baden, Thomas Euler, Andreas Tolias, Matthias Bethge

    Abstract: A fundamental challenge in calcium imaging has been to infer the timing of action potentials from the measured noisy calcium fluorescence traces. We systematically evaluate a range of spike inference algorithms on a large benchmark dataset recorded from varying neural tissue (V1 and retina) using different calcium indicators (OGB-1 and GCamp6). We show that a new algorithm based on supervised lear… ▽ More

    Submitted 28 February, 2015; originally announced March 2015.