Skip to main content

Showing 1–28 of 28 results for author: Mishne, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.01969  [pdf, other

    cs.LG

    Multiway Multislice PHATE: Visualizing Hidden Dynamics of RNNs through Training

    Authors: Jiancheng Xie, Lou C. Kohler Voinov, Noga Mudrik, Gal Mishne, Adam Charles

    Abstract: Recurrent neural networks (RNNs) are a widely used tool for sequential data analysis, however, they are still often seen as black boxes of computation. Understanding the functional principles of these networks is critical to develo** ideal model architectures and optimization strategies. Previous studies typically only emphasize the network representation post-training, overlooking their evoluti… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  2. arXiv:2402.14202  [pdf, other

    cs.LG

    Comparing Graph Transformers via Positional Encodings

    Authors: Mitchell Black, Zhengchao Wan, Gal Mishne, Amir Nayyeri, Yusu Wang

    Abstract: The distinguishing power of graph transformers is closely tied to the choice of positional encoding: features used to augment the base transformer with information about the graph. There are two primary types of positional encoding: absolute positional encodings (APEs) and relative positional encodings (RPEs). APEs assign features to each node and are given as input to the transformer. RPEs instea… ▽ More

    Submitted 4 June, 2024; v1 submitted 21 February, 2024; originally announced February 2024.

    Comments: accepted to ICML 2024

  3. arXiv:2402.08811  [pdf, ps, other

    eess.IV cs.LG q-bio.QM

    Deep and shallow data science for multi-scale optical neuroscience

    Authors: Gal Mishne, Adam Charles

    Abstract: Optical imaging of the brain has expanded dramatically in the past two decades. New optics, indicators, and experimental paradigms are now enabling in-vivo imaging from the synaptic to the cortex-wide scales. To match the resulting flood of data across scales, computational methods are continuously being developed to meet the need of extracting biologically relevant information. In this pursuit, c… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

    Comments: 6 pages

  4. arXiv:2402.08105  [pdf, other

    cs.LG stat.ML

    Learning Cartesian Product Graphs with Laplacian Constraints

    Authors: Changhao Shi, Gal Mishne

    Abstract: Graph Laplacian learning, also known as network topology inference, is a problem of great interest to multiple communities. In Gaussian graphical models (GM), graph learning amounts to endowing covariance selection with the Laplacian structure. In graph signal processing (GSP), it is essential to infer the unobserved graph from the outputs of a filtering system. In this paper, we study the problem… ▽ More

    Submitted 12 February, 2024; originally announced February 2024.

    Comments: Accepted to AISTATS 2024

  5. arXiv:2312.14254  [pdf, other

    cs.LG cs.NE

    Contextual Feature Selection with Conditional Stochastic Gates

    Authors: Ram Dyuthi Sristi, Ofir Lindenbaum, Shira Lifshitz, Maria Lavzin, Jackie Schiller, Gal Mishne, Hadas Benisty

    Abstract: Feature selection is a crucial tool in machine learning and is widely applied across various scientific disciplines. Traditional supervised methods generally identify a universal set of informative features for the entire population. However, feature relevance often varies with context, while the context itself may not directly affect the outcome variable. Here, we propose a novel architecture for… ▽ More

    Submitted 7 June, 2024; v1 submitted 21 December, 2023; originally announced December 2023.

    Comments: Accepted to ICML 2024

  6. arXiv:2312.10295  [pdf, other

    math.OC cs.DM

    On a Generalization of Wasserstein Distance and the Beckmann Problem to Connection Graphs

    Authors: Sawyer Robertson, Dhruv Kohli, Gal Mishne, Alexander Cloninger

    Abstract: The intersection of connection graphs and discrete optimal transport presents a novel paradigm for understanding complex graphs and node interactions. In this paper, we delve into this unexplored territory by focusing on the Beckmann problem within the context of connection graphs. Our study establishes feasibility conditions for the resulting convex optimization problem on connection graphs. Furt… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

    Comments: 19 pages, 6 figures

    MSC Class: 65K10; 05C21; 90C25; 68R10; 05C50

  7. arXiv:2308.09690  [pdf, other

    cs.DM

    Random Walks, Conductance, and Resistance for the Connection Graph Laplacian

    Authors: Alexander Cloninger, Gal Mishne, Andreas Oslandsbotn, Sawyer Jack Robertson, Zhengchao Wan, Yusu Wang

    Abstract: We investigate the concept of effective resistance in connection graphs, expanding its traditional application from undirected graphs. We propose a robust definition of effective resistance in connection graphs by focusing on the duality of Dirichlet-type and Poisson-type problems on connection graphs. Additionally, we delve into random walks, taking into account both node transitions and vector r… ▽ More

    Submitted 20 August, 2023; v1 submitted 18 August, 2023; originally announced August 2023.

  8. arXiv:2308.00142  [pdf, other

    cs.LG

    Semi-Supervised Laplacian Learning on Stiefel Manifolds

    Authors: Chester Holtz, Pengwen Chen, Alexander Cloninger, Chung-Kuan Cheng, Gal Mishne

    Abstract: Motivated by the need to address the degeneracy of canonical Laplace learning algorithms in low label rates, we propose to reformulate graph-based semi-supervised learning as a nonconvex generalization of a \emph{Trust-Region Subproblem} (TRS). This reformulation is motivated by the well-posedness of Laplacian eigenvectors in the limit of infinite unlabeled data. To solve this problem, we first sh… ▽ More

    Submitted 31 July, 2023; originally announced August 2023.

  9. arXiv:2306.08201  [pdf, other

    cs.LG eess.SP

    Graph Laplacian Learning with Exponential Family Noise

    Authors: Changhao Shi, Gal Mishne

    Abstract: Graph signal processing (GSP) is a prominent framework for analyzing signals on non-Euclidean domains. The graph Fourier transform (GFT) uses the combinatorial graph Laplacian matrix to reveal the spectral decomposition of signals in the graph frequency domain. However, a common challenge in applying GSP methods is that in many scenarios the underlying graph of a system is unknown. A solution in s… ▽ More

    Submitted 11 June, 2024; v1 submitted 13 June, 2023; originally announced June 2023.

  10. arXiv:2306.04817  [pdf, other

    stat.ML cs.LG

    SiBBlInGS: Similarity-driven Building-Block Inference using Graphs across States

    Authors: Noga Mudrik, Gal Mishne, Adam S. Charles

    Abstract: Time series data across scientific domains are often collected under distinct states (e.g., tasks), wherein latent processes (e.g., biological factors) create complex inter- and intra-state variability. A key approach to capture this complexity is to uncover fundamental interpretable units within the data, Building Blocks (BBs), which modulate their activity and adjust their structure across obser… ▽ More

    Submitted 14 June, 2024; v1 submitted 7 June, 2023; originally announced June 2023.

    Comments: Accepted to ICML 2024

    Journal ref: The Forty-first International Conference on Machine Learning. https://openreview.net/forum?id=h8aTi32tul

  11. arXiv:2305.18962  [pdf, other

    cs.LG

    Hyperbolic Diffusion Embedding and Distance for Hierarchical Representation Learning

    Authors: Ya-Wei Eileen Lin, Ronald R. Coifman, Gal Mishne, Ronen Talmon

    Abstract: Finding meaningful representations and distances of hierarchical data is important in many fields. This paper presents a new method for hierarchical data embedding and distance. Our method relies on combining diffusion geometry, a central approach to manifold learning, and hyperbolic geometry. Specifically, using diffusion geometry, we build multi-scale densities on the data, aimed to reveal their… ▽ More

    Submitted 30 May, 2023; originally announced May 2023.

  12. arXiv:2211.05314  [pdf, other

    cs.LG stat.ML

    DiSC: Differential Spectral Clustering of Features

    Authors: Ram Dyuthi Sristi, Gal Mishne, Ariel Jaffe

    Abstract: Selecting subsets of features that differentiate between two conditions is a key task in a broad range of scientific domains. In many applications, the features of interest form clusters with similar effects on the data at hand. To recover such clusters we develop DiSC, a data-driven approach for detecting groups of features that differentiate between conditions. For each condition, we construct a… ▽ More

    Submitted 9 November, 2022; originally announced November 2022.

    Comments: Accepted at NeurIPS 2022

  13. arXiv:2211.03329  [pdf, other

    cs.LG

    Implicit Graphon Neural Representation

    Authors: Xinyue Xia, Gal Mishne, Yusu Wang

    Abstract: Graphons are general and powerful models for generating graphs of varying size. In this paper, we propose to directly model graphons using neural networks, obtaining Implicit Graphon Neural Representation (IGNR). Existing work in modeling and reconstructing graphons often approximates a target graphon by a fixed resolution piece-wise constant representation. Our IGNR has the benefit that it can re… ▽ More

    Submitted 30 March, 2023; v1 submitted 7 November, 2022; originally announced November 2022.

    Comments: 3 figures

  14. arXiv:2211.00181  [pdf, other

    cs.LG math.NA

    The Numerical Stability of Hyperbolic Representation Learning

    Authors: Gal Mishne, Zhengchao Wan, Yusu Wang, Sheng Yang

    Abstract: Given the exponential growth of the volume of the ball w.r.t. its radius, the hyperbolic space is capable of embedding trees with arbitrarily small distortion and hence has received wide attention for representing hierarchical datasets. However, this exponential growth property comes at a price of numerical instability such that training hyperbolic learning models will sometimes lead to catastroph… ▽ More

    Submitted 27 June, 2023; v1 submitted 31 October, 2022; originally announced November 2022.

  15. arXiv:2210.11513  [pdf, other

    cs.LG

    Learning Sample Reweighting for Accuracy and Adversarial Robustness

    Authors: Chester Holtz, Tsui-Wei Weng, Gal Mishne

    Abstract: There has been great interest in enhancing the robustness of neural network classifiers to defend against adversarial perturbations through adversarial training, while balancing the trade-off between robust accuracy and standard accuracy. We propose a novel adversarial training framework that learns to reweight the loss associated with individual training samples based on a notion of class-conditi… ▽ More

    Submitted 20 October, 2022; originally announced October 2022.

  16. arXiv:2210.01760  [pdf, other

    cs.LG stat.ML

    Evaluating Disentanglement in Generative Models Without Knowledge of Latent Factors

    Authors: Chester Holtz, Gal Mishne, Alexander Cloninger

    Abstract: Probabilistic generative models provide a flexible and systematic framework for learning the underlying geometry of data. However, model selection in this setting is challenging, particularly when selecting for ill-defined qualities such as disentanglement or interpretability. In this work, we address this gap by introducing a method for ranking generative models based on the training dynamics exh… ▽ More

    Submitted 4 October, 2022; originally announced October 2022.

  17. arXiv:2101.11055  [pdf, other

    math.SP cs.LG math.AP stat.ML

    LDLE: Low Distortion Local Eigenmaps

    Authors: Dhruv Kohli, Alexander Cloninger, Gal Mishne

    Abstract: We present Low Distortion Local Eigenmaps (LDLE), a manifold learning technique which constructs a set of low distortion local views of a dataset in lower dimension and registers them to obtain a global embedding. The local views are constructed using the global eigenvectors of the graph Laplacian and are registered using Procrustes analysis. The choice of these eigenvectors may vary across the re… ▽ More

    Submitted 14 October, 2021; v1 submitted 26 January, 2021; originally announced January 2021.

    Comments: 66 pages, 32 figures, preprint. Rearranged figures

    Journal ref: JMLR (2021) 22: 64

  18. arXiv:2101.09387  [pdf, other

    cs.LG cs.CR cs.CV

    Online Adversarial Purification based on Self-Supervision

    Authors: Changhao Shi, Chester Holtz, Gal Mishne

    Abstract: Deep neural networks are known to be vulnerable to adversarial examples, where a perturbation in the input space leads to an amplified shift in the latent network representation. In this paper, we combine canonical supervised learning with self-supervised representation learning, and present Self-supervised Online Adversarial Purification (SOAP), a novel defense strategy that uses a self-supervise… ▽ More

    Submitted 22 January, 2021; originally announced January 2021.

    Comments: Accepted to ICLR 2021

  19. arXiv:2009.04142  [pdf, other

    stat.ML cs.LG math.DS physics.comp-ph

    Kernel-based parameter estimation of dynamical systems with unknown observation functions

    Authors: Ofir Lindenbaum, Amir Sagiv, Gal Mishne, Ronen Talmon

    Abstract: A low-dimensional dynamical system is observed in an experiment as a high-dimensional signal; for example, a video of a chaotic pendulums system. Assuming that we know the dynamical model up to some unknown parameters, can we estimate the underlying system's parameters by measuring its time-evolution only once? The key information for performing this estimation lies in the temporal inter-dependenc… ▽ More

    Submitted 4 April, 2021; v1 submitted 9 September, 2020; originally announced September 2020.

  20. arXiv:2007.00041  [pdf, other

    eess.SP cs.LG stat.ML

    Multi-way Graph Signal Processing on Tensors: Integrative analysis of irregular geometries

    Authors: Jay S. Stanley III, Eric C. Chi, Gal Mishne

    Abstract: Graph signal processing (GSP) is an important methodology for studying data residing on irregular structures. As acquired data is increasingly taking the form of multi-way tensors, new signal processing tools are needed to maximally utilize the multi-way structure within the data. In this paper, we review modern signal processing frameworks generalizing GSP to multi-way data, starting from graph s… ▽ More

    Submitted 27 July, 2020; v1 submitted 30 June, 2020; originally announced July 2020.

    Comments: In review for IEEE Signal Processing Magazine

  21. arXiv:1908.02831  [pdf, other

    cs.LG cs.NE stat.ML

    Visualizing the PHATE of Neural Networks

    Authors: Scott Gigante, Adam S. Charles, Smita Krishnaswamy, Gal Mishne

    Abstract: Understanding why and how certain neural networks outperform others is key to guiding future development of network architectures and optimization methods. To this end, we introduce a novel visualization algorithm that reveals the internal geometry of such networks: Multislice PHATE (M-PHATE), the first method designed explicitly to visualize how a neural network's hidden representations of data e… ▽ More

    Submitted 7 August, 2019; originally announced August 2019.

    Journal ref: Neural Information Processing Systems (2019)

  22. arXiv:1810.10695  [pdf, other

    cs.LG math.SP stat.ML

    Spectral Embedding Norm: Looking Deep into the Spectrum of the Graph Laplacian

    Authors: Xiuyuan Cheng, Gal Mishne

    Abstract: The extraction of clusters from a dataset which includes multiple clusters and a significant background component is a non-trivial task of practical importance. In image analysis this manifests for example in anomaly detection and target detection. The traditional spectral clustering algorithm, which relies on the leading $K$ eigenvectors to detect $K$ clusters, fails in such cases. In this paper… ▽ More

    Submitted 22 August, 2019; v1 submitted 24 October, 2018; originally announced October 2018.

  23. arXiv:1810.06803  [pdf, other

    stat.ML cs.LG stat.ME

    Co-manifold learning with missing data

    Authors: Gal Mishne, Eric C. Chi, Ronald R. Coifman

    Abstract: Representation learning is typically applied to only one mode of a data matrix, either its rows or columns. Yet in many applications, there is an underlying geometry to both the rows and the columns. We propose utilizing this coupled structure to perform co-manifold learning: uncovering the underlying geometry of both the rows and the columns of a given matrix, where we focus on a missing data set… ▽ More

    Submitted 16 October, 2018; originally announced October 2018.

    Comments: 16 pages, 9 figures

  24. arXiv:1711.04712  [pdf, other

    math.CO cs.DM cs.DS math.PR stat.ML

    Randomized Near Neighbor Graphs, Giant Components, and Applications in Data Science

    Authors: George C. Linderman, Gal Mishne, Yuval Kluger, Stefan Steinerberger

    Abstract: If we pick $n$ random points uniformly in $[0,1]^d$ and connect each point to its $k-$nearest neighbors, then it is well known that there exists a giant connected component with high probability. We prove that in $[0,1]^d$ it suffices to connect every point to $ c_{d,1} \log{\log{n}}$ points chosen randomly among its $ c_{d,2} \log{n}-$nearest neighbors to ensure a giant component of size… ▽ More

    Submitted 13 November, 2017; originally announced November 2017.

  25. arXiv:1708.05768  [pdf, other

    stat.ML cs.LG q-bio.QM

    Data-Driven Tree Transforms and Metrics

    Authors: Gal Mishne, Ronen Talmon, Israel Cohen, Ronald R. Coifman, Yuval Kluger

    Abstract: We consider the analysis of high dimensional data given in the form of a matrix with columns consisting of observations and rows consisting of features. Often the data is such that the observations do not reside on a regular grid, and the given order of the features is arbitrary and does not convey a notion of locality. Therefore, traditional transforms and metrics cannot be used for data organiza… ▽ More

    Submitted 18 August, 2017; originally announced August 2017.

    Comments: 16 pages, 5 figures. Accepted to IEEE Transactions on Signal and Information Processing over Networks

  26. arXiv:1506.07840  [pdf, other

    stat.ML cs.LG math.CA

    Diffusion Nets

    Authors: Gal Mishne, Uri Shaham, Alexander Cloninger, Israel Cohen

    Abstract: Non-linear manifold learning enables high-dimensional data analysis, but requires out-of-sample-extension methods to process new data points. In this paper, we propose a manifold learning algorithm based on deep learning to create an encoder, which maps a high-dimensional dataset and its low-dimensional embedding, and a decoder, which takes the embedded data back to the high-dimensional space. Sta… ▽ More

    Submitted 25 June, 2015; originally announced June 2015.

    Comments: 24 pages, 12 figures

  27. arXiv:1210.7350  [pdf, other

    cs.IR cs.DB

    Fast Data in the Era of Big Data: Twitter's Real-Time Related Query Suggestion Architecture

    Authors: Gilad Mishne, Jeff Dalton, Zhenghua Li, Aneesh Sharma, Jimmy Lin

    Abstract: We present the architecture behind Twitter's real-time related query suggestion and spelling correction service. Although these tasks have received much attention in the web search literature, the Twitter context introduces a real-time "twist": after significant breaking news events, we aim to provide relevant results within minutes. This paper provides a case study illustrating the challenges of… ▽ More

    Submitted 27 October, 2012; originally announced October 2012.

  28. arXiv:1205.6855  [pdf, other

    cs.IR cs.SI

    A Study of "Churn" in Tweets and Real-Time Search Queries (Extended Version)

    Authors: Jimmy Lin, Gilad Mishne

    Abstract: The real-time nature of Twitter means that term distributions in tweets and in search queries change rapidly: the most frequent terms in one hour may look very different from those in the next. Informally, we call this phenomenon "churn". Our interest in analyzing churn stems from the perspective of real-time search. Nearly all ranking functions, machine-learned or otherwise, depend on term statis… ▽ More

    Submitted 30 May, 2012; originally announced May 2012.

    Comments: This is an extended version of a similarly-titled paper at the 6th International AAAI Conference on Weblogs and Social Media (ICWSM 2012)