Skip to main content

Showing 1–32 of 32 results for author: Hayes, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.05132  [pdf, other

    cs.DC cs.DS

    Low-Distortion Clustering in Bounded Growth Graphs

    Authors: Yi-Jun Chang, Varsha Dani, Thomas P. Hayes

    Abstract: The well-known clustering algorithm of Miller, Peng, and Xu (SPAA 2013) is useful for many applications, including low-diameter decomposition and low-energy distributed algorithms. One nice property of their clustering, shown in previous work by Chang, Dani, Hayes, and Pettie (PODC 2020), is that distances in the cluster graph are rescaled versions of distances in the original graph, up to an… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

  2. arXiv:2402.12177  [pdf, ps, other

    cs.LG cs.AI cs.CL

    Mafin: Enhancing Black-Box Embeddings with Model Augmented Fine-Tuning

    Authors: Mingtian Zhang, Shawn Lan, Peter Hayes, David Barber

    Abstract: Retrieval Augmented Generation (RAG) has emerged as an effective solution for mitigating hallucinations in Large Language Models (LLMs). The retrieval stage in RAG typically involves a pre-trained embedding model, which converts queries and passages into vectors to capture their semantics. However, a standard pre-trained embedding model may exhibit sub-optimal performance when applied to specific… ▽ More

    Submitted 12 March, 2024; v1 submitted 19 February, 2024; originally announced February 2024.

  3. arXiv:2402.08114  [pdf, other

    cs.LG cs.AI cs.CL

    Active Preference Learning for Large Language Models

    Authors: William Muldrew, Peter Hayes, Mingtian Zhang, David Barber

    Abstract: As large language models (LLMs) become more capable, fine-tuning techniques for aligning with human intent are increasingly important. A key consideration for aligning these models is how to most effectively use human resources, or model resources in the case where LLMs themselves are used as oracles. Reinforcement learning from Human or AI preferences (RLHF/RLAIF) is the most prominent example of… ▽ More

    Submitted 28 June, 2024; v1 submitted 12 February, 2024; originally announced February 2024.

    Comments: 13 pages, 5 figures, 6 tables

  4. arXiv:2307.07727  [pdf, ps, other

    cs.DM cs.DS

    Optimal Mixing via Tensorization for Random Independent Sets on Arbitrary Trees

    Authors: Charilaos Efthymiou, Thomas P. Hayes, Daniel Stefankovic, Eric Vigoda

    Abstract: We study the mixing time of the single-site update Markov chain, known as the Glauber dynamics, for generating a random independent set of a tree. Our focus is obtaining optimal convergence results for arbitrary trees. We consider the more general problem of sampling from the Gibbs distribution in the hard-core model where independent sets are weighted by a parameter $λ>0$; the special case $λ=1$… ▽ More

    Submitted 18 February, 2024; v1 submitted 15 July, 2023; originally announced July 2023.

    Comments: The optimum mixing result (Theorem 1.2) of version 1 of the manuscript has been removed due to an error

  5. arXiv:2209.07396  [pdf, other

    stat.ML cs.LG

    Towards Healing the Blindness of Score Matching

    Authors: Mingtian Zhang, Oscar Key, Peter Hayes, David Barber, Brooks Paige, François-Xavier Briol

    Abstract: Score-based divergences have been widely used in machine learning and statistics applications. Despite their empirical success, a blindness problem has been observed when using these for multi-modal distributions. In this work, we discuss the blindness problem and propose a new family of divergences that can mitigate the blindness problem. We illustrate our proposed divergence in the context of de… ▽ More

    Submitted 15 October, 2022; v1 submitted 15 September, 2022; originally announced September 2022.

  6. arXiv:2206.09496  [pdf, other

    cs.LG

    Integrated Weak Learning

    Authors: Peter Hayes, Mingtian Zhang, Raza Habib, Jordan Burgess, Emine Yilmaz, David Barber

    Abstract: We introduce Integrated Weak Learning, a principled framework that integrates weak supervision into the training process of machine learning models. Our approach jointly trains the end-model and a label model that aggregates multiple sources of weak supervision. We introduce a label model that can learn to aggregate weak supervision sources differently for different datapoints and takes into consi… ▽ More

    Submitted 19 June, 2022; originally announced June 2022.

    Comments: 14 pages, 4 figures

  7. arXiv:2205.12830  [pdf, other

    cs.DC

    How to Wake Up Your Neighbors: Safe and Nearly Optimal Generic Energy Conservation in Radio Networks

    Authors: Varsha Dani, Thomas P. Hayes

    Abstract: Recent work has shown that it is sometimes feasible to significantly reduce the energy usage of some radio-network algorithms by adaptively powering down the radio receiver when it is not needed. Although past work has focused on modifying specific network algorithms in this way, we now ask the question of whether this problem can be solved in a generic way, treating the algorithm as a kind of bla… ▽ More

    Submitted 25 May, 2022; originally announced May 2022.

  8. arXiv:2205.11640  [pdf, other

    stat.ML cs.LG

    Generalization Gap in Amortized Inference

    Authors: Mingtian Zhang, Peter Hayes, David Barber

    Abstract: The ability of likelihood-based probabilistic models to generalize to unseen data is central to many machine learning applications such as lossless compression. In this work, we study the generalization of a popular class of probabilistic model - the Variational Auto-Encoder (VAE). We discuss the two generalization gaps that affect VAEs and show that overfitting is usually dominated by amortized i… ▽ More

    Submitted 15 October, 2022; v1 submitted 23 May, 2022; originally announced May 2022.

  9. arXiv:2109.12043  [pdf, other

    cs.LG stat.ML

    Sample Efficient Model Evaluation

    Authors: Emine Yilmaz, Peter Hayes, Raza Habib, Jordan Burgess, David Barber

    Abstract: Labelling data is a major practical bottleneck in training and testing classifiers. Given a collection of unlabelled data points, we address how to select which subset to label to best estimate test metrics such as accuracy, $F_1$ score or micro/macro $F_1$. We consider two sampling based approaches, namely the well-known Importance Sampling and we introduce a novel application of Poisson Sampling… ▽ More

    Submitted 24 September, 2021; originally announced September 2021.

  10. arXiv:2108.12326  [pdf

    cs.ET

    CeMux: Maximizing the Accuracy of Stochastic Mux Adders and an Application to Filter Design

    Authors: Timothy J. Baker, John P. Hayes

    Abstract: Stochastic computing (SC) is a low-cost computational paradigm that has promising applications in digital filter design, image processing and neural networks. Fundamental to these applications is the weighted addition operation which is most often implemented by a multiplexer (mux) tree. Mux-based adders have very low area but typically require long bit-streams to reach practical accuracy threshol… ▽ More

    Submitted 30 August, 2021; v1 submitted 27 August, 2021; originally announced August 2021.

    ACM Class: B.2

  11. arXiv:2107.14323  [pdf, other

    cs.CG cs.SI math.PR physics.soc-ph stat.ML

    Reconstruction of Random Geometric Graphs: Breaking the Omega(r) distortion barrier

    Authors: Varsha Dani, Josep Díaz, Thomas P. Hayes, Cristopher Moore

    Abstract: Embedding graphs in a geographical or latent space, i.e.\ inferring locations for vertices in Euclidean space or on a smooth manifold or submanifold, is a common task in network analysis, statistical inference, and graph visualization. We consider the classic model of random geometric graphs where $n$ points are scattered uniformly in a square of area $n$, and two points have an edge between them… ▽ More

    Submitted 17 May, 2022; v1 submitted 29 July, 2021; originally announced July 2021.

    Comments: v1 on arxiv was titled "Improved Reconstruction of Random Geometric Graphs." An extended abstract with the above title appeared in ICALP 2022. The current version includes the proofs that were omitted from the ICALP version and adds the section "Missing Edges."

    ACM Class: G.2.2; G.3

  12. arXiv:2105.12433  [pdf, other

    cs.LG

    Estimating the Uncertainty of Neural Network Forecasts for Influenza Prevalence Using Web Search Activity

    Authors: Michael Morris, Peter Hayes, Ingemar J. Cox, Vasileios Lampos

    Abstract: Influenza is an infectious disease with the potential to become a pandemic, and hence, forecasting its prevalence is an important undertaking for planning an effective response. Research has found that web search activity can be used to improve influenza models. Neural networks (NN) can provide state-of-the-art forecasting accuracy but do not commonly incorporate uncertainty in their estimates, so… ▽ More

    Submitted 26 May, 2021; originally announced May 2021.

  13. arXiv:2104.09096  [pdf, other

    cs.DS cs.DC

    Wake Up and Join Me! An Energy-Efficient Algorithm for Maximal Matching in Radio Networks

    Authors: Varsha Dani, Aayush Gupta, Thomas P. Hayes, Seth Pettie

    Abstract: We consider networks of small, autonomous devices that communicate with each other wirelessly. Minimizing energy usage is an important consideration in designing algorithms for such networks, as battery life is a crucial and limited resource. Working in a model where both sending and listening for messages deplete energy, we consider the problem of finding a maximal matching of the nodes in a radi… ▽ More

    Submitted 16 April, 2022; v1 submitted 19 April, 2021; originally announced April 2021.

    Comments: 14 pages, 2 figures, 3 algorithms

  14. arXiv:2007.09816  [pdf, ps, other

    cs.DS cs.DC

    The Energy Complexity of BFS in Radio Networks

    Authors: Yi-Jun Chang, Varsha Dani, Thomas P. Hayes, Seth Pettie

    Abstract: We consider a model of energy complexity in Radio Networks in which transmitting or listening on the channel costs one unit of energy and computation is free. This simplified model captures key aspects of battery-powered sensors: that battery life is most influenced by transceiver usage, and that at low transmission powers, the actual cost of transmitting and listening are very similar. The ener… ▽ More

    Submitted 19 July, 2020; originally announced July 2020.

    Comments: To appear in PODC 2020

  15. arXiv:1909.07059  [pdf, ps, other

    cs.DM cs.DS

    Improved Strong Spatial Mixing for Colorings on Trees

    Authors: Charilaos Efthymiou, Andreas Galanis, Thomas P. Hayes, Daniel Stefankovic, Eric Vigoda

    Abstract: Strong spatial mixing (SSM) is a form of correlation decay that has played an essential role in the design of approximate counting algorithms for spin systems. A notable example is the algorithm of Weitz (2006) for the hard-core model on weighted independent sets. We study SSM for the $q$-colorings problem on the infinite $(d+1)$-regular tree. Weak spatial mixing (WSM) captures whether the influen… ▽ More

    Submitted 16 September, 2019; originally announced September 2019.

  16. arXiv:1904.00943  [pdf, ps, other

    cs.DS cs.LG

    Distributed Metropolis Sampler with Optimal Parallelism

    Authors: Weiming Feng, Thomas P. Hayes, Yitong Yin

    Abstract: The Metropolis-Hastings algorithm is a fundamental Markov chain Monte Carlo (MCMC) method for sampling and inference. With the advent of Big Data, distributed and parallel variants of MCMC methods are attracting increased attention. In this paper, we give a distributed algorithm that can correctly simulate sequential single-site Metropolis chains without any bias in a fully asynchronous message-pa… ▽ More

    Submitted 14 July, 2019; v1 submitted 1 April, 2019; originally announced April 2019.

  17. arXiv:1811.08968  [pdf, other

    stat.ML cs.LG

    Spread Divergence

    Authors: Mingtian Zhang, Peter Hayes, Tom Bird, Raza Habib, David Barber

    Abstract: For distributions $\mathbb{P}$ and $\mathbb{Q}$ with different supports or undefined densities, the divergence $\textrm{D}(\mathbb{P}||\mathbb{Q})$ may not exist. We define a Spread Divergence $\tilde{\textrm{D}}(\mathbb{P}||\mathbb{Q})$ on modified $\mathbb{P}$ and $\mathbb{Q}$ and describe sufficient conditions for the existence of such a divergence. We demonstrate how to maximize the discrimina… ▽ More

    Submitted 4 December, 2022; v1 submitted 21 November, 2018; originally announced November 2018.

    Journal ref: Volume 119: International Conference on Machine Learning, 13-18 July 2020, Virtual

  18. arXiv:1802.06953  [pdf, ps, other

    cs.DS

    Distributed Symmetry Breaking in Sampling (Optimal Distributed Randomly Coloring with Fewer Colors)

    Authors: Weiming Feng, Thomas P. Hayes, Yitong Yin

    Abstract: We examine the problem of almost-uniform sampling proper $q$-colorings of a graph whose maximum degree is $Δ$. A famous result, discovered independently by Jerrum(1995) and Salas and Sokal(1997), is that, assuming $q > (2+δ) Δ$, the Glauber dynamics (a.k.a. single-site dynamics) for this problem has mixing time $O(n \log n)$, where $n$ is the number of vertices, and thus provides a nearly linear t… ▽ More

    Submitted 21 June, 2018; v1 submitted 19 February, 2018; originally announced February 2018.

  19. arXiv:1710.01800  [pdf, other

    cs.DC cs.DS

    The Energy Complexity of Broadcast

    Authors: Yi-Jun Chang, Varsha Dani, Thomas P. Hayes, Qizheng He, Wenzheng Li, Seth Pettie

    Abstract: Energy is often the most constrained resource in networks of battery-powered devices, and as devices become smaller, they spend a larger fraction of their energy on communication (transceiver usage) not computation. As an imperfect proxy for true energy usage, we define energy complexity to be the number of time slots a device transmits/listens; idle time and computation are free. In this paper… ▽ More

    Submitted 4 October, 2017; originally announced October 2017.

  20. arXiv:1707.03796  [pdf, other

    cs.DM math.CO

    Sampling Random Colorings of Sparse Random Graphs

    Authors: Charilaos Efthymiou, Thomas P. Hayes, Daniel Stefankovic, Eric Vigoda

    Abstract: We study the mixing properties of the single-site Markov chain known as the Glauber dynamics for sampling $k$-colorings of a sparse random graph $G(n,d/n)$ for constant $d$. The best known rapid mixing results for general graphs are in terms of the maximum degree $Δ$ of the input graph $G$ and hold when $k>11Δ/6$ for all $G$. Improved results hold when $k>αΔ$ for graphs with girth $\geq 5$ and… ▽ More

    Submitted 12 July, 2017; originally announced July 2017.

  21. arXiv:1706.02344  [pdf

    cs.AR

    Energy-Efficient Hybrid Stochastic-Binary Neural Networks for Near-Sensor Computing

    Authors: Vincent T. Lee, Armin Alaghi, John P. Hayes, Visvesh Sathe, Luis Ceze

    Abstract: Recent advances in neural networks (NNs) exhibit unprecedented success at transforming large, unstructured data streams into compact higher-level semantic information for tasks such as handwriting recognition, image classification, and speech recognition. Ideally, systems would employ near-sensor computation to execute these tasks at sensor endpoints to maximize data reduction and minimize data mo… ▽ More

    Submitted 7 June, 2017; originally announced June 2017.

    Comments: 6 pages, 3 figures, Design, Automata and Test in Europe (DATE) 2017

  22. arXiv:1612.05943  [pdf, other

    cs.CR cs.IT

    Distributed Computing with Channel Noise

    Authors: Abhinav Aggarwal, Varsha Dani, Thomas P. Hayes, Jared Saia

    Abstract: A group of $n$ users want to run a distributed protocol $π$ over a network where communication occurs via private point-to-point channels. Unfortunately, an adversary, who knows $π$, is able to maliciously flip bits on the channels. Can we efficiently simulate $π$ in the presence of such an adversary? We show that this is possible, even when $L$, the number of bits sent in $π$, and $T$, the number… ▽ More

    Submitted 24 July, 2017; v1 submitted 18 December, 2016; originally announced December 2016.

    Comments: 29 pages, 6 figures

  23. arXiv:1609.01582  [pdf, ps, other

    math.CO cs.DM math.PR

    Codes, Lower Bounds, and Phase Transitions in the Symmetric Rendezvous Problem

    Authors: Varsha Dani, Thomas P. Hayes, Cristopher Moore, Alexander Russell

    Abstract: In the rendezvous problem, two parties with different labelings of the vertices of a complete graph are trying to meet at some vertex at the same time. It is well-known that if the parties have predetermined roles, then the strategy where one of them waits at one vertex, while the other visits all $n$ vertices in random order is optimal, taking at most $n$ steps and averaging about $n/2$. Anderson… ▽ More

    Submitted 6 September, 2016; originally announced September 2016.

    MSC Class: 60C05 ACM Class: G.3

  24. arXiv:1605.06170  [pdf, other

    cs.LG

    Evaluation System for a Bayesian Optimization Service

    Authors: Ian Dewancker, Michael McCourt, Scott Clark, Patrick Hayes, Alexandra Johnson, George Ke

    Abstract: Bayesian optimization is an elegant solution to the hyperparameter optimization problem in machine learning. Building a reliable and robust Bayesian optimization service requires careful testing methodology and sound statistical analysis. In this talk we will outline our development of an evaluation framework to rigorously test and measure the impact of changes to the SigOpt optimization service.… ▽ More

    Submitted 19 May, 2016; originally announced May 2016.

  25. arXiv:1604.01422  [pdf, ps, other

    cs.DM math.PR

    Convergence of MCMC and Loopy BP in the Tree Uniqueness Region for the Hard-Core Model

    Authors: Charilaos Efthymiou, Thomas P. Hayes, Daniel Stefankovic, Eric Vigoda, Yitong Yin

    Abstract: We study the hard-core model defined on independent sets of an input graph where the independent sets are weighted by a parameter $λ>0$. For constant $Δ$, previous work of Weitz (2006) established an FPTAS for the partition function for graphs of maximum degree $Δ$ when $λ< λ_c(Δ)$. The threshold $λ_c(Δ)$ is the critical point for the phase transition for uniqueness/non-uniqueness on the infinite… ▽ More

    Submitted 29 August, 2016; v1 submitted 5 April, 2016; originally announced April 2016.

    ACM Class: G.2.1; F.2.2

  26. arXiv:1603.09441  [pdf, other

    cs.LG stat.ML

    A Stratified Analysis of Bayesian Optimization Methods

    Authors: Ian Dewancker, Michael McCourt, Scott Clark, Patrick Hayes, Alexandra Johnson, George Ke

    Abstract: Empirical analysis serves as an important complement to theoretical analysis for studying practical Bayesian optimization. Often empirical insights expose strengths and weaknesses inaccessible to theoretical analysis. We define two metrics for comparing the performance of Bayesian optimization methods and propose a ranking mechanism for summarizing performance within various genres or strata of te… ▽ More

    Submitted 30 March, 2016; originally announced March 2016.

  27. arXiv:1504.06316  [pdf, ps, other

    cs.DS cs.DC cs.IT cs.NI

    Interactive Communication with Unknown Noise Rate

    Authors: Varsha Dani, Thomas P. Hayes, Mahnush Movahedi, Jared Saia, Maxwell Young

    Abstract: Alice and Bob want to run a protocol over a noisy channel, where a certain number of bits are flipped adversarially. Several results take a protocol requiring $L$ bits of noise-free communication and make it robust over such a channel. In a recent breakthrough result, Haeupler described an algorithm that sends a number of bits that is conjectured to be near optimal in such a model. However, his al… ▽ More

    Submitted 13 August, 2015; v1 submitted 23 April, 2015; originally announced April 2015.

    Comments: Made substantial improvements to the algorithm and analysis. Previous version had a subtle error involving the adversary's ability to attack fingerprints

  28. arXiv:1407.1930  [pdf, other

    cs.CC cond-mat.stat-mech

    Lower Bounds on the Critical Density in the Hard Disk Model via Optimized Metrics

    Authors: Thomas P. Hayes, Cristopher Moore

    Abstract: We prove a new lower bound on the critical density $ρ_c$ of the hard disk model, i.e., the density below which it is possible to efficiently sample random configurations of $n$ non-overlap** disks in a unit torus. We use a classic Markov chain which moves one disk at a time, but with an improved path coupling analysis. Our main tool is an optimized metric on neighboring pairs of configurations,… ▽ More

    Submitted 7 July, 2014; originally announced July 2014.

  29. arXiv:1301.3527  [pdf, other

    cs.LG math.NA

    Block Coordinate Descent for Sparse NMF

    Authors: Vamsi K. Potluru, Sergey M. Plis, Jonathan Le Roux, Barak A. Pearlmutter, Vince D. Calhoun, Thomas P. Hayes

    Abstract: Nonnegative matrix factorization (NMF) has become a ubiquitous tool for data analysis. An important variant is the sparse NMF problem which arises when we explicitly require the learnt features to be sparse. A natural measure of sparsity is the L$_0$ norm, however its optimization is NP-hard. Mixed norms, such as L$_1$/L$_2$ measure, have been shown to model sparsity robustly, based on intuitive a… ▽ More

    Submitted 18 March, 2013; v1 submitted 15 January, 2013; originally announced January 2013.

  30. arXiv:1112.0829  [pdf, other

    math.PR cs.GT

    How Not to Win a Million Dollars: A Counterexample to a Conjecture of L. Breiman

    Authors: Thomas P. Hayes

    Abstract: Consider a gambling game in which we are allowed to repeatedly bet a portion of our bankroll at favorable odds. We investigate the question of how to minimize the expected number of rounds needed to increase our bankroll to a given target amount. Specifically, we disprove a 50-year old conjecture of L. Breiman, that there exists a threshold strategy that optimizes the expected number of rounds;… ▽ More

    Submitted 4 December, 2011; originally announced December 2011.

    Comments: 6 pages, 1 figure

  31. arXiv:0705.0017  [pdf, ps, other

    quant-ph cs.ET

    Checking Equivalence of Quantum Circuits and States

    Authors: George F. Viamontes, Igor L. Markov, John P. Hayes

    Abstract: Quantum computing promises exponential speed-ups for important simulation and optimization problems. It also poses new CAD problems that are similar to, but more challenging, than the related problems in classical (non-quantum) CAD, such as determining if two states or circuits are functionally equivalent. While differences in classical states are easy to detect, quantum states, which are repres… ▽ More

    Submitted 1 May, 2007; v1 submitted 1 May, 2007; originally announced May 2007.

    Comments: 9 pages, 13 figures, 3 tables

    Journal ref: Proc. Int'l Conf. on Computer-Aided Design (ICCAD), pp. 69-74, San Jose, CA, November 2007.

  32. arXiv:cs/0602053  [pdf, ps, other

    cs.DS cs.LG

    How to Beat the Adaptive Multi-Armed Bandit

    Authors: Varsha Dani, Thomas P. Hayes

    Abstract: The multi-armed bandit is a concise model for the problem of iterated decision-making under uncertainty. In each round, a gambler must pull one of $K$ arms of a slot machine, without any foreknowledge of their payouts, except that they are uniformly bounded. A standard objective is to minimize the gambler's regret, defined as the gambler's total payout minus the largest payout which would have b… ▽ More

    Submitted 14 February, 2006; originally announced February 2006.