-
Improving tracking algorithms with machine learning: a case for line-segment tracking at the High Luminosity LHC
Authors:
Jonathan Guiang,
Slava Krutelyov,
Manos Vourliotis,
Yanxi Gu,
Avi Yagil,
Balaji Venkat Sathia Narayanan,
Matevz Tadel,
Philip Chang,
Mayra Silva,
Gavin Niendorf,
Peter Wittich,
Tres Reid,
Peter Elmer
Abstract:
In this work, we present a study on ways that tracking algorithms can be improved with machine learning (ML). We base this study on the line segment tracking (LST) algorithm that we have designed to be naturally parallelized and vectorized in order to efficiently run on modern processors. LST has been developed specifically for the CMS Experiment at the LHC, towards the High Luminosity LHC (HL-LHC…
▽ More
In this work, we present a study on ways that tracking algorithms can be improved with machine learning (ML). We base this study on the line segment tracking (LST) algorithm that we have designed to be naturally parallelized and vectorized in order to efficiently run on modern processors. LST has been developed specifically for the CMS Experiment at the LHC, towards the High Luminosity LHC (HL-LHC) upgrade. Moreover, we have already shown excellent efficiency and performance results as we iteratively improve LST, leveraging a full simulation of the CMS detector. At the same time, promising deep-learning-based tracking algorithms, such as Graph Neural Networks (GNNs), are being pioneered on the simplified TrackML dataset. These results suggest that parts of LST could be improved or replaced by ML. Thus, a thorough, step-by-step investigation of exactly how and where ML can be utilized, while still meeting realistic HL-LHC performance and efficiency constraints, is implemented as follows. First, a lightweight neural network is used to replace and improve upon explicitly defined track quality selections. This neural network is shown to be highly efficient and robust to displaced tracks while having little-to-no impact on the runtime of LST. These results clearly establish that ML can be used to improve LST without penalty. Next, exploratory studies of GNN track-building algorithms are described. In particular, low-level track objects from LST are considered as nodes in a graph, where edges represent higher-level objects or even entire track candidates. Then, an edge-classifier GNN is trained, and the efficiency of the resultant edge scores is compared with that of the existing LST track quality selections. These GNN studies provide insights into the practicality and performance of using more ambitious and complex ML algorithms for HL-LHC tracking at the CMS Experiment.
△ Less
Submitted 19 March, 2024;
originally announced March 2024.
-
Human-AI Co-Creation of Worked Examples for Programming Classes
Authors:
Mohammad Hassany,
Peter Brusilovsky,
Jiaze Ke,
Kamil Akhuseyinoglu,
Arun Balajiee Lekshmi Narayanan
Abstract:
Worked examples (solutions to typical programming problems presented as a source code in a certain language and are used to explain the topics from a programming class) are among the most popular types of learning content in programming classes. Most approaches and tools for presenting these examples to students are based on line-by-line explanations of the example code. However, instructors rarel…
▽ More
Worked examples (solutions to typical programming problems presented as a source code in a certain language and are used to explain the topics from a programming class) are among the most popular types of learning content in programming classes. Most approaches and tools for presenting these examples to students are based on line-by-line explanations of the example code. However, instructors rarely have time to provide line-by-line explanations for a large number of examples typically used in a programming class. In this paper, we explore and assess a human-AI collaboration approach to authoring worked examples for Java programming. We introduce an authoring system for creating Java worked examples that generates a starting version of code explanations and presents it to the instructor to edit if necessary.We also present a study that assesses the quality of explanations created with this approach
△ Less
Submitted 29 February, 2024; v1 submitted 25 February, 2024;
originally announced February 2024.
-
Colouring random subgraphs
Authors:
Boris Bukh,
Michael Krivelevich,
Bhargav Narayanan
Abstract:
We study several basic problems about colouring the $p$-random subgraph $G_p$ of an arbitrary graph $G$, focusing primarily on the chromatic number and colouring number of $G_p$. In particular, we show that there exist infinitely many $k$-regular graphs $G$ for which the colouring number (i.e., degeneracy) of $G_{1/2}$ is at most $k/3 + o(k)$ with high probability, thus disproving the natural pred…
▽ More
We study several basic problems about colouring the $p$-random subgraph $G_p$ of an arbitrary graph $G$, focusing primarily on the chromatic number and colouring number of $G_p$. In particular, we show that there exist infinitely many $k$-regular graphs $G$ for which the colouring number (i.e., degeneracy) of $G_{1/2}$ is at most $k/3 + o(k)$ with high probability, thus disproving the natural prediction that such random graphs must have colouring number at least $k/2 - o(k)$.
△ Less
Submitted 6 May, 2024; v1 submitted 13 December, 2023;
originally announced December 2023.
-
Clique Supersaturation
Authors:
Quentin Dubroff,
Benjamin Gunby,
Bhargav Narayanan,
Sam Spiro
Abstract:
We study how many copies of a graph $F$ that another graph $G$ with a given number of cliques is guaranteed to have. For example, one of our main results states that for all $t\ge 2$, if $G$ is an $n$ vertex graph with $kn^{3/2}$ triangles and $k$ is sufficiently large in terms of $t$, then $G$ contains at least
\[Ω(\min\{k^t n^{3/2},k^{\frac{2t^2}{3t-1}}n^{\frac{5t-2}{3t-1}}\})\]
copies of…
▽ More
We study how many copies of a graph $F$ that another graph $G$ with a given number of cliques is guaranteed to have. For example, one of our main results states that for all $t\ge 2$, if $G$ is an $n$ vertex graph with $kn^{3/2}$ triangles and $k$ is sufficiently large in terms of $t$, then $G$ contains at least
\[Ω(\min\{k^t n^{3/2},k^{\frac{2t^2}{3t-1}}n^{\frac{5t-2}{3t-1}}\})\]
copies of $K_{2,t}$, and furthermore, we show these bounds are essentially best-possible provided either $k\ge n^{1/2t}$ or if certain bipartite-analogues of well known conjectures for Turán numbers hold.
△ Less
Submitted 13 December, 2023;
originally announced December 2023.
-
Authoring Worked Examples for Java Programming with Human-AI Collaboration
Authors:
Mohammad Hassany,
Peter Brusilovsky,
Jiaze Ke,
Kamil Akhuseyinoglu,
Arun Balajiee Lekshmi Narayanan
Abstract:
Worked examples (solutions to typical programming problems presented as a source code in a certain language and are used to explain the topics from a programming class) are among the most popular types of learning content in programming classes. Most approaches and tools for presenting these examples to students are based on line-by-line explanations of the example code. However, instructors rarel…
▽ More
Worked examples (solutions to typical programming problems presented as a source code in a certain language and are used to explain the topics from a programming class) are among the most popular types of learning content in programming classes. Most approaches and tools for presenting these examples to students are based on line-by-line explanations of the example code. However, instructors rarely have time to provide line-by-line explanations for a large number of examples typically used in a programming class. In this paper, we explore and assess a human-AI collaboration approach to authoring worked examples for Java programming. We introduce an authoring system for creating Java worked examples that generates a starting version of code explanations and presents it to the instructor to edit if necessary. We also present a study that assesses the quality of explanations created with this approach.
△ Less
Submitted 4 December, 2023;
originally announced December 2023.
-
Enhancing Programming eTextbooks with ChatGPT Generated Counterfactual-Thinking-Inspired Questions
Authors:
Arun Balajiee Lekshmi Narayanan,
Rully Agus Hendrawan,
Venktesh V
Abstract:
Digital textbooks have become an integral part of everyday learning tasks. In this work, we consider the use of digital textbooks for programming classes. Generally, students struggle with utilizing textbooks on programming to the maximum, with a possible reason being that the example programs provided as illustration of concepts in these textbooks don't offer sufficient interactivity for students…
▽ More
Digital textbooks have become an integral part of everyday learning tasks. In this work, we consider the use of digital textbooks for programming classes. Generally, students struggle with utilizing textbooks on programming to the maximum, with a possible reason being that the example programs provided as illustration of concepts in these textbooks don't offer sufficient interactivity for students, and thereby not sufficiently motivating to explore or understand these programming examples better. In our work, we explore the idea of enhancing the navigability of intelligent textbooks with the use of ``counterfactual'' questions, to make students think critically about these programs and enhance possible program comprehension. Inspired from previous works on nudging students on counter factual thinking, we present the possibility to enhance digital textbooks with questions generated using GPT.
△ Less
Submitted 6 June, 2023; v1 submitted 1 June, 2023;
originally announced June 2023.
-
GenQ: Automated Question Generation to Support Caregivers While Reading Stories with Children
Authors:
Arun Balajiee Lekshmi Narayanan,
Ligia E. Gomez,
Martha Michelle Soto Fernandez,
Tri Nguyen,
Chris Blais,
M. Adelaida Restrepo,
Art Glenberg
Abstract:
When caregivers ask open--ended questions to motivate dialogue with children, it facilitates the child's reading comprehension skills.Although there is scope for use of technological tools, referred here as "intelligent tutoring systems", to scaffold this process, it is currently unclear whether existing intelligent systems that generate human--language like questions is beneficial. Additionally,…
▽ More
When caregivers ask open--ended questions to motivate dialogue with children, it facilitates the child's reading comprehension skills.Although there is scope for use of technological tools, referred here as "intelligent tutoring systems", to scaffold this process, it is currently unclear whether existing intelligent systems that generate human--language like questions is beneficial. Additionally, training data used in the development of these automated question generation systems is typically sourced without attention to demographics, but people with different cultural backgrounds may ask different questions. As a part of a broader project to design an intelligent reading support app for Latinx children, we crowdsourced questions from Latinx caregivers and noncaregivers as well as caregivers and noncaregivers from other demographics. We examine variations in question--asking within this dataset mediated by individual, cultural, and contextual factors. We then design a system that automatically extracts templates from this data to generate open--ended questions that are representative of those asked by Latinx caregivers.
△ Less
Submitted 25 September, 2023; v1 submitted 26 May, 2023;
originally announced May 2023.
-
Antichain Codes
Authors:
Benjamin Gunby,
Xiaoyu He,
Bhargav Narayanan,
Sam Spiro
Abstract:
A family of sets $A$ is said to be an antichain if $x\not\subset y$ for all distinct $x,y\in A$, and it is said to be a distance-$r$ code if every pair of distinct elements of $A$ has Hamming distance at least $r$. Here, we prove that if $A\subset 2^{[n]}$ is both an antichain and a distance-$(2r+1)$ code, then $|A| = O_r(2^n n^{-r-1/2})$. This result, which is best-possible up to the implied cons…
▽ More
A family of sets $A$ is said to be an antichain if $x\not\subset y$ for all distinct $x,y\in A$, and it is said to be a distance-$r$ code if every pair of distinct elements of $A$ has Hamming distance at least $r$. Here, we prove that if $A\subset 2^{[n]}$ is both an antichain and a distance-$(2r+1)$ code, then $|A| = O_r(2^n n^{-r-1/2})$. This result, which is best-possible up to the implied constant, is a purely combinatorial strengthening of a number of results in Littlewood--Offord theory; for example, our result gives a short combinatorial proof of Hálasz's theorem, while all previously known proofs of this result are Fourier-analytic.
△ Less
Submitted 16 December, 2022;
originally announced December 2022.
-
Reconstructing random pictures
Authors:
Bhargav Narayanan,
Corrine Yap
Abstract:
Given a random binary picture $P_n$ of size $n$, i.e., an $n\times n$ grid filled with zeros and ones uniformly at random, when is it possible to reconstruct $P_n$ from its $k$-deck, i.e., the multiset of all its $k\times k$ subgrids? We demonstrate ``two-point concentration'' for the reconstruction threshold by showing that there is an integer $k_c(n) \sim (2 \log n)^{1/2}$ such that if…
▽ More
Given a random binary picture $P_n$ of size $n$, i.e., an $n\times n$ grid filled with zeros and ones uniformly at random, when is it possible to reconstruct $P_n$ from its $k$-deck, i.e., the multiset of all its $k\times k$ subgrids? We demonstrate ``two-point concentration'' for the reconstruction threshold by showing that there is an integer $k_c(n) \sim (2 \log n)^{1/2}$ such that if $k > k_c$, then $P_n$ is reconstructible from its $k$-deck with high probability, and if $k < k_c$, then with high probability, it is impossible to reconstruct $P_n$ from its $k$-deck. The proof of this result uses a combination of interface-exploration arguments and entropic arguments.
△ Less
Submitted 21 February, 2023; v1 submitted 17 October, 2022;
originally announced October 2022.
-
A Counterexample to a Directed KKL Inequality
Authors:
Quentin Dubroff,
Shivam Nadimpalli,
Bhargav Narayanan
Abstract:
We show that the natural directed analogues of the KKL theorem [KKL88] and the Eldan--Gross inequality [EG20] from the analysis of Boolean functions fail to hold. This is in contrast to several other isoperimetric inequalities on the Boolean hypercube (such as the Poincare inequality, Margulis's inequality [Mar74] and Talagrand's inequality [Tal93]) for which directed strengthenings have recently…
▽ More
We show that the natural directed analogues of the KKL theorem [KKL88] and the Eldan--Gross inequality [EG20] from the analysis of Boolean functions fail to hold. This is in contrast to several other isoperimetric inequalities on the Boolean hypercube (such as the Poincare inequality, Margulis's inequality [Mar74] and Talagrand's inequality [Tal93]) for which directed strengthenings have recently been established.
△ Less
Submitted 5 October, 2022;
originally announced October 2022.
-
Segment Linking: A Highly Parallelizable Track Reconstruction Algorithm for HL-LHC
Authors:
Philip Chang,
Peter Elmer,
Yanxi Gu,
Vyacheslav Krutelyov,
Gavin Niendorf,
Michael Reid,
Balaji Venkat Sathia Narayanan,
Matevž Tadel,
Emmanouil Vourliotis,
Bei Wang,
Peter Wittich,
Avraham Yagil
Abstract:
The High Luminosity upgrade of the Large Hadron Collider (HL-LHC) will produce particle collisions with up to 200 simultaneous proton-proton interactions. These unprecedented conditions will create a combinatorial complexity for charged-particle track reconstruction that demands a computational cost that is expected to surpass the projected computing budget using conventional CPUs. Motivated by th…
▽ More
The High Luminosity upgrade of the Large Hadron Collider (HL-LHC) will produce particle collisions with up to 200 simultaneous proton-proton interactions. These unprecedented conditions will create a combinatorial complexity for charged-particle track reconstruction that demands a computational cost that is expected to surpass the projected computing budget using conventional CPUs. Motivated by this and taking into account the prevalence of heterogeneous computing in cutting-edge High Performance Computing centers, we propose an efficient, fast and highly parallelizable bottom-up approach to track reconstruction for the HL-LHC, along with an associated implementation on GPUs, in the context of the Phase 2 CMS outer tracker. Our algorithm, called Segment Linking (or Line Segment Tracking), takes advantage of localized track stub creation, combining individual stubs to progressively form higher level objects that are subject to kinematical and geometrical requirements compatible with genuine physics tracks. The local nature of the algorithm makes it ideal for parallelization under the Single Instruction, Multiple Data paradigm, as hundreds of objects can be built simultaneously. The computing and physics performance of the algorithm has been tested on an NVIDIA Tesla V100 GPU, already yielding efficiency and timing measurements that are on par with the latest, multi-CPU versions of existing CMS tracking algorithms.
△ Less
Submitted 27 September, 2022;
originally announced September 2022.
-
Line Segment Tracking in the HL-LHC
Authors:
Gavin Niendorf,
Tres Reid,
Peter Wittich,
Peter Elmer,
Bei Wang,
Philip Chang,
Yanxi Gu,
Vyacheslav Krutelyov,
Balaji Venkat Sathia Narayanan,
Matevz Tadel,
Emmanouil Vourliotis,
Avi Yagil
Abstract:
The major challenge posed by the high instantaneous luminosity in the High Luminosity LHC (HL-LHC) motivates efficient and fast reconstruction of charged particle tracks in a high pile-up environment. While there have been efforts to use modern techniques like vectorization to improve the existing classic Kalman Filter based reconstruction algorithms, Line Segment Tracking takes a fundamentally di…
▽ More
The major challenge posed by the high instantaneous luminosity in the High Luminosity LHC (HL-LHC) motivates efficient and fast reconstruction of charged particle tracks in a high pile-up environment. While there have been efforts to use modern techniques like vectorization to improve the existing classic Kalman Filter based reconstruction algorithms, Line Segment Tracking takes a fundamentally different approach by doing a bottom-up reconstruction of tracks. Small track stubs from adjoining detector regions are constructed, and then these track stubs that are consistent with typical track trajectories are successively linked. Since the production of these track stubs is localized, they can be made in parallel, which lends way into using architectures like GPUs and multi-CPUs to take advantage of the parallelism. The algorithm is implemented in the context of the CMS Phase-2 Tracker and runs on NVIDIA Tesla V100 GPUs. Good physics and timing performance has been obtained, and step** stones for the future are elaborated.
△ Less
Submitted 28 September, 2022; v1 submitted 17 July, 2022;
originally announced July 2022.
-
Utilizing unsupervised learning to improve sward content prediction and herbage mass estimation
Authors:
Paul Albert,
Mohamed Saadeldin,
Badri Narayanan,
Brian Mac Namee,
Deirdre Hennessy,
Aisling H. O'Connor,
Noel E. O'Connor,
Kevin McGuinness
Abstract:
Sward species composition estimation is a tedious one. Herbage must be collected in the field, manually separated into components, dried and weighed to estimate species composition. Deep learning approaches using neural networks have been used in previous work to propose faster and more cost efficient alternatives to this process by estimating the biomass information from a picture of an area of p…
▽ More
Sward species composition estimation is a tedious one. Herbage must be collected in the field, manually separated into components, dried and weighed to estimate species composition. Deep learning approaches using neural networks have been used in previous work to propose faster and more cost efficient alternatives to this process by estimating the biomass information from a picture of an area of pasture alone. Deep learning approaches have, however, struggled to generalize to distant geographical locations and necessitated further data collection to retrain and perform optimally in different climates. In this work, we enhance the deep learning solution by reducing the need for ground-truthed (GT) images when training the neural network. We demonstrate how unsupervised contrastive learning can be used in the sward composition prediction problem and compare with the state-of-the-art on the publicly available GrassClover dataset collected in Denmark as well as a more recent dataset from Ireland where we tackle herbage mass and height estimation.
△ Less
Submitted 20 April, 2022;
originally announced April 2022.
-
Unsupervised domain adaptation and super resolution on drone images for autonomous dry herbage biomass estimation
Authors:
Paul Albert,
Mohamed Saadeldin,
Badri Narayanan,
Jaime Fernandez,
Brian Mac Namee,
Deirdre Hennessey,
Noel E. O'Connor,
Kevin McGuinness
Abstract:
Herbage mass yield and composition estimation is an important tool for dairy farmers to ensure an adequate supply of high quality herbage for grazing and subsequently milk production. By accurately estimating herbage mass and composition, targeted nitrogen fertiliser application strategies can be deployed to improve localised regions in a herbage field, effectively reducing the negative impacts of…
▽ More
Herbage mass yield and composition estimation is an important tool for dairy farmers to ensure an adequate supply of high quality herbage for grazing and subsequently milk production. By accurately estimating herbage mass and composition, targeted nitrogen fertiliser application strategies can be deployed to improve localised regions in a herbage field, effectively reducing the negative impacts of over-fertilization on biodiversity and the environment. In this context, deep learning algorithms offer a tempting alternative to the usual means of sward composition estimation, which involves the destructive process of cutting a sample from the herbage field and sorting by hand all plant species in the herbage. The process is labour intensive and time consuming and so not utilised by farmers. Deep learning has been successfully applied in this context on images collected by high-resolution cameras on the ground. Moving the deep learning solution to drone imaging, however, has the potential to further improve the herbage mass yield and composition estimation task by extending the ground-level estimation to the large surfaces occupied by fields/paddocks. Drone images come at the cost of lower resolution views of the fields taken from a high altitude and requires further herbage ground-truth collection from the large surfaces covered by drone images. This paper proposes to transfer knowledge learned on ground-level images to raw drone images in an unsupervised manner. To do so, we use unpaired image style translation to enhance the resolution of drone images by a factor of eight and modify them to appear closer to their ground-level counterparts. We then ... ~\url{www.github.com/PaulAlbert31/Clover_SSL}.
△ Less
Submitted 18 April, 2022;
originally announced April 2022.
-
Down-set thresholds
Authors:
Benjamin Gunby,
Xiaoyu He,
Bhargav Narayanan
Abstract:
We elucidate the relationship between the threshold and the expectation-threshold of a down-set. Qualitatively, our main result demonstrates that there exist down-sets with polynomial gaps between their thresholds and expectation-thresholds; in particular, the logarithmic gap predictions of Kahn--Kalai and Talagrand (recently proved by Park--Pham and Frankston--Kahn--Narayanan--Park) about up-sets…
▽ More
We elucidate the relationship between the threshold and the expectation-threshold of a down-set. Qualitatively, our main result demonstrates that there exist down-sets with polynomial gaps between their thresholds and expectation-thresholds; in particular, the logarithmic gap predictions of Kahn--Kalai and Talagrand (recently proved by Park--Pham and Frankston--Kahn--Narayanan--Park) about up-sets do not apply to down-sets. Quantitatively, we show that any collection $\mathcal{G}$ of graphs on $[n]$ that covers the family of all triangle-free graphs on $[n]$ satisfies the inequality $\sum_{G \in \mathcal{G}} \exp(-δe(G^c) / \sqrt{n}) < 1/2$ for some universal $δ> 0$, and this is essentially best-possible.
△ Less
Submitted 2 February, 2023; v1 submitted 15 December, 2021;
originally announced December 2021.
-
Applications of Random Algebraic Constructions to Hardness of Approximation
Authors:
Boris Bukh,
Karthik C. S.,
Bhargav Narayanan
Abstract:
In this paper, we show how one may (efficiently) construct two types of extremal combinatorial objects whose existence was previously conjectural.
(*) Panchromatic Graphs: For fixed integer k, a k-panchromatic graph is, roughly speaking, a balanced bipartite graph with one partition class equipartitioned into k colour classes in which the common neighbourhoods of panchromatic k-sets of vertices…
▽ More
In this paper, we show how one may (efficiently) construct two types of extremal combinatorial objects whose existence was previously conjectural.
(*) Panchromatic Graphs: For fixed integer k, a k-panchromatic graph is, roughly speaking, a balanced bipartite graph with one partition class equipartitioned into k colour classes in which the common neighbourhoods of panchromatic k-sets of vertices are much larger than those of k-sets that repeat a colour. The question of their existence was raised by Karthik and Manurangsi [Combinatorica 2020].
(*) Threshold Graphs: For fixed integer k, a k-threshold graph is, roughly speaking, a balanced bipartite graph in which the common neighbourhoods of k-sets of vertices on one side are much larger than those of (k+1)-sets. The question of their existence was raised by Lin [JACM 2018].
As applications of our constructions, we show the following conditional time lower bounds on the parameterized set intersection problem where, given a collection of n sets over universe [n] and a parameter k, the goal is to find k sets with the largest intersection.
(*) Assuming ETH, for any computable function F, no $n^{o(k)}$-time algorithm can approximate the parameterized set intersection problem up to factor F(k). This improves considerably on the previously best-known result under ETH due to Lin [JACM 2018], who ruled out any $n^{o(\sqrt{k})}$ time approximation algorithm for this problem.
(*) Assuming SETH, for every $\varepsilon>0$ and any computable function F, no $n^{k-\varepsilon}$-time algorithm can approximate the parameterized set intersection problem up to factor F(k). No result of comparable strength was previously known under SETH, even for solving this problem exactly.
△ Less
Submitted 9 November, 2021;
originally announced November 2021.
-
Semi-supervised dry herbage mass estimation using automatic data and synthetic images
Authors:
Paul Albert,
Mohamed Saadeldin,
Badri Narayanan,
Brian Mac Namee,
Deirdre Hennessy,
Aisling O'Connor,
Noel O'Connor,
Kevin McGuinness
Abstract:
Monitoring species-specific dry herbage biomass is an important aspect of pasture-based milk production systems. Being aware of the herbage biomass in the field enables farmers to manage surpluses and deficits in herbage supply, as well as using targeted nitrogen fertilization when necessary. Deep learning for computer vision is a powerful tool in this context as it can accurately estimate the dry…
▽ More
Monitoring species-specific dry herbage biomass is an important aspect of pasture-based milk production systems. Being aware of the herbage biomass in the field enables farmers to manage surpluses and deficits in herbage supply, as well as using targeted nitrogen fertilization when necessary. Deep learning for computer vision is a powerful tool in this context as it can accurately estimate the dry biomass of a herbage parcel using images of the grass canopy taken using a portable device. However, the performance of deep learning comes at the cost of an extensive, and in this case destructive, data gathering process. Since accurate species-specific biomass estimation is labor intensive and destructive for the herbage parcel, we propose in this paper to study low supervision approaches to dry biomass estimation using computer vision. Our contributions include: a synthetic data generation algorithm to generate data for a herbage height aware semantic segmentation task, an automatic process to label data using semantic segmentation maps, and a robust regression network trained to predict dry biomass using approximate biomass labels and a small trusted dataset with gold standard labels. We design our approach on a herbage mass estimation dataset collected in Ireland and also report state-of-the-art results on the publicly released Grass-Clover biomass estimation dataset from Denmark. Our code is available at https://git.io/J0L2a
△ Less
Submitted 26 October, 2021;
originally announced October 2021.
-
Friendly bisections of random graphs
Authors:
Asaf Ferber,
Matthew Kwan,
Bhargav Narayanan,
Ashwin Sah,
Mehtaab Sawhney
Abstract:
Resolving a conjecture of Füredi from 1988, we prove that with high probability, the random graph $G(n,1/2)$ admits a friendly bisection of its vertex set, i.e., a partition of its vertex set into two parts whose sizes differ by at most one in which $n-o(n)$ vertices have at least as many neighbours in their own part as across. The engine of our proof is a new method to study stochastic processes…
▽ More
Resolving a conjecture of Füredi from 1988, we prove that with high probability, the random graph $G(n,1/2)$ admits a friendly bisection of its vertex set, i.e., a partition of its vertex set into two parts whose sizes differ by at most one in which $n-o(n)$ vertices have at least as many neighbours in their own part as across. The engine of our proof is a new method to study stochastic processes driven by degree information in random graphs; this involves combining enumeration techniques with an abstract second moment argument.
△ Less
Submitted 7 June, 2021; v1 submitted 27 May, 2021;
originally announced May 2021.
-
Sharp estimates for spanning trees
Authors:
Steven Klee,
Bhargav Narayanan,
Lisa Sauermann
Abstract:
We prove the following sharp estimate for the number of spanning trees of a graph in terms of its vertex-degrees: a simple graph $G$ on $n$ vertices has at most $(1/n^{2}) \prod_{v \in V(G)} (d(v)+1)$ spanning trees. This result is tight (for complete graphs), and improves earlier estimates of Alon from 1990 and Kostochka from 1995 by a factor of about $1/n$ (for dense graphs). We additionally sho…
▽ More
We prove the following sharp estimate for the number of spanning trees of a graph in terms of its vertex-degrees: a simple graph $G$ on $n$ vertices has at most $(1/n^{2}) \prod_{v \in V(G)} (d(v)+1)$ spanning trees. This result is tight (for complete graphs), and improves earlier estimates of Alon from 1990 and Kostochka from 1995 by a factor of about $1/n$ (for dense graphs). We additionally show that an analogous bound holds for the weighted spanning tree enumerator of a (nonnegatively) weighted graph as well.
△ Less
Submitted 12 April, 2022; v1 submitted 2 February, 2021;
originally announced February 2021.
-
Extracting Pasture Phenotype and Biomass Percentages using Weakly Supervised Multi-target Deep Learning on a Small Dataset
Authors:
Badri Narayanan,
Mohamed Saadeldin,
Paul Albert,
Kevin McGuinness,
Brian Mac Namee
Abstract:
The dairy industry uses clover and grass as fodder for cows. Accurate estimation of grass and clover biomass yield enables smart decisions in optimizing fertilization and seeding density, resulting in increased productivity and positive environmental impact. Grass and clover are usually planted together, since clover is a nitrogen-fixing plant that brings nutrients to the soil. Adjusting the right…
▽ More
The dairy industry uses clover and grass as fodder for cows. Accurate estimation of grass and clover biomass yield enables smart decisions in optimizing fertilization and seeding density, resulting in increased productivity and positive environmental impact. Grass and clover are usually planted together, since clover is a nitrogen-fixing plant that brings nutrients to the soil. Adjusting the right percentages of clover and grass in a field reduces the need for external fertilization. Existing approaches for estimating the grass-clover composition of a field are expensive and time consuming - random samples of the pasture are clipped and then the components are physically separated to weigh and calculate percentages of dry grass, clover and weeds in each sample. There is growing interest in develo** novel deep learning based approaches to non-destructively extract pasture phenotype indicators and biomass yield predictions of different plant species from agricultural imagery collected from the field. Providing these indicators and predictions from images alone remains a significant challenge. Heavy occlusions in the dense mixture of grass, clover and weeds make it difficult to estimate each component accurately. Moreover, although supervised deep learning models perform well with large datasets, it is tedious to acquire large and diverse collections of field images with precise ground truth for different biomass yields. In this paper, we demonstrate that applying data augmentation and transfer learning is effective in predicting multi-target biomass percentages of different plant species, even with a small training dataset. The scheme proposed in this paper used a training set of only 261 images and provided predictions of biomass percentages of grass, clover, white clover, red clover, and weeds with mean absolute error of 6.77%, 6.92%, 6.21%, 6.89%, and 4.80% respectively.
△ Less
Submitted 8 January, 2021;
originally announced January 2021.
-
Simplicial homeomorphs and trace-bounded hypergraphs
Authors:
Jason Long,
Bhargav Narayanan,
Corrine Yap
Abstract:
Our first main result is a uniform bound, in every dimension $k \in \mathbb N$, on the topological Turán numbers of $k$-dimensional simplicial complexes: for each $k \in \mathbb N$, there is a $λ_k \ge k^{-2k^2}$ such that for any $k$-complex $\mathcal{S}$, every $k$-complex on $n \ge n_0(\mathcal{S})$ vertices with at least $n^{k+1 - λ_k}$ facets contains a homeomorphic copy of $\mathcal{S}$. Thi…
▽ More
Our first main result is a uniform bound, in every dimension $k \in \mathbb N$, on the topological Turán numbers of $k$-dimensional simplicial complexes: for each $k \in \mathbb N$, there is a $λ_k \ge k^{-2k^2}$ such that for any $k$-complex $\mathcal{S}$, every $k$-complex on $n \ge n_0(\mathcal{S})$ vertices with at least $n^{k+1 - λ_k}$ facets contains a homeomorphic copy of $\mathcal{S}$. This was previously known only in dimensions one and two, both by highly dimension-specific arguments: the existence of $λ_1$ is a result of Mader from 1967, and the existence of $λ_2$ was suggested by Linial in 2006 and recently proved by Keevash-Long-Narayanan-Scott. We deduce this geometric fact from a purely combinatorial result about trace-bounded hypergraphs, where an $r$-partite $r$-graph $H$ with partite classes $V_1, V_2, \dots, V_r$ is said to be $d$-trace-bounded if for each $2 \le i \le r$, all the vertices of $V_i$ have degree at most $d$ in the trace of $H$ on $V_1 \cup V_2 \cup \dots \cup V_i$. Our second main result is the following estimate for the Turán numbers of degenerate trace-bounded hypergraphs: for all $r \ge 2$ and $d\in\mathbb N$, there is an $α_{r,d} \ge (5rd)^{1-r}$ such that for any $d$-trace-bounded $r$-partite $r$-graph $H$, every $r$-graph on $n \ge n_0(H)$ vertices with at least $n^{r - α_{r,d}}$ edges contains a copy of $H$. This strengthens a result of Conlon-Fox-Sudakov from 2009 who showed that such a bound holds for $r$-partite $r$-graphs $H$ satisfying the stronger hypothesis that the vertex-degrees in all but one of its partite classes are bounded (in $H$, as opposed to in its traces).
△ Less
Submitted 5 July, 2022; v1 submitted 16 November, 2020;
originally announced November 2020.
-
The threshold for the square of a Hamilton cycle
Authors:
Jeff Kahn,
Bhargav Narayanan,
**young Park
Abstract:
Resolving a conjecture of Kühn and Osthus from 2012, we show that $p= 1/\sqrt{n}$ is the threshold for the random graph $G_{n,p}$ to contain the square of a Hamilton cycle.
Resolving a conjecture of Kühn and Osthus from 2012, we show that $p= 1/\sqrt{n}$ is the threshold for the random graph $G_{n,p}$ to contain the square of a Hamilton cycle.
△ Less
Submitted 16 October, 2020;
originally announced October 2020.
-
Active Learning A Neural Network Model For Gold Clusters \& Bulk From Sparse First Principles Training Data
Authors:
Troy D Loeffler,
Sukriti Manna,
Tarak K Patra,
Henry Chan,
Badri Narayanan,
Subramanian Sankaranarayanan
Abstract:
Small metal clusters are of fundamental scientific interest and of tremendous significance in catalysis. These nanoscale clusters display diverse geometries and structural motifs depending on the cluster size; a knowledge of this size-dependent structural motifs and their dynamical evolution has been of longstanding interest. Classical MD typically employ predefined functional forms which limits t…
▽ More
Small metal clusters are of fundamental scientific interest and of tremendous significance in catalysis. These nanoscale clusters display diverse geometries and structural motifs depending on the cluster size; a knowledge of this size-dependent structural motifs and their dynamical evolution has been of longstanding interest. Classical MD typically employ predefined functional forms which limits their ability to capture such complex size-dependent structural and dynamical transformation. Neural Network (NN) based potentials represent flexible alternatives and in principle, well-trained NN potentials can provide high level of flexibility, transferability and accuracy on-par with the reference model used for training. A major challenge, however, is that NN models are interpolative and requires large quantities of training data to ensure that the model adequately samples the energy landscape both near and far-from-equilibrium. Here, we introduce an active learning (AL) scheme that trains a NN model on-the-fly with minimal amount of first-principles based training data. Our AL workflow is initiated with a sparse training dataset (1 to 5 data points) and is updated on-the-fly via a Nested Ensemble Monte Carlo scheme that iteratively queries the energy landscape in regions of failure and updates the training pool to improve the network performance. Using a representative system of gold clusters, we demonstrate that our AL workflow can train a NN with ~500 total reference calculations. Our NN predictions are within 30 meV/atom and 40 meV/Åof the reference DFT calculations. Moreover, our AL-NN model also adequately captures the various size-dependent structural and dynamical properties of gold clusters in excellent agreement with DFT calculations and available experiments.
△ Less
Submitted 5 June, 2020;
originally announced June 2020.
-
A universal exponent for homeomorphs
Authors:
Peter Keevash,
Jason Long,
Bhargav Narayanan,
Alex Scott
Abstract:
We prove a uniform bound on the topological Turán number of an arbitrary two-dimensional simplicial complex $S$: any $n$-vertex two-dimensional complex with at least $C_S n^{3-1/5}$ facets contains a homeomorphic copy of $S$, where $C_S > 0$ is an absolute constant depending on $S$ alone. This result, a two-dimensional analogue of a classical result of Mader for one-dimensional complexes, sheds so…
▽ More
We prove a uniform bound on the topological Turán number of an arbitrary two-dimensional simplicial complex $S$: any $n$-vertex two-dimensional complex with at least $C_S n^{3-1/5}$ facets contains a homeomorphic copy of $S$, where $C_S > 0$ is an absolute constant depending on $S$ alone. This result, a two-dimensional analogue of a classical result of Mader for one-dimensional complexes, sheds some light on an old problem of Linial from 2006.
△ Less
Submitted 6 April, 2020;
originally announced April 2020.
-
Subgraphs of large connectivity and chromatic number
Authors:
António Girão,
Bhargav Narayanan
Abstract:
Resolving a problem raised by Norin, we show that for each $k \in \mathbb{N}$, there exists an $f(k) \le 7k$ such that every graph $G$ with chromatic number at least $f(k)+1$ contains a subgraph $H$ with both connectivity and chromatic number at least $k$. This result is best-possible up to multiplicative constants, and sharpens earlier results of Alon-Kleitman-Thomassen-Saks-Seymour from 1987 sho…
▽ More
Resolving a problem raised by Norin, we show that for each $k \in \mathbb{N}$, there exists an $f(k) \le 7k$ such that every graph $G$ with chromatic number at least $f(k)+1$ contains a subgraph $H$ with both connectivity and chromatic number at least $k$. This result is best-possible up to multiplicative constants, and sharpens earlier results of Alon-Kleitman-Thomassen-Saks-Seymour from 1987 showing that $f(k) = O(k^3)$, and of Chudnovsky-Penev-Scott-Trotignon from 2013 showing that $f(k) = O(k^2)$. Our methods are robust enough to handle list colouring as well: we also show that for each $k \in \mathbb{N}$, there exists an $f_\ell(k) \le 4k$ such that every graph $G$ with list chromatic number at least $f_\ell(k)+1$ contains a subgraph $H$ with both connectivity and list chromatic number at least $k$. This result is again best-possible up to multiplicative constants; here, unlike with $f(\cdot)$, even the existence of $f_\ell(\cdot)$ appears to have been previously unknown.
△ Less
Submitted 2 April, 2020; v1 submitted 1 April, 2020;
originally announced April 2020.
-
BLAST: Bridging Length/time scales via Atomistic Simulation Toolkit
Authors:
Henry Chan,
Badri Narayanan,
Mathew Cherukara,
Troy D. Loeffler,
Michael G. Sternberg,
Anthony Avarca,
Subramanian K. R. S. Sankaranarayanan
Abstract:
The ever-increasing power of supercomputers coupled with highly scalable simulation codes have made molecular dynamics an indispensable tool in applications ranging from predictive modeling of materials to computational design and discovery of new materials for a broad range of applications. Multi-fidelity scale bridging between the various flavors of molecular dynamics i.e. ab-initio, classical a…
▽ More
The ever-increasing power of supercomputers coupled with highly scalable simulation codes have made molecular dynamics an indispensable tool in applications ranging from predictive modeling of materials to computational design and discovery of new materials for a broad range of applications. Multi-fidelity scale bridging between the various flavors of molecular dynamics i.e. ab-initio, classical and coarse-grained models has remained a long-standing challenge. Here, we introduce our framework BLAST (Bridging Length/time scales via Atomistic Simulation Toolkit) that leverages machine learning principles to address this challenge. BLAST is a multi-fidelity scale bridging framework that provide users with the capabilities to train and develop their own classical atomistic and coarse-grained interatomic potentials (force fields) for molecular simulations. BLAST is designed to address several long-standing problems in the molecular simulations community, such as unintended misuse of existing force fields due to knowledge gap between developers and users, bottlenecks in traditional force field development approaches, and other issues relating to the accuracy, efficiency, and transferability of force fields. Here, we discuss several important aspects in force field development and highlight features in BLAST that enable its functionalities and ease of use.
△ Less
Submitted 21 February, 2020;
originally announced February 2020.
-
Counting independent sets in regular hypergraphs
Authors:
Jozsef Balogh,
Bela Bollobas,
Bhargav Narayanan
Abstract:
Amongst $d$-regular $r$-uniform hypergraphs on $n$ vertices, which ones have the largest number of independent sets? While the analogous problem for graphs (originally raised by Granville) is now well-understood, it is not even clear what the correct general conjecture ought to be; our goal here is propose such a generalisation. Lending credence to our conjecture, we verify it within the class of…
▽ More
Amongst $d$-regular $r$-uniform hypergraphs on $n$ vertices, which ones have the largest number of independent sets? While the analogous problem for graphs (originally raised by Granville) is now well-understood, it is not even clear what the correct general conjecture ought to be; our goal here is propose such a generalisation. Lending credence to our conjecture, we verify it within the class of `quasi-bipartite' hypergraphs (a generalisation of bipartite graphs that seems natural in this context) by adopting the entropic approach of Kahn.
△ Less
Submitted 23 February, 2020;
originally announced February 2020.
-
Understanding Deep Neural Network Predictions for Medical Imaging Applications
Authors:
Barath Narayanan Narayanan,
Manawaduge Supun De Silva,
Russell C. Hardie,
Nathan K. Kueterman,
Redha Ali
Abstract:
Computer-aided detection has been a research area attracting great interest in the past decade. Machine learning algorithms have been utilized extensively for this application as they provide a valuable second opinion to the doctors. Despite several machine learning models being available for medical imaging applications, not many have been implemented in the real-world due to the uninterpretable…
▽ More
Computer-aided detection has been a research area attracting great interest in the past decade. Machine learning algorithms have been utilized extensively for this application as they provide a valuable second opinion to the doctors. Despite several machine learning models being available for medical imaging applications, not many have been implemented in the real-world due to the uninterpretable nature of the decisions made by the network. In this paper, we investigate the results provided by deep neural networks for the detection of malaria, diabetic retinopathy, brain tumor, and tuberculosis in different imaging modalities. We visualize the class activation map**s for all the applications in order to enhance the understanding of these networks. This type of visualization, along with the corresponding network performance metrics, would aid the data science experts in better understanding of their models as well as assisting doctors in their decision-making process.
△ Less
Submitted 19 December, 2019;
originally announced December 2019.
-
Thresholds versus fractional expectation-thresholds
Authors:
Keith Frankston,
Jeff Kahn,
Bhargav Narayanan,
**young Park
Abstract:
Proving a conjecture of Talagrand, a fractional version of the 'expectation-threshold' conjecture of Kalai and the second author, we show for any increasing family $F$ on a finite set $X$ that $p_c (F) =O( q_f (F) \log \ell(F))$, where $p_c(F)$ and $q_f(F)$ are the threshold and 'fractional expectation-threshold' of $F$, and $\ell(F)$ is the largest size of a minimal member of $F$. This easily imp…
▽ More
Proving a conjecture of Talagrand, a fractional version of the 'expectation-threshold' conjecture of Kalai and the second author, we show for any increasing family $F$ on a finite set $X$ that $p_c (F) =O( q_f (F) \log \ell(F))$, where $p_c(F)$ and $q_f(F)$ are the threshold and 'fractional expectation-threshold' of $F$, and $\ell(F)$ is the largest size of a minimal member of $F$. This easily implies several heretofore difficult results and conjectures in probabilistic combinatorics, including thresholds for perfect hypergraph matchings (Johansson--Kahn--Vu), bounded-degree spanning trees (Montgomery), and bounded-degree spanning graphs (new). We also resolve (and vastly extend) the 'axial' version of the random multi-dimensional assignment problem (earlier considered by Martin--Mézard--Rivoire and Frieze--Sorkin). Our approach builds on a recent breakthrough of Alweiss, Lovett, Wu and Zhang on the Erdős--Rado 'Sunflower Conjecture'.
△ Less
Submitted 10 December, 2019; v1 submitted 29 October, 2019;
originally announced October 2019.
-
A coarse-grained deep neural network model for liquid water
Authors:
Tarak K Patra,
Troy D. Loeffler,
Henry Chan,
Mathew J. Cherukara,
Badri Narayanan,
Subramanian K. R. S. Sankaranarayanan
Abstract:
We introduce a coarse-grained deep neural network model (CG-DNN) for liquid water that utilizes 50 rotational and translational invariant coordinates, and is trained exclusively against energies of ~30,000 bulk water configurations. Our CG-DNN potential accurately predicts both the energies and molecular forces of water; within 0.9 meV/molecule and 54 meV/angstrom of a reference (coarse-grained bo…
▽ More
We introduce a coarse-grained deep neural network model (CG-DNN) for liquid water that utilizes 50 rotational and translational invariant coordinates, and is trained exclusively against energies of ~30,000 bulk water configurations. Our CG-DNN potential accurately predicts both the energies and molecular forces of water; within 0.9 meV/molecule and 54 meV/angstrom of a reference (coarse-grained bond-order potential) model. The CG-DNN water model also provides good prediction of several structural, thermodynamic, and temperature dependent properties of liquid water, with values close to that obtained from the reference model. More importantly, CG-DNN captures the well-known density anomaly of liquid water observed in experiments. Our work lays the groundwork for a scheme where existing empirical water models can be utilized to develop fully flexible neural network framework that can subsequently be trained against sparse data from high-fidelity albeit expensive beyond-DFT calculations.
△ Less
Submitted 14 October, 2019; v1 submitted 1 October, 2019;
originally announced October 2019.
-
On symmetric intersecting families of vectors
Authors:
Sean Eberhard,
Jeff Kahn,
Bhargav Narayanan,
Sophie Spirkl
Abstract:
A family of vectors $A \subset [k]^n$ is said to be intersecting if any two elements of $A$ agree on at least one coordinate. We prove, for fixed $k \ge 3$, that the size of a symmetric intersecting subfamily of $[k]^n$ is $o(k^n)$, which is in stark contrast to the case of the Boolean hypercube (where $k =2$). Our main contribution addresses limitations of existing technology: while there is now…
▽ More
A family of vectors $A \subset [k]^n$ is said to be intersecting if any two elements of $A$ agree on at least one coordinate. We prove, for fixed $k \ge 3$, that the size of a symmetric intersecting subfamily of $[k]^n$ is $o(k^n)$, which is in stark contrast to the case of the Boolean hypercube (where $k =2$). Our main contribution addresses limitations of existing technology: while there is now some spectral machinery, developed by Ellis and the third author, to tackle extremal problems in set theory involving symmetry, this machinery relies crucially on the interplay between up-sets and biased product measures on the Boolean hypercube, features that are notably absent in the problem at hand; here, we describe a method for circumventing these barriers.
△ Less
Submitted 31 July, 2020; v1 submitted 25 September, 2019;
originally announced September 2019.
-
Disproportionate division
Authors:
Logan Crew,
Bhargav Narayanan,
Sophie Spirkl
Abstract:
We study the disproportionate version of the classical cake-cutting problem: how efficiently can we divide a cake, here $[0,1]$, among $n$ agents with different demands $α_1, α_2, \dots, α_n$ summing to $1$? When all the agents have equal demands of $α_1 = α_2 = \dots = α_n = 1/n$, it is well-known that there exists a fair division with $n-1$ cuts, and this is optimal. For arbitrary demands on the…
▽ More
We study the disproportionate version of the classical cake-cutting problem: how efficiently can we divide a cake, here $[0,1]$, among $n$ agents with different demands $α_1, α_2, \dots, α_n$ summing to $1$? When all the agents have equal demands of $α_1 = α_2 = \dots = α_n = 1/n$, it is well-known that there exists a fair division with $n-1$ cuts, and this is optimal. For arbitrary demands on the other hand, folklore arguments from algebraic topology show that $O(n\log n)$ cuts suffice, and this has been the state of the art for decades. Here, we improve the state of affairs in two ways: we prove that disproportionate division may always be achieved with $3n-4$ cuts, and give an effective combinatorial procedure to construct such a division. We also offer a topological conjecture that implies that $2n-2$ cuts suffice in general, which would be optimal.
△ Less
Submitted 16 September, 2019;
originally announced September 2019.
-
Slowdown for the geodesic-biased random walk
Authors:
Mikhail Beliayeu,
Petr Chmel,
Bhargav Narayanan,
Jan Petr
Abstract:
Given a connected graph $G$ with some subset of its vertices excited and a fixed target vertex, in the geodesic-biased random walk on $G$, a random walker moves as follows: from an unexcited vertex, she moves to a uniformly random neighbour, whereas from an excited vertex, she takes one step along some fixed shortest path towards the target vertex. We show, perhaps counterintuitively, that the geo…
▽ More
Given a connected graph $G$ with some subset of its vertices excited and a fixed target vertex, in the geodesic-biased random walk on $G$, a random walker moves as follows: from an unexcited vertex, she moves to a uniformly random neighbour, whereas from an excited vertex, she takes one step along some fixed shortest path towards the target vertex. We show, perhaps counterintuitively, that the geodesic-bias can slow the random walker down exponentially: there exist connected, bounded-degree $n$-vertex graphs with excitations where the expected hitting time of a fixed target is at least $\exp (\sqrt[4]{n} / 100)$.
△ Less
Submitted 12 September, 2019;
originally announced September 2019.
-
Turán theorems for unavoidable patterns
Authors:
António Girão,
Bhargav Narayanan
Abstract:
We prove Turán-type theorems for two related Ramsey problems raised by Bollobás and by Fox and Sudakov. First, for $t \ge 3$, we show that any two-colouring of the complete graph on $n$ vertices that is $δ$-far from being monochromatic contains an \emph{unavoidable $t$-colouring} when $δ\gg n^{-1/t}$, where an unavoidable $t$-colouring is any two-colouring of a clique of order $2t$ in which one co…
▽ More
We prove Turán-type theorems for two related Ramsey problems raised by Bollobás and by Fox and Sudakov. First, for $t \ge 3$, we show that any two-colouring of the complete graph on $n$ vertices that is $δ$-far from being monochromatic contains an \emph{unavoidable $t$-colouring} when $δ\gg n^{-1/t}$, where an unavoidable $t$-colouring is any two-colouring of a clique of order $2t$ in which one colour forms either a clique of order $t$ or two disjoint cliques of order $t$. Next, for $ t\ge 3$, we show that any tournament on $n$ vertices that is $δ$-far from being transitive contains an \emph{unavoidable $t$-tournament} when $δ\gg n^{-1/\lceil t/2 \rceil}$, where an unavoidable $t$-tournament is the blow-up of a cyclic triangle obtained by replacing each vertex of the triangle by a transitive tournament of order $t$. Conditional on a well-known conjecture about bipartite Turán numbers, both results are sharp up to implied constants and hence determine the order of magnitude of the corresponding off-diagonal Ramsey numbers.
△ Less
Submitted 1 July, 2019;
originally announced July 2019.
-
Sharp thresholds for nonlinear Hamiltonian cycles in hypergraphs
Authors:
Bhargav Narayanan,
Mathias Schacht
Abstract:
For positive integers $r > \ell$, an $r$-uniform hypergraph is called an $\ell$-cycle if there exists a cyclic ordering of its vertices such that each of its edges consists of $r$ consecutive vertices, and such that every pair of consecutive edges (in the natural ordering of the edges) intersect in precisely $\ell$ vertices. Such cycles are said to be linear when $\ell = 1$, and nonlinear when…
▽ More
For positive integers $r > \ell$, an $r$-uniform hypergraph is called an $\ell$-cycle if there exists a cyclic ordering of its vertices such that each of its edges consists of $r$ consecutive vertices, and such that every pair of consecutive edges (in the natural ordering of the edges) intersect in precisely $\ell$ vertices. Such cycles are said to be linear when $\ell = 1$, and nonlinear when $\ell > 1$. We determine the sharp threshold for nonlinear Hamiltonian cycles and show that for all $r > \ell > 1$, the threshold $p^*_{r, \ell} (n)$ for the appearance of a Hamiltonian $\ell$-cycle in the random $r$-uniform hypergraph on $n$ vertices is sharp and is $p^*_{r, \ell} (n) = λ(r,\ell) (\frac{\mathrm{e}}{n})^{r - \ell}$ for an explicitly specified function $λ$. This resolves several questions raised by Dudek and Frieze in 2011.
△ Less
Submitted 12 June, 2019;
originally announced June 2019.
-
Machine Learning Prediction of Accurate Atomization Energies of Organic Molecules from Low-Fidelity Quantum Chemical Calculations
Authors:
Logan Ward,
Ben Blaiszik,
Ian Foster,
Rajeev S. Assary,
Badri Narayanan,
Larry Curtiss
Abstract:
Recent studies illustrate how machine learning (ML) can be used to bypass a core challenge of molecular modeling: the tradeoff between accuracy and computational cost. Here, we assess multiple ML approaches for predicting the atomization energy of organic molecules. Our resulting models learn the difference between low-fidelity, B3LYP, and high-accuracy, G4MP2, atomization energies, and predict th…
▽ More
Recent studies illustrate how machine learning (ML) can be used to bypass a core challenge of molecular modeling: the tradeoff between accuracy and computational cost. Here, we assess multiple ML approaches for predicting the atomization energy of organic molecules. Our resulting models learn the difference between low-fidelity, B3LYP, and high-accuracy, G4MP2, atomization energies, and predict the G4MP2 atomization energy to 0.005 eV (mean absolute error) for molecules with less than 9 heavy atoms and 0.012 eV for a small set of molecules with between 10 and 14 heavy atoms. Our two best models, which have different accuracy/speed tradeoffs, enable the efficient prediction of G4MP2-level energies for large molecules and are available through a simple web interface.
△ Less
Submitted 7 June, 2019;
originally announced June 2019.
-
Graphene Overcoats for Ultra-High Storage Density Magnetic Media
Authors:
N. Dwivedi,
A. K. Ott,
K. Sasikumar,
C. Dou,
R. J. Yeo,
B. Narayanan,
U. Sassi,
D. De Fazio,
G. Soavi,
T. Dutta,
S. K. R. S. Sankaranarayanan,
A. C. Ferrari,
C. S. Bhatia
Abstract:
Hard disk drives (HDDs) are used as secondary storage in a number of digital electronic devices owing to low cost ($<$0.1\$/GB at 2016 prices) and large data storage capacity (10TB with a 3.5 inch HDD). Due to the exponentially increasing amount of data, there is a need to increase areal storage densities beyond$\sim$1Tb/in$^2$. This requires the thickness of carbon overcoats (COCs) to be$<…
▽ More
Hard disk drives (HDDs) are used as secondary storage in a number of digital electronic devices owing to low cost ($<$0.1\$/GB at 2016 prices) and large data storage capacity (10TB with a 3.5 inch HDD). Due to the exponentially increasing amount of data, there is a need to increase areal storage densities beyond$\sim$1Tb/in$^2$. This requires the thickness of carbon overcoats (COCs) to be$<$2nm. Friction, wear, corrosion, and thermal stability are critical concerns$<$2nm, where most of the protective properties of current COCs are lost. This limits current technology and restricts COC integration with heat assisted magnetic recording technology (HAMR), since this also requires laser irradiation stability. Here we show that graphene-based overcoats can overcome all these limitations. 2-4 layers of graphene enable two-fold reduction in friction and provide better corrosion and wear than state-of-the-art COCs. A single graphene layer is enough to reduce corrosion$\sim$2.5 times. We also show that graphene can withstand HAMR conditions. Thus, graphene-based overcoats can enable ultrahigh areal density HDDs$>$10Tb/in$^2$.
△ Less
Submitted 2 June, 2019;
originally announced June 2019.
-
Product-free sets in the free semigroup
Authors:
Imre Leader,
Shoham Letzter,
Bhargav Narayanan,
Mark Walters
Abstract:
In this paper, we study product-free subsets of the free semigroup over a finite alphabet $A$. We prove that the maximum density of a product-free subset of the free semigroup over $A$, with respect to the natural measure that assigns a weight of $|A|^{-n}$ to each word of length $n$, is precisely $1/2$.
In this paper, we study product-free subsets of the free semigroup over a finite alphabet $A$. We prove that the maximum density of a product-free subset of the free semigroup over $A$, with respect to the natural measure that assigns a weight of $|A|^{-n}$ to each word of length $n$, is precisely $1/2$.
△ Less
Submitted 11 December, 2018;
originally announced December 2018.
-
Comparing optimization strategies for force field parameterization
Authors:
Fatih G. Sen,
Badri Narayanan,
Jeffrey Larson,
Alper Kinaci,
Kiran Sasikumar,
Michael J. Davis,
Stefan M. Wild,
Stephen K. Gray,
Subramanian K. R. S. Sankaranarayanan,
Maria K. Y. Chan
Abstract:
Classical molecular dynamics (MD) simulations enable modeling of materials and examination of microscopic details that are not accessible experimentally. The predictive capability of MD relies on the force field (FF) used to describe interatomic interactions. FF parameters are typically determined to reproduce selected material properties computed from density functional theory (DFT) and/or measur…
▽ More
Classical molecular dynamics (MD) simulations enable modeling of materials and examination of microscopic details that are not accessible experimentally. The predictive capability of MD relies on the force field (FF) used to describe interatomic interactions. FF parameters are typically determined to reproduce selected material properties computed from density functional theory (DFT) and/or measured experimentally. A common practice in parameterizing FFs is to use least-squares local minimization algorithms. Genetic algorithms (GAs) have also been demonstrated as a viable global optimization approach, even for complex FFs. However, an understanding of the relative effectiveness and efficiency of different optimization techniques for the determination of FF parameters is still lacking. In this work, we evaluate various FF parameter optimization schemes, using as example a training data set calculated from DFT for different polymorphs of Ir$O_2$. The Morse functional form is chosen for the pairwise interactions and the optimization of the parameters against the training data is carried out using (1) multi-start local optimization algorithms: Simplex, Levenberg-Marquardt, and POUNDERS, (2) single-objective GA, and (3) multi-objective GA. Using random search as a baseline, we compare the algorithms in terms of reaching the lowest error, and number of function evaluations. We also compare the effectiveness of different approaches for FF parameterization using a test data set with known ground truth (i.e generated from a specific Morse FF). We find that the performance of optimization approaches differs when using the Test data vs. the DFT data. Overall, this study provides insight for selecting a suitable optimization method for FF parameterization, which in turn can enable more accurate prediction of material properties and chemical phenomena.
△ Less
Submitted 1 December, 2018;
originally announced December 2018.
-
Spanning surfaces in 3-graphs
Authors:
Agelos Georgakopoulos,
John Haslegrave,
Richard Montgomery,
Bhargav Narayanan
Abstract:
We prove a topological extension of Dirac's theorem suggested by Gowers in 2005: for any connected, closed surface $\mathscr{S}$, we show that any two-dimensional simplicial complex on $n$ vertices in which each pair of vertices belongs to at least $n/3 + o(n)$ facets contains a homeomorph of $\mathscr{S}$ spanning all the vertices. This result is asymptotically sharp, and implies in particular th…
▽ More
We prove a topological extension of Dirac's theorem suggested by Gowers in 2005: for any connected, closed surface $\mathscr{S}$, we show that any two-dimensional simplicial complex on $n$ vertices in which each pair of vertices belongs to at least $n/3 + o(n)$ facets contains a homeomorph of $\mathscr{S}$ spanning all the vertices. This result is asymptotically sharp, and implies in particular that any 3-uniform hypergraph on $n$ vertices with minimum codegree exceeding $n/3+o(n)$ contains a spanning triangulation of the $2$-sphere.
△ Less
Submitted 21 August, 2018;
originally announced August 2018.
-
Exceptional graphs for the random walk
Authors:
Juhan Aru,
Carla Groenland,
Tom Johnston,
Bhargav Narayanan,
Alex Roberts,
Alex Scott
Abstract:
If $\mathcal{W}$ is the simple random walk on the square lattice $\mathbb{Z}^2$, then $\mathcal{W}$ induces a random walk $\mathcal{W}_G$ on any spanning subgraph $G\subset \mathbb{Z}^2$ of the lattice as follows: viewing $\mathcal{W}$ as a uniformly random infinite word on the alphabet $\{\mathbf{x}, -\mathbf{x}, \mathbf{y}, -\mathbf{y} \}$, the walk $\mathcal{W}_G$ starts at the origin and follo…
▽ More
If $\mathcal{W}$ is the simple random walk on the square lattice $\mathbb{Z}^2$, then $\mathcal{W}$ induces a random walk $\mathcal{W}_G$ on any spanning subgraph $G\subset \mathbb{Z}^2$ of the lattice as follows: viewing $\mathcal{W}$ as a uniformly random infinite word on the alphabet $\{\mathbf{x}, -\mathbf{x}, \mathbf{y}, -\mathbf{y} \}$, the walk $\mathcal{W}_G$ starts at the origin and follows the directions specified by $\mathcal{W}$, only accepting steps of $\mathcal{W}$ along which the walk $\mathcal{W}_G$ does not exit $G$. For any fixed subgraph $G \subset \mathbb{Z}^2$, the walk $\mathcal{W}_G$ is distributed as the simple random walk on $G$, and hence $\mathcal{W}_G$ is almost surely recurrent in the sense that $\mathcal{W}_G$ visits every site reachable from the origin in $G$ infinitely often. This fact naturally leads us to ask the following: does $\mathcal{W}$ almost surely have the property that $\mathcal{W}_G$ is recurrent for \emph{every} subgraph $G \subset \mathbb{Z}^2$? We answer this question negatively, demonstrating that exceptional subgraphs exist almost surely. In fact, we show more to be true: exceptional subgraphs continue to exist almost surely for a countable collection of independent simple random walks, but on the other hand, there are almost surely no exceptional subgraphs for a branching random walk.
△ Less
Submitted 6 September, 2018; v1 submitted 16 May, 2018;
originally announced May 2018.
-
Pressure-Induced Phase Transformation in $β$-Eucryptite: an X-Ray Diffraction and Density Functional Theory Study
Authors:
Yachao Chen,
Sukriti Manna,
Badri Narayanan,
Zhongwu Wang,
Ivar E. Reimanis,
Cristian V. Ciobanu
Abstract:
Certain alumino-silicates display exotic properties enabled by their framework structure made of corner-sharing tetrahedral rigid units. Using \textit{in situ} diamond-anvil cell x-ray diffraction (XRD), we study the pressure-induced transformation of $β$ eucryptite, a prototypical alumino-silicate. $β$ eucryptite undergoes a phase transformation at moderate pressures, but the atomic structure of…
▽ More
Certain alumino-silicates display exotic properties enabled by their framework structure made of corner-sharing tetrahedral rigid units. Using \textit{in situ} diamond-anvil cell x-ray diffraction (XRD), we study the pressure-induced transformation of $β$ eucryptite, a prototypical alumino-silicate. $β$ eucryptite undergoes a phase transformation at moderate pressures, but the atomic structure of the new phase has not yet been reported. Based on density functional theory stability studies and Rietveld analysis of XRD patterns, we find that the pressure-stabilized phase belongs to the Pna2$_1$ space group. Furthermore, we discover two other possible pressure-stabilized polymorphs, P1c1 and Pca2$_1$.
△ Less
Submitted 20 March, 2018;
originally announced March 2018.
-
On regular 3-wise intersecting families
Authors:
Keith Frankston,
Jeff Kahn,
Bhargav Narayanan
Abstract:
Ellis and the third author showed, verifying a conjecture of Frankl, that any $3$-wise intersecting family of subsets of $\{1,2,\dots,n\}$ admitting a transitive automorphism group has cardinality $o(2^n)$, while a construction of Frankl demonstrates that the same conclusion need not hold under the weaker constraint of being regular. Answering a question of Cameron, Frankl and Kantor from 1989, we…
▽ More
Ellis and the third author showed, verifying a conjecture of Frankl, that any $3$-wise intersecting family of subsets of $\{1,2,\dots,n\}$ admitting a transitive automorphism group has cardinality $o(2^n)$, while a construction of Frankl demonstrates that the same conclusion need not hold under the weaker constraint of being regular. Answering a question of Cameron, Frankl and Kantor from 1989, we show that the restriction of admitting a transitive automorphism group may be relaxed significantly: we prove that any $3$-wise intersecting family of subsets of $\{1,2,\dots,n\}$ that is regular and increasing has cardinality $o(2^n)$.
△ Less
Submitted 27 December, 2017;
originally announced December 2017.
-
Long cycles in Hamiltonian graphs
Authors:
António Girão,
Teeradej Kittipassorn,
Bhargav Narayanan
Abstract:
We prove that if an $n$-vertex graph with minimum degree at least $3$ contains a Hamiltonian cycle, then it contains another cycle of length $n-o(n)$; this implies, in particular, that a well-known conjecture of Sheehan from 1975 holds asymptotically. Our methods, which combine constructive, poset-based techniques and non-constructive, parity-based arguments, may be of independent interest.
We prove that if an $n$-vertex graph with minimum degree at least $3$ contains a Hamiltonian cycle, then it contains another cycle of length $n-o(n)$; this implies, in particular, that a well-known conjecture of Sheehan from 1975 holds asymptotically. Our methods, which combine constructive, poset-based techniques and non-constructive, parity-based arguments, may be of independent interest.
△ Less
Submitted 16 September, 2017; v1 submitted 14 September, 2017;
originally announced September 2017.
-
Coppersmith's lattices and "focus groups": an attack on small-exponent RSA
Authors:
Stephen D. Miller,
Bhargav Narayanan,
Ramarathnam Venkatesan
Abstract:
We present a principled technique for reducing the lattice and matrix size in some applications of Coppersmith's lattice method for finding roots of modular polynomial equations. Motivated by ideas from machine learning, it relies on extrapolating patterns from the actual behavior of Coppersmith's attack for smaller parameter sizes, which can be thought of as "focus group" testing. When applied to…
▽ More
We present a principled technique for reducing the lattice and matrix size in some applications of Coppersmith's lattice method for finding roots of modular polynomial equations. Motivated by ideas from machine learning, it relies on extrapolating patterns from the actual behavior of Coppersmith's attack for smaller parameter sizes, which can be thought of as "focus group" testing. When applied to the small-exponent RSA problem, our technique reduces lattice dimensions and consequently running times, and hence can be applied to a wider range of exponents. Moreover, in many difficult examples our attack is not only faster but also more successful in recovering the RSA secret key. We include a discussion of subtleties concerning whether or not existing metrics (such as enabling condition bounds) are decisive in predicting the true efficacy of attacks based on Coppersmith's method. Finally, indications are given which suggest certain lattice basis reduction algorithms (such as Nguyen-Stehlé's L2) may be particularly well-suited for Coppersmith's method.
△ Less
Submitted 16 December, 2020; v1 submitted 30 August, 2017;
originally announced August 2017.
-
Reconstructing random jigsaws
Authors:
Paul Balister,
Béla Bollobás,
Bhargav Narayanan
Abstract:
A colouring of the edges of an $n \times n$ grid is said to be \emph{reconstructible} if the colouring is uniquely determined by the multiset of its $n^2$ \emph{tiles}, where the tile corresponding to a vertex of the grid specifies the colours of the edges incident to that vertex in some fixed order. In 2015, Mossel and Ross asked the following question: if the edges of an $n \times n$ grid are co…
▽ More
A colouring of the edges of an $n \times n$ grid is said to be \emph{reconstructible} if the colouring is uniquely determined by the multiset of its $n^2$ \emph{tiles}, where the tile corresponding to a vertex of the grid specifies the colours of the edges incident to that vertex in some fixed order. In 2015, Mossel and Ross asked the following question: if the edges of an $n \times n$ grid are coloured independently and uniformly at random using $q=q(n)$ different colours, then is the resulting colouring reconstructible with high probability? From below, Mossel and Ross showed that such a colouring is not reconstructible when $q = o(n^{2/3})$ and from above, Bordenave, Feige and Mossel and Nenadov, Pfister and Steger independently showed, for any fixed $ε> 0$, that such a colouring is reconstructible when $q \ge n^{1+ε}$. Here, we improve on these results and prove the following: there exist absolute constants $C, c > 0$ such that, as $n \to \infty$, the probability that a random colouring as above is reconstructible tends to $1$ if $q \ge Cn$ and to $0$ if $q \le cn$.
△ Less
Submitted 15 July, 2017;
originally announced July 2017.
-
The number of hypergraphs without linear cycles
Authors:
József Balogh,
Bhargav Narayanan,
Jozef Skokan
Abstract:
The $r$-uniform linear $k$-cycle $C^r_k$ is the $r$-uniform hypergraph on $k(r-1)$ vertices whose edges are sets of $r$ consecutive vertices in a cyclic ordering of the vertex set chosen in such a way that every pair of consecutive edges share exactly one vertex. Here, we prove a balanced supersaturation result for linear cycles which we then use in conjunction with the method of hypergraph contai…
▽ More
The $r$-uniform linear $k$-cycle $C^r_k$ is the $r$-uniform hypergraph on $k(r-1)$ vertices whose edges are sets of $r$ consecutive vertices in a cyclic ordering of the vertex set chosen in such a way that every pair of consecutive edges share exactly one vertex. Here, we prove a balanced supersaturation result for linear cycles which we then use in conjunction with the method of hypergraph containers to show that for any fixed pair of integers $r, k \ge 3$, the number of $C^r_k$-free $r$-uniform hypergraphs on $n$ vertices is $2^{Θ(n^{r-1})}$, thereby settling a conjecture due to Mubayi and Wang.
△ Less
Submitted 5 June, 2017;
originally announced June 2017.
-
Diffusion on graphs is eventually periodic
Authors:
Jason Long,
Bhargav Narayanan
Abstract:
We study a variant of the chip-firing game called \emph{diffusion}. In diffusion on a graph, each vertex of the graph is initially labelled with an integer interpreted as the number of chips at that vertex, and at each subsequent step, each vertex simultaneously fires one chip to each of its neighbours with fewer chips. Since this firing rule may result in negative labels, diffusion, unlike the pa…
▽ More
We study a variant of the chip-firing game called \emph{diffusion}. In diffusion on a graph, each vertex of the graph is initially labelled with an integer interpreted as the number of chips at that vertex, and at each subsequent step, each vertex simultaneously fires one chip to each of its neighbours with fewer chips. Since this firing rule may result in negative labels, diffusion, unlike the parallel chip-firing game, is not obviously periodic. In 2016, Duffy, Lidbetter, Messinger and Nowakowski nevertheless conjectured that diffusion is always eventually periodic, and moreover, that the process eventually has period either 1 or 2. Here, we establish this conjecture.
△ Less
Submitted 5 June, 2017; v1 submitted 13 April, 2017;
originally announced April 2017.
-
An improved lower bound for Folkman's theorem
Authors:
József Balogh,
Sean Eberhard,
Bhargav Narayanan,
Andrew Treglown,
Adam Zsolt Wagner
Abstract:
Folkman's Theorem asserts that for each $k \in \mathbb{N}$, there exists a natural number $n = F(k)$ such that whenever the elements of $[n]$ are two-coloured, there exists a set $A \subset [n]$ of size $k$ with the property that all the sums of the form $\sum_{x \in B} x$, where $B$ is a nonempty subset of $A$, are contained in $[n]$ and have the same colour. In 1989, Erdős and Spencer showed tha…
▽ More
Folkman's Theorem asserts that for each $k \in \mathbb{N}$, there exists a natural number $n = F(k)$ such that whenever the elements of $[n]$ are two-coloured, there exists a set $A \subset [n]$ of size $k$ with the property that all the sums of the form $\sum_{x \in B} x$, where $B$ is a nonempty subset of $A$, are contained in $[n]$ and have the same colour. In 1989, Erdős and Spencer showed that $F(k) \ge 2^{ck^2/ \log k}$, where $c >0$ is an absolute constant; here, we improve this bound significantly by showing that $F(k) \ge 2^{2^{k-1}/k}$ for all $k\in \mathbb{N}$.
△ Less
Submitted 5 June, 2017; v1 submitted 7 March, 2017;
originally announced March 2017.
-
Perovskite Quantum Organismoids
Authors:
Fan Zuo,
Priyadarshini Panda,
Michele Kotiuga,
Jiarui Li,
Min Gu Kang,
Claudio Mazzoli,
Hua Zhou,
Andi Barbour,
Stuart Wilkins,
Badri Narayanan,
Mathew Cherukara,
Zhen Zhang,
Subramanian K. R. S. Sankaranarayanan,
Riccardo Comin,
Karin M. Rabe,
Kaushik Roy,
Shriram Ramanathan
Abstract:
A central characteristic of living beings is the ability to learn from and respond to their environment leading to habit formation and decision making1-3. This behavior, known as habituation, is universal among forms of life with a central nervous system, and interestingly observed even in single cellular organisms that do not possess a brain4-5. Here, we report the discovery of habituation based…
▽ More
A central characteristic of living beings is the ability to learn from and respond to their environment leading to habit formation and decision making1-3. This behavior, known as habituation, is universal among forms of life with a central nervous system, and interestingly observed even in single cellular organisms that do not possess a brain4-5. Here, we report the discovery of habituation based plasticity utilizing a perovskite quantum system by dynamical modulation of electron localization via reversible dopant incorporation. Microscopic mechanisms and pathways that enable this organismic collective charge-lattice interaction are elucidated by a combination of first-principles theory, synchrotron investigations, ab-initio dynamical simulations and in-situ environmental breathing studies. We implement a new learning algorithm inspired from the conductance relaxation behavior of perovskites that naturally incorporates habituation and demonstrate "learning to forget": a key feature of animal and human brains6. Most surprisingly, our results show that incorporating this elementary skill in learning dramatically boosts the capability of artificial cognitive systems.
△ Less
Submitted 3 March, 2017;
originally announced March 2017.