-
Arborescences and Shortest Path Trees when Colors Matter
Authors:
P. S. Ardra,
Jasine Babu,
Kritika Kashyap,
R. Krithika,
Sreejith K. Pallathumadam,
Deepak Rajendraprasad
Abstract:
Color-constrained subgraph problems are those where we are given an edge-colored (directed or undirected) graph and the task is to find a specific type of subgraph, like a spanning tree, an arborescence, a single-source shortest path tree, a perfect matching etc., with constraints on the number of edges of each color. Some of these problems, like color-constrained spanning tree, have elegant solut…
▽ More
Color-constrained subgraph problems are those where we are given an edge-colored (directed or undirected) graph and the task is to find a specific type of subgraph, like a spanning tree, an arborescence, a single-source shortest path tree, a perfect matching etc., with constraints on the number of edges of each color. Some of these problems, like color-constrained spanning tree, have elegant solutions and some of them, like color-constrained perfect matching, are longstanding open questions. In this work, we study color-constrained arborescences and shortest path trees. Computing a color-constrained shortest path tree on weighted digraphs turns out to be NP-hard in general but polynomial-time solvable when all cycles have positive weight. This polynomial-time solvability is due to the fact that the solution space is essentially the set of all color-constrained arborescences of a directed acyclic subgraph of the original graph. While finding color-constrained arborescence of digraphs is NP-hard in general, we give efficient algorithms when the input graph is acyclic. Consequently, a color-constrained shortest path tree on weighted digraphs having only positive weight cycles can be efficiently computed. Our algorithms also generalize to the problem of finding a color-constrained shortest path tree with minimum total weight. En route, we sight nice connections to colored matroids and color-constrained bases.
△ Less
Submitted 11 March, 2024;
originally announced March 2024.
-
What Values Do ImageNet-trained Classifiers Enact?
Authors:
Will Penman,
Joshua Babu,
Abhinaya Raghunathan
Abstract:
We identify "values" as actions that classifiers take that speak to open questions of significant social concern. Investigating a classifier's values builds on studies of social bias that uncover how classifiers participate in social processes beyond their creators' forethought. In our case, this participation involves what counts as nutritious, what it means to be modest, and more. Unlike AI soci…
▽ More
We identify "values" as actions that classifiers take that speak to open questions of significant social concern. Investigating a classifier's values builds on studies of social bias that uncover how classifiers participate in social processes beyond their creators' forethought. In our case, this participation involves what counts as nutritious, what it means to be modest, and more. Unlike AI social bias, however, a classifier's values are not necessarily morally loathsome. Attending to image classifiers' values can facilitate public debate and introspection about the future of society. To substantiate these claims, we report on an extensive examination of both ImageNet training/validation data and ImageNet-trained classifiers with custom testing data. We identify perceptual decision boundaries in 118 categories that address open questions in society, and through quantitative testing of rival datasets we find that ImageNet-trained classifiers enact at least 7 values through their perceptual decisions. To contextualize these results, we develop a conceptual framework that integrates values, social bias, and accuracy, and we describe a rhetorical method for identifying how context affects the values that a classifier enacts. We also discover that classifier performance does not straightforwardly reflect the proportions of subgroups in a training set. Our findings bring a rich sense of the social world to ML researchers that can be applied to other domains beyond computer vision.
△ Less
Submitted 7 February, 2024;
originally announced February 2024.
-
Structural and double magnetic transitions in the frustrated spin-$\frac{1}{2}$ capped-kagome antiferromagnet (RbCl)Cu$_{5}$P$_{2}$O$_{10}$
Authors:
S. Mohanty,
J. Babu,
Y. Furukawa,
R. Nath
Abstract:
The structural and magnetic properties of the geometrically frustrated spin-$1/2$ capped-kagome antiferromagnet (RbCl)Cu$_{5}$P$_{2}$O$_{10}$ are investigated via temperature dependent x-ray diffraction, magnetization, heat capacity, and $^{31}$P NMR experiments on a polycrystalline sample. It undergoes a structural transition at around $T_{\rm t} \simeq 310$ K from a high temperature trigonal (…
▽ More
The structural and magnetic properties of the geometrically frustrated spin-$1/2$ capped-kagome antiferromagnet (RbCl)Cu$_{5}$P$_{2}$O$_{10}$ are investigated via temperature dependent x-ray diffraction, magnetization, heat capacity, and $^{31}$P NMR experiments on a polycrystalline sample. It undergoes a structural transition at around $T_{\rm t} \simeq 310$ K from a high temperature trigonal ($P\bar{3}m1$) to a low temperature monoclinic ($C2/c$) unit cell, where the low temperature structure features the capped-kagome geometry of Cu$^{2+}$ ions. Interestingly, it shows the onset of two successive magnetic transitions at $T_{\rm N1} \simeq 20$ K and $T_{\rm N2} \simeq 7$ K. The shape of the $^{31}$P NMR spectra unfold the possible nature of the transitions below $T_{\rm N1}$ and $T_{\rm N2}$ to be of incommensurate and commensurate antiferromagnetic type, respectively. A large value of the Curie-Weiss temperature as compared to $T_{\rm N1}$ sets the frustration parameter $f \simeq 8$, ensuring strong magnetic frustration in the compound. From the $^{31}$P NMR spin-lattice relaxation rate, the leading antiferromagnetic exchange coupling is estimated to be $J/k_{\rm B} \simeq 117$ K. These unusual double magnetic transitions make this compound beguiling for further investigations.
△ Less
Submitted 22 September, 2023; v1 submitted 23 July, 2023;
originally announced July 2023.
-
Microscopic characterization of the magnetic properties of the itinerant antiferromagnet La2Ni7 by 139La NMR/NQR measurements
Authors:
Q. -P. Ding,
J. Babu,
K. Rana,
Y. Lee,
S. L. Bud'ko,
R. A. Ribeiro,
P. C. Canfield,
Y. Furukawa
Abstract:
139La nuclear magnetic resonance (NMR) and nuclear quadrupole resonance (NQR) measurements have been performed to investigate the magnetic properties of the itinerant magnet La2Ni7 which shows a series of antiferromagnetic (AFM) phase transitions at $T_{N1}$=61 K, $T_{N2}$=56 K, and $T_{N3}$=42 K under zero magnetic field. Two distinct La NMR signals were observed due to the two crystallographical…
▽ More
139La nuclear magnetic resonance (NMR) and nuclear quadrupole resonance (NQR) measurements have been performed to investigate the magnetic properties of the itinerant magnet La2Ni7 which shows a series of antiferromagnetic (AFM) phase transitions at $T_{N1}$=61 K, $T_{N2}$=56 K, and $T_{N3}$=42 K under zero magnetic field. Two distinct La NMR signals were observed due to the two crystallographically inequivalent La sites in La2Ni7 (La1 and La2 in the La2Ni4 and the LaNi5 sub-units of the La2Ni7 unit cell, respectively). From the 139La NQR spectrum in the AFM state below $T_{N3}$, the AFM state was revealed to be a commensurate state where Ni ordered moments align along the crystalline c axis. Owing to the two different La sites, we were able to estimate the average values of the Ni ordered moments ($\sim$0.09-0.10 $μ_{B}$/Ni and $\sim$0.17$μ_{B}$/Ni around La1 and La2, respectively) from 139La NMR spectrum measurements in the AFM state below $T_{N3}$, suggesting a non-uniform distribution of the Ni-ordered moments in the AFM state. In contrast, a more uniform distribution of the Ni-ordered moments in the saturated paramagnetic state induced by the application of high magnetic fields is observed. The temperature dependence of the sublattice magnetization measured by the internal field at the La2 site in the AFM state was reproduced by a local moment model better than the self-consistent renormalization (SCR) theory for weak itinerant antiferromagnets. Given the small Ni-ordered moments in the magnetically ordered state, our results suggest that La2Ni7 has characteristics of both itinerant and localized natures in its magnetism. With this in mind, it is noteworthy that the temperature dependence of nuclear spin-relaxation rates in the paramagnetic state above $T_{N1}$ measured at zero magnetic field can be explained qualitatively by both the SCR theory and the local-moment model.
△ Less
Submitted 31 July, 2023; v1 submitted 6 April, 2023;
originally announced April 2023.
-
Locally nilpotent derivations on $\mathbb{A}^2$-fibrations with $\mathbb{A}^1$-fibration kernels
Authors:
Janaki Raman Babu,
Prosenjit Das,
Animesh Lahiri
Abstract:
In this paper, we give a characterization of locally nilpotent derivations on $\mathbb{A}^2$-fibrations over Noetherian domains containing $\mathbb{Q}$ having kernel isomorphic to an $\mathbb{A}^1$-fibration.
In this paper, we give a characterization of locally nilpotent derivations on $\mathbb{A}^2$-fibrations over Noetherian domains containing $\mathbb{Q}$ having kernel isomorphic to an $\mathbb{A}^1$-fibration.
△ Less
Submitted 21 February, 2023; v1 submitted 8 December, 2022;
originally announced December 2022.
-
A criterion to determine residual coordinates of $\mathbb{A}^2$-fibrations
Authors:
Janaki Raman Babu,
Prosenjit Das
Abstract:
This article discusses a criterion to determine residual variables of an $\mathbb{A}^2$-fibration over a Noetherian domain containing $\mathbb{Q}$.
This article discusses a criterion to determine residual variables of an $\mathbb{A}^2$-fibration over a Noetherian domain containing $\mathbb{Q}$.
△ Less
Submitted 14 February, 2023; v1 submitted 7 December, 2022;
originally announced December 2022.
-
Rank and rigidity of locally nilpotent derivations of affine fibrations
Authors:
Janaki Raman Babu,
Prosenjit Das,
Swapnil A. Lokhande
Abstract:
In this exposition, we propose a notion of rank and rigidity of locally nilpotent derivations of affine fibrations. We show that the concept is analogous to the perception of rank and rigidity of locally nilpotent derivations of polynomial algebras. Our results characterize locally nilpotent derivations of $\mathbb{A}^3$-fibrations having slice by classifying the fixed point free locally nilpotent…
▽ More
In this exposition, we propose a notion of rank and rigidity of locally nilpotent derivations of affine fibrations. We show that the concept is analogous to the perception of rank and rigidity of locally nilpotent derivations of polynomial algebras. Our results characterize locally nilpotent derivations of $\mathbb{A}^3$-fibrations having slice by classifying the fixed point free locally nilpotent derivations in terms of their ranks.
△ Less
Submitted 19 October, 2022;
originally announced October 2022.
-
Investigating the impact of BTI, HCI and time-zero variability on neuromorphic spike event generation circuits
Authors:
Shaik Jani Babu,
Rohit Singh,
Siona Menezes Picardo,
Nilesh Goel,
Sonal Singhal
Abstract:
Neuromorphic computing refers to brain-inspired computers, that differentiate it from von Neumann architecture. Analog VLSI based neuromorphic circuits is a current research interest. Two simpler spiking integrate and fire neuron model namely axon-Hillock (AH) and voltage integrate, and fire (VIF) circuits are commonly used for generating spike events. This paper discusses the impact of reliabilit…
▽ More
Neuromorphic computing refers to brain-inspired computers, that differentiate it from von Neumann architecture. Analog VLSI based neuromorphic circuits is a current research interest. Two simpler spiking integrate and fire neuron model namely axon-Hillock (AH) and voltage integrate, and fire (VIF) circuits are commonly used for generating spike events. This paper discusses the impact of reliability issues like Bias Temperature instability (BTI) and Hot Carrier Injection (HCI), and timezero variability on these CMOS based neuromorphic circuits. AH and VIF circuits are implemented using HKMG based 45nm technology. For reliability analysis, industry standard Cadence RelXpert tool is used. For time-zero variability analysis, 1000 Monte-Carlo simulations are performed.
△ Less
Submitted 19 May, 2022;
originally announced May 2022.
-
Eternal vertex cover number of maximal outerplanar graphs
Authors:
Jasine Babu,
K. Murali Krishnan,
Veena Prabhakaran,
Nandini J. Warrier
Abstract:
Eternal vertex cover problem is a variant of the classical vertex cover problem modeled as a two player attacker-defender game. Computing eternal vertex cover number of graphs is known to be NP-hard in general and the complexity status of the problem for bipartite graphs is open. There is a quadratic complexity algorithm known for this problem for chordal graphs. Maximal outerplanar graphs forms a…
▽ More
Eternal vertex cover problem is a variant of the classical vertex cover problem modeled as a two player attacker-defender game. Computing eternal vertex cover number of graphs is known to be NP-hard in general and the complexity status of the problem for bipartite graphs is open. There is a quadratic complexity algorithm known for this problem for chordal graphs. Maximal outerplanar graphs forms a subclass of chordal graphs, for which no algorithm of sub-quadratic time complexity is known. In this paper, we obtain a recursive algorithm of linear time for computing eternal vertex cover number of maximal outerplanar graphs.
△ Less
Submitted 17 January, 2022;
originally announced January 2022.
-
Incorporating Measurement Error in Astronomical Object Classification
Authors:
Sarah Shy,
Hyungsuk Tak,
Eric D. Feigelson,
John D. Timlin,
G. Jogesh Babu
Abstract:
Most general-purpose classification methods, such as support-vector machine (SVM) and random forest (RF), fail to account for an unusual characteristic of astronomical data: known measurement error uncertainties. In astronomical data, this information is often given in the data but discarded because popular machine learning classifiers cannot incorporate it. We propose a simulation-based approach…
▽ More
Most general-purpose classification methods, such as support-vector machine (SVM) and random forest (RF), fail to account for an unusual characteristic of astronomical data: known measurement error uncertainties. In astronomical data, this information is often given in the data but discarded because popular machine learning classifiers cannot incorporate it. We propose a simulation-based approach that incorporates heteroscedastic measurement error into existing classification method to better quantify uncertainty in classification. The proposed method first simulates perturbed realizations of the data from a Bayesian posterior predictive distribution of a Gaussian measurement error model. Then, a chosen classifier is fit to each simulation. The variation across the simulations naturally reflects the uncertainty propagated from the measurement errors in both labeled and unlabeled data sets. We demonstrate the use of this approach via two numerical studies. The first is a thorough simulation study applying the proposed procedure to SVM and RF, which are well-known hard and soft classifiers, respectively. The second study is a realistic classification problem of identifying high-$z$ $(2.9 \leq z \leq 5.1)$ quasar candidates from photometric data. The data are from merged catalogs of the Sloan Digital Sky Survey, the $Spitzer$ IRAC Equatorial Survey, and the $Spitzer$-HETDEX Exploratory Large-Area Survey. The proposed approach reveals that out of 11,847 high-$z$ quasar candidates identified by a random forest without incorporating measurement error, 3,146 are potential misclassifications with measurement error. Additionally, out of $1.85$ million objects not identified as high-$z$ quasars without measurement error, 936 can be considered new candidates with measurement error.
△ Less
Submitted 2 May, 2022; v1 submitted 13 December, 2021;
originally announced December 2021.
-
Improved Bounds for the Oriented Radius of Mixed Multigraphs
Authors:
Jasine Babu,
Deepu Benson,
Deepak Rajendraprasad
Abstract:
A mixed multigraph is a multigraph which may contain both undirected and directed edges. An orientation of a mixed multigraph $G$ is an assignment of exactly one direction to each undirected edge of $G$. A mixed multigraph $G$ can be oriented to a strongly connected digraph if and only if $G$ is bridgeless and strongly connected [Boesch and Tindell, Am. Math. Mon., 1980]. For each…
▽ More
A mixed multigraph is a multigraph which may contain both undirected and directed edges. An orientation of a mixed multigraph $G$ is an assignment of exactly one direction to each undirected edge of $G$. A mixed multigraph $G$ can be oriented to a strongly connected digraph if and only if $G$ is bridgeless and strongly connected [Boesch and Tindell, Am. Math. Mon., 1980]. For each $r \in \mathbb{N}$, let $f(r)$ denote the smallest number such that any strongly connected bridgeless mixed multigraph with radius $r$ can be oriented to a digraph of radius at most $f(r)$. We improve the current best upper bound of $4r^2+4r$ on $f(r)$ [Chung, Garey and Tarjan, Networks, 1985] to $1.5 r^2 + r + 1$. Our upper bound is tight upto a multiplicative factor of $1.5$ since, $\forall r \in \mathbb{N}$, there exists an undirected bridgeless graph of radius $r$ such that every orientation of it has radius at least $r^2 + r$ [Chvátal and Thomassen, J. Comb. Theory. Ser. B., 1978]. We prove a marginally better lower bound, $f(r) \geq r^2 + 3r + 1$, for mixed multigraphs. While this marginal improvement does not help with asymptotic estimates, it clears a natural suspicion that, like undirected graphs, $f(r)$ may be equal to $r^2 + r$ even for mixed multigraphs. En route, we show that if each edge of $G$ lies in a cycle of length at most $η$, then the oriented radius of $G$ is at most $1.5 r η$. All our proofs are constructive and lend themselves to polynomial time algorithms.
△ Less
Submitted 5 May, 2021;
originally announced May 2021.
-
A Statistician Teaches Deep Learning
Authors:
G. Jogesh Babu,
David Banks,
Hyunsoon Cho,
David Han,
Hailin Sang,
Shouyi Wang
Abstract:
Deep learning (DL) has gained much attention and become increasingly popular in modern data science. Computer scientists led the way in develo** deep learning techniques, so the ideas and perspectives can seem alien to statisticians. Nonetheless, it is important that statisticians become involved -- many of our students need this expertise for their careers. In this paper, developed as part of a…
▽ More
Deep learning (DL) has gained much attention and become increasingly popular in modern data science. Computer scientists led the way in develo** deep learning techniques, so the ideas and perspectives can seem alien to statisticians. Nonetheless, it is important that statisticians become involved -- many of our students need this expertise for their careers. In this paper, developed as part of a program on DL held at the Statistical and Applied Mathematical Sciences Institute, we address this culture gap and provide tips on how to teach deep learning to statistics graduate students. After some background, we list ways in which DL and statistical perspectives differ, provide a recommended syllabus that evolved from teaching two iterations of a DL graduate course, offer examples of suggested homework assignments, give an annotated list of teaching resources, and discuss DL in the context of two research areas.
△ Less
Submitted 3 February, 2021; v1 submitted 28 January, 2021;
originally announced February 2021.
-
Ultrafast Insight into High energy (C, D) Excitons in Few Layer WS2
Authors:
Tanmay Goswami,
Himanshu Bhatt,
K. Justice Babu,
Gurpreet Kaur,
Nandan Ghorai,
Hirendra N. Ghosh
Abstract:
High energy (C, D) excitons possess remarkable influence over the optical properties of layered transition metal dichalcogenides (TMDCs) and comprehensive understanding of these may have revolutionary effect on 2D opto-electronic devices. Herein, we employed transient absorption spectroscopy to monitor the underlying photo-physical processes involved with C, D excitons in few layer WS2. We observe…
▽ More
High energy (C, D) excitons possess remarkable influence over the optical properties of layered transition metal dichalcogenides (TMDCs) and comprehensive understanding of these may have revolutionary effect on 2D opto-electronic devices. Herein, we employed transient absorption spectroscopy to monitor the underlying photo-physical processes involved with C, D excitons in few layer WS2. We observed a strong inter-valley coupling across the momentum space. C, D dynamics were significantly slower as compared to canonical A, B excitons, as a consequence of the indirect Lambda-Gamma relaxation in C, D, unlike K-K direct combination in A, B. Optical behaviour of D excitons was found to be more like A, B, contrary to C, which enjoy unique band nesting effects. Also, C excitons do not hold in any specific position of the momentum space, rather depends upon the photon energy. All these excitons immensely influence each other irrespective of the excitation energy.
△ Less
Submitted 11 September, 2020;
originally announced September 2020.
-
21st Century Statistical and Computational Challenges in Astrophysics
Authors:
Eric D. Feigelson,
Rafael S. de Souza,
Emille E. O. Ishida,
Gutti Jogesh Babu
Abstract:
Modern astronomy has been rapidly increasing our ability to see deeper into the universe, acquiring enormous samples of cosmic populations. Gaining astrophysical insights from these datasets requires a wide range of sophisticated statistical and machine learning methods. Long-standing problems in cosmology include characterization of galaxy clustering and estimation of galaxy distances from photom…
▽ More
Modern astronomy has been rapidly increasing our ability to see deeper into the universe, acquiring enormous samples of cosmic populations. Gaining astrophysical insights from these datasets requires a wide range of sophisticated statistical and machine learning methods. Long-standing problems in cosmology include characterization of galaxy clustering and estimation of galaxy distances from photometric colors. Bayesian inference, central to linking astronomical data to nonlinear astrophysical models, addresses problems in solar physics, properties of star clusters, and exoplanet systems. Likelihood-free methods are growing in importance. Detection of faint signals in complicated noise is needed to find periodic behaviors in stars and detect explosive gravitational wave events. Open issues concern treatment of heteroscedastic measurement errors and understanding probability distributions characterizing astrophysical systems. The field of astrostatistics needs increased collaboration with statisticians in the design and analysis stages of research projects, and to jointly develop new statistical methodologies. Together, they will draw more astrophysical insights into astronomical populations and the cosmos itself.
△ Less
Submitted 26 May, 2020;
originally announced May 2020.
-
A Linear Time Algorithm for Computing the Eternal Vertex Cover Number of Cactus Graphs
Authors:
Jasine Babu,
Veena Prabhakaran,
Arko Sharma
Abstract:
The eternal vertex cover problem is a dynamic variant of the classical vertex cover problem. It is NP-hard to compute the eternal vertex cover number of graphs and known algorithmic results for the problem are very few. This paper presents a linear time recursive algorithm for computing the eternal vertex cover number of cactus graphs. Unlike other graph classes for which polynomial time algorithm…
▽ More
The eternal vertex cover problem is a dynamic variant of the classical vertex cover problem. It is NP-hard to compute the eternal vertex cover number of graphs and known algorithmic results for the problem are very few. This paper presents a linear time recursive algorithm for computing the eternal vertex cover number of cactus graphs. Unlike other graph classes for which polynomial time algorithms for eternal vertex cover number are based on efficient computability of a known lower bound directly derived from minimum vertex cover, we show that it is a certain substructure property that helps the efficient computation of eternal vertex cover number of cactus graphs. An extension of the result to graphs in which each block is an edge, a cycle or a biconnected chordal graph is also presented.
△ Less
Submitted 16 May, 2020;
originally announced May 2020.
-
A Note on Arc-Disjoint Cycles in Bipartite Tournaments
Authors:
Jasine Babu,
Ajay Saju Jacob,
R. Krithika,
Deepak Rajendraprasad
Abstract:
We show that for each non-negative integer k, every bipartite tournament either contains k arc-disjoint cycles or has a feedback arc set of size at most 7(k - 1).
We show that for each non-negative integer k, every bipartite tournament either contains k arc-disjoint cycles or has a feedback arc set of size at most 7(k - 1).
△ Less
Submitted 17 February, 2020;
originally announced February 2020.
-
Structure of $\mathbb{A}^2$-fibrations Having Fixed Point Free Locally Nilpotent Derivations
Authors:
Janaki Raman Babu,
Prosenjit Das
Abstract:
In this article, we show that a fixed point free locally nilpotent derivation of an $\mathbb{A}^2$-fibration over a Noetherian ring containing $\mathbb{Q}$ has slice.
In this article, we show that a fixed point free locally nilpotent derivation of an $\mathbb{A}^2$-fibration over a Noetherian ring containing $\mathbb{Q}$ has slice.
△ Less
Submitted 14 July, 2020; v1 submitted 21 January, 2020;
originally announced January 2020.
-
An Improvement to Chvátal and Thomassen's Upper Bound for Oriented Diameter
Authors:
Jasine Babu,
Deepu Benson,
Deepak Rajendraprasad,
Sai Nishant Vaka
Abstract:
An orientation of an undirected graph $G$ is an assignment of exactly one direction to each edge of $G$. The oriented diameter of a graph $G$ is the smallest diameter among all the orientations of $G$. The maximum oriented diameter of a family of graphs $\mathscr{F}$ is the maximum oriented diameter among all the graphs in $\mathscr{F}$. Chvátal and Thomassen [JCTB, 1978] gave a lower bound of…
▽ More
An orientation of an undirected graph $G$ is an assignment of exactly one direction to each edge of $G$. The oriented diameter of a graph $G$ is the smallest diameter among all the orientations of $G$. The maximum oriented diameter of a family of graphs $\mathscr{F}$ is the maximum oriented diameter among all the graphs in $\mathscr{F}$. Chvátal and Thomassen [JCTB, 1978] gave a lower bound of $\frac{1}{2}d^2+d$ and an upper bound of $2d^2+2d$ for the maximum oriented diameter of the family of $2$-edge connected graphs of diameter $d$. We improve this upper bound to $ 1.373 d^2 + 6.971d-1 $, which outperforms the former upper bound for all values of $d$ greater than or equal to $8$. For the family of $2$-edge connected graphs of diameter $3$, Kwok, Liu and West [JCTB, 2010] obtained improved lower and upper bounds of $9$ and $11$ respectively. For the family of $2$-edge connected graphs of diameter $4$, the bounds provided by Chvátal and Thomassen are $12$ and $40$ and no better bounds were known. By extending the method we used for diameter $d$ graphs, along with an asymmetric extension of a technique used by Chvátal and Thomassen, we have improved this upper bound to $21$.
△ Less
Submitted 29 January, 2020; v1 submitted 10 January, 2020;
originally announced January 2020.
-
Algorithms and Statistical Models for Scientific Discovery in the Petabyte Era
Authors:
Brian Nord,
Andrew J. Connolly,
Jamie Kinney,
Jeremy Kubica,
Gautaum Narayan,
Joshua E. G. Peek,
Chad Schafer,
Erik J. Tollerud,
Camille Avestruz,
G. Jogesh Babu,
Simon Birrer,
Douglas Burke,
João Caldeira,
Douglas A. Caldwell,
Joleen K. Carlberg,
Yen-Chi Chen,
Chuanfei Dong,
Eric D. Feigelson,
V. Zach Golkhou,
Vinay Kashyap,
T. S. Li,
Thomas Loredo,
Luisa Lucie-Smith,
Kaisey S. Mandel,
J. R. Martínez-Galarza
, et al. (13 additional authors not shown)
Abstract:
The field of astronomy has arrived at a turning point in terms of size and complexity of both datasets and scientific collaboration. Commensurately, algorithms and statistical models have begun to adapt --- e.g., via the onset of artificial intelligence --- which itself presents new challenges and opportunities for growth. This white paper aims to offer guidance and ideas for how we can evolve our…
▽ More
The field of astronomy has arrived at a turning point in terms of size and complexity of both datasets and scientific collaboration. Commensurately, algorithms and statistical models have begun to adapt --- e.g., via the onset of artificial intelligence --- which itself presents new challenges and opportunities for growth. This white paper aims to offer guidance and ideas for how we can evolve our technical and collaborative frameworks to promote efficient algorithmic development and take advantage of opportunities for scientific discovery in the petabyte era. We discuss challenges for discovery in large and complex data sets; challenges and requirements for the next stage of development of statistical methodologies and algorithmic tool sets; how we might change our paradigms of collaboration and education; and the ethical implications of scientists' contributions to widely applicable algorithms and computational modeling. We start with six distinct recommendations that are supported by the commentary following them. This white paper is related to a larger corpus of effort that has taken place within and around the Petabytes to Science Workshops (https://petabytestoscience.github.io/).
△ Less
Submitted 4 November, 2019;
originally announced November 2019.
-
A new lower bound for eternal vertex cover number
Authors:
Jasine Babu,
Veena Prabhakaran
Abstract:
We obtain a new lower bound for the eternal vertex cover number of an arbitrary graph $G$, in terms of the cardinality of a vertex cover of minimum size in $G$ containing all its cut vertices. The consequences of the lower bound includes a quadratic time algorithm for computing the eternal vertex cover number of chordal graphs.
We obtain a new lower bound for the eternal vertex cover number of an arbitrary graph $G$, in terms of the cardinality of a vertex cover of minimum size in $G$ containing all its cut vertices. The consequences of the lower bound includes a quadratic time algorithm for computing the eternal vertex cover number of chordal graphs.
△ Less
Submitted 5 July, 2020; v1 submitted 11 October, 2019;
originally announced October 2019.
-
Advanced Astrophysics Discovery Technology in the Era of Data Driven Astronomy
Authors:
Richard K. Barry,
Jogesh G. Babu,
John G. Baker,
Eric D. Feigelson,
Amanpreet Kaur,
Alan J. Kogut,
Steven B. Kraemer,
James P. Mason,
Piyush Mehrotra,
Gregory Olmschenk,
Jeremy D. Schnittman,
Amalie Stokholm,
Eric R. Switzer,
Brian A. Thomas,
Raymond J. Walker
Abstract:
Experience suggests that structural issues in how institutional Astrophysics approaches data-driven science and the development of discovery technology may be hampering the community's ability to respond effectively to a rapidly changing environment in which increasingly complex, heterogeneous datasets are challenging our existing information infrastructure and traditional approaches to analysis.…
▽ More
Experience suggests that structural issues in how institutional Astrophysics approaches data-driven science and the development of discovery technology may be hampering the community's ability to respond effectively to a rapidly changing environment in which increasingly complex, heterogeneous datasets are challenging our existing information infrastructure and traditional approaches to analysis. We stand at the confluence of a new epoch of multimessenger science, remote co-location of data and processing power and new observing strategies based on miniaturized spacecraft. Significant effort will be required by the community to adapt to this rapidly evolving range of possible discovery moduses. In the suggested creation of a new Astrophysics element, Advanced Astrophysics Discovery Technology, we offer an affirmative solution that places the visibility of discovery technologies at a level that we suggest is fully commensurate with their importance to the future of the field.
△ Less
Submitted 24 July, 2019;
originally announced July 2019.
-
A local characterization for perfect plane near-triangulations
Authors:
Sameera M. Salam,
Jasine Babu,
K. Murali Krishnan
Abstract:
We derive a local criterion for a plane near-triangulated graph to be perfect. It is shown that a plane near-triangulated graph is perfect if and only if it does not contain either a vertex, an edge or a triangle, the neighbourhood of which has an odd hole as its boundary. The characterization leads to an $O(n^2)$ algorithm for checking perfectness of plane near-triangulations.
We derive a local criterion for a plane near-triangulated graph to be perfect. It is shown that a plane near-triangulated graph is perfect if and only if it does not contain either a vertex, an edge or a triangle, the neighbourhood of which has an odd hole as its boundary. The characterization leads to an $O(n^2)$ algorithm for checking perfectness of plane near-triangulations.
△ Less
Submitted 7 July, 2020; v1 submitted 14 June, 2019;
originally announced June 2019.
-
AutoRegressive Planet Search: Application to the Kepler Mission
Authors:
Gabriel A. Caceres,
Eric D. Feigelson,
G. Jogesh Babu,
Natalia Bahamonde,
Alejandra Christen,
Karine Bertin,
Cristian Meza,
Michel Curé
Abstract:
The 4-year light curves of 156,717 stars observed with NASA's Kepler mission are analyzed using the AutoRegressive Planet Search (ARPS) methodology described by Caceres et al. (2019). The three stages of processing are: maximum likelihood ARIMA modeling of the light curves to reduce stellar brightness variations; constructing the Transit Comb Filter periodogram to identify transit-like periodic di…
▽ More
The 4-year light curves of 156,717 stars observed with NASA's Kepler mission are analyzed using the AutoRegressive Planet Search (ARPS) methodology described by Caceres et al. (2019). The three stages of processing are: maximum likelihood ARIMA modeling of the light curves to reduce stellar brightness variations; constructing the Transit Comb Filter periodogram to identify transit-like periodic dips in the ARIMA residuals; Random Forest classification trained on Kepler Team confirmed planets using several dozen features from the analysis. Orbital periods between 0.2 and 100 days are examined. The result is a recovery of 76% of confirmed planets, 97% when period and transit depth constraints are added. The classifier is then applied to the full Kepler dataset; 1,004 previously noticed and 97 new stars have light curve criteria consistent with the confirmed planets, after subjective vetting removes clear False Alarms and False Positive cases. The 97 Kepler ARPS Candidate Transits mostly have periods $P<10$ days; many are UltraShort Period hot planets with radii $<1$% of the host star. Extensive tabular and graphical output from the ARPS time series analysis is provided to assist in other research relating to the Kepler sample.
△ Less
Submitted 23 May, 2019;
originally announced May 2019.
-
Autoregressive Times Series Methods for Time Domain Astronomy
Authors:
Eric D. Feigelson,
G. Jogesh Babu,
Gabriel A. Caceres
Abstract:
Celestial objects exhibit a wide range of variability in brightness at different wavebands. Surprisingly, the most common methods for characterizing time series in statistics -- parametric autoregressive modeling -- is rarely used to interpret astronomical light curves. We review standard ARMA, ARIMA and ARFIMA (autoregressive moving average fractionally integrated) models that treat short-memory…
▽ More
Celestial objects exhibit a wide range of variability in brightness at different wavebands. Surprisingly, the most common methods for characterizing time series in statistics -- parametric autoregressive modeling -- is rarely used to interpret astronomical light curves. We review standard ARMA, ARIMA and ARFIMA (autoregressive moving average fractionally integrated) models that treat short-memory autocorrelation, long-memory $1/f^α$ `red noise', and nonstationary trends. Though designed for evenly spaced time series, moderately irregular cadences can be treated as evenly-spaced time series with missing data. Fitting algorithms are efficient and software implementations are widely available. We apply ARIMA models to light curves of four variable stars, discussing their effectiveness for different temporal characteristics. A variety of extensions to ARIMA are outlined, with emphasis on recently developed continuous-time models like CARMA and CARFIMA designed for irregularly spaced time series. Strengths and weakness of ARIMA-type modeling for astronomical data analysis and astrophysical insights are reviewed.
△ Less
Submitted 23 January, 2019;
originally announced January 2019.
-
AutoRegressive Planet Search: Methodology
Authors:
Gabriel A. Caceres,
Eric D. Feigelson,
G. Jogesh Babu,
Natalia Bahamonde,
Alejandra Christen,
Karine Bertin,
Cristian Meza,
Michel Curé
Abstract:
The detection of periodic signals from transiting exoplanets is often impeded by extraneous aperiodic photometric variability, either intrinsic to the star or arising from the measurement process. Frequently, these variations are autocorrelated wherein later flux values are correlated with previous ones. In this work, we present the methodology of the Autoregessive Planet Search (ARPS) project whi…
▽ More
The detection of periodic signals from transiting exoplanets is often impeded by extraneous aperiodic photometric variability, either intrinsic to the star or arising from the measurement process. Frequently, these variations are autocorrelated wherein later flux values are correlated with previous ones. In this work, we present the methodology of the Autoregessive Planet Search (ARPS) project which uses Autoregressive Integrated Moving Average (ARIMA) and related statistical models that treat a wide variety of stochastic processes, as well as nonstationarity, to improve detection of new planetary transits. Providing a time series is evenly spaced or can be placed on an evenly spaced grid with missing values, these low-dimensional parametric models can prove very effective. We introduce a planet-search algorithm to detect periodic transits in the residuals after the application of ARIMA models. Our matched-filter algorithm, the Transit Comb Filter (TCF), is closely related to the traditional Box-fitting Least Squares and provides an analogous periodogram. Finally, if a previously identified or simulated sample of planets is available, selected scalar features from different stages of the analysis -- the original light curves, ARIMA fits, TCF periodograms, and folded light curves -- can be collectively used with a multivariate classifier to identify promising candidates while efficiently rejecting false alarms. We use Random Forests for this task, in conjunction with Receiver Operating Characteristic (ROC) curves, to define discovery criteria for new, high fidelity planetary candidates. The ARPS methodology can be applied to both evenly spaced satellite light curves and densely cadenced ground-based photometric surveys.
△ Less
Submitted 14 May, 2019; v1 submitted 15 January, 2019;
originally announced January 2019.
-
On Graphs whose Eternal Vertex Cover Number and Vertex Cover Number Coincide
Authors:
Jasine Babu,
L. Sunil Chandran,
Mathew Francis,
Veena Prabhakaran,
Deepak Rajendraprasad,
J. Nandini Warrier
Abstract:
The eternal vertex cover problem is a variant of the classical vertex cover problem where a set of guards on the vertices have to be dynamically reconfigured from one vertex cover to another in every round of an attacker-defender game. The minimum number of guards required to protect a graph $G$ from an infinite sequence of attacks is the eternal vertex cover number of $G$, denoted by $evc(G)$. It…
▽ More
The eternal vertex cover problem is a variant of the classical vertex cover problem where a set of guards on the vertices have to be dynamically reconfigured from one vertex cover to another in every round of an attacker-defender game. The minimum number of guards required to protect a graph $G$ from an infinite sequence of attacks is the eternal vertex cover number of $G$, denoted by $evc(G)$. It is known that, given a graph $G$ and an integer $k$, checking whether $evc(G) \le k$ is NP-hard. However, it is unknown whether this problem is in NP or not. Precise value of eternal vertex cover number is known only for certain very basic graph classes like trees, cycles and grids.
For any graph $G$, it is known that $mvc(G) \le evc(G) \le 2 mvc(G)$, where $mvc(G)$ is the minimum vertex cover number of $G$. Though a characterization is known for graphs for which $evc(G) = 2 mvc(G)$, a characterization of graphs for which $evc(G) = mvc(G)$ remained open. Here, we achieve such a characterization for a class of graphs that includes chordal graphs and internally triangulated planar graphs. For some graph classes including biconnected chordal graphs, our characterization leads to a polynomial time algorithm to precisely determine $evc(G)$ and to determine a safe strategy of guard movement in each round of the game with $evc(G)$ guards.
The characterization also leads to NP-completeness results for the eternal vertex cover problem for some graph classes including biconnected internally triangulated planar graphs. To the best of our knowledge, these are the first NP-completeness results known for the problem for any graph class.
△ Less
Submitted 30 April, 2019; v1 submitted 12 December, 2018;
originally announced December 2018.
-
Some Optimizations on Detecting Gravitational Wave Using Convolutional Neural Network
Authors:
Xiangru Li,
Woliang Yu,
Xilong Fan,
G. Jogesh Babu
Abstract:
This work investigates the problem of detecting gravitational wave (GW) events based on simulated damped sinusoid signals contaminated with white Gaussian noise. It is treated as a classification problem with one class for the interesting events. The proposed scheme consists of the following two successive steps: decomposing the data using a wavelet packet, representing the GW signal and noise usi…
▽ More
This work investigates the problem of detecting gravitational wave (GW) events based on simulated damped sinusoid signals contaminated with white Gaussian noise. It is treated as a classification problem with one class for the interesting events. The proposed scheme consists of the following two successive steps: decomposing the data using a wavelet packet, representing the GW signal and noise using the derived decomposition coefficients; and determining the existence of any GW event using a convolutional neural network (CNN) with a logistic regression output layer. The characteristics of this work is its comprehensive investigations on CNN structure, detection window width, data resolution, wavelet packet decomposition and detection window overlap scheme. Extensive simulation experiments show excellent performances for reliable detection of signals with a range of GW model parameters and signal-to-noise ratios. While we use a simple waveform model in this study, we expect the method to be particularly valuable when the potential GW shapes are too complex to be characterized with a template bank.
△ Less
Submitted 29 May, 2020; v1 submitted 1 December, 2017;
originally announced December 2017.
-
A fix-point characterization of Herbrand equivalence of expressions in data flow frameworks
Authors:
Jasine Babu,
K. Murali Krishnan,
Vineeth Paleri
Abstract:
The problem of determining Herbrand equivalence of terms at each program point in a data flow framework is a central and well studied question in program analysis. Most of the well-known algorithms for the computation of Herbrand equivalence in data flow frameworks proceed via iterative fix-point computation on some abstract lattice of short expressions relevant to the given flow graph. However th…
▽ More
The problem of determining Herbrand equivalence of terms at each program point in a data flow framework is a central and well studied question in program analysis. Most of the well-known algorithms for the computation of Herbrand equivalence in data flow frameworks proceed via iterative fix-point computation on some abstract lattice of short expressions relevant to the given flow graph. However the mathematical definition of Herbrand equivalence is based on a meet over all path characterization over the (infinite) set of all possible expressions. The aim of this paper is to develop a lattice theoretic fix-point formulation of Herbrand equivalence on the (infinite) concrete lattice defined over the set of all terms constructible from variables, constants and operators of a program. The present characterization uses an axiomatic formulation of the notion of Herbrand congruence and defines the (infinite) concrete lattice of Herbrand congruences. Transfer functions and non-deterministic assignments are formulated as monotone functions over this concrete lattice. Herbrand equivalence is defined as the maximum fix point of a composite transfer function defined over an appropriate product lattice of the above concrete lattice. A re-formulation of the classical meet-over-all-paths definition of Herbrand equivalence in the above lattice theoretic framework is also presented and is proven to be equivalent to the new lattice theoretic fix-point characterization.
△ Less
Submitted 20 October, 2017; v1 submitted 16 August, 2017;
originally announced August 2017.
-
On Induced Colourful Paths in Triangle-free Graphs
Authors:
Jasine Babu,
Manu Basavaraju,
L. Sunil Chandran,
Mathew C. Francis
Abstract:
Given a graph $G=(V,E)$ whose vertices have been properly coloured, we say that a path in $G$ is "colourful" if no two vertices in the path have the same colour. It is a corollary of the Gallai-Roy-Vitaver Theorem that every properly coloured graph contains a colourful path on $χ(G)$ vertices. We explore a conjecture that states that every properly coloured triangle-free graph $G$ contains an indu…
▽ More
Given a graph $G=(V,E)$ whose vertices have been properly coloured, we say that a path in $G$ is "colourful" if no two vertices in the path have the same colour. It is a corollary of the Gallai-Roy-Vitaver Theorem that every properly coloured graph contains a colourful path on $χ(G)$ vertices. We explore a conjecture that states that every properly coloured triangle-free graph $G$ contains an induced colourful path on $χ(G)$ vertices and prove its correctness when the girth of $G$ is at least $χ(G)$. Recent work on this conjecture by Gyárfás and Sárközy, and Scott and Seymour has shown the existence of a function $f$ such that if $χ(G)\geq f(k)$, then an induced colourful path on $k$ vertices is guaranteed to exist in any properly coloured triangle-free graph $G$.
△ Less
Submitted 18 January, 2019; v1 submitted 20 April, 2016;
originally announced April 2016.
-
Excess Vibrational Density of States and the Brittle to Ductile Transition in Crystalline and Amorphous Solids
Authors:
Jeetu S. Babu,
Chandana Mondal,
Surajit Sengupta,
Smarajit Karmakar
Abstract:
The conditions which determine whether a material behaves in a brittle or ductile fashion on mechanical loading are still elusive and comprise a topic of active research among materials physicists and engineers. In this study, we present results of {\em in silico} mechanical deformation experiments from two very different model solids in two and three dimensions. The first consists of particles in…
▽ More
The conditions which determine whether a material behaves in a brittle or ductile fashion on mechanical loading are still elusive and comprise a topic of active research among materials physicists and engineers. In this study, we present results of {\em in silico} mechanical deformation experiments from two very different model solids in two and three dimensions. The first consists of particles interacting with isotropic potentials and the other has strongly direction dependent interactions. We show that in both cases, the excess vibrational density of states is the fundamental quantity which characterises the ductility of the material. Our results can be checked using careful experiments on colloidal solids.
△ Less
Submitted 1 September, 2015; v1 submitted 27 August, 2015;
originally announced August 2015.
-
Sublinear Approximation Algorithms for Boxicity and Related Problems
Authors:
Abhi** Adiga,
Jasine Babu,
L. Sunil Chandran
Abstract:
Boxicity of a graph G(V, E) is the minimum integer k such that G can be represented as the intersection graph of axis parallel boxes in $\mathbb{R}^k$. Cubicity is a variant of boxicity, where the axis parallel boxes in the intersection representation are restricted to be of unit length sides. Deciding whether boxicity (resp. cubicity) of a graph is at most k is NP-hard, even for k=2 or 3. Computi…
▽ More
Boxicity of a graph G(V, E) is the minimum integer k such that G can be represented as the intersection graph of axis parallel boxes in $\mathbb{R}^k$. Cubicity is a variant of boxicity, where the axis parallel boxes in the intersection representation are restricted to be of unit length sides. Deciding whether boxicity (resp. cubicity) of a graph is at most k is NP-hard, even for k=2 or 3. Computing these parameters is inapproximable within $O(n^{1 - ε})$-factor, for any $ε>0$ in polynomial time unless NP=ZPP, even for many simple graph classes.
In this paper, we give a polynomial time $κ(n)$ factor approximation algorithm for computing boxicity and a $κ(n)\lceil \log \log n\rceil$ factor approximation algorithm for computing the cubicity, where $κ(n) =2\left\lceil\frac{n\sqrt{\log \log n}}{\sqrt{\log n}}\right\rceil$. These o(n) factor approximation algorithms also produce the corresponding box (resp. cube) representations. As a special case, this resolves the question paused by Spinrad about polynomial time construction of o(n) dimensional box representations for boxicity 2 graphs. Other consequences of our approximation algorithm include $O(κ(n))$ factor approximation algorithms for computing the following parameters: the partial order dimension of finite posets, the interval dimension of finite posets, minimum chain cover of bipartite graphs, threshold dimension of split graphs and Ferrer's dimension of digraphs. Each of these parameters is inapproximable within an $O(n^{1 - ε})$-factor, for any $ε>0$ in polynomial time unless NP=ZPP and the algorithms we derive seem to be the first o(n) factor approximation algorithms known for all these problems.
△ Less
Submitted 7 June, 2015; v1 submitted 19 May, 2015;
originally announced May 2015.
-
Approximating the Cubicity of Trees
Authors:
Jasine Babu,
Manu Basavaraju,
L Sunil Chandran,
Deepak Rajendraprasad,
Naveen Sivadasan
Abstract:
Cubicity of a graph $G$ is the smallest dimension $d$, for which $G$ is a unit disc graph in ${\mathbb{R}}^d$, under the $l^\infty$ metric, i.e. $G$ can be represented as an intersection graph of $d$-dimensional (axis-parallel) unit hypercubes. We call such an intersection representation a $d$-dimensional cube representation of $G$. Computing cubicity is known to be inapproximable in polynomial ti…
▽ More
Cubicity of a graph $G$ is the smallest dimension $d$, for which $G$ is a unit disc graph in ${\mathbb{R}}^d$, under the $l^\infty$ metric, i.e. $G$ can be represented as an intersection graph of $d$-dimensional (axis-parallel) unit hypercubes. We call such an intersection representation a $d$-dimensional cube representation of $G$. Computing cubicity is known to be inapproximable in polynomial time, within an $O(n^{1-ε})$ factor for any $ε>0$, unless NP=ZPP.
In this paper, we present a randomized algorithm that runs in polynomial time and computes cube representations of trees, of dimension within a constant factor of the optimum. It is also shown that the cubicity of trees can be approximated within a constant factor in deterministic polynomial time, if the cube representation is not required to be computed. As far as we know, this is the first constant factor approximation algorithm for computing the cubicity of trees. It is not yet clear whether computing the cubicity of trees is NP-hard or not.
△ Less
Submitted 25 February, 2014;
originally announced February 2014.
-
VOStat: A Statistical Web Service for Astronomers
Authors:
Arnab Chakraborty,
Eric D. Feigelson,
G. Jogesh Babu
Abstract:
VOStat is a Web service providing interactive statistical analysis of astronomical tabular datasets. It is integrated into the suite of analysis and visualization tools associated with the international Virtual Observatory (VO) through the SAMP communication system. A user supplies VOStat with a dataset extracted from the VO, or otherwise acquired, and chooses among $\sim 60$ statistical functions…
▽ More
VOStat is a Web service providing interactive statistical analysis of astronomical tabular datasets. It is integrated into the suite of analysis and visualization tools associated with the international Virtual Observatory (VO) through the SAMP communication system. A user supplies VOStat with a dataset extracted from the VO, or otherwise acquired, and chooses among $\sim 60$ statistical functions. These include data transformations, plots and summaries, density estimation, one- and two-sample hypothesis tests, global and local regressions, multivariate analysis and clustering, spatial analysis, directional statistics, survival analysis (for censored data like upper limits), and time series analysis. The statistical operations are performed using the public domain {\bf R} statistical software environment, including a small fraction of its $>4000$ {\bf CRAN} add-on packages. The purpose of VOStat is to facilitate a wider range of statistical analyses than are commonly used in astronomy, and to promote use of more advanced methodology in {\bf R} and {\bf CRAN}.
△ Less
Submitted 2 February, 2013;
originally announced February 2013.
-
2-connecting Outerplanar Graphs without Blowing Up the Pathwidth
Authors:
Jasine Babu,
Manu Basavaraju,
L. Sunil Chandran,
Deepak Rajendraprasad
Abstract:
Given a connected outerplanar graph G of pathwidth p, we give an algorithm to add edges to G to get a supergraph of G, which is 2-vertex-connected, outerplanar and of pathwidth O(p). This settles an open problem raised by Biedl, in the context of computing minimum height planar straight line drawings of outerplanar graphs, with their vertices placed on a two dimensional grid. In conjunction with t…
▽ More
Given a connected outerplanar graph G of pathwidth p, we give an algorithm to add edges to G to get a supergraph of G, which is 2-vertex-connected, outerplanar and of pathwidth O(p). This settles an open problem raised by Biedl, in the context of computing minimum height planar straight line drawings of outerplanar graphs, with their vertices placed on a two dimensional grid. In conjunction with the result of this paper, the constant factor approximation algorithm for this problem obtained by Biedl for 2-vertex-connected outerplanar graphs will work for all outer planar graphs.
△ Less
Submitted 1 January, 2014; v1 submitted 27 December, 2012;
originally announced December 2012.
-
The Astrophysical Multimessenger Observatory Network (AMON)
Authors:
M. W. E. Smith,
D. B. Fox,
D. F. Cowen,
P. Mészáros,
G. Tešić,
J. Fixelle,
I. Bartos,
P. Sommers,
Abhay Ashtekar,
G. Jogesh Babu,
S. D. Barthelmy,
S. Coutu,
T. DeYoung,
A. D. Falcone,
L. S. Finn,
Shan Gao,
B. Hashemi,
A. Homeier,
S. Márka,
B. J. Owen,
I. Taboada
Abstract:
We summarize the science opportunity, design elements, current and projected partner observatories, and anticipated science returns of the Astrophysical Multimessenger Observatory Network (AMON). AMON will link multiple current and future high-energy, multimessenger, and follow-up observatories together into a single network, enabling near real-time coincidence searches for multimessenger astrophy…
▽ More
We summarize the science opportunity, design elements, current and projected partner observatories, and anticipated science returns of the Astrophysical Multimessenger Observatory Network (AMON). AMON will link multiple current and future high-energy, multimessenger, and follow-up observatories together into a single network, enabling near real-time coincidence searches for multimessenger astrophysical transients and their electromagnetic counterparts. Candidate and high-confidence multimessenger transient events will be identified, characterized, and distributed as AMON alerts within the network and to interested external observers, leading to follow-up observations across the electromagnetic spectrum. In this way, AMON aims to evoke the discovery of multimessenger transients from within observatory subthreshold data streams and facilitate the exploitation of these transients for purposes of astronomy and fundamental physics. As a central hub of global multimessenger science, AMON will also enable cross-collaboration analyses of archival datasets in search of rare or exotic astrophysical phenomena.
△ Less
Submitted 23 November, 2012;
originally announced November 2012.
-
Fixed-Orientation Equilateral Triangle Matching of Point Sets
Authors:
Jasine Babu,
Ahmad Biniaz,
Anil Maheshwari,
Michiel Smid
Abstract:
Given a point set $P$ and a class $\mathcal{C}$ of geometric objects, $G_\mathcal{C}(P)$ is a geometric graph with vertex set $P$ such that any two vertices $p$ and $q$ are adjacent if and only if there is some $C \in \mathcal{C}$ containing both $p$ and $q$ but no other points from $P$. We study $G_{\bigtriangledown}(P)$ graphs where $\bigtriangledown$ is the class of downward equilateral triangl…
▽ More
Given a point set $P$ and a class $\mathcal{C}$ of geometric objects, $G_\mathcal{C}(P)$ is a geometric graph with vertex set $P$ such that any two vertices $p$ and $q$ are adjacent if and only if there is some $C \in \mathcal{C}$ containing both $p$ and $q$ but no other points from $P$. We study $G_{\bigtriangledown}(P)$ graphs where $\bigtriangledown$ is the class of downward equilateral triangles (ie. equilateral triangles with one of their sides parallel to the x-axis and the corner opposite to this side below that side). For point sets in general position, these graphs have been shown to be equivalent to half-$Θ_6$ graphs and TD-Delaunay graphs.
The main result in our paper is that for point sets $P$ in general position, $G_{\bigtriangledown}(P)$ always contains a matching of size at least $\lceil\frac{n-2}{3}\rceil$ and this bound cannot be improved above $\lceil\frac{n-1}{3}\rceil$.
We also give some structural properties of $G_{\davidsstar}(P)$ graphs, where $\davidsstar$ is the class which contains both upward and downward equilateral triangles. We show that for point sets in general position, the block cut point graph of $G_{\davidsstar}(P)$ is simply a path. Through the equivalence of $G_{\davidsstar}(P)$ graphs with $Θ_6$ graphs, we also derive that any $Θ_6$ graph can have at most $5n-11$ edges, for point sets in general position.
△ Less
Submitted 12 November, 2012;
originally announced November 2012.
-
Extraction of Deep Phylogenetic Signal and Improved Resolution of Evolutionary Events within the recA/RAD51 Phylogeny
Authors:
Sree V. Chintapalli,
Gaurav Bhardwaj,
Jagadish Babu,
Loukia Hadjiyianni,
Yoo** Hong,
Zhenhai Zhang,
Xiaofan Zhou,
Hong Ma,
Andriy Anishkin,
Damian B. van Rossum,
Randen L. Patterson
Abstract:
The recA/RAD51 gene family encodes a diverse set of recombinase proteins that effect homologous recombination, DNA-repair, and genome stability. The recA gene family is expressed in almost all species of Eubacteria, Archaea, and Eukaryotes, and even in some viruses. To date, efforts to resolve the deep evolutionary origins of this ancient protein family have been hindered, in part, by the high seq…
▽ More
The recA/RAD51 gene family encodes a diverse set of recombinase proteins that effect homologous recombination, DNA-repair, and genome stability. The recA gene family is expressed in almost all species of Eubacteria, Archaea, and Eukaryotes, and even in some viruses. To date, efforts to resolve the deep evolutionary origins of this ancient protein family have been hindered, in part, by the high sequence divergence between families (i.e. ~30% identity between paralogous groups). Through (i) large taxon sampling, (ii) the use of a phylogenetic algorithm designed for measuring highly divergent paralogs, and (iii) novel Evolutionary Spatial Dynamics simulation and analytical tools, we obtained a robust, parsimonious and more refined phylogenetic history of the recA/RAD51 superfamily. Taken together, our model for the evolution of recA/RAD51 family provides a better understanding of ancient origin of recA proteins and multiple events leading to the diversification of recA homologs in eukaryotes, including the discovery of additional RAD51 sub-families.
△ Less
Submitted 14 June, 2012;
originally announced June 2012.
-
Statistical Methods for Astronomy
Authors:
Eric D. Feigelson,
G. Jogesh Babu
Abstract:
This review outlines concepts of mathematical statistics, elements of probability theory, hypothesis tests and point estimation for use in the analysis of modern astronomical data. Least squares, maximum likelihood, and Bayesian approaches to statistical inference are treated. Resampling methods, particularly the bootstrap, provide valuable procedures when distributions functions of statistics are…
▽ More
This review outlines concepts of mathematical statistics, elements of probability theory, hypothesis tests and point estimation for use in the analysis of modern astronomical data. Least squares, maximum likelihood, and Bayesian approaches to statistical inference are treated. Resampling methods, particularly the bootstrap, provide valuable procedures when distributions functions of statistics are not known. Several approaches to model selection and good- ness of fit are considered. Applied statistics relevant to astronomical research are briefly discussed: nonparametric methods for use when little is known about the behavior of the astronomical populations or processes; data smoothing with kernel density estimation and nonparametric regression; unsupervised clustering and supervised classification procedures for multivariate problems; survival analysis for astronomical datasets with nondetections; time- and frequency-domain times series analysis for light curves; and spatial statistics to interpret the spatial distributions of points in low dimensions. Two types of resources are presented: about 40 recommended texts and monographs in various fields of statistics, and the public domain R software system for statistical analysis. Together with its \sim 3500 (and growing) add-on CRAN packages, R implements a vast range of statistical procedures in a coherent high-level language with advanced graphics.
△ Less
Submitted 9 May, 2012;
originally announced May 2012.
-
Parameterized and Approximation Algorithms for Boxicity
Authors:
Abhi** Adiga,
Jasine Babu,
L. Sunil Chandran
Abstract:
Boxicity of a graph $G(V,$ $E)$, denoted by $box(G)$, is the minimum integer $k$ such that $G$ can be represented as the intersection graph of axis parallel boxes in $\mathbb{R}^k$. The problem of computing boxicity is inapproximable even for graph classes like bipartite, co-bipartite and split graphs within $O(n^{1 - ε})$-factor, for any $ε>0$ in polynomial time unless $NP=ZPP$. We give FPT appro…
▽ More
Boxicity of a graph $G(V,$ $E)$, denoted by $box(G)$, is the minimum integer $k$ such that $G$ can be represented as the intersection graph of axis parallel boxes in $\mathbb{R}^k$. The problem of computing boxicity is inapproximable even for graph classes like bipartite, co-bipartite and split graphs within $O(n^{1 - ε})$-factor, for any $ε>0$ in polynomial time unless $NP=ZPP$. We give FPT approximation algorithms for computing the boxicity of graphs, where the parameter used is the vertex or edge edit distance of the given graph from families of graphs of bounded boxicity. This can be seen as a generalization of the parameterizations discussed in \cite{Adiga2}.
Extending the same idea in one of our algorithms, we also get an $O\left(\frac{n\sqrt{\log \log n}}{\sqrt{\log n}}\right)$ factor approximation algorithm for computing boxicity and an $O\left(\frac{n {(\log \log n)}^{\frac{3}{2}}}{\sqrt{\log n}}\right)$ factor approximation algorithm for computing the cubicity. These seem to be the first $o(n)$ factor approximation algorithms known for both boxicity and cubicity. As a consequence of this result, a $o(n)$ factor approximation algorithm for computing the partial order dimension of finite posets and a $o(n)$ factor approximation algorithm for computing the threshold dimension of split graphs would follow.
△ Less
Submitted 5 March, 2014; v1 submitted 28 January, 2012;
originally announced January 2012.
-
Limit theorems for functions of marginal quantiles
Authors:
G. Jogesh Babu,
Zhidong Bai,
Kwok Pui Choi,
Vasudevan Mangalam
Abstract:
Multivariate distributions are explored using the joint distributions of marginal sample quantiles. Limit theory for the mean of a function of order statistics is presented. The results include a multivariate central limit theorem and a strong law of large numbers. A result similar to Bahadur's representation of quantiles is established for the mean of a function of the marginal quantiles. In part…
▽ More
Multivariate distributions are explored using the joint distributions of marginal sample quantiles. Limit theory for the mean of a function of order statistics is presented. The results include a multivariate central limit theorem and a strong law of large numbers. A result similar to Bahadur's representation of quantiles is established for the mean of a function of the marginal quantiles. In particular, it is shown that \[\sqrt{n}\Biggl(\frac{1}{n}\sum_{i=1}^nφ\bigl(X_{n:i}^{(1)},...,X_{n:i}^{(d)}\bigr)-\barγ\Biggr)=\frac{1}{\sqrt{n}}\sum_{i=1}^nZ_{n,i}+\mathrm{o}_P(1)\] as $n\rightarrow\infty$, where $\barγ$ is a constant and $Z_{n,i}$ are i.i.d. random variables for each $n$. This leads to the central limit theorem. Weak convergence to a Gaussian process using equicontinuity of functions is indicated. The results are established under very general conditions. These conditions are shown to be satisfied in many commonly occurring situations.
△ Less
Submitted 22 April, 2011;
originally announced April 2011.
-
A Constant Factor Approximation Algorithm for Boxicity of Circular Arc Graphs
Authors:
Abhi** Adiga,
Jasine Babu,
L. Sunil Chandran
Abstract:
Boxicity of a graph $G(V,E)$ is the minimum integer $k$ such that $G$ can be represented as the intersection graph of $k$-dimensional axis parallel rectangles in $\mathbf{R}^k$. Equivalently, it is the minimum number of interval graphs on the vertex set $V$ such that the intersection of their edge sets is $E$. It is known that boxicity cannot be approximated even for graph classes like bipartite,…
▽ More
Boxicity of a graph $G(V,E)$ is the minimum integer $k$ such that $G$ can be represented as the intersection graph of $k$-dimensional axis parallel rectangles in $\mathbf{R}^k$. Equivalently, it is the minimum number of interval graphs on the vertex set $V$ such that the intersection of their edge sets is $E$. It is known that boxicity cannot be approximated even for graph classes like bipartite, co-bipartite and split graphs below $O(n^{0.5 - ε})$-factor, for any $ε>0$ in polynomial time unless $NP=ZPP$. Till date, there is no well known graph class of unbounded boxicity for which even an $n^ε$-factor approximation algorithm for computing boxicity is known, for any $ε<1$. In this paper, we study the boxicity problem on Circular Arc graphs - intersection graphs of arcs of a circle. We give a $(2+\frac{1}{k})$-factor polynomial time approximation algorithm for computing the boxicity of any circular arc graph along with a corresponding box representation, where $k \ge 1$ is its boxicity. For Normal Circular Arc(NCA) graphs, with an NCA model given, this can be improved to an additive 2-factor approximation algorithm. The time complexity of the algorithms to approximately compute the boxicity is $O(mn+n^2)$ in both these cases and in $O(mn+kn^2)= O(n^3)$ time we also get their corresponding box representations, where $n$ is the number of vertices of the graph and $m$ is its number of edges. The additive 2-factor algorithm directly works for any Proper Circular Arc graph, since computing an NCA model for it can be done in polynomial time.
△ Less
Submitted 8 February, 2011;
originally announced February 2011.
-
A statistical model for the relation between exoplanets and their host stars
Authors:
E. Martinez-Gomez,
G. J. Babu
Abstract:
A general model is proposed to explain the relation between the extrasolar planets (or exoplanets) detected until June 2008 and the main characteristics of their host stars through statistical techniques. The main goal is to establish a mathematical relation among the set of variables which better describe the physical characteristics of the host star and the planet itself. The host star is char…
▽ More
A general model is proposed to explain the relation between the extrasolar planets (or exoplanets) detected until June 2008 and the main characteristics of their host stars through statistical techniques. The main goal is to establish a mathematical relation among the set of variables which better describe the physical characteristics of the host star and the planet itself. The host star is characterized by its distance, age, effective temperature, mass, metallicity, radius and magnitude. The exoplanet is described through its physical parameters (radius and mass) and its orbital parameters (distance, period, eccentricity, inclination and major semiaxis). As a first approach we consider that only the mass of the exoplanet is being determined by the physical properties of its host star. The proposed model is then validated through statistical analysis. Finally we discuss the categorical behavior of the dependent variable through binary models.
△ Less
Submitted 27 August, 2009;
originally announced August 2009.
-
Object detection in multi-epoch data
Authors:
G. Jogesh Babu,
Ashish Mahabal,
S. G. Djorgovski,
R. Williams
Abstract:
In astronomy multiple images are frequently obtained at the same position of the sky for follow-up co-addition as it helps one go deeper and look for fainter objects. With large scale panchromatic synoptic surveys becoming more common, image co-addition has become even more necessary as new observations start to get compared with co-added fiducial sky in real time. The standard co-addition techn…
▽ More
In astronomy multiple images are frequently obtained at the same position of the sky for follow-up co-addition as it helps one go deeper and look for fainter objects. With large scale panchromatic synoptic surveys becoming more common, image co-addition has become even more necessary as new observations start to get compared with co-added fiducial sky in real time. The standard co-addition techniques have included straight averages, variance weighted averages, medians etc. A more sophisticated nonlinear response chi-square method is also used when it is known that the data are background noise limited and the point spread function is homogenized in all channels. A more robust object detection technique capable of detecting faint sources, even those not seen at all epochs which will normally be smoothed out in traditional methods, is described. The analysis at each pixel level is based on a formula similar to Mahalanobis distance. The method does not depend on the point spread function.
△ Less
Submitted 22 December, 2006;
originally announced December 2006.
-
Grist: Grid-based Data Mining for Astronomy
Authors:
Joseph C. Jacob,
Roy Williams,
Jogesh Babu,
S. George Djorgovski,
Matthew J. Graham,
Daniel S. Katz,
Ashish Mahabal,
Craig D. Miller,
Robert Nichol,
Daniel E. Vanden Berk,
Harshpreet Walia
Abstract:
The Grist project (http://grist.caltech.edu/) is develo** a grid-technology based system as a research environment for astronomy with massive and complex datasets. This knowledge extraction system will consist of a library of distributed grid services controlled by a workflow system, compliant with standards emerging from the grid computing, web services, and virtual observatory communities. T…
▽ More
The Grist project (http://grist.caltech.edu/) is develo** a grid-technology based system as a research environment for astronomy with massive and complex datasets. This knowledge extraction system will consist of a library of distributed grid services controlled by a workflow system, compliant with standards emerging from the grid computing, web services, and virtual observatory communities. This new technology is being used to find high redshift quasars, study peculiar variable objects, search for transients in real time, and fit SDSS QSO spectra to measure black hole masses. Grist services are also a component of the ``hyperatlas'' project to serve high-resolution multi-wavelength imagery over the Internet. In support of these science and outreach objectives, the Grist framework will provide the enabling fabric to tie together distributed grid services in the areas of data access, federation, mining, subsetting, source extraction, image mosaicking, statistics, and visualization.
△ Less
Submitted 19 November, 2004;
originally announced November 2004.
-
Statistical Challenges in Modern Astronomy
Authors:
E. D. Feigelson,
G. J. Babu
Abstract:
Despite centuries of close association, statistics and astronomy are surprisingly distant today. Most observational astronomical research relies on an inadequate toolbox of methodological tools. Yet the needs are substantial: astronomy encounters sophisticated problems involving sampling theory, survival analysis, multivariate classification and analysis, time series analysis, wavelet analysis,…
▽ More
Despite centuries of close association, statistics and astronomy are surprisingly distant today. Most observational astronomical research relies on an inadequate toolbox of methodological tools. Yet the needs are substantial: astronomy encounters sophisticated problems involving sampling theory, survival analysis, multivariate classification and analysis, time series analysis, wavelet analysis, spatial point processes, nonlinear regression, bootstrap resampling and model selection. We review the recent resurgence of astrostatistical research, and outline new challenges raised by the emerging Virtual Observatory. Our essay ends with a list of research challenges and infrastructure for astrostatistics in the coming decade.
△ Less
Submitted 20 January, 2004;
originally announced January 2004.
-
Three types of gamma-ray bursts
Authors:
Soma Mukherjee,
Eric D. Feigelson,
Gutti Jogesh Babu,
Fionn Murtagh,
Chris Fraley,
Adrian Raftery
Abstract:
A multivariate analysis of gamma-ray burst (GRB) bulk properties is presented to discriminate between distinct classes of GRBs. Several variables representing burst duration, fluence and spectral hardness are considered. Two multivariate clustering procedures are used on a sample of 797 bursts from the Third BATSE Catalog: a nonparametric average linkage hierarchical agglomerative clustering pro…
▽ More
A multivariate analysis of gamma-ray burst (GRB) bulk properties is presented to discriminate between distinct classes of GRBs. Several variables representing burst duration, fluence and spectral hardness are considered. Two multivariate clustering procedures are used on a sample of 797 bursts from the Third BATSE Catalog: a nonparametric average linkage hierarchical agglomerative clustering procedure validated with Wilks' $Λ^*$ and other MANOVA tests; and a parametric maximum likelihood model-based clustering procedure assuming multinormal populations calculated with the EM Algorithm and validated with the Bayesian Information Criterion.
The two methods yield very similar results. The BATSE GRB population consists of three classes with the following Duration/Fluence/Spectrum bulk properties: Class I with long/bright/intermediate bursts, Class II with short/hard/faint bursts, and Class III with intermediate/intermediate/soft bursts. One outlier with poor data is also present. Classes I and II correspond to those reported by Kouveliotou et al. (1993), but Class III is clearly defined here for the first time.
△ Less
Submitted 7 February, 1998;
originally announced February 1998.