-
Simulation-based Inference of Developmental EEG Maturation with the Spectral Graph Model
Authors:
Danilo Bernardo,
Xihe Xie,
Parul Verma,
Jonathan Kim,
Virginia Liu,
Adam Numis,
Ye Wu,
Hannah C. Glass,
Pew-Thian Yap,
Srikantan Nagarajan,
Ashish Raj
Abstract:
The spectral content of macroscopic neural activity evolves throughout development, yet how this maturation relates to underlying brain network formation and dynamics remains unknown. Here, we assess the developmental maturation of electroencephalogram spectra via Bayesian model inversion of the spectral graph model, a parsimonious whole-brain model of spatiospectral neural activity derived from l…
▽ More
The spectral content of macroscopic neural activity evolves throughout development, yet how this maturation relates to underlying brain network formation and dynamics remains unknown. Here, we assess the developmental maturation of electroencephalogram spectra via Bayesian model inversion of the spectral graph model, a parsimonious whole-brain model of spatiospectral neural activity derived from linearized neural field models coupled by the structural connectome. Simulation-based inference was used to estimate age-varying spectral graph model parameter posterior distributions from electroencephalogram spectra spanning the developmental period. This model-fitting approach accurately captures observed developmental electroencephalogram spectral maturation via a neurobiologically consistent progression of key neural parameters: long-range coupling, axonal conduction speed, and excitatory:inhibitory balance. These results suggest that the spectral maturation of macroscopic neural activity observed during typical development is supported by age-dependent functional adaptations in localized neural dynamics and their long-range coupling across the macroscopic structural network.
△ Less
Submitted 11 July, 2024; v1 submitted 3 May, 2024;
originally announced May 2024.
-
An Analysis of $D^α$ seeding for $k$-means
Authors:
Etienne Bamas,
Sai Ganesh Nagarajan,
Ola Svensson
Abstract:
One of the most popular clustering algorithms is the celebrated $D^α$ seeding algorithm (also know as $k$-means++ when $α=2$) by Arthur and Vassilvitskii (2007), who showed that it guarantees in expectation an $O(2^{2α}\cdot \log k)$-approximate solution to the ($k$,$α$)-means cost (where euclidean distances are raised to the power $α$) for any $α\ge 1$. More recently, Balcan, Dick, and White (201…
▽ More
One of the most popular clustering algorithms is the celebrated $D^α$ seeding algorithm (also know as $k$-means++ when $α=2$) by Arthur and Vassilvitskii (2007), who showed that it guarantees in expectation an $O(2^{2α}\cdot \log k)$-approximate solution to the ($k$,$α$)-means cost (where euclidean distances are raised to the power $α$) for any $α\ge 1$. More recently, Balcan, Dick, and White (2018) observed experimentally that using $D^α$ seeding with $α>2$ can lead to a better solution with respect to the standard $k$-means objective (i.e. the $(k,2)$-means cost).
In this paper, we provide a rigorous understanding of this phenomenon. For any $α>2$, we show that $D^α$ seeding guarantees in expectation an approximation factor of $$ O_α\left((g_α)^{2/α}\cdot \left(\frac{σ_{\mathrm{max}}}{σ_{\mathrm{min}}}\right)^{2-4/α}\cdot (\min\{\ell,\log k\})^{2/α}\right)$$ with respect to the standard $k$-means cost of any underlying clustering; where $g_α$ is a parameter capturing the concentration of the points in each cluster, $σ_{\mathrm{max}}$ and $σ_{\mathrm{min}}$ are the maximum and minimum standard deviation of the clusters around their means, and $\ell$ is the number of distinct mixing weights in the underlying clustering (after rounding them to the nearest power of $2$). We complement these results by some lower bounds showing that the dependency on $g_α$ and $σ_{\mathrm{max}}/σ_{\mathrm{min}}$ is tight.
Finally, we provide an experimental confirmation of the effects of the aforementioned parameters when using $D^α$ seeding. Further, we corroborate the observation that $α>2$ can indeed improve the $k$-means cost compared to $D^2$ seeding, and that this advantage remains even if we run Lloyd's algorithm after the seeding.
△ Less
Submitted 20 October, 2023;
originally announced October 2023.
-
Scan-specific Self-supervised Bayesian Deep Non-linear Inversion for Undersampled MRI Reconstruction
Authors:
Andrew P. Leynes,
Nikhil Deveshwar,
Srikantan S. Nagarajan,
Peder E. Z. Larson
Abstract:
Magnetic resonance imaging is subject to slow acquisition times due to the inherent limitations in data sampling. Recently, supervised deep learning has emerged as a promising technique for reconstructing sub-sampled MRI. However, supervised deep learning requires a large dataset of fully-sampled data. Although unsupervised or self-supervised deep learning methods have emerged to address the limit…
▽ More
Magnetic resonance imaging is subject to slow acquisition times due to the inherent limitations in data sampling. Recently, supervised deep learning has emerged as a promising technique for reconstructing sub-sampled MRI. However, supervised deep learning requires a large dataset of fully-sampled data. Although unsupervised or self-supervised deep learning methods have emerged to address the limitations of supervised deep learning approaches, they still require a database of images. In contrast, scan-specific deep learning methods learn and reconstruct using only the sub-sampled data from a single scan. Here, we introduce Scan-Specific Self-Supervised Bayesian Deep Non-Linear Inversion (DNLINV) that does not require an auto calibration scan region. DNLINV utilizes a deep image prior-type generative modeling approach and relies on approximate Bayesian inference to regularize the deep convolutional neural network. We demonstrate our approach on several anatomies, contrasts, and sampling patterns and show improved performance over existing approaches in scan-specific calibrationless parallel imaging and compressed sensing.
△ Less
Submitted 12 February, 2024; v1 submitted 1 March, 2022;
originally announced March 2022.
-
Efficient Hierarchical Bayesian Inference for Spatio-temporal Regression Models in Neuroimaging
Authors:
Ali Hashemi,
Yi**g Gao,
Chang Cai,
Sanjay Ghosh,
Klaus-Robert Müller,
Srikantan S. Nagarajan,
Stefan Haufe
Abstract:
Several problems in neuroimaging and beyond require inference on the parameters of multi-task sparse hierarchical regression models. Examples include M/EEG inverse problems, neural encoding models for task-based fMRI analyses, and climate science. In these domains, both the model parameters to be inferred and the measurement noise may exhibit a complex spatio-temporal structure. Existing work eith…
▽ More
Several problems in neuroimaging and beyond require inference on the parameters of multi-task sparse hierarchical regression models. Examples include M/EEG inverse problems, neural encoding models for task-based fMRI analyses, and climate science. In these domains, both the model parameters to be inferred and the measurement noise may exhibit a complex spatio-temporal structure. Existing work either neglects the temporal structure or leads to computationally demanding inference schemes. Overcoming these limitations, we devise a novel flexible hierarchical Bayesian framework within which the spatio-temporal dynamics of model parameters and noise are modeled to have Kronecker product covariance structure. Inference in our framework is based on majorization-minimization optimization and has guaranteed convergence properties. Our highly efficient algorithms exploit the intrinsic Riemannian geometry of temporal autocovariance matrices. For stationary dynamics described by Toeplitz matrices, the theory of circulant embeddings is employed. We prove convex bounding properties and derive update rules of the resulting algorithms. On both synthetic and real neural data from M/EEG, we demonstrate that our methods lead to improved performance.
△ Less
Submitted 23 November, 2021; v1 submitted 2 November, 2021;
originally announced November 2021.
-
Stochastic Multiplicative Weights Updates in Zero-Sum Games
Authors:
James P. Bailey,
Sai Ganesh Nagarajan,
Georgios Piliouras
Abstract:
We study agents competing against each other in a repeated network zero-sum game while applying the multiplicative weights update (MWU) algorithm with fixed learning rates. In our implementation, agents select their strategies probabilistically in each iteration and update their weights/strategies using the realized vector payoff of all strategies, i.e., stochastic MWU with full information. We sh…
▽ More
We study agents competing against each other in a repeated network zero-sum game while applying the multiplicative weights update (MWU) algorithm with fixed learning rates. In our implementation, agents select their strategies probabilistically in each iteration and update their weights/strategies using the realized vector payoff of all strategies, i.e., stochastic MWU with full information. We show that the system results in an irreducible Markov chain where agent strategies diverge from the set of Nash equilibria. Further, we show that agents will play pure strategies with probability 1 in the limit.
△ Less
Submitted 5 October, 2021;
originally announced October 2021.
-
Extremal mild solutions for Hilfer fractional evolution equation with mixed monotone Impulsive conditions
Authors:
Divya Raghavan,
Sukavanam Nagarajan
Abstract:
The well established mixed monotone iterative technique that is used to study the existence and uniqueness of fractional order system is studied explicitly for impulsive system with Hilfer fractional order in this paper. The procedure of finding mild $L$-quasi solution of such impulsive evolution equation with noncomapct semigroups involves measure of non-compactness and Sadovskii's fixed point th…
▽ More
The well established mixed monotone iterative technique that is used to study the existence and uniqueness of fractional order system is studied explicitly for impulsive system with Hilfer fractional order in this paper. The procedure of finding mild $L$-quasi solution of such impulsive evolution equation with noncomapct semigroups involves measure of non-compactness and Sadovskii's fixed point theorem as well. An example is provided to illustrate the main results.
△ Less
Submitted 17 June, 2021;
originally announced June 2021.
-
Extremal mild solutions of Hilfer fractional Impulsive systems
Authors:
Divya Raghavan,
Sukavanam Nagarajan
Abstract:
The well established monotone iterative technique that is used to study the existence and uniqueness of fractional impulsive system is extended to Hilfer fractional order in this paper. The results are derived by using the method of upper and lower solution and Gronwall inequality. Also, conditions on non-compactness of measure is used effectively to prove the main result.
The well established monotone iterative technique that is used to study the existence and uniqueness of fractional impulsive system is extended to Hilfer fractional order in this paper. The results are derived by using the method of upper and lower solution and Gronwall inequality. Also, conditions on non-compactness of measure is used effectively to prove the main result.
△ Less
Submitted 23 November, 2020;
originally announced November 2020.
-
Generalized Mittag-Leffler stability of fractional impulsive differential system
Authors:
Divya Raghavan,
Sukavanam Nagarajan,
Chengbo Zhai
Abstract:
This paper establishes integral representations of mild solutions of impulsive Hilfer fractional differential equations with impulsive conditions and fluctuating lower bounds at impulsive points. Further, the paper provides sufficient conditions for generalized Mittag-Leffler stability of a class of impulsive fractional differential systems with Hilfer order. The analysis extends through both, ins…
▽ More
This paper establishes integral representations of mild solutions of impulsive Hilfer fractional differential equations with impulsive conditions and fluctuating lower bounds at impulsive points. Further, the paper provides sufficient conditions for generalized Mittag-Leffler stability of a class of impulsive fractional differential systems with Hilfer order. The analysis extends through both, instantaneous and non-instantaneous impulsive conditions. The theory utilizes continuous Lyapunov functions, to ascertain the stability conditions. An example is provided to study the solution of the system with a changeable lower bound for the non-instantaneous impulsive conditions.
△ Less
Submitted 17 May, 2022; v1 submitted 20 September, 2020;
originally announced September 2020.
-
Efficient Statistics for Sparse Graphical Models from Truncated Samples
Authors:
Arnab Bhattacharyya,
Rathin Desai,
Sai Ganesh Nagarajan,
Ioannis Panageas
Abstract:
In this paper, we study high-dimensional estimation from truncated samples. We focus on two fundamental and classical problems: (i) inference of sparse Gaussian graphical models and (ii) support recovery of sparse linear models.
(i) For Gaussian graphical models, suppose $d$-dimensional samples ${\bf x}$ are generated from a Gaussian $N(μ,Σ)$ and observed only if they belong to a subset…
▽ More
In this paper, we study high-dimensional estimation from truncated samples. We focus on two fundamental and classical problems: (i) inference of sparse Gaussian graphical models and (ii) support recovery of sparse linear models.
(i) For Gaussian graphical models, suppose $d$-dimensional samples ${\bf x}$ are generated from a Gaussian $N(μ,Σ)$ and observed only if they belong to a subset $S \subseteq \mathbb{R}^d$. We show that $μ$ and $Σ$ can be estimated with error $ε$ in the Frobenius norm, using $\tilde{O}\left(\frac{\textrm{nz}(Σ^{-1})}{ε^2}\right)$ samples from a truncated $\mathcal{N}(μ,Σ)$ and having access to a membership oracle for $S$. The set $S$ is assumed to have non-trivial measure under the unknown distribution but is otherwise arbitrary.
(ii) For sparse linear regression, suppose samples $({\bf x},y)$ are generated where $y = {\bf x}^\top{Ω^*} + \mathcal{N}(0,1)$ and $({\bf x}, y)$ is seen only if $y$ belongs to a truncation set $S \subseteq \mathbb{R}$. We consider the case that $Ω^*$ is sparse with a support set of size $k$. Our main result is to establish precise conditions on the problem dimension $d$, the support size $k$, the number of observations $n$, and properties of the samples and the truncation that are sufficient to recover the support of $Ω^*$. Specifically, we show that under some mild assumptions, only $O(k^2 \log d)$ samples are needed to estimate $Ω^*$ in the $\ell_\infty$-norm up to a bounded error.
For both problems, our estimator minimizes the sum of the finite population negative log-likelihood function and an $\ell_1$-regularization term.
△ Less
Submitted 17 June, 2020;
originally announced June 2020.
-
Properties of relaxed trajectories of non-linear fractional impulsive control systems
Authors:
Divya Raghavan,
Sukavanam Nagarajan
Abstract:
A non-convex control system governed by a nonlinear impulsive evolution equation of Hilfer fractional order in a Banach space is considered. The existence of admissible state-control pair is established. Then the introduction of suitable measure-valued control convexifies the system, and the relaxed system is obtained. Further, the relaxation theorem for the described class is proved along with th…
▽ More
A non-convex control system governed by a nonlinear impulsive evolution equation of Hilfer fractional order in a Banach space is considered. The existence of admissible state-control pair is established. Then the introduction of suitable measure-valued control convexifies the system, and the relaxed system is obtained. Further, the relaxation theorem for the described class is proved along with the existence of optimal relaxed control.
△ Less
Submitted 28 May, 2022; v1 submitted 23 April, 2020;
originally announced April 2020.
-
Better Depth-Width Trade-offs for Neural Networks through the lens of Dynamical Systems
Authors:
Vaggos Chatziafratis,
Sai Ganesh Nagarajan,
Ioannis Panageas
Abstract:
The expressivity of neural networks as a function of their depth, width and type of activation units has been an important question in deep learning theory. Recently, depth separation results for ReLU networks were obtained via a new connection with dynamical systems, using a generalized notion of fixed points of a continuous map $f$, called periodic points. In this work, we strengthen the connect…
▽ More
The expressivity of neural networks as a function of their depth, width and type of activation units has been an important question in deep learning theory. Recently, depth separation results for ReLU networks were obtained via a new connection with dynamical systems, using a generalized notion of fixed points of a continuous map $f$, called periodic points. In this work, we strengthen the connection with dynamical systems and we improve the existing width lower bounds along several aspects. Our first main result is period-specific width lower bounds that hold under the stronger notion of $L^1$-approximation error, instead of the weaker classification error. Our second contribution is that we provide sharper width lower bounds, still yielding meaningful exponential depth-width separations, in regimes where previous results wouldn't apply. A byproduct of our results is that there exists a universal constant characterizing the depth-width trade-offs, as long as $f$ has odd periods. Technically, our results follow by unveiling a tighter connection between the following three quantities of a given function: its period, its Lipschitz constant and the growth rate of the number of oscillations arising under compositions of the function $f$ with itself.
△ Less
Submitted 20 July, 2020; v1 submitted 2 March, 2020;
originally announced March 2020.
-
Last iterate convergence in no-regret learning: constrained min-max optimization for convex-concave landscapes
Authors:
Qi Lei,
Sai Ganesh Nagarajan,
Ioannis Panageas,
Xiao Wang
Abstract:
In a recent series of papers it has been established that variants of Gradient Descent/Ascent and Mirror Descent exhibit last iterate convergence in convex-concave zero-sum games. Specifically, \cite{DISZ17, LiangS18} show last iterate convergence of the so called "Optimistic Gradient Descent/Ascent" for the case of \textit{unconstrained} min-max optimization. Moreover, in \cite{Metal} the authors…
▽ More
In a recent series of papers it has been established that variants of Gradient Descent/Ascent and Mirror Descent exhibit last iterate convergence in convex-concave zero-sum games. Specifically, \cite{DISZ17, LiangS18} show last iterate convergence of the so called "Optimistic Gradient Descent/Ascent" for the case of \textit{unconstrained} min-max optimization. Moreover, in \cite{Metal} the authors show that Mirror Descent with an extra gradient step displays last iterate convergence for convex-concave problems (both constrained and unconstrained), though their algorithm does not follow the online learning framework; it uses extra information rather than \textit{only} the history to compute the next iteration. In this work, we show that "Optimistic Multiplicative-Weights Update (OMWU)" which follows the no-regret online learning framework, exhibits last iterate convergence locally for convex-concave games, generalizing the results of \cite{DP19} where last iterate convergence of OMWU was shown only for the \textit{bilinear case}. We complement our results with experiments that indicate fast convergence of the method.
△ Less
Submitted 21 February, 2020; v1 submitted 16 February, 2020;
originally announced February 2020.
-
Depth-Width Trade-offs for ReLU Networks via Sharkovsky's Theorem
Authors:
Vaggos Chatziafratis,
Sai Ganesh Nagarajan,
Ioannis Panageas,
Xiao Wang
Abstract:
Understanding the representational power of Deep Neural Networks (DNNs) and how their structural properties (e.g., depth, width, type of activation unit) affect the functions they can compute, has been an important yet challenging question in deep learning and approximation theory. In a seminal paper, Telgarsky highlighted the benefits of depth by presenting a family of functions (based on simple…
▽ More
Understanding the representational power of Deep Neural Networks (DNNs) and how their structural properties (e.g., depth, width, type of activation unit) affect the functions they can compute, has been an important yet challenging question in deep learning and approximation theory. In a seminal paper, Telgarsky highlighted the benefits of depth by presenting a family of functions (based on simple triangular waves) for which DNNs achieve zero classification error, whereas shallow networks with fewer than exponentially many nodes incur constant error. Even though Telgarsky's work reveals the limitations of shallow neural networks, it does not inform us on why these functions are difficult to represent and in fact he states it as a tantalizing open question to characterize those functions that cannot be well-approximated by smaller depths.
In this work, we point to a new connection between DNNs expressivity and Sharkovsky's Theorem from dynamical systems, that enables us to characterize the depth-width trade-offs of ReLU networks for representing functions based on the presence of generalized notion of fixed points, called periodic points (a fixed point is a point of period 1). Motivated by our observation that the triangle waves used in Telgarsky's work contain points of period 3 - a period that is special in that it implies chaotic behavior based on the celebrated result by Li-Yorke - we proceed to give general lower bounds for the width needed to represent periodic functions as a function of the depth. Technically, the crux of our approach is based on an eigenvalue analysis of the dynamical system associated with such functions.
△ Less
Submitted 9 December, 2019;
originally announced December 2019.
-
Skin Cancer Recognition using Deep Residual Network
Authors:
Brij Rokad,
Dr. Sureshkumar Nagarajan
Abstract:
The advances in technology have enabled people to access internet from every part of the world. But to date, access to healthcare in remote areas is sparse. This proposed solution aims to bridge the gap between specialist doctors and patients. This prototype will be able to detect skin cancer from an image captured by the phone or any other camera. The network is deployed on cloud server-side proc…
▽ More
The advances in technology have enabled people to access internet from every part of the world. But to date, access to healthcare in remote areas is sparse. This proposed solution aims to bridge the gap between specialist doctors and patients. This prototype will be able to detect skin cancer from an image captured by the phone or any other camera. The network is deployed on cloud server-side processing for an even more accurate result. The Deep Residual learning model has been used for predicting the probability of cancer for server side The ResNet has three parametric layers. Each layer has Convolutional Neural Network, Batch Normalization, Maxpool and ReLU. Currently the model achieves an accuracy of 77% on the ISIC - 2017 challenge.
△ Less
Submitted 14 May, 2019;
originally announced May 2019.
-
On the Analysis of EM for truncated mixtures of two Gaussians
Authors:
Sai Ganesh Nagarajan,
Ioannis Panageas
Abstract:
Motivated by a recent result of Daskalakis et al. 2018, we analyze the population version of Expectation-Maximization (EM) algorithm for the case of \textit{truncated} mixtures of two Gaussians. Truncated samples from a $d$-dimensional mixture of two Gaussians $\frac{1}{2} \mathcal{N}(\vecμ, \vecΣ)+ \frac{1}{2} \mathcal{N}(-\vecμ, \vecΣ)$ means that a sample is only revealed if it falls in some su…
▽ More
Motivated by a recent result of Daskalakis et al. 2018, we analyze the population version of Expectation-Maximization (EM) algorithm for the case of \textit{truncated} mixtures of two Gaussians. Truncated samples from a $d$-dimensional mixture of two Gaussians $\frac{1}{2} \mathcal{N}(\vecμ, \vecΣ)+ \frac{1}{2} \mathcal{N}(-\vecμ, \vecΣ)$ means that a sample is only revealed if it falls in some subset $S \subset \mathbb{R}^d$ of positive (Lebesgue) measure. We show that for $d=1$, EM converges almost surely (under random initialization) to the true mean (variance $σ^2$ is known) for any measurable set $S$. Moreover, for $d>1$ we show EM almost surely converges to the true mean for any measurable set $S$ when the map of EM has only three fixed points, namely $-\vecμ, \vec{0}, \vecμ$ (covariance matrix $\vecΣ$ is known), and prove local convergence if there are more than three fixed points. We also provide convergence rates of our findings. Our techniques deviate from those of Daskalakis et al. 2017, which heavily depend on symmetry that the untruncated problem exhibits. For example, for an arbitrary measurable set $S$, it is impossible to compute a closed form of the update rule of EM. Moreover, arbitrarily truncating the mixture, induces further correlations among the variables. We circumvent these challenges by using techniques from dynamical systems, probability and statistics; implicit function theorem, stability analysis around the fixed points of the update rule of EM and correlation inequalities (FKG).
△ Less
Submitted 9 May, 2020; v1 submitted 19 February, 2019;
originally announced February 2019.
-
Distributed Anomaly Detection using Autoencoder Neural Networks in WSN for IoT
Authors:
Tie Luo,
Sai G. Nagarajan
Abstract:
Wireless sensor networks (WSN) are fundamental to the Internet of Things (IoT) by bridging the gap between the physical and the cyber worlds. Anomaly detection is a critical task in this context as it is responsible for identifying various events of interests such as equipment faults and undiscovered phenomena. However, this task is challenging because of the elusive nature of anomalies and the vo…
▽ More
Wireless sensor networks (WSN) are fundamental to the Internet of Things (IoT) by bridging the gap between the physical and the cyber worlds. Anomaly detection is a critical task in this context as it is responsible for identifying various events of interests such as equipment faults and undiscovered phenomena. However, this task is challenging because of the elusive nature of anomalies and the volatility of the ambient environments. In a resource-scarce setting like WSN, this challenge is further elevated and weakens the suitability of many existing solutions. In this paper, for the first time, we introduce autoencoder neural networks into WSN to solve the anomaly detection problem. We design a two-part algorithm that resides on sensors and the IoT cloud respectively, such that (i) anomalies can be detected at sensors in a fully distributed manner without the need for communicating with any other sensors or the cloud, and (ii) the relatively more computation-intensive learning task can be handled by the cloud with a much lower (and configurable) frequency. In addition to the minimal communication overhead, the computational load on sensors is also very low (of polynomial complexity) and readily affordable by most COTS sensors. Using a real WSN indoor testbed and sensor data collected over 4 consecutive months, we demonstrate via experiments that our proposed autoencoder-based anomaly detection mechanism achieves high detection accuracy and low false alarm rate. It is also able to adapt to unforeseeable and new changes in a non-stationary environment, thanks to the unsupervised learning feature of our chosen autoencoder neural networks.
△ Less
Submitted 12 December, 2018;
originally announced December 2018.
-
Simulation of Astronomical Images from Optical Survey Telescopes using a Comprehensive Photon Monte Carlo Approach
Authors:
J. R. Peterson,
J. G. Jernigan,
S. M. Kahn,
A. P. Rasmussen,
E. Peng,
Z. Ahmad,
J. Bankert,
C. Chang,
C. Claver,
D. K. Gilmore,
E. Grace,
M. Hannel,
M. Hodge,
S. Lorenz,
A. Lupu,
A. Meert,
S. Nagarajan,
N. Todd,
A. Winans,
M. Young
Abstract:
We present a comprehensive methodology for the simulation of astronomical images from optical survey telescopes. We use a photon Monte Carlo approach to construct images by sampling photons from models of astronomical source populations, and then simulating those photons through the system as they interact with the atmosphere, telescope, and camera. We demonstrate that all physical effects for opt…
▽ More
We present a comprehensive methodology for the simulation of astronomical images from optical survey telescopes. We use a photon Monte Carlo approach to construct images by sampling photons from models of astronomical source populations, and then simulating those photons through the system as they interact with the atmosphere, telescope, and camera. We demonstrate that all physical effects for optical light that determine the shapes, locations, and brightnesses of individual stars and galaxies can be accurately represented in this formalism. By using large scale grid computing, modern processors, and an efficient implementation that can produce 400,000 photons/second, we demonstrate that even very large optical surveys can be now be simulated. We demonstrate that we are able to: 1) construct kilometer scale phase screens necessary for wide-field telescopes, 2) reproduce atmospheric point-spread-function moments using a fast novel hybrid geometric/Fourier technique for non-diffraction limited telescopes, 3) accurately reproduce the expected spot diagrams for complex aspheric optical designs, and 4) recover system effective area predicted from analytic photometry integrals. This new code, the photon simulator (PhoSim), is publicly available. We have implemented the Large Synoptic Survey Telescope (LSST) design, and it can be extended to other telescopes. We expect that because of the comprehensive physics implemented in PhoSim, it will be used by the community to plan future observations, interpret detailed existing observations, and quantify systematics related to various astronomical measurements. Future development and validation by comparisons with real data will continue to improve the fidelity and usability of the code.
△ Less
Submitted 24 April, 2015;
originally announced April 2015.
-
Effect of Measurement Errors on Predicted Cosmological Constraints from Shear Peak Statistics with LSST
Authors:
D. Bard,
J. M. Kratochvil,
C. Chang,
M. May,
S. M. Kahn,
Y. AlSayyad,
Z. Ahmad,
J. Bankert,
A. Connolly,
R. R. Gibson,
K. Gilmore,
E. Grace,
Z. Haiman,
M. Hannel,
K. M. Huffenberger,
J. G. Jernigan,
L. Jones,
S. Krughoff,
S. Lorenz,
S. Marshall,
A. Meert,
S. Nagarajan,
E. Peng,
J. Peterson,
A. P. Rasmussen
, et al. (4 additional authors not shown)
Abstract:
The statistics of peak counts in reconstructed shear maps contain information beyond the power spectrum, and can improve cosmological constraints from measurements of the power spectrum alone if systematic errors can be controlled. We study the effect of galaxy shape measurement errors on predicted cosmological constraints from the statistics of shear peak counts with the Large Synoptic Survey Tel…
▽ More
The statistics of peak counts in reconstructed shear maps contain information beyond the power spectrum, and can improve cosmological constraints from measurements of the power spectrum alone if systematic errors can be controlled. We study the effect of galaxy shape measurement errors on predicted cosmological constraints from the statistics of shear peak counts with the Large Synoptic Survey Telescope (LSST). We use the LSST image simulator in combination with cosmological N-body simulations to model realistic shear maps for different cosmological models. We include both galaxy shape noise and, for the first time, measurement errors on galaxy shapes. We find that the measurement errors considered have relatively little impact on the constraining power of shear peak counts for LSST.
△ Less
Submitted 4 January, 2013;
originally announced January 2013.
-
Effectiveness of sparse Bayesian algorithm for MVAR coefficient estimation in MEG/EEG source-space causality analysis
Authors:
Kensuke Sekihara,
Hagai Attias,
Julia P. Owen,
Srikantan S. Nagarajan
Abstract:
This paper examines the effectiveness of a sparse Bayesian algorithm to estimate multivariate autoregressive coefficients when a large amount of background interference exists. This paper employs computer experiments to compare two methods in the source-space causality analysis: the conventional least-squares method and a sparse Bayesian method. Results of our computer experiments show that the in…
▽ More
This paper examines the effectiveness of a sparse Bayesian algorithm to estimate multivariate autoregressive coefficients when a large amount of background interference exists. This paper employs computer experiments to compare two methods in the source-space causality analysis: the conventional least-squares method and a sparse Bayesian method. Results of our computer experiments show that the interference affects the least-squares method in a very severe manner. It produces large false-positive results, unless the signal-to-interference ratio is very high. On the other hand, the sparse Bayesian method is relatively insensitive to the existence of interference. However, this robustness of the sparse Bayesian method is attained on the scarifies of the detectability of true causal relationship. Our experiments also show that the surrogate data bootstrap** method tends to give a statistical threshold that are too low for the sparse method.
The permutation-test-based method gives a higher (more conservative) threshold and it should be used with the sparse Bayesian method whenever the control period is available.
△ Less
Submitted 14 November, 2012;
originally announced November 2012.
-
Atmospheric PSF Interpolation for Weak Lensing in Short Exposure Imaging Data
Authors:
C. Chang,
P. J. Marshall,
J. G. Jernigan,
J. R. Peterson,
S. M. Kahn,
S. F. Gull,
Y. AlSayyad,
Z. Ahmad,
J. Bankert,
D. Bard,
A. Connolly,
R. R. Gibson,
K. Gilmore,
E. Grace,
M. Hannel,
M. A. Hodge,
L. Jones,
S. Krughoff,
S. Lorenz,
S. Marshall,
A. Meert,
S. Nagarajan,
E. Peng,
A. P. Rasmussen,
M. Shmakova
, et al. (3 additional authors not shown)
Abstract:
A main science goal for the Large Synoptic Survey Telescope (LSST) is to measure the cosmic shear signal from weak lensing to extreme accuracy. One difficulty, however, is that with the short exposure time ($\simeq$15 seconds) proposed, the spatial variation of the Point Spread Function (PSF) shapes may be dominated by the atmosphere, in addition to optics errors. While optics errors mainly cause…
▽ More
A main science goal for the Large Synoptic Survey Telescope (LSST) is to measure the cosmic shear signal from weak lensing to extreme accuracy. One difficulty, however, is that with the short exposure time ($\simeq$15 seconds) proposed, the spatial variation of the Point Spread Function (PSF) shapes may be dominated by the atmosphere, in addition to optics errors. While optics errors mainly cause the PSF to vary on angular scales similar or larger than a single CCD sensor, the atmosphere generates stochastic structures on a wide range of angular scales. It thus becomes a challenge to infer the multi-scale, complex atmospheric PSF patterns by interpolating the sparsely sampled stars in the field. In this paper we present a new method, PSFent, for interpolating the PSF shape parameters, based on reconstructing underlying shape parameter maps with a multi-scale maximum entropy algorithm. We demonstrate, using images from the LSST Photon Simulator, the performance of our approach relative to a 5th-order polynomial fit (representing the current standard) and a simple boxcar smoothing technique. Quantitatively, PSFent predicts more accurate PSF models in all scenarios and the residual PSF errors are spatially less correlated. This improvement in PSF interpolation leads to a factor of 3.5 lower systematic errors in the shear power spectrum on scales smaller than $\sim13'$, compared to polynomial fitting. We estimate that with PSFent and for stellar densities greater than $\simeq1/{\rm arcmin}^{2}$, the spurious shear correlation from PSF interpolation, after combining a complete 10-year dataset from LSST, is lower than the corresponding statistical uncertainties on the cosmic shear power spectrum, even under a conservative scenario.
△ Less
Submitted 12 November, 2012; v1 submitted 6 June, 2012;
originally announced June 2012.
-
Spurious Shear in Weak Lensing with LSST
Authors:
C. Chang,
S. M. Kahn,
J. G. Jernigan,
J. R. Peterson,
Y. AlSayyad,
Z. Ahmad,
J. Bankert,
D. Bard,
A. Connolly,
R. R. Gibson,
K. Gilmore,
E. Grace,
M. Hannel,
M. A. Hodge,
M. J. Jee,
L. Jones,
S. Krughoff,
S. Lorenz,
P. J. Marshall,
S. Marshall,
A. Meert,
S. Nagarajan,
E. Peng,
A. P. Rasmussen,
M. Shmakova
, et al. (3 additional authors not shown)
Abstract:
The complete 10-year survey from the Large Synoptic Survey Telescope (LSST) will image $\sim$ 20,000 square degrees of sky in six filter bands every few nights, bringing the final survey depth to $r\sim27.5$, with over 4 billion well measured galaxies. To take full advantage of this unprecedented statistical power, the systematic errors associated with weak lensing measurements need to be controll…
▽ More
The complete 10-year survey from the Large Synoptic Survey Telescope (LSST) will image $\sim$ 20,000 square degrees of sky in six filter bands every few nights, bringing the final survey depth to $r\sim27.5$, with over 4 billion well measured galaxies. To take full advantage of this unprecedented statistical power, the systematic errors associated with weak lensing measurements need to be controlled to a level similar to the statistical errors.
This work is the first attempt to quantitatively estimate the absolute level and statistical properties of the systematic errors on weak lensing shear measurements due to the most important physical effects in the LSST system via high fidelity ray-tracing simulations. We identify and isolate the different sources of algorithm-independent, \textit{additive} systematic errors on shear measurements for LSST and predict their impact on the final cosmic shear measurements using conventional weak lensing analysis techniques. We find that the main source of the errors comes from an inability to adequately characterise the atmospheric point spread function (PSF) due to its high frequency spatial variation on angular scales smaller than $\sim10'$ in the single short exposures, which propagates into a spurious shear correlation function at the $10^{-4}$--$10^{-3}$ level on these scales. With the large multi-epoch dataset that will be acquired by LSST, the stochastic errors average out, bringing the final spurious shear correlation function to a level very close to the statistical errors. Our results imply that the cosmological constraints from LSST will not be severely limited by these algorithm-independent, additive systematic effects.
△ Less
Submitted 16 October, 2012; v1 submitted 6 June, 2012;
originally announced June 2012.
-
Model for Predicting End User Web Page Response Time
Authors:
Sathya Narayanan Nagarajan,
Srijith Ravikumar
Abstract:
Perceived responsiveness of a web page is one of the most important and least understood metrics of web page design, and is critical for attracting and maintaining a large audience. Web pages can be designed to meet performance SLAs early in the product lifecycle if there is a way to predict the apparent responsiveness of a particular page layout. Response time of a web page is largely influenced…
▽ More
Perceived responsiveness of a web page is one of the most important and least understood metrics of web page design, and is critical for attracting and maintaining a large audience. Web pages can be designed to meet performance SLAs early in the product lifecycle if there is a way to predict the apparent responsiveness of a particular page layout. Response time of a web page is largely influenced by page layout and various network characteristics. Since the network characteristics vary widely from country to country, accurately modeling and predicting the perceived responsiveness of a web page from the end user's perspective has traditionally proven very difficult. We propose a model for predicting end user web page response time based on web page, network, browser download and browser rendering characteristics. We start by understanding the key parameters that affect perceived response time. We then model each of these parameters individually using experimental tests and statistical techniques. Finally, we demonstrate the effectiveness of this model by conducting an experimental study with Yahoo! web pages in two countries and compare it with 3rd party measurement application.
△ Less
Submitted 27 April, 2012;
originally announced April 2012.