-
Learning Optimal Filters Using Variational Inference
Authors:
Enoch Luk,
Eviatar Bach,
Ricardo Baptista,
Andrew Stuart
Abstract:
Filtering-the task of estimating the conditional distribution of states of a dynamical system given partial, noisy, observations-is important in many areas of science and engineering, including weather and climate prediction. However, the filtering distribution is generally intractable to obtain for high-dimensional, nonlinear systems. Filters used in practice, such as the ensemble Kalman filter (…
▽ More
Filtering-the task of estimating the conditional distribution of states of a dynamical system given partial, noisy, observations-is important in many areas of science and engineering, including weather and climate prediction. However, the filtering distribution is generally intractable to obtain for high-dimensional, nonlinear systems. Filters used in practice, such as the ensemble Kalman filter (EnKF), are biased for nonlinear systems and have numerous tuning parameters. Here, we present a framework for learning a parameterized analysis map-the map that takes a forecast distribution and observations to the filtering distribution-using variational inference. We show that this methodology can be used to learn gain matrices for filtering linear and nonlinear dynamical systems, as well as inflation and localization parameters for an EnKF. Future work will apply this framework to learn new filtering algorithms.
△ Less
Submitted 26 June, 2024;
originally announced June 2024.
-
Coupled Input-Output Dimension Reduction: Application to Goal-oriented Bayesian Experimental Design and Global Sensitivity Analysis
Authors:
Qiao Chen,
Elise Arnaud,
Ricardo Baptista,
Olivier Zahm
Abstract:
We introduce a new method to jointly reduce the dimension of the input and output space of a high-dimensional function. Choosing a reduced input subspace influences which output subspace is relevant and vice versa. Conventional methods focus on reducing either the input or output space, even though both are often reduced simultaneously in practice. Our coupled approach naturally supports goal-orie…
▽ More
We introduce a new method to jointly reduce the dimension of the input and output space of a high-dimensional function. Choosing a reduced input subspace influences which output subspace is relevant and vice versa. Conventional methods focus on reducing either the input or output space, even though both are often reduced simultaneously in practice. Our coupled approach naturally supports goal-oriented dimension reduction, where either an input or output quantity of interest is prescribed. We consider, in particular, goal-oriented sensor placement and goal-oriented sensitivity analysis, which can be viewed as dimension reduction where the most important output or, respectively, input components are chosen. Both applications present difficult combinatorial optimization problems with expensive objectives such as the expected information gain and Sobol indices. By optimizing gradient-based bounds, we can determine the most informative sensors and most sensitive parameters as the largest diagonal entries of some diagnostic matrices, thus bypassing the combinatorial optimization and objective evaluation.
△ Less
Submitted 19 June, 2024;
originally announced June 2024.
-
Neural Approximate Mirror Maps for Constrained Diffusion Models
Authors:
Berthy T. Feng,
Ricardo Baptista,
Katherine L. Bouman
Abstract:
Diffusion models excel at creating visually-convincing images, but they often struggle to meet subtle constraints inherent in the training data. Such constraints could be physics-based (e.g., satisfying a PDE), geometric (e.g., respecting symmetry), or semantic (e.g., including a particular number of objects). When the training data all satisfy a certain constraint, enforcing this constraint on a…
▽ More
Diffusion models excel at creating visually-convincing images, but they often struggle to meet subtle constraints inherent in the training data. Such constraints could be physics-based (e.g., satisfying a PDE), geometric (e.g., respecting symmetry), or semantic (e.g., including a particular number of objects). When the training data all satisfy a certain constraint, enforcing this constraint on a diffusion model not only improves its distribution-matching accuracy but also makes it more reliable for generating valid synthetic data and solving constrained inverse problems. However, existing methods for constrained diffusion models are inflexible with different types of constraints. Recent work proposed to learn mirror diffusion models (MDMs) in an unconstrained space defined by a mirror map and to impose the constraint with an inverse mirror map, but analytical mirror maps are challenging to derive for complex constraints. We propose neural approximate mirror maps (NAMMs) for general constraints. Our approach only requires a differentiable distance function from the constraint set. We learn an approximate mirror map that pushes data into an unconstrained space and a corresponding approximate inverse that maps data back to the constraint set. A generative model, such as an MDM, can then be trained in the learned mirror space and its samples restored to the constraint set by the inverse map. We validate our approach on a variety of constraints, showing that compared to an unconstrained diffusion model, a NAMM-based MDM substantially improves constraint satisfaction. We also demonstrate how existing diffusion-based inverse-problem solvers can be easily applied in the learned mirror space to solve constrained inverse problems.
△ Less
Submitted 18 June, 2024;
originally announced June 2024.
-
Computational Hypergraph Discovery, a Gaussian Process framework for connecting the dots
Authors:
Théo Bourdais,
Pau Batlle,
Xian** Yang,
Ricardo Baptista,
Nicolas Rouquette,
Houman Owhadi
Abstract:
Most scientific challenges can be framed into one of the following three levels of complexity of function approximation. Type 1: Approximate an unknown function given input/output data. Type 2: Consider a collection of variables and functions, some of which are unknown, indexed by the nodes and hyperedges of a hypergraph (a generalized graph where edges can connect more than two vertices). Given p…
▽ More
Most scientific challenges can be framed into one of the following three levels of complexity of function approximation. Type 1: Approximate an unknown function given input/output data. Type 2: Consider a collection of variables and functions, some of which are unknown, indexed by the nodes and hyperedges of a hypergraph (a generalized graph where edges can connect more than two vertices). Given partial observations of the variables of the hypergraph (satisfying the functional dependencies imposed by its structure), approximate all the unobserved variables and unknown functions. Type 3: Expanding on Type 2, if the hypergraph structure itself is unknown, use partial observations of the variables of the hypergraph to discover its structure and approximate its unknown functions. While most Computational Science and Engineering and Scientific Machine Learning challenges can be framed as Type 1 and Type 2 problems, many scientific problems can only be categorized as Type 3. Despite their prevalence, these Type 3 challenges have been largely overlooked due to their inherent complexity. Although Gaussian Process (GP) methods are sometimes perceived as well-founded but old technology limited to Type 1 curve fitting, their scope has recently been expanded to Type 2 problems. In this paper, we introduce an interpretable GP framework for Type 3 problems, targeting the data-driven discovery and completion of computational hypergraphs. Our approach is based on a kernel generalization of Row Echelon Form reduction from linear systems to nonlinear ones and variance-based analysis. Here, variables are linked via GPs and those contributing to the highest data variance unveil the hypergraph's structure. We illustrate the scope and efficiency of the proposed approach with applications to (algebraic) equation discovery, network discovery (gene pathways, chemical, and mechanical) and raw data analysis.
△ Less
Submitted 28 November, 2023;
originally announced November 2023.
-
Structured Neural Networks for Density Estimation and Causal Inference
Authors:
Asic Q. Chen,
Ruian Shi,
Xiang Gao,
Ricardo Baptista,
Rahul G. Krishnan
Abstract:
Injecting structure into neural networks enables learning functions that satisfy invariances with respect to subsets of inputs. For instance, when learning generative models using neural networks, it is advantageous to encode the conditional independence structure of observed variables, often in the form of Bayesian networks. We propose the Structured Neural Network (StrNN), which injects structur…
▽ More
Injecting structure into neural networks enables learning functions that satisfy invariances with respect to subsets of inputs. For instance, when learning generative models using neural networks, it is advantageous to encode the conditional independence structure of observed variables, often in the form of Bayesian networks. We propose the Structured Neural Network (StrNN), which injects structure through masking pathways in a neural network. The masks are designed via a novel relationship we explore between neural network architectures and binary matrix factorization, to ensure that the desired independencies are respected. We devise and study practical algorithms for this otherwise NP-hard design problem based on novel objectives that control the model architecture. We demonstrate the utility of StrNN in three applications: (1) binary and Gaussian density estimation with StrNN, (2) real-valued density estimation with Structured Autoregressive Flows (StrAFs) and Structured Continuous Normalizing Flows (StrCNF), and (3) interventional and counterfactual analysis with StrAFs for causal inference. Our work opens up new avenues for learning neural networks that enable data-efficient generative modeling and the use of normalizing flows for causal effect estimation.
△ Less
Submitted 3 November, 2023;
originally announced November 2023.
-
Distributed Nonlinear Filtering using Triangular Transport Maps
Authors:
Daniel Grange,
Ricardo Baptista,
Amirhossein Taghvaei,
Allen Tannenbaum,
Sean Phillips
Abstract:
The distributed filtering problem sequentially estimates a global state variable using observations from a network of local sensors with different measurement models. In this work, we introduce a novel methodology for distributed nonlinear filtering by combining techniques from transportation of measures, dimensionality reduction, and consensus algorithms. We illustrate our methodology on a satell…
▽ More
The distributed filtering problem sequentially estimates a global state variable using observations from a network of local sensors with different measurement models. In this work, we introduce a novel methodology for distributed nonlinear filtering by combining techniques from transportation of measures, dimensionality reduction, and consensus algorithms. We illustrate our methodology on a satellite pose estimation problem from a network of direct and indirect observers. The numerical results serve as a proof of concept, offering new venues for theoretical and applied research in the domain of distributed filtering.
△ Less
Submitted 29 October, 2023;
originally announced October 2023.
-
Efficient Neural Network Approaches for Conditional Optimal Transport with Applications in Bayesian Inference
Authors:
Zheyu Oliver Wang,
Ricardo Baptista,
Youssef Marzouk,
Lars Ruthotto,
Deepanshu Verma
Abstract:
We present two neural network approaches that approximate the solutions of static and dynamic conditional optimal transport (COT) problems, respectively. Both approaches enable sampling and density estimation of conditional probability distributions, which are core tasks in Bayesian inference. Our methods represent the target conditional distributions as transformations of a tractable reference di…
▽ More
We present two neural network approaches that approximate the solutions of static and dynamic conditional optimal transport (COT) problems, respectively. Both approaches enable sampling and density estimation of conditional probability distributions, which are core tasks in Bayesian inference. Our methods represent the target conditional distributions as transformations of a tractable reference distribution and, therefore, fall into the framework of measure transport. COT maps are a canonical choice within this framework, with desirable properties such as uniqueness and monotonicity. However, the associated COT problems are computationally challenging, even in moderate dimensions. To improve the scalability, our numerical algorithms leverage neural networks to parameterize COT maps. Our methods exploit the structure of the static and dynamic formulations of the COT problem. PCP-Map models conditional transport maps as the gradient of a partially input convex neural network (PICNN) and uses a novel numerical implementation to increase computational efficiency compared to state-of-the-art alternatives. COT-Flow models conditional transports via the flow of a regularized neural ODE; it is slower to train but offers faster sampling. We demonstrate their effectiveness and efficiency by comparing them with state-of-the-art approaches using benchmark datasets and Bayesian inverse problems.
△ Less
Submitted 25 October, 2023;
originally announced October 2023.
-
An adaptive ensemble filter for heavy-tailed distributions: tuning-free inflation and localization
Authors:
Mathieu Le Provost,
Ricardo Baptista,
Jeff D. Eldredge,
Youssef Marzouk
Abstract:
Heavy tails is a common feature of filtering distributions that results from the nonlinear dynamical and observation processes as well as the uncertainty from physical sensors. In these settings, the Kalman filter and its ensemble version - the ensemble Kalman filter (EnKF) - that have been designed under Gaussian assumptions result in degraded performance. t-distributions are a parametric family…
▽ More
Heavy tails is a common feature of filtering distributions that results from the nonlinear dynamical and observation processes as well as the uncertainty from physical sensors. In these settings, the Kalman filter and its ensemble version - the ensemble Kalman filter (EnKF) - that have been designed under Gaussian assumptions result in degraded performance. t-distributions are a parametric family of distributions whose tail-heaviness is modulated by a degree of freedom $ν$. Interestingly, Cauchy and Gaussian distributions correspond to the extreme cases of a t-distribution for $ν= 1$ and $ν= \infty$, respectively. Leveraging tools from measure transport (Spantini et al., SIAM Review, 2022), we present a generalization of the EnKF whose prior-to-posterior update leads to exact inference for t-distributions. We demonstrate that this filter is less sensitive to outlying synthetic observations generated by the observation model for small $ν$. Moreover, it recovers the Kalman filter for $ν= \infty$. For nonlinear state-space models with heavy-tailed noise, we propose an algorithm to estimate the prior-to-posterior update from samples of joint forecast distribution of the states and observations. We rely on a regularized expectation-maximization (EM) algorithm to estimate the mean, scale matrix, and degree of freedom of heavy-tailed \textit{t}-distributions from limited samples (Finegold and Drton, arXiv preprint, 2014). Leveraging the conditional independence of the joint forecast distribution, we regularize the scale matrix with an $l1$ sparsity-promoting penalization of the log-likelihood at each iteration of the EM algorithm. By sequentially estimating the degree of freedom at each analysis step, our filter can adapt its prior-to-posterior update to the tail-heaviness of the data. We demonstrate the benefits of this new ensemble filter on challenging filtering problems.
△ Less
Submitted 12 October, 2023;
originally announced October 2023.
-
Computational Optimal Transport and Filtering on Riemannian manifolds
Authors:
Daniel Grange,
Mohammad Al-Jarrah,
Ricardo Baptista,
Amirhossein Taghvaei,
Tryphon T. Georgiou,
Sean Phillips,
Allen Tannenbaum
Abstract:
In this paper we extend recent developments in computational optimal transport to the setting of Riemannian manifolds. In particular, we show how to learn optimal transport maps from samples that relate probability distributions defined on manifolds. Specializing these maps for sampling conditional probability distributions provides an ensemble approach for solving nonlinear filtering problems def…
▽ More
In this paper we extend recent developments in computational optimal transport to the setting of Riemannian manifolds. In particular, we show how to learn optimal transport maps from samples that relate probability distributions defined on manifolds. Specializing these maps for sampling conditional probability distributions provides an ensemble approach for solving nonlinear filtering problems defined on such geometries. The proposed computational methodology is illustrated with examples of transport and nonlinear filtering on Lie groups, including the circle $S^1$, the special Euclidean group $SE(2)$, and the special orthogonal group $SO(3)$.
△ Less
Submitted 29 October, 2023; v1 submitted 15 September, 2023;
originally announced September 2023.
-
Evidence of fractal structures in hadrons
Authors:
Rafael P. Baptista,
Lucas Q. Rocha,
D. P. Menezes,
Luis A. Trevisan,
Constantino Tsallis,
Airton Deppman
Abstract:
This study focuses on the presence of (multi)fractal structures in confined hadronic matter through the momentum distributions of mesons produced in proton-proton collisions between 23 GeV and 63 GeV. The analysis demonstrates that the $q$-exponential behaviour of the particle momentum distributions is consistent with fractal characteristics, exhibiting fractal structures in confined hadronic matt…
▽ More
This study focuses on the presence of (multi)fractal structures in confined hadronic matter through the momentum distributions of mesons produced in proton-proton collisions between 23 GeV and 63 GeV. The analysis demonstrates that the $q$-exponential behaviour of the particle momentum distributions is consistent with fractal characteristics, exhibiting fractal structures in confined hadronic matter with features similar to those observed in the deconfined quark-gluon plasma (QGP) regime. Furthermore, the systematic analysis of meson production in hadronic collisions at energies below 1 TeV suggests that specific fractal parameters are universal, independently of confinement or deconfinement, while others may be influenced by the quark content of the produced meson. These results pave the way for further research exploring the implications of fractal structures on various physical distributions and offer insights into the nature of the phase transition between confined and deconfined regimes.
△ Less
Submitted 31 August, 2023;
originally announced August 2023.
-
A generative flow for conditional sampling via optimal transport
Authors:
Jason Alfonso,
Ricardo Baptista,
Anupam Bhakta,
Noam Gal,
Alfin Hou,
Isa Lyubimova,
Daniel Pocklington,
Josef Sajonz,
Giulio Trigila,
Ryan Tsai
Abstract:
Sampling conditional distributions is a fundamental task for Bayesian inference and density estimation. Generative models, such as normalizing flows and generative adversarial networks, characterize conditional distributions by learning a transport map that pushes forward a simple reference (e.g., a standard Gaussian) to a target distribution. While these approaches successfully describe many non-…
▽ More
Sampling conditional distributions is a fundamental task for Bayesian inference and density estimation. Generative models, such as normalizing flows and generative adversarial networks, characterize conditional distributions by learning a transport map that pushes forward a simple reference (e.g., a standard Gaussian) to a target distribution. While these approaches successfully describe many non-Gaussian problems, their performance is often limited by parametric bias and the reliability of gradient-based (adversarial) optimizers to learn these transformations. This work proposes a non-parametric generative model that iteratively maps reference samples to the target. The model uses block-triangular transport maps, whose components are shown to characterize conditionals of the target distribution. These maps arise from solving an optimal transport problem with a weighted $L^2$ cost function, thereby extending the data-driven approach in [Trigila and Tabak, 2016] for conditional sampling. The proposed approach is demonstrated on a two dimensional example and on a parameter inference problem involving nonlinear ODEs.
△ Less
Submitted 9 July, 2023;
originally announced July 2023.
-
Debias Coarsely, Sample Conditionally: Statistical Downscaling through Optimal Transport and Probabilistic Diffusion Models
Authors:
Zhong Yi Wan,
Ricardo Baptista,
Yi-fan Chen,
John Anderson,
Anudhyan Boral,
Fei Sha,
Leonardo Zepeda-Núñez
Abstract:
We introduce a two-stage probabilistic framework for statistical downscaling using unpaired data. Statistical downscaling seeks a probabilistic map to transform low-resolution data from a biased coarse-grained numerical scheme to high-resolution data that is consistent with a high-fidelity scheme. Our framework tackles the problem by composing two transformations: (i) a debiasing step via an optim…
▽ More
We introduce a two-stage probabilistic framework for statistical downscaling using unpaired data. Statistical downscaling seeks a probabilistic map to transform low-resolution data from a biased coarse-grained numerical scheme to high-resolution data that is consistent with a high-fidelity scheme. Our framework tackles the problem by composing two transformations: (i) a debiasing step via an optimal transport map, and (ii) an upsampling step achieved by a probabilistic diffusion model with a posteriori conditional sampling. This approach characterizes a conditional distribution without needing paired data, and faithfully recovers relevant physical statistics from biased samples. We demonstrate the utility of the proposed approach on one- and two-dimensional fluid flow problems, which are representative of the core difficulties present in numerical simulations of weather and climate. Our method produces realistic high-resolution outputs from low-resolution inputs, by upsampling resolutions of 8x and 16x. Moreover, our procedure correctly matches the statistics of physical quantities, even when the low-frequency content of the inputs and outputs do not match, a crucial but difficult-to-satisfy assumption needed by current state-of-the-art alternatives. Code for this work is available at: https://github.com/google-research/swirl-dynamics/tree/main/swirl_dynamics/projects/probabilistic_diffusion.
△ Less
Submitted 30 October, 2023; v1 submitted 24 May, 2023;
originally announced May 2023.
-
An Approximation Theory Framework for Measure-Transport Sampling Algorithms
Authors:
Ricardo Baptista,
Bamdad Hosseini,
Nikola B. Kovachki,
Youssef M. Marzouk,
Amir Sagiv
Abstract:
This article presents a general approximation-theoretic framework to analyze measure transport algorithms for probabilistic modeling. A primary motivating application for such algorithms is sampling -- a central task in statistical inference and generative modeling. We provide a priori error estimates in the continuum limit, i.e., when the measures (or their densities) are given, but when the tran…
▽ More
This article presents a general approximation-theoretic framework to analyze measure transport algorithms for probabilistic modeling. A primary motivating application for such algorithms is sampling -- a central task in statistical inference and generative modeling. We provide a priori error estimates in the continuum limit, i.e., when the measures (or their densities) are given, but when the transport map is discretized or approximated using a finite-dimensional function space. Our analysis relies on the regularity theory of transport maps and on classical approximation theory for high-dimensional functions. A third element of our analysis, which is of independent interest, is the development of new stability estimates that relate the distance between two maps to the distance~(or divergence) between the pushforward measures they define. We present a series of applications of our framework, where quantitative convergence rates are obtained for practical problems using Wasserstein metrics, maximum mean discrepancy, and Kullback--Leibler divergence. Specialized rates for approximations of the popular triangular Kn{ö}the-Rosenblatt maps are obtained, followed by numerical experiments that demonstrate and extend our theory.
△ Less
Submitted 29 June, 2023; v1 submitted 27 February, 2023;
originally announced February 2023.
-
Score-based Diffusion Models in Function Space
Authors:
Jae Hyun Lim,
Nikola B. Kovachki,
Ricardo Baptista,
Christopher Beckham,
Kamyar Azizzadenesheli,
Jean Kossaifi,
Vikram Voleti,
Jiaming Song,
Karsten Kreis,
Jan Kautz,
Christopher Pal,
Arash Vahdat,
Anima Anandkumar
Abstract:
Diffusion models have recently emerged as a powerful framework for generative modeling. They consist of a forward process that perturbs input data with Gaussian white noise and a reverse process that learns a score function to generate samples by denoising. Despite their tremendous success, they are mostly formulated on finite-dimensional spaces, e.g. Euclidean, limiting their applications to many…
▽ More
Diffusion models have recently emerged as a powerful framework for generative modeling. They consist of a forward process that perturbs input data with Gaussian white noise and a reverse process that learns a score function to generate samples by denoising. Despite their tremendous success, they are mostly formulated on finite-dimensional spaces, e.g. Euclidean, limiting their applications to many domains where the data has a functional form such as in scientific computing and 3D geometric data analysis. In this work, we introduce a mathematically rigorous framework called Denoising Diffusion Operators (DDOs) for training diffusion models in function space. In DDOs, the forward process perturbs input functions gradually using a Gaussian process. The generative process is formulated by integrating a function-valued Langevin dynamic. Our approach requires an appropriate notion of the score for the perturbed data distribution, which we obtain by generalizing denoising score matching to function spaces that can be infinite-dimensional. We show that the corresponding discretized algorithm generates accurate samples at a fixed cost that is independent of the data resolution. We theoretically and numerically verify the applicability of our approach on a set of problems, including generating solutions to the Navier-Stokes equation viewed as the push-forward distribution of forcings from a Gaussian Random Field (GRF).
△ Less
Submitted 22 November, 2023; v1 submitted 14 February, 2023;
originally announced February 2023.
-
Ensemble transport smoothing. Part II: Nonlinear updates
Authors:
Maximilian Ramgraber,
Ricardo Baptista,
Dennis McLaughlin,
Youssef Marzouk
Abstract:
Smoothing is a specialized form of Bayesian inference for state-space models that characterizes the posterior distribution of a collection of states given an associated sequence of observations. Ramgraber et al. (2023) proposes a general framework for transport-based ensemble smoothing, which includes linear Kalman-type smoothers as special cases. Here, we build on this foundation to realize and d…
▽ More
Smoothing is a specialized form of Bayesian inference for state-space models that characterizes the posterior distribution of a collection of states given an associated sequence of observations. Ramgraber et al. (2023) proposes a general framework for transport-based ensemble smoothing, which includes linear Kalman-type smoothers as special cases. Here, we build on this foundation to realize and demonstrate nonlinear backward ensemble transport smoothers. We discuss parameterization and regularization of the associated transport maps, and then examine the performance of these smoothers for nonlinear and chaotic dynamical systems that exhibit non-Gaussian behavior. In these settings, our nonlinear transport smoothers yield lower estimation error than conventional linear smoothers and state-of-the-art iterative ensemble Kalman smoothers, for comparable numbers of model evaluations.
△ Less
Submitted 22 November, 2023; v1 submitted 31 October, 2022;
originally announced October 2022.
-
Ensemble transport smoothing. Part I: Unified framework
Authors:
Maximilian Ramgraber,
Ricardo Baptista,
Dennis McLaughlin,
Youssef Marzouk
Abstract:
Smoothers are algorithms for Bayesian time series re-analysis. Most operational smoothers rely either on affine Kalman-type transformations or on sequential importance sampling. These strategies occupy opposite ends of a spectrum that trades computational efficiency and scalability for statistical generality and consistency: non-Gaussianity renders affine Kalman updates inconsistent with the true…
▽ More
Smoothers are algorithms for Bayesian time series re-analysis. Most operational smoothers rely either on affine Kalman-type transformations or on sequential importance sampling. These strategies occupy opposite ends of a spectrum that trades computational efficiency and scalability for statistical generality and consistency: non-Gaussianity renders affine Kalman updates inconsistent with the true Bayesian solution, while the ensemble size required for successful importance sampling can be prohibitive. This paper revisits the smoothing problem from the perspective of measure transport, which offers the prospect of consistent prior-to-posterior transformations for Bayesian inference. We leverage this capacity by proposing a general ensemble framework for transport-based smoothing. Within this framework, we derive a comprehensive set of smoothing recursions based on nonlinear transport maps and detail how they exploit the structure of state-space models in fully non-Gaussian settings. We also describe how many standard Kalman-type smoothing algorithms emerge as special cases of our framework. A companion paper (Ramgraber et al., 2023) explores the implementation of nonlinear ensemble transport smoothers in greater depth.
△ Less
Submitted 22 November, 2023; v1 submitted 30 October, 2022;
originally announced October 2022.
-
Gradient-based data and parameter dimension reduction for Bayesian models: an information theoretic perspective
Authors:
Ricardo Baptista,
Youssef Marzouk,
Olivier Zahm
Abstract:
We consider the problem of reducing the dimensions of parameters and data in non-Gaussian Bayesian inference problems. Our goal is to identify an "informed" subspace of the parameters and an "informative" subspace of the data so that a high-dimensional inference problem can be approximately reformulated in low-to-moderate dimensions, thereby improving the computational efficiency of many inference…
▽ More
We consider the problem of reducing the dimensions of parameters and data in non-Gaussian Bayesian inference problems. Our goal is to identify an "informed" subspace of the parameters and an "informative" subspace of the data so that a high-dimensional inference problem can be approximately reformulated in low-to-moderate dimensions, thereby improving the computational efficiency of many inference techniques. To do so, we exploit gradient evaluations of the log-likelihood function. Furthermore, we use an information-theoretic analysis to derive a bound on the posterior error due to parameter and data dimension reduction. This bound relies on logarithmic Sobolev inequalities, and it reveals the appropriate dimensions of the reduced variables. We compare our method with classical dimension reduction techniques, such as principal component analysis and canonical correlation analysis, on applications ranging from mechanics to image processing.
△ Less
Submitted 18 July, 2022;
originally announced July 2022.
-
Bayesian model calibration for block copolymer self-assembly: Likelihood-free inference and expected information gain computation via measure transport
Authors:
Ricardo Baptista,
Lianghao Cao,
Joshua Chen,
Omar Ghattas,
Fengyi Li,
Youssef M. Marzouk,
J. Tinsley Oden
Abstract:
We consider the Bayesian calibration of models describing the phenomenon of block copolymer (BCP) self-assembly using image data produced by microscopy or X-ray scattering techniques. To account for the random long-range disorder in BCP equilibrium structures, we introduce auxiliary variables to represent this aleatory uncertainty. These variables, however, result in an integrated likelihood for h…
▽ More
We consider the Bayesian calibration of models describing the phenomenon of block copolymer (BCP) self-assembly using image data produced by microscopy or X-ray scattering techniques. To account for the random long-range disorder in BCP equilibrium structures, we introduce auxiliary variables to represent this aleatory uncertainty. These variables, however, result in an integrated likelihood for high-dimensional image data that is generally intractable to evaluate. We tackle this challenging Bayesian inference problem using a likelihood-free approach based on measure transport together with the construction of summary statistics for the image data. We also show that expected information gains (EIGs) from the observed data about the model parameters can be computed with no significant additional cost. Lastly, we present a numerical case study based on the Ohta--Kawasaki model for diblock copolymer thin film self-assembly and top-down microscopy characterization. For calibration, we introduce several domain-specific energy- and Fourier-based summary statistics, and quantify their informativeness using EIG. We demonstrate the power of the proposed approach to study the effect of data corruptions and experimental designs on the calibration results.
△ Less
Submitted 22 June, 2022;
originally announced June 2022.
-
A low-rank ensemble Kalman filter for elliptic observations
Authors:
Mathieu Le Provost,
Ricardo Baptista,
Youssef Marzouk,
Jeff D. Eldredge
Abstract:
We propose a regularization method for ensemble Kalman filtering (EnKF) with elliptic observation operators. Commonly used EnKF regularization methods suppress state correlations at long distances. For observations described by elliptic partial differential equations, such as the pressure Poisson equation (PPE) in incompressible fluid flows, distance localization cannot be applied, as we cannot di…
▽ More
We propose a regularization method for ensemble Kalman filtering (EnKF) with elliptic observation operators. Commonly used EnKF regularization methods suppress state correlations at long distances. For observations described by elliptic partial differential equations, such as the pressure Poisson equation (PPE) in incompressible fluid flows, distance localization cannot be applied, as we cannot disentangle slowly decaying physical interactions from spurious long-range correlations. This is particularly true for the PPE, in which distant vortex elements couple nonlinearly to induce pressure. Instead, these inverse problems have a low effective dimension: low-dimensional projections of the observations strongly inform a low-dimensional subspace of the state space. We derive a low-rank factorization of the Kalman gain based on the spectrum of the Jacobian of the observation operator. The identified eigenvectors generalize the source and target modes of the multipole expansion, independently of the underlying spatial distribution of the problem. Given rapid spectral decay, inference can be performed in the low-dimensional subspace spanned by the dominant eigenvectors. This low-rank EnKF is assessed on dynamical systems with Poisson observation operators, where we seek to estimate the positions and strengths of point singularities over time from potential or pressure observations. We also comment on the broader applicability of this approach to elliptic inverse problems outside the context of filtering.
△ Less
Submitted 24 October, 2022; v1 submitted 9 March, 2022;
originally announced March 2022.
-
Challenging the disk instability model: I -- The case of YZ LMi
Authors:
Raymundo Baptista,
Wagner Schlindwein
Abstract:
Observations of YZ LMi show enhanced emission along the stream trajectory beyond impact at disk rim during outbursts as well as when the quiescent disk is large. We investigated whether these features can be explained in terms of either gas stream overflow or penetration within the frameworks of the disk-instability (DIM) and the mass-transfer instability (MTIM) models of outbursting disks. Gas st…
▽ More
Observations of YZ LMi show enhanced emission along the stream trajectory beyond impact at disk rim during outbursts as well as when the quiescent disk is large. We investigated whether these features can be explained in terms of either gas stream overflow or penetration within the frameworks of the disk-instability (DIM) and the mass-transfer instability (MTIM) models of outbursting disks. Gas stream overflow is not possible because the vertical scaleheight of the stream is significantly lower than that of the outer disk and because there is no combination of parameters which enables stream overflow on a larger disk while preventing it on a smaller disk. Stream penetration requires the gas stream to be denser than the outer disk regions. This requirement cannot be met by a low-viscosity DIM disk because its density is significantly larger than that of the gas stream over the whole range of mass transfer rates where the thermal-viscous instability occurs. On the other hand, the high-viscosity MTIM disk has much lower densities which decrease with increasing radius, easily allowing for gas stream penetration during outbursts (when mass transfer rate and stream density increase) as well as in large quiescent disks. The observed features are not consistent with DIM, but can be plausibly explained by MTIM. These results suggest that the outbursts of YZ LMi are the response of a high-viscosity disk to bursts of enhanced mass transfer rate. In this case, the outburst decline timescale of (2-3) d implies a viscosity parameter in the range alpha=3-4.
△ Less
Submitted 2 December, 2021;
originally announced December 2021.
-
Diagonal Nonlinear Transformations Preserve Structure in Covariance and Precision Matrices
Authors:
Rebecca E Morrison,
Ricardo Baptista,
Estelle L Basor
Abstract:
For a multivariate normal distribution, the sparsity of the covariance and precision matrices encodes complete information about independence and conditional independence properties. For general distributions, the covariance and precision matrices reveal correlations and so-called partial correlations between variables, but these do not, in general, have any correspondence with respect to independ…
▽ More
For a multivariate normal distribution, the sparsity of the covariance and precision matrices encodes complete information about independence and conditional independence properties. For general distributions, the covariance and precision matrices reveal correlations and so-called partial correlations between variables, but these do not, in general, have any correspondence with respect to independence properties. In this paper, we prove that, for a certain class of non-Gaussian distributions, these correspondences still hold, exactly for the covariance and approximately for the precision. The distributions -- sometimes referred to as "nonparanormal" -- are given by diagonal transformations of multivariate normal random variables. We provide several analytic and numerical examples illustrating these results.
△ Less
Submitted 20 September, 2021; v1 submitted 8 July, 2021;
originally announced July 2021.
-
Search for magnetic accretion in SW Sextantis systems
Authors:
I. J. Lima,
C. V. Rodrigues,
C. E. Ferreira Lopes,
P. Szkody,
F. J. Jablonski,
A. S. Oliveira,
K. M. G. Silva,
D. Belloni,
M. S. Palhares,
S. Shugarov,
R. Baptista,
L. A. Almeida
Abstract:
SW Sextantis systems are nova-like cataclysmic variables that have unusual spectroscopic properties, which are thought to be caused by an accretion geometry having part of the mass flux trajectory out of the orbital plane. Accretion onto a magnetic white dwarf is one of the proposed scenarios for these systems. To verify this possibility, we analysed photometric and polarimetric time-series data f…
▽ More
SW Sextantis systems are nova-like cataclysmic variables that have unusual spectroscopic properties, which are thought to be caused by an accretion geometry having part of the mass flux trajectory out of the orbital plane. Accretion onto a magnetic white dwarf is one of the proposed scenarios for these systems. To verify this possibility, we analysed photometric and polarimetric time-series data for a sample of six SW Sex stars. We report possible modulated circular polarization in BO Cet, SW Sex, and UU Aqr with periods of 11.1, 41.2 and 25.7 min, respectively, and less significant periodicities for V380 Oph at 22 min and V442 Oph at 19.4 min. We confirm previous results that LS Peg shows variable circular polarization. However, we determine a period of 18.8 min, which is different from the earlier reported value. We interpret these periods as the spin periods of the white dwarfs. Our polarimetric results indicate that 15% of the SW Sex systems have direct evidence of magnetic accretion. We also discuss SW Sex objects within the perspective of being magnetic systems, considering the latest findings about cataclysmic variables demography, formation and evolution.
△ Less
Submitted 2 March, 2021;
originally announced March 2021.
-
Learning non-Gaussian graphical models via Hessian scores and triangular transport
Authors:
Ricardo Baptista,
Youssef Marzouk,
Rebecca E. Morrison,
Olivier Zahm
Abstract:
Undirected probabilistic graphical models represent the conditional dependencies, or Markov properties, of a collection of random variables. Knowing the sparsity of such a graphical model is valuable for modeling multivariate distributions and for efficiently performing inference. While the problem of learning graph structure from data has been studied extensively for certain parametric families o…
▽ More
Undirected probabilistic graphical models represent the conditional dependencies, or Markov properties, of a collection of random variables. Knowing the sparsity of such a graphical model is valuable for modeling multivariate distributions and for efficiently performing inference. While the problem of learning graph structure from data has been studied extensively for certain parametric families of distributions, most existing methods fail to consistently recover the graph structure for non-Gaussian data. Here we propose an algorithm for learning the Markov structure of continuous and non-Gaussian distributions. To characterize conditional independence, we introduce a score based on integrated Hessian information from the joint log-density, and we prove that this score upper bounds the conditional mutual information for a general class of distributions. To compute the score, our algorithm SING estimates the density using a deterministic coupling, induced by a triangular transport map, and iteratively exploits sparse structure in the map to reveal sparsity in the graph. For certain non-Gaussian datasets, we show that our algorithm recovers the graph structure even with a biased approximation to the density. Among other examples, we apply SING to learn the dependencies between the states of a chaotic dynamical system with local interactions.
△ Less
Submitted 25 February, 2023; v1 submitted 8 January, 2021;
originally announced January 2021.
-
On the representation and learning of monotone triangular transport maps
Authors:
Ricardo Baptista,
Youssef Marzouk,
Olivier Zahm
Abstract:
Transportation of measure provides a versatile approach for modeling complex probability distributions, with applications in density estimation, Bayesian inference, generative modeling, and beyond. Monotone triangular transport maps$\unicode{x2014}$approximations of the Knothe$\unicode{x2013}$Rosenblatt (KR) rearrangement$\unicode{x2014}$are a canonical choice for these tasks. Yet the representati…
▽ More
Transportation of measure provides a versatile approach for modeling complex probability distributions, with applications in density estimation, Bayesian inference, generative modeling, and beyond. Monotone triangular transport maps$\unicode{x2014}$approximations of the Knothe$\unicode{x2013}$Rosenblatt (KR) rearrangement$\unicode{x2014}$are a canonical choice for these tasks. Yet the representation and parameterization of such maps have a significant impact on their generality and expressiveness, and on properties of the optimization problem that arises in learning a map from data (e.g., via maximum likelihood estimation). We present a general framework for representing monotone triangular maps via invertible transformations of smooth functions. We establish conditions on the transformation such that the associated infinite-dimensional minimization problem has no spurious local minima, i.e., all local minima are global minima; and we show for target distributions satisfying certain tail conditions that the unique global minimizer corresponds to the KR map. Given a sample from the target, we then propose an adaptive algorithm that estimates a sparse semi-parametric approximation of the underlying KR map. We demonstrate how this framework can be applied to joint and conditional density estimation, likelihood-free inference, and structure learning of directed graphical models, with stable generalization performance across a range of sample sizes.
△ Less
Submitted 24 February, 2024; v1 submitted 21 September, 2020;
originally announced September 2020.
-
Conditional Sampling with Monotone GANs: from Generative Models to Likelihood-Free Inference
Authors:
Ricardo Baptista,
Bamdad Hosseini,
Nikola B. Kovachki,
Youssef Marzouk
Abstract:
We present a novel framework for conditional sampling of probability measures, using block triangular transport maps. We develop the theoretical foundations of block triangular transport in a Banach space setting, establishing general conditions under which conditional sampling can be achieved and drawing connections between monotone block triangular maps and optimal transport. Based on this theor…
▽ More
We present a novel framework for conditional sampling of probability measures, using block triangular transport maps. We develop the theoretical foundations of block triangular transport in a Banach space setting, establishing general conditions under which conditional sampling can be achieved and drawing connections between monotone block triangular maps and optimal transport. Based on this theory, we then introduce a computational approach, called monotone generative adversarial networks (M-GANs), to learn suitable block triangular maps. Our algorithm uses only samples from the underlying joint probability measure and is hence likelihood-free. Numerical experiments with M-GAN demonstrate accurate sampling of conditional measures in synthetic examples, Bayesian inverse problems involving ordinary and partial differential equations, and probabilistic image in-painting.
△ Less
Submitted 5 June, 2023; v1 submitted 11 June, 2020;
originally announced June 2020.
-
Towards Generalization of 3D Human Pose Estimation In The Wild
Authors:
Renato Baptista,
Alexandre Saint,
Kassem Al Ismaeil,
Djamila Aouada
Abstract:
In this paper, we propose 3DBodyTex.Pose, a dataset that addresses the task of 3D human pose estimation in-the-wild. Generalization to in-the-wild images remains limited due to the lack of adequate datasets. Existent ones are usually collected in indoor controlled environments where motion capture systems are used to obtain the 3D ground-truth annotations of humans. 3DBodyTex.Pose offers high qual…
▽ More
In this paper, we propose 3DBodyTex.Pose, a dataset that addresses the task of 3D human pose estimation in-the-wild. Generalization to in-the-wild images remains limited due to the lack of adequate datasets. Existent ones are usually collected in indoor controlled environments where motion capture systems are used to obtain the 3D ground-truth annotations of humans. 3DBodyTex.Pose offers high quality and rich data containing 405 different real subjects in various clothing and poses, and 81k image samples with ground-truth 2D and 3D pose annotations. These images are generated from 200 viewpoints among which 70 challenging extreme viewpoints. This data was created starting from high resolution textured 3D body scans and by incorporating various realistic backgrounds. Retraining a state-of-the-art 3D pose estimation approach using data augmented with 3DBodyTex.Pose showed promising improvement in the overall performance, and a sensible decrease in the per joint position error when testing on challenging viewpoints. The 3DBodyTex.Pose is expected to offer the research community with new possibilities for generalizing 3D pose estimation from monocular in-the-wild images.
△ Less
Submitted 21 April, 2020;
originally announced April 2020.
-
An Ostrogradsky Instability Analysis of Non-minimally Coupled Weyl Connection Gravity Theories
Authors:
Rodrigo Baptista,
Orfeu Bertolami
Abstract:
We study the Hamiltonian formalism of the non-minimally coupled Weyl connection gravity (NMCWCG) in order to check whether Ostrogradsky instabilities are present. The Hamiltonian of the NMCWCG theories is obtained by foliating space-time into a real line (representing time) and \mbox{3-dimensional} space-like hypersurfaces, and by considering the spatial metric and the extrinsic curvature of the h…
▽ More
We study the Hamiltonian formalism of the non-minimally coupled Weyl connection gravity (NMCWCG) in order to check whether Ostrogradsky instabilities are present. The Hamiltonian of the NMCWCG theories is obtained by foliating space-time into a real line (representing time) and \mbox{3-dimensional} space-like hypersurfaces, and by considering the spatial metric and the extrinsic curvature of the hypersurfaces as the canonical coordinates of the theory. Given the fact that the theory we study contains an additional dynamical vector field compared to the usual NMC models, which do not have Ostrogradsky instabilities, we are able to construct an effective theory without these instabilities, by constraining this Weyl field.
△ Less
Submitted 22 January, 2021; v1 submitted 1 April, 2020;
originally announced April 2020.
-
Infrared photometry of the dwarf nova V2051 Ophiuchi: II -- The quiescent accretion disc and its spiral arms
Authors:
Raymundo Baptista,
Eduardo Wojcikiewicz
Abstract:
We report the analysis of time-series of infrared $JHK_s$ photometry of the dwarf nova V2051 Oph in quiescence with eclipse map** techniques to investigate structures and the spectrum of its accretion disc. The light curves after removal of the ellipsoidal variations caused by the mass-donor star show a double-wave modulation signalling the presence of two asymmetric light sources in the accreti…
▽ More
We report the analysis of time-series of infrared $JHK_s$ photometry of the dwarf nova V2051 Oph in quiescence with eclipse map** techniques to investigate structures and the spectrum of its accretion disc. The light curves after removal of the ellipsoidal variations caused by the mass-donor star show a double-wave modulation signalling the presence of two asymmetric light sources in the accretion disc. Eclipse maps reveal two spiral arms on top of the disc emission, one at $R_1= 0.28\pm 0.02 \,R_\mathrm{L1}$ and the other at $R_2= 0.42\pm 0.02 \,R_\mathrm{L1}$ (where $R_\mathrm{L1}$ is the distance from disc centre to the inner Lagrangian point), which are seen face-on at binary phases consistent with the maxima of the double-wave modulation. The wide open angle inferred for the spiral arms ($θ_s= 21^o \pm 4^o$) suggests the quiescent accretion disc of V2051 Oph has high viscosity. The accretion disc is hot and optically thin in its inner regions ($T_\mathrm{gas}\sim 10-12 \times 10^3\,K$ and surface densities $\sim 10^{-3}-10^{-2}\,g\,cm^{-2}$), and becomes cool and opaque in its outer regions.
△ Less
Submitted 12 December, 2019;
originally announced December 2019.
-
Cosmological Solutions of the Non-minimally Coupled Weyl Connection Gravity Theories
Authors:
Rodrigo Baptista,
Orfeu Bertolami
Abstract:
We consider a non-minimally coupled curvature-matter gravity model (NMC) with a Weyl connection, a theory referred to as non-minimally coupled Weyl connection gravity (NMCWCG). The Weyl connection is an affine connection that is not compatible with the metric, and involves a vector field. Assuming a vacuum expectation value for the vector field and a matter Lagrangian that only contains the contri…
▽ More
We consider a non-minimally coupled curvature-matter gravity model (NMC) with a Weyl connection, a theory referred to as non-minimally coupled Weyl connection gravity (NMCWCG). The Weyl connection is an affine connection that is not compatible with the metric, and involves a vector field. Assuming a vacuum expectation value for the vector field and a matter Lagrangian that only contains the contributions of the vacuum energy, we show that the model admits solutions in the space-form with a reference curvature that can be fine-tuned to much smaller values than the contribution of the matter fields. This shows that, at least in principle, the model admits a workable cosmological description.
△ Less
Submitted 27 March, 2020; v1 submitted 12 November, 2019;
originally announced November 2019.
-
Coupling techniques for nonlinear ensemble filtering
Authors:
Alessio Spantini,
Ricardo Baptista,
Youssef Marzouk
Abstract:
We consider filtering in high-dimensional non-Gaussian state-space models with intractable transition kernels, nonlinear and possibly chaotic dynamics, and sparse observations in space and time. We propose a novel filtering methodology that harnesses transportation of measures, convex optimization, and ideas from probabilistic graphical models to yield robust ensemble approximations of the filteri…
▽ More
We consider filtering in high-dimensional non-Gaussian state-space models with intractable transition kernels, nonlinear and possibly chaotic dynamics, and sparse observations in space and time. We propose a novel filtering methodology that harnesses transportation of measures, convex optimization, and ideas from probabilistic graphical models to yield robust ensemble approximations of the filtering distribution in high dimensions. Our approach can be understood as the natural generalization of the ensemble Kalman filter (EnKF) to nonlinear updates, using stochastic or deterministic couplings. The use of nonlinear updates can reduce the intrinsic bias of the EnKF at a marginal increase in computational cost. We avoid any form of importance sampling and introduce non-Gaussian localization approaches for dimension scalability. Our framework achieves state-of-the-art tracking performance on challenging configurations of the Lorenz-96 model in the chaotic regime.
△ Less
Submitted 6 April, 2022; v1 submitted 30 June, 2019;
originally announced July 2019.
-
Constraining a nonminimally coupled curvature-matter gravity model with ocean experiments
Authors:
Riccardo March,
Orfeu Bertolami,
Marco Muccino,
Rodrigo Baptista,
Simone Dell'Agnello
Abstract:
We examine the constraints on the Yukawa regime from the non-minimally coupled curvature-matter gravity theory arising from deep underwater ocean experiments. We consider the geophysical experiment of Zumberge et al. of 1991 for searching deviations of Newton's inverse square law in ocean. In the context of non-minimally coupled curvature-matter theory of gravity the results of Zumberge et al. can…
▽ More
We examine the constraints on the Yukawa regime from the non-minimally coupled curvature-matter gravity theory arising from deep underwater ocean experiments. We consider the geophysical experiment of Zumberge et al. of 1991 for searching deviations of Newton's inverse square law in ocean. In the context of non-minimally coupled curvature-matter theory of gravity the results of Zumberge et al. can be used to obtain an upper bound both on the strength $α$ and range $λ$ of the Yukawa potential arising from the non-relativistic limit of the non-minimally coupled theory. The existence of an upper bound on $λ$ is related to the presence of an extra force, specific of the nonminimally coupled theory, which depends on $λ$ and on the gradient of mass density, and has an effect in the ocean because of compressibility of seawater.
These results can be achieved after a suitable treatment of the conversion of pressure to depth in the ocean by resorting to the equation of state of seawater and taking into account the effect of the extra force on hydrostatic equilibrium. If the sole Yukawa interaction were present the experiment would yield only a bound on $α$, while, in the presence of the extra force we find an upper bound on the range: $λ_{\rm max}= 57.4$ km. In the interval $1 \,{\rm m}<λ<λ_{\rm max}$ the upper bound on $α$ is consistent with the constraint $α<0.002$ found in Zumberge et al.
△ Less
Submitted 18 November, 2019; v1 submitted 29 April, 2019;
originally announced April 2019.
-
VVV-WIT-07: another Boyajian's star or a Mamajek's object?
Authors:
Roberto K. Saito,
Dante Minniti,
Valentin D. Ivanov,
Márcio Catelan,
Felipe Gran,
Raymundo Baptista,
Rodolfo Angeloni,
Claudio Caceres,
Juan Carlos Beamin
Abstract:
We report the discovery of VVV-WIT-07, an unique and intriguing variable source presenting a sequence of recurrent dips with a likely deep eclipse in July 2012. The object was found serendipitously in the near-IR data obtained by the VISTA Variables in the Vía Láctea (VVV) ESO Public Survey. Our analysis is based on VVV variability, multicolor, and proper motion (PM) data. Complementary data from…
▽ More
We report the discovery of VVV-WIT-07, an unique and intriguing variable source presenting a sequence of recurrent dips with a likely deep eclipse in July 2012. The object was found serendipitously in the near-IR data obtained by the VISTA Variables in the Vía Láctea (VVV) ESO Public Survey. Our analysis is based on VVV variability, multicolor, and proper motion (PM) data. Complementary data from the VVV eXtended survey (VVVX) as well as archive data and spectroscopic follow-up observations aided in the analysis and interpretation of VVV-WIT-07. A search for periodicity in the VVV Ks-band light curve of VVV-WIT-07 results in two tentative periods at P~322 days and P~170 days. Colors and PM are consistent either with a reddened MS star or a pre-MS star in the foreground disk. The near-IR spectra of VVV-WIT-07 appear featureless, having no prominent lines in emission or absorption. Features found in the light curve of VVV-WIT-07 are similar to those seen in J1407 (Mamajek's object), a pre-MS K5 dwarf with a ring system eclipsing the star or, alternatively, to KIC 8462852 (Boyajian's star), an F3 IV/V star showing irregular and aperiodic dips in its light curve. Alternative scenarios, none of which is fully consistent with the available data, are also briefly discussed, including a young stellar object, a T Tauri star surrounded by clumpy dust structure, a main sequence star eclipsed by a nearby extended object, a self-eclipsing R CrB variable star, and even a long-period, high-inclination X-ray binary.
△ Less
Submitted 6 November, 2018;
originally announced November 2018.
-
Bayesian Optimization of Combinatorial Structures
Authors:
Ricardo Baptista,
Matthias Poloczek
Abstract:
The optimization of expensive-to-evaluate black-box functions over combinatorial structures is an ubiquitous task in machine learning, engineering and the natural sciences. The combinatorial explosion of the search space and costly evaluations pose challenges for current techniques in discrete optimization and machine learning, and critically require new algorithmic ideas. This article proposes, t…
▽ More
The optimization of expensive-to-evaluate black-box functions over combinatorial structures is an ubiquitous task in machine learning, engineering and the natural sciences. The combinatorial explosion of the search space and costly evaluations pose challenges for current techniques in discrete optimization and machine learning, and critically require new algorithmic ideas. This article proposes, to the best of our knowledge, the first algorithm to overcome these challenges, based on an adaptive, scalable model that identifies useful combinatorial structure even when data is scarce. Our acquisition function pioneers the use of semidefinite programming to achieve efficiency and scalability. Experimental evaluations demonstrate that this algorithm consistently outperforms other methods from combinatorial and Bayesian optimization.
△ Less
Submitted 10 October, 2018; v1 submitted 22 June, 2018;
originally announced June 2018.
-
Map** the accretion disc of the short period eclipsing binary SDSS J0926+3624
Authors:
Wagner Schlindwein,
Raymundo Baptista
Abstract:
We report the analysis of time-series of optical photometry of SDSS J0926+3624 collected with the Liverpool Robotic Telescope between 2012 February and March while the object was in quiescence. We combined our median eclipse timing with those in the literature to revise the ephemeris and confirm that the binary period is increasing at a rate $\dot{P}=(3.2 \pm 0.4)\times 10^{-13} \, s/s$. The light…
▽ More
We report the analysis of time-series of optical photometry of SDSS J0926+3624 collected with the Liverpool Robotic Telescope between 2012 February and March while the object was in quiescence. We combined our median eclipse timing with those in the literature to revise the ephemeris and confirm that the binary period is increasing at a rate $\dot{P}=(3.2 \pm 0.4)\times 10^{-13} \, s/s$. The light curves show no evidence of either the orbital hump produced by a bright spot at disc rim or of superhumps; the average out-of-eclipse brightness level is consistently lower than previously reported. The eclipse map from the average light curve shows a hot white dwarf surrounded by a faint, cool accretion disc plus enhanced emission along the gas stream trajectory beyond the impact point at the outer disc rim, suggesting the occurrence of gas stream overflow/penetration at that epoch. We estimate a disc mass input rate of $\dot{M}=(9 \pm 1)\times 10^{-12}\,M_\odot \,yr^{-1}$, more than an order of magnitude lower than that expected from binary evolution with conservative mass transfer.
△ Less
Submitted 14 May, 2018; v1 submitted 18 April, 2018;
originally announced April 2018.
-
Infrared photometry of the dwarf nova V2051 Ophiuchi: I - The mass donor star and the distance
Authors:
Eduardo Wojcikiewicz,
Raymundo Baptista,
Tiago Ribeiro
Abstract:
We report the analysis of time-series of infrared $JHK_s$ photometry of the dwarf nova V2051 Oph in quiescence. We modelled the ellipsoidal variations caused by the distorted mass-donor star to infer its $JHK_s$ fluxes. From its infrared colors we estimate a spectral type of $M(8.0\pm 1.5)$ and an equivalent blackbody temperature of $T_\mathrm{BB}=(2700\pm270)\,K$. We used the Barnes & Evans relat…
▽ More
We report the analysis of time-series of infrared $JHK_s$ photometry of the dwarf nova V2051 Oph in quiescence. We modelled the ellipsoidal variations caused by the distorted mass-donor star to infer its $JHK_s$ fluxes. From its infrared colors we estimate a spectral type of $M(8.0\pm 1.5)$ and an equivalent blackbody temperature of $T_\mathrm{BB}=(2700\pm270)\,K$. We used the Barnes & Evans relation to infer a photometric parallax distance of $d_\mathrm{BE}=(102\pm16)$ pc to the binary. At this short distance, the corresponding accretion disc temperatures in outburst are too low to be explained by the disc-instability model for dwarf nova outbursts, underscoring a previous suggestion that the outbursts of this binary are powered by mass-transfer bursts.
△ Less
Submitted 19 January, 2018; v1 submitted 4 January, 2018;
originally announced January 2018.
-
Beyond normality: Learning sparse probabilistic graphical models in the non-Gaussian setting
Authors:
Rebecca E. Morrison,
Ricardo Baptista,
Youssef Marzouk
Abstract:
We present an algorithm to identify sparse dependence structure in continuous and non-Gaussian probability distributions, given a corresponding set of data. The conditional independence structure of an arbitrary distribution can be represented as an undirected graph (or Markov random field), but most algorithms for learning this structure are restricted to the discrete or Gaussian cases. Our new a…
▽ More
We present an algorithm to identify sparse dependence structure in continuous and non-Gaussian probability distributions, given a corresponding set of data. The conditional independence structure of an arbitrary distribution can be represented as an undirected graph (or Markov random field), but most algorithms for learning this structure are restricted to the discrete or Gaussian cases. Our new approach allows for more realistic and accurate descriptions of the distribution in question, and in turn better estimates of its sparse Markov structure. Sparsity in the graph is of interest as it can accelerate inference, improve sampling methods, and reveal important dependencies between variables. The algorithm relies on exploiting the connection between the sparsity of the graph and the sparsity of transport maps, which deterministically couple one probability measure to another.
△ Less
Submitted 6 November, 2017; v1 submitted 2 November, 2017;
originally announced November 2017.
-
SOAR observations of the high-viscosity accretion disc of the dwarf nova V4140 Sagitarii in quiescence and in outburst
Authors:
Raymundo Baptista,
Bernardo W. Borges,
Alexandre S. Oliveira
Abstract:
We report the analysis of 22 B-band light curves of the dwarf nova V4140 Sgr obtained with SOI/SOAR during two nights along the decline of a superoutburst in 2006 Sep 12-24 and in quiescence over 50 days following the superoutburst. Three-dimensional eclipse map** of the outburst light curves indicates that the accretion disc is elliptical (eccentricity e=0.13) and that superhump maximum occurs…
▽ More
We report the analysis of 22 B-band light curves of the dwarf nova V4140 Sgr obtained with SOI/SOAR during two nights along the decline of a superoutburst in 2006 Sep 12-24 and in quiescence over 50 days following the superoutburst. Three-dimensional eclipse map** of the outburst light curves indicates that the accretion disc is elliptical (eccentricity e=0.13) and that superhump maximum occurs when the mass donor star is aligned with the bulge of the elliptical disc. The accretion disc is geometrically thin both in outburst and in quiescence; it fills the primary Roche lobe in outburst and shrinks to about half this size in quiescence. The stability of the eclipse shape, width and depth along quiescence and the derived disc surface brightness distribution indicate that the quiescent accretion disc is in a high-viscosity, steady-state. Flickering map** of the quiescent data reveal that the low-frequency flickering arises from an azimuthally-extended stream-disc impact region at disc rim and from the innermost disc region, whereas the high-frequency flickering originates in the accretion disc. Assuming the disc-related flickering to be caused by fluctuations in the energy dissipation rate induced by magneto-hydrodynamic turbulence (Gertseema & Achterberg 1992), we find that the quiescent disc viscosity parameter is large (alpha ~ 0.2-0.4) at all radii. The high-viscosity quiescent disc and the inferred low disc temperatures in superoutburst are inconsistent with expectations of the disc-instability model, and lead to the conclusion that the outbursts of V4140 Sgr are powered by mass transfer bursts from its donor star.
△ Less
Submitted 12 September, 2016;
originally announced September 2016.
-
Eclipse Map**: Astrotomography of Accretion Discs
Authors:
Raymundo Baptista
Abstract:
The Eclipse Map** Method is an indirect imaging technique that transforms the shape of the eclipse light curve into a map of the surface brightness distribution of the occulted regions. Three decades of application of this technique to the investigation of the structure, the spectrum and the time evolution of accretion discs around white dwarfs in cataclysmic variables have enriched our understa…
▽ More
The Eclipse Map** Method is an indirect imaging technique that transforms the shape of the eclipse light curve into a map of the surface brightness distribution of the occulted regions. Three decades of application of this technique to the investigation of the structure, the spectrum and the time evolution of accretion discs around white dwarfs in cataclysmic variables have enriched our understanding of these accretion devices with a wealth of details such as (but not limited to) moving heating/cooling waves during outbursts in dwarf novae, tidally-induced spiral shocks of emitting gas with sub-Keplerian velocities, elliptical precessing discs associated to superhumps, and measurements of the radial run of the disc viscosity through the map** of the disc flickering sources. This chapter reviews the principles of the method, discusses its performance, limitations, useful error propagation procedures, as well as highlights a selection of applications aimed at showing the possible scientific problems that have been and may be addresses with it.
△ Less
Submitted 12 August, 2015;
originally announced August 2015.
-
Two-loop finiteness of self-energies in higher-derivative SQED3
Authors:
E. A. Gallegos,
R. Baptista
Abstract:
In the superfield formalism, two higher-derivative kinetic operators (Lee-Wick operators) are implemented into the standard three dimensional supersymmetric quantum electrodynamics for improving its ultraviolet behavior. It is shown in particular that the ghosts associated with these Lee-Wick operators allow to remove all ultraviolet divergences in the scalar and gauge self-energies at two-loop le…
▽ More
In the superfield formalism, two higher-derivative kinetic operators (Lee-Wick operators) are implemented into the standard three dimensional supersymmetric quantum electrodynamics for improving its ultraviolet behavior. It is shown in particular that the ghosts associated with these Lee-Wick operators allow to remove all ultraviolet divergences in the scalar and gauge self-energies at two-loop level.
△ Less
Submitted 22 August, 2013;
originally announced August 2013.
-
Accretion and activity on the post-common-envelope binary RR~Cae
Authors:
T. Ribeiro,
R. Baptista,
S. Kafka,
P. Dufour,
A. Gianninas,
G. Fontaine
Abstract:
Current scenarios for the evolution of interacting close binaries - such as cataclysmic variables (CVs) - rely mainly on our understanding of low-mass star angular momentum loss (AML) mechanisms. The coupling of stellar wind with its magnetic field, i.e., magnetic braking, is the most promising mechanism to drive AML in these stars. There are basically two properties driving magnetic braking: the…
▽ More
Current scenarios for the evolution of interacting close binaries - such as cataclysmic variables (CVs) - rely mainly on our understanding of low-mass star angular momentum loss (AML) mechanisms. The coupling of stellar wind with its magnetic field, i.e., magnetic braking, is the most promising mechanism to drive AML in these stars. There are basically two properties driving magnetic braking: the stellar magnetic field and the stellar wind. Understanding the mechanisms that drive AML therefore requires a comprehensive understanding of these two properties. RRCae is a well-known nearby (d=20pc) eclipsing DA+M binary with an orbital period of P=7.29h. The system harbors a metal-rich cool white dwarf (WD) and a highly active M-dwarf locked in synchronous rotation. The metallicity of the WD suggests that wind accretion is taking place, which provides a good opportunity to obtain the mass-loss rate of the M-dwarf component. We analyzed multi-epoch time-resolved high-resolution spectra of RRCae in search for traces of magnetic activity and accretion. We selected a number of well-known activity indicators and studied their short and long-term behavior. Indirect-imaging tomographic techniques were also applied to provide the surface brightness distribution of the magnetically active M-dwarf, and reveals a polar feature similar to those observed in fast-rotating solar-type stars. The blue part of the spectrum was modeled using a atmosphere model to constrain the WD properties and its metal enrichment. The latter was used to improve the determination of the mass-accretion rate from the M-dwarf wind. The presence of metals in the WD spectrum suggests that this component arises from accretion of the M-dwarf wind. A model fit to the WD gives Teff=(7260+/-250)K and logg=(7.8+/-0.1) dex with a metallicity of <log[X/Xsun]>=(-2.8+/-0.1)dex, and a mass-accretion rate of dotMacc=(7+/-2)x1e-16Msun/yr.
△ Less
Submitted 22 July, 2013;
originally announced July 2013.
-
Eclipse map** the flickering sources in the dwarf nova HT Cassiopeia
Authors:
R. Baptista,
B. Borges,
V. Kolokotronis,
O. Giannakis,
C. J. Papadimitriou
Abstract:
We report results of the eclipse map** analysis of an ensemble of light curves of HT Cas. The fast response of the white dwarf to the increase in mass transfer rate, the expansion rate of the accretion disc at the same time, and the relative amplitude of the high-frequency flickering indicate that the quiescent disc of HT Has has high viscosity, alpha ~ 0.3-0.7. This is in marked disagreement wi…
▽ More
We report results of the eclipse map** analysis of an ensemble of light curves of HT Cas. The fast response of the white dwarf to the increase in mass transfer rate, the expansion rate of the accretion disc at the same time, and the relative amplitude of the high-frequency flickering indicate that the quiescent disc of HT Has has high viscosity, alpha ~ 0.3-0.7. This is in marked disagreement with the disc-instability model and implies that the outbursts of HT Cas are caused by bursts of enhanced mass-transfer rate from its donor star.
△ Less
Submitted 6 May, 2011;
originally announced May 2011.
-
The stunted outbursts of UU Aquarii are likely mass-transfer events
Authors:
R. Baptista,
A. Bortoletto,
R. K. Honeycutt
Abstract:
We report a time-lapse eclipse map** analysis of B-band time-series of the nova-like variable UU Aqr along a typical stunted outburst in 2002 August. Disc asymmetries rotating in the prograde sense in the eclipse maps are interpreted as a precessing elliptical disc with enhanced emission at periastron. From the disc expansion velocity a disc viscosity alpha_{hot}= 0.2 is inferred. The outburst s…
▽ More
We report a time-lapse eclipse map** analysis of B-band time-series of the nova-like variable UU Aqr along a typical stunted outburst in 2002 August. Disc asymmetries rotating in the prograde sense in the eclipse maps are interpreted as a precessing elliptical disc with enhanced emission at periastron. From the disc expansion velocity a disc viscosity alpha_{hot}= 0.2 is inferred. The outburst starts with a 10-fold increase in uneclipsed light, probably arising in an enhanced disc wind; the disc response is delayed by 2 d. The results are inconsistent with the disc instability model and suggest that the stunted outburst of UU Aqr are the response of its viscous accretion disc to enhanced mass-transfer events.
△ Less
Submitted 6 May, 2011;
originally announced May 2011.
-
Near-Infrared SOAR Photometric Observations of Post Common Envelope Binaries
Authors:
Tiago Ribeiro,
Raymundo Baptista
Abstract:
{From a number of today known Post Common Envelopes Binaries (PCEB) only a handful has yet been observed at near-infrared (NIR) wavelengths and an even smaller number has modeled NIR light curves. At shorter wavelengths one has access to the cooler and larger components of these systems and has the chance to detect emission from its faint and heavily irradiated atmospheres. } {By modeling NIR ligh…
▽ More
{From a number of today known Post Common Envelopes Binaries (PCEB) only a handful has yet been observed at near-infrared (NIR) wavelengths and an even smaller number has modeled NIR light curves. At shorter wavelengths one has access to the cooler and larger components of these systems and has the chance to detect emission from its faint and heavily irradiated atmospheres. } {By modeling NIR light curves of PCEBs we intent to constrain their system parameters and study the properties of the system components.} {Here we present simultaneous NIR $JHK_s$ light curves of two PCEBs obtained with the $4m$ SOAR telescope.} {%For this work we have selected 3 systems from a previously selected sample of 8 PCEBs. KV Vel and TW Crv are long period (P$_{\rm orb} =$ 8.6h and 7.9h, respectively) PCEBs with large irradiation effects. The results of light curve fitting provided solutions with inclination $i = (47\pm5)^\circ$, mass ratio $q = 0.3\pm0.1$ and radius of the secondary $R_2/a = 0.24^{+0.05}_{-0.03}$ (where $a$ is the orbital separation) for KV Vel, and $i = (42\pm9)\degr$, $q = 0.28\pm0.04$ and $R_2/a = 0.22\pm0.01$ for TW Crv, respectively. For KV Vel, we obtain an average value for the albedo of the secondary star of $α= 0.43$, consistent in the $J$, $H$ and $K_s$-bands. For TW Crv, on the other hand, we obtain values of $α_{J} = (0.4\pm0.1)$ and $α_{H} = (0.3\pm0.1)$ for the $J$- and $H$-bands, respectively.
△ Less
Submitted 26 November, 2010;
originally announced November 2010.
-
Spectral Map** of the Intermediate Polar DQ Herculis
Authors:
Roberto K. Saito,
Raymundo Baptista,
Keith Horne,
Phillip Martell
Abstract:
We report an eclipse map** study of the intermediate polar DQ Her based on time-resolved optical spectroscopy (~3800-5000A) covering 4 eclipses. Eclipse maps of the HeII 4686 line indicate that an azimuthally-and vertically-extended bright spot at disk rim is important source of reprocessing of x-rays from the magnetic poles. The disk spectrum is flat with no Balmer or Helium lines in the inner…
▽ More
We report an eclipse map** study of the intermediate polar DQ Her based on time-resolved optical spectroscopy (~3800-5000A) covering 4 eclipses. Eclipse maps of the HeII 4686 line indicate that an azimuthally-and vertically-extended bright spot at disk rim is important source of reprocessing of x-rays from the magnetic poles. The disk spectrum is flat with no Balmer or Helium lines in the inner regions, and shows double-peaked emission lines in the intermediate and outer disk regions while the slope of the continuum becomes progressively redder with increasing radius. The inferred disk temperatures are in the range T~13500-5000K and can be reasonably well described by a steady-state disk with mass accretion rate of dM/dt=(2.7+/-1.0)x10^-9 Msun/yr. A comparison of the radial intensity distribution for the Balmer lines reveals a linear correlation between the slope of the distribution and the transition energy. The spectrum of the uneclipsed light is dominated by Balmer and HeI lines in emission with narrow absorption cores. The observed narrow and redshifted CaII 3934 absorption line in the total light spectra plus the inverse P-Cygni profiles of the Balmer and HeII 4686 emission lines in spectra of the asymmetric component indicate radial inflow of gas in the innermost disk regions and are best explained in terms of magnetically-controlled accretion inside the white dwarf magnetosphere. We infer projected radial inflow velocities of ~200-500km/s, significantly lower than both the rotational and the free-fall velocities for the corresponding range of radii. A combined net emission HeII plus Hbeta low-velocity eclipse map reveals a twisted dipole emitting pattern near disk center. This is interpreted as being the projection of accretion curtains onto the orbital plane at two specific spin phases, as a consequence of the selection in velocity provided by the spectral eclipse map**.
△ Less
Submitted 10 May, 2010;
originally announced May 2010.
-
Activity on the M star of QS Vir
Authors:
T. Ribeiro,
S. Kafka,
R. Baptista,
C. Tappert
Abstract:
We report analysis of VRIJH photometry, and phase-resolved optical spectroscopy of the eclipsing DA white dwarf plus dMe dwarf binary QS Vir. Modeling of the photometric data yields an inclination of $i = 74.9\pm0.6$ and a mass ratio of $q = M_2/M_1 = 0.50\pm0.05$. Our Doppler maps indicate the presence of material in the Roche lobe of the white dwarf, at a location near the M star, likely due t…
▽ More
We report analysis of VRIJH photometry, and phase-resolved optical spectroscopy of the eclipsing DA white dwarf plus dMe dwarf binary QS Vir. Modeling of the photometric data yields an inclination of $i = 74.9\pm0.6$ and a mass ratio of $q = M_2/M_1 = 0.50\pm0.05$. Our Doppler maps indicate the presence of material in the Roche lobe of the white dwarf, at a location near the M star, likely due to accretion from the stellar wind of the M star (as opposed to Roche-lobe overflow accretion). We also constructed images of the brightness distribution of the M star at different epochs which reveal the location of two stable active regions. Doppler tomography shows that the majority of the Hydrogen and Ca II H&K emission originates on the active M dwarf, likely distributed in two preferred activity longitudes, similar to active regions on BY Dra and FK Comae systems.
△ Less
Submitted 4 December, 2009;
originally announced December 2009.
-
New Complexities in the Low-State line profiles of AM Herculis
Authors:
S. Kafka,
T. Ribeiro,
R. Baptista,
R. K. Honeycutt,
J. W. Robertson
Abstract:
When accretion temporarily ceases in the polar AM Her, the emission line profiles are known to develop several distinct components, whose origin remains poorly understood. The new low-state spectra reported here have a more favorable combination of spectral resolution (R~4500), time resolution (~3-min exposures), and S/N than earlier work, revealing additional details of the orbital dependence o…
▽ More
When accretion temporarily ceases in the polar AM Her, the emission line profiles are known to develop several distinct components, whose origin remains poorly understood. The new low-state spectra reported here have a more favorable combination of spectral resolution (R~4500), time resolution (~3-min exposures), and S/N than earlier work, revealing additional details of the orbital dependence of the line profiles. The central strong feature of H-alpha is found to be composed of two components of similar strength, one having K~100 km/sec and phased with the motion of the secondary star, the other having little or no detectable radial velocity variations. We attribute the central line component to gas near the coupling region, perhaps with a contribution from irradiation of the secondary star. The two satellite components have RV offsets of ~+/-250 km/sec on either side of the central strong H-alpha peak. These satellites most likely arise in large loops of magnetically confined gas near the secondary star due to magnetic activity on the donor star and/or interactions of the magnetic fields of the two stars. Doppler maps show that these two satellite features have concentrations at velocities that match the velocity locations of L4 and L5 in the system.
△ Less
Submitted 14 October, 2008;
originally announced October 2008.
-
A two-armed pattern in flickering maps of the nova-like variable UU Aquarii
Authors:
Raymundo Baptista,
Alexandre Bortoletto
Abstract:
We report the analysis of a uniform sample of 31 light curves of the nova-like variable UU Aqr with eclipse map** techniques. The data were combined to derive eclipse maps of the average steady-light component, the long-term brightness changes, and low- and high-frequency flickering components. The long-term variability responsible for the 'low' and 'high' brightness states is explained in ter…
▽ More
We report the analysis of a uniform sample of 31 light curves of the nova-like variable UU Aqr with eclipse map** techniques. The data were combined to derive eclipse maps of the average steady-light component, the long-term brightness changes, and low- and high-frequency flickering components. The long-term variability responsible for the 'low' and 'high' brightness states is explained in terms of the response of a viscous disk to changes of 20-50 per cent in the mass transfer rate from the donor star. Low- and high-frequency flickering maps are dominated by emission from two asymmetric arcs reminiscent of those seen in the outbursting dwarf nova IP Peg, and are similarly interpreted as manifestation of a tidally-induced spiral shock wave in the outer regions of a large accretion disk. The asymmetric arcs are also seen in the map of the steady-light aside of the broad brightness distribution of a roughly steady-state disk. The arcs account for 25 per cent of the steady-light flux and are a long-lasting feature in the accretion disk of UU Aqr. We infer an opening angle of 10+/-3 degrees for the spiral arcs. The results suggest that the flickering in UU Aqr is caused by turbulence generated after the collision of disk gas with the density-enhanced spiral wave in the accretion disk.
△ Less
Submitted 12 December, 2007;
originally announced December 2007.
-
Cyclical period changes in HT Cas: a clear difference between systems above and below the period gap
Authors:
B. W. Borges,
R. Baptista,
C. Papadimitriou,
O. Giannakis
Abstract:
We report the identification of cyclical changes in the orbital period of the eclipsing cataclysmic variable HT Cas. We measured new white dwarf mid-eclipse timings and combined with published measurements to construct an observed-minus-calculated diagram covering 29 years of observations. The data present cyclical variations that can be fitted by a linear plus sinusoidal function with period 36…
▽ More
We report the identification of cyclical changes in the orbital period of the eclipsing cataclysmic variable HT Cas. We measured new white dwarf mid-eclipse timings and combined with published measurements to construct an observed-minus-calculated diagram covering 29 years of observations. The data present cyclical variations that can be fitted by a linear plus sinusoidal function with period 36 yr and semi-amplitude ~ 40 s. The statistical significance of this period by an F-test is larger than 99.9 per cent. We combine our results with those in the literature to revisit the issue of cyclical period changes in cataclysmic variables and their interpretation in terms of a solar-type magnetic activity cycle in the secondary star. A diagram of fractional period change (Delta P/P) versus the angular velocity of the active star (Omega) for cataclysmic variables, RS CVn, W UMa and Algols reveal that close binaries with periods above the gap (secondaries with convective envelopes) satisfy a relationship Delta P/P \propto Omega^{-0.7+/-0.1}. Cataclysmic variables below the period gap (with fully convective secondaries) deviate from this relationship by more than 3-sigma, with average fractional period changes ~ 5 times smaller than those of the systems above the gap.
△ Less
Submitted 22 November, 2007;
originally announced November 2007.
-
A multicolor near-infrared study of the dwarf nova IP Peg
Authors:
T. Ribeiro,
R. Baptista,
E. T. Harlaftis,
V. S. Dhillon,
R. G. M. Rutten
Abstract:
We report the analysis of $JHK_{s}$ light curves of the eclipsing dwarf nova IP Peg in quiescence. The light curves are dominated by the ellipsoidal variation of the mass-donor star, with additional contributions from the accretion disc and anisotropic emission from the bright spot. A secondary eclipse is visible in the $J$ and $H$ light curves, with 2% and 3% of the flux disappearing at minimum…
▽ More
We report the analysis of $JHK_{s}$ light curves of the eclipsing dwarf nova IP Peg in quiescence. The light curves are dominated by the ellipsoidal variation of the mass-donor star, with additional contributions from the accretion disc and anisotropic emission from the bright spot. A secondary eclipse is visible in the $J$ and $H$ light curves, with 2% and 3% of the flux disappearing at minimum light, respectively. We modeled the observed ellipsoidal variation of the secondary star (including possible illumination effects on its inner face) to find a mass ratio of $q = 0.42$ and an inclination of $i = 84.5^{o} $, consistent in the three bands within the uncertainties. Illumination effects are negligible. The secondary is responsible for 83%, 84% and 88% of the flux in $J$, $H$ and $K_{s}$, respectively. We fitted a black body spectrum to the $JHK_{s}$ fluxes of the secondary star to find a temperature of $T_{bb} = 3100\pm500 K$ and a distance of $d=115\pm30$ pc to the system. We subtracted the contribution of the secondary star and applied 3-D eclipse map** techniques to the resulting light curves to map the surface brightness of a disc with half-opening angle $α$ and a circular rim at the radius of the bright spot. The eclipse maps show enhanced emission along the stream trajectory ahead of the bright spot position, providing evidence of gas stream overflow. The inferred radial brightness-temperature distribution in the disc is flat for $R < 0.3R_{L1}$ with temperatures $\simeq 3500K$ and colors consistent with those of cool opaque radiators.
△ Less
Submitted 22 August, 2007;
originally announced August 2007.
-
A study of the evolution of the accretion disk of V2051 Oph through two outburst cycles
Authors:
R. Baptista,
R. F. Santos,
M. Faundez-Abans,
A. Bortoletto
Abstract:
We follow the changes in the structure of the accretion disk of the dwarf nova V2051 Oph along two separate outbursts in order to investigate the causes of its recurrent outbursts. We apply eclipse map** techniques to a set of light curves covering a normal (July 2000) and a low-amplitude (August 2002) outburst to derive maps of the disk surface brightness distribution at different phases alon…
▽ More
We follow the changes in the structure of the accretion disk of the dwarf nova V2051 Oph along two separate outbursts in order to investigate the causes of its recurrent outbursts. We apply eclipse map** techniques to a set of light curves covering a normal (July 2000) and a low-amplitude (August 2002) outburst to derive maps of the disk surface brightness distribution at different phases along the outburst cycles. The sequence of eclipse maps of the 2000 July outburst reveal that the disk shrinks at outburst onset while an uneclipsed component of 13 per cent of the total light develops. The derived radial intensity distributions suggest the presence of an outward-moving heating wave during rise and of an inward-moving cooling wave during decline. The inferred speed of the outward-moving heating wave is ~ 1.6 km/s, while the speed of the cooling wave is a fraction of that. A comparison of the measured cooling wave velocity on consecutive nights indicates that the cooling wave accelerates as it travels towards disk center, in contradiction with the prediction of the disk instability model. From the inferred speed of the heating wave we derive a viscosity parameter alpha_{hot} ~ 0.13, comparable to the measured viscosity parameter in quiescence. The 2002 August outburst had lower amplitude (ΔB ~ 0.8 mag) and the disk at outburst maximum was smaller than on 2000 July. For an assumed distance of 92 pc, we find that along both outbursts the disk brightness temperatures remain below the minimum expected according to the disk instability model. The results suggest that the outbursts of V2051 Oph are caused by bursts of increased mass transfer from the mass-donor star.
△ Less
Submitted 16 May, 2007;
originally announced May 2007.