Search | arXiv e-print repository

arXiv:2406.06488 [pdf, other]

Computationally efficient permutation tests for the multivariate two-sample problem based on energy distance or maximum mean discrepancy statistics

Authors: Elias Chaibub Neto

Abstract: Non-parametric two-sample tests based on energy distance or maximum mean discrepancy are widely used statistical tests for comparing multivariate data from two populations. While these tests enjoy desirable statistical properties, their test statistics can be expensive to compute as they require the computation of 3 distinct Euclidean distance (or kernel) matrices between samples, where the time c… ▽ More Non-parametric two-sample tests based on energy distance or maximum mean discrepancy are widely used statistical tests for comparing multivariate data from two populations. While these tests enjoy desirable statistical properties, their test statistics can be expensive to compute as they require the computation of 3 distinct Euclidean distance (or kernel) matrices between samples, where the time complexity of each of these computations (namely, $O(n_{x}^2 p)$, $O(n_{y}^2 p)$, and $O(n_{x} n_{y} p)$) scales quadratically with the number of samples ($n_x$, $n_y$) and linearly with the number of variables ($p$). Since the standard permutation test requires repeated re-computations of these expensive statistics it's application to large datasets can become unfeasible. While several statistical approaches have been proposed to mitigate this issue, they all sacrifice desirable statistical properties to decrease the computational cost (e.g., trade computation speed by a decrease in statistical power). A better computational strategy is to first pre-compute the Euclidean distance (kernel) matrix of the concatenated data, and then permute indexes and retrieve the corresponding elements to compute the re-sampled statistics. While this strategy can reduce the computation cost relative to the standard permutation test, it relies on the computation of a larger Euclidean distance (kernel) matrix with complexity $O((n_x + n_y)^2 p)$. In this paper, we present a novel computationally efficient permutation algorithm which only requires the pre-computation of the 3 smaller matrices and achieves large computational speedups without sacrificing finite-sample validity or statistical power. We illustrate its computational gains in a series of experiments and compare its statistical power to the current state-of-the-art approach for balancing computational cost and statistical performance. △ Less

Submitted 10 June, 2024; originally announced June 2024.

Comments: 22 pages, 3 figures

arXiv:2311.06422 [pdf, other]

Statistical disclosure control for numeric microdata via sequential joint probability preserving data shuffling

Authors: Elias Chaibub Neto

Abstract: Traditional perturbative statistical disclosure control (SDC) approaches such as microaggregation, noise addition, rank swap**, etc, perturb the data in an ``ad-hoc" way in the sense that while they manage to preserve some particular aspects of the data, they end up modifying others. Synthetic data approaches based on the fully conditional specification data synthesis paradigm, on the other hand… ▽ More Traditional perturbative statistical disclosure control (SDC) approaches such as microaggregation, noise addition, rank swap**, etc, perturb the data in an ``ad-hoc" way in the sense that while they manage to preserve some particular aspects of the data, they end up modifying others. Synthetic data approaches based on the fully conditional specification data synthesis paradigm, on the other hand, aim to generate new datasets that follow the same joint probability distribution as the original data. These synthetic data approaches, however, rely either on parametric statistical models, or non-parametric machine learning models, which need to fit well the original data in order to generate credible and useful synthetic data. Another important drawback is that they tend to perform better when the variables are synthesized in the correct causal order (i.e., in the same order as the true data generating process), which is often unknown in practice. To circumvent these issues, we propose a fully non-parametric and model free perturbative SDC approach that approximates the joint distribution of the original data via sequential applications of restricted permutations to the numerical microdata (where the restricted permutations are guided by the joint distribution of a discretized version of the data). Empirical comparisons against popular SDC approaches, using both real and simulated datasets, suggest that the proposed approach is competitive in terms of the trade-off between confidentiality and data utility. △ Less

Submitted 10 November, 2023; originally announced November 2023.

Comments: 25 page, 12 figures

arXiv:2310.08988 [pdf]

Reroute Prediction Service

Authors: Ítalo Romani de Oliveira, Samet Ayhan, Michael Biglin, Pablo Costas, Euclides C. Pinto Neto

Abstract: The cost of delays was estimated as 33 billion US dollars only in 2019 for the US National Airspace System, a peak value following a growth trend in past years. Aiming to address this huge inefficiency, we designed and developed a novel Data Analytics and Machine Learning system, which aims at reducing delays by proactively supporting re-routing decisions. Given a time interval up to a few days… ▽ More The cost of delays was estimated as 33 billion US dollars only in 2019 for the US National Airspace System, a peak value following a growth trend in past years. Aiming to address this huge inefficiency, we designed and developed a novel Data Analytics and Machine Learning system, which aims at reducing delays by proactively supporting re-routing decisions. Given a time interval up to a few days in the future, the system predicts if a reroute advisory for a certain Air Route Traffic Control Center or for a certain advisory identifier will be issued, which may impact the pertinent routes. To deliver such predictions, the system uses historical reroute data, collected from the System Wide Information Management (SWIM) data services provided by the FAA, and weather data, provided by the US National Centers for Environmental Prediction (NCEP). The data is huge in volume, and has many items streamed at high velocity, uncorrelated and noisy. The system continuously processes the incoming raw data and makes it available for the next step where an interim data store is created and adaptively maintained for efficient query processing. The resulting data is fed into an array of ML algorithms, which compete for higher accuracy. The best performing algorithm is used in the final prediction, generating the final results. Mean accuracy values higher than 90% were obtained in our experiments with this system. Our algorithm divides the area of interest in units of aggregation and uses temporal series of the aggregate measures of weather forecast parameters in each geographical unit, in order to detect correlations with reroutes and where they will most likely occur. Aiming at practical application, the system is formed by a number of microservices, which are deployed in the cloud, making the system distributed, scalable and highly available. △ Less

Submitted 13 October, 2023; originally announced October 2023.

Comments: Submitted to the 2023 IEEE/AIAA Digital Aviation Systems Conference (DASC)

arXiv:2307.14883 [pdf]

Stochastic Flight Plan Optimization

Authors: Ítalo Romani de Oliveira, Steve Altus, Sergey Tiourine, Euclides C. Pinto Neto, Alexandre Leite, Felipe C. F. de Azevedo

Abstract: Airline operations are subject to many uncertainties, such as weather, varying demand, maintenance events, congestion, etc. Large amounts of information are currently ignored due to difficulties in processing big data sets. We explore the use of ensemble weather forecast, which presents several distinct weather predictions for the same time horizon. So far, ensemble forecasts have been very little… ▽ More Airline operations are subject to many uncertainties, such as weather, varying demand, maintenance events, congestion, etc. Large amounts of information are currently ignored due to difficulties in processing big data sets. We explore the use of ensemble weather forecast, which presents several distinct weather predictions for the same time horizon. So far, ensemble forecasts have been very little exploited for flight planning purposes. Currently, airlines already carry out lots of statistical analyses on past data, and devise effective policies for how much fuel and payload an aircraft should carry and how much of time buffer should be used in the schedule. But these buffers can be further reduced by doing forward-looking stochastic optimization. The use of ensemble forecast allows to select a trajectory that optimizes the expected outcome of a flight for an array of scenarios, instead of optimizing for a single one. Besides, aircraft payload is another considerable source of uncertainty. We tested stochastic optimization, first with the objective of optimizing single flights, then with the objective of optimizing whole schedules. In one of the experiments, it was observed that, in 55.8% of the cases, stochastic optimization outperforms conventional optimization in terms of fuel consumption; in only 0.4% of the cases, conventional optimization wins; and, in the remaining 43.8% of the cases, they achieve equal results. The experiments with stochastic payload demonstrated that the use of payload uncertainty can squeeze a bit more fuel savings from the flight plan outcomes. But the use of this technology is not driven only by reducing overall fuel consumption. One optimization criterion can be the minimization of diversions or fuel emergencies, that is, choosing the candidate that minimizes the maximum fuel consumption (minimax). △ Less

Submitted 27 July, 2023; originally announced July 2023.

Comments: Preprint to the 2023 IEEE/AIAA Digital Aviation Systems Conference, October 1-5, 2023, Barcelona, Spain

arXiv:2305.07846 [pdf, other]

A note on bounded distance-based information loss metrics for statistical disclosure control of numeric microdata

Authors: Elias Chaibub Neto

Abstract: In the field of statistical disclosure control, the tradeoff between data confidentiality and data utility is measured by comparing disclosure risk and information loss metrics. Distance based metrics such as the mean absolute error (MAE), mean squared error (MSE), mean variation (IL1), and its scaled alternative (IL1s) are popular information loss measures for numerical microdata. However, the fa… ▽ More In the field of statistical disclosure control, the tradeoff between data confidentiality and data utility is measured by comparing disclosure risk and information loss metrics. Distance based metrics such as the mean absolute error (MAE), mean squared error (MSE), mean variation (IL1), and its scaled alternative (IL1s) are popular information loss measures for numerical microdata. However, the fact that these measures are unbounded makes it is difficult to compare them against disclosure risk measures which are usually bounded between 0 and 1. In this note, we propose rank-based versions of the MAE and MSE metrics that are bounded in the same range as the disclosure risk metrics. We empirically compare the proposed bounded metrics against the distance-based metrics in a series of experiments where the metrics are evaluated over multiple masked datasets, generated by the application of increasing amounts of perturbation (e.g., by adding increasing amounts of noise). Our results show that the proposed bounded metrics produce similar rankings as the traditional ones (as measured by Spearman correlation), suggesting that they are a viable additions to the toolbox of distance-based information loss metrics currently in use in the SDC literature. △ Less

Submitted 13 May, 2023; originally announced May 2023.

arXiv:2211.15330 [pdf, other]

UAS in the Airspace: A Review on Integration, Simulation, Optimization, and Open Challenges

Authors: Euclides Carlos Pinto Neto, Derick Moreira Baum, Jorge Rady de Almeida Jr., Joao Batista Camargo Jr., Paulo Sergio Cugnasca

Abstract: Air transportation is essential for society, and it is increasing gradually due to its importance. To improve the airspace operation, new technologies are under development, such as Unmanned Aircraft Systems (UAS). In fact, in the past few years, there has been a growth in UAS numbers in segregated airspace. However, there is an interest in integrating these aircraft into the National Airspace Sys… ▽ More Air transportation is essential for society, and it is increasing gradually due to its importance. To improve the airspace operation, new technologies are under development, such as Unmanned Aircraft Systems (UAS). In fact, in the past few years, there has been a growth in UAS numbers in segregated airspace. However, there is an interest in integrating these aircraft into the National Airspace System (NAS). The UAS is vital to different industries due to its advantages brought to the airspace (e.g., efficiency). Conversely, the relationship between UAS and Air Traffic Control (ATC) needs to be well-defined due to the impacts on ATC capacity these aircraft may present. Throughout the years, this impact may be lower than it is nowadays because the current lack of familiarity in this relationship contributes to higher workload levels. Thereupon, the primary goal of this research is to present a comprehensive review of the advancements in the integration of UAS in the National Airspace System (NAS) from different perspectives. We consider the challenges regarding simulation, final approach, and optimization of problems related to the interoperability of such systems in the airspace. Finally, we identify several open challenges in the field based on the existing state-of-the-art proposals. △ Less

Submitted 24 November, 2022; originally announced November 2022.

arXiv:2208.09312 [pdf]

Blockchain-based traffic management for Advanced Air Mobility

Authors: I. Romani de Oliveira, T. Matsumoto, E. C. Pinto Neto

Abstract: The large public interest in Advanced Air Mobility (AAM) will soon lead to congested skies overhead cities, analogously to what happened with other transportation means, including commercial aviation. In the latter case, the combination of large distances and demanded number flights is such that a system with centralized control, with most of the decisions made by human operators, is safe. However… ▽ More The large public interest in Advanced Air Mobility (AAM) will soon lead to congested skies overhead cities, analogously to what happened with other transportation means, including commercial aviation. In the latter case, the combination of large distances and demanded number flights is such that a system with centralized control, with most of the decisions made by human operators, is safe. However, for AAM, it is expected a much higher demand, because it will be used for people's daily commutes. Thus, higher automation levels will become a requirement for coordinating this traffic, which might not be effectively managed by humans. The establishment of fixed air routes can abate complexity, however at the cost of limiting capacity and decreasing efficiency. Another alternative is the use of a powerful central system based on Artificial Intelligence (AI), which would allow flexible trajectories and higher efficiency. However, such system would require concentrated investment, could contain Single-Points-of-Failure (SPoFs), would be a highly sought target of malicious attacks, and would be subject to periods of unavailability. This work proposes a new technology that solves the problem of managing the high complexity of the AAM traffic with a secure distributed approach, without the need for a proprietary centralized automation system. This technology enables distributed airspace allocation management and conflict resolution by means of trusted shared data structures and associated smart contracts running on a blockchain ecosystem. This way, it greatly reduces the risk of system outages due to SPoFs, by allowing peer-to-peer conflict resolution, and being more resilient to failures in the ground communication infrastructure. Furthermore, it provides priority-based balancing mechanisms that help to regulate fairness among participants in the utilization of the airspace. △ Less

Submitted 19 August, 2022; originally announced August 2022.

Comments: Submitted to the XIX Air Transportation Symposium, Oct. 25-27, Sao Jose dos Campos, Brazil

arXiv:2108.11329 [pdf]

Decentralized Air Traffic Management for Advanced Air Mobility

Authors: Ítalo Romani de Oliveira, Euclides Carlos Pinto Neto, Thiago Toshio Matsumoto, Huafeng Yu

Abstract: Leading proposals for Advanced Air Mobility (AAM) recognize the need for strategic and tactical airspace deconfliction, where the need for coordination appears in different forms and nuances. One recurring element is the use of pre-defined airways or corridors, a natural way to create order, with proven success from the conventional, manned, Air Traffic Management (ATM). But, while ATM is evolving… ▽ More Leading proposals for Advanced Air Mobility (AAM) recognize the need for strategic and tactical airspace deconfliction, where the need for coordination appears in different forms and nuances. One recurring element is the use of pre-defined airways or corridors, a natural way to create order, with proven success from the conventional, manned, Air Traffic Management (ATM). But, while ATM is evolving to 4D Trajectory-Based Operations, when we apply the ATM principles to very dense and potentially more complex operations such as AAM, we have to consider their limitations in such demanding new environment. The requirement of following circulation corridors most often increase flight distance and inevitably create bottlenecks, hence we explore the hypothesis of not using corridors, testing such option via a simple and scalable simulation model. Other motivations for comparing different forms of traffic coordination are redundancy and diversity, which have potential to increase system safety. Relying on a single method to maintain traffic separation of course would not be allowed in practice. However, the concepts that we have seen so far leave a gap between two very distinct and co-existing methods: one, cooperative and centered on a ground-based Provider of Services for UAM (PSU), and another, which is mostly non-cooperative and independent, centered on the individual aircraft, commonly referred to as Detect-And-Avoid (DAA). This duality achieves a welcomed diversity, however presents several points for improvement. In-between these opposite methods, this paper analyzes the performance of an airborne cooperative method to coordinate traffic which is capable of safely solving conflicts of multiple aircraft (more than two) and achieve higher efficiency than DAA alone, thus with potential for being an alternative or a live fallback for ground-based traffic coordination. △ Less

Submitted 24 August, 2021; originally announced August 2021.

Comments: Accepted for the 2021 Integrated Communications, Navigation and Surveillance (2012) Conference, 5-7 April, 2021

arXiv:2108.11316 [pdf]

Comparing the Performance of Traffic Coordination Methods for Advanced Aerial Mobility

Authors: Ítalo Romani de Oliveira, Euclides C. Pinto Neto, Thiago Matsumoto, Huafeng Yu, Emiliano Bartolomé, Guillermo Frontera, Aaron Mayne

Abstract: Traffic Management in Advanced Aerial Mobility (AAM) inherits many elements of conventional Air Traffic Management (ATM), but brings new complexities and challenges of its own. One of its ways of guaranteeing separation is the use of airborne, stand-alone Detect-And-Avoid, an operational concept where each aircraft decides its avoidance maneuvers independently, observing right-of-way rules and, in… ▽ More Traffic Management in Advanced Aerial Mobility (AAM) inherits many elements of conventional Air Traffic Management (ATM), but brings new complexities and challenges of its own. One of its ways of guaranteeing separation is the use of airborne, stand-alone Detect-And-Avoid, an operational concept where each aircraft decides its avoidance maneuvers independently, observing right-of-way rules and, in specific implementations, some form of pairwise coordination. This is a fundamental safety element for autonomous aircraft but, according to our research, is not sufficient for high-density airspaces as envisioned for urban environments. In these environments, some way of explicit and strategic traffic coordination must be in place, as done for conventional ATM. For efficiency reasons, ATM is evolving to more flexible uses of the airspace, such that the use of dynamically allocated corridors is a rising concept for AAM. These strategic forms of traffic coordination are potentially highly efficient if the aircraft adhere to their trajectory contracts and there are no significant perturbations to the traffic. However, if significant perturbations occur, such as loss of data communication, or the sudden appearance of an intruder, a centralized system may not react appropriately in due time. In busy scenarios, even small deviations from plans may compound so rapidly as to result in large differences in the overall achieved scenario, resulting in congestions and convoluted conflicts. Therefore, it is worth studying traffic coordination techniques that work locally with shorter look-ahead times. To that end, we explore an airborne collaborative method for traffic coordination, which is capable of safely solving conflicts with multiple aircraft, stressing its capabilities throughout a large number of scenarios and comparing its performance with established methods. △ Less

Submitted 26 August, 2021; v1 submitted 24 August, 2021; originally announced August 2021.

Comments: Accepted for the 40th Digital Aviation Systems Conference (DASC), October 3-7, 2021, San Antonio, TX. (copyright by IEEE)

arXiv:2011.04605 [pdf, other]

Causality-aware counterfactual confounding adjustment as an alternative to linear residualization in anticausal prediction tasks based on linear learners

Authors: Elias Chaibub Neto

Abstract: Linear residualization is a common practice for confounding adjustment in machine learning (ML) applications. Recently, causality-aware predictive modeling has been proposed as an alternative causality-inspired approach for adjusting for confounders. The basic idea is to simulate counterfactual data that is free from the spurious associations generated by the observed confounders. In this paper, w… ▽ More Linear residualization is a common practice for confounding adjustment in machine learning (ML) applications. Recently, causality-aware predictive modeling has been proposed as an alternative causality-inspired approach for adjusting for confounders. The basic idea is to simulate counterfactual data that is free from the spurious associations generated by the observed confounders. In this paper, we compare the linear residualization approach against the causality-aware confounding adjustment in anticausal prediction tasks, and show that the causality-aware approach tends to (asymptotically) outperform the residualization adjustment in terms of predictive performance in linear learners. Importantly, our results still holds even when the true model is not linear. We illustrate our results in both regression and classification tasks, where we compared the causality-aware and residualization approaches using mean squared errors and classification accuracy in synthetic data experiments where the linear regression model is mispecified, as well as, when the linear model is correctly specified. Furthermore, we illustrate how the causality-aware approach is more stable than residualization with respect to dataset shifts in the joint distribution of the confounders and outcome variables. △ Less

Submitted 9 November, 2020; originally announced November 2020.

Comments: This paper draws some material from arXiv:2001.03998

arXiv:2011.04128 [pdf, other]

Stable predictions for health related anticausal prediction tasks affected by selection biases: the need to deconfound the test set features

Authors: Elias Chaibub Neto, Phil Snyder, Solveig K Sieberts, Larsson Omberg

Abstract: In health related machine learning applications, the training data often corresponds to a non-representative sample from the target populations where the learners will be deployed. In anticausal prediction tasks, selection biases often make the associations between confounders and the outcome variable unstable across different target environments. As a consequence, the predictions from confounded… ▽ More In health related machine learning applications, the training data often corresponds to a non-representative sample from the target populations where the learners will be deployed. In anticausal prediction tasks, selection biases often make the associations between confounders and the outcome variable unstable across different target environments. As a consequence, the predictions from confounded learners are often unstable, and might fail to generalize in shifted test environments. Stable prediction approaches aim to solve this problem by producing predictions that are stable across unknown test environments. These approaches, however, are sometimes applied to the training data alone with the hope that training an unconfounded model will be enough to generate stable predictions in shifted test sets. Here, we show that this is insufficient, and that improved stability can be achieved by deconfounding the test set features as well. We illustrate these observations using both synthetic data and real world data from a mobile health study. △ Less

Submitted 8 November, 2020; originally announced November 2020.

Comments: Machine Learning for Health (ML4H) at NeurIPS 2020 - Extended Abstract. This workshop paper draws some material from arXiv:2001.03998 and arXiv:2004.09466

arXiv:2004.09466 [pdf, other]

Causality-aware counterfactual confounding adjustment for feature representations learned by deep models

Authors: Elias Chaibub Neto

Abstract: Causal modeling has been recognized as a potential solution to many challenging problems in machine learning (ML). Here, we describe how a recently proposed counterfactual approach developed to deconfound linear structural causal models can still be used to deconfound the feature representations learned by deep neural network (DNN) models. The key insight is that by training an accurate DNN using… ▽ More Causal modeling has been recognized as a potential solution to many challenging problems in machine learning (ML). Here, we describe how a recently proposed counterfactual approach developed to deconfound linear structural causal models can still be used to deconfound the feature representations learned by deep neural network (DNN) models. The key insight is that by training an accurate DNN using softmax activation at the classification layer, and then adopting the representation learned by the last layer prior to the output layer as our features, we have that, by construction, the learned features will fit well a (multi-class) logistic regression model, and will be linearly associated with the labels. As a consequence, deconfounding approaches based on simple linear models can be used to deconfound the feature representations learned by DNNs. We validate the proposed methodology using colored versions of the MNIST dataset. Our results illustrate how the approach can effectively combat confounding and improve model stability in the context of dataset shifts generated by selection biases. △ Less

Submitted 19 November, 2020; v1 submitted 20 April, 2020; originally announced April 2020.

arXiv:2001.03998 [pdf, other]

Towards causality-aware predictions in static anticausal machine learning tasks: the linear structural causal model case

Authors: Elias Chaibub Neto

Abstract: We propose a counterfactual approach to train ``causality-aware" predictive models that are able to leverage causal information in static anticausal machine learning tasks (i.e., prediction tasks where the outcome influences the features). In applications plagued by confounding, the approach can be used to generate predictions that are free from the influence of observed confounders. In applicatio… ▽ More We propose a counterfactual approach to train ``causality-aware" predictive models that are able to leverage causal information in static anticausal machine learning tasks (i.e., prediction tasks where the outcome influences the features). In applications plagued by confounding, the approach can be used to generate predictions that are free from the influence of observed confounders. In applications involving observed mediators, the approach can be used to generate predictions that only capture the direct or the indirect causal influences. Mechanistically, we train supervised learners on (counterfactually) simulated features which retain only the associations generated by the causal relations of interest. We focus on linear models, where analytical results connecting covariances, causal effects, and prediction mean squared errors are readily available. Quite importantly, we show that our approach does not require knowledge of the full causal graph. It suffices to know which variables represent potential confounders and/or mediators. We discuss the stability of the method with respect to dataset shifts generated by selection biases and validate the approach using synthetic data experiments. △ Less

Submitted 30 November, 2020; v1 submitted 12 January, 2020; originally announced January 2020.

Comments: Causal Discovery & Causality-Inspired Machine Learning Workshop at Neural Information Processing Systems 2020. (Contains some common material with arXiv:2011.04128.)

arXiv:1911.05139 [pdf, other]

Causality-based tests to detect the influence of confounders on mobile health diagnostic applications: a comparison with restricted permutations

Authors: Elias Chaibub Neto, Meghasyam Tummalacherla, Lara Mangravite, Larsson Omberg

Abstract: Machine learning practice is often impacted by confounders. Confounding can be particularly severe in remote digital health studies where the participants self-select to enter the study. While many different confounding adjustment approaches have been proposed in the literature, most of these methods rely on modeling assumptions, and it is unclear how robust they are to violations of these assumpt… ▽ More Machine learning practice is often impacted by confounders. Confounding can be particularly severe in remote digital health studies where the participants self-select to enter the study. While many different confounding adjustment approaches have been proposed in the literature, most of these methods rely on modeling assumptions, and it is unclear how robust they are to violations of these assumptions. This realization has recently motivated the development of restricted permutation methods to quantify the influence of observed confounders on the predictive performance of a machine learning models and evaluate if confounding adjustment methods are working as expected. In this paper we show, nonetheless, that restricted permutations can generate biased estimates of the contribution of the confounders to the predictive performance of a learner, and we propose an alternative approach to tackle this problem. By viewing a classification task from a causality perspective, we are able to leverage conditional independence tests between predictions and test set labels and confounders in order to detect confounding on the predictive performance of a classifier. We illustrate the application of our causality-based approach to data collected from mHealth study in Parkinson's disease. △ Less

Submitted 12 November, 2019; originally announced November 2019.

Comments: Machine Learning for Health (ML4H) at NeurIPS 2019 - Extended Abstract

arXiv:1910.01165 [pdf]

Indicators of retention in remote digital health studies: A cross-study evaluation of 100,000 participants

Authors: Abhishek Pratap, Elias Chaibub Neto, Phil Snyder, Carl Stepnowsky, Noémie Elhadad, Daniel Grant, Matthew H. Mohebbi, Sean Mooney, Christine Suver, John Wilbanks, Lara Mangravite, Patrick Heagerty, Pat Arean, Larsson Omberg

Abstract: Digital technologies such as smartphones are transforming the way scientists conduct biomedical research using real-world data. Several remotely-conducted studies have recruited thousands of participants over a span of a few months. Unfortunately, these studies are hampered by substantial participant attrition, calling into question the representativeness of the collected data including generaliza… ▽ More Digital technologies such as smartphones are transforming the way scientists conduct biomedical research using real-world data. Several remotely-conducted studies have recruited thousands of participants over a span of a few months. Unfortunately, these studies are hampered by substantial participant attrition, calling into question the representativeness of the collected data including generalizability of findings from these studies. We report the challenges in retention and recruitment in eight remote digital health studies comprising over 100,000 participants who participated for more than 850,000 days, completing close to 3.5 million remote health evaluations. Survival modeling surfaced several factors significantly associated(P < 1e-16) with increase in median retention time i) Clinician referral(increase of 40 days), ii) Effect of compensation (22 days), iii) Clinical conditions of interest to the study (7 days) and iv) Older adults(4 days). Additionally, four distinct patterns of daily app usage behavior that were also associated(P < 1e-10) with participant demographics were identified. Most studies were not able to recruit a representative sample, either demographically or regionally. Combined together these findings can help inform recruitment and retention strategies to enable equitable participation of populations in future digital health research. △ Less

Submitted 2 October, 2019; originally announced October 2019.

arXiv:1908.08651 [pdf]

Trajectory-Based Urban Air Mobility (UAM) Operations Simulator (TUS)

Authors: Euclides C. Pinto Neto, Derick M. Baum, Jorge Rady de Almeida Junior, João Batista Camargo Junior, Paulo Sérgio Cugnasca

Abstract: Nowadays, the demand for optimized services in urban environments to provide better society wellness is increasing. In this sense, ground transportation in dense urban environments has been facing challenges for many years (e.g., congestion and resilience). One import outcome of the effort made toward the creation of new concepts for enhancing urban transportation is the Urban Air Mobility (UAM) c… ▽ More Nowadays, the demand for optimized services in urban environments to provide better society wellness is increasing. In this sense, ground transportation in dense urban environments has been facing challenges for many years (e.g., congestion and resilience). One import outcome of the effort made toward the creation of new concepts for enhancing urban transportation is the Urban Air Mobility (UAM) concept. UAM aims at enhancing city transportation services using manned and unmanned vehicles. However, these operations bring many challenges to be faced, e.g., the interaction between the controller agent and autonomous vehicles. Furthermore, trajectory planning is not a simple task due to several factors. Firstly, the trajectories must consider a reduced minimum separation as eVTOL vehicle are expected to operate in complex urban environments. This leads the trajectory planning process to observe safety primitives more restrictively once the airspace is expected to comport many vehicles that follow small minimum separation standards. Thereupon, the main goal of the Trajectory-Based UAM Operations Simulator (TUS) is to simulate the Trajectory-Based UAM operations in urban environments considering the presence of both manned and unmanned eVTOL vehicles. For this, a Discrete Event Simulation (DES) approach is adopted, which considers an input (i.e., the eVTOL vehicles, their origin and destination, and their respective trajectories) and produces an output (which describes if the trajectories are safe and the elapsed operation time). The main contribution of this simulation tool is to provide a simulated environment for testing and measuring the effectiveness (e.g., flight duration) of trajectories planned for eVTOL vehicles. △ Less

Submitted 22 August, 2019; originally announced August 2019.

arXiv:1811.11920 [pdf, other]

Using permutations to assess confounding in machine learning applications for digital health

Authors: Elias Chaibub Neto, Abhishek Pratap, Thanneer M Perumal, Meghasyam Tummalacherla, Brian M Bot, Lara Mangravite, Larsson Omberg

Abstract: Clinical machine learning applications are often plagued with confounders that can impact the generalizability and predictive performance of the learners. Confounding is especially problematic in remote digital health studies where the participants self-select to enter the study, thereby making it challenging to balance the demographic characteristics of participants. One effective approach to com… ▽ More Clinical machine learning applications are often plagued with confounders that can impact the generalizability and predictive performance of the learners. Confounding is especially problematic in remote digital health studies where the participants self-select to enter the study, thereby making it challenging to balance the demographic characteristics of participants. One effective approach to combat confounding is to match samples with respect to the confounding variables in order to balance the data. This procedure, however, leads to smaller datasets and hence impact the inferences drawn from the learners. Alternatively, confounding adjustment methods that make more efficient use of the data (e.g., inverse probability weighting) usually rely on modeling assumptions, and it is unclear how robust these methods are to violations of these assumptions. Here, rather than proposing a new approach to control for confounding, we develop novel permutation based statistical methods to detect and quantify the influence of observed confounders, and estimate the unconfounded performance of the learner. Our tools can be used to evaluate the effectiveness of existing confounding adjustment methods. We illustrate their application using real-life data from a Parkinson's disease mobile health study collected in an uncontrolled environment. △ Less

Submitted 28 November, 2018; originally announced November 2018.

Comments: This workshop article draws some material from arXiv:1805.07465. Main text and Supplement. Machine Learning for Health (ML4H) Workshop at NeurIPS 2018 arXiv:1811.07216

Report number: ML4H/2018/193

arXiv:1805.07465 [pdf, other]

Using permutations to detect, quantify and correct for confounding in machine learning predictions

Authors: Elias Chaibub Neto

Abstract: Clinical machine learning applications are often plagued with confounders that are clinically irrelevant, but can still artificially boost the predictive performance of the algorithms. Confounding is especially problematic in mobile health studies run "in the wild", where it is challenging to balance the demographic characteristics of participants that self select to enter the study. An effective… ▽ More Clinical machine learning applications are often plagued with confounders that are clinically irrelevant, but can still artificially boost the predictive performance of the algorithms. Confounding is especially problematic in mobile health studies run "in the wild", where it is challenging to balance the demographic characteristics of participants that self select to enter the study. An effective approach to remove the influence of confounders is to match samples in order to improve the balance in the data. The caveat is that we end-up with a smaller number of participants to train and evaluate the machine learning algorithm. Alternative confounding adjustment methods that make more efficient use of the data (e.g., inverse probability weighting) usually rely on modeling assumptions, and it is unclear how robust these methods are to violations of these assumptions. Here, rather than proposing a new approach to prevent/reduce the learning of confounding signals by a machine learning algorithm, we develop novel statistical tools to detect, quantify and correct for the influence of observed confounders. Our tools are based on restricted and standard permutation approaches and can be used to evaluate how well a confounding adjustment method is actually working. We use restricted permutations to test if an algorithm has learned disease signal in the presence of confounding signal, and to develop a novel statistical test to detect confounding learning per se. Furthermore, we prove that restricted permutations provide an alternative method to compute partial correlations, and use this result as a motivation to develop a novel approach to estimate the corrected predictive performance of a learner. We evaluate the statistical properties of our methods in simulation studies. △ Less

Submitted 27 November, 2018; v1 submitted 18 May, 2018; originally announced May 2018.

arXiv:1802.07714 [pdf, other]

Detecting Learning vs Memorization in Deep Neural Networks using Shared Structure Validation Sets

Authors: Elias Chaibub Neto

Abstract: The roles played by learning and memorization represent an important topic in deep learning research. Recent work on this subject has shown that the optimization behavior of DNNs trained on shuffled labels is qualitatively different from DNNs trained with real labels. Here, we propose a novel permutation approach that can differentiate memorization from learning in deep neural networks (DNNs) trai… ▽ More The roles played by learning and memorization represent an important topic in deep learning research. Recent work on this subject has shown that the optimization behavior of DNNs trained on shuffled labels is qualitatively different from DNNs trained with real labels. Here, we propose a novel permutation approach that can differentiate memorization from learning in deep neural networks (DNNs) trained as usual (i.e., using the real labels to guide the learning, rather than shuffled labels). The evaluation of weather the DNN has learned and/or memorized, happens in a separate step where we compare the predictive performance of a shallow classifier trained with the features learned by the DNN, against multiple instances of the same classifier, trained on the same input, but using shuffled labels as outputs. By evaluating these shallow classifiers in validation sets that share structure with the training set, we are able to tell apart learning from memorization. Application of our permutation approach to multi-layer perceptrons and convolutional neural networks trained on image data corroborated many findings from other groups. Most importantly, our illustrations also uncovered interesting dynamic patterns about how DNNs memorize over increasing numbers of training epochs, and support the surprising result that DNNs are still able to learn, rather than only memorize, when trained with pure Gaussian noise as input. △ Less

Submitted 21 February, 2018; originally announced February 2018.

arXiv:1712.03120 [pdf, other]

Learning Disease vs Participant Signatures: a permutation test approach to detect identity confounding in machine learning diagnostic applications

Authors: Elias Chaibub Neto, Abhishek Pratap, Thanneer M Perumal, Meghasyam Tummalacherla, Brian M Bot, Andrew D Trister, Stephen H Friend, Lara Mangravite, Larsson Omberg

Abstract: Recently, Saeb et al (2017) showed that, in diagnostic machine learning applications, having data of each subject randomly assigned to both training and test sets (record-wise data split) can lead to massive underestimation of the cross-validation prediction error, due to the presence of "subject identity confounding" caused by the classifier's ability to identify subjects, instead of recognizing… ▽ More Recently, Saeb et al (2017) showed that, in diagnostic machine learning applications, having data of each subject randomly assigned to both training and test sets (record-wise data split) can lead to massive underestimation of the cross-validation prediction error, due to the presence of "subject identity confounding" caused by the classifier's ability to identify subjects, instead of recognizing disease. To solve this problem, the authors recommended the random assignment of the data of each subject to either the training or the test set (subject-wise data split). The adoption of subject-wise split has been criticized in Little et al (2017), on the basis that it can violate assumptions required by cross-validation to consistently estimate generalization error. In particular, adopting subject-wise splitting in heterogeneous data-sets might lead to model under-fitting and larger classification errors. Hence, Little et al argue that perhaps the overestimation of prediction errors with subject-wise cross-validation, rather than underestimation with record-wise cross-validation, is the reason for the discrepancies between prediction error estimates generated by the two splitting strategies. In order to shed light on this controversy, we focus on simpler classification performance metrics and develop permutation tests that can detect identity confounding. By focusing on permutation tests, we are able to evaluate the merits of record-wise and subject-wise data splits under more general statistical dependencies and distributional structures of the data, including situations where cross-validation breaks down. We illustrate the application of our tests using synthetic and real data from a Parkinson's disease study. △ Less

Submitted 6 July, 2018; v1 submitted 8 December, 2017; originally announced December 2017.

arXiv:1706.09574 [pdf, other]

On the analysis of personalized medication response and classification of case vs control patients in mobile health studies: the mPower case study

Authors: Elias Chaibub Neto, Thanneer M Perumal, Abhishek Pratap, Brian M Bot, Lara Mangravite, Larsson Omberg

Abstract: In this work we provide a couple of contributions to the analysis of longitudinal data collected by smartphones in mobile health applications. First, we propose a novel statistical approach to disentangle personalized treatment and "time-of-the-day" effects in observational studies. Under the assumption of no unmeasured confounders, we show how to use conditional independence relations in the data… ▽ More In this work we provide a couple of contributions to the analysis of longitudinal data collected by smartphones in mobile health applications. First, we propose a novel statistical approach to disentangle personalized treatment and "time-of-the-day" effects in observational studies. Under the assumption of no unmeasured confounders, we show how to use conditional independence relations in the data in order to determine if a difference in performance between activity tasks performed before and after the participant has taken medication, are potentially due to an effect of the medication or to a "time-of-the-day" effect (or still to both). Second, we show that smartphone data collected from a given study participant can represent a "digital fingerprint" of the participant, and that classifiers of case/control labels, constructed using longitudinal data, can show artificially improved performance when data from each participant is included in both training and test sets. We illustrate our contributions using data collected during the first 6 months of the mPower study. △ Less

Submitted 29 June, 2017; originally announced June 2017.

Comments: 27 pages, 12 figures

arXiv:1607.00091 [pdf, ps, other]

Reducing overfitting in challenge-based competitions

Authors: Elias Chaibub Neto, Bruce R Hoff, Chris Bare, Brian M Bot, Thomas Yu, Lara Magravite, Andrew D Trister, Thea Norman, Pablo Meyer, Julio Saez-Rodrigues, James C Costello, Justin Guinney, Gustavo Stolovitzky

Abstract: Over-fitting is a dreaded foe in challenge-based competitions. Because participants rely on public leaderboards to evaluate and refine their models, there is always the danger they might over-fit to the holdout data supporting the leaderboard. The recently published Ladder algorithm aims to address this problem by preventing the participants from exploiting willingly or inadvertently minor fluctua… ▽ More Over-fitting is a dreaded foe in challenge-based competitions. Because participants rely on public leaderboards to evaluate and refine their models, there is always the danger they might over-fit to the holdout data supporting the leaderboard. The recently published Ladder algorithm aims to address this problem by preventing the participants from exploiting willingly or inadvertently minor fluctuations in public leaderboard scores during model refinement. In this paper, we report a vulnerability of the Ladder that induces severe over-fitting of the leaderboard when the sample size is small. To circumvent this attack, we propose a variation of the Ladder that releases a bootstrapped estimate of the public leaderboard score instead of providing participants with a direct measure of performance. We also extend the scope of the Ladder to arbitrary performance metrics by relying on a more broadly applicable testing procedure based on the Bayesian bootstrap. Our method makes it possible to use a leaderboard, with the technical and social advantages that it provides, even in cases where data is scant. △ Less

Submitted 30 June, 2016; originally announced July 2016.

arXiv:1606.04896 [pdf, ps, other]

Using instrumental variables to disentangle treatment and placebo effects in blinded and unblinded randomized clinical trials influenced by unmeasured confounders

Authors: Elias Chaibub Neto

Abstract: Clinical trials traditionally employ blinding as a design mechanism to reduce the influence of placebo effects. In practice, however, it can be difficult or impossible to blind study participants and unblinded trials are common in medical research. Here we show how instrumental variables can be used to quantify and disentangle treatment and placebo effects in randomized clinical trials comparing c… ▽ More Clinical trials traditionally employ blinding as a design mechanism to reduce the influence of placebo effects. In practice, however, it can be difficult or impossible to blind study participants and unblinded trials are common in medical research. Here we show how instrumental variables can be used to quantify and disentangle treatment and placebo effects in randomized clinical trials comparing control and active treatments in the presence of confounders. The key idea is to use randomization to separately manipulate treatment assignment and psychological encouragement messages that increase the participants' desire for improved symptoms. The proposed approach is able to improve the estimation of treatment effects in blinded studies and, most importantly, opens the doors to account for placebo effects in unblinded trials. △ Less

Submitted 21 June, 2016; v1 submitted 15 June, 2016; originally announced June 2016.

Comments: 29 pages, 13 figures. Version 2 includes randomization confidence intervals and more references

arXiv:1604.01055 [pdf, ps, other]

Towards personalized causal inference of medication response in mobile health: an instrumental variable approach for randomized trials with imperfect compliance

Authors: Elias Chaibub Neto, Ross L Prentice, Brian M Bot, Mike Kellen, Stephen H Friend, Andrew D Trister, Larsson Omberg, Lara Mangravite

Abstract: Mobile health studies can leverage longitudinal sensor data from smartphones to guide the application of personalized medical interventions. In this paper, we propose that adoption of an instrumental variable approach for randomized trials with imperfect compliance provides a natural framework for personalized causal inference of medication response in mobile health studies. Randomized treatment s… ▽ More Mobile health studies can leverage longitudinal sensor data from smartphones to guide the application of personalized medical interventions. In this paper, we propose that adoption of an instrumental variable approach for randomized trials with imperfect compliance provides a natural framework for personalized causal inference of medication response in mobile health studies. Randomized treatment suggestions can be easily delivered to the study participants via electronic messages pop** up on the smart-phone screen. Under quite general assumptions we can identify the causal effect of the actual treatment on the response in the presence of unobserved confounders. We implement a personalized randomization test of the null hypothesis of no causal effect of treatment on response, and evaluate its performance in a large scale simulation study encompassing data generated from linear and non-linear time series models under several simulation conditions. In particular, we evaluate the empirical power of the proposed test under varying degrees of compliance between the suggested and actual treatment adopted by the participant. Our investigations provide encouraging results in terms of power and control of type I error rates. Finally, we compare the proposed instrumental variable approach to a simple intent-to-treat strategy, and develop randomization confidence intervals for the causal effects. △ Less

Submitted 31 July, 2017; v1 submitted 4 April, 2016; originally announced April 2016.

Comments: Main text, appendixes, and supplementary materials were re-organized

arXiv:1501.04709 [pdf]

doi 10.1038/srep16361

Identifying robust communities and multi-community nodes by combining top-down and bottom-up approaches to clustering

Authors: Chris Gaiteri, Mingming Chen, Boleslaw Szymanski, Konstantin Kuzmin, Jierui Xie, Changkyu Lee, Timothy Blanche, Elias Chaibub Neto, Su-Chun Huang, Thomas Grabowski, Tara Madhyastha, Vitalina Komashko

Abstract: Biological functions are carried out by groups of interacting molecules, cells or tissues, known as communities. Membership in these communities may overlap when biological components are involved in multiple functions. However, traditional clustering methods detect non-overlap** communities. These detected communities may also be unstable and difficult to replicate, because traditional methods… ▽ More Biological functions are carried out by groups of interacting molecules, cells or tissues, known as communities. Membership in these communities may overlap when biological components are involved in multiple functions. However, traditional clustering methods detect non-overlap** communities. These detected communities may also be unstable and difficult to replicate, because traditional methods are sensitive to noise and parameter settings. These aspects of traditional clustering methods limit our ability to detect biological communities, and therefore our ability to understand biological functions. To address these limitations and detect robust overlap** biological communities, we propose an unorthodox clustering method called SpeakEasy which identifies communities using top-down and bottom-up approaches simultaneously. Specifically, nodes join communities based on their local connections, as well as global information about the network structure. This method can quantify the stability of each community, automatically identify the number of communities, and quickly cluster networks with hundreds of thousands of nodes. SpeakEasy shows top performance on synthetic clustering benchmarks and accurately identifies meaningful biological communities in a range of datasets, including: gene microarrays, protein interactions, sorted cell populations, electrophysiology and fMRI brain imaging. △ Less

Submitted 25 February, 2015; v1 submitted 19 January, 2015; originally announced January 2015.

Journal ref: Scientific Reports 5, Article number: 16361 (2015)

arXiv:1412.1735 [pdf, ps, other]

Speeding up bootstrap computations: a vectorized implementation for statistics based on sample moments

Authors: E. Chaibub Neto

Abstract: In this note we propose a vectorized implementation of the non-parametric bootstrap for statistics based on sample moments. Basically, we adopt the multinomial sampling formulation of the non-parametric bootstrap, and compute bootstrap replications of sample moment statistics by simply weighting the observed data according to multinomial counts, instead of evaluating the statistic on a re-sampled… ▽ More In this note we propose a vectorized implementation of the non-parametric bootstrap for statistics based on sample moments. Basically, we adopt the multinomial sampling formulation of the non-parametric bootstrap, and compute bootstrap replications of sample moment statistics by simply weighting the observed data according to multinomial counts, instead of evaluating the statistic on a re-sampled version of the observed data. Using this formulation we can generate a matrix of bootstrap weights and compute the entire vector of bootstrap replications with a few matrix multiplications. Vectorization is particularly important for matrix-oriented programming languages such as R, where matrix/vector calculations tend to be faster than scalar operations implemented in a loop. We illustrate the gain in computational speed achieved by the vectorized implementation in real and simulated data sets, when bootstrap** Pearson's sample correlation coefficient. △ Less

Submitted 11 December, 2014; v1 submitted 4 December, 2014; originally announced December 2014.

Comments: 9 pages, 3 figures; changed the title

arXiv:1010.1402 [pdf, ps, other]

doi 10.1214/09-AOAS288

Causal graphical models in systems genetics: A unified framework for joint inference of causal network and genetic architecture for correlated phenotypes

Authors: Elias Chaibub Neto, Mark P. Keller, Alan D. Attie, Brian S. Yandell

Abstract: Causal inference approaches in systems genetics exploit quantitative trait loci (QTL) genotypes to infer causal relationships among phenotypes. The genetic architecture of each phenotype may be complex, and poorly estimated genetic architectures may compromise the inference of causal relationships among phenotypes. Existing methods assume QTLs are known or inferred without regard to the phenotype… ▽ More Causal inference approaches in systems genetics exploit quantitative trait loci (QTL) genotypes to infer causal relationships among phenotypes. The genetic architecture of each phenotype may be complex, and poorly estimated genetic architectures may compromise the inference of causal relationships among phenotypes. Existing methods assume QTLs are known or inferred without regard to the phenotype network structure. In this paper we develop a QTL-driven phenotype network method (QTLnet) to jointly infer a causal phenotype network and associated genetic architecture for sets of correlated phenotypes. Randomization of alleles during meiosis and the unidirectional influence of genotype on phenotype allow the inference of QTLs causal to phenotypes. Causal relationships among phenotypes can be inferred using these QTL nodes, enabling us to distinguish among phenotype networks that would otherwise be distribution equivalent. We jointly model phenotypes and QTLs using homogeneous conditional Gaussian regression models, and we derive a graphical criterion for distribution equivalence. We validate the QTLnet approach in a simulation study. Finally, we illustrate with simulated data and a real example how QTLnet can be used to infer both direct and indirect effects of QTLs and phenotypes that co-map to a genomic region. △ Less

Submitted 7 October, 2010; originally announced October 2010.

Comments: Published in at http://dx.doi.org/10.1214/09-AOAS288 the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org)

Report number: IMS-AOAS-AOAS288

Journal ref: Annals of Applied Statistics 2010, Vol. 4, No. 1, 320-339

arXiv:gr-qc/0606042 [pdf, ps, other]

Tidal fields on braneworlds

Authors: Edgard Casal de Rey Neto

Abstract: We write out the geodesic deviations that take place in a $d\geq 4$ dimensional brane world subspace of a higher dimensional spacetime by splitting out the brane and the extra space dynamical quantities from a global metric spacetime of dimension $D\geq 5$. The higher dimensional dynamical quantities are projected onto two orthogonal subspaces, where one of which is identified with a $(d-1)$-bra… ▽ More We write out the geodesic deviations that take place in a $d\geq 4$ dimensional brane world subspace of a higher dimensional spacetime by splitting out the brane and the extra space dynamical quantities from a global metric spacetime of dimension $D\geq 5$. The higher dimensional dynamical quantities are projected onto two orthogonal subspaces, where one of which is identified with a $(d-1)$-brane. This is done by using some technics of the conventional submanifold theory of the Riemannian geometry, applied to pseudo-Riemannian spaces. Using the splitting technic, we obtain the tidal field on $(d-1)$ branes with an arbitrary number of non compact extra dimensions. Later, we analise the geodesic deviations seen by an ordinary observer in a $d=4$ dimensional spacetime and show that deviations from general relativity tidal field due to the existence of the extra dimensions can appear because, (i) - the dependence of the indunced metric on the brane with the extra coordinates and (ii) - deviations of the higher dimensional spacetime metric from spherical symmetry. △ Less

Submitted 9 June, 2006; originally announced June 2006.

Comments: 12 pgs. no figures

arXiv:gr-qc/0511132 [pdf, ps, other]

Braneworld with Induced Axial Symmetry

Authors: Edgard Casal de Rey Neto

Abstract: We take arbitrary gravitational perturbations of a 5d spacetime and reduce it to the form an axially symmetric warped braneworld. Then, we write the filed equations for the linearized gravity perturbations. We obtain the equations that describes the graviton, gravivector and the graviscalar fluctuations and analyse the effects of the Schrödinger potentials that appear in these equations. We take arbitrary gravitational perturbations of a 5d spacetime and reduce it to the form an axially symmetric warped braneworld. Then, we write the filed equations for the linearized gravity perturbations. We obtain the equations that describes the graviton, gravivector and the graviscalar fluctuations and analyse the effects of the Schrödinger potentials that appear in these equations. △ Less

Submitted 5 December, 2005; v1 submitted 24 November, 2005; originally announced November 2005.

Comments: 3 pages, one figure, typos corrected, to apear in the special issue of Brazilian Journal of Physics dedicated to the conference 100 years of relativity, Sao Paulo, Brazil, 2005

Journal ref: Braz.J.Phys. 35 (2005) 1131-1132

arXiv:gr-qc/0311046 [pdf, ps, other]

doi 10.1088/0264-9381/21/5/023

Wave polarizations for a beam-like gravitational wave in quadratic curvature gravity

Authors: E. C. de Rey Neto, J. C. N. de Araujo, O. D. Aguiar

Abstract: We compute analytically the tidal field and polarizations of an exact gravitational wave generated by a cylindrical beam of null matter of finite width and length in quadratic curvature gravity. We propose that this wave can represent the gravitational wave that keep up with the high energy photons produced in a gamma ray burst (GRB) source. We compute analytically the tidal field and polarizations of an exact gravitational wave generated by a cylindrical beam of null matter of finite width and length in quadratic curvature gravity. We propose that this wave can represent the gravitational wave that keep up with the high energy photons produced in a gamma ray burst (GRB) source. △ Less

Submitted 20 November, 2003; v1 submitted 14 November, 2003; originally announced November 2003.

Comments: 5 pages, 3 figures, minor corrections, to appear in CQG

Journal ref: Class.Quant.Grav. 21 (2004) S541-S544

arXiv:gr-qc/0309128 [pdf, ps, other]

doi 10.1103/PhysRevD.68.124013

Geodesic deviation in pp-wave spacetimes of quadratic curvature gravity

Authors: Edgard C. de Rey Neto

Abstract: We write the equation of geodesic deviations in the spacetime of $pp$-waves in terms of the Newman-Penrose scalars and apply it to study gravitational waves in quadratic curvature gravity. We show that quadratic curvature gravity $pp$-waves can have a transverse helicity-0 polarization mode and two transverse helicity-2 general relativity-like wave polarizations. A concrete example is given in w… ▽ More We write the equation of geodesic deviations in the spacetime of $pp$-waves in terms of the Newman-Penrose scalars and apply it to study gravitational waves in quadratic curvature gravity. We show that quadratic curvature gravity $pp$-waves can have a transverse helicity-0 polarization mode and two transverse helicity-2 general relativity-like wave polarizations. A concrete example is given in which we analyze the wave polarizations of an exact impulsive gravitational wave solution to quadratic curvature gravity. △ Less

Submitted 26 September, 2003; originally announced September 2003.

Comments: 16 pages, no figures, accepted in Physical Review D

Journal ref: Phys.Rev. D68 (2003) 124013

arXiv:gr-qc/0304091 [pdf, ps, other]

doi 10.1088/0264-9381/20/11/305

A perturbative solution for gravitational waves in quadratic gravity

Authors: Edgard C. de Rey Neto, Odylio D. Aguiar, José C. N. de Araujo

Abstract: We find a gravitational wave solution to the linearized version of quadratic gravity by adding successive perturbations to the Einstein's linearized field equations. We show that only the Ricci squared quadratic invariant contributes to give a different solution of those found in Einstein's general relativity. The perturbative solution is written as a power series in the $β$ parameter, the coeff… ▽ More We find a gravitational wave solution to the linearized version of quadratic gravity by adding successive perturbations to the Einstein's linearized field equations. We show that only the Ricci squared quadratic invariant contributes to give a different solution of those found in Einstein's general relativity. The perturbative solution is written as a power series in the $β$ parameter, the coefficient of the Ricci squared term in the quadratic gravitational action. We also show that, for monochromatic waves of a given angular frequency $ω$, the perturbative solution can be summed out to give an exact solution to linearized version of quadratic gravity, for $0<ω<c/\midβ\mid^{1/2}$. This result may lead to implications to the predictions for gravitational wave backgrounds of cosmological origin. △ Less

Submitted 23 April, 2003; originally announced April 2003.

Comments: 9 pages, to appear in CQG

Journal ref: Class.Quant.Grav.20:2025-2031,2003

arXiv:gr-qc/0303105 [pdf, ps, other]

doi 10.1088/0264-9381/20/8/305

A gravitational shock wave generated by a beam of null matter in quadratic gravity

Authors: E. C. de Rey Neto, J. C. N. de Araujo, O. D. Aguiar

Abstract: In the present work we approximate an ultrarelativistic jet by a homogeneous beam of null matter with finite width. Then, we study the influence of this beam over the space-time metric in the framework of higher-derivative gravity. We find an exact shock wave solution of the quadratic gravity field equations and compare it with the solution to Einstein's gravity. We show that the effect of highe… ▽ More In the present work we approximate an ultrarelativistic jet by a homogeneous beam of null matter with finite width. Then, we study the influence of this beam over the space-time metric in the framework of higher-derivative gravity. We find an exact shock wave solution of the quadratic gravity field equations and compare it with the solution to Einstein's gravity. We show that the effect of higher-curvature gravity becomes negligible at large distances from the beam axis. We also observe that only the Ricci-squared term contribute to modify the Einstein's gravity prediction. Furthermore, we note that this higher-curvature term contribute to regularize the discontinuities associated to the solution to Einstein's general relativity. △ Less

Submitted 26 March, 2003; originally announced March 2003.

Comments: 11 pages, 3 figures

Journal ref: Class.Quant.Grav. 20 (2003) 1479-1488

arXiv:cond-mat/0111149 [pdf, ps, other]

Charge pairing and superconductivity in high-Tc cuprate superconductors

Authors: Eduardo C. Marino Marcello B. Silva Neto

Abstract: We propose a model for high-Tc superconductors that includes both the spin fluctuations of the Cu++ magnetic ions and of the spins of O-- doped holes (spinons). The charge of the dopants (holons) is associated to quantum skyrmion excitations of the Cu++ spin background. The quantum skyrmion effective interaction potential is evaluated as a function of do** and temperature, indicating that Coop… ▽ More We propose a model for high-Tc superconductors that includes both the spin fluctuations of the Cu++ magnetic ions and of the spins of O-- doped holes (spinons). The charge of the dopants (holons) is associated to quantum skyrmion excitations of the Cu++ spin background. The quantum skyrmion effective interaction potential is evaluated as a function of do** and temperature, indicating that Cooper pair formation is determined by the competition between these two types of spin fluctuations. The superconducting transition occurs when the effective potential allows for skyrmion bound states. Our theoretical predictions for the superconducting phase diagram of LSCO and YBCO are in good agreement with experiment. △ Less

Submitted 10 November, 2001; v1 submitted 9 November, 2001; originally announced November 2001.

Comments: 4 pages, 2 figures

arXiv:solv-int/9703009 [pdf, ps, other]

doi 10.1103/PhysRevE.58.2526

Linearizability of the Perturbed Burgers Equation

Authors: R. A. Kraenkel, J. G. Pereira, E. C. de Rey Neto

Abstract: We show in this letter that the perturbed Burgers equation $u_t = 2uu_x + u_{xx} + ε( 3 α_1 u^2 u_x + 3α_2 uu_{xx} + 3α_3 u_x^2 + α_4 u_{xxx} )$ is equivalent, through a near-identity transformation and up to order ε, to a linearizable equation if the condition $3α_1 - 3α_3 - 3/2 α_2 + 3/2 α_4 = 0$ is satisfied. In the case this condition is not fulfilled, a normal form for the equation under co… ▽ More We show in this letter that the perturbed Burgers equation $u_t = 2uu_x + u_{xx} + ε( 3 α_1 u^2 u_x + 3α_2 uu_{xx} + 3α_3 u_x^2 + α_4 u_{xxx} )$ is equivalent, through a near-identity transformation and up to order ε, to a linearizable equation if the condition $3α_1 - 3α_3 - 3/2 α_2 + 3/2 α_4 = 0$ is satisfied. In the case this condition is not fulfilled, a normal form for the equation under consideration is given. Then, to illustrate our results, we make a linearizability analysis of the equations governing the dynamics of a one-dimensional gas. △ Less

Submitted 19 March, 1997; originally announced March 1997.

Comments: 10 pages, RevTeX, no figures

Report number: IFT-P.020/97

Showing 1–35 of 35 results for author: Neto, E C