Search | arXiv e-print repository

Difference-in-Differences with Time-Varying Covariates in the Parallel Trends Assumption

Authors: Carolina Caetano, Brantly Callaway

Abstract: In this paper, we study difference-in-differences identification and estimation strategies where the parallel trends assumption holds after conditioning on time-varying covariates and/or time-invariant covariates. Our first main contribution is to point out a number of weaknesses of commonly used two-way fixed effects (TWFE) regressions in this context. In addition to issues related to multiple pe… ▽ More In this paper, we study difference-in-differences identification and estimation strategies where the parallel trends assumption holds after conditioning on time-varying covariates and/or time-invariant covariates. Our first main contribution is to point out a number of weaknesses of commonly used two-way fixed effects (TWFE) regressions in this context. In addition to issues related to multiple periods and variation in treatment timing that have been emphasized in the literature, we show that, even in the case with only two time periods, TWFE regressions are not generally robust to (i) paths of untreated potential outcomes depending on the level of time-varying covariates (as opposed to only the change in the covariates over time), (ii) paths of untreated potential outcomes depending on time-invariant covariates, and (iii) violations of linearity conditions for outcomes over time and/or the propensity score. Even in cases where none of the previous three issues hold, we show that TWFE regressions can suffer from negative weighting and weight-reversal issues. Thus, TWFE regressions can deliver misleading estimates of causal effect parameters in a number of empirically relevant cases. Second, we extend these arguments to the case of multiple periods and variation in treatment timing. Third, we provide simple diagnostics for assessing the extent of misspecification bias arising due to TWFE regressions. Finally, we propose alternative (and simple) estimation strategies that can circumvent these issues with two-way fixed regressions. △ Less

Submitted 21 June, 2024; originally announced June 2024.

Comments: This submission contains the same content as v2 of arXiv:2202.02903v2 but is now an independent project

Report number: arXiv:split-001

arXiv:2202.02903 [pdf, ps, other]

Difference in Differences with Time-Varying Covariates

Authors: Carolina Caetano, Brantly Callaway, Stroud Payne, Hugo Sant'Anna Rodrigues

Abstract: This paper considers identification and estimation of causal effect parameters from participating in a binary treatment in a difference in differences (DID) setup when the parallel trends assumption holds after conditioning on observed covariates. Relative to existing work in the econometrics literature, we consider the case where the value of covariates can change over time and, potentially, wher… ▽ More This paper considers identification and estimation of causal effect parameters from participating in a binary treatment in a difference in differences (DID) setup when the parallel trends assumption holds after conditioning on observed covariates. Relative to existing work in the econometrics literature, we consider the case where the value of covariates can change over time and, potentially, where participating in the treatment can affect the covariates themselves. We propose new empirical strategies in both cases. We also consider two-way fixed effects (TWFE) regressions that include time-varying regressors, which is the most common way that DID identification strategies are implemented under conditional parallel trends. We show that, even in the case with only two time periods, these TWFE regressions are not generally robust to (i) time-varying covariates being affected by the treatment, (ii) treatment effects and/or paths of untreated potential outcomes depending on the level of time-varying covariates in addition to only the change in the covariates over time, (iii) treatment effects and/or paths of untreated potential outcomes depending on time-invariant covariates, (iv) treatment effect heterogeneity with respect to observed covariates, and (v) violations of strong functional form assumptions, both for outcomes over time and the propensity score, that are unlikely to be plausible in most DID applications. Thus, TWFE regressions can deliver misleading estimates of causal effect parameters in a number of empirically relevant cases. We propose both doubly robust estimands and regression adjustment/imputation strategies that are robust to these issues while not being substantially more challenging to implement. △ Less

Submitted 23 June, 2024; v1 submitted 6 February, 2022; originally announced February 2022.

Comments: This version reverts to v1 of this paper (identical to arXiv:2202.02903v1, posted on Feb. 7, 2022). The project is now split into two parts. The first part is available at arXiv:2406.15288, mistakenly labeled as v2 of this submission. The second part will be posted separately. This paper is no longer active. Please see the two descendant papers for updates

arXiv:2007.00185 [pdf, other]

Regression Discontinuity Design with Multivalued Treatments

Authors: Carolina Caetano, Gregorio Caetano, Juan Carlos Escanciano

Abstract: We study identification and estimation in the Regression Discontinuity Design (RDD) with a multivalued treatment variable. We also allow for the inclusion of covariates. We show that without additional information, treatment effects are not identified. We give necessary and sufficient conditions that lead to identification of LATEs as well as of weighted averages of the conditional LATEs. We show… ▽ More We study identification and estimation in the Regression Discontinuity Design (RDD) with a multivalued treatment variable. We also allow for the inclusion of covariates. We show that without additional information, treatment effects are not identified. We give necessary and sufficient conditions that lead to identification of LATEs as well as of weighted averages of the conditional LATEs. We show that if the first stage discontinuities of the multiple treatments conditional on covariates are linearly independent, then it is possible to identify multivariate weighted averages of the treatment effects with convenient identifiable weights. If, moreover, treatment effects do not vary with some covariates or a flexible parametric structure can be assumed, it is possible to identify (in fact, over-identify) all the treatment effects. The over-identification can be used to test these assumptions. We propose a simple estimator, which can be programmed in packaged software as a Two-Stage Least Squares regression, and packaged standard errors and tests can also be used. Finally, we implement our approach to identify the effects of different types of insurance coverage on health care utilization, as in Card, Dobkin and Maestas (2008). △ Less

Submitted 30 June, 2020; originally announced July 2020.

arXiv:1909.05704 [pdf, other]

Skeleton Image Representation for 3D Action Recognition based on Tree Structure and Reference Joints

Authors: Carlos Caetano, François Brémond, William Robson Schwartz

Abstract: In the last years, the computer vision research community has studied on how to model temporal dynamics in videos to employ 3D human action recognition. To that end, two main baseline approaches have been researched: (i) Recurrent Neural Networks (RNNs) with Long-Short Term Memory (LSTM); and (ii) skeleton image representations used as input to a Convolutional Neural Network (CNN). Although RNN ap… ▽ More In the last years, the computer vision research community has studied on how to model temporal dynamics in videos to employ 3D human action recognition. To that end, two main baseline approaches have been researched: (i) Recurrent Neural Networks (RNNs) with Long-Short Term Memory (LSTM); and (ii) skeleton image representations used as input to a Convolutional Neural Network (CNN). Although RNN approaches present excellent results, such methods lack the ability to efficiently learn the spatial relations between the skeleton joints. On the other hand, the representations used to feed CNN approaches present the advantage of having the natural ability of learning structural information from 2D arrays (i.e., they learn spatial relations from the skeleton joints). To further improve such representations, we introduce the Tree Structure Reference Joints Image (TSRJI), a novel skeleton image representation to be used as input to CNNs. The proposed representation has the advantage of combining the use of reference joints and a tree structure skeleton. While the former incorporates different spatial relationships between the joints, the latter preserves important spatial relations by traversing a skeleton tree with a depth-first order algorithm. Experimental results demonstrate the effectiveness of the proposed representation for 3D action recognition on two datasets achieving state-of-the-art results on the recent NTU RGB+D~120 dataset. △ Less

Submitted 11 September, 2019; originally announced September 2019.

Comments: Conference on Graphics, Patterns and Images (SIBGRAPI2019). arXiv admin note: substantial text overlap with arXiv:1907.13025

arXiv:1907.13025 [pdf, other]

SkeleMotion: A New Representation of Skeleton Joint Sequences Based on Motion Information for 3D Action Recognition

Authors: Carlos Caetano, Jessica Sena, François Brémond, Jefersson A. dos Santos, William Robson Schwartz

Abstract: Due to the availability of large-scale skeleton datasets, 3D human action recognition has recently called the attention of computer vision community. Many works have focused on encoding skeleton data as skeleton image representations based on spatial structure of the skeleton joints, in which the temporal dynamics of the sequence is encoded as variations in columns and the spatial structure of eac… ▽ More Due to the availability of large-scale skeleton datasets, 3D human action recognition has recently called the attention of computer vision community. Many works have focused on encoding skeleton data as skeleton image representations based on spatial structure of the skeleton joints, in which the temporal dynamics of the sequence is encoded as variations in columns and the spatial structure of each frame is represented as rows of a matrix. To further improve such representations, we introduce a novel skeleton image representation to be used as input of Convolutional Neural Networks (CNNs), named SkeleMotion. The proposed approach encodes the temporal dynamics by explicitly computing the magnitude and orientation values of the skeleton joints. Different temporal scales are employed to compute motion values to aggregate more temporal dynamics to the representation making it able to capture longrange joint interactions involved in actions as well as filtering noisy motion values. Experimental results demonstrate the effectiveness of the proposed representation on 3D action recognition outperforming the state-of-the-art on NTU RGB+D 120 dataset. △ Less

Submitted 30 July, 2019; originally announced July 2019.

Comments: 16-th IEEE International Conference on Advanced Video and Signal-based Surveillance (AVSS2019)

arXiv:1708.06637 [pdf, other]

Activity Recognition based on a Magnitude-Orientation Stream Network

Authors: Carlos Caetano, Victor H. C. de Melo, Jefersson A. dos Santos, William Robson Schwartz

Abstract: The temporal component of videos provides an important clue for activity recognition, as a number of activities can be reliably recognized based on the motion information. In view of that, this work proposes a novel temporal stream for two-stream convolutional networks based on images computed from the optical flow magnitude and orientation, named Magnitude-Orientation Stream (MOS), to learn the m… ▽ More The temporal component of videos provides an important clue for activity recognition, as a number of activities can be reliably recognized based on the motion information. In view of that, this work proposes a novel temporal stream for two-stream convolutional networks based on images computed from the optical flow magnitude and orientation, named Magnitude-Orientation Stream (MOS), to learn the motion in a better and richer manner. Our method applies simple nonlinear transformations on the vertical and horizontal components of the optical flow to generate input images for the temporal stream. Experimental results, carried on two well-known datasets (HMDB51 and UCF101), demonstrate that using our proposed temporal stream as input to existing neural network architectures can improve their performance for activity recognition. Results demonstrate that our temporal stream provides complementary information able to improve the classical two-stream methods, indicating the suitability of our approach to be used as a temporal video representation. △ Less

Submitted 22 August, 2017; originally announced August 2017.

Comments: 8 pages, SIBGRAPI 2017

arXiv:1608.03010 [pdf, other]

doi 10.1016/j.newast.2016.08.001

The OPD Photometric Survey of Open Clusters II. robust determination of the fundamental parameters of 24 open clusters

Authors: Hektor Monteiro, Wilton S. Dias, Gabriel R. Hickel, Thiago C. Caetano

Abstract: In the second paper of the series we continue the investigation of open cluster fundamental parameters using a robust global optimization method to fit model isochrones to photometric data. We present optical UBVRI CCD photometry (Johnsons-Cousins system) observations for 24 neglected open clusters, of which 14 have high quality data in the visible obtained for the first time, as a part of our ong… ▽ More In the second paper of the series we continue the investigation of open cluster fundamental parameters using a robust global optimization method to fit model isochrones to photometric data. We present optical UBVRI CCD photometry (Johnsons-Cousins system) observations for 24 neglected open clusters, of which 14 have high quality data in the visible obtained for the first time, as a part of our ongoing survey being carried out in the 0.6m telescope of the Pico dos Dias Observatory in Brazil. All objects were then analyzed with a global optimization tool developed by our group which estimates the membership likelihood of the observed stars and fits an isochrone from which a distance, age, reddening, total to selective extinction ratio $R_{V}$ (included in this work as a new free parameter) and metallicity are estimated. Based on those estimates and their associated errors we analyzed the status of each object as real clusters or not, finding that two are likely to be asterisms. We also identify important discrepancies between our results and previous ones obtained in the literature which were determined using 2MASS photometry. △ Less

Submitted 9 August, 2016; originally announced August 2016.

Comments: 17 pages, accepted for publication in New Astronomy

arXiv:1605.03804 [pdf, other]

doi 10.1016/j.neucom.2016.03.099

A Mid-level Video Representation based on Binary Descriptors: A Case Study for Pornography Detection

Authors: Carlos Caetano, Sandra Avila, William Robson Schwartz, Silvio Jamil F. Guimarães, Arnaldo de A. Araújo

Abstract: With the growing amount of inappropriate content on the Internet, such as pornography, arises the need to detect and filter such material. The reason for this is given by the fact that such content is often prohibited in certain environments (e.g., schools and workplaces) or for certain publics (e.g., children). In recent years, many works have been mainly focused on detecting pornographic images… ▽ More With the growing amount of inappropriate content on the Internet, such as pornography, arises the need to detect and filter such material. The reason for this is given by the fact that such content is often prohibited in certain environments (e.g., schools and workplaces) or for certain publics (e.g., children). In recent years, many works have been mainly focused on detecting pornographic images and videos based on visual content, particularly on the detection of skin color. Although these approaches provide good results, they generally have the disadvantage of a high false positive rate since not all images with large areas of skin exposure are necessarily pornographic images, such as people wearing swimsuits or images related to sports. Local feature based approaches with Bag-of-Words models (BoW) have been successfully applied to visual recognition tasks in the context of pornography detection. Even though existing methods provide promising results, they use local feature descriptors that require a high computational processing time yielding high-dimensional vectors. In this work, we propose an approach for pornography detection based on local binary feature extraction and BossaNova image representation, a BoW model extension that preserves more richly the visual information. Moreover, we propose two approaches for video description based on the combination of mid-level representations namely BossaNova Video Descriptor (BNVD) and BoW Video Descriptor (BoW-VD). The proposed techniques are promising, achieving an accuracy of 92.40%, thus reducing the classification error by 16% over the current state-of-the-art local features approach on the Pornography dataset. △ Less

Submitted 12 May, 2016; originally announced May 2016.

Comments: Manuscript accepted at Elsevier Neurocomputing

arXiv:1307.2182 [pdf, ps, other]

doi 10.1051/0004-6361/201321157

Fitting isochrones to open cluster photometric data III. Estimating metallicities from UBV photometry

Authors: A. F. Oliveira, H. Monteiro, W. S. Dias, T. C. Caetano

Abstract: The metallicity is a critical parameter that affects the correct determination fundamental characteristics stellar cluster and has important implications in Galactic and Stellar evolution research. Fewer than 10 % of the 2174 currently catalog open clusters have their metallicity determined in the literature. In this work we present a method for estimating the metallicity of open clusters via non-… ▽ More The metallicity is a critical parameter that affects the correct determination fundamental characteristics stellar cluster and has important implications in Galactic and Stellar evolution research. Fewer than 10 % of the 2174 currently catalog open clusters have their metallicity determined in the literature. In this work we present a method for estimating the metallicity of open clusters via non-subjective isochrone fitting using the cross-entropy global optimization algorithm applied to UBV photometric data. The free parameters distance, reddening, age, and metallicity simultaneously determined by the fitting method. The fitting procedure uses weights for the observational data based on the estimation of membership likelihood for each star, which considers the observational magnitude limit, the density profile of stars as a function of radius from the center of the cluster, and the density of stars in multi-dimensional magnitude space. We present results of [Fe/H] for nine well-studied open clusters based on 15 distinct UBV data sets. The [Fe/H] values obtained in the ten cases for which spectroscopic determinations were available in the literature agree, indicating that our method provides a good alternative to determining [Fe/H] by using an objective isochrone fitting. Our results show that the typical precision is about 0.1 dex. △ Less

Submitted 8 July, 2013; originally announced July 2013.

arXiv:1003.4230 [pdf, ps, other]

doi 10.1051/0004-6361/200913677

Fitting Isochrones to Open Cluster photometric data: A new global optimization tool

Authors: H. Monteiro, W. S. Dias, T. C. Caetano

Abstract: We present a new technique to fit color-magnitude diagrams of open clusters based on the Cross-Entropy global optimization algorithm. The method uses theoretical isochrones available in the literature and maximizes a weighted likelihood function based on distances measured in the color-magnitude space. The weights are obtained through a non parametric technique that takes into account the star dis… ▽ More We present a new technique to fit color-magnitude diagrams of open clusters based on the Cross-Entropy global optimization algorithm. The method uses theoretical isochrones available in the literature and maximizes a weighted likelihood function based on distances measured in the color-magnitude space. The weights are obtained through a non parametric technique that takes into account the star distance to the observed center of the cluster, observed magnitude uncertainties, the stellar density profile of the cluster among others. The parameters determined simultaneously are distance, reddening, age and metallicity. The method takes binary fraction into account and uses a Monte-Carlo approach to obtain uncertainties on the determined parameters for the cluster by running the fitting algorithm many times with a re-sampled data set through a bootstrap** procedure. We present results for 9 well studied open clusters, based on 15 distinct data sets, and show that the results are consistent with previous studies. The method is shown to be reliable and free of the subjectivity of most previous visual isochrone fitting techniques. △ Less

Submitted 22 March, 2010; originally announced March 2010.

Comments: 19 pages, 25 figures, accepted for publication in Astronomy&Astrophysics

arXiv:0902.4368 [pdf, other]

doi 10.1063/1.3154560

Anomalous Lattice Parameter of Magnetic Semiconductor Alloys

Authors: Clovis Caetano, Luiz G. Ferreira, Marcelo Marques, Lara K. Teles

Abstract: The addition of transition metals (TM) to III-V semiconductors radically changes their electronic, magnetic and structural properties. In contrast to the conventional semiconductor alloys, the lattice parameter in magnetic semiconductor alloys, including the ones with diluted concentration (the diluted magnetic semiconductors - DMS), cannot be determined uniquely from the composition. By using f… ▽ More The addition of transition metals (TM) to III-V semiconductors radically changes their electronic, magnetic and structural properties. In contrast to the conventional semiconductor alloys, the lattice parameter in magnetic semiconductor alloys, including the ones with diluted concentration (the diluted magnetic semiconductors - DMS), cannot be determined uniquely from the composition. By using first-principles calculations, we find a direct correlation between the magnetic moment and the anion-TM bond lengths. We derive a simple formula that determines the lattice parameter of a particular magnetic semiconductor by considering both the composition and magnetic moment. The formula makes accurate predictions of the lattice parameter behavior of AlMnN, AlCrN, GaMnN, GaCrN, GaCrAs and GaMnAs alloys. This new dependence can explain some of the hitherto puzzling experimentally observed anomalies, as well as, stimulate other kind of theoretical and experimental investigations. △ Less

Submitted 25 February, 2009; originally announced February 2009.

Comments: 3 figures

Showing 1–11 of 11 results for author: Caetano, C