Search | arXiv e-print repository

The Informativeness of Combined Experimental and Observational Data under Dynamic Selection

Abstract: This paper addresses the challenge of estimating the Average Treatment Effect on the Treated Survivors (ATETS; Vikstrom et al., 2018) in the absence of long-term experimental data, utilizing available long-term observational data instead. We establish two theoretical results. First, it is impossible to obtain informative bounds for the ATETS with no model restriction and no auxiliary data. Second,… ▽ More This paper addresses the challenge of estimating the Average Treatment Effect on the Treated Survivors (ATETS; Vikstrom et al., 2018) in the absence of long-term experimental data, utilizing available long-term observational data instead. We establish two theoretical results. First, it is impossible to obtain informative bounds for the ATETS with no model restriction and no auxiliary data. Second, to overturn this negative result, we explore as a promising avenue the recent econometric developments in combining experimental and observational data (e.g., Athey et al., 2020, 2019); we indeed find that exploiting short-term experimental data can be informative without imposing classical model restrictions. Furthermore, building on Chesher and Rosen (2017), we explore how to systematically derive sharp identification bounds, exploiting both the novel data-combination principles and classical model restrictions. Applying the proposed method, we explore what can be learned about the long-run effects of job training programs on employment without long-term experimental data. △ Less

Submitted 24 March, 2024; originally announced March 2024.

arXiv:2403.01318 [pdf, other]

High-Dimensional Tail Index Regression: with An Application to Text Analyses of Viral Posts in Social Media

Authors: Yuya Sasaki, **g Tao, Yulong Wang

Abstract: Motivated by the empirical power law of the distributions of credits (e.g., the number of "likes") of viral posts in social media, we introduce the high-dimensional tail index regression and methods of estimation and inference for its parameters. We propose a regularized estimator, establish its consistency, and derive its convergence rate. To conduct inference, we propose to debias the regularize… ▽ More Motivated by the empirical power law of the distributions of credits (e.g., the number of "likes") of viral posts in social media, we introduce the high-dimensional tail index regression and methods of estimation and inference for its parameters. We propose a regularized estimator, establish its consistency, and derive its convergence rate. To conduct inference, we propose to debias the regularized estimate, and establish the asymptotic normality of the debiased estimator. Simulation studies support our theory. These methods are applied to text analyses of viral posts in X (formerly Twitter) concerning LGBTQ+. △ Less

Submitted 2 March, 2024; originally announced March 2024.

arXiv:2402.19268 [pdf, ps, other]

Extremal quantiles of intermediate orders under two-way clustering

Authors: Harold D. Chiang, Ryutah Kato, Yuya Sasaki

Abstract: This paper investigates extremal quantiles under two-way cluster dependence. We demonstrate that the limiting distribution of the unconditional intermediate order quantiles in the tails converges to a Gaussian distribution. This is remarkable as two-way cluster dependence entails potential non-Gaussianity in general, but extremal quantiles do not suffer from this issue. Building upon this result,… ▽ More This paper investigates extremal quantiles under two-way cluster dependence. We demonstrate that the limiting distribution of the unconditional intermediate order quantiles in the tails converges to a Gaussian distribution. This is remarkable as two-way cluster dependence entails potential non-Gaussianity in general, but extremal quantiles do not suffer from this issue. Building upon this result, we extend our analysis to extremal quantile regressions of intermediate order. △ Less

Submitted 4 March, 2024; v1 submitted 29 February, 2024; originally announced February 2024.

arXiv:2401.12050 [pdf, other]

A Bracketing Relationship for Long-Term Policy Evaluation with Combined Experimental and Observational Data

Authors: Yechan Park, Yuya Sasaki

Abstract: Combining short-term experimental data with observational data enables credible long-term policy evaluation. The literature offers two key but non-nested assumptions, namely the latent unconfoundedness (LU; Athey et al., 2020) and equi-confounding bias (ECB; Ghassami et al., 2022) conditions, to correct observational selection. Committing to the wrong assumption leads to biased estimation. To miti… ▽ More Combining short-term experimental data with observational data enables credible long-term policy evaluation. The literature offers two key but non-nested assumptions, namely the latent unconfoundedness (LU; Athey et al., 2020) and equi-confounding bias (ECB; Ghassami et al., 2022) conditions, to correct observational selection. Committing to the wrong assumption leads to biased estimation. To mitigate such risks, we provide a novel bracketing relationship (cf. Angrist and Pischke, 2009) repurposed for the setting with data combination: the LU-based estimand and the ECB-based estimand serve as the lower and upper bounds, respectively, with the true causal effect lying in between if either assumption holds. For researchers further seeking point estimates, our Lalonde-style exercise suggests the conservatively more robust LU-based lower bounds align closely with the hold-out experimental estimates for educational policy evaluation. We investigate the economic substantives of these findings through the lens of a nonparametric class of selection mechanisms and sensitivity analysis. We uncover as key the sub-martingale property and sufficient-statistics role (Chetty, 2009) of the potential outcomes of student test scores (Chetty et al., 2011, 2014). △ Less

Submitted 22 January, 2024; originally announced January 2024.

arXiv:2308.10138 [pdf, other]

On the Inconsistency of Cluster-Robust Inference and How Subsampling Can Fix It

Authors: Harold D. Chiang, Yuya Sasaki, Yulong Wang

Abstract: Conventional methods of cluster-robust inference are inconsistent in the presence of unignorably large clusters. We formalize this claim by establishing a necessary and sufficient condition for the consistency of the conventional methods. We find that this condition for the consistency is rejected for a majority of empirical research papers. In this light, we propose a novel score subsampling meth… ▽ More Conventional methods of cluster-robust inference are inconsistent in the presence of unignorably large clusters. We formalize this claim by establishing a necessary and sufficient condition for the consistency of the conventional methods. We find that this condition for the consistency is rejected for a majority of empirical research papers. In this light, we propose a novel score subsampling method that achieves uniform size control over a broad class of data generating processes, covering that fails the conventional method. Simulation studies support these claims. With real data used by an empirical paper, we showcase that the conventional methods conclude significance while our proposed method concludes insignificance. △ Less

Submitted 23 March, 2024; v1 submitted 19 August, 2023; originally announced August 2023.

Comments: keywords: cluster-robust inference, score subsampling, unignorably large cluster

arXiv:2304.08974 [pdf, ps, other]

Doubly Robust Estimators with Weak Overlap

Authors: Yukun Ma, Pedro H. C. Sant'Anna, Yuya Sasaki, Takuya Ura

Abstract: In this paper, we derive a new class of doubly robust estimators for treatment effect estimands that is also robust against weak covariate overlap. Our proposed estimator relies on trimming observations with extreme propensity scores and uses a bias correction device for trimming bias. Our framework accommodates many research designs, such as unconfoundedness, local treatment effects, and differen… ▽ More In this paper, we derive a new class of doubly robust estimators for treatment effect estimands that is also robust against weak covariate overlap. Our proposed estimator relies on trimming observations with extreme propensity scores and uses a bias correction device for trimming bias. Our framework accommodates many research designs, such as unconfoundedness, local treatment effects, and difference-in-differences. Simulation exercises illustrate that our proposed tools indeed have attractive finite sample properties, which are aligned with our theoretical asymptotic results. △ Less

Submitted 22 April, 2023; v1 submitted 18 April, 2023; originally announced April 2023.

arXiv:2301.13775 [pdf, other]

On Using The Two-Way Cluster-Robust Standard Errors

Authors: Harold D Chiang, Yuya Sasaki

Abstract: Thousands of papers have reported two-way cluster-robust (TWCR) standard errors. However, the recent econometrics literature points out the potential non-gaussianity of two-way cluster sample means, and thus invalidity of the inference based on the TWCR standard errors. Fortunately, simulation studies nonetheless show that the gaussianity is rather common than exceptional. This paper provides theo… ▽ More Thousands of papers have reported two-way cluster-robust (TWCR) standard errors. However, the recent econometrics literature points out the potential non-gaussianity of two-way cluster sample means, and thus invalidity of the inference based on the TWCR standard errors. Fortunately, simulation studies nonetheless show that the gaussianity is rather common than exceptional. This paper provides theoretical support for this encouraging observation. Specifically, we derive a novel central limit theorem for two-way clustered triangular arrays that justifies the use of the TWCR under very mild and interpretable conditions. We, therefore, hope that this paper will provide a theoretical justification for the legitimacy of most, if not all, of the thousands of those empirical papers that have used the TWCR standard errors. We provide a guide in practice as to when a researcher can employ the TWCR standard errors. △ Less

Submitted 31 January, 2023; originally announced January 2023.

arXiv:2211.14870 [pdf, other]

Extreme Changes in Changes

Authors: Yuya Sasaki, Yulong Wang

Abstract: Policy analysts are often interested in treating the units with extreme outcomes, such as infants with extremely low birth weights. Existing changes-in-changes (CIC) estimators are tailored to middle quantiles and do not work well for such subpopulations. This paper proposes a new CIC estimator to accurately estimate treatment effects at extreme quantiles. With its asymptotic normality, we also pr… ▽ More Policy analysts are often interested in treating the units with extreme outcomes, such as infants with extremely low birth weights. Existing changes-in-changes (CIC) estimators are tailored to middle quantiles and do not work well for such subpopulations. This paper proposes a new CIC estimator to accurately estimate treatment effects at extreme quantiles. With its asymptotic normality, we also propose a method of statistical inference, which is simple to implement. Based on simulation studies, we propose to use our extreme CIC estimator for extreme, such as below 5% and above 95%, quantiles, while the conventional CIC estimator should be used for intermediate quantiles. Applying the proposed method, we study the effects of income gains from the 1993 EITC reform on infant birth weights for those in the most critical conditions. This paper is accompanied by a Stata command. △ Less

Submitted 20 May, 2023; v1 submitted 27 November, 2022; originally announced November 2022.

arXiv:2210.16991 [pdf, other]

Non-Robustness of the Cluster-Robust Inference: with a Proposal of a New Robust Method

Authors: Yuya Sasaki, Yulong Wang

Abstract: The conventional cluster-robust (CR) standard errors may not be robust. They are vulnerable to data that contain a small number of large clusters. When a researcher uses the 51 states in the U.S. as clusters, the largest cluster (California) consists of about 10% of the total sample. Such a case in fact violates the assumptions under which the widely used CR methods are guaranteed to work. We form… ▽ More The conventional cluster-robust (CR) standard errors may not be robust. They are vulnerable to data that contain a small number of large clusters. When a researcher uses the 51 states in the U.S. as clusters, the largest cluster (California) consists of about 10% of the total sample. Such a case in fact violates the assumptions under which the widely used CR methods are guaranteed to work. We formally show that the conventional CR methods fail if the distribution of cluster sizes follows a power law with exponent less than two. Besides the example of 51 state clusters, some examples are drawn from a list of recent original research articles published in a top journal. In light of these negative results about the existing CR methods, we propose a weighted CR (WCR) method as a simple fix. Simulation studies support our arguments that the WCR method is robust while the conventional CR methods are not. △ Less

Submitted 11 December, 2022; v1 submitted 30 October, 2022; originally announced October 2022.

arXiv:2209.05914 [pdf, other]

Estimation of Average Derivatives of Latent Regressors: With an Application to Inference on Buffer-Stock Saving

Authors: Hao Dong, Yuya Sasaki

Abstract: This paper proposes a density-weighted average derivative estimator based on two noisy measures of a latent regressor. Both measures have classical errors with possibly asymmetric distributions. We show that the proposed estimator achieves the root-n rate of convergence, and derive its asymptotic normal distribution for statistical inference. Simulation studies demonstrate excellent small-sample p… ▽ More This paper proposes a density-weighted average derivative estimator based on two noisy measures of a latent regressor. Both measures have classical errors with possibly asymmetric distributions. We show that the proposed estimator achieves the root-n rate of convergence, and derive its asymptotic normal distribution for statistical inference. Simulation studies demonstrate excellent small-sample performance supporting the root-n asymptotic normality. Based on the proposed estimator, we construct a formal test on the sub-unity of the marginal propensity to consume out of permanent income (MPCP) under a nonparametric consumption model and a permanent-transitory model of income dynamics with nonparametric distribution. Applying the test to four recent waves of U.S. Panel Study of Income Dynamics (PSID), we reject the null hypothesis of the unit MPCP in favor of a sub-unit MPCP, supporting the buffer-stock model of saving. △ Less

Submitted 13 September, 2022; originally announced September 2022.

arXiv:2206.04257 [pdf, other]

Capital and Labor Income Pareto Exponents in the United States, 1916-2019

Authors: Ji Hyung Lee, Yuya Sasaki, Alexis Akira Toda, Yulong Wang

Abstract: Accurately estimating income Pareto exponents is challenging due to limitations in data availability and the applicability of statistical methods. Using tabulated summaries of incomes from tax authorities and a recent estimation method, we estimate income Pareto exponents in U.S. for 1916-2019. We find that during the past three decades, the capital and labor income Pareto exponents have been stab… ▽ More Accurately estimating income Pareto exponents is challenging due to limitations in data availability and the applicability of statistical methods. Using tabulated summaries of incomes from tax authorities and a recent estimation method, we estimate income Pareto exponents in U.S. for 1916-2019. We find that during the past three decades, the capital and labor income Pareto exponents have been stable at around 1.2 and 2. Our findings suggest that the top tail income and wealth inequality is higher and wealthy agents have twice as large an impact on the aggregate economy than previously thought but there is no clear trend post-1985. △ Less

Submitted 9 June, 2022; originally announced June 2022.

arXiv:2204.05480 [pdf, other]

doi 10.1016/j.jeconom.2023.105568

Tuning Parameter-Free Nonparametric Density Estimation from Tabulated Summary Data

Authors: Ji Hyung Lee, Yuya Sasaki, Alexis Akira Toda, Yulong Wang

Abstract: Administrative data are often easier to access as tabulated summaries than in the original format due to confidentiality concerns. Motivated by this practical feature, we propose a novel nonparametric density estimation method from tabulated summary data based on maximum entropy and prove its strong uniform consistency. Unlike existing kernel-based estimators, our estimator is free from tuning par… ▽ More Administrative data are often easier to access as tabulated summaries than in the original format due to confidentiality concerns. Motivated by this practical feature, we propose a novel nonparametric density estimation method from tabulated summary data based on maximum entropy and prove its strong uniform consistency. Unlike existing kernel-based estimators, our estimator is free from tuning parameters and admits a closed-form density that is convenient for post-estimation analysis. We apply the proposed method to the tabulated summary data of the U.S. tax returns to estimate the income distribution. △ Less

Submitted 17 May, 2023; v1 submitted 11 April, 2022; originally announced April 2022.

arXiv:2203.08014 [pdf, other]

Non-Existent Moments of Earnings Growth

Authors: Silvia Sarpietro, Yuya Sasaki, Yulong Wang

Abstract: The literature often employs moment-based earnings risk measures like variance, skewness, and kurtosis. However, under heavy-tailed distributions, these moments may not exist in the population. Our empirical analysis reveals that population kurtosis, skewness, and variance often do not exist for the conditional distribution of earnings growth. This challenges moment-based analyses. We propose robu… ▽ More The literature often employs moment-based earnings risk measures like variance, skewness, and kurtosis. However, under heavy-tailed distributions, these moments may not exist in the population. Our empirical analysis reveals that population kurtosis, skewness, and variance often do not exist for the conditional distribution of earnings growth. This challenges moment-based analyses. We propose robust conditional Pareto exponents as novel earnings risk measures, develo** estimation and inference methods. Using the UK New Earnings Survey Panel Dataset (NESPD) and US Panel Study of Income Dynamics (PSID), we find: 1) Moments often fail to exist; 2) Earnings risk increases over the life cycle; 3) Job stayers face higher earnings risk; 4) These patterns persist during the 2007--2008 recession and the 2015--2016 positive growth period. △ Less

Submitted 17 February, 2024; v1 submitted 15 March, 2022; originally announced March 2022.

arXiv:2201.11304 [pdf, other]

Standard errors for two-way clustering with serially correlated time effects

Authors: Harold D Chiang, Bruce E Hansen, Yuya Sasaki

Abstract: We propose improved standard errors and an asymptotic distribution theory for two-way clustered panels. Our proposed estimator and theory allow for arbitrary serial dependence in the common time effects, which is excluded by existing two-way methods, including the popular two-way cluster standard errors of Cameron, Gelbach, and Miller (2011) and the cluster bootstrap of Menzel (2021). Our asymptot… ▽ More We propose improved standard errors and an asymptotic distribution theory for two-way clustered panels. Our proposed estimator and theory allow for arbitrary serial dependence in the common time effects, which is excluded by existing two-way methods, including the popular two-way cluster standard errors of Cameron, Gelbach, and Miller (2011) and the cluster bootstrap of Menzel (2021). Our asymptotic distribution theory is the first which allows for this level of inter-dependence among the observations. Under weak regularity conditions, we demonstrate that the least squares estimator is asymptotically normal, our proposed variance estimator is consistent, and t-ratios are asymptotically standard normal, permitting conventional inference. We present simulation evidence that confidence intervals constructed with our proposed standard errors obtain superior coverage performance relative to existing methods. We illustrate the relevance of the proposed method in an empirical application to a standard Fama-French three-factor regression. △ Less

Submitted 13 December, 2023; v1 submitted 26 January, 2022; originally announced January 2022.

arXiv:2110.12041 [pdf, other]

Slow Movers in Panel Data

Authors: Yuya Sasaki, Takuya Ura

Abstract: Panel data often contain stayers (units with no within-variations) and slow movers (units with little within-variations). In the presence of many slow movers, conventional econometric methods can fail to work. We propose a novel method of robust inference for the average partial effects in correlated random coefficient models robustly across various distributions of within-variations, including th… ▽ More Panel data often contain stayers (units with no within-variations) and slow movers (units with little within-variations). In the presence of many slow movers, conventional econometric methods can fail to work. We propose a novel method of robust inference for the average partial effects in correlated random coefficient models robustly across various distributions of within-variations, including the cases with many stayers and/or many slow movers in a unified manner. In addition to this robustness property, our proposed method entails smaller biases and hence improves accuracy in inference compared to existing alternatives. Simulation studies demonstrate our theoretical claims about these properties: the conventional 95% confidence interval covers the true parameter value with 37-93% frequencies, whereas our proposed one achieves 93-96% coverage frequencies. △ Less

Submitted 22 October, 2021; originally announced October 2021.

arXiv:2110.04365 [pdf, ps, other]

Dyadic double/debiased machine learning for analyzing determinants of free trade agreements

Authors: Harold D Chiang, Yukun Ma, Joel Rodrigue, Yuya Sasaki

Abstract: This paper presents novel methods and theories for estimation and inference about parameters in econometric models using machine learning for nuisance parameters estimation when data are dyadic. We propose a dyadic cross fitting method to remove over-fitting biases under arbitrary dyadic dependence. Together with the use of Neyman orthogonal scores, this novel cross fitting method enables root-… ▽ More This paper presents novel methods and theories for estimation and inference about parameters in econometric models using machine learning for nuisance parameters estimation when data are dyadic. We propose a dyadic cross fitting method to remove over-fitting biases under arbitrary dyadic dependence. Together with the use of Neyman orthogonal scores, this novel cross fitting method enables root-$n$ consistent estimation and inference robustly against dyadic dependence. We illustrate an application of our general framework to high-dimensional network link formation models. With this method applied to empirical data of international economic networks, we reexamine determinants of free trade agreements (FTA) viewed as links formed in the dyad composed of world economies. We document that standard methods may lead to misleading conclusions for numerous classic determinants of FTA formation due to biased point estimates or standard errors which are too small. △ Less

Submitted 19 December, 2022; v1 submitted 8 October, 2021; originally announced October 2021.

arXiv:2108.09520 [pdf, ps, other]

Inference in high-dimensional regression models without the exact or $L^p$ sparsity

Authors: Jooyoung Cha, Harold D. Chiang, Yuya Sasaki

Abstract: This paper proposes a new method of inference in high-dimensional regression models and high-dimensional IV regression models. Estimation is based on a combined use of the orthogonal greedy algorithm, high-dimensional Akaike information criterion, and double/debiased machine learning. The method of inference for any low-dimensional subvector of high-dimensional parameters is based on a root-$N$ as… ▽ More This paper proposes a new method of inference in high-dimensional regression models and high-dimensional IV regression models. Estimation is based on a combined use of the orthogonal greedy algorithm, high-dimensional Akaike information criterion, and double/debiased machine learning. The method of inference for any low-dimensional subvector of high-dimensional parameters is based on a root-$N$ asymptotic normality, which is shown to hold without requiring the exact sparsity condition or the $L^p$ sparsity condition. Simulation studies demonstrate superior finite-sample performance of this proposed method over those based on the LASSO or the random forest, especially under less sparse models. We illustrate an application to production analysis with a panel of Chilean firms. △ Less

Submitted 31 December, 2022; v1 submitted 21 August, 2021; originally announced August 2021.

arXiv:2105.10007 [pdf, other]

Fixed-k Tail Regression: New Evidence on Tax and Wealth Inequality from Forbes 400

Authors: Ji Hyung Lee, Yuya Sasaki, Alexis Akira Toda, Yulong Wang

Abstract: We develop a novel fixed-k tail regression method that accommodates the unique feature in the Forbes 400 data that observations are truncated from below at the 400th largest order statistic. Applying this method, we find that higher maximum marginal income tax rates induce higher wealth Pareto exponents. Setting the maximum tax rate to 30-40% (as in U.S. currently) leads to a Pareto exponent of 1.… ▽ More We develop a novel fixed-k tail regression method that accommodates the unique feature in the Forbes 400 data that observations are truncated from below at the 400th largest order statistic. Applying this method, we find that higher maximum marginal income tax rates induce higher wealth Pareto exponents. Setting the maximum tax rate to 30-40% (as in U.S. currently) leads to a Pareto exponent of 1.5-1.8, while counterfactually setting it to 80% (as suggested by Piketty, 2014) would lead to a Pareto exponent of 2.6. We present a simple economic model that explains these findings and discuss the welfare implications of taxation. △ Less

Submitted 14 September, 2022; v1 submitted 20 May, 2021; originally announced May 2021.

arXiv:2104.14458 [pdf, other]

doi 10.1016/j.jeconom.2022.07.003

Nonparametric Difference-in-Differences in Repeated Cross-Sections with Continuous Treatments

Authors: Xavier D'Haultfoeuille, Stefan Hoderlein, Yuya Sasaki

Abstract: This paper studies the identification of causal effects of a continuous treatment using a new difference-in-difference strategy. Our approach allows for endogeneity of the treatment, and employs repeated cross-sections. It requires an exogenous change over time which affects the treatment in a heterogeneous way, stationarity of the distribution of unobservables and a rank invariance condition on t… ▽ More This paper studies the identification of causal effects of a continuous treatment using a new difference-in-difference strategy. Our approach allows for endogeneity of the treatment, and employs repeated cross-sections. It requires an exogenous change over time which affects the treatment in a heterogeneous way, stationarity of the distribution of unobservables and a rank invariance condition on the time trend. On the other hand, we do not impose any functional form restrictions or an additive time trend, and we are invariant to the scaling of the dependent variable. Under our conditions, the time trend can be identified using a control group, as in the binary difference-in-differences literature. In our scenario, however, this control group is defined by the data. We then identify average and quantile treatment effect parameters. We develop corresponding nonparametric estimators and study their asymptotic properties. Finally, we apply our results to the effect of disposable income on consumption. △ Less

Submitted 23 May, 2022; v1 submitted 29 April, 2021; originally announced April 2021.

Comments: 50 pages, 20 figures

Journal ref: Journal of Econometrics 2023 (234)

arXiv:2103.00557 [pdf, ps, other]

Algorithmic subsampling under multiway clustering

Authors: Harold D. Chiang, Jiatong Li, Yuya Sasaki

Abstract: This paper proposes a novel method of algorithmic subsampling (data sketching) for multiway cluster dependent data. We establish a new uniform weak law of large numbers and a new central limit theorem for the multiway algorithmic subsample means. Consequently, we discover an additional advantage of the algorithmic subsampling that it allows for robustness against potential degeneracy, and even non… ▽ More This paper proposes a novel method of algorithmic subsampling (data sketching) for multiway cluster dependent data. We establish a new uniform weak law of large numbers and a new central limit theorem for the multiway algorithmic subsample means. Consequently, we discover an additional advantage of the algorithmic subsampling that it allows for robustness against potential degeneracy, and even non-Gaussian degeneracy, of the asymptotic distribution under multiway clustering. Simulation studies support this novel result, and demonstrate that inference with the algorithmic subsampling entails more accuracy than that without the algorithmic subsampling. Applying these basic asymptotic theories, we derive the consistency and the asymptotic normality for the multiway algorithmic subsampling generalized method of moments estimator and for the multiway algorithmic subsampling M-estimator. We illustrate an application to scanner data. △ Less

Submitted 30 October, 2022; v1 submitted 28 February, 2021; originally announced March 2021.

arXiv:2102.06586 [pdf, ps, other]

Linear programming approach to nonparametric inference under shape restrictions: with an application to regression kink designs

Authors: Harold D. Chiang, Kengo Kato, Yuya Sasaki, Takuya Ura

Abstract: We develop a novel method of constructing confidence bands for nonparametric regression functions under shape constraints. This method can be implemented via a linear programming, and it is thus computationally appealing. We illustrate a usage of our proposed method with an application to the regression kink design (RKD). Econometric analyses based on the RKD often suffer from wide confidence inte… ▽ More We develop a novel method of constructing confidence bands for nonparametric regression functions under shape constraints. This method can be implemented via a linear programming, and it is thus computationally appealing. We illustrate a usage of our proposed method with an application to the regression kink design (RKD). Econometric analyses based on the RKD often suffer from wide confidence intervals due to slow convergence rates of nonparametric derivative estimators. We demonstrate that economic models and structures motivate shape restrictions, which in turn contribute to shrinking the confidence interval for an analysis of the causal effects of unemployment insurance benefits on unemployment durations. △ Less

Submitted 12 February, 2021; originally announced February 2021.

arXiv:2012.07624 [pdf, ps, other]

Welfare Analysis via Marginal Treatment Effects

Authors: Yuya Sasaki, Takuya Ura

Abstract: Consider a causal structure with endogeneity (i.e., unobserved confoundedness) in empirical data, where an instrumental variable is available. In this setting, we show that the mean social welfare function can be identified and represented via the marginal treatment effect (MTE, Bjorklund and Moffitt, 1987) as the operator kernel. This representation result can be applied to a variety of statistic… ▽ More Consider a causal structure with endogeneity (i.e., unobserved confoundedness) in empirical data, where an instrumental variable is available. In this setting, we show that the mean social welfare function can be identified and represented via the marginal treatment effect (MTE, Bjorklund and Moffitt, 1987) as the operator kernel. This representation result can be applied to a variety of statistical decision rules for treatment choice, including plug-in rules, Bayes rules, and empirical welfare maximization (EWM) rules as in Hirano and Porter (2020, Section 2.3). Focusing on the application to the EWM framework of Kitagawa and Tetenov (2018), we provide convergence rates of the worst case average welfare loss (regret) in the spirit of Manski (2004). △ Less

Submitted 14 December, 2020; originally announced December 2020.

arXiv:2009.05150 [pdf, other]

Inference for high-dimensional exchangeable arrays

Authors: Harold D. Chiang, Kengo Kato, Yuya Sasaki

Abstract: We consider inference for high-dimensional separately and jointly exchangeable arrays where the dimensions may be much larger than the sample sizes. For both exchangeable arrays, we first derive high-dimensional central limit theorems over the rectangles and subsequently develop novel multiplier bootstraps with theoretical guarantees. These theoretical results rely on new technical tools such as H… ▽ More We consider inference for high-dimensional separately and jointly exchangeable arrays where the dimensions may be much larger than the sample sizes. For both exchangeable arrays, we first derive high-dimensional central limit theorems over the rectangles and subsequently develop novel multiplier bootstraps with theoretical guarantees. These theoretical results rely on new technical tools such as Hoeffding-type decomposition and maximal inequalities for the degenerate components in the Hoeffiding-type decomposition for the exchangeable arrays. We exhibit applications of our methods to uniform confidence bands for density estimation under joint exchangeability and penalty choice for $\ell_1$-penalized regression under separate exchangeability. Extensive simulations demonstrate precise uniform coverage rates. We illustrate by constructing uniform confidence bands for international trade network densities. △ Less

Submitted 9 July, 2021; v1 submitted 10 September, 2020; originally announced September 2020.

arXiv:2007.13659 [pdf, ps, other]

Unconditional Quantile Regression with High Dimensional Data

Authors: Yuya Sasaki, Takuya Ura, Yichong Zhang

Abstract: This paper considers estimation and inference for heterogeneous counterfactual effects with high-dimensional data. We propose a novel robust score for debiased estimation of the unconditional quantile regression (Firpo, Fortin, and Lemieux, 2009) as a measure of heterogeneous counterfactual marginal effects. We propose a multiplier bootstrap inference and develop asymptotic theories to guarantee t… ▽ More This paper considers estimation and inference for heterogeneous counterfactual effects with high-dimensional data. We propose a novel robust score for debiased estimation of the unconditional quantile regression (Firpo, Fortin, and Lemieux, 2009) as a measure of heterogeneous counterfactual marginal effects. We propose a multiplier bootstrap inference and develop asymptotic theories to guarantee the size control in large sample. Simulation studies support our theories. Applying the proposed method to Job Corps survey data, we find that a policy which counterfactually extends the duration of exposures to the Job Corps training program will be effective especially for the targeted subpopulations of lower potential wage earners. △ Less

Submitted 24 February, 2022; v1 submitted 27 July, 2020; originally announced July 2020.

arXiv:2006.02541 [pdf, other]

Testing Finite Moment Conditions for the Consistency and the Root-N Asymptotic Normality of the GMM and M Estimators

Authors: Yuya Sasaki, Yulong Wang

Abstract: Common approaches to inference for structural and reduced-form parameters in empirical economic analysis are based on the consistency and the root-n asymptotic normality of the GMM and M estimators. The canonical consistency (respectively, root-n asymptotic normality) for these classes of estimators requires at least the first (respectively, second) moment of the score to be finite. In this articl… ▽ More Common approaches to inference for structural and reduced-form parameters in empirical economic analysis are based on the consistency and the root-n asymptotic normality of the GMM and M estimators. The canonical consistency (respectively, root-n asymptotic normality) for these classes of estimators requires at least the first (respectively, second) moment of the score to be finite. In this article, we present a method of testing these conditions for the consistency and the root-n asymptotic normality of the GMM and M estimators. The proposed test controls size nearly uniformly over the set of data generating processes that are compatible with the null hypothesis. Simulation studies support this theoretical result. Applying the proposed test to the market share data from the Dominick's Finer Foods retail chain, we find that a common \textit{ad hoc} procedure to deal with zero market shares in analysis of differentiated products markets results in a failure to satisfy the conditions for both the consistency and the root-n asymptotic normality. △ Less

Submitted 2 September, 2020; v1 submitted 3 June, 2020; originally announced June 2020.

arXiv:1909.03489 [pdf, ps, other]

Multiway Cluster Robust Double/Debiased Machine Learning

Authors: Harold D. Chiang, Kengo Kato, Yukun Ma, Yuya Sasaki

Abstract: This paper investigates double/debiased machine learning (DML) under multiway clustered sampling environments. We propose a novel multiway cross fitting algorithm and a multiway DML estimator based on this algorithm. We also develop a multiway cluster robust standard error formula. Simulations indicate that the proposed procedure has favorable finite sample performance. Applying the proposed metho… ▽ More This paper investigates double/debiased machine learning (DML) under multiway clustered sampling environments. We propose a novel multiway cross fitting algorithm and a multiway DML estimator based on this algorithm. We also develop a multiway cluster robust standard error formula. Simulations indicate that the proposed procedure has favorable finite sample performance. Applying the proposed method to market share data for demand analysis, we obtain larger two-way cluster robust standard errors than non-robust ones. △ Less

Submitted 4 March, 2020; v1 submitted 8 September, 2019; originally announced September 2019.

arXiv:1909.00294 [pdf, other]

Fixed-k Inference for Conditional Extremal Quantiles

Authors: Yuya Sasaki, Yulong Wang

Abstract: We develop a new extreme value theory for repeated cross-sectional and panel data to construct asymptotically valid confidence intervals (CIs) for conditional extremal quantiles from a fixed number $k$ of nearest-neighbor tail observations. As a by-product, we also construct CIs for extremal quantiles of coefficients in linear random coefficient models. For any fixed $k$, the CIs are uniformly val… ▽ More We develop a new extreme value theory for repeated cross-sectional and panel data to construct asymptotically valid confidence intervals (CIs) for conditional extremal quantiles from a fixed number $k$ of nearest-neighbor tail observations. As a by-product, we also construct CIs for extremal quantiles of coefficients in linear random coefficient models. For any fixed $k$, the CIs are uniformly valid without parametric assumptions over a set of nonparametric data generating processes associated with various tail indices. Simulation studies show that our CIs exhibit superior small-sample coverage and length properties than alternative nonparametric methods based on asymptotic normality. Applying the proposed method to Natality Vital Statistics, we study factors of extremely low birth weights. We find that signs of major effects are the same as those found in preceding studies based on parametric models, but with different magnitudes. △ Less

Submitted 19 July, 2020; v1 submitted 31 August, 2019; originally announced September 2019.

arXiv:1905.02107 [pdf, ps, other]

Lasso under Multi-way Clustering: Estimation and Post-selection Inference

Authors: Harold D. Chiang, Yuya Sasaki

Abstract: This paper studies high-dimensional regression models with lasso when data is sampled under multi-way clustering. First, we establish convergence rates for the lasso and post-lasso estimators. Second, we propose a novel inference method based on a post-double-selection procedure and show its asymptotic validity. Our procedure can be easily implemented with existing statistical packages. Simulation… ▽ More This paper studies high-dimensional regression models with lasso when data is sampled under multi-way clustering. First, we establish convergence rates for the lasso and post-lasso estimators. Second, we propose a novel inference method based on a post-double-selection procedure and show its asymptotic validity. Our procedure can be easily implemented with existing statistical packages. Simulation results demonstrate that the proposed procedure works well in finite sample. We illustrate the proposed method with a couple of empirical applications to development and growth economics. △ Less

Submitted 21 August, 2019; v1 submitted 6 May, 2019; originally announced May 2019.

arXiv:1904.00211 [pdf, ps, other]

Post-Selection Inference in Three-Dimensional Panel Data

Authors: Harold D. Chiang, Joel Rodrigue, Yuya Sasaki

Abstract: Three-dimensional panel models are widely used in empirical analysis. Researchers use various combinations of fixed effects for three-dimensional panels. When one imposes a parsimonious model and the true model is rich, then it incurs mis-specification biases. When one employs a rich model and the true model is parsimonious, then it incurs larger standard errors than necessary. It is therefore use… ▽ More Three-dimensional panel models are widely used in empirical analysis. Researchers use various combinations of fixed effects for three-dimensional panels. When one imposes a parsimonious model and the true model is rich, then it incurs mis-specification biases. When one employs a rich model and the true model is parsimonious, then it incurs larger standard errors than necessary. It is therefore useful for researchers to know correct models. In this light, Lu, Miao, and Su (2018) propose methods of model selection. We advance this literature by proposing a method of post-selection inference for regression parameters. Despite our use of the lasso technique as means of model selection, our assumptions allow for many and even all fixed effects to be nonzero. Simulation studies demonstrate that the proposed method is more precise than under-fitting fixed effect estimators, is more efficient than over-fitting fixed effect estimators, and allows for as accurate inference as the oracle estimator. △ Less

Submitted 30 April, 2019; v1 submitted 30 March, 2019; originally announced April 2019.

arXiv:1808.09375 [pdf, other]

Inference based on Kotlarski's Identity

Authors: Kengo Kato, Yuya Sasaki, Takuya Ura

Abstract: Kotlarski's identity has been widely used in applied economic research. However, how to conduct inference based on this popular identification approach has been an open question for two decades. This paper addresses this open problem by constructing a novel confidence band for the density function of a latent variable in repeated measurement error model. The confidence band builds on our finding t… ▽ More Kotlarski's identity has been widely used in applied economic research. However, how to conduct inference based on this popular identification approach has been an open question for two decades. This paper addresses this open problem by constructing a novel confidence band for the density function of a latent variable in repeated measurement error model. The confidence band builds on our finding that we can rewrite Kotlarski's identity as a system of linear moment restrictions. The confidence band controls the asymptotic size uniformly over a class of data generating processes, and it is consistent against all fixed alternatives. Simulation studies support our theoretical results. △ Less

Submitted 8 September, 2019; v1 submitted 28 August, 2018; originally announced August 2018.

arXiv:1805.11503 [pdf, ps, other]

Estimation and Inference for Policy Relevant Treatment Effects

Authors: Yuya Sasaki, Takuya Ura

Abstract: The policy relevant treatment effect (PRTE) measures the average effect of switching from a status-quo policy to a counterfactual policy. Estimation of the PRTE involves estimation of multiple preliminary parameters, including propensity scores, conditional expectation functions of the outcome and covariates given the propensity score, and marginal treatment effects. These preliminary estimators c… ▽ More The policy relevant treatment effect (PRTE) measures the average effect of switching from a status-quo policy to a counterfactual policy. Estimation of the PRTE involves estimation of multiple preliminary parameters, including propensity scores, conditional expectation functions of the outcome and covariates given the propensity score, and marginal treatment effects. These preliminary estimators can affect the asymptotic distribution of the PRTE estimator in complicated and intractable manners. In this light, we propose an orthogonal score for double debiased estimation of the PRTE, whereby the asymptotic distribution of the PRTE estimator is obtained without any influence of preliminary parameter estimators as far as they satisfy mild requirements of convergence rates. To our knowledge, this paper is the first to develop limit distribution theories for inference about the PRTE. △ Less

Submitted 16 July, 2020; v1 submitted 29 May, 2018; originally announced May 2018.

arXiv:1711.10031 [pdf, ps, other]

Constructive Identification of Heterogeneous Elasticities in the Cobb-Douglas Production Function

Authors: Tong Li, Yuya Sasaki

Abstract: This paper presents the identification of heterogeneous elasticities in the Cobb-Douglas production function. The identification is constructive with closed-form formulas for the elasticity with respect to each input for each firm. We propose that the flexible input cost ratio plays the role of a control function under "non-collinear heterogeneity" between elasticities with respect to two flexible… ▽ More This paper presents the identification of heterogeneous elasticities in the Cobb-Douglas production function. The identification is constructive with closed-form formulas for the elasticity with respect to each input for each firm. We propose that the flexible input cost ratio plays the role of a control function under "non-collinear heterogeneity" between elasticities with respect to two flexible inputs. The ex ante flexible input cost share can be used to identify the elasticities with respect to flexible inputs for each firm. The elasticities with respect to labor and capital can be subsequently identified for each firm under the timing assumption admitting the functional independence. △ Less

Submitted 27 November, 2017; originally announced November 2017.

Showing 1–32 of 32 results for author: Sasaki, Y