Search | arXiv e-print repository

MIDAS-QR with 2-Dimensional Structure

Authors: Tibor Szendrei, Arnab Bhattacharjee, Mark E. Schaffer

Abstract: Mixed frequency data has been shown to improve the performance of growth-at-risk models in the literature. Most of the research has focused on imposing structure on the high-frequency lags when estimating MIDAS-QR models akin to what is done in mean models. However, only imposing structure on the lag-dimension can potentially induce quantile variation that would otherwise not be there. In this pap… ▽ More Mixed frequency data has been shown to improve the performance of growth-at-risk models in the literature. Most of the research has focused on imposing structure on the high-frequency lags when estimating MIDAS-QR models akin to what is done in mean models. However, only imposing structure on the lag-dimension can potentially induce quantile variation that would otherwise not be there. In this paper we extend the framework by introducing structure on both the lag dimension and the quantile dimension. In this way we are able to shrink unnecessary quantile variation in the high-frequency variables. This leads to more gradual lag profiles in both dimensions compared to the MIDAS-QR and UMIDAS-QR. We show that this proposed method leads to further gains in nowcasting and forecasting on a pseudo-out-of-sample exercise on US data. △ Less

Submitted 21 June, 2024; originally announced June 2024.

arXiv:2405.20468 [pdf, other]

MTEB-French: Resources for French Sentence Embedding Evaluation and Analysis

Authors: Mathieu Ciancone, Imene Kerboua, Marion Schaeffer, Wissam Siblini

Abstract: Recently, numerous embedding models have been made available and widely used for various NLP tasks. The Massive Text Embedding Benchmark (MTEB) has primarily simplified the process of choosing a model that performs well for several tasks in English, but extensions to other languages remain challenging. This is why we expand MTEB to propose the first massive benchmark of sentence embeddings for Fre… ▽ More Recently, numerous embedding models have been made available and widely used for various NLP tasks. The Massive Text Embedding Benchmark (MTEB) has primarily simplified the process of choosing a model that performs well for several tasks in English, but extensions to other languages remain challenging. This is why we expand MTEB to propose the first massive benchmark of sentence embeddings for French. We gather 15 existing datasets in an easy-to-use interface and create three new French datasets for a global evaluation of 8 task categories. We compare 51 carefully selected embedding models on a large scale, conduct comprehensive statistical tests, and analyze the correlation between model performance and many of their characteristics. We find out that even if no model is the best on all tasks, large multilingual models pre-trained on sentence similarity perform exceptionally well. Our work comes with open-source code, new datasets and a public leaderboard. △ Less

Submitted 17 June, 2024; v1 submitted 30 May, 2024; originally announced May 2024.

arXiv:2403.14036 [pdf, other]

Fused LASSO as Non-Crossing Quantile Regression

Authors: Tibor Szendrei, Arnab Bhattacharjee, Mark E. Schaffer

Abstract: Quantile crossing has been an ever-present thorn in the side of quantile regression. This has spurred research into obtaining densities and coefficients that obey the quantile monotonicity property. While important contributions, these papers do not provide insight into how exactly these constraints influence the estimated coefficients. This paper extends non-crossing constraints and shows that by… ▽ More Quantile crossing has been an ever-present thorn in the side of quantile regression. This has spurred research into obtaining densities and coefficients that obey the quantile monotonicity property. While important contributions, these papers do not provide insight into how exactly these constraints influence the estimated coefficients. This paper extends non-crossing constraints and shows that by varying a single hyperparameter ($α$) one can obtain commonly used quantile estimators. Namely, we obtain the quantile regression estimator of Koenker and Bassett (1978) when $α=0$, the non crossing quantile regression estimator of Bondell et al. (2010) when $α=1$, and the composite quantile regression estimator of Koenker (1984) and Zou and Yuan (2008) when $α\rightarrow\infty$. As such, we show that non-crossing constraints are simply a special type of fused-shrinkage. △ Less

Submitted 20 March, 2024; originally announced March 2024.

Comments: 37 pages, 16 figures, 4 tables, 7348 words

arXiv:2401.01645 [pdf, other]

Model Averaging and Double Machine Learning

Authors: Achim Ahrens, Christian B. Hansen, Mark E. Schaffer, Thomas Wiemann

Abstract: This paper discusses pairing double/debiased machine learning (DDML) with stacking, a model averaging method for combining multiple candidate learners, to estimate structural parameters. We introduce two new stacking approaches for DDML: short-stacking exploits the cross-fitting step of DDML to substantially reduce the computational burden and pooled stacking enforces common stacking weights over… ▽ More This paper discusses pairing double/debiased machine learning (DDML) with stacking, a model averaging method for combining multiple candidate learners, to estimate structural parameters. We introduce two new stacking approaches for DDML: short-stacking exploits the cross-fitting step of DDML to substantially reduce the computational burden and pooled stacking enforces common stacking weights over cross-fitting folds. Using calibrated simulation studies and two applications estimating gender gaps in citations and wages, we show that DDML with stacking is more robust to partially unknown functional forms than common alternative approaches based on single pre-selected learners. We provide Stata and R software implementing our proposals. △ Less

Submitted 3 January, 2024; originally announced January 2024.

arXiv:2308.04187 [pdf, ps, other]

doi 10.1007/978-3-031-44070-0_13

Adding Why to What? Analyses of an Everyday Explanation

Authors: Lutz Terfloth, Michael Schaffer, Heike M. Buhl, Carsten Schulte

Abstract: In XAI it is important to consider that, in contrast to explanations for professional audiences, one cannot assume common expertise when explaining for laypeople. But such explanations between humans vary greatly, making it difficult to research commonalities across explanations. We used the dual nature theory, a techno-philosophical approach, to cope with these challenges. According to it, one ca… ▽ More In XAI it is important to consider that, in contrast to explanations for professional audiences, one cannot assume common expertise when explaining for laypeople. But such explanations between humans vary greatly, making it difficult to research commonalities across explanations. We used the dual nature theory, a techno-philosophical approach, to cope with these challenges. According to it, one can explain, for example, an XAI's decision by addressing its dual nature: by focusing on the Architecture (e.g., the logic of its algorithms) or the Relevance (e.g., the severity of a decision, the implications of a recommendation). We investigated 20 game explanations using the theory as an analytical framework. We elaborate how we used the theory to quickly structure and compare explanations of technological artifacts. We supplemented results from analyzing the explanation contents with results from a video recall to explore how explainers justified their explanation. We found that explainers were focusing on the physical aspects of the game first (Architecture) and only later on aspects of the Relevance. Reasoning in the video recalls indicated that EX regarded the focus on the Architecture as important for structuring the explanation initially by explaining the basic components before focusing on more complex, intangible aspects. Shifting between addressing the two sides was justified by explanation goals, emerging misunderstandings, and the knowledge needs of the explainee. We discovered several commonalities that inspire future research questions which, if further generalizable, provide first ideas for the construction of synthetic explanations. △ Less

Submitted 8 August, 2023; originally announced August 2023.

Comments: Paper accepted and presented at XAI World Conference 2023, Lisboa

arXiv:2301.09397 [pdf, other]

ddml: Double/debiased machine learning in Stata

Authors: Achim Ahrens, Christian B. Hansen, Mark E. Schaffer, Thomas Wiemann

Abstract: We introduce the package ddml for Double/Debiased Machine Learning (DDML) in Stata. Estimators of causal parameters for five different econometric models are supported, allowing for flexible estimation of causal effects of endogenous variables in settings with unknown functional forms and/or many exogenous variables. ddml is compatible with many existing supervised machine learning programs in Sta… ▽ More We introduce the package ddml for Double/Debiased Machine Learning (DDML) in Stata. Estimators of causal parameters for five different econometric models are supported, allowing for flexible estimation of causal effects of endogenous variables in settings with unknown functional forms and/or many exogenous variables. ddml is compatible with many existing supervised machine learning programs in Stata. We recommend using DDML in combination with stacking estimation which combines multiple machine learners into a final predictor. We provide Monte Carlo evidence to support our recommendation. △ Less

Submitted 6 January, 2024; v1 submitted 23 January, 2023; originally announced January 2023.

Comments: Tutorials and installations can be found at https://statalasso.github.io/

arXiv:2208.10896 [pdf, other]

pystacked: Stacking generalization and machine learning in Stata

Authors: Achim Ahrens, Christian B. Hansen, Mark E. Schaffer

Abstract: pystacked implements stacked generalization (Wolpert, 1992) for regression and binary classification via Python's scikit-learn. Stacking combines multiple supervised machine learners -- the "base" or "level-0" learners -- into a single learner. The currently supported base learners include regularized regression, random forest, gradient boosted trees, support vector machines, and feed-forward neur… ▽ More pystacked implements stacked generalization (Wolpert, 1992) for regression and binary classification via Python's scikit-learn. Stacking combines multiple supervised machine learners -- the "base" or "level-0" learners -- into a single learner. The currently supported base learners include regularized regression, random forest, gradient boosted trees, support vector machines, and feed-forward neural nets (multi-layer perceptron). pystacked can also be used with as a `regular' machine learning program to fit a single base learner and, thus, provides an easy-to-use API for scikit-learn's machine learning algorithms. △ Less

Submitted 6 March, 2023; v1 submitted 23 August, 2022; originally announced August 2022.

Comments: The pystacked package is available here: https://github.com/aahrens1/pystacked

arXiv:1901.05397 [pdf, other]

lassopack: Model selection and prediction with regularized regression in Stata

Authors: Achim Ahrens, Christian B. Hansen, Mark E. Schaffer

Abstract: This article introduces lassopack, a suite of programs for regularized regression in Stata. lassopack implements lasso, square-root lasso, elastic net, ridge regression, adaptive lasso and post-estimation OLS. The methods are suitable for the high-dimensional setting where the number of predictors $p$ may be large and possibly greater than the number of observations, $n$. We offer three different… ▽ More This article introduces lassopack, a suite of programs for regularized regression in Stata. lassopack implements lasso, square-root lasso, elastic net, ridge regression, adaptive lasso and post-estimation OLS. The methods are suitable for the high-dimensional setting where the number of predictors $p$ may be large and possibly greater than the number of observations, $n$. We offer three different approaches for selecting the penalization (`tuning') parameters: information criteria (implemented in lasso2), $K$-fold cross-validation and $h$-step ahead rolling cross-validation for cross-section, panel and time-series data (cvlasso), and theory-driven (`rigorous') penalization for the lasso and square-root lasso for cross-section and panel data (rlasso). We discuss the theoretical framework and practical considerations for each approach. We also present Monte Carlo results to compare the performance of the penalization approaches. △ Less

Submitted 16 January, 2019; originally announced January 2019.

Comments: 52 pages, 6 figures, 6 tables; submitted to Stata Journal; for more information see https://statalasso.github.io/

arXiv:1609.08931 [pdf, ps, other]

Optical Measurement of In-plane Elastic Waves in Mechanical Metamaterials Through Digital Image Correlation

Authors: Marshall Schaeffer, Giuseppe Trainiti, Massimo Ruzzene

Abstract: We report on a Digital Image Correlation-based technique for the detection of in-plane elastic waves propagating in structural lattices. The experimental characterization of wave motion in lattice structures is currently of great interest due its relevance to the design of novel mechanical metamaterials with unique/unusual properties such as strongly directional behavior, negative refractive index… ▽ More We report on a Digital Image Correlation-based technique for the detection of in-plane elastic waves propagating in structural lattices. The experimental characterization of wave motion in lattice structures is currently of great interest due its relevance to the design of novel mechanical metamaterials with unique/unusual properties such as strongly directional behavior, negative refractive indexes and topologically protected wave motion. Assessment of these functionalities often requires the detection of highly spatially resolved in-plane wavefields, which for reticulated or porous structural assemblies is an open challenge. A Digital Image Correlation approach is implemented that tracks small displacements of the lattice nodes by centering image subsets about the lattice intersections. A high speed camera records the motion of the points by properly interleaving subsequent frames thus artificially enhancing the available sampling rate. This, along with an imaging stitching procedure, enables the capturing of a field of view that is sufficiently large for subsequent processing. The transient response is recorded in the form of the full wavefields, which are processed to unveil features of wave motion in a hexagonal lattice. Time snapshots and frequency contours in the spatial Fourier domain are compared with numerical predictions to illustrate the accuracy of the recorded wavefields and demonstrate the suitability of this technique for the experimental characterization of wave properties. △ Less

Submitted 20 September, 2016; originally announced September 2016.

arXiv:1603.00958 [pdf]

doi 10.4155/bio.13.315

Tus-Ter-lock immuno-PCR assays for the sensitive detection of tropomyosin-specific IgE antibodies

Authors: Elecia B Johnston, Sandip D Kamath, Andreas L Lopata, Patrick M Schaeffer

Abstract: Background: The increasing prevalence of food allergies requires development of specific and sensitive tests capable of identifying the allergen responsible for the disease. The development of serologic tests that can detect specific IgE antibodies to allergenic proteins would therefore be highly received. Results: Here we present two new quantitative immuno-PCR assays for the sensitive detection… ▽ More Background: The increasing prevalence of food allergies requires development of specific and sensitive tests capable of identifying the allergen responsible for the disease. The development of serologic tests that can detect specific IgE antibodies to allergenic proteins would therefore be highly received. Results: Here we present two new quantitative immuno-PCR assays for the sensitive detection of antibodies specific to the shrimp allergen tropomyosin. Both assays are based on the self-assembling Tus-Ter-lock protein-DNA conjugation system. Significantly elevated levels of tropomyosin-specific IgE were detected in sera from patients allergic to shrimp. Conclusions: This is the first time an allergenic protein has been fused with Tus to enable specific IgE antibody detection in human sera by quantitative immuno-PCR. △ Less

Submitted 2 March, 2016; originally announced March 2016.

Comments: Author's final version, 5 figures

Journal ref: Bioanalysis, 2014, Vol. 6, No. 4, Pages 465-476

arXiv:1511.07507 [pdf, other]

doi 10.1063/1.4942357

Helical edge states and topological phase transitions in phononic systems using bi-layered lattices

Authors: Raj Kumar Pal, Marshall Schaeffer, Massimo Ruzzene

Abstract: We propose a framework to realize helical edge states in phononic systems using two identical lattices with interlayer couplings between them. A methodology is presented to systematically transform a quantum mechanical lattice which exhibits edge states to a phononic lattice, thereby develo** a family of lattices with edge states. Parameter spaces with topological phase boundaries in the vicinit… ▽ More We propose a framework to realize helical edge states in phononic systems using two identical lattices with interlayer couplings between them. A methodology is presented to systematically transform a quantum mechanical lattice which exhibits edge states to a phononic lattice, thereby develo** a family of lattices with edge states. Parameter spaces with topological phase boundaries in the vicinity of the transformed system are illustrated to demonstrate the robustness to mechanical imperfections. A potential realization in terms of fundamental mechanical building blocks is presented for the hexagonal and Lieb lattices. The lattices are composed of passive components and the building blocks are a set of disks and linear springs. Furthermore, by varying the spring stiffness, topological phase transitions are observed, illustrating the potential for tunability of our lattices. △ Less

Submitted 7 February, 2016; v1 submitted 23 November, 2015; originally announced November 2015.

Comments: 10 pages, 10 figures

arXiv:1406.4565 [pdf]

doi 10.1073/pnas.1412638112

High pressure superconducting phase diagram of 6Li: anomalous isotope effects in dense lithium

Authors: Anne Marie J. Schaeffer, Scott R. Temple, Jasmine K. Bishop, Shanti Deemyad

Abstract: We report the superconducting transition temperature of 6Li between 16-26 GPa, the lightest system to exhibit superconductivity to date. The superconducting phase diagram of 6Li is compared to that of 7Li through simultaneous measurement in a diamond anvil cell (DAC) 1, 2. Below 21.5 GPa, Li exhibits a direct, but unusually large isotope effect, while between 21.5-26 GPa, lithium shows an inverse… ▽ More We report the superconducting transition temperature of 6Li between 16-26 GPa, the lightest system to exhibit superconductivity to date. The superconducting phase diagram of 6Li is compared to that of 7Li through simultaneous measurement in a diamond anvil cell (DAC) 1, 2. Below 21.5 GPa, Li exhibits a direct, but unusually large isotope effect, while between 21.5-26 GPa, lithium shows an inverse superconducting isotope effect. The unusual dependence of the superconducting phase diagram of lithium on its atomic mass provides evidence that quantum solid effects dominate the low temperature properties of dense lithium leading to anomalous differences in the structures and/or in electronic properties of the isotopes through zero point effects. △ Less

Submitted 13 August, 2014; v1 submitted 17 June, 2014; originally announced June 2014.

Comments: 12 pages, 3 figures

arXiv:1206.6878 [pdf]

Efficient Selection of Disambiguating Actions for Stereo Vision

Authors: Monika Schaeffer, Ron Parr

Abstract: In many domains that involve the use of sensors, such as robotics or sensor networks, there are opportunities to use some form of active sensing to disambiguate data from noisy or unreliable sensors. These disambiguating actions typically take time and expend energy. One way to choose the next disambiguating action is to select the action with the greatest expected entropy reduction, or informatio… ▽ More In many domains that involve the use of sensors, such as robotics or sensor networks, there are opportunities to use some form of active sensing to disambiguate data from noisy or unreliable sensors. These disambiguating actions typically take time and expend energy. One way to choose the next disambiguating action is to select the action with the greatest expected entropy reduction, or information gain. In this work, we consider active sensing in aid of stereo vision for robotics. Stereo vision is a powerful sensing technique for mobile robots, but it can fail in scenes that lack strong texture. In such cases, a structured light source, such as vertical laser line can be used for disambiguation. By treating the stereo matching problem as a specially structured HMM-like graphical model, we demonstrate that for a scan line with n columns and maximum stereo disparity d, the entropy minimizing aim point for the laser can be selected in O(nd) time - cost no greater than the stereo algorithm itself. In contrast, a typical HMM formulation would suggest at least O(nd^2) time for the entropy calculation alone. △ Less

Submitted 27 June, 2012; originally announced June 2012.

Comments: Appears in Proceedings of the Twenty-Second Conference on Uncertainty in Artificial Intelligence (UAI2006)

Report number: UAI-P-2006-PG-418-427

arXiv:1001.1802 [pdf, ps, other]

Fusion Discrete Logarithm Problems

Authors: Martin Schaffer, Stefan Rass

Abstract: The Discrete Logarithm Problem is well-known among cryptographers, for its computational hardness that grants security to some of the most commonly used cryptosystems these days. Still, many of these are limited to a small number of candidate algebraic structures which permit implementing the algorithms. In order to extend the applicability of discrete-logarithm-based cryptosystems to a much ric… ▽ More The Discrete Logarithm Problem is well-known among cryptographers, for its computational hardness that grants security to some of the most commonly used cryptosystems these days. Still, many of these are limited to a small number of candidate algebraic structures which permit implementing the algorithms. In order to extend the applicability of discrete-logarithm-based cryptosystems to a much richer class of algebraic structures, we present a generalized form of exponential function. Our extension relaxes some assumptions on the exponent, which is no longer required to be an integer. Using an axiomatic characterization of the exponential function, we show how to construct map**s that obey the same rules as exponentials, but can raise vectors to the power of other vectors in an algebraically sound manner. At the same time, computational hardness is not affected (in fact, the problem could possibly be strengthened). Setting up standard cryptosystems in terms of our generalized exponential function is simple and requires no change to the existing security proofs. This opens the field for building much more general schemes than the ones known so far. △ Less

Submitted 19 February, 2010; v1 submitted 12 January, 2010; originally announced January 2010.

Comments: 15 pages, 1 figure

arXiv:astro-ph/0508216 [pdf, ps, other]

doi 10.1086/432502

Galaxy Morphologies in the Hubble Ultra Deep Field: Dominance of Linear Structures at the Detection Limit

Authors: Debra Meloy Elmegreen, Bruce G. Elmegreen, Douglas S. Rubin, Meredith A. Schaffer

Abstract: Galaxies in the Hubble Ultra Deep Field (UDF) larger than 10 pixels (0.3 arcsec) have been classified according to morphology and their photometric properties are presented. There are 269 spirals, 100 ellipticals, 114 chains, 126 double-clump, 97 tadpole, and 178 clump-cluster galaxies. We also catalogued 30 B-band and 13 V-band drop-outs and calculated their star formation rates. Chains, double… ▽ More Galaxies in the Hubble Ultra Deep Field (UDF) larger than 10 pixels (0.3 arcsec) have been classified according to morphology and their photometric properties are presented. There are 269 spirals, 100 ellipticals, 114 chains, 126 double-clump, 97 tadpole, and 178 clump-cluster galaxies. We also catalogued 30 B-band and 13 V-band drop-outs and calculated their star formation rates. Chains, doubles, and tadpoles dominate the other types at faint magnitudes. The fraction of obvious bars among spirals is ~10 percent, a factor of 2-3 lower than in other deep surveys. The distribution function of axial ratios for elliptical galaxies is similar to that seen locally, suggesting that ellipticals relaxed quickly to a standardized shape. The distribution of axial ratios for spiral galaxies is significantly different than locally, having a clear peak at ~0.55 instead of a nearly flat distribution. The fall-off at small axial ratio occurs at a higher value than locally, indicating thicker disks by a factor of ~2. The fall-off at high axial ratio could be from intrinsic triaxial shapes or selection effects. Inclined disks should be more highly sampled than face-on disks near the surface brightness limit of a survey. Simple models and data distributions demonstrate these effects. The decreased numbers of obvious spiral galaxies at high redshifts could be partly the result of surface brightness selection. △ Less

Submitted 9 August, 2005; originally announced August 2005.

Comments: 29 pages, 12 figures, ApJ in press, 20 Sept 2005, Vol 631

Journal ref: Astrophys.J.631:85-100,2005

Showing 1–15 of 15 results for author: Schaffer, M