-
The RSNA Abdominal Traumatic Injury CT (RATIC) Dataset
Authors:
Jeffrey D. Rudie,
Hui-Ming Lin,
Robyn L. Ball,
Sabeena Jalal,
Luciano M. Prevedello,
Savvas Nicolaou,
Brett S. Marinelli,
Adam E. Flanders,
Kirti Magudia,
George Shih,
Melissa A. Davis,
John Mongan,
Peter D. Chang,
Ferco H. Berger,
Sebastiaan Hermans,
Meng Law,
Tyler Richards,
Jan-Peter Grunz,
Andreas Steven Kunz,
Shobhit Mathur,
Sandro Galea-Soler,
Andrew D. Chung,
Saif Afat,
Chin-Chi Kuo,
Layal Aweidah
, et al. (15 additional authors not shown)
Abstract:
The RSNA Abdominal Traumatic Injury CT (RATIC) dataset is the largest publicly available collection of adult abdominal CT studies annotated for traumatic injuries. This dataset includes 4,274 studies from 23 institutions across 14 countries. The dataset is freely available for non-commercial use via Kaggle at https://www.kaggle.com/competitions/rsna-2023-abdominal-trauma-detection. Created for the…
▽ More
The RSNA Abdominal Traumatic Injury CT (RATIC) dataset is the largest publicly available collection of adult abdominal CT studies annotated for traumatic injuries. This dataset includes 4,274 studies from 23 institutions across 14 countries. The dataset is freely available for non-commercial use via Kaggle at https://www.kaggle.com/competitions/rsna-2023-abdominal-trauma-detection. Created for the RSNA 2023 Abdominal Trauma Detection competition, the dataset encourages the development of advanced machine learning models for detecting abdominal injuries on CT scans. The dataset encompasses detection and classification of traumatic injuries across multiple organs, including the liver, spleen, kidneys, bowel, and mesentery. Annotations were created by expert radiologists from the American Society of Emergency Radiology (ASER) and Society of Abdominal Radiology (SAR). The dataset is annotated at multiple levels, including the presence of injuries in three solid organs with injury grading, image-level annotations for active extravasations and bowel injury, and voxelwise segmentations of each of the potentially injured organs. With the release of this dataset, we hope to facilitate research and development in machine learning and abdominal trauma that can lead to improved patient care and outcomes.
△ Less
Submitted 29 May, 2024;
originally announced May 2024.
-
Haro 5-2: A New Pre-Main Sequence Quadruple Stellar System
Authors:
Bo Reipurth,
C. Briceno,
T. R. Geballe,
C. Baranec,
S. Mikkola,
A. M. Cody,
M. S. Connelley,
C. Flores,
B. A. Skiff,
J. D. Armstrong,
N. M. Law,
R. Riddle
Abstract:
We have discovered that the Halpha emission line star Haro 5-2, located in the 3-6 Myr old Ori OB1b association, is a young quadruple system. The system has a 2+2 configuration with an outer separation of 2.6 arcseconds and with resolved subarcsecond inner binary components. The brightest component, Aa, dominates the A-binary, it is a weakline T Tauri star with spectral type M2.5pm1. The two stars…
▽ More
We have discovered that the Halpha emission line star Haro 5-2, located in the 3-6 Myr old Ori OB1b association, is a young quadruple system. The system has a 2+2 configuration with an outer separation of 2.6 arcseconds and with resolved subarcsecond inner binary components. The brightest component, Aa, dominates the A-binary, it is a weakline T Tauri star with spectral type M2.5pm1. The two stars of the B component are equally bright at J, but the Bb star is much redder. Optical spectroscopy of the combined B pair indicates a rich emission line spectrum with a M3pm1 spectral type. The spectrum is highly variable and switches back and forth between a classical and a weakline T Tauri star. In the near-infrared, the spectrum shows Paschen beta and Brackett gamma in emission, indicative of active accretion. A significant mid-infrared excess reveals the presence of circumstellar or circumbinary material in the system. Most multiple systems are likely formed during the protostellar phase, involving flybys of neighboring stars followed by an in-spiraling phase driven by accretion from circumbinary material and leading to compact sub-systems. However, Haro 5-2 stands out among young 2+2 quadruples as the two inner binaries are unusually wide relative to the separation of the A and B pair, allowing future studies of the individual components. Assuming the components are coeval, the system could potentially allow stringent tests of PMS evolutionary models.
△ Less
Submitted 16 May, 2024;
originally announced May 2024.
-
Perivascular space Identification Nnunet for Generalised Usage (PINGU)
Authors:
Benjamin Sinclair,
Lucy Vivash,
Jasmine Moses,
Miranda Lynch,
William Pham,
Karina Dorfman,
Cassandra Marotta,
Shaun Koh,
Jacob Bunyamin,
Ella Rowsthorn,
Alex Jarema,
Himashi Peiris,
Zhaolin Chen,
Sandy R Shultz,
David K Wright,
Dexiao Kong,
Sharon L. Naismith,
Terence J. OBrien,
Meng Law
Abstract:
Perivascular spaces(PVSs) form a central component of the brainś waste clearance system, the glymphatic system. These structures are visible on MRI images, and their morphology is associated with aging and neurological disease. Manual quantification of PVS is time consuming and subjective. Numerous deep learning methods for PVS segmentation have been developed, however the majority have been devel…
▽ More
Perivascular spaces(PVSs) form a central component of the brainś waste clearance system, the glymphatic system. These structures are visible on MRI images, and their morphology is associated with aging and neurological disease. Manual quantification of PVS is time consuming and subjective. Numerous deep learning methods for PVS segmentation have been developed, however the majority have been developed and evaluated on homogenous datasets and high resolution scans, perhaps limiting their applicability for the wide range of image qualities acquired in clinic and research. In this work we train a nnUNet, a top-performing biomedical image segmentation algorithm, on a heterogenous training sample of manually segmented MRI images of a range of different qualities and resolutions from 6 different datasets. These are compared to publicly available deep learning methods for 3D segmentation of PVS. The resulting model, PINGU (Perivascular space Identification Nnunet for Generalised Usage), achieved voxel and cluster level dice scores of 0.50(SD=0.15), 0.63(0.17) in the white matter(WM), and 0.54(0.11), 0.66(0.17) in the basal ganglia(BG). Performance on data from unseen sites was substantially lower for both PINGU(0.20-0.38(WM, voxel), 0.29-0.58(WM, cluster), 0.22-0.36(BG, voxel), 0.46-0.60(BG, cluster)) and the publicly available algorithms(0.18-0.30(WM, voxel), 0.29-0.38(WM cluster), 0.10-0.20(BG, voxel), 0.15-0.37(BG, cluster)), but PINGU strongly outperformed the publicly available algorithms, particularly in the BG. Finally, training PINGU on manual segmentations from a single site with homogenous scan properties gave marginally lower performances on internal cross-validation, but in some cases gave higher performance on external validation. PINGU stands out as broad-use PVS segmentation tool, with particular strength in the BG, an area of PVS related to vascular disease and pathology.
△ Less
Submitted 17 May, 2024; v1 submitted 14 May, 2024;
originally announced May 2024.
-
Concavity for elliptic and parabolic equations in complex projective space
Authors:
Shrey Aryan,
Michael B. Law
Abstract:
We establish a concavity principle for solutions to elliptic and parabolic equations on complex projective space, generalizing the results of Langford and Scheuer. To our knowledge, this is the first example of a general concavity principle outside the constant sectional curvature regime, and in particular, our result partially answers a question raised by Korevaar in 1985 regarding the concavity…
▽ More
We establish a concavity principle for solutions to elliptic and parabolic equations on complex projective space, generalizing the results of Langford and Scheuer. To our knowledge, this is the first example of a general concavity principle outside the constant sectional curvature regime, and in particular, our result partially answers a question raised by Korevaar in 1985 regarding the concavity of solutions to elliptic equations on manifolds with non-constant sectional curvature.
△ Less
Submitted 25 March, 2024;
originally announced March 2024.
-
The ArgusSpec Prototype: Autonomous Spectroscopic Follow-up of Flares Detected by Large Array Telescopes
Authors:
Nathan W. Galliher,
Thomas Procter,
Nicholas M. Law,
Hank Corbett,
Ward S. Howard,
Alan Vasquez Soto,
Ramses Gonzalez,
Lawrence Machia,
Jonathan Carney,
William J. Marshall
Abstract:
ArgusSpec is a prototype autonomous spectroscopic follow-up instrument designed to characterize flares detected by the Argus Pathfinder telescope array by taking short exposure (30 s) broadband spectra (370 - 750 nm) at low resolutions (R~150 at 500 nm). The instrument is built from consumer off-the-shelf astronomical equipment, assembled inside a ship** container, and deployed alongside the Arg…
▽ More
ArgusSpec is a prototype autonomous spectroscopic follow-up instrument designed to characterize flares detected by the Argus Pathfinder telescope array by taking short exposure (30 s) broadband spectra (370 - 750 nm) at low resolutions (R~150 at 500 nm). The instrument is built from consumer off-the-shelf astronomical equipment, assembled inside a ship** container, and deployed alongside the Argus Pathfinder at a dark sky observing site in Western North Carolina. The \$35k prototype ArgusSpec was designed, built, and deployed in under a year, largely from existing parts, and has been operating on-sky since March 2023. With current hardware and software, the system is capable of receiving an observation, slewing, performing autonomous slit acquisition, and beginning data acquisition within an average of 32 s. With Argus Pathfinder's 1-second-cadence survey reporting alerts of rising sources within 2 s of onset, ArgusSpec can reach new targets well within a minute of the start of the event. As built, ArgusSpec can observe targets down to a 20$σ$ limiting magnitude of $m_V$~13 at 30 s cadence with an optical resolution of R~150 (at 500 nm). With automated rapid acquisition demonstrated, later hardware upgrades will significantly improve the limiting magnitude, and potentially enable deep spectroscopy by the coaddition of data from an array of ArgusSpec systems. ArgusSpec's primary science driver is the characterization of the blackbody evolution of flares from nearby M-dwarfs. Large flares emitted by these stars could have significant impacts on the potential habitability of any orbiting exoplanets, but our current understanding of these events is in large part built on observations from a handful of active stars. ArgusSpec will characterize large numbers of flares, building a spectroscopic library of the most extreme events from a wide variety of stellar masses and ages.
△ Less
Submitted 29 February, 2024;
originally announced March 2024.
-
Approximation Rates and VC-Dimension Bounds for (P)ReLU MLP Mixture of Experts
Authors:
Anastasis Kratsios,
Haitz Sáez de Ocáriz Borde,
Takashi Furuya,
Marc T. Law
Abstract:
Mixture-of-Experts (MoEs) can scale up beyond traditional deep learning models by employing a routing strategy in which each input is processed by a single "expert" deep learning model. This strategy allows us to scale up the number of parameters defining the MoE while maintaining sparse activation, i.e., MoEs only load a small number of their total parameters into GPU VRAM for the forward pass de…
▽ More
Mixture-of-Experts (MoEs) can scale up beyond traditional deep learning models by employing a routing strategy in which each input is processed by a single "expert" deep learning model. This strategy allows us to scale up the number of parameters defining the MoE while maintaining sparse activation, i.e., MoEs only load a small number of their total parameters into GPU VRAM for the forward pass depending on the input. In this paper, we provide an approximation and learning-theoretic analysis of mixtures of expert MLPs with (P)ReLU activation functions. We first prove that for every error level $\varepsilon>0$ and every Lipschitz function $f:[0,1]^n\to \mathbb{R}$, one can construct a MoMLP model (a Mixture-of-Experts comprising of (P)ReLU MLPs) which uniformly approximates $f$ to $\varepsilon$ accuracy over $[0,1]^n$, while only requiring networks of $\mathcal{O}(\varepsilon^{-1})$ parameters to be loaded in memory. Additionally, we show that MoMLPs can generalize since the entire MoMLP model has a (finite) VC dimension of $\tilde{O}(L\max\{nL,JW\})$, if there are $L$ experts and each expert has a depth and width of $J$ and $W$, respectively.
△ Less
Submitted 25 May, 2024; v1 submitted 5 February, 2024;
originally announced February 2024.
-
The Role of Foundation Models in Neuro-Symbolic Learning and Reasoning
Authors:
Daniel Cunnington,
Mark Law,
Jorge Lobo,
Alessandra Russo
Abstract:
Neuro-Symbolic AI (NeSy) holds promise to ensure the safe deployment of AI systems, as interpretable symbolic techniques provide formal behaviour guarantees. The challenge is how to effectively integrate neural and symbolic computation, to enable learning and reasoning from raw data. Existing pipelines that train the neural and symbolic components sequentially require extensive labelling, whereas…
▽ More
Neuro-Symbolic AI (NeSy) holds promise to ensure the safe deployment of AI systems, as interpretable symbolic techniques provide formal behaviour guarantees. The challenge is how to effectively integrate neural and symbolic computation, to enable learning and reasoning from raw data. Existing pipelines that train the neural and symbolic components sequentially require extensive labelling, whereas end-to-end approaches are limited in terms of scalability, due to the combinatorial explosion in the symbol grounding problem. In this paper, we leverage the implicit knowledge within foundation models to enhance the performance in NeSy tasks, whilst reducing the amount of data labelling and manual engineering. We introduce a new architecture, called NeSyGPT, which fine-tunes a vision-language foundation model to extract symbolic features from raw data, before learning a highly expressive answer set program to solve a downstream task. Our comprehensive evaluation demonstrates that NeSyGPT has superior accuracy over various baselines, and can scale to complex NeSy tasks. Finally, we highlight the effective use of a large language model to generate the programmatic interface between the neural and symbolic components, significantly reducing the amount of manual engineering required.
△ Less
Submitted 2 February, 2024;
originally announced February 2024.
-
Formation of Mn-rich interfacial phases in Co2FexMn1-xSi thin films
Authors:
Ka Ming Law,
Arashdeep S. Thind,
Mihir Pendharkar,
Sahil J. Patel,
Joshua J. Phillips,
Chris J. Palmstrom,
Jaume Gazquez,
Albina Borisevich,
Rohan Mishra,
Adam J. Hauser
Abstract:
We report the formation of Mn-rich regions at the interface of Co2FexMn1-xSi thin films grown on GaAs substrates by molecular beam epitaxy (MBE). Scanning transmission electron microscopy (STEM) with electron energy loss (EEL) spectrum imaging reveals that each interfacial region: (1) is 1-2 nm wide, (2) occurs irrespective of the Fe/Mn composition ratio and in both Co-rich and Co-poor films, and…
▽ More
We report the formation of Mn-rich regions at the interface of Co2FexMn1-xSi thin films grown on GaAs substrates by molecular beam epitaxy (MBE). Scanning transmission electron microscopy (STEM) with electron energy loss (EEL) spectrum imaging reveals that each interfacial region: (1) is 1-2 nm wide, (2) occurs irrespective of the Fe/Mn composition ratio and in both Co-rich and Co-poor films, and (3) displaces both Co and Fe indiscriminately. We also observe a Mn-depleted region in each film directly above each Mn-rich interfacial layer, roughly 3 nm in width in the x = 0 and x = 0.3 films, and 1 nm in the x = 0.7 (less Mn) film. We posit that growth energetics favor Mn diffusion to the interface even when there is no significant Ga interdiffusion into the epitaxial film. Element-specific X-ray magnetic circular dichroism (XMCD) measurements show larger Co, Fe, and Mn orbital to spin magnetic moment ratios compared to bulk values across the Co2FexMn1-xSi compositional range. The values lie between reported values for pure bulk and nanostructured Co, Fe, and Mn materials, corroborating the non-uniform, layered nature of the material on the nanoscale. Finally, SQUID magnetometry demonstrates that the films deviate from the Slater-Pauling rule for uniform films of both the expected and the measured composition. The results inform a need for care and increased scrutiny when forming Mn-based magnetic thin films on III-V semiconductors like GaAs, particularly when films are on the order of 5 nm or when interface composition is critical to spin transport or other device applications.
△ Less
Submitted 24 December, 2023;
originally announced December 2023.
-
Positive mass and Dirac operators on weighted manifolds and smooth metric measure spaces
Authors:
Michael B. Law,
Isaac M. Lopez,
Daniel Santiago
Abstract:
We establish a weighted positive mass theorem which unifies and generalizes results of Baldauf--Ozuch and Chu--Zhu. Our result is in fact equivalent to the usual positive mass theorem, and can be regarded as a positive mass theorem for smooth metric measure spaces. We also study Dirac operators on certain warped product manifolds associated to smooth metric measure spaces. Applications of this inc…
▽ More
We establish a weighted positive mass theorem which unifies and generalizes results of Baldauf--Ozuch and Chu--Zhu. Our result is in fact equivalent to the usual positive mass theorem, and can be regarded as a positive mass theorem for smooth metric measure spaces. We also study Dirac operators on certain warped product manifolds associated to smooth metric measure spaces. Applications of this include, among others, an alternative proof for a special case of our positive mass theorem, eigenvalue bounds for the Dirac operator on closed spin manifolds, and a new way to understand the weighted Dirac operator using warped products.
△ Less
Submitted 29 February, 2024; v1 submitted 24 December, 2023;
originally announced December 2023.
-
Distributional Robustness and Transfer Learning Through Empirical Bayes
Authors:
Michael Law,
Peter Bühlmann,
Ya'acov Ritov
Abstract:
We consider the problem of statistical inference on parameters of a target population when auxiliary observations are available from related populations. We propose a flexible empirical Bayes approach that can be applied on top of any asymptotically linear estimator to incorporate information from related populations when constructing confidence regions. The proposed methodology is valid regardles…
▽ More
We consider the problem of statistical inference on parameters of a target population when auxiliary observations are available from related populations. We propose a flexible empirical Bayes approach that can be applied on top of any asymptotically linear estimator to incorporate information from related populations when constructing confidence regions. The proposed methodology is valid regardless of whether there are direct observations on the population of interest. We demonstrate the performance of the empirical Bayes confidence regions on synthetic data as well as on the Trends in International Mathematics and Sciences Study when using the debiased Lasso as the basic algorithm in high-dimensional regression.
△ Less
Submitted 13 December, 2023;
originally announced December 2023.
-
Graph Metanetworks for Processing Diverse Neural Architectures
Authors:
Derek Lim,
Haggai Maron,
Marc T. Law,
Jonathan Lorraine,
James Lucas
Abstract:
Neural networks efficiently encode learned information within their parameters. Consequently, many tasks can be unified by treating neural networks themselves as input data. When doing so, recent studies demonstrated the importance of accounting for the symmetries and geometry of parameter spaces. However, those works developed architectures tailored to specific networks such as MLPs and CNNs with…
▽ More
Neural networks efficiently encode learned information within their parameters. Consequently, many tasks can be unified by treating neural networks themselves as input data. When doing so, recent studies demonstrated the importance of accounting for the symmetries and geometry of parameter spaces. However, those works developed architectures tailored to specific networks such as MLPs and CNNs without normalization layers, and generalizing such architectures to other types of networks can be challenging. In this work, we overcome these challenges by building new metanetworks - neural networks that take weights from other neural networks as input. Put simply, we carefully build graphs representing the input neural networks and process the graphs using graph neural networks. Our approach, Graph Metanetworks (GMNs), generalizes to neural architectures where competing methods struggle, such as multi-head attention layers, normalization layers, convolutional layers, ResNet blocks, and group-equivariant linear layers. We prove that GMNs are expressive and equivariant to parameter permutation symmetries that leave the input neural network functions unchanged. We validate the effectiveness of our method on several metanetwork tasks over diverse neural network architectures.
△ Less
Submitted 29 December, 2023; v1 submitted 7 December, 2023;
originally announced December 2023.
-
A Unifying Framework for Learning Argumentation Semantics
Authors:
Zlatina Mileva,
Antonis Bikakis,
Fabio Aurelio D'Asaro,
Mark Law,
Alessandra Russo
Abstract:
Argumentation is a very active research field of Artificial Intelligence concerned with the representation and evaluation of arguments used in dialogues between humans and/or artificial agents. Acceptability semantics of formal argumentation systems define the criteria for the acceptance or rejection of arguments. Several software systems, known as argumentation solvers, have been developed to com…
▽ More
Argumentation is a very active research field of Artificial Intelligence concerned with the representation and evaluation of arguments used in dialogues between humans and/or artificial agents. Acceptability semantics of formal argumentation systems define the criteria for the acceptance or rejection of arguments. Several software systems, known as argumentation solvers, have been developed to compute the accepted/rejected arguments using such criteria. These include systems that learn to identify the accepted arguments using non-interpretable methods. In this paper we present a novel framework, which uses an Inductive Logic Programming approach to learn the acceptability semantics for several abstract and structured argumentation frameworks in an interpretable way. Through an empirical evaluation we show that our framework outperforms existing argumentation solvers, thus opening up new future research directions in the area of formal argumentation and human-machine dialogues.
△ Less
Submitted 18 October, 2023;
originally announced October 2023.
-
GAPS contributions to the 38th International Cosmic Ray Conference (Nagoya 2023)
Authors:
T. Aramaki,
M. Boezio,
S. E. Boggs,
V. Bonvicini,
G. Bridges,
D. Campana,
W. W. Craig,
P. von Doetinchem,
E. Everson,
L. Fabris,
S. Feldman,
H. Fuke,
F. Gahbauer,
C. Gerrity,
L. Ghislotti,
C. J. Hailey,
T. Hayashi,
A. Kawachi,
M. Kozai,
P. Lazzaroni,
M. Law,
A. Lenni,
A. Lowell,
M. Manghisoni,
N. Marcelli
, et al. (33 additional authors not shown)
Abstract:
Compilation of papers presented by the GAPS Collaboration at the 38th International Cosmic Ray Conference (ICRC), held July 26 through August 3, 2023 in Nagoya, Japan.
Compilation of papers presented by the GAPS Collaboration at the 38th International Cosmic Ray Conference (ICRC), held July 26 through August 3, 2023 in Nagoya, Japan.
△ Less
Submitted 16 October, 2023;
originally announced October 2023.
-
Invariant Probabilistic Prediction
Authors:
Alexander Henzi,
Xinwei Shen,
Michael Law,
Peter Bühlmann
Abstract:
In recent years, there has been a growing interest in statistical methods that exhibit robust performance under distribution changes between training and test data. While most of the related research focuses on point predictions with the squared error loss, this article turns the focus towards probabilistic predictions, which aim to comprehensively quantify the uncertainty of an outcome variable g…
▽ More
In recent years, there has been a growing interest in statistical methods that exhibit robust performance under distribution changes between training and test data. While most of the related research focuses on point predictions with the squared error loss, this article turns the focus towards probabilistic predictions, which aim to comprehensively quantify the uncertainty of an outcome variable given covariates. Within a causality-inspired framework, we investigate the invariance and robustness of probabilistic predictions with respect to proper scoring rules. We show that arbitrary distribution shifts do not, in general, admit invariant and robust probabilistic predictions, in contrast to the setting of point prediction. We illustrate how to choose evaluation metrics and restrict the class of distribution shifts to allow for identifiability and invariance in the prototypical Gaussian heteroscedastic linear model. Motivated by these findings, we propose a method to yield invariant probabilistic predictions, called IPP, and study the consistency of the underlying parameters. Finally, we demonstrate the empirical performance of our proposed procedure on simulated as well as on single-cell data.
△ Less
Submitted 16 June, 2024; v1 submitted 18 September, 2023;
originally announced September 2023.
-
Longitudinal Position and Cancer Risk in the United States Revisited
Authors:
** Niu,
Charlotte Brown,
Michael Law,
Justin Colacino,
Ya'acov Ritov
Abstract:
Background: The debate over daylight saving time has surged, with interests in the effects of sunlight exposure on health. \commentnj{Prior studies simulated daylight saving time and standard time conditions by analyzing different locations within time zones and neighboring areas across time zone borders.
Methods: We analyzed cancer incidence rates from various longitudinal positions within time…
▽ More
Background: The debate over daylight saving time has surged, with interests in the effects of sunlight exposure on health. \commentnj{Prior studies simulated daylight saving time and standard time conditions by analyzing different locations within time zones and neighboring areas across time zone borders.
Methods: We analyzed cancer incidence rates from various longitudinal positions within time zones and at time zone borders in the contiguous United States. Using data from State Cancer Profiles (2016-2020), we analyzed total cancer of 19 types and specific rates for eight cancers, adjusted for age and includes all demographics. Log-linear regression is used to replicate a previous study, and spatial regression models are employed to explore discontinuities at borders.
Results: Cancer rate differences lack statistical significance within time zones and near borders for total cancer and most individual cancers. Exceptions included breast, prostate, and liver \& bile duct cancers, which exhibited significant relationships with relative position at the 95\% significance level. Breast and liver and bile duct cancers saw decreases, while prostate cancer incidence increased from west to east within time zones.
Conclusions: Relative position does not have a significant impact on cancer incidence, hence cancer development in general. Isolated exceptions may warrant further investigation as more data becomes available.
Impact: Our findings challenge prior research, revealing numerous inconsistencies. These disparities urge a reconsideration of the potential disparities in human health associated with daylight saving time and standard time. They offer insights contribute to the ongoing discussion surrounding the retention or abandonment of DST.
△ Less
Submitted 28 November, 2023; v1 submitted 9 June, 2023;
originally announced June 2023.
-
A Rank-Based Sequential Test of Independence
Authors:
Alexander Henzi,
Michael Law
Abstract:
We consider the problem of independence testing for two univariate random variables in a sequential setting. By leveraging recent developments on safe, anytime-valid inference, we propose a test with time-uniform type I error control and derive explicit bounds on the finite sample performance of the test. We demonstrate the empirical performance of the procedure in comparison to existing sequentia…
▽ More
We consider the problem of independence testing for two univariate random variables in a sequential setting. By leveraging recent developments on safe, anytime-valid inference, we propose a test with time-uniform type I error control and derive explicit bounds on the finite sample performance of the test. We demonstrate the empirical performance of the procedure in comparison to existing sequential and non-sequential independence tests. Furthermore, since the proposed test is distribution free under the null hypothesis, we empirically simulate the gap due to Ville's inequality, the supermartingale analogue of Markov's inequality, that is commonly applied to control type I error in anytime-valid inference, and apply this to construct a truncated sequential test.
△ Less
Submitted 25 January, 2024; v1 submitted 23 May, 2023;
originally announced May 2023.
-
Large-scale detector testing for the GAPS Si(Li) Tracker
Authors:
Mengjiao Xiao,
Achim Stoessl,
Brandon Roach,
Cory Gerrity,
Ian Bouche,
Gabriel Bridges,
Philip von Doetinchem,
Charles J. Hailey,
Derik Kraych,
Anika Katt,
Michael Law,
Alexander Lowell,
Evan Martinez,
Kerstin Perez,
Maggie Reed,
Chelsea Rodriguez,
Nathan Saffold,
Ceaser Stringfield,
Hershel Weiner,
Kelsey Yee
Abstract:
Lithium-drifted silicon [Si(Li)] has been used for decades as an ionizing radiation detector in nuclear, particle, and astrophysical experiments, though such detectors have frequently been limited to small sizes (few cm$^2$) and cryogenic operating temperatures. The 10-cm-diameter Si(Li) detectors developed for the General Antiparticle Spectrometer (GAPS) balloon-borne dark matter experiment are n…
▽ More
Lithium-drifted silicon [Si(Li)] has been used for decades as an ionizing radiation detector in nuclear, particle, and astrophysical experiments, though such detectors have frequently been limited to small sizes (few cm$^2$) and cryogenic operating temperatures. The 10-cm-diameter Si(Li) detectors developed for the General Antiparticle Spectrometer (GAPS) balloon-borne dark matter experiment are novel particularly for their requirements of low cost, large sensitive area (~10 m$^2$ for the full 1440-detector array), high temperatures (near -40$\,^\circ$C), and energy resolution below 4 keV FWHM for 20--100-keV x-rays. Previous works have discussed the manufacturing, passivation, and small-scale testing of prototype GAPS Si(Li) detectors. Here we show for the first time the results from detailed characterization of over 1100 flight detectors, illustrating the consistent intrinsic low-noise performance of a large sample of GAPS detectors. This work demonstrates the feasibility of large-area and low-cost Si(Li) detector arrays for next-generation astrophysics and nuclear physics applications.
△ Less
Submitted 7 September, 2023; v1 submitted 29 April, 2023;
originally announced May 2023.
-
Three Saturn-mass planets transiting F-type stars revealed with TESS and HARPS
Authors:
Angelica Psaridi,
François Bouchy,
Monika Lendl,
Babatunde Akinsanmi,
Keivan G. Stassun,
Barry Smalley,
David J. Armstrong,
Saburo Howard,
Solène Ulmer-Moll,
Nolan Grieves,
Khalid Barkaoui,
Joseph E. Rodriguez,
Edward M. Bryant,
Olga Suárez,
Tristan Guillot,
Phil Evans,
Omar Attia,
Robert A. Wittenmyer,
Samuel W. Yee,
Karen A. Collins,
George Zhou,
Franck Galland,
Léna Parc,
Stéphane Udry,
Pedro Figueira
, et al. (40 additional authors not shown)
Abstract:
While the sample of confirmed exoplanets continues to increase, the population of transiting exoplanets around early-type stars is still limited. These planets allow us to investigate the planet properties and formation pathways over a wide range of stellar masses and study the impact of high irradiation on hot Jupiters orbiting such stars. We report the discovery of TOI-615b, TOI-622b, and TOI-26…
▽ More
While the sample of confirmed exoplanets continues to increase, the population of transiting exoplanets around early-type stars is still limited. These planets allow us to investigate the planet properties and formation pathways over a wide range of stellar masses and study the impact of high irradiation on hot Jupiters orbiting such stars. We report the discovery of TOI-615b, TOI-622b, and TOI-2641b, three Saturn-mass planets transiting main sequence, F-type stars. The planets were identified by the Transiting Exoplanet Survey Satellite (TESS) and confirmed with complementary ground-based and radial velocity observations. TOI-615b is a highly irradiated ($\sim$1277 $F_{\oplus}$) and bloated Saturn-mass planet (1.69$^{+0.05}_{-0.06}$$R_{Jup}$ and 0.43$^{+0.09}_{-0.08}$$M_{Jup}$) in a 4.66 day orbit transiting a 6850 K star. TOI-622b has a radius of 0.82$^{+0.03}_{-0.03}$$R_{Jup}$ and a mass of 0.30$^{+0.07}_{-0.08}$~$M_{Jup}$ in a 6.40 day orbit. Despite its high insolation flux ($\sim$600 $F_{\oplus}$), TOI-622b does not show any evidence of radius inflation. TOI-2641b is a 0.39$^{+0.02}_{-0.04}$$M_{Jup}$ planet in a 4.88 day orbit with a grazing transit (b = 1.04$^{+0.05}_{-0.06 }$) that results in a poorly constrained radius of 1.61$^{+0.46}_{-0.64}$$R_{Jup}$. Additionally, TOI-615b is considered attractive for atmospheric studies via transmission spectroscopy with ground-based spectrographs and $\textit{JWST}$. Future atmospheric and spin-orbit alignment observations are essential since they can provide information on the atmospheric composition, formation and migration of exoplanets across various stellar types.
△ Less
Submitted 11 May, 2023; v1 submitted 27 March, 2023;
originally announced March 2023.
-
The Evryscope Fast Transient Engine: Real-Time Detection for Rapidly Evolving Transients
Authors:
Hank Corbett,
Jonathan Carney,
Ramses Gonzalez,
Octavi Fors,
Nathan Galliher,
Amy Glazier,
Ward S. Howard,
Nicholas M. Law,
Robert Quimby,
Jeffrey K. Ratzloff,
Alan Vasquez Soto
Abstract:
Astrophysical transients with rapid development on sub-hour timescales are intrinsically rare. Due to their short durations, events like stellar superflares, optical flashes from gamma-ray bursts, and shock breakouts from young supernovae are difficult to identify on timescales that enable spectroscopic followup. This paper presents the Evryscope Fast Transient Engine (EFTE), a new data reduction…
▽ More
Astrophysical transients with rapid development on sub-hour timescales are intrinsically rare. Due to their short durations, events like stellar superflares, optical flashes from gamma-ray bursts, and shock breakouts from young supernovae are difficult to identify on timescales that enable spectroscopic followup. This paper presents the Evryscope Fast Transient Engine (EFTE), a new data reduction pipeline designed to provide low-latency transient alerts from the Evryscopes, a North-South pair of ultra-wide-field telescopes with an instantaneous footprint covering 38% of the entire sky, and tools for building long-term light curves from Evryscope data. EFTE leverages the optical stability of the Evryscopes by using a simple direct image subtraction routine suited to continuously monitoring the transient sky at minute cadence. Candidates are produced within the base Evryscope two-minute cadence for 98.5% of images, and internally filtered using VetNet, a convolutional neural network real-bogus classifier. EFTE provides an extensible, robust architecture for transient surveys probing similar timescales, and serves as the software testbed for the real-time analysis pipelines and public data distribution systems for the Argus Array, a next generation all-sky observatory with a data rate 62x higher than Evryscope.
△ Less
Submitted 21 February, 2023;
originally announced February 2023.
-
Bridging the Sim2Real gap with CARE: Supervised Detection Adaptation with Conditional Alignment and Reweighting
Authors:
Viraj Prabhu,
David Acuna,
Andrew Liao,
Rafid Mahmood,
Marc T. Law,
Judy Hoffman,
Sanja Fidler,
James Lucas
Abstract:
Sim2Real domain adaptation (DA) research focuses on the constrained setting of adapting from a labeled synthetic source domain to an unlabeled or sparsely labeled real target domain. However, for high-stakes applications (e.g. autonomous driving), it is common to have a modest amount of human-labeled real data in addition to plentiful auto-labeled source data (e.g. from a driving simulator). We st…
▽ More
Sim2Real domain adaptation (DA) research focuses on the constrained setting of adapting from a labeled synthetic source domain to an unlabeled or sparsely labeled real target domain. However, for high-stakes applications (e.g. autonomous driving), it is common to have a modest amount of human-labeled real data in addition to plentiful auto-labeled source data (e.g. from a driving simulator). We study this setting of supervised sim2real DA applied to 2D object detection. We propose Domain Translation via Conditional Alignment and Reweighting (CARE) a novel algorithm that systematically exploits target labels to explicitly close the sim2real appearance and content gaps. We present an analytical justification of our algorithm and demonstrate strong gains over competing methods on standard benchmarks.
△ Less
Submitted 9 February, 2023;
originally announced February 2023.
-
TESS Hunt for Young and Maturing Exoplanets (THYME) IX: a 27 Myr extended population of Lower-Centaurus Crux with a transiting two-planet system
Authors:
Mackenna L. Wood,
Andrew W. Mann,
Madyson G. Barber,
Jonathan L. Bush,
Adam L. Kraus,
Benjamin M. Tofflemire,
Andrew Vanderburg,
Elisabeth R. Newton,
Gregory A. Feiden,
George Zhou,
Luke G. Bouma,
Samuel N. Quinn,
David J. Armstrong,
Ares Osborn,
Vardan Adibekyan,
Elisa Delgado Mena,
Sergio G. Sousa,
Jonathan Gagné,
Matthew J. Fields,
Reilly P. Milburn,
Pa Chia Thao,
Stephen P. Schmidt,
Crystal L. Gnilka,
Steve B. Howell,
Nicholas M. Law
, et al. (13 additional authors not shown)
Abstract:
We report the discovery and characterization of a nearby (~ 85 pc), older (27 +/- 3 Myr), distributed stellar population near Lower-Centaurus-Crux (LCC), initially identified by searching for stars co-moving with a candidate transiting planet from TESS (HD 109833; TOI 1097). We determine the association membership using Gaia kinematics, color-magnitude information, and rotation periods of candidat…
▽ More
We report the discovery and characterization of a nearby (~ 85 pc), older (27 +/- 3 Myr), distributed stellar population near Lower-Centaurus-Crux (LCC), initially identified by searching for stars co-moving with a candidate transiting planet from TESS (HD 109833; TOI 1097). We determine the association membership using Gaia kinematics, color-magnitude information, and rotation periods of candidate members. We measure it's age using isochrones, gyrochronology, and Li depletion. While the association is near known populations of LCC, we find that it is older than any previously found LCC sub-group (10-16 Myr), and distinct in both position and velocity. In addition to the candidate planets around HD 109833 the association contains four directly-imaged planetary-mass companions around 3 stars, YSES-1, YSES-2, and HD 95086, all of which were previously assigned membership in the younger LCC. Using the Notch pipeline, we identify a second candidate transiting planet around HD 109833. We use a suite of ground-based follow-up observations to validate the two transit signals as planetary in nature. HD 109833 b and c join the small but growing population of <100 Myr transiting planets from TESS. HD 109833 has a rotation period and Li abundance indicative of a young age (< 100 Myr), but a position and velocity on the outskirts of the new population, lower Li levels than similar members, and a CMD position below model predictions for 27 Myr. So, we cannot reject the possibility that HD 109833 is a young field star coincidentally nearby the population.
△ Less
Submitted 6 December, 2022;
originally announced December 2022.
-
Optimal Patient Allocation in Multi-Arm Clinical Trials
Authors:
Martin Law
Abstract:
A multi-arm multi-stage trial is a multi-arm trial which includes interim analyses - analysing the data at certain specified points, generally discontinuing treatments which are concluded to not work and proceeding with the remainder.
It is possible that the advantages of multi-arm trials over single-arm trials may be enhanced further by considering the allocation ratio, R. For an R:1 allocation…
▽ More
A multi-arm multi-stage trial is a multi-arm trial which includes interim analyses - analysing the data at certain specified points, generally discontinuing treatments which are concluded to not work and proceeding with the remainder.
It is possible that the advantages of multi-arm trials over single-arm trials may be enhanced further by considering the allocation ratio, R. For an R:1 allocation ratio, Rn patients are allocated to the control arm and n patients allocated to each active treatment arm. In this study, the optimal allocation ratio will be defined as the allocation ratio which results in the smallest total sample size satisfying some required power and probability of type I error. This is an intuitive definition in the context of clinical trials, as a smaller trial will in general be more ethical and less expensive than a larger one satisfying the same error rates. The purpose of this paper is to investigate the optimal allocation ratio in the case of multiple active treatment arms.
The setup for a single stage trial with K active treatment arms is described in Section 2, along with a brief exposition of Dunnett's statement regarding the optimal allocation ratio in such circumstances. Equations for type I error and power are derived, and the methodology used to investigate how total sample size may be minimised using allocation ratio is described. A two-stage trial is then considered, using the same methodology. Figures and tables showing how total sample size changes with allocation ratio, for a range of type I error and power values, are given in Section 3. The possible ethical and financial benefits of changing allocation ratio, including a simple example, is also included in Section 3. The results, and what they could mean in practical terms, are discussed in Section 4.
△ Less
Submitted 11 November, 2022;
originally announced November 2022.
-
A Low-mass, Pre-main-sequence Eclipsing Binary in the 40 Myr Columba Association -- Fundamental Stellar Parameters and Modeling the Effect of Star Spots
Authors:
Benjamin M. Tofflemire,
Adam L. Kraus,
Andrew W. Mann,
Elisabeth R. Newton,
Michael A. Gully-Santiago,
Andrew Vanderburg,
William C. Waalkes,
Zachory K. Berta-Thompson,
Kevin I. Collins,
Karen A. Collins,
Louise D. Nielsen,
Francois Bouchy,
Carl Ziegler,
Cesar Briceno,
Nicholas M. Law
Abstract:
Young eclipsing binaries (EBs) are powerful probes of early stellar evolution. Current models are unable to simultaneously reproduce the measured and derived properties that are accessible for EB systems (e.g., mass, radius, temperature, luminosity). In this study we add a benchmark EB to the pre-main-sequence population with our characterization of TOI 450 (TIC 77951245). Using Gaia astrometry to…
▽ More
Young eclipsing binaries (EBs) are powerful probes of early stellar evolution. Current models are unable to simultaneously reproduce the measured and derived properties that are accessible for EB systems (e.g., mass, radius, temperature, luminosity). In this study we add a benchmark EB to the pre-main-sequence population with our characterization of TOI 450 (TIC 77951245). Using Gaia astrometry to identify its comoving, coeval companions, we confirm TOI 450 is a member of the $\sim$40 Myr Columba association. This eccentric ($e=0.2969$), equal-mass ($q=1.000$) system provides only one grazing eclipse. Despite this, our analysis achieves the precision of a double-eclipsing system by leveraging information in our high-resolution spectra to place priors on the surface-brightness and radius ratios. We also introduce a framework to include the effect of star spots on the observed eclipse depths. Multicolor eclipse light curves play a critical role in breaking degeneracies between the effects of star spots and limb-darkening. Including star spots reduces the derived radii by $\sim$2\% from an unspotted model ($>2σ$) and inflates the formal uncertainty in accordance with our lack of knowledge regarding the star spot orientation. We derive masses of 0.1768($\pm$0.0004) and 0.1767($\pm$0.0003) $M_\odot$, and radii of 0.345($\pm$0.006) and 0.346($\pm$0.006) $R_\odot$ for the primary and secondary, respectively. We compare these measurements to multiple stellar evolution isochones, finding good agreement with the association age. The MESA MIST and SPOTS ($f_{\rm s}=0.17$) isochrones perform the best across our comparisons, but detailed agreement depends heavily on the quantities being compared.
△ Less
Submitted 21 December, 2022; v1 submitted 19 October, 2022;
originally announced October 2022.
-
Optimizing Data Collection for Machine Learning
Authors:
Rafid Mahmood,
James Lucas,
Jose M. Alvarez,
Sanja Fidler,
Marc T. Law
Abstract:
Modern deep learning systems require huge data sets to achieve impressive performance, but there is little guidance on how much or what kind of data to collect. Over-collecting data incurs unnecessary present costs, while under-collecting may incur future costs and delay workflows. We propose a new paradigm for modeling the data collection workflow as a formal optimal data collection problem that…
▽ More
Modern deep learning systems require huge data sets to achieve impressive performance, but there is little guidance on how much or what kind of data to collect. Over-collecting data incurs unnecessary present costs, while under-collecting may incur future costs and delay workflows. We propose a new paradigm for modeling the data collection workflow as a formal optimal data collection problem that allows designers to specify performance targets, collection costs, a time horizon, and penalties for failing to meet the targets. Additionally, this formulation generalizes to tasks requiring multiple data sources, such as labeled and unlabeled data used in semi-supervised learning. To solve our problem, we develop Learn-Optimize-Collect (LOC), which minimizes expected future collection costs. Finally, we numerically compare our framework to the conventional baseline of estimating data requirements by extrapolating from neural scaling laws. We significantly reduce the risks of failing to meet desired performance targets on several classification, segmentation, and detection tasks, while maintaining low total collection costs.
△ Less
Submitted 3 October, 2022;
originally announced October 2022.
-
Packing the sky: coverage optimization and evaluation for large telescope arrays
Authors:
Nathan W. Galliher,
Nicholas M. Law,
Hank Corbett,
Ramses Gonzalez,
Lawrence Machia,
Alan Vasquez Soto
Abstract:
Recent advancements in low-cost astronomical equipment, including high-quality medium-aperture telescopes and low-noise CMOS detectors, have made the deployment of large optical telescope arrays both financially feasible and scientifically interesting. The Argus Optical Array is one such system, composed of 900 eight-inch telescopes, which is planned to cover the entire night sky in each exposure…
▽ More
Recent advancements in low-cost astronomical equipment, including high-quality medium-aperture telescopes and low-noise CMOS detectors, have made the deployment of large optical telescope arrays both financially feasible and scientifically interesting. The Argus Optical Array is one such system, composed of 900 eight-inch telescopes, which is planned to cover the entire night sky in each exposure and is capable of being the deepest and fastest Northern Hemisphere sky survey. With this new class of telescope comes new challenges: determining optimal individual telescope pointings to achieve required sky coverage and overlaps for large numbers of telescopes, and realizing those pointings using either individual mounts, larger mounting structures containing telescope subarrays, or the full array on a single mount. In this paper, we describe a method for creating a pointing pattern, and an algorithm for rapidly evaluating sky coverage and overlaps given that pattern, and apply it to the Argus Array. Using this pattern, telescopes are placed into a hemispherical arrangement, which can be mounted as a single monolithic array or split into several smaller subarrays. These methods can be applied to other large arrays where sky packing is challenging and evenly spaced array subdivisions are necessary for mounting.
△ Less
Submitted 18 August, 2022;
originally announced August 2022.
-
The sky at one terabit per second: Architecture and implementation of the Argus Array Hierarchical Data Processing System
Authors:
Hank Corbett,
Alan Vasquez Soto,
Lawrence Machia,
Nathan Galliher,
Ramses Gonzalez,
Nicholas M. Law
Abstract:
The Argus Optical Array is a synoptic survey observatory, currently in development, that will have a total collecting area equivalent to a 5-meter monolithic telescope and an all-sky field of view, multiplexed from 900 commercial off-the-shelf telescopes. The Array will observe 7916 deg$^2$ every second during high-speed operations ($m_g\leq16.1$) and every 30 seconds at base cadence (…
▽ More
The Argus Optical Array is a synoptic survey observatory, currently in development, that will have a total collecting area equivalent to a 5-meter monolithic telescope and an all-sky field of view, multiplexed from 900 commercial off-the-shelf telescopes. The Array will observe 7916 deg$^2$ every second during high-speed operations ($m_g\leq16.1$) and every 30 seconds at base cadence ($m_g\leq19.1$), producing 4.3 PB and 145 TB respectively of data per night with its 55-gigapixel mosaic of cameras. The Argus Array Hierarchical Data Processing System (Argus-HDPS) is the instrument control and analysis pipeline for the Argus Array project, able to create fully-reduced data products in real time. We pair sub-arrays of cameras with co-located compute nodes, responsible for distilling the raw 11 Tbps data rate into transient alerts, full-resolution image segments around selected targets at 30-second cadence, and full-resolution coadds of the entire field of view at $15+$-min cadences. Production of long-term light curves and transient discovery in deep coadds out to 5-day cadence ($m_g\leq24.0$) will be scheduled for daytime operations. In this paper, we describe the data reduction strategy for the Argus Optical Array and demonstrate image segmentation, coaddition, and difference image analysis using the GPU-enabled Argus-HDPS pipelines on representative data from the Argus Array Technology Demonstrator.
△ Less
Submitted 28 July, 2022;
originally announced July 2022.
-
How Much More Data Do I Need? Estimating Requirements for Downstream Tasks
Authors:
Rafid Mahmood,
James Lucas,
David Acuna,
Daiqing Li,
Jonah Philion,
Jose M. Alvarez,
Zhiding Yu,
Sanja Fidler,
Marc T. Law
Abstract:
Given a small training data set and a learning algorithm, how much more data is necessary to reach a target validation or test performance? This question is of critical importance in applications such as autonomous driving or medical imaging where collecting data is expensive and time-consuming. Overestimating or underestimating data requirements incurs substantial costs that could be avoided with…
▽ More
Given a small training data set and a learning algorithm, how much more data is necessary to reach a target validation or test performance? This question is of critical importance in applications such as autonomous driving or medical imaging where collecting data is expensive and time-consuming. Overestimating or underestimating data requirements incurs substantial costs that could be avoided with an adequate budget. Prior work on neural scaling laws suggest that the power-law function can fit the validation performance curve and extrapolate it to larger data set sizes. We find that this does not immediately translate to the more difficult downstream task of estimating the required data set size to meet a target performance. In this work, we consider a broad class of computer vision tasks and systematically investigate a family of functions that generalize the power-law function to allow for better estimation of data requirements. Finally, we show that incorporating a tuned correction factor and collecting over multiple rounds significantly improves the performance of the data estimators. Using our guidelines, practitioners can accurately estimate data requirements of machine learning systems to gain savings in both development time and data acquisition costs.
△ Less
Submitted 13 July, 2022; v1 submitted 4 July, 2022;
originally announced July 2022.
-
Hierarchies of Reward Machines
Authors:
Daniel Furelos-Blanco,
Mark Law,
Anders Jonsson,
Krysia Broda,
Alessandra Russo
Abstract:
Reward machines (RMs) are a recent formalism for representing the reward function of a reinforcement learning task through a finite-state machine whose edges encode subgoals of the task using high-level events. The structure of RMs enables the decomposition of a task into simpler and independently solvable subtasks that help tackle long-horizon and/or sparse reward tasks. We propose a formalism fo…
▽ More
Reward machines (RMs) are a recent formalism for representing the reward function of a reinforcement learning task through a finite-state machine whose edges encode subgoals of the task using high-level events. The structure of RMs enables the decomposition of a task into simpler and independently solvable subtasks that help tackle long-horizon and/or sparse reward tasks. We propose a formalism for further abstracting the subtask structure by endowing an RM with the ability to call other RMs, thus composing a hierarchy of RMs (HRM). We exploit HRMs by treating each call to an RM as an independently solvable subtask using the options framework, and describe a curriculum-based method to learn HRMs from traces observed by the agent. Our experiments reveal that exploiting a handcrafted HRM leads to faster convergence than with a flat HRM, and that learning an HRM is feasible in cases where its equivalent flat representation is not.
△ Less
Submitted 4 June, 2023; v1 submitted 31 May, 2022;
originally announced May 2022.
-
Neuro-Symbolic Learning of Answer Set Programs from Raw Data
Authors:
Daniel Cunnington,
Mark Law,
Jorge Lobo,
Alessandra Russo
Abstract:
One of the ultimate goals of Artificial Intelligence is to assist humans in complex decision making. A promising direction for achieving this goal is Neuro-Symbolic AI, which aims to combine the interpretability of symbolic techniques with the ability of deep learning to learn from raw data. However, most current approaches require manually engineered symbolic knowledge, and where end-to-end train…
▽ More
One of the ultimate goals of Artificial Intelligence is to assist humans in complex decision making. A promising direction for achieving this goal is Neuro-Symbolic AI, which aims to combine the interpretability of symbolic techniques with the ability of deep learning to learn from raw data. However, most current approaches require manually engineered symbolic knowledge, and where end-to-end training is considered, such approaches are either restricted to learning definite programs, or are restricted to training binary neural networks. In this paper, we introduce Neuro-Symbolic Inductive Learner (NSIL), an approach that trains a general neural network to extract latent concepts from raw data, whilst learning symbolic knowledge that maps latent concepts to target labels. The novelty of our approach is a method for biasing the learning of symbolic knowledge, based on the in-training performance of both neural and symbolic components. We evaluate NSIL on three problem domains of different complexity, including an NP-complete problem. Our results demonstrate that NSIL learns expressive knowledge, solves computationally complex problems, and achieves state-of-the-art performance in terms of accuracy and data efficiency. Code and technical appendix: https://github.com/DanCunnington/NSIL
△ Less
Submitted 2 February, 2024; v1 submitted 25 May, 2022;
originally announced May 2022.
-
Efficient lifting of symmetry breaking constraints for complex combinatorial problems
Authors:
Alice Tarzariol,
Martin Gebser,
Mark Law,
Konstantin Schekotihin
Abstract:
Many industrial applications require finding solutions to challenging combinatorial problems. Efficient elimination of symmetric solution candidates is one of the key enablers for high-performance solving. However, existing model-based approaches for symmetry breaking are limited to problems for which a set of representative and easily-solvable instances is available, which is often not the case i…
▽ More
Many industrial applications require finding solutions to challenging combinatorial problems. Efficient elimination of symmetric solution candidates is one of the key enablers for high-performance solving. However, existing model-based approaches for symmetry breaking are limited to problems for which a set of representative and easily-solvable instances is available, which is often not the case in practical applications. This work extends the learning framework and implementation of a model-based approach for Answer Set Programming to overcome these limitations and address challenging problems, such as the Partner Units Problem. In particular, we incorporate a new conflict analysis algorithm in the Inductive Logic Programming system ILASP, redefine the learning task, and suggest a new example generation method to scale up the approach. The experiments conducted for different kinds of Partner Units Problem instances demonstrate the applicability of our approach and the computational benefits due to the first-order constraints learned.
△ Less
Submitted 14 May, 2022;
originally announced May 2022.
-
The Factory and the Beehive. IV. A Comprehensive Study of the Rotation X-ray Activity Relation in Praesepe and the Hyades
Authors:
Alejandro Núñez,
Marcel A. Agüeros,
Kevin R. Covey,
Stephanie T. Douglas,
Jeremy J. Drake,
Rayna Rampalli,
Emily C. Bowsher,
Phillip A. Cargile,
Adam L. Kraus,
Nicholas M. Law
Abstract:
X-ray observations of low-mass stars in open clusters are critical to understanding the dependence of magnetic activity on stellar properties and their evolution. Praesepe and the Hyades, two of the nearest, most-studied open clusters, are among the best available laboratories for examining the dependence of magnetic activity on rotation for stars with masses lower than $\approx 1\ M_{\odot}$. We…
▽ More
X-ray observations of low-mass stars in open clusters are critical to understanding the dependence of magnetic activity on stellar properties and their evolution. Praesepe and the Hyades, two of the nearest, most-studied open clusters, are among the best available laboratories for examining the dependence of magnetic activity on rotation for stars with masses lower than $\approx 1\ M_{\odot}$. We present an updated study of the rotation X-ray activity relation in the two clusters. We updated membership catalogs that combine pre-Gaia catalogs with new catalogs based on Gaia Data Release 2. The resulting catalogs are the most inclusive ones for both clusters: 1739 Praesepe and 1315 Hyades stars. We collected X-ray detections for cluster members, for which we analyzed, re-analyzed, or collated data from ROSAT, the Chandra X-ray Observatory, the Neil Gehrels Swift Observatory, and XMM-Newton. We have detections for 326 Praesepe and 462 Hyades members, of which 273 and 164, respectively, have rotation periods, an increase of 6$\times$ relative to what was previously available. We find that at $\approx$700 Myr, only M dwarfs remain saturated in X-rays, with only tentative evidence for supersaturation. We also find a tight relation between the Rossby number and fractional X-ray luminosity $L_\mathrm{X}/L_\mathrm{bol}$ in unsaturated single members, suggesting a power-law index between $-3.2$ and $-3.9$. Lastly, we find no difference in the coronal parameters between binary and single members. These results provide essential insight into the relative efficiency of magnetic heating of the stars' atmospheres, thereby informing the development of robust age-rotation-activity relations.
△ Less
Submitted 13 May, 2022;
originally announced May 2022.
-
An Adaptive Optics Census of Companions to Northern Stars Within 25 pc with Robo-AO
Authors:
Maissa Salama,
Carl Ziegler,
Christoph Baranec,
Michael C. Liu,
Nicholas M. Law,
Reed Riddle,
Todd J. Henry,
Jennifer G. Winters,
Wei-Chun Jao,
James Ou,
Arcelia Hermosillo Ruiz
Abstract:
In order to assess the multiplicity statistics of stars across spectral types and populations in a volume-limited sample, we censused nearby stars for companions with Robo-AO. We report on observations of 1157 stars of all spectral types within 25 pc with decl. $>-13^{\circ}$ searching for tight companions. We detected 154 companion candidates with separations ranging from $\sim$0.15$''$ to 4.0…
▽ More
In order to assess the multiplicity statistics of stars across spectral types and populations in a volume-limited sample, we censused nearby stars for companions with Robo-AO. We report on observations of 1157 stars of all spectral types within 25 pc with decl. $>-13^{\circ}$ searching for tight companions. We detected 154 companion candidates with separations ranging from $\sim$0.15$''$ to 4.0$''$ and magnitude differences up to $Δ$m$_{\textit{i'}}\le$7 using the robotic adaptive optics instrument Robo-AO. We confirmed physical association from Gaia EDR3 astrometry for 53 of the companion candidates, 99 remain to be confirmed, and 2 were ruled out as background objects. We complemented the high-resolution imaging companion search with a search for co-moving objects with separations out to 10,000 AU in Gaia EDR3, which resulted in an additional 147 companions registered. Of the 301 total companions reported in this study, 49 of them are new discoveries. Out of the 191 stars with significant acceleration measurements in the Hipparcos-Gaia catalog of accelerations, we detect companions around 115 of them, with the significance of the acceleration increasing as the companion separation decreases. From this survey, we report the following multiplicity fractions (compared to literature values): 40.9%$\pm$3.0% (44%) for FGK stars and 28.2%$\pm$2.3% (27%) for M stars, as well as higher-order fractions of 5.5%$\pm$1.1% (11%) and 3.9%$\pm$0.9% (5%) for FGK stars and M type stars, respectively.
△ Less
Submitted 21 March, 2022;
originally announced March 2022.
-
Domain Adversarial Training: A Game Perspective
Authors:
David Acuna,
Marc T Law,
Guojun Zhang,
Sanja Fidler
Abstract:
The dominant line of work in domain adaptation has focused on learning invariant representations using domain-adversarial training. In this paper, we interpret this approach from a game theoretical perspective. Defining optimal solutions in domain-adversarial training as a local Nash equilibrium, we show that gradient descent in domain-adversarial training can violate the asymptotic convergence gu…
▽ More
The dominant line of work in domain adaptation has focused on learning invariant representations using domain-adversarial training. In this paper, we interpret this approach from a game theoretical perspective. Defining optimal solutions in domain-adversarial training as a local Nash equilibrium, we show that gradient descent in domain-adversarial training can violate the asymptotic convergence guarantees of the optimizer, oftentimes hindering the transfer performance. Our analysis leads us to replace gradient descent with high-order ODE solvers (i.e., Runge-Kutta), for which we derive asymptotic convergence guarantees. This family of optimizers is significantly more stable and allows more aggressive learning rates, leading to high performance gains when used as a drop-in replacement over standard optimizers. Our experiments show that in conjunction with state-of-the-art domain-adversarial methods, we achieve up to 3.5% improvement with less than of half training iterations. Our optimizers are easy to implement, free of additional parameters, and can be plugged into any domain-adversarial framework.
△ Less
Submitted 10 February, 2022;
originally announced February 2022.
-
Rank-Constrained Least-Squares: Prediction and Inference
Authors:
Michael Law,
Ya'acov Ritov,
Ruixiang Zhang,
Ziwei Zhu
Abstract:
In this work, we focus on the high-dimensional trace regression model with a low-rank coefficient matrix. We establish a nearly optimal in-sample prediction risk bound for the rank-constrained least-squares estimator under no assumptions on the design matrix. Lying at the heart of the proof is a covering number bound for the family of projection operators corresponding to the subspaces spanned by…
▽ More
In this work, we focus on the high-dimensional trace regression model with a low-rank coefficient matrix. We establish a nearly optimal in-sample prediction risk bound for the rank-constrained least-squares estimator under no assumptions on the design matrix. Lying at the heart of the proof is a covering number bound for the family of projection operators corresponding to the subspaces spanned by the design. By leveraging this complexity result, we perform a power analysis for a permutation test on the existence of a low-rank signal under the high-dimensional trace regression model. We show that the permutation test based on the rank-constrained least-squares estimator achieves non-trivial power with no assumptions on the minimum (restricted) eigenvalue of the covariance matrix of the design. Finally, we use alternating minimization to approximately solve the rank-constrained least-squares problem to evaluate its empirical in-sample prediction risk and power of the resulting permutation test in our numerical study.
△ Less
Submitted 17 April, 2022; v1 submitted 28 November, 2021;
originally announced November 2021.
-
Statistical investigation of the large-area Si(Li) detectors mass-produced for the GAPS experiment
Authors:
M. Kozai,
K. Tokunaga,
H. Fuke,
M. Yamada,
C. J. Hailey,
C. Kato,
D. Kraych,
M. Law,
E. Martinez,
K. Munakata,
K. Perez,
F. Rogers,
N. Saffold,
Y. Shimizu,
K. Tokuda,
M. Xiao
Abstract:
The lithium-drifted silicon (Si(Li)) detector developed for the General Antiparticle Spectrometer (GAPS) experiment features a thick (~2.2 mm) sensitive layer, large (10 cm) diameter, and excellent energy resolution (~4 keV for 20-100 keV X-rays) at a relatively high operating temperature (approximately -40C). Mass production of GAPS Si(Li) detectors has been performed to construct a large-volume…
▽ More
The lithium-drifted silicon (Si(Li)) detector developed for the General Antiparticle Spectrometer (GAPS) experiment features a thick (~2.2 mm) sensitive layer, large (10 cm) diameter, and excellent energy resolution (~4 keV for 20-100 keV X-rays) at a relatively high operating temperature (approximately -40C). Mass production of GAPS Si(Li) detectors has been performed to construct a large-volume silicon tracker for GAPS. We achieved the first success of the mass production of large-area Si(Li) detectors with a high (~90%) yield rate. Valuable datasets related to detector fabrication, such as detector performance and manufacturing parameters, were recorded and collected during the mass production. This study analyzes the datasets using statistical methods with the aim of comprehensively examining the mass production and to gain valuable insight into the fabrication method. Sufficient uniformities of the performance parameters (leakage current and capacitance) between detectors and strips are found, demonstrating high-quality and stable mass production. We also search for correlations between detector performance and manufacturing parameters by using data-mining techniques. Conventional multivariate analysis (multiple regression analysis) and machine-learning techniques (regression tree analysis) are complementarily used, and it is found that the Li-drift process makes a significant contribution to the performance parameters of the finished detectors. Detailed investigation of the drift process is performed using environmental data, and physical interpretations are presented. Our results provide valuable insight into the fabrication methods for this kind of large-area Si(Li) detector, and encourages future projects that require large-volume silicon trackers.
△ Less
Submitted 1 May, 2022; v1 submitted 11 November, 2021;
originally announced November 2021.
-
Feature Generation for Long-tail Classification
Authors:
Rahul Vigneswaran,
Marc T. Law,
Vineeth N. Balasubramanian,
Makarand Tapaswi
Abstract:
The visual world naturally exhibits an imbalance in the number of object or scene instances resulting in a \emph{long-tailed distribution}. This imbalance poses significant challenges for classification models based on deep learning. Oversampling instances of the tail classes attempts to solve this imbalance. However, the limited visual diversity results in a network with poor representation abili…
▽ More
The visual world naturally exhibits an imbalance in the number of object or scene instances resulting in a \emph{long-tailed distribution}. This imbalance poses significant challenges for classification models based on deep learning. Oversampling instances of the tail classes attempts to solve this imbalance. However, the limited visual diversity results in a network with poor representation ability. A simple counter to this is decoupling the representation and classifier networks and using oversampling only to train the classifier. In this paper, instead of repeatedly re-sampling the same image (and thereby features), we explore a direction that attempts to generate meaningful features by estimating the tail category's distribution. Inspired by ideas from recent work on few-shot learning, we create calibrated distributions to sample additional features that are subsequently used to train the classifier. Through several experiments on the CIFAR-100-LT (long-tail) dataset with varying imbalance factors and on mini-ImageNet-LT (long-tail), we show the efficacy of our approach and establish a new state-of-the-art. We also present a qualitative analysis of generated features using t-SNE visualizations and analyze the nearest neighbors used to calibrate the tail class distributions. Our code is available at https://github.com/rahulvigneswaran/TailCalibX.
△ Less
Submitted 10 November, 2021;
originally announced November 2021.
-
High-Dimensional Varying Coefficient Models with Functional Random Effects
Authors:
Michael Law,
Ya'acov Ritov
Abstract:
We consider a sparse high-dimensional varying coefficients model with random effects, a flexible linear model allowing covariates and coefficients to have a functional dependence with time. For each individual, we observe discretely sampled responses and covariates as a function of time as well as time invariant covariates. Under sampling times that are either fixed and common or random and indepe…
▽ More
We consider a sparse high-dimensional varying coefficients model with random effects, a flexible linear model allowing covariates and coefficients to have a functional dependence with time. For each individual, we observe discretely sampled responses and covariates as a function of time as well as time invariant covariates. Under sampling times that are either fixed and common or random and independent amongst individuals, we propose a projection procedure for the empirical estimation of all varying coefficients. We extend this estimator to construct confidence bands for a fixed number of varying coefficients.
△ Less
Submitted 12 October, 2021;
originally announced October 2021.
-
EvryFlare IV: Detection of periodicity in flare occurrence from cool stars with TESS
Authors:
Ward S. Howard,
Nicholas M. Law
Abstract:
Phased flaring, or the periodic occurrence of stellar flares, may probe electromagnetic star-planet interaction (SPI), binary interaction, or magnetic conditions in spots. For the first time, we explore flare periodograms for a large sample of flare stars to identify periodicity due to magnetic interactions with orbiting companions, magnetic reservoirs, or rotational phase. Previous large surveys…
▽ More
Phased flaring, or the periodic occurrence of stellar flares, may probe electromagnetic star-planet interaction (SPI), binary interaction, or magnetic conditions in spots. For the first time, we explore flare periodograms for a large sample of flare stars to identify periodicity due to magnetic interactions with orbiting companions, magnetic reservoirs, or rotational phase. Previous large surveys have explored periodicity at the stellar rotation period, but we do not assume periods must correspond with rotation in this work. Two min TESS light curves of 284 cool stars are searched for periods from 1-10 d using two newly-developed periodograms. Because flares are discrete events in noisy and incomplete data, typical periodograms are not well-suited to detect phased flaring. We construct and test a new Bayesian likelihood periodogram and a modified Lomb-Scargle periodogram. We find 6 candidates with a false-alarm probability below 1%. Three targets are >3-sigma detections of flare periodicity; the others are plausible candidates which cannot be individually confirmed. Periods range from 1.35 to 6.7 d and some, but not all, correlate with the stellar rotation period or its 1/2 alias. Periodicity from 2 targets may persist from TESS Cycle 1 into Cycle 3. The periodicity does not appear to persist for the others. Long-term changes in periodicity may result from the spot evolution observed from each candidate, which suggests magnetic conditions play an important role in sustaining periodicity.
△ Less
Submitted 13 July, 2021;
originally announced July 2021.
-
Low-Cost Access to the Deep, High-Cadence Sky: the Argus Optical Array
Authors:
Nicholas M. Law,
Hank Corbett,
Nathan W. Galliher,
Ramses Gonzalez,
Alan Vasquez,
Glenn Walters,
Lawrence Machia,
Jeff Ratzloff,
Kendall Ackley,
Chris Bizon,
Christopher Clemens,
Steven Cox,
Steven Eikenberry,
Ward S. Howard,
Amy Glazier,
Andrew W. Mann,
Robert Quimby,
Daniel Reichart,
David Trilling
Abstract:
New mass-produced, wide-field, small-aperture telescopes have the potential to revolutionize ground-based astronomy by greatly reducing the cost of collecting area. In this paper, we introduce a new class of large telescope based on these advances: an all-sky, arcsecond-resolution, 1000-telescope array which builds a simultaneously high-cadence and deep survey by observing the entire sky all night…
▽ More
New mass-produced, wide-field, small-aperture telescopes have the potential to revolutionize ground-based astronomy by greatly reducing the cost of collecting area. In this paper, we introduce a new class of large telescope based on these advances: an all-sky, arcsecond-resolution, 1000-telescope array which builds a simultaneously high-cadence and deep survey by observing the entire sky all night. As a concrete example, we describe the Argus Array, a 5m-class telescope with an all-sky field of view and the ability to reach extremely high cadences using low-noise CMOS detectors. Each 55 GPix Argus exposure covers 20% of the entire sky to g=19.6 each minute and g=21.9 each hour; a high-speed mode will allow sub-second survey cadences for short times. Deep coadds will reach g=23.6 every five nights over 47% of the sky; a larger-aperture array telescope, with an étendue close to the Rubin Observatory, could reach g=24.3 in five nights. These arrays can build two-color, million-epoch movies of the sky, enabling sensitive and rapid searches for high-speed transients, fast-radio-burst counterparts, gravitational-wave counterparts, exoplanet microlensing events, occultations by distant solar system bodies, and myriad other phenomena. An array of O(1,000) telescopes, however, would be one of the most complex astronomical instruments yet built. Standard arrays with hundreds of tracking mounts entail thousands of moving parts and exposed optics, and maintenance costs would rapidly outpace the mass-produced-hardware cost savings compared to a monolithic large telescope. We discuss how to greatly reduce operations costs by placing all optics in a thermally controlled, sealed dome with a single moving part. Coupled with careful software scope control and use of existing pipelines, we show that the Argus Array could become the deepest and fastest Northern sky survey, with total costs below $20M.
△ Less
Submitted 1 July, 2021;
originally announced July 2021.
-
Rotation periods of TESS Objects of Interest from the Magellan-TESS Survey with multiband photometry from Evryscope and TESS
Authors:
Ward S. Howard,
Johanna Teske,
Hank Corbett,
Nicholas M. Law,
Sharon Xuesong Wang,
Jeffrey K. Ratzloff,
Nathan W. Galliher,
Ramses Gonzalez,
Alan Vasquez Soto,
Amy L. Glazier,
Joshua Haislip
Abstract:
Stellar RV jitter due to surface activity may bias the RV semi-amplitude and mass of rocky planets. The amplitude of the jitter may be estimated from the uncertainty in the rotation period, allowing the mass to be more accurately obtained. We find candidate rotation periods for 17 out of 35 TESS Objects of Interest (TOI) hosting <3 R_Earth planets as part of the Magellan-TESS Survey, which is the…
▽ More
Stellar RV jitter due to surface activity may bias the RV semi-amplitude and mass of rocky planets. The amplitude of the jitter may be estimated from the uncertainty in the rotation period, allowing the mass to be more accurately obtained. We find candidate rotation periods for 17 out of 35 TESS Objects of Interest (TOI) hosting <3 R_Earth planets as part of the Magellan-TESS Survey, which is the first-ever statistically robust study of exoplanet masses and radii across the photo-evaporation gap. Seven periods are 3+ sigma detections, two are 1.5+ sigma, and 8 show plausible variability but the periods remain unconfirmed. The other 18 TOIs are non-detections. Candidate rotators include the host stars of the confirmed planets L 168-9 b, the HD 21749 system, LTT 1445 A b, TOI 1062 b, and the L 98-59 system. 13 candidates have no counterpart in the 1000 TOI rotation catalog of Canto Martins et al. (2020). We find periods for G3-M3 dwarfs using combined light curves from TESS and the Evryscope all-sky array of small telescopes, sometimes with longer periods than would be possible with TESS alone. Secure periods range from 1.4 to 26 d with Evryscope-measured photometric amplitudes as small as 2.1 mmag in g'. We also apply Monte Carlo sampling and a Gaussian Process stellar activity model from the code exoplanet to the TESS light curves of 6 TOIs to confirm the Evryscope periods.
△ Less
Submitted 29 June, 2021;
originally announced June 2021.
-
Three K2 Campaigns Yield Rotation Periods for 1013 Stars in Praesepe
Authors:
Rayna Rampalli,
Marcel A. Agüeros,
Jason L. Curtis,
Stephanie T. Douglas,
Alejandro Núñez,
Phillip A. Cargile,
Kevin R. Covey,
Natalie M. Gosnell,
Adam L. Kraus,
Nicholas M. Law,
Andrew W. Mann
Abstract:
We use three campaigns of K2 observations to complete the census of rotation in low-mass members of the benchmark, $\approx$670-Myr-old open cluster Praesepe. We measure new rotation periods (\prot) for 220 $\lesssim$1.3~\Msun\ Praesepe members and recover periods for $97\%$ (793/812) of the stars with a \prot\ in the literature. Of the 19 stars for which we do not recover a \prot, 17 were not obs…
▽ More
We use three campaigns of K2 observations to complete the census of rotation in low-mass members of the benchmark, $\approx$670-Myr-old open cluster Praesepe. We measure new rotation periods (\prot) for 220 $\lesssim$1.3~\Msun\ Praesepe members and recover periods for $97\%$ (793/812) of the stars with a \prot\ in the literature. Of the 19 stars for which we do not recover a \prot, 17 were not observed by K2. As K2's three Praesepe campaigns took place over the course of three years, we test the stability of our measured \prot\ for stars observed in more than one campaign. We measure \prot\ consistent to within $10\%$ for $>95\%$ of the 331 likely single stars with $\geq$2 high-quality observations; the median difference in \prot\ is $0.3\%$, with a standard deviation of $2\%$. Nearly all of the exceptions are stars with discrepant \prot\ measurements in Campaign 18, K2's last, which was significantly shorter than the earlier two ($\approx$50~d rather than $\approx$75~d). This suggests that, despite the evident morphological evolution we observe in the light curves of $38\%$ of the stars, \prot\ measurements for low-mass stars in Praesepe are stable on timescales of several years. A \prot\ can therefore be taken to be representative even if measured only once.
△ Less
Submitted 24 June, 2021;
originally announced June 2021.
-
FF-NSL: Feed-Forward Neural-Symbolic Learner
Authors:
Daniel Cunnington,
Mark Law,
Alessandra Russo,
Jorge Lobo
Abstract:
Logic-based machine learning aims to learn general, interpretable knowledge in a data-efficient manner. However, labelled data must be specified in a structured logical form. To address this limitation, we propose a neural-symbolic learning framework, called Feed-Forward Neural-Symbolic Learner (FFNSL), that integrates a logic-based machine learning system capable of learning from noisy examples,…
▽ More
Logic-based machine learning aims to learn general, interpretable knowledge in a data-efficient manner. However, labelled data must be specified in a structured logical form. To address this limitation, we propose a neural-symbolic learning framework, called Feed-Forward Neural-Symbolic Learner (FFNSL), that integrates a logic-based machine learning system capable of learning from noisy examples, with neural networks, in order to learn interpretable knowledge from labelled unstructured data. We demonstrate the generality of FFNSL on four neural-symbolic classification problems, where different pre-trained neural network models and logic-based machine learning systems are integrated to learn interpretable knowledge from sequences of images. We evaluate the robustness of our framework by using images subject to distributional shifts, for which the pre-trained neural networks may predict incorrectly and with high confidence. We analyse the impact that these shifts have on the accuracy of the learned knowledge and run-time performance, comparing FFNSL to tree-based and pure neural approaches. Our experimental results show that FFNSL outperforms the baselines by learning more accurate and interpretable knowledge with fewer examples.
△ Less
Submitted 5 January, 2023; v1 submitted 24 June, 2021;
originally announced June 2021.
-
f-Domain-Adversarial Learning: Theory and Algorithms
Authors:
David Acuna,
Guojun Zhang,
Marc T. Law,
Sanja Fidler
Abstract:
Unsupervised domain adaptation is used in many machine learning applications where, during training, a model has access to unlabeled data in the target domain, and a related labeled dataset. In this paper, we introduce a novel and general domain-adversarial framework. Specifically, we derive a novel generalization bound for domain adaptation that exploits a new measure of discrepancy between distr…
▽ More
Unsupervised domain adaptation is used in many machine learning applications where, during training, a model has access to unlabeled data in the target domain, and a related labeled dataset. In this paper, we introduce a novel and general domain-adversarial framework. Specifically, we derive a novel generalization bound for domain adaptation that exploits a new measure of discrepancy between distributions based on a variational characterization of f-divergences. It recovers the theoretical results from Ben-David et al. (2010a) as a special case and supports divergences used in practice. Based on this bound, we derive a new algorithmic framework that introduces a key correction in the original adversarial training method of Ganin et al. (2016). We show that many regularizers and ad-hoc objectives introduced over the last years in this framework are then not required to achieve performance comparable to (if not better than) state-of-the-art domain-adversarial methods. Experimental analysis conducted on real-world natural language and computer vision datasets show that our framework outperforms existing baselines, and obtains the best results for f-divergences that were not considered previously in domain-adversarial learning.
△ Less
Submitted 21 June, 2021;
originally announced June 2021.
-
Critical Currents in Conventional Josephson Junctions With Grain Boundaries
Authors:
Miguel Antonio Sulangi,
Laetitia Bettmann,
T. A. Weingartner,
N. Pokhrel,
E. Patrick,
M. Law,
A. Kreisel,
P. J. Hirschfeld
Abstract:
It has been hypothesized that the variation of the critical currents in Nb/Al-AlO$_x$/Nb junctions is due to, among other effects, the presence of grain boundaries in the system. Motivated by this, we examine the effect of grain boundaries on the critical current of a Josephson junction. We assume that the hop** amplitudes are dependent on the interatomic distance, and derive a physically realis…
▽ More
It has been hypothesized that the variation of the critical currents in Nb/Al-AlO$_x$/Nb junctions is due to, among other effects, the presence of grain boundaries in the system. Motivated by this, we examine the effect of grain boundaries on the critical current of a Josephson junction. We assume that the hop** amplitudes are dependent on the interatomic distance, and derive a physically realistic model of distance-dependent hop** amplitudes. We find that the presence of a grain boundary and associated disorder is responsible for a very large drop in the critical current relative to a clean system. We also find that when a tunnel barrier is present, grain boundaries cause substantial variations in the critical currents due to the disordered hop**s near the tunnel barrier. We discuss the applicability of these results to Josephson junctions presently intended for use in superconducting electronics applications.
△ Less
Submitted 12 October, 2021; v1 submitted 21 June, 2021;
originally announced June 2021.
-
Low Budget Active Learning via Wasserstein Distance: An Integer Programming Approach
Authors:
Rafid Mahmood,
Sanja Fidler,
Marc T. Law
Abstract:
Active learning is the process of training a model with limited labeled data by selecting a core subset of an unlabeled data pool to label. The large scale of data sets used in deep learning forces most sample selection strategies to employ efficient heuristics. This paper introduces an integer optimization problem for selecting a core set that minimizes the discrete Wasserstein distance from the…
▽ More
Active learning is the process of training a model with limited labeled data by selecting a core subset of an unlabeled data pool to label. The large scale of data sets used in deep learning forces most sample selection strategies to employ efficient heuristics. This paper introduces an integer optimization problem for selecting a core set that minimizes the discrete Wasserstein distance from the unlabeled pool. We demonstrate that this problem can be tractably solved with a Generalized Benders Decomposition algorithm. Our strategy uses high-quality latent features that can be obtained by unsupervised learning on the unlabeled pool. Numerical results on several data sets show that our optimization approach is competitive with baselines and particularly outperforms them in the low budget regime where less than one percent of the data set is labeled.
△ Less
Submitted 6 March, 2023; v1 submitted 5 June, 2021;
originally announced June 2021.
-
Large Adaptive Optics Survey for Substellar Objects (LASSO) Around Young, Nearby, Low-mass Stars with Robo-AO
Authors:
Maissa Salama,
James Ou,
Christoph Baranec,
Michael C. Liu,
Brendan P. Bowler,
Paul Barnes,
Morgan Bonnet,
Mark Chun,
Dmitry A. Duev,
Sean Goebel,
Don Hall,
Shane Jacobson,
Rebecca Jensen-Clem,
Nicholas M. Law,
Charles Lockhart,
Reed Riddle,
Heather Situ,
Eric Warmbier,
Zhoujian Zhang
Abstract:
We present results from the Large Adaptive optics Survey for Substellar Objects (LASSO), where the goal is to directly image new substellar companions (<70 M$_{Jup}$) at wide orbital separations ($\gtrsim$50 AU) around young ($\lesssim$300 Myrs), nearby (<100 pc), low-mass ($\approx$0.1-0.8 M$_{\odot}$) stars. We report on 427 young stars imaged in the visible (i') and near-infrared (J or H) simul…
▽ More
We present results from the Large Adaptive optics Survey for Substellar Objects (LASSO), where the goal is to directly image new substellar companions (<70 M$_{Jup}$) at wide orbital separations ($\gtrsim$50 AU) around young ($\lesssim$300 Myrs), nearby (<100 pc), low-mass ($\approx$0.1-0.8 M$_{\odot}$) stars. We report on 427 young stars imaged in the visible (i') and near-infrared (J or H) simultaneously with Robo-AO on the Kitt Peak 2.1-m telescope and later the Maunakea University of Hawaii 2.2-m telescope. To undertake the observations, we commissioned a new infrared camera for Robo-AO that uses a low-noise high-speed SAPHIRA avalanche photodiode detector. We detected 121 companion candidates around 111 stars, of which 62 companions are physically associated based on Gaia DR2 parallaxes and proper motions, another 45 require follow-up observations to confirm physical association, and 14 are background objects. The companion separations range from 2-1101 AU and reach contrast ratios of 7.7 magnitudes in the near infrared compared to the primary. The majority of confirmed and pending candidates are stellar companions, with ~5 being potentially substellar and requiring follow-up observations for confirmation. We also detected a 43$\pm$9 M$_{Jup}$ and an 81$\pm$5 M$_{Jup}$ companion that were previously reported. We found 34 of our targets have acceleration measurements detected using Hipparcos-Gaia proper motions. Of those, 58$^{+12}_{-14}$% of the 12 stars with imaged companion candidates have significant accelerations ($χ^2 >11.8$), while only 23$^{+11}_{-6}$% of the remaining 22 stars with no detected companion have significant accelerations. The significance of the acceleration decreases with increasing companion separation. These young accelerating low-mass stars with companions will eventually yield dynamical masses with future orbit monitoring.
△ Less
Submitted 27 May, 2021;
originally announced May 2021.
-
TIC 172900988: A Transiting Circumbinary Planet Detected in One Sector of TESS Data
Authors:
Veselin B. Kostov,
Brian P. Powell,
Jerome A. Orosz,
William F. Welsh,
William Cochran,
Karen A. Collins,
Michael Endl,
Coel Hellier,
David W. Latham,
Phillip MacQueen,
Joshua Pepper,
Billy Quarles,
Lalitha Sairam,
Guillermo Torres,
Robert F. Wilson,
Serge Bergeron,
Pat Boyce,
Allyson Bieryla,
Robert Buchheim,
Caleb Ben Christiansen,
David R. Ciardi,
Kevin I. Collins,
Dennis M. Conti,
Scott Dixon,
Pere Guerra
, et al. (64 additional authors not shown)
Abstract:
We report the first discovery of a transiting circumbinary planet detected from a single sector of TESS data. During Sector 21, the planet TIC 172900988b transited the primary star and then 5 days later it transited the secondary star. The binary is itself eclipsing, with a period of P = 19.7 days and an eccentricity of e = 0.45. Archival data from ASAS-SN, Evryscope, KELT, and SuperWASP reveal a…
▽ More
We report the first discovery of a transiting circumbinary planet detected from a single sector of TESS data. During Sector 21, the planet TIC 172900988b transited the primary star and then 5 days later it transited the secondary star. The binary is itself eclipsing, with a period of P = 19.7 days and an eccentricity of e = 0.45. Archival data from ASAS-SN, Evryscope, KELT, and SuperWASP reveal a prominent apsidal motion of the binary orbit, caused by the dynamical interactions between the binary and the planet. A comprehensive photodynamical analysis of the TESS, archival and follow-up data yields stellar masses and radii of M1 = 1.2384 +/- 0.0007 MSun and R1 = 1.3827 +/- 0.0016 RSun for the primary and M2 = 1.2019 +/- 0.0007 MSun and R2 = 1.3124 +/- 0.0012 RSun for the secondary. The radius of the planet is R3 = 11.25 +/- 0.44 REarth (1.004 +/- 0.039 RJup). The planet's mass and orbital properties are not uniquely determined - there are six solutions with nearly equal likelihood. Specifically, we find that the planet's mass is in the range of 824 < M3 < 981 MEarth (2.65 < M3 < 3.09 MJup), its orbital period could be 188.8, 190.4, 194.0, 199.0, 200.4, or 204.1 days, and the eccentricity is between 0.02 and 0.09. At a V = 10.141 mag, the system is accessible for high-resolution spectroscopic observations, e.g. Rossiter-McLaughlin effect and transit spectroscopy.
△ Less
Submitted 27 August, 2021; v1 submitted 18 May, 2021;
originally announced May 2021.
-
Discovery of an Extremely Short Duration Flare from Proxima Centauri Using Millimeter through FUV Observations
Authors:
Meredith A. MacGregor,
Alycia J. Weinberger,
R. O. Parke Loyd,
Evgenya Shkolnik,
Thomas Barclay,
Ward S. Howard,
Andrew Zic,
Rachel A. Osten,
Steven R. Cranmer,
Adam F. Kowalski,
Emil Lenc,
Allison Youngblood,
Anna Estes,
David J. Wilner,
Jan Forbrich,
Anna Hughes,
Nicholas M. Law,
Tara Murphy,
Aaron Boley,
Jaymie Matthews
Abstract:
We present the discovery of an extreme flaring event from Proxima Cen by ASKAP, ALMA, HST, TESS, and the du Pont Telescope that occurred on 2019 May 1. In the millimeter and FUV, this flare is the brightest ever detected, brightening by a factor of >1000 and >14000 as seen by ALMA and HST, respectively. The millimeter and FUV continuum emission trace each other closely during the flare, suggesting…
▽ More
We present the discovery of an extreme flaring event from Proxima Cen by ASKAP, ALMA, HST, TESS, and the du Pont Telescope that occurred on 2019 May 1. In the millimeter and FUV, this flare is the brightest ever detected, brightening by a factor of >1000 and >14000 as seen by ALMA and HST, respectively. The millimeter and FUV continuum emission trace each other closely during the flare, suggesting that millimeter emission could serve as a proxy for FUV emission from stellar flares and become a powerful new tool to constrain the high-energy radiation environment of exoplanets. Surprisingly, optical emission associated with the event peaks at a much lower level with a time delay. The initial burst has an extremely short duration, lasting for <10 sec. Taken together with the growing sample of millimeter M dwarf flares, this event suggests that millimeter emission is actually common during stellar flares and often originates from short burst-like events.
△ Less
Submitted 19 April, 2021;
originally announced April 2021.
-
TESS Delivers Five New Hot Giant Planets Orbiting Bright Stars from the Full Frame Images
Authors:
Joseph E. Rodriguez,
Samuel N. Quinn,
George Zhou,
Andrew Vanderburg,
Louise D. Nielsen,
Robert A. Wittenmyer,
Rafael Brahm,
Phillip A. Reed,
Chelsea X. Huang,
Sydney Vach,
David R. Ciardi,
Ryan J. Oelkers,
Keivan G. Stassun,
Coel Hellier,
B. Scott Gaudi,
Jason D. Eastman,
Karen A. Collins,
Allyson Bieryla,
Sam Christian,
David W. Latham,
Ilaria Carleo,
Duncan J. Wright,
Elisabeth Matthews,
Erica J. Gonzales,
Carl Ziegler
, et al. (93 additional authors not shown)
Abstract:
We present the discovery and characterization of five hot and warm Jupiters -- TOI-628 b (TIC 281408474; HD 288842), TOI-640 b (TIC 147977348), TOI-1333 b (TIC 395171208, BD+47 3521A), TOI-1478 b (TIC 409794137), and TOI-1601 b (TIC 139375960) -- based on data from NASA's Transiting Exoplanet Survey Satellite (TESS). The five planets were identified from the full frame images and were confirmed th…
▽ More
We present the discovery and characterization of five hot and warm Jupiters -- TOI-628 b (TIC 281408474; HD 288842), TOI-640 b (TIC 147977348), TOI-1333 b (TIC 395171208, BD+47 3521A), TOI-1478 b (TIC 409794137), and TOI-1601 b (TIC 139375960) -- based on data from NASA's Transiting Exoplanet Survey Satellite (TESS). The five planets were identified from the full frame images and were confirmed through a series of photometric and spectroscopic follow-up observations by the $TESS$ Follow-up Observing Program (TFOP) Working Group. The planets are all Jovian size (R$_{\rm P}$ = 1.01-1.77 R$_{\rm J}$) and have masses that range from 0.85 to 6.33 M$_{\rm J}$. The host stars of these systems have F and G spectral types (5595 $\le$ T$_{\rm eff}$ $\le$ 6460 K) and are all relatively bright (9 $<V<$ 10.8, 8.2 $<K<$ 9.3) making them well-suited for future detailed characterization efforts. Three of the systems in our sample (TOI-640 b, TOI-1333 b, and TOI-1601 b) orbit subgiant host stars (log g$_*$ $<$4.1). TOI-640 b is one of only three known hot Jupiters to have a highly inflated radius (R$_{\rm P}$ > 1.7R$_{\rm J}$, possibly a result of its host star's evolution) and resides on an orbit with a period longer than 5 days. TOI-628 b is the most massive hot Jupiter discovered to date by $TESS$ with a measured mass of $6.31^{+0.28}_{-0.30}$ M$_{\rm J}$ and a statistically significant, non-zero orbital eccentricity of e = $0.074^{+0.021}_{-0.022}$. This planet would not have had enough time to circularize through tidal forces from our analysis, suggesting that it might be remnant eccentricity from its migration. The longest period planet in this sample, TOI-1478 b (P = 10.18 days), is a warm Jupiter in a circular orbit around a near-Solar analogue. NASA's $TESS$ mission is continuing to increase the sample of well-characterized hot and warm Jupiters, complementing its primary mission goals.
△ Less
Submitted 9 February, 2021; v1 submitted 5 January, 2021;
originally announced January 2021.
-
Conflict-driven Inductive Logic Programming
Authors:
Mark Law
Abstract:
The goal of Inductive Logic Programming (ILP) is to learn a program that explains a set of examples. Until recently, most research on ILP targeted learning Prolog programs. The ILASP system instead learns Answer Set Programs (ASP). Learning such expressive programs widens the applicability of ILP considerably; for example, enabling preference learning, learning common-sense knowledge, including de…
▽ More
The goal of Inductive Logic Programming (ILP) is to learn a program that explains a set of examples. Until recently, most research on ILP targeted learning Prolog programs. The ILASP system instead learns Answer Set Programs (ASP). Learning such expressive programs widens the applicability of ILP considerably; for example, enabling preference learning, learning common-sense knowledge, including defaults and exceptions, and learning non-deterministic theories.
Early versions of ILASP can be considered meta-level ILP approaches, which encode a learning task as a logic program and delegate the search to an ASP solver. More recently, ILASP has shifted towards a new method, inspired by conflict-driven SAT and ASP solvers. The fundamental idea of the approach, called Conflict-driven ILP (CDILP), is to iteratively interleave the search for a hypothesis with the generation of constraints which explain why the current hypothesis does not cover a particular example. These coverage constraints allow ILASP to rule out not just the current hypothesis, but an entire class of hypotheses that do not satisfy the coverage constraint.
This paper formalises the CDILP approach and presents the ILASP3 and ILASP4 systems for CDILP, which are demonstrated to be more scalable than previous ILASP systems, particularly in the presence of noise.
Under consideration in Theory and Practice of Logic Programming (TPLP).
△ Less
Submitted 14 January, 2022; v1 submitted 31 December, 2020;
originally announced January 2021.