-
An upgraded 0.4-meter telescope fleet for Las Cumbres Observatory's Educational and Science Programs
Authors:
Daniel-Rolf Harbeck,
Brook Taylor,
Annie Kirby,
Mark Bowman,
Steve Foale,
Kal Kadlec,
Curtis McCully,
Matthew Daily,
Jon DeVera,
Dave Douglass,
Mark Willis,
Ian Baker,
Nikolaus Volgenau,
Patrick Conway,
Brian Haworth,
Jesus Estrada,
Edward Gomez,
Sandy Seale,
Alice Hopkinson,
Fernando Rios,
Prerana Kotapali,
Lisa Storrie-Lombardi,
Wayne Rosing
Abstract:
Las Cumbres Observatory (LCOGT) operates a global network of robotic 0.4, 1.0, and 2.0-meter telescopes to facilitate scientific research and education in time-domain astronomy. LCOGT's flagship educational program, Global Sky Partners (GSP), awards up to 1500 hours per year of telescope time to individuals and organizations that run their own, fully supported, educational programs. The GSP has a…
▽ More
Las Cumbres Observatory (LCOGT) operates a global network of robotic 0.4, 1.0, and 2.0-meter telescopes to facilitate scientific research and education in time-domain astronomy. LCOGT's flagship educational program, Global Sky Partners (GSP), awards up to 1500 hours per year of telescope time to individuals and organizations that run their own, fully supported, educational programs. The GSP has a presence in 40 countries and 45% of the Partners target under-served, under-represented, and develo** world audiences. The degradation and obsolescence of the original 0.4-meter telescope network prompted LCOGT to update the fleet of 10 telescopes to a new system consisting of predominantly off-the-shelf products. New PlaneWave DeltaRho 350 telescopes with Gemini Focuser/Rotators, LCOGT filter wheels, and QHY600 CMOS cameras, complement the original, custom-built mount. The deployment of all ten telescopes was completed in March 2024. We describe the design and performance of this new system and its components. We comment on modifications made to the QHY600 cameras, as well as on the treatment of random telegraph noise of its CMOS detectors within our data processing system BANZAI. The new telescope network supports the GSP program as well as multiple key science projects, including follow-up observations for the TESS satellite mission.
△ Less
Submitted 16 May, 2024;
originally announced May 2024.
-
PRISM: Patient Records Interpretation for Semantic Clinical Trial Matching using Large Language Models
Authors:
Shashi Kant Gupta,
Aditya Basu,
Mauro Nievas,
Jerrin Thomas,
Nathan Wolfrath,
Adhitya Ramamurthi,
Bradley Taylor,
Anai N. Kothari,
Regina Schwind,
Therica M. Miller,
Sorena Nadaf-Rahrov,
Yanshan Wang,
Hrituraj Singh
Abstract:
Clinical trial matching is the task of identifying trials for which patients may be potentially eligible. Typically, this task is labor-intensive and requires detailed verification of patient electronic health records (EHRs) against the stringent inclusion and exclusion criteria of clinical trials. This process is manual, time-intensive, and challenging to scale up, resulting in many patients miss…
▽ More
Clinical trial matching is the task of identifying trials for which patients may be potentially eligible. Typically, this task is labor-intensive and requires detailed verification of patient electronic health records (EHRs) against the stringent inclusion and exclusion criteria of clinical trials. This process is manual, time-intensive, and challenging to scale up, resulting in many patients missing out on potential therapeutic options. Recent advancements in Large Language Models (LLMs) have made automating patient-trial matching possible, as shown in multiple concurrent research studies. However, the current approaches are confined to constrained, often synthetic datasets that do not adequately mirror the complexities encountered in real-world medical data. In this study, we present the first, end-to-end large-scale empirical evaluation of clinical trial matching using real-world EHRs. Our study showcases the capability of LLMs to accurately match patients with appropriate clinical trials. We perform experiments with proprietary LLMs, including GPT-4 and GPT-3.5, as well as our custom fine-tuned model called OncoLLM and show that OncoLLM, despite its significantly smaller size, not only outperforms GPT-3.5 but also matches the performance of qualified medical doctors. All experiments were carried out on real-world EHRs that include clinical notes and available clinical trials from a single cancer center in the United States.
△ Less
Submitted 26 April, 2024; v1 submitted 23 April, 2024;
originally announced April 2024.
-
Discovery of a dormant 33 solar-mass black hole in pre-release Gaia astrometry
Authors:
Gaia Collaboration,
P. Panuzzo,
T. Mazeh,
F. Arenou,
B. Holl,
E. Caffau,
A. Jorissen,
C. Babusiaux,
P. Gavras,
J. Sahlmann,
U. Bastian,
Ł. Wyrzykowski,
L. Eyer,
N. Leclerc,
N. Bauchet,
A. Bombrun,
N. Mowlavi,
G. M. Seabroke,
D. Teyssier,
E. Balbinot,
A. Helmi,
A. G. A. Brown,
A. Vallenari,
T. Prusti,
J. H. J. de Bruijne
, et al. (390 additional authors not shown)
Abstract:
Gravitational waves from black-hole merging events have revealed a population of extra-galactic BHs residing in short-period binaries with masses that are higher than expected based on most stellar evolution models - and also higher than known stellar-origin black holes in our Galaxy. It has been proposed that those high-mass BHs are the remnants of massive metal-poor stars. Gaia astrometry is exp…
▽ More
Gravitational waves from black-hole merging events have revealed a population of extra-galactic BHs residing in short-period binaries with masses that are higher than expected based on most stellar evolution models - and also higher than known stellar-origin black holes in our Galaxy. It has been proposed that those high-mass BHs are the remnants of massive metal-poor stars. Gaia astrometry is expected to uncover many Galactic wide-binary systems containing dormant BHs, which may not have been detected before. The study of this population will provide new information on the BH-mass distribution in binaries and shed light on their formation mechanisms and progenitors. As part of the validation efforts in preparation for the fourth Gaia data release (DR4), we analysed the preliminary astrometric binary solutions, obtained by the Gaia Non-Single Star pipeline, to verify their significance and to minimise false-detection rates in high-mass-function orbital solutions. The astrometric binary solution of one source, Gaia BH3, implies the presence of a 32.70 \pm 0.82 M\odot BH in a binary system with a period of 11.6 yr. Gaia radial velocities independently validate the astrometric orbit. Broad-band photometric and spectroscopic data show that the visible component is an old, very metal-poor giant of the Galactic halo, at a distance of 590 pc. The BH in the Gaia BH3 system is more massive than any other Galactic stellar-origin BH known thus far. The low metallicity of the star companion supports the scenario that metal-poor massive stars are progenitors of the high-mass BHs detected by gravitational-wave telescopes. The Galactic orbit of the system and its metallicity indicate that it might belong to the Sequoia halo substructure. Alternatively, and more plausibly, it could belong to the ED-2 stream, which likely originated from a globular cluster that had been disrupted by the Milky Way.
△ Less
Submitted 19 April, 2024; v1 submitted 16 April, 2024;
originally announced April 2024.
-
Onco-Retriever: Generative Classifier for Retrieval of EHR Records in Oncology
Authors:
Shashi Kant Gupta,
Aditya Basu,
Bradley Taylor,
Anai Kothari,
Hrituraj Singh
Abstract:
Retrieving information from EHR systems is essential for answering specific questions about patient journeys and improving the delivery of clinical care. Despite this fact, most EHR systems still rely on keyword-based searches. With the advent of generative large language models (LLMs), retrieving information can lead to better search and summarization capabilities. Such retrievers can also feed R…
▽ More
Retrieving information from EHR systems is essential for answering specific questions about patient journeys and improving the delivery of clinical care. Despite this fact, most EHR systems still rely on keyword-based searches. With the advent of generative large language models (LLMs), retrieving information can lead to better search and summarization capabilities. Such retrievers can also feed Retrieval-augmented generation (RAG) pipelines to answer any query. However, the task of retrieving information from EHR real-world clinical data contained within EHR systems in order to solve several downstream use cases is challenging due to the difficulty in creating query-document support pairs. We provide a blueprint for creating such datasets in an affordable manner using large language models. Our method results in a retriever that is 30-50 F-1 points better than propriety counterparts such as Ada and Mistral for oncology data elements. We further compare our model, called Onco-Retriever, against fine-tuned PubMedBERT model as well. We conduct an extensive manual evaluation on real-world EHR data along with latency analysis of the different models and provide a path forward for healthcare organizations to build domain-specific retrievers.
△ Less
Submitted 9 April, 2024;
originally announced April 2024.
-
Spatial Latent Gaussian Modelling with Change of Support
Authors:
Erick A. Chacón-Montalván,
Peter M. Atkinson,
Christopher Nemeth,
Benjamin M. Taylor,
Paula Moraga
Abstract:
Spatial data are often derived from multiple sources (e.g. satellites, in-situ sensors, survey samples) with different supports, but associated with the same properties of a spatial phenomenon of interest. It is common for predictors to also be measured on different spatial supports than the response variables. Although there is no standard way to work with spatial data with different supports, a…
▽ More
Spatial data are often derived from multiple sources (e.g. satellites, in-situ sensors, survey samples) with different supports, but associated with the same properties of a spatial phenomenon of interest. It is common for predictors to also be measured on different spatial supports than the response variables. Although there is no standard way to work with spatial data with different supports, a prevalent approach used by practitioners has been to use downscaling or interpolation to project all the variables of analysis towards a common support, and then using standard spatial models. The main disadvantage with this approach is that simple interpolation can introduce biases and, more importantly, the uncertainty associated with the change of support is not taken into account in parameter estimation. In this article, we propose a Bayesian spatial latent Gaussian model that can handle data with different rectilinear supports in both the response variable and predictors. Our approach allows to handle changes of support more naturally according to the properties of the spatial stochastic process being used, and to take into account the uncertainty from the change of support in parameter estimation and prediction. We use spatial stochastic processes as linear combinations of basis functions where Gaussian Markov random fields define the weights. Our hierarchical modelling approach can be described by the following steps: (i) define a latent model where response variables and predictors are considered as latent stochastic processes with continuous support, (ii) link the continuous-index set stochastic processes with its projection to the support of the observed data, (iii) link the projected process with the observed data. We show the applicability of our approach by simulation studies and modelling land suitability for improved grassland in Rhondda Cynon Taf, a county borough in Wales.
△ Less
Submitted 13 March, 2024;
originally announced March 2024.
-
Spatially Resolved Observations of Meteor Radio Afterglows with the OVRO-LWA
Authors:
S. S. Varghese,
J. Dowell,
K. S. Obenberger,
G. B. Taylor,
M. Anderson,
G. Hallinan
Abstract:
We conducted an all-sky imaging transient search with the Owens Valley Radio Observatory Long Wavelength Array (OVRO-LWA) data collected during the Perseid meteor shower in 2018. The data collection during the meteor shower was motivated to conduct a search for intrinsic radio emission from meteors below 60 MHz known as the meteor radio afterglows (MRAs). The data collected were calibrated and ima…
▽ More
We conducted an all-sky imaging transient search with the Owens Valley Radio Observatory Long Wavelength Array (OVRO-LWA) data collected during the Perseid meteor shower in 2018. The data collection during the meteor shower was motivated to conduct a search for intrinsic radio emission from meteors below 60 MHz known as the meteor radio afterglows (MRAs). The data collected were calibrated and imaged using the core array to obtain lower angular resolution images of the sky. These images were input to a pre-existing LWA transient search pipeline to search for MRAs as well as cosmic radio transients. This search detected 5 MRAs and did not find any cosmic transients. We further conducted peeling of bright sources, near-field correction, visibility differencing and higher angular resolution imaging using the full array for these 5 MRAs. These higher angular resolution images were used to study their plasma emission structures and monitor their evolution as a function of frequency and time. With higher angular resolution imaging, we resolved the radio emission size scales to less than 1 km physical size at 100 km heights. The spectral index map** of one of the long duration event showed signs of diffusion of plasma within the meteor trails. The unpolarized emission from the resolved radio components suggest resonant transition radiation as the possible radiation mechanism of MRAs.
△ Less
Submitted 8 March, 2024;
originally announced March 2024.
-
Sardinia Radio Telescope observations of the Coma Cluster
Authors:
M. Murgia,
F. Govoni,
V. Vacca,
F. Loi,
L. Feretti,
G. Giovannini,
A. Melis,
R. Concu,
E. Carretti,
S. Poppi,
G. Valente,
A. Bonafede,
G. Bernardi,
W. Boschin,
M. Brienza,
T. E. Clarke,
F. de Gasperin,
T. A. Ensslin,
C. Ferrari,
F. Gastaldello,
M. Girardi,
L. Gregorini,
M. Johnston-Hollitt,
E. Orru',
P. Parma
, et al. (3 additional authors not shown)
Abstract:
We present deep total intensity and polarization observations of the Coma cluster at 1.4 and 6.6 GHz performed with the Sardinia Radio Telescope. By combining the single-dish 1.4 GHz data with archival Very Large Array observations we obtain new images of the central radio halo and of the peripheral radio relic where we properly recover the brightness from the large scale structures. At 6.6 GHz we…
▽ More
We present deep total intensity and polarization observations of the Coma cluster at 1.4 and 6.6 GHz performed with the Sardinia Radio Telescope. By combining the single-dish 1.4 GHz data with archival Very Large Array observations we obtain new images of the central radio halo and of the peripheral radio relic where we properly recover the brightness from the large scale structures. At 6.6 GHz we detect both the relic and the central part of the halo in total intensity and polarization. These are the highest frequency images available to date for these radio sources in this galaxy cluster. In the halo, we find a localized spot of polarized signal, with fractional polarization of about 45%. The polarized emission possibly extends along the north-east side of the diffuse emission. The relic is highly polarized, up to 55%, as usually found for these sources. We confirm the halo spectrum is curved, in agreement with previous single-dish results. The spectral index is alpha=1.48 +/- 0.07 at a reference frequency of 1 GHz and varies from alpha ~1.1, at 0.1 GHz, up to alpha ~ 1.8, at 10 GHz. We compare the Coma radio halo surface brightness profile at 1.4 GHz (central brightness and e-folding radius) with the same properties of the other halos, and we find that it has one of the lowest emissivities observed so far. Reanalyzing the relic's spectrum in the light of the new data, we obtain a refined radio Mach number of M=2.9 +/- 0.1.
△ Less
Submitted 11 February, 2024;
originally announced February 2024.
-
The Central Kinematics and Black Hole Mass of 4C+37.11
Authors:
Tirth Surti,
Roger W. Romani,
Julia Scharwächter,
Alison Peck,
Greg B. Taylor
Abstract:
We report on IFU measurements of the host of the radio source 4C+37.11. This massive elliptical contains the only resolved double compact nucleus at pc-scale separation, likely a bound supermassive black hole binary (SMBHB). $i$-band photometry and GMOS-N IFU spectroscopy show that the galaxy has a large $r_b=1.5^{\prime\prime}$ core and that the stellar velocity dispersion increases inside of a r…
▽ More
We report on IFU measurements of the host of the radio source 4C+37.11. This massive elliptical contains the only resolved double compact nucleus at pc-scale separation, likely a bound supermassive black hole binary (SMBHB). $i$-band photometry and GMOS-N IFU spectroscopy show that the galaxy has a large $r_b=1.5^{\prime\prime}$ core and that the stellar velocity dispersion increases inside of a radius of influence $r_{\rm SOI} \approx 1.3^{\prime\prime}$. Jeans Anisotropic Modeling analysis of the core infers a total SMBHB mass of $2.8^{+0.8}_{-0.8} \times 10^{10}M_\odot$, making this one of the most massive black hole systems known. Our data indicate that there has been significant scouring of the central kpc of the host galaxy.
△ Less
Submitted 12 December, 2023;
originally announced December 2023.
-
Bayesian inference of a new Mallows model for characterising symptom sequences applied in primary progressive aphasia
Authors:
Beatrice Taylor,
Cameron Shand,
Chris J. D. Hardy,
Neil Oxtoby
Abstract:
Machine learning models offer the potential to understand diverse datasets in a data-driven way, powering insights into individual disease experiences and ensuring equitable healthcare. In this study, we explore Bayesian inference for characterising symptom sequences, and the associated modelling challenges. We adapted the Mallows model to account for partial rankings and right-censored data, empl…
▽ More
Machine learning models offer the potential to understand diverse datasets in a data-driven way, powering insights into individual disease experiences and ensuring equitable healthcare. In this study, we explore Bayesian inference for characterising symptom sequences, and the associated modelling challenges. We adapted the Mallows model to account for partial rankings and right-censored data, employing custom MCMC fitting. Our evaluation, encompassing synthetic data and a primary progressive aphasia dataset, highlights the model's efficacy in revealing mean orderings and estimating ranking variance. This holds the potential to enhance clinical comprehension of symptom occurrence. However, our work encounters limitations concerning model scalability and small dataset sizes.
△ Less
Submitted 22 November, 2023;
originally announced November 2023.
-
Gaia Focused Product Release: Sources from Service Interface Function image analysis -- Half a million new sources in omega Centauri
Authors:
Gaia Collaboration,
K. Weingrill,
A. Mints,
J. Castañeda,
Z. Kostrzewa-Rutkowska,
M. Davidson,
F. De Angeli,
J. Hernández,
F. Torra,
M. Ramos-Lerate,
C. Babusiaux,
M. Biermann,
C. Crowley,
D. W. Evans,
L. Lindegren,
J. M. Martín-Fleitas,
L. Palaversa,
D. Ruz Mieres,
K. Tisanić,
A. G. A. Brown,
A. Vallenari,
T. Prusti,
J. H. J. de Bruijne,
F. Arenou,
A. Barbier
, et al. (378 additional authors not shown)
Abstract:
Gaia's readout window strategy is challenged by very dense fields in the sky. Therefore, in addition to standard Gaia observations, full Sky Mapper (SM) images were recorded for nine selected regions in the sky. A new software pipeline exploits these Service Interface Function (SIF) images of crowded fields (CFs), making use of the availability of the full two-dimensional (2D) information. This ne…
▽ More
Gaia's readout window strategy is challenged by very dense fields in the sky. Therefore, in addition to standard Gaia observations, full Sky Mapper (SM) images were recorded for nine selected regions in the sky. A new software pipeline exploits these Service Interface Function (SIF) images of crowded fields (CFs), making use of the availability of the full two-dimensional (2D) information. This new pipeline produced half a million additional Gaia sources in the region of the omega Centauri ($ω$ Cen) cluster, which are published with this Focused Product Release. We discuss the dedicated SIF CF data reduction pipeline, validate its data products, and introduce their Gaia archive table. Our aim is to improve the completeness of the {\it Gaia} source inventory in a very dense region in the sky, $ω$ Cen. An adapted version of {\it Gaia}'s Source Detection and Image Parameter Determination software located sources in the 2D SIF CF images. We validated the results by comparing them to the public {\it Gaia} DR3 catalogue and external Hubble Space Telescope data. With this Focused Product Release, 526\,587 new sources have been added to the {\it Gaia} catalogue in $ω$ Cen. Apart from positions and brightnesses, the additional catalogue contains parallaxes and proper motions, but no meaningful colour information. While SIF CF source parameters generally have a lower precision than nominal {\it Gaia} sources, in the cluster centre they increase the depth of the combined catalogue by three magnitudes and improve the source density by a factor of ten. This first SIF CF data publication already adds great value to the {\it Gaia} catalogue. It demonstrates what to expect for the fourth {\it Gaia} catalogue, which will contain additional sources for all nine SIF CF regions.
△ Less
Submitted 8 November, 2023; v1 submitted 10 October, 2023;
originally announced October 2023.
-
Gaia Focused Product Release: A catalogue of sources around quasars to search for strongly lensed quasars
Authors:
Gaia Collaboration,
A. Krone-Martins,
C. Ducourant,
L. Galluccio,
L. Delchambre,
I. Oreshina-Slezak,
R. Teixeira,
J. Braine,
J. -F. Le Campion,
F. Mignard,
W. Roux,
A. Blazere,
L. Pegoraro,
A. G. A. Brown,
A. Vallenari,
T. Prusti,
J. H. J. de Bruijne,
F. Arenou,
C. Babusiaux,
A. Barbier,
M. Biermann,
O. L. Creevey,
D. W. Evans,
L. Eyer,
R. Guerra
, et al. (376 additional authors not shown)
Abstract:
Context. Strongly lensed quasars are fundamental sources for cosmology. The Gaia space mission covers the entire sky with the unprecedented resolution of $0.18$" in the optical, making it an ideal instrument to search for gravitational lenses down to the limiting magnitude of 21. Nevertheless, the previous Gaia Data Releases are known to be incomplete for small angular separations such as those ex…
▽ More
Context. Strongly lensed quasars are fundamental sources for cosmology. The Gaia space mission covers the entire sky with the unprecedented resolution of $0.18$" in the optical, making it an ideal instrument to search for gravitational lenses down to the limiting magnitude of 21. Nevertheless, the previous Gaia Data Releases are known to be incomplete for small angular separations such as those expected for most lenses. Aims. We present the Data Processing and Analysis Consortium GravLens pipeline, which was built to analyse all Gaia detections around quasars and to cluster them into sources, thus producing a catalogue of secondary sources around each quasar. We analysed the resulting catalogue to produce scores that indicate source configurations that are compatible with strongly lensed quasars. Methods. GravLens uses the DBSCAN unsupervised clustering algorithm to detect sources around quasars. The resulting catalogue of multiplets is then analysed with several methods to identify potential gravitational lenses. We developed and applied an outlier scoring method, a comparison between the average BP and RP spectra of the components, and we also used an extremely randomised tree algorithm. These methods produce scores to identify the most probable configurations and to establish a list of lens candidates. Results. We analysed the environment of 3 760 032 quasars. A total of 4 760 920 sources, including the quasars, were found within 6" of the quasar positions. This list is given in the Gaia archive. In 87\% of cases, the quasar remains a single source, and in 501 385 cases neighbouring sources were detected. We propose a list of 381 lensed candidates, of which we identified 49 as the most promising. Beyond these candidates, the associate tables in this Focused Product Release allow the entire community to explore the unique Gaia data for strong lensing studies further.
△ Less
Submitted 10 October, 2023;
originally announced October 2023.
-
Gaia Focused Product Release: Radial velocity time series of long-period variables
Authors:
Gaia Collaboration,
Gaia Collaboration,
M. Trabucchi,
N. Mowlavi,
T. Lebzelter,
I. Lecoeur-Taibi,
M. Audard,
L. Eyer,
P. García-Lario,
P. Gavras,
B. Holl,
G. Jevardat de Fombelle,
K. Nienartowicz,
L. Rimoldini,
P. Sartoretti,
R. Blomme,
Y. Frémat,
O. Marchal,
Y. Damerdji,
A. G. A. Brown,
A. Guerrier,
P. Panuzzo,
D. Katz,
G. M. Seabroke,
K. Benson
, et al. (382 additional authors not shown)
Abstract:
The third Gaia Data Release (DR3) provided photometric time series of more than 2 million long-period variable (LPV) candidates. Anticipating the publication of full radial-velocity (RV) in DR4, this Focused Product Release (FPR) provides RV time series for a selection of LPVs with high-quality observations. We describe the production and content of the Gaia catalog of LPV RV time series, and the…
▽ More
The third Gaia Data Release (DR3) provided photometric time series of more than 2 million long-period variable (LPV) candidates. Anticipating the publication of full radial-velocity (RV) in DR4, this Focused Product Release (FPR) provides RV time series for a selection of LPVs with high-quality observations. We describe the production and content of the Gaia catalog of LPV RV time series, and the methods used to compute variability parameters published in the Gaia FPR. Starting from the DR3 LPVs catalog, we applied filters to construct a sample of sources with high-quality RV measurements. We modeled their RV and photometric time series to derive their periods and amplitudes, and further refined the sample by requiring compatibility between the RV period and at least one of the $G$, $G_{\rm BP}$, or $G_{\rm RP}$ photometric periods. The catalog includes RV time series and variability parameters for 9\,614 sources in the magnitude range $6\lesssim G/{\rm mag}\lesssim 14$, including a flagged top-quality subsample of 6\,093 stars whose RV periods are fully compatible with the values derived from the $G$, $G_{\rm BP}$, and $G_{\rm RP}$ photometric time series. The RV time series contain a mean of 24 measurements per source taken unevenly over a duration of about three years. We identify the great most sources (88%) as genuine LPVs, with about half of them showing a pulsation period and the other half displaying a long secondary period. The remaining 12% consists of candidate ellipsoidal binaries. Quality checks against RVs available in the literature show excellent agreement. We provide illustrative examples and cautionary remarks. The publication of RV time series for almost 10\,000 LPVs constitutes, by far, the largest such database available to date in the literature. The availability of simultaneous photometric measurements gives a unique added value to the Gaia catalog (abridged)
△ Less
Submitted 9 October, 2023;
originally announced October 2023.
-
On Fundamental Proof Structures in First-Order Optimization
Authors:
Baptiste Goujaud,
Aymeric Dieuleveut,
Adrien Taylor
Abstract:
First-order optimization methods have attracted a lot of attention due to their practical success in many applications, including in machine learning. Obtaining convergence guarantees and worst-case performance certificates for first-order methods have become crucial for understanding ingredients underlying efficient methods and for develo** new ones. However, obtaining, verifying, and proving s…
▽ More
First-order optimization methods have attracted a lot of attention due to their practical success in many applications, including in machine learning. Obtaining convergence guarantees and worst-case performance certificates for first-order methods have become crucial for understanding ingredients underlying efficient methods and for develo** new ones. However, obtaining, verifying, and proving such guarantees is often a tedious task. Therefore, a few approaches were proposed for rendering this task more systematic, and even partially automated. In addition to hel** researchers finding convergence proofs, these tools provide insights on the general structures of such proofs. We aim at presenting those structures, showing how to build convergence guarantees for first-order optimization methods.
△ Less
Submitted 3 October, 2023;
originally announced October 2023.
-
Cannonball or Bowling Ball: A Proper Motion and Parallax for PSR J0002+6216
Authors:
S. Bruzewski,
F. K. Schinzel,
G. B. Taylor,
P. Demorest,
D. A. Frail,
M. Kerr,
P. Kumar
Abstract:
We report the results of careful astrometric measurements of the Cannonball pulsar J0002+6216 carried out over three years using the High Sensitivity Array (HSA). We significantly refine the proper motion to $μ=35.3\pm0.6$ mas yr$^{-1}$ and place new constraints on the distance, with the overall effect of lowering the velocity and increasing the inferred age to $47.60\pm0.80$ kyr. Although the pul…
▽ More
We report the results of careful astrometric measurements of the Cannonball pulsar J0002+6216 carried out over three years using the High Sensitivity Array (HSA). We significantly refine the proper motion to $μ=35.3\pm0.6$ mas yr$^{-1}$ and place new constraints on the distance, with the overall effect of lowering the velocity and increasing the inferred age to $47.60\pm0.80$ kyr. Although the pulsar is brought more in line with the standard natal kick distribution, this new velocity has implications for the morphology of the pulsar wind nebula that surrounds it, the density of the interstellar medium through which it travels, and the age of the supernova remnant (CTB 1) from which it originates.
△ Less
Submitted 30 October, 2023; v1 submitted 26 September, 2023;
originally announced September 2023.
-
Rate-Induced Transitions in Networked Complex Adaptive Systems: Exploring Dynamics and Management Implications Across Ecological, Social, and Socioecological Systems
Authors:
Vítor V. Vasconcelos,
Flávia M. D. Marquitti,
Theresa Ong,
Lisa C. McManus,
Marcus Aguiar,
Amanda B. Campos,
Partha S. Dutta,
Kristen Jovanelly,
Victoria Junquera,
Jude Kong,
Elisabeth H. Krueger,
Simon A. Levin,
Wenying Liao,
Mingzhen Lu,
Dhruv Mittal,
Mercedes Pascual,
Flávio L. Pinheiro,
Juan Rocha,
Fernando P. Santos,
Peter Sloot,
Chenyang,
Su,
Benton Taylor,
Eden Tekwa,
Sjoerd Terpstra
, et al. (5 additional authors not shown)
Abstract:
Complex adaptive systems (CASs), from ecosystems to economies, are open systems and inherently dependent on external conditions. While a system can transition from one state to another based on the magnitude of change in external conditions, the rate of change -- irrespective of magnitude -- may also lead to system state changes due to a phenomenon known as a rate-induced transition (RIT). This st…
▽ More
Complex adaptive systems (CASs), from ecosystems to economies, are open systems and inherently dependent on external conditions. While a system can transition from one state to another based on the magnitude of change in external conditions, the rate of change -- irrespective of magnitude -- may also lead to system state changes due to a phenomenon known as a rate-induced transition (RIT). This study presents a novel framework that captures RITs in CASs through a local model and a network extension where each node contributes to the structural adaptability of others. Our findings reveal how RITs occur at a critical environmental change rate, with lower-degree nodes tip** first due to fewer connections and reduced adaptive capacity. High-degree nodes tip later as their adaptability sources (lower-degree nodes) collapse. This pattern persists across various network structures. Our study calls for an extended perspective when managing CASs, emphasizing the need to focus not only on thresholds of external conditions but also the rate at which those conditions change, particularly in the context of the collapse of surrounding systems that contribute to the focal system's resilience. Our analytical method opens a path to designing management policies that mitigate RIT impacts and enhance resilience in ecological, social, and socioecological systems. These policies could include controlling environmental change rates, fostering system adaptability, implementing adaptive management strategies, and building capacity and knowledge exchange. Our study contributes to the understanding of RIT dynamics and informs effective management strategies for complex adaptive systems in the face of rapid environmental change.
△ Less
Submitted 14 September, 2023;
originally announced September 2023.
-
Analysis of Superconducting Qubit Layouts Using InductEx
Authors:
Sean Crowe,
Benjamin Taylor,
Nicholas Ferrante,
Brad Liu,
Susan Berggren
Abstract:
InductEx is a software tool used for the analysis of integrated circuit designs and extraction of design parameters by way of numerical electromagnetic field solving. This tool was originally developed with Rapid Single Flux Quantum (RSFQ) chips in mind, but it has a broad applicability and can be extended to other processes. In this poster, we report a comprehensive analysis of a superconducting…
▽ More
InductEx is a software tool used for the analysis of integrated circuit designs and extraction of design parameters by way of numerical electromagnetic field solving. This tool was originally developed with Rapid Single Flux Quantum (RSFQ) chips in mind, but it has a broad applicability and can be extended to other processes. In this poster, we report a comprehensive analysis of a superconducting aluminum two qubit chip. This analysis was performed with InductEx.
We report the design of a two qubit chip which has the characteristics necessary to execute single and two qubit gates. Ahead of fabrication, several design characteristics have been extracted from this quantum chip design in order to verify that it satisfies basic design principles of transmon qubits. These characteristics are reported in this poster and they include the calculation of chip anharmonicities, qubit frequencies, resonator frequencies as well as g-factors and dispersive shifts. Design constraints which are satisfied by these extracted parameters are discussed. Additionally, qualitative aspects of the chip have been obtained from current density maps and are reported here.
Taken as a whole, this analysis demonstrates the broad applicability of Inductex to integrated circuit design and particularly to the problem of quantum circuit layout optimization.
△ Less
Submitted 13 September, 2023;
originally announced September 2023.
-
Linking Symptom Inventories using Semantic Textual Similarity
Authors:
Eamonn Kennedy,
Shashank Vadlamani,
Hannah M Lindsey,
Kelly S Peterson,
Kristen Dams OConnor,
Kenton Murray,
Ronak Agarwal,
Houshang H Amiri,
Raeda K Andersen,
Talin Babikian,
David A Baron,
Erin D Bigler,
Karen Caeyenberghs,
Lisa Delano-Wood,
Seth G Disner,
Ekaterina Dobryakova,
Blessen C Eapen,
Rachel M Edelstein,
Carrie Esopenko,
Helen M Genova,
Elbert Geuze,
Naomi J Goodrich-Hunsaker,
Jordan Grafman,
Asta K Haberg,
Cooper B Hodges
, et al. (57 additional authors not shown)
Abstract:
An extensive library of symptom inventories has been developed over time to measure clinical symptoms, but this variety has led to several long standing issues. Most notably, results drawn from different settings and studies are not comparable, which limits reproducibility. Here, we present an artificial intelligence (AI) approach using semantic textual similarity (STS) to link symptoms and scores…
▽ More
An extensive library of symptom inventories has been developed over time to measure clinical symptoms, but this variety has led to several long standing issues. Most notably, results drawn from different settings and studies are not comparable, which limits reproducibility. Here, we present an artificial intelligence (AI) approach using semantic textual similarity (STS) to link symptoms and scores across previously incongruous symptom inventories. We tested the ability of four pre-trained STS models to screen thousands of symptom description pairs for related content - a challenging task typically requiring expert panels. Models were tasked to predict symptom severity across four different inventories for 6,607 participants drawn from 16 international data sources. The STS approach achieved 74.8% accuracy across five tasks, outperforming other models tested. This work suggests that incorporating contextual, semantic information can assist expert decision-making processes, yielding gains for both general and disease-specific clinical assessment.
△ Less
Submitted 8 September, 2023;
originally announced September 2023.
-
Compact Symmetric Objects -- III Evolution of the High-Luminosity Branch and a Possible Connection with Tidal Disruption Events
Authors:
A. C. S. Readhead,
V. Ravi,
R. D. Blandford,
A. G. Sullivan,
J. Somalwar,
M. C. Begelman,
M. Birkinshaw,
I. Liodakis,
M. L. Lister,
T. J. Pearson,
G. B. Taylor,
P. N. Wilkinson,
N. Globus,
S. Kiehlmann,
C. R. Lawrence,
D. Murphy,
S. O'Neill,
V. Pavlidou,
E. Sheldahl,
A. Siemiginowska,
K. Tassis
Abstract:
We use a sample of 54 Compact Symmetric Objects (CSOs) to confirm that there are two unrelated CSO classes: an edge-dimmed, low-luminosity class (CSO~1), and an edge-brightened, high-luminosity class (CSO~2). Using blind tests, we show that CSO~2s consist of three sub-classes: CSO 2.0, having prominent hot-spots at the leading edges of narrow jets and/or narrow lobes; CSO~2.2, without prominent ho…
▽ More
We use a sample of 54 Compact Symmetric Objects (CSOs) to confirm that there are two unrelated CSO classes: an edge-dimmed, low-luminosity class (CSO~1), and an edge-brightened, high-luminosity class (CSO~2). Using blind tests, we show that CSO~2s consist of three sub-classes: CSO 2.0, having prominent hot-spots at the leading edges of narrow jets and/or narrow lobes; CSO~2.2, without prominent hot-spots, and with broad jets and/or lobes; and CSO~2.1, which exhibit mixed properties. Most CSO 2s do not evolve into larger jetted-AGN, but spend their whole life-cycle as CSOs of size $\lesssim$500 pc and age $\lesssim$5000 yr. The minimum energies needed to produce the radio luminosity and structure in CSO~2s range from $\sim~10^{-4}\,M_\odot{c}^2$ to $\sim7\,M_\odot{c}^2$. We show that the transient nature of most CSO~2s, and their birthrate, can be explained through ignition in the tidal disruption events of giant stars. We also consider possibilities of tap** the spin energy of the supermassive black hole, and tap** the energy of the accretion disk. Our results demonstrate that CSOs constitute a large family of AGN in which we have thus far studied only the brightest. More comprehensive CSO studies, with higher sensitivity, resolution, and dynamic range, will revolutionize our understanding of AGN and the central engines that power them.
△ Less
Submitted 26 November, 2023; v1 submitted 20 March, 2023;
originally announced March 2023.
-
Compact Symmetric Objects -- II Confirmation of a Distinct Population of High-Luminosity Jetted Active Galaxies
Authors:
S. Kiehlmann,
A. C. S. Readhead,
S. O'Neill,
P. N. Wilkinson,
M. L. Lister,
I. Liodakis,
S. Bruzewski,
V. Pavlidou,
T. J. Pearson,
E. Sheldahl,
A. Siemiginowska,
K. Tassis,
G. B. Taylor
Abstract:
Compact Symmetric Objects (CSOs) are compact (<1 kpc), jetted Active Galactic Nuclei (AGN), whose jet axes are not aligned close to the line of sight, and whose observed emission is not predominantly relativistically boosted towards us. Two classes of CSOs have previously been identified: approximately one fifth are edge-dimmed and designated as CSO 1s, while the rest are edge brightened and desig…
▽ More
Compact Symmetric Objects (CSOs) are compact (<1 kpc), jetted Active Galactic Nuclei (AGN), whose jet axes are not aligned close to the line of sight, and whose observed emission is not predominantly relativistically boosted towards us. Two classes of CSOs have previously been identified: approximately one fifth are edge-dimmed and designated as CSO 1s, while the rest are edge brightened and designated as CSO 2s. This paper focuses almost exclusively on CSO 2s. Using complete samples of CSO 2s we present three independent lines of evidence, based on their relative numbers, redshift distributions, and size distributions, which show conclusively that the vast majority (> 99%) of CSO 2s do not evolve into larger-scale radio sources. These CSO 2s belong to a distinct population of jetted-AGN, which should be characterized as ``short-lived'' compared to the classes of larger jetted-AGN, as opposed to ``young''. We show that there is a sharp upper cutoff in the CSO 2 size distribution at $\approx 500$ pc. The distinct differences between most CSO 2s and other jetted-AGN provides a crucial new time domain window on the formation and evolution of relativistic jets in AGN and the supermassive black holes that drive them.
△ Less
Submitted 26 November, 2023; v1 submitted 20 March, 2023;
originally announced March 2023.
-
Compact Symmetric Objects -- I Towards a Comprehensive Bona Fide Catalog
Authors:
S. Kiehlmann,
M. L. Lister,
A. C. S. Readhead,
I. Liodakis,
S. O'Neill,
T. J. Pearson,
E. Sheldahl,
A. Siemiginowska,
K. Tassis,
G. B. Taylor,
P. N. Wilkinson
Abstract:
Compact Symmetric Objects (CSOs) are jetted Active Galactic Nuclei (AGN) with overall projected size <1 kpc. The classification was introduced to distinguish these objects from the majority of compact jetted-AGN in centimeter wavelength very long baseline interferometry observations, where the observed emission is relativistically boosted towards the observer. The original classification criteria…
▽ More
Compact Symmetric Objects (CSOs) are jetted Active Galactic Nuclei (AGN) with overall projected size <1 kpc. The classification was introduced to distinguish these objects from the majority of compact jetted-AGN in centimeter wavelength very long baseline interferometry observations, where the observed emission is relativistically boosted towards the observer. The original classification criteria for CSOs were: (i) evidence of emission on both sides of the center of activity, and (ii) overall size <1 kpc. However some relativistically boosted objects with jet axes close to the line of sight appear symmetric and have been mis-classified as CSOs, thereby undermining the CSO classification. This is because two essential CSO properties, pointed out in the original papers, have been neglected: (iii) low variability, and (iv) low apparent speeds along the jets. As a first step towards creating a comprehensive catalog of ``bona fide'' CSOs, we identify 79 bona fide CSOs, including 15 objects claimed as confirmed CSOs here for the first time, that match the CSO selection criteria. This sample of bona fide CSOs can be used for astrophysical studies of CSOs without contamination by mis-classified CSOs. We show that the fraction of CSOs in complete flux density limited AGN samples with S$_{\rm 5\,GHz}$ >700 mJy is between $(6.8\pm1.6)$% and $(8.5\pm1.8)$%.
△ Less
Submitted 26 November, 2023; v1 submitted 20 March, 2023;
originally announced March 2023.
-
Counter-examples in first-order optimization: a constructive approach
Authors:
Baptiste Goujaud,
Aymeric Dieuleveut,
Adrien Taylor
Abstract:
While many approaches were developed for obtaining worst-case complexity bounds for first-order optimization methods in the last years, there remain theoretical gaps in cases where no such bound can be found. In such cases, it is often unclear whether no such bound exists (e.g., because the algorithm might fail to systematically converge) or simply if the current techniques do not allow finding th…
▽ More
While many approaches were developed for obtaining worst-case complexity bounds for first-order optimization methods in the last years, there remain theoretical gaps in cases where no such bound can be found. In such cases, it is often unclear whether no such bound exists (e.g., because the algorithm might fail to systematically converge) or simply if the current techniques do not allow finding them.
In this work, we propose an approach to automate the search for cyclic trajectories generated by first-order methods. This provides a constructive approach to show that no appropriate complexity bound exists, thereby complementing the approaches providing sufficient conditions for convergence. Using this tool, we provide ranges of parameters for which some of the famous heavy-ball, Nesterov accelerated gradient, inexact gradient descent, and three-operator splitting algorithms fail to systematically converge, and show that it nicely complements existing tools searching for Lyapunov functions.
△ Less
Submitted 30 June, 2023; v1 submitted 18 March, 2023;
originally announced March 2023.
-
Automated tight Lyapunov analysis for first-order methods
Authors:
Manu Upadhyaya,
Sebastian Banert,
Adrien B. Taylor,
Pontus Giselsson
Abstract:
We present a methodology for establishing the existence of quadratic Lyapunov inequalities for a wide range of first-order methods used to solve convex optimization problems. In particular, we consider (i) classes of optimization problems of finite-sum form with (possibly strongly) convex and possibly smooth functional components, (ii) first-order methods that can be written as a linear system on…
▽ More
We present a methodology for establishing the existence of quadratic Lyapunov inequalities for a wide range of first-order methods used to solve convex optimization problems. In particular, we consider (i) classes of optimization problems of finite-sum form with (possibly strongly) convex and possibly smooth functional components, (ii) first-order methods that can be written as a linear system on state-space form in feedback interconnection with the subdifferentials of the functional components of the objective function, and (iii) quadratic Lyapunov inequalities that can be used to draw convergence conclusions. We present a necessary and sufficient condition for the existence of a quadratic Lyapunov inequality within a predefined class of Lyapunov inequalities, which amounts to solving a small-sized semidefinite program. We showcase our methodology on several first-order methods that fit the framework. Most notably, our methodology allows us to significantly extend the region of parameter choices that allow for duality gap convergence in the Chambolle-Pock method when the linear operator is the identity map**.
△ Less
Submitted 27 February, 2024; v1 submitted 13 February, 2023;
originally announced February 2023.
-
Resolving the bow shock and tail of the cannonball pulsar PSR J0002+6216
Authors:
P. Kumar,
F. K. Schinzel,
G. B. Taylor,
M. Kerr,
D. Castro,
U. Rau,
S. Bhatnagar
Abstract:
We present X-ray and radio observations of the recently-discovered bow shock pulsar wind nebula associated with PSR J0002+6216, characterizing the PWN morphology, which was unresolved in previous studies. The multi-frequency, multi-epoch Very Large Array radio observations reveal a cometary tail trailing the pulsar and extending up to 5.3', with multiple kinks along the emission. The presented rad…
▽ More
We present X-ray and radio observations of the recently-discovered bow shock pulsar wind nebula associated with PSR J0002+6216, characterizing the PWN morphology, which was unresolved in previous studies. The multi-frequency, multi-epoch Very Large Array radio observations reveal a cometary tail trailing the pulsar and extending up to 5.3', with multiple kinks along the emission. The presented radio continuum images from multi-configuration broadband VLA observations are one of the first results from the application of multi-term multi-frequency synthesis deconvolution in combination with the awproject gridder implemented in the Common Astronomy Software Applications package (CASA). The X-ray emission observed with Chandra extends to only 21'', fades quickly, and has some hot spots present along the extended radio emission. These kinks could indicate the presence of density variation in the local ISM or turbulence. The bow shock standoff distance estimates a small bow shock region with a size 0.003-0.009 pc, consistent with the pulsar spin-down power of Edot=1.51x10^35 ergs/s estimated from timing. The high-resolution radio image reveals the presence of an asymmetry in the bow shock region which is also present in the X-ray image. The broadband radio image shows an unusually steep spectrum along with a flat-spectrum sheath, which could indicate varying opacity or energy injection into the region. Spatially-resolved X-ray spectra provide marginal evidence of synchrotron cooling along the extended tail. Our analysis of the X-ray data also shows that this pulsar has a low spin-down power and one of the lowest X-ray efficiencies observed in these objects.
△ Less
Submitted 18 February, 2023; v1 submitted 9 February, 2023;
originally announced February 2023.
-
Nonlinear conjugate gradient methods: worst-case convergence rates via computer-assisted analyses
Authors:
Shuvomoy Das Gupta,
Robert M. Freund,
Xu Andy Sun,
Adrien Taylor
Abstract:
We propose a computer-assisted approach to the analysis of the worst-case convergence of nonlinear conjugate gradient methods (NCGMs). Those methods are known for their generally good empirical performances for large-scale optimization, while having relatively incomplete analyses. Using our computer-assisted approach, we establish novel complexity bounds for the Polak-Ribière-Polyak (PRP) and the…
▽ More
We propose a computer-assisted approach to the analysis of the worst-case convergence of nonlinear conjugate gradient methods (NCGMs). Those methods are known for their generally good empirical performances for large-scale optimization, while having relatively incomplete analyses. Using our computer-assisted approach, we establish novel complexity bounds for the Polak-Ribière-Polyak (PRP) and the Fletcher-Reeves (FR) NCGMs for smooth strongly convex minimization. In particular, we construct mathematical proofs that establish the first non-asymptotic convergence bound for FR (which is historically the first developed NCGM), and a much improved non-asymptotic convergence bound for PRP. Additionally, we provide simple adversarial examples on which these methods do not perform better than gradient descent with exact line search, leaving very little room for improvements on the same class of problems.
△ Less
Submitted 18 April, 2024; v1 submitted 4 January, 2023;
originally announced January 2023.
-
Biologically Plausible Learning on Neuromorphic Hardware Architectures
Authors:
Christopher Wolters,
Brady Taylor,
Edward Hanson,
Xiaoxuan Yang,
Ulf Schlichtmann,
Yiran Chen
Abstract:
With an ever-growing number of parameters defining increasingly complex networks, Deep Learning has led to several breakthroughs surpassing human performance. As a result, data movement for these millions of model parameters causes a growing imbalance known as the memory wall. Neuromorphic computing is an emerging paradigm that confronts this imbalance by performing computations directly in analog…
▽ More
With an ever-growing number of parameters defining increasingly complex networks, Deep Learning has led to several breakthroughs surpassing human performance. As a result, data movement for these millions of model parameters causes a growing imbalance known as the memory wall. Neuromorphic computing is an emerging paradigm that confronts this imbalance by performing computations directly in analog memories. On the software side, the sequential Backpropagation algorithm prevents efficient parallelization and thus fast convergence. A novel method, Direct Feedback Alignment, resolves inherent layer dependencies by directly passing the error from the output to each layer. At the intersection of hardware/software co-design, there is a demand for develo** algorithms that are tolerable to hardware nonidealities. Therefore, this work explores the interrelationship of implementing bio-plausible learning in-situ on neuromorphic hardware, emphasizing energy, area, and latency constraints. Using the benchmarking framework DNN+NeuroSim, we investigate the impact of hardware nonidealities and quantization on algorithm performance, as well as how network topologies and algorithm-level design choices can scale latency, energy and area consumption of a chip. To the best of our knowledge, this work is the first to compare the impact of different learning algorithms on Compute-In-Memory-based hardware and vice versa. The best results achieved for accuracy remain Backpropagation-based, notably when facing hardware imperfections. Direct Feedback Alignment, on the other hand, allows for significant speedup due to parallelization, reducing training time by a factor approaching N for N-layered networks.
△ Less
Submitted 11 April, 2023; v1 submitted 29 December, 2022;
originally announced December 2022.
-
A Combined Radio Multi-Survey Catalog of Fermi Unassociated Sources
Authors:
S. Bruzewski,
F. K. Schinzel,
G. B. Taylor
Abstract:
Approximately one-third of existing $γ$-ray sources identified by the $\textit{Fermi Gamma-Ray Space Telescope}$ are considered to be unassociated, with no known counterpart at other frequencies/wavelengths. These sources have been the subject of intense scrutiny and observational effort during the observatory's mission lifetime, and here we present a method of leveraging existing radio catalogs t…
▽ More
Approximately one-third of existing $γ$-ray sources identified by the $\textit{Fermi Gamma-Ray Space Telescope}$ are considered to be unassociated, with no known counterpart at other frequencies/wavelengths. These sources have been the subject of intense scrutiny and observational effort during the observatory's mission lifetime, and here we present a method of leveraging existing radio catalogs to examine these sources without the need for specific dedicated observations, which can be costly and complex. Via the inclusion of many sensitive low-frequency catalogs we specifically target steep spectrum sources such as pulsars. This work has found steep-spectrum radio sources contained inside 591 $\textit{Fermi}$ unassociated fields, with at least 21 of them being notable for having pulsar-like $γ$-ray properties as well. We also identify a number of other fields of interest based on various radio and $γ$-ray selections.
△ Less
Submitted 9 December, 2022;
originally announced December 2022.
-
Convergence of Proximal Point and Extragradient-Based Methods Beyond Monotonicity: the Case of Negative Comonotonicity
Authors:
Eduard Gorbunov,
Adrien Taylor,
Samuel Horváth,
Gauthier Gidel
Abstract:
Algorithms for min-max optimization and variational inequalities are often studied under monotonicity assumptions. Motivated by non-monotone machine learning applications, we follow the line of works [Diakonikolas et al., 2021, Lee and Kim, 2021, Pethick et al., 2022, Böhm, 2022] aiming at going beyond monotonicity by considering the weaker negative comonotonicity assumption. In particular, we pro…
▽ More
Algorithms for min-max optimization and variational inequalities are often studied under monotonicity assumptions. Motivated by non-monotone machine learning applications, we follow the line of works [Diakonikolas et al., 2021, Lee and Kim, 2021, Pethick et al., 2022, Böhm, 2022] aiming at going beyond monotonicity by considering the weaker negative comonotonicity assumption. In particular, we provide tight complexity analyses for the Proximal Point, Extragradient, and Optimistic Gradient methods in this setup, closing some questions on their working guarantees beyond monotonicity.
△ Less
Submitted 18 July, 2023; v1 submitted 25 October, 2022;
originally announced October 2022.
-
Quadratic minimization: from conjugate gradient to an adaptive Heavy-ball method with Polyak step-sizes
Authors:
Baptiste Goujaud,
Adrien Taylor,
Aymeric Dieuleveut
Abstract:
In this work, we propose an adaptive variation on the classical Heavy-ball method for convex quadratic minimization. The adaptivity crucially relies on so-called "Polyak step-sizes", which consists in using the knowledge of the optimal value of the optimization problem at hand instead of problem parameters such as a few eigenvalues of the Hessian of the problem. This method happens to also be equi…
▽ More
In this work, we propose an adaptive variation on the classical Heavy-ball method for convex quadratic minimization. The adaptivity crucially relies on so-called "Polyak step-sizes", which consists in using the knowledge of the optimal value of the optimization problem at hand instead of problem parameters such as a few eigenvalues of the Hessian of the problem. This method happens to also be equivalent to a variation of the classical conjugate gradient method, and thereby inherits many of its attractive features, including its finite-time convergence, instance optimality, and its worst-case convergence rates.
The classical gradient method with Polyak step-sizes is known to behave very well in situations in which it can be used, and the question of whether incorporating momentum in this method is possible and can improve the method itself appeared to be open. We provide a definitive answer to this question for minimizing convex quadratic functions, a arguably necessary first step for develo** such methods in more general setups.
△ Less
Submitted 12 October, 2022;
originally announced October 2022.
-
The NLP Sandbox: an efficient model-to-data system to enable federated and unbiased evaluation of clinical NLP models
Authors:
Yao Yan,
Thomas Yu,
Kathleen Muenzen,
Sijia Liu,
Connor Boyle,
George Koslowski,
Jiaxin Zheng,
Nicholas Dobbins,
Clement Essien,
Hongfang Liu,
Larsson Omberg,
Meliha Yestigen,
Bradley Taylor,
James A Eddy,
Justin Guinney,
Sean Mooney,
Thomas Schaffter
Abstract:
Objective The evaluation of natural language processing (NLP) models for clinical text de-identification relies on the availability of clinical notes, which is often restricted due to privacy concerns. The NLP Sandbox is an approach for alleviating the lack of data and evaluation frameworks for NLP models by adopting a federated, model-to-data approach. This enables unbiased federated model evalua…
▽ More
Objective The evaluation of natural language processing (NLP) models for clinical text de-identification relies on the availability of clinical notes, which is often restricted due to privacy concerns. The NLP Sandbox is an approach for alleviating the lack of data and evaluation frameworks for NLP models by adopting a federated, model-to-data approach. This enables unbiased federated model evaluation without the need for sharing sensitive data from multiple institutions. Materials and Methods We leveraged the Synapse collaborative framework, containerization software, and OpenAPI generator to build the NLP Sandbox (nlpsandbox.io). We evaluated two state-of-the-art NLP de-identification focused annotation models, Philter and NeuroNER, using data from three institutions. We further validated model performance using data from an external validation site. Results We demonstrated the usefulness of the NLP Sandbox through de-identification clinical model evaluation. The external developer was able to incorporate their model into the NLP Sandbox template and provide user experience feedback. Discussion We demonstrated the feasibility of using the NLP Sandbox to conduct a multi-site evaluation of clinical text de-identification models without the sharing of data. Standardized model and data schemas enable smooth model transfer and implementation. To generalize the NLP Sandbox, work is required on the part of data owners and model developers to develop suitable and standardized schemas and to adapt their data or model to fit the schemas. Conclusions The NLP Sandbox lowers the barrier to utilizing clinical data for NLP model evaluation and facilitates federated, multi-site, unbiased evaluation of NLP models.
△ Less
Submitted 28 June, 2022;
originally announced June 2022.
-
Gaia Data Release 3: Reflectance spectra of Solar System small bodies
Authors:
Gaia Collaboration,
L. Galluccio,
M. Delbo,
F. De Angeli,
T. Pauwels,
P. Tanga,
F. Mignard,
A. Cellino,
A. G. A. Brown,
K. Muinonen,
A. Penttila,
S. Jordan,
A. Vallenari,
T. Prusti,
J. H. J. de Bruijne,
F. Arenou,
C. Babusiaux,
M. Biermann,
O. L. Creevey,
C. Ducourant,
D. W. Evans,
L. Eyer,
R. Guerra,
A. Hutton,
C. Jordi
, et al. (422 additional authors not shown)
Abstract:
The Gaia mission of the European Space Agency (ESA) has been routinely observing Solar System objects (SSOs) since the beginning of its operations in August 2014. The Gaia data release three (DR3) includes, for the first time, the mean reflectance spectra of a selected sample of 60 518 SSOs, primarily asteroids, observed between August 5, 2014, and May 28, 2017. Each reflectance spectrum was deriv…
▽ More
The Gaia mission of the European Space Agency (ESA) has been routinely observing Solar System objects (SSOs) since the beginning of its operations in August 2014. The Gaia data release three (DR3) includes, for the first time, the mean reflectance spectra of a selected sample of 60 518 SSOs, primarily asteroids, observed between August 5, 2014, and May 28, 2017. Each reflectance spectrum was derived from measurements obtained by means of the Blue and Red photometers (BP/RP), which were binned in 16 discrete wavelength bands. We describe the processing of the Gaia spectral data of SSOs, explaining both the criteria used to select the subset of asteroid spectra published in Gaia DR3, and the different steps of our internal validation procedures. In order to further assess the quality of Gaia SSO reflectance spectra, we carried out external validation against SSO reflectance spectra obtained from ground-based and space-borne telescopes and available in the literature. For each selected SSO, an epoch reflectance was computed by dividing the calibrated spectrum observed by the BP/RP at each transit on the focal plane by the mean spectrum of a solar analogue. The latter was obtained by averaging the Gaia spectral measurements of a selected sample of stars known to have very similar spectra to that of the Sun. Finally, a mean of the epoch reflectance spectra was calculated in 16 spectral bands for each SSO. The agreement between Gaia mean reflectance spectra and those available in the literature is good for bright SSOs, regardless of their taxonomic spectral class. We identify an increase in the spectral slope of S-type SSOs with increasing phase angle. Moreover, we show that the spectral slope increases and the depth of the 1 um absorption band decreases for increasing ages of S-type asteroid families.
△ Less
Submitted 24 June, 2022;
originally announced June 2022.
-
Gaia Data Release 3: Pulsations in main sequence OBAF-type stars
Authors:
Gaia Collaboration,
J. De Ridder,
V. Ripepi,
C. Aerts,
L. Palaversa,
L. Eyer,
B. Holl,
M. Audard,
L. Rimoldini,
A. G. A. Brown,
A. Vallenari,
T. Prusti,
J. H. J. de Bruijne,
F. Arenou,
C. Babusiaux,
M. Biermann,
O. L. Creevey,
C. Ducourant,
D. W. Evans,
R. Guerra,
A. Hutton,
C. Jordi,
S. A. Klioner,
U. L. Lammers,
L. Lindegren
, et al. (423 additional authors not shown)
Abstract:
The third Gaia data release provides photometric time series covering 34 months for about 10 million stars. For many of those stars, a characterisation in Fourier space and their variability classification are also provided. This paper focuses on intermediate- to high-mass (IHM) main sequence pulsators M >= 1.3 Msun) of spectral types O, B, A, or F, known as beta Cep, slowly pulsating B (SPB), del…
▽ More
The third Gaia data release provides photometric time series covering 34 months for about 10 million stars. For many of those stars, a characterisation in Fourier space and their variability classification are also provided. This paper focuses on intermediate- to high-mass (IHM) main sequence pulsators M >= 1.3 Msun) of spectral types O, B, A, or F, known as beta Cep, slowly pulsating B (SPB), delta Sct, and gamma Dor stars. These stars are often multi-periodic and display low amplitudes, making them challenging targets to analyse with sparse time series. All datasets used in this analysis are part of the Gaia DR3 data release. The photometric time series were used to perform a Fourier analysis, while the global astrophysical parameters necessary for the empirical instability strips were taken from the Gaia DR3 gspphot tables, and the vsini data were taken from the Gaia DR3 esphs tables. We show that for nearby OBAF-type pulsators, the Gaia DR3 data are precise and accurate enough to pinpoint them in the Hertzsprung-Russell diagram. We find empirical instability strips covering broader regions than theoretically predicted. In particular, our study reveals the presence of fast rotating gravity-mode pulsators outside the strips, as well as the co-existence of rotationally modulated variables inside the strips as reported before in the literature. We derive an extensive period-luminosity relation for delta Sct stars and provide evidence that the relation features different regimes depending on the oscillation period. Finally, we demonstrate how stellar rotation attenuates the amplitude of the dominant oscillation mode of delta Sct stars.
△ Less
Submitted 16 August, 2022; v1 submitted 13 June, 2022;
originally announced June 2022.
-
Gaia Data Release 3: A Golden Sample of Astrophysical Parameters
Authors:
Gaia Collaboration,
O. L. Creevey,
L. M. Sarro,
A. Lobel,
E. Pancino,
R. Andrae,
R. L. Smart,
G. Clementini,
U. Heiter,
A. J. Korn,
M. Fouesneau,
Y. Frémat,
F. De Angeli,
A. Vallenari,
D. L. Harrison,
F. Thévenin,
C. Reylé,
R. Sordo,
A. Garofalo,
A. G. A. Brown,
L. Eyer,
T. Prusti,
J. H. J. de Bruijne,
F. Arenou,
C. Babusiaux
, et al. (423 additional authors not shown)
Abstract:
Gaia Data Release 3 (DR3) provides a wealth of new data products for the astronomical community to exploit, including astrophysical parameters for a half billion stars. In this work we demonstrate the high quality of these data products and illustrate their use in different astrophysical contexts. We query the astrophysical parameter tables along with other tables in Gaia DR3 to derive the samples…
▽ More
Gaia Data Release 3 (DR3) provides a wealth of new data products for the astronomical community to exploit, including astrophysical parameters for a half billion stars. In this work we demonstrate the high quality of these data products and illustrate their use in different astrophysical contexts. We query the astrophysical parameter tables along with other tables in Gaia DR3 to derive the samples of the stars of interest. We validate our results by using the Gaia catalogue itself and by comparison with external data. We have produced six homogeneous samples of stars with high quality astrophysical parameters across the HR diagram for the community to exploit. We first focus on three samples that span a large parameter space: young massive disk stars (~3M), FGKM spectral type stars (~3M), and UCDs (~20K). We provide these sources along with additional information (either a flag or complementary parameters) as tables that are made available in the Gaia archive. We furthermore identify 15740 bone fide carbon stars, 5863 solar-analogues, and provide the first homogeneous set of stellar parameters of the Spectro Photometric Standard Stars. We use a subset of the OBA sample to illustrate its usefulness to analyse the Milky Way rotation curve. We then use the properties of the FGKM stars to analyse known exoplanet systems. We also analyse the ages of some unseen UCD-companions to the FGKM stars. We additionally predict the colours of the Sun in various passbands (Gaia, 2MASS, WISE) using the solar-analogue sample.
△ Less
Submitted 12 June, 2022;
originally announced June 2022.
-
Gaia Data Release 3: The extragalactic content
Authors:
Gaia Collaboration,
C. A. L. Bailer-Jones,
D. Teyssier,
L. Delchambre,
C. Ducourant,
D. Garabato,
D. Hatzidimitriou,
S. A. Klioner,
L. Rimoldini,
I. Bellas-Velidis,
R. Carballo,
M. I. Carnerero,
C. Diener,
M. Fouesneau,
L. Galluccio,
P. Gavras,
A. Krone-Martins,
C. M. Raiteri,
R. Teixeira,
A. G. A. Brown,
A. Vallenari,
T. Prusti,
J. H. J. de Bruijne,
F. Arenou,
C. Babusiaux
, et al. (422 additional authors not shown)
Abstract:
The Gaia Galactic survey mission is designed and optimized to obtain astrometry, photometry, and spectroscopy of nearly two billion stars in our Galaxy. Yet as an all-sky multi-epoch survey, Gaia also observes several million extragalactic objects down to a magnitude of G~21 mag. Due to the nature of the Gaia onboard selection algorithms, these are mostly point-source-like objects. Using data prov…
▽ More
The Gaia Galactic survey mission is designed and optimized to obtain astrometry, photometry, and spectroscopy of nearly two billion stars in our Galaxy. Yet as an all-sky multi-epoch survey, Gaia also observes several million extragalactic objects down to a magnitude of G~21 mag. Due to the nature of the Gaia onboard selection algorithms, these are mostly point-source-like objects. Using data provided by the satellite, we have identified quasar and galaxy candidates via supervised machine learning methods, and estimate their redshifts using the low resolution BP/RP spectra. We further characterise the surface brightness profiles of host galaxies of quasars and of galaxies from pre-defined input lists. Here we give an overview of the processing of extragalactic objects, describe the data products in Gaia DR3, and analyse their properties. Two integrated tables contain the main results for a high completeness, but low purity (50-70%), set of 6.6 million candidate quasars and 4.8 million candidate galaxies. We provide queries that select purer sub-samples of these containing 1.9 million probable quasars and 2.9 million probable galaxies (both 95% purity). We also use high quality BP/RP spectra of 43 thousand high probability quasars over the redshift range 0.05-4.36 to construct a composite quasar spectrum spanning restframe wavelengths from 72-100 nm.
△ Less
Submitted 12 June, 2022;
originally announced June 2022.
-
Gaia Data Release 3: Stellar multiplicity, a teaser for the hidden treasure
Authors:
Gaia Collaboration,
F. Arenou,
C. Babusiaux,
M. A. Barstow,
S. Faigler,
A. Jorissen,
P. Kervella,
T. Mazeh,
N. Mowlavi,
P. Panuzzo,
J. Sahlmann,
S. Shahaf,
A. Sozzetti,
N. Bauchet,
Y. Damerdji,
P. Gavras,
P. Giacobbe,
E. Gosset,
J. -L. Halbwachs,
B. Holl,
M. G. Lattanzi,
N. Leclerc,
T. Morel,
D. Pourbaix,
P. Re Fiorentin
, et al. (425 additional authors not shown)
Abstract:
The Gaia DR3 Catalogue contains for the first time about eight hundred thousand solutions with either orbital elements or trend parameters for astrometric, spectroscopic and eclipsing binaries, and combinations of them. This paper aims to illustrate the huge potential of this large non-single star catalogue. Using the orbital solutions together with models of the binaries, a catalogue of tens of t…
▽ More
The Gaia DR3 Catalogue contains for the first time about eight hundred thousand solutions with either orbital elements or trend parameters for astrometric, spectroscopic and eclipsing binaries, and combinations of them. This paper aims to illustrate the huge potential of this large non-single star catalogue. Using the orbital solutions together with models of the binaries, a catalogue of tens of thousands of stellar masses, or lower limits, partly together with consistent flux ratios, has been built. Properties concerning the completeness of the binary catalogues are discussed, statistical features of the orbital elements are explained and a comparison with other catalogues is performed. Illustrative applications are proposed for binaries across the H-R diagram. The binarity is studied in the RGB/AGB and a search for genuine SB1 among long-period variables is performed. The discovery of new EL CVn systems illustrates the potential of combining variability and binarity catalogues. Potential compact object companions are presented, mainly white dwarf companions or double degenerates, but one candidate neutron star is also presented. Towards the bottom of the main sequence, the orbits of previously-suspected binary ultracool dwarfs are determined and new candidate binaries are discovered. The long awaited contribution of Gaia to the analysis of the substellar regime shows the brown dwarf desert around solar-type stars using true, rather than minimum, masses, and provides new important constraints on the occurrence rates of substellar companions to M dwarfs. Several dozen new exoplanets are proposed, including two with validated orbital solutions and one super-Jupiter orbiting a white dwarf, all being candidates requiring confirmation. Beside binarity, higher order multiple systems are also found.
△ Less
Submitted 11 June, 2022;
originally announced June 2022.
-
Gaia Data Release 3: Chemical cartography of the Milky Way
Authors:
Gaia Collaboration,
A. Recio-Blanco,
G. Kordopatis,
P. de Laverny,
P. A. Palicio,
A. Spagna,
L. Spina,
D. Katz,
P. Re Fiorentin,
E. Poggio,
P. J. McMillan,
A. Vallenari,
M. G. Lattanzi,
G. M. Seabroke,
L. Casamiquela,
A. Bragaglia,
T. Antoja,
C. A. L. Bailer-Jones,
R. Andrae,
M. Fouesneau,
M. Cropper,
T. Cantat-Gaudin,
U. Heiter,
A. Bijaoui,
A. G. A. Brown
, et al. (425 additional authors not shown)
Abstract:
Gaia DR3 opens a new era of all-sky spectral analysis of stellar populations thanks to the nearly 5.6 million stars observed by the RVS and parametrised by the GSP-spec module. The all-sky Gaia chemical cartography allows a powerful and precise chemo-dynamical view of the Milky Way with unprecedented spatial coverage and statistical robustness. First, it reveals the strong vertical symmetry of the…
▽ More
Gaia DR3 opens a new era of all-sky spectral analysis of stellar populations thanks to the nearly 5.6 million stars observed by the RVS and parametrised by the GSP-spec module. The all-sky Gaia chemical cartography allows a powerful and precise chemo-dynamical view of the Milky Way with unprecedented spatial coverage and statistical robustness. First, it reveals the strong vertical symmetry of the Galaxy and the flared structure of the disc. Second, the observed kinematic disturbances of the disc -- seen as phase space correlations -- and kinematic or orbital substructures are associated with chemical patterns that favour stars with enhanced metallicities and lower [alpha/Fe] abundance ratios compared to the median values in the radial distributions. This is detected both for young objects that trace the spiral arms and older populations. Several alpha, iron-peak elements and at least one heavy element trace the thin and thick disc properties in the solar cylinder. Third, young disc stars show a recent chemical impoverishment in several elements. Fourth, the largest chemo-dynamical sample of open clusters analysed so far shows a steepening of the radial metallicity gradient with age, which is also observed in the young field population. Finally, the Gaia chemical data have the required coverage and precision to unveil galaxy accretion debris and heated disc stars on halo orbits through their [alpha/Fe] ratio, and to allow the study of the chemo-dynamical properties of globular clusters. Gaia DR3 chemo-dynamical diagnostics open new horizons before the era of ground-based wide-field spectroscopic surveys. They unveil a complex Milky Way that is the outcome of an eventful evolution, sha** it to the present day (abridged).
△ Less
Submitted 11 June, 2022;
originally announced June 2022.
-
Optimal first-order methods for convex functions with a quadratic upper bound
Authors:
Baptiste Goujaud,
Adrien Taylor,
Aymeric Dieuleveut
Abstract:
We analyze worst-case convergence guarantees of first-order optimization methods over a function class extending that of smooth and convex functions. This class contains convex functions that admit a simple quadratic upper bound. Its study is motivated by its stability under minor perturbations. We provide a thorough analysis of first-order methods, including worst-case convergence guarantees for…
▽ More
We analyze worst-case convergence guarantees of first-order optimization methods over a function class extending that of smooth and convex functions. This class contains convex functions that admit a simple quadratic upper bound. Its study is motivated by its stability under minor perturbations. We provide a thorough analysis of first-order methods, including worst-case convergence guarantees for several algorithms, and demonstrate that some of them achieve the optimal worst-case guarantee over the class. We support our analysis by numerical validation of worst-case guarantees using performance estimation problems. A few observations can be drawn from this analysis, particularly regarding the optimality (resp. and adaptivity) of the heavy-ball method (resp. heavy-ball with line-search). Finally, we show how our analysis can be leveraged to obtain convergence guarantees over more complex classes of functions. Overall, this study brings insights on the choice of function classes over which standard first-order methods have working worst-case guarantees.
△ Less
Submitted 30 May, 2022;
originally announced May 2022.
-
Deep VLBI Observations Challenge Previous Evidence of a Binary Supermassive Black Hole Residing in the Seyfert Galaxy NGC 7674
Authors:
Peter Breiding,
Sarah Burke-Spolaor,
Tao An,
Karishma Bansal,
Prashanth Mohan,
Gregory B. Taylor,
Yingkang Zhang
Abstract:
Previous Ku-band (15 GHz) imaging with data obtained from the Very Long Baseline Array (VLBA) had shown two compact, sub-pc components at the location of a presumed kpc-scale radio core in the Seyfert galaxy NGC 7674. It was then presumed that these two unresolved and compact components were dual radio cores corresponding to two supermassive black holes (SMBHs) accreting surrounding gas and launch…
▽ More
Previous Ku-band (15 GHz) imaging with data obtained from the Very Long Baseline Array (VLBA) had shown two compact, sub-pc components at the location of a presumed kpc-scale radio core in the Seyfert galaxy NGC 7674. It was then presumed that these two unresolved and compact components were dual radio cores corresponding to two supermassive black holes (SMBHs) accreting surrounding gas and launching radio-bright relativistic jets. However, utilizing the original VLBA dataset used to claim the detection of a binary SMBH, in addition to later multi-epoch/multi-frequency datatsets obtained from both the VLBA and the European VLBI Network, we find no evidence to support the presence of a binary SMBH. We place stringent upper limits to the flux densities of any sub-pc-scale radio cores which are at least an order of magnitude lower than the original VLBI radio-core detections, directly challenging the original binary SMBH detection claim. With this in mind, we discuss the possible reasons for the non-detection of any VLBI radio cores in our imaging, the possibility of a binary SMBH still residing in NGC 7674, and the prospect of future observations shedding further light on the true nature of this active galactic nucleus.
△ Less
Submitted 28 May, 2022;
originally announced May 2022.
-
A systematic approach to Lyapunov analyses of continuous-time models in convex optimization
Authors:
Céline Moucer,
Adrien Taylor,
Francis Bach
Abstract:
First-order methods are often analyzed via their continuous-time models, where their worst-case convergence properties are usually approached via Lyapunov functions. In this work, we provide a systematic and principled approach to find and verify Lyapunov functions for classes of ordinary and stochastic differential equations. More precisely, we extend the performance estimation framework, origina…
▽ More
First-order methods are often analyzed via their continuous-time models, where their worst-case convergence properties are usually approached via Lyapunov functions. In this work, we provide a systematic and principled approach to find and verify Lyapunov functions for classes of ordinary and stochastic differential equations. More precisely, we extend the performance estimation framework, originally proposed by Drori and Teboulle [10], to continuous-time models. We retrieve convergence results comparable to those of discrete methods using fewer assumptions and convexity inequalities, and provide new results for stochastic accelerated gradient flows.
△ Less
Submitted 11 March, 2024; v1 submitted 25 May, 2022;
originally announced May 2022.
-
Fast Stochastic Composite Minimization and an Accelerated Frank-Wolfe Algorithm under Parallelization
Authors:
Benjamin Dubois-Taine,
Francis Bach,
Quentin Berthet,
Adrien Taylor
Abstract:
We consider the problem of minimizing the sum of two convex functions. One of those functions has Lipschitz-continuous gradients, and can be accessed via stochastic oracles, whereas the other is "simple". We provide a Bregman-type algorithm with accelerated convergence in function values to a ball containing the minimum. The radius of this ball depends on problem-dependent constants, including the…
▽ More
We consider the problem of minimizing the sum of two convex functions. One of those functions has Lipschitz-continuous gradients, and can be accessed via stochastic oracles, whereas the other is "simple". We provide a Bregman-type algorithm with accelerated convergence in function values to a ball containing the minimum. The radius of this ball depends on problem-dependent constants, including the variance of the stochastic oracle. We further show that this algorithmic setup naturally leads to a variant of Frank-Wolfe achieving acceleration under parallelization. More precisely, when minimizing a smooth convex function on a bounded domain, we show that one can achieve an $ε$ primal-dual gap (in expectation) in $\tilde{O}(1/ \sqrtε)$ iterations, by only accessing gradients of the original function and a linear maximization oracle with $O(1/\sqrtε)$ computing units in parallel. We illustrate this fast convergence on synthetic numerical experiments.
△ Less
Submitted 12 October, 2022; v1 submitted 25 May, 2022;
originally announced May 2022.
-
Last-Iterate Convergence of Optimistic Gradient Method for Monotone Variational Inequalities
Authors:
Eduard Gorbunov,
Adrien Taylor,
Gauthier Gidel
Abstract:
The Past Extragradient (PEG) [Popov, 1980] method, also known as the Optimistic Gradient method, has known a recent gain in interest in the optimization community with the emergence of variational inequality formulations for machine learning. Recently, in the unconstrained case, Golowich et al. [2020] proved that a $O(1/N)$ last-iterate convergence rate in terms of the squared norm of the operator…
▽ More
The Past Extragradient (PEG) [Popov, 1980] method, also known as the Optimistic Gradient method, has known a recent gain in interest in the optimization community with the emergence of variational inequality formulations for machine learning. Recently, in the unconstrained case, Golowich et al. [2020] proved that a $O(1/N)$ last-iterate convergence rate in terms of the squared norm of the operator can be achieved for Lipschitz and monotone operators with a Lipschitz Jacobian. In this work, by introducing a novel analysis through potential functions, we show that (i) this $O(1/N)$ last-iterate convergence can be achieved without any assumption on the Jacobian of the operator, and (ii) it can be extended to the constrained case, which was not derived before even under Lipschitzness of the Jacobian. The proof is significantly different from the one known from Golowich et al. [2020], and its discovery was computer-aided. Those results close the open question of the last iterate convergence of PEG for monotone variational inequalities.
△ Less
Submitted 31 October, 2022; v1 submitted 17 May, 2022;
originally announced May 2022.
-
Gaia Early Data Release 3: The celestial reference frame (Gaia-CRF3)
Authors:
Gaia Collaboration,
S. A. Klioner,
L. Lindegren,
F. Mignard,
J. Hernández,
M. Ramos-Lerate,
U. Bastian,
M. Biermann,
A. Bombrun,
A. de Torres,
E. Gerlach,
R. Geyer,
T. Hilger,
D. Hobbs,
U. L. Lammers,
P. J. McMillan,
H. Steidelmüller,
D. Teyssier,
C. M. Raiteri,
S. Bartolomé,
M. Bernet,
J. Castañeda,
M. Clotet,
M. Davidson,
C. Fabricius
, et al. (426 additional authors not shown)
Abstract:
Gaia-CRF3 is the celestial reference frame for positions and proper motions in the third release of data from the Gaia mission, Gaia DR3 (and for the early third release, Gaia EDR3, which contains identical astrometric results). The reference frame is defined by the positions and proper motions at epoch 2016.0 for a specific set of extragalactic sources in the (E)DR3 catalogue.
We describe the c…
▽ More
Gaia-CRF3 is the celestial reference frame for positions and proper motions in the third release of data from the Gaia mission, Gaia DR3 (and for the early third release, Gaia EDR3, which contains identical astrometric results). The reference frame is defined by the positions and proper motions at epoch 2016.0 for a specific set of extragalactic sources in the (E)DR3 catalogue.
We describe the construction of Gaia-CRF3, and its properties in terms of the distributions in magnitude, colour, and astrometric quality.
Compact extragalactic sources in Gaia DR3 were identified by positional cross-matching with 17 external catalogues of quasars (QSO) and active galactic nuclei (AGN), followed by astrometric filtering designed to remove stellar contaminants. Selecting a clean sample was favoured over including a higher number of extragalactic sources. For the final sample, the random and systematic errors in the proper motions are analysed, as well as the radio-optical offsets in position for sources in the third realisation of the International Celestial Reference Frame (ICRF3).
The Gaia-CRF3 comprises about 1.6 million QSO-like sources, of which 1.2 million have five-parameter astrometric solutions in Gaia DR3 and 0.4 million have six-parameter solutions. The sources span the magnitude range G = 13 to 21 with a peak density at 20.6 mag, at which the typical positional uncertainty is about 1 mas. The proper motions show systematic errors on the level of 12 $μ$as yr${}^{-1}$ on angular scales greater than 15 deg. For the 3142 optical counterparts of ICRF3 sources in the S/X frequency bands, the median offset from the radio positions is about 0.5 mas, but exceeds 4 mas in either coordinate for 127 sources. We outline the future of the Gaia-CRF in the next Gaia data releases.
△ Less
Submitted 30 October, 2022; v1 submitted 26 April, 2022;
originally announced April 2022.
-
Pulsar Observations at Low Frequencies: Applications to Pulsar Timing and Solar Wind Models
Authors:
P. Kumar,
S. M. White,
K. Stovall,
J. Dowell,
G. B. Taylor
Abstract:
Efforts are underway to use high-precision timing of pulsars in order to detect low-frequency gravitational waves. A limit to this technique is the timing noise generated by dispersion in the plasma along the line of sight to the pulsar, including the solar wind. The effects due to the solar wind vary with time, influenced by the change in solar activity on different time scales, ranging up to…
▽ More
Efforts are underway to use high-precision timing of pulsars in order to detect low-frequency gravitational waves. A limit to this technique is the timing noise generated by dispersion in the plasma along the line of sight to the pulsar, including the solar wind. The effects due to the solar wind vary with time, influenced by the change in solar activity on different time scales, ranging up to $\sim 11$ years for a solar cycle. The solar wind contribution depends strongly on the angle between the pulsar line of sight and the solar disk, and is a dominant effect at small separations. Although solar wind models to mitigate these effects do exist, they do not account for all the effects of the solar wind and its temporal changes. Since low-frequency pulsar observations are most sensitive to these dispersive delays, they are most suited to test the efficacy of these models and identify alternative approaches. Here, we investigate the efficacy of some solar wind models commonly used in pulsar timing using long-term, high-cadence data on 6 pulsars taken with the Long Wavelength Array, and compare them with an operational solar wind model. Our results show that stationary models of the solar wind correction are insufficient to achieve the timing noise desired by pulsar timing experiments, and we need to use non-stationary models, which are informed by other solar wind observations, to obtain accurate timing residuals.
△ Less
Submitted 2 February, 2022;
originally announced February 2022.
-
PEPit: computer-assisted worst-case analyses of first-order optimization methods in Python
Authors:
Baptiste Goujaud,
Céline Moucer,
François Glineur,
Julien Hendrickx,
Adrien Taylor,
Aymeric Dieuleveut
Abstract:
PEPit is a Python package aiming at simplifying the access to worst-case analyses of a large family of first-order optimization methods possibly involving gradient, projection, proximal, or linear optimization oracles, along with their approximate, or Bregman variants. In short, PEPit is a package enabling computer-assisted worst-case analyses of first-order optimization methods. The key underlyin…
▽ More
PEPit is a Python package aiming at simplifying the access to worst-case analyses of a large family of first-order optimization methods possibly involving gradient, projection, proximal, or linear optimization oracles, along with their approximate, or Bregman variants. In short, PEPit is a package enabling computer-assisted worst-case analyses of first-order optimization methods. The key underlying idea is to cast the problem of performing a worst-case analysis, often referred to as a performance estimation problem (PEP), as a semidefinite program (SDP) which can be solved numerically. To do that, the package users are only required to write first-order methods nearly as they would have implemented them. The package then takes care of the SDP modeling parts, and the worst-case analysis is performed numerically via a standard solver.
△ Less
Submitted 17 June, 2024; v1 submitted 11 January, 2022;
originally announced January 2022.
-
New Tests of Millilensing in the Blazar PKS 1413+135
Authors:
A. L. Peirson,
I. Liodakis,
A. C. S. Readhead,
M. L. Lister,
E. S. Perlman,
M. F. Aller,
R. D. Blandford,
K. J. B. Grainge,
D. A. Green,
M. A. Gurwell,
M. W. Hodges,
T. Hovatta,
S. Kiehlmann,
A. Lähteenmäki,
W. Max-Moerbeck,
T. Mcaloone,
S. O'Neill,
V. Pavlidou,
T. J. Pearson,
V. Ravi,
R. A. Reeves,
P. F. Scott,
G. B. Taylor,
D. J. Titterington,
M. Tornikoski
, et al. (4 additional authors not shown)
Abstract:
Symmetric Achromatic Variability (SAV) is a rare form of radio variability in blazars that has been attributed to gravitational millilensing by a ~$10^2 - 10^5$ $M_\odot$ mass condensate. Four SAVs have been identified between 1980 and 2020 in the long-term radio monitoring data of the blazar PKS 1413+135. We show that all four can be fitted with the same, unchanging, gravitational lens model. If…
▽ More
Symmetric Achromatic Variability (SAV) is a rare form of radio variability in blazars that has been attributed to gravitational millilensing by a ~$10^2 - 10^5$ $M_\odot$ mass condensate. Four SAVs have been identified between 1980 and 2020 in the long-term radio monitoring data of the blazar PKS 1413+135. We show that all four can be fitted with the same, unchanging, gravitational lens model. If SAV is due to gravitational millilensing, PKS 1413+135 provides a unique system for studying active galactic nuclei with unprecedented microarcsecond resolution, as well as for studying the nature of the millilens itself. We discuss two possible candidates for the putative millilens: a giant molecular cloud hosted in the intervening edge-on spiral galaxy, and an undetected dwarf galaxy with a massive black hole. We find a significant dependence of SAV crossing time on frequency, which could indicate a fast shock moving in a slower underlying flow. We also find tentative evidence for a 989-day periodicity in the SAVs, which, if real, makes possible the prediction of future SAVs: the next three windows for possible SAVs begin in August 2022, May 2025, and February 2028.
△ Less
Submitted 8 January, 2022; v1 submitted 4 January, 2022;
originally announced January 2022.
-
Improvements to the Search for Cosmic Dawn Using the Long Wavelength Array
Authors:
C. DiLullo,
J. Dowell,
G. B. Taylor
Abstract:
We present recent improvements to the search for the global Cosmic Dawn signature using the Long Wavelength Array station located on the Sevilleta National Wildlife Refuge in New Mexico, USA (LWA-SV). These improvements are both in the methodology of the experiment and the hardware of the station. An improved observing strategy along with more sophisticated temperature calibration and foreground m…
▽ More
We present recent improvements to the search for the global Cosmic Dawn signature using the Long Wavelength Array station located on the Sevilleta National Wildlife Refuge in New Mexico, USA (LWA-SV). These improvements are both in the methodology of the experiment and the hardware of the station. An improved observing strategy along with more sophisticated temperature calibration and foreground modelling schemes have led to improved residual RMS limits. A large improvement over previous work using LWA-SV is the use of a novel achromatic beamforming technique which has been developed for LWA-SV. We present results from an observing campaign which contains 29 days of observations between March $10^{\rm{th}}$, 2021 and April $10^{\rm{th}}$ 2021. The reported residual RMS limits are 6 times above the amplitude of the potential signal reported by the Experiment to Detect the Global EoR Signature (EDGES) collaboration.
△ Less
Submitted 27 December, 2021;
originally announced December 2021.
-
What defines a compact symmetric object? A carefully vetted sample of CSOs
Authors:
Anthony C. S. Readhead,
Sebastian Kiehlmann,
Matthew L. Lister,
Sandra O'Neill,
Timothy J. Pearson,
Evan Sheldahl,
Aneta Siemiginowska,
Gregory B. Taylor,
Peter N. Wilkinson
Abstract:
Compact Symmetric Objects (CSOs), young jetted-AGN of overall projected size <1 kpc, are of great interest due to their youth and evolution. The classification was introduced to distinguish between ~95% of powerful compact extragalactic radio sources in flux density limited samples that are dominated by asymmetric emission due to relativistic beaming from jets aligned close to the line of sight, a…
▽ More
Compact Symmetric Objects (CSOs), young jetted-AGN of overall projected size <1 kpc, are of great interest due to their youth and evolution. The classification was introduced to distinguish between ~95% of powerful compact extragalactic radio sources in flux density limited samples that are dominated by asymmetric emission due to relativistic beaming from jets aligned close to the line of sight, and ~5% of objects that are not. The original classification criteria were: (i) overall projected diameter smaller than ~1 kpc, (ii) identified center of activity, and (iii) symmetric jet structure about the center. There is confusion and erosion of the value of the CSO classification due to misclassifications. Many jets contain compact bright features outside core, resulting in a GPS total spectrum and a "compact double" appearance, and some objects with jet axes aligned close to the line of sight appear symmetric because the approaching jet is projected on both sides of the core. To eliminate the confusion, we propose adding (iv) slow radio variability and (v) low apparent velocity of bright features moving along the jets to the above CSO criteria. We are compiling a catalog of CSOs using these five criteria to eliminate the confusion of Doppler boosting.
△ Less
Submitted 16 November, 2021;
originally announced November 2021.
-
A note on approximate accelerated forward-backward methods with absolute and relative errors, and possibly strongly convex objectives
Authors:
Mathieu Barré,
Adrien Taylor,
Francis Bach
Abstract:
In this short note, we provide a simple version of an accelerated forward-backward method (a.k.a. Nesterov's accelerated proximal gradient method) possibly relying on approximate proximal operators and allowing to exploit strong convexity of the objective function. The method supports both relative and absolute errors, and its behavior is illustrated on a set of standard numerical experiments. Usi…
▽ More
In this short note, we provide a simple version of an accelerated forward-backward method (a.k.a. Nesterov's accelerated proximal gradient method) possibly relying on approximate proximal operators and allowing to exploit strong convexity of the objective function. The method supports both relative and absolute errors, and its behavior is illustrated on a set of standard numerical experiments. Using the same developments, we further provide a version of the accelerated proximal hybrid extragradient method of Monteiro and Svaiter (2013) possibly exploiting strong convexity of the objective function.
△ Less
Submitted 21 January, 2022; v1 submitted 29 June, 2021;
originally announced June 2021.
-
Malaria Risk Map** Using Routine Health System Incidence Data in Zambia
Authors:
Benjamin M. Taylor,
Ricardo Andrade-Pacheco,
Hugh Sturrock,
Busiku Hamainza,
Kafula Silumbe,
John Miller,
Thomas P. Eisele,
Francois Rerolle,
Hannah Slater,
Adam Bennett
Abstract:
Improvements to Zambia's malaria surveillance system allow better monitoring of incidence and targetting of responses at refined spatial scales. As transmission decreases, understanding heterogeneity in risk at fine spatial scales becomes increasingly important. However, there are challenges in using health system data for high-resolution risk map**: health facilities have undefined and overlapp…
▽ More
Improvements to Zambia's malaria surveillance system allow better monitoring of incidence and targetting of responses at refined spatial scales. As transmission decreases, understanding heterogeneity in risk at fine spatial scales becomes increasingly important. However, there are challenges in using health system data for high-resolution risk map**: health facilities have undefined and overlap** catchment areas, and report on an inconsistent basis. We propose a novel inferential framework for risk map** of malaria incidence data based on formal down-scaling of confirmed case data reported through the health system in Zambia. We combine data from large community intervention trials in 2011-2016 and model health facility catchments based upon treatment-seeking behaviours; our model for monthly incidence is an aggregated log-Gaussian Cox process, which allows us to predict incidence at fine scale. We predicted monthly malaria incidence at 5km$^2$ resolution nationally: whereas 4.8 million malaria cases were reported through the health system in 2016, we estimated that the number of cases occurring at the community level was closer to 10 million. As Zambia continues to scale up community-based reporting of malaria incidence, these outputs provide realistic estimates of community-level malaria burden as well as high resolution risk maps for targeting interventions at the sub-catchment level.
△ Less
Submitted 28 June, 2021;
originally announced June 2021.
-
Super-Acceleration with Cyclical Step-sizes
Authors:
Baptiste Goujaud,
Damien Scieur,
Aymeric Dieuleveut,
Adrien Taylor,
Fabian Pedregosa
Abstract:
We develop a convergence-rate analysis of momentum with cyclical step-sizes. We show that under some assumption on the spectral gap of Hessians in machine learning, cyclical step-sizes are provably faster than constant step-sizes. More precisely, we develop a convergence rate analysis for quadratic objectives that provides optimal parameters and shows that cyclical learning rates can improve upon…
▽ More
We develop a convergence-rate analysis of momentum with cyclical step-sizes. We show that under some assumption on the spectral gap of Hessians in machine learning, cyclical step-sizes are provably faster than constant step-sizes. More precisely, we develop a convergence rate analysis for quadratic objectives that provides optimal parameters and shows that cyclical learning rates can improve upon traditional lower complexity bounds. We further propose a systematic approach to design optimal first order methods for quadratic minimization with a given spectral structure. Finally, we provide a local convergence rate analysis beyond quadratic minimization for the proposed methods and illustrate our findings through benchmarks on least squares and logistic regression problems.
△ Less
Submitted 9 May, 2022; v1 submitted 17 June, 2021;
originally announced June 2021.
-
A Continuized View on Nesterov Acceleration for Stochastic Gradient Descent and Randomized Gossip
Authors:
Mathieu Even,
Raphaël Berthier,
Francis Bach,
Nicolas Flammarion,
Pierre Gaillard,
Hadrien Hendrikx,
Laurent Massoulié,
Adrien Taylor
Abstract:
We introduce the continuized Nesterov acceleration, a close variant of Nesterov acceleration whose variables are indexed by a continuous time parameter. The two variables continuously mix following a linear ordinary differential equation and take gradient steps at random times. This continuized variant benefits from the best of the continuous and the discrete frameworks: as a continuous process, o…
▽ More
We introduce the continuized Nesterov acceleration, a close variant of Nesterov acceleration whose variables are indexed by a continuous time parameter. The two variables continuously mix following a linear ordinary differential equation and take gradient steps at random times. This continuized variant benefits from the best of the continuous and the discrete frameworks: as a continuous process, one can use differential calculus to analyze convergence and obtain analytical expressions for the parameters; and a discretization of the continuized process can be computed exactly with convergence rates similar to those of Nesterov original acceleration. We show that the discretization has the same structure as Nesterov acceleration, but with random parameters. We provide continuized Nesterov acceleration under deterministic as well as stochastic gradients, with either additive or multiplicative noise. Finally, using our continuized framework and expressing the gossip averaging problem as the stochastic minimization of a certain energy function, we provide the first rigorous acceleration of asynchronous gossip algorithms.
△ Less
Submitted 27 October, 2021; v1 submitted 10 June, 2021;
originally announced June 2021.