-
Large Language Models for Wearable Sensor-Based Human Activity Recognition, Health Monitoring, and Behavioral Modeling: A Survey of Early Trends, Datasets, and Challenges
Authors:
Emilio Ferrara
Abstract:
The proliferation of wearable technology enables the generation of vast amounts of sensor data, offering significant opportunities for advancements in health monitoring, activity recognition, and personalized medicine. However, the complexity and volume of this data present substantial challenges in data modeling and analysis, which have been tamed with approaches spanning time series modeling to…
▽ More
The proliferation of wearable technology enables the generation of vast amounts of sensor data, offering significant opportunities for advancements in health monitoring, activity recognition, and personalized medicine. However, the complexity and volume of this data present substantial challenges in data modeling and analysis, which have been tamed with approaches spanning time series modeling to deep learning techniques. The latest frontier in this domain is the adoption of Large Language Models (LLMs), such as GPT-4 and Llama, for data analysis, modeling, understanding, and generation of human behavior through the lens of wearable sensor data. This survey explores current trends and challenges in applying LLMs for sensor-based human activity recognition and behavior modeling. We discuss the nature of wearable sensors data, the capabilities and limitations of LLMs to model them and their integration with traditional machine learning techniques. We also identify key challenges, including data quality, computational requirements, interpretability, and privacy concerns. By examining case studies and successful applications, we highlight the potential of LLMs in enhancing the analysis and interpretation of wearable sensors data. Finally, we propose future directions for research, emphasizing the need for improved preprocessing techniques, more efficient and scalable models, and interdisciplinary collaboration. This survey aims to provide a comprehensive overview of the intersection between wearable sensors data and LLMs, offering insights into the current state and future prospects of this emerging field.
△ Less
Submitted 9 July, 2024;
originally announced July 2024.
-
The Anomalous Acceleration of PSR J2043+1711: Long-Period Orbital Companion or Stellar Flyby?
Authors:
Thomas Donlon II,
Sukanya Chakrabarti,
Michael T. Lam,
Daniel Huber,
Daniel Hey,
Enrico Ramirez-Ruiz,
Benjamin Shappee,
David L. Kaplan,
Gabriella Agazie,
Akash Anumarlapudi,
Anne M. Archibald,
Zaven Arzoumanian,
Paul T. Baker,
Paul R. Brook,
H. Thankful Cromartie,
Kathryn Crowter,
Megan E. DeCesar,
Paul B. Demorest,
Timothy Dolch,
Elizabeth C. Ferrara,
William Fiore,
Emmanuel Fonseca,
Gabriel E. Freedman,
Nate Garver-Daniels,
Peter A. Gentile
, et al. (31 additional authors not shown)
Abstract:
Based on the rate of change of its orbital period, PSR J2043+1711 has a substantial peculiar acceleration of 3.5 $\pm$ 0.8 mm/s/yr, which deviates from the acceleration predicted by equilibrium Milky Way models at a $4σ$ level. The magnitude of the peculiar acceleration is too large to be explained by disequilibrium effects of the Milky Way interacting with orbiting dwarf galaxies ($\sim$1 mm/s/yr…
▽ More
Based on the rate of change of its orbital period, PSR J2043+1711 has a substantial peculiar acceleration of 3.5 $\pm$ 0.8 mm/s/yr, which deviates from the acceleration predicted by equilibrium Milky Way models at a $4σ$ level. The magnitude of the peculiar acceleration is too large to be explained by disequilibrium effects of the Milky Way interacting with orbiting dwarf galaxies ($\sim$1 mm/s/yr), and too small to be caused by period variations due to the pulsar being a redback. We identify and examine two plausible causes for the anomalous acceleration: a stellar flyby, and a long-period orbital companion. We identify a main-sequence star in \textit{Gaia} DR3 and Pan-STARRS DR2 with the correct mass, distance, and on-sky position to potentially explain the observed peculiar acceleration. However, the star and the pulsar system have substantially different proper motions, indicating that they are not gravitationally bound. However, it is possible that this is an unrelated star that just happens to be located near J2043+1711 along our line of sight (chance probability of 1.6\%). Therefore, we also constrain possible orbital parameters for a circumbinary companion in a hierarchical triple system with J2043+1711; the changes in the spindown rate of the pulsar are consistent with an outer object that has an orbital period of 80 kyr, a companion mass of 0.3 $M_\odot$ (indicative of a white dwarf or low-mass star), and a semi-major axis of 2000 AU. Continued timing and/or future faint optical observations of J2043+1711 may eventually allow us to differentiate between these scenarios.
△ Less
Submitted 8 July, 2024;
originally announced July 2024.
-
Tracking the 2024 US Presidential Election Chatter on Tiktok: A Public Multimodal Dataset
Authors:
Gabriela Pinto,
Charles Bickham,
Tanishq Salkar,
Luca Luceri,
Emilio Ferrara
Abstract:
This paper documents our release of a large-scale data collection of TikTok posts related to the upcoming 2024 U.S. Presidential Election. Our current data comprises 1.8 million videos published between November 1, 2023, and May 26, 2024. Its exploratory analysis identifies the most common keywords, hashtags, and bigrams in both Spanish and English posts, focusing on the election and the two main…
▽ More
This paper documents our release of a large-scale data collection of TikTok posts related to the upcoming 2024 U.S. Presidential Election. Our current data comprises 1.8 million videos published between November 1, 2023, and May 26, 2024. Its exploratory analysis identifies the most common keywords, hashtags, and bigrams in both Spanish and English posts, focusing on the election and the two main Presidential candidates, President Joe Biden and Donald Trump.
We utilized the TikTok Research API, incorporating various election-related keywords and hashtags, to capture the full scope of relevant content. To address the limitations of the TikTok Research API, we also employed third-party scrapers to expand our dataset. The dataset is publicly available at https://github.com/gabbypinto/US2024PresElectionTikToks
△ Less
Submitted 2 July, 2024; v1 submitted 1 July, 2024;
originally announced July 2024.
-
Exploring pulsar timing precision: A comparative study of polarization calibration methods for NANOGrav data from the Green Bank Telescope
Authors:
Lankeswar Dey,
Maura A. McLaughlin,
Haley M. Wahl,
Paul B. Demorest,
Zaven Arzoumanian,
Harsha Blumer,
Paul R. Brook,
Sarah Burke-Spolaor,
H. Thankful Cromartie,
Megan E. DeCesar,
Timothy Dolch,
Justin A. Ellis,
Robert D. Ferdman,
Elizabeth C. Ferrara,
William Fiore,
Emmanuel Fonseca,
Nate Garver-Daniels,
Peter A. Gentile,
Joseph Glaser,
Deborah C. Good,
Ross J. Jennings,
Megan L. Jones,
Michael T. Lam,
Duncan R. Lorimer,
**g Luo
, et al. (10 additional authors not shown)
Abstract:
Pulsar timing array experiments have recently uncovered evidence for a nanohertz gravitational wave background by precisely timing an ensemble of millisecond pulsars. The next significant milestones for these experiments include characterizing the detected background with greater precision, identifying its source(s), and detecting continuous gravitational waves from individual supermassive black h…
▽ More
Pulsar timing array experiments have recently uncovered evidence for a nanohertz gravitational wave background by precisely timing an ensemble of millisecond pulsars. The next significant milestones for these experiments include characterizing the detected background with greater precision, identifying its source(s), and detecting continuous gravitational waves from individual supermassive black hole binaries. To achieve these objectives, generating accurate and precise times of arrival of pulses from pulsar observations is crucial. Incorrect polarization calibration of the observed pulsar profiles may introduce errors in the measured times of arrival. Further, previous studies (e.g., van Straten 2013; Manchester et al. 2013) have demonstrated that robust polarization calibration of pulsar profiles can reduce noise in the pulsar timing data and improve timing solutions. In this paper, we investigate and compare the impact of different polarization calibration methods on pulsar timing precision using three distinct calibration techniques: the Ideal Feed Assumption (IFA), Measurement Equation Modeling (MEM), and Measurement Equation Template Matching (METM). Three NANOGrav pulsars-PSRs J1643$-$1224, J1744$-$1134, and J1909$-$3744-observed with the 800 MHz and 1.5 GHz receivers at the Green Bank Telescope (GBT) are utilized for our analysis. Our findings reveal that all three calibration methods enhance timing precision compared to scenarios where no polarization calibration is performed. Additionally, among the three calibration methods, the IFA approach generally provides the best results for timing analysis of pulsars observed with the GBT receiver system. We attribute the comparatively poorer performance of the MEM and METM methods to potential instabilities in the reference noise diode coupled to the receiver and temporal variations in the profile of the reference pulsar, respectively.
△ Less
Submitted 19 June, 2024;
originally announced June 2024.
-
Tracing the Unseen: Uncovering Human Trafficking Patterns in Job Listings
Authors:
Siyi Zhou,
Jiankun Peng,
Emilio Ferrara
Abstract:
In the shadow of the digital revolution, the insidious issue of human trafficking has found new breeding grounds within the realms of social media and online job boards. Previous research efforts have predominantly centered on identifying victims via the analysis of escort advertisements. However, our work shifts the focus towards enabling a proactive approach: pinpointing potential traffickers be…
▽ More
In the shadow of the digital revolution, the insidious issue of human trafficking has found new breeding grounds within the realms of social media and online job boards. Previous research efforts have predominantly centered on identifying victims via the analysis of escort advertisements. However, our work shifts the focus towards enabling a proactive approach: pinpointing potential traffickers before they lure their preys through false job opportunities. In this study, we collect and analyze a vast dataset comprising over a quarter million job postings collected from eight relevant regions across the United States, spanning nearly two decades (2006-2024). The job boards we considered are specifically catered towards Chinese-speaking immigrants in the US. We classify the job posts into distinct groups based on the self-reported information of the posting user. Our investigation into the types of advertised opportunities, the modes of preferred contact, and the frequency of postings uncovers the patterns characterizing suspicious ads. Additionally, we highlight how external events such as health emergencies and conflicts appear to strongly correlate with increased volume of suspicious job posts: traffickers are more likely to prey upon vulnerable populations in times of crises. This research underscores the imperative for a deeper dive into how online job boards and communication platforms could be unwitting facilitators of human trafficking. More importantly, it calls for the urgent formulation of targeted strategies to dismantle these digital conduits of exploitation.
△ Less
Submitted 18 June, 2024;
originally announced June 2024.
-
The Susceptibility Paradox in Online Social Influence
Authors:
Luca Luceri,
**yi Ye,
Julie Jiang,
Emilio Ferrara
Abstract:
Understanding susceptibility to online influence is crucial for mitigating the spread of misinformation and protecting vulnerable audiences. This paper investigates susceptibility to influence within social networks, focusing on the differential effects of influence-driven versus spontaneous behaviors on user content adoption. Our analysis reveals that influence-driven adoption exhibits high homop…
▽ More
Understanding susceptibility to online influence is crucial for mitigating the spread of misinformation and protecting vulnerable audiences. This paper investigates susceptibility to influence within social networks, focusing on the differential effects of influence-driven versus spontaneous behaviors on user content adoption. Our analysis reveals that influence-driven adoption exhibits high homophily, indicating that individuals prone to influence often connect with similarly susceptible peers, thereby reinforcing peer influence dynamics. Conversely, spontaneous adoption shows significant but lower homophily. Additionally, we extend the Generalized Friendship Paradox to influence-driven behaviors, demonstrating that users' friends are generally more susceptible to influence than the users themselves, de facto establishing the notion of Susceptibility Paradox in online social influence. This pattern does not hold for spontaneous behaviors, where friends exhibit fewer spontaneous adoptions. We find that susceptibility to influence can be accurately predicted using friends' susceptibility alone, while predicting spontaneous adoption requires additional features, such as user metadata. These findings highlight the complex interplay between user engagement and preferences in spontaneous content adoption. Our results provide new insights into social influence mechanisms and offer implications for designing more effective moderation strategies to protect vulnerable audiences.
△ Less
Submitted 17 June, 2024;
originally announced June 2024.
-
Charting the Landscape of Nefarious Uses of Generative Artificial Intelligence for Online Election Interference
Authors:
Emilio Ferrara
Abstract:
Generative Artificial Intelligence (GenAI) and Large Language Models (LLMs) pose significant risks, particularly in the realm of online election interference. This paper explores the nefarious applications of GenAI, highlighting their potential to disrupt democratic processes through deepfakes, botnets, targeted misinformation campaigns, and synthetic identities.
Generative Artificial Intelligence (GenAI) and Large Language Models (LLMs) pose significant risks, particularly in the realm of online election interference. This paper explores the nefarious applications of GenAI, highlighting their potential to disrupt democratic processes through deepfakes, botnets, targeted misinformation campaigns, and synthetic identities.
△ Less
Submitted 3 June, 2024;
originally announced June 2024.
-
The NANOGrav 15 yr Data Set: Chromatic Gaussian Process Noise Models for Six Pulsars
Authors:
Bjorn Larsen,
Chiara M. F. Mingarelli,
Jeffrey S. Hazboun,
Aurelien Chalumeau,
Deborah C. Good,
Joseph Simon,
Gabriella Agazie,
Akash Anumarlapudi,
Anne M. Archibald,
Zaven Arzoumanian,
Paul T. Baker,
Paul R. Brook,
H. Thankful Cromartie,
Kathryn Crowter,
Megan E. DeCesar,
Paul B. Demorest,
Timothy Dolch,
Elizabeth C. Ferrara,
William Fiore,
Emmanuel Fonseca,
Gabriel E. Freedman,
Nate Garver-Daniels,
Peter A. Gentile,
Joseph Glaser,
Ross J. Jennings
, et al. (39 additional authors not shown)
Abstract:
Pulsar timing arrays (PTAs) are designed to detect low-frequency gravitational waves (GWs). GWs induce achromatic signals in PTA data, meaning that the timing delays do not depend on radio-frequency. However, pulse arrival times are also affected by radio-frequency dependent "chromatic" noise from sources such as dispersion measure (DM) and scattering delay variations. Furthermore, the characteriz…
▽ More
Pulsar timing arrays (PTAs) are designed to detect low-frequency gravitational waves (GWs). GWs induce achromatic signals in PTA data, meaning that the timing delays do not depend on radio-frequency. However, pulse arrival times are also affected by radio-frequency dependent "chromatic" noise from sources such as dispersion measure (DM) and scattering delay variations. Furthermore, the characterization of GW signals may be influenced by the choice of chromatic noise model for each pulsar. To better understand this effect, we assess if and how different chromatic noise models affect achromatic noise properties in each pulsar. The models we compare include existing DM models used by NANOGrav and noise models used for the European PTA Data Release 2 (EPTA DR2). We perform this comparison using a subsample of six pulsars from the NANOGrav 15 yr data set, selecting the same six pulsars as from the EPTA DR2 six-pulsar dataset. We find that the choice of chromatic noise model noticeably affects the achromatic noise properties of several pulsars. This is most dramatic for PSR J1713+0747, where the amplitude of its achromatic red noise lowers from $\log_{10}A_{\text{RN}} = -14.1^{+0.1}_{-0.1}$ to $-14.7^{+0.3}_{-0.5}$, and the spectral index broadens from $γ_{\text{RN}} = 2.6^{+0.5}_{-0.4}$ to $γ_{\text{RN}} = 3.5^{+1.2}_{-0.9}$. We also compare each pulsar's noise properties with those inferred from the EPTA DR2, using the same models. From the discrepancies, we identify potential areas where the noise models could be improved. These results highlight the potential for custom chromatic noise models to improve PTA sensitivity to GWs.
△ Less
Submitted 23 May, 2024;
originally announced May 2024.
-
NICER Discovery that SRGA J144459.2-604207 is an Accreting Millisecond X-ray Pulsar
Authors:
Mason Ng,
Paul S. Ray,
Andrea Sanna,
Tod E. Strohmayer,
Alessandro Papitto,
Giulia Illiano,
Arianna C. Albayati,
Diego Altamirano,
Tuğba Boztepe,
Tolga Güver,
Deepto Chakrabarty,
Zaven Arzoumanian,
D. J. K. Buisson,
Elizabeth C. Ferrara,
Keith C. Gendreau,
Sebastien Guillot,
Jeremy Hare,
Gaurava K. Jaisawal,
Christian Malacaria,
Michael T. Wolff
Abstract:
We present the discovery, with the Neutron Star Interior Composition Explorer (NICER), that SRGA J144459.2-604207 is a 447.9 Hz accreting millisecond X-ray pulsar (AMXP), which underwent a four-week long outburst starting on 2024 February 15. The AMXP resides in a 5.22 hr binary, orbiting a low-mass companion donor with $M_d>0.1M_\odot$. We report on the temporal and spectral properties from NICER…
▽ More
We present the discovery, with the Neutron Star Interior Composition Explorer (NICER), that SRGA J144459.2-604207 is a 447.9 Hz accreting millisecond X-ray pulsar (AMXP), which underwent a four-week long outburst starting on 2024 February 15. The AMXP resides in a 5.22 hr binary, orbiting a low-mass companion donor with $M_d>0.1M_\odot$. We report on the temporal and spectral properties from NICER observations during the early days of the outburst, from 2024 February 21 through 2024 February 23, during which NICER also detected a type-I X-ray burst that exhibited a plateau lasting ~6 s. The spectra of the persistent emission were well described by an absorbed thermal blackbody and power-law model, with blackbody temperature $kT\approx0.9{\rm\,keV}$ and power-law photon index $Γ\approx1.9$. Time-resolved burst spectroscopy confirmed the thermonuclear nature of the burst, where an additional blackbody component reached a maximum temperature of nearly $kT\approx3{\rm\,keV}$ at the peak of the burst. We discuss the nature of the companion as well as the type-I X-ray burst.
△ Less
Submitted 14 May, 2024; v1 submitted 30 April, 2024;
originally announced May 2024.
-
Hidden in Plain Sight: Exploring the Intersections of Mental Health, Eating Disorders, and Content Moderation on TikTok
Authors:
Charles Bickham,
Kia Kazemi-Nia,
Luca Luceri,
Kristina Lerman,
Emilio Ferrara
Abstract:
Social media platforms actively moderate content glorifying harmful behaviors like eating disorders, which include anorexia and bulimia. However, users have adapted to evade moderation by using coded hashtags. Our study investigates the prevalence of moderation evaders on the popular social media platform TikTok and contrasts their use and emotional valence with mainstream hashtags. We notice that…
▽ More
Social media platforms actively moderate content glorifying harmful behaviors like eating disorders, which include anorexia and bulimia. However, users have adapted to evade moderation by using coded hashtags. Our study investigates the prevalence of moderation evaders on the popular social media platform TikTok and contrasts their use and emotional valence with mainstream hashtags. We notice that moderation evaders and mainstream hashtags appear together, indicating that vulnerable users might inadvertently encounter harmful content even when searching for mainstream terms. Additionally, through an analysis of emotional expressions in video descriptions and comments, we find that mainstream hashtags generally promote positive engagement, while moderation evaders evoke a wider range of emotions, including heightened negativity. These findings provide valuable insights for content creators, platform moderation efforts, and interventions aimed at cultivating a supportive online environment for discussions on mental health and eating disorders.
△ Less
Submitted 23 April, 2024;
originally announced April 2024.
-
The NANOGrav 15 yr Data Set: Looking for Signs of Discreteness in the Gravitational-wave Background
Authors:
Gabriella Agazie,
Paul T. Baker,
Bence Bécsy,
Laura Blecha,
Adam Brazier,
Paul R. Brook,
Lucas Brown,
Sarah Burke-Spolaor,
J. Andrew Casey-Clyde,
Maria Charisi,
Shami Chatterjee,
Tyler Cohen,
James M. Cordes,
Neil J. Cornish,
Fronefield Crawford,
H. Thankful Cromartie,
Megan E. DeCesar,
Paul B. Demorest,
Heling Deng,
Timothy Dolch,
Elizabeth C. Ferrara,
William Fiore,
Emmanuel Fonseca,
Gabriel E. Freedman,
Nate Garver-Daniels
, et al. (58 additional authors not shown)
Abstract:
The cosmic merger history of supermassive black hole binaries (SMBHBs) is expected to produce a low-frequency gravitational wave background (GWB). Here we investigate how signs of the discrete nature of this GWB can manifest in pulsar timing arrays through excursions from, and breaks in, the expected $f_{\mathrm{GW}}^{-2/3}$ power-law of the GWB strain spectrum. To do this, we create a semi-analyt…
▽ More
The cosmic merger history of supermassive black hole binaries (SMBHBs) is expected to produce a low-frequency gravitational wave background (GWB). Here we investigate how signs of the discrete nature of this GWB can manifest in pulsar timing arrays through excursions from, and breaks in, the expected $f_{\mathrm{GW}}^{-2/3}$ power-law of the GWB strain spectrum. To do this, we create a semi-analytic SMBHB population model, fit to NANOGrav's 15 yr GWB amplitude, and with 1,000 realizations we study the populations' characteristic strain and residual spectra. Comparing our models to the NANOGrav 15 yr spectrum, we find two interesting excursions from the power-law. The first, at $2 \; \mathrm{nHz}$, is below our GWB realizations with $p$-value significance $p = 0.05$ to $0.06$ ($\approx 1.8 σ- 1.9 σ$). The second, at $16 \; \mathrm{nHz}$, is above our GWB realizations with $p = 0.04$ to $0.15$ ($\approx 1.4 σ- 2.1 σ$). We explore the properties of a loud SMBHB which could cause such an excursion. Our simulations also show that the expected number of SMBHBs decreases by three orders of magnitude, from $\sim 10^6$ to $\sim 10^3$, between $2\; \mathrm{nHz}$ and $20 \; \mathrm{nHz}$. This causes a break in the strain spectrum as the stochasticity of the background breaks down at $26^{+28}_{-19} \; \mathrm{nHz}$, consistent with predictions pre-dating GWB measurements. The diminished GWB signal from SMBHBs at frequencies above the $26$~nHz break opens a window for PTAs to detect continuous GWs from individual SMBHBs or GWs from the early universe.
△ Less
Submitted 10 April, 2024;
originally announced April 2024.
-
A Case for a Binary Black Hole System Revealed via Quasi-Periodic Outflows
Authors:
Dheeraj R. Pasham,
Francesco Tombesi,
Petra Sukova,
Michal Zajacek,
Suvendu Rakshit,
Eric Coughlin,
Peter Kosec,
Vladimir Karas,
Megan Masterson,
Andrew Mummery,
Thomas W. -S. Holoien,
Muryel Guolo,
Jason Hinkle,
Bart Ripperda,
Vojtech Witzany,
Ben Shappee,
Erin Kara,
Assaf Horesh,
Sjoert van Velzen,
Itai Sfaradi,
David L. Kaplan,
Noam Burger,
Tara Murphy,
Ronald Remillard,
James F. Steiner
, et al. (11 additional authors not shown)
Abstract:
Binaries containing a compact object orbiting a supermassive black hole are thought to be precursors of gravitational wave events, but their identification has been extremely challenging. Here, we report quasi-periodic variability in X-ray absorption which we interpret as quasi-periodic outflows (QPOuts) from a previously low-luminosity active galactic nucleus after an outburst, likely caused by a…
▽ More
Binaries containing a compact object orbiting a supermassive black hole are thought to be precursors of gravitational wave events, but their identification has been extremely challenging. Here, we report quasi-periodic variability in X-ray absorption which we interpret as quasi-periodic outflows (QPOuts) from a previously low-luminosity active galactic nucleus after an outburst, likely caused by a stellar tidal disruption. We rule out several models based on observed properties and instead show using general relativistic magnetohydrodynamic simulations that QPOuts, separated by roughly 8.3 days, can be explained with an intermediate-mass black hole secondary on a mildly eccentric orbit at a mean distance of about 100 gravitational radii from the primary. Our work suggests that QPOuts could be a new way to identify intermediate/extreme-mass ratio binary candidates.
△ Less
Submitted 15 February, 2024;
originally announced February 2024.
-
A 350-MHz Green Bank Telescope Survey of Unassociated Fermi LAT Sources: Discovery and Timing of Ten Millisecond Pulsars
Authors:
P. Bangale,
B. Bhattacharyya,
F. Camilo,
C. J. Clark,
I. Cognard,
M. E. DeCesar,
E. C. Ferrara,
P. Gentile,
L. Guillemot,
J. W. T. Hessels,
T. J. Johnson,
M. Kerr,
M. A. McLaughlin,
L. Nieder,
S. M. Ransom,
P. S. Ray,
M. S. E. Roberts,
J. Roy,
S. Sanpa-Arsa,
G. Theureau,
M. T. Wolff
Abstract:
We have searched for radio pulsations towards 49 Fermi Large Area Telescope (LAT) 1FGL Catalog $γ$-ray sources using the Green Bank Telescope at 350 MHz. We detected 18 millisecond pulsars (MSPs) in blind searches of the data; 10 of these were discoveries unique to our survey. Sixteen are binaries, with eight having short orbital periods $P_B < 1$ day. No radio pulsations from young pulsars were d…
▽ More
We have searched for radio pulsations towards 49 Fermi Large Area Telescope (LAT) 1FGL Catalog $γ$-ray sources using the Green Bank Telescope at 350 MHz. We detected 18 millisecond pulsars (MSPs) in blind searches of the data; 10 of these were discoveries unique to our survey. Sixteen are binaries, with eight having short orbital periods $P_B < 1$ day. No radio pulsations from young pulsars were detected, although three targets are coincident with apparently radio-quiet $γ$-ray pulsars discovered in LAT data. Here, we give an overview of the survey and present radio and $γ$-ray timing results for the 10 MSPs discovered. These include the only isolated MSP discovered in our survey and six short-$P_B$ binary MSPs. Of these, three have very low-mass companions ($M_c$ $\ll$ 0.1M$_{\odot}$) and hence belong to the class of black widow pulsars. Two have more massive, non-degenerate companions with extensive radio eclipses and orbitally modulated X-ray emission consistent with the redback class. Significant $γ$-ray pulsations have been detected from nine of the discoveries. This survey and similar efforts suggest that the majority of Galactic $γ$-ray sources at high Galactic latitudes are either MSPs or relatively nearby non-recycled pulsars, with the latter having on average a much smaller radio/$γ$-ray beaming ratio as compared to MSPs. It also confirms that past surveys suffered from an observational bias against finding short-$P_B$ MSP systems.
△ Less
Submitted 14 February, 2024;
originally announced February 2024.
-
FACT-GPT: Fact-Checking Augmentation via Claim Matching with LLMs
Authors:
Eun Cheol Choi,
Emilio Ferrara
Abstract:
Our society is facing rampant misinformation harming public health and trust. To address the societal challenge, we introduce FACT-GPT, a system leveraging Large Language Models (LLMs) to automate the claim matching stage of fact-checking. FACT-GPT, trained on a synthetic dataset, identifies social media content that aligns with, contradicts, or is irrelevant to previously debunked claims. Our eva…
▽ More
Our society is facing rampant misinformation harming public health and trust. To address the societal challenge, we introduce FACT-GPT, a system leveraging Large Language Models (LLMs) to automate the claim matching stage of fact-checking. FACT-GPT, trained on a synthetic dataset, identifies social media content that aligns with, contradicts, or is irrelevant to previously debunked claims. Our evaluation shows that our specialized LLMs can match the accuracy of larger models in identifying related claims, closely mirroring human judgment. This research provides an automated solution for efficient claim matching, demonstrates the potential of LLMs in supporting fact-checkers, and offers valuable resources for further research in the field.
△ Less
Submitted 8 February, 2024;
originally announced February 2024.
-
GET-Tok: A GenAI-Enriched Multimodal TikTok Dataset Documenting the 2022 Attempted Coup in Peru
Authors:
Gabriela Pinto,
Keith Burghardt,
Kristina Lerman,
Emilio Ferrara
Abstract:
TikTok is one of the largest and fastest-growing social media sites in the world. TikTok features, however, such as voice transcripts, are often missing and other important features, such as OCR or video descriptions, do not exist. We introduce the Generative AI Enriched TikTok (GET-Tok) data, a pipeline for collecting TikTok videos and enriched data by augmenting the TikTok Research API with gene…
▽ More
TikTok is one of the largest and fastest-growing social media sites in the world. TikTok features, however, such as voice transcripts, are often missing and other important features, such as OCR or video descriptions, do not exist. We introduce the Generative AI Enriched TikTok (GET-Tok) data, a pipeline for collecting TikTok videos and enriched data by augmenting the TikTok Research API with generative AI models. As a case study, we collect videos about the attempted coup in Peru initiated by its former President, Pedro Castillo, and its accompanying protests. The data includes information on 43,697 videos published from November 20, 2022 to March 1, 2023 (102 days). Generative AI augments the collected data via transcripts of TikTok videos, text descriptions of what is shown in the videos, what text is displayed within the video, and the stances expressed in the video. Overall, this pipeline will contribute to a better understanding of online discussion in a multimodal setting with applications of Generative AI, especially outlining the utility of this pipeline in non-English-language social media. Our code used to produce the pipeline is in a public Github repository: https://github.com/gabbypinto/GET-Tok-Peru.
△ Less
Submitted 8 February, 2024;
originally announced February 2024.
-
Coordinated Activity Modulates the Behavior and Emotions of Organic Users: A Case Study on Tweets about the Gaza Conflict
Authors:
Priyanka Dey,
Luca Luceri,
Emilio Ferrara
Abstract:
Social media has become a crucial conduit for the swift dissemination of information during global crises. However, this also paves the way for the manipulation of narratives by malicious actors. This research delves into the interaction dynamics between coordinated (malicious) entities and organic (regular) users on Twitter amidst the Gaza conflict. Through the analysis of approximately 3.5 milli…
▽ More
Social media has become a crucial conduit for the swift dissemination of information during global crises. However, this also paves the way for the manipulation of narratives by malicious actors. This research delves into the interaction dynamics between coordinated (malicious) entities and organic (regular) users on Twitter amidst the Gaza conflict. Through the analysis of approximately 3.5 million tweets from over 1.3 million users, our study uncovers that coordinated users significantly impact the information landscape, successfully disseminating their content across the network: a substantial fraction of their messages is adopted and shared by organic users. Furthermore, the study documents a progressive increase in organic users' engagement with coordinated content, which is paralleled by a discernible shift towards more emotionally polarized expressions in their subsequent communications. These results highlight the critical need for vigilance and a nuanced understanding of information manipulation on social media platforms.
△ Less
Submitted 8 February, 2024;
originally announced February 2024.
-
"Can You Play Anything Else?" Understanding Play Style Flexibility in League of Legends
Authors:
Emily Chen,
Alexander Bisberg,
Emilio Ferrara
Abstract:
This study investigates the concept of flexibility within League of Legends, a popular online multiplayer game, focusing on the relationship between user adaptability and team success. Utilizing a dataset encompassing players of varying skill levels and play styles, we calculate two measures of flexibility for each player: overall flexibility and temporal flexibility. Our findings suggest that the…
▽ More
This study investigates the concept of flexibility within League of Legends, a popular online multiplayer game, focusing on the relationship between user adaptability and team success. Utilizing a dataset encompassing players of varying skill levels and play styles, we calculate two measures of flexibility for each player: overall flexibility and temporal flexibility. Our findings suggest that the flexibility of a user is dependent upon a user's preferred play style, and flexibility does impact match outcome. This work also shows that skill level not only indicates how willing a player is to adapt their play style but also how their adaptability changes over time. This paper highlights the duality and balance of specialization versus flexibility, providing insights that can inform strategic planning, collaboration and resource allocation in competitive environments.
△ Less
Submitted 10 July, 2024; v1 submitted 8 February, 2024;
originally announced February 2024.
-
Mass estimates from optical modelling of the new TRAPUM redback PSR J1910-5320
Authors:
O. G. Dodge,
R. P. Breton,
C. J. Clark,
M. Burgay,
J. Strader,
K. -Y. Au,
E. D. Barr,
S. Buchner,
V. S. Dhillon,
E. C. Ferrara,
P. C. C. Freire,
J. -M. Griessmeier,
M. R. Kennedy,
M. Kramer,
K. -L. Li,
P. V. Padmanabh,
A. Phosrisom,
B. W. Stappers,
S. J. Swihart,
T. Thongmeearkom
Abstract:
Spider pulsars continue to provide promising candidates for neutron star mass measurements. Here we present the discovery of PSR~J1910$-$5320, a new millisecond pulsar discovered in a MeerKAT observation of an unidentified \textit{Fermi}-LAT gamma-ray source. This pulsar is coincident with a recently identified candidate redback binary, independently discovered through its periodic optical flux an…
▽ More
Spider pulsars continue to provide promising candidates for neutron star mass measurements. Here we present the discovery of PSR~J1910$-$5320, a new millisecond pulsar discovered in a MeerKAT observation of an unidentified \textit{Fermi}-LAT gamma-ray source. This pulsar is coincident with a recently identified candidate redback binary, independently discovered through its periodic optical flux and radial velocity. New multi-color optical light curves obtained with ULTRACAM/NTT in combination with MeerKAT timing and updated SOAR/Goodman spectroscopic radial velocity measurements allow a mass constraint for PSR~J1910$-$5320. \texttt{Icarus} optical light curve modelling, with streamlined radial velocity fitting, constrains the orbital inclination and companion velocity, unlocking the binary mass function given the precise radio ephemeris. Our modelling aims to unite the photometric and spectroscopic measurements available by fitting each simultaneously to the same underlying physical model, ensuring self-consistency. This targets centre-of-light radial velocity corrections necessitated by the irradiation endemic to spider systems. Depending on the gravity darkening prescription used, we find a moderate neutron star mass of either $1.6\pm0.2$ or $1.4\pm0.2$ $M_\odot$. The companion mass of either $0.45\pm0.04$ or $0.43^{+0.04}_{-0.03}$ $M_\odot$ also further confirms PSR~J1910$-$5320 as an irradiated redback spider pulsar.radiated redback spider pulsar.
△ Less
Submitted 18 January, 2024;
originally announced January 2024.
-
Moral Values Underpinning COVID-19 Online Communication Patterns
Authors:
Julie Jiang,
Luca Luceri,
Emilio Ferrara
Abstract:
The COVID-19 pandemic has triggered profound societal changes, extending beyond its health impacts to the moralization of behaviors. Leveraging insights from moral psychology, this study delves into the moral fabric sha** online discussions surrounding COVID-19 over a span of nearly two years. Our investigation identifies four distinct user groups characterized by differences in morality, politi…
▽ More
The COVID-19 pandemic has triggered profound societal changes, extending beyond its health impacts to the moralization of behaviors. Leveraging insights from moral psychology, this study delves into the moral fabric sha** online discussions surrounding COVID-19 over a span of nearly two years. Our investigation identifies four distinct user groups characterized by differences in morality, political ideology, and communication styles. We underscore the intricate relationship between moral differences and political ideologies, revealing a nuanced picture where moral orientations do not rigidly separate users politically. Furthermore, we uncover patterns of moral homophily within the social network, highlighting the existence of one potential moral echo chamber. Analyzing the moral themes embedded in messages, we observe that messages featuring moral foundations not typically favored by their authors, as well as those incorporating multiple moral foundations, resonate more effectively with out-group members. This research contributes valuable insights into the complex interplay between moral foundations, communication dynamics, and network structures on Twitter.
△ Less
Submitted 16 January, 2024;
originally announced January 2024.
-
Social-LLM: Modeling User Behavior at Scale using Language Models and Social Network Data
Authors:
Julie Jiang,
Emilio Ferrara
Abstract:
The proliferation of social network data has unlocked unprecedented opportunities for extensive, data-driven exploration of human behavior. The structural intricacies of social networks offer insights into various computational social science issues, particularly concerning social influence and information diffusion. However, modeling large-scale social network data comes with computational challe…
▽ More
The proliferation of social network data has unlocked unprecedented opportunities for extensive, data-driven exploration of human behavior. The structural intricacies of social networks offer insights into various computational social science issues, particularly concerning social influence and information diffusion. However, modeling large-scale social network data comes with computational challenges. Though large language models make it easier than ever to model textual content, any advanced network representation methods struggle with scalability and efficient deployment to out-of-sample users. In response, we introduce a novel approach tailored for modeling social network data in user detection tasks. This innovative method integrates localized social network interactions with the capabilities of large language models. Operating under the premise of social network homophily, which posits that socially connected users share similarities, our approach is designed to address these challenges. We conduct a thorough evaluation of our method across seven real-world social network datasets, spanning a diverse range of topics and detection tasks, showcasing its applicability to advance research in computational social science.
△ Less
Submitted 31 December, 2023;
originally announced January 2024.
-
Social Bots: Detection and Challenges
Authors:
Kai-Cheng Yang,
Onur Varol,
Alexander C. Nwala,
Mohsen Sayyadiharikandeh,
Emilio Ferrara,
Alessandro Flammini,
Filippo Menczer
Abstract:
While social media are a key source of data for computational social science, their ease of manipulation by malicious actors threatens the integrity of online information exchanges and their analysis. In this Chapter, we focus on malicious social bots, a prominent vehicle for such manipulation. We start by discussing recent studies about the presence and actions of social bots in various online di…
▽ More
While social media are a key source of data for computational social science, their ease of manipulation by malicious actors threatens the integrity of online information exchanges and their analysis. In this Chapter, we focus on malicious social bots, a prominent vehicle for such manipulation. We start by discussing recent studies about the presence and actions of social bots in various online discussions to show their real-world implications and the need for detection methods. Then we discuss the challenges of bot detection methods and use Botometer, a publicly available bot detection tool, as a case study to describe recent developments in this area. We close with a practical guide on how to handle social bots in social media research.
△ Less
Submitted 28 December, 2023;
originally announced December 2023.
-
Evidence for a dynamic corona in the short-term time lags of black hole X-ray binary MAXI J1820+070
Authors:
Niek Bollemeijer,
Phil Uttley,
Arkadip Basak,
Adam Ingram,
Jakob van den Eijnden,
Kevin Alabarta,
Diego Altamirano,
Zaven Arzoumanian,
Douglas J. K. Buisson,
Andrew C. Fabian,
Elizabeth Ferrara,
Keith Gendreau,
Jeroen Homan,
Erin Kara,
Craig Markwardt,
Ronald A. Remillard,
Andrea Sanna,
James F. Steiner,
Francesco Tombesi,
**gyi Wang,
Yanan Wang,
Abderahmen Zoghbi
Abstract:
In X-ray observations of hard state black hole X-ray binaries, rapid variations in accretion disc and coronal power-law emission are correlated and show Fourier-frequency-dependent time lags. On short (~0.1 s) time-scales, these lags are thought to be due to reverberation and therefore may depend strongly on the geometry of the corona. Low-frequency quasi-periodic oscillations (QPOs) are variation…
▽ More
In X-ray observations of hard state black hole X-ray binaries, rapid variations in accretion disc and coronal power-law emission are correlated and show Fourier-frequency-dependent time lags. On short (~0.1 s) time-scales, these lags are thought to be due to reverberation and therefore may depend strongly on the geometry of the corona. Low-frequency quasi-periodic oscillations (QPOs) are variations in X-ray flux that have been suggested to arise because of geometric changes in the corona, possibly due to General Relativistic Lense-Thirring precession. Therefore one might expect the short-term time lags to vary on the QPO time-scale. We performed novel spectral-timing analyses on NICER observations of the black hole X-ray binary MAXI J1820+070 during the hard state of its outburst in 2018 to investigate how the short-term time lags between a disc-dominated and a coronal power-law-dominated energy band vary on different time-scales. Our method can distinguish between variability due to the QPO and broadband noise, and we find a linear correlation between the power-law flux and lag amplitude that is strongest at the QPO frequency. We also introduce a new method to resolve the QPO signal and determine the QPO-phase-dependence of the flux and lag variations, finding that both are very similar. Our results are consistent with a geometric origin of QPOs, but also provide evidence for a dynamic corona with a geometry varying in a similar way over a broad range of time-scales, not just the QPO time-scale.
△ Less
Submitted 15 December, 2023;
originally announced December 2023.
-
Can Language Model Moderators Improve the Health of Online Discourse?
Authors:
Hyundong Cho,
Shuai Liu,
Taiwei Shi,
Darpan Jain,
Basem Rizk,
Yuyang Huang,
Zixun Lu,
Nuan Wen,
Jonathan Gratch,
Emilio Ferrara,
Jonathan May
Abstract:
Conversational moderation of online communities is crucial to maintaining civility for a constructive environment, but it is challenging to scale and harmful to moderators. The inclusion of sophisticated natural language generation modules as a force multiplier to aid human moderators is a tantalizing prospect, but adequate evaluation approaches have so far been elusive. In this paper, we establis…
▽ More
Conversational moderation of online communities is crucial to maintaining civility for a constructive environment, but it is challenging to scale and harmful to moderators. The inclusion of sophisticated natural language generation modules as a force multiplier to aid human moderators is a tantalizing prospect, but adequate evaluation approaches have so far been elusive. In this paper, we establish a systematic definition of conversational moderation effectiveness grounded on moderation literature and establish design criteria for conducting realistic yet safe evaluation. We then propose a comprehensive evaluation framework to assess models' moderation capabilities independently of human intervention. With our framework, we conduct the first known study of language models as conversational moderators, finding that appropriately prompted models that incorporate insights from social science can provide specific and fair feedback on toxic behavior but struggle to influence users to increase their levels of respect and cooperation.
△ Less
Submitted 6 May, 2024; v1 submitted 16 November, 2023;
originally announced November 2023.
-
Tracking the Newsworthiness of Public Documents
Authors:
Alexander Spangher,
Emilio Ferrara,
Ben Welsh,
Nanyun Peng,
Serdar Tumgoren,
Jonathan May
Abstract:
Journalists must find stories in huge amounts of textual data (e.g. leaks, bills, press releases) as part of their jobs: determining when and why text becomes news can help us understand coverage patterns and help us build assistive tools. Yet, this is challenging because very few labelled links exist, language use between corpora is very different, and text may be covered for a variety of reasons…
▽ More
Journalists must find stories in huge amounts of textual data (e.g. leaks, bills, press releases) as part of their jobs: determining when and why text becomes news can help us understand coverage patterns and help us build assistive tools. Yet, this is challenging because very few labelled links exist, language use between corpora is very different, and text may be covered for a variety of reasons. In this work we focus on news coverage of local public policy in the San Francisco Bay Area by the San Francisco Chronicle. First, we gather news articles, public policy documents and meeting recordings and link them using probabilistic relational modeling, which we show is a low-annotation linking methodology that outperforms other retrieval-based baselines. Second, we define a new task: newsworthiness prediction, to predict if a policy item will get covered. We show that different aspects of public policy discussion yield different newsworthiness signals. Finally we perform human evaluation with expert journalists and show our systems identify policies they consider newsworthy with 68% F1 and our coverage recommendations are helpful with an 84% win-rate.
△ Less
Submitted 16 November, 2023;
originally announced November 2023.
-
Leveraging Large Language Models to Detect Influence Campaigns in Social Media
Authors:
Luca Luceri,
Eric Boniardi,
Emilio Ferrara
Abstract:
Social media influence campaigns pose significant challenges to public discourse and democracy. Traditional detection methods fall short due to the complexity and dynamic nature of social media. Addressing this, we propose a novel detection method using Large Language Models (LLMs) that incorporates both user metadata and network structures. By converting these elements into a text format, our app…
▽ More
Social media influence campaigns pose significant challenges to public discourse and democracy. Traditional detection methods fall short due to the complexity and dynamic nature of social media. Addressing this, we propose a novel detection method using Large Language Models (LLMs) that incorporates both user metadata and network structures. By converting these elements into a text format, our approach effectively processes multilingual content and adapts to the shifting tactics of malicious campaign actors. We validate our model through rigorous testing on multiple datasets, showcasing its superior performance in identifying influence efforts. This research not only offers a powerful tool for detecting campaigns, but also sets the stage for future enhancements to keep up with the fast-paced evolution of social media-based influence tactics.
△ Less
Submitted 13 November, 2023;
originally announced November 2023.
-
Susceptibility to Unreliable Information Sources: Swift Adoption with Minimal Exposure
Authors:
**yi Ye,
Luca Luceri,
Julie Jiang,
Emilio Ferrara
Abstract:
Misinformation proliferation on social media platforms is a pervasive threat to the integrity of online public discourse. Genuine users, susceptible to others' influence, often unknowingly engage with, endorse, and re-share questionable pieces of information, collectively amplifying the spread of misinformation. In this study, we introduce an empirical framework to investigate users' susceptibilit…
▽ More
Misinformation proliferation on social media platforms is a pervasive threat to the integrity of online public discourse. Genuine users, susceptible to others' influence, often unknowingly engage with, endorse, and re-share questionable pieces of information, collectively amplifying the spread of misinformation. In this study, we introduce an empirical framework to investigate users' susceptibility to influence when exposed to unreliable and reliable information sources. Leveraging two datasets on political and public health discussions on Twitter, we analyze the impact of exposure on the adoption of information sources, examining how the reliability of the source modulates this relationship. Our findings provide evidence that increased exposure augments the likelihood of adoption. Users tend to adopt low-credibility sources with fewer exposures than high-credibility sources, a trend that persists even among non-partisan users. Furthermore, the number of exposures needed for adoption varies based on the source credibility, with extreme ends of the spectrum (very high or low credibility) requiring fewer exposures for adoption. Additionally, we reveal that the adoption of information sources often mirrors users' prior exposure to sources with comparable credibility levels. Our research offers critical insights for mitigating the endorsement of misinformation by vulnerable users, offering a framework to study the dynamics of content exposure and adoption on social media platforms.
△ Less
Submitted 9 November, 2023;
originally announced November 2023.
-
The NANOGrav 15-year data set: Search for Transverse Polarization Modes in the Gravitational-Wave Background
Authors:
Gabriella Agazie,
Akash Anumarlapudi,
Anne M. Archibald,
Zaven Arzoumanian,
Jeremy Baier,
Paul T. Baker,
Bence Bécsy,
Laura Blecha,
Adam Brazier,
Paul R. Brook,
Sarah Burke-Spolaor,
Rand Burnette,
Robin Case,
J. Andrew Casey-Clyde,
Maria Charisi,
Shami Chatterjee,
Tyler Cohen,
James M. Cordes,
Neil J. Cornish,
Fronefield Crawford,
H. Thankful Cromartie,
Kathryn Crowter,
Megan E. DeCesar,
Dallas DeGan,
Paul B. Demorest
, et al. (74 additional authors not shown)
Abstract:
Recently we found compelling evidence for a gravitational wave background with Hellings and Downs (HD) correlations in our 15-year data set. These correlations describe gravitational waves as predicted by general relativity, which has two transverse polarization modes. However, more general metric theories of gravity can have additional polarization modes which produce different interpulsar correl…
▽ More
Recently we found compelling evidence for a gravitational wave background with Hellings and Downs (HD) correlations in our 15-year data set. These correlations describe gravitational waves as predicted by general relativity, which has two transverse polarization modes. However, more general metric theories of gravity can have additional polarization modes which produce different interpulsar correlations. In this work we search the NANOGrav 15-year data set for evidence of a gravitational wave background with quadrupolar Hellings and Downs (HD) and Scalar Transverse (ST) correlations. We find that HD correlations are the best fit to the data, and no significant evidence in favor of ST correlations. While Bayes factors show strong evidence for a correlated signal, the data does not strongly prefer either correlation signature, with Bayes factors $\sim 2$ when comparing HD to ST correlations, and $\sim 1$ for HD plus ST correlations to HD correlations alone. However, when modeled alongside HD correlations, the amplitude and spectral index posteriors for ST correlations are uninformative, with the HD process accounting for the vast majority of the total signal. Using the optimal statistic, a frequentist technique that focuses on the pulsar-pair cross-correlations, we find median signal-to-noise-ratios of 5.0 for HD and 4.6 for ST correlations when fit for separately, and median signal-to-noise-ratios of 3.5 for HD and 3.0 for ST correlations when fit for simultaneously. While the signal-to-noise-ratios for each of the correlations are comparable, the estimated amplitude and spectral index for HD are a significantly better fit to the total signal, in agreement with our Bayesian analysis.
△ Less
Submitted 18 October, 2023;
originally announced October 2023.
-
Unmasking the Web of Deceit: Uncovering Coordinated Activity to Expose Information Operations on Twitter
Authors:
Luca Luceri,
Valeria Pantè,
Keith Burghardt,
Emilio Ferrara
Abstract:
Social media platforms, particularly Twitter, have become pivotal arenas for influence campaigns, often orchestrated by state-sponsored information operations (IOs). This paper delves into the detection of key players driving IOs by employing similarity graphs constructed from behavioral pattern data. We unveil that well-known, yet underutilized network properties can help accurately identify coor…
▽ More
Social media platforms, particularly Twitter, have become pivotal arenas for influence campaigns, often orchestrated by state-sponsored information operations (IOs). This paper delves into the detection of key players driving IOs by employing similarity graphs constructed from behavioral pattern data. We unveil that well-known, yet underutilized network properties can help accurately identify coordinated IO drivers. Drawing from a comprehensive dataset of 49 million tweets from six countries, which includes multiple verified IOs, our study reveals that traditional network filtering techniques do not consistently pinpoint IO drivers across campaigns. We first propose a framework based on node pruning that emerges superior, particularly when combining multiple behavioral indicators across different networks. Then, we introduce a supervised machine learning model that harnesses a vector representation of the fused similarity network. This model, which boasts a precision exceeding 0.95, adeptly classifies IO drivers on a global scale and reliably forecasts their temporal engagements. Our findings are crucial in the fight against deceptive influence campaigns on social media, hel** us better understand and detect them.
△ Less
Submitted 15 October, 2023;
originally announced October 2023.
-
Automated Claim Matching with Large Language Models: Empowering Fact-Checkers in the Fight Against Misinformation
Authors:
Eun Cheol Choi,
Emilio Ferrara
Abstract:
In today's digital era, the rapid spread of misinformation poses threats to public well-being and societal trust. As online misinformation proliferates, manual verification by fact checkers becomes increasingly challenging. We introduce FACT-GPT (Fact-checking Augmentation with Claim matching Task-oriented Generative Pre-trained Transformer), a framework designed to automate the claim matching pha…
▽ More
In today's digital era, the rapid spread of misinformation poses threats to public well-being and societal trust. As online misinformation proliferates, manual verification by fact checkers becomes increasingly challenging. We introduce FACT-GPT (Fact-checking Augmentation with Claim matching Task-oriented Generative Pre-trained Transformer), a framework designed to automate the claim matching phase of fact-checking using Large Language Models (LLMs). This framework identifies new social media content that either supports or contradicts claims previously debunked by fact-checkers. Our approach employs GPT-4 to generate a labeled dataset consisting of simulated social media posts. This data set serves as a training ground for fine-tuning more specialized LLMs. We evaluated FACT-GPT on an extensive dataset of social media content related to public health. The results indicate that our fine-tuned LLMs rival the performance of larger pre-trained LLMs in claim matching tasks, aligning closely with human annotations. This study achieves three key milestones: it provides an automated framework for enhanced fact-checking; demonstrates the potential of LLMs to complement human expertise; offers public resources, including datasets and models, to further research and applications in the fact-checking domain.
△ Less
Submitted 13 October, 2023;
originally announced October 2023.
-
Social Approval and Network Homophily as Motivators of Online Toxicity
Authors:
Julie Jiang,
Luca Luceri,
Joseph B. Walther,
Emilio Ferrara
Abstract:
Online hate messaging is a pervasive issue plaguing the well-being of social media users. This research empirically investigates a novel theory positing that online hate may be driven primarily by the pursuit of social approval rather than a direct desire to harm the targets. Results show that toxicity is homophilous in users' social networks and that a user's propensity for hostility can be predi…
▽ More
Online hate messaging is a pervasive issue plaguing the well-being of social media users. This research empirically investigates a novel theory positing that online hate may be driven primarily by the pursuit of social approval rather than a direct desire to harm the targets. Results show that toxicity is homophilous in users' social networks and that a user's propensity for hostility can be predicted by their social networks. We also illustrate how receiving greater or fewer social engagements in the form of likes, retweets, quotes, and replies affects a user's subsequent toxicity. We establish a clear connection between receiving social approval signals and increases in subsequent toxicity. Being retweeted plays a particularly prominent role in escalating toxicity. Results also show that not receiving expected levels of social approval leads to decreased toxicity. We discuss the important implications of our research and opportunities to combat online hate.
△ Less
Submitted 29 February, 2024; v1 submitted 11 October, 2023;
originally announced October 2023.
-
Factuality Challenges in the Era of Large Language Models
Authors:
Isabelle Augenstein,
Timothy Baldwin,
Meeyoung Cha,
Tanmoy Chakraborty,
Giovanni Luca Ciampaglia,
David Corney,
Renee DiResta,
Emilio Ferrara,
Scott Hale,
Alon Halevy,
Eduard Hovy,
Heng Ji,
Filippo Menczer,
Ruben Miguez,
Preslav Nakov,
Dietram Scheufele,
Shivam Sharma,
Giovanni Zagni
Abstract:
The emergence of tools based on Large Language Models (LLMs), such as OpenAI's ChatGPT, Microsoft's Bing Chat, and Google's Bard, has garnered immense public attention. These incredibly useful, natural-sounding tools mark significant advances in natural language generation, yet they exhibit a propensity to generate false, erroneous, or misleading content -- commonly referred to as "hallucinations.…
▽ More
The emergence of tools based on Large Language Models (LLMs), such as OpenAI's ChatGPT, Microsoft's Bing Chat, and Google's Bard, has garnered immense public attention. These incredibly useful, natural-sounding tools mark significant advances in natural language generation, yet they exhibit a propensity to generate false, erroneous, or misleading content -- commonly referred to as "hallucinations." Moreover, LLMs can be exploited for malicious applications, such as generating false but credible-sounding content and profiles at scale. This poses a significant challenge to society in terms of the potential deception of users and the increasing dissemination of inaccurate information. In light of these risks, we explore the kinds of technological innovations, regulatory reforms, and AI literacy initiatives needed from fact-checkers, news organizations, and the broader research and policy communities. By identifying the risks, the imminent threats, and some viable solutions, we seek to shed light on navigating various aspects of veracity in the era of generative AI.
△ Less
Submitted 9 October, 2023; v1 submitted 8 October, 2023;
originally announced October 2023.
-
GenAI Against Humanity: Nefarious Applications of Generative Artificial Intelligence and Large Language Models
Authors:
Emilio Ferrara
Abstract:
Generative Artificial Intelligence (GenAI) and Large Language Models (LLMs) are marvels of technology; celebrated for their prowess in natural language processing and multimodal content generation, they promise a transformative future. But as with all powerful tools, they come with their shadows. Picture living in a world where deepfakes are indistinguishable from reality, where synthetic identiti…
▽ More
Generative Artificial Intelligence (GenAI) and Large Language Models (LLMs) are marvels of technology; celebrated for their prowess in natural language processing and multimodal content generation, they promise a transformative future. But as with all powerful tools, they come with their shadows. Picture living in a world where deepfakes are indistinguishable from reality, where synthetic identities orchestrate malicious campaigns, and where targeted misinformation or scams are crafted with unparalleled precision. Welcome to the darker side of GenAI applications. This article is not just a journey through the meanders of potential misuse of GenAI and LLMs, but also a call to recognize the urgency of the challenges ahead. As we navigate the seas of misinformation campaigns, malicious content generation, and the eerie creation of sophisticated malware, we'll uncover the societal implications that ripple through the GenAI revolution we are witnessing. From AI-powered botnets on social media platforms to the unnerving potential of AI to generate fabricated identities, or alibis made of synthetic realities, the stakes have never been higher. The lines between the virtual and the real worlds are blurring, and the consequences of potential GenAI's nefarious applications impact us all. This article serves both as a synthesis of rigorous research presented on the risks of GenAI and misuse of LLMs and as a thought-provoking vision of the different types of harmful GenAI applications we might encounter in the near future, and some ways we can prepare for them.
△ Less
Submitted 22 January, 2024; v1 submitted 1 October, 2023;
originally announced October 2023.
-
The NANOGrav 12.5-year data set: A computationally efficient eccentric binary search pipeline and constraints on an eccentric supermassive binary candidate in 3C 66B
Authors:
Gabriella Agazie,
Zaven Arzoumanian,
Paul T. Baker,
Bence Bécsy,
Laura Blecha,
Harsha Blumer,
Adam Brazier,
Paul R. Brook,
Sarah Burke-Spolaor,
J. Andrew Casey-Clyde,
Maria Charisi,
Shami Chatterjee,
Belinda D. Cheeseboro,
Tyler Cohen,
James M. Cordes,
Neil J. Cornish,
Fronefield Crawford,
H. Thankful Cromartie,
Megan E. DeCesar,
Paul B. Demorest,
Lankeswar Dey,
Timothy Dolch,
Justin A. Ellis,
Robert D. Ferdman,
Elizabeth C. Ferrara
, et al. (63 additional authors not shown)
Abstract:
The radio galaxy 3C 66B has been hypothesized to host a supermassive black hole binary (SMBHB) at its center based on electromagnetic observations. Its apparent 1.05-year period and low redshift ($\sim0.02$) make it an interesting testbed to search for low-frequency gravitational waves (GWs) using Pulsar Timing Array (PTA) experiments. This source has been subjected to multiple searches for contin…
▽ More
The radio galaxy 3C 66B has been hypothesized to host a supermassive black hole binary (SMBHB) at its center based on electromagnetic observations. Its apparent 1.05-year period and low redshift ($\sim0.02$) make it an interesting testbed to search for low-frequency gravitational waves (GWs) using Pulsar Timing Array (PTA) experiments. This source has been subjected to multiple searches for continuous GWs from a circular SMBHB, resulting in progressively more stringent constraints on its GW amplitude and chirp mass. In this paper, we develop a pipeline for performing Bayesian targeted searches for eccentric SMBHBs in PTA data sets, and test its efficacy by applying it on simulated data sets with varying injected signal strengths. We also search for a realistic eccentric SMBHB source in 3C 66B using the NANOGrav 12.5-year data set employing PTA signal models containing Earth term-only as well as Earth+Pulsar term contributions using this pipeline. Due to limitations in our PTA signal model, we get meaningful results only when the initial eccentricity $e_0<0.5$ and the symmetric mass ratio $η>0.1$. We find no evidence for an eccentric SMBHB signal in our data, and therefore place 95% upper limits on the PTA signal amplitude of $88.1\pm3.7$ ns for the Earth term-only and $81.74\pm0.86$ ns for the Earth+Pulsar term searches for $e_0<0.5$ and $η>0.1$. Similar 95% upper limits on the chirp mass are $(1.98 \pm 0.05) \times 10^9\,M_{\odot}$ and $(1.81 \pm 0.01) \times 10^9\,M_{\odot}$. These upper limits, while less stringent than those calculated from a circular binary search in the NANOGrav 12.5-year data set, are consistent with the SMBHB model of 3C 66B developed from electromagnetic observations.
△ Less
Submitted 15 January, 2024; v1 submitted 29 September, 2023;
originally announced September 2023.
-
How to Detect an Astrophysical Nanohertz Gravitational-Wave Background
Authors:
Bence Bécsy,
Neil J. Cornish,
Patrick M. Meyers,
Luke Zoltan Kelley,
Gabriella Agazie,
Akash Anumarlapudi,
Anne M. Archibald,
Zaven Arzoumanian,
Paul T. Baker,
Laura Blecha,
Adam Brazier,
Paul R. Brook,
Sarah Burke-Spolaor,
J. Andrew Casey-Clyde,
Maria Charisi,
Shami Chatterjee,
Katerina Chatziioannou,
Tyler Cohen,
James M. Cordes,
Fronefield Crawford,
H. Thankful Cromartie,
Kathryn Crowter,
Megan E. DeCesar,
Paul B. Demorest,
Timothy Dolch
, et al. (71 additional authors not shown)
Abstract:
Analysis of pulsar timing data have provided evidence for a stochastic gravitational wave background in the nHz frequency band. The most plausible source of such a background is the superposition of signals from millions of supermassive black hole binaries. The standard statistical techniques used to search for such a background and assess its significance make several simplifying assumptions, nam…
▽ More
Analysis of pulsar timing data have provided evidence for a stochastic gravitational wave background in the nHz frequency band. The most plausible source of such a background is the superposition of signals from millions of supermassive black hole binaries. The standard statistical techniques used to search for such a background and assess its significance make several simplifying assumptions, namely: i) Gaussianity; ii) isotropy; and most often iii) a power-law spectrum. However, a stochastic background from a finite collection of binaries does not exactly satisfy any of these assumptions. To understand the effect of these assumptions, we test standard analysis techniques on a large collection of realistic simulated datasets. The dataset length, observing schedule, and noise levels were chosen to emulate the NANOGrav 15-year dataset. Simulated signals from millions of binaries drawn from models based on the Illustris cosmological hydrodynamical simulation were added to the data. We find that the standard statistical methods perform remarkably well on these simulated datasets, despite their fundamental assumptions not being strictly met. They are able to achieve a confident detection of the background. However, even for a fixed set of astrophysical parameters, different realizations of the universe result in a large variance in the significance and recovered parameters of the background. We also find that the presence of loud individual binaries can bias the spectral recovery of the background if we do not account for them.
△ Less
Submitted 1 December, 2023; v1 submitted 8 September, 2023;
originally announced September 2023.
-
X-ray eruptions every 22 days from the nucleus of a nearby galaxy
Authors:
Muryel Guolo,
Dheeraj R. Pasham,
Michal Zajaček,
Eric R. Coughlin,
Suvi Gezari,
Petra Suková,
Thomas Wevers,
Vojtěch Witzany,
Francesco Tombesi,
Sjoert van Velzen,
Kate D. Alexander,
Yuhan Yao,
Riccardo Arcodia,
Vladimır Karas,
James Miller-Jones,
Ronald Remillard,
Keith Gendreau,
Elizabeth C. Ferrara
Abstract:
Galactic nuclei showing recurrent phases of activity and quiescence have recently been discovered, with recurrence times as short as a few hours to a day -- known as quasi-periodic X-ray eruption (QPE) sources -- to as long as hundreds to a thousand days for repeating nuclear transients (RNTs). Here we present a multi-wavelength overview of Swift J023017.0+283603 (hereafter Swift J0230+28), a sour…
▽ More
Galactic nuclei showing recurrent phases of activity and quiescence have recently been discovered, with recurrence times as short as a few hours to a day -- known as quasi-periodic X-ray eruption (QPE) sources -- to as long as hundreds to a thousand days for repeating nuclear transients (RNTs). Here we present a multi-wavelength overview of Swift J023017.0+283603 (hereafter Swift J0230+28), a source that exhibits repeating and quasi-periodic X-ray flares from the nucleus of a previously unremarkable galaxy at $\sim$ 165 Mpc, with a recurrence time of approximately 22 days, an intermediary timescale between known RNTs and QPE sources. The source also shows transient radio emission, likely associated with the X-ray emission. Such recurrent soft X-ray eruptions, with no accompanying UV/optical emission, are strikingly similar to QPE sources. However, in addition to having a recurrence time that is $\sim 25$ times longer than the longest-known QPE source, Swift J0230+28's eruptions exhibit somewhat distinct shapes and temperature evolution than the known QPE sources. Scenarios involving extreme mass ratio inspirals are favored over disk instability models. The source reveals an unexplored timescale for repeating extragalactic transients and highlights the need for a wide-field, time-domain X-ray mission to explore the parameter space of recurring X-ray transients.
△ Less
Submitted 15 January, 2024; v1 submitted 6 September, 2023;
originally announced September 2023.
-
Comparing recent PTA results on the nanohertz stochastic gravitational wave background
Authors:
The International Pulsar Timing Array Collaboration,
G. Agazie,
J. Antoniadis,
A. Anumarlapudi,
A. M. Archibald,
P. Arumugam,
S. Arumugam,
Z. Arzoumanian,
J. Askew,
S. Babak,
M. Bagchi,
M. Bailes,
A. -S. Bak Nielsen,
P. T. Baker,
C. G. Bassa,
A. Bathula,
B. Bécsy,
A. Berthereau,
N. D. R. Bhat,
L. Blecha,
M. Bonetti,
E. Bortolas,
A. Brazier,
P. R. Brook,
M. Burgay
, et al. (220 additional authors not shown)
Abstract:
The Australian, Chinese, European, Indian, and North American pulsar timing array (PTA) collaborations recently reported, at varying levels, evidence for the presence of a nanohertz gravitational wave background (GWB). Given that each PTA made different choices in modeling their data, we perform a comparison of the GWB and individual pulsar noise parameters across the results reported from the PTA…
▽ More
The Australian, Chinese, European, Indian, and North American pulsar timing array (PTA) collaborations recently reported, at varying levels, evidence for the presence of a nanohertz gravitational wave background (GWB). Given that each PTA made different choices in modeling their data, we perform a comparison of the GWB and individual pulsar noise parameters across the results reported from the PTAs that constitute the International Pulsar Timing Array (IPTA). We show that despite making different modeling choices, there is no significant difference in the GWB parameters that are measured by the different PTAs, agreeing within $1σ$. The pulsar noise parameters are also consistent between different PTAs for the majority of the pulsars included in these analyses. We bridge the differences in modeling choices by adopting a standardized noise model for all pulsars and PTAs, finding that under this model there is a reduction in the tension in the pulsar noise parameters. As part of this reanalysis, we "extended" each PTA's data set by adding extra pulsars that were not timed by that PTA. Under these extensions, we find better constraints on the GWB amplitude and a higher signal-to-noise ratio for the Hellings and Downs correlations. These extensions serve as a prelude to the benefits offered by a full combination of data across all pulsars in the IPTA, i.e., the IPTA's Data Release 3, which will involve not just adding in additional pulsars, but also including data from all three PTAs where any given pulsar is timed by more than as single PTA.
△ Less
Submitted 1 September, 2023;
originally announced September 2023.
-
The NANOGrav 12.5-year Data Set: Search for Gravitational Wave Memory
Authors:
Gabriella Agazie,
Zaven Arzoumanian,
Paul T. Baker,
Bence Bécsy,
Laura Blecha,
Harsha Blumer,
Adam Brazier,
Paul R. Brook,
Sarah Burke-Spolaor,
Rand Burnette,
Robin Case,
J. Andrew Casey-Clyde,
Maria Charisi,
Shami Chatterjee,
Tyler Cohen,
James M. Cordes,
Neil J. Cornish,
Fronefield Crawford,
H. Thankful Cromartie,
Megan E. DeCesar,
Dallas DeGan,
Paul B. Demorest,
Timothy Dolch,
Brendan Drachler,
Justin A. Ellis
, et al. (65 additional authors not shown)
Abstract:
We present the results of a Bayesian search for gravitational wave (GW) memory in the NANOGrav 12.5-yr data set. We find no convincing evidence for any gravitational wave memory signals in this data set (Bayes factor = 2.8). As such, we go on to place upper limits on the strain amplitude of GW memory events as a function of sky location and event epoch. These upper limits are computed using a sign…
▽ More
We present the results of a Bayesian search for gravitational wave (GW) memory in the NANOGrav 12.5-yr data set. We find no convincing evidence for any gravitational wave memory signals in this data set (Bayes factor = 2.8). As such, we go on to place upper limits on the strain amplitude of GW memory events as a function of sky location and event epoch. These upper limits are computed using a signal model that assumes the existence of a common, spatially uncorrelated red noise in addition to a GW memory signal. The median strain upper limit as a function of sky position is approximately $3.3 \times 10^{-14}$. We also find that there are some differences in the upper limits as a function of sky position centered around PSR J0613$-$0200. This suggests that this pulsar has some excess noise which can be confounded with GW memory. Finally, the upper limits as a function of burst epoch continue to improve at later epochs. This improvement is attributable to the continued growth of the pulsar timing array.
△ Less
Submitted 25 July, 2023;
originally announced July 2023.
-
The NANOGrav 12.5-Year Data Set: Dispersion Measure Mis-Estimation with Varying Bandwidths
Authors:
Sofia Valentina Sosa Fiscella,
Michael T. Lam,
Zaven Arzoumanian,
Harsha Blumer,
Paul R. Brook,
H. Thankful Cromartie,
Megan E. DeCesar,
Paul B. Demorest,
Timothy Dolch,
Justin A. Ellis,
Robert D. Ferdman,
Elizabeth C. Ferrara,
Emmanuel Fonseca,
Nate Garver-Daniels,
Peter A. Gentile,
Deborah C. Good,
Megan L. Jones,
Duncan R. Lorimer,
**g Luo,
Ryan S. Lynch,
Maura A. McLaughlin,
Cherry Ng,
David J. Nice,
Timothy T. Pennucci,
Nihan S. Pol
, et al. (6 additional authors not shown)
Abstract:
Noise characterization for pulsar-timing applications accounts for interstellar dispersion by assuming a known frequency-dependence of the delay it introduces in the times of arrival (TOAs). However, calculations of this delay suffer from mis-estimations due to other chromatic effects in the observations. The precision in modeling dispersion is dependent on the observed bandwidth. In this work, we…
▽ More
Noise characterization for pulsar-timing applications accounts for interstellar dispersion by assuming a known frequency-dependence of the delay it introduces in the times of arrival (TOAs). However, calculations of this delay suffer from mis-estimations due to other chromatic effects in the observations. The precision in modeling dispersion is dependent on the observed bandwidth. In this work, we calculate the offsets in infinite-frequency TOAs due to mis-estimations in the modeling of dispersion when using varying bandwidths at the Green Bank Telescope. We use a set of broadband observations of PSR J1643-1224, a pulsar with an excess of chromatic noise in its timing residuals. We artificially restricted these observations to a narrowband frequency range, then used both data sets to calculate residuals with a timing model that does not include short-scale dispersion variations. By fitting the resulting residuals to a dispersion model, and comparing the ensuing fitted parameters, we quantify the dispersion mis-estimations. Moreover, by calculating the autocovariance function of the parameters we obtained a characteristic timescale over which the dispersion mis-estimations are correlated. For PSR J1643-1224, which has one of the highest dispersion measures (DM) in the NANOGrav pulsar timing array, we find that the infinite-frequency TOAs suffer from a systematic offset of ~22 microseconds due to DM mis-estimations, with correlations over ~1 month. For lower-DM pulsars, the offset is ~7 microseconds. This error quantification can be used to provide more robust noise modeling in NANOGrav's data, thereby increasing sensitivity and improving parameter estimation in gravitational wave searches.
△ Less
Submitted 30 July, 2023; v1 submitted 25 July, 2023;
originally announced July 2023.
-
The RS Oph outburst of 2021 monitored in X-rays with NICER
Authors:
Marina Orio,
Keith Gendreau,
Morgan Giese,
Gerardo Juna M. Luna,
Jozef Magdolen,
Tod E. Strohmayer,
Andy E. Zhang,
Diego Altamirano,
Andrej Dobrotka,
Teruaki Enoto,
Elizabeth C. Ferrara,
Richard Ignace,
Sebastian heinz,
Craig Markwardt,
Joy S. Nichols,
Micahel L. Parker,
Dheerajay R. Pasham,
Songpeng Pei,
Pragati Pradhan,
Ron Remillard,
James F. Steiner,
Francesco Tombesi
Abstract:
The 2021 outburst of the symbiotic recurrent nova RS Oph was monitored with the Neutron Star Interior Composition Explorer Mission (NICER) in the 0.2-12 keV range from day one after the optical maximum, until day 88, producing an unprecedented, detailed view of the outburst development. The X-ray flux preceding the supersoft X-ray phase peaked almost 5 days after optical maximum and originated onl…
▽ More
The 2021 outburst of the symbiotic recurrent nova RS Oph was monitored with the Neutron Star Interior Composition Explorer Mission (NICER) in the 0.2-12 keV range from day one after the optical maximum, until day 88, producing an unprecedented, detailed view of the outburst development. The X-ray flux preceding the supersoft X-ray phase peaked almost 5 days after optical maximum and originated only in shocked ejecta for 21 to 25 days. The emission was thermal; in the first 5 days only a non-collisional-ionization equilibrium model fits the spectrum, and a transition to equilibrium occurred between days 6 and 12. The ratio of peak X-rays flux measured in the NICER range to that measured with Fermi in the 60 MeV-500 GeV range was about 0.1, and the ratio to the peak flux measured with H.E.S.S. in the 250 GeV-2.5 TeV range was about 100. The central supersoft X-ray source (SSS), namely the shell hydrogen burning white dwarf (WD), became visible in the fourth week, initially with short flares. A huge increase in flux occurred on day 41, but the SSS flux remained variable. A quasi-periodic oscillation every ~35 s was always observed during the SSS phase, with variations in amplitude and a period drift that appeared to decrease in the end. The SSS has characteristics of a WD of mass >1 M(solar). Thermonuclear burning switched off shortly after day 75, earlier than in 2006 outburst. We discuss implications for the nova physics.
△ Less
Submitted 21 July, 2023;
originally announced July 2023.
-
The Third Fermi Large Area Telescope Catalog of Gamma-ray Pulsars
Authors:
David A. Smith,
Philippe Bruel,
Colin J. Clark,
Lucas Guillemot,
Matthew T. Kerr,
Paul Ray,
Soheila Abdollahi,
Marco Ajello,
Luca Baldini,
Jean Ballet,
Matthew Baring,
Cees Bassa,
Josefa Becerra Gonzalez,
Ronaldo Bellazzini,
Alessandra Berretta,
Bhaswati Bhattacharyya,
Elisabetta Bissaldi,
Raffaella Bonino,
Eugenio Bottacini,
Johan Bregeon,
Marta Burgay,
Toby Burnett,
Rob Cameron,
Fernando Camilo,
Regina Caputo
, et al. (134 additional authors not shown)
Abstract:
We present 294 pulsars found in GeV data from the Large Area Telescope (LAT) on the Fermi Gamma-ray Space Telescope. Another 33 millisecond pulsars (MSPs) discovered in deep radio searches of LAT sources will likely reveal pulsations once phase-connected rotation ephemerides are achieved. A further dozen optical and/or X-ray binary systems co-located with LAT sources also likely harbor gamma-ray M…
▽ More
We present 294 pulsars found in GeV data from the Large Area Telescope (LAT) on the Fermi Gamma-ray Space Telescope. Another 33 millisecond pulsars (MSPs) discovered in deep radio searches of LAT sources will likely reveal pulsations once phase-connected rotation ephemerides are achieved. A further dozen optical and/or X-ray binary systems co-located with LAT sources also likely harbor gamma-ray MSPs. This catalog thus reports roughly 340 gamma-ray pulsars and candidates, 10% of all known pulsars, compared to $\leq 11$ known before Fermi. Half of the gamma-ray pulsars are young. Of these, the half that are undetected in radio have a broader Galactic latitude distribution than the young radio-loud pulsars. The others are MSPs, with 6 undetected in radio. Overall, >235 are bright enough above 50 MeV to fit the pulse profile, the energy spectrum, or both. For the common two-peaked profiles, the gamma-ray peak closest to the magnetic pole crossing generally has a softer spectrum. The spectral energy distributions tend to narrow as the spindown power $\dot E$ decreases to its observed minimum near $10^{33}$ erg s$^{-1}$, approaching the shape for synchrotron radiation from monoenergetic electrons. We calculate gamma-ray luminosities when distances are available. Our all-sky gamma-ray sensitivity map is useful for population syntheses. The electronic catalog version provides gamma-ray pulsar ephemerides, properties and fit results to guide and be compared with modeling results.
△ Less
Submitted 20 July, 2023;
originally announced July 2023.
-
The Butterfly Effect in Artificial Intelligence Systems: Implications for AI Bias and Fairness
Authors:
Emilio Ferrara
Abstract:
The Butterfly Effect, a concept originating from chaos theory, underscores how small changes can have significant and unpredictable impacts on complex systems. In the context of AI fairness and bias, the Butterfly Effect can stem from a variety of sources, such as small biases or skewed data inputs during algorithm development, saddle points in training, or distribution shifts in data between trai…
▽ More
The Butterfly Effect, a concept originating from chaos theory, underscores how small changes can have significant and unpredictable impacts on complex systems. In the context of AI fairness and bias, the Butterfly Effect can stem from a variety of sources, such as small biases or skewed data inputs during algorithm development, saddle points in training, or distribution shifts in data between training and testing phases. These seemingly minor alterations can lead to unexpected and substantial unfair outcomes, disproportionately affecting underrepresented individuals or groups and perpetuating pre-existing inequalities. Moreover, the Butterfly Effect can amplify inherent biases within data or algorithms, exacerbate feedback loops, and create vulnerabilities for adversarial attacks. Given the intricate nature of AI systems and their societal implications, it is crucial to thoroughly examine any changes to algorithms or input data for potential unintended consequences. In this paper, we envision both algorithmic and empirical strategies to detect, quantify, and mitigate the Butterfly Effect in AI systems, emphasizing the importance of addressing these challenges to promote fairness and ensure responsible AI development.
△ Less
Submitted 2 February, 2024; v1 submitted 11 July, 2023;
originally announced July 2023.
-
The NANOGrav 15-year Gravitational-Wave Background Analysis Pipeline
Authors:
Aaron D. Johnson,
Patrick M. Meyers,
Paul T. Baker,
Neil J. Cornish,
Jeffrey S. Hazboun,
Tyson B. Littenberg,
Joseph D. Romano,
Stephen R. Taylor,
Michele Vallisneri,
Sarah J. Vigeland,
Ken D. Olum,
Xavier Siemens,
Justin A. Ellis,
Rutger van Haasteren,
Sophie Hourihane,
Gabriella Agazie,
Akash Anumarlapudi,
Anne M. Archibald,
Zaven Arzoumanian,
Laura Blecha,
Adam Brazier,
Paul R. Brook,
Sarah Burke-Spolaor,
Bence Bécsy,
J. Andrew Casey-Clyde
, et al. (71 additional authors not shown)
Abstract:
This paper presents rigorous tests of pulsar timing array methods and software, examining their consistency across a wide range of injected parameters and signal strength. We discuss updates to the 15-year isotropic gravitational-wave background analyses and their corresponding code representations. Descriptions of the internal structure of the flagship algorithms \texttt{Enterprise} and \texttt{P…
▽ More
This paper presents rigorous tests of pulsar timing array methods and software, examining their consistency across a wide range of injected parameters and signal strength. We discuss updates to the 15-year isotropic gravitational-wave background analyses and their corresponding code representations. Descriptions of the internal structure of the flagship algorithms \texttt{Enterprise} and \texttt{PTMCMCSampler} are given to facilitate understanding of the PTA likelihood structure, how models are built, and what methods are currently used in sampling the high-dimensional PTA parameter space. We introduce a novel version of the PTA likelihood that uses a two-step marginalization procedure that performs much faster when the white noise parameters remain fixed. We perform stringent tests of consistency and correctness of the Bayesian and frequentist analysis software. For the Bayesian analysis, we test prior recovery, injection recovery, and Bayes factors. For the frequentist analysis, we test that the cross-correlation-based optimal statistic, when modified to account for a non-negligible gravitational-wave background, accurately recovers the amplitude of the background. We also summarize recent advances and tests performed on the optimal statistic in the literature from both GWB detection and parameter estimation perspectives. The tests presented here validate current and future analyses of PTA data.
△ Less
Submitted 7 July, 2023; v1 submitted 28 June, 2023;
originally announced June 2023.
-
The NANOGrav 15-year Data Set: Bayesian Limits on Gravitational Waves from Individual Supermassive Black Hole Binaries
Authors:
Gabriella Agazie,
Akash Anumarlapudi,
Anne M. Archibald,
Zaven Arzoumanian,
Paul T. Baker,
Bence Bécsy,
Laura Blecha,
Adam Brazier,
Paul R. Brook,
Sarah Burke-Spolaor,
Robin Case,
J. Andrew Casey-Clyde,
Maria Charisi,
Shami Chatterjee,
Tyler Cohen,
James M. Cordes,
Neil Cornish,
Fronefield Crawford,
H. Thankful Cromartie,
Kathryn Crowter,
Megan DeCesar,
Paul B. Demorest,
Matthew C. Digman,
Timothy Dolch,
Brendan Drachler
, et al. (74 additional authors not shown)
Abstract:
Evidence for a low-frequency stochastic gravitational wave background has recently been reported based on analyses of pulsar timing array data. The most likely source of such a background is a population of supermassive black hole binaries, the loudest of which may be individually detected in these datasets. Here we present the search for individual supermassive black hole binaries in the NANOGrav…
▽ More
Evidence for a low-frequency stochastic gravitational wave background has recently been reported based on analyses of pulsar timing array data. The most likely source of such a background is a population of supermassive black hole binaries, the loudest of which may be individually detected in these datasets. Here we present the search for individual supermassive black hole binaries in the NANOGrav 15-year dataset. We introduce several new techniques, which enhance the efficiency and modeling accuracy of the analysis. The search uncovered weak evidence for two candidate signals, one with a gravitational-wave frequency of $\sim$4 nHz, and another at $\sim$170 nHz. The significance of the low-frequency candidate was greatly diminished when Hellings-Downs correlations were included in the background model. The high-frequency candidate was discounted due to the lack of a plausible host galaxy, the unlikely astrophysical prior odds of finding such a source, and since most of its support comes from a single pulsar with a commensurate binary period. Finding no compelling evidence for signals from individual binary systems, we place upper limits on the strain amplitude of gravitational waves emitted by such systems.
△ Less
Submitted 28 June, 2023;
originally announced June 2023.
-
The NANOGrav 15-year Data Set: Search for Anisotropy in the Gravitational-Wave Background
Authors:
Gabriella Agazie,
Akash Anumarlapudi,
Anne M. Archibald,
Zaven Arzoumanian,
Paul T. Baker,
Bence Bécsy,
Laura Blecha,
Adam Brazier,
Paul R. Brook,
Sarah Burke-Spolaor,
J. Andrew Casey-Clyde,
Maria Charisi,
Shami Chatterjee,
Tyler Cohen,
James M. Cordes,
Neil J. Cornish,
Fronefield Crawford,
H. Thankful Cromartie,
Kathryn Crowter,
Megan E. DeCesar,
Paul B. Demorest,
Timothy Dolch,
Brendan Drachler,
Elizabeth C. Ferrara,
William Fiore
, et al. (68 additional authors not shown)
Abstract:
The North American Nanohertz Observatory for Gravitational Waves (NANOGrav) has reported evidence for the presence of an isotropic nanohertz gravitational wave background (GWB) in its 15 yr dataset. However, if the GWB is produced by a population of inspiraling supermassive black hole binary (SMBHB) systems, then the background is predicted to be anisotropic, depending on the distribution of these…
▽ More
The North American Nanohertz Observatory for Gravitational Waves (NANOGrav) has reported evidence for the presence of an isotropic nanohertz gravitational wave background (GWB) in its 15 yr dataset. However, if the GWB is produced by a population of inspiraling supermassive black hole binary (SMBHB) systems, then the background is predicted to be anisotropic, depending on the distribution of these systems in the local Universe and the statistical properties of the SMBHB population. In this work, we search for anisotropy in the GWB using multiple methods and bases to describe the distribution of the GWB power on the sky. We do not find significant evidence of anisotropy, and place a Bayesian $95\%$ upper limit on the level of broadband anisotropy such that $(C_{l>0} / C_{l=0}) < 20\%$. We also derive conservative estimates on the anisotropy expected from a random distribution of SMBHB systems using astrophysical simulations conditioned on the isotropic GWB inferred in the 15-yr dataset, and show that this dataset has sufficient sensitivity to probe a large fraction of the predicted level of anisotropy. We end by highlighting the opportunities and challenges in searching for anisotropy in pulsar timing array data.
△ Less
Submitted 28 June, 2023;
originally announced June 2023.
-
The NANOGrav 15-year Data Set: Constraints on Supermassive Black Hole Binaries from the Gravitational Wave Background
Authors:
Gabriella Agazie,
Akash Anumarlapudi,
Anne M. Archibald,
Paul T. Baker,
Bence Bécsy,
Laura Blecha,
Alexander Bonilla,
Adam Brazier,
Paul R. Brook,
Sarah Burke-Spolaor,
Rand Burnette,
Robin Case,
J. Andrew Casey-Clyde,
Maria Charisi,
Shami Chatterjee,
Katerina Chatziioannou,
Belinda D. Cheeseboro,
Siyuan Chen,
Tyler Cohen,
James M. Cordes,
Neil J. Cornish,
Fronefield Crawford,
H. Thankful Cromartie,
Kathryn Crowter,
Curt J. Cutler
, et al. (89 additional authors not shown)
Abstract:
The NANOGrav 15-year data set shows evidence for the presence of a low-frequency gravitational-wave background (GWB). While many physical processes can source such low-frequency gravitational waves, here we analyze the signal as coming from a population of supermassive black hole (SMBH) binaries distributed throughout the Universe. We show that astrophysically motivated models of SMBH binary popul…
▽ More
The NANOGrav 15-year data set shows evidence for the presence of a low-frequency gravitational-wave background (GWB). While many physical processes can source such low-frequency gravitational waves, here we analyze the signal as coming from a population of supermassive black hole (SMBH) binaries distributed throughout the Universe. We show that astrophysically motivated models of SMBH binary populations are able to reproduce both the amplitude and shape of the observed low-frequency gravitational-wave spectrum. While multiple model variations are able to reproduce the GWB spectrum at our current measurement precision, our results highlight the importance of accurately modeling binary evolution for producing realistic GWB spectra. Additionally, while reasonable parameters are able to reproduce the 15-year observations, the implied GWB amplitude necessitates either a large number of parameters to be at the edges of expected values, or a small number of parameters to be notably different from standard expectations. While we are not yet able to definitively establish the origin of the inferred GWB signal, the consistency of the signal with astrophysical expectations offers a tantalizing prospect for confirming that SMBH binaries are able to form, reach sub-parsec separations, and eventually coalesce. As the significance grows over time, higher-order features of the GWB spectrum will definitively determine the nature of the GWB and allow for novel constraints on SMBH populations.
△ Less
Submitted 18 July, 2023; v1 submitted 28 June, 2023;
originally announced June 2023.
-
The NANOGrav 15-year Data Set: Search for Signals from New Physics
Authors:
Adeela Afzal,
Gabriella Agazie,
Akash Anumarlapudi,
Anne M. Archibald,
Zaven Arzoumanian,
Paul T. Baker,
Bence Bécsy,
Jose Juan Blanco-Pillado,
Laura Blecha,
Kimberly K. Boddy,
Adam Brazier,
Paul R. Brook,
Sarah Burke-Spolaor,
Rand Burnette,
Robin Case,
Maria Charisi,
Shami Chatterjee,
Katerina Chatziioannou,
Belinda D. Cheeseboro,
Siyuan Chen,
Tyler Cohen,
James M. Cordes,
Neil J. Cornish,
Fronefield Crawford,
H. Thankful Cromartie
, et al. (98 additional authors not shown)
Abstract:
The 15-year pulsar timing data set collected by the North American Nanohertz Observatory for Gravitational Waves (NANOGrav) shows positive evidence for the presence of a low-frequency gravitational-wave (GW) background. In this paper, we investigate potential cosmological interpretations of this signal, specifically cosmic inflation, scalar-induced GWs, first-order phase transitions, cosmic string…
▽ More
The 15-year pulsar timing data set collected by the North American Nanohertz Observatory for Gravitational Waves (NANOGrav) shows positive evidence for the presence of a low-frequency gravitational-wave (GW) background. In this paper, we investigate potential cosmological interpretations of this signal, specifically cosmic inflation, scalar-induced GWs, first-order phase transitions, cosmic strings, and domain walls. We find that, with the exception of stable cosmic strings of field theory origin, all these models can reproduce the observed signal. When compared to the standard interpretation in terms of inspiraling supermassive black hole binaries (SMBHBs), many cosmological models seem to provide a better fit resulting in Bayes factors in the range from 10 to 100. However, these results strongly depend on modeling assumptions about the cosmic SMBHB population and, at this stage, should not be regarded as evidence for new physics. Furthermore, we identify excluded parameter regions where the predicted GW signal from cosmological sources significantly exceeds the NANOGrav signal. These parameter constraints are independent of the origin of the NANOGrav signal and illustrate how pulsar timing data provide a new way to constrain the parameter space of these models. Finally, we search for deterministic signals produced by models of ultralight dark matter (ULDM) and dark matter substructures in the Milky Way. We find no evidence for either of these signals and thus report updated constraints on these models. In the case of ULDM, these constraints outperform torsion balance and atomic clock constraints for ULDM coupled to electrons, muons, or gluons.
△ Less
Submitted 28 June, 2023;
originally announced June 2023.
-
The NANOGrav 15-Year Data Set: Detector Characterization and Noise Budget
Authors:
Gabriella Agazie,
Akash Anumarlapudi,
Anne M. Archibald,
Zaven Arzoumanian,
Paul T. Baker,
Bence Bécsy,
Laura Blecha,
Adam Brazier,
Paul R. Brook,
Sarah Burke-Spolaor,
Maria Charisi,
Shami Chatterjee,
Tyler Cohen,
James M. Cordes,
Neil J. Cornish,
Fronefield Crawford,
H. Thankful Cromartie,
Kathryn Crowter,
Megan E. Decesar,
Paul B. Demorest,
Timothy Dolch,
Brendan Drachler,
Elizabeth C. Ferrara,
William Fiore,
Emmanuel Fonseca
, et al. (66 additional authors not shown)
Abstract:
Pulsar timing arrays (PTAs) are galactic-scale gravitational wave detectors. Each individual arm, composed of a millisecond pulsar, a radio telescope, and a kiloparsecs-long path, differs in its properties but, in aggregate, can be used to extract low-frequency gravitational wave (GW) signals. We present a noise and sensitivity analysis to accompany the NANOGrav 15-year data release and associated…
▽ More
Pulsar timing arrays (PTAs) are galactic-scale gravitational wave detectors. Each individual arm, composed of a millisecond pulsar, a radio telescope, and a kiloparsecs-long path, differs in its properties but, in aggregate, can be used to extract low-frequency gravitational wave (GW) signals. We present a noise and sensitivity analysis to accompany the NANOGrav 15-year data release and associated papers, along with an in-depth introduction to PTA noise models. As a first step in our analysis, we characterize each individual pulsar data set with three types of white noise parameters and two red noise parameters. These parameters, along with the timing model and, particularly, a piecewise-constant model for the time-variable dispersion measure, determine the sensitivity curve over the low-frequency GW band we are searching. We tabulate information for all of the pulsars in this data release and present some representative sensitivity curves. We then combine the individual pulsar sensitivities using a signal-to-noise-ratio statistic to calculate the global sensitivity of the PTA to a stochastic background of GWs, obtaining a minimum noise characteristic strain of $7\times 10^{-15}$ at 5 nHz. A power law-integrated analysis shows rough agreement with the amplitudes recovered in NANOGrav's 15-year GW background analysis. While our phenomenological noise model does not model all known physical effects explicitly, it provides an accurate characterization of the noise in the data while preserving sensitivity to multiple classes of GW signals.
△ Less
Submitted 28 June, 2023;
originally announced June 2023.
-
The NANOGrav 15-year Data Set: Observations and Timing of 68 Millisecond Pulsars
Authors:
Gabriella Agazie,
Md Faisal Alam,
Akash Anumarlapudi,
Anne M. Archibald,
Zaven Arzoumanian,
Paul T. Baker,
Laura Blecha,
Victoria Bonidie,
Adam Brazier,
Paul R. Brook,
Sarah Burke-Spolaor,
Bence Bécsy,
Christopher Chapman,
Maria Charisi,
Shami Chatterjee,
Tyler Cohen,
James M. Cordes,
Neil J. Cornish,
Fronefield Crawford,
H. Thankful Cromartie,
Kathryn Crowter,
Megan E. DeCesar,
Paul B. Demorest,
Timothy Dolch,
Brendan Drachler
, et al. (75 additional authors not shown)
Abstract:
We present observations and timing analyses of 68 millisecond pulsars (MSPs) comprising the 15-year data set of the North American Nanohertz Observatory for Gravitational Waves (NANOGrav). NANOGrav is a pulsar timing array (PTA) experiment that is sensitive to low-frequency gravitational waves. This is NANOGrav's fifth public data release, including both "narrowband" and "wideband" time-of-arrival…
▽ More
We present observations and timing analyses of 68 millisecond pulsars (MSPs) comprising the 15-year data set of the North American Nanohertz Observatory for Gravitational Waves (NANOGrav). NANOGrav is a pulsar timing array (PTA) experiment that is sensitive to low-frequency gravitational waves. This is NANOGrav's fifth public data release, including both "narrowband" and "wideband" time-of-arrival (TOA) measurements and corresponding pulsar timing models. We have added 21 MSPs and extended our timing baselines by three years, now spanning nearly 16 years for some of our sources. The data were collected using the Arecibo Observatory, the Green Bank Telescope, and the Very Large Array between frequencies of 327 MHz and 3 GHz, with most sources observed approximately monthly. A number of notable methodological and procedural changes were made compared to our previous data sets. These improve the overall quality of the TOA data set and are part of the transition to new pulsar timing and PTA analysis software packages. For the first time, our data products are accompanied by a full suite of software to reproduce data reduction, analysis, and results. Our timing models include a variety of newly detected astrometric and binary pulsar parameters, including several significant improvements to pulsar mass constraints. We find that the time series of 23 pulsars contain detectable levels of red noise, 10 of which are new measurements. In this data set, we find evidence for a stochastic gravitational-wave background.
△ Less
Submitted 28 June, 2023;
originally announced June 2023.
-
The NANOGrav 15-year Data Set: Evidence for a Gravitational-Wave Background
Authors:
Gabriella Agazie,
Akash Anumarlapudi,
Anne M. Archibald,
Zaven Arzoumanian,
Paul T. Baker,
Bence Becsy,
Laura Blecha,
Adam Brazier,
Paul R. Brook,
Sarah Burke-Spolaor,
Rand Burnette,
Robin Case,
Maria Charisi,
Shami Chatterjee,
Katerina Chatziioannou,
Belinda D. Cheeseboro,
Siyuan Chen,
Tyler Cohen,
James M. Cordes,
Neil J. Cornish,
Fronefield Crawford,
H. Thankful Cromartie,
Kathryn Crowter,
Curt J. Cutler,
Megan E. DeCesar
, et al. (89 additional authors not shown)
Abstract:
We report multiple lines of evidence for a stochastic signal that is correlated among 67 pulsars from the 15-year pulsar-timing data set collected by the North American Nanohertz Observatory for Gravitational Waves. The correlations follow the Hellings-Downs pattern expected for a stochastic gravitational-wave background. The presence of such a gravitational-wave background with a power-law-spectr…
▽ More
We report multiple lines of evidence for a stochastic signal that is correlated among 67 pulsars from the 15-year pulsar-timing data set collected by the North American Nanohertz Observatory for Gravitational Waves. The correlations follow the Hellings-Downs pattern expected for a stochastic gravitational-wave background. The presence of such a gravitational-wave background with a power-law-spectrum is favored over a model with only independent pulsar noises with a Bayes factor in excess of $10^{14}$, and this same model is favored over an uncorrelated common power-law-spectrum model with Bayes factors of 200-1000, depending on spectral modeling choices. We have built a statistical background distribution for these latter Bayes factors using a method that removes inter-pulsar correlations from our data set, finding $p = 10^{-3}$ (approx. $3σ$) for the observed Bayes factors in the null no-correlation scenario. A frequentist test statistic built directly as a weighted sum of inter-pulsar correlations yields $p = 5 \times 10^{-5} - 1.9 \times 10^{-4}$ (approx. $3.5 - 4σ$). Assuming a fiducial $f^{-2/3}$ characteristic-strain spectrum, as appropriate for an ensemble of binary supermassive black-hole inspirals, the strain amplitude is $2.4^{+0.7}_{-0.6} \times 10^{-15}$ (median + 90% credible interval) at a reference frequency of 1/(1 yr). The inferred gravitational-wave background amplitude and spectrum are consistent with astrophysical expectations for a signal from a population of supermassive black-hole binaries, although more exotic cosmological and astrophysical sources cannot be excluded. The observation of Hellings-Downs correlations points to the gravitational-wave origin of this signal.
△ Less
Submitted 28 June, 2023;
originally announced June 2023.
-
Probing spectral and timing properties of the X-ray pulsar RX J0440.9+4431 in the giant outburst of 2022-2023
Authors:
Manoj Mandal,
Rahul Sharma,
Sabyasachi Pal,
G. K. Jaisawal,
Keith C. Gendreau,
Mason Ng,
Andrea Sanna,
Christian Malacaria,
Francesco Tombesi,
E. C. Ferrara,
Craig B. Markwardt,
Michael T. Wolff,
Joel B. Coley
Abstract:
The X-ray pulsar RX J0440.9+4431 went through a giant outburst in 2022 and reached a record-high flux of 2.3 Crab, as observed by Swift/BAT. We study the evolution of different spectral and timing properties of the source using NICER observations. The pulse period is found to decrease from 208 s to 205 s, and the pulse profile evolves significantly with energy and luminosity. The hardness ratio an…
▽ More
The X-ray pulsar RX J0440.9+4431 went through a giant outburst in 2022 and reached a record-high flux of 2.3 Crab, as observed by Swift/BAT. We study the evolution of different spectral and timing properties of the source using NICER observations. The pulse period is found to decrease from 208 s to 205 s, and the pulse profile evolves significantly with energy and luminosity. The hardness ratio and hardness intensity diagram (HID) show remarkable evolution during the outburst. The HID turns towards the diagonal branch from the horizontal branch above a transition (critical) luminosity, suggesting the presence of two accretion modes. Each NICER spectrum can be described using a cutoff power law with a blackbody component and a Gaussian at 6.4 keV. At higher luminosities, an additional Gaussian at 6.67 keV is used. The observed photon index shows negative and positive correlations with X-ray flux below and above the critical luminosity, respectively. The evolution of spectral and timing parameters suggests a possible change in the emission mechanism and beaming pattern of the pulsar depending on the spectral transition to sub- and super-critical accretion regimes. Based on the critical luminosity, the magnetic field of the neutron star can be estimated in the order of 10$^{12}$ or 10$^{13}$ G, assuming different theoretical models. Moreover, the observed iron emission line evolves from a narrow to a broad feature with luminosity. Two emission lines originating from neutral and highly ionized Fe atoms were evident in the spectra around 6.4 keV and 6.67 keV (higher luminosities).
△ Less
Submitted 14 September, 2023; v1 submitted 31 May, 2023;
originally announced June 2023.