Search | arXiv e-print repository

scores: A Python package for verifying and evaluating models and predictions with xarray

Authors: Tennessee Leeuwenburg, Nicholas Loveday, Elizabeth E. Ebert, Harrison Cook, Mohammadreza Khanarmuei, Robert J. Taggart, Nikeeth Ramanathan, Maree Carroll, Stephanie Chong, Aidan Griffiths, John Sharples

Abstract: `scores` is a Python package containing mathematical functions for the verification, evaluation and optimisation of forecasts, predictions or models. It supports labelled n-dimensional (multidimensional) data, which is used in many scientific fields and in machine learning. At present, `scores` primarily supports the geoscience communities; in particular, the meteorological, climatological and oce… ▽ More `scores` is a Python package containing mathematical functions for the verification, evaluation and optimisation of forecasts, predictions or models. It supports labelled n-dimensional (multidimensional) data, which is used in many scientific fields and in machine learning. At present, `scores` primarily supports the geoscience communities; in particular, the meteorological, climatological and oceanographic communities. `scores` not only includes common scores (e.g., Mean Absolute Error), it also includes novel scores not commonly found elsewhere (e.g., FIxed Risk Multicategorical (FIRM) score, Flip-Flop Index), complex scores (e.g., threshold-weighted continuous ranked probability score), and statistical tests (such as the Diebold Mariano test). It also contains isotonic regression which is becoming an increasingly important tool in forecast verification and can be used to generate stable reliability diagrams. Additionally, it provides pre-processing tools for preparing data for scores in a variety of formats including cumulative distribution functions (CDF). At the time of writing, `scores` includes over 50 metrics, statistical techniques and data processing tools. All of the scores and statistical techniques in this package have undergone a thorough scientific and software review. Every score has a companion Jupyter Notebook tutorial that demonstrates its use in practice. `scores` supports `xarray` datatypes, allowing it to work with Earth system data in a range of formats including NetCDF4, HDF5, Zarr and GRIB among others. `scores` uses Dask for scaling and performance. Support for `pandas` is being introduced. The `scores` software repository can be found at https://github.com/nci/scores/ △ Less

Submitted 3 July, 2024; v1 submitted 11 June, 2024; originally announced June 2024.

Comments: Minor revisions to text and table. Updated title. 6 pages, 1 table. Software repository at https://github.com/nci/scores/

arXiv:2405.17713 [pdf, other]

AI Alignment with Changing and Influenceable Reward Functions

Authors: Micah Carroll, Davis Foote, Anand Siththaranjan, Stuart Russell, Anca Dragan

Abstract: Existing AI alignment approaches assume that preferences are static, which is unrealistic: our preferences change, and may even be influenced by our interactions with AI systems themselves. To clarify the consequences of incorrectly assuming static preferences, we introduce Dynamic Reward Markov Decision Processes (DR-MDPs), which explicitly model preference changes and the AI's influence on them.… ▽ More Existing AI alignment approaches assume that preferences are static, which is unrealistic: our preferences change, and may even be influenced by our interactions with AI systems themselves. To clarify the consequences of incorrectly assuming static preferences, we introduce Dynamic Reward Markov Decision Processes (DR-MDPs), which explicitly model preference changes and the AI's influence on them. We show that despite its convenience, the static-preference assumption may undermine the soundness of existing alignment techniques, leading them to implicitly reward AI systems for influencing user preferences in ways users may not truly want. We then explore potential solutions. First, we offer a unifying perspective on how an agent's optimization horizon may partially help reduce undesirable AI influence. Then, we formalize different notions of AI alignment that account for preference change from the outset. Comparing the strengths and limitations of 8 such notions of alignment, we find that they all either err towards causing undesirable AI influence, or are overly risk-averse, suggesting that a straightforward solution to the problems of changing preferences may not exist. As there is no avoiding grappling with changing preferences in real-world settings, this makes it all the more important to handle these issues with care, balancing risks and capabilities. We hope our work can provide conceptual clarity and constitute a first step towards AI alignment practices which explicitly account for (and contend with) the changing and influenceable nature of human preferences. △ Less

Submitted 27 May, 2024; originally announced May 2024.

Comments: Accepted to ICML 2024

arXiv:2404.18429 [pdf]

The Jive Verification System and its Transformative Impact on Weather Forecasting Operations

Authors: Nicholas Loveday, Deryn Griffiths, Tennessee Leeuwenburg, Robert Taggart, Thomas C. Pagano, George Cheng, Kevin Plastow, Elizabeth Ebert, Cassandra Templeton, Maree Carroll, Mohammadreza Khanarmuei, Isha Nagpal

Abstract: Forecast verification is critical for continuous improvement in meteorological organizations. The Jive verification system was originally developed to assess the accuracy of public weather forecasts issued by the Australian Bureau of Meteorology. It started as a research project in 2015 and gradually evolved into the operational verification system that went live in 2022. The system includes daily… ▽ More Forecast verification is critical for continuous improvement in meteorological organizations. The Jive verification system was originally developed to assess the accuracy of public weather forecasts issued by the Australian Bureau of Meteorology. It started as a research project in 2015 and gradually evolved into the operational verification system that went live in 2022. The system includes daily verification dashboards for forecasters to visualize recent forecast performance and "Evidence Targeted Automation" dashboards for exploring the performance of competing forecast systems. Additionally, there is a Jupyter Notebook server with the Jive Python library which supports research experiments, case studies, and the development of new verification metrics and tools. This paper shows how the Jive verification project helped bring verification to the forefront at the Bureau of Meteorology, leading to more accurate, streamlined forecasts. Jive has been used to provide evidence for forecast automation decisions and has helped to understand the evolving role of meteorologists in the forecast process. It has given operational meteorologists tools for evaluating forecast processes, including identifying when and how manual interventions lead to superior predictions. The project also led to new verification science, including novel metrics that are decision-focused, including for extreme conditions. Additionally, Jive has provided the Bureau with an enterprise-wide data analysis environment and has prompted a clarification of forecast definitions. These collective impacts have resulted in more accurate forecasts, ultimately benefiting society, and building trust with forecast users. These positive outcomes highlight the importance of meteorological organizations investing in verification science and technology. △ Less

Submitted 29 April, 2024; originally announced April 2024.

arXiv:2404.14305 [pdf, other]

"I Upload...All Types of Different Things to Say, the World of Blindness Is More Than What They Think It Is": A Study of Blind TikTokers' Identity Work from a Flourishing Perspective

Authors: Yao Lyu, Jie Cai, Bryan Dosono, Davis Yadav, John M. Carroll

Abstract: Identity work in Human-Computer Interaction (HCI) has focused on the marginalized group to explore designs to support their asset (what they have). However, little has been explored specifically on the identity work of people with disabilities, specifically, visual impairments. In this study, we interviewed 45 BlindTokers (blind users on TikTok) from various backgrounds to understand their identit… ▽ More Identity work in Human-Computer Interaction (HCI) has focused on the marginalized group to explore designs to support their asset (what they have). However, little has been explored specifically on the identity work of people with disabilities, specifically, visual impairments. In this study, we interviewed 45 BlindTokers (blind users on TikTok) from various backgrounds to understand their identity work from a positive design perspective. We found that BlindTokers leverage the affordance of the platform to create positive content, share their identities, and build the community with the desire to flourish. We proposed flourishing labor to present the work conducted by BlindTokers for their community's flourishing with implications to support the flourishing labor. This work contributes to understanding blind users' experience in short video platforms and highlights that flourishing is not just an activity for any single Blind user but also a job that needs all stakeholders, including all user groups and the TikTok platform, serious and committed contribution. △ Less

Submitted 22 April, 2024; originally announced April 2024.

Comments: ACM CSCW

arXiv:2402.11016 [pdf, other]

Holographic phenomenology via overlap** degrees of freedom

Authors: Oliver Friedrich, ChunJun Cao, Sean M. Carroll, Gong Cheng, Ashmeet Singh

Abstract: The holographic principle suggests that regions of space contain fewer physical degrees of freedom than would be implied by conventional quantum field theory. Meanwhile, in Hilbert spaces of large dimension $2^n$, it is possible to define $N \gg n$ Pauli algebras that are nearly anti-commuting (but not quite) and which can be thought of as "overlap** degrees of freedom". We propose to model the… ▽ More The holographic principle suggests that regions of space contain fewer physical degrees of freedom than would be implied by conventional quantum field theory. Meanwhile, in Hilbert spaces of large dimension $2^n$, it is possible to define $N \gg n$ Pauli algebras that are nearly anti-commuting (but not quite) and which can be thought of as "overlap** degrees of freedom". We propose to model the phenomenology of holographic theories by allowing field-theory modes to be overlap**, and derive potential observational consequences. In particular, we build a Fermionic quantum field whose effective degrees of freedom approximately obey area scaling and satisfy a cosmic Bekenstein bound, and compare predictions of that model to cosmic neutrino observations. Our implementation of holography implies a finite lifetime of plane waves, which depends on the overall UV cutoff of the theory. To allow for neutrino flux from blazar TXS 0506+056 to be observable, our model needs to have a cutoff $k_{\mathrm{UV}} \lesssim 500\, k_{\mathrm{LHC}}\,$. This is broadly consistent with current bounds on the energy spectrum of cosmic neutrinos from IceCube, but high energy neutrinos are a potential challenge for our model of holography. We motivate our construction via quantum mereology, i.e. using the idea that EFT degrees of freedom should emerge from an abstract theory of quantum gravity by finding quasi-classical Hilbert space decompositions. We also discuss how to extend the framework to Bosons. Finally, using results from random matrix theory we derive an analytical understanding of the energy spectrum of our theory. The numerical tools used in this work are publicly available within the GPUniverse package, https://github.com/OliverFHD/GPUniverse . △ Less

Submitted 5 March, 2024; v1 submitted 16 February, 2024; originally announced February 2024.

Comments: 46 pages + appendix; code and data available at https://github.com/OliverFHD/GPUniverse

arXiv:2402.08203 [pdf, other]

Subsystem surface and compass code sensitivities to non-identical infidelity distributions on heavy-hex lattice

Authors: Malcolm S. Carroll, James R. Wootton, Andrew W. Cross

Abstract: Logical qubits encoded into a quantum code exhibit improved error rates when the physical error rates are sufficiently low, below the pseudothreshold. Logical error rates and pseudothresholds can be estimated for specific circuits and noise models, and these estimates provide approximate goals for qubit performance. However, estimates often assume uniform error rates, while real devices have stati… ▽ More Logical qubits encoded into a quantum code exhibit improved error rates when the physical error rates are sufficiently low, below the pseudothreshold. Logical error rates and pseudothresholds can be estimated for specific circuits and noise models, and these estimates provide approximate goals for qubit performance. However, estimates often assume uniform error rates, while real devices have static and/or dynamic distributions of non-identical error rates and may exhibit outliers. These distributions make it more challenging to evaluate, compare, and rank the expected performance of quantum processors. We numerically investigate how the logical error rate depends on parameters of the noise distribution for the subsystem surface code and the compass code on a subdivided hexagonal lattice. Three notable observations are found: (1) the average logical error rate depends on the average of the physical qubit infidelity distribution without sensitivity to higher moments (e.g., variance or outliers) for a wide parameter range; (2) the logical error rate saturates as errors increase at one or a few "bad" locations; and (3) a decoder that is aware of location specific error rates modestly improves the logical error rate. We discuss the implications of these results in the context of several different practical sources of outliers and non-uniform qubit error rates. △ Less

Submitted 12 February, 2024; originally announced February 2024.

arXiv:2401.15222 [pdf, other]

Transfer Learning for the Prediction of Entity Modifiers in Clinical Text: Application to Opioid Use Disorder Case Detection

Authors: Abdullateef I. Almudaifer, Whitney Covington, JaMor Hairston, Zachary Deitch, Ankit Anand, Caleb M. Carroll, Estera Crisan, William Bradford, Lauren Walter, Eaton Ellen, Sue S. Feldman, John D. Osborne

Abstract: Background: The semantics of entities extracted from a clinical text can be dramatically altered by modifiers, including entity negation, uncertainty, conditionality, severity, and subject. Existing models for determining modifiers of clinical entities involve regular expression or features weights that are trained independently for each modifier. Methods: We develop and evaluate a multi-task tr… ▽ More Background: The semantics of entities extracted from a clinical text can be dramatically altered by modifiers, including entity negation, uncertainty, conditionality, severity, and subject. Existing models for determining modifiers of clinical entities involve regular expression or features weights that are trained independently for each modifier. Methods: We develop and evaluate a multi-task transformer architecture design where modifiers are learned and predicted jointly using the publicly available SemEval 2015 Task 14 corpus and a new Opioid Use Disorder (OUD) data set that contains modifiers shared with SemEval as well as novel modifiers specific for OUD. We evaluate the effectiveness of our multi-task learning approach versus previously published systems and assess the feasibility of transfer learning for clinical entity modifiers when only a portion of clinical modifiers are shared. Results: Our approach achieved state-of-the-art results on the ShARe corpus from SemEval 2015 Task 14, showing an increase of 1.1% on weighted accuracy, 1.7% on unweighted accuracy, and 10% on micro F1 scores. Conclusions: We show that learned weights from our shared model can be effectively transferred to a new partially matched data set, validating the use of transfer learning for clinical text modifiers △ Less

Submitted 5 February, 2024; v1 submitted 26 January, 2024; originally announced January 2024.

Comments: 18 pages, 2 figures, 6 tables. To be submitted to the Journal of Biomedical Semantics

arXiv:2401.12521 [pdf, ps, other]

doi 10.1007/978-3-031-57860-1_6

Exploring Virtual Reality through Ihde's Instrumental Realism

Authors: He Zhang, John M. Carroll

Abstract: Based on Ihde's theory, this paper explores the relationship between virtual reality (VR) as an instrument and phenomenology. It reviews the "technological revolution" spurred by the development of VR technology and discusses how VR has been used to study subjective experience, explore perception and embodiment, enhance empathy and perspective, and investigate altered states of consciousness. The… ▽ More Based on Ihde's theory, this paper explores the relationship between virtual reality (VR) as an instrument and phenomenology. It reviews the "technological revolution" spurred by the development of VR technology and discusses how VR has been used to study subjective experience, explore perception and embodiment, enhance empathy and perspective, and investigate altered states of consciousness. The paper emphasizes the role of VR as an instrumental technology, particularly its ability to expand human perception and cognition. Reflecting on this in conjunction with the work of Husserl and Ihde, among others, it revisits the potential of VR to provide new avenues for scientific inquiry and experience and to transform our understanding of the world through VR. △ Less

Submitted 23 January, 2024; originally announced January 2024.

Comments: Accepted to iConference 2024 as a short paper

arXiv:2401.12133 [pdf, other]

VRMN-bD: A Multi-modal Natural Behavior Dataset of Immersive Human Fear Responses in VR Stand-up Interactive Games

Authors: He Zhang, Xinyang Li, Yuanxi Sun, Xinyi Fu, Christine Qiu, John M. Carroll

Abstract: Understanding and recognizing emotions are important and challenging issues in the metaverse era. Understanding, identifying, and predicting fear, which is one of the fundamental human emotions, in virtual reality (VR) environments plays an essential role in immersive game development, scene development, and next-generation virtual human-computer interaction applications. In this article, we used… ▽ More Understanding and recognizing emotions are important and challenging issues in the metaverse era. Understanding, identifying, and predicting fear, which is one of the fundamental human emotions, in virtual reality (VR) environments plays an essential role in immersive game development, scene development, and next-generation virtual human-computer interaction applications. In this article, we used VR horror games as a medium to analyze fear emotions by collecting multi-modal data (posture, audio, and physiological signals) from 23 players. We used an LSTM-based model to predict fear with accuracies of 65.31% and 90.47% under 6-level classification (no fear and five different levels of fear) and 2-level classification (no fear and fear), respectively. We constructed a multi-modal natural behavior dataset of immersive human fear responses (VRMN-bD) and compared it with existing relevant advanced datasets. The results show that our dataset has fewer limitations in terms of collection method, data scale and audience scope. We are unique and advanced in targeting multi-modal datasets of fear and behavior in VR stand-up interactive environments. Moreover, we discussed the implications of this work for communities and applications. The dataset and pre-trained model are available at https://github.com/KindOPSTAR/VRMN-bD. △ Less

Submitted 22 January, 2024; originally announced January 2024.

Comments: Accepted to IEEE VR 2024

arXiv:2401.11663 [pdf, other]

doi 10.1145/3613904.3642148

"I Got Flagged for Supposed Bullying, Even Though It Was in Response to Someone Harassing Me About My Disability.": A Study of Blind TikTokers' Content Moderation Experiences

Authors: Yao Lyu, Jie Cai, Anisa Callis, Kelley Cotter, John M. Carroll

Abstract: The Human-Computer Interaction (HCI) community has consistently focused on the experiences of users moderated by social media platforms. Recently, scholars have noticed that moderation practices could perpetuate biases, resulting in the marginalization of user groups undergoing moderation. However, most studies have primarily addressed marginalization related to issues such as racism or sexism, wi… ▽ More The Human-Computer Interaction (HCI) community has consistently focused on the experiences of users moderated by social media platforms. Recently, scholars have noticed that moderation practices could perpetuate biases, resulting in the marginalization of user groups undergoing moderation. However, most studies have primarily addressed marginalization related to issues such as racism or sexism, with little attention given to the experiences of people with disabilities. In this paper, we present a study on the moderation experiences of blind users on TikTok, also known as "BlindToker," to address this gap. We conducted semi-structured interviews with 20 BlindTokers and used thematic analysis to analyze the data. Two main themes emerged: BlindTokers' situated content moderation experiences and their reactions to content moderation. We reported on the lack of accessibility on TikTok's platform, contributing to the moderation and marginalization of BlindTokers. Additionally, we discovered instances of harassment from trolls that prompted BlindTokers to respond with harsh language, triggering further moderation. We discussed these findings in the context of the literature on moderation, marginalization, and transformative justice, seeking solutions to address such issues. △ Less

Submitted 21 January, 2024; originally announced January 2024.

Comments: 24 paged, 1 Figure, accepted by CHI'24

arXiv:2401.11317 [pdf, other]

doi 10.1145/3613904.3642787

Third-Party Developers and Tool Development For Community Management on Live Streaming Platform Twitch

Authors: Jie Cai, Ya-Fang Lin, He Zhang, John M. Carroll

Abstract: Community management is critical for stakeholders to collaboratively build and sustain communities with socio-technical support. However, most of the existing research has mainly focused on the community members and the platform, with little attention given to the developers who act as intermediaries between the platform and community members and develop tools to support community management. This… ▽ More Community management is critical for stakeholders to collaboratively build and sustain communities with socio-technical support. However, most of the existing research has mainly focused on the community members and the platform, with little attention given to the developers who act as intermediaries between the platform and community members and develop tools to support community management. This study focuses on third-party developers (TPDs) for the live streaming platform Twitch and explores their tool development practices. Using a mixed method with in-depth qualitative analysis, we found that TPDs maintain complex relationships with different stakeholders (streamers, viewers, platform, professional developers), and the multi-layered policy restricts their agency regarding idea innovation and tool development. We argue that HCI research should shift its focus from tool users to tool developers with regard to community management. We propose designs to support closer collaboration between TPDS and the platform and professional developers and streamline TPDs' development process with unified toolkits and policy documentation. △ Less

Submitted 17 March, 2024; v1 submitted 20 January, 2024; originally announced January 2024.

Comments: Accepted by ACM CHI 2024

arXiv:2312.16697 [pdf, other]

Multi-channel Sensor Network Construction, Data Fusion and Challenges for Smart Home

Authors: He Zhang, Robin Ananda, Xinyi Fu, Zhe Sun, Xiaoyu Wang, Keqi Chen, John M. Carroll

Abstract: Both sensor networks and data fusion are essential foundations for develo** the smart home Internet of Things (IoT) and related fields. We proposed a multi-channel sensor network construction method involving hardware, acquisition, and synchronization in the smart home environment and a smart home data fusion method (SHDFM) for multi-modal data (position, gait, voice, pose, facial expression, te… ▽ More Both sensor networks and data fusion are essential foundations for develo** the smart home Internet of Things (IoT) and related fields. We proposed a multi-channel sensor network construction method involving hardware, acquisition, and synchronization in the smart home environment and a smart home data fusion method (SHDFM) for multi-modal data (position, gait, voice, pose, facial expression, temperature, and humidity) generated in the smart home environment to address the configuration of a multi-channel sensor network, improve the quality and efficiency of various human activities and environmental data collection, and reduce the difficulty of multi-modal data fusion in the smart home. SHDFM contains 5 levels, with inputs and outputs as criteria to provide recommendations for multi-modal data fusion strategies in the smart home. We built a real experimental environment using the proposed method in this paper. To validate our method, we created a real experimental environment - a physical setup in a home-like scenario where the multi-channel sensor network and data fusion techniques were deployed and evaluated. The acceptance and testing results show that the proposed construction and data fusion methods can be applied to the examples with high robustness, replicability, and scalability. Besides, we discuss how smart homes with multi-channel sensor networks can support digital twins. △ Less

Submitted 27 December, 2023; originally announced December 2023.

Comments: 8 pages, accepted by CHCHI2023

arXiv:2312.12338 [pdf, other]

Smart Connected Farms and Networked Farmers to Tackle Climate Challenges Impacting Agricultural Production

Authors: Behzad J. Balabaygloo, Barituka Bekee, Samuel W. Blair, Suzanne Fey, Fateme Fotouhi, Ashish Gupta, Kevin Menke, Anusha Vangala, Jorge C. M. Palomares, Aaron Prestholt, Vishesh K. Tanwar, Xu Tao, Matthew E. Carroll, Sajal Das, Gil Depaula, Peter Kyveryga, Soumik Sarkar, Michelle Segovia, Simone Sylvestri, Corinne Valdivia, Asheesh K. Singh

Abstract: To meet the grand challenges of agricultural production including climate change impacts on crop production, a tight integration of social science, technology and agriculture experts including farmers are needed. There are rapid advances in information and communication technology, precision agriculture and data analytics, which are creating a fertile field for the creation of smart connected farm… ▽ More To meet the grand challenges of agricultural production including climate change impacts on crop production, a tight integration of social science, technology and agriculture experts including farmers are needed. There are rapid advances in information and communication technology, precision agriculture and data analytics, which are creating a fertile field for the creation of smart connected farms (SCF) and networked farmers. A network and coordinated farmer network provides unique advantages to farmers to enhance farm production and profitability, while tackling adverse climate events. The aim of this article is to provide a comprehensive overview of the state of the art in SCF including the advances in engineering, computer sciences, data sciences, social sciences and economics including data privacy, sharing and technology adoption. △ Less

Submitted 19 December, 2023; originally announced December 2023.

arXiv:2311.05933 [pdf, other]

Benchmarking Quantum Processor Performance at Scale

Authors: David C. McKay, Ian Hincks, Emily J. Pritchett, Malcolm Carroll, Luke C. G. Govia, Seth T. Merkel

Abstract: As quantum processors grow, new performance benchmarks are required to capture the full quality of the devices at scale. While quantum volume is an excellent benchmark, it focuses on the highest quality subset of the device and so is unable to indicate the average performance over a large number of connected qubits. Furthermore, it is a discrete pass/fail and so is not reflective of continuous imp… ▽ More As quantum processors grow, new performance benchmarks are required to capture the full quality of the devices at scale. While quantum volume is an excellent benchmark, it focuses on the highest quality subset of the device and so is unable to indicate the average performance over a large number of connected qubits. Furthermore, it is a discrete pass/fail and so is not reflective of continuous improvements in hardware nor does it provide quantitative direction to large-scale algorithms. For example, there may be value in error mitigated Hamiltonian simulation at scale with devices unable to pass strict quantum volume tests. Here we discuss a scalable benchmark which measures the fidelity of a connecting set of two-qubit gates over $N$ qubits by measuring gate errors using simultaneous direct randomized benchmarking in disjoint layers. Our layer fidelity can be easily related to algorithmic run time, via $γ$ defined in Ref.\cite{berg2022probabilistic} that can be used to estimate the number of circuits required for error mitigation. The protocol is efficient and obtains all the pair rates in the layered structure. Compared to regular (isolated) RB this approach is sensitive to crosstalk. As an example we measure a $N=80~(100)$ qubit layer fidelity on a 127 qubit fixed-coupling "Eagle" processor (ibm\_sherbrooke) of 0.26(0.19) and on the 133 qubit tunable-coupling "Heron" processor (ibm\_montecarlo) of 0.61(0.26). This can easily be expressed as a layer size independent quantity, error per layered gate (EPLG), which is here $1.7\times10^{-2}(1.7\times10^{-2})$ for ibm\_sherbrooke and $6.2\times10^{-3}(1.2\times10^{-2})$ for ibm\_montecarlo. △ Less

Submitted 10 November, 2023; originally announced November 2023.

Comments: 15 pages, 8 figures (including appendices)

arXiv:2310.07154 [pdf, other]

doi 10.1145/3637297

"Because Some Sighted People, They Don't Know What the Heck You're Talking About:" A Study of Blind TikTokers' Infrastructuring Work to Build Independence

Authors: Yao Lyu, John M. Carroll

Abstract: There has been extensive research on the experiences of individuals with visual impairments on text- and image-based social media platforms, such as Facebook and Twitter. However, little is known about the experiences of visually impaired users on short-video platforms like TikTok. To bridge this gap, we conducted an interview study with 30 BlindTokers (the nickname of blind TikTokers). Our study… ▽ More There has been extensive research on the experiences of individuals with visual impairments on text- and image-based social media platforms, such as Facebook and Twitter. However, little is known about the experiences of visually impaired users on short-video platforms like TikTok. To bridge this gap, we conducted an interview study with 30 BlindTokers (the nickname of blind TikTokers). Our study aimed to explore the various activities of BlindTokers on TikTok, including everyday entertainment, professional development, and community engagement. The widespread usage of TikTok among participants demonstrated that they considered TikTok and its associated experiences as the infrastructure for their activities. Additionally, participants reported experiencing breakdowns in this infrastructure due to accessibility issues. They had to carry out infrastructuring work to resolve the breakdowns. Blind users' various practices on TikTok also foregrounded their perceptions of independence. We then discussed blind users' nuanced understanding of the TikTok-mediated independence; we also critically examined BlindTokers' infrastructuring work for such independence. △ Less

Submitted 11 December, 2023; v1 submitted 10 October, 2023; originally announced October 2023.

Comments: Accepted at CSCW'24, 29 pages, 2 figures, and 2 tables

arXiv:2310.07061 [pdf, other]

QualiGPT: GPT as an easy-to-use tool for qualitative coding

Authors: He Zhang, Chuhao Wu, **gyi Xie, ChanMin Kim, John M. Carroll

Abstract: Qualitative research delves deeply into individual complex perspectives on technology and various phenomena. However, a meticulous analysis of qualitative data often requires a significant amount of time, especially during the crucial coding stage. Although there is software specifically designed for qualitative evaluation, many of these platforms fall short in terms of automatic coding, intuitive… ▽ More Qualitative research delves deeply into individual complex perspectives on technology and various phenomena. However, a meticulous analysis of qualitative data often requires a significant amount of time, especially during the crucial coding stage. Although there is software specifically designed for qualitative evaluation, many of these platforms fall short in terms of automatic coding, intuitive usability, and cost-effectiveness. With the rise of Large Language Models (LLMs) such as GPT-3 and its successors, we are at the forefront of a transformative era for enhancing qualitative analysis. In this paper, we introduce QualiGPT, a specialized tool designed after considering challenges associated with ChatGPT and qualitative analysis. It harnesses the capabilities of the Generative Pretrained Transformer (GPT) and its API for thematic analysis of qualitative data. By comparing traditional manual coding with QualiGPT's analysis on both simulated and actual datasets, we verify that QualiGPT not only refines the qualitative analysis process but also elevates its transparency, credibility, and accessibility. Notably, compared to existing analytical platforms, QualiGPT stands out with its intuitive design, significantly reducing the learning curve and operational barriers for users. △ Less

Submitted 10 October, 2023; originally announced October 2023.

Comments: 25 pages, 7 figures, 1 table, under review

arXiv:2309.10771 [pdf, other]

Redefining Qualitative Analysis in the AI Era: Utilizing ChatGPT for Efficient Thematic Analysis

Authors: He Zhang, Chuhao Wu, **gyi Xie, Yao Lyu, Jie Cai, John M. Carroll

Abstract: AI tools, particularly large-scale language model (LLM) based applications such as ChatGPT, have the potential to simplify qualitative research. Through semi-structured interviews with seventeen participants, we identified challenges and concerns in integrating ChatGPT into the qualitative analysis process. Collaborating with thirteen qualitative researchers, we developed a framework for designing… ▽ More AI tools, particularly large-scale language model (LLM) based applications such as ChatGPT, have the potential to simplify qualitative research. Through semi-structured interviews with seventeen participants, we identified challenges and concerns in integrating ChatGPT into the qualitative analysis process. Collaborating with thirteen qualitative researchers, we developed a framework for designing prompts to enhance the effectiveness of ChatGPT in thematic analysis. Our findings indicate that improving transparency, providing guidance on prompts, and strengthening users' understanding of LLMs' capabilities significantly enhance the users' ability to interact with ChatGPT. We also discovered and revealed the reasons behind researchers' shift in attitude towards ChatGPT from negative to positive. This research not only highlights the importance of well-designed prompts in LLM applications but also offers reflections for qualitative researchers on the perception of AI's role. Finally, we emphasize the potential ethical risks and the impact of constructing AI ethical expectations by researchers, particularly those who are novices, on future research and AI development. △ Less

Submitted 27 May, 2024; v1 submitted 19 September, 2023; originally announced September 2023.

arXiv:2308.14014 [pdf, other]

Reconnecting An International Travel Network: The Personal Infrastructuring Work of International Travelers in A Multi-facet Crisis

Authors: Yao Lyu, He Zhang, John M. Carroll

Abstract: In times of crisis, international travel becomes tenuous and anxiety provoking. The crisis informatics and Human-Computer Interaction (HCI) community has paid increasing attention to the use of Information and Communication Technologies (ICTs) in various crisis settings. However, little is known about the travelers' actual experiences in whole trips in crises. In this paper, we bridge the gap by p… ▽ More In times of crisis, international travel becomes tenuous and anxiety provoking. The crisis informatics and Human-Computer Interaction (HCI) community has paid increasing attention to the use of Information and Communication Technologies (ICTs) in various crisis settings. However, little is known about the travelers' actual experiences in whole trips in crises. In this paper, we bridge the gap by presenting a study on Chinese travelers' encounters in their international journeys to the US during a multifacet crisis and their use of ICTs to overcome difficulties in the journeys. We interviewed 22 Chinese travelers who had successfully come to the US during the crisis. The findings showed how travelers improvised to reconnect the broken international travel infrastructure. We also discuss the findings with the literature on infrastructure, and crisis informatics, and provide design implications for travel authorities and agencies. △ Less

Submitted 27 August, 2023; originally announced August 2023.

arXiv:2307.15217 [pdf, other]

Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback

Authors: Stephen Casper, Xander Davies, Claudia Shi, Thomas Krendl Gilbert, Jérémy Scheurer, Javier Rando, Rachel Freedman, Tomasz Korbak, David Lindner, Pedro Freire, Tony Wang, Samuel Marks, Charbel-Raphaël Segerie, Micah Carroll, Andi Peng, Phillip Christoffersen, Mehul Damani, Stewart Slocum, Usman Anwar, Anand Siththaranjan, Max Nadeau, Eric J. Michaud, Jacob Pfau, Dmitrii Krasheninnikov, Xin Chen , et al. (7 additional authors not shown)

Abstract: Reinforcement learning from human feedback (RLHF) is a technique for training AI systems to align with human goals. RLHF has emerged as the central method used to finetune state-of-the-art large language models (LLMs). Despite this popularity, there has been relatively little public work systematizing its flaws. In this paper, we (1) survey open problems and fundamental limitations of RLHF and rel… ▽ More Reinforcement learning from human feedback (RLHF) is a technique for training AI systems to align with human goals. RLHF has emerged as the central method used to finetune state-of-the-art large language models (LLMs). Despite this popularity, there has been relatively little public work systematizing its flaws. In this paper, we (1) survey open problems and fundamental limitations of RLHF and related methods; (2) overview techniques to understand, improve, and complement RLHF in practice; and (3) propose auditing and disclosure standards to improve societal oversight of RLHF systems. Our work emphasizes the limitations of RLHF and highlights the importance of a multi-faceted approach to the development of safer AI systems. △ Less

Submitted 11 September, 2023; v1 submitted 27 July, 2023; originally announced July 2023.

arXiv:2307.11927 [pdf, other]

Completely Discretized, Finite Quantum Mechanics

Authors: Sean M. Carroll

Abstract: I propose a version of quantum mechanics featuring a discrete and finite number of states that is plausibly a model of the real world. The model is based on standard unitary quantum theory of a closed system with a finite-dimensional Hilbert space. Given certain simple conditions on the spectrum of the Hamiltonian, Schrödinger evolution is periodic, and it is straightforward to replace continuous… ▽ More I propose a version of quantum mechanics featuring a discrete and finite number of states that is plausibly a model of the real world. The model is based on standard unitary quantum theory of a closed system with a finite-dimensional Hilbert space. Given certain simple conditions on the spectrum of the Hamiltonian, Schrödinger evolution is periodic, and it is straightforward to replace continuous time with a discrete version, with the result that the system only visits a discrete and finite set of state vectors. The biggest challenges to the viability of such a model come from cosmological considerations. The theory may have implications for questions of mathematical realism and finitism. △ Less

Submitted 1 November, 2023; v1 submitted 21 July, 2023; originally announced July 2023.

arXiv:2307.05425 [pdf, other]

Axions and Cosmic Magnetic Fields

Authors: George B. Field, Sean M. Carroll

Abstract: We argue that if axions are the dark matter, their coupling to electromagnetism results in exponential growth of a helical magnetic field when the axion field first rolls down its potential. After an inverse cascade, the relevant length scales to day are of order 10-100 kpc, of astrophysical interest. Our mechanism for allowing the field to grow relies on a nuance of MHD. Faraday's Law says that a… ▽ More We argue that if axions are the dark matter, their coupling to electromagnetism results in exponential growth of a helical magnetic field when the axion field first rolls down its potential. After an inverse cascade, the relevant length scales to day are of order 10-100 kpc, of astrophysical interest. Our mechanism for allowing the field to grow relies on a nuance of MHD. Faraday's Law says that an electric field is needed to create a magnetic field. Previous authors relied on conventional Ohm's law to calculate E, but the resistivity is negligible and therefore they assume E is as well. We use a modified Ohm's Law that includes the effects of self-induction in limiting the current driven by a given E, which allows a magnetic field to grow. △ Less

Submitted 14 July, 2023; v1 submitted 11 July, 2023; originally announced July 2023.

arXiv:2306.09309 [pdf, other]

Who Needs to Know? Minimal Knowledge for Optimal Coordination

Authors: Niklas Lauffer, Ameesh Shah, Micah Carroll, Michael Dennis, Stuart Russell

Abstract: To optimally coordinate with others in cooperative games, it is often crucial to have information about one's collaborators: successful driving requires understanding which side of the road to drive on. However, not every feature of collaborators is strategically relevant: the fine-grained acceleration of drivers may be ignored while maintaining optimal coordination. We show that there is a well-d… ▽ More To optimally coordinate with others in cooperative games, it is often crucial to have information about one's collaborators: successful driving requires understanding which side of the road to drive on. However, not every feature of collaborators is strategically relevant: the fine-grained acceleration of drivers may be ignored while maintaining optimal coordination. We show that there is a well-defined dichotomy between strategically relevant and irrelevant information. Moreover, we show that, in dynamic games, this dichotomy has a compact representation that can be efficiently computed via a Bellman backup operator. We apply this algorithm to analyze the strategically relevant information for tasks in both a standard and a partially observable version of the Overcooked environment. Theoretical and empirical results show that our algorithms are significantly more efficient than baselines. Videos are available at https://minknowledge.github.io. △ Less

Submitted 13 July, 2023; v1 submitted 15 June, 2023; originally announced June 2023.

Comments: To be published at ICML 2023

ACM Class: I.2.6; I.2.11

arXiv:2305.16941 [pdf, other]

Engagement, User Satisfaction, and the Amplification of Divisive Content on Social Media

Authors: Smitha Milli, Micah Carroll, Yike Wang, Sashrika Pandey, Sebastian Zhao, Anca D. Dragan

Abstract: In a pre-registered randomized experiment, we found that, relative to a reverse-chronological baseline, Twitter's engagement-based ranking algorithm amplifies emotionally charged, out-group hostile content that users say makes them feel worse about their political out-group. Furthermore, we find that users do not prefer the political tweets selected by the algorithm, suggesting that the engagement… ▽ More In a pre-registered randomized experiment, we found that, relative to a reverse-chronological baseline, Twitter's engagement-based ranking algorithm amplifies emotionally charged, out-group hostile content that users say makes them feel worse about their political out-group. Furthermore, we find that users do not prefer the political tweets selected by the algorithm, suggesting that the engagement-based algorithm underperforms in satisfying users' stated preferences. Finally, we explore the implications of an alternative approach that ranks content based on users' stated preferences and find a reduction in angry, partisan, and out-group hostile content but also a potential reinforcement of echo chambers. The evidence underscores the necessity for a more nuanced approach to content ranking that balances engagement, users' stated preferences, and sociopolitical outcomes. △ Less

Submitted 22 December, 2023; v1 submitted 26 May, 2023; originally announced May 2023.

arXiv:2305.09933 [pdf, other]

Impact of ROS 2 Node Composition in Robotic Systems

Authors: Steve Macenski, Alberto Soragna, Michael Carroll, Zhenpeng Ge

Abstract: The Robot Operating System 2 (ROS 2) is the second generation of ROS representing a step forward in the robotic framework. Several new types of nodes and executor models are integral to control where, how, and when information is processed in the computational graph. This paper explores and benchmarks one of these new node types -- the Component node -- which allows nodes to be composed manually o… ▽ More The Robot Operating System 2 (ROS 2) is the second generation of ROS representing a step forward in the robotic framework. Several new types of nodes and executor models are integral to control where, how, and when information is processed in the computational graph. This paper explores and benchmarks one of these new node types -- the Component node -- which allows nodes to be composed manually or dynamically into processes while retaining separation of concerns in a codebase for distributed development. Composition is shown to achieve a high degree of performance optimization, particularly valuable for resource-constrained systems and sensor processing pipelines, enabling distributed tasks that would not be otherwise possible in ROS 2. In this work, we briefly introduce the significance and design of node composition, then our contribution of benchmarking is provided to analyze its impact on robotic systems. Its compelling influence on performance is shown through several experiments on the latest Long Term Support (LTS) ROS 2 distribution, Humble Hawksbill. △ Less

Submitted 16 May, 2023; originally announced May 2023.

Comments: IEEE Robotics and Automation Letters, 2023

arXiv:2303.09387 [pdf, other]

Characterizing Manipulation from AI Systems

Authors: Micah Carroll, Alan Chan, Henry Ashton, David Krueger

Abstract: Manipulation is a common concern in many domains, such as social media, advertising, and chatbots. As AI systems mediate more of our interactions with the world, it is important to understand the degree to which AI systems might manipulate humans without the intent of the system designers. Our work clarifies challenges in defining and measuring manipulation in the context of AI systems. Firstly, w… ▽ More Manipulation is a common concern in many domains, such as social media, advertising, and chatbots. As AI systems mediate more of our interactions with the world, it is important to understand the degree to which AI systems might manipulate humans without the intent of the system designers. Our work clarifies challenges in defining and measuring manipulation in the context of AI systems. Firstly, we build upon prior literature on manipulation from other fields and characterize the space of possible notions of manipulation, which we find to depend upon the concepts of incentives, intent, harm, and covertness. We review proposals on how to operationalize each factor. Second, we propose a definition of manipulation based on our characterization: a system is manipulative if it acts as if it were pursuing an incentive to change a human (or another agent) intentionally and covertly. Third, we discuss the connections between manipulation and related concepts, such as deception and coercion. Finally, we contextualize our operationalization of manipulation in some applications. Our overall assessment is that while some progress has been made in defining and measuring manipulation from AI systems, many gaps remain. In the absence of a consensus definition and reliable tools for measurement, we cannot rule out the possibility that AI systems learn to manipulate humans without the intent of the system designers. We argue that such manipulation poses a significant threat to human autonomy, suggesting that precautionary actions to mitigate it are warranted. △ Less

Submitted 30 October, 2023; v1 submitted 16 March, 2023; originally announced March 2023.

Comments: Presented at EAAMO 2023; The first two authors contributed equally; author order was decided with a coin flip

arXiv:2303.08813 [pdf, other]

doi 10.3847/1538-4357/acc402

A High Fraction of Heavily X-ray Obscured Active Galactic Nuclei

Authors: Christopher M. Carroll, Tonima T. Ananna, Ryan C. Hickox, Alberto Masini, Roberto J. Assef, Daniel Stern, Chien-Ting J. Chen, Lauranne Lanz

Abstract: We present new estimates on the fraction of heavily X-ray obscured, Compton-thick (CT) active galactic nuclei (AGNs) out to a redshift of $z \leq$ 0.8. From a sample of 540 AGNs selected by mid-IR (MIR) properties in observed X-ray survey fields, we forward model the observed-to-intrinsic X-ray luminosity ratio ($R_{L_{\text{X}}}$) with a Markov chain Monte Carlo (MCMC) simulation to estimate the… ▽ More We present new estimates on the fraction of heavily X-ray obscured, Compton-thick (CT) active galactic nuclei (AGNs) out to a redshift of $z \leq$ 0.8. From a sample of 540 AGNs selected by mid-IR (MIR) properties in observed X-ray survey fields, we forward model the observed-to-intrinsic X-ray luminosity ratio ($R_{L_{\text{X}}}$) with a Markov chain Monte Carlo (MCMC) simulation to estimate the total fraction of CT AGNs ($f_{\text{CT}}$), many of which are missed in typical X-ray observations. We create model $N_{\text{H}}$ distributions and convert these to $R_{L_{\text{X}}}$ using a set of X-ray spectral models. We probe the posterior distribution of our models to infer the population of X-ray non-detected sources. From our simulation we estimate a CT fraction of $f_{\text{CT}}$ = $\text{0.555}^{+\text{0.037}}_{-\text{0.032}}$. We perform an X-ray stacking analysis for sources in Chandra X-ray Observatory fields and find that the expected soft (0.5-2 keV) and hard (2-7 keV) observed fluxes drawn from our model to be within 0.48 and 0.12 dex of our stacked fluxes, respectively. Our results suggests at least 50% of all MIR-selected AGNs, possibly more, are Compton-thick ($N_{\text{H}} \gtrsim$ 10$^{\text{24}}$ cm$^{-\text{2}}$), which is in excellent agreement with other recent work using independent methods. This work indicates that the total number of AGNs is higher than can be identified using X-ray observations alone, highlighting the importance of a multiwavelength approach. A high $f_{\text{CT}}$ also has implications for black hole (BH) accretion physics and supports models of BH and galaxy co-evolution that include periods of heavy obscuration. △ Less

Submitted 15 March, 2023; originally announced March 2023.

Comments: 14 pages, 6 figures, 1 table, plus appendix figures. Accepted for publication in ApJ

arXiv:2302.10329 [pdf, other]

doi 10.1145/3593013.3594033

Harms from Increasingly Agentic Algorithmic Systems

Authors: Alan Chan, Rebecca Salganik, Alva Markelius, Chris Pang, Nitarshan Rajkumar, Dmitrii Krasheninnikov, Lauro Langosco, Zhonghao He, Yawen Duan, Micah Carroll, Michelle Lin, Alex Mayhew, Katherine Collins, Maryam Molamohammadi, John Burden, Wanru Zhao, Shalaleh Rismani, Konstantinos Voudouris, Umang Bhatt, Adrian Weller, David Krueger, Tegan Maharaj

Abstract: Research in Fairness, Accountability, Transparency, and Ethics (FATE) has established many sources and forms of algorithmic harm, in domains as diverse as health care, finance, policing, and recommendations. Much work remains to be done to mitigate the serious harms of these systems, particularly those disproportionately affecting marginalized communities. Despite these ongoing harms, new systems… ▽ More Research in Fairness, Accountability, Transparency, and Ethics (FATE) has established many sources and forms of algorithmic harm, in domains as diverse as health care, finance, policing, and recommendations. Much work remains to be done to mitigate the serious harms of these systems, particularly those disproportionately affecting marginalized communities. Despite these ongoing harms, new systems are being developed and deployed which threaten the perpetuation of the same harms and the creation of novel ones. In response, the FATE community has emphasized the importance of anticipating harms. Our work focuses on the anticipation of harms from increasingly agentic systems. Rather than providing a definition of agency as a binary property, we identify 4 key characteristics which, particularly in combination, tend to increase the agency of a given algorithmic system: underspecification, directness of impact, goal-directedness, and long-term planning. We also discuss important harms which arise from increasing agency -- notably, these include systemic and/or long-range impacts, often on marginalized stakeholders. We emphasize that recognizing agency of algorithmic systems does not absolve or shift the human responsibility for algorithmic harms. Rather, we use the term agency to highlight the increasingly evident fact that ML systems are not fully under human control. Our work explores increasingly agentic algorithmic systems in three parts. First, we explain the notion of an increase in agency for algorithmic systems in the context of diverse perspectives on agency across disciplines. Second, we argue for the need to anticipate harms from increasingly agentic systems. Third, we discuss important harms from increasingly agentic systems and ways forward for addressing them. We conclude by reflecting on implications of our work for anticipating algorithmic harms from emerging systems. △ Less

Submitted 11 May, 2023; v1 submitted 20 February, 2023; originally announced February 2023.

Comments: Accepted at FAccT 2023

arXiv:2302.08824 [pdf, other]

doi 10.1103/PhysRevA.107.063710

Coherence build up and laser thresholds from nanolasers to macroscopic lasers

Authors: Mark Anthony Carroll, Giampaolo D'Alessandro, Gian Luca Lippi, Gian-Luca Oppo, Francesco Papoff

Abstract: We detail the derivation of nanolaser models that include coherent and incoherent variables and predict the existence of a laser threshold, irrespective of cavity size and emitter number, for both single- and multi-electron systems. The growth in photon number in the lasing mode is driven by an increase in correlation between absorption and emission processes, leading to the onset of self-sustaine… ▽ More We detail the derivation of nanolaser models that include coherent and incoherent variables and predict the existence of a laser threshold, irrespective of cavity size and emitter number, for both single- and multi-electron systems. The growth in photon number in the lasing mode is driven by an increase in correlation between absorption and emission processes, leading to the onset of self-sustained stimulated emission (laser threshold), followed, in turn, by a correlation decrease and ending with the dominance of coherent emission. The first-order coherence $g^{(1)}$ steadily increases, as the pump grows towards the laser threshold value, and reaches unity at or beyond threshold. The transition toward coherent emission becomes increasingly sharp as the number of emitters and of the coupled electromagnetic cavity modes increase, continuously connecting, in the thermodynamic limit, the physics of nano- and macroscopic lasers at threshold. Our predictions are in remarkable agreement with experiments whose first-order coherence measurements have so far been explained only phenomenologically. A consistent evaluation of different threshold indicators provides a tool for a correct interpretation of experimental measurements at the onset of laser action. △ Less

Submitted 17 February, 2023; originally announced February 2023.

Comments: 11 pages, 5 figures

arXiv:2302.07343 [pdf, other]

Agile and Versatile Robot Locomotion via Kernel-based Residual Learning

Authors: Milo Carroll, Zhaocheng Liu, Mohammadreza Kasaei, Zhibin Li

Abstract: This work developed a kernel-based residual learning framework for quadrupedal robotic locomotion. Initially, a kernel neural network is trained with data collected from an MPC controller. Alongside a frozen kernel network, a residual controller network is trained via reinforcement learning to acquire generalized locomotion skills and resilience against external perturbations. With this proposed f… ▽ More This work developed a kernel-based residual learning framework for quadrupedal robotic locomotion. Initially, a kernel neural network is trained with data collected from an MPC controller. Alongside a frozen kernel network, a residual controller network is trained via reinforcement learning to acquire generalized locomotion skills and resilience against external perturbations. With this proposed framework, a robust quadrupedal locomotion controller is learned with high sample efficiency and controllability, providing omnidirectional locomotion at continuous velocities. Its versatility and robustness are validated on unseen terrains that the expert MPC controller fails to traverse. Furthermore, the learned kernel can produce a range of functional locomotion behaviors and can generalize to unseen gaits. △ Less

Submitted 14 February, 2023; originally announced February 2023.

arXiv:2212.00169 [pdf, other]

Time-Efficient Reward Learning via Visually Assisted Cluster Ranking

Authors: David Zhang, Micah Carroll, Andreea Bobu, Anca Dragan

Abstract: One of the most successful paradigms for reward learning uses human feedback in the form of comparisons. Although these methods hold promise, human comparison labeling is expensive and time consuming, constituting a major bottleneck to their broader applicability. Our insight is that we can greatly improve how effectively human time is used in these approaches by batching comparisons together, rat… ▽ More One of the most successful paradigms for reward learning uses human feedback in the form of comparisons. Although these methods hold promise, human comparison labeling is expensive and time consuming, constituting a major bottleneck to their broader applicability. Our insight is that we can greatly improve how effectively human time is used in these approaches by batching comparisons together, rather than having the human label each comparison individually. To do so, we leverage data dimensionality-reduction and visualization techniques to provide the human with a interactive GUI displaying the state space, in which the user can label subportions of the state space. Across some simple Mujoco tasks, we show that this high-level approach holds promise and is able to greatly increase the performance of the resulting agents, provided the same amount of human labeling time. △ Less

Submitted 30 November, 2022; originally announced December 2022.

Comments: Presented at the NeurIPS 2022 Human in the Loop Learning (HiLL) Workshop

arXiv:2211.10869 [pdf, other]

UniMASK: Unified Inference in Sequential Decision Problems

Authors: Micah Carroll, Orr Paradise, Jessy Lin, Raluca Georgescu, Mingfei Sun, David Bignell, Stephanie Milani, Katja Hofmann, Matthew Hausknecht, Anca Dragan, Sam Devlin

Abstract: Randomly masking and predicting word tokens has been a successful approach in pre-training language models for a variety of downstream tasks. In this work, we observe that the same idea also applies naturally to sequential decision-making, where many well-studied tasks like behavior cloning, offline reinforcement learning, inverse dynamics, and waypoint conditioning correspond to different sequenc… ▽ More Randomly masking and predicting word tokens has been a successful approach in pre-training language models for a variety of downstream tasks. In this work, we observe that the same idea also applies naturally to sequential decision-making, where many well-studied tasks like behavior cloning, offline reinforcement learning, inverse dynamics, and waypoint conditioning correspond to different sequence maskings over a sequence of states, actions, and returns. We introduce the UniMASK framework, which provides a unified way to specify models which can be trained on many different sequential decision-making tasks. We show that a single UniMASK model is often capable of carrying out many tasks with performance similar to or better than single-task models. Additionally, after fine-tuning, our UniMASK models consistently outperform comparable single-task models. Our code is publicly available at https://github.com/micahcarroll/uniMASK. △ Less

Submitted 19 November, 2022; originally announced November 2022.

Comments: NeurIPS 2022 (Oral). A prior version was published at an ICML Workshop, available at arXiv:2204.13326

arXiv:2211.07128 [pdf, other]

doi 10.3847/1538-4365/acbfba

The Young Supernova Experiment Data Release 1 (YSE DR1): Light Curves and Photometric Classification of 1975 Supernovae

Authors: P. D. Aleo, K. Malanchev, S. Sharief, D. O. Jones, G. Narayan, R. J. Foley, V. A. Villar, C. R. Angus, V. F. Baldassare, M. J. Bustamante-Rosell, D. Chatterjee, C. Cold, D. A. Coulter, K. W. Davis, S. Dhawan, M. R. Drout, A. Engel, K. D. French, A. Gagliano, C. Gall, J. Hjorth, M. E. Huber, W. V. Jacobson-Galán, C. D. Kilpatrick, D. Langeroodi , et al. (58 additional authors not shown)

Abstract: We present the Young Supernova Experiment Data Release 1 (YSE DR1), comprised of processed multi-color Pan-STARRS1 (PS1) griz and Zwicky Transient Facility (ZTF) gr photometry of 1975 transients with host-galaxy associations, redshifts, spectroscopic/photometric classifications, and additional data products from 2019 November 24 to 2021 December 20. YSE DR1 spans discoveries and observations from… ▽ More We present the Young Supernova Experiment Data Release 1 (YSE DR1), comprised of processed multi-color Pan-STARRS1 (PS1) griz and Zwicky Transient Facility (ZTF) gr photometry of 1975 transients with host-galaxy associations, redshifts, spectroscopic/photometric classifications, and additional data products from 2019 November 24 to 2021 December 20. YSE DR1 spans discoveries and observations from young and fast-rising supernovae (SNe) to transients that persist for over a year, with a redshift distribution reaching z~0.5. We present relative SN rates from YSE's magnitude- and volume-limited surveys, which are consistent with previously published values within estimated uncertainties for untargeted surveys. We combine YSE and ZTF data, and create multi-survey SN simulations to train the ParSNIP and SuperRAENN photometric classification algorithms; when validating our ParSNIP classifier on 472 spectroscopically classified YSE DR1 SNe, we achieve 82% accuracy across three SN classes (SNe Ia, II, Ib/Ic) and 90% accuracy across two SN classes (SNe Ia, core-collapse SNe). Our classifier performs particularly well on SNe Ia, with high (>90%) individual completeness and purity, which will help build an anchor photometric SNe Ia sample for cosmology. We then use our photometric classifier to characterize our photometric sample of 1483 SNe, labeling 1048 (~71%) SNe Ia, 339 (~23%) SNe II, and 96 (~6%) SNe Ib/Ic. YSE DR1 provides a training ground for building discovery, anomaly detection, and classification algorithms, performing cosmological analyses, understanding the nature of red and rare transients, exploring tidal disruption events and nuclear variability, and preparing for the forthcoming Vera C. Rubin Observatory Legacy Survey of Space and Time. △ Less

Submitted 21 February, 2023; v1 submitted 14 November, 2022; originally announced November 2022.

Comments: Accepted to ApJS; 64 pages; 35 figures; 10 tables

arXiv:2211.01602 [pdf, other]

Optimal Behavior Prior: Data-Efficient Human Models for Improved Human-AI Collaboration

Authors: Mesut Yang, Micah Carroll, Anca Dragan

Abstract: AI agents designed to collaborate with people benefit from models that enable them to anticipate human behavior. However, realistic models tend to require vast amounts of human data, which is often hard to collect. A good prior or initialization could make for more data-efficient training, but what makes for a good prior on human behavior? Our work leverages a very simple assumption: people genera… ▽ More AI agents designed to collaborate with people benefit from models that enable them to anticipate human behavior. However, realistic models tend to require vast amounts of human data, which is often hard to collect. A good prior or initialization could make for more data-efficient training, but what makes for a good prior on human behavior? Our work leverages a very simple assumption: people generally act closer to optimal than to random chance. We show that using optimal behavior as a prior for human models makes these models vastly more data-efficient and able to generalize to new environments. Our intuition is that such a prior enables the training to focus one's precious real-world data on capturing the subtle nuances of human suboptimality, instead of on the basics of how to do the task in the first place. We also show that using these improved human models often leads to better human-AI collaboration performance compared to using models based on real human data alone. △ Less

Submitted 19 November, 2022; v1 submitted 3 November, 2022; originally announced November 2022.

Comments: Presented at the NeurIPS 2022 Human in the Loop Learning (HiLL) Workshop

arXiv:2210.16381 [pdf, other]

Not Another Day Zero: Design Hackathons for Community-Based Water Quality Monitoring

Authors: Srishti Gupta, Chun-Hua Tsai, John M. Carroll

Abstract: This study looks at water quality monitoring and management as a new form of community engagement. Through a series of a unique research method called `design hackathons', we engaged with a hyperlocal community of citizens who are actively involved in monitoring and management of their local watershed. These design hackathons sought to understand the motivation, practices, collaboration and experi… ▽ More This study looks at water quality monitoring and management as a new form of community engagement. Through a series of a unique research method called `design hackathons', we engaged with a hyperlocal community of citizens who are actively involved in monitoring and management of their local watershed. These design hackathons sought to understand the motivation, practices, collaboration and experiences of these citizens. Qualitative analysis of data revealed the nature of the complex stakeholder network, workflow practices, initiatives to engage with a larger community, current state of technological infrastructure being used, and innovative design scenarios proposed by the hackathon participants. Based on this comprehensive analysis, we conceptualize water quality monitoring and management as community-based monitoring and management, and water data as community data. Such a conceptualization sheds light on how these practices can help in preempting water crisis by empowering citizens through increased awareness, active participation and informal learning of water data and resources. △ Less

Submitted 28 October, 2022; originally announced October 2022.

Comments: 21 pages, 3 figures, 3 tables

arXiv:2210.04780 [pdf, other]

doi 10.1103/PRXQuantum.4.020356

TLS Dynamics in a Superconducting Qubit Due to Background Ionizing Radiation

Authors: Ted Thorbeck, Andrew Eddins, Isaac Lauer, Douglas T. McClure, Malcolm Carroll

Abstract: Superconducting qubit lifetimes must be both long and stable to provide an adequate foundation for quantum computing. This stability is imperiled by two-level systems (TLSs), currently a dominant loss mechanism, which exhibit slow spectral dynamics that destabilize qubit lifetimes on hour timescales. Stability is also threatened at millisecond timescales, where ionizing radiation has recently been… ▽ More Superconducting qubit lifetimes must be both long and stable to provide an adequate foundation for quantum computing. This stability is imperiled by two-level systems (TLSs), currently a dominant loss mechanism, which exhibit slow spectral dynamics that destabilize qubit lifetimes on hour timescales. Stability is also threatened at millisecond timescales, where ionizing radiation has recently been found to cause bursts of correlated multi-qubit decays, complicating quantum error correction. Here we study both ionizing radiation and TLS dynamics on a 27-qubit processor, repurposing the standard transmon qubits as sensors of both radiation impacts and TLS dynamics. Unlike prior literature, we observe resilience of the qubit lifetimes to the transient quasiparticles generated by the impact of radiation. However, we also observe a new interaction between these two processes, "TLS scrambling," in which a radiation impact causes multiple TLSs to jump in frequency, which we suggest is due to the same charge rearrangement sensed by qubits near a radiation impact. As TLS scrambling brings TLSs out of or in to resonance with the qubit, the lifetime of the qubit increases or decreases. Our findings thus identify radiation as a new contribution to fluctuations in qubit lifetimes, with implications for efforts to characterize and improve device stability △ Less

Submitted 10 October, 2022; originally announced October 2022.

Comments: 14 pages, 10 figures

Journal ref: PRX Quantum 4, 020356, 2023

arXiv:2210.01647 [pdf, other]

Codeless App Development: Evaluating A Cloud-Native Domain-Specific Functions Approach

Authors: Chuhao Wu, Jose Miguel Perez-Alvarez, Adrian Mos, John M. Carroll

Abstract: Mobile applications play an important role in the economy today and there is an increasing trend for app enablement on multiple platforms. However, creating, distributing, and maintaining an application remain expert tasks. Even for software developers, the process can be error-prone and resource-consuming, especially when targeting different platforms simultaneously. Researchers have proposed sev… ▽ More Mobile applications play an important role in the economy today and there is an increasing trend for app enablement on multiple platforms. However, creating, distributing, and maintaining an application remain expert tasks. Even for software developers, the process can be error-prone and resource-consuming, especially when targeting different platforms simultaneously. Researchers have proposed several frameworks to facilitate cross-platform app development, but little attention has been paid to non-technical users. In this paper, we described the Flow framework, which takes the advantage of domain-specific languages to enable no-code specification for app modeling. The cloud-native coordination mechanism further supports non-technical users to execute, monitor, and maintain apps for any target platforms. User evaluations were conducted to assess the usability and user experience with the system. The results indicated that users can develop apps in Flow with ease, but the prototype could be optimized to reduce learning time and workload. △ Less

Submitted 4 October, 2022; originally announced October 2022.

arXiv:2209.00018 [pdf, other]

A fast rising tidal disruption event from a candidate intermediate mass black hole

Authors: C. R. Angus, V. F. Baldassare, B. Mockler, R. J. Foley, E. Ramirez-Ruiz, S. I. Raimundo, K. D. French, K. Auchettl, H. Pfister, C. Gall, J. Hjorth, M. R. Drout, K. D. Alexander, G. Dimitriadis, T. Hung, D. O. Jones, A. Rest, M. R. Siebert, K. Taggart, G. Terreran, S. Tinyanont, C. M. Carroll, L. DeMarchi, N. Earl, A. Gagliano , et al. (14 additional authors not shown)

Abstract: Massive black holes (BHs) at the centres of massive galaxies are ubiquitous. The population of BHs within dwarf galaxies, on the other hand, is evasive. Dwarf galaxies are thought to harbour BHs with proportionally small masses, including intermediate mass BHs, with masses $10^{2} < M_{BH} < 10^{6} M_{\odot}$. Identification of these systems has historically relied upon the detection of light emit… ▽ More Massive black holes (BHs) at the centres of massive galaxies are ubiquitous. The population of BHs within dwarf galaxies, on the other hand, is evasive. Dwarf galaxies are thought to harbour BHs with proportionally small masses, including intermediate mass BHs, with masses $10^{2} < M_{BH} < 10^{6} M_{\odot}$. Identification of these systems has historically relied upon the detection of light emitted from accreting gaseous discs close to the BHs. Without this light, they are difficult to detect. Tidal disruption events (TDEs), the luminous flares produced when a star strays close to a BH and is shredded, are a direct way to probe massive BHs. The rise times of these flares theoretically correlate with the BH mass. Here we present AT2020neh, a fast rising TDE candidate, hosted by a dwarf galaxy. AT2020neh can be described by the tidal disruption of a main sequence star by a 10$^{4.7} - 10^{5.9} M_{\odot}$ BH. We find the observable rate of fast rising nuclear transients like AT2020neh to be rare, at $\lesssim 2 \times 10^{-8}$ events Mpc$^{-3}$ yr$^{-1}$. Finding non-accreting BHs in dwarf galaxies is important to determine how prevalent BHs are within these galaxies, and constrain models of BH formation. AT2020neh-like events may provide a galaxy-independent method of measuring IMBH masses. △ Less

Submitted 5 September, 2022; v1 submitted 31 August, 2022; originally announced September 2022.

Comments: Accepted for publication in Nature Astronomy

arXiv:2207.10631 [pdf, other]

Robust incorporation in multi-donor patches created using atomic-precision advanced manufacturing

Authors: Quinn Campbell, Justine C. Koepke, Jeffrey A. Ivie, Andrew M. Mounce, Daniel R. Ward, Malcolm S. Carroll, Shashank Misra, Andrew D. Baczewski, Ezra Bussmann

Abstract: Atomic-precision advanced manufacturing enables the placement of dopant atoms within $\pm$1 lattice site in crystalline Si. However, it has recently been shown that reaction kinetics can introduce uncertainty in whether a single donor will incorporate at all in a minimal 3-dimer lithographic window. In this work, we explore the combined impact of lithographic variation and stochastic kinetics on P… ▽ More Atomic-precision advanced manufacturing enables the placement of dopant atoms within $\pm$1 lattice site in crystalline Si. However, it has recently been shown that reaction kinetics can introduce uncertainty in whether a single donor will incorporate at all in a minimal 3-dimer lithographic window. In this work, we explore the combined impact of lithographic variation and stochastic kinetics on P incorporation as the size of such a window is increased. We augment a kinetic model for PH$_3$ dissociation leading to P incorporation on Si(100)-2$\times$1 to include barriers for reactions across distinct dimer rows. Using this model, we demonstrate that even for a window consisting of 2$\times$3 silicon dimers, the probability that at least one donor incorporates is nearly unity. We also examine the impact of size of the lithographic window, finding that the incorporation fraction saturates to $δ$-layer like coverage as the circumference-to-area ratio approaches zero. We predict that this incorporation fraction depends strongly on the dosage of the precursor, and that the standard deviation of the number of incorporations scales as $\sim \sqrt{n}$, as would be expected for a series of largely independent incorporation events. Finally, we characterize an array of experimentally prepared multi-donor lithographic windows and use our kinetic model to study variability due to the observed lithographic roughness, predicting a negligible impact on incorporation statistics. We find good agreement between our model and the inferred incorporation in these windows from scanning tunneling microscope measurements, indicating the robustness of atomic-precision advanced manufacturing to errors in patterning for multi-donor patches. △ Less

Submitted 21 July, 2022; originally announced July 2022.

Comments: Main text 24 pages, 5 figures + Appendecies 8 pages, 3 figures

arXiv:2206.07760 [pdf, other]

doi 10.3390/e24081116

Multiscale methods for signal selection in single-cell data

Authors: Renee S. Hoekzema, Lewis Marsh, Otto Sumray, Thomas M. Carroll, Xin Lu, Helen M. Byrne, Heather A. Harrington

Abstract: Analysis of single-cell transcriptomics often relies on clustering cells and then performing differential gene expression (DGE) to identify genes that vary between these clusters. These discrete analyses successfully determine cell types and markers; however, continuous variation within and between cell types may not be detected. We propose three topologically motivated mathematical methods for un… ▽ More Analysis of single-cell transcriptomics often relies on clustering cells and then performing differential gene expression (DGE) to identify genes that vary between these clusters. These discrete analyses successfully determine cell types and markers; however, continuous variation within and between cell types may not be detected. We propose three topologically motivated mathematical methods for unsupervised feature selection that consider discrete and continuous transcriptional patterns on an equal footing across multiple scales simultaneously. Eigenscores ($\text{eig}_i$) rank signals or genes based on their correspondence to low-frequency intrinsic patterning in the data using the spectral decomposition of the Laplacian graph. The multiscale Laplacian score (MLS) is an unsupervised method for locating relevant scales in data and selecting the genes that are coherently expressed at these respective scales. The persistent Rayleigh quotient (PRQ) takes data equipped with a filtration, allowing the separation of genes with different roles in a bifurcation process (e.g., pseudo-time). We demonstrate the utility of these techniques by applying them to published single-cell transcriptomics data sets. The methods validate previously identified genes and detect additional biologically meaningful genes with coherent expression patterns. By studying the interaction between gene signals and the geometry of the underlying space, the three methods give multidimensional rankings of the genes and visualisation of relationships between them. △ Less

Submitted 6 October, 2022; v1 submitted 15 June, 2022; originally announced June 2022.

Comments: 32 pages, 15 figures, 1 table. Revised and published in Entropy, special issue Applications of Topological Data Analysis in the Life Sciences

Journal ref: Entropy 2022, 24(8), 1116

arXiv:2205.00547 [pdf, other]

doi 10.3847/1538-4357/ac715b

Variable AGN in the GALEX Time Domain Survey

Authors: Erik J. Wasleske, Vivienne F. Baldassare, Christopher M. Carroll

Abstract: We searched the Northern Hemisphere Fields of the GALEX Time-Domain Survey (TDS) for galaxies with UV variability indicative of active galactic nuclei (AGNs). We identified 48 high-probability candidate AGNs from a parent sample of 1819 galaxies in the NASA Sloan Atlas (NSA) catalog. We further characterized these systems using optical spectroscopic diagnostics, WISE IR color selection criteria, a… ▽ More We searched the Northern Hemisphere Fields of the GALEX Time-Domain Survey (TDS) for galaxies with UV variability indicative of active galactic nuclei (AGNs). We identified 48 high-probability candidate AGNs from a parent sample of 1819 galaxies in the NASA Sloan Atlas (NSA) catalog. We further characterized these systems using optical spectroscopic diagnostics, WISE IR color selection criteria, and spectral energy distribution (SED) modeling. Of the 48 candidates, eight were identified as AGNs from optical emission lines, two were identified by their IR colors, and 28 were identified through spectral energy decomposition. Observational biases of each selection method are discussed in connecting these AGNs subsamples to another. By selecting AGNs based on UV variability, we also identified six low-mass AGNs candidates, all of which would have been missed by spectroscopic selection. △ Less

Submitted 1 May, 2022; originally announced May 2022.

Comments: Resubmitted to The Astrophysical Journal

arXiv:2204.13326 [pdf, other]

Towards Flexible Inference in Sequential Decision Problems via Bidirectional Transformers

Authors: Micah Carroll, Jessy Lin, Orr Paradise, Raluca Georgescu, Mingfei Sun, David Bignell, Stephanie Milani, Katja Hofmann, Matthew Hausknecht, Anca Dragan, Sam Devlin

Abstract: Randomly masking and predicting word tokens has been a successful approach in pre-training language models for a variety of downstream tasks. In this work, we observe that the same idea also applies naturally to sequential decision making, where many well-studied tasks like behavior cloning, offline RL, inverse dynamics, and waypoint conditioning correspond to different sequence maskings over a se… ▽ More Randomly masking and predicting word tokens has been a successful approach in pre-training language models for a variety of downstream tasks. In this work, we observe that the same idea also applies naturally to sequential decision making, where many well-studied tasks like behavior cloning, offline RL, inverse dynamics, and waypoint conditioning correspond to different sequence maskings over a sequence of states, actions, and returns. We introduce the FlexiBiT framework, which provides a unified way to specify models which can be trained on many different sequential decision making tasks. We show that a single FlexiBiT model is simultaneously capable of carrying out many tasks with performance similar to or better than specialized models. Additionally, we show that performance can be further improved by fine-tuning our general model on specific tasks of interest. △ Less

Submitted 9 December, 2022; v1 submitted 28 April, 2022; originally announced April 2022.

Comments: Superseded by arXiv:2211.10869

arXiv:2204.11966 [pdf, other]

Estimating and Penalizing Induced Preference Shifts in Recommender Systems

Authors: Micah Carroll, Anca Dragan, Stuart Russell, Dylan Hadfield-Menell

Abstract: The content that a recommender system (RS) shows to users influences them. Therefore, when choosing a recommender to deploy, one is implicitly also choosing to induce specific internal states in users. Even more, systems trained via long-horizon optimization will have direct incentives to manipulate users: in this work, we focus on the incentive to shift user preferences so they are easier to sati… ▽ More The content that a recommender system (RS) shows to users influences them. Therefore, when choosing a recommender to deploy, one is implicitly also choosing to induce specific internal states in users. Even more, systems trained via long-horizon optimization will have direct incentives to manipulate users: in this work, we focus on the incentive to shift user preferences so they are easier to satisfy. We argue that - before deployment - system designers should: estimate the shifts a recommender would induce; evaluate whether such shifts would be undesirable; and perhaps even actively optimize to avoid problematic shifts. These steps involve two challenging ingredients: estimation requires anticipating how hypothetical algorithms would influence user preferences if deployed - we do this by using historical user interaction data to train a predictive user model which implicitly contains their preference dynamics; evaluation and optimization additionally require metrics to assess whether such influences are manipulative or otherwise unwanted - we use the notion of "safe shifts", that define a trust region within which behavior is safe: for instance, the natural way in which users would shift without interference from the system could be deemed "safe". In simulated experiments, we show that our learned preference dynamics model is effective in estimating user preferences and how they would respond to new recommenders. Additionally, we show that recommenders that optimize for staying in the trust region can avoid manipulative behaviors while still generating engagement. △ Less

Submitted 14 July, 2022; v1 submitted 25 April, 2022; originally announced April 2022.

Comments: Accepted to ICML 2022 (Spotlight)

Journal ref: Proceedings of the 39th International Conference on Machine Learning, PMLR 162:2686-2708, 2022

arXiv:2202.01365 [pdf, other]

Feasibility of Interactive 3D Map for Remote Sighted Assistance

Authors: **gyi Xie, Rui Yu, Sooyeon Lee, Yao Lyu, Syed Masum Billah, John M. Carroll

Abstract: Remote sighted assistance (RSA) has emerged as a conversational assistive technology, where remote sighted workers, i.e., agents, provide real-time assistance to users with vision impairments via video-chat-like communication. Researchers found that agents' lack of environmental knowledge, the difficulty of orienting users in their surroundings, and the inability to estimate distances from users'… ▽ More Remote sighted assistance (RSA) has emerged as a conversational assistive technology, where remote sighted workers, i.e., agents, provide real-time assistance to users with vision impairments via video-chat-like communication. Researchers found that agents' lack of environmental knowledge, the difficulty of orienting users in their surroundings, and the inability to estimate distances from users' camera feeds are key challenges to sighted agents. To address these challenges, researchers have suggested assisting agents with computer vision technologies, especially 3D reconstruction. This paper presents a high-fidelity prototype of such an RSA, where agents use interactive 3D maps with localization capability. We conducted a walkthrough study with thirteen agents and one user with simulated vision impairment using this prototype. The study revealed that, compared to baseline RSA, the agents were significantly faster in providing navigational assistance to users, and their mental workload was significantly reduced -- all indicate the feasibility and prospect of 3D maps in RSA. △ Less

Submitted 2 February, 2022; originally announced February 2022.

arXiv:2201.02468 [pdf, other]

Reply to the Comment on "Thermal, quantum antibunching and lasing thresholds from single emitters to macroscopic devices"

Authors: Mark Anthony Carroll, Giampaolo D'Alessandro, Gian Luca Lippi, Gian-Luca Oppo, Francesco Papoff

Abstract: We deconstruct and address a comment to Carroll et al. [Phys Rev Lett 126, 063902 (2021)] (PRL) that has been posted on arXiv appearing as two versions [arXiv:2106.15242v1] and [arXiv:2106.15242v2]. This comment claimed that a term in the model presented in the PRL had been incorrectly omitted and that, hence, the laser threshold predicted by the model in the PRL is unattainable. We show that the… ▽ More We deconstruct and address a comment to Carroll et al. [Phys Rev Lett 126, 063902 (2021)] (PRL) that has been posted on arXiv appearing as two versions [arXiv:2106.15242v1] and [arXiv:2106.15242v2]. This comment claimed that a term in the model presented in the PRL had been incorrectly omitted and that, hence, the laser threshold predicted by the model in the PRL is unattainable. We show that the term in question was correctly neglected because it represents collective effects that are not observable in the devices modelled in the PRL. Moreover, even if this term were to be included, the laser threshold would still be present, contrary to what was claimed in the comment. We conclude that the model presented in PRL is correct and that its results are innovative and of wide application in laser physics and quantum optics. △ Less

Submitted 7 January, 2022; originally announced January 2022.

Comments: 1 Figure

arXiv:2111.08741 [pdf, other]

Practical Guidance on Modeling Choices for the Virtual Twins Method

Authors: Chuyu Deng, David M. Vock, Dana M. Carroll, Jeffrey A. Boatman, Dorothy K. Hatsukami, Ning Leng, Joseph S. Koopmeiners

Abstract: Individuals can vary drastically in their response to the same treatment, and this heterogeneity has driven the push for more personalized medicine. Accurate and interpretable methods to identify subgroups that respond to the treatment differently from the population average are necessary to achieving this goal. The Virtual Twins (VT) method by Foster et al. \cite{Foster} is a highly cited and imp… ▽ More Individuals can vary drastically in their response to the same treatment, and this heterogeneity has driven the push for more personalized medicine. Accurate and interpretable methods to identify subgroups that respond to the treatment differently from the population average are necessary to achieving this goal. The Virtual Twins (VT) method by Foster et al. \cite{Foster} is a highly cited and implemented method for subgroup identification because of its intuitive framework. However, since its initial publication, many researchers still rely heavily on the authors' initial modeling suggestions without examining newer and more powerful alternatives. This leaves much of the potential of the method untapped. We comprehensively evaluate the performance of VT with different combinations of methods in each of its component steps, under a collection of linear and nonlinear problem settings. Our simulations show that the method choice for step 1 of VT is highly influential in the overall accuracy of the method, and Superlearner is a promising choice. We illustrate our findings by using VT to identify subgroups with heterogeneous treatment effects in a randomized, double-blind nicotine reduction trial. △ Less

Submitted 16 November, 2021; originally announced November 2021.

arXiv:2105.15201 [pdf, other]

Dynamics of superconducting qubit relaxation times

Authors: Malcolm Carroll, Sami Rosenblatt, Petar Jurcevic, Isaac Lauer, Abhinav Kandala

Abstract: Superconducting qubits are a leading candidate for quantum computing but display temporal fluctuations in their energy relaxation times T1. This introduces instabilities in multi-qubit device performance. Furthermore, autocorrelation in these time fluctuations introduces challenges for obtaining representative measures of T1 for process optimization and device screening. These T1 fluctuations are… ▽ More Superconducting qubits are a leading candidate for quantum computing but display temporal fluctuations in their energy relaxation times T1. This introduces instabilities in multi-qubit device performance. Furthermore, autocorrelation in these time fluctuations introduces challenges for obtaining representative measures of T1 for process optimization and device screening. These T1 fluctuations are often attributed to time varying coupling of the qubit to defects, putative two level systems (TLSs). In this work, we develop a technique to probe the spectral and temporal dynamics of T1 in single junction transmons by repeated T1 measurements in the frequency vicinity of the bare qubit transition, via the AC-Stark effect. Across 10 qubits, we observe strong correlations between the mean T1 averaged over approximately nine months and a snapshot of an equally weighted T1 average over the Stark shifted frequency range. These observations are suggestive of an ergodic-like spectral diffusion of TLSs dominating T1, and offer a promising path to more rapid T1 characterization for device screening and process optimization. △ Less

Submitted 28 June, 2022; v1 submitted 31 May, 2021; originally announced May 2021.

arXiv:2105.14031 [pdf, other]

doi 10.3847/1538-3881/ac0530

Where Do Obscured AGN Fit in A Galaxy's Timeline?

Authors: Cassandra Hatcher, Allison Kirkpatrick, Francesca Fornasini, Francesca Civano, Erini Lambrides, Dale Kocesvski, Christopher M. Carroll, Mauro Giavalisco, Ryan Hickox, Zhiyuan Ji

Abstract: Many X-ray bright active galactic nuclei (AGN) are predicted to follow an extended stage of obscured black hole growth. In support of this picture we examine the X-ray undetected AGNs in the COSMOS field and compare their host galaxies with X-ray bright AGNs. We examine galaxies with M_\ast>10^{9.5}M_\odot for the presence of AGNs at redshifts $z=0.5-3$. We select AGNs in the infrared using \texti… ▽ More Many X-ray bright active galactic nuclei (AGN) are predicted to follow an extended stage of obscured black hole growth. In support of this picture we examine the X-ray undetected AGNs in the COSMOS field and compare their host galaxies with X-ray bright AGNs. We examine galaxies with M_\ast>10^{9.5}M_\odot for the presence of AGNs at redshifts $z=0.5-3$. We select AGNs in the infrared using \textit{Spitzer} and \textit{Herschel} detections and use color selection techniques to select AGNs within strongly star forming hosts. We stack \textit{Chandra} X-ray data of galaxies with an IR detection but lacking an X-ray detection to obtain soft and hard fluxes, allowing us to measure the energetics of these AGNs. We find a clear correlation between X-ray luminosity and IR AGN luminosity in the stacked galaxies. We also find that X-ray undetected AGNs all lie on the main sequence -- the tight correlation between SFR and $M_\ast$ that holds for the majority of galaxies, regardless of mass or redshift. This work demonstrates that there is a higher population of obscured AGNs than previously thought. △ Less

Submitted 28 May, 2021; originally announced May 2021.

arXiv:2105.12074 [pdf, other]

doi 10.1103/PhysRevApplied.16.054037

The impact of stochastic incorporation on atomic-precision Si:P arrays

Authors: Jeffrey A. Ivie, Quinn Campbell, Justin C. Koepke, Mitchell I. Brickson, Peter A. Schultz, Richard P. Muller, Andrew M. Mounce, Daniel R. Ward, Malcom S. Carroll, Ezra Bussmann, Andrew D. Baczewski, Shashank Misra

Abstract: Scanning tunneling microscope lithography can be used to create nanoelectronic devices in which dopant atoms are precisely positioned in a Si lattice within $\sim$1 nm of a target position. This exquisite precision is promising for realizing various quantum technologies. However, a potentially impactful form of disorder is due to incorporation kinetics, in which the number of P atoms that incorpor… ▽ More Scanning tunneling microscope lithography can be used to create nanoelectronic devices in which dopant atoms are precisely positioned in a Si lattice within $\sim$1 nm of a target position. This exquisite precision is promising for realizing various quantum technologies. However, a potentially impactful form of disorder is due to incorporation kinetics, in which the number of P atoms that incorporate into a single lithographic window is manifestly uncertain. We present experimental results indicating that the likelihood of incorporating into an ideally written three-dimer single-donor window is $63 \pm 10\%$ for room-temperature dosing, and corroborate these results with a model for the incorporation kinetics. Nevertheless, further analysis of this model suggests conditions that might raise the incorporation rate to near-deterministic levels. We simulate bias spectroscopy on a chain of comparable dimensions to the array in our yield study, indicating that such an experiment may help confirm the inferred incorporation rate. △ Less

Submitted 25 May, 2021; originally announced May 2021.

Comments: 20 pages, 13 figures

Journal ref: Phys. Rev. Applied 16, 054037 (2021)

arXiv:2105.10754 [pdf]

doi 10.1109/GEM.2019.8811554

Effects of VR Gaming and Game Genre on Player Experience

Authors: Michael Carroll, Ethan Osborne, Caglar Yildirim

Abstract: With the increasing availability of modern virtual reality (VR) headsets, the use and applications of VR technology for gaming purposes have become more pervasive than ever. Despite the growing popularity of VR gaming, user studies into how it might affect the player experience (PX) during the gameplay are scarce. Accordingly, the current study investigated the effects of VR gaming and game genre… ▽ More With the increasing availability of modern virtual reality (VR) headsets, the use and applications of VR technology for gaming purposes have become more pervasive than ever. Despite the growing popularity of VR gaming, user studies into how it might affect the player experience (PX) during the gameplay are scarce. Accordingly, the current study investigated the effects of VR gaming and game genre on PX. We compared PX metrics for two game genres, strategy (more interactive) and racing (less interactive), across two gaming platforms, VR and traditional desktop gaming. Participants were randomly assigned to one of the gaming platforms, played both a strategy and racing game on their corresponding platform, and provided PX ratings. Results revealed that, regardless of the game genre, participants in the VR gaming condition experienced a greater level of sense of presence than did those in the desktop gaming condition. That said, results showed that the two gaming platforms did not significantly differ from one another in PX ratings. As for the effect of game genre, participants provided greater PX ratings for the strategy game than for the racing game, regardless of whether the game was played on a VR headset or desktop computer. Collectively, these results indicate that although VR gaming affords a greater sense of presence in the game environment, this increase in presence does not seem to translate into a more satisfactory PX when playing either a strategy or racing game. △ Less

Submitted 22 May, 2021; originally announced May 2021.

Comments: 2019 IEEE Games, Entertainment, Media Conference (GEM)

arXiv:2103.13921 [pdf, ps, other]

The Resh Programming Language for Multirobot Orchestration

Authors: Martin Carroll, Kedar S. Namjoshi, Itai Segall

Abstract: This paper describes Resh, a new, statically typed, interpreted programming language and associated runtime for orchestrating multirobot systems. The main features of Resh are: (1) It offloads much of the tedious work of programming such systems away from the programmer and into the language runtime; (2) It is based on a small set of temporal and locational operators; and (3) It is not restricted… ▽ More This paper describes Resh, a new, statically typed, interpreted programming language and associated runtime for orchestrating multirobot systems. The main features of Resh are: (1) It offloads much of the tedious work of programming such systems away from the programmer and into the language runtime; (2) It is based on a small set of temporal and locational operators; and (3) It is not restricted to specific robot types or tasks. The Resh runtime consists of three engines that collaborate to run a Resh program using the available robots in their current environment. This paper describes both Resh and its runtime and gives examples of its use. △ Less

Submitted 25 March, 2021; originally announced March 2021.

Comments: Accepted for publication at ICRA'21

Showing 1–50 of 222 results for author: Carroll, M