-
Paying to Do Better: Games with Payments between Learning Agents
Authors:
Yoav Kolumbus,
Joe Halpern,
Éva Tardos
Abstract:
In repeated games, such as auctions, players typically use learning algorithms to choose their actions. The use of such autonomous learning agents has become widespread on online platforms. In this paper, we explore the impact of players incorporating monetary transfers into their agents' algorithms, aiming to incentivize behavior in their favor. Our focus is on understanding when players have inc…
▽ More
In repeated games, such as auctions, players typically use learning algorithms to choose their actions. The use of such autonomous learning agents has become widespread on online platforms. In this paper, we explore the impact of players incorporating monetary transfers into their agents' algorithms, aiming to incentivize behavior in their favor. Our focus is on understanding when players have incentives to make use of monetary transfers, how these payments affect learning dynamics, and what the implications are for welfare and its distribution among the players. We propose a simple game-theoretic model to capture such scenarios. Our results on general games show that in a broad class of games, players benefit from letting their learning agents make payments to other learners during the game dynamics, and that in many cases, this kind of behavior improves welfare for all players. Our results on first- and second-price auctions show that in equilibria of the ``payment policy game,'' the agents' dynamics can reach strong collusive outcomes with low revenue for the auctioneer. These results highlight a challenge for mechanism design in systems where automated learning agents can benefit from interacting with their peers outside the boundaries of the mechanism.
△ Less
Submitted 31 May, 2024;
originally announced May 2024.
-
Intervention and Conditioning in Causal Bayesian Networks
Authors:
Sainyam Galhotra,
Joseph Y. Halpern
Abstract:
Causal models are crucial for understanding complex systems and identifying causal relationships among variables. Even though causal models are extremely popular, conditional probability calculation of formulas involving interventions pose significant challenges. In case of Causal Bayesian Networks (CBNs), Pearl assumes autonomy of mechanisms that determine interventions to calculate a range of pr…
▽ More
Causal models are crucial for understanding complex systems and identifying causal relationships among variables. Even though causal models are extremely popular, conditional probability calculation of formulas involving interventions pose significant challenges. In case of Causal Bayesian Networks (CBNs), Pearl assumes autonomy of mechanisms that determine interventions to calculate a range of probabilities. We show that by making simple yet often realistic independence assumptions, it is possible to uniquely estimate the probability of an interventional formula (including the well-studied notions of probability of sufficiency and necessity). We discuss when these assumptions are appropriate. Importantly, in many cases of interest, when the assumptions are appropriate, these probability estimates can be evaluated using observational data, which carries immense significance in scenarios where conducting experiments is impractical or unfeasible.
△ Less
Submitted 23 May, 2024;
originally announced May 2024.
-
Towards Guaranteed Safe AI: A Framework for Ensuring Robust and Reliable AI Systems
Authors:
David "davidad" Dalrymple,
Joar Skalse,
Yoshua Bengio,
Stuart Russell,
Max Tegmark,
Sanjit Seshia,
Steve Omohundro,
Christian Szegedy,
Ben Goldhaber,
Nora Ammann,
Alessandro Abate,
Joe Halpern,
Clark Barrett,
Ding Zhao,
Tan Zhi-Xuan,
Jeannette Wing,
Joshua Tenenbaum
Abstract:
Ensuring that AI systems reliably and robustly avoid harmful or dangerous behaviours is a crucial challenge, especially for AI systems with a high degree of autonomy and general intelligence, or systems used in safety-critical contexts. In this paper, we will introduce and define a family of approaches to AI safety, which we will refer to as guaranteed safe (GS) AI. The core feature of these appro…
▽ More
Ensuring that AI systems reliably and robustly avoid harmful or dangerous behaviours is a crucial challenge, especially for AI systems with a high degree of autonomy and general intelligence, or systems used in safety-critical contexts. In this paper, we will introduce and define a family of approaches to AI safety, which we will refer to as guaranteed safe (GS) AI. The core feature of these approaches is that they aim to produce AI systems which are equipped with high-assurance quantitative safety guarantees. This is achieved by the interplay of three core components: a world model (which provides a mathematical description of how the AI system affects the outside world), a safety specification (which is a mathematical description of what effects are acceptable), and a verifier (which provides an auditable proof certificate that the AI satisfies the safety specification relative to the world model). We outline a number of approaches for creating each of these three core components, describe the main technical challenges, and suggest a number of potential solutions to them. We also argue for the necessity of this approach to AI safety, and for the inadequacy of the main alternative approaches.
△ Less
Submitted 17 May, 2024; v1 submitted 10 May, 2024;
originally announced May 2024.
-
X-ray measurement of a high-mass white dwarf and its spin for the intermediate polar IGR J18434-0508
Authors:
Julian Gerber,
Jeremy Hare,
John A. Tomsick,
Benjamin M. Coughenour,
Aarran W. Shaw,
Maïca Clavel,
Francesca Fornasini,
Jules Halpern,
Alyson Joens,
Roman Krivonos,
Koji Mukai
Abstract:
IGR J18434-0508 is a Galactic Intermediate Polar (IP) type Cataclysmic Variable (CV) previously classified through optical spectroscopy. The source is already known to have a hard Chandra spectrum. In this paper, we have used follow-up XMM-Newton and NuSTAR observations to measure the white dwarf (WD) mass and spin period. We measure a spin period of P = 304.4 +/- 0.3 s based on the combined MOS1,…
▽ More
IGR J18434-0508 is a Galactic Intermediate Polar (IP) type Cataclysmic Variable (CV) previously classified through optical spectroscopy. The source is already known to have a hard Chandra spectrum. In this paper, we have used follow-up XMM-Newton and NuSTAR observations to measure the white dwarf (WD) mass and spin period. We measure a spin period of P = 304.4 +/- 0.3 s based on the combined MOS1, MOS2, and pn light curve. Although this is twice the optical period found previously, we interpret this value to be the true spin period of the WD. The source has an 8 +/- 2% pulsed fraction in the 0.5-10 keV XMM-Newton data and shows strong dips in the soft energy band (0.5-2 keV). The XMM-Newton and NuSTAR joint spectrum is consistent with a thermal bremsstrahlung continuum model with an additional partial covering factor, reflection, and Fe line Gaussian components. Furthermore, we fit the joint spectrum with the post-shock region "ipolar" model which indicates a high WD mass $>$ $\sim$ 1.36 Msun, approaching the Chandrasekhar limit.
△ Less
Submitted 28 March, 2024;
originally announced March 2024.
-
Mathematical Explanations
Authors:
Joseph Y. Halpern
Abstract:
A definition of what counts as an explanation of mathematical statement, and when one explanation is better than another, is given. Since all mathematical facts must be true in all causal models, and hence known by an agent, mathematical facts cannot be part of an explanation (under the standard notion of explanation). This problem is solved using impossible possible worlds.
A definition of what counts as an explanation of mathematical statement, and when one explanation is better than another, is given. Since all mathematical facts must be true in all causal models, and hence known by an agent, mathematical facts cannot be part of an explanation (under the standard notion of explanation). This problem is solved using impossible possible worlds.
△ Less
Submitted 31 December, 2023;
originally announced February 2024.
-
Resolving the Periods of the Asynchronous Polar 1RXS J083842.1$-$282723
Authors:
J. P. Halpern
Abstract:
1RXS J083842.1$-$282723 is a nearly synchronous magnetic cataclysmic variable with a simple X-ray light curve. While its orbital period was fairly well established at $P_{\rm orb}=98.4$ minutes from optical spectroscopy, indirect estimates of $P_{\rm spin}/P_{\rm orb}$ ranged from 0.90 to 0.96 because the short X-ray light curves could not determine the beat period to a factor of 2. We analyze a r…
▽ More
1RXS J083842.1$-$282723 is a nearly synchronous magnetic cataclysmic variable with a simple X-ray light curve. While its orbital period was fairly well established at $P_{\rm orb}=98.4$ minutes from optical spectroscopy, indirect estimates of $P_{\rm spin}/P_{\rm orb}$ ranged from 0.90 to 0.96 because the short X-ray light curves could not determine the beat period to a factor of 2. We analyze a recent 50 day TESS observation, and ground-based optical time-series photometry spanning 9 years, that together measure precise beat, orbit, and spin periods and enable the X-ray and optical modulations to be phase aligned. Although the X-ray light curves do not distinguish between a beat period of 16.11 or 32.22 hours, all of the optical evidence favors the longer value, with complete pole switching of accretion every half beat cycle. This would require $P_{\rm spin}/P_{\rm orb}=0.952$. Long-term optical monitoring also shows a decline in accretion rate, and a change in the beat-folded light curve. It would be useful to obtain a new X-ray/optical observation of at least 32 hours duration to examine any associated change in accretion structure, and confirm the spin and beat periods.
△ Less
Submitted 1 February, 2024;
originally announced February 2024.
-
Explaining Image Classifiers
Authors:
Hana Chockler,
Joseph Y. Halpern
Abstract:
We focus on explaining image classifiers, taking the work of Mothilal et al. [2021] (MMTS) as our point of departure. We observe that, although MMTS claim to be using the definition of explanation proposed by Halpern [2016], they do not quite do so. Roughly speaking, Halpern's definition has a necessity clause and a sufficiency clause. MMTS replace the necessity clause by a requirement that, as we…
▽ More
We focus on explaining image classifiers, taking the work of Mothilal et al. [2021] (MMTS) as our point of departure. We observe that, although MMTS claim to be using the definition of explanation proposed by Halpern [2016], they do not quite do so. Roughly speaking, Halpern's definition has a necessity clause and a sufficiency clause. MMTS replace the necessity clause by a requirement that, as we show, implies it. Halpern's definition also allows agents to restrict the set of options considered. While these difference may seem minor, as we show, they can have a nontrivial impact on explanations. We also show that, essentially without change, Halpern's definition can handle two issues that have proved difficult for other approaches: explanations of absence (when, for example, an image classifier for tumors outputs "no tumor") and explanations of rare events (such as tumors).
△ Less
Submitted 24 January, 2024;
originally announced January 2024.
-
Subjective Causality
Authors:
Joseph Y. Halpern,
Evan Piermont
Abstract:
We show that it is possible to understand and identify a decision maker's subjective causal judgements by observing her preferences over interventions. Following Pearl [2000], we represent causality using causal models (also called structural equations models), where the world is described by a collection of variables, related by equations. We show that if a preference relation over interventions…
▽ More
We show that it is possible to understand and identify a decision maker's subjective causal judgements by observing her preferences over interventions. Following Pearl [2000], we represent causality using causal models (also called structural equations models), where the world is described by a collection of variables, related by equations. We show that if a preference relation over interventions satisfies certain axioms (related to standard axioms regarding counterfactuals), then we can define (i) a causal model, (ii) a probability capturing the decision-maker's uncertainty regarding the external factors in the world and (iii) a utility on outcomes such that each intervention is associated with an expected utility and such that intervention $A$ is preferred to $B$ iff the expected utility of $A$ is greater than that of $B$. In addition, we characterize when the causal model is unique. Thus, our results allow a modeler to test the hypothesis that a decision maker's preferences are consistent with some causal model and to identify causal judgements from observed behavior.
△ Less
Submitted 17 January, 2024;
originally announced January 2024.
-
Bounding the Communication Complexity of Fault-Tolerant Common Coin Tossing
Authors:
Ivan Geffner,
Joseph Y. Halpern
Abstract:
Protocols for tossing a common coin play a key role in the vast majority of implementations of consensus. Even though the common coins in the literature are usually \emph{fair} (they have equal chance of landing heads or tails), we focus on the problem of implementing a \emph{biased} common coin such that the probability of landing heads is $p \in [0,1]$. Even though biased common coins can be imp…
▽ More
Protocols for tossing a common coin play a key role in the vast majority of implementations of consensus. Even though the common coins in the literature are usually \emph{fair} (they have equal chance of landing heads or tails), we focus on the problem of implementing a \emph{biased} common coin such that the probability of landing heads is $p \in [0,1]$. Even though biased common coins can be implemented using fair common coins, we show that this can require significant inter-party communication. In fact, we show that there is no bound on the number of messages needed to generate a common coin of bias $p$ in a way that tolerates even one malicious agent, even if we restrict $p$ to an arbitrary infinite subset of $[0,1]$ (e.g., rational numbers of the form $1/2^n$) and assume that the system is synchronous. By way of contrast, if we do not require the protocol to tolerate a faulty agent, we can do this. Thus, the cause of the message complexity is the requirement of fault tolerance.
△ Less
Submitted 24 December, 2023; v1 submitted 22 December, 2023;
originally announced December 2023.
-
Inference for Probabilistic Dependency Graphs
Authors:
Oliver E. Richardson,
Joseph Y. Halpern,
Christopher De Sa
Abstract:
Probabilistic dependency graphs (PDGs) are a flexible class of probabilistic graphical models, subsuming Bayesian Networks and Factor Graphs. They can also capture inconsistent beliefs, and provide a way of measuring the degree of this inconsistency. We present the first tractable inference algorithm for PDGs with discrete variables, making the asymptotic complexity of PDG inference similar that o…
▽ More
Probabilistic dependency graphs (PDGs) are a flexible class of probabilistic graphical models, subsuming Bayesian Networks and Factor Graphs. They can also capture inconsistent beliefs, and provide a way of measuring the degree of this inconsistency. We present the first tractable inference algorithm for PDGs with discrete variables, making the asymptotic complexity of PDG inference similar that of the graphical models they generalize. The key components are: (1) the observation that, in many cases, the distribution a PDG specifies can be formulated as a convex optimization problem (with exponential cone constraints), (2) a construction that allows us to express these problems compactly for PDGs of boundeed treewidth, (3) contributions to the theory of PDGs that justify the construction, and (4) an appeal to interior point methods that can solve such problems in polynomial time. We verify the correctness and complexity of our approach, and provide an implementation of it. We then evaluate our implementation, and demonstrate that it outperforms baseline approaches. Our code is available at http://github.com/orichardson/pdg-infer-uai.
△ Less
Submitted 9 November, 2023;
originally announced November 2023.
-
Communication games, sequential equilibrium, and mediators
Authors:
Ivan Geffner,
Joseph Y. Halpern
Abstract:
We consider $k$-resilient sequential equilibria, strategy profiles where no player in a coalition of at most $k$ players believes that it can increase its utility by deviating, regardless of its local state. We prove that all $k$-resilient sequential equilibria that can be implemented with a trusted mediator can also be implemented without the mediator in a synchronous system of $n$ players if…
▽ More
We consider $k$-resilient sequential equilibria, strategy profiles where no player in a coalition of at most $k$ players believes that it can increase its utility by deviating, regardless of its local state. We prove that all $k$-resilient sequential equilibria that can be implemented with a trusted mediator can also be implemented without the mediator in a synchronous system of $n$ players if $n >3k$. In asynchronous systems, where there is no global notion of time and messages may take arbitrarily long to get to their recipient, we prove that a $k$-resilient sequential equilibrium with a mediator can be implemented without the mediator if $n > 4k$. These results match the lower bounds given by Abraham, Dolev, and Halpern (2008) and Geffner and Halpern (2023) for implementing a Nash equilibrium without a mediator (which are easily seen to apply to implementing a sequential equilibrium) and improve the results of Gerardi, who showed that, in the case that $k=1$, a sequential equilibrium can be implemented in synchronous systems if $n \ge 5$.
△ Less
Submitted 9 January, 2024; v1 submitted 25 September, 2023;
originally announced September 2023.
-
Chunking Tasks for Present-Biased Agents
Authors:
Joe Halpern,
Aditya Saraf
Abstract:
Everyone puts things off sometimes. How can we combat this tendency to procrastinate? A well-known technique used by instructors is to break up a large project into more manageable chunks. But how should this be done best? Here we study the process of chunking using the graph-theoretic model of present bias introduced by Kleinberg and Oren (2014). We first analyze how to optimally chunk single edg…
▽ More
Everyone puts things off sometimes. How can we combat this tendency to procrastinate? A well-known technique used by instructors is to break up a large project into more manageable chunks. But how should this be done best? Here we study the process of chunking using the graph-theoretic model of present bias introduced by Kleinberg and Oren (2014). We first analyze how to optimally chunk single edges within a task graph, given a limited number of chunks. We show that for edges on the shortest path, the optimal chunking makes initial chunks easy and later chunks progressively harder. For edges not on the shortest path, optimal chunking is significantly more complex, but we provide an efficient algorithm that chunks the edge optimally. We then use our optimal edge-chunking algorithm to optimally chunk task graphs. We show that with a linear number of chunks on each edge, the biased agent's cost can be exponentially lowered, to within a constant factor of the true cheapest path. Finally, we extend our model to the case where a task designer must chunk a graph for multiple types of agents simultaneously. The problem grows significantly more complex with even two types of agents, but we provide optimal graph chunking algorithms for two types. Our work highlights the efficacy of chunking as a means to combat present bias.
△ Less
Submitted 22 September, 2023;
originally announced September 2023.
-
Colordag: An Incentive-Compatible Blockchain
Authors:
Ittai Abraham,
Danny Dolev,
Ittay Eyal,
Joseph Y. Halpern
Abstract:
We present Colordag, a blockchain protocol where following the prescribed strategy is, with high probability, a best response as long as all miners have less than 1/2 of the mining power. We prove the correctness of Colordag even if there is an extremely powerful adversary who knows future actions of the scheduler: specifically, when agents will generate blocks and when messages will arrive. The s…
▽ More
We present Colordag, a blockchain protocol where following the prescribed strategy is, with high probability, a best response as long as all miners have less than 1/2 of the mining power. We prove the correctness of Colordag even if there is an extremely powerful adversary who knows future actions of the scheduler: specifically, when agents will generate blocks and when messages will arrive. The state-of-the-art protocol, Fruitchain, is an epsilon-Nash equilibrium as long as all miners have less than 1/2 of the mining power. However, there is a simple deviation that guarantees that deviators are never worse off than they would be by following Fruitchain, and can sometimes do better. Thus, agents are motivated to deviate. Colordag implements a solution concept that we call epsilon-sure Nash equilibrium and does not suffer from this problem. Because it is an epsilon-sure Nash equilibrium, Colordag is an epsilon Nash equilibrium and with probability (1 - epsilon) is a best response.
△ Less
Submitted 22 August, 2023;
originally announced August 2023.
-
A Surprising Periodicity Detected During a Super-outburst of V844 Herculis by TESS
Authors:
A. Greiveldinger,
P. Garnavich,
C. Littlefield,
M. R. Kennedy,
J. P. Halpern,
J. R. Thorstensen,
P. Szkody,
A. Oksanen,
R. S. Boyle
Abstract:
We identify a previously undetected periodicity at a frequency of 49.08$\pm$0.01 d$^{-1}$ (period of 29.34$\pm$0.01 minutes) during a super-outburst of V844 Her observed by TESS. V844 Her is an SU UMa type cataclysmic variable with an orbital period of 78.69 minutes, near the period minimum. The frequency of this new signal is constant in contrast to the superhump oscillations commonly seen in SU…
▽ More
We identify a previously undetected periodicity at a frequency of 49.08$\pm$0.01 d$^{-1}$ (period of 29.34$\pm$0.01 minutes) during a super-outburst of V844 Her observed by TESS. V844 Her is an SU UMa type cataclysmic variable with an orbital period of 78.69 minutes, near the period minimum. The frequency of this new signal is constant in contrast to the superhump oscillations commonly seen in SU UMa outbursts. We searched without success for oscillations during quiescence using MDM, TESS, and XMM-Newton data. The lack of a periodic signal in the XMM light curve and the relatively low X-ray luminosity of V844 Her suggests that it is not a typical IP. We consider the possibility that the 29 min signal is the result of super-Nyquist sampling of a Dwarf Nova Oscillation with a period near the 2-minute cadence of the TESS data. Our analysis of archival AAVSO photometry from a 2006 super-outburst supports the existence of a 29 min oscillation, although a published study of an earlier superoutburst did not detect the signal. We compare the X-ray properties of V844 Her with short orbital period intermediate polars (IP), V1025 Cen and DW Cnc. We conclude that the new signal is a real photometric oscillation coming from the V844 Her system and that it is unlikely to be an aliased high-frequency oscillation. The steady frequency of the new signal suggests that its origin is related to an asynchronously rotating white dwarf in V844 Her, although the precise mechanism producing the flux variations remains unclear.
△ Less
Submitted 20 August, 2023;
originally announced August 2023.
-
Strategic Play By Resource-Bounded Agents in Security Games
Authors:
Xinming Liu,
Joseph Y. Halpern
Abstract:
Many studies have shown that humans are "predictably irrational": they do not act in a fully rational way, but their deviations from rational behavior are quite systematic. Our goal is to see the extent to which we can explain and justify these deviations as the outcome of rational but resource-bounded agents doing as well as they can, given their limitations. We focus on the well-studied ranger-p…
▽ More
Many studies have shown that humans are "predictably irrational": they do not act in a fully rational way, but their deviations from rational behavior are quite systematic. Our goal is to see the extent to which we can explain and justify these deviations as the outcome of rational but resource-bounded agents doing as well as they can, given their limitations. We focus on the well-studied ranger-poacher game, where rangers are trying to protect a number of sites from poaching. We capture the computational limitations by modeling the poacher and the ranger as probabilistic finite automata (PFAs). We show that, with sufficiently large memory, PFAs learn to play the Nash equilibrium (NE) strategies of the game and achieve the NE utility. However, if we restrict the memory, we get more "human-like" behaviors, such as probability matching (i.e., visiting sites in proportion to the probability of a rhino being there), and avoiding sites where there was a bad outcome (e.g., the poacher was caught by the ranger), that we also observed in experiments conducted on Amazon Mechanical Turk. Interestingly, we find that adding human-like behaviors such as probability matching and overweighting significant events (like getting caught) actually improves performance, showing that this seemingly irrational behavior can be quite rational.
△ Less
Submitted 25 July, 2023;
originally announced July 2023.
-
Sequential Language-based Decisions
Authors:
Adam Bjorndahl,
Joseph Y. Halpern
Abstract:
In earlier work, we introduced the framework of language-based decisions, the core idea of which was to modify Savage's classical decision-theoretic framework by taking actions to be descriptions in some language, rather than functions from states to outcomes, as they are defined classically. Actions had the form "if psi then do(phi)", where psi and phi were formulas in some underlying language,…
▽ More
In earlier work, we introduced the framework of language-based decisions, the core idea of which was to modify Savage's classical decision-theoretic framework by taking actions to be descriptions in some language, rather than functions from states to outcomes, as they are defined classically. Actions had the form "if psi then do(phi)", where psi and phi were formulas in some underlying language, specifying what effects would be brought about under what circumstances. The earlier work allowed only one-step actions. But, in practice, plans are typically composed of a sequence of steps. Here, we extend the earlier framework to sequential actions, making it much more broadly applicable. Our technical contribution is a representation theorem in the classical spirit: agents whose preferences over actions satisfy certain constraints can be modeled as if they are expected utility maximizers. As in the earlier work, due to the language-based specification of the actions, the representation theorem requires a construction not only of the probability and utility functions representing the agent's beliefs and preferences, but also the state and outcomes spaces over which these are defined, as well as a "selection function" which intuitively captures how agents disambiguate coarse descriptions. The (unbounded) depth of action sequencing adds substantial interest (and complexity!) to the proof.
△ Less
Submitted 11 July, 2023;
originally announced July 2023.
-
Green Bank Telescope Discovery of the Redback Binary Millisecond Pulsar PSR J0212+5321
Authors:
Karen I. Perez,
Slavko Bogdanov,
Jules P. Halpern,
Vishal Gajjar
Abstract:
We report the discovery of a 2.11 ms binary millisecond pulsar during a targeted search of the redback optical candidate coincident with the $γ$-ray source 3FGL J0212.5+5320 using the Robert C. Byrd Green Bank Telescope (GBT) with the Breakthrough Listen backend at L-band. Over a seven month period, five pointings were made near inferior conjunction of the pulsar in its 20.9 hr orbit, resulting in…
▽ More
We report the discovery of a 2.11 ms binary millisecond pulsar during a targeted search of the redback optical candidate coincident with the $γ$-ray source 3FGL J0212.5+5320 using the Robert C. Byrd Green Bank Telescope (GBT) with the Breakthrough Listen backend at L-band. Over a seven month period, five pointings were made near inferior conjunction of the pulsar in its 20.9 hr orbit, resulting in two detections, lasting 12 and 42 minutes. The pulsar dispersion measure (DM) of 25.7 pc cm$^{-3}$ corresponds to a distance of 1.15 kpc in the NE2001 Galactic electron density model, consistent with the Gaia parallax distance of $1.16\pm0.03$ kpc for the companion star. We suspect the pulsar experiences wide-orbit eclipses, similar to other redbacks, as well as scintillation and DM delays caused by its interaction with its companion and surroundings. Although the pulsar was only detected over $\approx3.7\%$ of the orbit, its measured acceleration is consistent with published binary parameters from optical radial velocity spectroscopy and light-curve modeling of the companion star, and it provides a more precise mass ratio and a projected semi-major axis for the pulsar orbit. We also obtained a refined optical photometric orbit ephemeris, and observed variability of the tidally distorted companion over 7 years. A hard X-ray light curve from NuSTAR shows expected orbit-modulated emission from the intrabinary shock. The pulsar parameters and photometric ephemeris greatly restrict the parameter space required to search for a coherent timing solution including pulsar spin-down rate, either using Fermi $γ$-rays, or further radio pulse detections.
△ Less
Submitted 8 June, 2023;
originally announced June 2023.
-
Optimal Eventual Byzantine Agreement Protocols with Omission Failures
Authors:
Kaya Alpturer,
Joseph Y. Halpern,
Ron van der Meyden
Abstract:
Work on \emph{optimal} protocols for \emph{Eventual Byzantine Agreement} (EBA) -- protocols that, in a precise sense, decide as soon as possible in every run and guarantee that all nonfaulty agents decide on the same value -- has focused on emph{full-information protocols} (FIPs), where agents repeatedly send messages that completely describe their past observations to every other agent. While it…
▽ More
Work on \emph{optimal} protocols for \emph{Eventual Byzantine Agreement} (EBA) -- protocols that, in a precise sense, decide as soon as possible in every run and guarantee that all nonfaulty agents decide on the same value -- has focused on emph{full-information protocols} (FIPs), where agents repeatedly send messages that completely describe their past observations to every other agent. While it can be shown that, without loss of generality, we can take an optimal protocol to be an FIP, full information exchange is impractical to implement for many applications due to the required message size. We separate protocols into two parts, the \emph{information-exchange protocol} and the \emph{action protocol}, so as to be able to examine the effects of more limited information exchange. We then define a notion of optimality with respect to an information-exchange protocol. Roughly speaking, an action protocol $P$ is optimal with respect to an information-exchange protocol $\mathcal{E}$ if, with $P$, agents decide as soon as possible among action protocols that exchange information according to $\mathcal{E}$. We present a knowledge-based EBA program for omission failures all of whose implementations are guaranteed to be correct and are optimal if the information exchange satisfies a certain safety condition. We then construct concrete programs that implement this knowledge-based program in two settings of interest that are shown to satisfy the safety condition. Finally, we show that a small modification of our program results in an FIP that is both optimal and efficiently implementable, settling an open problem posed by Halpern, Moses, and Waarts (SIAM J. Comput., 2001).
△ Less
Submitted 10 May, 2023;
originally announced May 2023.
-
Joint Behavior and Common Belief
Authors:
Meir Friedenberg,
Joseph Y. Halpern
Abstract:
For over 25 years, common belief has been widely viewed as necessary for joint behavior. But this is not quite correct. We show by example that what can naturally be thought of as joint behavior can occur without common belief. We then present two variants of common belief that can lead to joint behavior, even without standard common belief ever being achieved, and show that one of them, action-st…
▽ More
For over 25 years, common belief has been widely viewed as necessary for joint behavior. But this is not quite correct. We show by example that what can naturally be thought of as joint behavior can occur without common belief. We then present two variants of common belief that can lead to joint behavior, even without standard common belief ever being achieved, and show that one of them, action-stamped common belief, is in a sense necessary and sufficient for joint behavior. These observations are significant because, as is well known, common belief is quite difficult to achieve in practice, whereas these variants are more easily achievable.
△ Less
Submitted 11 July, 2023; v1 submitted 13 March, 2023;
originally announced March 2023.
-
Do Central Compact Objects have Carbon Atmospheres?
Authors:
J. A. J. Alford,
J. P. Halpern
Abstract:
Only three of the dozen central compact objects (CCOs) in supernova remnants (SNRs) show thermal X-ray pulsations due to non-uniform surface temperature (hot-spots). The absence of X-ray pulsations from several unpulsed CCOs has motivated suggestions that they have uniform-temperature carbon atmospheres (UTCAs), which adequately fit their spectra with appropriate neutron star (NS) surface areas. T…
▽ More
Only three of the dozen central compact objects (CCOs) in supernova remnants (SNRs) show thermal X-ray pulsations due to non-uniform surface temperature (hot-spots). The absence of X-ray pulsations from several unpulsed CCOs has motivated suggestions that they have uniform-temperature carbon atmospheres (UTCAs), which adequately fit their spectra with appropriate neutron star (NS) surface areas. This is in contrast to the two-temperature blackbody or hydrogen atmospheres that also fit well. Here we investigate the applicability of UTCAs to CCOs. We show the following: (i) The phase-averaged spectra of the three pulsed CCOs can also be fitted with a UTCA of the appropriate NS area, despite pulsed CCOs manifestly having non-uniform surface temperature. A good spectral fit is therefore not strong support for the UTCA model of unpulsed CCOs. (ii) An improved spectrum of one unpulsed CCO, previously analyzed with a UTCA, does not allow an acceptable fit. (iii) For two unpulsed CCOs, the UTCA does not allow a distance compatible with the SNR distance. These results imply that, in general, CCOs must have hot, localized regions on the NS surface. We derive new X-ray pulse modulation upper limits on the unpulsed CCOs, and constrain their hot spot sizes and locations. We develop an alternative model that accounts for both the pulsed and unpulsed CCOs: a range of angles between hot spot and rotation axes consistent with an exponential distribution with scale factor $λ\sim 20^{\circ}$. We discuss physical mechanisms that could produce such small angles and small hot-spots.
△ Less
Submitted 12 February, 2023;
originally announced February 2023.
-
Causal Models with Constraints
Authors:
Sander Beckers,
Joseph Y. Halpern,
Christopher Hitchcock
Abstract:
Causal models have proven extremely useful in offering formal representations of causal relationships between a set of variables. Yet in many situations, there are non-causal relationships among variables. For example, we may want variables $LDL$, $HDL$, and $TOT$ that represent the level of low-density lipoprotein cholesterol, the level of lipoprotein high-density lipoprotein cholesterol, and tot…
▽ More
Causal models have proven extremely useful in offering formal representations of causal relationships between a set of variables. Yet in many situations, there are non-causal relationships among variables. For example, we may want variables $LDL$, $HDL$, and $TOT$ that represent the level of low-density lipoprotein cholesterol, the level of lipoprotein high-density lipoprotein cholesterol, and total cholesterol level, with the relation $LDL+HDL=TOT$. This cannot be done in standard causal models, because we can intervene simultaneously on all three variables. The goal of this paper is to extend standard causal models to allow for constraints on settings of variables. Although the extension is relatively straightforward, to make it useful we have to define a new intervention operation that $disconnects$ a variable from a causal equation. We give examples showing the usefulness of this extension, and provide a sound and complete axiomatization for causal models with constraints.
△ Less
Submitted 17 January, 2023;
originally announced January 2023.
-
A Causal Analysis of Harm
Authors:
Sander Beckers,
Hana Chockler,
Joseph Y. Halpern
Abstract:
As autonomous systems rapidly become ubiquitous, there is a growing need for a legal and regulatory framework to address when and how such a system harms someone. There have been several attempts within the philosophy literature to define harm, but none of them has proven capable of dealing with with the many examples that have been presented, leading some to suggest that the notion of harm should…
▽ More
As autonomous systems rapidly become ubiquitous, there is a growing need for a legal and regulatory framework to address when and how such a system harms someone. There have been several attempts within the philosophy literature to define harm, but none of them has proven capable of dealing with with the many examples that have been presented, leading some to suggest that the notion of harm should be abandoned and "replaced by more well-behaved notions". As harm is generally something that is caused, most of these definitions have involved causality at some level. Yet surprisingly, none of them makes use of causal models and the definitions of actual causality that they can express. In this paper we formally define a qualitative notion of harm that uses causal models and is based on a well-known definition of actual causality (Halpern, 2016). The key novelty of our definition is that it is based on contrastive causation and uses a default utility to which the utility of actual outcomes is compared. We show that our definition is able to handle the examples from the literature, and illustrate its importance for reasoning about situations involving autonomous systems.
△ Less
Submitted 19 January, 2023; v1 submitted 11 October, 2022;
originally announced October 2022.
-
Quantifying Harm
Authors:
Sander Beckers,
Hana Chockler,
Joseph Y. Halpern
Abstract:
In a companion paper (Beckers et al. 2022), we defined a qualitative notion of harm: either harm is caused, or it is not. For practical applications, we often need to quantify harm; for example, we may want to choose the lest harmful of a set of possible interventions. We first present a quantitative definition of harm in a deterministic context involving a single individual, then we consider the…
▽ More
In a companion paper (Beckers et al. 2022), we defined a qualitative notion of harm: either harm is caused, or it is not. For practical applications, we often need to quantify harm; for example, we may want to choose the lest harmful of a set of possible interventions. We first present a quantitative definition of harm in a deterministic context involving a single individual, then we consider the issues involved in dealing with uncertainty regarding the context and going from a notion of harm for a single individual to a notion of "societal harm", which involves aggregating the harm to individuals. We show that the "obvious" way of doing this (just taking the expected harm for an individual and then summing the expected harm over all individuals can lead to counterintuitive or inappropriate answers, and discuss alternatives, drawing on work from the decision-theory literature.
△ Less
Submitted 6 October, 2022; v1 submitted 29 September, 2022;
originally announced September 2022.
-
Luminous Optical and X-ray Flaring of the Putative Redback Millisecond Pulsar 1FGL J0523.5$-$2529
Authors:
Jules P. Halpern,
Karen I. Perez,
Slavko Bogdanov
Abstract:
Several redback and black widow millisecond pulsar binaries have episodes of flaring in X-rays and optical. We initially detected such behavior from the Fermi selected redback candidate 1FGL J0523.5$-$2529 during optical time-series monitoring. Triggered observations with the Neil Gehrels Swift Observatory over the next $\approx100$ days showed episodic flaring in X-rays with luminosity up to…
▽ More
Several redback and black widow millisecond pulsar binaries have episodes of flaring in X-rays and optical. We initially detected such behavior from the Fermi selected redback candidate 1FGL J0523.5$-$2529 during optical time-series monitoring. Triggered observations with the Neil Gehrels Swift Observatory over the next $\approx100$ days showed episodic flaring in X-rays with luminosity up to $8\times10^{33}$ erg s$^{-1}$ ($\sim100$ times the minimum), and a comparable luminosity in the optical/UV, with similar power-law spectra of $f_ν\proptoν^{-0.7}$. These are the most luminous flares seen in any non-accreting "spider" pulsar system, which may be related to the large size of the companion through the fraction of the pulsar wind that it or its ablated wind intercepts. Simultaneously with an optical flare, we see Balmer-line and He I emission, not previously known in this object, which is evidence of a stellar wind that may also inhibit detection of radio pulsations. The quiescent optical light curves, while dominated by ellipsoidal modulation, show evidence of variable non-uniform temperature that could be due either to large starspots or asymmetric heating of the companion by the pulsar. This may explain a previous measurement of unusual non-zero orbital eccentricity as, alternatively, distortion of the radial-velocity curve by the surface temperature distribution of the large companion.
△ Less
Submitted 17 July, 2022;
originally announced July 2022.
-
Swift J0503.7-2819: A Short-Period Asynchronous Polar or Stream-Fed Intermediate Polar
Authors:
J. P. Halpern
Abstract:
We analyze a 7.4 hr XMM-Newton light curve of the cataclysmic variable Swift J0503.7-2819, previously classified using optical periods as an intermediate polar (IP) with an orbital period of 0.0567 days. A photometric signal at 975 s, previously suggested to be the spin period, is not present in X-rays and is readily understood as a quasi-periodic oscillation. The X-ray light curve instead shows c…
▽ More
We analyze a 7.4 hr XMM-Newton light curve of the cataclysmic variable Swift J0503.7-2819, previously classified using optical periods as an intermediate polar (IP) with an orbital period of 0.0567 days. A photometric signal at 975 s, previously suggested to be the spin period, is not present in X-rays and is readily understood as a quasi-periodic oscillation. The X-ray light curve instead shows clear behavior of a highly asynchronous polar (AP) or stream-fed IP. It can be described by either of two scenarios: one which switches between one-pole and two-pole accretion, and another in which accretion alternates fully between two poles. The spin periods in these two models are 0.0455 days and 0.0505 days, respectively. The spin frequency $ω$ is thus either 24% faster or 12% faster than the orbital frequency $Ω$, and the corresponding beat period between spin and orbit is 0.231 days or 0.462 days. Brief absorption events seen in light curve are spaced in a way that may favor the longer spin and beat periods. These periods are confirmed and refined using data from the Transiting Exoplanet Survey Satellite (TESS) and the Asteroid Terrestrial-impact Last Alert System (ATLAS). The short beat cycle of Swift J0503.7-2819 makes it well-suited to resolving this common dilemma, which amounts to deciding whether the main signal in the power spectrum is $ω$ or $2ω-Ω$.
△ Less
Submitted 29 June, 2022;
originally announced June 2022.
-
From Outcome-Based to Language-Based Preferences
Authors:
Valerio Capraro,
Joseph Y. Halpern,
Matjaz Perc
Abstract:
We review the literature on models that try to explain human behavior in social interactions described by normal-form games with monetary payoffs. We start by covering social and moral preferences. We then focus on the growing body of research showing that people react to the language in which actions are described, especially when it activates moral concerns. We conclude by arguing that behaviora…
▽ More
We review the literature on models that try to explain human behavior in social interactions described by normal-form games with monetary payoffs. We start by covering social and moral preferences. We then focus on the growing body of research showing that people react to the language in which actions are described, especially when it activates moral concerns. We conclude by arguing that behavioral economics is in the midst of a paradigm shift towards language-based preferences, which will require an exploration of new models and experimental setups.
△ Less
Submitted 15 June, 2022;
originally announced June 2022.
-
Optical Light Curve of 4FGL J0935.3+0901: A Flaring Black Widow Candidate
Authors:
J. P. Halpern
Abstract:
I obtained time-series photometry of the compact binary candidate for the Fermi source 4FGL J0935.3+0901. Superposed on the 2.44 hr orbital modulation are day-to-day variations and frequent flaring as seen in several redback and black widow millisecond pulsars (MSPs). The short orbital period favors a black widow. While the modulation of $\leq 1$ mag is smaller than that of most black widows, it c…
▽ More
I obtained time-series photometry of the compact binary candidate for the Fermi source 4FGL J0935.3+0901. Superposed on the 2.44 hr orbital modulation are day-to-day variations and frequent flaring as seen in several redback and black widow millisecond pulsars (MSPs). The short orbital period favors a black widow. While the modulation of $\leq 1$ mag is smaller than that of most black widows, it could indicate a low orbital inclination. Although a published optical spectrum shows strong emission lines, the light curve evinces pulsar heating of the companion star rather than accretion-disk emission of a transitional MSP. Emission lines and flaring occur in the same objects, probably powered by shocks between the relativistic pulsar wind and a wind driven off the companion star. I also recovered the period in photometry from the Zwicky Transient Facility. A phase-connected ephemeris derived from MDM Observatory and ZTF data spanning 4 years yields a period of 0.10153276(36) days and an epoch for the ascending node of the putative pulsar.
△ Less
Submitted 17 July, 2022; v1 submitted 29 May, 2022;
originally announced May 2022.
-
Space-charge-limited current density for nonplanar diodes with monoenergetic emission using Lie-point symmetries
Authors:
N. R. Sree Harsha,
Jacob M. Halpern,
Adam M. Darr,
Allen L. Garner
Abstract:
Understanding space-charge limited current density (SCLCD) is fundamentally and practically important for characterizing many high-power and high-current vacuum devices. Despite this, no analytic equations for SCLCD with nonzero monoenergetic initial velocity have been derived for nonplanar diodes from first principles. Obtaining analytic equations for SCLCD for nonplanar geometries is often compl…
▽ More
Understanding space-charge limited current density (SCLCD) is fundamentally and practically important for characterizing many high-power and high-current vacuum devices. Despite this, no analytic equations for SCLCD with nonzero monoenergetic initial velocity have been derived for nonplanar diodes from first principles. Obtaining analytic equations for SCLCD for nonplanar geometries is often complicated by the nonlinearity of the problem and over constrained boundary conditions. In this letter, we use the canonical coordinates obtained by identifying Lie-point symmetries to linearize the governing differential equations to derive SCLCD for any orthogonal diode. Using this method, we derive exact analytic equations for SCLCD with a monoenergetic injection velocity for one-dimensional cylindrical, spherical, tip-to-tip (t-t), and tip-to-plate (t-p) diodes. We specifically demonstrate that the correction factor from zero initial velocity to monoenergetic emission depends only on the initial kinetic and electric potential energies and not on the diode geometry and that SCLCD is universal when plotted as a function of the canonical gap size. We also show that SCLCD for a t-p diode is a factor of four larger than a t-t diode independent of injection velocity. The results reduce to previously derived results for zero initial velocity using variational calculus and conformal map**.
△ Less
Submitted 3 March, 2022;
originally announced April 2022.
-
Measuring the mass of the black widow PSR J1555-2908
Authors:
M. R. Kennedy,
R. P. Breton,
C. J. Clark,
D. Mata-Sanchez,
G. Voisin,
V. S. Dhillon,
J. P. Halpern,
T. R. Marsh,
L. Nieder,
P. S. Ray,
M. H. van Kerkwijk
Abstract:
Accurate measurements of the masses of neutron stars are necessary to test binary evolution models, and to constrain the neutron star equation of state. In pulsar binaries with no measurable post-Keplerian parameters, this requires an accurate estimate of the binary system's inclination and the radial velocity of the companion star by other means than pulsar timing. In this paper, we present the r…
▽ More
Accurate measurements of the masses of neutron stars are necessary to test binary evolution models, and to constrain the neutron star equation of state. In pulsar binaries with no measurable post-Keplerian parameters, this requires an accurate estimate of the binary system's inclination and the radial velocity of the companion star by other means than pulsar timing. In this paper, we present the results of a new method for measuring this radial velocity using the binary synthesis code Icarus. This method relies on constructing a model spectrum of a tidally distorted, irradiated star as viewed for a given binary configuration. This method is applied to optical spectra of the newly discovered black widow PSR J1555-2908. By modelling the optical spectroscopy alongside optical photometry, we find that the radial velocity of the companion star is $397\pm4$ km s$^{-1}$ (errors quoted at 95\% confidence interval), as well as a binary inclination of $>75^{\rm o}$. Combined with $γ$-ray pulsation timing information, this gives a neutron star mass of 1.67$^{+0.15}_{-0.09}$ M$_\odot$ and a companion mass of 0.060$^{+0.005}_{-0.003}$ M$_\odot$, placing PSR J1555-2908 at the observed upper limit of what is considered a black widow system.
△ Less
Submitted 9 February, 2022;
originally announced February 2022.
-
Discovery, Timing, and Multiwavelength Observations of the Black Widow Millisecond Pulsar PSR J1555-2908
Authors:
Paul S. Ray,
Lars Nieder,
Colin J. Clark,
Scott M. Ransom,
H. Thankful Cromartie,
Dale A. Frail,
Kunal P. Mooley,
Huib Intema,
Preshanth Jagannathan,
Paul Demorest,
Kevin Stovall,
Jules P. Halpern,
Julia Deneva,
Sebastien Guillot,
Matthew Kerr,
Samuel J. Swihart,
Philippe Bruel,
Ben W. Stappers,
Andrew Lyne,
Mitch Mickaliger,
Fernando Camilo,
Elizabeth C. Ferrara,
Michael T. Wolff,
P. F. Michelson
Abstract:
We report the discovery of PSR J1555-2908, a 1.79 ms radio and gamma-ray pulsar in a 5.6 hr binary system with a minimum companion mass of 0.052 $M_\odot$. This fast and energetic ($\dot E = 3 \times 10^{35}$ erg/s) millisecond pulsar was first detected as a gamma-ray point source in Fermi LAT sky survey observations. Guided by a steep spectrum radio point source in the Fermi error region, we perf…
▽ More
We report the discovery of PSR J1555-2908, a 1.79 ms radio and gamma-ray pulsar in a 5.6 hr binary system with a minimum companion mass of 0.052 $M_\odot$. This fast and energetic ($\dot E = 3 \times 10^{35}$ erg/s) millisecond pulsar was first detected as a gamma-ray point source in Fermi LAT sky survey observations. Guided by a steep spectrum radio point source in the Fermi error region, we performed a search at 820 MHz with the Green Bank Telescope that first discovered the pulsations. The initial radio pulse timing observations provided enough information to seed a search for gamma-ray pulsations in the LAT data, from which we derive a timing solution valid for the full Fermi mission. In addition to the radio and gamma-ray pulsation discovery and timing, we searched for X-ray pulsations using NICER but no significant pulsations were detected. We also obtained time-series r-band photometry that indicates strong heating of the companion star by the pulsar wind. Material blown off the heated companion eclipses the 820 MHz radio pulse during inferior conjunction of the companion for ~10% of the orbit, which is twice the angle subtended by its Roche lobe in an edge-on system.
△ Less
Submitted 9 February, 2022;
originally announced February 2022.
-
The Benefits of Coarse Preferences
Authors:
Joseph Y. Halpern,
Yuval Heller,
Eyal Winter
Abstract:
We study the strategic advantages of coarsening one's utility by clustering nearby payoffs together (i.e., classifying them the same way). Our solution concept, coarse-utility equilibrium (CUE) requires that (1) each player maximizes her coarse utility, given the opponent's strategy, and (2) the classifications form best replies to one another. We characterize CUEs in various games. In particular,…
▽ More
We study the strategic advantages of coarsening one's utility by clustering nearby payoffs together (i.e., classifying them the same way). Our solution concept, coarse-utility equilibrium (CUE) requires that (1) each player maximizes her coarse utility, given the opponent's strategy, and (2) the classifications form best replies to one another. We characterize CUEs in various games. In particular, we show that there is a qualitative difference between CUEs in which only one of the players clusters payoffs, and those in which all players cluster their payoffs, and that the latter type induce players to treat co-players better than in Nash equilibria in the large class of games with monotone externalities.
△ Less
Submitted 14 June, 2023; v1 submitted 25 January, 2022;
originally announced January 2022.
-
The brain as a probabilistic transducer: an evolutionarily plausible network architecture for knowledge representation, computation, and behavior
Authors:
Joseph Y. Halpern,
Arnon Lotem
Abstract:
We offer a general theoretical framework for brain and behavior that is evolutionarily and computationally plausible. The brain in our abstract model is a network of nodes and edges. Although it has some similarities to standard neural network models, as we show, there are some significant differences. Both nodes and edges in our network have weights and activation levels. They act as probabilisti…
▽ More
We offer a general theoretical framework for brain and behavior that is evolutionarily and computationally plausible. The brain in our abstract model is a network of nodes and edges. Although it has some similarities to standard neural network models, as we show, there are some significant differences. Both nodes and edges in our network have weights and activation levels. They act as probabilistic transducers that use a set of relatively simple rules to determine how activation levels and weights are affected by input, generate output, and affect each other. We show that these simple rules enable a learning process that allows the network to represent increasingly complex knowledge, and simultaneously to act as a computing device that facilitates planning, decision-making, and the execution of behavior. By specifying the innate (genetic) components of the network, we show how evolution could endow the network with initial adaptive rules and goals that are then enriched through learning. We demonstrate how the develo** structure of the network (which determines what the brain can do and how well) is critically affected by the co-evolved coordination between the mechanisms affecting the distribution of data input and those determining the learning parameters (used in the programs run by nodes and edges). Finally, we consider how the model accounts for various findings in the field of learning and decision making, how it can address some challenging problems in mind and behavior, such as those related to setting goals and self-control, and how it can help understand some cognitive disorders.
△ Less
Submitted 11 April, 2022; v1 submitted 26 December, 2021;
originally announced December 2021.
-
Measuring the Non-Axially-Symmetric Surface Temperature Distribution of the Central Compact Object in Puppis A
Authors:
J. A. J. Alford,
E. V. Gotthelf,
R. Perna,
J. P. Halpern
Abstract:
The surface temperature distributions of central compact objects (CCOs) are powerful probes of their crustal magnetic field strengths and geometries. Here we model the surface temperature distribution of RX J0822$-$4300, the CCO in the Puppis A supernova remnant (SNR), using $471$ ks of XMM-Newton data. We compute the energy-dependent pulse profiles in sixteen energy bands, fully including the gen…
▽ More
The surface temperature distributions of central compact objects (CCOs) are powerful probes of their crustal magnetic field strengths and geometries. Here we model the surface temperature distribution of RX J0822$-$4300, the CCO in the Puppis A supernova remnant (SNR), using $471$ ks of XMM-Newton data. We compute the energy-dependent pulse profiles in sixteen energy bands, fully including the general relativistic effects of gravitational redshift and light bending, to accurately model the two heated surface regions of different temperatures and areas, in addition to constraining the viewing geometry. This results in precise measurements of the two temperatures: $kT_{\rm warm} = (1+z) \times 0.222_{-0.019}^{+0.018}$ keV and $kT_{\rm hot} = (1+z) \times 0.411\pm0.011$ keV. For the first time, we are able to measure a deviation from a pure antipodal hot-spot geometry, with a minimum value of $1.\!^{\circ}1 \pm 0.\!^{\circ}2$, and an expectation value of $9.\!^{\circ}35 \pm 0.\!^{\circ}17$ among the most probable geometries. The discovery of this asymmetry, along with the factor of $\approx2$ temperature difference between the two emitting regions, may indicate that RX J0822$-$4300 was born with a strong, tangled crustal magnetic field.
△ Less
Submitted 22 December, 2021;
originally announced December 2021.
-
Reasoning About Causal Models With Infinitely Many Variables
Authors:
Joseph Y. Halpern,
Spencer Peters
Abstract:
Generalized structural equations models (GSEMs) [Peters and Halpern 2021], are, as the name suggests, a generalization of structural equations models (SEMs). They can deal with (among other things) infinitely many variables with infinite ranges, which is critical for capturing dynamical systems. We provide a sound and complete axiomatization of causal reasoning in GSEMs that is an extension of the…
▽ More
Generalized structural equations models (GSEMs) [Peters and Halpern 2021], are, as the name suggests, a generalization of structural equations models (SEMs). They can deal with (among other things) infinitely many variables with infinite ranges, which is critical for capturing dynamical systems. We provide a sound and complete axiomatization of causal reasoning in GSEMs that is an extension of the sound and complete axiomatization provided by Halpern [2000] for SEMs. Considering GSEMs helps clarify what properties Halpern's axioms capture.
△ Less
Submitted 21 December, 2021;
originally announced December 2021.
-
Causal Modeling With Infinitely Many Variables
Authors:
Spencer Peters,
Joseph Y. Halpern
Abstract:
Structural-equations models (SEMs) are perhaps the most commonly used framework for modeling causality. However, as we show, naively extending this framework to infinitely many variables, which is necessary, for example, to model dynamical systems, runs into several problems. We introduce GSEMs (generalized SEMs), a flexible generalization of SEMs that directly specify the results of interventions…
▽ More
Structural-equations models (SEMs) are perhaps the most commonly used framework for modeling causality. However, as we show, naively extending this framework to infinitely many variables, which is necessary, for example, to model dynamical systems, runs into several problems. We introduce GSEMs (generalized SEMs), a flexible generalization of SEMs that directly specify the results of interventions, in which (1) systems of differential equations can be represented in a natural and intuitive manner, (2) certain natural situations, which cannot be represented by SEMs at all, can be represented easily, (3) the definition of actual causality in SEMs carries over essentially without change.
△ Less
Submitted 16 December, 2021;
originally announced December 2021.
-
Optical Studies of Ten Hard X-ray Selected Cataclysmic Binaries
Authors:
J. P. Halpern,
J. R. Thorstensen
Abstract:
We conducted time-resolved optical spectroscopy and/or photometry of ten cataclysmic binaries that were discovered in hard X-ray surveys, with the goal of measuring their orbital periods and searching for evidence that they are magnetic. Four of the objects in this study are new optical identifications: IGR J18017$-$3542, PBC J1841.1+0138, IGR J18434$-$0508, and Swift J1909.3+0124. A 311.8 s, cohe…
▽ More
We conducted time-resolved optical spectroscopy and/or photometry of ten cataclysmic binaries that were discovered in hard X-ray surveys, with the goal of measuring their orbital periods and searching for evidence that they are magnetic. Four of the objects in this study are new optical identifications: IGR J18017$-$3542, PBC J1841.1+0138, IGR J18434$-$0508, and Swift J1909.3+0124. A 311.8 s, coherent optical pulsation is detected from PBC J1841.1+0138, as well as eclipses with a period of 0.221909 days. A 152.49 s coherent period is detected from IGR J18434$-$0508. A probable period of 389 s is seen in IGR J18151$-$1052, in agreement with a known X-ray spin period. We also detect a period of 803.5 s in an archival X-ray observation of Swift J0717.8$-$2156. The latter four objects are thus confirmed magnetic CVs of the intermediate polar class. An optical period of 1554 s in AX J1832.3$-$0840 also confirms the known X-ray spin period, but a stronger signal at 2303 s is present whose interpretation is not obvious. We also studied the candidate intermediate polar Swift J0820.6$-$2805, which has low and high states differing by $\approx4$ mag, and optical periods or QPOs not in agreement with proposed X-ray periods. Of note is an unusually long 2.06 day orbital period for Swift J1909.3+0124, manifest in the radial velocity variation of photospheric absorption lines of an early K-type companion star. The star must be somewhat evolved if it is to fill its Roche lobe.
△ Less
Submitted 14 October, 2021;
originally announced October 2021.
-
Multi-Wavelength Observation Campaign of the TeV Gamma-Ray Binary HESS J0632+057 with NuSTAR, VERITAS, MDM, and Swift
Authors:
Y. M. Tokayer,
H. An,
J. P. Halpern,
J. Kim,
K. Mori,
C. J. Hailey,
C. B. Adams,
W. Benbow,
A. Brill,
J. H. Buckley,
M. Capasso,
M. Errando,
A. Falcone,
K. A Farrell,
G. M Foote,
L. Fortson,
A. Furniss,
A. Gent,
C. Giuri,
D. Hanna,
T. Hassan,
O. Hervet,
J. Holder,
B. Hona,
T. B. Humensky
, et al. (31 additional authors not shown)
Abstract:
HESS J0632+057 belongs to a rare subclass of binary systems which emits gamma-rays above 100 GeV. It stands out for its distinctive high-energy light curve, which features a sharp ``primary'' peak and broader ``secondary'' peak. We present the results of contemporaneous observations by NuSTAR and VERITAS during the secondary peak between Dec. 2019 and Feb. 2020, when the orbital phase ($φ$) is bet…
▽ More
HESS J0632+057 belongs to a rare subclass of binary systems which emits gamma-rays above 100 GeV. It stands out for its distinctive high-energy light curve, which features a sharp ``primary'' peak and broader ``secondary'' peak. We present the results of contemporaneous observations by NuSTAR and VERITAS during the secondary peak between Dec. 2019 and Feb. 2020, when the orbital phase ($φ$) is between 0.55 and 0.75. NuSTAR detected X-ray spectral evolution, while VERITAS detected TeV emission. We fit a leptonic wind-collision model to the multi-wavelength spectra data obtained over the four NuSTAR and VERITAS observations, constraining the pulsar spin-down luminosity and the magnetization parameter at the shock. Despite long-term monitoring of the source from Oct. 2019 to Mar. 2020, the MDM observatory did not detect significant variation in H$α$ and H$β$ line equivalent widths, an expected signature of Be-disk interaction with the pulsar. Furthermore, fitting folded Swift-XRT light curve data with an intra-binary shock model constrained the orbital parameters, suggesting two orbital phases (at $φ_D = 0.13$ and 0.37) where the pulsar crosses the Be-disk, as well as phases for the periastron ($φ_0 = 0.30$) and inferior conjunction ($φ_{\text{IFC}} = 0.75$). The broad-band X-ray spectra with Swift-XRT and NuSTAR allowed us to measure a higher neutral hydrogen column density at one of the predicted disk-passing phases.
△ Less
Submitted 3 October, 2021;
originally announced October 2021.
-
In Defense of Liquid Democracy
Authors:
Daniel Halpern,
Joseph Y. Halpern,
Ali Jadbabaie,
Elchanan Mossel,
Ariel D. Procaccia,
Manon Revel
Abstract:
Fluid democracy is a voting paradigm that allows voters to choose between directly voting and transitively delegating their votes to other voters. While fluid democracy has been viewed as a system that can combine the best aspects of direct and representative democracy, it can also result in situations where few voters amass a large amount of influence. To analyze the impact of this shortcoming, w…
▽ More
Fluid democracy is a voting paradigm that allows voters to choose between directly voting and transitively delegating their votes to other voters. While fluid democracy has been viewed as a system that can combine the best aspects of direct and representative democracy, it can also result in situations where few voters amass a large amount of influence. To analyze the impact of this shortcoming, we consider what has been called an epistemic setting, where voters decide on a binary issue for which there is a ground truth. Previous work has shown that under certain assumptions on the delegation mechanism, the concentration of power is so severe that fluid democracy is less likely to identify the ground truth than direct voting. We examine different, arguably more realistic, classes of mechanisms, and prove they behave well by ensuring that (with high probability) there is a limit on concentration of power. Our proofs demonstrate that delegations can be treated as stochastic processes and that they can be compared to well-known processes from the literature -- such as preferential attachment and multi-types branching process -- that are sufficiently bounded for our purposes. Our results suggest that the concerns raised about fluid democracy can be overcome, thereby bolstering the case for this emerging paradigm.
△ Less
Submitted 29 March, 2022; v1 submitted 25 July, 2021;
originally announced July 2021.
-
Language-based Decisions
Authors:
Adam Bjorndahl,
Joseph Y. Halpern
Abstract:
In Savage's classic decision-theoretic framework, actions are formally defined as functions from states to outcomes. But where do the state space and outcome space come from? Expanding on recent work by Blume, Easley, and Halpern (BEH), we consider a language-based framework in which actions are identified with (conditional) descriptions in a simple underlying language, while states and outcomes (…
▽ More
In Savage's classic decision-theoretic framework, actions are formally defined as functions from states to outcomes. But where do the state space and outcome space come from? Expanding on recent work by Blume, Easley, and Halpern (BEH), we consider a language-based framework in which actions are identified with (conditional) descriptions in a simple underlying language, while states and outcomes (along with probabilities and utilities) are constructed as part of a representation theorem. Our work expands the role of language from that of BEH by using it not only for the conditions that determine which actions are taken, but also the effects. More precisely, we take the set of actions to be built from those of the form "do(phi)", for formulas phi in the underlying language. This presents a problem: how do we interpret the result of do(phi) when phi is underspecified (i.e., compatible with multiple states)? We answer this using tools familiar from the semantics of counterfactuals: roughly speaking, do(phi) maps each state to the "closest" phi-state. This notion of "closest" is also something we construct as part of the representation theorem; in effect, then, we prove that (under appropriate assumptions) the agent is acting as if each underspecified action is first made definite and then evaluated (i.e., by maximizing expected utility). Of course, actions in the real world are often not presented in a fully precise manner, yet agents reason about and form preferences among them all the same. Our work brings the abstract tools of decision theory into closer contact with such real-world scenarios.
△ Less
Submitted 21 June, 2021;
originally announced June 2021.
-
Proceedings Eighteenth Conference on Theoretical Aspects of Rationality and Knowledge
Authors:
Joseph Halpern,
Andrés Perea
Abstract:
The TARK conference (Theoretical Aspects of Rationality and Knowledge) is a biannual conference that aims to bring together researchers from a wide variety of fields, including computer science, artificial intelligence, game theory, decision theory, philosophy, logic, linguistics, and cognitive science. Its goal is to further our understanding of interdisciplinary issues involving reasoning about…
▽ More
The TARK conference (Theoretical Aspects of Rationality and Knowledge) is a biannual conference that aims to bring together researchers from a wide variety of fields, including computer science, artificial intelligence, game theory, decision theory, philosophy, logic, linguistics, and cognitive science. Its goal is to further our understanding of interdisciplinary issues involving reasoning about rationality and knowledge.
Topics of interest include, but are not limited to, semantic models for knowledge, belief, awareness and uncertainty, bounded rationality and resource-bounded reasoning, commonsense epistemic reasoning, epistemic logic, epistemic game theory, knowledge and action, applications of reasoning about knowledge and other mental states, belief revision, and foundations of multi-agent systems.
These proceedings contain the papers that have been accepted for presentation at the Eighteenth Conference on Theoretical Aspects of Rationality and Knowledge (TARK 2021), held between June 25 and June 27, 2021, at Tsinghua University at Bei**g, China.
△ Less
Submitted 21 June, 2021;
originally announced June 2021.
-
Radio Detection of PSR J1813-1749 in HESS J1813-178: The Most Scattered Pulsar Known
Authors:
F. Camilo,
S. M. Ransom,
J. P. Halpern,
D. A. Roshi
Abstract:
The 44.7 ms X-ray pulsar in the supernova remnant G12.82-0.02/HESS J1813-178 has the second highest spin-down luminosity of known pulsars in the Galaxy, with E-dot=5.6e37 erg/s. Using the Green Bank Telescope, we have detected radio pulsations from PSR J1813-1749 at 4.4-10.2 GHz. The pulse is highly scattered, with an exponential decay timescale τlonger than that of any other pulsar at these frequ…
▽ More
The 44.7 ms X-ray pulsar in the supernova remnant G12.82-0.02/HESS J1813-178 has the second highest spin-down luminosity of known pulsars in the Galaxy, with E-dot=5.6e37 erg/s. Using the Green Bank Telescope, we have detected radio pulsations from PSR J1813-1749 at 4.4-10.2 GHz. The pulse is highly scattered, with an exponential decay timescale τlonger than that of any other pulsar at these frequencies. A point source detected at this position by Dzib et al. in several observations with the Jansky Very Large Array can be attributed to the pulsed emission. The steep dependence of τon observing frequency explains why all previous pulsation searches at lower frequencies failed (τ~0.25 s at 2 GHz). The large dispersion measure, DM=1087 pc/cc, indicates a distance of either 6.2 or 12 kpc according to two widely used models of the electron density distribution in the Galaxy. These disfavor a previously suggested association with a young stellar cluster at the closer distance of 4.8 kpc. The high X-ray measured column density of ~1e23/cm^2 also supports a large distance. If at d~12 kpc, HESS J1813-178 would be one of the most luminous TeV sources in the Galaxy.
△ Less
Submitted 1 June, 2021;
originally announced June 2021.
-
Chandra, NuSTAR, and Optical Observations of the Cataclysmic Variables IGR J17528-2022 and IGR J20063+3641
Authors:
Jeremy Hare,
Jules P. Halpern,
John A. Tomsick,
John R. Thorstensen,
Arash Bodaghee,
Maica Clavel,
Roman Krivonos,
Kaya Mori
Abstract:
We report on Chandra, NuSTAR, and MDM observations of two INTEGRAL sources, namely IGR J17528-2022 and IGR J20063+3641. IGR J17528-2022 is an unidentified INTEGRAL source, while IGR J20063+3641 was recently identified as a magnetic cataclysmic variable (mCV) by Halpern et al. (2018). The Chandra observation of IGR J17528-2022 has allowed us to locate the optical counterpart to the source and to ob…
▽ More
We report on Chandra, NuSTAR, and MDM observations of two INTEGRAL sources, namely IGR J17528-2022 and IGR J20063+3641. IGR J17528-2022 is an unidentified INTEGRAL source, while IGR J20063+3641 was recently identified as a magnetic cataclysmic variable (mCV) by Halpern et al. (2018). The Chandra observation of IGR J17528-2022 has allowed us to locate the optical counterpart to the source and to obtain its optical spectrum, which shows a strong H$α$ emission line. The optical spectrum and flickering observed in the optical time-series photometry in combination with the X-ray spectrum, which is well fit by an absorbed partially covered thermal bremsstrahlung model, suggests that this source is a strong mCV candidate. The X-ray observations of IGR J20063+3641 reveal a clear modulation with a period of 172.46$\pm0.01$ s, which we attribute to the white dwarf spin period. Additional MDM spectroscopy of the source has also allowed for a clear determination of the orbital period at 0.731$\pm0.015$ d. The X-ray spectrum of this source is also well fit by an absorbed partially covered thermal bremsstrahlung model. The X-ray spectrum, spin periodicity, and orbital periodicity allow this source to be further classified as an intermediate polar.
△ Less
Submitted 21 April, 2021;
originally announced April 2021.
-
Lower Bounds Implementing Mediators in Asynchronous Systems
Authors:
Ivan Geffner,
Joseph Y. Halpern
Abstract:
Abraham, Dolev, Geffner, and Halpern proved that, in asynchronous systems, a $(k,t)$-robust equilibrium for $n$ players and a trusted mediator can be implemented without the mediator as long as $n > 4(k+t)$, where an equilibrium is $(k,t)$-robust if, roughly speaking, no coalition of $t$ players can decrease the payoff of any of the other players, and no coalition of $k$ players can increase their…
▽ More
Abraham, Dolev, Geffner, and Halpern proved that, in asynchronous systems, a $(k,t)$-robust equilibrium for $n$ players and a trusted mediator can be implemented without the mediator as long as $n > 4(k+t)$, where an equilibrium is $(k,t)$-robust if, roughly speaking, no coalition of $t$ players can decrease the payoff of any of the other players, and no coalition of $k$ players can increase their payoff by deviating. We prove that this bound is tight, in the sense that if $n \le 4(k+t)$ there exist $(k,t)$-robust equilibria with a mediator that cannot be implemented by the players alone. Even though implementing $(k,t)$-robust mediators seems closely related to implementing asynchronous multiparty $(k+t)$-secure computation \cite{BCG93}, to the best of our knowledge there is no known straightforward reduction from one problem to another. Nevertheless, we show that there is a non-trivial reduction from a slightly weaker notion of $(k+t)$-secure computation, which we call $(k+t)$-strict secure computation, to implementing $(k,t)$-robust mediators. We prove the desired lower bound by showing that there are functions on $n$ variables that cannot be $(k+t)$-strictly securely computed if $n \le 4(k+t)$. This also provides a simple alternative proof for the well-known lower bound of $4t+1$ on asynchronous secure computation in the presence of up to $t$ malicious agents.
△ Less
Submitted 6 April, 2021;
originally announced April 2021.
-
Security Properties as Nested Causal Statements
Authors:
Matvey Soloviev,
Joseph Y. Halpern
Abstract:
Thinking in terms of causality helps us structure how different parts of a system depend on each other, and how interventions on one part of a system may result in changes to other parts. Therefore, formal models of causality are an attractive tool for reasoning about security, which concerns itself with safeguarding properties of a system against interventions that may be malicious. As we show, m…
▽ More
Thinking in terms of causality helps us structure how different parts of a system depend on each other, and how interventions on one part of a system may result in changes to other parts. Therefore, formal models of causality are an attractive tool for reasoning about security, which concerns itself with safeguarding properties of a system against interventions that may be malicious. As we show, many security properties are naturally expressed as nested causal statements: not only do we consider what caused a particular undesirable effect, but we also consider what caused this causal relationship itself to hold. We present a natural way to extend the Halpern-Pearl (HP) framework for causality to capture such nested causal statements. This extension adds expressivity, enabling the HP framework to distinguish between causal scenarios that it could not previously naturally tell apart. We moreover revisit some design decisions of the HP framework that were made with non-nested causal statements in mind, such as the choice to treat specific values of causal variables as opposed to the variables themselves as causes, and may no longer be appropriate for nested ones.
△ Less
Submitted 1 April, 2021;
originally announced April 2021.
-
Probabilistic Dependency Graphs
Authors:
Oliver Richardson,
Joseph Y Halpern
Abstract:
We introduce Probabilistic Dependency Graphs (PDGs), a new class of directed graphical models. PDGs can capture inconsistent beliefs in a natural way and are more modular than Bayesian Networks (BNs), in that they make it easier to incorporate new information and restructure the representation. We show by example how PDGs are an especially natural modeling tool. We provide three semantics for PDGs…
▽ More
We introduce Probabilistic Dependency Graphs (PDGs), a new class of directed graphical models. PDGs can capture inconsistent beliefs in a natural way and are more modular than Bayesian Networks (BNs), in that they make it easier to incorporate new information and restructure the representation. We show by example how PDGs are an especially natural modeling tool. We provide three semantics for PDGs, each of which can be derived from a scoring function (on joint distributions over the variables in the network) that can be viewed as representing a distribution's incompatibility with the PDG. For the PDG corresponding to a BN, this function is uniquely minimized by the distribution the BN represents, showing that PDG semantics extend BN semantics. We show further that factor graphs and their exponential families can also be faithfully represented as PDGs, while there are significant barriers to modeling a PDG with a factor graph.
△ Less
Submitted 19 December, 2020;
originally announced December 2020.
-
The Timing Behavior of the Central Compact Object Pulsar 1E 1207.4-5209
Authors:
E. V. Gotthelf,
J. P. Halpern
Abstract:
We present 20 years of timing observations for 1E 1207.4-5209, the central compact object in supernova remnant PKS 1209-51/52, to follow up on our detection of an unexpected timing glitch in its spin-down. Using new XMM-Newton and NICER observations of 1E 1207.4-5209, we now find that the phase ephemeris can be well modelled by either two small glitches, or extreme timing noise. The implied magnit…
▽ More
We present 20 years of timing observations for 1E 1207.4-5209, the central compact object in supernova remnant PKS 1209-51/52, to follow up on our detection of an unexpected timing glitch in its spin-down. Using new XMM-Newton and NICER observations of 1E 1207.4-5209, we now find that the phase ephemeris can be well modelled by either two small glitches, or extreme timing noise. The implied magnitudes of the frequency glitches are Delta f/f = (9+\-2)E-10 and Delta f/f = (3.7+/-0.7)E-10, at epochs 2010.9 and 2014.4, respectively. The updated timing solutions also rule out our previous suggestion of a large glitch in the frequency derivative fdot. No other canonical pulsar with such a small spin-down rate (fdot = -1.2E-16 Hz/s) or surface dipole magnetic field strength (B_s = 9.8E10 G) has been observed to glitch; the glitch activity parameter of 1E 1207.4-5209 is larger than that of more energetic pulsars. Alternative parameterizations that do not involve glitches can fit the data, but they have timing residuals or a second frequency derivative fddot that are orders of magnitude larger than in pulsars with similar spin-down parameters. These timing properties of 1E 1207.4-5209 further motivate the leading theory of central compact objects, that an initial B-field of normal strength was buried in the neutron star crust by fallback of supernova ejecta, suppressing the surface dipole field. The slow reemergence of the buried field may be involved in triggering glitches or excess timing noise.
△ Less
Submitted 20 July, 2020;
originally announced July 2020.
-
Dynamic Awareness
Authors:
Joseph Y. Halpern,
Evan Piermont
Abstract:
We investigate how to model the beliefs of an agent who becomes more aware. We use the framework of Halpern and Rego (2013) by adding probability, and define a notion of a model transition that describes constraints on how, if an agent becomes aware of a new formula $φ$ in state $s$ of a model $M$, she transitions to state $s^*$ in a model $M^*$. We then discuss how such a model can be applied to…
▽ More
We investigate how to model the beliefs of an agent who becomes more aware. We use the framework of Halpern and Rego (2013) by adding probability, and define a notion of a model transition that describes constraints on how, if an agent becomes aware of a new formula $φ$ in state $s$ of a model $M$, she transitions to state $s^*$ in a model $M^*$. We then discuss how such a model can be applied to information disclosure.
△ Less
Submitted 6 July, 2020;
originally announced July 2020.
-
Bounded Rationality in Las Vegas: Probabilistic Finite Automata PlayMulti-Armed Bandits
Authors:
Xinming Liu,
Joseph Y. Halpern
Abstract:
While traditional economics assumes that humans are fully rational agents who always maximize their expected utility, in practice, we constantly observe apparently irrational behavior. One explanation is that people have limited computational power, so that they are, quite rationally, making the best decisions they can, given their computational limitations. To test this hypothesis, we consider th…
▽ More
While traditional economics assumes that humans are fully rational agents who always maximize their expected utility, in practice, we constantly observe apparently irrational behavior. One explanation is that people have limited computational power, so that they are, quite rationally, making the best decisions they can, given their computational limitations. To test this hypothesis, we consider the multi-armed bandit (MAB) problem. We examine a simple strategy for playing an MAB that can be implemented easily by a probabilistic finite automaton (PFA). Roughly speaking, the PFA sets certain expectations, and plays an arm as long as it meets them. If the PFA has sufficiently many states, it performs near-optimally. Its performance degrades gracefully as the number of states decreases. Moreover, the PFA acts in a "human-like" way, exhibiting a number of standard human biases, like an optimism bias and a negativity bias.
△ Less
Submitted 30 June, 2020;
originally announced June 2020.
-
Information Acquisition Under Resource Limitations in a Noisy Environment
Authors:
Matvey Soloviev,
Joseph Y. Halpern
Abstract:
We introduce a theoretical model of information acquisition under resource limitations in a noisy environment. An agent must guess the truth value of a given Boolean formula $\varphi$ after performing a bounded number of noisy tests of the truth values of variables in the formula. We observe that, in general, the problem of finding an optimal testing strategy for $φ$ is hard, but we suggest a usef…
▽ More
We introduce a theoretical model of information acquisition under resource limitations in a noisy environment. An agent must guess the truth value of a given Boolean formula $\varphi$ after performing a bounded number of noisy tests of the truth values of variables in the formula. We observe that, in general, the problem of finding an optimal testing strategy for $φ$ is hard, but we suggest a useful heuristic. The techniques we use also give insight into two apparently unrelated, but well-studied problems: (1) \emph{rational inattention}, that is, when it is rational to ignore pertinent information (the optimal strategy may involve hardly ever testing variables that are clearly relevant to $φ$), and (2) what makes a formula hard to learn/remember.
△ Less
Submitted 20 May, 2020;
originally announced May 2020.
-
MDPs with Unawareness in Robotics
Authors:
Nan Rong,
Joseph Y. Halpern,
Ashutosh Saxena
Abstract:
We formalize decision-making problems in robotics and automated control using continuous MDPs and actions that take place over continuous time intervals. We then approximate the continuous MDP using finer and finer discretizations. Doing this results in a family of systems, each of which has an extremely large action space, although only a few actions are "interesting". We can view the decision ma…
▽ More
We formalize decision-making problems in robotics and automated control using continuous MDPs and actions that take place over continuous time intervals. We then approximate the continuous MDP using finer and finer discretizations. Doing this results in a family of systems, each of which has an extremely large action space, although only a few actions are "interesting". We can view the decision maker as being unaware of which actions are "interesting". We can model this using MDPUs, MDPs with unawareness, where the action space is much smaller. As we show, MDPUs can be used as a general framework for learning tasks in robotic problems. We prove results on the difficulty of learning a near-optimal policy in an an MDPU for a continuous task. We apply these ideas to the problem of having a humanoid robot learn on its own how to walk.
△ Less
Submitted 20 May, 2020;
originally announced May 2020.