-
First joint oscillation analysis of Super-Kamiokande atmospheric and T2K accelerator neutrino data
Authors:
Super-Kamiokande,
T2K collaborations,
:,
S. Abe,
K. Abe,
N. Akhlaq,
R. Akutsu,
H. Alarakia-Charles,
A. Ali,
Y. I. Alj Hakim,
S. Alonso Monsalve,
S. Amanai,
C. Andreopoulos,
L. H. V. Anthony,
M. Antonova,
S. Aoki,
K. A. Apte,
T. Arai,
T. Arihara,
S. Arimoto,
Y. Asada,
R. Asaka,
Y. Ashida,
E. T. Atkin,
N. Babu
, et al. (524 additional authors not shown)
Abstract:
The Super-Kamiokande and T2K collaborations present a joint measurement of neutrino oscillation parameters from their atmospheric and beam neutrino data. It uses a common interaction model for events overlap** in neutrino energy and correlated detector systematic uncertainties between the two datasets, which are found to be compatible. Using 3244.4 days of atmospheric data and a beam exposure of…
▽ More
The Super-Kamiokande and T2K collaborations present a joint measurement of neutrino oscillation parameters from their atmospheric and beam neutrino data. It uses a common interaction model for events overlap** in neutrino energy and correlated detector systematic uncertainties between the two datasets, which are found to be compatible. Using 3244.4 days of atmospheric data and a beam exposure of $19.7(16.3) \times 10^{20}$ protons on target in (anti)neutrino mode, the analysis finds a 1.9$σ$ exclusion of CP-conservation (defined as $J_{CP}=0$) and a preference for the normal mass ordering.
△ Less
Submitted 21 May, 2024;
originally announced May 2024.
-
Combined Pre-Supernova Alert System with Kamland and Super-Kamiokande
Authors:
KamLAND,
Super-Kamiokande Collaborations,
:,
Seisho Abe,
Minori Eizuka,
Sawako Futagi,
Azusa Gando,
Yoshihito Gando,
Shun Goto,
Takahiko Hachiya,
Kazumi Hata,
Koichi Ichimura,
Sei Ieki,
Haruo Ikeda,
Kunio Inoue,
Koji Ishidoshiro,
Yuto Kamei,
Nanami Kawada,
Yasuhiro Kishimoto,
Masayuki Koga,
Maho Kurasawa,
Tadao Mitsui,
Haruhiko Miyake,
Daisuke Morita,
Takeshi Nakahata
, et al. (290 additional authors not shown)
Abstract:
Preceding a core-collapse supernova, various processes produce an increasing amount of neutrinos of all flavors characterized by mounting energies from the interior of massive stars. Among them, the electron antineutrinos are potentially detectable by terrestrial neutrino experiments such as KamLAND and Super-Kamiokande via inverse beta decay interactions. Once these pre-supernova neutrinos are ob…
▽ More
Preceding a core-collapse supernova, various processes produce an increasing amount of neutrinos of all flavors characterized by mounting energies from the interior of massive stars. Among them, the electron antineutrinos are potentially detectable by terrestrial neutrino experiments such as KamLAND and Super-Kamiokande via inverse beta decay interactions. Once these pre-supernova neutrinos are observed, an early warning of the upcoming core-collapse supernova can be provided. In light of this, KamLAND and Super-Kamiokande, both located in the Kamioka mine in Japan, have been monitoring pre-supernova neutrinos since 2015 and 2021, respectively. Recently, we performed a joint study between KamLAND and Super-Kamiokande on pre-supernova neutrino detection. A pre-supernova alert system combining the KamLAND detector and the Super-Kamiokande detector was developed and put into operation, which can provide a supernova alert to the astrophysics community. Fully leveraging the complementary properties of these two detectors, the combined alert is expected to resolve a pre-supernova neutrino signal from a 15 M$_{\odot}$ star within 510 pc of the Earth, at a significance level corresponding to a false alarm rate of no more than 1 per century. For a Betelgeuse-like model with optimistic parameters, it can provide early warnings up to 12 hours in advance.
△ Less
Submitted 1 July, 2024; v1 submitted 15 April, 2024;
originally announced April 2024.
-
Development of a data overflow protection system for Super-Kamiokande to maximize data from nearby supernovae
Authors:
M. Mori,
K. Abe,
Y. Hayato,
K. Hiraide,
K. Hosokawa,
K. Ieki,
M. Ikeda,
J. Kameda,
Y. Kanemura,
R. Kaneshima,
Y. Kashiwagi,
Y. Kataoka,
S. Miki,
S. Mine,
M. Miura,
S. Moriyama,
Y. Nakano,
M. Nakahata,
S. Nakayama,
Y. Noguchi,
K. Okamoto,
K. Sato,
H. Sekiya,
H. Shiba,
K. Shimizu
, et al. (230 additional authors not shown)
Abstract:
Neutrinos from very nearby supernovae, such as Betelgeuse, are expected to generate more than ten million events over 10\,s in Super-Kamokande (SK). At such large event rates, the buffers of the SK analog-to-digital conversion board (QBEE) will overflow, causing random loss of data that is critical for understanding the dynamics of the supernova explosion mechanism. In order to solve this problem,…
▽ More
Neutrinos from very nearby supernovae, such as Betelgeuse, are expected to generate more than ten million events over 10\,s in Super-Kamokande (SK). At such large event rates, the buffers of the SK analog-to-digital conversion board (QBEE) will overflow, causing random loss of data that is critical for understanding the dynamics of the supernova explosion mechanism. In order to solve this problem, two new DAQ modules were developed to aid in the observation of very nearby supernovae. The first of these, the SN module, is designed to save only the number of hit PMTs during a supernova burst and the second, the Veto module, prescales the high rate neutrino events to prevent the QBEE from overflowing based on information from the SN module. In the event of a very nearby supernova, these modules allow SK to reconstruct the time evolution of the neutrino event rate from beginning to end using both QBEE and SN module data. This paper presents the development and testing of these modules together with an analysis of supernova-like data generated with a flashing laser diode. We demonstrate that the Veto module successfully prevents DAQ overflows for Betelgeuse-like supernovae as well as the long-term stability of the new modules. During normal running the Veto module is found to issue DAQ vetos a few times per month resulting in a total dead time less than 1\,ms, and does not influence ordinary operations. Additionally, using simulation data we find that supernovae closer than 800~pc will trigger Veto module resulting in a prescaling of the observed neutrino data.
△ Less
Submitted 12 April, 2024;
originally announced April 2024.
-
Mixture-of-Depths: Dynamically allocating compute in transformer-based language models
Authors:
David Raposo,
Sam Ritter,
Blake Richards,
Timothy Lillicrap,
Peter Conway Humphreys,
Adam Santoro
Abstract:
Transformer-based language models spread FLOPs uniformly across input sequences. In this work we demonstrate that transformers can instead learn to dynamically allocate FLOPs (or compute) to specific positions in a sequence, optimising the allocation along the sequence for different layers across the model depth. Our method enforces a total compute budget by cap** the number of tokens ($k$) that…
▽ More
Transformer-based language models spread FLOPs uniformly across input sequences. In this work we demonstrate that transformers can instead learn to dynamically allocate FLOPs (or compute) to specific positions in a sequence, optimising the allocation along the sequence for different layers across the model depth. Our method enforces a total compute budget by cap** the number of tokens ($k$) that can participate in the self-attention and MLP computations at a given layer. The tokens to be processed are determined by the network using a top-$k$ routing mechanism. Since $k$ is defined a priori, this simple procedure uses a static computation graph with known tensor sizes, unlike other conditional computation techniques. Nevertheless, since the identities of the $k$ tokens are fluid, this method can expend FLOPs non-uniformly across the time and model depth dimensions. Thus, compute expenditure is entirely predictable in sum total, but dynamic and context-sensitive at the token-level. Not only do models trained in this way learn to dynamically allocate compute, they do so efficiently. These models match baseline performance for equivalent FLOPS and wall-clock times to train, but require a fraction of the FLOPs per forward pass, and can be upwards of 50\% faster to step during post-training sampling.
△ Less
Submitted 2 April, 2024;
originally announced April 2024.
-
Measurements of the charge ratio and polarization of cosmic-ray muons with the Super-Kamiokande detector
Authors:
H. Kitagawa,
T. Tada,
K. Abe,
C. Bronner,
Y. Hayato,
K. Hiraide,
K. Hosokawa,
K. Ieki,
M. Ikeda,
J. Kameda,
Y. Kanemura,
R. Kaneshima,
Y. Kashiwagi,
Y. Kataoka,
S. Miki,
S. Mine,
M. Miura,
S. Moriyama,
Y. Nakano,
M. Nakahata,
S. Nakayama,
Y. Noguchi,
K. Okamoto,
K. Sato,
H. Sekiya
, et al. (231 additional authors not shown)
Abstract:
We present the results of the charge ratio ($R$) and polarization ($P^μ_{0}$) measurements using the decay electron events collected from 2008 September to 2022 June by the Super-Kamiokande detector. Because of its underground location and long operation, we performed high precision measurements by accumulating cosmic-ray muons. We measured the muon charge ratio to be $R=1.32 \pm 0.02$…
▽ More
We present the results of the charge ratio ($R$) and polarization ($P^μ_{0}$) measurements using the decay electron events collected from 2008 September to 2022 June by the Super-Kamiokande detector. Because of its underground location and long operation, we performed high precision measurements by accumulating cosmic-ray muons. We measured the muon charge ratio to be $R=1.32 \pm 0.02$ $(\mathrm{stat.}{+}\mathrm{syst.})$ at $E_μ\cos θ_{\mathrm{Zenith}}=0.7^{+0.3}_{-0.2}$ $\mathrm{TeV}$, where $E_μ$ is the muon energy and $θ_{\mathrm{Zenith}}$ is the zenith angle of incoming cosmic-ray muons. This result is consistent with the Honda flux model while this suggests a tension with the $πK$ model of $1.9σ$. We also measured the muon polarization at the production location to be $P^μ_{0}=0.52 \pm 0.02$ $(\mathrm{stat.}{+}\mathrm{syst.})$ at the muon momentum of $0.9^{+0.6}_{-0.1}$ $\mathrm{TeV}/c$ at the surface of the mountain; this also suggests a tension with the Honda flux model of $1.5σ$. This is the most precise measurement ever to experimentally determine the cosmic-ray muon polarization near $1~\mathrm{TeV}/c$. These measurement results are useful to improve the atmospheric neutrino simulations.
△ Less
Submitted 13 March, 2024;
originally announced March 2024.
-
Second gadolinium loading to Super-Kamiokande
Authors:
K. Abe,
C. Bronner,
Y. Hayato,
K. Hiraide,
K. Hosokawa,
K. Ieki,
M. Ikeda,
J. Kameda,
Y. Kanemura,
R. Kaneshima,
Y. Kashiwagi,
Y. Kataoka,
S. Miki,
S. Mine,
M. Miura,
S. Moriyama,
Y. Nakano,
M. Nakahata,
S. Nakayama,
Y. Noguchi,
K. Sato,
H. Sekiya,
H. Shiba,
K. Shimizu,
M. Shiozawa
, et al. (225 additional authors not shown)
Abstract:
The first loading of gadolinium (Gd) into Super-Kamiokande in 2020 was successful, and the neutron capture efficiency on Gd reached 50\%. To further increase the Gd neutron capture efficiency to 75\%, 26.1 tons of $\rm Gd_2(\rm SO_4)_3\cdot \rm 8H_2O$ was additionally loaded into Super-Kamiokande (SK) from May 31 to July 4, 2022. As the amount of loaded $\rm Gd_2(\rm SO_4)_3\cdot \rm 8H_2O$ was do…
▽ More
The first loading of gadolinium (Gd) into Super-Kamiokande in 2020 was successful, and the neutron capture efficiency on Gd reached 50\%. To further increase the Gd neutron capture efficiency to 75\%, 26.1 tons of $\rm Gd_2(\rm SO_4)_3\cdot \rm 8H_2O$ was additionally loaded into Super-Kamiokande (SK) from May 31 to July 4, 2022. As the amount of loaded $\rm Gd_2(\rm SO_4)_3\cdot \rm 8H_2O$ was doubled compared to the first loading, the capacity of the powder dissolving system was doubled. We also developed new batches of gadolinium sulfate with even further reduced radioactive impurities. In addition, a more efficient screening method was devised and implemented to evaluate these new batches of $\rm Gd_2(\rm SO_4)_3\cdot \rm 8H_2O$. Following the second loading, the Gd concentration in SK was measured to be $333.5\pm2.5$ ppm via an Atomic Absorption Spectrometer (AAS). From the mean neutron capture time constant of neutrons from an Am/Be calibration source, the Gd concentration was independently measured to be 332.7 $\pm$ 6.8(sys.) $\pm$ 1.1(stat.) ppm, consistent with the AAS result. Furthermore, during the loading the Gd concentration was monitored continually using the capture time constant of each spallation neutron produced by cosmic-ray muons,and the final neutron capture efficiency was shown to become 1.5 times higher than that of the first loaded phase, as expected.
△ Less
Submitted 18 June, 2024; v1 submitted 12 March, 2024;
originally announced March 2024.
-
Performance of SK-Gd's Upgraded Real-time Supernova Monitoring System
Authors:
Y. Kashiwagi,
K. Abe,
C. Bronner,
Y. Hayato,
K. Hiraide,
K. Hosokawa,
K. Ieki,
M. Ikeda,
J. Kameda,
Y. Kanemura,
R. Kaneshima,
Y. Kataoka,
S. Miki,
S. Mine,
M. Miura,
S. Moriyama,
Y. Nakano,
M. Nakahata,
S. Nakayama,
Y. Noguchi,
K. Sato,
H. Sekiya,
H. Shiba,
K. Shimizu,
M. Shiozawa
, et al. (214 additional authors not shown)
Abstract:
Among multi-messenger observations of the next galactic core-collapse supernova, Super-Kamiokande (SK) plays a critical role in detecting the emitted supernova neutrinos, determining the direction to the supernova (SN), and notifying the astronomical community of these observations in advance of the optical signal. On 2022, SK has increased the gadolinium dissolved in its water target (SK-Gd) and…
▽ More
Among multi-messenger observations of the next galactic core-collapse supernova, Super-Kamiokande (SK) plays a critical role in detecting the emitted supernova neutrinos, determining the direction to the supernova (SN), and notifying the astronomical community of these observations in advance of the optical signal. On 2022, SK has increased the gadolinium dissolved in its water target (SK-Gd) and has achieved a Gd concentration of 0.033%, resulting in enhanced neutron detection capability, which in turn enables more accurate determination of the supernova direction. Accordingly, SK-Gd's real-time supernova monitoring system (Abe te al. 2016b) has been upgraded. SK_SN Notice, a warning system that works together with this monitoring system, was released on December 13, 2021, and is available through GCN Notices (Barthelmy et al. 2000). When the monitoring system detects an SN-like burst of events, SK_SN Notice will automatically distribute an alarm with the reconstructed direction to the supernova candidate within a few minutes. In this paper, we present a systematic study of SK-Gd's response to a simulated galactic SN. Assuming a supernova situated at 10 kpc, neutrino fluxes from six supernova models are used to characterize SK-Gd's pointing accuracy using the same tools as the online monitoring system. The pointing accuracy is found to vary from 3-7$^\circ$ depending on the models. However, if the supernova is closer than 10 kpc, SK_SN Notice can issue an alarm with three-degree accuracy, which will benefit follow-up observations by optical telescopes with large fields of view.
△ Less
Submitted 13 March, 2024; v1 submitted 11 March, 2024;
originally announced March 2024.
-
Solar neutrino measurements using the full data period of Super-Kamiokande-IV
Authors:
Super-Kamiokande Collaboration,
:,
K. Abe,
C. Bronner,
Y. Hayato,
K. Hiraide,
K. Hosokawa,
K. Ieki,
M. Ikeda,
S. Imaizumi,
K. Iyogi,
J. Kameda,
Y. Kanemura,
R. Kaneshima,
Y. Kashiwagi,
Y. Kataoka,
Y. Kato,
Y. Kishimoto,
S. Miki,
S. Mine,
M. Miura,
T. Mochizuki,
S. Moriyama,
Y. Nagao,
M. Nakahata
, et al. (305 additional authors not shown)
Abstract:
An analysis of solar neutrino data from the fourth phase of Super-Kamiokande~(SK-IV) from October 2008 to May 2018 is performed and the results are presented. The observation time of the data set of SK-IV corresponds to $2970$~days and the total live time for all four phases is $5805$~days. For more precise solar neutrino measurements, several improvements are applied in this analysis: lowering th…
▽ More
An analysis of solar neutrino data from the fourth phase of Super-Kamiokande~(SK-IV) from October 2008 to May 2018 is performed and the results are presented. The observation time of the data set of SK-IV corresponds to $2970$~days and the total live time for all four phases is $5805$~days. For more precise solar neutrino measurements, several improvements are applied in this analysis: lowering the data acquisition threshold in May 2015, further reduction of the spallation background using neutron clustering events, precise energy reconstruction considering the time variation of the PMT gain. The observed number of solar neutrino events in $3.49$--$19.49$ MeV electron kinetic energy region during SK-IV is $65,443^{+390}_{-388}\,(\mathrm{stat.})\pm 925\,(\mathrm{syst.})$ events. Corresponding $\mathrm{^{8}B}$ solar neutrino flux is $(2.314 \pm 0.014\, \rm{(stat.)} \pm 0.040 \, \rm{(syst.)}) \times 10^{6}~\mathrm{cm^{-2}\,s^{-1}}$, assuming a pure electron-neutrino flavor component without neutrino oscillations. The flux combined with all SK phases up to SK-IV is $(2.336 \pm 0.011\, \rm{(stat.)} \pm 0.043 \, \rm{(syst.)}) \times 10^{6}~\mathrm{cm^{-2}\,s^{-1}}$. Based on the neutrino oscillation analysis from all solar experiments, including the SK $5805$~days data set, the best-fit neutrino oscillation parameters are $\rm{sin^{2} θ_{12,\,solar}} = 0.306 \pm 0.013 $ and $Δm^{2}_{21,\,\mathrm{solar}} = (6.10^{+ 0.95}_{-0.81}) \times 10^{-5}~\rm{eV}^{2}$, with a deviation of about 1.5$σ$ from the $Δm^{2}_{21}$ parameter obtained by KamLAND. The best-fit neutrino oscillation parameters obtained from all solar experiments and KamLAND are $\sin^{2} θ_{12,\,\mathrm{global}} = 0.307 \pm 0.012 $ and $Δm^{2}_{21,\,\mathrm{global}} = (7.50^{+ 0.19}_{-0.18}) \times 10^{-5}~\rm{eV}^{2}$.
△ Less
Submitted 20 February, 2024; v1 submitted 20 December, 2023;
originally announced December 2023.
-
Addressing Sample Inefficiency in Multi-View Representation Learning
Authors:
Kumar Krishna Agrawal,
Arna Ghosh,
Adam Oberman,
Blake Richards
Abstract:
Non-contrastive self-supervised learning (NC-SSL) methods like BarlowTwins and VICReg have shown great promise for label-free representation learning in computer vision. Despite the apparent simplicity of these techniques, researchers must rely on several empirical heuristics to achieve competitive performance, most notably using high-dimensional projector heads and two augmentations of the same i…
▽ More
Non-contrastive self-supervised learning (NC-SSL) methods like BarlowTwins and VICReg have shown great promise for label-free representation learning in computer vision. Despite the apparent simplicity of these techniques, researchers must rely on several empirical heuristics to achieve competitive performance, most notably using high-dimensional projector heads and two augmentations of the same image. In this work, we provide theoretical insights on the implicit bias of the BarlowTwins and VICReg loss that can explain these heuristics and guide the development of more principled recommendations. Our first insight is that the orthogonality of the features is more critical than projector dimensionality for learning good representations. Based on this, we empirically demonstrate that low-dimensional projector heads are sufficient with appropriate regularization, contrary to the existing heuristic. Our second theoretical insight suggests that using multiple data augmentations better represents the desiderata of the SSL objective. Based on this, we demonstrate that leveraging more augmentations per sample improves representation quality and trainability. In particular, it improves optimization convergence, leading to better features emerging earlier in the training. Remarkably, we demonstrate that we can reduce the pretraining dataset size by up to 4x while maintaining accuracy and improving convergence simply by using more data augmentations. Combining these insights, we present practical pretraining recommendations that improve wall-clock time by 2x and improve performance on CIFAR-10/STL-10 datasets using a ResNet-50 backbone. Thus, this work provides a theoretical insight into NC-SSL and produces practical recommendations for enhancing its sample and compute efficiency.
△ Less
Submitted 17 December, 2023;
originally announced December 2023.
-
Deployment of Water-based Liquid Scintillator in the Accelerator Neutrino Neutron Interaction Experiment
Authors:
ANNIE Collaboration,
M. Ascencio-Sosa,
Z. Bagdasarian,
J. Beacom,
M. Bergevin,
M. Breisch,
G. Caceres Vera,
S. Dazeley,
S. Doran,
E. Drakopoulou,
S. Edayath,
R. Edwards,
J. Eisch,
Y. Feng,
V. Fischer,
R. Foster,
S. Gardiner,
S. Gokhale,
P. Hackspacher,
C. Hagner,
J. He,
B. Kaiser,
F. Krennrich,
T. Lachenmaier,
F. Lemmons
, et al. (30 additional authors not shown)
Abstract:
The Accelerator Neutrino Neutron Interaction Experiment (ANNIE) is a 26-ton water Cherenkov neutrino detector installed on the Booster Neutrino Beam (BNB) at Fermilab. Its main physics goals are to perform a measurement of the neutron yield from neutrino-nucleus interactions, as well as a measurement of the charged-current cross section of muon neutrinos. An equally important focus is placed on th…
▽ More
The Accelerator Neutrino Neutron Interaction Experiment (ANNIE) is a 26-ton water Cherenkov neutrino detector installed on the Booster Neutrino Beam (BNB) at Fermilab. Its main physics goals are to perform a measurement of the neutron yield from neutrino-nucleus interactions, as well as a measurement of the charged-current cross section of muon neutrinos. An equally important focus is placed on the research and development of new detector technologies and target media. Specifically water-based liquid scintillator (WbLS) is of interest as a novel detector medium, as it allows for the simultaneous detection of scintillation and Cherenkov light. This paper presents the deployment of a 366L WbLS vessel in ANNIE in March 2023 and the subsequent detection of both Cherenkov light and scintillation from the WbLS. This proof-of-concept allows for the future development of reconstruction and particle identification algorithms in ANNIE, as well as dedicated analyses, such as the search for neutral current events and the hadronic scintillation component within the WbLS volume.
△ Less
Submitted 6 March, 2024; v1 submitted 14 December, 2023;
originally announced December 2023.
-
Learning to combine top-down context and feed-forward representations under ambiguity with apical and basal dendrites
Authors:
Nizar Islah,
Guillaume Etter,
Mashbayar Tugsbayar,
Tugce Gurbuz,
Blake Richards,
Eilif Muller
Abstract:
One of the most striking features of neocortical anatomy is the presence of extensive top-down projections into primary sensory areas. Notably, many of these top-down projections im**e on the distal apical dendrites of pyramidal neurons, where they exert a modulatory effect, altering the gain of responses. It is thought that these top-down projections carry contextual information that can help a…
▽ More
One of the most striking features of neocortical anatomy is the presence of extensive top-down projections into primary sensory areas. Notably, many of these top-down projections im**e on the distal apical dendrites of pyramidal neurons, where they exert a modulatory effect, altering the gain of responses. It is thought that these top-down projections carry contextual information that can help animals to resolve ambiguities in sensory data. However, it has yet to be demonstrated how such modulatory connections to the distal apical dendrites can serve this computational function. Here, we develop a computational model of pyramidal cells that integrates contextual information from top-down projections to apical compartments with sensory representations driven by bottom-up projections to basal compartments. When input stimuli are ambiguous and relevant contextual information is available, the apical feedback modulates the basal signals to recover unambiguous sensory representations. Importantly, when stimuli are unambiguous, contextual information which is irrelevant or opposes sensory evidence is appropriately ignored by the model. By generalizing the task to temporal sequences, we further show that our model can learn to integrate contextual information across time. Using layer-wise relevance propagation, we extract the importance of individual neurons to the prediction of each category, revealing that neurons that are most relevant for the overlap of categories receive the largest magnitude of top-down signals, and are necessary for solving the task. This work thus provides a proof-of-concept demonstrating how the top-down modulatory inputs to apical dendrites in sensory regions could be used by the cortex to handle the ambiguities that animals encounter in the real world.
△ Less
Submitted 9 December, 2023;
originally announced December 2023.
-
Atmospheric neutrino oscillation analysis with neutron tagging and an expanded fiducial volume in Super-Kamiokande I-V
Authors:
Super-Kamiokande Collaboration,
:,
T. Wester,
K. Abe,
C. Bronner,
Y. Hayato,
K. Hiraide,
K. Hosokawa,
K. Ieki,
M. Ikeda,
J. Kameda,
Y. Kanemura,
R. Kaneshima,
Y. Kashiwagi,
Y. Kataoka,
S. Miki,
S. Mine,
M. Miura,
S. Moriyama,
Y. Nakano,
M. Nakahata,
S. Nakayama,
Y. Noguchi,
K. Sato,
H. Sekiya
, et al. (212 additional authors not shown)
Abstract:
We present a measurement of neutrino oscillation parameters with the Super-Kamiokande detector using atmospheric neutrinos from the complete pure-water SK I-V (April 1996-July 2020) data set, including events from an expanded fiducial volume. The data set corresponds to 6511.3 live days and an exposure of 484.2 kiloton-years. Measurements of the neutrino oscillation parameters $Δm^2_{32}$,…
▽ More
We present a measurement of neutrino oscillation parameters with the Super-Kamiokande detector using atmospheric neutrinos from the complete pure-water SK I-V (April 1996-July 2020) data set, including events from an expanded fiducial volume. The data set corresponds to 6511.3 live days and an exposure of 484.2 kiloton-years. Measurements of the neutrino oscillation parameters $Δm^2_{32}$, $\sin^2θ_{23}$, $\sin^2 θ_{13}$, $δ_{CP}$, and the preference for the neutrino mass ordering are presented with atmospheric neutrino data alone, and with constraints on $\sin^2 θ_{13}$ from reactor neutrino experiments. Our analysis including constraints on $\sin^2 θ_{13}$ favors the normal mass ordering at the 92.3% level.
△ Less
Submitted 8 November, 2023;
originally announced November 2023.
-
Measurement of the neutrino-oxygen neutral-current quasielastic cross section using atmospheric neutrinos in the SK-Gd experiment
Authors:
S. Sakai,
K. Abe,
C. Bronner,
Y. Hayato,
K. Hiraide,
K. Hosokawa,
K. Ieki,
M. Ikeda,
J. Kameda,
Y. Kanemura,
R. Kaneshima,
Y. Kashiwagi,
Y. Kataoka,
S. Miki,
S. Mine,
M. Miura,
S. Moriyama,
Y. Nakano,
M. Nakahata,
S. Nakayama,
Y. Noguchi,
K. Sato,
H. Sekiya,
H. Shiba,
K. Shimizu
, et al. (211 additional authors not shown)
Abstract:
We report the first measurement of the atmospheric neutrino-oxygen neutral-current quasielastic (NCQE) cross section in the gadolinium-loaded Super-Kamiokande (SK) water Cherenkov detector. In June 2020, SK began a new experimental phase, named SK-Gd, by loading 0.011% by mass of gadolinium into the ultrapure water of the SK detector. The introduction of gadolinium to ultrapure water has the effec…
▽ More
We report the first measurement of the atmospheric neutrino-oxygen neutral-current quasielastic (NCQE) cross section in the gadolinium-loaded Super-Kamiokande (SK) water Cherenkov detector. In June 2020, SK began a new experimental phase, named SK-Gd, by loading 0.011% by mass of gadolinium into the ultrapure water of the SK detector. The introduction of gadolinium to ultrapure water has the effect of improving the neutron-tagging efficiency. Using a 552.2 day data set from August 2020 to June 2022, we measure the NCQE cross section to be 0.74 $\pm$ 0.22(stat.) $^{+0.85}_{-0.15}$ (syst.) $\times$ 10$^{-38}$ cm$^{2}$/oxygen in the energy range from 160 MeV to 10 GeV, which is consistent with the atmospheric neutrino-flux-averaged theoretical NCQE cross section and the measurement in the SK pure-water phase within the uncertainties. Furthermore, we compare the models of the nucleon-nucleus interactions in water and find that the Binary Cascade model and the Liege Intranuclear Cascade model provide a somewhat better fit to the observed data than the Bertini Cascade model. Since the atmospheric neutrino-oxygen NCQE reactions are one of the main backgrounds in the search for diffuse supernova neutrino background (DSNB), these new results will contribute to future studies - and the potential discovery - of the DSNB in SK.
△ Less
Submitted 7 November, 2023;
originally announced November 2023.
-
Search for Periodic Time Variations of the Solar $^8$B Neutrino Flux between 1996 and 2018 in Super-Kamiokande
Authors:
K. Abe,
C. Bronner,
Y. Hayato,
K. Hiraide,
K. Hosokawa,
K. Ieki,
M. Ikeda,
J. Kameda,
Y. Kanemura,
R. Kaneshima,
Y. Kashiwagi,
Y. Kataoka,
S. Miki,
S. Mine,
M. Miura,
S. Moriyama,
Y. Nakano,
M. Nakahata,
S. Nakayama,
Y. Noguchi,
K. Sato,
H. Sekiya,
H. Shiba,
K. Shimizu,
M. Shiozawa
, et al. (211 additional authors not shown)
Abstract:
We report a search for time variations of the solar $^8$B neutrino flux using 5804 live days of Super-Kamiokande data collected between May 31, 1996, and May 30, 2018. Super-Kamiokande measured the precise time of each solar neutrino interaction over 22 calendar years to search for solar neutrino flux modulations with unprecedented precision. Periodic modulations are searched for in a dataset comp…
▽ More
We report a search for time variations of the solar $^8$B neutrino flux using 5804 live days of Super-Kamiokande data collected between May 31, 1996, and May 30, 2018. Super-Kamiokande measured the precise time of each solar neutrino interaction over 22 calendar years to search for solar neutrino flux modulations with unprecedented precision. Periodic modulations are searched for in a dataset comprising five-day interval solar neutrino flux measurements with a maximum likelihood method. We also applied the Lomb-Scargle method to this dataset to compare it with previous reports. The only significant modulation found is due to the elliptic orbit of the Earth around the Sun. The observed modulation is consistent with astronomical data: we measured an eccentricity of (1.53$\pm$0.35)\%, and a perihelion shift of ($-$1.5$\pm$13.5) days.
△ Less
Submitted 6 June, 2024; v1 submitted 2 November, 2023;
originally announced November 2023.
-
A Unified, Scalable Framework for Neural Population Decoding
Authors:
Mehdi Azabou,
Vinam Arora,
Venkataramana Ganesh,
Ximeng Mao,
Santosh Nachimuthu,
Michael J. Mendelson,
Blake Richards,
Matthew G. Perich,
Guillaume Lajoie,
Eva L. Dyer
Abstract:
Our ability to use deep learning approaches to decipher neural activity would likely benefit from greater scale, in terms of both model size and datasets. However, the integration of many neural recordings into one unified model is challenging, as each recording contains the activity of different neurons from different individual animals. In this paper, we introduce a training framework and archit…
▽ More
Our ability to use deep learning approaches to decipher neural activity would likely benefit from greater scale, in terms of both model size and datasets. However, the integration of many neural recordings into one unified model is challenging, as each recording contains the activity of different neurons from different individual animals. In this paper, we introduce a training framework and architecture designed to model the population dynamics of neural activity across diverse, large-scale neural recordings. Our method first tokenizes individual spikes within the dataset to build an efficient representation of neural events that captures the fine temporal structure of neural activity. We then employ cross-attention and a PerceiverIO backbone to further construct a latent tokenization of neural population activities. Utilizing this architecture and training framework, we construct a large-scale multi-session model trained on large datasets from seven nonhuman primates, spanning over 158 different sessions of recording from over 27,373 neural units and over 100 hours of recordings. In a number of different tasks, we demonstrate that our pretrained model can be rapidly adapted to new, unseen sessions with unspecified neuron correspondence, enabling few-shot performance with minimal labels. This work presents a powerful new approach for building deep learning tools to analyze neural data and stakes out a clear path to training at scale.
△ Less
Submitted 24 October, 2023;
originally announced October 2023.
-
Super-resolution diamond magnetic microscopy of superparamagnetic nanoparticles
Authors:
Nazanin Mosavian,
Forrest Hubert,
Janis Smits,
Pauli Kehayias,
Yaser Silani,
Bryan A. Richards,
Victor M. Acosta
Abstract:
Scanning-probe and wide-field magnetic microscopes based on Nitrogen-Vacancy (NV) centers in diamond have enabled remarkable advances in the study of biology and materials, but each method has drawbacks. Here, we implement an alternative method for nanoscale magnetic microscopy based on optical control of the charge state of NV centers in a dense layer near the diamond surface. By combining a donu…
▽ More
Scanning-probe and wide-field magnetic microscopes based on Nitrogen-Vacancy (NV) centers in diamond have enabled remarkable advances in the study of biology and materials, but each method has drawbacks. Here, we implement an alternative method for nanoscale magnetic microscopy based on optical control of the charge state of NV centers in a dense layer near the diamond surface. By combining a donut-beam super-resolution technique with optically detected magnetic resonance spectroscopy, we imaged the magnetic fields produced by single 30-nm iron-oxide nanoparticles. The magnetic microscope has a lateral spatial resolution of ~100 nm, and it resolves the individual magnetic dipole features from clusters of nanoparticles with interparticle spacings down to ~190 nm. The magnetic feature amplitudes are more than an order of magnitude larger than those obtained by confocal magnetic microscopy due to the smaller characteristic NV-nanoparticle distance within nearby sensing voxels. We analyze the magnetic point-spread function and sensitivity as a function of the microscope's spatial resolution and identify sources of background fluorescence that limit the present performance, including diamond second-order Raman emission and imperfect NV charge-state control. Our method, which uses less than 10 mW laser power and can be parallelized by patterned illumination, introduces a new format for nanoscale magnetic imaging.
△ Less
Submitted 9 October, 2023;
originally announced October 2023.
-
Convex Ancient Solutions to Anisotropic Curve Shortening Flow
Authors:
Theodora Bourni,
Benjamin Richards
Abstract:
We construct a translating solution to anisotropic curve shortening flow and show that for a given anisotropic factor $g:S^1\to\mathbb{R}_+$, and a given direction and speed, this translator is unique. We then construct an ancient compact solution to anisotropic curve shortening flow, and show that this solution, along with the appropriate translating solution, are the unique solutions to anisotro…
▽ More
We construct a translating solution to anisotropic curve shortening flow and show that for a given anisotropic factor $g:S^1\to\mathbb{R}_+$, and a given direction and speed, this translator is unique. We then construct an ancient compact solution to anisotropic curve shortening flow, and show that this solution, along with the appropriate translating solution, are the unique solutions to anisotropic curve shortening flow that lie in a slab of a given width and no smaller.
△ Less
Submitted 1 September, 2023;
originally announced September 2023.
-
Synaptic Weight Distributions Depend on the Geometry of Plasticity
Authors:
Roman Pogodin,
Jonathan Cornford,
Arna Ghosh,
Gauthier Gidel,
Guillaume Lajoie,
Blake Richards
Abstract:
A growing literature in computational neuroscience leverages gradient descent and learning algorithms that approximate it to study synaptic plasticity in the brain. However, the vast majority of this work ignores a critical underlying assumption: the choice of distance for synaptic changes - i.e. the geometry of synaptic plasticity. Gradient descent assumes that the distance is Euclidean, but many…
▽ More
A growing literature in computational neuroscience leverages gradient descent and learning algorithms that approximate it to study synaptic plasticity in the brain. However, the vast majority of this work ignores a critical underlying assumption: the choice of distance for synaptic changes - i.e. the geometry of synaptic plasticity. Gradient descent assumes that the distance is Euclidean, but many other distances are possible, and there is no reason that biology necessarily uses Euclidean geometry. Here, using the theoretical tools provided by mirror descent, we show that the distribution of synaptic weights will depend on the geometry of synaptic plasticity. We use these results to show that experimentally-observed log-normal weight distributions found in several brain areas are not consistent with standard gradient descent (i.e. a Euclidean geometry), but rather with non-Euclidean distances. Finally, we show that it should be possible to experimentally test for different synaptic geometries by comparing synaptic weight distributions before and after learning. Overall, our work shows that the current paradigm in theoretical work on synaptic plasticity that assumes Euclidean synaptic geometry may be misguided and that it should be possible to experimentally determine the true geometry of synaptic plasticity in the brain.
△ Less
Submitted 4 March, 2024; v1 submitted 30 May, 2023;
originally announced May 2023.
-
Search for astrophysical electron antineutrinos in Super-Kamiokande with 0.01wt% gadolinium-loaded water
Authors:
M. Harada,
K. Abe,
C. Bronner,
Y. Hayato,
K. Hiraide,
K. Hosokawa,
K. Ieki,
M. Ikeda,
J. Kameda,
Y. Kanemura,
R. Kaneshima,
Y. Kashiwagi,
Y. Kataoka,
S. Miki,
S. Mine,
M. Miura,
S. Moriyama,
Y. Nakano,
M. Nakahata,
S. Nakayama,
Y. Noguchi,
K. Okamoto,
K. Sato,
H. Sekiya,
H. Shiba
, et al. (216 additional authors not shown)
Abstract:
We report the first search result for the flux of astrophysical electron antineutrinos for energies O(10) MeV in the gadolinium-loaded Super-Kamiokande (SK) detector. In June 2020, gadolinium was introduced to the ultra-pure water of the SK detector in order to detect neutrons more efficiently. In this new experimental phase, SK-Gd, we can search for electron antineutrinos via inverse beta decay w…
▽ More
We report the first search result for the flux of astrophysical electron antineutrinos for energies O(10) MeV in the gadolinium-loaded Super-Kamiokande (SK) detector. In June 2020, gadolinium was introduced to the ultra-pure water of the SK detector in order to detect neutrons more efficiently. In this new experimental phase, SK-Gd, we can search for electron antineutrinos via inverse beta decay with efficient background rejection and higher signal efficiency thanks to the high efficiency of the neutron tagging technique. In this paper, we report the result for the initial stage of SK-Gd with a $22.5\times552$ $\rm kton\cdot day$ exposure at 0.01% Gd mass concentration. No significant excess over the expected background in the observed events is found for the neutrino energies below 31.3 MeV. Thus, the flux upper limits are placed at the 90% confidence level. The limits and sensitivities are already comparable with the previous SK result with pure-water ($22.5 \times 2970 \rm kton\cdot day$) owing to the enhanced neutron tagging.
△ Less
Submitted 30 May, 2023; v1 submitted 8 May, 2023;
originally announced May 2023.
-
Nuclear quadrupole resonance spectroscopy with a femtotesla diamond magnetometer
Authors:
Yaser Silani,
Janis Smits,
Ilja Fescenko,
Michael W. Malone,
Andrew F. McDowell,
Andrey Jarmola,
Pauli Kehayias,
Bryan Richards,
Nazanin Mosavian,
Nathaniel Ristoff,
Victor M. Acosta
Abstract:
Sensitive Radio-Frequency (RF) magnetometers that can detect oscillating magnetic fields at the femtotesla level are needed for demanding applications such as Nuclear Quadrupole Resonance (NQR) spectroscopy. RF magnetometers based on Nitrogen-Vacancy (NV) centers in diamond have been predicted to offer femtotesla sensitivity, but published experiments have largely been limited to the picotesla lev…
▽ More
Sensitive Radio-Frequency (RF) magnetometers that can detect oscillating magnetic fields at the femtotesla level are needed for demanding applications such as Nuclear Quadrupole Resonance (NQR) spectroscopy. RF magnetometers based on Nitrogen-Vacancy (NV) centers in diamond have been predicted to offer femtotesla sensitivity, but published experiments have largely been limited to the picotesla level. Here, we demonstrate a femtotesla RF magnetometer based on an NV-doped diamond membrane inserted between two ferrite flux concentrators. The device operates in bias magnetic fields of 2-10 microtesla and provides a ~300-fold amplitude enhancement within the diamond for RF magnetic fields in the 0.07-3.6 MHz range. The magnetometer's sensitivity is ~70 fT s^{1/2} at 0.35 MHz, and the noise floor decreases to below 2 fT after 1 hour of acquisition. We used this sensor to detect the 3.6 MHz NQR signal of 14N in sodium nitrite powder at room temperature. NQR signals are amplified by a resonant RF coil wrapped around the sample, allowing for higher signal-to-noise ratio detection. The diamond RF magnetometer's recovery time after a strong RF pulse is ~35 us, limited by the coil ring-down time. The sodium-nitrite NQR frequency shifts linearly with temperature as -1.00 +/- 0.02 kHz/K, the magnetization dephasing time is T2* = 887 +/- 51 us, and a spin-lock spin-echo pulse sequence extends the signal lifetime to 332 +/- 23 ms, all consistent with coil-based NQR studies. Our results expand the sensitivity frontier of diamond magnetometers to the femtotesla range, with potential applications in security, medical imaging, and materials science.
△ Less
Submitted 23 February, 2023;
originally announced February 2023.
-
Measurement of the cosmogenic neutron yield in Super-Kamiokande with gadolinium loaded water
Authors:
Super-Kamiokande Collaboration,
:,
M. Shinoki,
K. Abe,
Y. Hayato,
K. Hiraide,
K. Hosokawa,
K. Ieki,
M. Ikeda,
J. Kameda,
Y. Kanemura,
R. Kaneshima,
Y. Kashiwagi,
Y. Kataoka,
S. Miki,
S. Mine,
M. Miura,
S. Moriyama,
Y. Nakano,
M. Nakahata,
S. Nakayama,
Y. Noguchi,
K. Okamoto,
K. Sato,
H. Sekiya
, et al. (217 additional authors not shown)
Abstract:
Cosmic-ray muons that enter the Super-Kamiokande detector cause hadronic showers due to spallation in water, producing neutrons and radioactive isotopes. Those are a major background source for studies of MeV-scale neutrinos and searches for rare events. Since 2020, gadolinium was introduced in the ultra-pure water in the Super-Kamiokande detector to improve the detection efficiency of neutrons. I…
▽ More
Cosmic-ray muons that enter the Super-Kamiokande detector cause hadronic showers due to spallation in water, producing neutrons and radioactive isotopes. Those are a major background source for studies of MeV-scale neutrinos and searches for rare events. Since 2020, gadolinium was introduced in the ultra-pure water in the Super-Kamiokande detector to improve the detection efficiency of neutrons. In this study, the cosmogenic neutron yield was measured using data acquired during the period after the gadolinium loading. The yield was found to be $(2.76 \pm 0.02\,\mathrm{(stat.) \pm 0.19\,\mathrm{(syst.)}}) \times 10^{-4}\,μ^{-1} \mathrm{g^{-1} cm^{2}}$ at 259 GeV of average muon energy at the Super-Kamiokande detector.
△ Less
Submitted 25 October, 2023; v1 submitted 21 December, 2022;
originally announced December 2022.
-
Transfer Entropy Bottleneck: Learning Sequence to Sequence Information Transfer
Authors:
Damjan Kalajdzievski,
Ximeng Mao,
Pascal Fortier-Poisson,
Guillaume Lajoie,
Blake Richards
Abstract:
When presented with a data stream of two statistically dependent variables, predicting the future of one of the variables (the target stream) can benefit from information about both its history and the history of the other variable (the source stream). For example, fluctuations in temperature at a weather station can be predicted using both temperatures and barometric readings. However, a challeng…
▽ More
When presented with a data stream of two statistically dependent variables, predicting the future of one of the variables (the target stream) can benefit from information about both its history and the history of the other variable (the source stream). For example, fluctuations in temperature at a weather station can be predicted using both temperatures and barometric readings. However, a challenge when modelling such data is that it is easy for a neural network to rely on the greatest joint correlations within the target stream, which may ignore a crucial but small information transfer from the source to the target stream. As well, there are often situations where the target stream may have previously been modelled independently and it would be useful to use that model to inform a new joint model. Here, we develop an information bottleneck approach for conditional learning on two dependent streams of data. Our method, which we call Transfer Entropy Bottleneck (TEB), allows one to learn a model that bottlenecks the directed information transferred from the source variable to the target variable, while quantifying this information transfer within the model. As such, TEB provides a useful new information bottleneck approach for modelling two statistically dependent streams of data in order to make predictions about one of them.
△ Less
Submitted 8 March, 2023; v1 submitted 29 November, 2022;
originally announced November 2022.
-
Searching for neutrinos from solar flares across solar cycles 23 and 24 with the Super-Kamiokande detector
Authors:
K. Okamoto,
K. Abe,
Y. Hayato,
K. Hiraide,
K. Hosokawa,
K. Ieki,
M. Ikeda,
J. Kameda,
Y. Kanemura,
Y. Kaneshima,
Y. Kataoka,
Y. Kashiwagi,
S. Miki,
S. Mine,
M. Miura,
S. Moriyama,
Y. Nagao,
M. Nakahata,
Y. Nakano,
S. Nakayama,
Y. Noguchi,
K. Sato,
H. Sekiya,
K. Shimizu,
M. Shiozawa
, et al. (220 additional authors not shown)
Abstract:
Neutrinos associated with solar flares (solar-flare neutrinos) provide information on particle acceleration mechanisms during the impulsive phase of solar flares. We searched using the Super-Kamiokande detector for neutrinos from solar flares that occurred during solar cycles $23$ and $24$, including the largest solar flare (X28.0) on November 4th, 2003. In order to minimize the background rate we…
▽ More
Neutrinos associated with solar flares (solar-flare neutrinos) provide information on particle acceleration mechanisms during the impulsive phase of solar flares. We searched using the Super-Kamiokande detector for neutrinos from solar flares that occurred during solar cycles $23$ and $24$, including the largest solar flare (X28.0) on November 4th, 2003. In order to minimize the background rate we searched for neutrino interactions within narrow time windows coincident with $γ$-rays and soft X-rays recorded by satellites. In addition, we performed the first attempt to search for solar-flare neutrinos from solar flares on the invisible side of the Sun by using the emission time of coronal mass ejections (CMEs). By selecting twenty powerful solar flares above X5.0 on the visible side and eight CMEs whose emission speed exceeds $2000$ $\mathrm{km \, s^{-1}}$ on the invisible side from 1996 to 2018, we found two (six) neutrino events coincident with solar flares occurring on the visible (invisible) side of the Sun, with a typical background rate of $0.10$ ($0.62$) events per flare in the MeV-GeV energy range. No significant solar-flare neutrino signal above the estimated background rate was observed. As a result we set the following upper limit on neutrino fluence at the Earth $\mathitΦ<1.1\times10^{6}$ $\mathrm{cm^{-2}}$ at the $90\%$ confidence level for the largest solar flare. The resulting fluence limits allow us to constrain some of the theoretical models for solar-flare neutrino emission.
△ Less
Submitted 26 October, 2022; v1 submitted 24 October, 2022;
originally announced October 2022.
-
Toward Next-Generation Artificial Intelligence: Catalyzing the NeuroAI Revolution
Authors:
Anthony Zador,
Sean Escola,
Blake Richards,
Bence Ölveczky,
Yoshua Bengio,
Kwabena Boahen,
Matthew Botvinick,
Dmitri Chklovskii,
Anne Churchland,
Claudia Clopath,
James DiCarlo,
Surya Ganguli,
Jeff Hawkins,
Konrad Koerding,
Alexei Koulakov,
Yann LeCun,
Timothy Lillicrap,
Adam Marblestone,
Bruno Olshausen,
Alexandre Pouget,
Cristina Savin,
Terrence Sejnowski,
Eero Simoncelli,
Sara Solla,
David Sussillo
, et al. (2 additional authors not shown)
Abstract:
Neuroscience has long been an essential driver of progress in artificial intelligence (AI). We propose that to accelerate progress in AI, we must invest in fundamental research in NeuroAI. A core component of this is the embodied Turing test, which challenges AI animal models to interact with the sensorimotor world at skill levels akin to their living counterparts. The embodied Turing test shifts…
▽ More
Neuroscience has long been an essential driver of progress in artificial intelligence (AI). We propose that to accelerate progress in AI, we must invest in fundamental research in NeuroAI. A core component of this is the embodied Turing test, which challenges AI animal models to interact with the sensorimotor world at skill levels akin to their living counterparts. The embodied Turing test shifts the focus from those capabilities like game playing and language that are especially well-developed or uniquely human to those capabilities, inherited from over 500 million years of evolution, that are shared with all animals. Building models that can pass the embodied Turing test will provide a roadmap for the next generation of AI.
△ Less
Submitted 22 February, 2023; v1 submitted 15 October, 2022;
originally announced October 2022.
-
Contrastive Retrospection: honing in on critical steps for rapid learning and generalization in RL
Authors:
Chen Sun,
Wannan Yang,
Thomas Jiralerspong,
Dane Malenfant,
Benjamin Alsbury-Nealy,
Yoshua Bengio,
Blake Richards
Abstract:
In real life, success is often contingent upon multiple critical steps that are distant in time from each other and from the final reward. These critical steps are challenging to identify with traditional reinforcement learning (RL) methods that rely on the Bellman equation for credit assignment. Here, we present a new RL algorithm that uses offline contrastive learning to hone in on these critica…
▽ More
In real life, success is often contingent upon multiple critical steps that are distant in time from each other and from the final reward. These critical steps are challenging to identify with traditional reinforcement learning (RL) methods that rely on the Bellman equation for credit assignment. Here, we present a new RL algorithm that uses offline contrastive learning to hone in on these critical steps. This algorithm, which we call Contrastive Retrospection (ConSpec), can be added to any existing RL algorithm. ConSpec learns a set of prototypes for the critical steps in a task by a novel contrastive loss and delivers an intrinsic reward when the current state matches one of the prototypes. The prototypes in ConSpec provide two key benefits for credit assignment: (i) They enable rapid identification of all the critical steps. (ii) They do so in a readily interpretable manner, enabling out-of-distribution generalization when sensory features are altered. Distinct from other contemporary RL approaches to credit assignment, ConSpec takes advantage of the fact that it is easier to retrospectively identify the small set of steps that success is contingent upon (and ignoring other states) than it is to prospectively predict reward at every taken step. ConSpec greatly improves learning in a diverse set of RL tasks. The code is available at the link: https://github.com/sunchipsster1/ConSpec
△ Less
Submitted 27 October, 2023; v1 submitted 11 October, 2022;
originally announced October 2022.
-
Search for Cosmic-ray Boosted Sub-GeV Dark Matter using Recoil Protons at Super-Kamiokande
Authors:
The Super-Kamiokande Collaboration,
:,
K. Abe,
Y. Hayato,
K. Hiraide,
K. Ieki,
M. Ikeda,
J. Kameda,
Y. Kanemura,
R. Kaneshima,
Y. Kashiwagi,
Y. Kataoka,
S. Miki,
S. Mine,
M. Miura,
S. Moriyama,
Y. Nakano,
M. Nakahata,
S. Nakayama,
Y. Noguchi,
K. Okamoto,
K. Sato,
H. Sekiya,
H. Shiba,
K. Shimizu
, et al. (197 additional authors not shown)
Abstract:
We report a search for cosmic-ray boosted dark matter with protons using the 0.37 megaton$\times$years data collected at Super-Kamiokande experiment during the 1996-2018 period (SKI-IV phase). We searched for an excess of proton recoils above the atmospheric neutrino background from the vicinity of the Galactic Center. No such excess is observed, and limits are calculated for two reference models…
▽ More
We report a search for cosmic-ray boosted dark matter with protons using the 0.37 megaton$\times$years data collected at Super-Kamiokande experiment during the 1996-2018 period (SKI-IV phase). We searched for an excess of proton recoils above the atmospheric neutrino background from the vicinity of the Galactic Center. No such excess is observed, and limits are calculated for two reference models of dark matter with either a constant interaction cross-section or through a scalar mediator. This is the first experimental search for boosted dark matter with hadrons using directional information. The results present the most stringent limits on cosmic-ray boosted dark matter and exclude the dark matter-nucleon elastic scattering cross-section between $10^{-33}\text{ cm}^{2}$ and $10^{-27}\text{ cm}^{2}$ for dark matter mass from 10 MeV/$c^2$ to 1 GeV/$c^2$.
△ Less
Submitted 30 August, 2023; v1 submitted 29 September, 2022;
originally announced September 2022.
-
Neutron Tagging following Atmospheric Neutrino Events in a Water Cherenkov Detector
Authors:
K. Abe,
Y. Haga,
Y. Hayato,
K. Hiraide,
K. Ieki,
M. Ikeda,
S. Imaizumi,
K. Iyogi,
J. Kameda,
Y. Kanemura,
Y. Kataoka,
Y. Kato,
Y. Kishimoto,
S. Miki,
S. Mine,
M. Miura,
T. Mochizuki,
S. Moriyama,
Y. Nagao,
M. Nakahata,
T. Nakajima,
Y. Nakano,
S. Nakayama,
T. Okada,
K. Okamoto
, et al. (281 additional authors not shown)
Abstract:
We present the development of neutron-tagging techniques in Super-Kamiokande IV using a neural network analysis. The detection efficiency of neutron capture on hydrogen is estimated to be 26%, with a mis-tag rate of 0.016 per neutrino event. The uncertainty of the tagging efficiency is estimated to be 9.0%. Measurement of the tagging efficiency with data from an Americium-Beryllium calibration agr…
▽ More
We present the development of neutron-tagging techniques in Super-Kamiokande IV using a neural network analysis. The detection efficiency of neutron capture on hydrogen is estimated to be 26%, with a mis-tag rate of 0.016 per neutrino event. The uncertainty of the tagging efficiency is estimated to be 9.0%. Measurement of the tagging efficiency with data from an Americium-Beryllium calibration agrees with this value within 10%. The tagging procedure was performed on 3,244.4 days of SK-IV atmospheric neutrino data, identifying 18,091 neutrons in 26,473 neutrino events. The fitted neutron capture lifetime was measured as 218 \pm 9 μs.
△ Less
Submitted 20 September, 2022; v1 submitted 18 September, 2022;
originally announced September 2022.
-
Responsible AI Implementation: A Human-centered Framework for Accelerating the Innovation Process
Authors:
Dian Tjondronegoro,
Elizabeth Yuwono,
Brent Richards,
Damian Green,
Siiri Hatakka
Abstract:
There is still a significant gap between expectations and the successful adoption of AI to innovate and improve businesses. Due to the emergence of deep learning, AI adoption is more complex as it often incorporates big data and the internet of things, affecting data privacy. Existing frameworks have identified the need to focus on human-centered design, combining technical and business/organizati…
▽ More
There is still a significant gap between expectations and the successful adoption of AI to innovate and improve businesses. Due to the emergence of deep learning, AI adoption is more complex as it often incorporates big data and the internet of things, affecting data privacy. Existing frameworks have identified the need to focus on human-centered design, combining technical and business/organizational perspectives. However, trust remains a critical issue that needs to be designed from the beginning. The proposed framework expands from the human-centered design approach, emphasizing and maintaining the trust that underpins the process. This paper proposes a theoretical framework for responsible artificial intelligence (AI) implementation. The proposed framework emphasizes a synergistic business technology approach for the agile co-creation process. The aim is to streamline the adoption process of AI to innovate and improve business by involving all stakeholders throughout the project so that the AI technology is designed, developed, and deployed in conjunction with people and not in isolation. The framework presents a fresh viewpoint on responsible AI implementation based on analytical literature review, conceptual framework design, and practitioners' mediating expertise. The framework emphasizes establishing and maintaining trust throughout the human-centered design and agile development of AI. This human-centered approach is aligned with and enabled by the privacy by design principle. The creators of the technology and the end-users are working together to tailor the AI solution specifically for the business requirements and human characteristics. An illustrative case study on adopting AI for assisting planning in a hospital will demonstrate that the proposed framework applies to real-life applications.
△ Less
Submitted 15 September, 2022;
originally announced September 2022.
-
The neuroconnectionist research programme
Authors:
Adrien Doerig,
Rowan Sommers,
Katja Seeliger,
Blake Richards,
Jenann Ismael,
Grace Lindsay,
Konrad Kording,
Talia Konkle,
Marcel A. J. Van Gerven,
Nikolaus Kriegeskorte,
Tim C. Kietzmann
Abstract:
Artificial Neural Networks (ANNs) inspired by biology are beginning to be widely used to model behavioral and neural data, an approach we call neuroconnectionism. ANNs have been lauded as the current best models of information processing in the brain, but also criticized for failing to account for basic cognitive functions. We propose that arguing about the successes and failures of a restricted s…
▽ More
Artificial Neural Networks (ANNs) inspired by biology are beginning to be widely used to model behavioral and neural data, an approach we call neuroconnectionism. ANNs have been lauded as the current best models of information processing in the brain, but also criticized for failing to account for basic cognitive functions. We propose that arguing about the successes and failures of a restricted set of current ANNs is the wrong approach to assess the promise of neuroconnectionism. Instead, we take inspiration from the philosophy of science, and in particular from Lakatos, who showed that the core of scientific research programmes is often not directly falsifiable, but should be assessed by its capacity to generate novel insights. Following this view, we present neuroconnectionism as a cohesive large-scale research programme centered around ANNs as a computational language for expressing falsifiable theories about brain computation. We describe the core of the programme, the underlying computational framework and its tools for testing specific neuroscientific hypotheses. Taking a longitudinal view, we review past and present neuroconnectionist projects and their responses to challenges, and argue that the research programme is highly progressive, generating new and otherwise unreachable insights into the workings of the brain.
△ Less
Submitted 8 September, 2022;
originally announced September 2022.
-
Search for proton decay via $p\rightarrow μ^+K^0$ in 0.37 megaton-years exposure of Super-Kamiokande
Authors:
Super-Kamiokande Collaboration,
:,
R. Matsumoto,
K. Abe,
Y. Hayato,
K. Hiraide,
K. Ieki,
M. Ikeda,
J. Kameda,
Y. Kanemura,
R. Kaneshima,
Y. Kashiwagi,
Y. Kataoka,
S. Miki,
S. Mine,
M. Miura,
S. Moriyama,
Y. Nakano,
M. Nakahata,
S. Nakayama,
Y. Noguchi,
K. Okamoto,
K. Sato,
H. Sekiya,
H. Shiba
, et al. (208 additional authors not shown)
Abstract:
We searched for proton decay via $p\toμ^+K^0$ in 0.37\,Mton$\cdot$years of data collected between 1996 and 2018 from the Super-Kamiokande water Cherenkov experiment. The selection criteria were defined separately for $K^0_S$ and $K^0_L$ channels. No significant event excess has been observed. As a result of this analysis, which extends the previous search by an additional 0.2\,Mton$\cdot$years of…
▽ More
We searched for proton decay via $p\toμ^+K^0$ in 0.37\,Mton$\cdot$years of data collected between 1996 and 2018 from the Super-Kamiokande water Cherenkov experiment. The selection criteria were defined separately for $K^0_S$ and $K^0_L$ channels. No significant event excess has been observed. As a result of this analysis, which extends the previous search by an additional 0.2\,Mton$\cdot$years of exposure and uses an improved event reconstruction, we set a lower limit of $3.6\times10^{33}$ years on the proton lifetime.
△ Less
Submitted 28 August, 2022;
originally announced August 2022.
-
On Neural Architecture Inductive Biases for Relational Tasks
Authors:
Giancarlo Kerg,
Sarthak Mittal,
David Rolnick,
Yoshua Bengio,
Blake Richards,
Guillaume Lajoie
Abstract:
Current deep learning approaches have shown good in-distribution generalization performance, but struggle with out-of-distribution generalization. This is especially true in the case of tasks involving abstract relations like recognizing rules in sequences, as we find in many intelligence tests. Recent work has explored how forcing relational representations to remain distinct from sensory represe…
▽ More
Current deep learning approaches have shown good in-distribution generalization performance, but struggle with out-of-distribution generalization. This is especially true in the case of tasks involving abstract relations like recognizing rules in sequences, as we find in many intelligence tests. Recent work has explored how forcing relational representations to remain distinct from sensory representations, as it seems to be the case in the brain, can help artificial systems. Building on this work, we further explore and formalize the advantages afforded by 'partitioned' representations of relations and sensory details, and how this inductive bias can help recompose learned relational structure in newly encountered settings. We introduce a simple architecture based on similarity scores which we name Compositional Relational Network (CoRelNet). Using this model, we investigate a series of inductive biases that ensure abstract relations are learned and represented distinctly from sensory data, and explore their effects on out-of-distribution generalization for a series of relational psychophysics tasks. We find that simple architectural choices can outperform existing models in out-of-distribution generalization. Together, these results show that partitioning relational representations from other information streams may be a simple way to augment existing network architectures' robustness when performing out-of-distribution relational computations.
△ Less
Submitted 9 June, 2022;
originally announced June 2022.
-
Search for supernova bursts in Super-Kamiokande IV
Authors:
The Super-Kamiokande collaboration,
:,
M. Mori,
K. Abe,
Y. Hayato,
K. Hiraide,
K. Ieki,
M. Ikeda,
S. Imaizumi,
J. Kameda,
Y. Kanemura,
R. Kaneshima,
Y. Kashiwagi,
Y. Kataoka,
S. Miki,
S. Mine,
M. Miura,
S. Moriyama,
Y. Nagao,
M. Nakahata,
Y. Nakano,
S. Nakayama,
Y. Noguchi,
T. Okada,
K. Okamoto
, et al. (223 additional authors not shown)
Abstract:
Super-Kamiokande has been searching for neutrino bursts characteristic of core-collapse supernovae continuously, in real time, since the start of operations in 1996. The present work focuses on detecting more distant supernovae whose event rate may be too small to trigger in real time, but may be identified using an offline approach. The analysis of data collected from 2008 to 2018 found no eviden…
▽ More
Super-Kamiokande has been searching for neutrino bursts characteristic of core-collapse supernovae continuously, in real time, since the start of operations in 1996. The present work focuses on detecting more distant supernovae whose event rate may be too small to trigger in real time, but may be identified using an offline approach. The analysis of data collected from 2008 to 2018 found no evidence of distant supernovae bursts. This establishes an upper limit of 0.29 year$^{-1}$ on the rate of core-collapse supernovae out to 100 kpc at 90% C.L.. For supernovae that fail to explode and collapse directly to black holes the limit reaches to 300 kpc.
△ Less
Submitted 2 June, 2022;
originally announced June 2022.
-
Beyond accuracy: generalization properties of bio-plausible temporal credit assignment rules
Authors:
Yuhan Helena Liu,
Arna Ghosh,
Blake A. Richards,
Eric Shea-Brown,
Guillaume Lajoie
Abstract:
To unveil how the brain learns, ongoing work seeks biologically-plausible approximations of gradient descent algorithms for training recurrent neural networks (RNNs). Yet, beyond task accuracy, it is unclear if such learning rules converge to solutions that exhibit different levels of generalization than their nonbiologically-plausible counterparts. Leveraging results from deep learning theory bas…
▽ More
To unveil how the brain learns, ongoing work seeks biologically-plausible approximations of gradient descent algorithms for training recurrent neural networks (RNNs). Yet, beyond task accuracy, it is unclear if such learning rules converge to solutions that exhibit different levels of generalization than their nonbiologically-plausible counterparts. Leveraging results from deep learning theory based on loss landscape curvature, we ask: how do biologically-plausible gradient approximations affect generalization? We first demonstrate that state-of-the-art biologically-plausible learning rules for training RNNs exhibit worse and more variable generalization performance compared to their machine learning counterparts that follow the true gradient more closely. Next, we verify that such generalization performance is correlated significantly with loss landscape curvature, and we show that biologically-plausible learning rules tend to approach high-curvature regions in synaptic weight space. Using tools from dynamical systems, we derive theoretical arguments and present a theorem explaining this phenomenon. This predicts our numerical results, and explains why biologically-plausible rules lead to worse and more variable generalization properties. Finally, we suggest potential remedies that could be used by the brain to mitigate this effect. To our knowledge, our analysis is the first to identify the reason for this generalization gap between artificial and biologically-plausible learning rules, which can help guide future investigations into how the brain learns solutions that generalize.
△ Less
Submitted 13 January, 2023; v1 submitted 1 June, 2022;
originally announced June 2022.
-
Evaluating Multimodal Interactive Agents
Authors:
Josh Abramson,
Arun Ahuja,
Federico Carnevale,
Petko Georgiev,
Alex Goldin,
Alden Hung,
Jessica Landon,
Timothy Lillicrap,
Alistair Muldal,
Blake Richards,
Adam Santoro,
Tamara von Glehn,
Greg Wayne,
Nathaniel Wong,
Chen Yan
Abstract:
Creating agents that can interact naturally with humans is a common goal in artificial intelligence (AI) research. However, evaluating these interactions is challenging: collecting online human-agent interactions is slow and expensive, yet faster proxy metrics often do not correlate well with interactive evaluation. In this paper, we assess the merits of these existing evaluation metrics and prese…
▽ More
Creating agents that can interact naturally with humans is a common goal in artificial intelligence (AI) research. However, evaluating these interactions is challenging: collecting online human-agent interactions is slow and expensive, yet faster proxy metrics often do not correlate well with interactive evaluation. In this paper, we assess the merits of these existing evaluation metrics and present a novel approach to evaluation called the Standardised Test Suite (STS). The STS uses behavioural scenarios mined from real human interaction data. Agents see replayed scenario context, receive an instruction, and are then given control to complete the interaction offline. These agent continuations are recorded and sent to human annotators to mark as success or failure, and agents are ranked according to the proportion of continuations in which they succeed. The resulting STS is fast, controlled, interpretable, and representative of naturalistic interactions. Altogether, the STS consolidates much of what is desirable across many of our standard evaluation metrics, allowing us to accelerate research progress towards producing agents that can interact naturally with humans. A video may be found at https://youtu.be/YR1TngGORGQ.
△ Less
Submitted 14 July, 2022; v1 submitted 26 May, 2022;
originally announced May 2022.
-
Pre-Supernova Alert System for Super-Kamiokande
Authors:
Super-Kamiokande Collaboration,
:,
L. N. Machado,
K. Abe,
Y. Hayato,
K. Hiraide,
K. Ieki,
M. Ikeda,
J. Kameda,
Y. Kanemura,
R. Kaneshima,
Y. Kashiwagi,
Y. Kataoka,
S. Miki,
S. Mine,
M. Miura,
S. Moriyama,
Y. Nakano,
M. Nakahata,
S. Nakayama,
Y. Noguchi,
K. Okamoto,
K. Sato,
H. Sekiya,
H. Shiba
, et al. (202 additional authors not shown)
Abstract:
In 2020, the Super-Kamiokande (SK) experiment moved to a new stage (SK-Gd) in which gadolinium (Gd) sulfate octahydrate was added to the water in the detector, enhancing the efficiency to detect thermal neutrons and consequently improving the sensitivity to low energy electron anti-neutrinos from inverse beta decay (IBD) interactions. SK-Gd has the potential to provide early alerts of incipient co…
▽ More
In 2020, the Super-Kamiokande (SK) experiment moved to a new stage (SK-Gd) in which gadolinium (Gd) sulfate octahydrate was added to the water in the detector, enhancing the efficiency to detect thermal neutrons and consequently improving the sensitivity to low energy electron anti-neutrinos from inverse beta decay (IBD) interactions. SK-Gd has the potential to provide early alerts of incipient core-collapse supernovae through detection of electron anti-neutrinos from thermal and nuclear processes responsible for the cooling of massive stars before the gravitational collapse of their cores. These pre-supernova neutrinos emitted during the silicon burning phase can exceed the energy threshold for IBD reactions. We present the sensitivity of SK-Gd to pre-supernova stars and the techniques used for the development of a pre-supernova alarm based on the detection of these neutrinos in SK, as well as prospects for future SK-Gd phases with higher concentrations of Gd. For the current SK-Gd phase, high-confidence alerts for Betelgeuse could be issued up to nine hours in advance of the core-collapse itself.
△ Less
Submitted 17 August, 2022; v1 submitted 19 May, 2022;
originally announced May 2022.
-
Testing Non-Standard Interactions Between Solar Neutrinos and Quarks with Super-Kamiokande
Authors:
Super-Kamiokande Collaboration,
:,
P. Weatherly,
K. Abe,
C. Bronner,
Y. Hayato,
K. Hiraide,
M. Ikeda,
K. Iyogi,
J. Kameda,
Y. Kanemura,
Y. Kataoka,
Y. Kato,
Y. Kishimoto,
S. Miki,
M. Miura,
S. Moriyama,
T. Mochizuki,
M. Nakahata,
Y. Nakano,
S. Nakayama,
T. Okada,
K. Okamoto,
A. Orii,
G. Pronost
, et al. (248 additional authors not shown)
Abstract:
Non-Standard Interactions (NSI) between neutrinos and matter affect the neutrino flavor oscillations. Due to the high matter density in the core of the Sun, solar neutrinos are suited to probe these interactions. Using the $277$ kton-yr exposure of Super-Kamiokande to $^{8}$B solar neutrinos, we search for the presence of NSI. Our data favors the presence of NSI with down quarks at 1.8$σ$, and wit…
▽ More
Non-Standard Interactions (NSI) between neutrinos and matter affect the neutrino flavor oscillations. Due to the high matter density in the core of the Sun, solar neutrinos are suited to probe these interactions. Using the $277$ kton-yr exposure of Super-Kamiokande to $^{8}$B solar neutrinos, we search for the presence of NSI. Our data favors the presence of NSI with down quarks at 1.8$σ$, and with up quarks at 1.6$σ$, with the best fit NSI parameters being ($ε_{11}^{d},ε_{12}^{d}$) = (-3.3, -3.1) for $d$-quarks and ($ε_{11}^{u},ε_{12}^{u}$) = (-2.5, -3.1) for $u$-quarks. After combining with data from the Sudbury Neutrino Observatory and Borexino, the significance increases by 0.1$σ$.
△ Less
Submitted 22 March, 2022;
originally announced March 2022.
-
Investigating Power laws in Deep Representation Learning
Authors:
Arna Ghosh,
Arnab Kumar Mondal,
Kumar Krishna Agrawal,
Blake Richards
Abstract:
Representation learning that leverages large-scale labelled datasets, is central to recent progress in machine learning. Access to task relevant labels at scale is often scarce or expensive, motivating the need to learn from unlabelled datasets with self-supervised learning (SSL). Such large unlabelled datasets (with data augmentations) often provide a good coverage of the underlying input distrib…
▽ More
Representation learning that leverages large-scale labelled datasets, is central to recent progress in machine learning. Access to task relevant labels at scale is often scarce or expensive, motivating the need to learn from unlabelled datasets with self-supervised learning (SSL). Such large unlabelled datasets (with data augmentations) often provide a good coverage of the underlying input distribution. However evaluating the representations learned by SSL algorithms still requires task-specific labelled samples in the training pipeline. Additionally, the generalization of task-specific encoding is often sensitive to potential distribution shift. Inspired by recent advances in theoretical machine learning and vision neuroscience, we observe that the eigenspectrum of the empirical feature covariance matrix often follows a power law. For visual representations, we estimate the coefficient of the power law, $α$, across three key attributes which influence representation learning: learning objective (supervised, SimCLR, Barlow Twins and BYOL), network architecture (VGG, ResNet and Vision Transformer), and tasks (object and scene recognition). We observe that under mild conditions, proximity of $α$ to 1, is strongly correlated to the downstream generalization performance. Furthermore, $α\approx 1$ is a strong indicator of robustness to label noise during fine-tuning. Notably, $α$ is computable from the representations without knowledge of any labels, thereby offering a framework to evaluate the quality of representations in unlabelled datasets.
△ Less
Submitted 11 February, 2022;
originally announced February 2022.
-
Towards Scaling Difference Target Propagation by Learning Backprop Targets
Authors:
Maxence Ernoult,
Fabrice Normandin,
Abhinav Moudgil,
Sean Spinney,
Eugene Belilovsky,
Irina Rish,
Blake Richards,
Yoshua Bengio
Abstract:
The development of biologically-plausible learning algorithms is important for understanding learning in the brain, but most of them fail to scale-up to real-world tasks, limiting their potential as explanations for learning by real brains. As such, it is important to explore learning algorithms that come with strong theoretical guarantees and can match the performance of backpropagation (BP) on c…
▽ More
The development of biologically-plausible learning algorithms is important for understanding learning in the brain, but most of them fail to scale-up to real-world tasks, limiting their potential as explanations for learning by real brains. As such, it is important to explore learning algorithms that come with strong theoretical guarantees and can match the performance of backpropagation (BP) on complex tasks. One such algorithm is Difference Target Propagation (DTP), a biologically-plausible learning algorithm whose close relation with Gauss-Newton (GN) optimization has been recently established. However, the conditions under which this connection rigorously holds preclude layer-wise training of the feedback pathway synaptic weights (which is more biologically plausible). Moreover, good alignment between DTP weight updates and loss gradients is only loosely guaranteed and under very specific conditions for the architecture being trained. In this paper, we propose a novel feedback weight training scheme that ensures both that DTP approximates BP and that layer-wise feedback weight training can be restored without sacrificing any theoretical guarantees. Our theory is corroborated by experimental results and we report the best performance ever achieved by DTP on CIFAR-10 and ImageNet 32$\times$32
△ Less
Submitted 31 January, 2022;
originally announced January 2022.
-
A Generalized Bootstrap Target for Value-Learning, Efficiently Combining Value and Feature Predictions
Authors:
Anthony GX-Chen,
Veronica Chelu,
Blake A. Richards,
Joelle Pineau
Abstract:
Estimating value functions is a core component of reinforcement learning algorithms. Temporal difference (TD) learning algorithms use bootstrap**, i.e. they update the value function toward a learning target using value estimates at subsequent time-steps. Alternatively, the value function can be updated toward a learning target constructed by separately predicting successor features (SF)--a poli…
▽ More
Estimating value functions is a core component of reinforcement learning algorithms. Temporal difference (TD) learning algorithms use bootstrap**, i.e. they update the value function toward a learning target using value estimates at subsequent time-steps. Alternatively, the value function can be updated toward a learning target constructed by separately predicting successor features (SF)--a policy-dependent model--and linearly combining them with instantaneous rewards. We focus on bootstrap** targets used when estimating value functions, and propose a new backup target, the $η$-return mixture, which implicitly combines value-predictive knowledge (used by TD methods) with (successor) feature-predictive knowledge--with a parameter $η$ capturing how much to rely on each. We illustrate that incorporating predictive knowledge through an $ηγ$-discounted SF model makes more efficient use of sampled experience, compared to either extreme, i.e. bootstrap** entirely on the value function estimate, or bootstrap** on the product of separately estimated successor features and instantaneous reward models. We empirically show this approach leads to faster policy evaluation and better control performance, for tabular and nonlinear function approximations, indicating scalability and generality.
△ Less
Submitted 5 January, 2022;
originally announced January 2022.
-
Single crystal monolithic up-converter solar cell device tandems with integrated optics
Authors:
Georgios E. Arnaoutakis,
Elena Favilla,
Mauro Tonelli,
Bryce S. Richards
Abstract:
Solar photons possessing energy less than the band-gap of a single-junction solar cell can be utilized via the up-conversion (UC) of two or more photons, resulting in the emission of a single above-bandgap photon. Due to the non-linear nature of UC, highly concentrated light is required, which is typically much greater than the practical concentration limits of a solar cell. It has been proposed t…
▽ More
Solar photons possessing energy less than the band-gap of a single-junction solar cell can be utilized via the up-conversion (UC) of two or more photons, resulting in the emission of a single above-bandgap photon. Due to the non-linear nature of UC, highly concentrated light is required, which is typically much greater than the practical concentration limits of a solar cell. It has been proposed that concentrating up-conversion solar cells (UC-SC) with optical elements integrated into the device could help realize the high solar irradiance required. To avoid scattering problems arising from common UC materials based on micro-crystalline powders, in this work concentrators are investigated with mono-crystalline up-converters in silicon-based tandem devices. An external quantum efficiency (EQE) of 6% with 1493 nm infrared illumination at $876 W/m^2$ was obtained in upconverter device with concave integrated optics. At an irradiance higher than $90 W/m^2$ (equivalent to 2.95x in the 1450-1600 nm range), the non-concentrating UC-SC exhibited 1.5x higher EQE than the UC-SC with CPC, while below $90 W/m^2$ the CPC UC-SC exhibited 1.95x higher EQE than the non-concentrating reference device. Due to the negligible scattering of the UC layer, the distribution of localized irradiance is revealed along with its effect on the performance of devices. It is found that irradiance is accumulated within the first 1 mm of the UC layer with peaks at variable depths according to the concentrating scheme. These results suggest ample space for improved up-conversion devices by using integrated optics.
△ Less
Submitted 7 December, 2021;
originally announced December 2021.
-
New Methods and Simulations for Cosmogenic Induced Spallation Removal in Super-Kamiokande-IV
Authors:
Super-Kamiokande Collaboration,
:,
S. Locke,
A. Coffani,
K. Abe,
C. Bronner,
Y. Hayato,
M. Ikeda,
S. Imaizumi,
H. Ito,
J. Kameda,
Y. Kataoka,
M. Miura,
S. Moriyama,
Y. Nagao,
M. Nakahata,
Y. Nakajima,
S. Nakayama,
T. Okada,
K. Okamoto,
A. Orii,
G. Pronost,
H. Sekiya,
M. Shiozawa,
Y. Sonoda
, et al. (196 additional authors not shown)
Abstract:
Radioactivity induced by cosmic muon spallation is a dominant source of backgrounds for $\mathcal{O}(10)~$MeV neutrino interactions in water Cherenkov detectors. In particular, it is crucial to reduce backgrounds to measure the solar neutrino spectrum and find neutrino interactions from distant supernovae. In this paper we introduce new techniques to locate muon-induced hadronic showers and effici…
▽ More
Radioactivity induced by cosmic muon spallation is a dominant source of backgrounds for $\mathcal{O}(10)~$MeV neutrino interactions in water Cherenkov detectors. In particular, it is crucial to reduce backgrounds to measure the solar neutrino spectrum and find neutrino interactions from distant supernovae. In this paper we introduce new techniques to locate muon-induced hadronic showers and efficiently reject spallation backgrounds. Applying these techniques to the solar neutrino analysis with an exposure of $2790\times22.5$~kton.day increases the signal efficiency by $12.6\%$, approximately corresponding to an additional year of detector running. Furthermore, we present the first spallation simulation at SK, where we model hadronic interactions using FLUKA. The agreement between the isotope yields and shower pattern in this simulation and in the data gives confidence in the accuracy of this simulation, and thus opens the door to use it to optimize muon spallation removal in new data with gadolinium-enhanced neutron capture detection.
△ Less
Submitted 30 November, 2021;
originally announced December 2021.
-
From Machine Learning to Robotics: Challenges and Opportunities for Embodied Intelligence
Authors:
Nicholas Roy,
Ingmar Posner,
Tim Barfoot,
Philippe Beaudoin,
Yoshua Bengio,
Jeannette Bohg,
Oliver Brock,
Isabelle Depatie,
Dieter Fox,
Dan Koditschek,
Tomas Lozano-Perez,
Vikash Mansinghka,
Christopher Pal,
Blake Richards,
Dorsa Sadigh,
Stefan Schaal,
Gaurav Sukhatme,
Denis Therien,
Marc Toussaint,
Michiel Van de Panne
Abstract:
Machine learning has long since become a keystone technology, accelerating science and applications in a broad range of domains. Consequently, the notion of applying learning methods to a particular problem set has become an established and valuable modus operandi to advance a particular field. In this article we argue that such an approach does not straightforwardly extended to robotics -- or to…
▽ More
Machine learning has long since become a keystone technology, accelerating science and applications in a broad range of domains. Consequently, the notion of applying learning methods to a particular problem set has become an established and valuable modus operandi to advance a particular field. In this article we argue that such an approach does not straightforwardly extended to robotics -- or to embodied intelligence more generally: systems which engage in a purposeful exchange of energy and information with a physical environment. In particular, the purview of embodied intelligent agents extends significantly beyond the typical considerations of main-stream machine learning approaches, which typically (i) do not consider operation under conditions significantly different from those encountered during training; (ii) do not consider the often substantial, long-lasting and potentially safety-critical nature of interactions during learning and deployment; (iii) do not require ready adaptation to novel tasks while at the same time (iv) effectively and efficiently curating and extending their models of the world through targeted and deliberate actions. In reality, therefore, these limitations result in learning-based systems which suffer from many of the same operational shortcomings as more traditional, engineering-based approaches when deployed on a robot outside a well defined, and often narrow operating envelope. Contrary to viewing embodied intelligence as another application domain for machine learning, here we argue that it is in fact a key driver for the advancement of machine learning technology. In this article our goal is to highlight challenges and opportunities that are specific to embodied intelligence and to propose research directions which may significantly advance the state-of-the-art in robot learning.
△ Less
Submitted 28 October, 2021;
originally announced October 2021.
-
Diffuse Supernova Neutrino Background Search at Super-Kamiokande
Authors:
Super-Kamiokande Collaboration,
:,
K. Abe,
C. Bronner,
Y. Hayato,
K. Hiraide,
M. Ikeda,
S. Imaizumi,
J. Kameda,
Y. Kanemura,
Y. Kataoka,
S. Miki,
M. Miura,
S. Moriyama,
Y. Nagao,
M. Nakahata,
S. Nakayama,
T. Okada,
K. Okamoto,
A. Orii,
G. Pronost,
H. Sekiya,
M. Shiozawa,
Y. Sonoda,
Y. Suzuki
, et al. (197 additional authors not shown)
Abstract:
A new search for the diffuse supernova neutrino background (DSNB) flux has been conducted at Super-Kamiokande (SK), with a $22.5\times2970$-kton$\cdot$day exposure from its fourth operational phase IV. The new analysis improves on the existing background reduction techniques and systematic uncertainties and takes advantage of an improved neutron tagging algorithm to lower the energy threshold comp…
▽ More
A new search for the diffuse supernova neutrino background (DSNB) flux has been conducted at Super-Kamiokande (SK), with a $22.5\times2970$-kton$\cdot$day exposure from its fourth operational phase IV. The new analysis improves on the existing background reduction techniques and systematic uncertainties and takes advantage of an improved neutron tagging algorithm to lower the energy threshold compared to the previous phases of SK. This allows for setting the world's most stringent upper limit on the extraterrestrial $\barν_e$ flux, for neutrino energies below 31.3 MeV. The SK-IV results are combined with the ones from the first three phases of SK to perform a joint analysis using $22.5\times5823$ kton$\cdot$days of data. This analysis has the world's best sensitivity to the DSNB $\barν_e$ flux, comparable to the predictions from various models. For neutrino energies larger than 17.3 MeV, the new combined $90\%$ C.L. upper limits on the DSNB $\barν_e$ flux lie around $2.7$ cm$^{-2}$$\cdot$$\text{sec}^{-1}$, strongly disfavoring the most optimistic predictions. Finally, potentialities of the gadolinium phase of SK and the future Hyper-Kamiokande experiment are discussed.
△ Less
Submitted 2 November, 2021; v1 submitted 23 September, 2021;
originally announced September 2021.
-
First Gadolinium Loading to Super-Kamiokande
Authors:
K. Abe,
C. Bronner,
Y. Hayato,
K. Hiraide,
M. Ikeda,
S. Imaizumi,
J. Kameda,
Y. Kanemura,
Y. Kataoka,
S. Miki,
M. Miura,
S. Moriyama,
Y. Nagao,
M. Nakahata,
S. Nakayama,
T. Okada,
K. Okamoto,
A. Orii,
G. Pronost,
H. Sekiya,
M. Shiozawa,
Y. Sonoda,
Y. Suzuki,
A. Takeda,
Y. Takemoto
, et al. (192 additional authors not shown)
Abstract:
In order to improve Super-Kamiokande's neutron detection efficiency and to thereby increase its sensitivity to the diffuse supernova neutrino background flux, 13 tons of $\rm Gd_2(\rm SO_4)_3\cdot \rm 8H_2O$ (gadolinium sulfate octahydrate) was dissolved into the detector's otherwise ultrapure water from July 14 to August 17, 2020, marking the start of the SK-Gd phase of operations. During the loa…
▽ More
In order to improve Super-Kamiokande's neutron detection efficiency and to thereby increase its sensitivity to the diffuse supernova neutrino background flux, 13 tons of $\rm Gd_2(\rm SO_4)_3\cdot \rm 8H_2O$ (gadolinium sulfate octahydrate) was dissolved into the detector's otherwise ultrapure water from July 14 to August 17, 2020, marking the start of the SK-Gd phase of operations. During the loading, water was continuously recirculated at a rate of 60 m$^3$/h, extracting water from the top of the detector and mixing it with concentrated $\rm Gd_2(\rm SO_4)_3\cdot \rm 8H_2O$ solution to create a 0.02% solution of the Gd compound before injecting it into the bottom of the detector. A clear boundary between the Gd-loaded and pure water was maintained through the loading, enabling monitoring of the loading itself and the spatial uniformity of the Gd concentration over the 35 days it took to reach the top of the detector. During the subsequent commissioning the recirculation rate was increased to 120 m$^3$/h, resulting in a constant and uniform distribution of Gd throughout the detector and water transparency equivalent to that of previous pure-water operation periods. Using an Am-Be neutron calibration source the mean neutron capture time was measured to be $115\pm1$ $μ$s, which corresponds to a Gd concentration of $111\pm2$ ppm, as expected for this level of Gd loading. This paper describes changes made to the water circulation system for this detector upgrade, the Gd loading procedure, detector commissioning, and the first neutron calibration measurements in SK-Gd.
△ Less
Submitted 15 December, 2021; v1 submitted 1 September, 2021;
originally announced September 2021.
-
Unclonable anti-counterfeiting labels based on microlens arrays and luminescent microparticles
Authors:
Vinay Kumar,
Stephan Dottermusch,
Ngei Katumo,
Bryce S. Richards,
Ian A. Howard
Abstract:
Micron-scale randomness during manufacturing can ensure anti-counterfeiting labels are unclonable. However, this security typically comes at the expense of complex hardware being needed for authentication (e.g., microscopy systems). We demonstrate unclonable labels that can be authenticated using a standard light-emitting diode and smartphone camera. The labels consist of a microlens array laminat…
▽ More
Micron-scale randomness during manufacturing can ensure anti-counterfeiting labels are unclonable. However, this security typically comes at the expense of complex hardware being needed for authentication (e.g., microscopy systems). We demonstrate unclonable labels that can be authenticated using a standard light-emitting diode and smartphone camera. The labels consist of a microlens array laminated to a polymer film that is doped with luminescent microparticles. The micron-scale random overlap of focal volumes and microparticles leads to a pattern of bright points of visible light emission that can be easily imaged by a smartphone camera. 10 000 comparisons of images demonstrate that the labels can be robustly authenticated, and that the probability of a false authentication is on the order of $10^{-15}$. The ability for microlens arrays to simplify the hardware needed for authentication of unclonable labels is generalizable, and attractive for the implementation of unclonable labels in anti-counterfeiting systems.
△ Less
Submitted 12 July, 2021;
originally announced July 2021.
-
Current State and Future Directions for Learning in Biological Recurrent Neural Networks: A Perspective Piece
Authors:
Luke Y. Prince,
Roy Henha Eyono,
Ellen Boven,
Arna Ghosh,
Joe Pemberton,
Franz Scherr,
Claudia Clopath,
Rui Ponte Costa,
Wolfgang Maass,
Blake A. Richards,
Cristina Savin,
Katharina Anna Wilmes
Abstract:
We provide a brief review of the common assumptions about biological learning with findings from experimental neuroscience and contrast them with the efficiency of gradient-based learning in recurrent neural networks. The key issues discussed in this review include: synaptic plasticity, neural circuits, theory-experiment divide, and objective functions. We conclude with recommendations for both th…
▽ More
We provide a brief review of the common assumptions about biological learning with findings from experimental neuroscience and contrast them with the efficiency of gradient-based learning in recurrent neural networks. The key issues discussed in this review include: synaptic plasticity, neural circuits, theory-experiment divide, and objective functions. We conclude with recommendations for both theoretical and experimental neuroscientists when designing new studies that could help bring clarity to these issues.
△ Less
Submitted 5 January, 2022; v1 submitted 11 May, 2021;
originally announced May 2021.
-
Search for neutrinos in coincidence with gravitational wave events from the LIGO-Virgo O3a Observing Run with the Super-Kamiokande detector
Authors:
The Super-Kamiokande collaboration,
:,
K. Abe,
C. Bronner,
Y. Hayato,
M. Ikeda,
S. Imaizumi,
J. Kameda,
Y. Kanemura,
Y. Kataoka,
S. Miki,
M. Miura,
S. Moriyama,
Y. Nagao,
M. Nakahata,
S. Nakayama,
T. Okada,
K. Okamoto,
A. Orii,
G. Pronost,
H. Sekiya,
M. Shiozawa,
Y. Sonoda,
Y. Suzuki,
A. Takeda
, et al. (189 additional authors not shown)
Abstract:
The Super-Kamiokande detector can be used to search for neutrinos in time coincidence with gravitational waves detected by the LIGO-Virgo Collaboration (LVC). Both low-energy ($7-100$ MeV) and high-energy ($0.1-10^5$ GeV) samples were analyzed in order to cover a very wide neutrino spectrum. Follow-ups of 36 (out of 39) gravitational waves reported in the GWTC-2 catalog were examined; no significa…
▽ More
The Super-Kamiokande detector can be used to search for neutrinos in time coincidence with gravitational waves detected by the LIGO-Virgo Collaboration (LVC). Both low-energy ($7-100$ MeV) and high-energy ($0.1-10^5$ GeV) samples were analyzed in order to cover a very wide neutrino spectrum. Follow-ups of 36 (out of 39) gravitational waves reported in the GWTC-2 catalog were examined; no significant excess above the background was observed, with 10 (24) observed neutrinos compared with 4.8 (25.0) expected events in the high-energy (low-energy) samples. A statistical approach was used to compute the significance of potential coincidences. For each observation, p-values were estimated using neutrino direction and LVC sky map ; the most significant event (GW190602_175927) is associated with a post-trial p-value of $7.8\%$ ($1.4σ$). Additionally, flux limits were computed independently for each sample and by combining the samples. The energy emitted as neutrinos by the identified gravitational wave sources was constrained, both for given flavors and for all-flavors assuming equipartition between the different flavors, independently for each trigger and by combining sources of the same nature.
△ Less
Submitted 13 September, 2021; v1 submitted 19 April, 2021;
originally announced April 2021.
-
Accretion onto a small black hole at the center of a neutron star
Authors:
Chloe B. Richards,
Thomas W. Baumgarte,
Stuart L. Shapiro
Abstract:
We revisit the system consisting of a neutron star that harbors a small, possibly primordial, black hole at its center, focusing on a nonspinning black hole embedded in a nonrotating neutron star. Extending earlier treatments, we provide an analytical treatment describing the rate of secular accretion of the neutron star matter onto the black hole, adopting the relativistic Bondi accretion formali…
▽ More
We revisit the system consisting of a neutron star that harbors a small, possibly primordial, black hole at its center, focusing on a nonspinning black hole embedded in a nonrotating neutron star. Extending earlier treatments, we provide an analytical treatment describing the rate of secular accretion of the neutron star matter onto the black hole, adopting the relativistic Bondi accretion formalism for stiff equations of state that we presented elsewhere. We use these accretion rates to sketch the evolution of the system analytically until the neutron star is completely consumed. We also perform numerical simulations in full general relativity for black holes with masses up to nine orders of magnitude smaller than the neutron star mass, including a simulation of the entire evolution through collapse for the largest black hole mass. We construct relativistic initial data for these simulations by generalizing the black hole puncture method to allow for the presence of matter, and evolve these data with a code that is optimally designed to resolve the vastly different length scales present in this problem. We compare our analytic and numerical results, and provide expressions for the lifetime of neutron stars harboring such endoparasitic black holes.
△ Less
Submitted 18 May, 2021; v1 submitted 18 February, 2021;
originally announced February 2021.
-
Relativistic Bondi accretion for stiff equations of state
Authors:
Chloe B. Richards,
Thomas W. Baumgarte,
Stuart L. Shapiro
Abstract:
We revisit Bondi accretion - steady-state, adiabatic, spherical gas flow onto a Schwarzschild black hole at rest in an asymptotically homogeneous medium - for stiff polytropic equations of state (EOSs) with adiabatic indices $Γ> 5/3$. A general relativistic treatment is required to determine their accretion rates, for which we provide exact expressions. We discuss several qualitative differences b…
▽ More
We revisit Bondi accretion - steady-state, adiabatic, spherical gas flow onto a Schwarzschild black hole at rest in an asymptotically homogeneous medium - for stiff polytropic equations of state (EOSs) with adiabatic indices $Γ> 5/3$. A general relativistic treatment is required to determine their accretion rates, for which we provide exact expressions. We discuss several qualitative differences between results for soft and stiff EOSs - including the appearance of a minimum steady-state accretion rate for EOSs with $Γ\geq 5/3$ - and explore limiting cases in order to examine these differences. As an example we highlight results for $Γ= 2$, which is often used in numerical simulations to model the EOS of neutron stars. We also discuss a special case with this index, the ultra-relativistic `causal' EOS, $P = ρ$. The latter serves as a useful limit for the still undetermined neutron-star EOS above nuclear density. The results are useful, for example, to estimate the accretion rate onto a mini-black hole residing at the center of a neutron star.
△ Less
Submitted 7 July, 2021; v1 submitted 21 January, 2021;
originally announced January 2021.
-
Supernova Model Discrimination with Hyper-Kamiokande
Authors:
Hyper-Kamiokande Collaboration,
:,
K. Abe,
P. Adrich,
H. Aihara,
R. Akutsu,
I. Alekseev,
A. Ali,
F. Ameli,
I. Anghel,
L. H. V. Anthony,
M. Antonova,
A. Araya,
Y. Asaoka,
Y. Ashida,
V. Aushev,
F. Ballester,
I. Bandac,
M. Barbi,
G. J. Barker,
G. Barr,
M. Batkiewicz-Kwasniak,
M. Bellato,
V. Berardi,
M. Bergevin
, et al. (478 additional authors not shown)
Abstract:
Core-collapse supernovae are among the most magnificent events in the observable universe. They produce many of the chemical elements necessary for life to exist and their remnants -- neutron stars and black holes -- are interesting astrophysical objects in their own right. However, despite millennia of observations and almost a century of astrophysical study, the explosion mechanism of core-colla…
▽ More
Core-collapse supernovae are among the most magnificent events in the observable universe. They produce many of the chemical elements necessary for life to exist and their remnants -- neutron stars and black holes -- are interesting astrophysical objects in their own right. However, despite millennia of observations and almost a century of astrophysical study, the explosion mechanism of core-collapse supernovae is not yet well understood. Hyper-Kamiokande is a next-generation neutrino detector that will be able to observe the neutrino flux from the next galactic core-collapse supernova in unprecedented detail. We focus on the first 500 ms of the neutrino burst, corresponding to the accretion phase, and use a newly-developed, high-precision supernova event generator to simulate Hyper-Kamiokande's response to five different supernova models. We show that Hyper-Kamiokande will be able to distinguish between these models with high accuracy for a supernova at a distance of up to 100 kpc. Once the next galactic supernova happens, this ability will be a powerful tool for guiding simulations towards a precise reproduction of the explosion mechanism observed in nature.
△ Less
Submitted 20 July, 2021; v1 submitted 13 January, 2021;
originally announced January 2021.