Search | arXiv e-print repository

arXiv:1906.04675 [pdf, other]

doi 10.1109/ACCESS.2021.3108545

Taxonomy of Saliency Metrics for Channel Pruning

Authors: Kaveena Persand, Andrew Anderson, David Gregg

Abstract: Pruning unimportant parameters can allow deep neural networks (DNNs) to reduce their heavy computation and memory requirements. A saliency metric estimates which parameters can be safely pruned with little impact on the classification performance of the DNN. Many saliency metrics have been proposed, each within the context of a wider pruning algorithm. The result is that it is difficult to separat… ▽ More Pruning unimportant parameters can allow deep neural networks (DNNs) to reduce their heavy computation and memory requirements. A saliency metric estimates which parameters can be safely pruned with little impact on the classification performance of the DNN. Many saliency metrics have been proposed, each within the context of a wider pruning algorithm. The result is that it is difficult to separate the effectiveness of the saliency metric from the wider pruning algorithm that surrounds it. Similar-looking saliency metrics can yield very different results because of apparently minor design choices. We propose a taxonomy of saliency metrics based on four mostly-orthogonal principal components. We show that a broad range of metrics from the pruning literature can be grouped according to these components. Our taxonomy not only serves as a guide to prior work, but allows us to construct new saliency metrics by exploring novel combinations of our taxonomic components. We perform an in-depth experimental investigation of more than 300 saliency metrics. Our results provide decisive answers to open research questions, and demonstrate the importance of reduction and scaling when pruning groups of weights. We find that some of our constructed metrics can outperform the best existing state-of-the-art metrics for convolutional neural network channel pruning. △ Less

Submitted 4 July, 2021; v1 submitted 11 June, 2019; originally announced June 2019.

Journal ref: IEEE Access, vol. 9, pp. 120110-120126, 2021

arXiv:1905.11712 [pdf, other]

doi 10.1007/s00285-019-01441-5

Evolutionary dynamics of competing phenotype-structured populations in periodically fluctuating environments

Authors: Aleksandra Ardaševa, Robert A. Gatenby, Alexander R. A. Anderson, Helen M. Byrne, Philip K. Maini, Tommaso Lorenzi

Abstract: Living species, ranging from bacteria to animals, exist in environmental conditions that exhibit spatial and temporal heterogeneity which requires them to adapt. Risk-spreading through spontaneous phenotypic variations is a known concept in ecology, which is used to explain how species may survive when faced with the evolutionary risks associated with temporally varying environments. In order to s… ▽ More Living species, ranging from bacteria to animals, exist in environmental conditions that exhibit spatial and temporal heterogeneity which requires them to adapt. Risk-spreading through spontaneous phenotypic variations is a known concept in ecology, which is used to explain how species may survive when faced with the evolutionary risks associated with temporally varying environments. In order to support a deeper understanding of the adaptive role of spontaneous phenotypic variations in fluctuating environments, we consider a system of non-local partial differential equations modelling the evolutionary dynamics of two competing phenotype-structured populations in the presence of periodically oscillating nutrient levels. The two populations undergo spontaneous phenotypic variations at different rates. The phenotypic state of each individual is represented by a continuous variable, and the phenotypic landscape of the populations evolves in time due to variations in the nutrient level. Exploiting the analytical tractability of our model, we study the long-time behaviour of the solutions to obtain a detailed mathematical depiction of evolutionary dynamics. The results suggest that when nutrient levels undergo small and slow oscillations, it is evolutionarily more convenient to rarely undergo spontaneous phenotypic variations. Conversely, under relatively large and fast periodic oscillations in the nutrient levels, which bring about alternating cycles of starvation and nutrient abundance, higher rates of spontaneous phenotypic variations confer a competitive advantage. We discuss the implications of our results in the context of cancer metabolism. △ Less

Submitted 23 August, 2019; v1 submitted 28 May, 2019; originally announced May 2019.

Comments: 33 pages, 8 figures

Journal ref: Journal of Mathematical Biology, 80, 775-807, 2020

arXiv:1905.09925 [pdf, ps, other]

doi 10.1103/PhysRevA.100.063427

Electromagnetically-induced transparency, absorption, and microwave field sensing in a Rb vapor cell with a three-color all-infrared laser system

Authors: N. Thaicharoen, K. R. Moore, D. A. Anderson, R. C. Powel, E. Peterson, G. Raithel

Abstract: A comprehensive study of three-photon electromagnetically-induced transparency (EIT) and absorption (EIA) on the rubidium cascade $5S_{1/2} \rightarrow 5P_{3/2}$ (laser wavelength 780~nm), $5P_{3/2} \rightarrow 5D_{5/2}$ (776~nm), and $5D_{5/2}\rightarrow 28F_{7/2}$ (1260~nm) is performed. The 780-nm probe and 776-nm dressing beams are counter-aligned through a Rb room-temperature vapor cell, and… ▽ More A comprehensive study of three-photon electromagnetically-induced transparency (EIT) and absorption (EIA) on the rubidium cascade $5S_{1/2} \rightarrow 5P_{3/2}$ (laser wavelength 780~nm), $5P_{3/2} \rightarrow 5D_{5/2}$ (776~nm), and $5D_{5/2}\rightarrow 28F_{7/2}$ (1260~nm) is performed. The 780-nm probe and 776-nm dressing beams are counter-aligned through a Rb room-temperature vapor cell, and the 1260-nm coupler beam is co- or counter-aligned with the probe beam. Several cases of EIT and EIA, measured over a range of detunings of the 776-nm beam, are studied. The observed phenomena are modeled by numerically solving the Lindblad equation, and the results are interpreted in terms of the probe-beam absorption behavior of velocity- and detuning-dependent dressed states. To explore the utility of three-photon Rydberg EIA/EIT for microwave electric-field diagnostics, a sub-THz field generated by a signal source and a frequency quadrupler is applied to the Rb cell. The 100.633-GHz field resonantly drives the $28F_{7/2}\leftrightarrow29D_{5/2}$ transition and causes Autler-Townes splittings in the Rydberg EIA/EIT spectra, which are measured and employed to characterize the performance of the microwave quadrupler. △ Less

Submitted 23 May, 2019; originally announced May 2019.

Journal ref: Phys. Rev. A 100, 063427 (2019)

arXiv:1905.05777 [pdf, other]

doi 10.3847/1538-4357/ab4186

A Measurement of the Cosmic Microwave Background Lensing Potential and Power Spectrum from 500 deg$^2$ of SPTpol Temperature and Polarization Data

Authors: W. L. K. Wu, L. M. Mocanu, P. A. R. Ade, A. J. Anderson, J. E. Austermann, J. S. Avva, J. A. Beall, A. N. Bender, B. A. Benson, F. Bianchini, L. E. Bleem, J. E. Carlstrom, C. L. Chang, H. C. Chiang, R. Citron, C. Corbett Moran, T. M. Crawford, A. T. Crites, T. de Haan, M. A. Dobbs, W. Everett, J. Gallicchio, E. M. George, A. Gilbert, N. Gupta , et al. (46 additional authors not shown)

Abstract: We present a measurement of the cosmic microwave background (CMB) lensing potential using 500 deg$^2$ of 150 GHz data from the SPTpol receiver on the South Pole Telescope. The lensing potential is reconstructed with signal-to-noise per mode greater than unity at lensing multipoles $L \lesssim 250$, using a quadratic estimator on a combination of CMB temperature and polarization maps. We report mea… ▽ More We present a measurement of the cosmic microwave background (CMB) lensing potential using 500 deg$^2$ of 150 GHz data from the SPTpol receiver on the South Pole Telescope. The lensing potential is reconstructed with signal-to-noise per mode greater than unity at lensing multipoles $L \lesssim 250$, using a quadratic estimator on a combination of CMB temperature and polarization maps. We report measurements of the lensing potential power spectrum in the multipole range of $100< L < 2000$ from sets of temperature-only, polarization-only, and minimum-variance estimators. We measure the lensing amplitude by taking the ratio of the measured spectrum to the expected spectrum from the best-fit $Λ$CDM model to the $\textit{Planck}$ 2015 TT+lowP+lensing dataset. For the minimum-variance estimator, we find $A_{\rm{MV}} = 0.944 \pm 0.058{\rm (Stat.)}\pm0.025{\rm (Sys.)}$; restricting to only polarization data, we find $A_{\rm{POL}} = 0.906 \pm 0.090 {\rm (Stat.)} \pm 0.040 {\rm (Sys.)}$. Considering statistical uncertainties alone, this is the most precise polarization-only lensing amplitude constraint to date (10.1 $σ$), and is more precise than our temperature-only constraint. We perform null tests and consistency checks and find no evidence for significant contamination. △ Less

Submitted 22 October, 2019; v1 submitted 14 May, 2019; originally announced May 2019.

Comments: 18 pages, 8 figures; updated to match published version

Journal ref: ApJ 884 70 (2019)

arXiv:1903.09708 [pdf, other]

Explaining Reinforcement Learning to Mere Mortals: An Empirical Study

Authors: Andrew Anderson, Jonathan Dodge, Amrita Sadarangani, Zoe Juozapaitis, Evan Newman, Jed Irvine, Souti Chattopadhyay, Alan Fern, Margaret Burnett

Abstract: We present a user study to investigate the impact of explanations on non-experts' understanding of reinforcement learning (RL) agents. We investigate both a common RL visualization, saliency maps (the focus of attention), and a more recent explanation type, reward-decomposition bars (predictions of future types of rewards). We designed a 124 participant, four-treatment experiment to compare partic… ▽ More We present a user study to investigate the impact of explanations on non-experts' understanding of reinforcement learning (RL) agents. We investigate both a common RL visualization, saliency maps (the focus of attention), and a more recent explanation type, reward-decomposition bars (predictions of future types of rewards). We designed a 124 participant, four-treatment experiment to compare participants' mental models of an RL agent in a simple Real-Time Strategy (RTS) game. Our results show that the combination of both saliency and reward bars were needed to achieve a statistically significant improvement in mental model score over the control. In addition, our qualitative analysis of the data reveals a number of effects for further study. △ Less

Submitted 18 June, 2019; v1 submitted 22 March, 2019; originally announced March 2019.

Comments: 7 pages

arXiv:1903.03308 [pdf, other]

Thanks for Stop** By: A Study of "Thanks" Usage on Wikimedia

Authors: Swati Goel, Ashton Anderson, Leila Zia

Abstract: The Thanks feature on Wikipedia, also known as "Thanks", is a tool with which editors can quickly and easily send one other positive feedback. The aim of this project is to better understand this feature: its scope, the characteristics of a typical "Thanks" interaction, and the effects of receiving a thank on individual editors. We study the motivational impacts of "Thanks" because maintaining edi… ▽ More The Thanks feature on Wikipedia, also known as "Thanks", is a tool with which editors can quickly and easily send one other positive feedback. The aim of this project is to better understand this feature: its scope, the characteristics of a typical "Thanks" interaction, and the effects of receiving a thank on individual editors. We study the motivational impacts of "Thanks" because maintaining editor engagement is a central problem for crowdsourced repositories of knowledge such as Wikimedia. Our main findings are that most editors have not been exposed to the Thanks feature (meaning they have never given nor received a thank), thanks are typically sent upwards (from less experienced to more experienced editors), and receiving a thank is correlated with having high levels of editor engagement. Though the prevalence of "Thanks" usage varies by editor experience, the impact of receiving a thank seems mostly consistent for all users. We empirically demonstrate that receiving a thank has a strong positive effect on short-term editor activity across the board and provide preliminary evidence that thanks could compound to have long-term effects as well. △ Less

Submitted 8 March, 2019; originally announced March 2019.

arXiv:1902.09640 [pdf, other]

doi 10.1007/s10909-018-2057-2

Design and Bolometer Characterization of the SPT-3G First-year Focal Plane

Authors: W. Everett, P. A. R. Ade, Z. Ahmed, A. J. Anderson, J. E. Austermann, J. S. Avva, R. Basu Thakur, A. N. Bender, B. A. Benson, J. E. Carlstrom, F. W. Carter, T. Cecil, C. L. Chang, J. F. Cliche, A. Cukierman, E. V. Denison, T. de Haan, J. Ding, M. A. Dobbs, D. Dutcher, A. Foster, R. N. Gannon, A. Gilbert, J. C. Groh, N. W. Halverson , et al. (51 additional authors not shown)

Abstract: During the austral summer of 2016-17, the third-generation camera, SPT-3G, was installed on the South Pole Telescope, increasing the detector count in the focal plane by an order of magnitude relative to the previous generation. Designed to map the polarization of the cosmic microwave background, SPT-3G contains ten 6-in-hexagonal modules of detectors, each with 269 trichroic and dual-polarization… ▽ More During the austral summer of 2016-17, the third-generation camera, SPT-3G, was installed on the South Pole Telescope, increasing the detector count in the focal plane by an order of magnitude relative to the previous generation. Designed to map the polarization of the cosmic microwave background, SPT-3G contains ten 6-in-hexagonal modules of detectors, each with 269 trichroic and dual-polarization pixels, read out using 68x frequency-domain multiplexing. Here we discuss design, assembly, and layout of the modules, as well as early performance characterization of the first-year array, including yield and detector properties. △ Less

Submitted 25 February, 2019; originally announced February 2019.

Comments: Conference proceeding for Low Temperature Detectors 2017. Accepted for publication: 27 August 2018

arXiv:1901.05049 [pdf, other]

doi 10.1145/3403572

Bonseyes AI Pipeline -- bringing AI to you. End-to-end integration of data, algorithms and deployment tools

Authors: Miguel de Prado, **g Su, Rabia Saeed, Lorenzo Keller, Noelia Vallez, Andrew Anderson, David Gregg, Luca Benini, Tim Llewellynn, Nabil Ouerhani, Rozenn Dahyot and, Nuria Pazos

Abstract: Next generation of embedded Information and Communication Technology (ICT) systems are collaborative systems able to perform autonomous tasks. The remarkable expansion of the embedded ICT market, together with the rise and breakthroughs of Artificial Intelligence (AI), have put the focus on the Edge as it stands as one of the keys for the next technological revolution: the seamless integration of… ▽ More Next generation of embedded Information and Communication Technology (ICT) systems are collaborative systems able to perform autonomous tasks. The remarkable expansion of the embedded ICT market, together with the rise and breakthroughs of Artificial Intelligence (AI), have put the focus on the Edge as it stands as one of the keys for the next technological revolution: the seamless integration of AI in our daily life. However, training and deployment of custom AI solutions on embedded devices require a fine-grained integration of data, algorithms, and tools to achieve high accuracy. Such integration requires a high level of expertise that becomes a real bottleneck for small and medium enterprises wanting to deploy AI solutions on the Edge which, ultimately, slows down the adoption of AI on daily-life applications. In this work, we present a modular AI pipeline as an integrating framework to bring data, algorithms, and deployment tools together. By removing the integration barriers and lowering the required expertise, we can interconnect the different stages of tools and provide a modular end-to-end development of AI products for embedded devices. Our AI pipeline consists of four modular main steps: i) data ingestion, ii) model training, iii) deployment optimization and, iv) the IoT hub integration. To show the effectiveness of our pipeline, we provide examples of different AI applications during each of the steps. Besides, we integrate our deployment framework, LPDNN, into the AI pipeline and present its lightweight architecture and deployment capabilities for embedded devices. Finally, we demonstrate the results of the AI pipeline by showing the deployment of several AI applications such as keyword spotting, image classification and object detection on a set of well-known embedded platforms, where LPDNN consistently outperforms all other popular deployment frameworks. △ Less

Submitted 11 June, 2020; v1 submitted 15 January, 2019; originally announced January 2019.

arXiv:1811.06981 [pdf, other]

Learned Video Compression

Authors: Oren Rippel, Sanjay Nair, Carissa Lew, Steve Branson, Alexander G. Anderson, Lubomir Bourdev

Abstract: We present a new algorithm for video coding, learned end-to-end for the low-latency mode. In this setting, our approach outperforms all existing video codecs across nearly the entire bitrate range. To our knowledge, this is the first ML-based method to do so. We evaluate our approach on standard video compression test sets of varying resolutions, and benchmark against all mainstream commercial c… ▽ More We present a new algorithm for video coding, learned end-to-end for the low-latency mode. In this setting, our approach outperforms all existing video codecs across nearly the entire bitrate range. To our knowledge, this is the first ML-based method to do so. We evaluate our approach on standard video compression test sets of varying resolutions, and benchmark against all mainstream commercial codecs, in the low-latency mode. On standard-definition videos, relative to our algorithm, HEVC/H.265, AVC/H.264 and VP9 typically produce codes up to 60% larger. On high-definition 1080p videos, H.265 and VP9 typically produce codes up to 20% larger, and H.264 up to 35% larger. Furthermore, our approach does not suffer from blocking artifacts and pixelation, and thus produces videos that are more visually pleasing. We propose two main contributions. The first is a novel architecture for video compression, which (1) generalizes motion estimation to perform any learned compensation beyond simple translations, (2) rather than strictly relying on previously transmitted reference frames, maintains a state of arbitrary information learned by the model, and (3) enables jointly compressing all transmitted signals (such as optical flow and residual). Secondly, we present a framework for ML-based spatial rate control: namely, a mechanism for assigning variable bitrates across space for each frame. This is a critical component for video coding, which to our knowledge had not been developed within a machine learning setting. △ Less

Submitted 16 November, 2018; originally announced November 2018.

arXiv:1811.06111 [pdf, ps, other]

doi 10.1016/j.jmmm.2018.11.007

Intercalated Rare-Earth Metals under Graphene on SiC

Authors: Nathaniel A. Anderson, Myron Hupalo, David Keavney, Michael Tringides, David Vaknin

Abstract: Intercalation of rare earth metals ($RE$ = Eu, Dy, and Gd) is achieved by depositing the $RE$ metal on graphene that is grown on silicon-carbide (SiC) and by subsequent annealing at high temperatures to promote intercalation. STM images of the films reveal that the graphene layer is defect free and that each of the intercalated metals has a distinct nucleation pattern. Intercalated Eu forms nano-c… ▽ More Intercalation of rare earth metals ($RE$ = Eu, Dy, and Gd) is achieved by depositing the $RE$ metal on graphene that is grown on silicon-carbide (SiC) and by subsequent annealing at high temperatures to promote intercalation. STM images of the films reveal that the graphene layer is defect free and that each of the intercalated metals has a distinct nucleation pattern. Intercalated Eu forms nano-clusters that are situated on the vertices of a Moir{è} pattern, while Dy and Gd form randomly distributed nano-clusters. X-ray magnetic circular dichroism (XMCD) measurements of intercalated films reveal the magnetic properties of these $RE$'s nano-clusters. Furthermore, field dependence and temperature dependence of the magnetic moments extracted from the XMCD show paramagnetic-like behaviors with moments that are generally smaller than those predicted by the Brillouin function. XMCD measurements of $RE$-oxides compared with those of the intercalated $RE$'s under graphene after exposure to air for months indicate that the graphene membranes protect these intercalants against oxidation. △ Less

Submitted 14 November, 2018; originally announced November 2018.

Comments: 9 pages, 7 figures

arXiv:1810.10998 [pdf, other]

doi 10.3847/1538-4357/ab01ca

Mass Calibration of Optically Selected DES clusters using a Measurement of CMB-Cluster Lensing with SPTpol Data

Authors: S. Raghunathan, S. Patil, E. Baxter, B. A. Benson, L. E. Bleem, T. L. Chou, T. M. Crawford, G. P. Holder, T. McClintock, C. L. Reichardt, E. Rozo, T. N. Varga, T. M. C. Abbott, P. A. R. Ade, S. Allam, A. J. Anderson, J. Annis, J. E. Austermann, S. Avila, J. A. Beall, K. Bechtol, A. N. Bender, G. Bernstein, E. Bertin, F. Bianchini , et al. (107 additional authors not shown)

Abstract: We use cosmic microwave background (CMB) temperature maps from the 500 deg$^{2}$ SPTpol survey to measure the stacked lensing convergence of galaxy clusters from the Dark Energy Survey (DES) Year-3 redMaPPer (RM) cluster catalog. The lensing signal is extracted through a modified quadratic estimator designed to be unbiased by the thermal Sunyaev-Zel{'}dovich (tSZ) effect. The modified estimator us… ▽ More We use cosmic microwave background (CMB) temperature maps from the 500 deg$^{2}$ SPTpol survey to measure the stacked lensing convergence of galaxy clusters from the Dark Energy Survey (DES) Year-3 redMaPPer (RM) cluster catalog. The lensing signal is extracted through a modified quadratic estimator designed to be unbiased by the thermal Sunyaev-Zel{'}dovich (tSZ) effect. The modified estimator uses a tSZ-free map, constructed from the SPTpol 95 and 150 GHz datasets, to estimate the background CMB gradient. For lensing reconstruction, we employ two versions of the RM catalog: a flux-limited sample containing 4003 clusters and a volume-limited sample with 1741 clusters. We detect lensing at a significance of 8.7$σ$(6.7$σ$) with the flux(volume)-limited sample. By modeling the reconstructed convergence using the Navarro-Frenk-White profile, we find the average lensing masses to be $M_{200m}$ = ($1.62^{+0.32}_{-0.25}$ [stat.] $\pm$ 0.04 [sys.]) and ($1.28^{+0.14}_{-0.18}$ [stat.] $\pm$ 0.03 [sys.]) $\times\ 10^{14}\ M_{\odot}$ for the volume- and flux-limited samples respectively. The systematic error budget is much smaller than the statistical uncertainty and is dominated by the uncertainties in the RM cluster centroids. We use the volume-limited sample to calibrate the normalization of the mass-richness scaling relation, and find a result consistent with the galaxy weak-lensing measurements from DES (Mcclintock et al. 2018). △ Less

Submitted 20 February, 2019; v1 submitted 25 October, 2018; originally announced October 2018.

Comments: 19 pages, 6 figures, published in ApJ

arXiv:1810.04260 [pdf]

Inter-Scanner Harmonization of High Angular Resolution DW-MRI using Null Space Deep Learning

Authors: Vishwesh Nath, Prasanna Parvathaneni, Colin B. Hansen, Allison E. Hainline, Camilo Bermudez, Samuel Remedios, Justin A. Blaber, Kurt G. Schilling, Ilwoo Lyu, Vaibhav Janve, Yurui Gao, Iwona Stepniewska, Baxter P. Rogers, Allen T. Newton, L. Taylor Davis, Jeff Luci, Adam W. Anderson, Bennett A. Landman

Abstract: Diffusion-weighted magnetic resonance imaging (DW-MRI) allows for non-invasive imaging of the local fiber architecture of the human brain at a millimetric scale. Multiple classical approaches have been proposed to detect both single (e.g., tensors) and multiple (e.g., constrained spherical deconvolution, CSD) fiber population orientations per voxel. However, existing techniques generally exhibit l… ▽ More Diffusion-weighted magnetic resonance imaging (DW-MRI) allows for non-invasive imaging of the local fiber architecture of the human brain at a millimetric scale. Multiple classical approaches have been proposed to detect both single (e.g., tensors) and multiple (e.g., constrained spherical deconvolution, CSD) fiber population orientations per voxel. However, existing techniques generally exhibit low reproducibility across MRI scanners. Herein, we propose a data-driven tech-nique using a neural network design which exploits two categories of data. First, training data were acquired on three squirrel monkey brains using ex-vivo DW-MRI and histology of the brain. Second, repeated scans of human subjects were acquired on two different scanners to augment the learning of the network pro-posed. To use these data, we propose a new network architecture, the null space deep network (NSDN), to simultaneously learn on traditional observed/truth pairs (e.g., MRI-histology voxels) along with repeated observations without a known truth (e.g., scan-rescan MRI). The NSDN was tested on twenty percent of the histology voxels that were kept completely blind to the network. NSDN significantly improved absolute performance relative to histology by 3.87% over CSD and 1.42% over a recently proposed deep neural network approach. More-over, it improved reproducibility on the paired data by 21.19% over CSD and 10.09% over a recently proposed deep approach. Finally, NSDN improved gen-eralizability of the model to a third in vivo human scanner (which was not used in training) by 16.08% over CSD and 10.41% over a recently proposed deep learn-ing approach. This work suggests that data-driven approaches for local fiber re-construction are more reproducible, informative and precise and offers a novel, practical method for determining these models. △ Less

Submitted 9 October, 2018; originally announced October 2018.

Comments: 10 pages, 5 figures

arXiv:1810.03611 [pdf, other]

Understanding the Origins of Bias in Word Embeddings

Authors: Marc-Etienne Brunet, Colleen Alkalay-Houlihan, Ashton Anderson, Richard Zemel

Abstract: The power of machine learning systems not only promises great technical progress, but risks societal harm. As a recent example, researchers have shown that popular word embedding algorithms exhibit stereotypical biases, such as gender bias. The widespread use of these algorithms in machine learning systems, from automated translation services to curriculum vitae scanners, can amplify stereotypes i… ▽ More The power of machine learning systems not only promises great technical progress, but risks societal harm. As a recent example, researchers have shown that popular word embedding algorithms exhibit stereotypical biases, such as gender bias. The widespread use of these algorithms in machine learning systems, from automated translation services to curriculum vitae scanners, can amplify stereotypes in important contexts. Although methods have been developed to measure these biases and alter word embeddings to mitigate their biased representations, there is a lack of understanding in how word embedding bias depends on the training data. In this work, we develop a technique for understanding the origins of bias in word embeddings. Given a word embedding trained on a corpus, our method identifies how perturbing the corpus will affect the bias of the resulting embedding. This can be used to trace the origins of word embedding bias back to the original training documents. Using our method, one can investigate trends in the bias of the underlying corpus and identify subsets of documents whose removal would most reduce bias. We demonstrate our techniques on both a New York Times and Wikipedia corpus and find that our influence function-based approximations are very accurate. △ Less

Submitted 7 June, 2019; v1 submitted 8 October, 2018; originally announced October 2018.

arXiv:1809.10572 [pdf, other]

doi 10.1109/ARITH.2019.00018

Scalar Arithmetic Multiple Data: Customizable Precision for Deep Neural Networks

Authors: Andrew Anderson, David Gregg

Abstract: Quantization of weights and activations in Deep Neural Networks (DNNs) is a powerful technique for network compression, and has enjoyed significant attention and success. However, much of the inference-time benefit of quantization is accessible only through the use of customized hardware accelerators or by providing an FPGA implementation of quantized arithmetic. Building on prior work, we show… ▽ More Quantization of weights and activations in Deep Neural Networks (DNNs) is a powerful technique for network compression, and has enjoyed significant attention and success. However, much of the inference-time benefit of quantization is accessible only through the use of customized hardware accelerators or by providing an FPGA implementation of quantized arithmetic. Building on prior work, we show how to construct arbitrary bit-precise signed and unsigned integer operations using a software technique which logically \emph{embeds} a vector architecture with custom bit-width lanes in universally available fixed-width scalar arithmetic. We evaluate our approach on a high-end Intel Haswell processor, and an embedded ARM processor. Our approach yields very fast implementations of bit-precise custom DNN operations, which often match or exceed the performance of operations quantized to the sizes supported in native arithmetic. At the strongest level of quantization, our approach yields a maximum speedup of $\thicksim6\times$ on the Intel platform, and $\thicksim10\times$ on the ARM platform versus quantization to native 8-bit integers. △ Less

Submitted 12 December, 2019; v1 submitted 27 September, 2018; originally announced September 2018.

arXiv:1809.00036 [pdf, other]

doi 10.1117/12.2312426

Year two instrument status of the SPT-3G cosmic microwave background receiver

Authors: A. N. Bender, P. A. R. Ade, Z. Ahmed, A. J. Anderson, J. S. Avva, K. Aylor, P. S. Barry, R. Basu Thakur, B. A. Benson, L. S. Bleem, S. Bocquet, K. Byrum, J. E. Carlstrom, F. W. Carter, T. W. Cecil, C. L. Chang, H. -M. Cho, J. F. Cliche, T. M. Crawford, A. Cukierman, T. de Haan, E. V. Denison, J. Ding, M. A. Dobbs, S. Dodelson , et al. (64 additional authors not shown)

Abstract: The South Pole Telescope (SPT) is a millimeter-wavelength telescope designed for high-precision measurements of the cosmic microwave background (CMB). The SPT measures both the temperature and polarization of the CMB with a large aperture, resulting in high resolution maps sensitive to signals across a wide range of angular scales on the sky. With these data, the SPT has the potential to make a br… ▽ More The South Pole Telescope (SPT) is a millimeter-wavelength telescope designed for high-precision measurements of the cosmic microwave background (CMB). The SPT measures both the temperature and polarization of the CMB with a large aperture, resulting in high resolution maps sensitive to signals across a wide range of angular scales on the sky. With these data, the SPT has the potential to make a broad range of cosmological measurements. These include constraining the effect of massive neutrinos on large-scale structure formation as well as cleaning galactic and cosmological foregrounds from CMB polarization data in future searches for inflationary gravitational waves. The SPT began observing in January 2017 with a new receiver (SPT-3G) containing $\sim$16,000 polarization-sensitive transition-edge sensor bolometers. Several key technology developments have enabled this large-format focal plane, including advances in detectors, readout electronics, and large millimeter-wavelength optics. We discuss the implementation of these technologies in the SPT-3G receiver as well as the challenges they presented. In late 2017 the implementations of all three of these technologies were modified to optimize total performance. Here, we present the current instrument status of the SPT-3G receiver. △ Less

Submitted 31 August, 2018; originally announced September 2018.

Comments: 21 pages, 9 Figures, Presented at SPIE Astronomical Telescopes + Instrumentation 2018

Journal ref: Proc. SPIE 10708, Millimeter, Submillimeter, and Far-Infrared Detectors and Instrumentation for Astronomy IX, 1070803 (1 August 2018)

arXiv:1809.00033 [pdf, other]

doi 10.1117/12.2312451

Characterization and performance of the second-year SPT-3G focal plane

Authors: D. Dutcher, P. A. R. Ade, Z. Ahmed, A. J. Anderson, J. S. Avva, R. Basu Thakur, A. N. Bender, B. A. Benson, J. E. Carlstrom, F. W. Carter, T. W. Cecil, C. L. Chang, J. F. Cliche, A. Cukierman, T. de Haan, J. Ding, M. A. Dobbs, W. Everett, A. Foster, J. Gallicchio, A. Gilbert, J. C. Groh, S. T. Guns, N. W. Halverson, A. H. Harke-Hosemann , et al. (46 additional authors not shown)

Abstract: The third-generation instrument for the 10-meter South Pole Telescope, SPT-3G, was first installed in January 2017. In addition to completely new cryostats, secondary telescope optics, and readout electronics, the number of detectors in the focal plane has increased by an order of magnitude from previous instruments to ~16,000. The SPT-3G focal plane consists of ten detector modules, each with an… ▽ More The third-generation instrument for the 10-meter South Pole Telescope, SPT-3G, was first installed in January 2017. In addition to completely new cryostats, secondary telescope optics, and readout electronics, the number of detectors in the focal plane has increased by an order of magnitude from previous instruments to ~16,000. The SPT-3G focal plane consists of ten detector modules, each with an array of 269 trichroic, polarization-sensitive pixels on a six-inch silicon wafer. Within each pixel is a broadband, dual-polarization sinuous antenna; the signal from each orthogonal linear polarization is divided into three frequency bands centered at 95, 150, and 220 GHz by in-line lumped element filters and transmitted via superconducting microstrip to Ti/Au transition-edge sensor (TES) bolometers. Properties of the TES film, microstrip filters, and bolometer island must be tightly controlled to achieve optimal performance. For the second year of SPT-3G operation, we have replaced all ten wafers in the focal plane with new detector arrays tuned to increase map** speed and improve overall performance. Here we discuss the TES superconducting transition temperature and normal resistance, detector saturation power, bandpasses, optical efficiency, and full array yield for the 2018 focal plane. △ Less

Submitted 31 August, 2018; originally announced September 2018.

Comments: 13 pages, 11 figures

Journal ref: Proc. SPIE Volume 10708, Millimeter, Submillimeter, and Far-Infrared Detectors and Instrumentation for Astronomy IX; 107081Z (2018)

arXiv:1809.00032 [pdf, other]

doi 10.1117/12.2314366

Design and characterization of the SPT-3G receiver

Authors: J. A. Sobrin, P. A. R. Ade, Z. Ahmed, A. J. Anderson, J. S. Avva, R. Basu Thakur, A. N. Bender, B. A. Benson, J. E. Carlstrom, F. W. Carter, T. W. Cecil, C. L. Chang, J. F. Cliche, A. Cukierman, T. de Haan, J. Ding, M. A. Dobbs, D. Dutcher, W. Everett, A. Foster, J. Gallichio, A. Gilbert, J. C. Groh, S. T. Guns, N. W. Halverson , et al. (46 additional authors not shown)

Abstract: The SPT-3G receiver was commissioned in early 2017 on the 10-meter South Pole Telescope (SPT) to map anisotropies in the cosmic microwave background (CMB). New optics, detector, and readout technologies have yielded a multichroic, high-resolution, low-noise camera with impressive throughput and sensitivity, offering the potential to improve our understanding of inflationary physics, astroparticle… ▽ More The SPT-3G receiver was commissioned in early 2017 on the 10-meter South Pole Telescope (SPT) to map anisotropies in the cosmic microwave background (CMB). New optics, detector, and readout technologies have yielded a multichroic, high-resolution, low-noise camera with impressive throughput and sensitivity, offering the potential to improve our understanding of inflationary physics, astroparticle physics, and growth of structure. We highlight several key features and design principles of the new receiver, and summarize its performance to date. △ Less

Submitted 31 August, 2018; originally announced September 2018.

Comments: Conference Presentation at SPIE Astronomical Telescopes + Instrumentation 2018, conference 10708

Journal ref: Proc. SPIE 10708, Millimeter, Submillimeter, and Far-Infrared Detectors and Instrumentation for Astronomy IX, 107081H (24 August 2018)

arXiv:1809.00030 [pdf, other]

doi 10.1117/12.2315674

Broadband anti-reflective coatings for cosmic microwave background experiments

Authors: A. Nadolski, A. M. Kofman, J. D. Vieira, P. A. R. Ade, Z. Ahmed, A. J. Anderson, J. S. Avva, R. Basu Thakur, A. N. Bender, B. A. Benson, J. E. Carlstrom, F. W. Carter, T. W. Cecil, C. L. Chang, J. F. Cliche, A. Cukierman, T. de Haan, J. Ding, M. A. Dobbs, D. Dutcher, W. Everett, A. Foster, J. Fu, J. Gallicchio, A. Gilbert , et al. (49 additional authors not shown)

Abstract: The desire for higher sensitivity has driven ground-based cosmic microwave background (CMB) experiments to employ ever larger focal planes, which in turn require larger reimaging optics. Practical limits to the maximum size of these optics motivates the development of quasi-optically-coupled (lenslet-coupled), multi-chroic detectors. These detectors can be sensitive across a broader bandwidth comp… ▽ More The desire for higher sensitivity has driven ground-based cosmic microwave background (CMB) experiments to employ ever larger focal planes, which in turn require larger reimaging optics. Practical limits to the maximum size of these optics motivates the development of quasi-optically-coupled (lenslet-coupled), multi-chroic detectors. These detectors can be sensitive across a broader bandwidth compared to waveguide-coupled detectors. However, the increase in bandwidth comes at a cost: the lenses (up to $\sim$700 mm diameter) and lenslets ($\sim$5 mm diameter, hemispherical lenses on the focal plane) used in these systems are made from high-refractive-index materials (such as silicon or amorphous aluminum oxide) that reflect nearly a third of the incident radiation. In order to maximize the faint CMB signal that reaches the detectors, the lenses and lenslets must be coated with an anti-reflective (AR) material. The AR coating must maximize radiation transmission in scientifically interesting bands and be cryogenically stable. Such a coating was developed for the third generation camera, SPT-3G, of the South Pole Telescope (SPT) experiment, but the materials and techniques used in the development are general to AR coatings for mm-wave optics. The three-layer polytetrafluoroethylene-based AR coating is broadband, inexpensive, and can be manufactured with simple tools. The coating is field tested; AR coated focal plane elements were deployed in the 2016-2017 austral summer and AR coated reimaging optics were deployed in 2017-2018. △ Less

Submitted 31 August, 2018; originally announced September 2018.

Comments: 13 pages, 5 figures

Journal ref: Proceedings of SPIE Volume 10708, Millimeter, Submillimeter, and Far-Infrared Detectors and Instrumentation for Astronomy IX; 1070843 (2018)

arXiv:1808.08589 [pdf, other]

doi 10.1109/TAP.2020.2987112

An atomic receiver for AM and FM radio communication

Authors: David A. Anderson, Rachel E. Sapiro, Georg Raithel

Abstract: Radio reception relies on antennas for the collection of electromagnetic fields carrying information, and receiver elements for demodulation and retrieval of the transmitted information. Here we demonstrate an atom-based receiver for AM and FM microwave communication with a 3-dB bandwidth in the baseband of $\sim$100~kHz that provides optical circuit-free field pickup, multi-band carrier capabilit… ▽ More Radio reception relies on antennas for the collection of electromagnetic fields carrying information, and receiver elements for demodulation and retrieval of the transmitted information. Here we demonstrate an atom-based receiver for AM and FM microwave communication with a 3-dB bandwidth in the baseband of $\sim$100~kHz that provides optical circuit-free field pickup, multi-band carrier capability, and inherently high field sensitivity. The quantum receiver exploits field-sensitive cesium Rydberg vapors in a centimeter-sized glass cell, and quantum-optical readout of baseband signals modulated onto carriers with frequencies ranging over four octaves, from C-band to Q-band. Receiver bandwidth, dynamic range and sideband suppression are characterized, and acquisition of audio waveforms of human vocals demonstrated. The atomic radio receiver is a valuable receiver technology because it does not require antenna structures and is resilient against electromagnetic interference, while affording multi-band operation in a single compact receiving element. △ Less

Submitted 26 August, 2018; originally announced August 2018.

arXiv:1805.00412 [pdf, other]

doi 10.1063/1.5038550

A hybrid polarization-selective atomic sensor for radio-frequency field detection with a passive resonant-cavity field amplifier

Authors: David A. Anderson, Eric G. Paradis, Georg Raithel

Abstract: We present a hybrid atomic sensor that realizes radio-frequency electric field detection with intrinsic field amplification and polarization selectivity for robust high-sensitivity field measurement. The hybrid sensor incorporates a passive resonator element integrated with an atomic vapor cell that provides amplification and polarization selectivity for detection of incident radio-frequency field… ▽ More We present a hybrid atomic sensor that realizes radio-frequency electric field detection with intrinsic field amplification and polarization selectivity for robust high-sensitivity field measurement. The hybrid sensor incorporates a passive resonator element integrated with an atomic vapor cell that provides amplification and polarization selectivity for detection of incident radio-frequency fields. The amplified intra-cavity radio-frequency field is measured by atoms using a quantum-optical readout of AC level shifts of field-sensitive atomic Rydberg states. In our experimental demonstration, we employ a split field-enhancement resonator embedded in a rubidium vapor cell to amplify and detect C-band radio-frequency fields. We observe a field amplification equivalent to a 24 dB gain in intensity sensitivity. The spatial profile of the resonant field mode inside the field-enhancement cavity is characterized. The resonant field modes only couple with a well-defined polarization component of the incident field, allowing us to measure the polarization of the incident field in a robust fashion. Measured field enhancement factors, polarization-selectivity performance, and field distributions for the hybrid sensor are in good agreement with simulations. Applications of hybrid atomic sensors in ultra-weak radio-frequency detection and advanced measurement capabilities are discussed. △ Less

Submitted 1 May, 2018; originally announced May 2018.

arXiv:1804.09789 [pdf, other]

doi 10.1109/GSMM.2018.8439437

High-resolution antenna near-field imaging and sub-THz measurements with a small atomic vapor-cell sensing element

Authors: David A. Anderson, Eric Paradis, Georg Raithel, Rachel E. Sapiro, Christopher L. Holloway

Abstract: Atomic sensing and measurement of millimeter-wave (mmW) and THz electric fields using quantum-optical EIT spectroscopy of Rydberg states in atomic vapors has garnered significant interest in recent years towards the development of atomic electric-field standards and sensor technologies. Here we describe recent work employing small atomic vapor cell sensing elements for near-field imaging of the ra… ▽ More Atomic sensing and measurement of millimeter-wave (mmW) and THz electric fields using quantum-optical EIT spectroscopy of Rydberg states in atomic vapors has garnered significant interest in recent years towards the development of atomic electric-field standards and sensor technologies. Here we describe recent work employing small atomic vapor cell sensing elements for near-field imaging of the radiation pattern of a K$_u$-band horn antenna at 13.49 GHz. We image fields at a spatial resolution of $λ/10$ and measure over a 72 to 240 V/m field range using off-resonance AC-Stark shifts of a Rydberg resonance. The same atomic sensing element is used to measure sub-THz electric fields at 255 GHz, an increase in mmW-frequency by more than one order of magnitude. The sub-THz field is measured over a continuous $\pm$100 MHz frequency band using a near-resonant mmW atomic transition. △ Less

Submitted 25 April, 2018; originally announced April 2018.

arXiv:1804.02318 [pdf, other]

How Constraints Affect Content: The Case of Twitter's Switch from 140 to 280 Characters

Authors: Kristina Gligorić, Ashton Anderson, Robert West

Abstract: It is often said that constraints affect creative production, both in terms of form and quality. Online social media platforms frequently impose constraints on the content that users can produce, limiting the range of possible contributions. Do these restrictions tend to push creators towards producing more or less successful content? How do creators adapt their contributions to fit the limits imp… ▽ More It is often said that constraints affect creative production, both in terms of form and quality. Online social media platforms frequently impose constraints on the content that users can produce, limiting the range of possible contributions. Do these restrictions tend to push creators towards producing more or less successful content? How do creators adapt their contributions to fit the limits imposed by social media platforms? To answer these questions, we conduct an observational study of a recent event: on November 7, 2017, Twitter changed the maximum allowable length of a tweet from 140 to 280 characters, thereby significantly altering its signature constraint. In the first study of this switch, we compare tweets with nearly or exactly 140 characters before the change to tweets of the same length posted after the change. This setup enables us to characterize how users alter their tweets to fit the constraint and how this affects their tweets' success. We find that in response to a length constraint, users write more tersely, use more abbreviations and contracted forms, and use fewer definite articles. Also, although in general tweet success increases with length, we find initial evidence that tweets made to fit the 140-character constraint tend to be more successful than similar-length tweets written when the constraint was removed, suggesting that the length constraint improved tweet quality. △ Less

Submitted 10 April, 2018; v1 submitted 6 April, 2018; originally announced April 2018.

Comments: To appear in the Proceedings of AAAI ICWSM 2018

arXiv:1803.10986 [pdf, other]

Error Analysis and Improving the Accuracy of Winograd Convolution for Deep Neural Networks

Authors: Barbara Barabasz, Andrew Anderson, Kirk M. Soodhalter, David Gregg

Abstract: Popular deep neural networks (DNNs) spend the majority of their execution time computing convolutions. The Winograd family of algorithms can greatly reduce the number of arithmetic operations required and is present in many DNN software frameworks. However, the performance gain is at the expense of a reduction in floating point (FP) numerical accuracy. In this paper, we analyse the worst case FP e… ▽ More Popular deep neural networks (DNNs) spend the majority of their execution time computing convolutions. The Winograd family of algorithms can greatly reduce the number of arithmetic operations required and is present in many DNN software frameworks. However, the performance gain is at the expense of a reduction in floating point (FP) numerical accuracy. In this paper, we analyse the worst case FP error and prove the estimation of norm and conditioning of the algorithm. We show that the bound grows exponentially with the size of the convolution, but the error bound of the \textit{modified} algorithm is smaller than the original one. We propose several methods for reducing FP error. We propose a canonical evaluation ordering based on Huffman coding that reduces summation error. We study the selection of sampling "points" experimentally and find empirically good points for the most important sizes. We identify the main factors associated with good points. In addition, we explore other methods to reduce FP error, including mixed-precision convolution, and pairwise summation across DNN channels. Using our methods we can significantly reduce FP error for a given block size, which allows larger block sizes and reduced computation. △ Less

Submitted 1 May, 2019; v1 submitted 29 March, 2018; originally announced March 2018.

arXiv:1803.02903 [pdf, other]

doi 10.1016/j.nima.2018.07.028

Nuclear-recoil energy scale in CDMS II silicon dark-matter detectors

Authors: R. Agnese, A. J. Anderson, T. Aramaki, W. Baker, D. Balakishiyeva, S. Banik, D. Barker, R. Basu Thakur, D. A. Bauer, T. Binder, A. Borgland, M. A. Bowles, P. L. Brink, R. Bunker, B. Cabrera, D. O. Caldwell, R. Calkins, C. Cartaro, D. G. Cerdeno, H. Chagani, Y. -Y. Chang, Y. Chen, J. Cooley, B. Cornell, P. Cushman , et al. (84 additional authors not shown)

Abstract: The Cryogenic Dark Matter Search (CDMS II) experiment aims to detect dark matter particles that elastically scatter from nuclei in semiconductor detectors. The resulting nuclear-recoil energy depositions are detected by ionization and phonon sensors. Neutrons produce a similar spectrum of low-energy nuclear recoils in such detectors, while most other backgrounds produce electron recoils. The absol… ▽ More The Cryogenic Dark Matter Search (CDMS II) experiment aims to detect dark matter particles that elastically scatter from nuclei in semiconductor detectors. The resulting nuclear-recoil energy depositions are detected by ionization and phonon sensors. Neutrons produce a similar spectrum of low-energy nuclear recoils in such detectors, while most other backgrounds produce electron recoils. The absolute energy scale for nuclear recoils is necessary to interpret results correctly. The energy scale can be determined in CDMS II silicon detectors using neutrons incident from a broad-spectrum $^{252}$Cf source, taking advantage of a prominent resonance in the neutron elastic scattering cross section of silicon at a recoil (neutron) energy near 20 (182) keV. Results indicate that the phonon collection efficiency for nuclear recoils is $4.8^{+0.7}_{-0.9}$% lower than for electron recoils of the same energy. Comparisons of the ionization signals for nuclear recoils to those measured previously by other groups at higher electric fields indicate that the ionization collection efficiency for CDMS II silicon detectors operated at $\sim$4 V/cm is consistent with 100% for nuclear recoils below 20 keV and gradually decreases for larger energies to $\sim$75% at 100 keV. The impact of these measurements on previously published CDMS II silicon results is small. △ Less

Submitted 27 July, 2018; v1 submitted 7 March, 2018; originally announced March 2018.

Comments: 22 pages, 17 figures, 1 table, 1 appendix

arXiv:1802.09597 [pdf, other]

doi 10.1145/3178876.3186129

Map** the Invocation Structure of Online Political Interaction

Authors: Manish Raghavan, Ashton Anderson, Jon Kleinberg

Abstract: The surge in political information, discourse, and interaction has been one of the most important developments in social media over the past several years. There is rich structure in the interaction among different viewpoints on the ideological spectrum. However, we still have only a limited analytical vocabulary for expressing the ways in which these viewpoints interact. In this paper, we devel… ▽ More The surge in political information, discourse, and interaction has been one of the most important developments in social media over the past several years. There is rich structure in the interaction among different viewpoints on the ideological spectrum. However, we still have only a limited analytical vocabulary for expressing the ways in which these viewpoints interact. In this paper, we develop network-based methods that operate on the ways in which users share content; we construct \emph{invocation graphs} on Web domains showing the extent to which pages from one domain are invoked by users to reply to posts containing pages from other domains. When we locate the domains on a political spectrum induced from the data, we obtain an embedded graph showing how these interaction links span different distances on the spectrum. The structure of this embedded network, and its evolution over time, helps us derive macro-level insights about how political interaction unfolded through 2016, leading up to the US Presidential election. In particular, we find that the domains invoked in replies spanned increasing distances on the spectrum over the months approaching the election, and that there was clear asymmetry between the left-to-right and right-to-left patterns of linkage. △ Less

Submitted 26 February, 2018; originally announced February 2018.

Comments: The Web Conference 2018 (WWW 2018)

arXiv:1712.08719 [pdf, other]

doi 10.1063/1.4996234

Continuous-frequency measurements of high-intensity microwave electric fields with atomic vapor cells

Authors: David A. Anderson, Georg Raithel

Abstract: We demonstrate continuous-frequency electric field measurements of high-intensity microwaves via optical spectroscopy in a small atomic vapor cell. The spectroscopic response of a room-temperature rubidium atomic vapor in a glass cell is investigated and employed for absolute measurements of K$_a$-band microwave electric fields from $\sim$200 V/m to $>$1 kV/m over a continuous frequency range of… ▽ More We demonstrate continuous-frequency electric field measurements of high-intensity microwaves via optical spectroscopy in a small atomic vapor cell. The spectroscopic response of a room-temperature rubidium atomic vapor in a glass cell is investigated and employed for absolute measurements of K$_a$-band microwave electric fields from $\sim$200 V/m to $>$1 kV/m over a continuous frequency range of $\pm $1 GHz (15% band coverage). It is established that in strong microwave fields frequency-specific spectral features allow for electric field measurements over a large continuous frequency range. △ Less

Submitted 23 December, 2017; originally announced December 2017.

Journal ref: Applied Physics Letters, Vol. 111, No. 5 (2017)

arXiv:1712.08717 [pdf, other]

Quantum-optical spectroscopy for plasma electric field measurements and diagnostics

Authors: David A. Anderson, Georg Raithel, Matthew Simons, Christopher L. Holloway

Abstract: Measurements of plasma electric fields are essential to the advancement of plasma science and applications. Methods for non-invasive in situ measurements of plasma fields on sub-millimeter length scales with high sensitivity over a large field range remain an outstanding challenge. Here, we introduce and demonstrate a new method for plasma electric field measurement that employs electromagneticall… ▽ More Measurements of plasma electric fields are essential to the advancement of plasma science and applications. Methods for non-invasive in situ measurements of plasma fields on sub-millimeter length scales with high sensitivity over a large field range remain an outstanding challenge. Here, we introduce and demonstrate a new method for plasma electric field measurement that employs electromagnetically induced transparency as a high-resolution quantum-optical probe for the Stark energy level shifts of plasma-embedded Rydberg atoms, which serve as highly-sensitive field sensors with a large dynamic range. The method is applied in diagnostics of plasmas photo-excited out of a cesium vapor. The plasma electric fields are extracted from spatially-resolved measurements of field-induced shape changes and shifts of Rydberg resonances in rubidium tracer atoms. Measurement capabilities over a range of plasma densities and temperatures are exploited to characterize plasmas in applied magnetic fields and to image electric-field distributions in cyclotron-heated plasmas. △ Less

Submitted 23 December, 2017; originally announced December 2017.

arXiv:1712.08671 [pdf, other]

doi 10.1063/1.5020173

Electromagnetically Induced Transparency (EIT) and Autler-Townes (AT) splitting in the Presence of Band-Limited White Gaussian Noise

Authors: Christopher L. Holloway, Matthew T. Simons, Marcus D. Kautz, David A. Anderson, Georg Raithel, Daniel Stack, Marc C. St. John, Wansheng Su

Abstract: We investigate the effect of band-limited white Gaussian noise (BLWGN) on electromagnetically induced transparency (EIT) and Autler-Townes (AT) splitting, when performing atom-based continuous-wave (CW) radio-frequency (RF) electric (E) field strength measurements with Rydberg atoms in an atomic vapor. This EIT/AT-based E-field measurement approach is currently being investigated by several groups… ▽ More We investigate the effect of band-limited white Gaussian noise (BLWGN) on electromagnetically induced transparency (EIT) and Autler-Townes (AT) splitting, when performing atom-based continuous-wave (CW) radio-frequency (RF) electric (E) field strength measurements with Rydberg atoms in an atomic vapor. This EIT/AT-based E-field measurement approach is currently being investigated by several groups around the world as a means to develop a new SI traceable RF E-field measurement technique. For this to be a useful technique, it is important to understand the influence of BLWGN. We perform EIT/AT based E-field experiments with BLWGN centered on the RF transition frequency and for the BLWGN blue-shifted and red-shifted relative to the RF transition frequency. The EIT signal can be severely distorted for certain noise conditions (band-width, center-frequency, and noise power), hence altering the ability to accurately measure a CW RF E-field strength. We present a model to predict the changes in the EIT signal in the presence of noise. This model includes AC Stark shifts and on resonance transitions associated with the noise source. The results of this model are compared to the experimental data and we find very good agreement between the two. △ Less

Submitted 22 December, 2017; originally announced December 2017.

Comments: 14 page, 15 figures, 1 table

arXiv:1712.04431 [pdf, ps, other]

The Fraïssé limit of matrix algebras with the rank metric

Authors: Aaron Anderson

Abstract: We realize the $\mathbb{F}_q$-algebra $M(\mathbb{F}_q)$ studied by von Neumann and Halperin as the Fraïssé limit of the class of finite-dimensional matrix algebras over a finite field $\mathbb{F}_q$ equipped with the rank metric. We then provide a new Fraïssé-theoretic proof of uniqueness of such an object. Using the results of Carderi and Thom, we show that the automorphism group of… ▽ More We realize the $\mathbb{F}_q$-algebra $M(\mathbb{F}_q)$ studied by von Neumann and Halperin as the Fraïssé limit of the class of finite-dimensional matrix algebras over a finite field $\mathbb{F}_q$ equipped with the rank metric. We then provide a new Fraïssé-theoretic proof of uniqueness of such an object. Using the results of Carderi and Thom, we show that the automorphism group of $\mathrm{Aut}(\mathbb{F}_q )$ is extremely amenable. We deduce a Ramsey-theoretic property for the class of algebras $M(\mathbb{F}_q)$, and provide an explicit bound for the quantities involved. △ Less

Submitted 21 May, 2021; v1 submitted 12 December, 2017; originally announced December 2017.

Comments: 11 pages

MSC Class: 16E50; 03C30 (Primary) 03C98 (Secondary)

arXiv:1711.08019 [pdf, other]

Toward Foraging for Understanding of StarCraft Agents: An Empirical Study

Authors: Sean Penney, Jonathan Dodge, Claudia Hilderbrand, Andrew Anderson, Logan Simpson, Margaret Burnett

Abstract: Assessing and understanding intelligent agents is a difficult task for users that lack an AI background. A relatively new area, called "Explainable AI," is emerging to help address this problem, but little is known about how users would forage through information an explanation system might offer. To inform the development of Explainable AI systems, we conducted a formative study, using the lens o… ▽ More Assessing and understanding intelligent agents is a difficult task for users that lack an AI background. A relatively new area, called "Explainable AI," is emerging to help address this problem, but little is known about how users would forage through information an explanation system might offer. To inform the development of Explainable AI systems, we conducted a formative study, using the lens of Information Foraging Theory, into how experienced users foraged in the domain of StarCraft to assess an agent. Our results showed that participants faced difficult foraging problems. These foraging problems caused participants to entirely miss events that were important to them, reluctantly choose to ignore actions they did not want to ignore, and bear high cognitive, navigation, and information costs to access the information they needed. △ Less

Submitted 26 December, 2017; v1 submitted 21 November, 2017; originally announced November 2017.

Comments: 13 pages, 10 figures, to appear in ACM IUI 2018

arXiv:1711.06953 [pdf, other]

How the Experts Do It: Assessing and Explaining Agent Behaviors in Real-Time Strategy Games

Authors: Jonathan Dodge, Sean Penney, Claudia Hilderbrand, Andrew Anderson, Margaret Burnett

Abstract: How should an AI-based explanation system explain an agent's complex behavior to ordinary end users who have no background in AI? Answering this question is an active research area, for if an AI-based explanation system could effectively explain intelligent agents' behavior, it could enable the end users to understand, assess, and appropriately trust (or distrust) the agents attempting to help the… ▽ More How should an AI-based explanation system explain an agent's complex behavior to ordinary end users who have no background in AI? Answering this question is an active research area, for if an AI-based explanation system could effectively explain intelligent agents' behavior, it could enable the end users to understand, assess, and appropriately trust (or distrust) the agents attempting to help them. To provide insights into this question, we turned to human expert explainers in the real-time strategy domain, "shoutcaster", to understand (1) how they foraged in an evolving strategy game in real time, (2) how they assessed the players' behaviors, and (3) how they constructed pertinent and timely explanations out of their insights and delivered them to their audience. The results provided insights into shoutcasters' foraging strategies for gleaning information necessary to assess and explain the players; a characterization of the types of implicit questions shoutcasters answered; and implications for creating explanations by using the patterns △ Less

Submitted 18 November, 2017; originally announced November 2017.

Comments: 12 pages, 11 figures, submitted to CHI 2017

arXiv:1710.07396 [pdf, other]

doi 10.1103/PhysRevMaterials.1.054005

Intercalated europium metal in epitaxial graphene on SiC

Authors: Nathaniel A. Anderson, Myron Hupalo, David Keavney, Michael C. Tringides, David Vaknin

Abstract: X-ray magnetic circular dichroism (XMCD) reveal the magnetic properties of intercalated europium metal under graphene on SiC(0001). Intercalation of Eu nano-clusters (average size 2.5 nm) between graphene and SiC substate are formed by deposition of Eu on epitaxially grown graphene that is subsequently annealed at various temperatures while kee** the integrity of the graphene layer. Using sum-ru… ▽ More X-ray magnetic circular dichroism (XMCD) reveal the magnetic properties of intercalated europium metal under graphene on SiC(0001). Intercalation of Eu nano-clusters (average size 2.5 nm) between graphene and SiC substate are formed by deposition of Eu on epitaxially grown graphene that is subsequently annealed at various temperatures while kee** the integrity of the graphene layer. Using sum-rules analysis of the XMCD of Eu M$_{4,5}$ edges at $T = 15$ K, our samples show paramagnetic-like behavior with distinct anomaly at T $\approx$ 90 K which may be related to the N{è}el transition, T$_N$ = 91 K, of bulk metal Eu. We find no evidence of ferromagnetism due to EuO or antiferromagnetism due to Eu$_2$O$_3$ indicating that the graphene layer protects the intercalated metallic Eu against oxidation over months of exposure to atmospheric environment. △ Less

Submitted 19 October, 2017; originally announced October 2017.

Comments: 6 pages, 5 figures

Journal ref: Phys. Rev. Materials 1, 054005 (2017)

arXiv:1710.01079 [pdf, other]

Optimal DNN Primitive Selection with Partitioned Boolean Quadratic Programming

Authors: Andrew Anderson, David Gregg

Abstract: Deep Neural Networks (DNNs) require very large amounts of computation both for training and for inference when deployed in the field. Many different algorithms have been proposed to implement the most computationally expensive layers of DNNs. Further, each of these algorithms has a large number of variants, which offer different trade-offs of parallelism, data locality, memory footprint, and execu… ▽ More Deep Neural Networks (DNNs) require very large amounts of computation both for training and for inference when deployed in the field. Many different algorithms have been proposed to implement the most computationally expensive layers of DNNs. Further, each of these algorithms has a large number of variants, which offer different trade-offs of parallelism, data locality, memory footprint, and execution time. In addition, specific algorithms operate much more efficiently on specialized data layouts and formats. We state the problem of optimal primitive selection in the presence of data format transformations, and show that it is NP-hard by demonstrating an embedding in the Partitioned Boolean Quadratic Assignment problem (PBQP). We propose an analytic solution via a PBQP solver, and evaluate our approach experimentally by optimizing several popular DNNs using a library of more than 70 DNN primitives, on an embedded platform and a general purpose platform. We show experimentally that significant gains are possible versus the state of the art vendor libraries by using a principled analytic solution to the problem of layout selection in the presence of data format transformations. △ Less

Submitted 2 November, 2018; v1 submitted 3 October, 2017; originally announced October 2017.

arXiv:1710.00802 [pdf, other]

doi 10.1088/1748-0221/13/02/P02004

Unfolding Neutron Spectrum with Markov Chain Monte Carlo at MIT Research Reactor with He-3 Neutral Current Detectors

Authors: A. F. Leder, A. J. Anderson, J. Billard, E. Figueroa-Feliciano, J. A. Formaggio, C. Hasselkus, E. Newman, K. Palladino, M. Phuthi, L. Winslow, L. Zhang

Abstract: The Ricochet experiment seeks to measure Coherent (neutral-current) Elastic Neutrino-Nucleus Scattering using dark-matter-style detectors with sub-keV thresholds placed near a neutrino source, such as the MIT (research) Reactor (MITR), which operates at 5.5 MW generating approximately 2.2e18 neutrinos/second in its core. Currently, Ricochet is characterizing the backgrounds at MITR, the main compo… ▽ More The Ricochet experiment seeks to measure Coherent (neutral-current) Elastic Neutrino-Nucleus Scattering using dark-matter-style detectors with sub-keV thresholds placed near a neutrino source, such as the MIT (research) Reactor (MITR), which operates at 5.5 MW generating approximately 2.2e18 neutrinos/second in its core. Currently, Ricochet is characterizing the backgrounds at MITR, the main component of which comes in the form of neutrons emitted from the core simultaneous with the neutrino signal. To characterize this background, we wrapped Bonner cylinders around a He-3 thermal neutron detector, whose data was then unfolded via a Markov Chain Monte Carlo (MCMC) to produce a neutron energy spectrum across several orders of magnitude. We discuss the resulting spectrum and its implications for deploying Ricochet at the MITR site as well as the feasibility of reducing this background level via the addition of polyethylene shielding around the detector setup. △ Less

Submitted 7 February, 2018; v1 submitted 2 October, 2017; originally announced October 2017.

Comments: 14 pages, 9 figures, 1 table, published in JINST

Journal ref: JINST 13 (2018) no.02, P02004

arXiv:1709.03395 [pdf, other]

Low-memory GEMM-based convolution algorithms for deep neural networks

Authors: Andrew Anderson, Aravind Vasudevan, Cormac Keane, David Gregg

Abstract: Deep neural networks (DNNs) require very large amounts of computation both for training and for inference when deployed in the field. A common approach to implementing DNNs is to recast the most computationally expensive operations as general matrix multiplication (GEMM). However, as we demonstrate in this paper, there are a great many different ways to express DNN convolution operations using GEM… ▽ More Deep neural networks (DNNs) require very large amounts of computation both for training and for inference when deployed in the field. A common approach to implementing DNNs is to recast the most computationally expensive operations as general matrix multiplication (GEMM). However, as we demonstrate in this paper, there are a great many different ways to express DNN convolution operations using GEMM. Although different approaches all perform the same number of operations, the size of temporary data structures differs significantly. Convolution of an input matrix with dimensions $C \times H \times W$, requires $O(K^2CHW)$ additional space using the classical im2col approach. More recently memory-efficient approaches requiring just $O(KCHW)$ auxiliary space have been proposed. We present two novel GEMM-based algorithms that require just $O(MHW)$ and $O(KW)$ additional space respectively, where $M$ is the number of channels in the result of the convolution. These algorithms dramatically reduce the space overhead of DNN convolution, making it much more suitable for memory-limited embedded systems. Experimental evaluation shows that our low-memory algorithms are just as fast as the best patch-building approaches despite requiring just a fraction of the amount of additional memory. Our low-memory algorithms have excellent data locality which gives them a further edge over patch-building algorithms when multiple cores are used. As a result, our low memory algorithms often outperform the best patch-building algorithms using multiple threads. △ Less

Submitted 8 September, 2017; originally announced September 2017.

Comments: 13 pages, 16 figures and 3 tables. arXiv admin note: text overlap with arXiv:1704.04428

arXiv:1707.09353 [pdf, other]

doi 10.3847/1538-4357/aa9ff4

Measurements of the Temperature and E-Mode Polarization of the CMB from 500 Square Degrees of SPTpol Data

Authors: J. W. Henning, J. T. Sayre, C. L. Reichardt, P. A. R. Ade, A. J. Anderson, J. E. Austermann, J. A. Beall, A. N. Bender, B. A. Benson, L. E. Bleem, J. E. Carlstrom, C. L. Chang, H. C. Chiang, H-M. Cho, R. Citron, C. Corbett Moran, T. M. Crawford, A. T. Crites, T. de Haan, M. A. Dobbs, W. Everett, J. Gallicchio, E. M. George, A. Gilbert, N. W. Halverson , et al. (42 additional authors not shown)

Abstract: We present measurements of the $E$-mode polarization angular auto-power spectrum ($EE$) and temperature-$E$-mode cross-power spectrum ($TE$) of the cosmic microwave background (CMB) using 150 GHz data from three seasons of SPTpol observations. We report the power spectra over the spherical harmonic multipole range $50 < \ell \leq 8000$, and detect nine acoustic peaks in the $EE$ spectrum with high… ▽ More We present measurements of the $E$-mode polarization angular auto-power spectrum ($EE$) and temperature-$E$-mode cross-power spectrum ($TE$) of the cosmic microwave background (CMB) using 150 GHz data from three seasons of SPTpol observations. We report the power spectra over the spherical harmonic multipole range $50 < \ell \leq 8000$, and detect nine acoustic peaks in the $EE$ spectrum with high signal-to-noise ratio. These measurements are the most sensitive to date of the $EE$ and $TE$ power spectra at $\ell > 1050$ and $\ell > 1475$, respectively. The observations cover 500 deg$^2$, a fivefold increase in area compared to previous SPTpol analyses, which increases our sensitivity to the photon diffusion dam** tail of the CMB power spectra enabling tighter constraints on \LCDM model extensions. After masking all sources with unpolarized flux $>50$ mJy we place a 95% confidence upper limit on residual polarized point-source power of $D_\ell = \ell(\ell+1)C_\ell/2π<0.107\,μ{\rm K}^2$ at $\ell=3000$, suggesting that the $EE$ dam** tail dominates foregrounds to at least $\ell = 4050$ with modest source masking. We find that the SPTpol dataset is in mild tension with the $ΛCDM$ model ($2.1\,σ$), and different data splits prefer parameter values that differ at the $\sim 1\,σ$ level. When fitting SPTpol data at $\ell < 1000$ we find cosmological parameter constraints consistent with those for $Planck$ temperature. Including SPTpol data at $\ell > 1000$ results in a preference for a higher value of the expansion rate ($H_0 = 71.3 \pm 2.1\,\mbox{km}\,s^{-1}\mbox{Mpc}^{-1}$ ) and a lower value for present-day density fluctuations ($σ_8 = 0.77 \pm 0.02$). △ Less

Submitted 11 April, 2018; v1 submitted 28 July, 2017; originally announced July 2017.

Comments: Updated to match version accepted to ApJ. 34 pages, 17 figures, 6 tables

Journal ref: The Astrophysical Journal, Volume 852, Issue 2, article id. 97, 31 pp. (2018)

arXiv:1707.01632 [pdf, other]

doi 10.1103/PhysRevD.97.022002

Low-Mass Dark Matter Search with CDMSlite

Authors: SuperCDMS Collaboration, R. Agnese, A. J. Anderson, T. Aralis, T. Aramaki, I. J. Arnquist, W. Baker, D. Balakishiyeva, D. Barker, R. Basu Thakur, D. A. Bauer, T. Binder, M. A. Bowles, P. L. Brink, R. Bunker, B. Cabrera, D. O. Caldwell, R. Calkins, C. Cartaro, D. G. Cerdeno, Y. Chang, H. Chagani, Y. Chen, J. Cooley, B. Cornell , et al. (83 additional authors not shown)

Abstract: The SuperCDMS experiment is designed to directly detect weakly interacting massive particles (WIMPs) that may constitute the dark matter in our Galaxy. During its operation at the Soudan Underground Laboratory, germanium detectors were run in the CDMSlite mode to gather data sets with sensitivity specifically for WIMPs with masses ${<}$10 GeV/$c^2$. In this mode, a higher detector-bias voltage is… ▽ More The SuperCDMS experiment is designed to directly detect weakly interacting massive particles (WIMPs) that may constitute the dark matter in our Galaxy. During its operation at the Soudan Underground Laboratory, germanium detectors were run in the CDMSlite mode to gather data sets with sensitivity specifically for WIMPs with masses ${<}$10 GeV/$c^2$. In this mode, a higher detector-bias voltage is applied to amplify the phonon signals produced by drifting charges. This paper presents studies of the experimental noise and its effect on the achievable energy threshold, which is demonstrated to be as low as 56 eV$_{\text{ee}}$ (electron equivalent energy). The detector-biasing configuration is described in detail, with analysis corrections for voltage variations to the level of a few percent. Detailed studies of the electric-field geometry, and the resulting successful development of a fiducial parameter, eliminate poorly measured events, yielding an energy resolution ranging from ${\sim}$9 eV$_{\text{ee}}$ at 0 keV to 101 eV$_{\text{ee}}$ at ${\sim}$10 eV$_{\text{ee}}$. New results are derived for astrophysical uncertainties relevant to the WIMP-search limits, specifically examining how they are affected by variations in the most probable WIMP velocity and the Galactic escape velocity. These variations become more important for WIMP masses below 10 GeV/$c^2$. Finally, new limits on spin-dependent low-mass WIMP-nucleon interactions are derived, with new parameter space excluded for WIMP masses $\lesssim$3 GeV/$c^2$ △ Less

Submitted 18 January, 2018; v1 submitted 6 July, 2017; originally announced July 2017.

Comments: 30 Pages, 33 Figures v3 to match published version in PRD. v2 contains public release (and documentation) of the CDMSlite Run 2 data as ancillary files

Journal ref: Phys. Rev. D 97, 022002 (2018)

arXiv:1705.10689 [pdf, other]

doi 10.1145/3041021.3054197

Auditing Search Engines for Differential Satisfaction Across Demographics

Authors: Rishabh Mehrotra, Ashton Anderson, Fernando Diaz, Amit Sharma, Hanna Wallach, Emine Yilmaz

Abstract: Many online services, such as search engines, social media platforms, and digital marketplaces, are advertised as being available to any user, regardless of their age, gender, or other demographic factors. However, there are growing concerns that these services may systematically underserve some groups of users. In this paper, we present a framework for internally auditing such services for differ… ▽ More Many online services, such as search engines, social media platforms, and digital marketplaces, are advertised as being available to any user, regardless of their age, gender, or other demographic factors. However, there are growing concerns that these services may systematically underserve some groups of users. In this paper, we present a framework for internally auditing such services for differences in user satisfaction across demographic groups, using search engines as a case study. We first explain the pitfalls of naïvely comparing the behavioral metrics that are commonly used to evaluate search engines. We then propose three methods for measuring latent differences in user satisfaction from observed differences in evaluation metrics. To develop these methods, we drew on ideas from the causal inference literature and the multilevel modeling literature. Our framework is broadly applicable to other online services, and provides general insight into interpreting their evaluation metrics. △ Less

Submitted 24 May, 2017; originally announced May 2017.

Comments: 8 pages Accepted at WWW 2017

arXiv:1705.07199 [pdf, other]

The High-Dimensional Geometry of Binary Neural Networks

Authors: Alexander G. Anderson, Cory P. Berg

Abstract: Recent research has shown that one can train a neural network with binary weights and activations at train time by augmenting the weights with a high-precision continuous latent variable that accumulates small changes from stochastic gradient descent. However, there is a dearth of theoretical analysis to explain why we can effectively capture the features in our data with binary weights and activa… ▽ More Recent research has shown that one can train a neural network with binary weights and activations at train time by augmenting the weights with a high-precision continuous latent variable that accumulates small changes from stochastic gradient descent. However, there is a dearth of theoretical analysis to explain why we can effectively capture the features in our data with binary weights and activations. Our main result is that the neural networks with binary weights and activations trained using the method of Courbariaux, Hubara et al. (2016) work because of the high-dimensional geometry of binary vectors. In particular, the ideal continuous vectors that extract out features in the intermediate representations of these BNNs are well-approximated by binary vectors in the sense that dot products are approximately preserved. Compared to previous research that demonstrated the viability of such BNNs, our work explains why these BNNs work in terms of the HD geometry. Our theory serves as a foundation for understanding not only BNNs but a variety of methods that seek to compress traditional neural networks. Furthermore, a better understanding of multilayer binary neural networks serves as a starting point for generalizing BNNs to other neural network architectures such as recurrent neural networks. △ Less

Submitted 19 May, 2017; originally announced May 2017.

Comments: 12 pages, 4 Figures

arXiv:1705.01618 [pdf, ps, other]

doi 10.1016/j.jmmm.2017.04.007

Magnetic properties of Dy nano-islands on graphene

Authors: Nathaniel A. Anderson, Qiang Zhang, Myron Hupalo, Richard A. Rosenberg, John W. Freeland, Michael C. Tringides, David Vaknin

Abstract: We have determined the magnetic properties of epitaxially grown Dy islands on graphene/SiC(0001) that are passivated by a gold film (deposited in the ultra-high vacuum growth chamber) for {\it ex-situ} X-ray magnetic circular dichroism (XMCD). Our sum-rule analysis of the Dy $M_{4,5}$ XMCD spectra at low temperatures ($T=15$ K) as a function of magnetic field assuming Dy$^{3+}$ (spin configuration… ▽ More We have determined the magnetic properties of epitaxially grown Dy islands on graphene/SiC(0001) that are passivated by a gold film (deposited in the ultra-high vacuum growth chamber) for {\it ex-situ} X-ray magnetic circular dichroism (XMCD). Our sum-rule analysis of the Dy $M_{4,5}$ XMCD spectra at low temperatures ($T=15$ K) as a function of magnetic field assuming Dy$^{3+}$ (spin configuration $^6H_{15/2}$) indicate that the projection of the magnetic moment along an applied magnetic field of 5 T is 3.5(3) $μ_B$. Temperature dependence of the magnetic moment (extracted from the $M_5$ XMCD spectra) shows an onset of a change in magnetic moment at about 175 K in proximity of the transition from paramagnetic to helical magnetic structure at $T_{\rm H} =179$ K in bulk Dy. No feature at the vicinity of the ferromagnetic transition of hcp bulk Dy at $T_{\rm c}$ = 88 K is observed. However, below $\sim$130 K, the inverse magnetic moment (extracted from the XMCD) is linear in temperature as commonly expected from a paramagnetic system suggesting different behavior of Dy nano-island than bulk Dy. △ Less

Submitted 3 May, 2017; originally announced May 2017.

Journal ref: J. Magn. Magn. Mater. 435, 212 (2017)

arXiv:1704.04428 [pdf, other]

Parallel Multi Channel Convolution using General Matrix Multiplication

Authors: Aravind Vasudevan, Andrew Anderson, David Gregg

Abstract: Convolutional neural networks (CNNs) have emerged as one of the most successful machine learning technologies for image and video processing. The most computationally intensive parts of CNNs are the convolutional layers, which convolve multi-channel images with multiple kernels. A common approach to implementing convolutional layers is to expand the image into a column matrix (im2col) and perform… ▽ More Convolutional neural networks (CNNs) have emerged as one of the most successful machine learning technologies for image and video processing. The most computationally intensive parts of CNNs are the convolutional layers, which convolve multi-channel images with multiple kernels. A common approach to implementing convolutional layers is to expand the image into a column matrix (im2col) and perform Multiple Channel Multiple Kernel (MCMK) convolution using an existing parallel General Matrix Multiplication (GEMM) library. This im2col conversion greatly increases the memory footprint of the input matrix and reduces data locality. In this paper we propose a new approach to MCMK convolution that is based on General Matrix Multiplication (GEMM), but not on im2col. Our algorithm eliminates the need for data replication on the input thereby enabling us to apply the convolution kernels on the input images directly. We have implemented several variants of our algorithm on a CPU processor and an embedded ARM processor. On the CPU, our algorithm is faster than im2col in most cases. △ Less

Submitted 3 July, 2017; v1 submitted 6 April, 2017; originally announced April 2017.

Comments: Camera ready version to be published at ASAP 2017 - The 28th Annual IEEE International Conference on Application-specific Systems, Architectures and Processors. 6 pages

arXiv:1704.04208 [pdf, other]

Cardiac re-entry dynamics & self-termination in DT-MRI based model of Human Foetal Heart

Authors: Irina V. Biktasheva, Richard A. Anderson, Arun V. Holden, Eleftheria Pervolaraki, Fengcai Wen

Abstract: The effect of heart geometry and anisotropy on cardiac re-entry dynamics and self-termination is studied here in anatomically realistic computer simulations of human foetal heart. 20 weeks of gestational age human foetal heart isotropic and anisotropic anatomy models from diffusion tensor MRI data sets are used in the computer simulations. The fibre orientation angles of the heart were obtained fr… ▽ More The effect of heart geometry and anisotropy on cardiac re-entry dynamics and self-termination is studied here in anatomically realistic computer simulations of human foetal heart. 20 weeks of gestational age human foetal heart isotropic and anisotropic anatomy models from diffusion tensor MRI data sets are used in the computer simulations. The fibre orientation angles of the heart were obtained from the DT-MRI primary eigenvalues. In a spatially homogeneous electrophysiological mono domain model with the DT-MRI based heart geometries, we initiate simplified Fitz-Hugh-Nagumo kinetics cardiac re-entry at a prescribed location in a 2D slice, and in the full 3D anatomy model. In a slice of the heart, the MRI based fibre anisotropy changes the re-entry dynamics from pinned to anatomical re-entry. In the full 3D MRI based model, the foetal heart fibre anisotropy changes the re-entry dynamics from a persistent re-entry to the re-entry self-termination. △ Less

Submitted 19 August, 2017; v1 submitted 13 April, 2017; originally announced April 2017.

Comments: submitted to Chaos: An Interdisciplinary Journal of Nonlinear Science, Focus Issue on the topic of Complex Cardiac Dynamics

arXiv:1704.02341 [pdf, other]

HiFrames: High Performance Data Frames in a Scripting Language

Authors: Ehsan Totoni, Wajih Ul Hassan, Todd A. Anderson, Tatiana Shpeisman

Abstract: Data frames in scripting languages are essential abstractions for processing structured data. However, existing data frame solutions are either not distributed (e.g., Pandas in Python) and therefore have limited scalability, or they are not tightly integrated with array computations (e.g., Spark SQL). This paper proposes a novel compiler-based approach where we integrate data frames into the High… ▽ More Data frames in scripting languages are essential abstractions for processing structured data. However, existing data frame solutions are either not distributed (e.g., Pandas in Python) and therefore have limited scalability, or they are not tightly integrated with array computations (e.g., Spark SQL). This paper proposes a novel compiler-based approach where we integrate data frames into the High Performance Analytics Toolkit (HPAT) to build HiFrames. It provides expressive and flexible data frame APIs which are tightly integrated with array operations. HiFrames then automatically parallelizes and compiles relational operations along with other array computations in end-to-end data analytics programs, and generates efficient MPI/C++ code. We demonstrate that HiFrames is significantly faster than alternatives such as Spark SQL on clusters, without forcing the programmer to switch to embedded SQL for part of the program. HiFrames is 3.6x to 70x faster than Spark SQL for basic relational operations, and can be up to 20,000x faster for advanced analytics operations, such as weighted moving averages (WMA), that the map-reduce paradigm cannot handle effectively. HiFrames is also 5x faster than Spark SQL for TPCx-BB Q26 on 64 nodes of Cori supercomputer. △ Less

Submitted 7 April, 2017; originally announced April 2017.

arXiv:1702.05556 [pdf, ps, other]

doi 10.1103/PhysRevA.95.061804

Paschen-Back effect and Rydberg-state diamagnetism in vapor-cell electromagnetically induced transparency

Authors: L. Ma, D. A. Anderson, G. Raithel

Abstract: We report on rubidium vapor-cell Rydberg electromagnetically induced transparency (EIT) in a 0.7~T magnetic field where all involved levels are in the hyperfine Paschen-Back regime, and the Rydberg state exhibits a strong diamagnetic interaction with the magnetic field. Signals from both $^{85}\mathrm{Rb}$ and $^{87}\mathrm{Rb}$ are present in the EIT spectra. This feature of isotope-mixed Rb cell… ▽ More We report on rubidium vapor-cell Rydberg electromagnetically induced transparency (EIT) in a 0.7~T magnetic field where all involved levels are in the hyperfine Paschen-Back regime, and the Rydberg state exhibits a strong diamagnetic interaction with the magnetic field. Signals from both $^{85}\mathrm{Rb}$ and $^{87}\mathrm{Rb}$ are present in the EIT spectra. This feature of isotope-mixed Rb cells allows us to measure the field strength to within a $\pm 0.12$\% relative uncertainty. The measured spectra are in excellent agreement with the results of a Monte Carlo calculation and indicate unexpectedly large Rydberg-level dephasing rates. Line shifts and broadenings due to small inhomogeneities of the magnetic field are included in the model. △ Less

Submitted 21 February, 2017; v1 submitted 17 February, 2017; originally announced February 2017.

Comments: 6 pages, 5figures

Journal ref: Phys. Rev. A 95, 061804 (2017)

arXiv:1702.02092 [pdf, ps, other]

Characterisation of speech diversity using self-organising maps

Authors: Tom A. F. Anderson, David M. W. Powers

Abstract: We report investigations into speaker classification of larger quantities of unlabelled speech data using small sets of manually phonemically annotated speech. The Kohonen speech typewriter is a semi-supervised method comprised of self-organising maps (SOMs) that achieves low phoneme error rates. A SOM is a 2D array of cells that learn vector representations of the data based on neighbourhoods. In… ▽ More We report investigations into speaker classification of larger quantities of unlabelled speech data using small sets of manually phonemically annotated speech. The Kohonen speech typewriter is a semi-supervised method comprised of self-organising maps (SOMs) that achieves low phoneme error rates. A SOM is a 2D array of cells that learn vector representations of the data based on neighbourhoods. In this paper, we report a method to evaluate pronunciation using multilevel SOMs with /hVd/ single syllable utterances for the study of vowels, for Australian pronunciation. △ Less

Submitted 23 January, 2017; originally announced February 2017.

Comments: 16th Speech Science and Technology Conference (SST2016)

arXiv:1701.00861 [pdf, other]

doi 10.1063/1.4973571

Magnetite nano-islands on silicon-carbide with graphene

Authors: Nathaniel A. Anderson, Qiang Zhang, Myron Hupalo, Richard A. Rosenberg, Michael C. Tringides, David Vaknin

Abstract: X-ray magnetic circular dichroism (XMCD) measurements of iron nano-islands grown on graphene and covered with a Au film for passivation reveal that the oxidation through defects in the Au film spontaneously leads to the formation of magnetite nano-particles (i.e, $Fe_3$$O_4$). The Fe nano-islands (20 and 75 monolayers; MLs) are grown on epitaxial graphene formed by thermally annealing 6H-SiC(0001)… ▽ More X-ray magnetic circular dichroism (XMCD) measurements of iron nano-islands grown on graphene and covered with a Au film for passivation reveal that the oxidation through defects in the Au film spontaneously leads to the formation of magnetite nano-particles (i.e, $Fe_3$$O_4$). The Fe nano-islands (20 and 75 monolayers; MLs) are grown on epitaxial graphene formed by thermally annealing 6H-SiC(0001) and subsequently covered, in the growth chamber, with nominal 20 layers of Au. Our X-ray absorption spectroscopy and XMCD measurements at applied magnetic fields show that the thin film (20 ML) is totally converted to magnetite whereas the thicker film (75 ML) exhibits properties of magnetite but also those of pure metallic iron. Temperature dependence of the XMCD signal (of both samples) shows a clear transition at $T_{\rm V}\approx 120$ K consistent with the Verwey transition of bulk magnetite. These results have implications on the synthesis of magnetite nano-crystals and also on their regular arrangements on functional substrates such as graphene. △ Less

Submitted 3 January, 2017; originally announced January 2017.

Comments: 5 pages, 6 figures

Journal ref: J. Appl. Phys. 121, 014310 (2017)

arXiv:1611.10003 [pdf]

Vocabulary and the Brain: Evidence from Neuroimaging Studies

Authors: Tom A. F. Anderson, C. -H. Ruan

Abstract: In summary of the research findings presented in this paper, various brain regions are correlated with vocabulary and vocabulary acquisition. Semantic associations for vocabulary seem to be located near brain areas that vary according to the type of vocabulary, e.g. ventral temporal regions important for words for things that can be seen. Semantic processing is believed to be strongly associated w… ▽ More In summary of the research findings presented in this paper, various brain regions are correlated with vocabulary and vocabulary acquisition. Semantic associations for vocabulary seem to be located near brain areas that vary according to the type of vocabulary, e.g. ventral temporal regions important for words for things that can be seen. Semantic processing is believed to be strongly associated with the ANG. Phonological ability has been closely related to the anterior surfaces of the SMG. Pathways through the posterior SMG are thought to link the anterior SMG and the ANG. In vocabulary tasks, mediotemporal structures may be related to long-term memory processing, with left hippocampal and parahippocampal regions related to long-term and working memory, respectively. Precentral structures are associated with phonological retrieval. Furthermore, many more regions of the brain are of interest in vocabulary tasks, particularly in areas important for visual and auditory processing. Furthermore, differences between brain anatomies can be attributed to vocabulary demands of different languages. △ Less

Submitted 30 November, 2016; originally announced November 2016.

arXiv:1611.04934 [pdf, other]

HPAT: High Performance Analytics with Scripting Ease-of-Use

Authors: Ehsan Totoni, Todd A. Anderson, Tatiana Shpeisman

Abstract: Big data analytics requires high programmer productivity and high performance simultaneously on large-scale clusters. However, current big data analytics frameworks (e.g. Apache Spark) have prohibitive runtime overheads since they are library-based. We introduce a novel auto-parallelizing compiler approach that exploits the characteristics of the data analytics domain such as the map/reduce parall… ▽ More Big data analytics requires high programmer productivity and high performance simultaneously on large-scale clusters. However, current big data analytics frameworks (e.g. Apache Spark) have prohibitive runtime overheads since they are library-based. We introduce a novel auto-parallelizing compiler approach that exploits the characteristics of the data analytics domain such as the map/reduce parallel pattern and is robust, unlike previous auto-parallelization methods. Using this approach, we build High Performance Analytics Toolkit (HPAT), which parallelizes high-level scripting (Julia) programs automatically, generates efficient MPI/C++ code, and provides resiliency. Furthermore, it provides automatic optimizations for scripting programs, such as fusion of array operations. Thus, HPAT is 369x to 2033x faster than Spark on the Cori supercomputer and 20x to 256x times on Amazon AWS. △ Less

Submitted 10 April, 2017; v1 submitted 15 November, 2016; originally announced November 2016.

arXiv:1611.01714 [pdf, other]

Beyond Fine Tuning: A Modular Approach to Learning on Small Data

Authors: Ark Anderson, Kyle Shaffer, Artem Yankov, Court D. Corley, Nathan O. Hodas

Abstract: In this paper we present a technique to train neural network models on small amounts of data. Current methods for training neural networks on small amounts of rich data typically rely on strategies such as fine-tuning a pre-trained neural network or the use of domain-specific hand-engineered features. Here we take the approach of treating network layers, or entire networks, as modules and combine… ▽ More In this paper we present a technique to train neural network models on small amounts of data. Current methods for training neural networks on small amounts of rich data typically rely on strategies such as fine-tuning a pre-trained neural network or the use of domain-specific hand-engineered features. Here we take the approach of treating network layers, or entire networks, as modules and combine pre-trained modules with untrained modules, to learn the shift in distributions between data sets. The central impact of using a modular approach comes from adding new representations to a network, as opposed to replacing representations via fine-tuning. Using this technique, we are able surpass results using standard fine-tuning transfer learning approaches, and we are also able to significantly increase performance over such approaches when using smaller amounts of data. △ Less

Submitted 5 November, 2016; originally announced November 2016.

arXiv:1610.00006 [pdf, other]

doi 10.1103/PhysRevD.95.082002

Projected Sensitivity of the SuperCDMS SNOLAB experiment

Authors: R. Agnese, A. J. Anderson, T. Aramaki, I. Arnquist, W. Baker, D. Barker, R. Basu Thakur, D. A. Bauer, A. Borgland, M. A. Bowles, P. L. Brink, R. Bunker, B. Cabrera, D. O. Caldwell, R. Calkins, C. Cartaro, D. G. Cerdeño, H. Chagani, Y. Chen, J. Cooley, B. Cornell, P. Cushman, M. Daal, P. C. F. Di Stefano, T. Doughty , et al. (71 additional authors not shown)

Abstract: SuperCDMS SNOLAB will be a next-generation experiment aimed at directly detecting low-mass (< 10 GeV/c$^2$) particles that may constitute dark matter by using cryogenic detectors of two types (HV and iZIP) and two target materials (germanium and silicon). The experiment is being designed with an initial sensitivity to nuclear recoil cross sections ~ 1 x 10$^{-43}$ cm$^2$ for a dark matter particle… ▽ More SuperCDMS SNOLAB will be a next-generation experiment aimed at directly detecting low-mass (< 10 GeV/c$^2$) particles that may constitute dark matter by using cryogenic detectors of two types (HV and iZIP) and two target materials (germanium and silicon). The experiment is being designed with an initial sensitivity to nuclear recoil cross sections ~ 1 x 10$^{-43}$ cm$^2$ for a dark matter particle mass of 1 GeV/c$^2$, and with capacity to continue exploration to both smaller masses and better sensitivities. The phonon sensitivity of the HV detectors will be sufficient to detect nuclear recoils from sub-GeV dark matter. A detailed calibration of the detector response to low energy recoils will be needed to optimize running conditions of the HV detectors and to interpret their data for dark matter searches. Low-activity shielding, and the depth of SNOLAB, will reduce most backgrounds, but cosmogenically produced $^{3}$H and naturally occurring $^{32}$Si will be present in the detectors at some level. Even if these backgrounds are x10 higher than expected, the science reach of the HV detectors would be over three orders of magnitude beyond current results for a dark matter mass of 1 GeV/c$^2$. The iZIP detectors are relatively insensitive to variations in detector response and backgrounds, and will provide better sensitivity for dark matter particle masses (> 5 GeV/c$^2$). The mix of detector types (HV and iZIP), and targets (germanium and silicon), planned for the experiment, as well as flexibility in how the detectors are operated, will allow us to maximize the low-mass reach, and understand the backgrounds that the experiment will encounter. Upgrades to the experiment, perhaps with a variety of ultra-low-background cryogenic detectors, will extend dark matter sensitivity down to the "neutrino floor", where coherent scatters of solar neutrinos become a limiting background. △ Less

Submitted 30 September, 2016; originally announced October 2016.

Comments: SuperCDMS SNOLAB Projected sensitivity reach

Journal ref: Phys. Rev. D 95, 082002 (2017)

Showing 151–200 of 337 results for author: Anderson, A