Search | arXiv e-print repository

TensorFlow: A system for large-scale machine learning

Authors: Martín Abadi, Paul Barham, Jianmin Chen, Zhifeng Chen, Andy Davis, Jeffrey Dean, Matthieu Devin, Sanjay Ghemawat, Geoffrey Irving, Michael Isard, Manjunath Kudlur, Josh Levenberg, Rajat Monga, Sherry Moore, Derek G. Murray, Benoit Steiner, Paul Tucker, Vijay Vasudevan, Pete Warden, Martin Wicke, Yuan Yu, Xiaoqiang Zheng

Abstract: TensorFlow is a machine learning system that operates at large scale and in heterogeneous environments. TensorFlow uses dataflow graphs to represent computation, shared state, and the operations that mutate that state. It maps the nodes of a dataflow graph across many machines in a cluster, and within a machine across multiple computational devices, including multicore CPUs, general-purpose GPUs,… ▽ More TensorFlow is a machine learning system that operates at large scale and in heterogeneous environments. TensorFlow uses dataflow graphs to represent computation, shared state, and the operations that mutate that state. It maps the nodes of a dataflow graph across many machines in a cluster, and within a machine across multiple computational devices, including multicore CPUs, general-purpose GPUs, and custom designed ASICs known as Tensor Processing Units (TPUs). This architecture gives flexibility to the application developer: whereas in previous "parameter server" designs the management of shared state is built into the system, TensorFlow enables developers to experiment with novel optimizations and training algorithms. TensorFlow supports a variety of applications, with particularly strong support for training and inference on deep neural networks. Several Google services use TensorFlow in production, we have released it as an open-source project, and it has become widely used for machine learning research. In this paper, we describe the TensorFlow dataflow model in contrast to existing systems, and demonstrate the compelling performance that TensorFlow achieves for several real-world applications. △ Less

Submitted 31 May, 2016; v1 submitted 27 May, 2016; originally announced May 2016.

Comments: 18 pages, 9 figures; v2 has a spelling correction in the metadata

arXiv:1603.04467 [pdf, other]

TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems

Authors: Martín Abadi, Ashish Agarwal, Paul Barham, Eugene Brevdo, Zhifeng Chen, Craig Citro, Greg S. Corrado, Andy Davis, Jeffrey Dean, Matthieu Devin, Sanjay Ghemawat, Ian Goodfellow, Andrew Harp, Geoffrey Irving, Michael Isard, Yangqing Jia, Rafal Jozefowicz, Lukasz Kaiser, Manjunath Kudlur, Josh Levenberg, Dan Mane, Rajat Monga, Sherry Moore, Derek Murray, Chris Olah , et al. (15 additional authors not shown)

Abstract: TensorFlow is an interface for expressing machine learning algorithms, and an implementation for executing such algorithms. A computation expressed using TensorFlow can be executed with little or no change on a wide variety of heterogeneous systems, ranging from mobile devices such as phones and tablets up to large-scale distributed systems of hundreds of machines and thousands of computational de… ▽ More TensorFlow is an interface for expressing machine learning algorithms, and an implementation for executing such algorithms. A computation expressed using TensorFlow can be executed with little or no change on a wide variety of heterogeneous systems, ranging from mobile devices such as phones and tablets up to large-scale distributed systems of hundreds of machines and thousands of computational devices such as GPU cards. The system is flexible and can be used to express a wide variety of algorithms, including training and inference algorithms for deep neural network models, and it has been used for conducting research and for deploying machine learning systems into production across more than a dozen areas of computer science and other fields, including speech recognition, computer vision, robotics, information retrieval, natural language processing, geographic information extraction, and computational drug discovery. This paper describes the TensorFlow interface and an implementation of that interface that we have built at Google. The TensorFlow API and a reference implementation were released as an open-source package under the Apache 2.0 license in November, 2015 and are available at www.tensorflow.org. △ Less

Submitted 16 March, 2016; v1 submitted 14 March, 2016; originally announced March 2016.

Comments: Version 2 updates only the metadata, to correct the formatting of Martín Abadi's name

arXiv:1405.6632 [pdf, other]

Postselected quantum circuits

Authors: Michael Devin

Abstract: The purpose of this paper is to show the unusual behavior of a number of simple circuits under the effects of post-selection. A useful duality exists between post-selected ensembles and a consistent picture of acausal physics embodying the application of Novikov's consistency postulate to the wave-function of a time machine. A competing view applies this postulate to density matrices instead, but… ▽ More The purpose of this paper is to show the unusual behavior of a number of simple circuits under the effects of post-selection. A useful duality exists between post-selected ensembles and a consistent picture of acausal physics embodying the application of Novikov's consistency postulate to the wave-function of a time machine. A competing view applies this postulate to density matrices instead, but that lies outside the scope of this paper. To find out the result of a measurement on a particular circuit, we consider a weighted distribution of histories between some initial and final time, such that each individually obeys ordinary quantum mechanics. The weight of a particular history is given by a joint distribution over the values of a part of the system dubbed the time machine, at two chosen times. In the dual post-selection picture the weight is simply the probability for the 'periodic bit' to satisfy the constraint, plus a small noise part, often assumed constant, representing the error rate of the time machine's channel. The emerging bit is one of a Bell pair, its partner kept in reserve to test that the bit entering the time machine later matches the one which emerged. This is done by interfering the incoming bit with the reserve member of the pair and kee** the experimental runs that interfere constructively. Noise can be effectively simulated by kee** a small proportion of runs that interfere destructively, or by perturbing the reserve bit to decohere it by a similar amount. Paradoxes remain but are tractable. △ Less

Submitted 26 May, 2014; originally announced May 2014.

arXiv:1403.3706 [pdf, other]

Experimental Test of the Final State Hypothesis

Authors: Michael Devin

Abstract: The black hole final state projection model, also known as the Horowitz-Maldacena model has garnished new interest due to the current debate over black hole firewalls. The nonlinear quantum mechanics of post-selection preserves information and avoids the AMPS argument by relaxing monogamy of entanglement. While these are promising features there are also potentially observable predictions to be ma… ▽ More The black hole final state projection model, also known as the Horowitz-Maldacena model has garnished new interest due to the current debate over black hole firewalls. The nonlinear quantum mechanics of post-selection preserves information and avoids the AMPS argument by relaxing monogamy of entanglement. While these are promising features there are also potentially observable predictions to be made. △ Less

Submitted 13 March, 2014; originally announced March 2014.

arXiv:1401.0588 [pdf, other]

Musings on Firewalls and the Information Paradox

Authors: Michael Devin

Abstract: The past year has seen an explosion of new and old ideas about black hole physics. Prior to the firewall paper, the dominant picture was the thermofield model apparently implied by ADS/CFT duality\cite{mal2}. While some seek a narrow responce to Almheiri, Marolf, Polchinski, and Sully,(AMPS)\cite{amps}, there are a number of competing models. One problem in the field is the ambiguity of the compet… ▽ More The past year has seen an explosion of new and old ideas about black hole physics. Prior to the firewall paper, the dominant picture was the thermofield model apparently implied by ADS/CFT duality\cite{mal2}. While some seek a narrow responce to Almheiri, Marolf, Polchinski, and Sully,(AMPS)\cite{amps}, there are a number of competing models. One problem in the field is the ambiguity of the competing proposals. Some are equivalent while others incompatible. This paper will attempt to define and classify a few models representative of the current discussions. △ Less

Submitted 13 March, 2014; v1 submitted 3 January, 2014; originally announced January 2014.

Comments: 4 figures

arXiv:1302.3298 [pdf, other]

Thermodynamics of Time Machines

Authors: Michael Devin

Abstract: In this note, a brief review of the consistent state approach to systems containing closed timelike curves or similar devices is given, and applied to the well known thermodynamic problem of Maxwell's demon. The 'third party paradox' for acausal systems is defined and applied to CTC censorship and black hole evaporation. Some traditional arguments for chronology protection are re-examined. In this note, a brief review of the consistent state approach to systems containing closed timelike curves or similar devices is given, and applied to the well known thermodynamic problem of Maxwell's demon. The 'third party paradox' for acausal systems is defined and applied to CTC censorship and black hole evaporation. Some traditional arguments for chronology protection are re-examined. △ Less

Submitted 8 February, 2013; originally announced February 2013.

arXiv:1112.6209 [pdf, other]

Building high-level features using large scale unsupervised learning

Authors: Quoc V. Le, Marc'Aurelio Ranzato, Rajat Monga, Matthieu Devin, Kai Chen, Greg S. Corrado, Jeff Dean, Andrew Y. Ng

Abstract: We consider the problem of building high-level, class-specific feature detectors from only unlabeled data. For example, is it possible to learn a face detector using only unlabeled images? To answer this, we train a 9-layered locally connected sparse autoencoder with pooling and local contrast normalization on a large dataset of images (the model has 1 billion connections, the dataset has 10 milli… ▽ More We consider the problem of building high-level, class-specific feature detectors from only unlabeled data. For example, is it possible to learn a face detector using only unlabeled images? To answer this, we train a 9-layered locally connected sparse autoencoder with pooling and local contrast normalization on a large dataset of images (the model has 1 billion connections, the dataset has 10 million 200x200 pixel images downloaded from the Internet). We train this network using model parallelism and asynchronous SGD on a cluster with 1,000 machines (16,000 cores) for three days. Contrary to what appears to be a widely-held intuition, our experimental results reveal that it is possible to train a face detector without having to label images as containing a face or not. Control experiments show that this feature detector is robust not only to translation but also to scaling and out-of-plane rotation. We also find that the same network is sensitive to other high-level concepts such as cat faces and human bodies. Starting with these learned features, we trained our network to obtain 15.8% accuracy in recognizing 20,000 object categories from ImageNet, a leap of 70% relative improvement over the previous state-of-the-art. △ Less

Submitted 12 July, 2012; v1 submitted 28 December, 2011; originally announced December 2011.

arXiv:0907.1756 [pdf, ps, other]

doi 10.1088/1126-6708/2009/10/095

Orbifold Branes in the $M_D \times M_{d^+} \times M_{d^-}$ Compactification of type II string on $S^1/Z_2$ and their cosmological applications

Authors: Michael Devin, Tibra Ali, Gerald Cleaver, Anzhong Wang, Qiang Wu

Abstract: In this paper, we study the implementation of brane worlds in type II string theory. Starting with the NS/NS sector of type II string, we first compactify the $(D+d_{+} + d_{-})$-dimensional spacetime, and reduce the corresponding action to a D-dimensional effective action, where the topologies of $M_{d_{+}}$ and $M_{d_{-}}$ are arbitrary. We further compactify one of the $(D-1)$ spatial dimensi… ▽ More In this paper, we study the implementation of brane worlds in type II string theory. Starting with the NS/NS sector of type II string, we first compactify the $(D+d_{+} + d_{-})$-dimensional spacetime, and reduce the corresponding action to a D-dimensional effective action, where the topologies of $M_{d_{+}}$ and $M_{d_{-}}$ are arbitrary. We further compactify one of the $(D-1)$ spatial dimensions on an $S^{1}/Z_{2}$ orbifold, and derive the gravitational and matter field equations both in the bulk and on the branes. Then, we investigate two key issues in such a setup: (i) the radion stability and radion mass; and (ii) the localization of gravity, and the corresponding Kaluza-Klein (KK) modes. We show explicitly that the radion is stable and its mass can be in the order of $GeV$. In addition, the gravity is localized on the visible brane, and its spectrum of the gravitational KK towers is discrete and can have a mass gap of $TeV$, too. The high order Yukawa corrections to the 4-dimensional Newtonian potential is exponentially suppressed, and can be negligible. Applying such a setup to cosmology, we obtain explicitly the field equations in the bulk and the generalized Friedmann equations on the branes. △ Less

Submitted 10 July, 2009; originally announced July 2009.

Comments: revtex4, 16 pages, 8 figures

Journal ref: JHEP 0910:095,2009

Showing 1–8 of 8 results for author: Devin, M