Search | arXiv e-print repository

Red Teaming Language Models to Reduce Harms: Methods, Scaling Behaviors, and Lessons Learned

Authors: Deep Ganguli, Liane Lovitt, Jackson Kernion, Amanda Askell, Yuntao Bai, Saurav Kadavath, Ben Mann, Ethan Perez, Nicholas Schiefer, Kamal Ndousse, Andy Jones, Sam Bowman, Anna Chen, Tom Conerly, Nova DasSarma, Dawn Drain, Nelson Elhage, Sheer El-Showk, Stanislav Fort, Zac Hatfield-Dodds, Tom Henighan, Danny Hernandez, Tristan Hume, Josh Jacobson, Scott Johnston , et al. (11 additional authors not shown)

Abstract: We describe our early efforts to red team language models in order to simultaneously discover, measure, and attempt to reduce their potentially harmful outputs. We make three main contributions. First, we investigate scaling behaviors for red teaming across 3 model sizes (2.7B, 13B, and 52B parameters) and 4 model types: a plain language model (LM); an LM prompted to be helpful, honest, and harmle… ▽ More We describe our early efforts to red team language models in order to simultaneously discover, measure, and attempt to reduce their potentially harmful outputs. We make three main contributions. First, we investigate scaling behaviors for red teaming across 3 model sizes (2.7B, 13B, and 52B parameters) and 4 model types: a plain language model (LM); an LM prompted to be helpful, honest, and harmless; an LM with rejection sampling; and a model trained to be helpful and harmless using reinforcement learning from human feedback (RLHF). We find that the RLHF models are increasingly difficult to red team as they scale, and we find a flat trend with scale for the other model types. Second, we release our dataset of 38,961 red team attacks for others to analyze and learn from. We provide our own analysis of the data and find a variety of harmful outputs, which range from offensive language to more subtly harmful non-violent unethical outputs. Third, we exhaustively describe our instructions, processes, statistical methodologies, and uncertainty about red teaming. We hope that this transparency accelerates our ability to work together as a community in order to develop shared norms, practices, and technical standards for how to red team language models. △ Less

Submitted 22 November, 2022; v1 submitted 23 August, 2022; originally announced September 2022.

arXiv:2207.05221 [pdf, other]

Language Models (Mostly) Know What They Know

Authors: Saurav Kadavath, Tom Conerly, Amanda Askell, Tom Henighan, Dawn Drain, Ethan Perez, Nicholas Schiefer, Zac Hatfield-Dodds, Nova DasSarma, Eli Tran-Johnson, Scott Johnston, Sheer El-Showk, Andy Jones, Nelson Elhage, Tristan Hume, Anna Chen, Yuntao Bai, Sam Bowman, Stanislav Fort, Deep Ganguli, Danny Hernandez, Josh Jacobson, Jackson Kernion, Shauna Kravec, Liane Lovitt , et al. (11 additional authors not shown)

Abstract: We study whether language models can evaluate the validity of their own claims and predict which questions they will be able to answer correctly. We first show that larger models are well-calibrated on diverse multiple choice and true/false questions when they are provided in the right format. Thus we can approach self-evaluation on open-ended sampling tasks by asking models to first propose answe… ▽ More We study whether language models can evaluate the validity of their own claims and predict which questions they will be able to answer correctly. We first show that larger models are well-calibrated on diverse multiple choice and true/false questions when they are provided in the right format. Thus we can approach self-evaluation on open-ended sampling tasks by asking models to first propose answers, and then to evaluate the probability "P(True)" that their answers are correct. We find encouraging performance, calibration, and scaling for P(True) on a diverse array of tasks. Performance at self-evaluation further improves when we allow models to consider many of their own samples before predicting the validity of one specific possibility. Next, we investigate whether models can be trained to predict "P(IK)", the probability that "I know" the answer to a question, without reference to any particular proposed answer. Models perform well at predicting P(IK) and partially generalize across tasks, though they struggle with calibration of P(IK) on new tasks. The predicted P(IK) probabilities also increase appropriately in the presence of relevant source materials in the context, and in the presence of hints towards the solution of mathematical word problems. We hope these observations lay the groundwork for training more honest models, and for investigating how honesty generalizes to cases where models are trained on objectives other than the imitation of human writing. △ Less

Submitted 21 November, 2022; v1 submitted 11 July, 2022; originally announced July 2022.

Comments: 23+17 pages; refs added, typos fixed

arXiv:2205.10487 [pdf, other]

Scaling Laws and Interpretability of Learning from Repeated Data

Authors: Danny Hernandez, Tom Brown, Tom Conerly, Nova DasSarma, Dawn Drain, Sheer El-Showk, Nelson Elhage, Zac Hatfield-Dodds, Tom Henighan, Tristan Hume, Scott Johnston, Ben Mann, Chris Olah, Catherine Olsson, Dario Amodei, Nicholas Joseph, Jared Kaplan, Sam McCandlish

Abstract: Recent large language models have been trained on vast datasets, but also often on repeated data, either intentionally for the purpose of upweighting higher quality data, or unintentionally because data deduplication is not perfect and the model is exposed to repeated data at the sentence, paragraph, or document level. Some works have reported substantial negative performance effects of this repea… ▽ More Recent large language models have been trained on vast datasets, but also often on repeated data, either intentionally for the purpose of upweighting higher quality data, or unintentionally because data deduplication is not perfect and the model is exposed to repeated data at the sentence, paragraph, or document level. Some works have reported substantial negative performance effects of this repeated data. In this paper we attempt to study repeated data systematically and to understand its effects mechanistically. To do this, we train a family of models where most of the data is unique but a small fraction of it is repeated many times. We find a strong double descent phenomenon, in which repeated data can lead test loss to increase midway through training. A predictable range of repetition frequency leads to surprisingly severe degradation in performance. For instance, performance of an 800M parameter model can be degraded to that of a 2x smaller model (400M params) by repeating 0.1% of the data 100 times, despite the other 90% of the training tokens remaining unique. We suspect there is a range in the middle where the data can be memorized and doing so consumes a large fraction of the model's capacity, and this may be where the peak of degradation occurs. Finally, we connect these observations to recent mechanistic interpretability work - attempting to reverse engineer the detailed computations performed by the model - by showing that data repetition disproportionately damages copying and internal structures associated with generalization, such as induction heads, providing a possible mechanism for the shift from generalization to memorization. Taken together, these results provide a hypothesis for why repeating a relatively small fraction of data in large language models could lead to disproportionately large harms to performance. △ Less

Submitted 20 May, 2022; originally announced May 2022.

Comments: 23 pages, 22 figures

arXiv:2204.05862 [pdf, other]

Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback

Authors: Yuntao Bai, Andy Jones, Kamal Ndousse, Amanda Askell, Anna Chen, Nova DasSarma, Dawn Drain, Stanislav Fort, Deep Ganguli, Tom Henighan, Nicholas Joseph, Saurav Kadavath, Jackson Kernion, Tom Conerly, Sheer El-Showk, Nelson Elhage, Zac Hatfield-Dodds, Danny Hernandez, Tristan Hume, Scott Johnston, Shauna Kravec, Liane Lovitt, Neel Nanda, Catherine Olsson, Dario Amodei , et al. (6 additional authors not shown)

Abstract: We apply preference modeling and reinforcement learning from human feedback (RLHF) to finetune language models to act as helpful and harmless assistants. We find this alignment training improves performance on almost all NLP evaluations, and is fully compatible with training for specialized skills such as python coding and summarization. We explore an iterated online mode of training, where prefer… ▽ More We apply preference modeling and reinforcement learning from human feedback (RLHF) to finetune language models to act as helpful and harmless assistants. We find this alignment training improves performance on almost all NLP evaluations, and is fully compatible with training for specialized skills such as python coding and summarization. We explore an iterated online mode of training, where preference models and RL policies are updated on a weekly cadence with fresh human feedback data, efficiently improving our datasets and models. Finally, we investigate the robustness of RLHF training, and identify a roughly linear relation between the RL reward and the square root of the KL divergence between the policy and its initialization. Alongside our main results, we perform peripheral analyses on calibration, competing objectives, and the use of OOD detection, compare our models with human writers, and provide samples from our models using prompts appearing in recent related work. △ Less

Submitted 12 April, 2022; originally announced April 2022.

Comments: Data available at https://github.com/anthropics/hh-rlhf

arXiv:1912.03949 [pdf]

doi 10.1038/s41477-019-0429-5

Sphingolipid biosynthesis modulates plasmodesmal ultrastructure and phloem unloading

Authors: Dawei Yan, Shri Yadav, Andrea Paterlini, William Nicolas, Ilya Belevich, Magali Grison, Anne Vaten, Leila Karami, Sedeer El-Showk, Jung-Youn Lee, Gosia Murawska, Jenny Mortimer, Michael Knoblauch, Eija Jokitalo, Jonathan Markham, Emmanuelle Bayer, Ykä Helariutta

Abstract: During phloem unloading, multiple cell-to-cell transport events move organic substances to the root meristem. Although the primary unloading event from the sieve elements to the phloem pole pericycle has been characterized to some extent, little is known about post-sieve element unloading. Here, we report a novel gene, PHLOEM UNLOADING MODULATOR (PLM), in the absence of which plasmodesmata-mediate… ▽ More During phloem unloading, multiple cell-to-cell transport events move organic substances to the root meristem. Although the primary unloading event from the sieve elements to the phloem pole pericycle has been characterized to some extent, little is known about post-sieve element unloading. Here, we report a novel gene, PHLOEM UNLOADING MODULATOR (PLM), in the absence of which plasmodesmata-mediated symplastic transport through the phloem pole pericycle--endodermis interface is specifically enhanced. Increased unloading is attributable to a defect in the formation of the endoplasmic reticulum--plasma membrane tethers during plasmodesmal morphogenesis, resulting in the majority of pores lacking a visible cytoplasmic sleeve. PLM encodes a putative enzyme required for the biosynthesis of sphingolipids with very-long-chain fatty acid. Taken together, our results indicate that post-sieve element unloading involves sphingolipid metabolism, which affects plasmodesmal ultrastructure. They also raise the question of how and why plasmodesmata with no cytoplasmic sleeve facilitate molecular trafficking. △ Less

Submitted 9 December, 2019; originally announced December 2019.

Comments: Nature Plants, Nature Publishing Group, In press

arXiv:1605.08087 [pdf, other]

Extremal bootstrap**: go with the flow

Authors: Sheer El-Showk, Miguel F. Paulos

Abstract: The extremal functional method determines approximate solutions to the constraints of crossing symmetry, which saturate bounds on the space of unitary CFTs. We show that such solutions are characterized by extremality conditions, which may be used to flow continuously along the boundaries of parameter space. Along the flow there is generically no further need for optimization, which dramatically r… ▽ More The extremal functional method determines approximate solutions to the constraints of crossing symmetry, which saturate bounds on the space of unitary CFTs. We show that such solutions are characterized by extremality conditions, which may be used to flow continuously along the boundaries of parameter space. Along the flow there is generically no further need for optimization, which dramatically reduces computational requirements, bringing calculations from the realm of computing clusters to laptops. Conceptually, extremality sheds light on possible ways to bootstrap without positivity, extending the method to non-unitary theories, and implies that theories saturating bounds, and especially those sitting at kinks, have unusually sparse spectra. We discuss several applications, including the first high-precision bootstrap of a non-unitary CFT. △ Less

Submitted 25 May, 2016; originally announced May 2016.

Comments: 35+5 pages and lots of nice figures

Report number: CERN-TH-2016-125

arXiv:1503.02081 [pdf, other]

doi 10.1007/JHEP08(2015)142

Bootstrap** SCFTs with Four Supercharges

Authors: Nikolay Bobev, Sheer El-Showk, Dalimil Mazac, Miguel F. Paulos

Abstract: We study the constraints imposed by superconformal symmetry, crossing symmetry, and unitarity for theories with four supercharges in spacetime dimension $2\leq d\leq 4$. We show how superconformal algebras with four Poincaré supercharges can be treated in a formalism applicable to any, in principle continuous, value of $d$ and use this to construct the superconformal blocks for any $d\leq 4$. We t… ▽ More We study the constraints imposed by superconformal symmetry, crossing symmetry, and unitarity for theories with four supercharges in spacetime dimension $2\leq d\leq 4$. We show how superconformal algebras with four Poincaré supercharges can be treated in a formalism applicable to any, in principle continuous, value of $d$ and use this to construct the superconformal blocks for any $d\leq 4$. We then use numerical bootstrap techniques to derive upper bounds on the conformal dimension of the first unprotected operator appearing in the OPE of a chiral and an anti-chiral superconformal primary. We obtain an intriguing structure of three distinct kinks. We argue that one of the kinks smoothly interpolates between the $d=2$, $\mathcal N=(2,2)$ minimal model with central charge $c=1$ and the theory of a free chiral multiplet in $d=4$, passing through the critical Wess-Zumino model with cubic superpotential in intermediate dimensions. △ Less

Submitted 28 August, 2015; v1 submitted 6 March, 2015; originally announced March 2015.

Comments: 43+16 pages, 13 figures. Comments welcome. v2: Minor changes, published version

Journal ref: JHEP 1508 (2015) 142

arXiv:1502.04124 [pdf, other]

doi 10.1103/PhysRevLett.115.051601

Bootstrap** the Three-Dimensional Supersymmetric Ising Model

Authors: Nikolay Bobev, Sheer El-Showk, Dalimil Mazac, Miguel F. Paulos

Abstract: We implement the conformal bootstrap program for three-dimensional CFTs with $\mathcal{N}=2$ supersymmetry and find universal constraints on the spectrum of operator dimensions in these theories. By studying the bounds on the dimension of the first scalar appearing in the OPE of a chiral and an anti-chiral primary, we find a kink at the expected location of the critical three-dimensional… ▽ More We implement the conformal bootstrap program for three-dimensional CFTs with $\mathcal{N}=2$ supersymmetry and find universal constraints on the spectrum of operator dimensions in these theories. By studying the bounds on the dimension of the first scalar appearing in the OPE of a chiral and an anti-chiral primary, we find a kink at the expected location of the critical three-dimensional $\mathcal{N}=2$ Wess-Zumino model, which can be thought of as a supersymmetric analog of the critical Ising model. Focusing on this kink, we determine, to high accuracy, the low-lying spectrum of operator dimensions of the theory. △ Less

Submitted 9 August, 2015; v1 submitted 13 February, 2015; originally announced February 2015.

Comments: 5 pages, 6 figures

Journal ref: Phys. Rev. Lett. 115, 051601 (2015)

arXiv:1403.4545 [pdf, other]

doi 10.1007/s10955-014-1042-7

Solving the 3d Ising Model with the Conformal Bootstrap II. c-Minimization and Precise Critical Exponents

Authors: Sheer El-Showk, Miguel F. Paulos, David Poland, Slava Rychkov, David Simmons-Duffin, Alessandro Vichi

Abstract: We use the conformal bootstrap to perform a precision study of the operator spectrum of the critical 3d Ising model. We conjecture that the 3d Ising spectrum minimizes the central charge c in the space of unitary solutions to crossing symmetry. Because extremal solutions to crossing symmetry are uniquely determined, we are able to precisely reconstruct the first several Z2-even operator dimensions… ▽ More We use the conformal bootstrap to perform a precision study of the operator spectrum of the critical 3d Ising model. We conjecture that the 3d Ising spectrum minimizes the central charge c in the space of unitary solutions to crossing symmetry. Because extremal solutions to crossing symmetry are uniquely determined, we are able to precisely reconstruct the first several Z2-even operator dimensions and their OPE coefficients. We observe that a sharp transition in the operator spectrum occurs at the 3d Ising dimension Delta_sigma=0.518154(15), and find strong numerical evidence that operators decouple from the spectrum as one approaches the 3d Ising point. We compare this behavior to the analogous situation in 2d, where the disappearance of operators can be understood in terms of degenerate Virasoro representations. △ Less

Submitted 4 June, 2014; v1 submitted 18 March, 2014; originally announced March 2014.

Comments: 55 pages, many figures; v2 - refs and comments added, to appear in a special issue of J.Stat.Phys. in memory of Kenneth Wilson

Report number: CERN-PH-TH/2014-038, NSF-KITP-14-022

Journal ref: J. Stat. Phys. 157, 869-914 (2014)

arXiv:1309.5089 [pdf, other]

doi 10.1103/PhysRevLett.112.141601

Conformal Field Theories in Fractional Dimensions

Authors: S. El-Showk, M. Paulos, D. Poland, S. Rychkov, D. Simmons-Duffin, A. Vichi

Abstract: We study the conformal bootstrap in fractional space-time dimensions, obtaining rigorous bounds on operator dimensions. Our results show strong evidence that there is a family of unitary CFTs connecting the 2D Ising model, the 3D Ising model, and the free scalar theory in 4D. We give numerical predictions for the leading operator dimensions and central charge in this family at different values of… ▽ More We study the conformal bootstrap in fractional space-time dimensions, obtaining rigorous bounds on operator dimensions. Our results show strong evidence that there is a family of unitary CFTs connecting the 2D Ising model, the 3D Ising model, and the free scalar theory in 4D. We give numerical predictions for the leading operator dimensions and central charge in this family at different values of D and compare these to calculations of phi^4 theory in the epsilon-expansion. △ Less

Submitted 12 October, 2015; v1 submitted 19 September, 2013; originally announced September 2013.

Comments: 11 pages, 4 figures - references updated - one affiliation modified

Report number: CERN-PH-TH/2013-219

Journal ref: Phys. Rev. Lett. 112, 141601 (2014)

arXiv:1211.2810 [pdf, other]

doi 10.1103/PhysRevLett.111.241601

Bootstrap** Conformal Field Theories with the Extremal Functional Method

Authors: Sheer El-Showk, Miguel F. Paulos

Abstract: The existence of a positive linear functional acting on the space of (differences between) conformal blocks has been shown to rule out regions in the parameter space of conformal field theories (CFTs). We argue that at the boundary of the allowed region the extremal functional contains, in principle, enough information to determine the dimensions and OPE coefficients of an infinite number of opera… ▽ More The existence of a positive linear functional acting on the space of (differences between) conformal blocks has been shown to rule out regions in the parameter space of conformal field theories (CFTs). We argue that at the boundary of the allowed region the extremal functional contains, in principle, enough information to determine the dimensions and OPE coefficients of an infinite number of operators appearing in the correlator under analysis. Based on this idea we develop the Extremal Functional Method (EFM), a numerical procedure for deriving the spectrum and OPE coefficients of CFTs lying on the boundary (of solution space). We test the EFM by using it to rederive the low lying spectrum and OPE coefficients of the 2d Ising model based solely on the dimension of a single scalar quasi-primary -- no Virasoro algebra required. Our work serves as a benchmark for applications to more interesting, less known CFTs in the near future. △ Less

Submitted 12 November, 2012; originally announced November 2012.

Comments: 28 pages, 9 figures, 3 tables

arXiv:1205.5023 [pdf, other]

doi 10.1007/JHEP11(2012)171

Scaling BPS Solutions and pure-Higgs States

Authors: Iosif Bena, Micha Berkooz, Jan de Boer, Sheer El-Showk, Dieter Van den Bleeken

Abstract: Depending on the value of the coupling, BPS states of type II string theory compactified on a Calabi-Yau manifold can be described as multicenter supergravity solutions or as states on the Coulomb or the Higgs branch of a quiver gauge theory. While the Coulomb-branch states can be mapped one-to-one to supergravity states, this is not automatically so for Higgs-branch states. In this paper we expli… ▽ More Depending on the value of the coupling, BPS states of type II string theory compactified on a Calabi-Yau manifold can be described as multicenter supergravity solutions or as states on the Coulomb or the Higgs branch of a quiver gauge theory. While the Coulomb-branch states can be mapped one-to-one to supergravity states, this is not automatically so for Higgs-branch states. In this paper we explicitly compute the BPS spectrum of the Higgs branch of a three-center quiver with a closed loop, and identify the subset of states that are in one-to-one correspondence with Coulomb/supergravity multicenter states. We also show that there exist additional "pure-Higgs" states, that exist if and only if the charges of the centers can form a scaling solution. Using generating function techniques we compute the large charge degeneracy of the "pure-Higgs" sector and show that it is always exponential. We also construct the map between Higgs- and Coulomb-branch states, discuss its relation to the Higgs-Coulomb map of one of the authors and Verlinde, and argue that the pure Higgs states live in the kernel of this map. Given that these states have no obvious description on the Coulomb branch or in supergravity, we discuss whether they can correspond to a single-center black hole or can be related to more complicated horizonless configurations. △ Less

Submitted 22 May, 2012; originally announced May 2012.

Comments: 37 pages, 4 figures

Report number: IPhT-T12/041

arXiv:1203.6064 [pdf, other]

doi 10.1103/PhysRevD.86.025022

Solving the 3D Ising Model with the Conformal Bootstrap

Authors: Sheer El-Showk, Miguel F. Paulos, David Poland, Slava Rychkov, David Simmons-Duffin, Alessandro Vichi

Abstract: We study the constraints of crossing symmetry and unitarity in general 3D Conformal Field Theories. In doing so we derive new results for conformal blocks appearing in four-point functions of scalars and present an efficient method for their computation in arbitrary space-time dimension. Comparing the resulting bounds on operator dimensions and OPE coefficients in 3D to known results, we find that… ▽ More We study the constraints of crossing symmetry and unitarity in general 3D Conformal Field Theories. In doing so we derive new results for conformal blocks appearing in four-point functions of scalars and present an efficient method for their computation in arbitrary space-time dimension. Comparing the resulting bounds on operator dimensions and OPE coefficients in 3D to known results, we find that the 3D Ising model lies at a corner point on the boundary of the allowed parameter space. We also derive general upper bounds on the dimensions of higher spin operators, relevant in the context of theories with weakly broken higher spin symmetries. △ Less

Submitted 1 August, 2012; v1 submitted 27 March, 2012; originally announced March 2012.

Comments: 32 pages, 11 figures; v2: refs added, small changes in Section 5.3, Fig. 7 replaced; v3: ref added, fits redone in Section 5.4

Report number: LPTENS-12/07

Journal ref: Phys. Rev. D 86, 025022 (2012)

arXiv:1108.6091 [pdf, ps, other]

doi 10.1007/JHEP12(2012)009

Kerr/CFT, dipole theories and nonrelativistic CFTs

Authors: Sheer El-Showk, Monica Guica

Abstract: We study solutions of type IIB supergravity which are SL(2,R) x SU(2) x U(1)^2 invariant deformations of AdS_3 x S^3 x K3 and take the form of products of self-dual spacelike warped AdS_3 and a deformed three-sphere. One of these backgrounds has been recently argued to be relevant for a derivation of Kerr/CFT from string theory, whereas the remaining ones are holographic duals of two-dimensional d… ▽ More We study solutions of type IIB supergravity which are SL(2,R) x SU(2) x U(1)^2 invariant deformations of AdS_3 x S^3 x K3 and take the form of products of self-dual spacelike warped AdS_3 and a deformed three-sphere. One of these backgrounds has been recently argued to be relevant for a derivation of Kerr/CFT from string theory, whereas the remaining ones are holographic duals of two-dimensional dipole theories and their S-duals. We show that each of these backgrounds is holographically dual to a deformation of the DLCQ of the D1-D5 CFT by a specific supersymmetric (1,2) operator, which we write down explicitly in terms of twist operators at the free orbifold point. The deforming operator is argued to be exactly marginal with respect to the zero-dimensional nonrelativistic conformal (or Schroedinger) group - which is simply SL(2,R)_L x U(1)_R. Moreover, in the supergravity limit of large N and strong coupling, no other single-trace operators are turned on. We thus propose that the field theory duals to the backgrounds of interest are nonrelativistic CFTs defined by adding the single Schroedinger-invariant (1,2) operator mentioned above to the original CFT action. Our analysis indicates that the rotating extremal black holes we study are best thought of as finite right-moving temperature (non-supersymmetric) states in the above-defined supersymmetric nonrelativistic CFT and hints towards a more general connection between Kerr/CFT and two-dimensional non-relativistic CFTs. △ Less

Submitted 30 December, 2012; v1 submitted 30 August, 2011; originally announced August 2011.

Comments: 48+8 pages, 4 figures; minor corrections and references added

arXiv:1108.0411 [pdf, ps, other]

doi 10.1007/JHEP03(2012)094

Moulting Black Holes

Authors: Iosif Bena, Borun D. Chowdhury, Jan de Boer, Sheer El-Showk, Masaki Shigemori

Abstract: We find a family of novel supersymmetric phases of the D1-D5 CFT, which in certain ranges of charges have more entropy than all known ensembles. We also find bulk BPS configurations that exist in the same range of parameters as these phases, and have more entropy than a BMPV black hole; they can be thought of as coming from a BMPV black hole shedding a "hair" condensate outside of the horizon. The… ▽ More We find a family of novel supersymmetric phases of the D1-D5 CFT, which in certain ranges of charges have more entropy than all known ensembles. We also find bulk BPS configurations that exist in the same range of parameters as these phases, and have more entropy than a BMPV black hole; they can be thought of as coming from a BMPV black hole shedding a "hair" condensate outside of the horizon. The entropy of the bulk configurations is smaller than that of the CFT phases, which indicates that some of the CFT states are lifted at strong coupling. Neither the bulk nor the boundary phases are captured by the elliptic genus, which makes the coincidence of the phase boundaries particularly remarkable. Our configurations are supersymmetric, have non-Cardy-like entropy, and are the first instance of a black hole entropy enigma with a controlled CFT dual. Furthermore, contrary to common lore, these objects exist in a region of parameter space (between the "cosmic censorship bound" and the "unitarity bound") where no black holes were thought to exist. △ Less

Submitted 29 March, 2012; v1 submitted 1 August, 2011; originally announced August 2011.

Comments: 51 pages, 15 figures. Print in color to enjoy. v2: References added, clarifications in Introduction, and a new appendix added to explain units and conventions. v3: the spectral flow argument in section 3 improved

Report number: IPhT-T11/164, ITFA11-11

Journal ref: JHEP 03, 094 (2012)

arXiv:1101.5385 [pdf, ps, other]

doi 10.1016/j.nuclphysb.2011.03.008

What Maxwell Theory in D<>4 teaches us about scale and conformal invariance

Authors: Sheer El-Showk, Yu Nakayama, Slava Rychkov

Abstract: The free Maxwell theory in D<>4 dimensions provides a physical example of a unitary, scale invariant theory which is NOT conformally invariant. The easiest way to see this is that the field strength operator F_mn is neither a primary nor a descendant. We show how conformal multiplets can be completed, and conformality restored, by adding new local operators to the theory. In D>=5, this can only be… ▽ More The free Maxwell theory in D<>4 dimensions provides a physical example of a unitary, scale invariant theory which is NOT conformally invariant. The easiest way to see this is that the field strength operator F_mn is neither a primary nor a descendant. We show how conformal multiplets can be completed, and conformality restored, by adding new local operators to the theory. In D>=5, this can only be done by sacrificing unitarity of the extended Hilbert space. We analyze the full symmetry structure of the extended theory, which turns out to be related to the OSp(D,2|2) superalgebra. △ Less

Submitted 12 February, 2011; v1 submitted 27 January, 2011; originally announced January 2011.

Comments: 20 pages; v2: minor corrections, refs added

Report number: LPTENS-11/05; CALT 68-2819

arXiv:1101.4163 [pdf, ps, other]

doi 10.1007/JHEP10(2012)106

Emergent Spacetime and Holographic CFTs

Authors: Sheer El-Showk, Kyriakos Papadodimas

Abstract: We discuss universal properties of conformal field theories with holographic duals. A central feature of these theories is the existence of a low-lying sector of operators whose correlators factorize. We demonstrate that factorization can only hold in the large central charge limit. Using conformal invariance and factorization we argue that these operators are naturally represented as fields in Ad… ▽ More We discuss universal properties of conformal field theories with holographic duals. A central feature of these theories is the existence of a low-lying sector of operators whose correlators factorize. We demonstrate that factorization can only hold in the large central charge limit. Using conformal invariance and factorization we argue that these operators are naturally represented as fields in AdS as this makes the underlying linearity of the system manifest. In this class of CFTs the solution of the conformal bootstrap conditions can be naturally organized in structures which coincide with Witten diagrams in the bulk. The large value of the central charge suggests that the theory must include a large number of new operators not captured by the factorized sector. Consequently we may think of the AdS hologram as an effective representation of a small sector of the CFT, which is embedded inside a much larger Hilbert space corresponding to the black hole microstates. △ Less

Submitted 16 November, 2012; v1 submitted 21 January, 2011; originally announced January 2011.

Comments: 89 pages, 8 figures, typos corrected

Journal ref: Journal of High Energy Physics, Volume 2012, Issue 10:106

arXiv:0906.0011 [pdf, other]

doi 10.1007/JHEP02(2010)062

A bound on the entropy of supergravity?

Authors: Jan de Boer, Sheer El-Showk, Ilies Messamah, Dieter Van den Bleeken

Abstract: We determine, in two independent ways, the number of BPS quantum states arising from supergravity degrees of freedom in a system with fixed total D4D0 charge. First, we count states generated by quantizing the spacetime degrees of freedom of 'entropyless' multicentered solutions consisting of anti-D0-branes bound to a D6-anti-D6 pair. Second, we determine the number of free supergravity excitati… ▽ More We determine, in two independent ways, the number of BPS quantum states arising from supergravity degrees of freedom in a system with fixed total D4D0 charge. First, we count states generated by quantizing the spacetime degrees of freedom of 'entropyless' multicentered solutions consisting of anti-D0-branes bound to a D6-anti-D6 pair. Second, we determine the number of free supergravity excitations of the corresponding AdS_3 geometry with the same total charge. We find that, although these two approaches yield a priori different sets of states, the leading degeneracies in a large charge expansion are equal to each other and that, furthermore, the number of such states is parametrically smaller than that arising from the D4D0 black hole's entropy. This strongly suggests that supergravity alone is not sufficient to capture all degrees of freedom of large supersymmetric black holes. Comparing the free supergravity calculation to that of the D6-anti-D6-D0 system we find that the bound on the free spectrum imposed by the stringy exclusion principle (a unitarity bound in the dual CFT) seems to be captured in the dynamics of the fully interacting but classcial supergravity equations of motion. △ Less

Submitted 1 June, 2009; originally announced June 2009.

Comments: 33 pages, 5 figures

Journal ref: JHEP 1002:062,2010

arXiv:0811.0263 [pdf, ps, other]

doi 10.1088/0264-9381/25/21/214004

Black Holes as Effective Geometries

Authors: Vijay Balasubramanian, Jan de Boer, Sheer El-Showk, Ilies Messamah

Abstract: Gravitational entropy arises in string theory via coarse graining over an underlying space of microstates. In this review we would like to address the question of how the classical black hole geometry itself arises as an effective or approximate description of a pure state, in a closed string theory, which semiclassical observers are unable to distinguish from the "naive" geometry. In cases with… ▽ More Gravitational entropy arises in string theory via coarse graining over an underlying space of microstates. In this review we would like to address the question of how the classical black hole geometry itself arises as an effective or approximate description of a pure state, in a closed string theory, which semiclassical observers are unable to distinguish from the "naive" geometry. In cases with enough supersymmetry it has been possible to explicitly construct these microstates in spacetime, and understand how coarse-graining of non-singular, horizon-free objects can lead to an effective description as an extremal black hole. We discuss how these results arise for examples in Type II string theory on AdS_5 x S^5 and on AdS_3 x S^3 x T^4 that preserve 16 and 8 supercharges respectively. For such a picture of black holes as effective geometries to extend to cases with finite horizon area the scale of quantum effects in gravity would have to extend well beyond the vicinity of the singularities in the effective theory. By studying examples in M-theory on AdS_3 x S^2 x CY that preserve 4 supersymmetries we show how this can happen. △ Less

Submitted 17 November, 2008; v1 submitted 3 November, 2008; originally announced November 2008.

Comments: Review based on lectures of JdB at CERN RTN Winter School and of VB at PIMS Summer School. 68 pages. Added references

Journal ref: Class.Quant.Grav.25:214004,2008

arXiv:0807.4556 [pdf, other]

doi 10.1088/1126-6708/2009/05/002

Quantizing N=2 Multicenter Solutions

Authors: Jan de Boer, Sheer El-Showk, Ilies Messamah, Dieter Van den Bleeken

Abstract: N=2 supergravity in four dimensions, or equivalently N=1 supergravity in five dimensions, has an interesting set of BPS solutions that each correspond to a number of charged centers. This set contains black holes, black rings and their bound states, as well as many smooth solutions. Moduli spaces of such solutions carry a natural symplectic form which we determine, and which allows us to study t… ▽ More N=2 supergravity in four dimensions, or equivalently N=1 supergravity in five dimensions, has an interesting set of BPS solutions that each correspond to a number of charged centers. This set contains black holes, black rings and their bound states, as well as many smooth solutions. Moduli spaces of such solutions carry a natural symplectic form which we determine, and which allows us to study their quantization. By counting the resulting wavefunctions we come to an independent derivation of some of the wall-crossing formulae. Knowledge of the explicit form of these wavefunctions allows us to find quantum resolutions to some apparent classical paradoxes such as solutions with barely bound centers and those with an infinitely deep throat. We show that quantum effects seem to cap off the throat at a finite depth and we give an estimate for the corresponding mass gap in the dual CFT. This is an interesting example of a system where quantum effects cannot be neglected at macroscopic scales even though the curvature is everywhere small. △ Less

Submitted 29 July, 2008; originally announced July 2008.

Comments: 49 pages + appendices

Report number: ITFA-2008-28, KUL-TF-08/18

arXiv:0807.0892 [pdf]

The driving force behind genomic diversity

Authors: Salla Jaakkola, Sedeer El-Showk, Arto Annila

Abstract: Eukaryote genomes contain excessively introns, inter-genic and other non-genic sequences that appear to have no vital functional role or phenotype manifestation. Their existence, a long-standing puzzle, is viewed from the principle of increasing entropy. According to thermodynamics of open systems, genomes evolve toward diversity by various mechanisms that increase, decrease and distribute genom… ▽ More Eukaryote genomes contain excessively introns, inter-genic and other non-genic sequences that appear to have no vital functional role or phenotype manifestation. Their existence, a long-standing puzzle, is viewed from the principle of increasing entropy. According to thermodynamics of open systems, genomes evolve toward diversity by various mechanisms that increase, decrease and distribute genomic material in response to thermodynamic driving forces. Evolution results in an excessive genome, a high-entropy ecosystem of its own, where copious non-coding segments associate with low-level functions and conserved sequences code coordinated activities. The rate of entropy increase, equivalent to the rate of free energy decrease, is identified with the universal fitness criterion of natural selection that governs populations of genomic entities as well as other species. △ Less

Submitted 6 July, 2008; originally announced July 2008.

Comments: 8 pages, 3 figures

Journal ref: Biophys Chem 134 (2008) 232-238

arXiv:0802.2257 [pdf, other]

doi 10.1088/1126-6708/2008/11/050

Black hole bound states in AdS_3 x S^2

Authors: Jan de Boer, Frederik Denef, Sheer El-Showk, Ilies Messamah, Dieter Van den Bleeken

Abstract: We systematically construct the geometries dual to the 1+1 dimensional (0,4) conformal field theories that arise in the low-energy description of wrapped M5-branes in S^1 x CY_3 compactifications of M-theory. This includes a large number of multicentered black hole bound states asymptotic to AdS_3 x S^2. In addition, we find many geometries that develop multiple, mutually decoupled AdS_3 x S^2 t… ▽ More We systematically construct the geometries dual to the 1+1 dimensional (0,4) conformal field theories that arise in the low-energy description of wrapped M5-branes in S^1 x CY_3 compactifications of M-theory. This includes a large number of multicentered black hole bound states asymptotic to AdS_3 x S^2. In addition, we find many geometries that develop multiple, mutually decoupled AdS_3 x S^2 throats. We argue there is a useful one to one correspondence between the connected components of the space of solutions and particular limits of type IIA attractor flow trees. We point out that there is a thermodynamic instability of small supersymmetric BTZ black holes to localization on the S^2, a supersymmetric and exactly solvable analog of the well known AdS-Schwarzschild localization instability, and identify this with the ``Entropy Enigma'' in four dimensions. We discuss the phase transition this suggests, and initiate the CFT interpretation of these results. △ Less

Submitted 15 February, 2008; originally announced February 2008.

Journal ref: JHEP 0811:050,2008

arXiv:0706.3119 [pdf, ps, other]

doi 10.1088/0264-9381/25/7/075006

G2 Hitchin functionals at one loop

Authors: Jan de Boer, Paul de Medeiros, Sheer El-Showk, Annamaria Sinkovics

Abstract: We consider the quantization of the effective target space description of topological M-theory in terms of the Hitchin functional whose critical points describe seven-manifolds with G2 structure. The one-loop partition function for this theory is calculated and an extended version of it, that is related to generalized G2 geometry, is compared with the topological G2 string. We relate the reducti… ▽ More We consider the quantization of the effective target space description of topological M-theory in terms of the Hitchin functional whose critical points describe seven-manifolds with G2 structure. The one-loop partition function for this theory is calculated and an extended version of it, that is related to generalized G2 geometry, is compared with the topological G2 string. We relate the reduction of the effective action for the extended G2 theory to the Hitchin functional description of the topological string in six dimensions. The dependence of the partition functions on the choice of background G2 metric is also determined. △ Less

Submitted 5 July, 2007; v1 submitted 21 June, 2007; originally announced June 2007.

Comments: 58 pages, LaTeX; v2: Acknowledgments added

Report number: DAMTP-2007-54, EMPG-07-10, ITFA-07-25

Journal ref: Class.Quant.Grav.25:075006,2008

arXiv:hep-th/0611080 [pdf, ps, other]

doi 10.1088/1126-6708/2008/02/012

Open G2 Strings

Authors: Jan de Boer, Paul de Medeiros, Sheer El-Showk, Annamaria Sinkovics

Abstract: We consider an open string version of the topological twist previously proposed for sigma-models with G2 target spaces. We determine the cohomology of open strings states and relate these to geometric deformations of calibrated submanifolds and to flat or anti-self-dual connections on such submanifolds. On associative three-cycles we show that the worldvolume theory is a gauge-fixed Chern-Simons… ▽ More We consider an open string version of the topological twist previously proposed for sigma-models with G2 target spaces. We determine the cohomology of open strings states and relate these to geometric deformations of calibrated submanifolds and to flat or anti-self-dual connections on such submanifolds. On associative three-cycles we show that the worldvolume theory is a gauge-fixed Chern-Simons theory coupled to normal deformations of the cycle. For coassociative four-cycles we find a functional that extremizes on anti-self-dual gauge fields. A brane wrap** the whole G2 induces a seven-dimensional associative Chern-Simons theory on the manifold. This theory has already been proposed by Donaldson and Thomas as the higher-dimensional generalization of real Chern-Simons theory. When the G2 manifold has the structure of a Calabi-Yau times a circle, these theories reduce to a combination of the open A-model on special Lagrangians and the open B+\bar{B}-model on holomorphic submanifolds. We also comment on possible applications of our results. △ Less

Submitted 7 November, 2006; originally announced November 2006.

Comments: 55 pages, no figures

Report number: DAMTP-2006-99, EMPG-06-10, ITFA-06-41, MCTP-06-28

Journal ref: JHEP0802:012,2008

Showing 1–24 of 24 results for author: El-Showk, S