Skip to main content

Showing 1–24 of 24 results for author: El-Showk, S

.
  1. arXiv:2209.07858  [pdf, other

    cs.CL cs.AI cs.CY

    Red Teaming Language Models to Reduce Harms: Methods, Scaling Behaviors, and Lessons Learned

    Authors: Deep Ganguli, Liane Lovitt, Jackson Kernion, Amanda Askell, Yuntao Bai, Saurav Kadavath, Ben Mann, Ethan Perez, Nicholas Schiefer, Kamal Ndousse, Andy Jones, Sam Bowman, Anna Chen, Tom Conerly, Nova DasSarma, Dawn Drain, Nelson Elhage, Sheer El-Showk, Stanislav Fort, Zac Hatfield-Dodds, Tom Henighan, Danny Hernandez, Tristan Hume, Josh Jacobson, Scott Johnston , et al. (11 additional authors not shown)

    Abstract: We describe our early efforts to red team language models in order to simultaneously discover, measure, and attempt to reduce their potentially harmful outputs. We make three main contributions. First, we investigate scaling behaviors for red teaming across 3 model sizes (2.7B, 13B, and 52B parameters) and 4 model types: a plain language model (LM); an LM prompted to be helpful, honest, and harmle… ▽ More

    Submitted 22 November, 2022; v1 submitted 23 August, 2022; originally announced September 2022.

  2. arXiv:2207.05221  [pdf, other

    cs.CL cs.AI cs.LG

    Language Models (Mostly) Know What They Know

    Authors: Saurav Kadavath, Tom Conerly, Amanda Askell, Tom Henighan, Dawn Drain, Ethan Perez, Nicholas Schiefer, Zac Hatfield-Dodds, Nova DasSarma, Eli Tran-Johnson, Scott Johnston, Sheer El-Showk, Andy Jones, Nelson Elhage, Tristan Hume, Anna Chen, Yuntao Bai, Sam Bowman, Stanislav Fort, Deep Ganguli, Danny Hernandez, Josh Jacobson, Jackson Kernion, Shauna Kravec, Liane Lovitt , et al. (11 additional authors not shown)

    Abstract: We study whether language models can evaluate the validity of their own claims and predict which questions they will be able to answer correctly. We first show that larger models are well-calibrated on diverse multiple choice and true/false questions when they are provided in the right format. Thus we can approach self-evaluation on open-ended sampling tasks by asking models to first propose answe… ▽ More

    Submitted 21 November, 2022; v1 submitted 11 July, 2022; originally announced July 2022.

    Comments: 23+17 pages; refs added, typos fixed

  3. arXiv:2205.10487  [pdf, other

    cs.LG cs.AI

    Scaling Laws and Interpretability of Learning from Repeated Data

    Authors: Danny Hernandez, Tom Brown, Tom Conerly, Nova DasSarma, Dawn Drain, Sheer El-Showk, Nelson Elhage, Zac Hatfield-Dodds, Tom Henighan, Tristan Hume, Scott Johnston, Ben Mann, Chris Olah, Catherine Olsson, Dario Amodei, Nicholas Joseph, Jared Kaplan, Sam McCandlish

    Abstract: Recent large language models have been trained on vast datasets, but also often on repeated data, either intentionally for the purpose of upweighting higher quality data, or unintentionally because data deduplication is not perfect and the model is exposed to repeated data at the sentence, paragraph, or document level. Some works have reported substantial negative performance effects of this repea… ▽ More

    Submitted 20 May, 2022; originally announced May 2022.

    Comments: 23 pages, 22 figures

  4. arXiv:2204.05862  [pdf, other

    cs.CL cs.LG

    Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback

    Authors: Yuntao Bai, Andy Jones, Kamal Ndousse, Amanda Askell, Anna Chen, Nova DasSarma, Dawn Drain, Stanislav Fort, Deep Ganguli, Tom Henighan, Nicholas Joseph, Saurav Kadavath, Jackson Kernion, Tom Conerly, Sheer El-Showk, Nelson Elhage, Zac Hatfield-Dodds, Danny Hernandez, Tristan Hume, Scott Johnston, Shauna Kravec, Liane Lovitt, Neel Nanda, Catherine Olsson, Dario Amodei , et al. (6 additional authors not shown)

    Abstract: We apply preference modeling and reinforcement learning from human feedback (RLHF) to finetune language models to act as helpful and harmless assistants. We find this alignment training improves performance on almost all NLP evaluations, and is fully compatible with training for specialized skills such as python coding and summarization. We explore an iterated online mode of training, where prefer… ▽ More

    Submitted 12 April, 2022; originally announced April 2022.

    Comments: Data available at https://github.com/anthropics/hh-rlhf

  5. Sphingolipid biosynthesis modulates plasmodesmal ultrastructure and phloem unloading

    Authors: Dawei Yan, Shri Yadav, Andrea Paterlini, William Nicolas, Ilya Belevich, Magali Grison, Anne Vaten, Leila Karami, Sedeer El-Showk, Jung-Youn Lee, Gosia Murawska, Jenny Mortimer, Michael Knoblauch, Eija Jokitalo, Jonathan Markham, Emmanuelle Bayer, Ykä Helariutta

    Abstract: During phloem unloading, multiple cell-to-cell transport events move organic substances to the root meristem. Although the primary unloading event from the sieve elements to the phloem pole pericycle has been characterized to some extent, little is known about post-sieve element unloading. Here, we report a novel gene, PHLOEM UNLOADING MODULATOR (PLM), in the absence of which plasmodesmata-mediate… ▽ More

    Submitted 9 December, 2019; originally announced December 2019.

    Comments: Nature Plants, Nature Publishing Group, In press

  6. arXiv:1605.08087  [pdf, other

    hep-th

    Extremal bootstrap**: go with the flow

    Authors: Sheer El-Showk, Miguel F. Paulos

    Abstract: The extremal functional method determines approximate solutions to the constraints of crossing symmetry, which saturate bounds on the space of unitary CFTs. We show that such solutions are characterized by extremality conditions, which may be used to flow continuously along the boundaries of parameter space. Along the flow there is generically no further need for optimization, which dramatically r… ▽ More

    Submitted 25 May, 2016; originally announced May 2016.

    Comments: 35+5 pages and lots of nice figures

    Report number: CERN-TH-2016-125

  7. arXiv:1503.02081  [pdf, other

    hep-th cond-mat.stat-mech cond-mat.str-el

    Bootstrap** SCFTs with Four Supercharges

    Authors: Nikolay Bobev, Sheer El-Showk, Dalimil Mazac, Miguel F. Paulos

    Abstract: We study the constraints imposed by superconformal symmetry, crossing symmetry, and unitarity for theories with four supercharges in spacetime dimension $2\leq d\leq 4$. We show how superconformal algebras with four Poincaré supercharges can be treated in a formalism applicable to any, in principle continuous, value of $d$ and use this to construct the superconformal blocks for any $d\leq 4$. We t… ▽ More

    Submitted 28 August, 2015; v1 submitted 6 March, 2015; originally announced March 2015.

    Comments: 43+16 pages, 13 figures. Comments welcome. v2: Minor changes, published version

    Journal ref: JHEP 1508 (2015) 142

  8. arXiv:1502.04124  [pdf, other

    hep-th cond-mat.stat-mech cond-mat.str-el

    Bootstrap** the Three-Dimensional Supersymmetric Ising Model

    Authors: Nikolay Bobev, Sheer El-Showk, Dalimil Mazac, Miguel F. Paulos

    Abstract: We implement the conformal bootstrap program for three-dimensional CFTs with $\mathcal{N}=2$ supersymmetry and find universal constraints on the spectrum of operator dimensions in these theories. By studying the bounds on the dimension of the first scalar appearing in the OPE of a chiral and an anti-chiral primary, we find a kink at the expected location of the critical three-dimensional… ▽ More

    Submitted 9 August, 2015; v1 submitted 13 February, 2015; originally announced February 2015.

    Comments: 5 pages, 6 figures

    Journal ref: Phys. Rev. Lett. 115, 051601 (2015)

  9. arXiv:1403.4545  [pdf, other

    hep-th cond-mat.stat-mech cond-mat.str-el hep-lat math-ph

    Solving the 3d Ising Model with the Conformal Bootstrap II. c-Minimization and Precise Critical Exponents

    Authors: Sheer El-Showk, Miguel F. Paulos, David Poland, Slava Rychkov, David Simmons-Duffin, Alessandro Vichi

    Abstract: We use the conformal bootstrap to perform a precision study of the operator spectrum of the critical 3d Ising model. We conjecture that the 3d Ising spectrum minimizes the central charge c in the space of unitary solutions to crossing symmetry. Because extremal solutions to crossing symmetry are uniquely determined, we are able to precisely reconstruct the first several Z2-even operator dimensions… ▽ More

    Submitted 4 June, 2014; v1 submitted 18 March, 2014; originally announced March 2014.

    Comments: 55 pages, many figures; v2 - refs and comments added, to appear in a special issue of J.Stat.Phys. in memory of Kenneth Wilson

    Report number: CERN-PH-TH/2014-038, NSF-KITP-14-022

    Journal ref: J. Stat. Phys. 157, 869-914 (2014)

  10. arXiv:1309.5089  [pdf, other

    hep-th cond-mat.stat-mech

    Conformal Field Theories in Fractional Dimensions

    Authors: S. El-Showk, M. Paulos, D. Poland, S. Rychkov, D. Simmons-Duffin, A. Vichi

    Abstract: We study the conformal bootstrap in fractional space-time dimensions, obtaining rigorous bounds on operator dimensions. Our results show strong evidence that there is a family of unitary CFTs connecting the 2D Ising model, the 3D Ising model, and the free scalar theory in 4D. We give numerical predictions for the leading operator dimensions and central charge in this family at different values of… ▽ More

    Submitted 12 October, 2015; v1 submitted 19 September, 2013; originally announced September 2013.

    Comments: 11 pages, 4 figures - references updated - one affiliation modified

    Report number: CERN-PH-TH/2013-219

    Journal ref: Phys. Rev. Lett. 112, 141601 (2014)

  11. arXiv:1211.2810  [pdf, other

    hep-th cond-mat.stat-mech

    Bootstrap** Conformal Field Theories with the Extremal Functional Method

    Authors: Sheer El-Showk, Miguel F. Paulos

    Abstract: The existence of a positive linear functional acting on the space of (differences between) conformal blocks has been shown to rule out regions in the parameter space of conformal field theories (CFTs). We argue that at the boundary of the allowed region the extremal functional contains, in principle, enough information to determine the dimensions and OPE coefficients of an infinite number of opera… ▽ More

    Submitted 12 November, 2012; originally announced November 2012.

    Comments: 28 pages, 9 figures, 3 tables

  12. Scaling BPS Solutions and pure-Higgs States

    Authors: Iosif Bena, Micha Berkooz, Jan de Boer, Sheer El-Showk, Dieter Van den Bleeken

    Abstract: Depending on the value of the coupling, BPS states of type II string theory compactified on a Calabi-Yau manifold can be described as multicenter supergravity solutions or as states on the Coulomb or the Higgs branch of a quiver gauge theory. While the Coulomb-branch states can be mapped one-to-one to supergravity states, this is not automatically so for Higgs-branch states. In this paper we expli… ▽ More

    Submitted 22 May, 2012; originally announced May 2012.

    Comments: 37 pages, 4 figures

    Report number: IPhT-T12/041

  13. arXiv:1203.6064  [pdf, other

    hep-th cond-mat.stat-mech

    Solving the 3D Ising Model with the Conformal Bootstrap

    Authors: Sheer El-Showk, Miguel F. Paulos, David Poland, Slava Rychkov, David Simmons-Duffin, Alessandro Vichi

    Abstract: We study the constraints of crossing symmetry and unitarity in general 3D Conformal Field Theories. In doing so we derive new results for conformal blocks appearing in four-point functions of scalars and present an efficient method for their computation in arbitrary space-time dimension. Comparing the resulting bounds on operator dimensions and OPE coefficients in 3D to known results, we find that… ▽ More

    Submitted 1 August, 2012; v1 submitted 27 March, 2012; originally announced March 2012.

    Comments: 32 pages, 11 figures; v2: refs added, small changes in Section 5.3, Fig. 7 replaced; v3: ref added, fits redone in Section 5.4

    Report number: LPTENS-12/07

    Journal ref: Phys. Rev. D 86, 025022 (2012)

  14. Kerr/CFT, dipole theories and nonrelativistic CFTs

    Authors: Sheer El-Showk, Monica Guica

    Abstract: We study solutions of type IIB supergravity which are SL(2,R) x SU(2) x U(1)^2 invariant deformations of AdS_3 x S^3 x K3 and take the form of products of self-dual spacelike warped AdS_3 and a deformed three-sphere. One of these backgrounds has been recently argued to be relevant for a derivation of Kerr/CFT from string theory, whereas the remaining ones are holographic duals of two-dimensional d… ▽ More

    Submitted 30 December, 2012; v1 submitted 30 August, 2011; originally announced August 2011.

    Comments: 48+8 pages, 4 figures; minor corrections and references added

  15. Moulting Black Holes

    Authors: Iosif Bena, Borun D. Chowdhury, Jan de Boer, Sheer El-Showk, Masaki Shigemori

    Abstract: We find a family of novel supersymmetric phases of the D1-D5 CFT, which in certain ranges of charges have more entropy than all known ensembles. We also find bulk BPS configurations that exist in the same range of parameters as these phases, and have more entropy than a BMPV black hole; they can be thought of as coming from a BMPV black hole shedding a "hair" condensate outside of the horizon. The… ▽ More

    Submitted 29 March, 2012; v1 submitted 1 August, 2011; originally announced August 2011.

    Comments: 51 pages, 15 figures. Print in color to enjoy. v2: References added, clarifications in Introduction, and a new appendix added to explain units and conventions. v3: the spectral flow argument in section 3 improved

    Report number: IPhT-T11/164, ITFA11-11

    Journal ref: JHEP 03, 094 (2012)

  16. What Maxwell Theory in D<>4 teaches us about scale and conformal invariance

    Authors: Sheer El-Showk, Yu Nakayama, Slava Rychkov

    Abstract: The free Maxwell theory in D<>4 dimensions provides a physical example of a unitary, scale invariant theory which is NOT conformally invariant. The easiest way to see this is that the field strength operator F_mn is neither a primary nor a descendant. We show how conformal multiplets can be completed, and conformality restored, by adding new local operators to the theory. In D>=5, this can only be… ▽ More

    Submitted 12 February, 2011; v1 submitted 27 January, 2011; originally announced January 2011.

    Comments: 20 pages; v2: minor corrections, refs added

    Report number: LPTENS-11/05; CALT 68-2819

  17. Emergent Spacetime and Holographic CFTs

    Authors: Sheer El-Showk, Kyriakos Papadodimas

    Abstract: We discuss universal properties of conformal field theories with holographic duals. A central feature of these theories is the existence of a low-lying sector of operators whose correlators factorize. We demonstrate that factorization can only hold in the large central charge limit. Using conformal invariance and factorization we argue that these operators are naturally represented as fields in Ad… ▽ More

    Submitted 16 November, 2012; v1 submitted 21 January, 2011; originally announced January 2011.

    Comments: 89 pages, 8 figures, typos corrected

    Journal ref: Journal of High Energy Physics, Volume 2012, Issue 10:106

  18. A bound on the entropy of supergravity?

    Authors: Jan de Boer, Sheer El-Showk, Ilies Messamah, Dieter Van den Bleeken

    Abstract: We determine, in two independent ways, the number of BPS quantum states arising from supergravity degrees of freedom in a system with fixed total D4D0 charge. First, we count states generated by quantizing the spacetime degrees of freedom of 'entropyless' multicentered solutions consisting of anti-D0-branes bound to a D6-anti-D6 pair. Second, we determine the number of free supergravity excitati… ▽ More

    Submitted 1 June, 2009; originally announced June 2009.

    Comments: 33 pages, 5 figures

    Journal ref: JHEP 1002:062,2010

  19. Black Holes as Effective Geometries

    Authors: Vijay Balasubramanian, Jan de Boer, Sheer El-Showk, Ilies Messamah

    Abstract: Gravitational entropy arises in string theory via coarse graining over an underlying space of microstates. In this review we would like to address the question of how the classical black hole geometry itself arises as an effective or approximate description of a pure state, in a closed string theory, which semiclassical observers are unable to distinguish from the "naive" geometry. In cases with… ▽ More

    Submitted 17 November, 2008; v1 submitted 3 November, 2008; originally announced November 2008.

    Comments: Review based on lectures of JdB at CERN RTN Winter School and of VB at PIMS Summer School. 68 pages. Added references

    Journal ref: Class.Quant.Grav.25:214004,2008

  20. Quantizing N=2 Multicenter Solutions

    Authors: Jan de Boer, Sheer El-Showk, Ilies Messamah, Dieter Van den Bleeken

    Abstract: N=2 supergravity in four dimensions, or equivalently N=1 supergravity in five dimensions, has an interesting set of BPS solutions that each correspond to a number of charged centers. This set contains black holes, black rings and their bound states, as well as many smooth solutions. Moduli spaces of such solutions carry a natural symplectic form which we determine, and which allows us to study t… ▽ More

    Submitted 29 July, 2008; originally announced July 2008.

    Comments: 49 pages + appendices

    Report number: ITFA-2008-28, KUL-TF-08/18

  21. arXiv:0807.0892  [pdf

    q-bio.PE q-bio.BM

    The driving force behind genomic diversity

    Authors: Salla Jaakkola, Sedeer El-Showk, Arto Annila

    Abstract: Eukaryote genomes contain excessively introns, inter-genic and other non-genic sequences that appear to have no vital functional role or phenotype manifestation. Their existence, a long-standing puzzle, is viewed from the principle of increasing entropy. According to thermodynamics of open systems, genomes evolve toward diversity by various mechanisms that increase, decrease and distribute genom… ▽ More

    Submitted 6 July, 2008; originally announced July 2008.

    Comments: 8 pages, 3 figures

    Journal ref: Biophys Chem 134 (2008) 232-238

  22. Black hole bound states in AdS_3 x S^2

    Authors: Jan de Boer, Frederik Denef, Sheer El-Showk, Ilies Messamah, Dieter Van den Bleeken

    Abstract: We systematically construct the geometries dual to the 1+1 dimensional (0,4) conformal field theories that arise in the low-energy description of wrapped M5-branes in S^1 x CY_3 compactifications of M-theory. This includes a large number of multicentered black hole bound states asymptotic to AdS_3 x S^2. In addition, we find many geometries that develop multiple, mutually decoupled AdS_3 x S^2 t… ▽ More

    Submitted 15 February, 2008; originally announced February 2008.

    Journal ref: JHEP 0811:050,2008

  23. G2 Hitchin functionals at one loop

    Authors: Jan de Boer, Paul de Medeiros, Sheer El-Showk, Annamaria Sinkovics

    Abstract: We consider the quantization of the effective target space description of topological M-theory in terms of the Hitchin functional whose critical points describe seven-manifolds with G2 structure. The one-loop partition function for this theory is calculated and an extended version of it, that is related to generalized G2 geometry, is compared with the topological G2 string. We relate the reducti… ▽ More

    Submitted 5 July, 2007; v1 submitted 21 June, 2007; originally announced June 2007.

    Comments: 58 pages, LaTeX; v2: Acknowledgments added

    Report number: DAMTP-2007-54, EMPG-07-10, ITFA-07-25

    Journal ref: Class.Quant.Grav.25:075006,2008

  24. Open G2 Strings

    Authors: Jan de Boer, Paul de Medeiros, Sheer El-Showk, Annamaria Sinkovics

    Abstract: We consider an open string version of the topological twist previously proposed for sigma-models with G2 target spaces. We determine the cohomology of open strings states and relate these to geometric deformations of calibrated submanifolds and to flat or anti-self-dual connections on such submanifolds. On associative three-cycles we show that the worldvolume theory is a gauge-fixed Chern-Simons… ▽ More

    Submitted 7 November, 2006; originally announced November 2006.

    Comments: 55 pages, no figures

    Report number: DAMTP-2006-99, EMPG-06-10, ITFA-06-41, MCTP-06-28

    Journal ref: JHEP0802:012,2008