Skip to main content

Showing 1–13 of 13 results for author: Soklaski, R

.
  1. arXiv:2406.10162  [pdf, other

    cs.AI cs.CL

    Sycophancy to Subterfuge: Investigating Reward-Tampering in Large Language Models

    Authors: Carson Denison, Monte MacDiarmid, Fazl Barez, David Duvenaud, Shauna Kravec, Samuel Marks, Nicholas Schiefer, Ryan Soklaski, Alex Tamkin, Jared Kaplan, Buck Shlegeris, Samuel R. Bowman, Ethan Perez, Evan Hubinger

    Abstract: In reinforcement learning, specification gaming occurs when AI systems learn undesired behaviors that are highly rewarded due to misspecified training goals. Specification gaming can range from simple behaviors like sycophancy to sophisticated and pernicious behaviors like reward-tampering, where a model directly modifies its own reward mechanism. However, these more pernicious behaviors may be to… ▽ More

    Submitted 28 June, 2024; v1 submitted 14 June, 2024; originally announced June 2024.

    Comments: Make it easier to find samples from the model, and highlight that our operational definition of reward tampering has false positives where the model attempts to complete the task honestly but edits the reward. Add paragraph to conclusion to this effect, and add sentence to figure 1 to this effect

  2. arXiv:2202.12412  [pdf, other

    cs.CV cs.LG

    Fourier-Based Augmentations for Improved Robustness and Uncertainty Calibration

    Authors: Ryan Soklaski, Michael Yee, Theodoros Tsiligkaridis

    Abstract: Diverse data augmentation strategies are a natural approach to improving robustness in computer vision models against unforeseen shifts in data distribution. However, the ability to tailor such strategies to inoculate a model against specific classes of corruptions or attacks -- without incurring substantial losses in robustness against other classes of corruptions -- remains elusive. In this work… ▽ More

    Submitted 24 February, 2022; originally announced February 2022.

    Comments: 35th Conference on Neural Information Processing Systems (NeurIPS 2021), Sydney, Australia

  3. arXiv:2202.03188  [pdf

    cs.AI

    Knowledge-Integrated Informed AI for National Security

    Authors: Anu K. Myne, Kevin J. Leahy, Ryan J. Soklaski

    Abstract: The state of artificial intelligence technology has a rich history that dates back decades and includes two fall-outs before the explosive resurgence of today, which is credited largely to data-driven techniques. While AI technology has and continues to become increasingly mainstream with impact across domains and industries, it's not without several drawbacks, weaknesses, and potential to cause u… ▽ More

    Submitted 4 February, 2022; originally announced February 2022.

    Report number: Technical Report TR-1272

  4. arXiv:2201.05647  [pdf, other

    cs.LG cs.AI cs.SE

    Tools and Practices for Responsible AI Engineering

    Authors: Ryan Soklaski, Justin Goodwin, Olivia Brown, Michael Yee, Jason Matterer

    Abstract: Responsible Artificial Intelligence (AI) - the practice of develo**, evaluating, and maintaining accurate AI systems that also exhibit essential properties such as robustness and explainability - represents a multifaceted challenge that often stretches standard machine learning tooling, frameworks, and testing methods beyond their limits. In this paper, we present two new software libraries - hy… ▽ More

    Submitted 14 January, 2022; originally announced January 2022.

  5. Transparency by Design: Closing the Gap Between Performance and Interpretability in Visual Reasoning

    Authors: David Mascharka, Philip Tran, Ryan Soklaski, Arjun Majumdar

    Abstract: Visual question answering requires high-order reasoning about an image, which is a fundamental capability needed by machine systems to follow complex directives. Recently, modular networks have been shown to be an effective framework for performing visual reasoning tasks. While modular networks were initially designed with a degree of model transparency, their performance on complex visual reasoni… ▽ More

    Submitted 2 July, 2018; v1 submitted 14 March, 2018; originally announced March 2018.

    Comments: CVPR 2018 pre-print

  6. arXiv:1508.02736  [pdf, other

    cond-mat.mtrl-sci cond-mat.dis-nn cond-mat.soft

    A Dramatically Growing Shear Rigidity Length Scale in a Supercooled Glass Former ($NiZr_2$)

    Authors: Nicholas B. Weingartner, Ryan Soklaski, K. F. Kelton, Zohar Nussinov

    Abstract: Finding a suitably growing length scale that increases in tandem with the immense viscous slowdown of supercooled liquids is an open problem associated with the glass transition. Here, we define and demonstrate the existence of one such length scale which may be experimentally verifiable. This is the length scale over which external shear perturbations appreciably penetrate into a liquid as the gl… ▽ More

    Submitted 6 April, 2016; v1 submitted 11 August, 2015; originally announced August 2015.

    Comments: 10 pages, 12 figures;Renamed, Massive Revisions, PRB Accepted

    Journal ref: Phys. Rev. B 93, 214201 (2016)

  7. arXiv:1502.01739  [pdf, ps, other

    cond-mat.dis-nn cond-mat.mtrl-sci cond-mat.soft physics.comp-ph

    A locally preferred structure characterises all dynamical regimes of a supercooled liquid

    Authors: Ryan Soklaski, Vy Tran, Zohar Nussinov, K. F. Kelton, Li Yang

    Abstract: Recent experimental results suggest that metallic liquids universally exhibit a high-temperature dynamical crossover, which is correlated with the glass transition temperature ($T_{g}$). We demonstrate, using molecular dynamics results for Cu64Zr36, that this temperature, $T_{A} \approx 2 \times T_{g}$, is linked with cooperative atomic rearrangements that produce domains of connected icosahedra.… ▽ More

    Submitted 23 March, 2016; v1 submitted 5 February, 2015; originally announced February 2015.

    Comments: 21 pages with 9 figures, Philosophical Magazine, 2016

  8. arXiv:1405.2836  [pdf

    cond-mat.mtrl-sci cond-mat.mes-hall physics.comp-ph

    Enhanced Thermoelectric Efficiency via Orthogonal Electrical and Thermal Conductances in Phosphorene

    Authors: Ruixiang Fei, Alireza Faghaninia, Ryan Soklaski, Jia-An Yan, Cynthia Lo, Li Yang

    Abstract: Thermoelectric devices that utilize the Seebeck effect convert heat flow into electrical energy and are highly desirable for the development of portable, solid state, passively-powered electronic systems. The conversion efficiencies of such devices are quantified by the dimensionless thermoelectric figure of merit (ZT), which is proportional to the ratio of a device's electrical conductance to its… ▽ More

    Submitted 12 May, 2014; originally announced May 2014.

    Comments: 22 pages with 6 figures

    Journal ref: Nano Lett, 14, 6393 (2014)

  9. arXiv:1402.4192  [pdf, ps, other

    cond-mat.mes-hall cond-mat.mtrl-sci

    Tunable Band Gap and Anisotropic Optical Response in Few-layer Black Phosphorus

    Authors: Vy Tran, Ryan Soklaski, Yufeng Liang, Li Yang

    Abstract: We report the quasiparticle band gap, excitons, and highly anisotropic optical responses of few-layer black phosphorous (phosphorene). It is shown that these new materials exhibit unique many-electron effects; the electronic structures are dispersive essentially along one dimension, leading to particularly enhanced self-energy corrections and excitonic effects. Additionally, within a wide energy r… ▽ More

    Submitted 15 April, 2014; v1 submitted 17 February, 2014; originally announced February 2014.

    Comments: 12 pages with 5 figures and 1 table

    Journal ref: Phys. Rev. B 89, 235319 (2014)

  10. arXiv:1401.6663  [pdf

    cond-mat.mes-hall cond-mat.mtrl-sci

    New Mechanism for Strongly Bound Excitons in Gapless Two-Dimensional Structures

    Authors: Yufeng Liang, Ryan Soklaski, Shouting Huang, Matthew W. Graham, Robin Havener, Jiwoong Park, Li Yang

    Abstract: Common wisdom asserts that bound excitons cannot form in high-dimensional (d>1) metallic structures because of their overwhelming screening and unavoidable resonance with nearby continuous bands. Strikingly, here we illustrate that this prevalent assumption is not quite true. A key ingredient that has been overlooked is that of viable decoherence that thwarts the formation of resonances. As an exa… ▽ More

    Submitted 26 January, 2014; originally announced January 2014.

    Comments: 12 pages and 5 figures

    Journal ref: Phys. Rev. B 90, 115418 (2014)

  11. arXiv:1401.5732  [pdf, ps, other

    cond-mat.mes-hall cond-mat.mtrl-sci

    Temperature Renormalization of Optical Spectra of Monolayer MoS2

    Authors: Ryan Soklaski, Yufeng Liang, Changjian Zhang, Haining Wang, Farhan Rana, Li Yang

    Abstract: Newly measured optical absorption and photoluminescence spectra reveal substantial frequency shifts of both exciton and trion peaks as monolayer MoS2 is cooled from 363 K to 4 K. First-principles simulations using the GW-Bethe-Salpeter Equation approach satisfactorily reproduce these frequency shifts by incorporating many-electron interactions and the thermal expansion of the in-plane lattice cons… ▽ More

    Submitted 22 January, 2014; originally announced January 2014.

    Comments: 12 pages and 4 figures

    Journal ref: Appl. Phys. Lett. 104, 193110 (2014)

  12. arXiv:1306.0620  [pdf, ps, other

    cond-mat.mes-hall cond-mat.mtrl-sci

    Quasiparticle band-edge energy and band offsets of monolayer of molybdenum and tungsten chalcogenide

    Authors: Yufeng Liang, Shouting Huang, Ryan Soklaski, Li Yang

    Abstract: We report the quasiparticle energy of monolayer of molybdenum and tungsten dichalcogenides, MX2 (M=Mo, W; X=S, Se, Te). Beyond calculating bandgaps, we have achieved converged absolute band energies relative to the vacuum level. Compared with the results from other approaches, the GW calculation reveals substantially larger bandgaps and different absolute band energies because of enhanced many-ele… ▽ More

    Submitted 26 July, 2013; v1 submitted 3 June, 2013; originally announced June 2013.

    Journal ref: Appl. Phys. Lett. 103, 042106 (2013)

  13. arXiv:1302.1895  [pdf, ps, other

    cond-mat.mtrl-sci cond-mat.dis-nn

    Connectivity of the Icosahedral Network and a Dramatically Growing Static Length Scale in Cu-Zr Binary Metallic Glasses

    Authors: Ryan Soklaski, Zohar Nussinov, Zachary Markow, K. F. Kelton, Li Yang

    Abstract: We report on and characterize, via molecular dynamics (MD) studies, the evolution of the structure of Cu50Zr50 and Cu64Zr36 metallic glasses (MGs) as temperature is varied. Interestingly, a percolating icosahedral network appears in the Cu64Zr36 system as it is supercooled. This leads us to introduce a static length scale, which grows dramatically as this three dimensional system approaches the gl… ▽ More

    Submitted 7 February, 2013; originally announced February 2013.

    Comments: 9 pages and 8 figures

    Journal ref: Phys. Rev. B 87, 184203 (2013)