Skip to main content

Showing 1–18 of 18 results for author: Wright, L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.03372  [pdf, other

    physics.app-ph cs.LG

    Training of Physical Neural Networks

    Authors: Ali Momeni, Babak Rahmani, Benjamin Scellier, Logan G. Wright, Peter L. McMahon, Clara C. Wanjura, Yuhang Li, Anas Skalli, Natalia G. Berloff, Tatsuhiro Onodera, Ilker Oguz, Francesco Morichetti, Philipp del Hougne, Manuel Le Gallo, Abu Sebastian, Azalia Mirhoseini, Cheng Zhang, Danijela Marković, Daniel Brunner, Christophe Moser, Sylvain Gigan, Florian Marquardt, Aydogan Ozcan, Julie Grollier, Andrea J. Liu , et al. (3 additional authors not shown)

    Abstract: Physical neural networks (PNNs) are a class of neural-like networks that leverage the properties of physical systems to perform computation. While PNNs are so far a niche research area with small-scale laboratory demonstrations, they are arguably one of the most underappreciated important opportunities in modern AI. Could we train AI models 1000x larger than current ones? Could we do this and also… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: 29 pages, 4 figures

  2. Null Compliance: NYC Local Law 144 and the Challenges of Algorithm Accountability

    Authors: Lucas Wright, Roxana Mike Muenster, Briana Vecchione, Tianyao Qu, Pika, Cai, COMM/INFO 2450 Student Investigators, Jacob Metcalf, J. Nathan Matias

    Abstract: In July 2023, New York City became the first jurisdiction globally to mandate bias audits for commercial algorithmic systems, specifically for automated employment decisions systems (AEDTs) used in hiring and promotion. Local Law 144 (LL 144) requires AEDTs to be independently audited annually for race and gender bias, and the audit report must be publicly posted. Additionally, employers are oblig… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  3. arXiv:2402.17750  [pdf, other

    physics.optics cs.ET cs.LG

    Scaling on-chip photonic neural processors using arbitrarily programmable wave propagation

    Authors: Tatsuhiro Onodera, Martin M. Stein, Benjamin A. Ash, Mandar M. Sohoni, Melissa Bosch, Ryotatsu Yanagimoto, Marc Jankowski, Timothy P. McKenna, Tianyu Wang, Gennady Shvets, Maxim R. Shcherbakov, Logan G. Wright, Peter L. McMahon

    Abstract: On-chip photonic processors for neural networks have potential benefits in both speed and energy efficiency but have not yet reached the scale at which they can outperform electronic processors. The dominant paradigm for designing on-chip photonics is to make networks of relatively bulky discrete components connected by one-dimensional waveguides. A far more compact alternative is to avoid explici… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

  4. arXiv:2402.00025  [pdf, other

    cs.DC cs.AI

    Accelerating a Triton Fused Kernel for W4A16 Quantized Inference with SplitK work decomposition

    Authors: Adnan Hoque, Less Wright, Chih-Chieh Yang, Mudhakar Srivatsa, Raghu Ganti

    Abstract: We propose an implementation of an efficient fused matrix multiplication kernel for W4A16 quantized inference, where we perform dequantization and GEMM in a fused kernel using a SplitK work decomposition. Our implementation shows improvement for the type of skinny matrix-matrix multiplications found in foundation model inference workloads. In particular, this paper surveys the type of matrix multi… ▽ More

    Submitted 22 February, 2024; v1 submitted 5 January, 2024; originally announced February 2024.

  5. arXiv:2310.18335  [pdf, other

    cs.ET cs.NE q-bio.NC

    The hardware is the software

    Authors: Jeremie Laydevant, Logan G. Wright, Tianyu Wang, Peter L. McMahon

    Abstract: Human brains and bodies are not hardware running software: the hardware is the software. We reason that because the microscopic physics of artificial-intelligence hardware and of human biological "hardware" is distinct, neuromorphic engineers need to be cautious (and yet also creative) in how we take inspiration from biological intelligence. We should focus primarily on principles and design ideas… ▽ More

    Submitted 20 October, 2023; originally announced October 2023.

  6. arXiv:2308.15265  [pdf, other

    cs.IR

    A Multi-Perspective Learning to Rank Approach to Support Children's Information Seeking in the Classroom

    Authors: Garrett Allen, Katherine Landau Wright, Jerry Alan Fails, Casey Kennington, Maria Soledad Pera

    Abstract: We introduce a novel re-ranking model that aims to augment the functionality of standard search engines to support classroom search activities for children (ages 6 to 11). This model extends the known listwise learning-to-rank framework by balancing risk and reward. Doing so enables the model to prioritize Web resources of high educational alignment, appropriateness, and adequate readability by an… ▽ More

    Submitted 29 August, 2023; originally announced August 2023.

    Comments: Extended version of the manuscript to appear in proceedings of the 22nd IEEE/WIC International Conference on Web Intelligence and Intelligent Agent Technology

  7. arXiv:2307.15712  [pdf, other

    physics.optics cs.ET cs.LG cs.NE quant-ph

    Quantum-noise-limited optical neural networks operating at a few quanta per activation

    Authors: Shi-Yuan Ma, Tianyu Wang, Jérémie Laydevant, Logan G. Wright, Peter L. McMahon

    Abstract: Analog physical neural networks, which hold promise for improved energy efficiency and speed compared to digital electronic neural networks, are nevertheless typically operated in a relatively high-power regime so that the signal-to-noise ratio (SNR) is large (>10). What happens if an analog system is instead operated in an ultra-low-power regime, in which the behavior of the system becomes highly… ▽ More

    Submitted 28 July, 2023; originally announced July 2023.

    Comments: 55 pages, 27 figures

  8. arXiv:2304.11277  [pdf, other

    cs.DC cs.AI cs.LG cs.PF

    PyTorch FSDP: Experiences on Scaling Fully Sharded Data Parallel

    Authors: Yanli Zhao, Andrew Gu, Rohan Varma, Liang Luo, Chien-Chin Huang, Min Xu, Less Wright, Hamid Shojanazeri, Myle Ott, Sam Shleifer, Alban Desmaison, Can Balioglu, Pritam Damania, Bernard Nguyen, Geeta Chauhan, Yuchen Hao, Ajit Mathews, Shen Li

    Abstract: It is widely acknowledged that large models have the potential to deliver superior performance across a broad range of domains. Despite the remarkable progress made in the field of machine learning systems research, which has enabled the development and exploration of large models, such abilities remain confined to a small group of advanced users and industry leaders, resulting in an implicit tech… ▽ More

    Submitted 12 September, 2023; v1 submitted 21 April, 2023; originally announced April 2023.

  9. arXiv:2302.12043  [pdf, ps, other

    cs.CL

    Conversational Agents and Children: Let Children Learn

    Authors: Casey Kennington, Jerry Alan Fails, Katherine Landau Wright, Maria Soledad Pera

    Abstract: Using online information discovery as a case study, in this position paper we discuss the need to design, develop, and deploy (conversational) agents that can -- non-intrusively -- guide children in their quest for online resources rather than simply finding resources for them. We argue that agents should "let children learn" and should be built to take on a teacher-facilitator function, allowing… ▽ More

    Submitted 23 February, 2023; originally announced February 2023.

    Comments: 6 pages

  10. arXiv:2302.10360  [pdf, other

    cs.ET cs.LG cs.NE physics.app-ph physics.optics

    Optical Transformers

    Authors: Maxwell G. Anderson, Shi-Yuan Ma, Tianyu Wang, Logan G. Wright, Peter L. McMahon

    Abstract: The rapidly increasing size of deep-learning models has caused renewed and growing interest in alternatives to digital computers to dramatically reduce the energy cost of running state-of-the-art neural networks. Optical matrix-vector multipliers are best suited to performing computations with very large operands, which suggests that large Transformer models could be a good target for optical comp… ▽ More

    Submitted 20 February, 2023; originally announced February 2023.

    Comments: 27 pages, 13 figures

    Journal ref: Transactions on Machine Learning Research, 03/2024, https://openreview.net/forum?id=Xxw0edFFQC

  11. arXiv:2207.14293  [pdf, other

    physics.optics cs.ET cs.LG

    Image sensing with multilayer, nonlinear optical neural networks

    Authors: Tianyu Wang, Mandar M. Sohoni, Logan G. Wright, Martin M. Stein, Shi-Yuan Ma, Tatsuhiro Onodera, Maxwell G. Anderson, Peter L. McMahon

    Abstract: Optical imaging is commonly used for both scientific and technological applications across industry and academia. In image sensing, a measurement, such as of an object's position, is performed by computational analysis of a digitized image. An emerging image-sensing paradigm breaks this delineation between data collection and analysis by designing optical components to perform not imaging, but enc… ▽ More

    Submitted 27 July, 2022; originally announced July 2022.

    Journal ref: Nat. Photon. 18, 1-8 (2023)

  12. arXiv:2203.03366  [pdf, other

    cs.LG cond-mat.str-el quant-ph

    Improvements to Gradient Descent Methods for Quantum Tensor Network Machine Learning

    Authors: Fergus Barratt, James Dborin, Lewis Wright

    Abstract: Tensor networks have demonstrated significant value for machine learning in a myriad of different applications. However, optimizing tensor networks using standard gradient descent has proven to be difficult in practice. Tensor networks suffer from initialization problems resulting in exploding or vanishing gradients and require extensive hyperparameter tuning. Efforts to overcome these problems us… ▽ More

    Submitted 3 March, 2022; originally announced March 2022.

    Journal ref: Second Workshop on Quantum Tensor Networks in Machine Learning, 35th Conference on Neural Information Processing Systems (NeurIPS 2021)

  13. arXiv:2106.13731  [pdf, other

    cs.LG

    Ranger21: a synergistic deep learning optimizer

    Authors: Less Wright, Nestor Demeure

    Abstract: As optimizers are critical to the performances of neural networks, every year a large number of papers innovating on the subject are published. However, while most of these publications provide incremental improvements to existing algorithms, they tend to be presented as new optimizers rather than composable algorithms. Thus, many worthwhile improvements are rarely seen out of their initial public… ▽ More

    Submitted 6 August, 2021; v1 submitted 25 June, 2021; originally announced June 2021.

    Comments: for associated code, see https://github.com/lessw2020/Ranger21

    ACM Class: I.2.6

  14. arXiv:2105.03456  [pdf, other

    cs.CY cs.HC cs.IR

    CASTing a Net: Supporting Teachers with Search Technology

    Authors: Garrett Allen, Katherine Landau Wright, Jerry Alan Fails, Casey Kennington, Maria Soledad Pera

    Abstract: Past and current research has typically focused on ensuring that search technology for the classroom serves children. In this paper, we argue for the need to broaden the research focus to include teachers and how search technology can aid them. In particular, we share how furnishing a behind-the-scenes portal for teachers can empower them by providing a window into the spelling, writing, and conce… ▽ More

    Submitted 7 May, 2021; originally announced May 2021.

    Comments: KidRec '21: 5th International and Interdisciplinary Perspectives on Children & Recommender and Information Retrieval Systems (KidRec) Search and Recommendation Technology through the Lens of a Teacher- Co-located with ACM IDC 2021

  15. arXiv:2104.13467  [pdf, other

    physics.optics cs.ET cs.LG cs.NE

    An optical neural network using less than 1 photon per multiplication

    Authors: Tianyu Wang, Shi-Yuan Ma, Logan G. Wright, Tatsuhiro Onodera, Brian Richard, Peter L. McMahon

    Abstract: Deep learning has rapidly become a widespread tool in both scientific and commercial endeavors. Milestones of deep learning exceeding human performance have been achieved for a growing number of tasks over the past several years, across areas as diverse as game-playing, natural-language translation, and medical-image analysis. However, continued progress is increasingly hampered by the high energy… ▽ More

    Submitted 27 April, 2021; originally announced April 2021.

    Comments: 42 pages, 21 figures

    Journal ref: Nature Communications 13, 123 (2022)

  16. arXiv:2104.13386  [pdf, other

    cs.LG cond-mat.dis-nn cs.ET physics.optics

    Deep physical neural networks enabled by a backpropagation algorithm for arbitrary physical systems

    Authors: Logan G. Wright, Tatsuhiro Onodera, Martin M. Stein, Tianyu Wang, Darren T. Schachter, Zoey Hu, Peter L. McMahon

    Abstract: Deep neural networks have become a pervasive tool in science and engineering. However, modern deep neural networks' growing energy requirements now increasingly limit their scaling and broader use. We propose a radical alternative for implementing deep neural network models: Physical Neural Networks. We introduce a hybrid physical-digital algorithm called Physics-Aware Training to efficiently trai… ▽ More

    Submitted 27 April, 2021; originally announced April 2021.

    Journal ref: Nature 601, 549-555 (2022)

  17. arXiv:2102.00645  [pdf, other

    cs.CV

    An End-to-End Food Image Analysis System

    Authors: Jiangpeng He, Runyu Mao, Zeman Shao, Janine L. Wright, Deborah A. Kerr, Carol J. Boushey, Fengqing Zhu

    Abstract: Modern deep learning techniques have enabled advances in image-based dietary assessment such as food recognition and food portion size estimation. Valuable information on the types of foods and the amount consumed are crucial for prevention of many chronic diseases. However, existing methods for automated image-based food analysis are neither end-to-end nor are capable of processing multiple tasks… ▽ More

    Submitted 1 February, 2021; originally announced February 2021.

  18. arXiv:1807.04599  [pdf, other

    cs.DS physics.comp-ph quant-ph

    Benchmarking treewidth as a practical component of tensor-network--based quantum simulation

    Authors: Eugene F. Dumitrescu, Allison L. Fisher, Timothy D. Goodrich, Travis S. Humble, Blair D. Sullivan, Andrew L. Wright

    Abstract: Tensor networks are powerful factorization techniques which reduce resource requirements for numerically simulating principal quantum many-body systems and algorithms. The computational complexity of a tensor network simulation depends on the tensor ranks and the order in which they are contracted. Unfortunately, computing optimal contraction sequences (orderings) in general is known to be a compu… ▽ More

    Submitted 12 July, 2018; originally announced July 2018.

    Comments: Open source code available