Skip to main content

Showing 1–7 of 7 results for author: Wu, C M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.06262  [pdf, other

    cs.NE cs.AI

    Modular Growth of Hierarchical Networks: Efficient, General, and Robust Curriculum Learning

    Authors: Mani Hamidi, Sina Khajehabdollahi, Emmanouil Giannakakis, Tim Schäfer, Anna Levina, Charley M. Wu

    Abstract: Structural modularity is a pervasive feature of biological neural networks, which have been linked to several functional and computational advantages. Yet, the use of modular architectures in artificial neural networks has been relatively limited despite early successes. Here, we explore the performance and functional dynamics of a modular network trained on a memory task via an iterative growth c… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

  2. arXiv:2405.05294  [pdf, other

    cs.HC cs.CL cs.IT cs.LG cs.SC stat.ML

    Harmonizing Program Induction with Rate-Distortion Theory

    Authors: Hanqi Zhou, David G. Nagy, Charley M. Wu

    Abstract: Many aspects of human learning have been proposed as a process of constructing mental programs: from acquiring symbolic number representations to intuitive theories about the world. In parallel, there is a long-tradition of using information processing to model human cognition through Rate Distortion Theory (RDT). Yet, it is still poorly understood how to apply RDT when mental representations take… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

    Comments: CogSci 2024

  3. arXiv:2403.13179  [pdf, other

    cs.LG cs.CY stat.ML

    Predictive, scalable and interpretable knowledge tracing on structured domains

    Authors: Hanqi Zhou, Robert Bamler, Charley M. Wu, Álvaro Tejero-Cantero

    Abstract: Intelligent tutoring systems optimize the selection and timing of learning materials to enhance understanding and long-term retention. This requires estimates of both the learner's progress (''knowledge tracing''; KT), and the prerequisite structure of the learning domain (''knowledge map**''). While recent deep learning models achieve high KT accuracy, they do so at the expense of the interpret… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

  4. arXiv:2312.10343  [pdf, other

    eess.SP cs.AR cs.LG cs.NE

    In-Sensor Radio Frequency Computing for Energy-Efficient Intelligent Radar

    Authors: Yang Sui, Minning Zhu, Lingyi Huang, Chung-Tse Michael Wu, Bo Yuan

    Abstract: Radio Frequency Neural Networks (RFNNs) have demonstrated advantages in realizing intelligent applications across various domains. However, as the model size of deep neural networks rapidly increases, implementing large-scale RFNN in practice requires an extensive number of RF interferometers and consumes a substantial amount of energy. To address this challenge, we propose to utilize low-rank dec… ▽ More

    Submitted 16 December, 2023; originally announced December 2023.

  5. arXiv:2310.00066  [pdf

    cond-mat.dis-nn cond-mat.mtrl-sci cs.AR cs.NE

    Temporal credit assignment for one-shot learning utilizing a phase transition material

    Authors: Alessandro R. Galloni, Yifan Yuan, Minning Zhu, Haoming Yu, Ravindra S. Bisht, Chung-Tse Michael Wu, Christine Grienberger, Shriram Ramanathan, Aaron D. Milstein

    Abstract: Design of hardware based on biological principles of neuronal computation and plasticity in the brain is a leading approach to realizing energy- and sample-efficient artificial intelligence and learning machines. An important factor in selection of the hardware building blocks is the identification of candidate materials with physical properties suitable to emulate the large dynamic ranges and var… ▽ More

    Submitted 29 September, 2023; originally announced October 2023.

    Comments: 37 pages, 5 figures, 6 supplementary figures

  6. A Reconfigurable Linear RF Analog Processor for Realizing Microwave Artificial Neural Network

    Authors: Minning Zhu, Tzu-Wei Kuo, Chung-Tse Michael Wu

    Abstract: Owing to the data explosion and rapid development of artificial intelligence (AI), particularly deep neural networks (DNNs), the ever-increasing demand for large-scale matrix-vector multiplication has become one of the major issues in machine learning (ML). Training and evaluating such neural networks rely on heavy computational resources, resulting in significant system latency and power consumpt… ▽ More

    Submitted 24 July, 2023; v1 submitted 14 April, 2023; originally announced April 2023.

    Comments: 11 pages, 16 figures

  7. arXiv:2304.02351  [pdf, other

    cs.MA

    Constructing and deconstructing bias: modeling privilege and mentorship in agent-based simulations

    Authors: Andria L. Smith, Simon Heuschkel, Ksenia Keplinger, Charley M. Wu

    Abstract: Bias exists in how we pick leaders, who we perceive as being influential, and who we interact with, not only in society, but in organizational contexts. Drawing from leadership emergence and social influence theories, we investigate potential interventions that support diverse leaders. Using agent-based simulations, we model a collective search process on a fitness landscape. Agents combine indivi… ▽ More

    Submitted 5 April, 2023; originally announced April 2023.