Skip to main content

Showing 1–5 of 5 results for author: Terzić, A

.
  1. arXiv:2407.02060  [pdf, other

    cs.LG cs.AI cs.SC

    Terminating Differentiable Tree Experts

    Authors: Jonathan Thomm, Michael Hersche, Giacomo Camposampiero, Aleksandar Terzić, Bernhard Schölkopf, Abbas Rahimi

    Abstract: We advance the recently proposed neuro-symbolic Differentiable Tree Machine, which learns tree operations using a combination of transformers and Tensor Product Representations. We investigate the architecture and propose two key components. We first remove a series of different transformer layers that are used in every step by introducing a mixture of experts. This results in a Differentiable Tre… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: Accepted at the 18th International Conference on Neural-Symbolic Learning and Reasoning (NeSy) 2024

  2. arXiv:2406.19121  [pdf, other

    cs.LG cs.AI cs.SC

    Towards Learning Abductive Reasoning using VSA Distributed Representations

    Authors: Giacomo Camposampiero, Michael Hersche, Aleksandar Terzić, Roger Wattenhofer, Abu Sebastian, Abbas Rahimi

    Abstract: We introduce the Abductive Rule Learner with Context-awareness (ARLC), a model that solves abstract reasoning tasks based on Learn-VRF. ARLC features a novel and more broadly applicable training objective for abductive reasoning, resulting in better interpretability and higher accuracy when solving Raven's progressive matrices (RPM). ARLC allows both programming domain knowledge and learning the r… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

    Comments: Accepted at the 18th International Conference on Neural-Symbolic Learning and Reasoning (NeSy) 2024

  3. arXiv:2402.05785  [pdf, other

    cs.LG cs.AI cs.CL

    Limits of Transformer Language Models on Learning to Compose Algorithms

    Authors: Jonathan Thomm, Aleksandar Terzic, Giacomo Camposampiero, Michael Hersche, Bernhard Schölkopf, Abbas Rahimi

    Abstract: We analyze the capabilities of Transformer language models in learning compositional discrete tasks. To this end, we evaluate training LLaMA models and prompting GPT-4 and Gemini on four tasks demanding to learn a composition of several discrete sub-tasks. On both training LLaMA models from scratch and prompting on GPT-4 and Gemini, we measure how well these models can reuse primitives observable… ▽ More

    Submitted 25 May, 2024; v1 submitted 8 February, 2024; originally announced February 2024.

  4. arXiv:2312.05605  [pdf, other

    cs.LG cs.CV

    TCNCA: Temporal Convolution Network with Chunked Attention for Scalable Sequence Processing

    Authors: Aleksandar Terzic, Michael Hersche, Geethan Karunaratne, Luca Benini, Abu Sebastian, Abbas Rahimi

    Abstract: MEGA is a recent transformer-based architecture, which utilizes a linear recurrent operator whose parallel computation, based on the FFT, scales as $O(LlogL)$, with $L$ being the sequence length. We build upon their approach by replacing the linear recurrence with a special temporal convolutional network which permits larger receptive field size with shallower networks, and reduces the computation… ▽ More

    Submitted 9 December, 2023; originally announced December 2023.

  5. arXiv:2303.13957  [pdf, other

    cs.CV cs.LG cs.NE

    Factorizers for Distributed Sparse Block Codes

    Authors: Michael Hersche, Aleksandar Terzic, Geethan Karunaratne, Jovin Langenegger, Angéline Pouget, Giovanni Cherubini, Luca Benini, Abu Sebastian, Abbas Rahimi

    Abstract: Distributed sparse block codes (SBCs) exhibit compact representations for encoding and manipulating symbolic data structures using fixed-width vectors. One major challenge however is to disentangle, or factorize, the distributed representation of data structures into their constituent elements without having to search through all possible combinations. This factorization becomes more challenging w… ▽ More

    Submitted 28 May, 2024; v1 submitted 24 March, 2023; originally announced March 2023.

    Comments: Accepted at Neurosymbolic Artificial Intelligence