Skip to main content

Showing 1–2 of 2 results for author: Behjati, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2310.17284  [pdf, other

    cs.CL

    Learning to Abstract with Nonparametric Variational Information Bottleneck

    Authors: Melika Behjati, Fabio Fehr, James Henderson

    Abstract: Learned representations at the level of characters, sub-words, words and sentences, have each contributed to advances in understanding different NLP tasks and linguistic phenomena. However, learning textual embeddings is costly as they are tokenization specific and require different models to be trained for each level of abstraction. We introduce a novel language representation model which can lea… ▽ More

    Submitted 26 October, 2023; originally announced October 2023.

    Comments: Accepted to Findings of EMNLP 2023

  2. arXiv:2102.01223  [pdf, other

    cs.CL cs.LG

    Inducing Meaningful Units from Character Sequences with Dynamic Capacity Slot Attention

    Authors: Melika Behjati, James Henderson

    Abstract: Characters do not convey meaning, but sequences of characters do. We propose an unsupervised distributional method to learn the abstract meaningful units in a sequence of characters. Rather than segmenting the sequence, our Dynamic Capacity Slot Attention model discovers continuous representations of the objects in the sequence, extending an architecture for object discovery in images. We train ou… ▽ More

    Submitted 16 January, 2024; v1 submitted 1 February, 2021; originally announced February 2021.

    Comments: Accepted to TMLR 2023