Skip to main content

Showing 1–5 of 5 results for author: Rethmeier, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2305.02763  [pdf, other

    cs.CY cs.CL cs.CR cs.LG

    VendorLink: An NLP approach for Identifying & Linking Vendor Migrants & Potential Aliases on Darknet Markets

    Authors: Vageesh Saxena, Nils Rethmeier, Gijs Van Dijck, Gerasimos Spanakis

    Abstract: The anonymity on the Darknet allows vendors to stay undetected by using multiple vendor aliases or frequently migrating between markets. Consequently, illegal markets and their connections are challenging to uncover on the Darknet. To identify relationships between illegal markets and their vendors, we propose VendorLink, an NLP-based approach that examines writing patterns to verify, identify, an… ▽ More

    Submitted 4 May, 2023; originally announced May 2023.

  2. arXiv:2202.06671  [pdf, other

    cs.CL

    Neighborhood Contrastive Learning for Scientific Document Representations with Citation Embeddings

    Authors: Malte Ostendorff, Nils Rethmeier, Isabelle Augenstein, Bela Gipp, Georg Rehm

    Abstract: Learning scientific document representations can be substantially improved through contrastive learning objectives, where the challenge lies in creating positive and negative training samples that encode the desired similarity semantics. Prior work relies on discrete citation relations to generate contrast samples. However, discrete citations enforce a hard cut-off to similarity. This is counter-i… ▽ More

    Submitted 19 October, 2022; v1 submitted 14 February, 2022; originally announced February 2022.

    Comments: Accepted to EMNLP 2022

  3. arXiv:2102.12982  [pdf, other

    cs.CL cs.AI cs.CV

    A Primer on Contrastive Pretraining in Language Processing: Methods, Lessons Learned and Perspectives

    Authors: Nils Rethmeier, Isabelle Augenstein

    Abstract: Modern natural language processing (NLP) methods employ self-supervised pretraining objectives such as masked language modeling to boost the performance of various application tasks. These pretraining methods are frequently extended with recurrence, adversarial or linguistic property masking, and more recently with contrastive learning objectives. Contrastive self-supervised training objectives en… ▽ More

    Submitted 25 February, 2021; originally announced February 2021.

  4. arXiv:2010.01061  [pdf, other

    cs.CL cs.LG

    Data-Efficient Pretraining via Contrastive Self-Supervision

    Authors: Nils Rethmeier, Isabelle Augenstein

    Abstract: For natural language processing `text-to-text' tasks, the prevailing approaches heavily rely on pretraining large self-supervised models on increasingly larger `task-external' data. Transfer learning from high-resource pretraining works well, but research has focused on settings with very large data and compute requirements, while the potential of efficient low-resource learning, without large `ta… ▽ More

    Submitted 15 April, 2021; v1 submitted 2 October, 2020; originally announced October 2020.

    Comments: Majorly reworked version. Comparison to a large-scale RoBERTa model added. Focus on learning efficiency comparison to self and RoBERTa

  5. arXiv:1912.00982  [pdf, other

    cs.LG cs.CL stat.ML

    TX-Ray: Quantifying and Explaining Model-Knowledge Transfer in (Un-)Supervised NLP

    Authors: Nils Rethmeier, Vageesh Kumar Saxena, Isabelle Augenstein

    Abstract: While state-of-the-art NLP explainability (XAI) methods focus on explaining per-sample decisions in supervised end or probing tasks, this is insufficient to explain and quantify model knowledge transfer during (un-)supervised training. Thus, for TX-Ray, we modify the established computer vision explainability principle of 'visualizing preferred inputs of neurons' to make it usable transfer analysi… ▽ More

    Submitted 19 June, 2020; v1 submitted 2 December, 2019; originally announced December 2019.