Search | arXiv e-print repository

arXiv:2401.14569 [pdf, other]

Detecting Structured Language Alternations in Historical Documents by Combining Language Identification with Fourier Analysis

Authors: Hale Sirin, Sabrina Li, Tom Lippincott

Abstract: In this study, we present a generalizable workflow to identify documents in a historic language with a nonstandard language and script combination, Armeno-Turkish. We introduce the task of detecting distinct patterns of multilinguality based on the frequency of structured language alternations within a document. In this study, we present a generalizable workflow to identify documents in a historic language with a nonstandard language and script combination, Armeno-Turkish. We introduce the task of detecting distinct patterns of multilinguality based on the frequency of structured language alternations within a document. △ Less

Submitted 25 January, 2024; originally announced January 2024.

Comments: Accepted to LaTeCH@EACL2024

arXiv:2401.13905 [pdf, other]

Dynamic embedded topic models and change-point detection for exploring literary-historical hypotheses

Authors: Hale Sirin, Tom Lippincott

Abstract: We present a novel combination of dynamic embedded topic models and change-point detection to explore diachronic change of lexical semantic modality in classical and early Christian Latin. We demonstrate several methods for finding and characterizing patterns in the output, and relating them to traditional scholarship in Comparative Literature and Classics. This simple approach to unsupervised mod… ▽ More We present a novel combination of dynamic embedded topic models and change-point detection to explore diachronic change of lexical semantic modality in classical and early Christian Latin. We demonstrate several methods for finding and characterizing patterns in the output, and relating them to traditional scholarship in Comparative Literature and Classics. This simple approach to unsupervised models of semantic change can be applied to any suitable corpus, and we conclude with future directions and refinements aiming to allow noisier, less-curated materials to meet that threshold. △ Less

Submitted 24 January, 2024; originally announced January 2024.

Comments: Accepted to LaTeCH@EACL2024

arXiv:2110.14038 [pdf, ps, other]

Robustness of Graph Neural Networks at Scale

Authors: Simon Geisler, Tobias Schmidt, Hakan Şirin, Daniel Zügner, Aleksandar Bojchevski, Stephan Günnemann

Abstract: Graph Neural Networks (GNNs) are increasingly important given their popularity and the diversity of applications. Yet, existing studies of their vulnerability to adversarial attacks rely on relatively small graphs. We address this gap and study how to attack and defend GNNs at scale. We propose two sparsity-aware first-order optimization attacks that maintain an efficient representation despite op… ▽ More Graph Neural Networks (GNNs) are increasingly important given their popularity and the diversity of applications. Yet, existing studies of their vulnerability to adversarial attacks rely on relatively small graphs. We address this gap and study how to attack and defend GNNs at scale. We propose two sparsity-aware first-order optimization attacks that maintain an efficient representation despite optimizing over a number of parameters which is quadratic in the number of nodes. We show that common surrogate losses are not well-suited for global attacks on GNNs. Our alternatives can double the attack strength. Moreover, to improve GNNs' reliability we design a robust aggregation function, Soft Median, resulting in an effective defense at all scales. We evaluate our attacks and defense with standard GNNs on graphs more than 100 times larger compared to previous work. We even scale one order of magnitude further by extending our techniques to a scalable GNN. △ Less

Submitted 30 April, 2023; v1 submitted 26 October, 2021; originally announced October 2021.

Comments: 39 pages, 22 figures, 17 tables NeurIPS 2021

Showing 1–3 of 3 results for author: Şirin, H