Skip to main content

Showing 1–13 of 13 results for author: Stöckel, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.11549  [pdf, other

    cs.CL cs.AI

    Syntactic Language Change in English and German: Metrics, Parsers, and Convergences

    Authors: Yanran Chen, Wei Zhao, Anne Breitbarth, Manuel Stoeckel, Alexander Mehler, Steffen Eger

    Abstract: Many studies have shown that human languages tend to optimize for lower complexity and increased communication efficiency. Syntactic dependency distance, which measures the linear distance between dependent words, is often considered a key indicator of language processing difficulty and working memory load. The current paper looks at diachronic trends in syntactic language change in both English a… ▽ More

    Submitted 28 March, 2024; v1 submitted 18 February, 2024; originally announced February 2024.

    Comments: Updated to the current version

  2. arXiv:2204.08848  [pdf, ps, other

    cs.CL

    I still have Time(s): Extending HeidelTime for German Texts

    Authors: Andy Lücking, Manuel Stoeckel, Giuseppe Abrami, Alexander Mehler

    Abstract: HeidelTime is one of the most widespread and successful tools for detecting temporal expressions in texts. Since HeidelTime's pattern matching system is based on regular expression, it can be extended in a convenient way. We present such an extension for the German resources of HeidelTime: HeidelTime-EXT . The extension has been brought about by means of observing false negatives within real world… ▽ More

    Submitted 19 April, 2022; originally announced April 2022.

    Comments: LREC 2022

  3. arXiv:1910.03387  [pdf, other

    cs.CL cs.LG

    When Specialization Helps: Using Pooled Contextualized Embeddings to Detect Chemical and Biomedical Entities in Spanish

    Authors: Manuel Stoeckel, Wahed Hemati, Alexander Mehler

    Abstract: The recognition of pharmacological substances, compounds and proteins is an essential preliminary work for the recognition of relations between chemicals and other biomedically relevant units. In this paper, we describe an approach to Task 1 of the PharmaCoNER Challenge, which involves the recognition of mentions of chemicals and drugs in Spanish medical texts. We train a state-of-the-art BiLSTM-C… ▽ More

    Submitted 8 October, 2019; originally announced October 2019.

    Comments: EMNLP-IJCNLP 2019: International Workshop on BioNLP Open Shared Tasks 2019, 5, pages, 1 figure

  4. arXiv:1907.13237  [pdf, other

    cs.CL cs.LG

    SenseFitting: Sense Level Semantic Specialization of Word Embeddings for Word Sense Disambiguation

    Authors: Manuel Stoeckel, Sajawel Ahmed, Alexander Mehler

    Abstract: We introduce a neural network-based system of Word Sense Disambiguation (WSD) for German that is based on SenseFitting, a novel method for optimizing WSD. We outperform knowledge-based WSD methods by up to 25% F1-score and produce a new state-of-the-art on the German sense-annotated dataset WebCAGe. Our method uses three feature vectors consisting of a) sense, b) gloss, and c) relational vectors t… ▽ More

    Submitted 30 July, 2019; originally announced July 2019.

    Comments: Sketch for LREC 2020 submission

  5. arXiv:1709.01960  [pdf, other

    cs.DS

    Constructing Light Spanners Deterministically in Near-Linear Time

    Authors: Stephen Alstrup, Søren Dahlgaard, Arnold Filtser, Morten Stöckel, Christian Wulff-Nilsen

    Abstract: Graph spanners are well-studied and widely used both in theory and practice. In a recent breakthrough, Chechik and Wulff-Nilsen [CW18] improved the state-of-the-art for light spanners by constructing a $(2k-1)(1+ε)$-spanner with $O(n^{1+1/k})$ edges and $O_ε(n^{1/k})$ lightness. Soon after, Filtser and Solomon [FS19] showed that the classic greedy spanner construction achieves the same bounds The… ▽ More

    Submitted 19 January, 2022; v1 submitted 6 September, 2017; originally announced September 2017.

  6. arXiv:1704.02178  [pdf, other

    cs.DS

    New Subquadratic Approximation Algorithms for the Girth

    Authors: Søren Dahlgaard, Mathias Bæk Tejs Knudsen, Morten Stöckel

    Abstract: We consider the problem of approximating the girth, $g$, of an unweighted and undirected graph $G=(V,E)$ with $n$ nodes and $m$ edges. A seminal result of Itai and Rodeh [SICOMP'78] gave an additive $1$-approximation in $O(n^2)$ time, and the main open question is thus how well we can do in subquadratic time. In this paper we present two main results. The first is a $(1+\varepsilon,O(1))$-approx… ▽ More

    Submitted 7 April, 2017; originally announced April 2017.

  7. arXiv:1703.10380  [pdf, other

    cs.DS

    Finding Even Cycles Faster via Capped k-Walks

    Authors: Søren Dahlgaard, Mathias Bæk Tejs Knudsen, Morten Stöckel

    Abstract: In this paper, we consider the problem of finding a cycle of length $2k$ (a $C_{2k}$) in an undirected graph $G$ with $n$ nodes and $m$ edges for constant $k\ge2$. A classic result by Bondy and Simonovits [J.Comb.Th.'74] implies that if $m \ge100k n^{1+1/k}$, then $G$ contains a $C_{2k}$, further implying that one needs to consider only graphs with $m = O(n^{1+1/k})$. Previously the best known a… ▽ More

    Submitted 30 March, 2017; originally announced March 2017.

    Comments: To appear at STOC'17

  8. arXiv:1607.04911  [pdf, other

    cs.DS math.CO

    Near-Optimal Induced Universal Graphs for Bounded Degree Graphs

    Authors: Mikkel Abrahamsen, Stephen Alstrup, Jacob Holm, Mathias Bæk Tejs Knudsen, Morten Stöckel

    Abstract: A graph $U$ is an induced universal graph for a family $F$ of graphs if every graph in $F$ is a vertex-induced subgraph of $U$. For the family of all undirected graphs on $n$ vertices Alstrup, Kaplan, Thorup, and Zwick [STOC 2015] give an induced universal graph with $O\!\left(2^{n/2}\right)$ vertices, matching a lower bound by Moon [Proc. Glasgow Math. Assoc. 1965]. Let $k= \lceil D/2 \rceil$.… ▽ More

    Submitted 21 July, 2016; v1 submitted 17 July, 2016; originally announced July 2016.

  9. arXiv:1507.00552  [pdf, ps, other

    cs.DS

    I/O-Efficient Similarity Join

    Authors: Rasmus Pagh, Ninh Pham, Francesco Silvestri, Morten Stöckel

    Abstract: We present an I/O-efficient algorithm for computing similarity joins based on locality-sensitive hashing (LSH). In contrast to the filtering methods commonly suggested our method has provable sub-quadratic dependency on the data size. Further, in contrast to straightforward implementations of known LSH-based algorithms on external memory, our approach is able to take significant advantage of the a… ▽ More

    Submitted 28 March, 2017; v1 submitted 2 July, 2015; originally announced July 2015.

    Comments: 20 pages in Proceedings of the 23rd Annual European Symposium on Algorithms 2015. The full version appeared in Algorithmica 2017

  10. arXiv:1502.05729  [pdf, other

    cs.DS

    Quicksort, Largest Bucket, and Min-Wise Hashing with Limited Independence

    Authors: Mathias Bæk Tejs Knudsen, Morten Stöckel

    Abstract: Randomized algorithms and data structures are often analyzed under the assumption of access to a perfect source of randomness. The most fundamental metric used to measure how "random" a hash function or a random number generator is, is its independence: a sequence of random variables is said to be $k$-independent if every variable is uniform and every size $k$ subset is independent. In this paper… ▽ More

    Submitted 19 February, 2015; originally announced February 2015.

    Comments: Submitted to ICALP 2015

  11. arXiv:1501.02143  [pdf, other

    cs.DB

    Association Rule Mining using Maximum Entropy

    Authors: Rasmus Pagh, Morten Stöckel

    Abstract: Recommendations based on behavioral data may be faced with ambiguous statistical evidence. We consider the case of association rules, relevant e.g.~for query and product recommendations. For example: Suppose that a customer belongs to categories A and B, each of which is known to have positive correlation with buying product C, how do we estimate the probability that she will buy product C? For… ▽ More

    Submitted 9 January, 2015; originally announced January 2015.

  12. arXiv:1403.3551  [pdf, ps, other

    cs.DS

    The Input/Output Complexity of Sparse Matrix Multiplication

    Authors: Rasmus Pagh, Morten Stöckel

    Abstract: We consider the problem of multiplying sparse matrices (over a semiring) where the number of non-zero entries is larger than main memory. In the classical paper of Hong and Kung (STOC '81) it was shown that to compute a product of dense $U \times U$ matrices, $Θ\left(U^3 / (B \sqrt{M}) \right)$ I/Os are necessary and sufficient in the I/O model with internal memory size $M$ and memory block size… ▽ More

    Submitted 14 March, 2014; originally announced March 2014.

    Comments: Submitted to ICALP 2014

  13. arXiv:1210.2544  [pdf, ps, other

    cs.CC cs.DM math.CO

    The Hardness of the Functional Orientation 2-Color Problem

    Authors: Søren Bøg, Morten Stöckel, Hjalte Wedel Vildhøj

    Abstract: We consider the Functional Orientation 2-Color problem, which was introduced by Valiant in his seminal paper on holographic algorithms [SIAM J. Comput., 37(5), 2008]. For this decision problem, Valiant gave a polynomial time holographic algorithm for planar graphs of maximum degree 3, and showed that the problem is NP-complete for planar graphs of maximum degree 10. A recent result on defective gr… ▽ More

    Submitted 23 April, 2013; v1 submitted 9 October, 2012; originally announced October 2012.

    Journal ref: Australas. J. Combin., vol. 56 (2013), pages 225-234