Skip to main content

Showing 1–12 of 12 results for author: Teschke, O

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.03858  [pdf, ps, other

    cs.IR cs.DL

    Reducing the climate impact of data portals: a case study

    Authors: Noah Gießing, Madhurima Deb, Ankit Satpute, Moritz Schubotz, Olaf Teschke

    Abstract: The carbon footprint share of the information and communication technology (ICT) sector has steadily increased in the past decade and is predicted to make up as much as 23 \% of global emissions in 2030. This shows a pressing need for developers, including the information retrieval community, to make their code more energy-efficient. In this project proposal, we discuss techniques to reduce the en… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: 4 pages

  2. arXiv:2404.00344  [pdf, other

    cs.CL cs.AI cs.IR

    Can LLMs Master Math? Investigating Large Language Models on Math Stack Exchange

    Authors: Ankit Satpute, Noah Giessing, Andre Greiner-Petter, Moritz Schubotz, Olaf Teschke, Akiko Aizawa, Bela Gipp

    Abstract: Large Language Models (LLMs) have demonstrated exceptional capabilities in various natural language tasks, often achieving performances that surpass those of humans. Despite these advancements, the domain of mathematics presents a distinctive challenge, primarily due to its specialized structure and the precision it demands. In this study, we adopted a two-step approach for investigating the profi… ▽ More

    Submitted 30 March, 2024; originally announced April 2024.

    Comments: Accepted for publication at the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR) July 14--18, 2024, Washington D.C.,USA

  3. Taxonomy of Mathematical Plagiarism

    Authors: Ankit Satpute, Andre Greiner-Petter, Noah Gießing, Isabel Beckenbach, Moritz Schubotz, Olaf Teschke, Akiko Aizawa, Bela Gipp

    Abstract: Plagiarism is a pressing concern, even more so with the availability of large language models. Existing plagiarism detection systems reliably find copied and moderately reworded text but fail for idea plagiarism, especially in mathematical science, which heavily uses formal mathematical notation. We make two contributions. First, we establish a taxonomy of mathematical content reuse by annotating… ▽ More

    Submitted 31 May, 2024; v1 submitted 30 January, 2024; originally announced January 2024.

    Comments: 46th European Conference on Information Retrieval (ECIR)

  4. arXiv:2401.08297  [pdf, other

    cs.DL math.HO

    The extension of zbMATH Open by arXiv preprints

    Authors: Isabel Beckenbach, Klaus Hulek, Olaf Teschke

    Abstract: zbMATH Open has started a new feature -- relevant preprints posted at arXiv will also be displayed in the database. In this article we introduce this new feature and the underlying editorial policy. We also describe some of the technical issues involved and discuss the challenges this presents for future developments.

    Submitted 16 January, 2024; originally announced January 2024.

    MSC Class: 68V35

  5. arXiv:2309.11484  [pdf, other

    cs.DL cs.IR

    Bravo MaRDI: A Wikibase Powered Knowledge Graph on Mathematics

    Authors: Moritz Schubotz, Eloi Ferrer, Johannes Stegmüller, Daniel Mietchen, Olaf Teschke, Larissa Pusch, Tim OF Conrad

    Abstract: Mathematical world knowledge is a fundamental component of Wikidata. However, to date, no expertly curated knowledge graph has focused specifically on contemporary mathematics. Addressing this gap, the Mathematical Research Data Initiative (MaRDI) has developed a comprehensive knowledge graph that links multimodal research data in mathematics. This encompasses traditional research data items like… ▽ More

    Submitted 20 September, 2023; originally announced September 2023.

    Comments: Accepted at Wikidata'23: Wikidata workshop at ISWC 2023

  6. arXiv:2305.13193  [pdf, other

    cs.IR

    TEIMMA: The First Content Reuse Annotator for Text, Images, and Math

    Authors: Ankit Satpute, André Greiner-Petter, Moritz Schubotz, Norman Meuschke, Akiko Aizawa, Olaf Teschke, Bela Gipp

    Abstract: This demo paper presents the first tool to annotate the reuse of text, images, and mathematical formulae in a document pair -- TEIMMA. Annotating content reuse is particularly useful to develop plagiarism detection algorithms. Real-world content reuse is often obfuscated, which makes it challenging to identify such cases. TEIMMA allows entering the obfuscation type to enable novel classifications… ▽ More

    Submitted 13 June, 2023; v1 submitted 22 May, 2023; originally announced May 2023.

  7. arXiv:2107.13877  [pdf, ps, other

    cs.DL math.HO

    10 Years Later: The Mathematics Subject Classification and Linked Open Data

    Authors: Susanne Arndt, Patrick Ion, Mila Runnwerth, Moritz Schubotz, Olaf Teschke

    Abstract: Ten years ago, the Mathematics Subject Classification MSC 2010 was released, and a corresponding machine-readable Linked Open Data collection was published using the Simple Knowledge Organization System (SKOS). Now, the new MSC 2020 is out. This paper recaps the last ten years of working on machine-readable MSC data and presents the new machine-readable MSC 2020. We describe the processing requi… ▽ More

    Submitted 2 August, 2021; v1 submitted 29 July, 2021; originally announced July 2021.

    Comments: Extended version of the CICM article

    MSC Class: 00-01 ACM Class: G.m; E.m

  8. arXiv:2106.04664  [pdf, other

    cs.DL

    zbMATH Open: API Solutions and Research Challenges

    Authors: Matteo Petrera, Dennis Trautwein, Isabel Beckenbach, Dariush Ehsani, Fabian Mueller, Olaf Teschke, Bela Gipp, Moritz Schubotz

    Abstract: We present zbMATH Open, the most comprehensive collection of reviews and bibliographic metadata of scholarly literature in mathematics. Besides our website https://zbMATH.org which is openly accessible since the beginning of this year, we provide API endpoints to offer our data. The API improves interoperability with others, i.e., digital libraries, and allows using our data for research purposes.… ▽ More

    Submitted 23 June, 2021; v1 submitted 8 June, 2021; originally announced June 2021.

  9. arXiv:2012.02413  [pdf

    cs.DL

    ARQMath Lab: An Incubator for Semantic Formula Search in zbMATH Open?

    Authors: Philipp Scharpf, Moritz Schubotz, Andre Greiner-Petter, Malte Ostendorff, Olaf Teschke, Bela Gipp

    Abstract: The zbMATH database contains more than 4 million bibliographic entries. We aim to provide easy access to these entries. Therefore, we maintain different index structures, including a formula index. To optimize the findability of the entries in our database, we continuously investigate new approaches to satisfy the information needs of our users. We believe that the findings from the ARQMath evalua… ▽ More

    Submitted 10 December, 2020; v1 submitted 4 December, 2020; originally announced December 2020.

    Comments: in Working Notes of {CLEF} 2020 - Conference and Labs of the Evaluation Forum, Thessaloniki, Greece, September 22-25, 2020 http://ceur-ws.org/Vol-2696/paper_200.pdf

  10. AutoMSC: Automatic Assignment of Mathematics Subject Classification Labels

    Authors: Moritz Schubotz, Philipp Scharpf, Olaf Teschke, Andreas Kuehnemund, Corinna Breitinger, Bela Gipp

    Abstract: Authors of research papers in the fields of mathematics, and other math-heavy disciplines commonly employ the Mathematics Subject Classification (MSC) scheme to search for relevant literature. The MSC is a hierarchical alphanumerical classification scheme that allows librarians to specify one or multiple codes for publications. Digital Libraries in Mathematics, as well as reviewing services, such… ▽ More

    Submitted 9 November, 2020; v1 submitted 25 May, 2020; originally announced May 2020.

    Journal ref: Intelligent Computer Mathematics - 13thInternational Conference, {CICM} 2020, Bertinoro, Italy, July 26-31, 2020, Proceedings

  11. Mathematical Formulae in Wikimedia Projects 2020

    Authors: Moritz Schubotz, André Greiner-Petter, Norman Meuschke, Olaf Teschke, Bela Gipp

    Abstract: This poster summarizes our contributions to Wikimedia's processing pipeline for mathematical formulae. We describe how we have supported the transition from rendering formulae as course-grained PNG images in 2001 to providing modern semantically enriched language-independent MathML formulae in 2020. Additionally, we describe our plans to improve the accessibility and discoverability of mathematica… ▽ More

    Submitted 6 May, 2020; v1 submitted 20 March, 2020; originally announced March 2020.

    Comments: Submitted to JCDL 2020: Proceedings of the ACM/ IEEE Joint Conference on Digital Libraries in 2020 (JCDL '20), August 1-5, 2020, Virtual Event, China

  12. Forms of Plagiarism in Digital Mathematical Libraries

    Authors: Moritz Schubotz, Olaf Teschke, Vincent Stange, Norman Meuschke, Bela Gipp

    Abstract: We report on an exploratory analysis of the forms of plagiarism observable in mathematical publications, which we identified by investigating editorial notes from zbMATH. While most cases we encountered were simple copies of earlier work, we also identified several forms of disguised plagiarism. We investigated 11 cases in detail and evaluate how current plagiarism detection systems perform in ide… ▽ More

    Submitted 9 September, 2019; v1 submitted 8 May, 2019; originally announced May 2019.

    Journal ref: Intelligent Computer Mathematics - 12th International Conference, {CICM} 2019, Prague, Czech Republic, July 8-12, 2019, Proceedings