Skip to main content

Showing 1–10 of 10 results for author: Scharpf, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2303.01994  [pdf, other

    cs.IR cs.LG

    Discovery and Recognition of Formula Concepts using Machine Learning

    Authors: Philipp Scharpf, Moritz Schubotz, Howard S. Cohl, Corinna Breitinger, Bela Gipp

    Abstract: Citation-based Information Retrieval (IR) methods for scientific documents have proven effective for IR applications, such as Plagiarism Detection or Literature Recommender Systems in academic disciplines that use many references. In science, technology, engineering, and mathematics, researchers often employ mathematical concepts through formula notation to refer to prior knowledge. Our long-term… ▽ More

    Submitted 19 March, 2023; v1 submitted 3 March, 2023; originally announced March 2023.

    Comments: Accepted by Scientometrics (Springer) journal

    MSC Class: 68P20 (Primary); 68T50 (Secondary) ACM Class: H.3.3; I.2.7

  2. Collaborative and AI-aided Exam Question Generation using Wikidata in Education

    Authors: Philipp Scharpf, Moritz Schubotz, Andreas Spitz, Andre Greiner-Petter, Bela Gipp

    Abstract: Since the COVID-19 outbreak, the use of digital learning or education platforms has significantly increased. Teachers now digitally distribute homework and provide exercise questions. In both cases, teachers need to continuously develop novel and individual questions. This process can be very time-consuming and should be facilitated and accelerated both through exchange with other teachers and by… ▽ More

    Submitted 15 November, 2022; originally announced November 2022.

    MSC Class: 68Uxx ACM Class: H.4

  3. arXiv:2211.06664  [pdf

    cs.IR

    Mining Mathematical Documents for Question Answering via Unsupervised Formula Labeling

    Authors: Philipp Scharpf, Moritz Schubotz, Bela Gipp

    Abstract: The increasing number of questions on Question Answering (QA) platforms like Math Stack Exchange (MSE) signifies a growing information need to answer math-related questions. However, there is currently very little research on approaches for an open data QA system that retrieves mathematical formulae using their concept names or querying formula identifier relationships from knowledge graphs. In th… ▽ More

    Submitted 12 November, 2022; originally announced November 2022.

    MSC Class: 68Uxx ACM Class: H.4

  4. arXiv:2109.00954  [pdf, other

    cs.IR

    Towards Explaining STEM Document Classification using Mathematical Entity Linking

    Authors: Philipp Scharpf, Moritz Schubotz, Bela Gipp

    Abstract: Document subject classification is essential for structuring (digital) libraries and allowing readers to search within a specific field. Currently, the classification is typically made by human domain experts. Semi-supervised Machine Learning algorithms can support them by exploiting the labeled data to predict subject classes for unclassified new documents. However, while humans partly do, machin… ▽ More

    Submitted 2 September, 2021; originally announced September 2021.

  5. arXiv:2104.05111  [pdf, other

    cs.DL

    Fast Linking of Mathematical Wikidata Entities in Wikipedia Articles Using Annotation Recommendation

    Authors: Philipp Scharpf, Moritz Schubotz, Bela Gipp

    Abstract: Mathematical information retrieval (MathIR) applications such as semantic formula search and question answering systems rely on knowledge-bases that link mathematical expressions to their natural language names. For database population, mathematical formulae need to be annotated and linked to semantic concepts, which is very time-consuming. In this paper, we present our approach to structure and s… ▽ More

    Submitted 11 April, 2021; originally announced April 2021.

  6. arXiv:2012.02413  [pdf

    cs.DL

    ARQMath Lab: An Incubator for Semantic Formula Search in zbMATH Open?

    Authors: Philipp Scharpf, Moritz Schubotz, Andre Greiner-Petter, Malte Ostendorff, Olaf Teschke, Bela Gipp

    Abstract: The zbMATH database contains more than 4 million bibliographic entries. We aim to provide easy access to these entries. Therefore, we maintain different index structures, including a formula index. To optimize the findability of the entries in our database, we continuously investigate new approaches to satisfy the information needs of our users. We believe that the findings from the ARQMath evalua… ▽ More

    Submitted 10 December, 2020; v1 submitted 4 December, 2020; originally announced December 2020.

    Comments: in Working Notes of {CLEF} 2020 - Conference and Labs of the Evaluation Forum, Thessaloniki, Greece, September 22-25, 2020 http://ceur-ws.org/Vol-2696/paper_200.pdf

  7. AutoMSC: Automatic Assignment of Mathematics Subject Classification Labels

    Authors: Moritz Schubotz, Philipp Scharpf, Olaf Teschke, Andreas Kuehnemund, Corinna Breitinger, Bela Gipp

    Abstract: Authors of research papers in the fields of mathematics, and other math-heavy disciplines commonly employ the Mathematics Subject Classification (MSC) scheme to search for relevant literature. The MSC is a hierarchical alphanumerical classification scheme that allows librarians to specify one or multiple codes for publications. Digital Libraries in Mathematics, as well as reviewing services, such… ▽ More

    Submitted 9 November, 2020; v1 submitted 25 May, 2020; originally announced May 2020.

    Journal ref: Intelligent Computer Mathematics - 13thInternational Conference, {CICM} 2020, Bertinoro, Italy, July 26-31, 2020, Proceedings

  8. arXiv:2005.11021  [pdf, other

    cs.DL cs.CL cs.IR cs.LG

    Classification and Clustering of arXiv Documents, Sections, and Abstracts, Comparing Encodings of Natural and Mathematical Language

    Authors: Philipp Scharpf, Moritz Schubotz, Abdou Youssef, Felix Hamborg, Norman Meuschke, Bela Gipp

    Abstract: In this paper, we show how selecting and combining encodings of natural and mathematical language affect classification and clustering of documents with mathematical content. We demonstrate this by using sets of documents, sections, and abstracts from the arXiv preprint server that are labeled by their subject class (mathematics, computer science, physics, etc.) to compare different encodings of t… ▽ More

    Submitted 22 May, 2020; originally announced May 2020.

    Journal ref: Proceedings of the ACM/IEEE Joint Conference on Digital Libraries JCDL 2020

  9. arXiv:1907.01642  [pdf, other

    cs.IR cs.CL cs.DL

    Introducing MathQA -- A Math-Aware Question Answering System

    Authors: Moritz Schubotz, Philipp Scharpf, Kaushal Dudhat, Yash Nagar, Felix Hamborg, Bela Gipp

    Abstract: We present an open source math-aware Question Answering System based on Ask Platypus. Our system returns as a single mathematical formula for a natural language question in English or Hindi. This formulae originate from the knowledge-base Wikidata. We translate these formulae to computable data by integrating the calculation engine sympy into our system. This way, users can enter numeric values fo… ▽ More

    Submitted 28 June, 2019; originally announced July 2019.

    Comments: Proceedings of the ACM/IEEE-CS Joint Conference on Digital Libraries (JCDL), Workshop on Knowledge Discovery (2018)

  10. Improving the Representation and Conversion of Mathematical Formulae by Considering their Textual Context

    Authors: Moritz Schubotz, Andre Greiner-Petter, Philipp Scharpf, Norman Meuschke, Howard Cohl, Bela Gipp

    Abstract: Mathematical formulae represent complex semantic information in a concise form. Especially in Science, Technology, Engineering, and Mathematics, mathematical formulae are crucial to communicate information, e.g., in scientific papers, and to perform computations using computer algebra systems. Enabling computers to access the information encoded in mathematical formulae requires machine-readable f… ▽ More

    Submitted 13 April, 2018; originally announced April 2018.

    Comments: 10 pages, 4 figures

    Journal ref: Proceedings of the ACM/IEEE-CS Joint Conference on Digital Libraries (JCDL), Jun. 2018, Fort Worth, USA