Skip to main content

Showing 1–14 of 14 results for author: Schenkel, R

.
  1. arXiv:2407.05467  [pdf, other

    cs.DC cs.AI

    The infrastructure powering IBM's Gen AI model development

    Authors: Talia Gershon, Seetharami Seelam, Brian Belgodere, Milton Bonilla, Lan Hoang, Danny Barnett, I-Hsin Chung, Apoorve Mohan, Ming-Hung Chen, Lixiang Luo, Robert Walkup, Constantinos Evangelinos, Shweta Salaria, Marc Dombrowa, Yoonho Park, Apo Kayi, Liran Schour, Alim Alim, Ali Sydney, Pavlos Maniotis, Laurent Schares, Bernard Metzler, Bengi Karacali-Akyamac, Sophia Wen, Tatsuhiro Chiba , et al. (121 additional authors not shown)

    Abstract: AI Infrastructure plays a key role in the speed and cost-competitiveness of develo** and deploying advanced AI models. The current demand for powerful AI infrastructure for model training is driven by the emergence of generative AI and foundational models, where on occasion thousands of GPUs must cooperate on a single training job for the model to be trained in a reasonable time. Delivering effi… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

    Comments: Corresponding Authors: Talia Gershon, Seetharami Seelam,Brian Belgodere, Milton Bonilla

  2. arXiv:2304.11656  [pdf, ps, other

    cs.DL

    Capturing Stability of Information Needs in Digital Libraries

    Authors: Christin Katharina Kreutz, Philipp Schaer, Ralf Schenkel

    Abstract: Scientific digital libraries provide users access to large amounts of data to satisfy their diverse information needs. Factors influencing users' decisions on the relevancy of a publication or a person are individual and usually only visible through posed queries or clicked information. However, the actual formulation or consideration of information requirements begins earlier in users' exploratio… ▽ More

    Submitted 23 April, 2023; originally announced April 2023.

    Comments: 2 pages + references, poster accepted at JCDL'23

  3. arXiv:2304.11651  [pdf, other

    cs.DL

    Evaluating Digital Library Search Systems by using Formal Process Modelling

    Authors: Christin Katharina Kreutz, Martin Blum, Philipp Schaer, Ralf Schenkel, Benjamin Weyers

    Abstract: Evaluations of digital library information systems are typically centred on users correctly, efficiently, and quickly performing predefined tasks. Additionally, users generally enjoy working with the evaluated system, and completed questionnaires show an interface's excellent user experience. However, such evaluations do not explicitly consider comparing or connecting user-specific information-see… ▽ More

    Submitted 23 April, 2023; originally announced April 2023.

    Comments: 10 pages + references, publication accepted at JCDL'23

  4. SchenQL: A query language for bibliographic data with aggregations and domain-specific functions

    Authors: Christin Katharina Kreutz, Martin Blum, Ralf Schenkel

    Abstract: Current search interfaces of digital libraries are not suitable to satisfy complex or convoluted information needs directly, when it comes to cases such as "Find authors who only recently started working on a topic". They might offer possibilities to obtain this information only by requiring vast user interaction. We present SchenQL, a web interface of a domain-specific query language on bibliogra… ▽ More

    Submitted 13 May, 2022; originally announced May 2022.

    Comments: Accepted at JCDL'22 as a demo, 5 pages, 4 figures

  5. arXiv:2201.11030  [pdf, other

    cs.DL

    Diverse Reviewer Suggestion for Extending Conference Program Committees

    Authors: Christin Katharina Kreutz, Krisztian Balog, Ralf Schenkel

    Abstract: Automated reviewer recommendation for scientific conferences currently relies on the assumption that the program committee has the necessary expertise to handle all submissions. However, topical discrepancies between received submissions and reviewer candidates might lead to unreliable reviews or overburdening of reviewers, and may result in the rejection of high-quality papers. In this work, we p… ▽ More

    Submitted 26 January, 2022; originally announced January 2022.

  6. arXiv:2201.00682  [pdf, other

    cs.DL

    Scientific Paper Recommendation Systems: a Literature Review of recent Publications

    Authors: Christin Katharina Kreutz, Ralf Schenkel

    Abstract: Scientific writing builds upon already published papers. Manual identification of publications to read, cite or consider as related papers relies on a researcher's ability to identify fitting keywords or initial papers from which a literature search can be started. The rapidly increasing amount of papers has called for automatic measures to find the desired relevant publications, so-called paper r… ▽ More

    Submitted 7 September, 2022; v1 submitted 3 January, 2022; originally announced January 2022.

    Comments: 33 pages, accepted for publication at IJDL (https://www.springer.com/journal/799)

  7. arXiv:2110.02862  [pdf, other

    cs.DL

    RevASIDE: Assignment of Suitable Reviewer Sets for Publications from Fixed Candidate Pools

    Authors: Christin Katharina Kreutz, Ralf Schenkel

    Abstract: Scientific publishing heavily relies on the assessment of quality of submitted manuscripts by peer reviewers. Assigning a set of matching reviewers to a submission is a highly complex task which can be performed only by domain experts. We introduce RevASIDE, a reviewer recommendation system that assigns suitable sets of complementing reviewers from a predefined candidate pool without requiring man… ▽ More

    Submitted 7 October, 2021; v1 submitted 6 October, 2021; originally announced October 2021.

  8. arXiv:2103.06253  [pdf, other

    cs.DB

    FiLiPo: A Sample Driven Approach for Finding Linkage Points between RDF Data and APIs (Extended Version)

    Authors: Tobias Zeimetz, Ralf Schenkel

    Abstract: Data integration is an important task in order to create comprehensive RDF knowledge bases. Many data sources are used to extend a given dataset or to correct errors. Since several data providers make their data publicly available only via Web APIs they also must be included in the integration process. However, APIs often come with limitations in terms of access frequencies and speed due to latenc… ▽ More

    Submitted 17 June, 2021; v1 submitted 10 March, 2021; originally announced March 2021.

  9. arXiv:2006.04562  [pdf, other

    cs.CL cs.AI cs.LG

    Towards an Argument Mining Pipeline Transforming Texts to Argument Graphs

    Authors: Mirko Lenz, Premtim Sahitaj, Sean Kallenberg, Christopher Coors, Lorik Dumani, Ralf Schenkel, Ralph Bergmann

    Abstract: This paper targets the automated extraction of components of argumentative information and their relations from natural language text. Moreover, we address a current lack of systems to provide complete argumentative structure from arbitrary natural language text for general usage. We present an argument mining pipeline as a universally applicable approach for transforming German and English langua… ▽ More

    Submitted 28 September, 2020; v1 submitted 8 June, 2020; originally announced June 2020.

  10. arXiv:2004.11163  [pdf, other

    cs.CL cs.LG

    Same Side Stance Classification Task: Facilitating Argument Stance Classification by Fine-tuning a BERT Model

    Authors: Stefan Ollinger, Lorik Dumani, Premtim Sahitaj, Ralph Bergmann, Ralf Schenkel

    Abstract: Research on computational argumentation is currently being intensively investigated. The goal of this community is to find the best pro and con arguments for a user given topic either to form an opinion for oneself, or to persuade others to adopt a certain standpoint. While existing argument mining methods can find appropriate arguments for a topic, a correct classification into pro and con is not… ▽ More

    Submitted 23 April, 2020; originally announced April 2020.

  11. arXiv:1906.06132  [pdf, other

    cs.DL

    SchenQL -- A Domain-Specific Query Language on Bibliographic Metadata

    Authors: Christin Katharina Kreutz, Michael Wolz, Ralf Schenkel

    Abstract: Information access needs to be uncomplicated, users rather use incorrect data which is easily received than correct information which is harder to obtain. Querying bibliographic metadata from digital libraries mainly supports simple textual queries. A user's demand for answering more sophisticated queries could be fulfilled by the usage of SQL. As such means are highly complex and challenging even… ▽ More

    Submitted 17 June, 2019; v1 submitted 14 June, 2019; originally announced June 2019.

  12. Prioritizing and Scheduling Conferences for Metadata Harvesting in dblp

    Authors: Mandy Neumann, Christopher Michels, Philipp Schaer, Ralf Schenkel

    Abstract: Maintaining literature databases and online bibliographies is a core responsibility of metadata aggregators such as digital libraries. In the process of monitoring all the available data sources the question arises which data source should be prioritized. Based on a broad definition of information quality we are looking for different ways to find the best fitting and most promising conference cand… ▽ More

    Submitted 17 April, 2018; originally announced April 2018.

    Comments: submitted to JCDL 2018

  13. arXiv:1212.5636  [pdf, other

    cs.DB

    Partout: A Distributed Engine for Efficient RDF Processing

    Authors: Luis Galárraga, Katja Hose, Ralf Schenkel

    Abstract: The increasing interest in Semantic Web technologies has led not only to a rapid growth of semantic data on the Web but also to an increasing number of backend applications with already more than a trillion triples in some cases. Confronted with such huge amounts of data and the future growth, existing state-of-the-art systems for storing RDF and processing SPARQL queries are no longer sufficient.… ▽ More

    Submitted 21 December, 2012; originally announced December 2012.

  14. arXiv:1210.5403  [pdf, other

    cs.DB

    An Experience Report of Large Scale Federations

    Authors: Andreas Schwarte, Peter Haase, Michael Schmidt, Katja Hose, Ralf Schenkel

    Abstract: We present an experimental study of large-scale RDF federations on top of the Bio2RDF data sources, involving 29 data sets with more than four billion RDF triples deployed in a local federation. Our federation is driven by FedX, a highly optimized federation mediator for Linked Data. We discuss design decisions, technical aspects, and experiences made in setting up and optimizing the Bio2RDF feder… ▽ More

    Submitted 19 October, 2012; originally announced October 2012.

    ACM Class: H.2.3; H.2.4; H.3.4