Skip to main content

Showing 1–5 of 5 results for author: Méndez, S J R

.
  1. arXiv:2405.15374  [pdf, other

    cs.IR cs.AI cs.CL

    Leveraging Large Language Models for Semantic Query Processing in a Scholarly Knowledge Graph

    Authors: Runsong Jia, Bowen Zhang, Sergio J. Rodríguez Méndez, Pouya G. Omran

    Abstract: The proposed research aims to develop an innovative semantic query processing system that enables users to obtain comprehensive information about research works produced by Computer Science (CS) researchers at the Australian National University (ANU). The system integrates Large Language Models (LLMs) with the ANU Scholarly Knowledge Graph (ASKG), a structured repository of all research-related ar… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

    Comments: for the associated repository, see http://w3id.org/kgcp/KGQP

    ACM Class: H.3.3; I.2.4; I.7.5; I.2.7

  2. arXiv:2309.06126  [pdf, other

    astro-ph.IM astro-ph.CO astro-ph.GA astro-ph.HE cs.CL cs.LG

    AstroLLaMA: Towards Specialized Foundation Models in Astronomy

    Authors: Tuan Dung Nguyen, Yuan-Sen Ting, Ioana Ciucă, Charlie O'Neill, Ze-Chang Sun, Maja Jabłońska, Sandor Kruk, Ernest Perkowski, Jack Miller, Jason Li, Josh Peek, Kartheik Iyer, Tomasz Różański, Pranav Khetarpal, Sharaf Zaman, David Brodrick, Sergio J. Rodríguez Méndez, Thang Bui, Alyssa Goodman, Alberto Accomazzi, Jill Naiman, Jesse Cranney, Kevin Schawinski, UniverseTBD

    Abstract: Large language models excel in many human-language tasks but often falter in highly specialized domains like scholarly astronomy. To bridge this gap, we introduce AstroLLaMA, a 7-billion-parameter model fine-tuned from LLaMA-2 using over 300,000 astronomy abstracts from arXiv. Optimized for traditional causal language modeling, AstroLLaMA achieves a 30% lower perplexity than Llama-2, showing marke… ▽ More

    Submitted 12 September, 2023; originally announced September 2023.

    Comments: 6 pages, 3 figures, submitted to IJCNLP-AACL 2023. Comments are welcome. The model can be found on Hugging Face - https://huggingface.co/universeTBD/astrollama

  3. arXiv:2304.07774  [pdf, other

    cs.CL cs.IR

    Syntactic Complexity Identification, Measurement, and Reduction Through Controlled Syntactic Simplification

    Authors: Muhammad Salman, Armin Haller, Sergio J. Rodríguez Méndez

    Abstract: Text simplification is one of the domains in Natural Language Processing (NLP) that offers an opportunity to understand the text in a simplified manner for exploration. However, it is always hard to understand and retrieve knowledge from unstructured text, which is usually in the form of compound and complex sentences. There are state-of-the-art neural network-based methods to simplify the sentenc… ▽ More

    Submitted 16 April, 2023; originally announced April 2023.

    Comments: This work is accepted and presented in International workshop on Learning with Knowledge Graphs (IWLKG) at WSDM'2023 Conference

  4. arXiv:2210.16843  [pdf, other

    cs.LG cs.IR

    A Pipeline for Analysing Grant Applications

    Authors: Shuaiqun Pan, Sergio J. Rodríguez Méndez, Kerry Taylor

    Abstract: Data mining techniques can transform massive amounts of unstructured data into quantitative data that quickly reveal insights, trends, and patterns behind the original data. In this paper, a data mining model is applied to analyse the 2019 grant applications submitted to an Australian Government research funding agency to investigate whether grant schemes successfully identifies innovative project… ▽ More

    Submitted 30 October, 2022; originally announced October 2022.

  5. arXiv:2108.13700  [pdf, other

    cs.CL cs.AI cs.IR cs.SE

    TNNT: The Named Entity Recognition Toolkit

    Authors: Sandaru Seneviratne, Sergio J. Rodríguez Méndez, Xuecheng Zhang, Pouya G. Omran, Kerry Taylor, Armin Haller

    Abstract: Extraction of categorised named entities from text is a complex task given the availability of a variety of Named Entity Recognition (NER) models and the unstructured information encoded in different source document formats. Processing the documents to extract text, identifying suitable NER models for a task, and obtaining statistical information is important in data analysis to make informed deci… ▽ More

    Submitted 31 August, 2021; originally announced August 2021.

    Comments: This demo paper will be submitted at K-Cap 2021