Skip to main content

Showing 1–7 of 7 results for author: Sengupta, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.12840  [pdf, other

    cs.CL

    ArabicMMLU: Assessing Massive Multitask Language Understanding in Arabic

    Authors: Fajri Koto, Haonan Li, Sara Shatnawi, Jad Doughman, Abdelrahman Boda Sadallah, Aisha Alraeesi, Khalid Almubarak, Zaid Alyafeai, Neha Sengupta, Shady Shehata, Nizar Habash, Preslav Nakov, Timothy Baldwin

    Abstract: The focus of language model evaluation has transitioned towards reasoning and knowledge-intensive tasks, driven by advancements in pretraining large models. While state-of-the-art models are partially trained on large Arabic texts, evaluating their performance in Arabic remains challenging due to the limited availability of relevant datasets. To bridge this gap, we present ArabicMMLU, the first mu… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

  2. arXiv:2308.16149  [pdf, other

    cs.CL cs.AI cs.LG

    Jais and Jais-chat: Arabic-Centric Foundation and Instruction-Tuned Open Generative Large Language Models

    Authors: Neha Sengupta, Sunil Kumar Sahu, Bokang Jia, Satheesh Katipomu, Haonan Li, Fajri Koto, William Marshall, Gurpreet Gosal, Cynthia Liu, Zhiming Chen, Osama Mohammed Afzal, Samta Kamboj, Onkar Pandit, Rahul Pal, Lalit Pradhan, Zain Muhammad Mujahid, Massa Baali, Xudong Han, Sondos Mahmoud Bsharat, Alham Fikri Aji, Zhiqiang Shen, Zhengzhong Liu, Natalia Vassilieva, Joel Hestness, Andy Hock , et al. (7 additional authors not shown)

    Abstract: We introduce Jais and Jais-chat, new state-of-the-art Arabic-centric foundation and instruction-tuned open generative large language models (LLMs). The models are based on the GPT-3 decoder-only architecture and are pretrained on a mixture of Arabic and English texts, including source code in various programming languages. With 13 billion parameters, they demonstrate better knowledge and reasoning… ▽ More

    Submitted 29 September, 2023; v1 submitted 30 August, 2023; originally announced August 2023.

    Comments: Arabic-centric, foundation model, large-language model, LLM, generative model, instruction-tuned, Jais, Jais-chat

    MSC Class: 68T50 ACM Class: F.2.2; I.2.7

  3. arXiv:2010.06336  [pdf, other

    cs.DB

    Finding Minimum Connected Subgraphs with Ontology Exploration on Large RDF Data

    Authors: Xiangnan Ren, Neha Sengupta, Xuguang Ren, Junhu Wang, Olivier Curé

    Abstract: In this paper, we study the following problem: given a knowledge graph (KG) and a set of input vertices (representing concepts or entities) and edge labels, we aim to find the smallest connected subgraphs containing all of the inputs. This problem plays a key role in KG-based search engines and natural language question answering systems, and it is a natural extension of the Steiner tree problem,… ▽ More

    Submitted 14 October, 2020; v1 submitted 13 October, 2020; originally announced October 2020.

    Comments: 13 pages, 11 figures

  4. arXiv:2008.00441  [pdf, other

    cs.CL

    Relation Extraction with Self-determined Graph Convolutional Network

    Authors: Sunil Kumar Sahu, Derek Thomas, Billy Chiu, Neha Sengupta, Mohammady Mahdy

    Abstract: Relation Extraction is a way of obtaining the semantic relationship between entities in text. The state-of-the-art methods use linguistic tools to build a graph for the text in which the entities appear and then a Graph Convolutional Network (GCN) is employed to encode the pre-built graphs. Although their performance is promising, the reliance on linguistic tools results in a non end-to-end proces… ▽ More

    Submitted 27 August, 2020; v1 submitted 2 August, 2020; originally announced August 2020.

    Comments: CIKM-2020

  5. arXiv:1901.09659  [pdf

    cs.SI cs.CY stat.AP

    Simple Surveys: Response Retrieval Inspired by Recommendation Systems

    Authors: Nandana Sengupta, Nati Srebro, James Evans

    Abstract: In the last decade, the use of simple rating and comparison surveys has proliferated on social and digital media platforms to fuel recommendations. These simple surveys and their extrapolation with machine learning algorithms shed light on user preferences over large and growing pools of items, such as movies, songs and ads. Social scientists have a long history of measuring perceptions, preferenc… ▽ More

    Submitted 11 December, 2018; originally announced January 2019.

  6. arXiv:1802.07176  [pdf, other

    cs.LG stat.ML

    Adaptive Sampling for Coarse Ranking

    Authors: Sumeet Katariya, Lalit Jain, Nandana Sengupta, James Evans, Robert Nowak

    Abstract: We consider the problem of active coarse ranking, where the goal is to sort items according to their means into clusters of pre-specified sizes, by adaptively sampling from their reward distributions. This setting is useful in many social science applications involving human raters and the approximate rank of every item is desired. Approximate or coarse ranking can significantly reduce the number… ▽ More

    Submitted 20 February, 2018; originally announced February 2018.

    Comments: Accepted at AISTATS 2018

  7. Sampling and Reconstruction Using Bloom Filters

    Authors: Neha Sengupta, Amitabha Bagchi, Srikanta Bedathur, Maya Ramanath

    Abstract: In this paper, we address the problem of sampling from a set and reconstructing a set stored as a Bloom filter. To the best of our knowledge our work is the first to address this question. We introduce a novel hierarchical data structure called BloomSampleTree that helps us design efficient algorithms to extract an almost uniform sample from the set stored in a Bloom filter and also allows us to r… ▽ More

    Submitted 6 September, 2017; v1 submitted 12 January, 2017; originally announced January 2017.

    Journal ref: IEEE T. Knowl. Data En. 30(7):1324-1337, July 2018