Skip to main content

Showing 1–8 of 8 results for author: Akash, P S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2311.09383  [pdf, other

    cs.CL cs.LG

    Long-form Question Answering: An Iterative Planning-Retrieval-Generation Approach

    Authors: Pritom Saha Akash, Kashob Kumar Roy, Lucian Popa, Kevin Chen-Chuan Chang

    Abstract: Long-form question answering (LFQA) poses a challenge as it involves generating detailed answers in the form of paragraphs, which go beyond simple yes/no responses or short factual answers. While existing QA models excel in questions with concise answers, LFQA requires handling multiple topics and their intricate relationships, demanding comprehensive explanations. Previous attempts at LFQA focuse… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

  2. arXiv:2310.15420  [pdf, other

    cs.CL

    Let the Pretrained Language Models "Imagine" for Short Texts Topic Modeling

    Authors: Pritom Saha Akash, Jie Huang, Kevin Chen-Chuan Chang

    Abstract: Topic models are one of the compelling methods for discovering latent semantics in a document collection. However, it assumes that a document has sufficient co-occurrence information to be effective. However, in short texts, co-occurrence information is minimal, which results in feature sparsity in document representation. Therefore, existing topic models (probabilistic or neural) mostly fail to m… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

  3. arXiv:2310.04978  [pdf, other

    cs.CL cs.LG

    TopicAdapt- An Inter-Corpora Topics Adaptation Approach

    Authors: Pritom Saha Akash, Trisha Das, Kevin Chen-Chuan Chang

    Abstract: Topic models are popular statistical tools for detecting latent semantic topics in a text corpus. They have been utilized in various applications across different fields. However, traditional topic models have some limitations, including insensitivity to user guidance, sensitivity to the amount and quality of data, and the inability to adapt learned topics from one corpus to another. To address th… ▽ More

    Submitted 7 October, 2023; originally announced October 2023.

  4. arXiv:2306.10755  [pdf, other

    cs.CL

    Unsupervised Open-domain Keyphrase Generation

    Authors: Lam Thanh Do, Pritom Saha Akash, Kevin Chen-Chuan Chang

    Abstract: In this work, we study the problem of unsupervised open-domain keyphrase generation, where the objective is a keyphrase generation model that can be built without using human-labeled data and can perform consistently across domains. To solve this problem, we propose a seq2seq model that consists of two modules, namely \textit{phraseness} and \textit{informativeness} module, both of which can be bu… ▽ More

    Submitted 19 June, 2023; originally announced June 2023.

    Comments: Accepted to ACL 2023. arXiv admin note: text overlap with arXiv:1207.4169 by other authors

  5. arXiv:2210.08559  [pdf, other

    cs.CL cs.IR

    Coordinated Topic Modeling

    Authors: Pritom Saha Akash, Jie Huang, Kevin Chen-Chuan Chang

    Abstract: We propose a new problem called coordinated topic modeling that imitates human behavior while describing a text corpus. It considers a set of well-defined topics like the axes of a semantic space with a reference representation. It then uses the axes to model a corpus for easily understandable representation. This new task helps represent a corpus more interpretably by reusing existing knowledge a… ▽ More

    Submitted 22 October, 2022; v1 submitted 16 October, 2022; originally announced October 2022.

  6. arXiv:2204.07125  [pdf, other

    cs.DB

    Online Aggregation based Approximate Query Processing: A Literature Survey

    Authors: Pritom Saha Akash, Wei-Cheng Lai, Po-Wen Lin

    Abstract: In the current world, OLAP (Online Analytical Processing) is used intensively by modern organizations to perform ad hoc analysis of data, providing insight for better decision making. Thus, the performance for OLAP is crucial; however, it is costly to support OLAP for a large data-set. An approximate query process (AQP) was proposed to efficiently compute approximate values as close as to the exac… ▽ More

    Submitted 14 April, 2022; originally announced April 2022.

  7. Domain Representative Keywords Selection: A Probabilistic Approach

    Authors: Pritom Saha Akash, Jie Huang, Kevin Chen-Chuan Chang, Yunyao Li, Lucian Popa, ChengXiang Zhai

    Abstract: We propose a probabilistic approach to select a subset of a \textit{target domain representative keywords} from a candidate set, contrasting with a context domain. Such a task is crucial for many downstream tasks in natural language processing. To contrast the target domain and the context domain, we adapt the \textit{two-component mixture model} concept to generate a distribution of candidate key… ▽ More

    Submitted 4 June, 2022; v1 submitted 19 March, 2022; originally announced March 2022.

  8. arXiv:2203.08787  [pdf, other

    cs.SE cs.AI

    Exploring Variational Graph Auto-Encoders for Extract Class Refactoring Recommendation

    Authors: Pritom Saha Akash, Kevin Chen-Chuan Chang

    Abstract: The code smell is a sign of design and development flaws in a software system that reduces the reusability and maintainability of the system. Refactoring is done as an ongoing practice to remove the code smell from the program code. Among different code smells, the God class or Blob is one of the most common code smells. A god class contains too many responsibilities, violating object-oriented pro… ▽ More

    Submitted 19 March, 2023; v1 submitted 16 March, 2022; originally announced March 2022.