Skip to main content

Showing 1–6 of 6 results for author: Sheinin, V

.
  1. arXiv:2405.06807  [pdf, other

    cs.CL cs.SE

    Tackling Execution-Based Evaluation for NL2Bash

    Authors: Ngoc Phuoc An Vo, Brent Paulovicks, Vadim Sheinin

    Abstract: Given recent advancement of Large Language Models (LLMs), the task of translating from natural language prompts to different programming languages (code generation) attracts immense attention for wide application in different domains. Specially code generation for Bash (NL2Bash) is widely used to generate Bash scripts for automating different tasks, such as performance monitoring, compilation, sys… ▽ More

    Submitted 10 May, 2024; originally announced May 2024.

  2. arXiv:2312.05448  [pdf, other

    cs.CL

    Domain Adaptation of a State of the Art Text-to-SQL Model: Lessons Learned and Challenges Found

    Authors: Irene Manotas, Octavian Popescu, Ngoc Phuoc An Vo, Vadim Sheinin

    Abstract: There are many recent advanced developments for the Text-to-SQL task, where the Picard model is one of the the top performing models as measured by the Spider dataset competition. However, bringing Text-to-SQL systems to realistic use-cases through domain adaptation remains a tough challenge. We analyze how well the base T5 Language Model and Picard perform on query structures different from the S… ▽ More

    Submitted 8 December, 2023; originally announced December 2023.

    ACM Class: I.2.7

  3. arXiv:2104.00660  [pdf, other

    cs.CL

    Recognizing and Splitting Conditional Sentences for Automation of Business Processes Management

    Authors: Ngoc Phuoc An Vo, Irene Manotas, Octavian Popescu, Algimantas Cerniauskas, Vadim Sheinin

    Abstract: Business Process Management (BPM) is the discipline which is responsible for management of discovering, analyzing, redesigning, monitoring, and controlling business processes. One of the most crucial tasks of BPM is discovering and modelling business processes from text documents. In this paper, we present our system that resolves an end-to-end problem consisting of 1) recognizing conditional sent… ▽ More

    Submitted 1 April, 2021; originally announced April 2021.

    Comments: Preprint

  4. arXiv:1809.05255  [pdf, other

    cs.CL cs.LG

    SQL-to-Text Generation with Graph-to-Sequence Model

    Authors: Kun Xu, Lingfei Wu, Zhiguo Wang, Yansong Feng, Vadim Sheinin

    Abstract: Previous work approaches the SQL-to-text generation task using vanilla Seq2Seq models, which may not fully capture the inherent graph-structured information in SQL query. In this paper, we first introduce a strategy to represent the SQL query as a directed graph and then employ a graph-to-sequence model to encode the global structure information into node embeddings. This model can effectively lea… ▽ More

    Submitted 12 February, 2019; v1 submitted 14 September, 2018; originally announced September 2018.

    Comments: EMNLP18, Accepted

  5. arXiv:1808.07624  [pdf, other

    cs.CL cs.AI

    Exploiting Rich Syntactic Information for Semantic Parsing with Graph-to-Sequence Model

    Authors: Kun Xu, Lingfei Wu, Zhiguo Wang, Mo Yu, Liwei Chen, Vadim Sheinin

    Abstract: Existing neural semantic parsers mainly utilize a sequence encoder, i.e., a sequential LSTM, to extract word order features while neglecting other valuable syntactic information such as dependency graph or constituent trees. In this paper, we first propose to use the \textit{syntactic graph} to represent three types of syntactic information, i.e., word order, dependency and constituency features.… ▽ More

    Submitted 22 August, 2018; originally announced August 2018.

    Comments: EMNLP'18

  6. arXiv:1804.00823  [pdf, other

    cs.AI cs.CL cs.LG stat.ML

    Graph2Seq: Graph to Sequence Learning with Attention-based Neural Networks

    Authors: Kun Xu, Lingfei Wu, Zhiguo Wang, Yansong Feng, Michael Witbrock, Vadim Sheinin

    Abstract: The celebrated Sequence to Sequence learning (Seq2Seq) technique and its numerous variants achieve excellent performance on many tasks. However, many machine learning tasks have inputs naturally represented as graphs; existing Seq2Seq models face a significant challenge in achieving accurate conversion from graph form to the appropriate sequence. To address this challenge, we introduce a novel gen… ▽ More

    Submitted 3 December, 2018; v1 submitted 3 April, 2018; originally announced April 2018.

    Comments: 16 pages, 3 figures, 4 tables