Skip to main content

Showing 1–20 of 20 results for author: Rokhlenko, O

Searching in archive cs. Search in all archives.
.
  1. Question Suggestion for Conversational Shop** Assistants Using Product Metadata

    Authors: Nikhita Vedula, Oleg Rokhlenko, Shervin Malmasi

    Abstract: Digital assistants have become ubiquitous in e-commerce applications, following the recent advancements in Information Retrieval (IR), Natural Language Processing (NLP) and Generative Artificial Intelligence (AI). However, customers are often unsure or unaware of how to effectively converse with these assistants to meet their shop** needs. In this work, we emphasize the importance of providing c… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

    Comments: 5 pages, 1 figure

  2. arXiv:2404.06659  [pdf, other

    cs.CL

    Leveraging Interesting Facts to Enhance User Engagement with Conversational Interfaces

    Authors: Nikhita Vedula, Giuseppe Castellucci, Eugene Agichtein, Oleg Rokhlenko, Shervin Malmasi

    Abstract: Conversational Task Assistants (CTAs) guide users in performing a multitude of activities, such as making recipes. However, ensuring that interactions remain engaging, interesting, and enjoyable for CTA users is not trivial, especially for time-consuming or challenging tasks. Grounded in psychological theories of human interest, we propose to engage users with contextual and interesting statements… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

    Comments: 10 pages, 1 figure

  3. arXiv:2404.06017  [pdf, other

    cs.CL

    Identifying Shop** Intent in Product QA for Proactive Recommendations

    Authors: Besnik Fetahu, Nachshon Cohen, Elad Haramaty, Liane Lewin-Eytan, Oleg Rokhlenko, Shervin Malmasi

    Abstract: Voice assistants have become ubiquitous in smart devices allowing users to instantly access information via voice questions. While extensive research has been conducted in question answering for voice search, little attention has been paid on how to enable proactive recommendations from a voice assistant to its users. This is a highly challenging problem that often leads to user friction, mainly d… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

    Comments: Accepted at IronGraphs@ECIR'2024

  4. arXiv:2404.02422  [pdf, other

    cs.CL cs.LG

    Enhancing Low-Resource LLMs Classification with PEFT and Synthetic Data

    Authors: Parth Patwa, Simone Filice, Zhiyu Chen, Giuseppe Castellucci, Oleg Rokhlenko, Shervin Malmasi

    Abstract: Large Language Models (LLMs) operating in 0-shot or few-shot settings achieve competitive results in Text Classification tasks. In-Context Learning (ICL) typically achieves better accuracy than the 0-shot setting, but it pays in terms of efficiency, due to the longer input prompt. In this paper, we propose a strategy to make LLMs as efficient as 0-shot text classifiers, while getting comparable or… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

    Comments: Accepted at LREC-COLING 2024

  5. arXiv:2401.09785  [pdf, other

    cs.CL

    Instant Answering in E-Commerce Buyer-Seller Messaging using Message-to-Question Reformulation

    Authors: Besnik Fetahu, Tejas Mehta, Qun Song, Nikhita Vedula, Oleg Rokhlenko, Shervin Malmasi

    Abstract: E-commerce customers frequently seek detailed product information for purchase decisions, commonly contacting sellers directly with extended queries. This manual response requirement imposes additional costs and disrupts buyer's shop** experience with response time fluctuations ranging from hours to days. We seek to automate buyer inquiries to sellers in a leading e-commerce store using a domain… ▽ More

    Submitted 30 January, 2024; v1 submitted 18 January, 2024; originally announced January 2024.

    Comments: Accepted at ECIR 2024

  6. arXiv:2401.09775  [pdf, other

    cs.CL

    Controllable Decontextualization of Yes/No Question and Answers into Factual Statements

    Authors: Lingbo Mo, Besnik Fetahu, Oleg Rokhlenko, Shervin Malmasi

    Abstract: Yes/No or polar questions represent one of the main linguistic question categories. They consist of a main interrogative clause, for which the answer is binary (assertion or negation). Polar questions and answers (PQA) represent a valuable knowledge resource present in many community and other curated QA sources, such as forums or e-commerce applications. Using answers to polar questions alone in… ▽ More

    Submitted 18 January, 2024; originally announced January 2024.

    Comments: Accepted at ECIR 2024

  7. arXiv:2311.12534  [pdf, other

    cs.CL

    Evaluation Metrics of Language Generation Models for Synthetic Traffic Generation Tasks

    Authors: Simone Filice, Jason Ingyu Choi, Giuseppe Castellucci, Eugene Agichtein, Oleg Rokhlenko

    Abstract: Many Natural Language Generation (NLG) tasks aim to generate a single output text given an input prompt. Other settings require the generation of multiple texts, e.g., for Synthetic Traffic Generation (STG). This generation task is crucial for training and evaluating QA systems as well as conversational agents, where the goal is to generate multiple questions or utterances resembling the linguisti… ▽ More

    Submitted 21 November, 2023; originally announced November 2023.

  8. arXiv:2310.17034  [pdf, other

    cs.CL

    Follow-on Question Suggestion via Voice Hints for Voice Assistants

    Authors: Besnik Fetahu, Pedro Faustini, Giuseppe Castellucci, Anjie Fang, Oleg Rokhlenko, Shervin Malmasi

    Abstract: The adoption of voice assistants like Alexa or Siri has grown rapidly, allowing users to instantly access information via voice search. Query suggestion is a standard feature of screen-based search experiences, allowing users to explore additional topics. However, this is not trivial to implement in voice-based settings. To enable this, we tackle the novel task of suggesting questions with compact… ▽ More

    Submitted 25 October, 2023; originally announced October 2023.

    Comments: Accepted as Long Paper at EMNLP'23 Findings

  9. arXiv:2310.16361  [pdf, other

    cs.CL cs.AI

    InstructPTS: Instruction-Tuning LLMs for Product Title Summarization

    Authors: Besnik Fetahu, Zhiyu Chen, Oleg Rokhlenko, Shervin Malmasi

    Abstract: E-commerce product catalogs contain billions of items. Most products have lengthy titles, as sellers pack them with product attributes to improve retrieval, and highlight key product aspects. This results in a gap between such unnatural products titles, and how customers refer to them. It also limits how e-commerce stores can use these seller-provided titles for recommendation, QA, or review summa… ▽ More

    Submitted 25 October, 2023; originally announced October 2023.

    Comments: Accepted by EMNLP 2023 (Industry Track)

  10. arXiv:2310.13213  [pdf, other

    cs.CL cs.AI

    MultiCoNER v2: a Large Multilingual dataset for Fine-grained and Noisy Named Entity Recognition

    Authors: Besnik Fetahu, Zhiyu Chen, Sudipta Kar, Oleg Rokhlenko, Shervin Malmasi

    Abstract: We present MULTICONER V2, a dataset for fine-grained Named Entity Recognition covering 33 entity classes across 12 languages, in both monolingual and multilingual settings. This dataset aims to tackle the following practical challenges in NER: (i) effective handling of fine-grained classes that include complex entities like movie titles, and (ii) performance degradation due to noise generated from… ▽ More

    Submitted 19 October, 2023; originally announced October 2023.

    Comments: Accepted to the Findings of EMNLP 2023

  11. arXiv:2306.03411  [pdf, other

    cs.CL cs.AI cs.IR

    Generate-then-Retrieve: Intent-Aware FAQ Retrieval in Product Search

    Authors: Zhiyu Chen, Jason Choi, Besnik Fetahu, Oleg Rokhlenko, Shervin Malmasi

    Abstract: Customers interacting with product search engines are increasingly formulating information-seeking queries. Frequently Asked Question (FAQ) retrieval aims to retrieve common question-answer pairs for a user query with question intent. Integrating FAQ retrieval in product search can not only empower users to make more informed purchase decisions, but also enhance user retention through efficient po… ▽ More

    Submitted 6 June, 2023; originally announced June 2023.

    Comments: ACL 2023 Industry Track

  12. arXiv:2305.17393  [pdf, other

    cs.CL cs.AI

    Answering Unanswered Questions through Semantic Reformulations in Spoken QA

    Authors: Pedro Faustini, Zhiyu Chen, Besnik Fetahu, Oleg Rokhlenko, Shervin Malmasi

    Abstract: Spoken Question Answering (QA) is a key feature of voice assistants, usually backed by multiple QA systems. Users ask questions via spontaneous speech which can contain disfluencies, errors, and informal syntax or phrasing. This is a major challenge in QA, causing unanswered questions or irrelevant answers, and leading to bad user experiences. We analyze failed QA requests to identify core challen… ▽ More

    Submitted 3 June, 2023; v1 submitted 27 May, 2023; originally announced May 2023.

    Comments: ACL 2023 Industry Track

  13. arXiv:2305.14793  [pdf, other

    cs.CL

    Faithful Low-Resource Data-to-Text Generation through Cycle Training

    Authors: Zhuoer Wang, Marcus Collins, Nikhita Vedula, Simone Filice, Shervin Malmasi, Oleg Rokhlenko

    Abstract: Methods to generate text from structured data have advanced significantly in recent years, primarily due to fine-tuning of pre-trained language models on large datasets. However, such models can fail to produce output faithful to the input data, particularly on out-of-domain data. Sufficient annotated data is often not available for specific domains, leading us to seek an unsupervised approach to… ▽ More

    Submitted 11 July, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

    Comments: 19 pages, 4 figures, ACL 2023

  14. arXiv:2305.06586  [pdf, other

    cs.CL cs.AI

    SemEval-2023 Task 2: Fine-grained Multilingual Named Entity Recognition (MultiCoNER 2)

    Authors: Besnik Fetahu, Sudipta Kar, Zhiyu Chen, Oleg Rokhlenko, Shervin Malmasi

    Abstract: We present the findings of SemEval-2023 Task 2 on Fine-grained Multilingual Named Entity Recognition (MultiCoNER 2). Divided into 13 tracks, the task focused on methods to identify complex fine-grained named entities (like WRITTENWORK, VEHICLE, MUSICALGRP) across 12 languages, in both monolingual and multilingual scenarios, as well as noisy settings. The task used the MultiCoNER V2 dataset, compos… ▽ More

    Submitted 25 May, 2023; v1 submitted 11 May, 2023; originally announced May 2023.

    Comments: SemEval-2023 (co-located with ACL-2023 in Toronto, Canada)

  15. arXiv:2302.11074  [pdf, other

    cs.CL cs.AI cs.LG

    Preventing Catastrophic Forgetting in Continual Learning of New Natural Language Tasks

    Authors: Sudipta Kar, Giuseppe Castellucci, Simone Filice, Shervin Malmasi, Oleg Rokhlenko

    Abstract: Multi-Task Learning (MTL) is widely-accepted in Natural Language Processing as a standard technique for learning multiple related tasks in one model. Training an MTL model requires having the training data for all tasks available at the same time. As systems usually evolve over time, (e.g., to support new functionalities), adding a new task to an existing MTL model usually requires retraining the… ▽ More

    Submitted 21 February, 2023; originally announced February 2023.

    Comments: KDD 2022

  16. arXiv:2210.15777  [pdf, other

    cs.CL cs.IR

    Reinforced Question Rewriting for Conversational Question Answering

    Authors: Zhiyu Chen, Jie Zhao, Anjie Fang, Besnik Fetahu, Oleg Rokhlenko, Shervin Malmasi

    Abstract: Conversational Question Answering (CQA) aims to answer questions contained within dialogues, which are not easily interpretable without context. Develo** a model to rewrite conversational questions into self-contained ones is an emerging solution in industry settings as it allows using existing single-turn QA systems to avoid training a CQA model from scratch. Previous work trains rewriting mode… ▽ More

    Submitted 31 October, 2022; v1 submitted 27 October, 2022; originally announced October 2022.

    Comments: A cleaned version of our paper Accepted by EMNLP 2022 (Industry Track)

  17. arXiv:2209.06321  [pdf, other

    cs.CL cs.AI cs.HC

    Alexa, Let's Work Together: Introducing the First Alexa Prize TaskBot Challenge on Conversational Task Assistance

    Authors: Anna Gottardi, Osman Ipek, Giuseppe Castellucci, Shui Hu, Lavina Vaz, Yao Lu, Anju Khatri, Anjali Chadha, Desheng Zhang, Sattvik Sahai, Prerna Dwivedi, Hangjie Shi, Lucy Hu, Andy Huang, Luke Dai, Bofei Yang, Varun Somani, Pankaj Rajan, Ron Rezac, Michael Johnston, Savanna Stiff, Leslie Ball, David Carmel, Yang Liu, Dilek Hakkani-Tur , et al. (5 additional authors not shown)

    Abstract: Since its inception in 2016, the Alexa Prize program has enabled hundreds of university students to explore and compete to develop conversational agents through the SocialBot Grand Challenge. The goal of the challenge is to build agents capable of conversing coherently and engagingly with humans on popular topics for 20 minutes, while achieving an average rating of at least 4.0/5.0. However, as co… ▽ More

    Submitted 13 September, 2022; originally announced September 2022.

    Comments: 14 pages, Proceedings of Alexa Prize Taskbot (Alexa Prize 2021)

    ACM Class: I.2.7; J.0; H.5.1; H.5.2

  18. arXiv:2208.14536  [pdf, other

    cs.CL

    MultiCoNER: A Large-scale Multilingual dataset for Complex Named Entity Recognition

    Authors: Shervin Malmasi, Anjie Fang, Besnik Fetahu, Sudipta Kar, Oleg Rokhlenko

    Abstract: We present MultiCoNER, a large multilingual dataset for Named Entity Recognition that covers 3 domains (Wiki sentences, questions, and search queries) across 11 languages, as well as multilingual and code-mixing subsets. This dataset is designed to represent contemporary challenges in NER, including low-context scenarios (short and uncased text), syntactically complex entities like movie titles, a… ▽ More

    Submitted 30 August, 2022; originally announced August 2022.

    Comments: Accepted at COLING 2022

  19. arXiv:1712.02838  [pdf, ps, other

    cs.AI cs.CL cs.LG

    End-to-End Offline Goal-Oriented Dialog Policy Learning via Policy Gradient

    Authors: Li Zhou, Kevin Small, Oleg Rokhlenko, Charles Elkan

    Abstract: Learning a goal-oriented dialog policy is generally performed offline with supervised learning algorithms or online with reinforcement learning (RL). Additionally, as companies accumulate massive quantities of dialog transcripts between customers and trained human agents, encoder-decoder methods have gained popularity as agent utterances can be directly treated as supervision without the need for… ▽ More

    Submitted 7 December, 2017; originally announced December 2017.

    Comments: Workshop on Conversational AI, NIPS 2017, Long Beach, CA, USA

  20. arXiv:1406.2431  [pdf, other

    cs.IR cs.LG

    Budget-Constrained Item Cold-Start Handling in Collaborative Filtering Recommenders via Optimal Design

    Authors: Oren Anava, Shahar Golan, Nadav Golbandi, Zohar Karnin, Ronny Lempel, Oleg Rokhlenko, Oren Somekh

    Abstract: It is well known that collaborative filtering (CF) based recommender systems provide better modeling of users and items associated with considerable rating history. The lack of historical ratings results in the user and the item cold-start problems. The latter is the main focus of this work. Most of the current literature addresses this problem by integrating content-based recommendation technique… ▽ More

    Submitted 20 September, 2016; v1 submitted 10 June, 2014; originally announced June 2014.

    Comments: 11 pages, 2 figures

    MSC Class: 62K05