Skip to main content

Showing 1–20 of 20 results for author: Schnabel, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.01633  [pdf, other

    cs.IR cs.AI cs.CL cs.LG

    On Overcoming Miscalibrated Conversational Priors in LLM-based Chatbots

    Authors: Christine Herlihy, Jennifer Neville, Tobias Schnabel, Adith Swaminathan

    Abstract: We explore the use of Large Language Model (LLM-based) chatbots to power recommender systems. We observe that the chatbots respond poorly when they encounter under-specified requests (e.g., they make incorrect assumptions, hedge with a long response, or refuse to answer). We conjecture that such miscalibrated response tendencies (i.e., conversational priors) can be attributed to LLM fine-tuning us… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

    Comments: Preprint of UAI'24 conference publication

  2. arXiv:2404.02319  [pdf, other

    cs.CL cs.AI cs.LG

    Symbolic Prompt Program Search: A Structure-Aware Approach to Efficient Compile-Time Prompt Optimization

    Authors: Tobias Schnabel, Jennifer Neville

    Abstract: In many modern LLM applications, such as retrieval augmented generation, prompts have become programs themselves. In these settings, prompt programs are repeatedly called with different user queries or data instances. A big practical challenge is optimizing such prompt programs. Recent work has mostly focused on either simple prompt programs or assumed that the general structure of a prompt progra… ▽ More

    Submitted 27 June, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

  3. arXiv:2305.01801  [pdf, other

    cs.IR cs.AI cs.LG

    When Newer is Not Better: Does Deep Learning Really Benefit Recommendation From Implicit Feedback?

    Authors: Yushun Dong, Jundong Li, Tobias Schnabel

    Abstract: In recent years, neural models have been repeatedly touted to exhibit state-of-the-art performance in recommendation. Nevertheless, multiple recent studies have revealed that the reported state-of-the-art results of many neural recommendation models cannot be reliably replicated. A primary reason is that existing evaluations are performed under various inconsistent protocols. Correspondingly, thes… ▽ More

    Submitted 2 May, 2023; originally announced May 2023.

    Comments: Published as a conference paper at SIGIR 2023

  4. arXiv:2211.06365  [pdf, other

    cs.IR cs.AI cs.LG

    Situating Recommender Systems in Practice: Towards Inductive Learning and Incremental Updates

    Authors: Tobias Schnabel, Mengting Wan, Longqi Yang

    Abstract: With information systems becoming larger scale, recommendation systems are a topic of growing interest in machine learning research and industry. Even though progress on improving model design has been rapid in research, we argue that many advances fail to translate into practice because of two limiting assumptions. First, most approaches focus on a transductive learning setting which cannot handl… ▽ More

    Submitted 11 November, 2022; originally announced November 2022.

  5. arXiv:2211.01261  [pdf, other

    cs.IR cs.AI

    Where Do We Go From Here? Guidelines For Offline Recommender Evaluation

    Authors: Tobias Schnabel

    Abstract: Various studies in recent years have pointed out large issues in the offline evaluation of recommender systems, making it difficult to assess whether true progress has been made. However, there has been little research into what set of practices should serve as a starting point during experimentation. In this paper, we examine four larger issues in recommender system research regarding uncertainty… ▽ More

    Submitted 2 November, 2022; originally announced November 2022.

    Comments: 8 pages

  6. arXiv:2207.05772  [pdf, ps, other

    cs.IR

    EvalRS: a Rounded Evaluation of Recommender Systems

    Authors: Jacopo Tagliabue, Federico Bianchi, Tobias Schnabel, Giuseppe Attanasio, Ciro Greco, Gabriel de Souza P. Moreira, Patrick John Chia

    Abstract: Much of the complexity of Recommender Systems (RSs) comes from the fact that they are used as part of more complex applications and affect user experience through a varied range of user interfaces. However, research focused almost exclusively on the ability of RSs to produce accurate item rankings while giving little attention to the evaluation of RS behavior in real-world scenarios. Such narrow f… ▽ More

    Submitted 12 August, 2022; v1 submitted 12 July, 2022; originally announced July 2022.

    Comments: CIKM 2022 Data Challenge Paper

  7. arXiv:2202.02427  [pdf, other

    cs.LG

    Lightweight Compositional Embeddings for Incremental Streaming Recommendation

    Authors: Mengyue Hang, Tobias Schnabel, Longqi Yang, Jennifer Neville

    Abstract: Most work in graph-based recommender systems considers a {\em static} setting where all information about test nodes (i.e., users and items) is available upfront at training time. However, this static setting makes little sense for many real-world applications where data comes in continuously as a stream of new edges and nodes, and one has to update model predictions incrementally to reflect the l… ▽ More

    Submitted 4 February, 2022; originally announced February 2022.

  8. arXiv:2111.09525  [pdf, other

    cs.CL

    SummaC: Re-Visiting NLI-based Models for Inconsistency Detection in Summarization

    Authors: Philippe Laban, Tobias Schnabel, Paul N. Bennett, Marti A. Hearst

    Abstract: In the summarization domain, a key requirement for summaries is to be factually consistent with the input document. Previous work has found that natural language inference (NLI) models do not perform competitively when applied to inconsistency detection. In this work, we revisit the use of NLI for inconsistency detection, finding that past work suffered from a mismatch in input granularity between… ▽ More

    Submitted 18 November, 2021; originally announced November 2021.

    Comments: TACL pre-MIT Press publication version; 11 pages, 2 figures, 5 tables

  9. arXiv:2107.03444  [pdf, other

    cs.CL

    Keep it Simple: Unsupervised Simplification of Multi-Paragraph Text

    Authors: Philippe Laban, Tobias Schnabel, Paul Bennett, Marti A. Hearst

    Abstract: This work presents Keep it Simple (KiS), a new approach to unsupervised text simplification which learns to balance a reward across three properties: fluency, salience and simplicity. We train the model with a novel algorithm to optimize the reward (k-SCST), in which the model proposes several candidate simplifications, computes each candidate's reward, and encourages candidates that outperform th… ▽ More

    Submitted 7 July, 2021; originally announced July 2021.

    Comments: Accepted at ACL-IJCNLP 2021, 14 pages, 7 figures

    Journal ref: Association for Computational Linguistics (2021)

  10. arXiv:1905.12495  [pdf, other

    stat.ML cs.LG econ.EM

    Deep Generalized Method of Moments for Instrumental Variable Analysis

    Authors: Andrew Bennett, Nathan Kallus, Tobias Schnabel

    Abstract: Instrumental variable analysis is a powerful tool for estimating causal effects when randomization or full control of confounders is not possible. The application of standard methods such as 2SLS, GMM, and more recent variants are significantly impeded when the causal effects are complex, the instruments are high-dimensional, and/or the treatment is high-dimensional. In this paper, we propose the… ▽ More

    Submitted 18 April, 2020; v1 submitted 29 May, 2019; originally announced May 2019.

    Journal ref: Advances in Neural Information Processing Systems 32 (2019) 3564--3574

  11. arXiv:1802.07578  [pdf, other

    cs.HC cs.IR

    Improving Recommender Systems Beyond the Algorithm

    Authors: Tobias Schnabel, Paul N. Bennett, Thorsten Joachims

    Abstract: Recommender systems rely heavily on the predictive accuracy of the learning algorithm. Most work on improving accuracy has focused on the learning algorithm itself. We argue that this algorithmic focus is myopic. In particular, since learning algorithms generally improve with more and better data, we propose sha** the feedback generation process as an alternate and complementary route to improvi… ▽ More

    Submitted 21 February, 2018; originally announced February 2018.

  12. arXiv:1706.08184  [pdf, other

    cs.SI cs.IR

    A preference elicitation interface for collecting dense recommender datasets with rich user information

    Authors: Pantelis P. Analytis, Tobias Schnabel, Stefan Herzog, Daniel Barkoczi, Thorsten Joachims

    Abstract: We present an interface that can be leveraged to quickly and effortlessly elicit people's preferences for visual stimuli, such as photographs, visual art and screensavers, along with rich side-information about its users. We plan to employ the new interface to collect dense recommender datasets that will complement existing sparse industry-scale datasets. The new interface and the collected datase… ▽ More

    Submitted 26 June, 2017; v1 submitted 25 June, 2017; originally announced June 2017.

    Comments: 2 pages

  13. Effective Evaluation using Logged Bandit Feedback from Multiple Loggers

    Authors: Aman Agarwal, Soumya Basu, Tobias Schnabel, Thorsten Joachims

    Abstract: Accurately evaluating new policies (e.g. ad-placement models, ranking functions, recommendation functions) is one of the key prerequisites for improving interactive systems. While the conventional approach to evaluation relies on online A/B tests, recent work has shown that counterfactual estimators can provide an inexpensive and fast alternative, since they can be applied offline using log data t… ▽ More

    Submitted 26 June, 2017; v1 submitted 17 March, 2017; originally announced March 2017.

    Comments: KDD 2018

  14. arXiv:1608.04468  [pdf, other

    cs.IR cs.LG

    Unbiased Learning-to-Rank with Biased Feedback

    Authors: Thorsten Joachims, Adith Swaminathan, Tobias Schnabel

    Abstract: Implicit feedback (e.g., clicks, dwell times, etc.) is an abundant source of data in human-interactive systems. While implicit feedback has many advantages (e.g., it is inexpensive to collect, user centric, and timely), its inherent biases are a key obstacle to its effective use. For example, position bias in search rankings strongly influences how many clicks a result receives, so that directly u… ▽ More

    Submitted 15 August, 2016; originally announced August 2016.

  15. arXiv:1604.07209  [pdf, ps, other

    cs.IR cs.LG

    Unbiased Comparative Evaluation of Ranking Functions

    Authors: Tobias Schnabel, Adith Swaminathan, Peter Frazier, Thorsten Joachims

    Abstract: Eliciting relevance judgments for ranking evaluation is labor-intensive and costly, motivating careful selection of which documents to judge. Unlike traditional approaches that make this selection deterministically, probabilistic sampling has shown intriguing promise since it enables the design of estimators that are provably unbiased even when reusing data with missing judgments. In this paper, w… ▽ More

    Submitted 25 April, 2016; originally announced April 2016.

    Comments: Under review; 10 pages

  16. arXiv:1604.00502  [pdf, other

    cs.CL

    Online Updating of Word Representations for Part-of-Speech Tagging

    Authors: Wenpeng Yin, Tobias Schnabel, Hinrich Schütze

    Abstract: We propose online unsupervised domain adaptation (DA), which is performed incrementally as data comes in and is applicable when batch DA is not possible. In a part-of-speech (POS) tagging evaluation, we find that online unsupervised DA performs as well as batch DA.

    Submitted 2 April, 2016; originally announced April 2016.

    Comments: EMNLP'2015. Released POS tagger "FLORS" for online domain adaptation

  17. arXiv:1602.05352  [pdf, other

    cs.LG cs.AI cs.IR

    Recommendations as Treatments: Debiasing Learning and Evaluation

    Authors: Tobias Schnabel, Adith Swaminathan, Ashudeep Singh, Navin Chandak, Thorsten Joachims

    Abstract: Most data for evaluating and training recommender systems is subject to selection biases, either through self-selection by the users or through the actions of the recommendation system itself. In this paper, we provide a principled approach to handling selection biases, adapting models and estimation techniques from causal inference. The approach leads to unbiased performance estimators despite bi… ▽ More

    Submitted 26 May, 2016; v1 submitted 17 February, 2016; originally announced February 2016.

    Comments: 10 pages in ICML 2016

  18. arXiv:1511.02024  [pdf, other

    cs.LG cs.CL

    Towards a Better Understanding of Predict and Count Models

    Authors: S. Sathiya Keerthi, Tobias Schnabel, Rajiv Khanna

    Abstract: In a recent paper, Levy and Goldberg pointed out an interesting connection between prediction-based word embedding models and count models based on pointwise mutual information. Under certain conditions, they showed that both models end up optimizing equivalent objective functions. This paper explores this connection in more detail and lays out the factors leading to differences between these mode… ▽ More

    Submitted 6 November, 2015; originally announced November 2015.

    Comments: 17 pages

  19. arXiv:1510.07545  [pdf, other

    cs.HC cs.IR cs.LG

    Using Shortlists to Support Decision Making and Improve Recommender System Performance

    Authors: Tobias Schnabel, Paul N. Bennett, Susan T. Dumais, Thorsten Joachims

    Abstract: In this paper, we study shortlists as an interface component for recommender systems with the dual goal of supporting the user's decision process, as well as improving implicit feedback elicitation for increased recommendation quality. A shortlist is a temporary list of candidates that the user is currently considering, e.g., a list of a few movies the user is currently considering for viewing. Fr… ▽ More

    Submitted 8 February, 2016; v1 submitted 26 October, 2015; originally announced October 2015.

    Comments: 11 pages in WWW 2016

  20. arXiv:1304.6213  [pdf, other

    cs.CV

    Counting people from above: Airborne video based crowd analysis

    Authors: Roland Perko, Thomas Schnabel, Gerald Fritz, Alexander Almer, Lucas Paletta

    Abstract: Crowd monitoring and analysis in mass events are highly important technologies to support the security of attending persons. Proposed methods based on terrestrial or airborne image/video data often fail in achieving sufficiently accurate results to guarantee a robust service. We present a novel framework for estimating human count, density and motion from video data based on custom tailored object… ▽ More

    Submitted 23 April, 2013; originally announced April 2013.

    Comments: Part of the OAGM/AAPR 2013 proceedings (arXiv:1304.1876)

    Report number: OAGM-AAPR/2013/15