Skip to main content

Showing 1–12 of 12 results for author: Steck, H

.
  1. arXiv:2405.12119  [pdf, other

    cs.IR cs.AI cs.CL

    Reindex-Then-Adapt: Improving Large Language Models for Conversational Recommendation

    Authors: Zhankui He, Zhouhang Xie, Harald Steck, Dawen Liang, Rahul Jha, Nathan Kallus, Julian McAuley

    Abstract: Large language models (LLMs) are revolutionizing conversational recommender systems by adeptly indexing item content, understanding complex conversational contexts, and generating relevant item titles. However, controlling the distribution of recommended items remains a challenge. This leads to suboptimal performance due to the failure to capture rapidly changing data distributions, such as item p… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

  2. Is Cosine-Similarity of Embeddings Really About Similarity?

    Authors: Harald Steck, Chaitanya Ekanadham, Nathan Kallus

    Abstract: Cosine-similarity is the cosine of the angle between two vectors, or equivalently the dot product between their normalizations. A popular application is to quantify semantic similarity between high-dimensional objects by applying cosine-similarity to a learned low-dimensional feature embedding. This can work better but sometimes also worse than the unnormalized dot-product between embedded vectors… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

    Comments: 9 pages

    Journal ref: ACM Web Conference 2024 (WWW 2024 Companion)

  3. Large Language Models as Zero-Shot Conversational Recommenders

    Authors: Zhankui He, Zhouhang Xie, Rahul Jha, Harald Steck, Dawen Liang, Yesu Feng, Bodhisattwa Prasad Majumder, Nathan Kallus, Julian McAuley

    Abstract: In this paper, we present empirical studies on conversational recommendation tasks using representative large language models in a zero-shot setting with three primary contributions. (1) Data: To gain insights into model behavior in "in-the-wild" conversational recommendation scenarios, we construct a new dataset of recommendation-related conversations by scra** a popular discussion website. Thi… ▽ More

    Submitted 19 August, 2023; originally announced August 2023.

    Comments: Accepted as CIKM 2023 long paper. Longer version is coming soon (e.g., more details about dataset)

  4. arXiv:2110.11402  [pdf, other

    cs.LG

    On the Regularization of Autoencoders

    Authors: Harald Steck, Dario Garcia Garcia

    Abstract: While much work has been devoted to understanding the implicit (and explicit) regularization of deep nonlinear networks in the supervised setting, this paper focuses on unsupervised learning, i.e., autoencoders are trained with the objective of reproducing the output from the input. We extend recent results [** et al. 2021] on unconstrained linear models and apply them to (1) nonlinear autoencode… ▽ More

    Submitted 21 October, 2021; originally announced October 2021.

    Comments: 10 pages

  5. arXiv:1910.09645  [pdf, ps, other

    cs.IR cs.LG stat.ML

    Markov Random Fields for Collaborative Filtering

    Authors: Harald Steck

    Abstract: In this paper, we model the dependencies among the items that are recommended to a user in a collaborative-filtering problem via a Gaussian Markov Random Field (MRF). We build upon Besag's auto-normal parameterization and pseudo-likelihood, which not only enables computationally efficient learning, but also connects the areas of MRFs and sparse inverse covariance estimation with autoencoders and n… ▽ More

    Submitted 21 October, 2019; originally announced October 2019.

    Comments: 9 pages

    Journal ref: 33rd Conference on Neural Information Processing Systems (NeurIPS 2019), Vancouver, Canada

  6. arXiv:1905.03375  [pdf, other

    cs.IR cs.LG stat.ML

    Embarrassingly Shallow Autoencoders for Sparse Data

    Authors: Harald Steck

    Abstract: Combining simple elements from the literature, we define a linear model that is geared toward sparse data, in particular implicit feedback data for recommender systems. We show that its training objective has a closed-form solution, and discuss the resulting conceptual insights. Surprisingly, this simple model achieves better ranking accuracy than various state-of-the-art collaborative-filtering a… ▽ More

    Submitted 8 May, 2019; originally announced May 2019.

    Comments: In the proceedings of the Web Conference (WWW) 2019 (7 pages)

  7. arXiv:1904.13033  [pdf, ps, other

    cs.IR cs.LG

    Collaborative Filtering via High-Dimensional Regression

    Authors: Harald Steck

    Abstract: While the SLIM approach obtained high ranking-accuracy in many experiments in the literature, it is also known for its high computational cost of learning its parameters from data. For this reason, we focus in this paper on variants of high-dimensional regression problems that have closed-form solutions. Moreover, we motivate a re-scaling rather than a re-weighting approach for dealing with biases… ▽ More

    Submitted 29 April, 2019; originally announced April 2019.

    Comments: 10 pages

  8. arXiv:1301.3894  [pdf

    cs.AI

    On the Use of Skeletons when Learning in Bayesian Networks

    Authors: Harald Steck

    Abstract: In this paper, we present a heuristic operator which aims at simultaneously optimizing the orientations of all the edges in an intermediate Bayesian network structure during the search process. This is done by alternating between the space of directed acyclic graphs (DAGs) and the space of skeletons. The found orientations of the edges are based on a scoring function rather than on induced con… ▽ More

    Submitted 16 January, 2013; originally announced January 2013.

    Comments: Appears in Proceedings of the Sixteenth Conference on Uncertainty in Artificial Intelligence (UAI2000)

    Report number: UAI-P-2000-PG-558-565

  9. arXiv:1301.0602  [pdf

    cs.LG stat.ML

    Unsupervised Active Learning in Large Domains

    Authors: Harald Steck, Tommi S. Jaakkola

    Abstract: Active learning is a powerful approach to analyzing data effectively. We show that the feasibility of active learning depends crucially on the choice of measure with respect to which the query is being optimized. The standard information gain, for example, does not permit an accurate evaluation with a small committee, a representative subset of the model space. We propose a surrogate measure requ… ▽ More

    Submitted 12 December, 2012; originally announced January 2013.

    Comments: Appears in Proceedings of the Eighteenth Conference on Uncertainty in Artificial Intelligence (UAI2002)

    Report number: UAI-P-2002-PG-469-476

  10. arXiv:1206.6871  [pdf

    cs.LG stat.ML

    Ranking by Dependence - A Fair Criteria

    Authors: Harald Steck

    Abstract: Estimating the dependences between random variables, and ranking them accordingly, is a prevalent problem in machine learning. Pursuing frequentist and information-theoretic approaches, we first show that the p-value and the mutual information can fail even in simplistic situations. We then propose two conditions for regularizing an estimator of dependence, which leads to a simple yet effective ne… ▽ More

    Submitted 27 June, 2012; originally announced June 2012.

    Comments: Appears in Proceedings of the Twenty-Second Conference on Uncertainty in Artificial Intelligence (UAI2006)

    Report number: UAI-P-2006-PG-477-484

  11. arXiv:1206.3287  [pdf

    cs.LG stat.ME stat.ML

    Learning the Bayesian Network Structure: Dirichlet Prior versus Data

    Authors: Harald Steck

    Abstract: In the Bayesian approach to structure learning of graphical models, the equivalent sample size (ESS) in the Dirichlet prior over the model parameters was recently shown to have an important effect on the maximum-a-posteriori estimate of the Bayesian network structure. In our first contribution, we theoretically analyze the case of large ESS-values, which complements previous work: among other resu… ▽ More

    Submitted 13 June, 2012; originally announced June 2012.

    Comments: Appears in Proceedings of the Twenty-Fourth Conference on Uncertainty in Artificial Intelligence (UAI2008)

    Report number: UAI-P-2008-PG-511-518

  12. arXiv:quant-ph/9708014  [pdf, ps, other

    quant-ph cond-mat.stat-mech physics.atom-ph

    Output of a pulsed atom laser

    Authors: H. Steck, M. Naraschewski, H. Wallis

    Abstract: We study the output properties of a pulsed atom laser consisting of an interacting Bose-Einstein condensate (BEC) in a magnetic trap and an additional rf field transferring atoms to an untrapped Zeeman sublevel. For weak output coupling we calculate the dynamics of the decaying condensate population, of its chemical potential and the velocity of the output atoms analytically.

    Submitted 8 August, 1997; v1 submitted 7 August, 1997; originally announced August 1997.

    Comments: 4 pages, RevTeX. Full ps file available on http://mpqibmr1.mpq.mpg.de:5000/~man/

    Journal ref: Phys.Rev.Lett.80:1-5,1998