Skip to main content

Showing 1–10 of 10 results for author: Chia, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.05863  [pdf, other

    cs.AI cs.CL cs.GT

    How Well Can LLMs Negotiate? NegotiationArena Platform and Analysis

    Authors: Federico Bianchi, Patrick John Chia, Mert Yuksekgonul, Jacopo Tagliabue, Dan Jurafsky, James Zou

    Abstract: Negotiation is the basis of social interactions; humans negotiate everything from the price of cars to how to share common resources. With rapidly growing interest in using large language models (LLMs) to act as agents on behalf of human users, such LLM agents would also need to be able to negotiate. In this paper, we study how well LLMs can negotiate with each other. We develop NegotiationArena:… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

  2. arXiv:2304.10621  [pdf, other

    cs.IR

    E Pluribus Unum: Guidelines on Multi-Objective Evaluation of Recommender Systems

    Authors: Patrick John Chia, Giuseppe Attanasio, Jacopo Tagliabue, Federico Bianchi, Ciro Greco, Gabriel de Souza P. Moreira, Davide Eynard, Fahd Husain

    Abstract: Recommender Systems today are still mostly evaluated in terms of accuracy, with other aspects beyond the immediate relevance of recommendations, such as diversity, long-term user retention and fairness, often taking a back seat. Moreover, reconciling multiple performance perspectives is by definition indeterminate, presenting a stumbling block to those in the pursuit of rounded evaluation of Recom… ▽ More

    Submitted 20 April, 2023; originally announced April 2023.

    Comments: 15 pages, under submission

  3. arXiv:2304.07145  [pdf, ps, other

    cs.IR cs.CY

    EvalRS 2023. Well-Rounded Recommender Systems For Real-World Deployments

    Authors: Federico Bianchi, Patrick John Chia, Ciro Greco, Claudio Pomo, Gabriel Moreira, Davide Eynard, Fahd Husain, Jacopo Tagliabue

    Abstract: EvalRS aims to bring together practitioners from industry and academia to foster a debate on rounded evaluation of recommender systems, with a focus on real-world impact across a multitude of deployment scenarios. Recommender systems are often evaluated only through accuracy metrics, which fall short of fully characterizing their generalization capabilities and miss important aspects, such as fair… ▽ More

    Submitted 22 July, 2023; v1 submitted 14 April, 2023; originally announced April 2023.

    Comments: EvalRS 2023 is a workshop at KDD23. Code and hackathon materials: https://github.com/RecList/evalRS-KDD-2023

  4. arXiv:2207.05772  [pdf, ps, other

    cs.IR

    EvalRS: a Rounded Evaluation of Recommender Systems

    Authors: Jacopo Tagliabue, Federico Bianchi, Tobias Schnabel, Giuseppe Attanasio, Ciro Greco, Gabriel de Souza P. Moreira, Patrick John Chia

    Abstract: Much of the complexity of Recommender Systems (RSs) comes from the fact that they are used as part of more complex applications and affect user experience through a varied range of user interfaces. However, research focused almost exclusively on the ability of RSs to produce accurate item rankings while giving little attention to the evaluation of RS behavior in real-world scenarios. Such narrow f… ▽ More

    Submitted 12 August, 2022; v1 submitted 12 July, 2022; originally announced July 2022.

    Comments: CIKM 2022 Data Challenge Paper

  5. arXiv:2204.03972  [pdf, other

    cs.IR cs.CL

    Contrastive language and vision learning of general fashion concepts

    Authors: Patrick John Chia, Giuseppe Attanasio, Federico Bianchi, Silvia Terragni, Ana Rita Magalhães, Diogo Goncalves, Ciro Greco, Jacopo Tagliabue

    Abstract: The steady rise of online shop** goes hand in hand with the development of increasingly complex ML and NLP models. While most use cases are cast as specialized supervised learning problems, we argue that practitioners would greatly benefit from more transferable representations of products. In this work, we build on recent developments in contrastive learning to train FashionCLIP, a CLIP-like mo… ▽ More

    Submitted 18 April, 2023; v1 submitted 8 April, 2022; originally announced April 2022.

    Comments: Latest version available at https://www.nature.com/articles/s41598-022-23052-9; model available at https://huggingface.co/patrickjohncyh/fashion-clip

  6. arXiv:2204.02473  [pdf, other

    cs.IR cs.AI

    "Does it come in black?" CLIP-like models are zero-shot recommenders

    Authors: Patrick John Chia, Jacopo Tagliabue, Federico Bianchi, Ciro Greco, Diogo Goncalves

    Abstract: Product discovery is a crucial component for online shop**. However, item-to-item recommendations today do not allow users to explore changes along selected dimensions: given a query item, can a model suggest something similar but in a different color? We consider item recommendations of the comparative nature (e.g. "something darker") and show how CLIP-based models can support this use case in… ▽ More

    Submitted 11 April, 2022; v1 submitted 5 April, 2022; originally announced April 2022.

    Comments: Accepted at ACL 2022 (ECNLP)

  7. arXiv:2112.00219  [pdf, other

    cs.CV cs.RO

    Scalable Primitives for Generalized Sensor Fusion in Autonomous Vehicles

    Authors: Sammy Sidhu, Linda Wang, Tayyab Naseer, Ashish Malhotra, Jay Chia, Aayush Ahuja, Ella Rasmussen, Qiangui Huang, Ray Gao

    Abstract: In autonomous driving, there has been an explosion in the use of deep neural networks for perception, prediction and planning tasks. As autonomous vehicles (AVs) move closer to production, multi-modal sensor inputs and heterogeneous vehicle fleets with different sets of sensor platforms are becoming increasingly common in the industry. However, neural network architectures typically target specifi… ▽ More

    Submitted 30 November, 2021; originally announced December 2021.

    Comments: Presented in Machine Learning for Autonomous Driving Workshop at the 35th Conference on Neural Information Processing Systems (NeurIPS 2021), Sydney, Australia. 11 pages, 8 figures

  8. arXiv:2111.09963  [pdf, other

    cs.IR cs.AI cs.LG

    Beyond NDCG: behavioral testing of recommender systems with RecList

    Authors: Patrick John Chia, Jacopo Tagliabue, Federico Bianchi, Chloe He, Brian Ko

    Abstract: As with most Machine Learning systems, recommender systems are typically evaluated through performance metrics computed over held-out data points. However, real-world behavior is undoubtedly nuanced: ad hoc error analysis and deployment-specific tests must be employed to ensure the desired quality in actual deployments. In this paper, we propose RecList, a behavioral-based testing methodology. Rec… ▽ More

    Submitted 27 March, 2022; v1 submitted 18 November, 2021; originally announced November 2021.

    Comments: Paper accepted to the WebConf 2022

  9. arXiv:2107.03256  [pdf, other

    cs.IR cs.LG

    "Are you sure?": Preliminary Insights from Scaling Product Comparisons to Multiple Shops

    Authors: Patrick John Chia, Bingqing Yu, Jacopo Tagliabue

    Abstract: Large eCommerce players introduced comparison tables as a new type of recommendations. However, building comparisons at scale without pre-existing training/taxonomy data remains an open challenge, especially within the operational constraints of shops in the long tail. We present preliminary results from building a comparison pipeline designed to scale in a multi-shop scenario: we describe our des… ▽ More

    Submitted 8 July, 2021; v1 submitted 7 July, 2021; originally announced July 2021.

    Comments: Accepted for publication at SIGIR eCom 2021

  10. arXiv:2104.09423  [pdf, ps, other

    cs.IR

    SIGIR 2021 E-Commerce Workshop Data Challenge

    Authors: Jacopo Tagliabue, Ciro Greco, Jean-Francis Roy, Bingqing Yu, Patrick John Chia, Federico Bianchi, Giovanni Cassani

    Abstract: The 2021 SIGIR workshop on eCommerce is hosting the Coveo Data Challenge for "In-session prediction for purchase intent and recommendations". The challenge addresses the growing need for reliable predictions within the boundaries of a shop** session, as customer intentions can be different depending on the occasion. The need for efficient procedures for personalization is even clearer if we cons… ▽ More

    Submitted 16 July, 2021; v1 submitted 19 April, 2021; originally announced April 2021.

    Comments: SIGIR eCOM 2021 Data Challenge