Learning from Negative User Feedback and Measuring Responsiveness for Sequential Recommenders
Authors:
Yueqi Wang,
Yoni Halpern,
Shuo Chang,
**gchen Feng,
Elaine Ya Le,
Longfei Li,
Xujian Liang,
Min-Cheng Huang,
Shane Li,
Alex Beutel,
Ya** Zhang,
Shuchao Bi
Abstract:
Sequential recommenders have been widely used in industry due to their strength in modeling user preferences. While these models excel at learning a user's positive interests, less attention has been paid to learning from negative user feedback. Negative user feedback is an important lever of user control, and comes with an expectation that recommenders should respond quickly and reduce similar re…
▽ More
Sequential recommenders have been widely used in industry due to their strength in modeling user preferences. While these models excel at learning a user's positive interests, less attention has been paid to learning from negative user feedback. Negative user feedback is an important lever of user control, and comes with an expectation that recommenders should respond quickly and reduce similar recommendations to the user. However, negative feedback signals are often ignored in the training objective of sequential retrieval models, which primarily aim at predicting positive user interactions. In this work, we incorporate explicit and implicit negative user feedback into the training objective of sequential recommenders in the retrieval stage using a "not-to-recommend" loss function that optimizes for the log-likelihood of not recommending items with negative feedback. We demonstrate the effectiveness of this approach using live experiments on a large-scale industrial recommender system. Furthermore, we address a challenge in measuring recommender responsiveness to negative feedback by develo** a counterfactual simulation framework to compare recommender responses between different user actions, showing improved responsiveness from the modeling change.
△ Less
Submitted 23 August, 2023;
originally announced August 2023.
Long-Term Value of Exploration: Measurements, Findings and Algorithms
Authors:
Yi Su,
Xiangyu Wang,
Elaine Ya Le,
Liang Liu,
Yuening Li,
Haokai Lu,
Benjamin Lipshitz,
Sriraj Badam,
Lukasz Heldt,
Shuchao Bi,
Ed Chi,
Cristos Goodrow,
Su-Lin Wu,
Lexi Baugher,
Minmin Chen
Abstract:
Effective exploration is believed to positively influence the long-term user experience on recommendation platforms. Determining its exact benefits, however, has been challenging. Regular A/B tests on exploration often measure neutral or even negative engagement metrics while failing to capture its long-term benefits. We here introduce new experiment designs to formally quantify the long-term valu…
▽ More
Effective exploration is believed to positively influence the long-term user experience on recommendation platforms. Determining its exact benefits, however, has been challenging. Regular A/B tests on exploration often measure neutral or even negative engagement metrics while failing to capture its long-term benefits. We here introduce new experiment designs to formally quantify the long-term value of exploration by examining its effects on content corpus, and connecting content corpus growth to the long-term user experience from real-world experiments. Once established the values of exploration, we investigate the Neural Linear Bandit algorithm as a general framework to introduce exploration into any deep learning based ranking systems. We conduct live experiments on one of the largest short-form video recommendation platforms that serves billions of users to validate the new experiment designs, quantify the long-term values of exploration, and to verify the effectiveness of the adopted neural linear bandit algorithm for exploration.
△ Less
Submitted 25 February, 2024; v1 submitted 12 May, 2023;
originally announced May 2023.