Skip to main content

Showing 1–14 of 14 results for author: Milli, S

.
  1. arXiv:2402.06831  [pdf

    cs.SI

    What We Know About Using Non-Engagement Signals in Content Ranking

    Authors: Tom Cunningham, Sana Pandey, Leif Sigerson, Jonathan Stray, Jeff Allen, Bonnie Barrilleaux, Ravi Iyer, Smitha Milli, Mohit Kothari, Behnam Rezaei

    Abstract: Many online platforms predominantly rank items by predicted user engagement. We believe that there is much unrealized potential in including non-engagement signals, which can improve outcomes both for platforms and for society as a whole. Based on a daylong workshop with experts from industry and academia, we formulate a series of propositions and document each as best we can from public evidence,… ▽ More

    Submitted 9 February, 2024; originally announced February 2024.

    ACM Class: H.3.3; H.4.3

  2. arXiv:2305.17428  [pdf, other

    cs.LG

    Choosing the Right Weights: Balancing Value, Strategy, and Noise in Recommender Systems

    Authors: Smitha Milli, Emma Pierson, Nikhil Garg

    Abstract: Many recommender systems are based on optimizing a linear weighting of different user behaviors, such as clicks, likes, shares, etc. Though the choice of weights can have a significant impact, there is little formal study or guidance on how to choose them. We analyze the optimal choice of weights from the perspectives of both users and content producers who strategically respond to the weights. We… ▽ More

    Submitted 27 May, 2023; originally announced May 2023.

  3. arXiv:2305.16941  [pdf, other

    cs.SI cs.CY

    Engagement, User Satisfaction, and the Amplification of Divisive Content on Social Media

    Authors: Smitha Milli, Micah Carroll, Yike Wang, Sashrika Pandey, Sebastian Zhao, Anca D. Dragan

    Abstract: In a pre-registered randomized experiment, we found that, relative to a reverse-chronological baseline, Twitter's engagement-based ranking algorithm amplifies emotionally charged, out-group hostile content that users say makes them feel worse about their political out-group. Furthermore, we find that users do not prefer the political tweets selected by the algorithm, suggesting that the engagement… ▽ More

    Submitted 22 December, 2023; v1 submitted 26 May, 2023; originally announced May 2023.

  4. Causal Inference Struggles with Agency on Online Platforms

    Authors: Smitha Milli, Luca Belli, Moritz Hardt

    Abstract: Online platforms regularly conduct randomized experiments to understand how changes to the platform causally affect various outcomes of interest. However, experimentation on online platforms has been criticized for having, among other issues, a lack of meaningful oversight and user consent. As platforms give users greater agency, it becomes possible to conduct observational studies in which users… ▽ More

    Submitted 10 May, 2022; v1 submitted 19 July, 2021; originally announced July 2021.

    Comments: Accepted to FaccT'22

  5. arXiv:2008.12623  [pdf, other

    cs.SI cs.LG stat.ML

    From Optimizing Engagement to Measuring Value

    Authors: Smitha Milli, Luca Belli, Moritz Hardt

    Abstract: Most recommendation engines today are based on predicting user engagement, e.g. predicting whether a user will click on an item or not. However, there is potentially a large gap between engagement signals and a desired notion of "value" that is worth optimizing for. We use the framework of measurement theory to (a) confront the designer with a normative question about what the designer values, (b)… ▽ More

    Submitted 19 July, 2021; v1 submitted 20 August, 2020; originally announced August 2020.

    Comments: Published at FAccT'21

  6. arXiv:2002.04833  [pdf, other

    cs.LG cs.AI cs.HC cs.RO

    Reward-rational (implicit) choice: A unifying formalism for reward learning

    Authors: Hong Jun Jeon, Smitha Milli, Anca D. Dragan

    Abstract: It is often difficult to hand-specify what the correct reward function is for a task, so researchers have instead aimed to learn reward functions from human behavior or feedback. The types of behavior interpreted as evidence of the reward function have expanded greatly in recent years. We've gone from demonstrations, to comparisons, to reading into the information leaked when the human is pushing… ▽ More

    Submitted 11 December, 2020; v1 submitted 12 February, 2020; originally announced February 2020.

    Comments: Published at NeurIPS 2020

  7. arXiv:1912.01172  [pdf, other

    cs.LG cs.AI stat.ML

    Value-laden Disciplinary Shifts in Machine Learning

    Authors: Ravit Dotan, Smitha Milli

    Abstract: As machine learning models are increasingly used for high-stakes decision making, scholars have sought to intervene to ensure that such models do not encode undesirable social and political values. However, little attention thus far has been given to how values influence the machine learning discipline as a whole. How do values influence what the discipline focuses on and the way it develops? If u… ▽ More

    Submitted 2 December, 2019; originally announced December 2019.

    Comments: Accepted to FAT* 2020

  8. arXiv:1910.10362  [pdf, other

    cs.LG stat.ML

    Strategic Classification is Causal Modeling in Disguise

    Authors: John Miller, Smitha Milli, Moritz Hardt

    Abstract: Consequential decision-making incentivizes individuals to strategically adapt their behavior to the specifics of the decision rule. While a long line of work has viewed strategic adaptation as gaming and attempted to mitigate its effects, recent work has instead sought to design classifiers that incentivize individuals to improve a desired quality. Key to both accounts is a cost function that dict… ▽ More

    Submitted 17 February, 2020; v1 submitted 23 October, 2019; originally announced October 2019.

    Comments: This paper was previously titled "Strategic Adaptation to Classifiers: A Causal Perspective." The current version subsumes all previous versions

  9. arXiv:1903.03877  [pdf, other

    cs.AI

    Literal or Pedagogic Human? Analyzing Human Model Misspecification in Objective Learning

    Authors: Smitha Milli, Anca D. Dragan

    Abstract: It is incredibly easy for a system designer to misspecify the objective for an autonomous system ("robot''), thus motivating the desire to have the robot learn the objective from human behavior instead. Recent work has suggested that people have an interest in the robot performing well, and will thus behave pedagogically, choosing actions that are informative to the robot. In turn, robots benefit… ▽ More

    Submitted 28 June, 2019; v1 submitted 9 March, 2019; originally announced March 2019.

    Comments: Published at UAI 2019

  10. arXiv:1808.08460  [pdf, other

    cs.LG stat.ML

    The Social Cost of Strategic Classification

    Authors: Smitha Milli, John Miller, Anca D. Dragan, Moritz Hardt

    Abstract: Consequential decision-making typically incentivizes individuals to behave strategically, tailoring their behavior to the specifics of the decision rule. A long line of work has therefore sought to counteract strategic behavior by designing more conservative decision boundaries in an effort to increase robustness to the effects of strategic covariate shift. We show that these efforts benefit the i… ▽ More

    Submitted 22 November, 2018; v1 submitted 25 August, 2018; originally announced August 2018.

  11. arXiv:1807.05185  [pdf, other

    stat.ML cs.LG

    Model Reconstruction from Model Explanations

    Authors: Smitha Milli, Ludwig Schmidt, Anca D. Dragan, Moritz Hardt

    Abstract: We show through theory and experiment that gradient-based explanations of a model quickly reveal the model itself. Our results speak to a tension between the desire to keep a proprietary model secret and the ability to offer model explanations. On the theoretical side, we give an algorithm that provably learns a two-layer ReLU network in a setting where the algorithm may query the gradient of the… ▽ More

    Submitted 13 July, 2018; originally announced July 2018.

  12. arXiv:1711.02827  [pdf, other

    cs.AI cs.LG

    Inverse Reward Design

    Authors: Dylan Hadfield-Menell, Smitha Milli, Pieter Abbeel, Stuart Russell, Anca Dragan

    Abstract: Autonomous agents optimize the reward function we give them. What they don't know is how hard it is for us to design a reward function that actually captures what we want. When designing the reward, we might think of some specific training scenarios, and make sure that the reward will lead to the right behavior in those scenarios. Inevitably, agents encounter new scenarios (e.g., new types of terr… ▽ More

    Submitted 7 October, 2020; v1 submitted 7 November, 2017; originally announced November 2017.

    Comments: Advances in Neural Information Processing Systems 30 (NIPS 2017) Revised Oct 2020 to fix a typo in Eq. 3

  13. arXiv:1711.00694  [pdf, other

    cs.AI

    Interpretable and Pedagogical Examples

    Authors: Smitha Milli, Pieter Abbeel, Igor Mordatch

    Abstract: Teachers intentionally pick the most informative examples to show their students. However, if the teacher and student are neural networks, the examples that the teacher network learns to give, although effective at teaching the student, are typically uninterpretable. We show that training the student and teacher iteratively, rather than jointly, can produce interpretable teaching strategies. We ev… ▽ More

    Submitted 14 February, 2018; v1 submitted 2 November, 2017; originally announced November 2017.

  14. arXiv:1705.09990  [pdf, other

    cs.AI

    Should Robots be Obedient?

    Authors: Smitha Milli, Dylan Hadfield-Menell, Anca Dragan, Stuart Russell

    Abstract: Intuitively, obedience -- following the order that a human gives -- seems like a good property for a robot to have. But, we humans are not perfect and we may give orders that are not best aligned to our preferences. We show that when a human is not perfectly rational then a robot that tries to infer and act according to the human's underlying preferences can always perform better than a robot that… ▽ More

    Submitted 28 May, 2017; originally announced May 2017.

    Comments: Accepted to IJCAI 2017