Skip to main content

Showing 1–2 of 2 results for author: Kohlhoff, K J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2312.10240  [pdf, other

    cs.CV

    Rich Human Feedback for Text-to-Image Generation

    Authors: Youwei Liang, Junfeng He, Gang Li, Peizhao Li, Arseniy Klimovskiy, Nicholas Carolan, Jiao Sun, Jordi Pont-Tuset, Sarah Young, Feng Yang, Junjie Ke, Krishnamurthy Dj Dvijotham, Katie Collins, Yiwen Luo, Yang Li, Kai J Kohlhoff, Deepak Ramachandran, Vidhya Navalpakkam

    Abstract: Recent Text-to-Image (T2I) generation models such as Stable Diffusion and Imagen have made significant progress in generating high-resolution images based on text descriptions. However, many generated images still suffer from issues such as artifacts/implausibility, misalignment with text descriptions, and low aesthetic quality. Inspired by the success of Reinforcement Learning with Human Feedback… ▽ More

    Submitted 8 April, 2024; v1 submitted 15 December, 2023; originally announced December 2023.

    Comments: CVPR'24

  2. arXiv:2312.10175  [pdf, other

    cs.CV

    ALOHA: from Attention to Likes -- a unified mOdel for understanding HumAn responses to diverse visual content

    Authors: Peizhao Li, Junfeng He, Gang Li, Rachit Bhargava, Shaolei Shen, Nachiappan Valliappan, Youwei Liang, Hongxiang Gu, Venky Ramachandran, Golnaz Farhadi, Yang Li, Kai J Kohlhoff, Vidhya Navalpakkam

    Abstract: Progress in human behavior modeling involves understanding both implicit, early-stage perceptual behavior such as human attention and explicit, later-stage behavior such as subjective preferences/likes. Yet, most prior research has focused on modeling implicit and explicit human behavior in isolation; and often limited to a specific type of visual content. Can we build a unified model of human att… ▽ More

    Submitted 4 July, 2024; v1 submitted 15 December, 2023; originally announced December 2023.