Skip to main content

Showing 1–25 of 25 results for author: Howard, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.19593  [pdf, other

    cs.CL cs.CV

    SK-VQA: Synthetic Knowledge Generation at Scale for Training Context-Augmented Multimodal LLMs

    Authors: Xin Su, Man Luo, Kris W Pan, Tien Pei Chou, Vasudev Lal, Phillip Howard

    Abstract: Synthetic data generation has gained significant attention recently for its utility in training large vision and language models. However, the application of synthetic data to the training of multimodal context-augmented generation systems has been relatively unexplored. This gap in existing work is important because existing vision and language models (VLMs) are not trained specifically for conte… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  2. arXiv:2405.20152  [pdf, other

    cs.CV

    Uncovering Bias in Large Vision-Language Models at Scale with Counterfactuals

    Authors: Phillip Howard, Kathleen C. Fraser, Anahita Bhiwandiwalla, Svetlana Kiritchenko

    Abstract: With the advent of Large Language Models (LLMs) possessing increasingly impressive capabilities, a number of Large Vision-Language Models (LVLMs) have been proposed to augment LLMs with visual inputs. Such models condition generated text on both an input image and a text prompt, enabling a variety of use cases such as visual question answering and multimodal chat. While prior studies have examined… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  3. arXiv:2404.00166  [pdf, other

    cs.CV cs.AI

    Uncovering Bias in Large Vision-Language Models with Counterfactuals

    Authors: Phillip Howard, Anahita Bhiwandiwalla, Kathleen C. Fraser, Svetlana Kiritchenko

    Abstract: With the advent of Large Language Models (LLMs) possessing increasingly impressive capabilities, a number of Large Vision-Language Models (LVLMs) have been proposed to augment LLMs with visual inputs. Such models condition generated text on both an input image and a text prompt, enabling a variety of use cases such as visual question answering and multimodal chat. While prior studies have examined… ▽ More

    Submitted 7 June, 2024; v1 submitted 29 March, 2024; originally announced April 2024.

    Comments: Accepted to the CVPR 2024 Responsible Generative AI (ReGenAI) Workshop

  4. arXiv:2312.00825  [pdf, other

    cs.CV cs.AI

    SocialCounterfactuals: Probing and Mitigating Intersectional Social Biases in Vision-Language Models with Counterfactual Examples

    Authors: Phillip Howard, Avinash Madasu, Tiep Le, Gustavo Lujan Moreno, Anahita Bhiwandiwalla, Vasudev Lal

    Abstract: While vision-language models (VLMs) have achieved remarkable performance improvements recently, there is growing evidence that these models also posses harmful biases with respect to social attributes such as gender and race. Prior studies have primarily focused on probing such bias attributes individually while ignoring biases associated with intersections between social attributes. This could be… ▽ More

    Submitted 9 April, 2024; v1 submitted 30 November, 2023; originally announced December 2023.

    Comments: Accepted to CVPR 2024. arXiv admin note: text overlap with arXiv:2310.02988

  5. arXiv:2311.12229  [pdf, other

    cs.AI

    NeuroPrompts: An Adaptive Framework to Optimize Prompts for Text-to-Image Generation

    Authors: Shachar Rosenman, Vasudev Lal, Phillip Howard

    Abstract: Despite impressive recent advances in text-to-image diffusion models, obtaining high-quality images often requires prompt engineering by humans who have developed expertise in using them. In this work, we present NeuroPrompts, an adaptive framework that automatically enhances a user's prompt to improve the quality of generations produced by text-to-image models. Our framework utilizes constrained… ▽ More

    Submitted 5 April, 2024; v1 submitted 20 November, 2023; originally announced November 2023.

    Comments: Accepted to EACL 2024 System Demonstration Track

  6. arXiv:2311.08505  [pdf, other

    cs.CL

    Semi-Structured Chain-of-Thought: Integrating Multiple Sources of Knowledge for Improved Language Model Reasoning

    Authors: Xin Su, Tiep Le, Steven Bethard, Phillip Howard

    Abstract: An important open question in the use of large language models for knowledge-intensive tasks is how to effectively integrate knowledge from three sources: the model's parametric memory, external structured knowledge, and external unstructured knowledge. Most existing prompting methods either rely on one or two of these sources, or require repeatedly invoking large language models to generate simil… ▽ More

    Submitted 1 April, 2024; v1 submitted 14 November, 2023; originally announced November 2023.

    Comments: NAACL 2024 main conference

  7. arXiv:2310.19292  [pdf, other

    cs.CL

    Fusing Temporal Graphs into Transformers for Time-Sensitive Question Answering

    Authors: Xin Su, Phillip Howard, Nagib Hakim, Steven Bethard

    Abstract: Answering time-sensitive questions from long documents requires temporal reasoning over the times in questions and documents. An important open question is whether large language models can perform such reasoning solely using a provided text document, or whether they can benefit from additional temporal information extracted using other systems. We address this research question by applying existi… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

    Comments: EMNLP 2023 Findings

  8. arXiv:2310.02988  [pdf, other

    cs.CV cs.AI

    Probing Intersectional Biases in Vision-Language Models with Counterfactual Examples

    Authors: Phillip Howard, Avinash Madasu, Tiep Le, Gustavo Lujan Moreno, Vasudev Lal

    Abstract: While vision-language models (VLMs) have achieved remarkable performance improvements recently, there is growing evidence that these models also posses harmful biases with respect to social attributes such as gender and race. Prior studies have primarily focused on probing such bias attributes individually while ignoring biases associated with intersections between social attributes. This could be… ▽ More

    Submitted 4 October, 2023; originally announced October 2023.

  9. arXiv:2309.14356  [pdf, other

    cs.LG cs.CL cs.CV

    COCO-Counterfactuals: Automatically Constructed Counterfactual Examples for Image-Text Pairs

    Authors: Tiep Le, Vasudev Lal, Phillip Howard

    Abstract: Counterfactual examples have proven to be valuable in the field of natural language processing (NLP) for both evaluating and improving the robustness of language models to spurious correlations in datasets. Despite their demonstrated utility for NLP, multimodal counterfactual examples have been relatively unexplored due to the difficulty of creating paired image-text data with minimal counterfactu… ▽ More

    Submitted 31 October, 2023; v1 submitted 22 September, 2023; originally announced September 2023.

    Comments: Accepted to NeurIPS 2023 Datasets and Benchmarks Track

  10. arXiv:2305.04978  [pdf, other

    cs.CL

    NeuroComparatives: Neuro-Symbolic Distillation of Comparative Knowledge

    Authors: Phillip Howard, Junlin Wang, Vasudev Lal, Gadi Singer, Ye** Choi, Swabha Swayamdipta

    Abstract: Comparative knowledge (e.g., steel is stronger and heavier than styrofoam) is an essential component of our world knowledge, yet understudied in prior literature. In this paper, we harvest the dramatic improvements in knowledge capabilities of language models into a large-scale comparative knowledge base. While the ease of acquisition of such comparative knowledge is much higher from extreme-scale… ▽ More

    Submitted 5 April, 2024; v1 submitted 8 May, 2023; originally announced May 2023.

    Comments: Accepted to NAACL 2024 Findings

  11. Thrill-K Architecture: Towards a Solution to the Problem of Knowledge Based Understanding

    Authors: Gadi Singer, Joscha Bach, Tetiana Grinberg, Nagib Hakim, Phillip Howard, Vasudev Lal, Zev Rivlin

    Abstract: While end-to-end learning systems are rapidly gaining capabilities and popularity, the increasing computational demands for deploying such systems, along with a lack of flexibility, adaptability, explainability, reasoning and verification capabilities, require new types of architectures. Here we introduce a classification of hybrid systems which, based on an analysis of human knowledge and intelli… ▽ More

    Submitted 28 February, 2023; originally announced March 2023.

    Comments: Artificial General Intelligence: 15th International Conference, AGI 2022, Seattle, WA, USA, August 2022, Proceedings

    Journal ref: Springer Lecture Notes in Computer Science, vol 13539, 2023

  12. arXiv:2210.12365  [pdf, other

    cs.CL

    NeuroCounterfactuals: Beyond Minimal-Edit Counterfactuals for Richer Data Augmentation

    Authors: Phillip Howard, Gadi Singer, Vasudev Lal, Ye** Choi, Swabha Swayamdipta

    Abstract: While counterfactual data augmentation offers a promising step towards robust generalization in natural language processing, producing a set of counterfactuals that offer valuable inductive bias for models remains a challenge. Most existing approaches for producing counterfactuals, manual or automated, rely on small perturbations via minimal edits, resulting in simplistic changes. We introduce Neu… ▽ More

    Submitted 22 October, 2022; originally announced October 2022.

    Comments: Findings of EMNLP 2022

  13. Cross-Domain Aspect Extraction using Transformers Augmented with Knowledge Graphs

    Authors: Phillip Howard, Arden Ma, Vasudev Lal, Ana Paula Simoes, Daniel Korat, Oren Pereg, Moshe Wasserblat, Gadi Singer

    Abstract: The extraction of aspect terms is a critical step in fine-grained sentiment analysis of text. Existing approaches for this task have yielded impressive results when the training and testing data are from the same domain. However, these methods show a drastic decrease in performance when applied to cross-domain settings where the domain of the testing data differs from that of the training data. To… ▽ More

    Submitted 18 October, 2022; originally announced October 2022.

    ACM Class: I.2.7

    Journal ref: Proceedings of the 31st ACM International Conference on Information & Knowledge Management (CIKM 2022). Association for Computing Machinery, New York, NY, USA, 780-790

  14. arXiv:2112.05785  [pdf, ps, other

    cs.CL cs.AI cs.LG

    TempoQR: Temporal Question Reasoning over Knowledge Graphs

    Authors: Costas Mavromatis, Prasanna Lakkur Subramanyam, Vassilis N. Ioannidis, Soji Adeshina, Phillip R. Howard, Tetiana Grinberg, Nagib Hakim, George Karypis

    Abstract: Knowledge Graph Question Answering (KGQA) involves retrieving facts from a Knowledge Graph (KG) using natural language queries. A KG is a curated set of facts consisting of entities linked by relations. Certain facts include also temporal information forming a Temporal KG (TKG). Although many natural questions involve explicit or implicit time constraints, question answering (QA) over TKGs has bee… ▽ More

    Submitted 10 December, 2021; originally announced December 2021.

    Comments: AAAI 2022

  15. arXiv:2010.14950  [pdf

    cs.SI

    Predicting Engagement with the Internet Research Agency's Facebook and Instagram Campaigns around the 2016 U.S. Presidential Election

    Authors: Dimitra Liotsiou, Bharath Ganesh, Philip N. Howard

    Abstract: The Russian Internet Research Agency's (IRA) online interference campaign in the 2016 U.S. presidential election represents a turning point in the trajectory of democratic elections in the digital age. What can we learn about how the IRA engages U.S. audiences, ahead of the 2020 U.S. presidential election? We provide the first in-depth analysis of the relationships between IRA content characterist… ▽ More

    Submitted 28 October, 2020; originally announced October 2020.

  16. arXiv:2002.12069  [pdf

    cs.SI

    Junk News & Information Sharing During the 2019 UK General Election

    Authors: Nahema Marchal, Bence Kollanyi, Lisa-Maria Neudert, Hubert Au, Philip N. Howard

    Abstract: Today, an estimated 75% of the British public access information about politics and public life online, and 40% do so via social media. With this context in mind, we investigate information sharing patterns over social media in the lead-up to the 2019 UK General Elections, and ask: (1) What type of political news and information were social media users sharing on Twitter ahead of the vote? (2) How… ▽ More

    Submitted 27 February, 2020; originally announced February 2020.

  17. arXiv:1901.07920  [pdf, other

    cs.SI

    The Junk News Aggregator: Examining junk news posted on Facebook, starting with the 2018 US Midterm Elections

    Authors: Dimitra Liotsiou, Bence Kollanyi, Philip N. Howard

    Abstract: In recent years, the phenomenon of online misinformation and junk news circulating on social media has come to constitute an important and widespread problem affecting public life online across the globe, particularly around important political events such as elections. At the same time, there have been calls for more transparency around misinformation on social media platforms, as many of the mos… ▽ More

    Submitted 17 April, 2019; v1 submitted 23 January, 2019; originally announced January 2019.

  18. arXiv:1806.00830  [pdf, ps, other

    cs.CY cs.SI

    Studying Politically Vulnerable Communities Online: Ethical Dilemmas, Questions, and Solutions

    Authors: Robert Gorwa, Philip N. Howard

    Abstract: This short article introduces the concept of political vulnerability for social media researchers. How are traditional notions of harm challenged by research subjects in politically vulnerable communities? Through a selection of case studies, we explore some of the trade-offs, challenges, and questions raised by research that seeks be robust and transparent while also preserving anonymity and priv… ▽ More

    Submitted 3 June, 2018; originally announced June 2018.

    Comments: 2018 ICWSM Workshop on Exploring Ethical Trade-offs in Social Media Research, June 25, Stanford, CA, USA

  19. arXiv:1803.01845  [pdf

    cs.SI

    Polarization, Partisanship and Junk News Consumption over Social Media in the US

    Authors: Vidya Narayanan, Vlad Barash, John Kelly, Bence Kollanyi, Lisa-Maria Neudert, Philip N. Howard

    Abstract: What kinds of social media users read junk news? We examine the distribution of the most significant sources of junk news in the three months before President Donald Trump first State of the Union Address. Drawing on a list of sources that consistently publish political news and information that is extremist, sensationalist, conspiratorial, masked commentary, fake news and other forms of junk news… ▽ More

    Submitted 4 March, 2018; originally announced March 2018.

    Comments: arXiv admin note: text overlap with arXiv:1802.03572

    Report number: Data Memo 2018.1

  20. arXiv:1802.03573  [pdf

    cs.SI

    Social Media, News and Political Information during the US Election: Was Polarizing Content Concentrated in Swing States?

    Authors: Philip N. Howard, Bence Kollanyi, Samantha Bradshaw, Lisa-Maria Neudert

    Abstract: US voters shared large volumes of polarizing political news and information in the form of links to content from Russian, WikiLeaks and junk news sources. Was this low quality political information distributed evenly around the country, or concentrated in swing states and particular parts of the country? In this data memo we apply a tested dictionary of sources about political news and information… ▽ More

    Submitted 10 February, 2018; originally announced February 2018.

    Comments: Data Memo

  21. arXiv:1802.03572  [pdf

    cs.SI

    Junk News on Military Affairs and National Security: Social Media Disinformation Campaigns Against US Military Personnel and Veterans

    Authors: John D. Gallacher, Vlad Barash, Philip N. Howard, John Kelly

    Abstract: Social media provides political news and information for both active duty military personnel and veterans. We analyze the subgroups of Twitter and Facebook users who spend time consuming junk news from websites that target US military personnel and veterans with conspiracy theories, misinformation, and other forms of junk news about military affairs and national security issues. (1) Over Twitter w… ▽ More

    Submitted 10 February, 2018; originally announced February 2018.

    Comments: Data Memo

  22. arXiv:1710.07087  [pdf

    cs.SI

    Does Campaigning on Social Media Make a Difference? Evidence from candidate use of Twitter during the 2015 and 2017 UK Elections

    Authors: Jonathan Bright, Scott A Hale, Bharath Ganesh, Andrew Bulovsky, Helen Margetts, Phil Howard

    Abstract: Social media are now a routine part of political campaigns all over the world. However, studies of the impact of campaigning on social platform have thus far been limited to cross-sectional datasets from one election period which are vulnerable to unobserved variable bias. Hence empirical evidence on the effectiveness of political social media activity is thin. We address this deficit by analysing… ▽ More

    Submitted 27 July, 2018; v1 submitted 19 October, 2017; originally announced October 2017.

  23. arXiv:1710.03330  [pdf, other

    cs.SI

    Redes sociales, participación ciudadana y la hipótesis del slacktivismo: lecciones del caso de "El Bronco" / Social Media, Civic Engagement, and the Slacktivism Hypothesis: Lessons from Mexico's "El Bronco"

    Authors: Philip N. Howard, Saiph Savage, Claudia Flores-Saviaga, Carlos Toxtli, Andres Monroy-Hernández

    Abstract: El uso de las redes sociales tiene consecuencias positivas o negativas en la participación ciudadana? La gran parte de los intentos por responder a esta pregunta incluyen datos de la opinión pública de los Estados Unidos, por lo que nosotros ofrecemos un estudio sobre un caso significativo de México, donde un candidato independiente utilizó las redes sociales para comunicarse con el público y rehu… ▽ More

    Submitted 9 October, 2017; originally announced October 2017.

  24. arXiv:1606.06356  [pdf

    cs.SI physics.soc-ph

    Bots, #StrongerIn, and #Brexit: Computational Propaganda during the UK-EU Referendum

    Authors: Philip N. Howard, Bence Kollanyi

    Abstract: Bots are social media accounts that automate interaction with other users, and they are active on the StrongerIn-Brexit conversation happening over Twitter. These automated scripts generate content through these platforms and then interact with people. Political bots are automated accounts that are particularly active on public policy issues, elections, and political crises. In this preliminary st… ▽ More

    Submitted 20 June, 2016; originally announced June 2016.

    Comments: 6 pages, 1 figure, 2 tables

    Report number: 2016-1

  25. arXiv:1507.07109  [pdf

    cs.SI cs.CY physics.soc-ph

    Political Bots and the Manipulation of Public Opinion in Venezuela

    Authors: Michelle Forelle, Phil Howard, Andrés Monroy-Hernández, Saiph Savage

    Abstract: Social and political bots have a small but strategic role in Venezuelan political conversations. These automated scripts generate content through social media platforms and then interact with people. In this preliminary study on the use of political bots in Venezuela, we analyze the tweeting, following and retweeting patterns for the accounts of prominent Venezuelan politicians and prominent Venez… ▽ More

    Submitted 25 July, 2015; originally announced July 2015.

    Comments: 8 pages, 3 figures

    ACM Class: H.5.3