Search | arXiv e-print repository

Improving Multimodal Classification of Social Media Posts by Leveraging Image-Text Auxiliary Tasks

Authors: Danae Sánchez Villegas, Daniel Preoţiuc-Pietro, Nikolaos Aletras

Abstract: Effectively leveraging multimodal information from social media posts is essential to various downstream tasks such as sentiment analysis, sarcasm detection or hate speech classification. Jointly modeling text and images is challenging because cross-modal semantics might be hidden or the relation between image and text is weak. However, prior work on multimodal classification of social media posts… ▽ More Effectively leveraging multimodal information from social media posts is essential to various downstream tasks such as sentiment analysis, sarcasm detection or hate speech classification. Jointly modeling text and images is challenging because cross-modal semantics might be hidden or the relation between image and text is weak. However, prior work on multimodal classification of social media posts has not yet addressed these challenges. In this work, we present an extensive study on the effectiveness of using two auxiliary losses jointly with the main task during fine-tuning multimodal models. First, Image-Text Contrastive (ITC) is designed to minimize the distance between image-text representations within a post, thereby effectively bridging the gap between posts where the image plays an important role in conveying the post's meaning. Second, Image-Text Matching (ITM) enhances the model's ability to understand the semantic relationship between images and text, thus improving its capacity to handle ambiguous or loosely related modalities. We combine these objectives with five multimodal models across five diverse social media datasets, demonstrating consistent improvements of up to 2.6 points F1. Our comprehensive analysis shows the specific scenarios where each auxiliary task is most effective. △ Less

Submitted 3 February, 2024; v1 submitted 14 September, 2023; originally announced September 2023.

Comments: Accepted at EACL 2024 Findings

arXiv:2309.03064 [pdf, other]

A Multimodal Analysis of Influencer Content on Twitter

Authors: Danae Sánchez Villegas, Catalina Goanta, Nikolaos Aletras

Abstract: Influencer marketing involves a wide range of strategies in which brands collaborate with popular content creators (i.e., influencers) to leverage their reach, trust, and impact on their audience to promote and endorse products or services. Because followers of influencers are more likely to buy a product after receiving an authentic product endorsement rather than an explicit direct product promo… ▽ More Influencer marketing involves a wide range of strategies in which brands collaborate with popular content creators (i.e., influencers) to leverage their reach, trust, and impact on their audience to promote and endorse products or services. Because followers of influencers are more likely to buy a product after receiving an authentic product endorsement rather than an explicit direct product promotion, the line between personal opinions and commercial content promotion is frequently blurred. This makes automatic detection of regulatory compliance breaches related to influencer advertising (e.g., misleading advertising or hidden sponsorships) particularly difficult. In this work, we (1) introduce a new Twitter (now X) dataset consisting of 15,998 influencer posts mapped into commercial and non-commercial categories for assisting in the automatic detection of commercial influencer content; (2) experiment with an extensive set of predictive models that combine text and visual information showing that our proposed cross-attention approach outperforms state-of-the-art multimodal models; and (3) conduct a thorough analysis of strengths and limitations of our models. We show that multimodal modeling is useful for identifying commercial posts, reducing the amount of false positives, and capturing relevant context that aids in the discovery of undisclosed commercial posts. △ Less

Submitted 6 September, 2023; originally announced September 2023.

Comments: Accepted at AACL 2023

arXiv:2306.09830 [pdf, other]

Sheffield's Submission to the AmericasNLP Shared Task on Machine Translation into Indigenous Languages

Authors: Edward Gow-Smith, Danae Sánchez Villegas

Abstract: In this paper we describe the University of Sheffield's submission to the AmericasNLP 2023 Shared Task on Machine Translation into Indigenous Languages which comprises the translation from Spanish to eleven indigenous languages. Our approach consists of extending, training, and ensembling different variations of NLLB-200. We use data provided by the organizers and data from various other sources s… ▽ More In this paper we describe the University of Sheffield's submission to the AmericasNLP 2023 Shared Task on Machine Translation into Indigenous Languages which comprises the translation from Spanish to eleven indigenous languages. Our approach consists of extending, training, and ensembling different variations of NLLB-200. We use data provided by the organizers and data from various other sources such as constitutions, handbooks, news articles, and backtranslations generated from monolingual data. On the dev set, our best submission outperforms the baseline by 11% average chrF across all languages, with substantial improvements particularly for Aymara, Guarani and Quechua. On the test set, we achieve the highest average chrF of all the submissions, we rank first in four of the eleven languages, and at least one of our submissions ranks in the top 3 for all languages. △ Less

Submitted 16 June, 2023; originally announced June 2023.

Comments: Best-performing submission overall to the AmericasNLP 2023 Shared Task. Code and models available here: https://github.com/edwardgowsmith/americasnlp-2023-sheffield

arXiv:2205.03313 [pdf, other]

Combining Humor and Sarcasm for Improving Political Parody Detection

Authors: Xiao Ao, Danae Sánchez Villegas, Daniel Preoţiuc-Pietro, Nikolaos Aletras

Abstract: Parody is a figurative device used for mimicking entities for comedic or critical purposes. Parody is intentionally humorous and often involves sarcasm. This paper explores jointly modelling these figurative tropes with the goal of improving performance of political parody detection in tweets. To this end, we present a multi-encoder model that combines three parallel encoders to enrich parody-spec… ▽ More Parody is a figurative device used for mimicking entities for comedic or critical purposes. Parody is intentionally humorous and often involves sarcasm. This paper explores jointly modelling these figurative tropes with the goal of improving performance of political parody detection in tweets. To this end, we present a multi-encoder model that combines three parallel encoders to enrich parody-specific representations with humor and sarcasm information. Experiments on a publicly available data set of political parody tweets demonstrate that our approach outperforms previous state-of-the-art methods. △ Less

Submitted 6 May, 2022; originally announced May 2022.

Comments: Accepted at NAACL 2022

arXiv:2109.00602 [pdf, other]

Point-of-Interest Type Prediction using Text and Images

Authors: Danae Sánchez Villegas, Nikolaos Aletras

Abstract: Point-of-interest (POI) type prediction is the task of inferring the type of a place from where a social media post was shared. Inferring a POI's type is useful for studies in computational social science including sociolinguistics, geosemiotics, and cultural geography, and has applications in geosocial networking technologies such as recommendation and visualization systems. Prior efforts in POI… ▽ More Point-of-interest (POI) type prediction is the task of inferring the type of a place from where a social media post was shared. Inferring a POI's type is useful for studies in computational social science including sociolinguistics, geosemiotics, and cultural geography, and has applications in geosocial networking technologies such as recommendation and visualization systems. Prior efforts in POI type prediction focus solely on text, without taking visual information into account. However in reality, the variety of modalities, as well as their semiotic relationships with one another, shape communication and interactions in social media. This paper presents a study on POI type prediction using multimodal information from text and images available at posting time. For that purpose, we enrich a currently available data set for POI type prediction with the images that accompany the text messages. Our proposed method extracts relevant information from each modality to effectively capture interactions between text and image achieving a macro F1 of 47.21 across eight categories significantly outperforming the state-of-the-art method for POI type prediction based on text-only methods. Finally, we provide a detailed analysis to shed light on cross-modal interactions and the limitations of our best performing model. △ Less

Submitted 1 September, 2021; originally announced September 2021.

Comments: Accepted at EMNLP 2021

arXiv:2105.04047 [pdf, other]

Analyzing Online Political Advertisements

Authors: Danae Sánchez Villegas, Saeid Mokaram, Nikolaos Aletras

Abstract: Online political advertising is a central aspect of modern election campaigning for influencing public opinion. Computational analysis of political ads is of utmost importance in political science to understand the characteristics of digital campaigning. It is also important in computational linguistics to study features of political discourse and communication on a large scale. In this work, we p… ▽ More Online political advertising is a central aspect of modern election campaigning for influencing public opinion. Computational analysis of political ads is of utmost importance in political science to understand the characteristics of digital campaigning. It is also important in computational linguistics to study features of political discourse and communication on a large scale. In this work, we present the first computational study on online political ads with the aim to (1) infer the political ideology of an ad sponsor; and (2) identify whether the sponsor is an official political party or a third-party organization. We develop two new large datasets for the two tasks consisting of ads from the U.S.. Evaluation results show that our approach that combines textual and visual information from pre-trained neural models outperforms a state-of-the-art method for generic commercial ad classification. Finally, we provide an in-depth analysis of the limitations of our best-performing models and linguistic analysis to study the characteristics of political ads discourse. △ Less

Submitted 26 May, 2021; v1 submitted 9 May, 2021; originally announced May 2021.

Comments: Accepted at ACL Findings 2021

arXiv:2009.14734 [pdf, other]

Point-of-Interest Type Inference from Social Media Text

Authors: Danae Sánchez Villegas, Daniel Preoţiuc-Pietro, Nikolaos Aletras

Abstract: Physical places help shape how we perceive the experiences we have there. For the first time, we study the relationship between social media text and the type of the place from where it was posted, whether a park, restaurant, or someplace else. To facilitate this, we introduce a novel data set of $\sim$200,000 English tweets published from 2,761 different points-of-interest in the U.S., enriched w… ▽ More Physical places help shape how we perceive the experiences we have there. For the first time, we study the relationship between social media text and the type of the place from where it was posted, whether a park, restaurant, or someplace else. To facilitate this, we introduce a novel data set of $\sim$200,000 English tweets published from 2,761 different points-of-interest in the U.S., enriched with place type information. We train classifiers to predict the type of the location a tweet was sent from that reach a macro F1 of 43.67 across eight classes and uncover the linguistic markers associated with each type of place. The ability to predict semantic place information from a tweet has applications in recommendation systems, personalization services and cultural geography. △ Less

Submitted 2 October, 2020; v1 submitted 30 September, 2020; originally announced September 2020.

Comments: Accepted at AACL-IJCNLP 2020

arXiv:2004.13878 [pdf, other]

Analyzing Political Parody in Social Media

Authors: Antonis Maronikolakis, Danae Sanchez Villegas, Daniel Preotiuc-Pietro, Nikolaos Aletras

Abstract: Parody is a figurative device used to imitate an entity for comedic or critical purposes and represents a widespread phenomenon in social media through many popular parody accounts. In this paper, we present the first computational study of parody. We introduce a new publicly available data set of tweets from real politicians and their corresponding parody accounts. We run a battery of supervised… ▽ More Parody is a figurative device used to imitate an entity for comedic or critical purposes and represents a widespread phenomenon in social media through many popular parody accounts. In this paper, we present the first computational study of parody. We introduce a new publicly available data set of tweets from real politicians and their corresponding parody accounts. We run a battery of supervised machine learning models for automatically detecting parody tweets with an emphasis on robustness by testing on tweets from accounts unseen in training, across different genders and across countries. Our results show that political parody tweets can be predicted with an accuracy up to 90%. Finally, we identify the markers of parody through a linguistic analysis. Beyond research in linguistics and political communication, accurately and automatically detecting parody is important to improving fact checking for journalists and analytics such as sentiment analysis through filtering out parodical utterances. △ Less

Submitted 1 May, 2020; v1 submitted 28 April, 2020; originally announced April 2020.

arXiv:1803.08383 [pdf, other]

Head-up Displays (HUD) in driving

Authors: Marcos Maroto, Enrique Caño, Pavel González, Diego Villegas

Abstract: Head Up Displays (HUDs) were designed originally to present at the usual viewpoints of the pilot the main sensor data during aircraft missions, because of placing instrument information in the forward field of view enhances pilots ability to utilize both instrument and environmental information simultaneously. The first civilian motor vehicle had a monochrome HUD that was released in 1988 by Gener… ▽ More Head Up Displays (HUDs) were designed originally to present at the usual viewpoints of the pilot the main sensor data during aircraft missions, because of placing instrument information in the forward field of view enhances pilots ability to utilize both instrument and environmental information simultaneously. The first civilian motor vehicle had a monochrome HUD that was released in 1988 by General Motors as a technological improvement of HeadDown Display (HDD) interface, which is commonly used in automobile industry. The HUD reduces the number and duration of the drivers sight deviations from the road, by projecting the required information directly into the drivers line of vision. There are many studies about ways of presenting the information: standard oneearpiece presentation, threedimensional audio presentation, visual only or audiovisual presentation. Results have shown that using a 3D auditory display the time of acquiring targets is approximately 2.2 seconds faster than using a oneearpiece way. Nevertheless, a disadvantage is when the drivers attention unconsciously shifts away from the road and goes focused on processing the information presented by the HUD. By this reason, the time, the way and the channel are important to represent the information on a HUD. A solution is a context aware multimodal proactive recommended system that features personalized content combined with the use of car sensors to determine when the information has to be presented. △ Less

Submitted 22 March, 2018; originally announced March 2018.

Comments: 7 pages

arXiv:1004.2883 [pdf, ps, other]

doi 10.1088/0004-637X/717/2/603

The ACS Fornax Cluster Survey. VIII. The Luminosity Function of Globular Clusters in Virgo and Fornax Early-Type Galaxies and its Use as a Distance Indicator

Authors: Daniela Villegas, Andres Jordan, Eric W. Peng, John P. Blakeslee, Patrick Cote, Laura Ferrarese, Markus Kissler-Patig, Simona Mei, Leopoldo Infante, John L. Tonry, Michael J. West

Abstract: We use a highly homogeneous set of data from 132 early-type galaxies in the Virgo and Fornax clusters in order to study the properties of the globular cluster luminosity function (GCLF). The globular cluster system of each galaxy was studied using a maximum likelihood approach to model the intrinsic GCLF after accounting for contamination and completeness effects. The results presented here update… ▽ More We use a highly homogeneous set of data from 132 early-type galaxies in the Virgo and Fornax clusters in order to study the properties of the globular cluster luminosity function (GCLF). The globular cluster system of each galaxy was studied using a maximum likelihood approach to model the intrinsic GCLF after accounting for contamination and completeness effects. The results presented here update our Virgo measurements and confirm our previous results showing a tight correlation between the dispersion of the GCLF and the absolute magnitude of the parent galaxy. Regarding the use of the GCLF as a standard candle, we have found that the relative distance modulus between the Virgo and Fornax clusters is systematically lower than the one derived by other distance estimators, and in particular it is 0.22mag lower than the value derived from surface brightness fluctuation measurements performed on the same data. From numerical simulations aimed at reproducing the observed dispersion of the value of the turnover magnitude in each galaxy cluster we estimate an intrinsic dispersion on this parameter of 0.21mag and 0.15mag for Virgo and Fornax respectively. All in all, our study shows that the GCLF properties vary systematically with galaxy mass showing no evidence for a dichotomy between giant and dwarf early-type galaxies. These properties may be influenced by the cluster environment as suggested by cosmological simulations. △ Less

Submitted 16 April, 2010; originally announced April 2010.

Comments: 16 pages, 10 figures. Accepted for publication in ApJ. Full version of figure 3 available at: http://www.eso.org/~dvillega/gclf/gclf_all.pdf

arXiv:0712.0515 [pdf, ps, other]

doi 10.1088/0004-6256/135/2/467

Normal Globular Cluster Systems in Massive Low Surface Brightness Galaxies

Authors: Daniela Villegas, Markus Kissler-Patig, Andrés Jordán, Paul Goudfrooij, Martin Zwaan

Abstract: We present the results of a study of the globular cluster systems of 6 massive spiral galaxies, originally cataloged as low surface brightness galaxies but here shown to span a wide range of central surface brightness values, including two intermediate to low surface brightness galaxies. We used the Advanced Camera for Surveys on board HST to obtain photometry in the F475W and F775W bands and se… ▽ More We present the results of a study of the globular cluster systems of 6 massive spiral galaxies, originally cataloged as low surface brightness galaxies but here shown to span a wide range of central surface brightness values, including two intermediate to low surface brightness galaxies. We used the Advanced Camera for Surveys on board HST to obtain photometry in the F475W and F775W bands and select sources with photometric and morphological properties consistent with those of globular clusters. A total of 206 candidates were identified in our target galaxies. From a direct comparison with the Galactic globular cluster system we derive specific frequency values for each galaxy that are in the expected range for late-type galaxies. We show that the globular cluster candidates in all galaxies have properties consistent with globular cluster systems of previously studied galaxies in terms of luminosity, sizes and color. We establish the presence of globular clusters in the two intermediate to low surface brightness galaxies in our sample and show that their properties do not have any significant deviation from the behavior observed in the other sample galaxies. Our results are broadly consistent with a scenario in which low surface brightness galaxies follow roughly the same evolutionary history as normal (i.e. high surface) brightness galaxies except at a much lower rate, but require the presence of an initial period of star formation intense enough to allow the formation of massive star clusters. △ Less

Submitted 4 December, 2007; originally announced December 2007.

Comments: 14 pages, 6 figures. AJ accepted

arXiv:astro-ph/0702496 [pdf, ps, other]

doi 10.1086/516840

The ACS Virgo Cluster Survey. XII. The Luminosity Function of Globular Clusters in Early Type Galaxies

Authors: Andres Jordan, Dean E. McLaughlin, Patrick Cote, Laura Ferrarese, Eric W. Peng, Simona Mei, Daniela Villegas, David Merritt, John L. Tonry, Michael J. West

Abstract: We analyze the luminosity function of the globular clusters (GCs) belonging to the early-type galaxies observed in the ACS Virgo Cluster Survey. We have obtained estimates for a Gaussian representation of the GC luminosity function (GCLF) for 89 galaxies. We have also fit the GCLFs with an "evolved Schechter function", which is meant to reflect the preferential depletion of low-mass GCs, primari… ▽ More We analyze the luminosity function of the globular clusters (GCs) belonging to the early-type galaxies observed in the ACS Virgo Cluster Survey. We have obtained estimates for a Gaussian representation of the GC luminosity function (GCLF) for 89 galaxies. We have also fit the GCLFs with an "evolved Schechter function", which is meant to reflect the preferential depletion of low-mass GCs, primarily by evaporation due to two-body relaxation, from an initial Schechter mass function similar to that of young massive clusters. We find a significant trend of the GCLF dispersion with galaxy luminosity, in the sense that smaller galaxies have narrower GCLFs. We show that this narrowing of the GCLF in a Gaussian description is driven by a steepening of the GC mass function above the turnover mass, as one moves to smaller host galaxies. We argue that this behavior at the high-mass end of the GC mass function is most likely a consequence of systematic variations of the initial cluster mass function. The GCLF turnover mass M_TO is roughly constant, at ~ 2.2 x 10^5 M_sun in bright galaxies, but it decreases slightly in dwarfs with M_B >~ -18. We show that part of the variation could arise from the shorter dynamical friction timescales in smaller galaxies. We probe the variation of the GCLF to projected galactocentric radii of 20-35 kpc in the Virgo giants M49 and M87, finding that M_TO is essentially constant over these spatial scales. Our fits of evolved Schechter functions imply average dynamical mass losses (Delta) over a Hubble time that fall in the range 2 x 10^5 <~ (Delta/M_sun) < 10^6 per GC. We agree with previous suggestions that if the full GCLF is to be understood in more detail GCLF models will have to include self-consistent treatments of dynamical evolution inside time-dependent galaxy potentials. (Abridged) △ Less

Submitted 19 February, 2007; originally announced February 2007.

Comments: 46 pages, 20 figures, 6 tables. Accepted for publication in ApJS. Also available at http://www1.cadc-ccda.hia-iha.nrc-cnrc.gc.ca/community/ACSVCS/publications.html

arXiv:astro-ph/0609371 [pdf, ps, other]

doi 10.1086/509119

Trends in the Globular Cluster Luminosity Function of Early-Type Galaxies

Authors: Andres Jordan, Dean E. McLaughlin, Patrick Cote, Laura Ferrarese, Eric W. Peng, John P. Blakeslee, Simona Mei, Daniela Villegas, David Merritt, John L. Tonry, Michael J. West

Abstract: We present results from a study of the globular cluster luminosity function (GCLF) in a sample of 89 early-type galaxies observed as part of the ACS Virgo Cluster Survey. Using a Gaussian parametrization of the GCLF, we find a highly significant correlation between the GCLF dispersion, sigma, and the galaxy luminosity, M_B, in the sense that the GC systems in fainter galaxies have narrower lumin… ▽ More We present results from a study of the globular cluster luminosity function (GCLF) in a sample of 89 early-type galaxies observed as part of the ACS Virgo Cluster Survey. Using a Gaussian parametrization of the GCLF, we find a highly significant correlation between the GCLF dispersion, sigma, and the galaxy luminosity, M_B, in the sense that the GC systems in fainter galaxies have narrower luminosity functions. The GCLF dispersions in the Milky Way and M31 are fully consistent with this trend, implying that the correlation between sigma and galaxy luminosity is more fundamental than older suggestions that GCLF shape is a function of galaxy Hubble type. We show that the sigma - M_B relation results from a bonafide narrowing of the distribution of (logarithmic) cluster masses in fainter galaxies. We further show that this behavior is mirrored by a steepening of the GC mass function for relatively high masses, M >~ 3 x 10^5 M_sun, a mass regime in which the shape of the GCLF is not strongly affected by dynamical evolution over a Hubble time. We argue that this trend arises from variations in initial conditions and requires explanation by theories of cluster formation. Finally, we confirm that in bright galaxies, the GCLF "turns over" at the canonical mass scale of M_TO ~ 2 x 10^5 M_sun. However, we find that M_TO scatters to lower values (~1-2 x 10^5 M_sun) in galaxies fainter than M_B >~ -18.5, an important consideration if the GCLF is to be used as a distance indicator for dwarf ellipticals. △ Less

Submitted 13 September, 2006; originally announced September 2006.

Comments: 4 pages, 3 figures. Accepted for publication in ApJ Letters. Also available at http://www.cadc.hia.nrc.gc.ca/community/ACSVCS/publications.html

Journal ref: Astrophys.J.651:L25-L28,2006

arXiv:math/0206280 [pdf, ps, other]

A methodological exhibition of the theory of the identification of Lineal Dynamic systems

Authors: Rosina Hing, Gloria Nunez, Diosdado Villegas

Abstract: 038The identification theory and realization of the dynamic systems is a medullary aspect in the modern control theory that consists fundamentally in that, starting from the knowledge of the behavior entrance-exit, obtained experimentally in the case of the identification, or given previously in the case of the realization, to build a state model that carries o… ▽ More 038The identification theory and realization of the dynamic systems is a medullary aspect in the modern control theory that consists fundamentally in that, starting from the knowledge of the behavior entrance-exit, obtained experimentally in the case of the identification, or given previously in the case of the realization, to build a state model that carries out this behavior. This content is not generally treated in the pre-graduate courses, for the systems of multiple entrances and multiple exits. 038In this work it is demonstrated that the identification theory and realization of the lineal dynamic systems can be imparted in the technical careers starting from the results of Mathematical Analysis and Lineal Algebra received by the students in pre-graduate studies, without necessity of adding new contents in the programs of this subjects 038. 038We propose a new form of imparting the theory of identification and realization of the lineal dynamic systems based on the intuition and the physical interpretation of the concepts. △ Less

Submitted 26 June, 2002; originally announced June 2002.

Comments: 13 pages, no figures

MSC Class: 97D40

Showing 1–14 of 14 results for author: Villegas, D