Skip to main content

Showing 1–18 of 18 results for author: Mozetic, I

.
  1. arXiv:2303.03953  [pdf, other

    cs.CL cs.LG

    ChatGPT: Beginning of an End of Manual Linguistic Data Annotation? Use Case of Automatic Genre Identification

    Authors: Taja Kuzman, Igor Mozetič, Nikola Ljubešić

    Abstract: ChatGPT has shown strong capabilities in natural language generation tasks, which naturally leads researchers to explore where its abilities end. In this paper, we examine whether ChatGPT can be used for zero-shot text classification, more specifically, automatic genre identification. We compare ChatGPT with a multilingual XLM-RoBERTa language model that was fine-tuned on datasets, manually annota… ▽ More

    Submitted 8 March, 2023; v1 submitted 7 March, 2023; originally announced March 2023.

  2. Retweet communities reveal the main sources of hate speech

    Authors: Bojan Evkoski, Andraz Pelicon, Igor Mozetic, Nikola Ljubesic, Petra Kralj Novak

    Abstract: We address a challenging problem of identifying main sources of hate speech on Twitter. On one hand, we carefully annotate a large set of tweets for hate speech, and deploy advanced deep learning to produce high quality hate speech classification models. On the other hand, we create retweet networks, detect communities and monitor their evolution through time. This combined approach is applied to… ▽ More

    Submitted 17 March, 2022; v1 submitted 31 May, 2021; originally announced May 2021.

    Journal ref: B. Evkoski, A. Pelicon, I. Mozetič, N. Ljubešić, P. Kralj Novak. Retweet communities reveal the main sources of hate speech, PLoS ONE 17(3): e0265602, 2022

  3. arXiv:2105.14005  [pdf, other

    cs.SI cs.CY cs.LG

    Online Hate: Behavioural Dynamics and Relationship with Misinformation

    Authors: Matteo Cinelli, Andraž Pelicon, Igor Mozetič, Walter Quattrociocchi, Petra Kralj Novak, Fabiana Zollo

    Abstract: Online debates are often characterised by extreme polarisation and heated discussions among users. The presence of hate speech online is becoming increasingly problematic, making necessary the development of appropriate countermeasures. In this work, we perform hate speech detection on a corpus of more than one million comments on YouTube videos through a machine learning model fine-tuned on a lar… ▽ More

    Submitted 28 May, 2021; originally announced May 2021.

  4. Community evolution in retweet networks

    Authors: Bojan Evkoski, Igor Mozetic, Nikola Ljubesic, Petra Kralj Novak

    Abstract: Communities in social networks often reflect close social ties between their members and their evolution through time. We propose an approach that tracks two aspects of community evolution in retweet networks: flow of the members in, out and between the communities, and their influence. We start with high resolution time windows, and then select several timepoints which exhibit large differences b… ▽ More

    Submitted 2 September, 2021; v1 submitted 13 May, 2021; originally announced May 2021.

    Journal ref: PLoS ONE 16(9): e0256175, 2021

  5. arXiv:2005.07456  [pdf

    cs.CL cs.LG

    Cross-lingual Transfer of Sentiment Classifiers

    Authors: Marko Robnik-Sikonja, Kristjan Reba, Igor Mozetic

    Abstract: Word embeddings represent words in a numeric space so that semantic relations between words are represented as distances and directions in the vector space. Cross-lingual word embeddings transform vector spaces of different languages so that similar words are aligned. This is done by constructing a map** between vector spaces of two languages or learning a joint vector space for multiple languag… ▽ More

    Submitted 24 March, 2021; v1 submitted 15 May, 2020; originally announced May 2020.

    Comments: 18 pages, 8 tables

    MSC Class: 68T50 (Primary) ACM Class: I.2.7; J.4; K.4.2

  6. Evaluating time series forecasting models: An empirical study on performance estimation methods

    Authors: Vitor Cerqueira, Luis Torgo, Igor Mozetic

    Abstract: Performance estimation aims at estimating the loss that a predictive model will incur on unseen data. These procedures are part of the pipeline in every machine learning project and are used for assessing the overall generalisation ability of predictive models. In this paper we address the application of these methods to time series forecasting tasks. For independent and identically distributed da… ▽ More

    Submitted 28 May, 2019; originally announced May 2019.

    Journal ref: Machine Learning 109:1997-2028, 2020

  7. arXiv:1804.02233  [pdf, other

    cs.SI cs.CL cs.CY econ.TH

    Forex trading and Twitter: Spam, bots, and reputation manipulation

    Authors: Igor Mozetič, Peter Gabrovšek, Petra Kralj Novak

    Abstract: Currency trading (Forex) is the largest world market in terms of volume. We analyze trading and tweeting about the EUR-USD currency pair over a period of three years. First, a large number of tweets were manually labeled, and a Twitter stance classification model is constructed. The model then classifies all the tweets by the trading stance signal: buy, hold, or sell (EUR vs. USD). The Twitter sta… ▽ More

    Submitted 16 April, 2018; v1 submitted 6 April, 2018; originally announced April 2018.

    Comments: MIS2: Misinformation and Misbehavior Mining on the Web, Workshop at WSDM-18, Marina Del Rey, CA, USA, Feb. 9, 2018

  8. How to evaluate sentiment classifiers for Twitter time-ordered data?

    Authors: Igor Mozetič, Luis Torgo, Vitor Cerqueira, Jasmina Smailović

    Abstract: Social media are becoming an increasingly important source of information about the public mood regarding issues such as elections, Brexit, stock market, etc. In this paper we focus on sentiment classification of Twitter data. Construction of sentiment classifiers is a standard text mining task, but here we address the question of how to properly evaluate them as there is no settled way to do so.… ▽ More

    Submitted 14 March, 2018; originally announced March 2018.

    Journal ref: PLoS ONE 13(3): e0194317, 2018

  9. Twitter Sentiment around the Earnings Announcement Events

    Authors: Peter Gabrovsek, Darko Aleksovski, Igor Mozetic, Miha Grcar

    Abstract: We investigate the relationship between social media, Twitter in particular, and stock market. We provide an in-depth analysis of the Twitter volume and sentiment about the 30 companies in the Dow Jones Industrial Average index, over a period of three years. We focus on Earnings Announcements and show that there is a considerable difference with respect to when the announcements are made: before t… ▽ More

    Submitted 9 January, 2017; v1 submitted 7 November, 2016; originally announced November 2016.

    Journal ref: PLoS ONE 12(2): e0173151, 2017

  10. Cohesion and Coalition Formation in the European Parliament: Roll-Call Votes and Twitter Activities

    Authors: Darko Cherepnalkoski, Andreas Karpf, Igor Mozetic, Miha Grcar

    Abstract: We study the cohesion within and the coalitions between political groups in the Eighth European Parliament (2014--2019) by analyzing two entirely different aspects of the behavior of the Members of the European Parliament (MEPs) in the policy-making processes. On one hand, we analyze their co-voting patterns and, on the other, their retweeting behavior. We make use of two diverse datasets in the a… ▽ More

    Submitted 14 October, 2016; v1 submitted 17 August, 2016; originally announced August 2016.

    Journal ref: PLoS ONE 11(11): e0166586, 2016

  11. Multilingual Twitter Sentiment Classification: The Role of Human Annotators

    Authors: Igor Mozetic, Miha Grcar, Jasmina Smailovic

    Abstract: What are the limits of automated Twitter sentiment classification? We analyze a large set of manually labeled tweets in different languages, use them as training data, and construct automated classification models. It turns out that the quality of classification models depends much more on the quality and size of training data than on the type of the model trained. Experimental results indicate th… ▽ More

    Submitted 5 May, 2016; v1 submitted 24 February, 2016; originally announced February 2016.

    Journal ref: PLoS ONE 11(5): e0155036, 2016

  12. Sentiment of Emojis

    Authors: Petra Kralj Novak, Jasmina Smailović, Borut Sluban, Igor Mozetič

    Abstract: There is a new generation of emoticons, called emojis, that is increasingly being used in mobile communications and social media. In the past two years, over ten billion emojis were used on Twitter. Emojis are Unicode graphic symbols, used as a shorthand to express concepts and ideas. In contrast to the small number of well-known emoticons that carry clear emotional contents, there are hundreds of… ▽ More

    Submitted 8 December, 2015; v1 submitted 25 September, 2015; originally announced September 2015.

    Journal ref: PLoS ONE 10(12): e0144296, 2015

  13. arXiv:1508.00027  [pdf, other

    cs.IR

    Analysis of Financial News with NewsStream

    Authors: Petra Kralj Novak, Miha Grcar, Borut Sluban, Igor Mozetic

    Abstract: Unstructured data, such as news and blogs, can provide valuable insights into the financial world. We present the NewsStream portal, an intuitive and easy-to-use tool for news analytics, which supports interactive querying and visualizations of the documents at different levels of detail. It relies on a scalable architecture for real-time processing of a continuous stream of textual data, which in… ▽ More

    Submitted 7 November, 2015; v1 submitted 31 July, 2015; originally announced August 2015.

    Report number: IJS-DP-11965

  14. The Effects of Twitter Sentiment on Stock Price Returns

    Authors: Gabriele Ranco, Darko Aleksovski, Guido Caldarelli, Miha Grčar, Igor Mozetič

    Abstract: Social media are increasingly reflecting and influencing behavior of other complex systems. In this paper we investigate the relations between a well-know micro-blogging platform Twitter and financial markets. In particular, we consider, in a period of 15 months, the Twitter volume and sentiment about the 30 stock companies that form the Dow Jones Industrial Average (DJIA) index. We find a relativ… ▽ More

    Submitted 11 August, 2015; v1 submitted 8 June, 2015; originally announced June 2015.

    Journal ref: PLoS ONE 10(9): e0138441 (2015)

  15. arXiv:1505.08001  [pdf, other

    cs.SI cs.CY physics.soc-ph

    Emotional Dynamics in the Age of Misinformation

    Authors: Fabiana Zollo, Petra Kralj Novak, Michela Del Vicario, Alessandro Bessi, Igor Mozetic, Antonio Scala, Guido Caldarelli, Walter Quattrociocchi

    Abstract: According to the World Economic Forum, the diffusion of unsubstantiated rumors on online social media is one of the main threats for our society. The disintermediated paradigm of content production and consumption on online social media might foster the formation of homophile communities (echo-chambers) around specific worldviews. Such a scenario has been shown to be a vivid environment for the… ▽ More

    Submitted 29 May, 2015; originally announced May 2015.

    Journal ref: PLoS ONE, 10(9): e0138740 (2015)

  16. arXiv:1504.06861  [pdf, ps, other

    physics.soc-ph cs.SI physics.data-an

    Twitter-based analysis of the dynamics of collective attention to political parties

    Authors: Young-Ho Eom, Michelangelo Puliga, Jasmina Smailović, Igor Mozetič, Guido Caldarelli

    Abstract: Large-scale data from social media have a significant potential to describe complex phenomena in real world and to anticipate collective behaviors such as information spreading and social trends. One specific case of study is represented by the collective attention to the action of political parties. Not surprisingly, researchers and stakeholders tried to correlate parties' presence on social medi… ▽ More

    Submitted 14 July, 2015; v1 submitted 26 April, 2015; originally announced April 2015.

    Comments: 16 pages, 7 figures, 3 tables. Published in PLoS ONE

    Journal ref: PLoS ONE 10(7): e0131184 (2015)

  17. arXiv:1406.5323  [pdf, other

    physics.soc-ph cs.CY cs.SI

    Extraction of Temporal Networks from Term Co-occurrences in Online Textual Sources

    Authors: Marko Popović, Hrvoje Štefančić, Borut Sluban, Petra Kralj Novak, Miha Grčar, Igor Mozetič, Michelangelo Puliga, Vinko Zlatić

    Abstract: A stream of unstructured news can be a valuable source of hidden relations between different entities, such as financial institutions, countries, or persons. We present an approach to continuously collect online news, recognize relevant entities in them, and extract time-varying networks. The nodes of the network are the entities, and the links are their co-occurrences. We present a method to esti… ▽ More

    Submitted 20 June, 2014; originally announced June 2014.

    Comments: 27 pages, 12 figures

    Journal ref: PLoS ONE 9(12): e99515 (2014)

  18. arXiv:1402.3483  [pdf, other

    cs.SI physics.soc-ph q-fin.ST

    News Cohesiveness: an Indicator of Systemic Risk in Financial Markets

    Authors: Matija Piškorec, Nino Antulov-Fantulin, Petra Kralj Novak, Igor Mozetič, Miha Grčar, Irena Vodenska, Tomislav Šmuc

    Abstract: Motivated by recent financial crises significant research efforts have been put into studying contagion effects and herding behaviour in financial markets. Much less has been said about influence of financial news on financial markets. We propose a novel measure of collective behaviour in financial news on the Web, News Cohesiveness Index (NCI), and show that it can be used as a systemic risk indi… ▽ More

    Submitted 14 February, 2014; originally announced February 2014.

    Journal ref: Scientific Reports 4: 5038 (2014)