Skip to main content

Showing 1–11 of 11 results for author: Alonso, O

Searching in archive cs. Search in all archives.
.
  1. A General Model for Aggregating Annotations Across Simple, Complex, and Multi-Object Annotation Tasks

    Authors: Alexander Braylan, Madalyn Marabella, Omar Alonso, Matthew Lease

    Abstract: Human annotations are vital to supervised learning, yet annotators often disagree on the correct label, especially as annotation tasks increase in complexity. A strategy to improve label quality is to ask multiple annotators to label the same item and aggregate their labels. Many aggregation models have been proposed for categorical or numerical annotation tasks, but far less work has considered m… ▽ More

    Submitted 20 December, 2023; originally announced December 2023.

    Journal ref: Journal of Artificial Intelligence Research 2023, 78, 901-973

  2. Measuring Annotator Agreement Generally across Complex Structured, Multi-object, and Free-text Annotation Tasks

    Authors: Alexander Braylan, Omar Alonso, Matthew Lease

    Abstract: When annotators label data, a key metric for quality assurance is inter-annotator agreement (IAA): the extent to which annotators agree on their labels. Though many IAA measures exist for simple categorical and ordinal labeling tasks, relatively little work has considered more complex labeling tasks, such as structured, multi-object, and free-text annotations. Krippendorff's alpha, best known for… ▽ More

    Submitted 15 December, 2022; originally announced December 2022.

  3. arXiv:1908.01868  [pdf, other

    cs.IR

    Local versus Global Strategies in Social Query Expansion

    Authors: Omar Alonso, Vasileios Kandylas, Serge-Eric Tremblay

    Abstract: Link sharing in social media can be seen as a collaboratively retrieved set of documents for a query or topic expressed by a hashtag. Temporal information plays an important role for identifying the correct context for which such annotations are valid for retrieval purposes. We investigate how social data as temporal context can be used for query expansion and compare global versus local strategie… ▽ More

    Submitted 5 August, 2019; originally announced August 2019.

  4. arXiv:1906.05986  [pdf, other

    cs.IR cs.SI

    Scalable Knowledge Graph Construction from Twitter

    Authors: Omar Alonso, Vasileios Kandylas, Serge-Eric Tremblay

    Abstract: We describe a knowledge graph derived from Twitter data with the goal of discovering relationships between people, links, and topics. The goal is to filter out noise from Twitter and surface an inside-out view that relies on high quality content. The generated graph contains many relationships where the user can query and traverse the structure from different angles allowing the development of new… ▽ More

    Submitted 13 June, 2019; originally announced June 2019.

  5. arXiv:1612.03316  [pdf, other

    cs.IR

    Label Visualization and Exploration in IR

    Authors: Omar Alonso

    Abstract: There is a renaissance in visual analytics systems for data analysis and sharing, in particular, in the current wave of big data applications. We introduce RAVE, a prototype that automates the generation of an interface that uses facets and visualization techniques for exploring and analyzing relevance assessments data sets collected via crowdsourcing. We present a technical description of the mai… ▽ More

    Submitted 10 December, 2016; originally announced December 2016.

  6. arXiv:1411.0149  [pdf, other

    cs.AI cs.DS

    How Many Workers to Ask? Adaptive Exploration for Collecting High Quality Labels

    Authors: Ittai Abraham, Omar Alonso, Vasilis Kandylas, Rajesh Patel, Steven Shelford, Aleksandrs Slivkins

    Abstract: Crowdsourcing has been part of the IR toolbox as a cheap and fast mechanism to obtain labels for system development and evaluation. Successful deployment of crowdsourcing at scale involves adjusting many variables, a very important one being the number of workers needed per human intelligence task (HIT). We consider the crowdsourcing task of learning the answer to simple multiple-choice HITs, whic… ▽ More

    Submitted 19 May, 2016; v1 submitted 1 November, 2014; originally announced November 2014.

    Comments: SIGIR 2016

  7. arXiv:1410.2828  [pdf, other

    cs.HC

    A Study on Placement of Social Buttons in Web Pages

    Authors: Omar Alonso, Vasilis Kandylas

    Abstract: With the explosion of social media in the last few years, web pages nowadays include different social network buttons where users can express if they support or recommend content. Those social buttons are very visual and their presentations, along with the counters, mark the importance of the social network and the interest on the content. In this paper, we analyze the presence of four types of so… ▽ More

    Submitted 10 October, 2014; originally announced October 2014.

  8. arXiv:1407.6714  [pdf, other

    cs.SI

    CrowdSTAR: A Social Task Routing Framework for Online Communities

    Authors: Besmira Nushi, Omar Alonso, Martin Hentschel, Vasileios Kandylas

    Abstract: The online communities available on the Web have shown to be significantly interactive and capable of collectively solving difficult tasks. Nevertheless, it is still a challenge to decide how a task should be dispatched through the network due to the high diversity of the communities and the dynamically changing expertise and social availability of their members. We introduce CrowdSTAR, a framewor… ▽ More

    Submitted 24 July, 2014; originally announced July 2014.

    ACM Class: H.4.m; H.5.3

  9. arXiv:1307.3673  [pdf, other

    cs.LG cs.IR

    A Data Management Approach for Dataset Selection Using Human Computation

    Authors: Alexandros Ntoulas, Omar Alonso, Vasilis Kandylas

    Abstract: As the number of applications that use machine learning algorithms increases, the need for labeled data useful for training such algorithms intensifies. Getting labels typically involves employing humans to do the annotation, which directly translates to training and working costs. Crowdsourcing platforms have made labeling cheaper and faster, but they still involve significant costs, especially… ▽ More

    Submitted 13 July, 2013; originally announced July 2013.

  10. arXiv:1302.3268  [pdf, ps, other

    cs.LG

    Adaptive Crowdsourcing Algorithms for the Bandit Survey Problem

    Authors: Ittai Abraham, Omar Alonso, Vasilis Kandylas, Aleksandrs Slivkins

    Abstract: Very recently crowdsourcing has become the de facto platform for distributing and collecting human computation for a wide range of tasks and applications such as information retrieval, natural language processing and machine learning. Current crowdsourcing platforms have some limitations in the area of quality control. Most of the effort to ensure good quality has to be done by the experimenter wh… ▽ More

    Submitted 20 May, 2013; v1 submitted 13 February, 2013; originally announced February 2013.

    Comments: Full version of a paper in COLT 2013

  11. arXiv:1212.1927  [pdf, other

    cs.SI

    User Taglines: Alternative Presentations of Expertise and Interest in Social Media

    Authors: Hemant Purohit, Alex Dow, Omar Alonso, Lei Duan, Kevin Haas

    Abstract: Web applications are increasingly showing recommended users from social media along with some descriptions, an attempt to show relevancy - why they are being shown. For example, Twitter search for a topical keyword shows expert twitterers on the side for 'whom to follow'. Google+ and Facebook also recommend users to follow or add to friend circle. Popular Internet newspaper- The Huffington Post sh… ▽ More

    Submitted 9 December, 2012; originally announced December 2012.

    Comments: First ASE International Conference on Social Informatics, Social-Informatics-2012

    ACM Class: H.5.3