Skip to main content

Showing 1–4 of 4 results for author: de León, E

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.02931  [pdf

    cs.CY

    Improving the quality of individual-level online information tracking: challenges of existing approaches and introduction of a new content- and long-tail sensitive academic solution

    Authors: Silke Adam, Mykola Makhortykh, Michaela Maier, Viktor Aigenseer, Aleksandra Urman, Teresa Gil Lopez, Clara Christner, Ernesto de León, Roberto Ulloa

    Abstract: This article evaluates the quality of data collection in individual-level desktop information tracking used in the social sciences and shows that the existing approaches face sampling issues, validity issues due to the lack of content-level data and their disregard of the variety of devices and long-tail consumption patterns as well as transparency and privacy issues. To overcome some of these pro… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

    Comments: 73 pages

  2. arXiv:2207.00489  [pdf

    cs.CL cs.CY

    Panning for gold: Lessons learned from the platform-agnostic automated detection of political content in textual data

    Authors: Mykola Makhortykh, Ernesto de León, Aleksandra Urman, Clara Christner, Maryna Sydorova, Silke Adam, Michaela Maier, Teresa Gil-Lopez

    Abstract: The growing availability of data about online information behaviour enables new possibilities for political communication research. However, the volume and variety of these data makes them difficult to analyse and prompts the need for develo** automated content approaches relying on a broad range of natural language processing techniques (e.g. machine learning- or neural network-based ones). In… ▽ More

    Submitted 1 July, 2022; originally announced July 2022.

  3. arXiv:2011.05537  [pdf, other

    cs.LG cs.AI cs.CR cs.CY

    Differentially Private Synthetic Data: Applied Evaluations and Enhancements

    Authors: Lucas Rosenblatt, Xiaoyan Liu, Samira Pouyanfar, Eduardo de Leon, Anuj Desai, Joshua Allen

    Abstract: Machine learning practitioners frequently seek to leverage the most informative available data, without violating the data owner's privacy, when building predictive models. Differentially private data synthesis protects personal details from exposure, and allows for the training of differentially private machine learning models on privately generated datasets. But how can we effectively assess the… ▽ More

    Submitted 10 November, 2020; originally announced November 2020.

    Comments: Under Review

  4. arXiv:1804.04031  [pdf, other

    cs.DC cs.LG

    Flexible and Scalable Deep Learning with MMLSpark

    Authors: Mark Hamilton, Sudarshan Raghunathan, Akshaya Annavajhala, Danil Kirsanov, Eduardo de Leon, Eli Barzilay, Ilya Matiach, Joe Davison, Maureen Busch, Miruna Oprescu, Ratan Sur, Roope Astala, Tong Wen, ChangYoung Park

    Abstract: In this work we detail a novel open source library, called MMLSpark, that combines the flexible deep learning library Cognitive Toolkit, with the distributed computing framework Apache Spark. To achieve this, we have contributed Java Language bindings to the Cognitive Toolkit, and added several new components to the Spark ecosystem. In addition, we also integrate the popular image processing libra… ▽ More

    Submitted 11 April, 2018; originally announced April 2018.

    Journal ref: Proceedings of Machine Learning Research 82 (2017) 11-22, 4th International Conference on Predictive Applications and APIs