Skip to main content

Showing 1–17 of 17 results for author: Assent, I

.
  1. arXiv:2406.17753  [pdf, other

    cs.CL cs.AI

    Measuring and Benchmarking Large Language Models' Capabilities to Generate Persuasive Language

    Authors: Amalie Brogaard Pauli, Isabelle Augenstein, Ira Assent

    Abstract: We are exposed to much information trying to influence us, such as teaser messages, debates, politically framed news, and propaganda - all of which use persuasive language. With the recent interest in Large Language Models (LLMs), we study the ability of LLMs to produce persuasive text. As opposed to prior work which focuses on particular domains or types of persuasion, we conduct a general study… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

  2. arXiv:2406.08435  [pdf, other

    cs.IR

    Wiki Entity Summarization Benchmark

    Authors: Saeedeh Javadi, Atefeh Moradan, Mohammad Sorkhpar, Klim Zaporojets, Davide Mottin, Ira Assent

    Abstract: Entity summarization aims to compute concise summaries for entities in knowledge graphs. Existing datasets and benchmarks are often limited to a few hundred entities and discard graph structure in source knowledge graphs. This limitation is particularly pronounced when it comes to ground-truth summaries, where there exist only a few labeled summaries for evaluation and training. We propose WikES,… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  3. arXiv:2311.18398  [pdf, other

    cs.CV cs.LG physics.ao-ph

    RainAI -- Precipitation Nowcasting from Satellite Data

    Authors: Rafael Pablos Sarabia, Joachim Nyborg, Morten Birk, Ira Assent

    Abstract: This paper presents a solution to the Weather4Cast 2023 competition, where the goal is to forecast high-resolution precipitation with an 8-hour lead time using lower-resolution satellite radiance images. We propose a simple, yet effective method for spatiotemporal feature learning using a 2D U-Net model, that outperforms the official 3D U-Net baseline in both performance and efficiency. We place e… ▽ More

    Submitted 30 November, 2023; originally announced November 2023.

  4. arXiv:2305.07320  [pdf, other

    cs.LG

    ActUp: Analyzing and Consolidating tSNE and UMAP

    Authors: Andrew Draganov, Jakob Rødsgaard Jørgensen, Katrine Scheel Nellemann, Davide Mottin, Ira Assent, Tyrus Berry, Cigdem Aslay

    Abstract: tSNE and UMAP are popular dimensionality reduction algorithms due to their speed and interpretable low-dimensional embeddings. Despite their popularity, however, little work has been done to study their full span of differences. We theoretically and experimentally evaluate the space of parameters in both tSNE and UMAP and observe that a single one -- the normalization -- is responsible for switchi… ▽ More

    Submitted 12 May, 2023; originally announced May 2023.

    Comments: arXiv admin note: substantial text overlap with arXiv:2206.09689

  5. arXiv:2206.09689  [pdf, other

    cs.LG

    GiDR-DUN; Gradient Dimensionality Reduction -- Differences and Unification

    Authors: Andrew Draganov, Tyrus Berry, Jakob Rødsgaard Jørgensen, Katrine Scheel Nellemann, Ira Assent, Davide Mottin

    Abstract: TSNE and UMAP are two of the most popular dimensionality reduction algorithms due to their speed and interpretable low-dimensional embeddings. However, while attempts have been made to improve on TSNE's computational complexity, no existing method can obtain TSNE embeddings at the speed of UMAP. In this work, we show that this is indeed possible by combining the two approaches into a single method… ▽ More

    Submitted 20 June, 2022; originally announced June 2022.

  6. arXiv:2203.09175  [pdf, other

    cs.CV cs.LG

    Generalized Classification of Satellite Image Time Series with Thermal Positional Encoding

    Authors: Joachim Nyborg, Charlotte Pelletier, Ira Assent

    Abstract: Large-scale crop type classification is a task at the core of remote sensing efforts with applications of both economic and ecological importance. Current state-of-the-art deep learning methods are based on self-attention and use satellite image time series (SITS) to discriminate crop types based on their unique growth patterns. However, existing methods generalize poorly to regions not seen durin… ▽ More

    Submitted 14 June, 2022; v1 submitted 17 March, 2022; originally announced March 2022.

    Comments: In proceedings of the 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops

  7. arXiv:2112.14822  [pdf, ps, other

    cs.SI

    UCoDe: Unified Community Detection with Graph Convolutional Networks

    Authors: Atefeh Moradan, Andrew Draganov, Davide Mottin, Ira Assent

    Abstract: Community detection finds homogeneous groups of nodes in a graph. Existing approaches either partition the graph into disjoint, non-overlap**, communities, or determine only overlap** communities. To date, no method supports both detections of overlap** and non-overlap** communities. We propose UCoDe, a unified method for community detection in attributed graphs that detects both overlappi… ▽ More

    Submitted 2 November, 2023; v1 submitted 29 December, 2021; originally announced December 2021.

  8. arXiv:2111.11879  [pdf, other

    cs.CV cs.LG

    Weakly-Supervised Cloud Detection with Fixed-Point GANs

    Authors: Joachim Nyborg, Ira Assent

    Abstract: The detection of clouds in satellite images is an essential preprocessing task for big data in remote sensing. Convolutional neural networks (CNNs) have greatly advanced the state-of-the-art in the detection of clouds in satellite images, but existing CNN-based methods are costly as they require large amounts of training images with expensive pixel-level cloud labels. To alleviate this cost, we pr… ▽ More

    Submitted 23 November, 2021; originally announced November 2021.

    Comments: Accepted to the 3rd IEEE Workshop on Machine Learning for Big Data Analytics in Remote Sensing

  9. TimeMatch: Unsupervised Cross-Region Adaptation by Temporal Shift Estimation

    Authors: Joachim Nyborg, Charlotte Pelletier, Sébastien Lefèvre, Ira Assent

    Abstract: The recent developments of deep learning models that capture complex temporal patterns of crop phenology have greatly advanced crop classification from Satellite Image Time Series (SITS). However, when applied to target regions spatially different from the training region, these models perform poorly without any target labels due to the temporal shift of crop phenology between regions. Although va… ▽ More

    Submitted 9 May, 2022; v1 submitted 4 November, 2021; originally announced November 2021.

    Journal ref: ISPRS Journal of Photogrammetry and Remote Sensing, Volume 188, June 2022, Pages 301-313

  10. arXiv:2111.00177  [pdf, other

    cs.LG

    On Quantitative Evaluations of Counterfactuals

    Authors: Frederik Hvilshøj, Alexandros Iosifidis, Ira Assent

    Abstract: As counterfactual examples become increasingly popular for explaining decisions of deep learning models, it is essential to understand what properties quantitative evaluation metrics do capture and equally important what they do not capture. Currently, such understanding is lacking, potentially slowing down scientific progress. In this paper, we consolidate the work on evaluating visual counterfac… ▽ More

    Submitted 30 October, 2021; originally announced November 2021.

  11. TextBenDS: a generic Textual data Benchmark for Distributed Systems

    Authors: Ciprian-Octavian Truica, Elena Apostol, Jérôme Darmont, Ira Assent

    Abstract: Extracting top-k keywords and documents using weighting schemes are popular techniques employed in text mining and machine learning for different analysis and retrieval tasks. The weights are usually computed in the data preprocessing step, as they are costly to update and keep track of all the modifications performed on the dataset. Furthermore, computation errors are introduced when analyzing on… ▽ More

    Submitted 12 August, 2021; originally announced August 2021.

    Journal ref: Information Systems Frontiers, Springer Verlag, 2021, Breakthroughs on Cross-Cutting Data Management, Data Analytics and Applied Data Science, 23, pp.81-100

  12. arXiv:2105.00687  [pdf, other

    cs.LG

    Learning by Design: Structuring and Documenting the Human Choices in Machine Learning Development

    Authors: Simon Enni, Ira Assent

    Abstract: The influence of machine learning (ML) is quickly spreading, and a number of recent technological innovations have applied ML as a central technology. However, ML development still requires a substantial amount of human expertise to be successful. The deliberation and expert judgment applied during ML development cannot be revisited or scrutinized if not properly documented, and this hinders the f… ▽ More

    Submitted 3 May, 2021; originally announced May 2021.

    Comments: 13 pages, 1 figure

  13. arXiv:2103.13701  [pdf, other

    cs.LG cs.CV

    ECINN: Efficient Counterfactuals from Invertible Neural Networks

    Authors: Frederik Hvilshøj, Alexandros Iosifidis, Ira Assent

    Abstract: Counterfactual examples identify how inputs can be altered to change the predicted class of a classifier, thus opening up the black-box nature of, e.g., deep neural networks. We propose a method, ECINN, that utilizes the generative capacities of invertible neural networks for image classification to generate counterfactual examples efficiently. In contrast to competing methods that sometimes need… ▽ More

    Submitted 5 April, 2021; v1 submitted 25 March, 2021; originally announced March 2021.

  14. arXiv:2102.06282  [pdf, other

    cs.CL cs.LG

    A reproduction of Apple's bi-directional LSTM models for language identification in short strings

    Authors: Mads Toftrup, Søren Asger Sørensen, Manuel R. Ciosici, Ira Assent

    Abstract: Language Identification is the task of identifying a document's language. For applications like automatic spell checker selection, language identification must use very short strings such as text message fragments. In this work, we reproduce a language identification architecture that Apple briefly sketched in a blog post. We confirm the bi-LSTM model's performance and find that it outperforms cur… ▽ More

    Submitted 11 February, 2021; originally announced February 2021.

    Comments: Will be presented at EACL 2021 SRW

  15. arXiv:1912.01927  [pdf, other

    cs.LG stat.ML

    Active Learning of SVDD Hyperparameter Values

    Authors: Holger Trittenbach, Klemens Böhm, Ira Assent

    Abstract: Support Vector Data Description is a popular method for outlier detection. However, its usefulness largely depends on selecting good hyperparameter values -- a difficult problem that has received significant attention in literature. Existing methods to estimate hyperparameter values are purely heuristic, and the conditions under which they work well are unclear. In this article, we propose LAMA (L… ▽ More

    Submitted 4 December, 2019; originally announced December 2019.

  16. arXiv:1904.00929  [pdf, other

    cs.CL

    Unsupervised Abbreviation Disambiguation Contextual disambiguation using word embeddings

    Authors: Manuel Ciosici, Tobias Sommer, Ira Assent

    Abstract: Abbreviations often have several distinct meanings, often making their use in text ambiguous. Expanding them to their intended meaning in context is important for Machine Reading tasks such as document search, recommendation and question answering. Existing approaches mostly rely on manually labeled examples of abbreviations and their correct long-forms. Such data sets are costly to create and res… ▽ More

    Submitted 22 May, 2019; v1 submitted 1 April, 2019; originally announced April 2019.

    Comments: Fixed author names; Revised text and experimental section

  17. arXiv:1507.08104  [pdf, other

    cs.LG

    Learning Representations for Outlier Detection on a Budget

    Authors: Barbora Micenková, Brian McWilliams, Ira Assent

    Abstract: The problem of detecting a small number of outliers in a large dataset is an important task in many fields from fraud detection to high-energy physics. Two approaches have emerged to tackle this problem: unsupervised and supervised. Supervised approaches require a sufficient amount of labeled data and are challenged by novel types of outliers and inherent class imbalance, whereas unsupervised meth… ▽ More

    Submitted 29 July, 2015; originally announced July 2015.