Skip to main content

Showing 1–16 of 16 results for author: Spitz, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2310.11252  [pdf, other

    cs.CL cs.AI cs.HC

    Revealing the Unwritten: Visual Investigation of Beam Search Trees to Address Language Model Prompting Challenges

    Authors: Thilo Spinner, Rebecca Kehlbeck, Rita Sevastjanova, Tobias Stähle, Daniel A. Keim, Oliver Deussen, Andreas Spitz, Mennatallah El-Assady

    Abstract: The growing popularity of generative language models has amplified interest in interactive methods to guide model outputs. Prompt refinement is considered one of the most effective means to influence output among these methods. We identify several challenges associated with prompting large language models, categorized into data- and model-specific, linguistic, and socio-linguistic challenges. A co… ▽ More

    Submitted 17 October, 2023; originally announced October 2023.

    Comments: 9 pages paper, 2 pages references, 7 figures

    ACM Class: H.5.2; I.2.7

  2. arXiv:2309.02142  [pdf

    cs.CY

    Who are the users of ChatGPT? Implications for the digital divide from web tracking data

    Authors: Celina Kacperski, Roberto Ulloa, Denis Bonnay, Juhi Kulshrestha, Peter Selb, Andreas Spitz

    Abstract: A major challenge of our time is reducing disparities in access to and effective use of digital technologies, with recent discussions highlighting the role of AI in exacerbating the digital divide. We examine user characteristics that predict usage of the AI-powered conversational agent ChatGPT. We combine behavioral (web tracking) and survey data of N=1068 German citizens to investigate differenc… ▽ More

    Submitted 22 February, 2024; v1 submitted 5 September, 2023; originally announced September 2023.

  3. arXiv:2211.08461  [pdf, other

    cs.CL cs.CY

    Mind Your Bias: A Critical Review of Bias Detection Methods for Contextual Language Models

    Authors: Silke Husse, Andreas Spitz

    Abstract: The awareness and mitigation of biases are of fundamental importance for the fair and transparent use of contextual language models, yet they crucially depend on the accurate detection of biases as a precursor. Consequently, numerous bias detection methods have been proposed, which vary in their approach, the considered type of bias, and the data used for evaluation. However, while most detection… ▽ More

    Submitted 15 November, 2022; originally announced November 2022.

  4. Collaborative and AI-aided Exam Question Generation using Wikidata in Education

    Authors: Philipp Scharpf, Moritz Schubotz, Andreas Spitz, Andre Greiner-Petter, Bela Gipp

    Abstract: Since the COVID-19 outbreak, the use of digital learning or education platforms has significantly increased. Teachers now digitally distribute homework and provide exercise questions. In both cases, teachers need to continuously develop novel and individual questions. This process can be very time-consuming and should be facilitated and accelerated both through exchange with other teachers and by… ▽ More

    Submitted 15 November, 2022; originally announced November 2022.

    MSC Class: 68Uxx ACM Class: H.4

  5. arXiv:2210.15476  [pdf, other

    cs.CY

    Quotatives Indicate Decline in Objectivity in U.S. Political News

    Authors: Tiancheng Hu, Manoel Horta Ribeiro, Robert West, Andreas Spitz

    Abstract: According to journalistic standards, direct quotes should be attributed to sources with objective quotatives such as "said" and "told", as nonobjective quotatives, like "argued" and "insisted" would influence the readers' perception of the quote and the quoted person. In this paper, we analyze the adherence to this journalistic norm to study trends in objectivity in political news across U.S. outl… ▽ More

    Submitted 16 May, 2023; v1 submitted 27 October, 2022; originally announced October 2022.

    Comments: ICWSM 2023 Repo: https://github.com/epfl-dlab/quotative_bias

  6. arXiv:2207.08112  [pdf, other

    cs.CL

    United States Politicians' Tone Became More Negative with 2016 Primary Campaigns

    Authors: Jonathan Külz, Andreas Spitz, Ahmad Abu-Akel, Stephan Günnemann, Robert West

    Abstract: There is a widespread belief that the tone of US political language has become more negative recently, in particular when Donald Trump entered politics. At the same time, there is disagreement as to whether Trump changed or merely continued previous trends. To date, data-driven evidence regarding these questions is scarce, partly due to the difficulty of obtaining a comprehensive, longitudinal rec… ▽ More

    Submitted 17 July, 2022; originally announced July 2022.

  7. arXiv:2207.03592  [pdf, other

    cs.IR cs.CL cs.DB

    Quote Erat Demonstrandum: A Web Interface for Exploring the Quotebank Corpus

    Authors: Vuk Vuković, Akhil Arora, Huan-Cheng Chang, Andreas Spitz, Robert West

    Abstract: The use of attributed quotes is the most direct and least filtered pathway of information propagation in news. Consequently, quotes play a central role in the conception, reception, and analysis of news stories. Since quotes provide a more direct window into a speaker's mind than regular reporting, they are a valuable resource for journalists and researchers alike. While substantial research effor… ▽ More

    Submitted 7 July, 2022; originally announced July 2022.

    Comments: SIGIR 2022 (Demo), 5 pages, 2 figures

  8. arXiv:2207.02824  [pdf, other

    cs.CL cs.IR cs.LG

    Strong Heuristics for Named Entity Linking

    Authors: Marko Čuljak, Andreas Spitz, Robert West, Akhil Arora

    Abstract: Named entity linking (NEL) in news is a challenging endeavour due to the frequency of unseen and emerging entities, which necessitates the use of unsupervised or zero-shot methods. However, such methods tend to come with caveats, such as no integration of suitable knowledge bases (like Wikidata) for emerging entities, a lack of scalability, and poor interpretability. Here, we consider person disam… ▽ More

    Submitted 6 July, 2022; originally announced July 2022.

    Comments: NAACL-SRW 2022

  9. arXiv:2106.02926  [pdf, ps, other

    cs.SI cs.AI cs.IT cs.LG cs.NE

    IM-META: Influence Maximization Using Node Metadata in Networks With Unknown Topology

    Authors: Cong Tran, Won-Yong Shin, Andreas Spitz

    Abstract: Since the structure of complex networks is often unknown, we may identify the most influential seed nodes by exploring only a part of the underlying network, given a small budget for node queries. We propose IM-META, a solution to influence maximization (IM) in networks with unknown topology by retrieving information from queries and node metadata. Since using such metadata is not without risk due… ▽ More

    Submitted 6 February, 2024; v1 submitted 5 June, 2021; originally announced June 2021.

    Comments: 14 pages, 11 figures, 4 tables, to appear in the IEEE Transactions on Network Science and Engineering (Please cite our journal version that will appear in an upcoming issue.)

  10. arXiv:1907.07381  [pdf, other

    cs.SI cs.LG cs.NE

    DeepNC: Deep Generative Network Completion

    Authors: Cong Tran, Won-Yong Shin, Andreas Spitz, Michael Gertz

    Abstract: Most network data are collected from partially observable networks with both missing nodes and missing edges, for example, due to limited resources and privacy settings specified by users on social media. Thus, it stands to reason that inferring the missing parts of the networks by performing network completion should precede downstream applications. However, despite this need, the recovery of mis… ▽ More

    Submitted 20 October, 2020; v1 submitted 17 July, 2019; originally announced July 2019.

    Comments: 16 pages, 10 figures, 5 tables; to appear in the IEEE Transactions on Pattern Analysis and Machine Intelligence (Please cite our journal version that will appear in an upcoming issue.)

  11. TopExNet: Entity-Centric Network Topic Exploration in News Streams

    Authors: Andreas Spitz, Satya Almasian, Michael Gertz

    Abstract: The recent introduction of entity-centric implicit network representations of unstructured text offers novel ways for exploring entity relations in document collections and streams efficiently and interactively. Here, we present TopExNet as a tool for exploring entity-centric network topics in streams of news articles. The application is available as a web service at https://topexnet.ifi.uni-heide… ▽ More

    Submitted 31 May, 2019; v1 submitted 29 May, 2019; originally announced May 2019.

    Comments: Published in Proceedings of the Twelfth ACM International Conference on Web Search and Data Mining, WSDM 2019, Melbourne, VIC, Australia, February 11-15, 2019

  12. Retrieving Multi-Entity Associations: An Evaluation of Combination Modes for Word Embeddings

    Authors: Gloria Feher, Andreas Spitz, Michael Gertz

    Abstract: Word embeddings have gained significant attention as learnable representations of semantic relations between words, and have been shown to improve upon the results of traditional word representations. However, little effort has been devoted to using embeddings for the retrieval of entity associations beyond pairwise relations. In this paper, we use popular embedding methods to train vector represe… ▽ More

    Submitted 22 May, 2019; originally announced May 2019.

    Comments: 4 pages; Accepted at SIGIR'19

    ACM Class: H.3.3

  13. Word Embeddings for Entity-annotated Texts

    Authors: Satya Almasian, Andreas Spitz, Michael Gertz

    Abstract: Learned vector representations of words are useful tools for many information retrieval and natural language processing tasks due to their ability to capture lexical semantics. However, while many such tasks involve or even rely on named entities as central components, popular word embedding models have so far failed to include entities as first-class citizens. While it seems intuitive that annota… ▽ More

    Submitted 12 February, 2020; v1 submitted 6 February, 2019; originally announced February 2019.

    Comments: This paper is accepted in 41st European Conference on Information Retrieval

  14. arXiv:1811.12114  [pdf, ps, other

    math.OC cs.CE

    A Mixed Integer Linear Programming Model for Multi-Satellite Scheduling

    Authors: Xiaoyu Chen, Gerhard Reinelt, Guangming Dai, Andreas Spitz

    Abstract: We address the multi-satellite scheduling problem with limited observation capacities that arises from the need to observe a set of targets on the Earth's surface using imaging resources installed on a set of satellites. We define and analyze the conflict indicators of all available visible time windows of missions, as well as the feasible time intervals of resources. The problem is then formulate… ▽ More

    Submitted 6 December, 2018; v1 submitted 29 November, 2018; originally announced November 2018.

  15. arXiv:1801.00132  [pdf, other

    cs.SI cs.LG physics.soc-ph

    Community Detection in Partially Observable Social Networks

    Authors: Cong Tran, Won-Yong Shin, Andreas Spitz

    Abstract: The discovery of community structures in social networks has gained significant attention since it is a fundamental problem in understanding the networks' topology and functions. However, most social network data are collected from partially observable networks with both missing nodes and edges. In this paper, we address a new problem of detecting overlap** community structures in the context of… ▽ More

    Submitted 16 April, 2021; v1 submitted 30 December, 2017; originally announced January 2018.

    Comments: 24 pages, 8 figures, 5 tables; to appear in the ACM Transactions on Knowledge Discovery from Data (Please cite our journal version that will appear in an upcoming issue.)

  16. arXiv:1708.03569  [pdf, other

    cs.IR cs.CL

    Semantic Word Clouds with Background Corpus Normalization and t-distributed Stochastic Neighbor Embedding

    Authors: Erich Schubert, Andreas Spitz, Michael Weiler, Johanna Geiß, Michael Gertz

    Abstract: Many word clouds provide no semantics to the word placement, but use a random layout optimized solely for aesthetic purposes. We propose a novel approach to model word significance and word affinity within a document, and in comparison to a large background corpus. We demonstrate its usefulness for generating more meaningful word clouds as a visual summary of a given document. We then select keywo… ▽ More

    Submitted 11 August, 2017; originally announced August 2017.