Skip to main content

Showing 1–26 of 26 results for author: Durumeric, Z

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.19569  [pdf, other

    cs.NI

    On the Centralization and Regionalization of the Web

    Authors: Gautam Akiwate, Kimberly Ruth, Rumaisa Habib, Zakir Durumeric

    Abstract: Over the past decade, Internet centralization and its implications for both people and the resilience of the Internet has become a topic of active debate. While the networking community informally agrees on the definition of centralization, we lack a formal metric for quantifying centralization, which limits research beyond descriptive analysis. In this work, we introduce a statistical measure for… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  2. arXiv:2406.15585  [pdf, other

    cs.CR cs.NI

    Ten Years of ZMap

    Authors: Zakir Durumeric, David Adrian, Phillip Stephens, Eric Wustrow, J. Alex Halderman

    Abstract: Since ZMap's debut in 2013, networking and security researchers have used the open-source scanner to write hundreds of research papers that study Internet behavior. In addition, ZMap powers much of the attack-surface management and security ratings industries, and more than a dozen security companies have built products on top of ZMap. Behind the scenes, much of ZMap's behavior - ranging from its… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

    Comments: 12 pages, 7 figures, in submission at Internet Measurement Conference 2024

  3. arXiv:2404.11763  [pdf, other

    cs.SE cs.CR

    The Code the World Depends On: A First Look at Technology Makers' Open Source Software Dependencies

    Authors: Cadence Patrick, Kimberly Ruth, Zakir Durumeric

    Abstract: Open-source software (OSS) supply chain security has become a topic of concern for organizations. Patching an OSS vulnerability can require updating other dependent software products in addition to the original package. However, the landscape of OSS dependencies is not well explored: we do not know what packages are most critical to patch, hindering efforts to improve OSS security where it is most… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

  4. arXiv:2402.06099  [pdf, other

    cs.NI

    CATO: End-to-End Optimization of ML-Based Traffic Analysis Pipelines

    Authors: Gerry Wan, Shinan Liu, Francesco Bronzino, Nick Feamster, Zakir Durumeric

    Abstract: Machine learning has shown tremendous potential for improving the capabilities of network traffic analysis applications, often outperforming simpler rule-based heuristics. However, ML-based solutions remain difficult to deploy in practice. Many existing approaches only optimize the predictive performance of their models, overlooking the practical challenges of running them against network traffic… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

  5. arXiv:2401.11032  [pdf, other

    cs.CY cs.HC

    PressProtect: Hel** Journalists Navigate Social Media in the Face of Online Harassment

    Authors: Catherine Han, Anne Li, Deepak Kumar, Zakir Durumeric

    Abstract: Social media has become a critical tool for journalists to disseminate their work, engage with their audience, and connect with sources. Unfortunately, journalists also regularly endure significant online harassment on social media platforms, ranging from personal attacks to doxxing to threats of physical harm. In this paper, we seek to understand how we can make social media usable for journalist… ▽ More

    Submitted 19 January, 2024; originally announced January 2024.

  6. arXiv:2310.14450  [pdf, other

    cs.CL cs.CY cs.LG

    TATA: Stance Detection via Topic-Agnostic and Topic-Aware Embeddings

    Authors: Hans W. A. Hanley, Zakir Durumeric

    Abstract: Stance detection is important for understanding different attitudes and beliefs on the Internet. However, given that a passage's stance toward a given topic is often highly dependent on that topic, building a stance detection model that generalizes to unseen topics is difficult. In this work, we propose using contrastive learning as well as an unlabeled dataset of news articles that cover a variet… ▽ More

    Submitted 8 February, 2024; v1 submitted 22 October, 2023; originally announced October 2023.

    Comments: Accepted to EMNLP 2023; Updated citations

  7. arXiv:2309.14517  [pdf, other

    cs.HC cs.AI cs.CL cs.CR cs.SI

    Watch Your Language: Investigating Content Moderation with Large Language Models

    Authors: Deepak Kumar, Yousef AbuHashem, Zakir Durumeric

    Abstract: Large language models (LLMs) have exploded in popularity due to their ability to perform a wide array of natural language tasks. Text-based content moderation is one LLM use case that has received recent enthusiasm, however, there is little research investigating how LLMs perform in content moderation settings. In this work, we evaluate a suite of commodity LLMs on two common content moderation ta… ▽ More

    Submitted 17 January, 2024; v1 submitted 25 September, 2023; originally announced September 2023.

  8. Stratosphere: Finding Vulnerable Cloud Storage Buckets

    Authors: Jack Cable, Drew Gregory, Liz Izhikevich, Zakir Durumeric

    Abstract: Misconfigured cloud storage buckets have leaked hundreds of millions of medical, voter, and customer records. These breaches are due to a combination of easily-guessable bucket names and error-prone security configurations, which, together, allow attackers to easily guess and access sensitive data. In this work, we investigate the security of buckets, finding that prior studies have largely undere… ▽ More

    Submitted 23 September, 2023; originally announced September 2023.

    Comments: Proceedings of the 24th International Symposium on Research in Attacks, Intrusions and Defenses. 2021

  9. ZDNS: A Fast DNS Toolkit for Internet Measurement

    Authors: Liz Izhikevich, Gautam Akiwate, Briana Berger, Spencer Drakontaidis, Anna Ascheman, Paul Pearce, David Adrian, Zakir Durumeric

    Abstract: Active DNS measurement is fundamental to understanding and improving the DNS ecosystem. However, the absence of an extensible, high-performance, and easy-to-use DNS toolkit has limited both the reproducibility and coverage of DNS research. In this paper, we introduce ZDNS, a modular and open-source active DNS measurement framework optimized for large-scale research studies of DNS on the public Int… ▽ More

    Submitted 23 September, 2023; originally announced September 2023.

    Comments: Proceedings of the 22nd ACM Internet Measurement Conference. 2022

  10. Cloud Watching: Understanding Attacks Against Cloud-Hosted Services

    Authors: Liz Izhikevich, Manda Tran, Michalis Kallitsis, Aurore Fass, Zakir Durumeric

    Abstract: Cloud computing has dramatically changed service deployment patterns. In this work, we analyze how attackers identify and target cloud services in contrast to traditional enterprise networks and network telescopes. Using a diverse set of cloud honeypots in 5~providers and 23~countries as well as 2~educational networks and 1~network telescope, we analyze how IP address assignment, geography, networ… ▽ More

    Submitted 28 September, 2023; v1 submitted 23 September, 2023; originally announced September 2023.

    Comments: Proceedings of the 2023 ACM Internet Measurement Conference (IMC '23), October 24--26, 2023, Montreal, QC, Canada

  11. arXiv:2308.02068  [pdf, other

    cs.SI cs.CY cs.LG

    Specious Sites: Tracking the Spread and Sway of Spurious News Stories at Scale

    Authors: Hans W. A. Hanley, Deepak Kumar, Zakir Durumeric

    Abstract: Misinformation, propaganda, and outright lies proliferate on the web, with some narratives having dangerous real-world consequences on public health, elections, and individual safety. However, despite the impact of misinformation, the research community largely lacks automated and programmatic approaches for tracking news narratives across online platforms. In this work, utilizing daily scrapes of… ▽ More

    Submitted 2 February, 2024; v1 submitted 3 August, 2023; originally announced August 2023.

    Comments: Accepted to IEEE S&P 2024. Updated Emails

  12. arXiv:2307.10349  [pdf, other

    cs.SI cs.CY

    Twits, Toxic Tweets, and Tribal Tendencies: Trends in Politically Polarized Posts on Twitter

    Authors: Hans W. A. Hanley, Zakir Durumeric

    Abstract: Social media platforms are often blamed for exacerbating political polarization and worsening public dialogue. Many claim hyperpartisan users post pernicious content, slanted to their political views, inciting contentious and toxic conversations. However, what factors, actually contribute to increased online toxicity and negative interactions? In this work, we explore the role that political ideol… ▽ More

    Submitted 19 July, 2023; originally announced July 2023.

  13. arXiv:2306.07469  [pdf, other

    cs.NI

    Democratizing LEO Satellite Network Measurement

    Authors: Liz Izhikevich, Manda Tran, Katherine Izhikevich, Gautam Akiwate, Zakir Durumeric

    Abstract: Low Earth Orbit (LEO) satellite networks are quickly gaining traction with promises of impressively low latency, high bandwidth, and global reach. However, the research community knows relatively little about their operation and performance in practice. The obscurity is largely due to the high barrier of entry for measuring LEO networks, which requires deploying specialized hardware or recruiting… ▽ More

    Submitted 12 October, 2023; v1 submitted 12 June, 2023; originally announced June 2023.

    Comments: Pre-Print

    Journal ref: ACM SIGMETRICS/IFIP Performance 2024

  14. arXiv:2305.09820  [pdf, other

    cs.CY cs.LG cs.SI

    Machine-Made Media: Monitoring the Mobilization of Machine-Generated Articles on Misinformation and Mainstream News Websites

    Authors: Hans W. A. Hanley, Zakir Durumeric

    Abstract: As large language models (LLMs) like ChatGPT have gained traction, an increasing number of news websites have begun utilizing them to generate articles. However, not only can these language models produce factually inaccurate articles on reputable websites but disreputable news sites can utilize LLMs to mass produce misinformation. To begin to understand this phenomenon, we present one of the firs… ▽ More

    Submitted 19 March, 2024; v1 submitted 16 May, 2023; originally announced May 2023.

    Comments: Accepted to ICWSM 2024

  15. arXiv:2303.00895  [pdf, other

    cs.NI cs.DC cs.LG

    Predicting IPv4 Services Across All Ports

    Authors: Liz Izhikevich, Renata Teixeira, Zakir Durumeric

    Abstract: Internet-wide scanning is commonly used to understand the topology and security of the Internet. However, IPv4 Internet scans have been limited to scanning only a subset of services -- exhaustively scanning all IPv4 services is too costly and no existing bandwidth-saving frameworks are designed to scan IPv4 addresses across all ports. In this work we introduce GPS, a system that efficiently discov… ▽ More

    Submitted 1 March, 2023; originally announced March 2023.

    Journal ref: ACM SIGCOMM 2022 Conference (SIGCOMM '22), August 22--26, 2022, Amsterdam, Netherlands

  16. arXiv:2301.11486  [pdf, other

    cs.SI cs.CY

    Sub-Standards and Mal-Practices: Misinformation's Role in Insular, Polarized, and Toxic Interactions

    Authors: Hans W. A. Hanley, Zakir Durumeric

    Abstract: How do users and communities respond to news from unreliable sources? How does news from these sources change online conversations? In this work, we examine the role of misinformation in sparking political incivility and toxicity on the social media platform Reddit. Utilizing the Google Jigsaw Perspective API to identify toxicity, hate speech, and other forms of incivility, we find that Reddit com… ▽ More

    Submitted 19 July, 2023; v1 submitted 26 January, 2023; originally announced January 2023.

  17. arXiv:2301.10880  [pdf, other

    cs.CY cs.SI

    A Golden Age: Conspiracy Theories' Relationship with Misinformation Outlets, News Media, and the Wider Internet

    Authors: Hans W. A. Hanley, Deepak Kumar, Zakir Durumeric

    Abstract: Do we live in a "Golden Age of Conspiracy Theories?" In the last few decades, conspiracy theories have proliferated on the Internet with some having dangerous real-world consequences. A large contingent of those who participated in the January 6th attack on the US Capitol fervently believed in the QAnon conspiracy theory. In this work, we study the relationships amongst five prominent conspiracy t… ▽ More

    Submitted 13 November, 2023; v1 submitted 25 January, 2023; originally announced January 2023.

    Comments: Accepted to CSCW 2023; CSCW version

  18. arXiv:2301.10856  [pdf, other

    cs.CY cs.CL cs.LG cs.SI

    Partial Mobilization: Tracking Multilingual Information Flows Amongst Russian Media Outlets and Telegram

    Authors: Hans W. A. Hanley, Zakir Durumeric

    Abstract: In response to disinformation and propaganda from Russian online media following the invasion of Ukraine, Russian media outlets such as Russia Today and Sputnik News were banned throughout Europe. To maintain viewership, many of these Russian outlets began to heavily promote their content on messaging services like Telegram. In this work, we study how 16 Russian media outlets interacted with and u… ▽ More

    Submitted 27 May, 2024; v1 submitted 25 January, 2023; originally announced January 2023.

    Comments: Accepted to ICWSM 2024 (ICWSM version)

  19. arXiv:2301.04841  [pdf, other

    cs.CR cs.NI

    LZR: Identifying Unexpected Internet Services

    Authors: Liz Izhikevich, Renata Teixeira, Zakir Durumeric

    Abstract: Internet-wide scanning is a commonly used research technique that has helped uncover real-world attacks, find cryptographic weaknesses, and understand both operator and miscreant behavior. Studies that employ scanning have largely assumed that services are hosted on their IANA-assigned ports, overlooking the study of services on unusual ports. In this work, we investigate where Internet services a… ▽ More

    Submitted 12 January, 2023; originally announced January 2023.

    Comments: In 30th USENIX Security Symposium, 2021

  20. arXiv:2301.03946  [pdf, other

    cs.CY cs.CR cs.HC

    Hate Raids on Twitch: Echoes of the Past, New Modalities, and Implications for Platform Governance

    Authors: Catherine Han, Joseph Seering, Deepak Kumar, Jeffrey T. Hancock, Zakir Durumeric

    Abstract: In the summer of 2021, users on the livestreaming platform Twitch were targeted by a wave of "hate raids," a form of attack that overwhelms a streamer's chatroom with hateful messages, often through the use of bots and automation. Using a mixed-methods approach, we combine a quantitative measurement of attacks across the platform with interviews of streamers and third-party bot developers. We pres… ▽ More

    Submitted 12 January, 2023; v1 submitted 10 January, 2023; originally announced January 2023.

  21. arXiv:2210.03016  [pdf, other

    cs.CY cs.SI

    "A Special Operation": A Quantitative Approach to Dissecting and Comparing Different Media Ecosystems' Coverage of the Russo-Ukrainian War

    Authors: Hans W. A. Hanley, Deepak Kumar, Zakir Durumeric

    Abstract: The coverage of the Russian invasion of Ukraine has varied widely between Western, Russian, and Chinese media ecosystems with propaganda, disinformation, and narrative spins present in all three. By utilizing the normalized pointwise mutual information metric, differential sentiment analysis, word2vec models, and partially labeled Dirichlet allocation, we present a quantitative analysis of the dif… ▽ More

    Submitted 31 May, 2023; v1 submitted 6 October, 2022; originally announced October 2022.

    Comments: Accepted to ICWSM 2023

  22. arXiv:2209.02533  [pdf, other

    cs.SI cs.CR cs.CY

    Understanding Longitudinal Behaviors of Toxic Accounts on Reddit

    Authors: Deepak Kumar, Jeff Hancock, Kurt Thomas, Zakir Durumeric

    Abstract: Toxic comments are the top form of hate and harassment experienced online. While many studies have investigated the types of toxic comments posted online, the effects that such content has on people, and the impact of potential defenses, no study has captured the long-term behaviors of the accounts that post toxic comments or how toxic comments are operationalized. In this paper, we present a long… ▽ More

    Submitted 6 September, 2022; originally announced September 2022.

  23. arXiv:2205.14484  [pdf, other

    cs.SI cs.CY cs.LG

    Happenstance: Utilizing Semantic Search to Track Russian State Media Narratives about the Russo-Ukrainian War On Reddit

    Authors: Hans W. A. Hanley, Deepak Kumar, Zakir Durumeric

    Abstract: In the buildup to and in the weeks following the Russian Federation's invasion of Ukraine, Russian state media outlets output torrents of misleading and outright false information. In this work, we study this coordinated information campaign in order to understand the most prominent state media narratives touted by the Russian government to English-speaking audiences. To do this, we first perform… ▽ More

    Submitted 30 May, 2023; v1 submitted 28 May, 2022; originally announced May 2022.

    Comments: Accepted to ICWSM 2023

  24. arXiv:2111.00703  [pdf, other

    cs.CR

    An Empirical Analysis of HTTPS Configuration Security

    Authors: Camelia Simoiu, Wilson Nguyen, Zakir Durumeric

    Abstract: It is notoriously difficult to securely configure HTTPS, and poor server configurations have contributed to several attacks including the FREAK, Logjam, and POODLE attacks. In this work, we empirically evaluate the TLS security posture of popular websites and endeavor to understand the configuration decisions that operators make. We correlate several sources of influence on sites' security posture… ▽ More

    Submitted 1 November, 2021; originally announced November 2021.

  25. arXiv:2106.15715  [pdf, other

    cs.CY cs.SI

    No Calm in The Storm: Investigating QAnon Website Relationships

    Authors: Hans W. A. Hanley, Deepak Kumar, Zakir Durumeric

    Abstract: QAnon is a far-right conspiracy theory whose followers largely organize online. In this work, we use web crawls seeded from two of the largest QAnon hotbeds on the Internet, Voat and 8kun, to build a QAnon-centered domain-based hyperlink graph. We use this graph to identify, understand, and learn about the set of websites that spread QAnon content online. Specifically, we curate the largest list o… ▽ More

    Submitted 31 March, 2022; v1 submitted 29 June, 2021; originally announced June 2021.

  26. arXiv:2106.04511  [pdf, other

    cs.SI cs.CR cs.CY cs.HC

    Designing Toxic Content Classification for a Diversity of Perspectives

    Authors: Deepak Kumar, Patrick Gage Kelley, Sunny Consolvo, Joshua Mason, Elie Bursztein, Zakir Durumeric, Kurt Thomas, Michael Bailey

    Abstract: In this work, we demonstrate how existing classifiers for identifying toxic comments online fail to generalize to the diverse concerns of Internet users. We survey 17,280 participants to understand how user expectations for what constitutes toxic content differ across demographics, beliefs, and personal experiences. We find that groups historically at-risk of harassment - such as people who identi… ▽ More

    Submitted 4 June, 2021; originally announced June 2021.