Skip to main content

Showing 1–2 of 2 results for author: Levchenko, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2206.02285  [pdf, other

    cs.CR

    Story Beyond the Eye: Glyph Positions Break PDF Text Redaction

    Authors: Maxwell Bland, Anushya Iyer, Kirill Levchenko

    Abstract: In this work we find that many current redactions of PDF text are insecure due to non-redacted character positioning information. In particular, subpixel-sized horizontal shifts in redacted and non-redacted characters can be recovered and used to effectively deredact first and last names. Unfortunately these findings affect redactions where the text underneath the black box is removed from the PDF… ▽ More

    Submitted 13 November, 2022; v1 submitted 5 June, 2022; originally announced June 2022.

  2. Identifying Products in Online Cybercrime Marketplaces: A Dataset for Fine-grained Domain Adaptation

    Authors: Greg Durrett, Jonathan K. Kummerfeld, Taylor Berg-Kirkpatrick, Rebecca S. Portnoff, Sadia Afroz, Damon McCoy, Kirill Levchenko, Vern Paxson

    Abstract: One weakness of machine-learned NLP models is that they typically perform poorly on out-of-domain data. In this work, we study the task of identifying products being bought and sold in online cybercrime forums, which exhibits particularly challenging cross-domain effects. We formulate a task that represents a hybrid of slot-filling information extraction and named entity recognition and annotate d… ▽ More

    Submitted 31 August, 2017; originally announced August 2017.

    Comments: To appear at EMNLP 2017

    ACM Class: I.2.7

    Journal ref: EMNLP (2017) 2598-2607