Skip to main content

Showing 1–17 of 17 results for author: Etzioni, O

.
  1. arXiv:2407.06174  [pdf, other

    cs.CV

    The Tug-of-War Between Deepfake Generation and Detection

    Authors: Hannah Lee, Changyeon Lee, Kevin Farhat, Lin Qiu, Steve Geluso, Aerin Kim, Oren Etzioni

    Abstract: Multimodal generative models are rapidly evolving, leading to a surge in the generation of realistic video and audio that offers exciting possibilities but also serious risks. Deepfake videos, which can convincingly impersonate individuals, have particularly garnered attention due to their potential misuse in spreading misinformation and creating fraudulent content. This survey paper examines the… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

  2. arXiv:2406.00856  [pdf, other

    cs.CV cs.CR cs.LG

    DistilDIRE: A Small, Fast, Cheap and Lightweight Diffusion Synthesized Deepfake Detection

    Authors: Yewon Lim, Changyeon Lee, Aerin Kim, Oren Etzioni

    Abstract: A dramatic influx of diffusion-generated images has marked recent years, posing unique challenges to current detection technologies. While the task of identifying these images falls under binary classification, a seemingly straightforward category, the computational load is significant when employing the "reconstruction then compare" technique. This approach, known as DIRE (Diffusion Reconstructio… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

    Comments: 6 pages, 1 figure

  3. arXiv:2301.10140  [pdf, other

    cs.DL cs.CL

    The Semantic Scholar Open Data Platform

    Authors: Rodney Kinney, Chloe Anastasiades, Russell Authur, Iz Beltagy, Jonathan Bragg, Alexandra Buraczynski, Isabel Cachola, Stefan Candra, Yoganand Chandrasekhar, Arman Cohan, Miles Crawford, Doug Downey, Jason Dunkelberger, Oren Etzioni, Rob Evans, Sergey Feldman, Joseph Gorney, David Graham, Fangzhou Hu, Regan Huff, Daniel King, Sebastian Kohlmeier, Bailey Kuehl, Michael Langan, Daniel Lin , et al. (23 additional authors not shown)

    Abstract: The volume of scientific output is creating an urgent need for automated tools to help scientists keep up with developments in their field. Semantic Scholar (S2) is an open data platform and website aimed at accelerating science by hel** scholars discover and understand scientific literature. We combine public and proprietary data sources using state-of-the-art techniques for scholarly PDF conte… ▽ More

    Submitted 24 January, 2023; originally announced January 2023.

    Comments: 8 pages, 6 figures

  4. arXiv:2211.06318  [pdf

    cs.CY cs.AI cs.LG

    Artificial Intelligence and Life in 2030: The One Hundred Year Study on Artificial Intelligence

    Authors: Peter Stone, Rodney Brooks, Erik Brynjolfsson, Ryan Calo, Oren Etzioni, Greg Hager, Julia Hirschberg, Shivaram Kalyanakrishnan, Ece Kamar, Sarit Kraus, Kevin Leyton-Brown, David Parkes, William Press, AnnaLee Saxenian, Julie Shah, Milind Tambe, Astro Teller

    Abstract: In September 2016, Stanford's "One Hundred Year Study on Artificial Intelligence" project (AI100) issued the first report of its planned long-term periodic assessment of artificial intelligence (AI) and its impact on society. It was written by a panel of 17 study authors, each of whom is deeply rooted in AI research, chaired by Peter Stone of the University of Texas at Austin. The report, entitled… ▽ More

    Submitted 31 October, 2022; originally announced November 2022.

    Comments: 52 pages, https://ai100.stanford.edu/2016-report

  5. arXiv:2205.02007  [pdf, other

    cs.CL cs.CY cs.HC cs.IR

    A Computational Inflection for Scientific Discovery

    Authors: Tom Hope, Doug Downey, Oren Etzioni, Daniel S. Weld, Eric Horvitz

    Abstract: We stand at the foot of a significant inflection in the trajectory of scientific discovery. As society continues on its fast-paced digital transformation, so does humankind's collective scientific knowledge and discourse. We now read and write papers in digitized form, and a great deal of the formal and informal processes of science are captured digitally -- including papers, preprints and books,… ▽ More

    Submitted 24 May, 2023; v1 submitted 4 May, 2022; originally announced May 2022.

    Comments: Accepted to CACM

  6. arXiv:2112.00800  [pdf, other

    cs.CL cs.AI

    Iconary: A Pictionary-Based Game for Testing Multimodal Communication with Drawings and Text

    Authors: Christopher Clark, Jordi Salvador, Dustin Schwenk, Derrick Bonafilia, Mark Yatskar, Eric Kolve, Alvaro Herrasti, Jonghyun Choi, Sachin Mehta, Sam Skjonsberg, Carissa Schoenick, Aaron Sarnat, Hannaneh Hajishirzi, Aniruddha Kembhavi, Oren Etzioni, Ali Farhadi

    Abstract: Communicating with humans is challenging for AIs because it requires a shared understanding of the world, complex semantics (e.g., metaphors or analogies), and at times multi-modal gestures (e.g., pointing with a finger, or an arrow in a diagram). We investigate these challenges in the context of Iconary, a collaborative game of drawing and guessing based on Pictionary, that poses a novel challeng… ▽ More

    Submitted 1 December, 2021; originally announced December 2021.

    Comments: In EMNLP 2021

  7. arXiv:2110.07574  [pdf, other

    cs.CL

    Can Machines Learn Morality? The Delphi Experiment

    Authors: Liwei Jiang, Jena D. Hwang, Chandra Bhagavatula, Ronan Le Bras, Jenny Liang, Jesse Dodge, Keisuke Sakaguchi, Maxwell Forbes, Jon Borchardt, Saadia Gabriel, Yulia Tsvetkov, Oren Etzioni, Maarten Sap, Regina Rini, Ye** Choi

    Abstract: As AI systems become increasingly powerful and pervasive, there are growing concerns about machines' morality or a lack thereof. Yet, teaching morality to machines is a formidable task, as morality remains among the most intensely debated questions in humanity, let alone for AI. Existing AI systems deployed to millions of users, however, are already making decisions loaded with moral implications,… ▽ More

    Submitted 12 July, 2022; v1 submitted 14 October, 2021; originally announced October 2021.

  8. arXiv:2004.10706  [pdf, other

    cs.DL cs.CL

    CORD-19: The COVID-19 Open Research Dataset

    Authors: Lucy Lu Wang, Kyle Lo, Yoganand Chandrasekhar, Russell Reas, Jiangjiang Yang, Doug Burdick, Darrin Eide, Kathryn Funk, Yannis Katsis, Rodney Kinney, Yunyao Li, Ziyang Liu, William Merrill, Paul Mooney, Dewey Murdick, Devvret Rishi, Jerry Sheehan, Zhihong Shen, Brandon Stilson, Alex Wade, Kuansan Wang, Nancy Xin Ru Wang, Chris Wilhelm, Boya Xie, Douglas Raymond , et al. (3 additional authors not shown)

    Abstract: The COVID-19 Open Research Dataset (CORD-19) is a growing resource of scientific papers on COVID-19 and related historical coronavirus research. CORD-19 is designed to facilitate the development of text mining and information retrieval systems over its rich collection of metadata and structured full text papers. Since its release, CORD-19 has been downloaded over 200K times and has served as the b… ▽ More

    Submitted 10 July, 2020; v1 submitted 22 April, 2020; originally announced April 2020.

    Comments: ACL NLP-COVID Workshop 2020

  9. arXiv:1909.01958  [pdf, other

    cs.CL cs.AI

    From 'F' to 'A' on the N.Y. Regents Science Exams: An Overview of the Aristo Project

    Authors: Peter Clark, Oren Etzioni, Daniel Khashabi, Tushar Khot, Bhavana Dalvi Mishra, Kyle Richardson, Ashish Sabharwal, Carissa Schoenick, Oyvind Tafjord, Niket Tandon, Sumithra Bhakthavatsalam, Dirk Groeneveld, Michal Guerquin, Michael Schmitz

    Abstract: AI has achieved remarkable mastery over games such as Chess, Go, and Poker, and even Jeopardy, but the rich variety of standardized exams has remained a landmark challenge. Even in 2016, the best AI system achieved merely 59.3% on an 8th Grade science exam challenge. This paper reports unprecedented success on the Grade 8 New York Regents Science Exam, where for the first time a system scores more… ▽ More

    Submitted 1 February, 2021; v1 submitted 4 September, 2019; originally announced September 2019.

    Comments: AI Magazine 41 (4) Winter 2020. New analysis sections added

  10. arXiv:1907.10597  [pdf, other

    cs.CY cs.CL cs.CV cs.LG stat.ME

    Green AI

    Authors: Roy Schwartz, Jesse Dodge, Noah A. Smith, Oren Etzioni

    Abstract: The computations required for deep learning research have been doubling every few months, resulting in an estimated 300,000x increase from 2012 to 2018 [2]. These computations have a surprisingly large carbon footprint [38]. Ironically, deep learning was inspired by the human brain, which is remarkably energy efficient. Moreover, the financial cost of the computations can make it difficult for aca… ▽ More

    Submitted 13 August, 2019; v1 submitted 22 July, 2019; originally announced July 2019.

    Comments: 12 pages

  11. arXiv:1906.07883  [pdf, other

    cs.DL cs.CY cs.SI

    Gender trends in computer science authorship

    Authors: Lucy Lu Wang, Gabriel Stanovsky, Luca Weihs, Oren Etzioni

    Abstract: A large-scale, up-to-date analysis of Computer Science literature (11.8M papers through 2019) reveals that, if trends from the last 50 years continue, parity between the number of male and female authors will not be reached in this century. In contrast, parity is projected to be reached within two to three decades or may have already been reached in other fields of study like Medicine or Sociology… ▽ More

    Submitted 28 January, 2021; v1 submitted 18 June, 2019; originally announced June 2019.

    Comments: 13 pages, 8 figures, 2 tables, 4 appendices; Communications of the ACM

  12. arXiv:1805.02262  [pdf, other

    cs.CL

    Construction of the Literature Graph in Semantic Scholar

    Authors: Waleed Ammar, Dirk Groeneveld, Chandra Bhagavatula, Iz Beltagy, Miles Crawford, Doug Downey, Jason Dunkelberger, Ahmed Elgohary, Sergey Feldman, Vu Ha, Rodney Kinney, Sebastian Kohlmeier, Kyle Lo, Tyler Murray, Hsu-Han Ooi, Matthew Peters, Joanna Power, Sam Skjonsberg, Lucy Lu Wang, Chris Wilhelm, Zheng Yuan, Madeleine van Zuylen, Oren Etzioni

    Abstract: We describe a deployed scalable system for organizing published scientific literature into a heterogeneous graph to facilitate algorithmic manipulation and discovery. The resulting literature graph consists of more than 280M nodes, representing papers, authors, entities and various interactions between them (e.g., authorships, citations, entity mentions). We reduce literature graph construction in… ▽ More

    Submitted 6 May, 2018; originally announced May 2018.

    Comments: To appear in NAACL 2018 industry track

  13. arXiv:1803.05457  [pdf, other

    cs.AI cs.CL cs.IR

    Think you have Solved Question Answering? Try ARC, the AI2 Reasoning Challenge

    Authors: Peter Clark, Isaac Cowhey, Oren Etzioni, Tushar Khot, Ashish Sabharwal, Carissa Schoenick, Oyvind Tafjord

    Abstract: We present a new question set, text corpus, and baselines assembled to encourage AI research in advanced question answering. Together, these constitute the AI2 Reasoning Challenge (ARC), which requires far more powerful knowledge and reasoning than previous challenges such as SQuAD or SNLI. The ARC question set is partitioned into a Challenge Set and an Easy Set, where the Challenge Set contains o… ▽ More

    Submitted 14 March, 2018; originally announced March 2018.

    Comments: 10 pages, 7 tables, 2 figures

  14. arXiv:1604.06076  [pdf, other

    cs.AI cs.CL

    Question Answering via Integer Programming over Semi-Structured Knowledge

    Authors: Daniel Khashabi, Tushar Khot, Ashish Sabharwal, Peter Clark, Oren Etzioni, Dan Roth

    Abstract: Answering science questions posed in natural language is an important AI challenge. Answering such questions often requires non-trivial inference and knowledge that goes beyond factoid retrieval. Yet, most systems for this task are based on relatively shallow Information Retrieval (IR) and statistical correlation techniques operating on large unstructured corpora. We propose a structured inference… ▽ More

    Submitted 20 April, 2016; originally announced April 2016.

    Comments: Extended version of the paper accepted to IJCAI'16

  15. arXiv:1604.04315  [pdf

    cs.AI

    Moving Beyond the Turing Test with the Allen AI Science Challenge

    Authors: Carissa Schoenick, Peter Clark, Oyvind Tafjord, Peter Turney, Oren Etzioni

    Abstract: Given recent successes in AI (e.g., AlphaGo's victory against Lee Sedol in the game of GO), it's become increasingly important to assess: how close are AI systems to human-level intelligence? This paper describes the Allen AI Science Challenge---an approach towards that goal which led to a unique Kaggle Competition, its results, the lessons learned, and our next steps.

    Submitted 22 February, 2017; v1 submitted 14 April, 2016; originally announced April 2016.

    Comments: 7 pages

  16. arXiv:1507.03045  [pdf, other

    cs.AI cs.CL

    Markov Logic Networks for Natural Language Question Answering

    Authors: Tushar Khot, Niranjan Balasubramanian, Eric Gribkoff, Ashish Sabharwal, Peter Clark, Oren Etzioni

    Abstract: Our goal is to answer elementary-level science questions using knowledge extracted automatically from science textbooks, expressed in a subset of first-order logic. Given the incomplete and noisy nature of these automatically extracted rules, Markov Logic Networks (MLNs) seem a natural model to use, but the exact way of leveraging MLNs is by no means obvious. We investigate three ways of applying… ▽ More

    Submitted 10 July, 2015; originally announced July 2015.

    Comments: 7 pages, 1 figure, StarAI workshop at UAI'15

  17. Unsupervised Methods for Determining Object and Relation Synonyms on the Web

    Authors: Alexander Pieter Yates, Oren Etzioni

    Abstract: The task of identifying synonymous relations and objects, or synonym resolution, is critical for high-quality information extraction. This paper investigates synonym resolution in the context of unsupervised information extraction, where neither hand-tagged training examples nor domain knowledge is available. The paper presents a scalable, fully-implemented system that runs in O(KN log N) time in… ▽ More

    Submitted 15 January, 2014; originally announced January 2014.

    Journal ref: Journal Of Artificial Intelligence Research, Volume 34, pages 255-296, 2009