Skip to main content

Showing 1–22 of 22 results for author: Wood, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.08810  [pdf, other

    quant-ph cs.ET

    Quantum computing with Qiskit

    Authors: Ali Javadi-Abhari, Matthew Treinish, Kevin Krsulich, Christopher J. Wood, Jake Lishman, Julien Gacon, Simon Martiel, Paul D. Nation, Lev S. Bishop, Andrew W. Cross, Blake R. Johnson, Jay M. Gambetta

    Abstract: We describe Qiskit, a software development kit for quantum information science. We discuss the key design decisions that have shaped its development, and examine the software architecture and its core components. We demonstrate an end-to-end workflow for solving a problem in condensed matter physics on a quantum computer that serves to highlight some of Qiskit's capabilities, for example the repre… ▽ More

    Submitted 18 June, 2024; v1 submitted 14 May, 2024; originally announced May 2024.

  2. arXiv:2401.05060  [pdf, other

    cs.SD cs.CL eess.AS

    MuTox: Universal MUltilingual Audio-based TOXicity Dataset and Zero-shot Detector

    Authors: Marta R. Costa-jussà, Mariano Coria Meglioli, Pierre Andrews, David Dale, Prangthip Hansanti, Elahe Kalbassi, Alex Mourachko, Christophe Ropers, Carleigh Wood

    Abstract: Research in toxicity detection in natural language processing for the speech modality (audio-based) is quite limited, particularly for languages other than English. To address these limitations and lay the groundwork for truly multilingual audio-based toxicity detection, we introduce MuTox, the first highly multilingual audio-based dataset with toxicity labels. The dataset comprises 20,000 audio u… ▽ More

    Submitted 27 June, 2024; v1 submitted 10 January, 2024; originally announced January 2024.

    ACM Class: I.2.7

  3. arXiv:2312.15821  [pdf, other

    cs.SD cs.LG eess.AS

    Audiobox: Unified Audio Generation with Natural Language Prompts

    Authors: Apoorv Vyas, Bowen Shi, Matthew Le, Andros Tjandra, Yi-Chiao Wu, Baishan Guo, Jiemin Zhang, Xinyue Zhang, Robert Adkins, William Ngan, Jeff Wang, Ivan Cruz, Bapi Akula, Akinniyi Akinyemi, Brian Ellis, Rashel Moritz, Yael Yungster, Alice Rakotoarison, Liang Tan, Chris Summers, Carleigh Wood, Joshua Lane, Mary Williamson, Wei-Ning Hsu

    Abstract: Audio is an essential part of our life, but creating it often requires expertise and is time-consuming. Research communities have made great progress over the past year advancing the performance of large scale audio generative models for a single modality (speech, sound, or music) through adopting more powerful generative models and scaling data. However, these models lack controllability in sever… ▽ More

    Submitted 25 December, 2023; originally announced December 2023.

  4. arXiv:2312.05187  [pdf, other

    cs.CL cs.SD eess.AS

    Seamless: Multilingual Expressive and Streaming Speech Translation

    Authors: Seamless Communication, Loïc Barrault, Yu-An Chung, Mariano Coria Meglioli, David Dale, Ning Dong, Mark Duppenthaler, Paul-Ambroise Duquenne, Brian Ellis, Hady Elsahar, Justin Haaheim, John Hoffman, Min-Jae Hwang, Hirofumi Inaguma, Christopher Klaiber, Ilia Kulikov, Pengwei Li, Daniel Licht, Jean Maillard, Ruslan Mavlyutov, Alice Rakotoarison, Kaushik Ram Sadagopan, Abinesh Ramakrishnan, Tuan Tran, Guillaume Wenzek , et al. (40 additional authors not shown)

    Abstract: Large-scale automatic speech translation systems today lack key features that help machine-mediated communication feel seamless when compared to human-to-human dialogue. In this work, we introduce a family of models that enable end-to-end expressive and multilingual translations in a streaming fashion. First, we contribute an improved version of the massively multilingual and multimodal SeamlessM4… ▽ More

    Submitted 8 December, 2023; originally announced December 2023.

  5. arXiv:2308.11596  [pdf, other

    cs.CL

    SeamlessM4T: Massively Multilingual & Multimodal Machine Translation

    Authors: Seamless Communication, Loïc Barrault, Yu-An Chung, Mariano Cora Meglioli, David Dale, Ning Dong, Paul-Ambroise Duquenne, Hady Elsahar, Hongyu Gong, Kevin Heffernan, John Hoffman, Christopher Klaiber, Pengwei Li, Daniel Licht, Jean Maillard, Alice Rakotoarison, Kaushik Ram Sadagopan, Guillaume Wenzek, Ethan Ye, Bapi Akula, Peng-Jen Chen, Naji El Hachem, Brian Ellis, Gabriel Mejia Gonzalez, Justin Haaheim , et al. (43 additional authors not shown)

    Abstract: What does it take to create the Babel Fish, a tool that can help individuals translate speech between any two languages? While recent breakthroughs in text-based models have pushed machine translation coverage beyond 200 languages, unified speech-to-speech translation models have yet to achieve similar strides. More specifically, conventional speech-to-speech translation systems rely on cascaded s… ▽ More

    Submitted 24 October, 2023; v1 submitted 22 August, 2023; originally announced August 2023.

    ACM Class: I.2.7

  6. arXiv:2308.03399  [pdf, ps, other

    quant-ph cs.DC

    Efficient techniques to GPU Accelerations of Multi-Shot Quantum Computing Simulations

    Authors: Jun Doi, Hiroshi Horii, Christopher Wood

    Abstract: Quantum computers are becoming practical for computing numerous applications. However, simulating quantum computing on classical computers is still demanding yet useful because current quantum computers are limited because of computer resources, hardware limits, instability, and noises. Improving quantum computing simulation performance in classical computers will contribute to the development of… ▽ More

    Submitted 7 August, 2023; originally announced August 2023.

  7. arXiv:2305.13198  [pdf, other

    cs.CL

    Multilingual Holistic Bias: Extending Descriptors and Patterns to Unveil Demographic Biases in Languages at Scale

    Authors: Marta R. Costa-jussà, Pierre Andrews, Eric Smith, Prangthip Hansanti, Christophe Ropers, Elahe Kalbassi, Cynthia Gao, Daniel Licht, Carleigh Wood

    Abstract: We introduce a multilingual extension of the HOLISTICBIAS dataset, the largest English template-based taxonomy of textual people references: MULTILINGUALHOLISTICBIAS. This extension consists of 20,459 sentences in 50 languages distributed across all 13 demographic axes. Source sentences are built from combinations of 118 demographic descriptors and three patterns, excluding nonsensical combination… ▽ More

    Submitted 22 May, 2023; originally announced May 2023.

    ACM Class: I.2.7

  8. arXiv:2203.07806  [pdf, other

    cs.CR

    You get PADDING, everybody gets PADDING! You get privacy? Evaluating practical QUIC website fingerprinting protections for the masses

    Authors: Sandra Siby, Ludovic Barman, Christopher Wood, Marwan Fayed, Nick Sullivan, Carmela Troncoso

    Abstract: Website fingerprinting (WF) is a well-know threat to users' web privacy. New internet standards, such as QUIC, include padding to support defenses against WF. Previous work only analyzes the effectiveness of defenses when users are behind a VPN. Yet, this is not how most users browse the Internet. In this paper, we provide a comprehensive evaluation of QUIC-padding-based defenses against WF when u… ▽ More

    Submitted 15 December, 2022; v1 submitted 15 March, 2022; originally announced March 2022.

  9. arXiv:2109.14490  [pdf

    cs.CR

    Might I Get Pwned: A Second Generation Compromised Credential Checking Service

    Authors: Bijeeta Pal, Mazharul Islam, Marina Sanusi, Nick Sullivan, Luke Valenta, Tara Whalen, Christopher Wood, Thomas Ristenpart, Rahul Chattejee

    Abstract: Credential stuffing attacks use stolen passwords to log into victim accounts. To defend against these attacks, recently deployed compromised credential checking (C3) services provide APIs that help users and companies check whether a username, password pair is exposed. These services however only check if the exact password is leaked, and therefore do not mitigate credential tweaking attacks - att… ▽ More

    Submitted 18 March, 2022; v1 submitted 29 September, 2021; originally announced September 2021.

  10. arXiv:2109.11576  [pdf, other

    cs.LG physics.comp-ph

    Efficient, Interpretable Graph Neural Network Representation for Angle-dependent Properties and its Application to Optical Spectroscopy

    Authors: Tim Hsu, Tuan Anh Pham, Nathan Keilbart, Stephen Weitzner, James Chapman, Penghao Xiao, S. Roger Qiu, Xiao Chen, Brandon C. Wood

    Abstract: Graph neural networks are attractive for learning properties of atomic structures thanks to their intuitive graph encoding of atoms and bonds. However, conventional encoding does not include angular information, which is critical for describing atomic arrangements in disordered systems. In this work, we extend the recently proposed ALIGNN encoding, which incorporates bond angles, to also include d… ▽ More

    Submitted 15 February, 2022; v1 submitted 23 September, 2021; originally announced September 2021.

  11. arXiv:2109.07925  [pdf, other

    q-bio.BM cs.LG

    PDBench: Evaluating Computational Methods for Protein Sequence Design

    Authors: Leonardo V. Castorina, Rokas Petrenas, Kartic Subr, Christopher W. Wood

    Abstract: Proteins perform critical processes in all living systems: converting solar energy into chemical energy, replicating DNA, as the basis of highly performant materials, sensing and much more. While an incredible range of functionality has been sampled in nature, it accounts for a tiny fraction of the possible protein universe. If we could tap into this pool of unexplored protein structures, we could… ▽ More

    Submitted 28 September, 2021; v1 submitted 16 September, 2021; originally announced September 2021.

    Comments: 9 pages, 5 figures

  12. arXiv:2108.10550  [pdf

    eess.IV cs.CV cs.LG q-bio.QM

    A generative adversarial approach to facilitate archival-quality histopathologic diagnoses from frozen tissue sections

    Authors: Kianoush Falahkheirkhah, Tao Guo, Michael Hwang, Pheroze Tamboli, Christopher G Wood, Jose A Karam, Kanishka Sircar, Rohit Bhargava

    Abstract: In clinical diagnostics and research involving histopathology, formalin fixed paraffin embedded (FFPE) tissue is almost universally favored for its superb image quality. However, tissue processing time (more than 24 hours) can slow decision-making. In contrast, fresh frozen (FF) processing (less than 1 hour) can yield rapid information but diagnostic accuracy is suboptimal due to lack of clearing,… ▽ More

    Submitted 24 August, 2021; originally announced August 2021.

    Comments: 24 pages, 6 figures, and 3 tables

  13. arXiv:2011.10121  [pdf, other

    cs.CR cs.NI

    Oblivious DNS over HTTPS (ODoH): A Practical Privacy Enhancement to DNS

    Authors: Sudheesh Singanamalla, Suphanat Chunhapanya, Marek Vavruša, Tanya Verma, Peter Wu, Marwan Fayed, Kurtis Heimerl, Nick Sullivan, Christopher Wood

    Abstract: The Domain Name System (DNS) is the foundation of a human-usable Internet, responding to client queries for host-names with corresponding IP addresses and records. Traditional DNS is also unencrypted, and leaks user information to network operators. Recent efforts to secure DNS using DNS over TLS (DoT) and DNS over HTTPS (DoH) have been gaining traction, ostensibly protecting traffic and hiding co… ▽ More

    Submitted 19 November, 2020; originally announced November 2020.

    Comments: 16 pages, 7 figures, Under submission and Presented at IETF 109 MAPRG

  14. Turning Machines: a simple algorithmic model for molecular robotics

    Authors: Irina Kostitsyna, Cai Wood, Damien Woods

    Abstract: Molecular robotics is challenging, so it seems best to keep it simple. We consider an abstract molecular robotics model based on simple folding instructions that execute asynchronously. Turning Machines are a simple 1D to 2D folding model, also easily generalisable to 2D to 3D folding. A Turning Machine starts out as a line of connected monomers in the discrete plane, each with an associated turni… ▽ More

    Submitted 24 January, 2022; v1 submitted 1 September, 2020; originally announced September 2020.

    ACM Class: F.1.1; F.2.2; I.3.5

    Journal ref: Earlier version published in the Proceedings of The 26th International Conference on DNA Computing and Molecular Programming. 2020. LIPIcs vol 174, pages 11:1--21

  15. arXiv:1810.05726  [pdf, other

    cs.CV cs.LG stat.ML

    DeepWeeds: A Multiclass Weed Species Image Dataset for Deep Learning

    Authors: Alex Olsen, Dmitry A. Konovalov, Bronson Philippa, Peter Ridd, Jake C. Wood, Jamie Johns, Wesley Banks, Benjamin Girgenti, Owen Kenny, James Whinney, Brendan Calvert, Mostafa Rahimi Azghadi, Ronald D. White

    Abstract: Robotic weed control has seen increased research of late with its potential for boosting productivity in agriculture. Majority of works focus on develo** robotics for croplands, ignoring the weed management problems facing rangeland stock farmers. Perhaps the greatest obstacle to widespread uptake of robotic weed control is the robust classification of weed species in their natural environment.… ▽ More

    Submitted 14 February, 2019; v1 submitted 9 October, 2018; originally announced October 2018.

    Comments: 14 pages, 8 figures, 4 tables

    Journal ref: Sci.Rep. 9, 2058 (2019)

  16. arXiv:1809.03452  [pdf, other

    quant-ph cs.ET

    Qiskit Backend Specifications for OpenQASM and OpenPulse Experiments

    Authors: David C. McKay, Thomas Alexander, Luciano Bello, Michael J. Biercuk, Lev Bishop, Jiayin Chen, Jerry M. Chow, Antonio D. Córcoles, Daniel Egger, Stefan Filipp, Juan Gomez, Michael Hush, Ali Javadi-Abhari, Diego Moreda, Paul Nation, Brent Paulovicks, Erick Winston, Christopher J. Wood, James Wootton, Jay M. Gambetta

    Abstract: As interest in quantum computing grows, there is a pressing need for standardized API's so that algorithm designers, circuit designers, and physicists can be provided a common reference frame for designing, executing, and optimizing experiments. There is also a need for a language specification that goes beyond gates and allows users to specify the time dynamics of a quantum experiment and recover… ▽ More

    Submitted 10 September, 2018; originally announced September 2018.

    Comments: 68 pages. More information and schemas can be found in the Qiskit repository https://github.com/Qiskit/

  17. arXiv:1706.07165  [pdf, other

    cs.NI

    Content-Centric Networking - Architectural Overview and Protocol Description

    Authors: Marc Mosko, Ignacio Solis, Christopher A. Wood

    Abstract: This document describes the core concepts of the CCNx architecture and presents a minimum network protocol based on two messages: Interests and Content Objects. It specifies the set of mandatory and optional fields within those messages and describes their behavior and interpretation. This architecture and protocol specification is independent of a specific wire encoding.

    Submitted 22 June, 2017; originally announced June 2017.

  18. arXiv:1512.07755  [pdf, other

    cs.NI

    Living in a PIT-less World: A Case Against Stateful Forwarding in Content-Centric Networking

    Authors: Cesar Ghali, Gene Tsudik, Ersin Uzun, Christopher A. Wood

    Abstract: Information-Centric Networking (ICN) is a recent paradigm that claims to mitigate some limitations of the current IP-based Internet architecture. The centerpiece of ICN is named and addressable content, rather than hosts or interfaces. Content-Centric Networking (CCN) is a prominent ICN instance that shares the fundamental architectural design with its equally popular academic sibling Named-Data N… ▽ More

    Submitted 24 December, 2015; originally announced December 2015.

    Comments: 10 pages, 6 figures

  19. arXiv:1512.07311  [pdf, other

    cs.NI

    BEAD: Best Effort Autonomous Deletion in Content-Centric Networking

    Authors: Cesar Ghali, Gene Tsudik, Christopher A. Wood

    Abstract: A core feature of Content-Centric Networking (CCN) is opportunistic content caching in routers. It enables routers to satisfy content requests with in-network cached copies, thereby reducing bandwidth utilization, decreasing congestion, and improving overall content retrieval latency. One major drawback of in-network caching is that content producers have no knowledge about where their content i… ▽ More

    Submitted 22 December, 2015; originally announced December 2015.

    Comments: 9 pages, 4 figures

  20. arXiv:1510.01852  [pdf, other

    cs.NI

    Practical Accounting in Content-Centric Networking (extended version)

    Authors: Cesar Ghali, Gene Tsudik, Christopher A. Wood, Edmund Yeh

    Abstract: Content-Centric Networking (CCN) is a new class of network architectures designed to address some key limitations of the current IP-based Internet. One of its main features is in-network content caching, which allows requests for content to be served by routers. Despite improved bandwidth utilization and lower latency for popular content retrieval, in-network content caching offers producers no me… ▽ More

    Submitted 7 October, 2015; originally announced October 2015.

    Comments: 13 pages, 6 figures

  21. Interest-Based Access Control for Content Centric Networks (extended version)

    Authors: Cesar Ghali, Marc A. Schlosberg, Gene Tsudik, Christopher A. Wood

    Abstract: Content-Centric Networking (CCN) is an emerging network architecture designed to overcome limitations of the current IP-based Internet. One of the fundamental tenets of CCN is that data, or content, is a named and addressable entity in the network. Consumers request content by issuing interest messages with the desired content name. These interests are forwarded by routers to producers, and the re… ▽ More

    Submitted 22 May, 2015; originally announced May 2015.

    Comments: 11 pages, 2 figures

  22. arXiv:1405.2861  [pdf, other

    cs.NI cs.CR

    Secure Fragmentation for Content-Centric Networks (extended version)

    Authors: Cesar Ghali, Ashok Narayanan, David Oran, Gene Tsudik, Christopher A. Wood

    Abstract: Content-Centric Networking (CCN) is a communication paradigm that emphasizes content distribution. Named-Data Networking (NDN) is an instantiation of CCN, a candidate Future Internet Architecture. NDN supports human-readable content naming and router-based content caching which lends itself to efficient, secure, and scalable content distribution. Because of NDN's fundamental requirement that each… ▽ More

    Submitted 19 August, 2015; v1 submitted 12 May, 2014; originally announced May 2014.

    Comments: 13 pages, 6 figures