Skip to main content

Showing 1–3 of 3 results for author: Kirchoff, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.07970  [pdf, other

    cs.IR cs.LG

    Utilizing Low-Dimensional Molecular Embeddings for Rapid Chemical Similarity Search

    Authors: Kathryn E. Kirchoff, James Wellnitz, Joshua E. Hochuli, Travis Maxfield, Konstantin I. Popov, Shawn Gomez, Alexander Tropsha

    Abstract: Nearest neighbor-based similarity searching is a common task in chemistry, with notable use cases in drug discovery. Yet, some of the most commonly used approaches for this task still leverage a brute-force approach. In practice this can be computationally costly and overly time-consuming, due in part to the sheer size of modern chemical databases. Previous computational advancements for this task… ▽ More

    Submitted 12 February, 2024; originally announced February 2024.

  2. arXiv:2310.02744  [pdf, other

    cs.LG

    SALSA: Semantically-Aware Latent Space Autoencoder

    Authors: Kathryn E. Kirchoff, Travis Maxfield, Alexander Tropsha, Shawn M. Gomez

    Abstract: In deep learning for drug discovery, chemical data are often represented as simplified molecular-input line-entry system (SMILES) sequences which allow for straightforward implementation of natural language processing methodologies, one being the sequence-to-sequence autoencoder. However, we observe that training an autoencoder solely on SMILES is insufficient to learn molecular representations th… ▽ More

    Submitted 4 October, 2023; originally announced October 2023.

  3. arXiv:1210.4904  [pdf

    cs.CE q-bio.QM

    Spectrum Identification using a Dynamic Bayesian Network Model of Tandem Mass Spectra

    Authors: Ajit P. Singh, John Halloran, Jeff A. Bilmes, Katrin Kirchoff, William S. Noble

    Abstract: Shotgun proteomics is a high-throughput technology used to identify unknown proteins in a complex mixture. At the heart of this process is a prediction task, the spectrum identification problem, in which each fragmentation spectrum produced by a shotgun proteomics experiment must be mapped to the peptide (protein subsequence) which generated the spectrum. We propose a new algorithm for spectrum id… ▽ More

    Submitted 16 October, 2012; originally announced October 2012.

    Comments: Appears in Proceedings of the Twenty-Eighth Conference on Uncertainty in Artificial Intelligence (UAI2012)

    Report number: UAI-P-2012-PG-775-785