Skip to main content

Showing 1–4 of 4 results for author: Feher, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2308.15136  [pdf, other

    cs.DS cs.CV cs.DB cs.DC cs.IR

    CAGRA: Highly Parallel Graph Construction and Approximate Nearest Neighbor Search for GPUs

    Authors: Hiroyuki Ootomo, Akira Naruse, Corey Nolet, Ray Wang, Tamas Feher, Yong Wang

    Abstract: Approximate Nearest Neighbor Search (ANNS) plays a critical role in various disciplines spanning data mining and artificial intelligence, from information retrieval and computer vision to natural language processing and recommender systems. Data volumes have soared in recent years and the computational cost of an exhaustive exact nearest neighbor search is often prohibitive, necessitating the adop… ▽ More

    Submitted 29 August, 2023; originally announced August 2023.

  2. arXiv:2104.02443  [pdf

    cs.SE cs.AI cs.CL cs.LG cs.PL

    CodeTrans: Towards Cracking the Language of Silicon's Code Through Self-Supervised Deep Learning and High Performance Computing

    Authors: Ahmed Elnaggar, Wei Ding, Llion Jones, Tom Gibbs, Tamas Feher, Christoph Angerer, Silvia Severini, Florian Matthes, Burkhard Rost

    Abstract: Currently, a growing number of mature natural language processing applications make people's life more convenient. Such applications are built by source code - the language in software engineering. However, the applications for understanding source code language to ease the software engineering process are under-researched. Simultaneously, the transformer model, especially its combination with tra… ▽ More

    Submitted 12 May, 2021; v1 submitted 6 April, 2021; originally announced April 2021.

    Comments: 28 pages, 6 tables and 1 figure

  3. arXiv:2007.06225  [pdf

    cs.LG cs.CL cs.DC stat.ML

    ProtTrans: Towards Cracking the Language of Life's Code Through Self-Supervised Deep Learning and High Performance Computing

    Authors: Ahmed Elnaggar, Michael Heinzinger, Christian Dallago, Ghalia Rihawi, Yu Wang, Llion Jones, Tom Gibbs, Tamas Feher, Christoph Angerer, Martin Steinegger, Debsindhu Bhowmik, Burkhard Rost

    Abstract: Computational biology and bioinformatics provide vast data gold-mines from protein sequences, ideal for Language Models taken from NLP. These LMs reach for new prediction frontiers at low inference costs. Here, we trained two auto-regressive models (Transformer-XL, XLNet) and four auto-encoder models (BERT, Albert, Electra, T5) on data from UniRef and BFD containing up to 393 billion amino acids.… ▽ More

    Submitted 4 May, 2021; v1 submitted 13 July, 2020; originally announced July 2020.

    Comments: 17 pages, 9 figures, 4 tables

  4. arXiv:1810.04413  [pdf

    cs.PF

    Performance analysis and optimization of the JOREK code for many-core CPUs

    Authors: T. B. Fehér, M. Hölzl, G. Latu, G. T. A. Huijsmans

    Abstract: This report investigates the performance of the JOREK code on the Intel Knights Landing and Skylake processor architectures. The OpenMP scaling of the matrix construction part of the code was analyzed and improved synchronization methods were implemented. A new switch was implemented to control the number of threads used for the linear equation solver independently from other parts of the code. Th… ▽ More

    Submitted 10 October, 2018; originally announced October 2018.