-
Virtual Screening on FPGA: Performance and Energy versus Effort
Authors:
Tom Vander Aa,
Tom Haber,
Thomas J. Ashby,
Roel Wuyts,
Wilfried Verachtert
Abstract:
With their widespread availability, FPGA-based accelerators cards have become an alternative to GPUs and CPUs to accelerate computing in applications with certain requirements (like energy efficiency) or properties (like fixed-point computations). In this paper we show results and experiences from map** an industrial application used for drug discovery on several types of accelerators. We especi…
▽ More
With their widespread availability, FPGA-based accelerators cards have become an alternative to GPUs and CPUs to accelerate computing in applications with certain requirements (like energy efficiency) or properties (like fixed-point computations). In this paper we show results and experiences from map** an industrial application used for drug discovery on several types of accelerators. We especially highlight the effort versus benefit of FPGAs compared to CPUs and GPUs in terms of performance and energy efficiency. For this application, even with extensive use of FPGA-specific features, and performing different optimizations, results on GPUs are still better, both in terms of energy and performance.
△ Less
Submitted 19 October, 2022;
originally announced October 2022.
-
A High-Performance Implementation of Bayesian Matrix Factorization with Limited Communication
Authors:
Tom Vander Aa,
Xiangju Qin,
Paul Blomstedt,
Roel Wuyts,
Wilfried Verachtert,
Samuel Kaski
Abstract:
Matrix factorization is a very common machine learning technique in recommender systems. Bayesian Matrix Factorization (BMF) algorithms would be attractive because of their ability to quantify uncertainty in their predictions and avoid over-fitting, combined with high prediction accuracy. However, they have not been widely used on large-scale data because of their prohibitive computational cost. I…
▽ More
Matrix factorization is a very common machine learning technique in recommender systems. Bayesian Matrix Factorization (BMF) algorithms would be attractive because of their ability to quantify uncertainty in their predictions and avoid over-fitting, combined with high prediction accuracy. However, they have not been widely used on large-scale data because of their prohibitive computational cost. In recent work, efforts have been made to reduce the cost, both by improving the scalability of the BMF algorithm as well as its implementation, but so far mainly separately. In this paper we show that the state-of-the-art of both approaches to scalability can be combined. We combine the recent highly-scalable Posterior Propagation algorithm for BMF, which parallelizes computation of blocks of the matrix, with a distributed BMF implementation that users asynchronous communication within each block. We show that the combination of the two methods gives substantial improvements in the scalability of BMF on web-scale datasets, when the goal is to reduce the wall-clock time.
△ Less
Submitted 14 April, 2020; v1 submitted 6 April, 2020;
originally announced April 2020.
-
SMURFF: a High-Performance Framework for Matrix Factorization
Authors:
Tom Vander Aa,
Imen Chakroun,
Thomas J. Ashby,
Jaak Simm,
Adam Arany,
Yves Moreau,
Thanh Le Van,
José Felipe Golib Dzib,
Jörg Wegner,
Vladimir Chupakhin,
Hugo Ceulemans,
Roel Wuyts,
Wilfried Verachtert
Abstract:
Bayesian Matrix Factorization (BMF) is a powerful technique for recommender systems because it produces good results and is relatively robust against overfitting. Yet BMF is more computationally intensive and thus more challenging to implement for large datasets. In this work we present SMURFF a high-performance feature-rich framework to compose and construct different Bayesian matrix-factorizatio…
▽ More
Bayesian Matrix Factorization (BMF) is a powerful technique for recommender systems because it produces good results and is relatively robust against overfitting. Yet BMF is more computationally intensive and thus more challenging to implement for large datasets. In this work we present SMURFF a high-performance feature-rich framework to compose and construct different Bayesian matrix-factorization methods. The framework has been successfully used in to do large scale runs of compound-activity prediction. SMURFF is available as open-source and can be used both on a supercomputer and on a desktop or laptop machine. Documentation and several examples are provided as Jupyter notebooks using SMURFF's high-level Python API.
△ Less
Submitted 29 July, 2019; v1 submitted 4 April, 2019;
originally announced April 2019.
-
Resilient Work Stealing
Authors:
Pascal Costanza,
Charlotte Herzeel,
Wolfgang De Meuter,
Roel Wuyts
Abstract:
Future generations of processors will exhibit an increase of faults over their lifetime, and it becomes increasingly expensive to solve the resulting reliability issues purely at the hardware level. We propose to model computations in terms of restartable task graphs in order to improve reliability at the software level. As a proof of concept, we present Cobra, a novel design for a shared-memory w…
▽ More
Future generations of processors will exhibit an increase of faults over their lifetime, and it becomes increasingly expensive to solve the resulting reliability issues purely at the hardware level. We propose to model computations in terms of restartable task graphs in order to improve reliability at the software level. As a proof of concept, we present Cobra, a novel design for a shared-memory work-stealing scheduler that realizes this notion of restartable task graphs, and enables computations to survive hardware failures due to soft errors. A comparison with the work-stealing scheduler of Threading Building Blocks on the PARSEC benchmark suite shows that Cobra incurs no performance overhead in the absence of failures, and low performance overheads in the presence of single and multiple failures.
△ Less
Submitted 12 June, 2017;
originally announced June 2017.