Download PDFOpen PDF in browser

1,000x Faster than PLINK: Genome-Wide Epistasis Detection with Logistic Regression Using Combined FPGA and GPU Accelerators

EasyChair Preprint no. 60

14 pagesDate: April 12, 2018

Abstract

Logistic regression as implemented in PLINK is a powerful and commonly used framework for assessing gene-gene (GxG) interactions. However, fitting regression models for each pair of markers in a genome-wide dataset is a computationally intensive task. Performing billions of tests with PLINK takes days if not weeks, for which reason pre-filtering techniques and fast epistasis screenings are applied to reduce the computational burden. Here, we demonstrate that employing a combination of a Xilinx UltraScale KU115 FPGA and an Nvidia Tesla P100 GPU leads to runtimes of only minutes for logistic regression GxG tests on a genome-wide level. In particular, a dataset with 53,000 samples genotyped at 130,000 SNPs was analyzed in 8 minutes, resulting in a speedup of more than 1,000 when compared to PLINK v1.9 using 32 threads on a server-grade computing platform. Furthermore, on-the-fly calculation of test statistics, p-values and LD-scores in double precision make commonly used pre-filtering strategies obsolete.

Keyphrases: Boost, Computer Science, contingency table, FPGA computing, gene-gene (GxG) interaction, gene-gene interaction, genome wide interaction study, Genome-Wide Association Studies (GWAS), Genome-wide association study, genome-wide interaction studies (GWIS), GPU computing, gpu computing architecture, Hardware Accelerator, heterogeneous architectures, hybrid computing, intel xeon e5, kintex ultrascale ku115, linkage disequilibrium, linkage disequilibrium (LD), logistic regression, logistic regression test, nvidia tesla p100, PLINK, tesla p100 gpu, ultrascale ku115 fpga, xilinx kintex ultrascale

BibTeX entry
BibTeX does not have the right entry for preprints. This is a hack for producing the correct reference:
@Booklet{EasyChair:60,
  author = {Lars Wienbrandt and Jan Christian Kässens and Matthias Hübenthal and David Ellinghaus},
  title = {1,000x Faster than PLINK: Genome-Wide Epistasis Detection with Logistic Regression Using Combined FPGA and GPU Accelerators},
  howpublished = {EasyChair Preprint no. 60},
  doi = {10.29007/w46m},
  year = {EasyChair, 2018}}
Download PDFOpen PDF in browser