Institutional Repository
Technical University of Crete
EN  |  EL

Search

Browse

My Space

Design and implementation of a multi-FPGA acceleration system for large-scale population genomics analyses based on linkage disequilibrium

Bozikas Dimitrios

Simple record


URIhttp://purl.tuc.gr/dl/dias/8199C02C-9F29-4793-A1F1-C6D1658A8FD4-
Identifierhttps://doi.org/10.26233/heallink.tuc.67515-
Languageen-
Extent2.11 megabytesen
TitleDesign and implementation of a multi-FPGA acceleration system for large-scale population genomics analyses based on linkage disequilibriumen
TitleΣχεδίαση και υλοποίηση συστήματος πολλαπλών FPGA για την επιτάχυνση μεγάλης κλίμακας αναλύσεων πληθυσμιακής γενομικής βασιζόμενες σε ανισορροπία σύνδεσηςel
CreatorBozikas Dimitriosen
CreatorΜποζικας Δημητριοςel
Contributor [Thesis Supervisor]Dollas Apostolosen
Contributor [Thesis Supervisor]Δολλας Αποστολοςel
Contributor [Committee Member]Pnevmatikatos Dionysiosen
Contributor [Committee Member]Πνευματικατος Διονυσιοςel
Contributor [Committee Member]Papaefstathiou Ioannisen
Contributor [Committee Member]Παπαευσταθιου Ιωαννηςel
PublisherΠολυτεχνείο Κρήτηςel
PublisherTechnical University of Creteen
Academic UnitΠολυτεχνείο Κρήτης::Σχολή Ηλεκτρολόγων Μηχανικών και Μηχανικών Υπολογιστώνel
DescriptionΔιπλωματική εργασία που υποβλήθηκε στη σχολή ΗΜΜΥ του Πολ. Κρήτης για την πλήρωση προϋποθέσεων λήψης Πτυχίου.el
Content SummaryModern sequencing technologies have contributed to the creation and rapid expansion of DNA databases, already numbering thousands of whole genomes. The astonishing rate at which genomic data are being collected, combined with the fact that it has outpaced Moore’s law, establishes the necessity of the development of novel tools, capable of conducting large-scale genomics analyses efficiently. This work addresses the computational challenges inherent to the analysis of linkage disequilibrium (LD) levels in large-scale datasets. LD is a statistic that quantifies the non-random association between alleles at different genomic locations. While it contributes to a wide variety of genomics and genetics analyses, the compute- and memory-intensive operation of counting set bits (population count) in large vectors, required for the estimation of LD, hinders the efficient use of modern CPUs for such analyses, mainly due to the lack of a vectorized population counter. To overcome this obstacle, we present a novel hardware architecture for the calculation of pairwise LD scores based on reconfigurable logic. The proposed accelerator exploits the ability of reconfigurable machines to be programmed at the hardware level. The effective use of multiple levels of parallelism, combined with the efficient manipulation of the data structures on memory through the transformation of the memory layout, result in high throughput capabilities for the estimation of LD in arbitrarily large datasets. The architecture is, subsequently, mapped onto a high-performance heterogeneous computing platform that enables the parallel cooperation of 4 reconfigurable devices, while, simultaneously, providing a high-speed memory interface. The implemented accelerator is evaluated for analyses of simulated genomic data of varying sizes, through its comparison with corresponding state-of-the-art parallel software implementations, achieving speedups between 6.35X and 134.93X, depending on the dataset size. Concerning real-world analyses, such as scanning the 22nd chromosome of the human genome, the accelerator is capable of potentially achieving quintupled throughput when compared to highly optimized reference software running on multiple cores.en
Type of ItemΔιπλωματική Εργασίαel
Type of ItemDiploma Worken
Licensehttp://creativecommons.org/licenses/by/4.0/en
Date of Item2017-03-08-
Date of Publication2017-
SubjectΑναδιατασσόμενη λογικήel
SubjectReconfigurable logicen
SubjectFPGAen
SubjectConveyen
SubjectLinkage disequilibriumen
SubjectΑνισορροπία σύνδεσηςel
Bibliographic CitationDimitrios Bozikas, "Design and implementation of a multi-FPGA acceleration system for large-scale population genomics analyses based on linkage disequilibrium", Diploma Work, Σχολή Ηλεκτρολόγων Μηχανικών και Μηχανικών Υπολογιστών, Πολυτεχνείο Κρήτης, Chania, Greece, 2017en
Bibliographic CitationΔημήτριος Μπόζικας, "Σχεδίαση και υλοποίηση συστήματος πολλαπλών FPGA για την επιτάχυνση μεγάλης κλίμακας αναλύσεων πληθυσμιακής γενομικής βασιζόμενες σε ανισορροπία σύνδεσης", Διπλωματική Εργασία, Σχολή Ηλεκτρολόγων Μηχανικών και Μηχανικών Υπολογιστών, Πολυτεχνείο Κρήτης, Χανιά, Ελλάς, 2017el

Available Files

Services

Statistics