Ιδρυματικό Αποθετήριο
Πολυτεχνείο Κρήτης
EN  |  EL

Αναζήτηση

Πλοήγηση

Ο Χώρος μου

A decoupled access-execute architecture for reconfigurable accelerators

Charitopoulos Georgios, Vatsolakis Charalabos, Chrysos Grigorios, Pnevmatikatos Dionysios

Απλή Εγγραφή


URIhttp://purl.tuc.gr/dl/dias/555CB276-79F1-4351-85E0-1D11756971E5-
Αναγνωριστικόhttps://doi.org/10.1145/3203217.3203267-
Αναγνωριστικόhttps://dl.acm.org/citation.cfm?doid=3203217.3203267-
Γλώσσαen-
Μέγεθος4 pagesen
ΤίτλοςA decoupled access-execute architecture for reconfigurable acceleratorsen
ΔημιουργόςCharitopoulos Georgiosen
ΔημιουργόςΧαριτοπουλος Γεωργιοςel
ΔημιουργόςVatsolakis Charalabosen
ΔημιουργόςΒατσολακης Χαραλαμποςel
ΔημιουργόςChrysos Grigoriosen
ΔημιουργόςΧρυσος Γρηγοριοςel
ΔημιουργόςPnevmatikatos Dionysiosen
ΔημιουργόςΠνευματικατος Διονυσιοςel
ΕκδότηςAssociation for Computing Machineryen
ΠερίληψηMapping computational intensive applications on reconfigurable technology for acceleration requires two main implementation parts: (a) the data plane, i.e., efficient interconnected units that accelerate processing, and (b) the access-plane, i.e., efficient ways to access data and transfer them to/from the accelerator. Data plane construction is well understood and mature tools -such as High Level Synthesis (HLS)- that produce efficient reconfigurable architectures exist. The access plane, however, is more challenging: data fetching for big-data and high-performance computing applications is even more complex and time consuming than processing. Towards this end,we presentDAER, a Decoupled Access-Execute architecture and framework for Reconfigurable accelerators. Our approach maps the code to be accelerated in two separate parts: (a) the fetch unit, responsible for fetching data to the accelerator and storing results back in memory, and (b) the processing unit, which processes the fetched data in a streaming way. This approach offers the user a structured and well-defined way of mapping applications on an FPGA. Additionally, it bodes well with other hardware-based optimization techniques, e.g. pipelining, custom processing and data prefetching, which hide the memory data access latency. We use the DAER framework and HLS mapping tools on five applications and show the proposed DAER framework achieves an order of magnitude performance speed-up compared to unmodified applications, and as much as 2x performance improvement compared to their optimized HLS versions. We, also, map the DAER-based architectures on HPC platforms showing the performance advantages of our approach on real world platforms.en
ΤύποςΠλήρης Δημοσίευση σε Συνέδριοel
ΤύποςConference Full Paperen
Άδεια Χρήσηςhttp://creativecommons.org/licenses/by/4.0/en
Ημερομηνία2019-09-12-
Ημερομηνία Δημοσίευσης2018-
Θεματική ΚατηγορίαAccelerationen
Θεματική ΚατηγορίαData handlingen
Θεματική ΚατηγορίαMemory architectureen
Βιβλιογραφική ΑναφοράG. Charitopoulos, C. Vatsolakis, G. Chrysos and D.N. Pnevmatikatos, "A decoupled access-execute architecture for reconfigurable accelerators," in 15th ACM International Conference on Computing Frontiers, 2018, pp. 244-247. doi: 10.1145/3203217.3203267en

Υπηρεσίες

Στατιστικά