Accelerating intercommunication in highly parallel systems

Tampouratzis Nikolaos, Matthaiakis Pavlos, Papaefstathiou Ioannis

URI	http://purl.tuc.gr/dl/dias/AC1A7632-C151-4E39-9961-0D5D354B80E2	-
Αναγνωριστικό	https://dl.acm.org/citation.cfm?doid=3012405.3005717	-
Αναγνωριστικό	https://doi.org/10.1145/3005717	-
Γλώσσα	en	-
Μέγεθος	25 pages	en
Τίτλος	Accelerating intercommunication in highly parallel systems	en
Δημιουργός	Tampouratzis Nikolaos	en
Δημιουργός	Ταμπουρατζης Νικολαος	el
Δημιουργός	Matthaiakis Pavlos	en
Δημιουργός	Ματθαιακης Παυλος	el
Δημιουργός	Papaefstathiou Ioannis	en
Δημιουργός	Παπαευσταθιου Ιωαννης	el
Εκδότης	Association for Computing Machinery	en
Περίληψη	Every HPC system consists of numerous processing nodes interconnect using a number of different interprocess communication protocols such as Messaging Passing Interface (MPI) and Global Arrays (GA). Traditionally, research has focused on optimizing these protocols and identifying the most suitable ones for each system and/or application. Recently, there has been a proposal to unify the primitive operations of the different inter-processor communication protocols through the Portals library. Portals offer a set of low-level communication routines which can be composed in order to implement the functionality of different intercommunication protocols. However, Portals modularity comes at a performance cost, since it adds one more layer in the actual protocol implementation. This work aims at closing the performance gap between a generic and reusable intercommunication layer, such as Portals, and the several monolithic and highly optimized intercommunication protocols. This is achieved through the development of a novel hardware offload engine efficiently implementing the basic Portals' modules. Our innovative system is up to two2 orders of magnitude faster than the conventional software implementation of Portals' while the speedup achieved over the conventional monolithic software implementations of MPI and GAs is more than an order of magnitude. The power consumption of our hardware system is less than 1/100th of what a low-power CPU consumes when executing the Portal's software while its silicon cost is less than 1/10th of that of a very simple RISC CPU. Moreover, our design process is also innovative since we have first modeled the hardware within an untimed virtual prototype which allowed for rapid design space exploration; then we applied a novel methodology to transform the untimed description into an efficient timed hardware description, which was then transformed into a hardware netlist through a High-Level Synthesis (HLS) tool.	en
Τύπος	Peer-Reviewed Journal Publication	en
Τύπος	Δημοσίευση σε Περιοδικό με Κριτές	el
Άδεια Χρήσης	http://creativecommons.org/licenses/by/4.0/	en
Ημερομηνία	2018-06-28	-
Ημερομηνία Δημοσίευσης	2016	-
Θεματική Κατηγορία	Computer systems organization	en
Θεματική Κατηγορία	Architectures	en
Θεματική Κατηγορία	Parallel architectures	en
Βιβλιογραφική Αναφορά	N. Tampouratzis, P. M. Mattheakis and I. Papaefstathiou, "Accelerating intercommunication in highly parallel systems," ACM T. Archit. Code Op., vol. 13, no. 4, Dec. 2016. doi: 10.1145/3005717	en

Αναζήτηση

Πλοήγηση

Ο Χώρος μου

Accelerating intercommunication in highly parallel systems

Tampouratzis Nikolaos, Matthaiakis Pavlos, Papaefstathiou Ioannis

Υπηρεσίες

Εξαγωγή

Κοινοποίηση

Στατιστικά

Μεταδεδομένων & Περιεχομένου σε METS:

Μεταδεδομένων σε Μορφότυπο: