Publications of Prof. Dr. Rudolf Berrendorf:

Awards

IARIA Fellow 2015/2016
Best paper award for:
Javed Razzaq, Rudolf Berrendorf, Soenke Hack, Max Weierstall, Florian Mannuss:
Fixed and Variable Sized Block Techniques for Sparse Matrix Vector Multiplication with General Matrix Structures
Proc. Tenth International Conference on Advanced Engineering Computing and Applications in Sciences (ADVCOMP 2016), Venice, Italy, 2016, pp. 84-90)
Best paper award for:
Rudolf Berrendorf, Max Weierstall, Florian Mannuss:
Program Optimization Strategies to Improve the Performance of SpMV-Operations
Proc. 8th International Conference on Future Computational Technologies and Applications (FUTURE COMPUTING 2016), Rome, Italy, 2016, pp. 34-40
Best paper award for:
Rudolf Berrendorf, Matthias Makulla:
Level-Synchronous Parallel Breadth-First Search Algorithms For Multicore and Multiprocessor Systems
Proc. Sixth Intl. Conference on Future Computational Technologies and Applications (FUTURE COMPUTING 2014), Venice, Italy, 2014, pp. 26-31
Best paper award for:
Rudolf Berrendorf:
Trading Redundant Work Against Atomic Operations On Large Shared Memory Parallel Systems
Proc. Seventh Intl. Conference on Advanced Engineering Computing and Applications in Sciences (ADVCOMP 2013), Porto, Portugal, 2013, pp. 61-66

Publications

Christian Neugebauer, Rudolf Berrendorf, Florian Mannuss:
Improving the Performance of Parallel SpMV Operations on NUMA Systems with Adaptive Load Balancing
Proc. ParCO 2017, Advances in Parallel Computing, Vol. 32, IOS Press, 2018, pp. 445-454
Jan Philipp Ecker, Rudolf Berrendorf, Florian Mannuss:
New Efficient General Sparse Matrix Formats for Parallel SpMV Operations
Francisco F. Rivera, Tomas F. Pena, Jose C. Cabaleiro (Eds.): Proc. 23nd International Conference on Parallel and Distributed Computing (Euro-Par 2017), Santiago de Compostela, Spain, Lecture Notes in Computer Science, Springer, Vol. 10417, 2017, pp. 523-537
Javed Razzaq, Rudolf Berrendorf, Jan P. Ecker, Soenke Hack, Max Weierstall, Florian Mannuss:
The DynB Sparse Matrix Format Using Variable Sized 2D Blocks for Efficient Sparse Matrix Vector Multiplications with General Matrix Structures
International Journal On Advances in Intelligent Systems, Vol. 10, No. 1&2, 2017, pp. 48-58
Rudolf Berrendorf, Max Weierstall, Florian Mannuss:
SpMV Runtime Improvements with Program Optimization Techniques on Different Abstraction Levels
International Journal On Advances in Intelligent Systems, Vol.9, No.3&4, 2016, pp. 417-429
Javed Razzaq, Rudolf Berrendorf, Soenke Hack, Max Weierstall, Florian Mannuss:
Fixed and Variable Sized Block Techniques for Sparse Matrix Vector Multiplication with General Matrix Structures
Proc. Tenth International Conference on Advanced Engineering Computing and Applications in Sciences (ADVCOMP 2016), Venice, Italy, 2016, pp. 84-90 (best paper award)
Javed Razzaq, Rudolf Berrendorf, Jan Philipp Ecker, Simon Eric Scholl, Florian Mannuss:
Performance Characterization of Current CPUs and Accelerators Using Micro-Benchmarks
International Journal on Advances in Systems and Measurements, Vol.9, No.1&2, 2016, pp. 77-90.
Christoph Lehnert, Rudolf Berrendorf, Jan P. Ecker, Florian Mannuss:
Performance Prediction and Ranking of SpMV Kernels on GPU Architectures
Dutot, Trystram (Eds.): Proc. 22nd International Conference on Parallel and Distributed Computing (Euro-Par 2016), Grenoble, France, Lecture Notes in Computer Science, Springer, Vol. 9833, 2016, pp. 90-102
Rudolf Berrendorf, Max Weierstall, Florian Mannuss:
Program Optimization Strategies to Improve the Performance of SpMV-Operations
Proc. 8th International Conference on Future Computational Technologies and Applications (FUTURE COMPUTING 2016), Rome, Italy, 2016, pp. 34-40 (best paper award)
Jan Ecker, Rudolf Berrendorf, Javed Razzaq, Simon E. Scholl, Florian Mannuss:
Comparing Different Programming Approaches for SpMV-Operations on GPUs
R.Wyrzykowski et.al. (Eds.): Proc. 11th International Conference on Parallel Processing and Applied Mathematics (PPAM 2015), Krakow, Poland, Springer, LNCS 9573, Part I, pp. 537-547, 2016.
Rudolf Berrendorf, Jan P. Ecker, Javed Razzaq, Simon E. Scholl, Florian Mannuss:
Using Application Oriented Micro-Benchmarks to Characterize the Performance of Single-node Hardware Architectures
Proc. Ninth International Conference on Advanced Engineering Computing and Applications in Sciences (ADVCOMP 2015), Nice, France, 2015, pp. 28-35
Matthias Makulla, Rudolf Berrendorf:
Evaluating Parallel Breadth-First Search Algorithms for Multiprocessor Systems
Intl. Journal on Advances in Software, Vol. 7, No. 3&4, 2014, pp. 740-751
Rudolf Berrendorf:
A Technique to Avoid Atomic Operations on Large Shared Memory Parallel Systems
Intl. Journal on Advances in Software, Vol. 7, No. 1&2, 2014, pp. 197 - 210
Rudolf Berrendorf, Matthias Makulla:
Level-Synchronous Parallel Breadth-First Search Algorithms For Multicore and Multiprocessor Systems
Proc. Sixth Intl. Conference on Future Computational Technologies and Applications (FUTURE COMPUTING 2014), Venice, Italy, 2014, pp. 26-31 (best paper award)
Rudolf Berrendorf:
Trading Redundant Work Against Atomic Operations On Large Shared Memory Parallel Systems
Proc. Seventh Intl. Conference on Advanced Engineering Computing and Applications in Sciences (ADVCOMP 2013), Porto, Portugal, 2013, pp. 61-66 (best paper award)
Die Programmiersprache C - Ein Nachschlagewerk
RRZN Hannover, 19. Auflage, September 2011
Jan Seidel, Rudolf Berrendorf, Ace Crngarov, Marc-Andre Hermanns:
Optimization Strategies for Data Distribution Schemes in a Parallel File System
Proc. Intl. Conference Parallel Computing: Architectures, Algorithms and Applications, Parco 2007, Aachen, Germany, IOS Press, 2007, pp. 425-432
Jan Seidel, Rudolf Berrendorf, Marcel Birkner, Marc-Andre Hermanns:
High-Bandwidth Remote Parallel I/O with the Distributed Memory Filesystem MEMFS
Recent Advances in Parallel Virtual Machine and Message Passing Interface, Proc. 13th EuroPVM/MPI 2006, Bonn, Germany, LNCS 4192, Springer Verlag, 2006, pp. 222-228
Marc-Andre Hermanns, Rudolf Berrendorf, Marcel Birkner, Jan Seidel:
Flexible I/O Support for Reconfigurable Grid Environments
Proc. Euro-Par 2006, Dresden, Germany, LNCS 4128, Springer Verlag, 2006, pp. 415-424
Rudolf Berrendorf, Marc-Andre Hermanns, Jan Seidel:
Remote Parallel I/O in Grid Environments
Proc. 6th Intl. Conference on Parallel Processing and Applied Mathematics, PPAM2005, Poznan, Poland , LNCS 3911, Springer Verlag, 2006, pp. 212-217
Rudolf Berrendorf:
Verteilte objektorientierte Systeme
Kapitel 7 in Lehr- und Übungsbuch Softwaretechnik (Hrsg: P.Forbrig, I.O.Kerner), Fachbuchverlag Leipzig, 2004
R.Berrendorf, C.Bischof, H.Brunst, M.Bücker, U.Detert, R.Esser, M.Gerndt, J.Grotendorst, I.Gutheil, H.-C.Hoppe, F.Hoßfeld, B.Körfgen, B.Lang, D.an Mey, B.Mohr, W.E.Nagel, K.Solchenbach, G.Sutmann, V.Tikko, L.Wollschläger:
Gekoppelte SMP-Systeme im wissenschaftlich-technischen Hochleistungsrechnen - Status und Entwicklungsbedarf
BMBF Report, 01 IR 903, 2000
Rudolf Berrendorf, Guido Nieken:
Performance Characteristics for OpenMP Constructs on Different Parallel Computer Architectures
Concurrency: Practice and Experience, Vol. 12, 2000, pp. 1261-1273
Rudolf Berrendorf, Guido Nieken:
Performance Characteristics for OpenMP Constructs on Different Parallel Computer Architectures
Proc. 1st European Workshop on OpenMP, Lund, Schweden, 1999, pp. 92-97
Rudolf Berrendorf:
Optimizing Load Balance and Communication on Parallel Computers with Distributed Shared Memory
Proc. Euro-Par'98, Southampton, UK, Springer Verlag, Lecture Notes in Computer Science 1470, 1998, pp. 299-306
Rudolf Berrendorf:
Benutzer- und datengesteuertes Schleifenscheduling auf Parallelrechnern mit Distributed Shared Memory
Dissertation Elektrotechnische Fakultät der RWTH Aachen, 1998, als Jül-3552, Forschungszentrum Jülich GmbH erschienen
Rudolf Berrendorf, Michael Gerndt, Andreas Krumme, Selcuk Özmen:
SVM-Fortran: Eine Programmierumgebung für massiv-parallele Rechner
PIK, 19. Jahrgang, Heft 3, 1996, S. 142-147
Rudolf Berrendorf, Michael Gerndt, Andreas Krumme:
A Programming Environment for Parallel Computers with a Global Address Space
Proc. First Intl. Workshop on High-Level Programming Models and Supportive Environments, Honolulu, Hawaii, IEEE Computer Society Press, Los Alamitos, 1996, pp. 10-16
Rudolf Berrendorf, Michael Gerndt:
Compiling SVM-Fortran for the Intel Paragon XP/S
Proc. Working Conference on Massively Parallel Programming Models MPPM'95, Berlin, Germany, IEEE Computer Society Press, Los Alamitos, 1995, pp. 52-59
Rudolf Berrendorf, Michael Gerndt, Martin Mairandres, Stephan Zeisset:
A Programming Environment for Shared Virtual Memory on the Intel Paragon Supercomputer
Proc.Intel User Group Meeting, Albuquerque, New Mexiko, 1995
Rudolf Berrendorf, Michael Gerndt, Martin Mairandres:
Programming Shared Virtual Memory on the Intel Paragon(TM) Supercomputer
Proc. Fifth Workshop on Compilers for Parallel Computers CPC'95, Malaga, Spanien, 1995, pp. 257-270
Rudolf Berrendorf, Heribert Burg, Ulrich Detert:
Leistungscharakteristika von Parallelrechnern: Fallstudie Intel Paragon
it+ti: Informationstechnik und Technische Informatik, R. Oldenbourg Verlag, 37. Jahrgang, Heft 2, 1995, S. 37-45
Michael Gerndt, Rudolf Berrendorf:
Parallelizing Applications with SVM-Fortran
Proc. High Performance Computing and Networking HPCN'95, Mailand, Italien, Lecture Notes in Computer Science 919, 1995, pp. 793-798
Rudolf Berrendorf, Michael Gerndt, Zakaria Lahjomri, Thierry Priol:
A Comparism of Shared Virtual Memory and Message Passing Programming Techniques Based on a Finite Element Application
Proc. CONPAR 94/VAPP VI, Universität Linz, Österreich, Lecture Notes in Computer Science 854, 1994, pp. 461-472
Rudolf Berrendorf, Ulrich Detert, Jutta Docter, Ursula Ehrhart, Michael Gerndt, Inge Gutheil, Renate Knecht:
Massively Parallel Computing in a Production Environment - iPSC/860 Installation at KFA Juelich
Tagungsband der 23. GI-Jahrestagung in Dresden, Reihe "Informatik aktuell", Springer Verlag, 1993
Rudolf Berrendorf:
Memory Access in Shared Virtual Memory
Informatik-Bericht 93/5, Informatik-Kolloquium über Parallelverarbeitung, K. Ecker, R. Hirschberg, Hsrg., TU Clausthal, 1993, S. 51-66
Rudolf Berrendorf:
Memory Access in Shared Virtual Memory
Proc. Third Workshop on Compilers for Parallel Computers, Vol. 2, Austrian Center for Parallel Computation, Nr. ACPC/TR 93-2, Wien, Österreich, 1993, pp. 13-27
Rudolf Berrendorf:
Memory Access in Shared Virtual Memory
Proc. Conpar 92/VAPP V, Lyon, Frankreich, Springer Verlag, Lecture Notes in Computer Science 634, 1992, pp. 785-786
Rudolf Berrendorf, Jukka Helin:
Evaluating the Basic Performance of the Intel iPSC/860 Parallel Computer
Concurrency: Practice and Experience, Vol. 4, No. 3, 1992, pp. 223-240
Jukka Helin, Rudolf Berrendorf:
Analyzing the Performance of Message Passing MIMD Hypercubes: A Study with the Intel iPSC/860
Proc. ACM Intl. Conference on Supercomputing, Köln, 1991, pp. 376-385
Rudolf Berrendorf, Ulrich Detert:
Tools für Supercomputer
Proc. GI-PARS Workshop "Sprachen und Systeme zur Parallelverarbeitung", Arnoldshain/Taunus, 1990, S. 15-21
Rudolf Berrendorf:
Der FORTRAN-Parser PAFF als wiederverwendbares Werkzeug für Programmier-Tools
Kernforschungsanlage Jülich GmbH, Jül-Spez-537, 1989

Keynotes

Chances and Challenges in Developing Future Parallel Applications
27.05.2014, ComputationWorld 2014, Venice, Italy

Invited Talks

Program and Data Access Optimizations for Parallel Applications on NUMA Systems
25.05.2013, Saudi Aramco, Saudi Arabia
Schleifenscheduling und Graphpartitionierung
25.10.1999, Kolloquium über Parallelverarbeitung in technisch-naturwissenschaftlichen Anwendungen, RWTH Aachen
PCL - A Uniform Interface to Access Hardware Performance Counters on Microprocessors
22.4.1999, Mathematics and Computer Science Division, Argonne National Laboratory, USA
Using Graph-Partitioning Techniques to Schedule Loops on Parallel Computers with Distributed Shared Memory
8.2.1999, NEC Europe Ltd., C&C Research Laboratories, Sankt Augustin
PCL - Eine einheitliche Schnittstelle zum Zugriff auf Performance Counter in Mikroprozessoren
11.12.1998, ZHR-Seminar, TU Dresden
Schleifen-Scheduling auf Parallelrechnern mit Distributed Shared Memory
10.12.1998, ZHR-Kolloquium, TU Dresden
Experiences with Parallel Systems at KFA Jülich
11.4.1996, Maui High Performance Computing Center, Maui, Hawaii
Simulation eines einfachen Parallelrechners mit Shared Virtual Memory
26.2.1993, GMD-First, Berlin
Rosi - A Simulator for Shared Virtual Memory
16.2.1993, INRIA-IRISA, Rennes, Frankreich
Untersuchungen zu Shared Virtual Memory
21.9.1992, Kolloquium über Parallelverarbeitung, Lessach, Österreich
Simulation of Shared Virtual Memory
5.2.1992, Mathematics and Computer Science Division, Argonne National Laboratory, USA

Talks

Program Optimization Strategies to Improve the Performance of SpMV-Operations
22.3.2016, Eighth International Conference on Future Computational Technologies and Applications (FUTURE COMPUTING 2016), 2016, Rome, Italy
Leistungsparameter aktueller Parallelrechner und Rechenbeschleuniger: Was Programmierer wissen sollten
22.04.2015, parallel 2015 - Softwarekonferenz für Parallel Programming, Concurrency und Multicore-Systeme, Karlsruhe
Level-Synchronous Parallel Breadth-First Search Algorithms For Multicore and Multiprocessor Systems
27.05.2014, 6th International Conference on Future Computational Technologies and Applications (FUTURE COMPUTING 2014), Venice, Italy
Trading Redundant Work Against Atomic Operations On Large Shared Memory Parallel Systems
01.10.2013, 7th Intl. Conference on Advanced Engineering Computing and Applications in Sciences, ADVCOMP 2013, Porto, Portugal
J.Mockenhaupt, R.Berrendorf, F.Rieg, A.Hinkenjann:
Detektion von Mammatumoren mittels Verformungsanalyse im FE-Modell
15.05.2013, 8. Jahrestagung der Deutschen Gesellschaft für Biomechanik (DGfB), Neu-Ulm
Remote Parallel I/O in Grid Environments
13.09.2005, 6th Intl. Conference on Parallel Processing and Applied Mathematics, PPAM'05, Poznan, Polen
Parallel I/O in VIOLA
07.07.2005, VIOLA-Clusterix-Workshop, Berlin
ZAMpano: Das SMP-Cluster im ZAM
29.5.2000, ZAMpano-Workshop, Forschungszentrum Jülich GmbH
Performance Characteristics for OpenMP Constructs on Different Parallel Computer Architectures
1.10.1999, 1st European Workshop on OpenMP, Lund, Schweden
Optimizing Load Balance and Communication on Parallel Computers with Distributed Shared Memory
3.9.1998, Euro-Par'98, Southampton, UK
Benutzer- und datengesteuertes Schleifen-Scheduling auf Parallelrechnern mit Distributed Shared Memory
28.5.1998, Forschungszentrum Jülich GmbH
T3E-Performance: Prozessor, Kommunikation, I/O
9.6.1997, Cray-Workshop, Forschungszentrum Jülich GmbH
A Programming Environment for Parallel Computers with a Global Address Space
16.4.1996, First Intl. Workshop on High-Level Programming Models and Supportive Environments, Honolulu, Hawaii
Compiling SVM-Fortran for the Intel Paragon XP/S
9.10.1995, Working Conference on Massively Parallel Programming Models MPPM'95, Berlin
Early Experiences with SVM-Fortran
7.10.1994, PEP-Symposium, Parallab, Bergen, Norwegen
Current Performance Comparisms Between Paragon and iPSC/860
18.6.1993, Intel European User Group Meeting, München
Memory Access in Shared Virtual Memory
1.4.1992, CONPAR92, Lyon, Frankreich
Optimizing Parallel Programs for Shared Virtual Memory
9. Juli 1992, International Workshop on Compilers for Parallel Computers, Wien, Österreich
Tools für Supercomputer
25.1.1990, GI-PARS Workshop "Sprachen und Systeme zur Parallelverarbeitung", Arnoldshain/Taunus