Last update:
Thu Aug 1 11:56:36 MDT 2019
Franck Cappello and
Al Geist and
William Gropp and
Sanjay Kale and
Bill Kramer and
Marc Snir Toward Exascale Resilience: 2014 update 5--28
Mateo Valero and
Miquel Moreto and
Marc Casas and
Eduard Ayguade and
Jesus Labarta Runtime-Aware Architectures: A First
Approach . . . . . . . . . . . . . . . . 29--44
Oliver Fuhrer and
Carlos Osuna and
Xavier Lapillonne and
Tobias Gysi and
Ben Cumming and
Mauro Bianco and
Andrea Arteaga and
Thomas Christoph Schulthess Towards a performance portable,
architecture agnostic implementation
strategy for weather and climate models 45--62
Rio Yokota and
George Turkiyyah and
David Keyes Communication Complexity of the Fast
Multipole Method and its Algebraic
Variants . . . . . . . . . . . . . . . . 63--84
Jack Dongarra and
Azzam Haidar and
Jakub Kurzak and
Piotr Luszczek and
Stanimire Tomov and
Asim YarKhan Model-Driven One-Sided Factorizations on
Multicore Accelerated Systems . . . . . 85--115
Julian Martin Kunkel and
Michael Kuhn and
Thomas Ludwig Exascale Storage Systems --- An
Analytical Study of Expenses . . . . . . 116--134
Claudia Rosas and
Judit Giménez and
Jesús Labarta Scalability prediction for fundamental
performance factors . . . . . . . . . . 4--19
Hayk Shoukourian and
Torsten Wilde and
Axel Auweter and
Arndt Bode Predicting the Energy and Power
Consumption of Strong and Weak Scaling
HPC Applications . . . . . . . . . . . . 20--41
Thomas Sterling and
Daniel Kogler and
Matthew Anderson and
Maciej Brodowicz SLOWER: A performance model for Exascale
computing . . . . . . . . . . . . . . . 42--57
Torsten Hoefler and
Dmitry Moor Energy, Memory, and Runtime Tradeoffs
for Implementing Collective
Communication Operations . . . . . . . . 58--75
Seung Woo Son and
Zhengzhang Chen and
William Hendrix and
Ankit Agrawal and
Wei-keng Liao and
Alok Choudhary Data Compression for the Exascale
Computing Era --- Survey . . . . . . . . 76--88
Satoshi Matsuoka and
Hitoshi Sato and
Osamu Tatebe and
Michihiro Koibuchi and
Ikki Fujiwara and
Shuji Suzuki and
Masanori Kakuta and
Takashi Ishida and
Yutaka Akiyama and
Toyotaro Suzumura and
Koji Ueno and
Hiroki Kanezashi and
Takemasa Miyoshi Extreme Big Data (EBD): Next Generation
Big Data Infrastructure Technologies
Towards Yottabyte/Year . . . . . . . . . 89--107
Bernd Mohr Scalable parallel performance
measurement and analysis tools ---
state-of-the-art and future challenges 108--123
Antoni Artigues and
Fernando Martin Cucchietti and
Carlos Tripiana Montes and
David Vicente and
Hadrien Calmet and
Guillermo Marin and
Guillaume Houzeaux and
Mariano Vazquez Scientific Big Data Visualization: a
coupled tools approach . . . . . . . . . 4--18
Onur Mutlu and
Lavanya Subramanian Research Problems and Opportunities in
Memory Systems . . . . . . . . . . . . . 19--55
Alexander N. Daryin and
Anton A. Korzh Early evaluation of direct large-scale
InfiniBand networks with adaptive
routing . . . . . . . . . . . . . . . . 56--69
Alexey Lastovetsky Heterogeneous parallel computing: from
clusters of workstations to hierarchical
hybrid platforms . . . . . . . . . . . . 70--87
Boris M. Glinskiy and
Igor M. Kulikov and
Alexey V. Snytnikov and
Alexey A. Romanenko and
Igor G. Chernykh and
Vitaly A. Vshivkov Co-design of Parallel Numerical Methods
for Plasma Physics and Astrophysics . . 88--98
Vladimir V. Voevodin and
Alexander S. Antonov and
Jack Dongarra AlgoWiki: an Open Encyclopedia of
Parallel Algorithmic Features . . . . . 4--18
Milan Mihajlovic and
Lars Ailo Bongo and
Raimondas Ciegis and
Neki Frasheri and
Dragi Kimovski and
Peter Kropf and
Svetozar Margenov and
Maya Neytcheva and
Thomas Rauber and
Gudula Runger and
Roman Trobec and
Roel Wuyts and
Roman Wyrzykowski and
Jing Gong Applications for ultrascale computing 19--48
Rabab Al-Omairy and
Guillermo Miranda and
Hatem Ltaief and
Rosa M. Badia and
Xavier Martorell and
Jesus Labarta and
David Keyes Dense Matrix Computations on NUMA
Architectures with Distance-Aware Work
Stealing . . . . . . . . . . . . . . . . 49--72
Xiangke Liao and
Shaoliang Peng and
Yutong Lu and
Yingbo Cui and
Chengkun Wu and
Heng Wang and
Jiajun Wen Neo-heterogeneous Programming and
Parallelized Optimization of a Human
Genome Re-sequencing Analysis Software
Pipeline on TH-2 Supercomputer . . . . . 73--83
Georges Da Costa and
Thomas Fahringer and
Juan Antonio Rico Gallego and
Ivan Grasso and
Atanas Hristov and
Helen D. Karatza and
Alexey Lastovetsky and
Fabrizio Marozzo and
Dana Petcu and
Georgios L. Stavrinides and
Domenico Talia and
Paolo Trunfio and
Hrachya Astsatryan Exascale Machines Require New
Programming Paradigms and Runtimes . . . 6--27
Jesus Carretero and
Javier Garcia-Blas and
David E. Singh and
Florin Isaila and
Alexey Lastovetsky and
Thomas Fahringer and
Radu Prodan and
Peter Zangerl and
Christi Symeonidou and
Afshin Fassihi and
Horacio Pérez-Sánchez Acceleration of MPI mechanisms for
sustainable HPC applications . . . . . . 28--45
Pascal Bouvry and
Rudolf Mayer and
Jakub Muszy\'nski and
Dana Petcu and
Andreas Rauber and
Gianluca Tempesti and
Tuan Trinh and
Sébastien Varrette Resilience within Ultrascale Computing
System: Challenges and Opportunities
from Nesus Project . . . . . . . . . . . 46--63
Francisco Almeida and
Javier Arteaga and
Vicente Blanco and
Alberto Cabrera Energy Measurement Tools for Ultrascale
Computing: A Survey . . . . . . . . . . 64--76
Jesus Carretero and
Salvatore Distefano and
Dana Petcu and
Daniel Pop and
Thomas Rauber and
Gudula Rünger and
David E. Singh Energy-efficient Algorithms for
Ultrascale Systems . . . . . . . . . . . 77--104
Michel Bagein and
Jorge Barbosa and
Vicente Blanco and
Ivona Brandic and
Samuel Cremer and
Sebastien Fremal and
Helen Karatza and
Laurent Lefevre and
Toni Mastelic and
Ariel Oleksiak and
Anne-Cecile Orgerie and
Georgios L. Stavrinides and
Sebastien Varrette Energy Efficiency for Ultrascale
Systems: Challenges and Trends from
Nesus Project . . . . . . . . . . . . . 105--131
Marek Michalewicz and
Yuefan Deng Foreword . . . . . . . . . . . . . . . . 4
Hank Childs Data Exploration at the Exascale . . . . 5--13
Kenneth Hon Kim Ban and
Jakub Chrzeszczyk and
Andrew Howard and
Dongyang Li and
Tin Wee Tan InfiniCloud: Leveraging the Global
InfiniCortex Fabric and OpenStack Cloud
for Borderless High Performance
Computing of Genomic Data . . . . . . . 14--27
Jonathan Low and
Jakub Chrzeszczyk and
Andrew Howard and
Andrzej Chrzeszczyk Performance Assessment of InfiniBand HPC
Cloud Instances on Intel Haswell and
Intel Sandy Bridge Architectures . . . . 28--40
David Rohr and
Gvozden Neskovic and
Volker Lindenstruth The L-CSC cluster: Optimizing power
efficiency to become the greenest
supercomputer in the world in the
Green500 list of November 2014 . . . . . 41--48
Kevin A. Huck and
Allan Porterfield and
Nick Chaimov and
Hartmut Kaiser and
Allen D. Malony and
Thomas Sterling and
Rob Fowler An Autonomic Performance Environment for
Exascale . . . . . . . . . . . . . . . . 49--66
Kenneth Moreland and
Matthew Larsen and
Hank Childs Visualization for Exascale: Portable
Performance is Critical . . . . . . . . 67--75
Nachiket Kapre and
Pradeep Moorthy A Case for Embedded FPGA-based SoCs in
Energy-Efficient Acceleration of Graph
Problems . . . . . . . . . . . . . . . . 76--86
Ben Swift and
Andrew Sorensen and
Henry Gardner and
Peter Davis and
Viktor K. Decyk Live Programming in Scientific
Simulation . . . . . . . . . . . . . . . 4--15
Marek T. Michalewicz and
\Lukasz P. Or\lowski and
Yuefan Deng Creating interconnect topologies by
algorithmic edge removal: MOD and SMOD
graphs . . . . . . . . . . . . . . . . . 16--47
Alexander V. Nemukhin and
Igor V. Polyakov and
Alexander I. Moskovsky Multi-Scale Supercomputing of Large
Molecular Aggregates: A Case Study of
the Light-Harvesting Photosynthetic
Center . . . . . . . . . . . . . . . . . 48--54
Alexander A. Danilov and
Kirill M. Terekhov and
Igor N. Konshin and
Yuri V. Vassilevski Parallel software platform INMOST: a
framework for numerical modeling . . . . 55--66
Jack Dongarra and
M. Abalenkovs and
A. Abdelfattah and
M. Gates and
A. Haidar and
J. Kurzak and
P. Luszczek and
S. Tomov and
I. Yamazaki and
A. YarKhan Parallel Programming Models for Dense
Linear Algebra on Heterogeneous Systems 67--86
Suo Guang NR-MPI: A Non-stop and Fault Resilient
MPI Supporting Programmer Defined Data
Backup and Restore for E-scale Super
Computing Systems . . . . . . . . . . . 4--21
Ilya I. Levin and
Alexey I. Dordopulo and
Alexander M. Fedorov and
Igor A. Kalyaev Reconfigurable computer systems: from
the first FPGAs towards liquid cooling
systems . . . . . . . . . . . . . . . . 22--40
Alexander V. Goncharsky and
Sergey Y. Romanov and
Sergey Y. Seryozhnikov Supercomputer technologies in
tomographic imaging applications . . . . 41--66
Alexander A. Moskovsky and
Egor A. Druzhinin and
Alexey B. Shmelev and
Vladimir V. Mironov and
Andrey Semin Server Level Liquid Cooling: Do Higher
System Temperatures Improve Energy
Efficiency? . . . . . . . . . . . . . . 67--74
Michael Kuhn and
Julian Kunkel and
Thomas Ludwig Data Compression for Climate Data . . . 75--94
Fabrice Mizero and
Malathi Veeraraghavan and
Qian Liu and
Robert D. Russell and
John M. Dennis A Dynamic Congestion Management System
for InfiniBand Networks . . . . . . . . 5--20
Michaël Krajecki and
Julien Loiseau and
François Alin and
Christophe Jaillet Many-Core Approaches to Combinatorial
Problems: case of the Langford Problem 21--37
John L. Gustafson A Radical Approach to Computation with
Real Numbers . . . . . . . . . . . . . . 38--53
Jakub Chrzeszczyk and
Andrew Howard and
Andrzej Chrzeszczyk and
Ben Swift and
Peter Davis and
Jonathan Low and
Tin Wee Tan and
Kenneth Ban InfiniCloud 2.0: distributing High
Performance Computing across continents 54--71
Dmitry A. Nikitenko and
Sergey A. Zhumatiy and
Pavel A. Shvets Making Large-Scale Systems Observable
--- Another Inescapable Step Towards
Exascale . . . . . . . . . . . . . . . . 72--79
Mikhail A. Naumenko and
Vyacheslav V. Samarin Application of CUDA technology to
calculation of ground states of few-body
nuclei by Feynman's continual integrals
method . . . . . . . . . . . . . . . . . 80--95
Iosif B. Meyerov and
Sergey I. Bastrakov and
Igor A. Surmin and
Alexey V. Bashinov and
Evgeny S. Efimenko and
Artem V. Korzhimanov and
Alexander A. Muraviev and
Arkady A. Gonoskov Hybrid CPU + Xeon Phi implementation of
the Particle-in-Cell method for plasma
simulation . . . . . . . . . . . . . . . 5--10
Matthijs van Waveren and
Ahmed Seif El Nawasany and
Nasr Hassanein and
David Moon and
Niall O'Byrnes and
Alain Clo and
Karthikeyan Murugan and
Antonio Arena Easy Access to HPC Resources through the
Application GUI . . . . . . . . . . . . 11--18
Jan Fabian Schmid and
Julian M. Kunkel Predicting I/O Performance in HPC Using
Artificial Neural Networks . . . . . . . 19--33
Julian Martin Kunkel Analyzing Data Properties using
Statistical Sampling --- Illustrated on
Scientific File Formats . . . . . . . . 34--39
Jiri Jaros and
Filip Vaverka and
Bradley E. Treeby Spectral Domain Decomposition Using
Local Fourier Basis: Application to
Ultrasound Simulation on a Cluster of
GPUs . . . . . . . . . . . . . . . . . . 40--55
Kedar Kulkarni and
Shreeya Badhe and
Geetanjali Gadre HCA aware Parallel Communication
Library: A feasibility study for
offloading MPI requirements . . . . . . 56--60
Alexander S. Antonov and
Alexey V. Frolov and
Hiroaki Kobayashi and
Igor N. Konshin and
Alexey M. Teplov and
Vadim V. Voevodin and
Vladimir V. Voevodin Parallel Processing Model for Cholesky
Decomposition Algorithm in AlgoWiki
Project . . . . . . . . . . . . . . . . 61--70
Mariem El Afrit and
Yann Le Du and
Rafaël Del Pino and
Guolin Zhang Data merging for the cultural heritage
imaging based on Chebfun approach . . . 71--83
Will Usher and
Ingo Wald and
Aaron Knoll and
Michael Papka and
Valerio Pascucci In Situ Exploration of Particle
Simulations with CPU Ray Tracing . . . . 4--18
Brad Joseph Whitlock and
Earl P. N. Duque In Situ Visualization and Production of
Extract Databases . . . . . . . . . . . 19--29
Alexander Matthes and
Axel Huebl and
René Widera and
Sebastian Grottel and
Stefan Gumhold and
Michael Bussmann In situ, steerable, hardware-independent
and data-structure agnostic
visualization with ISAAC . . . . . . . . 30--48
James Kress and
Randy Michael Churchill and
Scott Klasky and
Mark Kim and
Hank Childs and
David Pugmire Preparing for In Situ Processing on
Upcoming Leading-edge Supercomputers . . 49--65
Konstantin S. Stefanov and
Alexey A. Gradskov Analysis of CPU Usage Data Properties
and their possible impact on Performance
Monitoring . . . . . . . . . . . . . . . 66--73
Mikhail S. Malovichko and
Nikolay E. Khokhlov and
Nikolay B. Yavich and
Michael S. Zhdanov Parallel algorithm for $3$D modeling of
monochromatic acoustic field by the
method of integral equations . . . . . . 74--78
Jakub Kurzak and
Piotr Luszczek and
Ichitaro Yamazaki and
Yves Robert and
Jack Dongarra Design and Implementation of the PULSAR
Programming System for Large Scale
Computing . . . . . . . . . . . . . . . 4--26
Rosa M. Badia and
Eduard Ayguade and
Jesus Labarta Workflows for Science: a Challenge when
Facing the Convergence of HPC and Big
Data . . . . . . . . . . . . . . . . . . 27--47
Thomas Sterling and
Matthew Anderson and
Maciej Brodowicz A Survey: Runtime Software Systems for
High Performance Computing . . . . . . . 48--68
Roscoe Bartlett and
Irina Demeshko and
Todd Gamblin and
Glenn Hammond and
Michael Allen Heroux and
Jeffrey Johnson and
Alicia Klinvex and
Xiaoye Li and
Lois Curfman McInnes and
J. David Moulton and
Daniel Osei-Kuffuor and
Jason Sarich and
Barry Smith and
James Willenbring and
Ulrike Meier Yang xSDK Foundations: Toward an
Extreme-scale Scientific Software
Development Kit . . . . . . . . . . . . 69--82
William Tang and
Bei Wang and
Stephane Ethier and
Zhihong Lin Performance Portability of HPC Discovery
Science Software: Fusion Energy
Turbulence Simulations at Extreme Scale 83--97
Asmi H. Shah and
Jonathan D. Picker and
Saumya S. Jamuar Using High Performance Computing to
Create and Freely Distribute the South
Asian Genomic Database, Necessary for
Precision Medicine in this Population 4--12
Yang Yao and
Khoon-Seng Yeo An Application of GPU Acceleration in
CFD Simulation for Insect Flight . . . . 13--26
Maciej Brodowicz and
Thomas Sterling Simultac Fonton: A Fine-Grain
Architecture for Extreme Performance
beyond Moore's Law . . . . . . . . . . . 27--37
Earle Jennings The Simultaneous Transmit And Receive
(STAR) Message Protocol . . . . . . . . 38--53
Earle Jennings Core Module Optimizing PDE Sparse Matrix
Models With HPCG Example . . . . . . . . 54--70
John L. Gustafson and
Isaac T. Yonemoto Beating Floating Point at its Own Game:
Posit Arithmetic . . . . . . . . . . . . 71--86
Gabriel Noaje and
Alan Davis and
Jonathan Low and
Seng Lim and
Geok Lian Tan and
\Lukasz Or\lowski and
Dominic Chien and
Sing-Wu Liou and
Tin Wee Tan and
Yves Poppe and
Kenneth Ban Hon Kim and
Andrew Howard and
David Southwell and
Jason Gunthorpe and
Marek Michalewicz InfiniCortex --- From Proof-of-concept
to Production . . . . . . . . . . . . . 87--102
Saurabh Hukerikar and
Christian Engelmann Resilience Design Patterns: A Structured
Approach to Resilience at Extreme Scale 4--42
Takuma Kawamura and
Tomoyuki Noda and
Yasuhiro Idomura Performance Evaluation of Runtime Data
Exploration Framework based on In-Situ
Particle Based Volume Rendering . . . . 43--54
Michael Vetter and
Stephan Olbrich Development and Integration of an
In-Situ Framework for Flow Visualization
of Large-Scale, Unsteady Phenomena in
ICON . . . . . . . . . . . . . . . . . . 55--67
Nuttiiya Seekhao and
Joseph JaJa and
Luc Mongeau and
Nicole Y. K. Li-Jessen In Situ Visualization for $3$D
Agent-Based Vocal Fold Inflammation and
Repair Simulation . . . . . . . . . . . 68--79
Ekaterina Olegovna Tyutlyaeva and
Sergey Konyukhov and
Igor Odintsov and
Alexander Moskovsky Seismic Processing Performance Analysis
on Different Hardware Environment . . . 80--90
Germán Ceballos and
Andra Hugo and
Erik Hagersten and
David Black-Schaffer Exploring Scheduling Effects on Task
Performance with TaskInsight . . . . . . 91--98
Roman Kaplan and
Leonid Yavits and
Ran Ginosar From Processing-in-Memory to
Processing-in-Storage . . . . . . . . . 99--116