A Compiler and Runtime System for Enabling Data Mining Applications on GPUs Wenjing Ma, Gagan Agrawal | |
A Tunable Holistic Resiliency Approach for High-Performance Computing Systems Stephen L. Scott, Christian Engelmann, Chokchai (Box) Leangsuksun, Frank Mueller | |
Architectural Support for Cilk Computations on Many-core Architectures Guoping Long, Dongrui Fan | |
Exploiting Global Optimizations for OpenMP Programs in the OpenUH Compiler Lei Huang, Deepak Eachempati, Marcus Hervey, Barbara Chapman | |
NePaLTM: Design and Implementation of Nested Parallelism for Transactional Memory System Haris Volos, Adam Welc, Ali-Reza Adl-Tabatabai, Tatiana Shpeisman, Xinmin Tian, Ravi Narayanaswamy | |
Parallelization Spectroscopy: Analysis of Thread-level Parallelism in HPC Programs Arun Kejariwal, Calin Cascaval | |
Preliminary results on NB-FEB, a Synchronization Primitive for Parallel Programming Phuong Ha, Philippas Tsigas, Otto Anshus | |
Software Transactional Distributed Shared Memory Alokika Dash, Brian Demsky | |
Stack-Based Parallel Recursion on Graphics Processors Ke Yang, Bingsheng He, Qiong Luo, Pedro Sander, Jiaoying Shi | |
Topology Aware Task Mapping Techniques: An API and Case Study Abhinav Bhatele, Eric Bohm, Laxmikant V. Kale | |
Towards Concurrency Refactoring for X10 Shane Markstrum, Robert Fuhrer, Todd Millstein | |
Turbocharging boosted transactions or: How I Learnt to Stop Worrying and Love Longer Transactions Chinmay Kulkarni, Osman Unsal, Adrian Cristal, Eduard Ayguade, Mateo Valero |