A Case Study of Application Analytical Modeling in Heterogeneous Computing Environments
An Asynchronous Protocol for Release Consistent Distributed Shared Memory Systems
Combined Selection of Tile Sizes and Unroll Factors Using Iterative Compilation
Parallel Performance of Block ILU Preconditioners for a Block-tridiagonal Matrix*
Performance Analysis of Linear Algebraic Functions Using Reconfigurable Computing