5 votes

On a simplified approach to achieve parallel performance and portability across CPU and GPU architectures