This tutorial is strongly inspired from the OpenACC Bootcamp session presented at GPU Technology Conference 2016.
OpenACC is an application programming interface (API) for porting code onto accelerators such as GPU and coprocessors. It has been developed by Cray, CAPS, NVidia and PGI. Like in OpenMP, the programmer annotates C, C++ or Fortran code to identify portions that should be parallelized by the compiler.
This tutorial uses OpenACC to accelerate C, C++ or Fortran code. A working knowledge of one of these languages is therefore required to gain the most benefit out of it.
This tutorial is based on examples. You can download all of the examples in this Github repository.
- Gathering a profile and getting compiler information
- Expressing parallelism with OpenACC directives
- Expressing data movement
- Optimizing loops
Here are some useful external references: