Check out my first novel, midnight's simulacra!
PTX: Difference between revisions
From dankwiki
No edit summary |
(add PTX 2.2 notes) |
||
Line 7: | Line 7: | ||
! CUDA Toolkit Version | ! CUDA Toolkit Version | ||
! Changes | ! Changes | ||
|- | |||
| 2.2 | |||
| 3.2 | |||
| | |||
* New kernel parameter directives for pointer arguments | |||
* Flat address space for constants (backwards compatibility for constant banks) | |||
* Texture changes for OpenCL, bilerp (bilinear interpolation) and high-bw loads | |||
|- | |- | ||
| 2.1 | | 2.1 |
Revision as of 07:52, 14 December 2010
The ISA to which CUDA's nvcc compiles source code. This is JIT'd into architecture-specific machine language by the hardware driver after the CUDA runtime is used to load a PTX module. It can then be scheduled for execution on CUDA devices. From Version 2.1 of the PTX ISA Reference:
- PTX defines a virtual machine and ISA for general purpose parallel thread execution. PTX programs are translated at install time to the target hardware instruction set. The PTX-to-GPU translator and driver enable NVIDIA GPUs to be used as programmable parallel computers.
Versions
PTX Version | CUDA Toolkit Version | Changes |
---|---|---|
2.2 | 3.2 |
|
2.1 | 3.1 |
|
2.0 | 3.0 |
Tools
- Marcin Wilhelm Kościelnicki's nv50dis, a disassembler