TI OpenCL v01.01.xx
TI OpenCL™ Runtime Documentation Contents:
- Introduction
- OpenCL 1.1 Reference Material
- Compilation
- Memory Usage
- Device Memory
- How DDR3 is Partitioned for Linux System and OpenCL
- Alternate Host malloc/free Extension for Zero Copy OpenCL Kernels
- The OpenCL Memory Model
- OpenCL Buffers
- Buffer Read/Write vs. Map/Unmap
- Discovering OpenCL Memory Sizes and Limits
- Cache Operations
- Large OpenCL buffers and Memory Beyond the 32-bit DSP Address Space
- User Defined DSP Heap Extension
- Execution Model
- Extensions
- Calling Standard C Code From OpenCL C Code
- Calling Standard C code with OpenMP from OpenCL C code
- C66x standard C compiler intrinsic functions
- OpenCL C code using printf
- DMA Control Using EdmaMgr Functions
- Using Extended Memory on the 66AK2x device
- Fast Global buffers in on-chip MSMC memory
- OpenCL C Builtin Function Extensions
- Cache Operations
- Environment Variables
- Optimization Tips
- Optimization Techniques for Host Code
- Optimization
Techniques for Device (DSP) Code
- Prefer Kernels with 1 work-item per work-group
- Use Local Buffers
- Use async_work_group_copy and async_work_group_strided_copy
- Avoid DSP writes directly to DDR
- Use the reqd_work_group_size attribute on kernels
- Use the TI OpenCL extension than allows Standard C code to be called from OpenCL C code
- Avoid OpenCL C Barriers
- Use the most efficient data type on the DSP
- Do Not Use Large Vector Types
- Consecutive memory accesses
- Prefer the CPU style of writing OpenCL code over the GPU style
- Typical Steps to Optimize Device Code
- Optimizing 3x3 Gaussian smoothing filter
- Performance Data
- Examples
- Building and Running
- Example Descriptions
- platforms example
- simple example
- mandelbrot, mandelbrot_native examples
- ccode example
- matmpy example
- offline example
- vecadd_openmp example
- vecadd_openmp_t example
- vecadd example
- vecadd_mpax example
- vecadd_mpax_openmp example
- dsplib_fft example
- ooo, ooo_map examples
- null example
- sgemm example
- dgemm example
- edmamgr example
- dspheap example
- Float compute example
- Monte Carlo example
- Debug
- Profiling
- OpenCL on TI-RTOS
- Frequently Asked Questions
- How do I get support for TI OpenCL products?
- Which TI OpenCL Version is Installed?
- Using Python OpenCL with the TI OpenCL implementation
- Guidelines for porting Stand-alone DSP applications to OpenCL
- OpenCL Interoperability with Host OpenMP
- MCSDK-HPC to OpenCL Component Version Map
- Does TI’s OpenCL support images and samplers?
- Why does the OpenCL ICD installed on my platform not find the TI OpenCL implementation?
- Why do I get messages about /var/lock/opencl when running OpenCL applications?
- Why do I get DLOAD error messages when running OpenCL applications?
- How do I limit log file sizes on EVM’s temporary file storage (tmpfs)?
- Readme
- Disclaimer
- Important Notice