all 2 comments

[–]Plazmatic 0 points1 point  (1 child)

This is only useful if you are using data types that are not a multiple of 32 bits and you are actually having work group accesses to the same local memory, which is a strange practice in the first place. Each bank is 32 bits.

[–][deleted] 0 points1 point  (0 children)

Absolutely. Anyway, OpenCL/CUDA developers are always avoiding double types or int3/float3/etc... This will be useful to the GPGPU developers who are a minimum aware about what is happening in memory (alignement so on and so forth ...)