if (threadIdx.x < 128) {
float reg[32];
// do something with reg...
} else {
return;
}
let's say each block has 256 threads, but only half of the threads are using registers, and the other half is doing something else (in this case nothing). my question is, how many registers this thread block will use (only concidering reg)? 32 * 256 or 32 * 128 ?