I have a for-loop to do indexing:
for (int i=0; i<N; i++){
a[i] = b[c[i]]
}
c are the indices of interest and are int *, while b and a are float * and the manipulated values.
But, this takes a long time (and it can't take that long). I'd like to have some vectorizing version, most likely found in BLAS/LAPLACK/etc.
I'm looking for nested_indexing(float * output_vector, float * input_vector, int * input_indices).
I've tried looking through the docs, but have not found anything.