Hi!
I'm writing some (arm) inline assembly code that works on a huge array of C structs in a loop and stores some data into another array.
the processor supports the PLD prefetching command.
if i'm accessing the data in successive order, is there a gain in performance if I use the prefetch command to load the startadress of the next struct in the array, before i started processing the current one? or should i prefetch in each iteration the next but one? or prefetch a certain ammound of bytes ahead?
does it also make sense to prefetch an address in the destination array?
thanks!