Hi Guys,
I'm not in a position to make out the difference between vld4_f32 and vld4q_f32 in ARM NEON instructions.
The confusion started when I raised my coding levels and started looking at the assembly instructions rather than the less informative intrinsics.
The reason I need to use vld4 variant instruction here is because, I would...
Hello,
I feel the need for speed. Double for loops are killing my iPad apps performance. I need SIMD. How do I perform integer SIMD operations on the iPad A4 processor?
Thanks,
Doug
...
I have recently started using Neon intrinsics in my iOS image convolution code and have a shaky grasp at best. Right now, I get to the pixel data from CGBitmapContextGetData (cgctx); but I would like to take advantage of de-interleaving using vld4 (ARGB data). What is the best way to do this? I'm sure it's one of those simple things I ...