simd

How do I Perform Integer SIMD operations on the iPad A4 Processor?

Hello, I feel the need for speed. Double for loops are killing my iPad apps performance. I need SIMD. How do I perform integer SIMD operations on the iPad A4 processor? Thanks, Doug ...

SIMD code for exponentiation

Hi all, I am using SIMD to compute fast exponentiation result. I compare the timing with non-simd code. The exponentiation is implemented using square and multiply algorithm. Ordinary(non-simd) version of code: b = 1; for (i=WPE-1; i>=0; --i){ ew = e[i]; for(j=0; j<BPW; ++j){ b = (b * b) % p; if (ew...

SIMD version check

Hi, I am using Intel Core2Duo E4500 processor. It is supposed to have SSE3, SSSE3 facilities. But if I try to use them in programs it shows the following error "SSE3 instruction set not enabled" Any ideas? ...

c++ how to write code the compiler can easily optimize for SIMD?

hello there! i'm working in Visual Studio 2008 and in the project settings I see the option for "activate Extended Instruction set" which I can set to None, SSE or SSE2 So the compiler will try to batch instructions together in order to make use of SIMD instructions? Are there any rules one can follow in how to optimize code such that...

SIMD optimization puzzle

I Want to optimize the following function using SIMD (SSE2 & such): int64_t fun(int64_t N, int size, int* p) { int64_t sum = 0; for(int i=1; i<size; i++) sum += (N/i)*p[i]; return sum; } This seems like an eminently vectorizable task, except that the needed instructions just aren't there ... We can assume that N i...