In RAM I have 8 x (int16). I read it with:
__m128i RawInt16 = _mm_load_si128 (pSrc);
I have to convert RawInt16 into 2 registers of 4 x (int32) My code is:
__m128i Zero = { 0,0,0,0,0,0,0,0 };
_mm128i RealInt32_0 = _mm_cvtepi16_epi32(RawInt16); //Low 4xint32
_mm128i RealInt32_1 = _mm_unpackhi_epi16(RawInt16, Zero ); //High 4xint32
Is this the fastest way ?
Thank you, Zvika