|
From: | Ethan Eade |
Subject: | Re: [libcvd-members] FAST crashes (again!) |
Date: | Tue, 21 Aug 2007 19:10:15 +0100 |
User-agent: | Thunderbird 2.0.0.6 (X11/20070728) |
Yes, it seems Ed changed the code to work with SubImage (hence the change to a double loop), but it should be 'x+=16'.for(int x=16; x < xend; x++) { const byte* p = &I[y][x]; __m128i lo, hi; { const __m128i here = load_si128<Aligned>((const __m128i*)(p));Notice how p is is being incremented in 1-byte steps, and is then loaded as if it were 16-byte aligned. Hmm! I think one of the 9 or 10 had this incremented in 16-byte steps earlier.
[Prev in Thread] | Current Thread | [Next in Thread] |