|
From: | Dominik Auras |
Subject: | Re: [Discuss-gnuradio] Writing SIMD code with sse |
Date: | Thu, 13 Dec 2007 21:03:35 +0100 |
User-agent: | Thunderbird 2.0.0.9 (X11/20071115) |
Hi!I am currently struggling with the memory/cache performance of the most used inner loops in my code. For my code, I am pretty sure that most clock cycles it spends are related to cache miss. I used VTune and Cachegrind to analyze the code. But all I got was the information THAT I frequently miss the cache. They don't give a reason.
So, maybe you can point me to a good website, or give me a hint? Is there a program that can tell me why this happens? E.g. for the Cell processor, there is a static analysis tool that tells you everything about your code. When did it stall, why did it stall, how many stall cycles etc.
Thanks Dominik
[Prev in Thread] | Current Thread | [Next in Thread] |