[avr-gcc-list] Re: C vs. assembly performance

avr-gcc-list

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[avr-gcc-list] Re: C vs. assembly performance

From:	David Brown
Subject:	[avr-gcc-list] Re: C vs. assembly performance
Date:	Sun, 01 Mar 2009 01:00:25 +0100
User-agent:	Thunderbird 2.0.0.19 (Windows/20081209)

Georg-Johann Lay wrote:

David Brown schrieb:
Nicholas Vinen wrote:
OK, I only spent a few minutes looking at old code and I found someobviously sub-optimal results. It distills down to this:
#include <avr/io.h>

int main(void) {
  unsigned long packet = 0;

  while(1) {
    if( !(PINC & _BV(PC2)) ) {
      packet = (packet<<1)|(((unsigned char)PINC>>1)&1);
    }
    PORTB = packet;
  }
}
Did you write the code like this just to test the optimiser? It
As far as I understand, it's a stripped down example to demonstrate thecode bloat in a reproducable way (combileable source).

Yes, I understand - it's just bad luck that it happens to beparticularly tough code for the optimiser.

However, avr-gcc constantly surprises me in the quality of its codegeneration - it really is very good, and it has got steadily betterthrough the years. Sometimes it pays to think a bit about the wayyour source code is structured, and maybe test out differentarrangements.
Source code structure is a concern of the project, not of the compiler.
Even for braindead code that comes from a code generator a compiler issupposed to yield good results.

That's true in theory - but embedded programmers are used to thedifference between theory and practice (there's an interestingdiscussion about the theory and practice of "volatile" oncomp.arch.embedded at the moment). In theory, the compiler shouldgenerate good code no matter how the source code is structured. Inpractice, the experienced programmer can do a lot to help the tools.avr-gcc *does* do a good job with most code - I do much lessre-structuring of my source code for avr-gcc than I do for most othercompilers (I use a lot of compilers for a lot of different targets).

I am inspecting the produced asm in some of my AVR projects with hardrealtime requirements, too. But I would not encourage anyone to dig inthe generated asm and try to get best code by re-arranging it or tryingto find other algebraic representations. That takes a lot of time, and acompiler should care for the sources it gets, not the other way round.And if your code is intended to be cross-platform, you are stuck. Ifyour code changes some 100 source lines away from the critical code, theinefficient code can return and you have to rewrite your code again tofind another representation that avoids the bad code.

It is certainly true that you want to keep such compiler-helpfulstructuring to a minimum. But if you are trying to write efficient code(rather than emphasising portability or development speed or otherpriorities), you *must* be familiar with your compiler and the types ofcode it generates for particular sequences of input. You can veryquickly learn some basic tricks that can make a great difference to thegenerated code with very little re-structuring of the source code. Aprime example is to use 8-bit data rather than traditional C "int" wherepossible. Another case in point is to prefer explicit "if" conditionalsrather than trying to calculate a conditional expression, such as wasdone here (if you are using a heavily pipelined processor, the oppositeis true).

But I fully agree that you should not be hand-optimising all your sourcecode and studying the generated assembly - the readability of the sourcecode is more important than the tightness of the generated code in allbut the most time-critical sections (there's no point in writing fastcode if you can't be sure it's correct!).

However, in this case, I believe that my re-write is better source code,although I'm aware that's a personal preference. I think it is muchclearer what the code is doing, and it is far more obvious which pinsare being used - it would also be much easier for proper code (ratherthan this example code) in which the pins would normally have definedsymbolic names rather than "magic numbers" in the code.

[Prev in Thread]

Current Thread

[Next in Thread]

RE: [avr-gcc-list] Re: C vs. assembly performance, (continued)
- Re: [avr-gcc-list] Re: C vs. assembly performance, Georg-Johann Lay, 2009/02/28
  - Re: [avr-gcc-list] Re: C vs. assembly performance, Nicholas Vinen, 2009/02/28
  - Re: [avr-gcc-list] Re: C vs. assembly performance, Nicholas Vinen, 2009/02/28
    - Re: [avr-gcc-list] Re: C vs. assembly performance, Georg-Johann Lay, 2009/02/28
    - RE: [avr-gcc-list] Re: C vs. assembly performance, Weddington, Eric, 2009/02/28
    - Re: [avr-gcc-list] Re: C vs. assembly performance, Georg-Johann Lay, 2009/02/28
    - RE: [avr-gcc-list] Re: C vs. assembly performance, Weddington, Eric, 2009/02/28
    - [avr-gcc-list] Re: C vs. assembly performance, David Brown, 2009/02/28
    - Re: [avr-gcc-list] Re: C vs. assembly performance, Georg-Johann Lay, 2009/02/28
    - [avr-gcc-list] Re: C vs. assembly performance, David Brown <=
    - Re: [avr-gcc-list] Re: C vs. assembly performance, Bob Paddock, 2009/02/28
    - RE: [avr-gcc-list] Re: C vs. assembly performance, Weddington, Eric, 2009/02/28
    - Re: [avr-gcc-list] Re: C vs. assembly performance, Vincent Trouilliez, 2009/02/28
    - RE: [avr-gcc-list] Re: C vs. assembly performance, Weddington, Eric, 2009/02/28
    - Re: [avr-gcc-list] Re: C vs. assembly performance, Vincent Trouilliez, 2009/02/28
    - RE: [avr-gcc-list] Re: C vs. assembly performance, Weddington, Eric, 2009/02/28

Prev by Date: Re: [avr-gcc-list] Re: C vs. assembly performance
Next by Date: Re: [avr-gcc-list] Re: C vs. assembly performance
Previous by thread: Re: [avr-gcc-list] Re: C vs. assembly performance
Next by thread: Re: [avr-gcc-list] Re: C vs. assembly performance
Index(es):
- Date
- Thread