Re: [avr-gcc-list] Avr-gcc Produces Incorrect Code with -Os

avr-gcc-list

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [avr-gcc-list] Avr-gcc Produces Incorrect Code with -Os

From:	David Brown
Subject:	Re: [avr-gcc-list] Avr-gcc Produces Incorrect Code with -Os
Date:	Tue, 27 May 2008 11:32:46 +0200
User-agent:	Thunderbird 2.0.0.14 (Windows/20080421)

I think this is getting *way* off-topic for this list. However, itseems there are a number of people on the list who are interested in thethread. Perhaps it should move to avr-chat?


Scott and Roxanne Munns wrote:

David,

I agree that this is a "point of view" thing.  And I will admit that most of
my embedded programming (Coldfire, not AVR) is spent doing C++ programming,
so that may make my view more data-centric than code-centric.

I know that you understand the "critical section" issue, and you are able to
state its requirements much more eloquently than I can.  I would trust your
ability to recognize critical section issues and solve them properly,
whether you think about them in a data-centric way or in a code-centric way.
However, I am amazed how many embedded programmers *don't* understand how to
work with critical sections.  Some developers I have talked to seem to
interpret critical sections as "if some *code* can be executed in two
different contexts, it needs a critical section to handle it".

I too am sometimes amazed at the lack of thought and understanding somedevelopers have - although I have more often seen too little thoughtgiven to critical sections and other protection, than too much. Acommon example is code writing messages to a screen called from bothmain loops and interrupts in the same system.

I can't say I've seen code using critical sections just because the codesection is called from two different places in the program, but thatcould just be my luck. Perhaps you've come across people who are usedto programming for the PIC and other small micros where compilers oftengenerate non-reentrant code (local variables are statically allocatedinstead of on the stack), and the programmers are thus extra paranoid.

Maybe if you're talking about two threads accessing a shared code path, and
that code path is also changing the state of shared data objects, that is
correct.  If the developer always protects any shared code paths, the code
will probably be safe in its code sequences, just very slow.

On AVR, at least without an OS installed, there are no threads.  There are
only contexts: mainline code context and interrupt contexts.  Depending on
how the developer writes the code, it is possible that no code may get
shared between contexts, only data.  Ideally, the developer should use
shared functions to modify shared data objects and keep them in a coherent
state, but there is no guarantee they will do that.  They may just clone
code between the interrupt function and the mainline code.  Now there is no
"shared code path".

In my opinion, that is the possible flaw of thinking in a code-centric way.
The developer may think there is no risk (no overlapping code paths), when
the data is actually at risk.

If the developer is thinking in those lines, then he does not understandthe issues at hand. As you say, there is no need to protect codesequences themselves (unless you are dealing with non-reentrant codethat can be called from different contexts) - the purpose is ultimatelyto protect the data.

Exclusively using shared functions to modify shared data is *not* ideal- it is very far from ideal on small systems looking for optimal code,and it is seldom ideal on larger systems. It certainly has advantagesin some situations - in this case, you only have a small number of codesections that need locks or other types of protection, so it is hard toget wrong. But it can often lead to larger or slower code, and it canlead to at least two fundamental misunderstandings.

First, if you think you can make your code safe for multi-threading (orother multiple context access) by protecting individual data objects,you will have problems when you need to protect sequences of access. Itcan be useful having a "queue" class with atomic "push" and "pop"classes, but it's easy then to forget that "q1.push(q2.pop())" is notatomic - you still need locks around the code sequence.

Secondly, if you believe too strongly in your "protected data items"approach, you'll forget that all sorts of other resources are shared andneed protecting. It's common to see code that is very careful aboutprotecting access to a shared queue - and then happily calls printf() orsimilar functions from multiple contexts.

There is probably a corollary to this for a data-centric way of thinking.
Perhaps thinking that the get/set operations need critical sections, and
forgetting that the in-between code sequences that modify a data object are
also a vital part of maintaining overall data object coherency.  In other
words, don't "drop the lock" until you're done modifying the data object!

Correct - that's why it is the *code sequence* that has the lock, notthe data object.

However, if you are using an OS with mutexes, semaphores, and otherlocks (rather than the global "disable interrupts" lock), it's importantthat you use the correct lock in the correct place. I certainly don'tmean to imply that you use a particular lock for a sequence of codeaccessing some data, and a different lock for a different sequence ofcode accessing the same data!

Either way, it comes down to training a developer about critical sections
and making sure they really understand the core issues.  I guess I've been
disappointed on that front up to this point.

I guess as I ponder this issue, I am struggling to see which critical
section primitives should be provided as "base GCC+avr-libc" independent of
any OS, and which should be provided by an OS on top of the basic toolset.
Do you have any ideas for guidelines we can use to choose which ones belong
in each category?

Personally, I think all that is really needed is a clear and fast way toensure that a section of code is run with interrupts disabled, and withthe interrupt state restored at the end. It's up to the OS to providehigher level locks, probably implemented using this disable interruptprimitive. For those that are not using an OS, or only using a verysimple one, something like a "critical" or "monitor" function attributeis sufficient.


mvh.,

David

[Prev in Thread]

Current Thread

[Next in Thread]

RE: [avr-gcc-list] Avr-gcc Produces Incorrect Code with -Os, (continued)

Prev by Date: RE: [avr-gcc-list] Avr-gcc Produces Incorrect Code with -Os
Next by Date: [avr-gcc-list] __checksum
Previous by thread: RE: [avr-gcc-list] Avr-gcc Produces Incorrect Code with -Os
Next by thread: Re: [avr-gcc-list] Avr-gcc Produces Incorrect Code with -Os
Index(es):
- Date
- Thread