A note about the volatile keyword in C++

The volatile keyword in C++ is poorly misunderstood. Lots of developers believe that it makes the declared variable an atomic variable, but that’s wrong. The truth is that you should use std::atomic or related code (e.g., mutex) on a variable if you want to guarantee atomic operations in a multithreaded context.

What I want to discuss in this article is something else. Before I start you should remember that the volatile keyword should be used when you want to tell the compiler to not optimize a given variable. It basically says “don’t perform any optimizations on operations on this memory because something else outside the program (which the compiler is not aware of) may change it”. The most common use of volatile is in memory-mapped I/O. Gadgets and electronic peripherals (e.g., sensors, displays, etc.) may communicate with the software and optimizations could break the code.

The use of the volatile keyword has caught my attention recently and I want to show you a curious thing about it. I will go through some code examples and I will use an amazing online tool called gcc.godbolt.org to look at the assembly code generated by the compiler.

With the GCC compiler

Let’s start with a very simple example without the volatile keyword. As you can see in the code below, we have an int variable and a while loop. The GCC compiler realizes that the variable will never change its value and the loop will never finish. So it optimizes the assembly code by ignoring the variable check and keeps jumping forever back to the L2 label.

no_volatile

Now consider the same code, but let’s add volatile to the declaration of the int variable on line 4. As we can see, the GCC compiler now generates an assembly code that is totally different than the previous example and it respects the value == 5 check inside the loop.

volatile1.png

Now let’s change this example a little bit. What would happen if we move the int variable into a struct? Consider the code below without the volatile keyword. The assembly code is the same as in the first example. The GCC compiler ignores the loop check and keeps jumping forever.

no_volatile2

If we want to tell the compiler to not optimize the int variable, should we add the volatile keyword to the int variable declaration inside the struct? Let’s see how this would work:

volatile2

As we can see, the GCC compiler simply ignored the volatile definition inside the struct and optimized away the loop again. A careless developer could easily assume that code was correct and move on to other tasks. Bugs later would require hours of debugging in order to find the source of the problem. The real solution here is to add the volatile keyword to the struct object on line 6:

volatile3.png

I’ve tried different versions of GCC they all have the same result regarding this issue. Now let’s try CLANG and see what happens.

With the CLANG compiler

Using clang 3.9 we can start without the volatile keyword. As you can see below, the loop is optimized just like it happened with the GCC compiler.

clang0.png

Now let’s try the volatile keyword in the struct member variable. We have a surprise here: CLANG doesn’t optimize the loop like GCC did. I tried older versions of CLANG and the output is the same.

clang1.png

We get the same result when we move the volatile to the object on line 6.

clang2.png

So which compiler is the correct one?

It seems to me that CLANG is doing a better job than GCC because the loop deals with the struct object and its member variable. So if the volatile modifier is present either in the object or in its member, then the compiler has to respect that in the loop.

If you have more thoughts about this, please leave comments below.

EDIT 1: I posted this discussion on reddit and we have some great comments there.

EDIT 2: This issue was also discussed on the CppCast Episode 76 with Dan Saks, check it out: http://cppcast.com/2016/10/dan-saks/

6 thoughts on “A note about the volatile keyword in C++

  1. I believe that Clang is correct. Unfortunately for all of us, I’m too lazy to confirm this by referring to the standard. But I do memory-mapped I/O all the time (usually ARM Cortex M target, usually under IAR, but also Keil, GCC and Diab). I frequently declare struct fields to be volatile, or const, or both (e.g. a status register), and the the compilers have always respected my tags.

    I think GCC is odd man out here. But I don’t know for sure that it’s wrong; could be that it’s a dusty corner of the standard that is under-specified and different compilers have made different choices.

    1. As a follow-up to what might otherwise seem like a contradictory statement — I use GCC the least of any tools mentioned for embedded, and I’m not sure I’ve closely examined the output for it. I just did a test with IAR and Keil and they “passed”, however…

  2. The semantics of volatile is such that the compiler is in some sense allowed to do whatever it wants, as its behavior is mostly implementation defined.

    But GCC aims to “do the right thing”, so I’d say this is a bug. It took a quick look at this, and GCC keeps the volatile during the high-level optimizations. But the volatile is lost when going down into the backend, and all code is eliminated while doing the preparation for instruction selection/register allocation, in the same way as I described in my blog post about why volatile is hard to specify and implement: https://kristerw.blogspot.se/2016/04/why-volatile-is-hard-to-specify-and.html

    1. This helps explain my own personal experience (explained above). For my targets, I almost always compile with little-to-no optimization, for debugging purposes. I didn’t ever remember seeing this w/ GCC, though it’s not my primary compiler.

  3. Somewhat unrelated; I’m thinking of adding an “embedded” view to Compiler Explorer, so the kind of things you had to do above to get the inline images will be unnecessary. If you’re interested in this, do contact me (firstname at lastname dot org) and I’ll let you know once the feature is ready!

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out /  Change )

Twitter picture

You are commenting using your Twitter account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )

Connecting to %s