Coroutines In C

July 14, 2025

It is virtually a rite of passage for C programmers to realize that they can write their own cooperative multitasking system. C is low-level enough, and there are several ways to approach the problem, so, like Jedi light sabers, each one is a little bit different. [Christoph Wolcher] took his turn, and not only is his system an elegant hack, if that’s not an oxymoron, it is also extremely well documented.

Before you dig in, be warned. [Christoph] fully admits that you should use an RTOS. Or Rust. Besides, after he finished, he discovered the protothreads library, which does a similar task in a different way that is both more cool and more terrible all at the same time.

Once you dig in, though, you’ll see the system relies on state machines. Just to prove the point, he writes a basic implementation, which is fine, but hard to parse and modify. Then he shows a simple implementation using FreeRTOS, which is fine except for, you know, needing FreeRTOS.

Using a simple set of macros, it is possible to get something very similar to the RTOS version that runs independently, like the original version. Most of the long code snippets show you what code the macros generate. The real code is short and to the point.

Multiprocessing is a big topic. You can have processes, threads, fibers, and coroutines. Each has its pros and cons, and each has its place in your toolbox.

20 thoughts on “Coroutines In C”

Cad the Mad says:

July 14, 2025 at 9:21 am

You’re not kidding about it being a rite of passage. This brought back memories.

Back in grad school I ran into the problem for the first time because of the limitations of Arduino. On bare metal microcontrollers I was already in the habit of breaking the application into timer ISRs but I didn’t have a clean way of doing that on Arduino. What I ended up doing was restructuring all the logic to finish fast and return, and then in the main Loop I had some logic to control how often the different functions would be called to iterate. Dirty, hacky, inefficient, but I had a working robot in time for an open house event the next day.

Report comment

Reply
paulvdh says:

July 14, 2025 at 10:37 am

First, I don’t agree he should have used an RTOS. An RTOS can be used for simply making code more readable, but on a small uC, some simple loops or statemachines, combined with some ISR’s for the time critical stuff is usually the best method. Also note that an RTOS does a lot of ISR enabling / disabling during the task switching, and this makes the other ISR’s less predictable. In general, If you are using an 8-bitter with a few kB of memory, then an RTOS is probably overkill. If you’re using a 32-bitter (ARM-Cortex XMega’s and such) which have several KB of RAM in addition to >= 32kB of Flash, then you’re starting to get into the area where an RTOS can be beneficial.

In his comments he also writes:
“This whole setup is an unholy alliance of C macros, state machines, and sheer willpower. It’s clever, it’s educational”

I have a big dislike for macro’s and for “clever tricks”, but I do like C++. In his example of a blinking led. I would write a little class that has:
1). Function pointer.
2). Two functions: led_on() and led_off().

And then put a simple while() loop in main(), that repeatedly calls the function to which the function pointer points. You do have to think a bit of how you structure the functions of your state machine, but that is a good thing, because it forces you to think about program structure. State machines based on a function pointer also have very little overhead, and it’s easy to see the overall structure of the program in the while() loop in main(). It’s also completely free of macro trickery.

Report comment

Reply
1. dave proctor says:
  
  July 14, 2025 at 11:13 am
  
  C++ on uC’s with virtual function indirection and vf tables can be a huge waste of memory and CPU cycles for some applications. Instead it’s possible to implement a simple C, static compile time OO framework rather than a dynamic runtime OO framework like C++ and remove the indirection and virtual function tables. This can be done with templates, and I’ve been on several projects that use this approach, including Intel SSDs.
  
  Report comment
  
  Reply
  1. notmyfault2000 says:
    
    July 14, 2025 at 9:32 pm
    
    Aren’t most compilers smart enough to get rid of the indirection and vtables if they’re never used?
    
    Report comment
    
    Reply
  2. Daan Timmer says:
    
    July 16, 2025 at 12:11 am
    
    You are partly right. Yes it increases code size. But as long as you are not on a tight CPU cycle/RAM/ROM budget it usually doesn’t matter.
    
    Maintenance and (unit) testability score higher on the “important” list than code size and code speed.
    
    Our CPU doesn’t do anything for 50% of the time. Better to waste some cycles to increase testability instead.
    
    Report comment
    
    Reply
Danjovic says:

July 14, 2025 at 10:49 am

State machines are one step behind, and are a rite of passage as well.

Report comment

Reply
1. James says:
  
  July 14, 2025 at 3:39 pm
  
  What’s the next step? (student currently passing through rites)
  
  Report comment
  
  Reply
  1. amwales says:
    
    July 15, 2025 at 12:53 am
    
    The next step is using setjmp/longjmp to implement your own green threads. Wrapping all the C library blocking routines to yield. No assembler.
    Then later you will want a malloc arena allocator for your threads….
    
    Report comment
    
    Reply
2. Toon says:
  
  July 15, 2025 at 1:06 am
  
  As a bare metal embedded developer in C, I came to regard the main process loop (an infinite for loop) as being analogous to an operating system of sorts. This only contains calls to the various process handler functions that the system needs to carry out all of its tasks. There is no preemption (apart from a few ISRs). The trick is, to write all of the process handlers, as well as their helper functions, to only do work on what’s immediately available, and return as soon as there’s nothing to do. They should never wait for anything.
  
  Writing code this way, is dependent on writing efficient state machines, never using wait functions, but instead relying on timers that can be started, and then testing for completion elsewhere within the state machine typically several process loop iterations later. However, coding this way can be quite challenging for some programmers…
  
  The thing is, this complexity has to exist somewhere within the software stack. It’s either within the operating system itself eg through use of a preemptive task scheduler, use of a high-level eg interpreted language with features making the programmers life easier, OR by writing efficient process handling (as above). The main benefit of the latter, is that by not depending on so much generalised code, the system is more efficient. Useful in an embedded solution, but less so for a general purpose operating system.
  
  Report comment
  
  Reply
Pat says:

July 14, 2025 at 12:00 pm

As is mentioned in the article, this has been done before, but actually more than once. The protothreads header is mentioned there, but there’s also this:

https://www.chiark.greenend.org.uk/~sgtatham/coroutines.html

by the author of PuTTY.

Report comment

Reply
1. Al Williams says:
  
  July 14, 2025 at 2:16 pm
  
  Yeah it has been done hundreds upon thousands of time. I’ve done three or four versions all by myself ;-)
  
  Report comment
  
  Reply
Rastersoft says:

July 14, 2025 at 1:43 pm

I wrote a coroutines system combined with an event queue that greatly simplified working with interrupts. And the most interesting thing is that I was able to avoid using a switch() statement in the macros, so it’s possible to use them inside a coroutine. The code is in the firmware folder of my keyboard project: https://gitlab.com/rastersoft/full-ten-keyless

Report comment

Reply
Scott Hess says:

July 14, 2025 at 2:57 pm

Something I miss about the 80’s is that you would come up with an idea like this and implement it in a couple projects and discuss it with your friends, and you didn’t find out under three months or three years later that someone else did it (and possibly/probably better). I think spending your time in that sort of low-level playground is essential for building good engineers, in a way that wiring together libraries doesn’t accomplish.

Report comment

Reply
1. James says:
  
  July 14, 2025 at 3:49 pm
  
  Some might say the same about fussing with crystal point contacts over wiring together IC’s for building good electrical engineers…
  
  on a serious note, it would be great if beginner IDE’s like Arduino put on display the inner workings of libraries in some way, so that learners could drill down into what makes the microprocessor tick, while still getting difficult things done quickly.
  
  maybe even down to assembly… (https://godbolt.org/)
  
  Report comment
  
  Reply
rclark says:

July 14, 2025 at 3:34 pm

“It is virtually a rite of passage for C programmers” . Must be… I remember writing one as well. But I seem to recall at the core was a small assembly code module that did the actual switch to the next task to run. This was back in the x86 days. Never did for need it for 68xxx as we used Ready Systems VRTX as the preemptive multitasking core.

Report comment

Reply
James says:

July 14, 2025 at 3:41 pm

I’ve seen this library tossed around as a highly efficient, yet declarative way of writing state machines:

https://github.com/boost-ext/sml

I’ve yet to try it in depth but I’m looking to implement it into my next embedded project…

Report comment

Reply
Christian says:

July 14, 2025 at 8:41 pm

Right of passage… I expected to see something via setjmp/longjmp.

Report comment

Reply
Jerry says:

July 15, 2025 at 6:16 pm

When I hear “coroutines in C” it reminds me of “coroutine.h” which I wrote ages ago (based on something I found on whatever passed for the Internet at the time), which wraps the function body in a switch statement and defines a “yield” operation which IIRC does something along the lines of “state = LINE; return _VAL; case LINE:”. You can embed this INSIDE of a block and the (hidden) outer switch will take you back inside of it. It’s kinda weird and surprisingly is actually part of the conformance tests so compilers are required to support it.

Report comment

Reply
1. Duality says:
  
  July 16, 2025 at 7:44 am
  
  something like on this page?
  https://www.iwriteiam.nl/D241005_c.txt (this happens to be a page of a colleague of mine)
  
  Report comment
  
  Reply
Oppy says:

July 16, 2025 at 3:28 am

I’d like to give a shout out for RIOS which I found a useful introduction to this topic. The authors wrote a paper explaining how it works here:
https://dl.acm.org/doi/pdf/10.1145/2530544.2530553
GitHub here:
https://github.com/BerranRemzi/RIOS/tree/main

Report comment

Reply

Hackaday

Coroutines In C

20 thoughts on “Coroutines In C”

Leave a ReplyCancel reply

Search

Never miss a hack

If you missed it

Two For The Price Of One: BornHack 2024 And 2025 Badges

Hands On: The Hacker Pager

Power Line Patrols: The Grid’s Eye In The Sky

A History Of Pong

Supersonic Flight May Finally Return To US Skies

Our Columns

Thanks, Tamiya-san

Hackaday Podcast Ep 331: Clever Machine Tools, Storing Data In Birds, And The Ultimate Cyberdeck

This Week In Security: Spilling Tea, Rooting AIs, And Accusing Of Backdoors

When Online Safety Means Surrendering Your ID, What Can You Do?

Farewell Shunsaku Tamiya: The Man Who Gave Us The Best Things To Build

20 thoughts on “Coroutines In C”

Leave a ReplyCancel reply

Search

Never miss a hack

Subscribe

If you missed it

Our Columns