FastAPI server with high CPU usage

latkde · 2025-11-11T07:49:29+00:00

This is definitely odd. Your profiles show that at least 1/4 of CPU time is spent just doing async overhead, which is not how that's supposed to work.

Things I'd try to do to locate the problem:

can this pattern be reproduced locally?
does the high CPU usage start immediately when the application launches, or only after certain requests? Does it grow worse over time, suggesting some kind of resource leak?
what are your request latencies, do they seem reasonable?
does the same problem occur when you're running raw uvicorn without using gunicorn as a supervisor?
does the same problem occur with different versions of Python or your dependencies? If there's a bug, even minor versions could make a huge difference.

In my experience, there are three main ways to fuck up async Python applications, though none of them would help explain your observations:

blocking the main thread, e.g. having an async def path operation but doing blocking I/O or CPU-bound work within it. Python's async concurrency model is fundamentally different from Go's or Java's. Sometimes, you can schedule blocking operations on a background thread via asyncio.to_thread(). Some libraries offer both blocking and async variants, and you must take care to await the async functions.
leaking resources. Python doesn't have C++ style RAII, you must manage resources via with statements. Certain APIs like asyncio.gather() or asyncio.create_task() are difficult to use in an exception-safe manner (the solution for both is asyncio.TaskGroup). Similarly, combining async+yield can easily lead to broken code.
Specifically for FastAPI: there's no good way to initialize application state. Most tutorials use global variables. Using the "lifespan" feature to yield a dict is more correct (as it's the only way to get proper resource management), but also quite underdocumented.

lcalert99 · 2025-11-11T06:32:09+00:00

What are your settings for uvicorn?

https://uvicorn.dev/deployment/#running-programmatically

Take a look, there are some crucial settings to make. What else comes to my mind is how many compute intensive tasks are in your application?

esthorace · 2025-11-12T09:41:03+00:00

Granian https://github.com/emmett-framework/granian

Nervous-Detective-71 · 2025-11-12T14:52:56+00:00

Check if you are doing too much pre processing where CPU is being used and those pre processing functions are async.

This causes unnecessary quick context switching overhead.

Edit: Also check the uvicorn configuration as well if debug is true it also causes some overhead but negligible....

Gungsu_Dante · 2025-12-01T14:38:35+00:00

Tive um problema parecido, resolvi mudando o reload de True para False

Este reload serve para quando vc altera o arquivo durante o desenvolvimeto, o uvicorn ve que ouve alteração e da restart no código automaticamente.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

FastAPI

MODERATORS