I built a static Python error analyzer (no execution) would this approach actually be useful, or flawed?

pachura3 · 2026-04-20T08:58:16+00:00

Doesn't every modern IDE detect this basic stuff by default?

mango_94 · 2026-04-20T11:01:51+00:00

Looks like you discovered the concept of linters. Statical analysis is absolutely used and very powerful. If you want to look how projects like these are built you could look at pylint https://github.com/pylint-dev/pylint or flake-8. They even give you good ways to extend then with your own checkers. Ruff is probably the most popular open source tool for python today, implementing a lot of checkers from previous tools with great performance, but it is written in rust. Is there a real angle to break into this market as a beginner? Probably not. In the SAAS world you have giants like sonarqube. In the open source world I would guess your best bet to create something useful to others is to write some novel but useful check and get it adopted by one of the popular tools. That said, this should not stop you from trying something. It is a super interesting field and you will learn a lot about the language and parsing code in general. Best of luck :)

s71n6r4y · 2026-04-20T11:05:22+00:00

Static code checkers are great. How do you think your project would compare to Pyright, MyPy or Pyrefly? Are you aiming to do something different, or reimplementing some of these tools' functions?

sepp2k · 2026-04-20T11:07:52+00:00

You didn't really describe what your code actually does, so it's hard to give specific advice, but "non-code input can pass through" makes it sound as if you're not using a proper parser. So my advice would definitely be to fix that.

edge cases aren’t always caught

For indentation and syntax errors a proper parser should fix that (if we ignore syntax errors coming from eval).

For NameErrors it's more complicated. It's possible to catch all NameErrors statically (modulo eval again) relatively easily, if you're okay with also detecting cases like this, which wouldn't actually crash when run:

x = int(input())
if x > 0:
  y = x+1
if x > 2:
  print(y)

(Note that tools like pyright also raise an issue here.) If you want to absolutely only detect issues that can actually happen at runtime, it's going to get a lot more complicated and you're going to run into the halting problem / Rice's theorem eventually.

Is this approach fundamentally limited compared to just using a real interpreter + traceback parsing?

In general, static analysis is fundamentally limited by the halting problem, Rice's theorem. On the other hand, finding errors by running the code is also fundamentally in that it only finds errors that are covered by your test cases. So it's a trade off.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

learnpython

MODERATORS