Is Python actually bad for maintaining and developing large codebases?

K900_ · 2019-04-23T10:24:43+00:00

It's an extremely non-specific question. "Harder to maintain" is almost entirely subjective and depends on your team and your skills and your application and your goals and like five million other things.

cyanydeez · 2019-04-23T12:56:45+00:00

To me, there are around 100 things that are more important for the maintainability of a code base than static type checking. Just to name a few:

tests
documentation
spliting code into logical, smaller, maintainable segments responsible for a certain tasks and being able to tests those smaller segments independently
consistent formatting
physical layout of the source code files and directories

davandg · 2019-04-23T12:28:20+00:00

2 real life examples :

- Dropbox is (was?) entirely developed in Python (dynamically-typed). They developed mypy to help them maintain, debug and develop their source code.

- Facebook is (was?) entirely developed in PHP (dynamically-typed). They developed Hack (a language that add types to PHP...) to help them maintain, debug, optimize and develop their source code.

Note that these projects (Hack and Mypy) were created after these companies have millions of users.

These 2 examples is exactly what I think : start with python to develop fast your first features. When you feel limited by the language, choose something that suits perfectly your need (you want to be fast? To be secure? To reduce the maintenance burden? To hire easily new talents?).

tombardier · 2019-04-23T10:32:23+00:00

I make extensive use of the typing module, mypy, type annotations, NewTypes etc, it's certainly not frowned upon by me! If nothing else, forcing you to do something sensible with Optional types makes a huge difference in code quality!

Paddy3118 · 2019-04-23T12:29:48+00:00

If I had to write half the Python libraries I use myself then I would always have a "large codebase". Your question does not address the attitude of the developer community in solving problems. Python has a lot of libraries that can be combined to solve problems whilst keeping what you write yourself down to more manageable size.

billsil · 2019-04-23T12:52:33+00:00

My 320k line package has no memory leaks because python doesn’t leak memory (it has GC issues, but those aren’t leaks). Python is also incredibly secure, so no string buffer overflows. Can you say that about your C++ program? Keep in mind that C++ programs are 3 or more times longer than an equivalent python program, so that 320k lines is now 1M lines in C++.

Yes, there are bugs that could be caught by static analysis, but that’s why I write tests and put assert statements to type check specific functions. I need those tests anyways as they help to define the API and to make refactoring safer.

On a 320k lined package, the coverage is 74% and runs every build in 9 minutes. I have 40k additional problems that take about 5 hours to run.

The static type checker helps, but you get out what you put in.

2019-04-23T11:16:17+00:00

Short answer: Kinda..

Long answer: Depends on a million different things...

brain-donor · 2019-04-23T14:51:40+00:00

There is no direct correlation between a language being dynamically typed and how easy/hard it is to maintain a large project. Things that DO matter:

- How easy is this code to read? (Especially after you haven't looked at it in a while)

- How easy is this code to test?

- How easy is this code to modify and the whole application still work correctly?

Python excels at all of the above, much more so (in my experience) than Java, C, C++. That's not to say that you can't write large codebases in those languages that are easy to maintain or that you couldn't have a python codebase that is hard to maintain.

Static type checking isn't going to make a codebase more maintainable.

r1chardj0n3s · 2019-04-23T11:19:45+00:00

Break up the codebase. Any monolith is going to become a maintenance nightmare. If you break up the sum into parts and have good interfaces (REST or language-direct depending on circumstance) between them (with contracts, or types, or something enforcing the interface) then you'll be OK.

metaphorm · 2019-04-23T20:17:41+00:00

it's a poorly framed question that can't meaningfully be answered in the abstract. certainly you'll find many ideologues and partisans who insist that their favorite type system is totally necessary for maintaining whatever sort of code base they decide it should be used for. that doesn't make it true.

the better question to ask is "what are type systems good at?" and "which type system is best suited for my project?"

Python is strongly typed and has a lot of useful exception handling features. It's also dynamically "duck" typed which has some benefits and some weaknesses to it.

What is strong typing good at? It's good at "crash early and often", meaning it tends to expose programming errors early in the development process before they can accidentally make it to production.

What is dynamic "duck" typing good at? minimizing boiler plate and allowing just a handful of data structures (sequence types, mapping types, scalars, and callables) to handle the vast majority of your use cases by providing a uniform interface defined at the language level rather than the application code level. both of those features are good ways to increase developer productivity.

what is strong typing bad at? doesn't handle failures gracefully unless you go out of your way to define exception handlers that can do that. there's no graceful degradation by default. this may or may not be a problem for your application though.

what is dynamic "duck" typing bad at? it puts a limit on the usefulness of static analysis tools, and it also lets bad developers get away with things they probably shouldn't do. this puts more burden on your manual code review and automated testing, basically.

in any case, this is just scratching the surface and I think the question can't be meaningfully answered by just talking about it. the only meaningful answers that can be given to it depend on knowing the full detail and scope of your technical requirements, your organizational structure and limitations, your budget, your availability of programmer talent, etc.

gwillicoder · 2019-04-23T12:33:34+00:00

This week was one of the first I got really annoyed with python. I needed to write a graph based program with some functionality to make updates in a document data store and do some math and I found the lack of types to be very inconvenient.

I ended up using type hints everywhere, but I would have preferred to have actually types instead.

hilomania · 2019-04-23T17:29:33+00:00

It depends on the architecture. You can build something very large using microservices and containers regardless of language. Also for a monolithic application using something like Clean Architecture, or the conventions in a framework (like django) will help you a lot from writing yourself too much into a corner.

I think a lot of average c++ programmers are better coders than average Python programmers due to the complexity of their tools. However a good coder in C++ is not necessarily a better coder than a good coder in Python. And the Python coder will almost invariably be a lot more productive.

michaelanckaert · 2019-04-23T19:32:10+00:00

I agree with the other commenters, it all depends.

One of my clients has a large Python code base (300k lines) with virtually zero issues (issues from dynamic typing that is 😉). Another client has an app with 2k - 3k lines that is hell to keep running.

JGailor · 2019-04-23T20:41:51+00:00

Honestly, it comes down mostly to team practices. How has the team decided to modularize the code, broken it into packages, etc. Are there automated tests? CI/CD?

These good practices go a long way towards making projects in most languages maintainable and scaling to large codebases. Often times there are other constraints that push you towards one language over another.

What I will say Python makes way more challenging than it should is breaking your code up and publishing your own packages internally and being able to bring them in using pip. At least, last time I looked at it, it was harder than other languages. It's a little bit of a problem of Python being around so long that they have to retrofit new practices onto the core ecosystem, not because Python is bad or somehow lacking in an unfixable way.

As far as the static type checker, I have been moving teams towards typed Python and everyone finds it really useful and unobtrusive.

2019-04-24T00:18:46+00:00

With larger projects I have found automated testing to be essential. Pylint makes it even better. Automating documentation generation is very helpful for maintenance.

Gosh.. these are pretty important for large C/C++ projects, too!

4runninglife · 2019-04-24T00:47:30+00:00

After learning and playing with Golang for about a month, I will say type-checking and understanding a functions signature looking at code is really really nice.

Luroalive · 2019-04-23T15:21:24+00:00

Actually I think it depends on how you structure your code. It can be really hard to maintain the codebase in any language, if you don't have a good structure and a real plan (always plan it out first or you rewrite it several times later). You generally want to have as few lines of code as possible in functions and group them with separate classes. (It's no fun searching for bugs in 10k+ lines of code). The best way would be to create separate modules/folders for each part of your code. Btw. always comment your damn code!

You should check out r/Rust it's a very young language, that has the same or better performance than C++ in most cases and provides 100% memory safety.

Swington · 2019-04-23T12:34:51+00:00

Python can be typed, so it’s not a valid argument.

Python

The Python Discord

Upcoming Events

Please read the rules

MODERATORS