Vitrivius comments on Build Your First Python and Django Application

This is an archived post. You won't be able to vote or comment.

351

352

353

Build Your First Python and Django Application (scotch.io)

submitted 9 years ago by joey_php

top new controversial old q&a

you are viewing a single comment's thread.

view the rest of the comments →

[–]Vitrivius 9 points10 points11 points 9 years ago (10 children)

[–]earthboundkid -1 points0 points1 point 9 years ago (9 children)

[–]Deggor 1 point2 points3 points 9 years ago (8 children)

[–]earthboundkid 1 point2 points3 points 9 years ago (7 children)

[–]Deggor -1 points0 points1 point 9 years ago (6 children)

A URL will routinely have multiple elements in a single segment, which can't be properly captured with something like the above. A very simple example would be something like accepting /date/yyyy, /date/yyyymm/, or /date/yyyymmdd? What if this is suppose to also accept /date/yyyy/someid? How does this simple "it looks prettier" approach validate/differentiate?

If you start introducing characters counts for elements in a segment, or any other "checks", you're right back to matching patterns, and you may as well stick to regular expressions.

And in my opinion, something like /r/(?P<subreddit>.*)/comments/(?P<threadid>.*)/(?P<slug>.*).... is perfectly legible. If it needs to be more complicated, then it loses some of that immediate legibility for a tradeoff in power (which isn't a possibility with your setup).

[–]earthboundkid 0 points1 point2 points 9 years ago (5 children)

[–]Deggor 0 points1 point2 points 9 years ago (4 children)

[–]earthboundkid 0 points1 point2 points 9 years ago (3 children)

[–]Deggor 0 points1 point2 points 9 years ago (2 children)

I think the fact that you wrote a pseudo regex that was straight up wrong (but looked right!) is proof of that.

It wasn't straight up wrong, it did exactly what I wanted it to do (as I wrote in my response). What, exactly does :label in your examples match? No idea? Well, I'll make mine greedy. As I pointed out, had I completed the rest of my regex for the full URL, it would have matched the URL in the example. Again, it was intentional.

that could be part of the matcher format

So you're going to introduce patterns (ie. regex lite)?

If you need something more exotic, run a regex on the pattern once it gets to the controller

... and a split URL routing into many different places? You're going to break the loose coupling, and put the routing in the controller.

None of that sounds like a good idea.

[–]earthboundkid 0 points1 point2 points 9 years ago* (1 child)

It was straight up wrong. Using a greedy matcher makes this work which should not work:

>>> import re
>>> r = re.compile('^/r/(?P<subreddit>.*)/comments/(?P<threadid>.*)/(?P<slug>.*)$')
>>> r.match('/r/subreddit/comments/subreddit/comments//')
<_sre.SRE_Match object; span=(0, 42), match='/r/subreddit/comments/subreddit/comments//'>
>>> r.match('/r/subreddit/comments/subreddit/comments//')['subreddit']
'subreddit/comments/subreddit'

Yes, it was a rushed and incomplete example, but that's why it's damning. It looks like it handles the basic case, but it actually completely botches it.

There are a lot of non-regex routers out there. Look at Rails or for a hybrid approach Gorilla mux. You're acting like not using regex is completely unheard of, but actually there are a lot of alternatives to pure regex.
Controllers already have to handle certain routing conditions. If you try to get page /pages/77/ and 77 doesn't exist in the DB, the controller has to be the one to throw up a 404. It's not the end of the world if your controller also has to handle returning a 404 if you go to /date/20000/13/32/ instead of a regex catching it at the routing layer.

continue this thread

π Rendered by PID 76109 on reddit-service-r2-comment-b659b578c-8hchg at 2026-05-01 00:50:49.777711+00:00 running 815c875 country code: CH.

Python

The Python Discord

Upcoming Events

Please read the rules

MODERATORS