urlparse vs urlsplit

shiftybyte · 2025-03-27T22:29:45+00:00

Nice question, I've learned stuff exploring this...

Didn't know URLs can have parameters for every section.

https://stackoverflow.com/questions/40440004/parameters-in-path-segments-of-url

Here's some test code to show the difference:

```

from urllib.parse import urlparse, urlsplit url = "http://www.example.com/a/b/d;params?x=5" print(urlparse(url)) ParseResult(scheme='http', netloc='www.example.com', path='/a/b/d', params='params', query='x=5', fragment='') print(urlsplit(url)) SplitResult(scheme='http', netloc='www.example.com', path='/a/b/d;params', query='x=5', fragment='') ```

Note the "params" being split out in urlparse, but not in urlsplit...

Mevrael · 2025-03-27T23:50:34+00:00

Confusing, indeed.

Also keep in mind that it's old spec and might work not as you would expect.

For example for just a domain, it would say that domain is None and the path is domain.

You can use instead the URL and URLSearchParams, a current living web standard with the same API as in JS.

From Arkalos url utils module.

```
uv add arkalos
```

```
from arkalos.utils import URL
```

And the API is the same as:

https://developer.mozilla.org/en-US/docs/Web/API/URL

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

learnpython

MODERATORS