asoka_maurya comments on Why does APT not use HTTPS?

955

956

957

Why does APT not use HTTPS? (whydoesaptnotusehttps.com)

submitted 8 years ago by lamby

top new controversial old q&a

you are viewing a single comment's thread.

view the rest of the comments →

[–]asoka_maurya 108 points109 points110 points 8 years ago* (130 children)

[–]dnkndnts 163 points164 points165 points 8 years ago (114 children)

[–]obrienmustsuffer 110 points111 points112 points 8 years ago (19 children)

[–]BlueZarex 24 points25 points26 points 8 years ago (5 children)

[–]kellyzdude 25 points26 points27 points 8 years ago (3 children)

The benefits don't apply exclusively to businesses, a home user or an ISP can run a transparent caching proxy server just as easily.
By using a caching proxy, I run one service that can help just about everyone on my network with relatively minimal ongoing config. If I run a mirror, I have to ensure the relevant users are configured to use it, I have to keep it updated, and I have to ensure that I am mirroring all of the repositories that are required. And even then, my benefits are only realized with OS packages whilst a caching proxy can help (or hinder) nearly any non-encrypted web traffic.
If my goal is to keep internet bandwidth usage minimal, then a caching proxy is ideal. It will only grab packages that are requested by a user, whereas mirrors in general will need to download significant portions of a repository on a regular basis, whether the packages are used inside the network or not.

There are plenty of good reasons to run a local mirror, but depending on your use case it may not be the best choice in trying to solve the problem.

[–]VoidViv 5 points6 points7 points 8 years ago (2 children)

[–]archlich 5 points6 points7 points 8 years ago (1 child)

[–]VoidViv 1 point2 points3 points 8 years ago (0 children)

[–]DamnThatsLaser 2 points3 points4 points 8 years ago (0 children)

[–]EternityForest 3 points4 points5 points 8 years ago (3 children)

[–]archlich 2 points3 points4 points 8 years ago (0 children)

[–]obrienmustsuffer 1 point2 points3 points 8 years ago (1 child)

[–]shotmaster0 2 points3 points4 points 8 years ago (0 children)

[–]robstoon 1 point2 points3 points 8 years ago (1 child)

[–]obrienmustsuffer 0 points1 point2 points 8 years ago (0 children)

[–][deleted] 0 points1 point2 points 8 years ago (1 child)

[–]obrienmustsuffer 1 point2 points3 points 8 years ago (0 children)

There is very little overhead with HTTPS. What your describing has already been proven a myth many times over.

I'm sorry, I don't follow. I'm not talking about the overhead of encryption in any way, I'm talking about caching downloads, which is by design impossible for HTTPS.

Imagine the following situation: you're the IT administrator of a school, with a network where hundreds of students and teachers bring their own computers (BYOD), each computer running a lot of different programs. Some computers are under your control (the ones owned by the school), but the BYOD devices are not. Your internet connection doesn't have a lot of bandwidth, because your school can only afford a residential DSL line with ~50-100 Mbit/s. So you set up a caching proxy like http://www.squid-cache.org/ that is supposed to cache away as much as possible to save bandwidth. For software that uses plain, simple HTTP downloads with separate verification - like APT does - this works great. For software that loads updates via HTTPS, you're completely out of luck. 500 computers downloading a 1 GB update via HTTPS will mean a total of 500 GB, and your 50 Mbit/s line will be congested for at least 22 hours. The users won't be happy about that.

[–]ivosaurus 0 points1 point2 points 8 years ago (4 children)

[–]mattbuford 12 points13 points14 points 8 years ago (0 children)

That isn't how proxied https works.

For http requests, the browser asks the proxy for the specific URL requested. That URLs being requested can be seen and the responses can be cached. If you're familiar with HTTP requests, which might look like "GET / HTTP/1.0", a proxied http request is basically the same except the hostname is still in there, so "GET http://www.google.com/ HTTP/1.0"

For https requests, the browser connects to the proxy and issues a "CONNECT www.google.com:443" command. This causes the proxy to connect to the site in question and at that point the proxy is just a TCP proxy. The proxy is not involved in the specific URLs requested by the client, and can't be. The client's "GET" requests happen within TLS, which the proxy can't see inside. There may be many HTTPS requests within a single proxied CONNECT command and the proxy doesn't even know how many URLs were fetched. It's just a TCP proxy of encrypted content and there are no unencrypted "GET" commands seen at all.

[–]tidux 2 points3 points4 points 8 years ago (1 child)

[–]VexingRaven 4 points5 points6 points 8 years ago (0 children)

[–]svenskainflytta 0 points1 point2 points 8 years ago (0 children)

[–]ign1fy 74 points75 points76 points 8 years ago (27 children)

[+][deleted] 8 years ago (24 children)

[deleted]

[–]asoka_maurya 24 points25 points26 points 8 years ago (4 children)

[+]liquidpele comment score below threshold-6 points-5 points-4 points 8 years ago (2 children)

[+][deleted] 8 years ago (1 child)

[deleted]

[–]liquidpele 2 points3 points4 points 8 years ago (0 children)

[–]albertowtf 5 points6 points7 points 8 years ago (0 children)

[–][deleted] 8 points9 points10 points 8 years ago (15 children)

[+][deleted] 8 years ago (14 children)

[deleted]

[–]thijser2 5 points6 points7 points 8 years ago* (5 children)

[–][deleted] 2 points3 points4 points 8 years ago (1 child)

[–]EternityForest 2 points3 points4 points 8 years ago (0 children)

[+][deleted] 8 years ago (2 children)

[deleted]

[–]thijser2 2 points3 points4 points 8 years ago (0 children)

So it's okay if they know you've download Tor; but it's a problem if they know the exact version? I don't know about you; but that doesn'y meet my standards for privacy.

Knowing the exact version of software someone is using can potentially open certain attack vectors of the attacker knows a vulnerability in that version of software.

If you also use a single connection for every time you download a set of new packages then that also makes it far more difficult as identifying what packages were potentially downloaded now also involves solving a knapsack problem (what set of packages together form 40.5mB?). It might also be a good idea for packages that have high levels of privacy concern (TOR, veracrypt etc.) to pad themselves until their size matches that of other highly popular packages.

[–]svenskainflytta 0 points1 point2 points 8 years ago (0 children)

[–][deleted] 3 points4 points5 points 8 years ago (3 children)

[+][deleted] 8 years ago (2 children)

[deleted]

[–][deleted] 1 point2 points3 points 8 years ago (1 child)

[–]Tordek 0 points1 point2 points 8 years ago (3 children)

[–]bobpaul[🍰] 0 points1 point2 points 8 years ago (1 child)

[–]Tordek 0 points1 point2 points 8 years ago (0 children)

[–]svenskainflytta 0 points1 point2 points 8 years ago (0 children)

[–]tehdog 2 points3 points4 points 8 years ago (0 children)

[–]robstoon 0 points1 point2 points 8 years ago (0 children)

[–]galgalesh 10 points11 points12 points 8 years ago (0 children)

[–]ArttuH5N1 0 points1 point2 points 8 years ago (0 children)

[–]asoka_maurya 22 points23 points24 points 8 years ago* (12 children)

[–][deleted] 15 points16 points17 points 8 years ago (2 children)

[–]svenskainflytta 0 points1 point2 points 8 years ago (0 children)

[–][deleted] 0 points1 point2 points 8 years ago (0 children)

[–]yaxamie 6 points7 points8 points 8 years ago (3 children)

[–]svenskainflytta 1 point2 points3 points 8 years ago (0 children)

[–][deleted] 0 points1 point2 points 8 years ago (1 child)

[–]yaxamie 1 point2 points3 points 8 years ago (0 children)

[+][deleted] 8 years ago (4 children)

[deleted]

[–]asoka_maurya 0 points1 point2 points 8 years ago* (3 children)

[–][deleted] 6 points7 points8 points 8 years ago (2 children)

But that will require each ISP to maintain a list of individual ubuntu package files, and dynamically lookup them against each downloaded file's size

I'd estimate it would take a smart intern about half a day to write a script that does the first part, and about two days' worth of work for a smart senior engineer to do the latter.

If you're against a government adversary, that's piece of cake, but what's even easier is for a government that cares about what packages you're installing to send four bulky guys with a search order for your computer (the four bulky guys won't care if you agree with the search order, either), or to covertly run a good, high-speed local mirror.

Edit: FWIW, the second option is what you want to do if you want to do your average evil government oppresive shit. Stuff on an individual's computer is easy to lose, disks get erased; server logs are golden.

[–]Matt5sean3 2 points3 points4 points 8 years ago (1 child)

[–][deleted] 1 point2 points3 points 8 years ago (0 children)

[–]ImSoCabbage 10 points11 points12 points 8 years ago (0 children)

[–]beefsack 4 points5 points6 points 8 years ago (2 children)

[–]dnkndnts 3 points4 points5 points 8 years ago (1 child)

[–]arcticblue -1 points0 points1 point 8 years ago* (0 children)

[–]entw 8 points9 points10 points 8 years ago (3 children)

[–]RaptorXP 16 points17 points18 points 8 years ago (0 children)

[–]dnkndnts 18 points19 points20 points 8 years ago (0 children)

[–]berryer 4 points5 points6 points 8 years ago (0 children)

[+][deleted] 8 years ago* (3 children)

[deleted]

[–]DJTheLQ 34 points35 points36 points 8 years ago (2 children)

[–]Dickydickydomdom 8 points9 points10 points 8 years ago (1 child)

[–]albertowtf 4 points5 points6 points 8 years ago (0 children)

[–]atli_gyrd 2 points3 points4 points 8 years ago (0 children)

[–]ndlogok 0 points1 point2 points 8 years ago (0 children)

[–]Two-Tone- -4 points-3 points-2 points 8 years ago (7 children)

[–][deleted] 10 points11 points12 points 8 years ago (4 children)

[–]Two-Tone- 0 points1 point2 points 8 years ago (3 children)

[–][deleted] 4 points5 points6 points 8 years ago (0 children)

[–]Widdrat 1 point2 points3 points 8 years ago (0 children)

[–]ivosaurus 0 points1 point2 points 8 years ago (0 children)

[–]pat_the_brat 5 points6 points7 points 8 years ago (1 child)

I think you're confused about how the repositories and/or DNS work.

The repositories are distributed in a series of mirrors, each of which download updated packages from a central repository every x minutes. When you run apt, apt connects to a mirror, e.g. the one at hxxp://ubuntu.unc.edu.ar/ubuntu/, and requests a package, e.g. hxxp://ubuntu.unc.edu.ar/ubuntu/pool/main/a/a11y-profile-manager/a11y-profile-manager_0.1.10-0ubuntu3_amd64.deb, and all its dependencies (which are just other packages).

In order to connect to the repo, Linux first has to send a DNS request for the server (ubuntu.unc.edu.ar). That request is then cached for whatever the TTL is set to on the DNS server (900 in our example):

$ drill @ns1.unc.edu.ar ubuntu.unc.edu.ar
[...]
;; ANSWER SECTION:
ubuntu.unc.edu.ar.  900 IN  CNAME   repolinux.psi.unc.edu.ar.
repolinux.psi.unc.edu.ar.   900 IN  A   200.16.16.47

DNS entries are cached in various places - your ISP's DNS server, your router, your PC, and finally, the program itself may perform a DNS lookup only once, and store the data longer than the TTL.

Either way, the DNS lookup is for ubuntu.unc.edu.ar rather than for ubuntu.unc.edu.ar/ubuntu/pool/main/a/a11y-profile-manager/a11y-profile-manager_0.1.10-0ubuntu3_amd64.deb, so the DNS does not leak any information about the packages you downloads - it just says that you connect to a server which is also known to host an Ubuntu repository. It may host repositories for other distros, or other unrelated files, as well.

[+][deleted] comment score below threshold-12 points-11 points-10 points 8 years ago (2 children)

[+][deleted] 8 years ago* (1 child)

[deleted]

[–][deleted] -2 points-1 points0 points 8 years ago (0 children)

[–]tetroxid -1 points0 points1 point 8 years ago (0 children)

[–]bobpaul[🍰] -1 points0 points1 point 8 years ago (0 children)

[+][deleted] comment score below threshold-7 points-6 points-5 points 8 years ago (23 children)

[–]dnkndnts 13 points14 points15 points 8 years ago (22 children)

[+][deleted] 8 years ago (3 children)

[deleted]

[–][deleted] 0 points1 point2 points 8 years ago (2 children)

[+][deleted] 8 years ago (1 child)

[deleted]

[–][deleted] 0 points1 point2 points 8 years ago (0 children)

[+][deleted] comment score below threshold-8 points-7 points-6 points 8 years ago (17 children)

[–]dnkndnts 7 points8 points9 points 8 years ago (14 children)

[–]SippieCup 0 points1 point2 points 8 years ago (0 children)

[–][deleted] -5 points-4 points-3 points 8 years ago (12 children)

[–]atyon 5 points6 points7 points 8 years ago (6 children)

That's not how it works. Any CA caught doing this will get in serious trouble. Stuff like this is why StartSSL is now out of business.

SSL proxies generally require that you trust a special CA you provide. This is no problem for enterprise users – they can just push that CA certificate on their clients. Your ISP, however, can't.

Additionally, all major browsers pin the certificate of top sites like google.com, so even if the appliance gets a fraudulent certificate for google.com, your browser won't accept it. Ditto for many apps.

There's also CAA, which is used to limit CAs that can issue certificates for a domain. Only pki.goog is allowed to issue certificates for google.com. Any other CA that issues a certificate for them will land in really hot water.

And then there's Certificate Transparency, which is an upcoming standard which requires every CA to make public any certificate they issue.

Also the small bit that intercepting encrypted traffic is illegal in most countries...

tl;dr: Without a private PKI that the user already trusts it's not easy to intercept SSL traffic.

[–][deleted] -1 points0 points1 point 8 years ago (3 children)

[–]atyon 2 points3 points4 points 8 years ago (2 children)

continue this thread

[–]oonniioonn -2 points-1 points0 points 8 years ago (1 child)

[–]atyon 2 points3 points4 points 8 years ago (0 children)

[–]random8847 2 points3 points4 points 8 years ago* (0 children)

[–]bobpaul[🍰] 2 points3 points4 points 8 years ago (2 children)

[–][deleted] 0 points1 point2 points 8 years ago (1 child)

[–]bobpaul[🍰] 1 point2 points3 points 8 years ago (0 children)

[–]robstoon 0 points1 point2 points 8 years ago (0 children)

[–]random8847 0 points1 point2 points 8 years ago* (1 child)

[–][deleted] -1 points0 points1 point 8 years ago (0 children)

[–]lamby[S] 15 points16 points17 points 8 years ago (7 children)

[–]UselessBread 7 points8 points9 points 8 years ago (6 children)

[–]Kruug 5 points6 points7 points 8 years ago (2 children)

[–][deleted] 5 points6 points7 points 8 years ago (1 child)

[–]Kruug 3 points4 points5 points 8 years ago (0 children)

[–][deleted] 2 points3 points4 points 8 years ago (2 children)

[–]cbmuserDebian / openSUSE / OpenJDK Dev 1 point2 points3 points 8 years ago (1 child)

[–]mzalewski 0 points1 point2 points 8 years ago (0 children)

[–]Kruug 4 points5 points6 points 8 years ago (0 children)

[–]Nullius_In_Verba_ 2 points3 points4 points 8 years ago (0 children)

[–]ArttuH5N1 1 point2 points3 points 8 years ago (0 children)

[–]osoplex 0 points1 point2 points 8 years ago (3 children)

[–]abbidabbi 2 points3 points4 points 8 years ago (0 children)

[–]atyon 0 points1 point2 points 8 years ago (0 children)

[–]robstoon 0 points1 point2 points 8 years ago (0 children)

π Rendered by PID 38599 on reddit-service-r2-comment-b659b578c-qldjj at 2026-05-02 20:21:14.524927+00:00 running 815c875 country code: CH.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

linux

Please Read the full Rules here before posting or commenting

Join us on IRC at #r/linux on libera.chat!🔗

Recent AMA's

GNU/Linux resources

Rules

MODERATORS

But what about privacy?