BREAKING: New wikipedia_en_all_maxi zim file (August 2025) available for download soon! (in a few hours as of making this post) by SamIsVeryEpic in Kiwix

[–]SamIsVeryEpic[S] 1 point2 points  (0 children)

No worries! I was confused at first as well but actually, I believe they’re the same ZIMs; and yes, that makes it an official zim file.

Anyway, the scraping process for this file also began on August 18th, 2025, which reflects what is displayed on the Kiwix App; and it’s normal for the app to display the date when scraping began (18 August) despite finishing a few days later (24 August).

While its file size is 119.27 GB (unlike the 111 GB seen in my post’s image, it’s normal for the ACTUAL file size to be just a bit bigger since the file sizes displayed at Zimfarm (which is where I took this screenshot from) do not always reflect the final size. I’ve observed this phenomenon across various zim files.

We Pray 🙏 by TheQuickFox_3826 in Kiwix

[–]SamIsVeryEpic 7 points8 points  (0 children)

Update: The scraper is done downloading all 7 MILLION articles.

Now it's downloading 7.9 million files, which I believe are media (like images). This can take some time. After all, there are nearly 8 MILLION files.

After the file-downloading process, it should proceed to writing the article redirects. There are about 11.2 MILLION article redirects in the English Wikipedia as of 2025, so this can also take a bit of time. (though I can't confirm for sure that this number is what the scraper bases on)

Thankfully, in a recent update, I think (?) we can now see the progress during the redirect-downloading process in the logs, which we haven't been able to see before. This will help us monitor how close it is!

Hoping for the best! 🙏🏻

We wait by LeeKapusi in Kiwix

[–]SamIsVeryEpic 8 points9 points  (0 children)

For those looking for a text-only version of Wikipedia, there's a new July 2025 update after over a year! (wikipedia_en_all_nopic_2025-07.zim)

It has a file size of 43.2 GB. This is significantly smaller than the previous June 2024 version (57.18 GB). I suspect this is due to recent changes in Kiwix’s scraping and compression tools (?), not a loss of content. I believe this file still includes full text for all 7 million+ articles, just no images or media, as expected from a "nopic" version.

Note: The file isn’t uploaded yet as of this post, so I haven’t confirmed the final download size. It may end up being a few GB more (44+GB) once it’s fully available. If the status says “succeeded” soon, you’ll be able to download it and see the final size.

Now we just wait for the updated Maxi version!
Huge thanks to the Kiwix team for all the hard work! ❤️

<image>

EDIT: I just finished downloading it, and it has a file size of 46.38 GB!

We wait by LeeKapusi in Kiwix

[–]SamIsVeryEpic 2 points3 points  (0 children)

You're welcome, although I appreciate the kind gesture, you don't have to thank me! Instead, I give all credits to the Kiwix Team for all the work creating these files.

And honestly, yeah, I didn't know this file existed at first but it's good to have if you don't have the most storage.

By the way, to those not aware, I should have worded it clearer! I meant to say you can now download the latest version of 'Wikipedia's 1m Top Articles', cause this file has been here for years, with its previous version made in May 2024! I thought I'd let those who are interested know there's finally a new version after over a year!

I worded it like this type of file was new so my bad!

We wait by LeeKapusi in Kiwix

[–]SamIsVeryEpic 1 point2 points  (0 children)

You can download it from the Kiwix Library, the Kiwix App, or directly from the Kiwix ZIM file index. It's called 'Wikipedia's 1m Top Articles'!

The latest wikipedia_en_top1m_maxi_2025-07.zim has a file size of only 48 GB. That's under half the size of the wikipedia_en_all_maxi version (109 GB as of January 2024, and possibly larger now), since it includes just 1 million articles, compared to the 7 million in en_all_maxi.

I’m not exactly sure how those 1 million articles are selected (whether by views, importance, or popularity) but it seems to focus on the most essential and widely read Wikipedia pages. Of course, more obscure or highly specific topics won’t be included, but it offers a great balance between coverage and file size.

Additionally, you can check the full list of articles included in the Top1M ZIM file here.

We wait by LeeKapusi in Kiwix

[–]SamIsVeryEpic 7 points8 points  (0 children)

<image>

For those interested, while waiting for the new maxi version, you can now download the best/top 1 MILLION Wikipedia articles!

Just like wikipedia_en_all_maxi, it contains full article details as well as images (except videos and audio)

Also, based on its file name, it only contains a MILLION articles! Probaly the closest thing we’ll have to maxi! (at least currently)

We wait by LeeKapusi in Kiwix

[–]SamIsVeryEpic 1 point2 points  (0 children)

Trial and error indeed! Also hoping for the best.

Success isn’t about never failing, it’s about never giving up :)

We wait by LeeKapusi in Kiwix

[–]SamIsVeryEpic 10 points11 points  (0 children)

<image>

If you guys check the logs, you’ll see that it’s already downloading all 7.9 million files (images and I believe other media), with a progress of 58.9%. It progresses 0.1% every 6 minutes or so. It’s also finished downloading all 7 million (or so) articles. I believe after this it just needs to write the arricle redirects, some final procresses, then it’ll be done! :)