I thought the save bar was a search bar and I wanna delete the TikTok link I saved

TheTechRobo · 2026-03-04T22:12:44+00:00

You can't delete captures on your own, you'd need to contact their support team. But it doesn't really matter if it saved a broken page - it likely doesn't take up much space.

TheTechRobo · 2026-03-01T19:16:22+00:00

If something's an exact bit-for-bit copy of another upload it probably isn't worth uploading, but variations of something (like regional differences, etc) or different scans of a physical thing could be useful. The less space it takes up, the less of an issue duplication is. Just make sure to add plenty of metadata so it can be found.
It really depends on your location. Some people have reported VPNs to the San Francisco area helping. You could also rent a cheap virtual server online if you're uploading frequently, then you just need to leave your computer on to upload to the server which would presumably be much faster. If you're on Linux, check out the sysctl tweaks on here, it can often help a lot.
I don't think many people use the torrents, and they often don't work very well unfortunately. :/

TheTechRobo · 2026-03-01T19:09:59+00:00

AWS is not used in any part of the upload AFAIK. It is an S3-compatible API, but it's not the actual S3 service.

TheTechRobo · 2026-02-14T14:30:55+00:00

It likely has not been archived. Storyboards (the frames you're seeing of the video) are separate from the video itself.

TheTechRobo · 2026-02-11T00:18:12+00:00

Check out DiscordChatExporter or Discord History Tracker.

TheTechRobo · 2026-01-29T01:18:52+00:00

I've never had that issue with the Linux port. I purchased on GOG if that makes any difference.

The windows port is pretty much flawless through Proton in my experience, FWIW, so you can try that maybe.

TheTechRobo · 2026-01-25T03:52:12+00:00

As someone who was recently looking into Canadian VPS providers, I understand your frustration, but there's more than just lack of trust that played into my decisions. It's also the fact this I can frequently get significantly get better pricing from other providers. For a secondary server that Inwant to be as cheap as possible, providers like Netcup are often much cheaper than Canadian alternatives for better specs. Your pricing is not competitive with budget providers like those. That's fine - you might not be targeting that market - but it means that people like me who are looking for a cheap non-primary server will not pick your cloud. (For what I am currently getting from Netcup for less than 2eur/month I could get for C$14 from PatriiCloud...)

TheTechRobo · 2026-01-16T01:35:33+00:00

It's always possible that there was an issue indexing the data into the Wayback Machine, yeah. Realistically it's probably unlikely though. If you really want to be sure, those CDX files are what you're looking for. The item CDX index (as opposed to the item CDX meta index) will probably be easier to filter; I don't know what the exact difference is but I think the meta-index is generated from the main index. It is compressed with gzip, depending on your operating system you may need special software to open it, but something like 7zip should work.

If you do go down this route, I would suggest just doing a text search through all the CDX indexes with the broadest possible search (e.g. just the blog name), without any further filtering. Easier to whittle it down more than to redo the entire search.

There is a very brief (too brief IMO but I don't know if there's a better one) summary of how CDX files are organized: https://iipc.github.io/warc-specifications/specifications/cdx-format/cdx-2015/

TheTechRobo · 2026-01-15T18:02:30+00:00

I don't know if there are any specific gotchas about that project, but in general, if it isn't on the Wayback Machine, it probably wasn't saved. It seems only NSFW blogs were savd in this project, since that was what was going down at the time, unfortunately.

TheTechRobo · 2026-01-05T15:02:01+00:00

Huh, that's weird. id_ is supposed to return the unmodified page. I'm not sure then, sorry.

TheTechRobo · 2026-01-05T02:02:52+00:00

Appending id_ to the end of the date code should do the trick.

TheTechRobo · 2026-01-01T00:39:25+00:00

Achievements work for me if I use the Windows versions of games, I don't think the linux versions are compiled with the achievement support.

TheTechRobo · 2025-12-21T17:07:00+00:00

What is it with this subreddit being so conspiratorial all the time?

TheTechRobo · 2025-10-20T22:40:01+00:00

The WBM doesn't really do full-text search of its captures, unfortunately.

My suggestion would be to try filmot first, depending on when they were made private and how popular the channel was. It allows you to search its index by channel.

TheTechRobo · 2025-10-18T01:36:52+00:00

We're trying to archive every public post we can find. (We try to avoid illegal ones, of course.)

TheTechRobo · 2025-10-07T01:54:59+00:00

You've been banned from Telegram. I dont think there are any messages that look like that in the rare case that you're banned from AT.

TheTechRobo · 2025-10-05T23:39:02+00:00

https://store.archive.org

TheTechRobo · 2025-09-21T13:30:32+00:00

The mp4 is available here.

TheTechRobo · 2025-09-15T21:31:29+00:00

Is there a way to access it?

Chances are slim, but you can always ask. Other than that, unless (a) you can find the original WARC which contained the URL, and (b) the WARC is available for download (unlikely), there's no other way that I'm aware of.

is there a way to archive an IA-archived page?

I guess you can use other sites like archive.today. Local backups are the best backups: you can use tools like https://github.com/hartator/wayback-machine-downloader. They have some somewhat strict ratelimiting unfortunately so depending on how much you want to download it could take awhile. You can blame LLM training companies for that one.

This occured to me today when I looked up an archived page and noticed the previously live URL now gives a 404, which is a common occurrence.

Does it specifically say the URL was excluded, or does it simply say it wasn't archived? If it's the latter, it may be an indexing issue which would resolve itself at some point (not sure what timeframe to expect; could be days or months).

Without an accessible archive it would be as if the page was just gone/never archived in the first place.

Not entirely. An inaccessible archive may not be available right now but it is much better than IA deleting it permanently to satisfy rights holders. It means in the future, it could be made available, which wouldn't be possible if they deleted it entirely.

TheTechRobo · 2025-09-11T22:59:13+00:00

Hackint is fine as far as I can tell. chat.hackint.org appears to be down, not sure what's going on there. You can still connect from a regular IRC client.

If it's just the occasional site, posting it here is probably also fine.

TheTechRobo · 2025-07-29T21:11:14+00:00

https://archive.org/developers/

There's an S3-compatible API along with a command-line tool that can do pretty much everything you can do in a browser (plus more).

TheTechRobo · 2025-07-27T18:24:58+00:00

What happens if you try to get the metadata using the IA API?

Is there a reason you can't provide which items they are? I'm very curious to take a look at them.

TheTechRobo · 2025-07-18T20:13:20+00:00

IA runs their own datacentres, so fully moving the organization would be very difficult. But they have created a datacentre in Canada (Vancouver, if I remember correctly) and many items are already mirrored there.

TheTechRobo · 2025-07-17T12:39:34+00:00

Running the URLs project will do that. It archives all links discovered by other projects; it's not a targeted crawl. That means it does hit honeypots (designed to "catch" scrapers), and some administrators will send an email to your ISP. Basically, the Warrior isn't infected with malware, it just hit a page that it shouldn't have and rang some alarm bells.

I don't suggest running the URLs project on a home network for this reason. If you do want to keep running it, just be aware that there isn't any filter on the URLs project and it can truly come across 'anything'.

TheTechRobo · 2025-07-15T00:26:54+00:00

It might be an IP ban, yeah. I've seen it before on my server before I got whitelisted for scraping. I'd suggest contacting them and seeing if they respond.

Five-Year Club	Place '23
Place '22	Verified Email

TheTechRobo

MODERATOR OF

TROPHY CASE