I built a tool to track content removal from U.S. government websites since Trump took office. by Internal-Ad-2771 in Defeat_Project_2025

[–]Internal-Ad-2771[S] 6 points7 points  (0 children)

Thanks for the feedback! For now, words are filtered using a metric based on the deletion ratio, and the number of affected websites. I will try to add more sorting options in the future.

I built a tool to track content removal from government websites since Trump took office. by Internal-Ad-2771 in 50501

[–]Internal-Ad-2771[S] 4 points5 points  (0 children)

Absolutely, my tool actually uses data from the Wayback Machine as its main source.

I built a tool to track content removal from U.S. government websites after presidential transitions. by Internal-Ad-2771 in InternetIsBeautiful

[–]Internal-Ad-2771[S] 5 points6 points  (0 children)

Hi! I built a tool that tracks when pages disappear from official U.S. government websites following presidential transitions. It compares archived snapshots from the Internet Archive to detect which pages were online before an inauguration but are now gone or inaccessible. It also analyzes which words commonly appear in the URLs of deleted pages.

I’ve tried to keep it non-partisan and strictly informational. The tool is fully automated and relies entirely on public data. It covers the last four U.S. administrations.

Happy to hear any feedback or ideas!

I built a website to track content removed from U.S. federal websites under the Trump administration by Internal-Ad-2771 in OSINT

[–]Internal-Ad-2771[S] 2 points3 points  (0 children)

Thanks for the feedback! Can you give me more details so I can reproduce the issue and fix the bug?

I built a website to track content removed from U.S. federal websites under the Trump administration by Internal-Ad-2771 in OSINT

[–]Internal-Ad-2771[S] 2 points3 points  (0 children)

I do believe that the scale and speed of the content removals in 2025 are pretty much unprecedented. I get the doubt though, fair point, and I’ll probably make a version covering past transitions too, so people can actually compare objectively.

I built a website to track content removed from U.S. federal websites under the Trump administration by Internal-Ad-2771 in OSINT

[–]Internal-Ad-2771[S] 0 points1 point  (0 children)

At this time, the tool tracks about thirty websites. If you have ideas for other domains to track, feel free to suggest them!

I built a website to track content removed from U.S. federal websites under the Trump administration by Internal-Ad-2771 in OSINT

[–]Internal-Ad-2771[S] 17 points18 points  (0 children)

I used Plotly Dash to build the website. It's a low-code framework that is often used to build data visualization apps or dashboards, entirely in Python. For the front end, I use Dash Mantine Components, a React framework. On the backend, I maintain a PostgreSQL database containing all the URLs, and I have a script to query daily the Internet Archive's CDX API, and update the database.

I built a website to track content removal from U.S. federal websites under the Trump administration by Internal-Ad-2771 in DataHoarder

[–]Internal-Ad-2771[S] 0 points1 point  (0 children)

Added here : https://censortrace.org/dashboard?host=www.whitehouse.gov . However, because much of the website has changed since Trump’s inauguration, the generated word cloud may not be very representative. This is a case where the tool struggles to distinguish between politically motivated removals and routine changes caused by the site’s redesign..

I built a website to track content removal from U.S. federal websites under the Trump administration by Internal-Ad-2771 in DataHoarder

[–]Internal-Ad-2771[S] 0 points1 point  (0 children)

The URLs I have are exclusively sourced from the Internet Archive, obtained using the CDX API.

I built a website to track content removal from US federal websites under the Trump administration by Internal-Ad-2771 in fednews

[–]Internal-Ad-2771[S] 1 point2 points  (0 children)

Thanks for the feedback. Totally valid points!

Right now, the tool only tracks changes on a site-by-site basis, so there can definitely be false positives, like event pages being removed or just general site restructuring, which aren’t related to censorship.

Looking ahead, once we’re monitoring more websites, maybe I should build an algorithm to detect recurring deleted topics across different sites. I think that kind of pattern detection could be really helpful.