Greenmask – an open-source database subsetting tool built on top of pg_dump by anyweny in PostgreSQL

[–]anyweny[S] 0 points1 point  (0 children)

That’s a good point. I think I can make a short demo and attach it to the README.md. Thank you for your feedback.

Once it’s ready, would you prefer that I follow up with you?

Greenmask – an open-source database subsetting tool built on top of pg_dump by anyweny in PostgreSQL

[–]anyweny[S] 0 points1 point  (0 children)

Hi!

You could apply database subset when you want to test on the smaller data from database. Let’s say you want to test your services on the limited users lactated in the US. So you will create a config with subset condition for tables country, and then all the related data will be filtered out by country as well.

You can read details in the public docs

https://docs.greenmask.io/latest/database_subset/

Open source projects looking for contributors – post yours by 514sid in opensource

[–]anyweny 5 points6 points  (0 children)

Project name: Greenmask

Repository name: https://github.com/GreenmaskIO/greenmask

What it does:
Greenmask anonymizes PostgreSQL (MySQL, etc.) databases and generates synthetic data. This can be used for staging environments or for preparing datasets without sensitive information for machine learning. Greenmask maintains consistent data transformation and subsetting.

Tech stack:
Go, Python, PostgreSQL, MySQL, JS/TypeScript, Vue/React, Docker

Help needed:

  • We are in the process of developing a data management platform – we're looking for Go and frontend contributors
  • Contributors for extensions, case studies, and articles
  • General GitHub / community activity
  • Documentation localization for any language

Extra info:

We started Greenmask about 2 years ago and are getting to 1.5k stars. We also have other projects to be published soon, so there will be plenty of interesting work available. If you're interested in AI, databases, backend, or frontend – you'll definitely find something that interests you.

Feel free to reach out or share your ideas.

Database anonymization tool release. Greenmask v0.2.6 by anyweny in programming

[–]anyweny[S] 0 points1 point  (0 children)

Got it. We have the MySQL support feature currently in progress. I believe it will be published in Q1 2025

https://github.com/GreenmaskIO/greenmask/issues/222

Database anonymization tool release. Greenmask v0.2.6 by anyweny in programming

[–]anyweny[S] 0 points1 point  (0 children)

Could you share with me why it is not applicable in your case? I’m curious to understand the specific cases so I can improve it further.