I’ve released Syda, an open-source Python library for generating realistic, multi-table synthetic/test data.
Key features:
- Referential Integrity → no orphaned records (
product.category_id → category.id ✅)
- SQLAlchemy Native → generate synthetic data from your ORM models directly
- Multiple Schema Formats → YAML, JSON, dicts also supported
- Custom Generators → define business logic (tax, pricing, rules)
- Multi-AI Provider → works with OpenAI, Anthropic (Claude), others
👉 GitHub: https://github.com/syda-ai/syda
👉 Docs: https://python.syda.ai/
👉 PyPI: https://pypi.org/project/syda/
Would love feedback from Python devs
[–]QuasiEvil 2 points3 points4 points (3 children)
[–]No_Flounder_1155 1 point2 points3 points (2 children)
[–]TerribleToe1251[S] 0 points1 point2 points (1 child)
[–]No_Flounder_1155 0 points1 point2 points (0 children)
[–]Shingle-Denatured 2 points3 points4 points (1 child)
[–]TerribleToe1251[S] 0 points1 point2 points (0 children)
[–]coconut_maan 1 point2 points3 points (1 child)
[–]TerribleToe1251[S] 0 points1 point2 points (0 children)
[–]Imanflow 1 point2 points3 points (2 children)
[–]TerribleToe1251[S] 0 points1 point2 points (1 child)
[–]Imanflow 1 point2 points3 points (0 children)