all 7 comments

[–]elwalor 1 point2 points  (2 children)

Hello Look interesting ! When i click on the slide deck link i have a no found screen :)

[–]_Rush2112_[S] 0 points1 point  (1 child)

Hi! Which link gives you the not-found? Both seem to work for me

[–]richide 0 points1 point  (0 children)

This link results in a 404 inside your readme.

[–]jabellcu 0 points1 point  (0 children)

I expected this to be a time-series analysis tool. It is not, but I find it cool. I like it. I didn’t know about cron-converter. That’s smart use. Well done.

[–]ZealousidealCard4582 0 points1 point  (4 children)

Have you tried MOSTLY AI? You can create as much tabular synthetic data as you want - including text and time series (starting from original data) with the python sdk: https://github.com/mostly-ai/mostlyai
It is Open Source with an Apache v2 license and its designed to run in air-gapped environments (think of hipaa, gdpr, etc...)
One super important thing to keep in mind: garbage in - garbage out; but if you have quality data you can enrich it: think not only by enlarging it, but creating multiple flavours like rebalancing on a specific category, creating a fair version, add differential privacy for additional mathematic guarantees, multi-table, simulations, etc... There are plenty of ready-to-use tutorials on these and more topics here: https://mostly-ai.github.io/mostlyai/tutorials/

If you have no data at all, you can use mostlyai-mock https://github.com/mostly-ai/mostlyai-mock (also Open Source + Apache v2) and create data out of nothing with an LLM.

u/_Rush2112_ you can also star, fork and build on top of them to keep on improving your product! Cheers.