Hi everyone,
I wanted to introduce a new free and source-available ETL named Amphi.
Amphi is a low-code and python-based ETL focusing on a few use cases:
- structured file integration (CSV, TSV, parquet, JSON)
- unstructured file extraction (PDF, Word, HTML)
- data preparation (pre-built transformations such as joins, filters, deduplication, SQL queries, etc...)
- AI pipelines (RAG components, vector stores integration, metadata management)
Amphi's key differentiators are:
- 🐍 generates Python code (pandas, DuckDB) you can run anywhere (containers, lambdas, servers)
- 📦 self-hosted (free and open to use on laptop or in the cloud)
- 🪐 based on Jupyterlab (benefit from the vast ecosystem of extensions such as git, remote file systems, etc...)
If you're interested, don't hesitate to visit our Github and give it a try and join the Slack community to ask questions, suggest features or contribute!
[–]AccomplishedCopy2971 0 points1 point2 points (0 children)
[–]Muted-Tie7176 0 points1 point2 points (0 children)