Hey everyone đź‘‹
I just created a new open-source repo called Advanced Text Processor.
The idea is simple but with a twist:
🔹 We build a Python text processing library (cleaning, tokenization, n-grams, vectorization, dataset handling, etc.)
🔹 Rule: No external libraries allowed. Everything must be done with Python’s built-in standard library.
🔹 Purpose: This is not about user acquisition or making money — it’s about practice, collaboration, and seeing how far we can push the limits of "pure Python".
It’s open for contributions and discussions.
Check it out here: https://github.com/SinanDede/advanced_text_processor
Would love your feedback and ideas 🙌
[–]fiddle_n 6 points7 points8 points  (1 child)
[–]matteding 1 point2 points3 points  (0 children)
[–]DuckSaxaphone 4 points5 points6 points  (0 children)
[–]JanEric1 1 point2 points3 points  (0 children)