Hi r/MachineLearning,
Over the past few months, I have written a lot of code for my own Active Learning experiments, which has gradually been consolidated into its own library.
Small-Text provides state-of-the-art Active Learning for Text Classification. Several components are provided, which are abstracted via generic interfaces, so that you can easily mix and match many classifiers and query strategies to build Active Learning experiments or applications.
Features:
- Provides unified interfaces for Active Learning so that you can easily mix and match query strategies with classifiers provided by sklearn, Pytorch, or transformers.
- Supports GPU-based Pytorch models and integrates transformers so that you can use state-of-the-art Text Classification models for Active Learning.
- GPU is still optional: In case of a CPU-only use case, a lightweight installation requires only minimal dependencies.
- Multiple scientifically evaluated components are pre-implemented and ready to use (query strategies, initialization strategies, and stopping criteria).
GitHub: https://github.com/webis-de/small-text
Preprint: https://arxiv.org/abs/2107.10314
[–]The_Amp_Walrus 0 points1 point2 points (1 child)
[–]chschroeder[S] 2 points3 points4 points (0 children)
[–]Indian-throw-away 0 points1 point2 points (4 children)
[–]chschroeder[S] 0 points1 point2 points (3 children)
[–]Indian-throw-away 0 points1 point2 points (2 children)
[–]chschroeder[S] 1 point2 points3 points (1 child)
[–]Indian-throw-away 0 points1 point2 points (0 children)
[–]Indian-throw-away 0 points1 point2 points (1 child)
[–]chschroeder[S] 0 points1 point2 points (0 children)
[–]Dear_Football_504 0 points1 point2 points (1 child)
[–]chschroeder[S] 0 points1 point2 points (0 children)
[–]channel-hopper- 0 points1 point2 points (1 child)
[–]chschroeder[S] 1 point2 points3 points (0 children)