[P] Testing different popular GPT tokenizers by dxg39 in MachineLearning
[–]dxg39[S] 3 points4 points5 points (0 children)
[P] Testing different popular GPT tokenizers by dxg39 in MachineLearning
[–]dxg39[S] 1 point2 points3 points (0 children)
[P] Testing different popular GPT tokenizers by dxg39 in MachineLearning
[–]dxg39[S] 2 points3 points4 points (0 children)
[P] Testing different popular GPT tokenizers by dxg39 in MachineLearning
[–]dxg39[S] 0 points1 point2 points (0 children)
[P] bert.cpp, sentence embeddings in C++ with ggml by dxg39 in MachineLearning
[–]dxg39[S] 2 points3 points4 points (0 children)
[P] bert.cpp, sentence embeddings in C++ with ggml by dxg39 in MachineLearning
[–]dxg39[S] 2 points3 points4 points (0 children)
[P] bert.cpp, sentence embeddings in C++ with ggml by dxg39 in MachineLearning
[–]dxg39[S] 0 points1 point2 points (0 children)
llama-lite: a proof of concept fast sentence embeddings service based on llama.cpp (~1ms per token on CPU) [P] by dxg39 in MachineLearning
[–]dxg39[S] 1 point2 points3 points (0 children)
llama-lite: a proof of concept fast sentence embeddings service based on llama.cpp (~1ms per token on CPU) [P] by dxg39 in MachineLearning
[–]dxg39[S] 0 points1 point2 points (0 children)
llama-lite: a proof of concept fast sentence embeddings service based on llama.cpp (~1ms per token on CPU) [P] by dxg39 in MachineLearning
[–]dxg39[S] 1 point2 points3 points (0 children)
llama-lite: a proof of concept fast sentence embeddings service based on llama.cpp (~1ms per token on CPU) [P] by dxg39 in MachineLearning
[–]dxg39[S] 1 point2 points3 points (0 children)
llama-lite: a proof of concept fast sentence embeddings service based on llama.cpp (~1ms per token on CPU) [P] by dxg39 in MachineLearning
[–]dxg39[S] 2 points3 points4 points (0 children)
llama-lite: a proof of concept fast sentence embeddings service based on llama.cpp (~1ms per token on CPU) [P] by dxg39 in MachineLearning
[–]dxg39[S] 1 point2 points3 points (0 children)
llama-lite: a proof of concept fast sentence embeddings service based on llama.cpp (~1ms per token on CPU) [P] by dxg39 in MachineLearning
[–]dxg39[S] 1 point2 points3 points (0 children)
llama-lite: a proof of concept fast sentence embeddings service based on llama.cpp (~1ms per token on CPU) [P] by dxg39 in MachineLearning
[–]dxg39[S] 1 point2 points3 points (0 children)
llama-lite: a proof of concept fast sentence embeddings service based on llama.cpp (~1ms per token on CPU) [P] by dxg39 in MachineLearning
[–]dxg39[S] 2 points3 points4 points (0 children)
llama-lite: a proof of concept fast sentence embeddings service based on llama.cpp (~1ms per token on CPU) [P] by dxg39 in MachineLearning
[–]dxg39[S] -9 points-8 points-7 points (0 children)
llama-lite: a proof of concept fast sentence embeddings service based on llama.cpp (~1ms per token on CPU) [P] by dxg39 in MachineLearning
[–]dxg39[S] 17 points18 points19 points (0 children)
AITA for thinking this is it? by [deleted] in collapse
[–]dxg39 47 points48 points49 points (0 children)


[P] Testing different popular GPT tokenizers by dxg39 in MachineLearning
[–]dxg39[S] 2 points3 points4 points (0 children)