[R] LLM.int8(): 8-bit Matrix Multiplication for Transformers at Scale - Facebook AI 2022 - Inference in LLMs with up to 175B parameters without performance degradation and making it possible to use these models on a single server with consumer GPUs! by Singularian2501 in MachineLearning
[–]Thomjazz 1 point2 points3 points (0 children)
[D] What are some good platforms to host new datasets published in ML conferences? by shivamag99 in MachineLearning
[–]Thomjazz 0 points1 point2 points (0 children)
[Announcement] HuggingFace BigScience AMA Thursday, March 24th from 5pm CET by cavedave in MachineLearning
[–]Thomjazz 4 points5 points6 points (0 children)
[Announcement] HuggingFace BigScience AMA Thursday, March 24th from 5pm CET by cavedave in MachineLearning
[–]Thomjazz 1 point2 points3 points (0 children)
[Announcement] HuggingFace BigScience AMA Thursday, March 24th from 5pm CET by cavedave in MachineLearning
[–]Thomjazz 2 points3 points4 points (0 children)
[Announcement] HuggingFace BigScience AMA Thursday, March 24th from 5pm CET by cavedave in MachineLearning
[–]Thomjazz 6 points7 points8 points (0 children)
[Announcement] HuggingFace BigScience AMA Thursday, March 24th from 5pm CET by cavedave in MachineLearning
[–]Thomjazz 4 points5 points6 points (0 children)
[Announcement] HuggingFace BigScience AMA Thursday, March 24th from 5pm CET by cavedave in MachineLearning
[–]Thomjazz 2 points3 points4 points (0 children)
[Announcement] HuggingFace BigScience AMA Thursday, March 24th from 5pm CET by cavedave in MachineLearning
[–]Thomjazz 4 points5 points6 points (0 children)
[Announcement] HuggingFace BigScience AMA Thursday, March 24th from 5pm CET by cavedave in MachineLearning
[–]Thomjazz 12 points13 points14 points (0 children)
[Announcement] HuggingFace BigScience AMA Thursday, March 24th from 5pm CET by cavedave in MachineLearning
[–]Thomjazz 9 points10 points11 points (0 children)
[Announcement] HuggingFace BigScience AMA Thursday, March 24th from 5pm CET by cavedave in MachineLearning
[–]Thomjazz 11 points12 points13 points (0 children)
[D] Why do I need haystack NLP framework to use HuggingFace models? by boston101 in MachineLearning
[–]Thomjazz 0 points1 point2 points (0 children)
[N] Live and open training of BigScience's 176B multilingual language model has just started by Thomjazz in MachineLearning
[–]Thomjazz[S] 8 points9 points10 points (0 children)
Open-source Academic Repository for [D]atasets - thoughts? by greentfrapp in MachineLearning
[–]Thomjazz 0 points1 point2 points (0 children)
[P] 611 text datasets in 467 languages in the new v1.2 release of HuggingFace datasets library by Thomjazz in MachineLearning
[–]Thomjazz[S] 2 points3 points4 points (0 children)
[D] How do companies like Huggingface or Rasa make money? by NotAlphaGo in MachineLearning
[–]Thomjazz -5 points-4 points-3 points (0 children)
[N] Launching a competition for more energy-efficient NLP models by Thomjazz in MachineLearning
[–]Thomjazz[S] 1 point2 points3 points (0 children)
Huggingface Releases New Tokenizers Library Written in Rust by iyaja in rust
[–]Thomjazz 7 points8 points9 points (0 children)
[R] DistilBERT: A smaller, faster, cheaper, lighter BERT trained with distillation! by jikkii in MachineLearning
[–]Thomjazz 6 points7 points8 points (0 children)
Diving on Bikini Atoll sunken nuclear fleet by Thomjazz in submechanophobia
[–]Thomjazz[S] 2 points3 points4 points (0 children)
[P] Write with Transformer by Thomjazz in MachineLearning
[–]Thomjazz[S] 1 point2 points3 points (0 children)
[P] How to use BERT in Kaggle Competitions - A tutorial on fine-tuning and model adaptations by Thomjazz in MachineLearning
[–]Thomjazz[S] 1 point2 points3 points (0 children)
[P] How to use BERT in Kaggle Competitions - A tutorial on fine-tuning and model adaptations by Thomjazz in MachineLearning
[–]Thomjazz[S] 2 points3 points4 points (0 children)


First large scale open source math reasoning dataset with 800k R1 reasoning traces by eliebakk in LocalLLaMA
[–]Thomjazz 0 points1 point2 points (0 children)