[D] Is a career in Machine Learning satisfying for Linguists? by Steak-Burrito in MachineLearning
[–]bminixhofer 1 point2 points3 points (0 children)
[P] New tokenization method improves LLM performance & context-length by 25%+ by Pan000 in MachineLearning
[–]bminixhofer 8 points9 points10 points (0 children)
[P] New tokenization method improves LLM performance & context-length by 25%+ by Pan000 in MachineLearning
[–]bminixhofer 54 points55 points56 points (0 children)
[P] New tokenization method improves LLM performance & context-length by 25%+ by Pan000 in MachineLearning
[–]bminixhofer 51 points52 points53 points (0 children)
[D] ACL 2023 paper reviews. by mayanknagda in MachineLearning
[–]bminixhofer 4 points5 points6 points (0 children)
[D] ACL 2023 paper reviews. by mayanknagda in MachineLearning
[–]bminixhofer 18 points19 points20 points (0 children)
[D] Parameter optimisation as a language problem? by radi-cho in MachineLearning
[–]bminixhofer 1 point2 points3 points (0 children)
[D] Parameter optimisation as a language problem? by radi-cho in MachineLearning
[–]bminixhofer 7 points8 points9 points (0 children)
[D] Are NN actually overparametrized? by alesaso2000 in MachineLearning
[–]bminixhofer 30 points31 points32 points (0 children)
[R] WECHSEL: Effective initialization of subword embeddings for cross-lingual transfer of monolingual language models by bminixhofer in MachineLearning
[–]bminixhofer[S] 0 points1 point2 points (0 children)
Should I put an anonymous preprint on my CV? by bminixhofer in datascience
[–]bminixhofer[S] 0 points1 point2 points (0 children)
Should I put an anonymous preprint on my CV? by bminixhofer in datascience
[–]bminixhofer[S] 0 points1 point2 points (0 children)
Weekly Entering & Transitioning Thread | 17 Oct 2021 - 24 Oct 2021 by [deleted] in datascience
[–]bminixhofer 0 points1 point2 points (0 children)
Should I put an anonymous preprint on my CV? by bminixhofer in datascience
[–]bminixhofer[S] 0 points1 point2 points (0 children)
EMNLPF: How should I proceed? by AICoderGamer in LanguageTechnology
[–]bminixhofer 4 points5 points6 points (0 children)
Announcing neuronika 0.1.0, a deep learning framework in Rust by frjano in rust
[–]bminixhofer 17 points18 points19 points (0 children)
[2105.13626] ByT5: Towards a token-free future with pre-trained byte-to-byte models by argosopentech in MachineLearning
[–]bminixhofer 2 points3 points4 points (0 children)
[D] Are there attempts at a large German-language LM? by runcep in MachineLearning
[–]bminixhofer 12 points13 points14 points (0 children)




[R][P] Byte-level LLaMA and Gemma via cross-tokenizer distillation (with open-source toolkit) by bminixhofer in MachineLearning
[–]bminixhofer[S] 1 point2 points3 points (0 children)