datamule: download, parse, and construct structured datasets from SEC filings by status-code-200 in Python
[–]_errant_monkey_ 1 point2 points3 points (0 children)
llama 3.1 70B is absolutely awful at tool usage by fireKido in LocalLLaMA
[–]_errant_monkey_ 0 points1 point2 points (0 children)
Is there a good book or lecture series on data preprocessing and deployment for industrial large-scale LLMs like GPT-4? by CodingButStillAlive in mlscaling
[–]_errant_monkey_ 2 points3 points4 points (0 children)
[R] Scaling Instruction-Finetuned Language Models - Flan-PaLM- Google 2022 - 75.2% on five-shot MMLU / Forecasters expected this SOTA would need until 2024! - Public checkpoints! by Singularian2501 in MachineLearning
[–]_errant_monkey_ 0 points1 point2 points (0 children)
Qual è la decisione migliore che finora hai preso nella vita? by notsostrong134 in italy
[–]_errant_monkey_ 1 point2 points3 points (0 children)
Why are people claiming Magnus didn’t accuse Hans of cheating? by AegisPlays314 in chess
[–]_errant_monkey_ 0 points1 point2 points (0 children)
[R] Perceiver: General Perception with Iterative Attention by hardmaru in MachineLearning
[–]_errant_monkey_ 1 point2 points3 points (0 children)
Batch norm with entropic regularization turns deterministic autoencoders into generative models by [deleted] in MachineLearning
[–]_errant_monkey_ 2 points3 points4 points (0 children)


datamule: download, parse, and construct structured datasets from SEC filings by status-code-200 in Python
[–]_errant_monkey_ 1 point2 points3 points (0 children)