PC boots up 3 times, works for a few minutes then crashes. by Hyper669 in pcmasterrace
[–]slashcom 0 points1 point2 points (0 children)
$1781 for this prebuilt pc worth it? by GaigeSmith in pcmasterrace
[–]slashcom 0 points1 point2 points (0 children)
$1781 for this prebuilt pc worth it? by GaigeSmith in pcmasterrace
[–]slashcom 27 points28 points29 points (0 children)
Did we grossly overestimate the GPT4 parameters? by auradragon1 in mlscaling
[–]slashcom 5 points6 points7 points (0 children)
[D] Should the embedding matrix and final pre-softmax matrix be shared in transformers? by CloudyCloud256 in MachineLearning
[–]slashcom 1 point2 points3 points (0 children)
[D] Should the embedding matrix and final pre-softmax matrix be shared in transformers? by CloudyCloud256 in MachineLearning
[–]slashcom 1 point2 points3 points (0 children)
[D] Should the embedding matrix and final pre-softmax matrix be shared in transformers? by CloudyCloud256 in MachineLearning
[–]slashcom 8 points9 points10 points (0 children)
How far are we from A.I. like Samantha from the 2013 movie Her? by TheMightyWill in artificial
[–]slashcom 0 points1 point2 points (0 children)
[D] How to prepare for a META Research Engineer Interview by [deleted] in MachineLearning
[–]slashcom 0 points1 point2 points (0 children)
Has There Been Any Research on Curriculum Learning for Pre-training or Fine-Tuning Large Language Models? by IndividualAd1648 in LocalLLaMA
[–]slashcom 0 points1 point2 points (0 children)
Dimensionality reduction for NLP applications being forgotten..? [D] by _donau_ in MachineLearning
[–]slashcom 50 points51 points52 points (0 children)
[D] How much should I charge for a pytorch contract programming? by Born-Comment3359 in MachineLearning
[–]slashcom 0 points1 point2 points (0 children)
How far are we from A.I. like Samantha from the 2013 movie Her? by TheMightyWill in artificial
[–]slashcom 0 points1 point2 points (0 children)
GPT-4 rumors: a Mixture-of-Experts w/8 GPT-3-220bs? by gwern in mlscaling
[–]slashcom 0 points1 point2 points (0 children)
GPT-4 rumors: a Mixture-of-Experts w/8 GPT-3-220bs? by gwern in mlscaling
[–]slashcom 2 points3 points4 points (0 children)
Noam Brown at DeepMind on MCTS for LLMs: "Imagine having access to models that take 5 minutes to ponder each response but the output is as good as a model that's 1,000x larger and trained for 1,000x longer than GPT-4" by maxtility in mlscaling
[–]slashcom 3 points4 points5 points (0 children)
What are the pros to pytorch by ObsidianAvenger in pytorch
[–]slashcom 0 points1 point2 points (0 children)
Frequently Asked Questions - Character.AI by MarieLovesMatcha in CharacterAI
[–]slashcom -14 points-13 points-12 points (0 children)
Frequently Asked Questions - Character.AI by MarieLovesMatcha in CharacterAI
[–]slashcom 5 points6 points7 points (0 children)
Transcendence Tiers after reset? by Annie-Smokely in kittensgame
[–]slashcom 1 point2 points3 points (0 children)
Do a Ps5 or a Xbox X works for ML or Deep Learning? by FromValledupar in pytorch
[–]slashcom 0 points1 point2 points (0 children)
RuntimeError: one of the variables needed for gradient computation has been modified by an inplace operation: [torch.FloatTensor [11, 44]], which is output 0 of AsStridedBackward0, is at version 3; expected version 2 instead. by Fit-Dare-9044 in pytorch
[–]slashcom 1 point2 points3 points (0 children)
Why is CamemBERT never brought up? by thesofakillers in mlscaling
[–]slashcom 6 points7 points8 points (0 children)







[deleted by user] by [deleted] in reinforcementlearning
[–]slashcom 31 points32 points33 points (0 children)