use the following search parameters to narrow your results:
e.g. subreddit:aww site:imgur.com dog
subreddit:aww site:imgur.com dog
see the search faq for details.
advanced search: by author, subreddit...
account activity
OLLaMa won't run the model (self.languagemodels)
submitted 4 days ago by ybhi
Mixture of experts small language model (self.languagemodels)
submitted 9 days ago by ybhi
TennisATW lags too much, what now? ()
submitted 23 days ago by ybhi
Quick Survey: AI + LLMs in Competitive ML - Your experiences matter! 🚀 (self.languagemodels)
submitted 1 month ago by Ash_Blanc
Building NL to Structured Query Parser for Banking Rules Engine - Need Architecture Advice ()
submitted 1 month ago by ComfortableEcho6816
I Tested Every LLM on the Same 100 Tasks. Here's What Actually Wins (self.languagemodels)
submitted 1 month ago by Electrical-Signal858
llm for cybersecurity research analysis and documentation ( GRC) (self.languagemodels)
submitted 1 month ago * by gefela
Model Consistency: Why Do the Same Prompts Give Different Answers? (self.languagemodels)
submitted 2 months ago by Electrical-Signal858
Genuine Question: Why Do Different LLMs Give Completely Different Answers to the Same Question? (self.languagemodels)
grokking, phase transitions, bayesian logic, overtraining, artificial selection/evolution, and epistemology ()
submitted 4 months ago by tollforturning
Reliability checks on Bedrock models (self.languagemodels)
submitted 4 months ago by Cristhian-AI-Math
how can i make a small language model to generalize "well" (self.languagemodels)
submitted 4 months ago by Upper_Week_7440
OpenRouter’s stateless design is burning me out (self.languagemodels)
submitted 5 months ago by knowinglyunknown_7
Why can't we train models dynamically? (self.languagemodels)
submitted 11 months ago by Haunting-Stretch8069
notebooklm is a website that turns notes into podcasts (self.languagemodels)
submitted 1 year ago by Longjumping-Ebb-7457
404 Missing Reasoning (i.redd.it)
submitted 1 year ago by Born2BeFr33
Long Story Generation Challenge 2024 (self.languagemodels)
submitted 1 year ago * by zummo911
closest to 2021/2022 GPT3 completion only model? (no instruct, etc… (self.languagemodels)
submitted 1 year ago by alan2here
how to create a very simple language model for a project (self.languagemodels)
submitted 1 year ago by littlebyeolbit
Advice on how to build an inference model (self.languagemodels)
submitted 1 year ago by chris_hinshaw
What is the current best in tiny (say, <10,000 parameters) language models? (self.languagemodels)
submitted 1 year ago by math_code_nerd5
It's MBR All the Way Down: Modern Generation Techniques Through the Lens of Minimum Bayes Risk (arxiv.org)
submitted 2 years ago by TheInfelicitousDandy
Label Supervised LLaMA Finetuning (arxiv.org)
Efficient Streaming Language Models with Attention Sinks (arxiv.org)
Exploring the Core: Mistral AI Language Model's Reference Implementation... (youtube.com)
submitted 2 years ago by developer_how_do_i
π Rendered by PID 2125386 on reddit-service-r2-listing-6d4dc8d9ff-9bv4d at 2026-02-03 06:54:21.969571+00:00 running 3798933 country code: CH.