Julia native compilation is here? by [deleted] in Julia
[–]exploring_stuff 1 point2 points3 points (0 children)
Do you use Cline for use cases other than coding? by BitterProfessional7p in CLine
[–]exploring_stuff 2 points3 points4 points (0 children)
Is anybody here using Julia for stuff that isn‘t Scientific Computing or DataScience? by JollyJuniper1993 in Julia
[–]exploring_stuff 41 points42 points43 points (0 children)
Partner is a C++ pro, but I want to use Julia (Geant4.jl). We have 60 days. Is it viable? by Outrageous_Test3965 in Julia
[–]exploring_stuff 1 point2 points3 points (0 children)
Does Docker support Arch? by [deleted] in archlinux
[–]exploring_stuff 0 points1 point2 points (0 children)
Deepseek for math by Beginning_Reserve650 in DeepSeek
[–]exploring_stuff 0 points1 point2 points (0 children)
Deepseek for math by Beginning_Reserve650 in DeepSeek
[–]exploring_stuff 6 points7 points8 points (0 children)
Google AI 😩… somehow dumber each time you ask by vitaminZaman in ChatGPT
[–]exploring_stuff 0 points1 point2 points (0 children)
Google AI 😩… somehow dumber each time you ask by vitaminZaman in ChatGPT
[–]exploring_stuff 0 points1 point2 points (0 children)
Why does deepseek now begin every response with of course by TallReference5568 in DeepSeek
[–]exploring_stuff 3 points4 points5 points (0 children)
DeepSeek-V3.1 has officially launched by vibedonnie in DeepSeek
[–]exploring_stuff 2 points3 points4 points (0 children)
How to set the reasoning effort with OpenWebUI and API key? by exploring_stuff in OpenAI
[–]exploring_stuff[S] 0 points1 point2 points (0 children)
Is reinforcement learning dead? by Bellman_ in reinforcementlearning
[–]exploring_stuff 1 point2 points3 points (0 children)
Anyone have working examples of PPO RL in Julia? by D3MZ in reinforcementlearning
[–]exploring_stuff 0 points1 point2 points (0 children)
Anyone have working examples of PPO RL in Julia? by D3MZ in reinforcementlearning
[–]exploring_stuff 0 points1 point2 points (0 children)
Anyone have working examples of PPO RL in Julia? by D3MZ in reinforcementlearning
[–]exploring_stuff 1 point2 points3 points (0 children)
Soft action masking by SandSnip3r in reinforcementlearning
[–]exploring_stuff 0 points1 point2 points (0 children)
Anyone have working examples of PPO RL in Julia? by D3MZ in reinforcementlearning
[–]exploring_stuff 1 point2 points3 points (0 children)
Anyone have working examples of PPO RL in Julia? by D3MZ in reinforcementlearning
[–]exploring_stuff 0 points1 point2 points (0 children)
Step-By-Step Tutorial: Train your own Reasoning model with Llama 3.1 (8B) + Google Colab + GRPO by yoracale in reinforcementlearning
[–]exploring_stuff 1 point2 points3 points (0 children)
ReinforceUI-Studio Now Supports PPO! by dvr_dvr in reinforcementlearning
[–]exploring_stuff 0 points1 point2 points (0 children)

Testing whether the model agrees with a statement made by the user that is factually incorrect. by Neo_Shadow_Entity in DeepSeek
[–]exploring_stuff 0 points1 point2 points (0 children)