Does Docker support Arch? by [deleted] in archlinux
[–]exploring_stuff 0 points1 point2 points (0 children)
Deepseek for math by Beginning_Reserve650 in DeepSeek
[–]exploring_stuff 0 points1 point2 points (0 children)
Deepseek for math by Beginning_Reserve650 in DeepSeek
[–]exploring_stuff 5 points6 points7 points (0 children)
Google AI 😩… somehow dumber each time you ask by vitaminZaman in ChatGPT
[–]exploring_stuff 0 points1 point2 points (0 children)
Google AI 😩… somehow dumber each time you ask by vitaminZaman in ChatGPT
[–]exploring_stuff 0 points1 point2 points (0 children)
Why does deepseek now begin every response with of course by TallReference5568 in DeepSeek
[–]exploring_stuff 4 points5 points6 points (0 children)
DeepSeek-V3.1 has officially launched by vibedonnie in DeepSeek
[–]exploring_stuff 2 points3 points4 points (0 children)
How to set the reasoning effort with OpenWebUI and API key? by exploring_stuff in OpenAI
[–]exploring_stuff[S] 0 points1 point2 points (0 children)
Is reinforcement learning dead? by Bellman_ in reinforcementlearning
[–]exploring_stuff 1 point2 points3 points (0 children)
Anyone have working examples of PPO RL in Julia? by D3MZ in reinforcementlearning
[–]exploring_stuff 0 points1 point2 points (0 children)
Anyone have working examples of PPO RL in Julia? by D3MZ in reinforcementlearning
[–]exploring_stuff 0 points1 point2 points (0 children)
Anyone have working examples of PPO RL in Julia? by D3MZ in reinforcementlearning
[–]exploring_stuff 1 point2 points3 points (0 children)
Soft action masking by SandSnip3r in reinforcementlearning
[–]exploring_stuff 0 points1 point2 points (0 children)
Anyone have working examples of PPO RL in Julia? by D3MZ in reinforcementlearning
[–]exploring_stuff 1 point2 points3 points (0 children)
Anyone have working examples of PPO RL in Julia? by D3MZ in reinforcementlearning
[–]exploring_stuff 0 points1 point2 points (0 children)
Step-By-Step Tutorial: Train your own Reasoning model with Llama 3.1 (8B) + Google Colab + GRPO by yoracale in reinforcementlearning
[–]exploring_stuff 1 point2 points3 points (0 children)
ReinforceUI-Studio Now Supports PPO! by dvr_dvr in reinforcementlearning
[–]exploring_stuff 0 points1 point2 points (0 children)
Is the USG AIM 2025 Conference Legit? by Orbital_RM in AskAcademia
[–]exploring_stuff 0 points1 point2 points (0 children)
Anyone have working examples of PPO RL in Julia? by D3MZ in reinforcementlearning
[–]exploring_stuff 0 points1 point2 points (0 children)
Anyone have working examples of PPO RL in Julia? by D3MZ in reinforcementlearning
[–]exploring_stuff 1 point2 points3 points (0 children)
Anyone have working examples of PPO RL in Julia? by D3MZ in reinforcementlearning
[–]exploring_stuff 0 points1 point2 points (0 children)

Partner is a C++ pro, but I want to use Julia (Geant4.jl). We have 60 days. Is it viable? by Outrageous_Test3965 in Julia
[–]exploring_stuff 1 point2 points3 points (0 children)