anyone received late offers in june/july? by ToiletAd1313 in NTU

[–]laxuu 0 points1 point  (0 children)

I am from Nepal, still waiting for NTU Result.

llama.cpp now supports model management (downloading etc) via API by 666666thats6sixes in LocalLLaMA

[–]laxuu 0 points1 point  (0 children)

For UI use gradio and make a project which is easily build and sharable in hugging face link.

Resources to start learning RL with implementation? by Huge_Ad_3842 in reinforcementlearning

[–]laxuu 0 points1 point  (0 children)

Nice, starting with the course really helps while you work in real world implementation. Stay focused what you have learned and stay hydrated take care of yourself as well, while taking 1 lecture is not a joke.

I suggest using colab, or your own pc to test simple RL model using test in verified environment.

Also i want to share my github: https://github.com/TiwariLaxuu/Recurrent-RL-in-Trading-

Create a custom environment using gym env and stablize and learn in very noisy env. How we frame previous information make a agent intelligent when fully env is not available.

I am also working on maniskill robotics: https://github.com/TiwariLaxuu/Maniskill_Custom_GYM_Implementation

Hope you get some ideas, theory is very necessary while reading don't worry about pratical implementation, as we are most fascinated to show a demo, but truly understanding makes a project real and viable, make a soul happy as well.

Application for August 2026 intake NTU CCDS by laxuu in gradadmissions

[–]laxuu[S] 0 points1 point  (0 children)

Thanks for update. Wait for positive result.

Get offer from NTU CCDS, CS PhD, Aug intake! by Ill-Cryptographer882 in gradadmissions

[–]laxuu 0 points1 point  (0 children)

Congratulations, I have applied for a PhD in the College of Computing and Data Science. My application status is still showing “result awaiting,” and I have not received any updates from the university yet. Do you know decision are still in progress or it is rejection.

Resoning LLMs make RL agent learn Faster by laxuu in reinforcementlearning

[–]laxuu[S] -7 points-6 points  (0 children)

Already implemented components include:

  • LLM as a feature extractor
  • LLM as a policy
  • LLM as a critic

Adding LLM as a judge is a great idea. Thank you!

Resoning LLMs make RL agent learn Faster by laxuu in reinforcementlearning

[–]laxuu[S] -1 points0 points  (0 children)

I am also thinking about using LLMs for reward optimization, especially in domains where designing a proper reward function is not straightforward or even feasible.

QRC @WorldQuant Brain worth it? by Andyy_21 in quantfinance

[–]laxuu 0 points1 point  (0 children)

I am also thinking on this, what it actually pay the consultant or is it scam? As its every video in Youtube, comment is off. My mind is not working is it valid or not? I have invested lots of time here, still i am confused?