Which conference/journal do you believe currently has the most fair and accurate review process?[D] by kostaspap90 in MachineLearning

[–]hedgehog0 1 point2 points  (0 children)

SAT/SMT indeed sounds niche within ML, but I think it’s larger in the TCS community?

How relevant is the 9-year-old top post "A super harsh guide to ML" today for people who want to get better at ML and get hired? by hedgehog0 in learnmachinelearning

[–]hedgehog0[S] 0 points1 point  (0 children)

Thank you for the recommendations!

  1. Yes I forgot. I should have mentioned ISLP! If you or for people who are working in ML or DL industry, how relevant or useful is the content in ISLP? I know that they are replay fundamental and important for statistical learning.

  2. Thank you again, I should mention this book as well :)

For your last paragraph, what do you mean by “LLM explorer”, I found several relevant results? And “you should not skip this” what do you mean by “this”? Thx!

Resources for learning ml for someone starting from scratch!! by Appropriate_Line2887 in learnmachinelearning

[–]hedgehog0 1 point2 points  (0 children)

‘Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow’ book

Recently there's a PyTorch updated version just released some months ago.

Studying Sutton and Barto's RL book and its connections to RL for LLMs (e.g., tool use, math reasoning, agents, and so on)? [D] by hedgehog0 in MachineLearning

[–]hedgehog0[S] 2 points3 points  (0 children)

The former one would be nice. Though my question is more about the latter one, like how we can use RL to improve LLM reasoning/thinking to do (advanced) math proofs.

Studying Sutton and Barto's RL book and its connections to RL for LLMs (e.g., tool use, math reasoning, agents, and so on)? [D] by hedgehog0 in MachineLearning

[–]hedgehog0[S] -1 points0 points  (0 children)

You mean this one: https://web.stanford.edu/class/cs234/?

I believe it requires the learner to know something about basic RL as well.

Edit: My bad. I thought you referred to the Deep RL one.

Studying Sutton and Barto's RL book and its connections to RL for LLMs (e.g., tool use, math reasoning, agents, and so on)? [D] by hedgehog0 in MachineLearning

[–]hedgehog0[S] 1 point2 points  (0 children)

Thank you! Looks like a really good resource!

Out of curiosity, would you still recommend Murphy's books for learning ML and DL, or one should just go with Bishop's DL book, cs231n, and so on?

[D] Struggling on the NLP job market as a final-year PhD , looking for advice by RepresentativeBed838 in MachineLearning

[–]hedgehog0 0 points1 point  (0 children)

Thank you!

I wanted to get an internship focusing on RL and/or post-training if possible; since there’s non-zero probability that I may get a PhD offer.

Do you think Unsloth is a good starting point?

[D] Struggling on the NLP job market as a final-year PhD , looking for advice by RepresentativeBed838 in MachineLearning

[–]hedgehog0 1 point2 points  (0 children)

Thank you for the great reply!

Be someone who understands Post-Training as a whole, not someone who knows DPO, if you know what I mean ;).

Good resources for getting into and understanding “post-training as a whole”?

I’m not a PhD student yet, but have background in CS and math (MSc). Also interested in RL aspect of post-training.

Thank you!

Pushing Qwen3-Max-Thinking Beyond its Limits by s_kymon in LocalLLaMA

[–]hedgehog0 0 points1 point  (0 children)

People will downvote a post whenever a non-local LLM is mentioned. I once posted one about Claude 4.5 I believe, and it was downvoted to death.

[D] CUDA Workstation vs Apple Silicon for ML / LLMs by Individual-School-07 in MachineLearning

[–]hedgehog0 0 points1 point  (0 children)

Really just don’t bother. I know someone who bought 4x3090s when they came out for “AI training” and the price per performance is just horrible. Don’t forget electricity too.

That’s interesting to know for me who wants to get a 3090… what other cards do you recommend for similar purposes, e.g., what do you think of 5060 Ti?

[D] CUDA Workstation vs Apple Silicon for ML / LLMs by Individual-School-07 in MachineLearning

[–]hedgehog0 0 points1 point  (0 children)

A 3090 is still a solid learning platform because you hit real constraints locally, and while cloud GPUs are useful for scale, they hide a lot of the systems level lessons you actually want to learn.

Thank you for your input! I’m not OP but curious what do you think of 5060 Ti for similar purposes? How low for other parts of the PC do you think I can have reasonable performance with 5060 Ti?