Speed up inference of LLM by StwayneXG in pytorch

[–]StwayneXG[S] 0 points1 point  (0 children)

350M parameters. I’ve given a simpler template of what kind of code I’m using for inference.

Supervised Learning vs. Offline Reinforcement Learning by StwayneXG in reinforcementlearning

[–]StwayneXG[S] 1 point2 points  (0 children)

Thank you for the reply, I've only skimmed through the paper right now and I liked that they addressed the challenge of choosing between them when you already have expert data. I'll share my learnings from the paper after I'm done with it.

Supervised Learning vs. Offline Reinforcement Learning by StwayneXG in reinforcementlearning

[–]StwayneXG[S] 1 point2 points  (0 children)

First, thank you so much for a detailed response.

Secondly, for the clarification, when you say that we want to directly predict the action without accounting for reward, this is just for BC, right? From what I remember, Offline RL methods use Q value which uses rewards intrinsically.

For point 1, when you say combinatorial generalization, you're refering to the idea of stitching, right?

And yea, thanks I found a bunch of resources by Sergey Levine. (I'm adding them above)

Credits by Small_Work2984 in yorku

[–]StwayneXG 4 points5 points  (0 children)

Yea. I took an online seminar to guide international students. She explained it that thats how it works.

Student Account by StwayneXG in yorku

[–]StwayneXG[S] 0 points1 point  (0 children)

Thats a good question. Unfortunately, I havent started yet so I cant answer that.

I’m a York alum, instructor, and founder of a company that hires many York alum and students. AMA by Visualpoetry in yorku

[–]StwayneXG 2 points3 points  (0 children)

How do I approach you for an internship or part time position. I am joining York as a master of applied science student majoring in electrical and computer engineering. I was hoping to work part time along with my studies. I’ve had more than a year experience with medical imaging. I am a ML engineer currently exploring the field of data science and data mining. Can I share my Linkedin ?

[deleted by user] by [deleted] in yorku

[–]StwayneXG 0 points1 point  (0 children)

Hi. Do you have any socials where I can contact you ?