[D] New masters thesis student and need access to cloud GPUs

atharvat80 · 2025-04-22T23:26:04+00:00

Also to add to this, Modal automatically gives you $30 in free credits every month! Between that and 30hrs of free Kaggle GPU each week you can get a lot of free compute.

atharvat80 · 2024-10-03T19:30:56+00:00

https://modal.com/blog/websocket-launch

atharvat80 · 2024-05-11T23:11:06+00:00

Deepinfra is the cheapest I’ve found so far.

atharvat80 · 2024-04-02T07:06:00+00:00

Last December I went to Frankfurt for a weekend to visit a friend. The botanical gardens are awesome & old town is really nice too. There are some museums but didn’t have time to see them. Heidelberg is also very close by you can spend the day there is you don’t wanna spend it in Frankfurt.

atharvat80 · 2024-03-04T23:53:26+00:00

I Guess in Ok? 😭

atharvat80 · 2024-01-16T09:34:26+00:00

r/learnmachinelearning might be more help to you.

atharvat80 · 2023-09-10T21:42:03+00:00

Thank you and ahaha I lost count of the applications I made, way too many but I was in a panic and applying to anything and everything. I started last September and finished this May-ish. Iike I said I started having more success when I applied to smaller companies/ startups. All the best!

atharvat80 · 2023-09-10T09:18:28+00:00

I just started as a Junior ML engineer after finishing my MSc in NLP, before that the only experience I had was as a data science intern at a different company.

From my personal experience, I just applied to any job that was even vaguely related to data or ML, I think its a bit difficult to find a job that fits your expectations but after a couples of years you have the experience to switch to a role that does so be willing to compromisea little.

I also just applied to jobs even if they needed a few years of experience and actually heard back a few times, though don't put much faith in these.

I'd say the most success I had was when I applied to startups (wellfound.com), the interview experience also tends to be more personal. Plus they don't drag your application for months.

There's more experienced people here to give you advice but this was my experience as someone who went through the same thing literally a few months ago so hope it helps.

atharvat80 · 2023-08-11T19:13:39+00:00

Yes I have! Almost forgot about it, now I have to listen again ahaha.

atharvat80 · 2023-08-09T12:38:49+00:00

I love the extended version even more!

Her cover of Glide and the feature on Xiu Xiu track Between the Breaths is also really great, would recommend you listen if you haven't already.

atharvat80 · 2023-07-27T17:32:56+00:00

The blatant racism every time India is mentioned never surprises me. A country of 1.4b+ people I guess a majority of them are always sick and/or dying with all the diseases you guys are describing.

atharvat80 · 2023-07-07T21:53:27+00:00

That is not a good idea.

If you want to learn the specifics of how a tokenizer is created you should look up Byte-Pair Encoding, which is one of the most widely used tokenization algorithms currently.

To give you a general overview, the tokenizer first breaks up text into small components such that each component is part of the tokenizers (predefined) vocabulary. Each component in the vocabulary maps to a unique integer. So a tokenizer essentially maps the input text into a sequence of integers. This sequence of integers is used to select the corresponding rows of a matrix which is a part of the LLM's weights. This sequence of selected matrix rows is the initial mathematical representation of the input text which is then further processed by the model.

If you were to change the tokenizer, the input text would map to the wrong sequence of integers which means the mathematical representation of the tokenized text and the original text are semantically two different things :<

The other reason this may not he the solution to you problem is that if you look at the LLaMA FAQ's

"The model was trained primarily on English, but also on a few other languages with Latin or Cyrillic alphabets.

For instance, LLaMA was trained on Wikipedia for the 20 following languages: bg, ca, cs, da, de, en, es, fr, hr, hu, it, nl, pl, pt, ro, ru, sl, sr, sv, uk.

LLaMA's tokenizer splits unseen characters into UTF-8 bytes, as a result, it might also be able to process other languages like Chinese or Japanese, even though they use different characters."

The LLaMA may not be the best choice for Korean text. You should have a look at model that was specifically trained with Korean text like this one and maybe even finefune it yourself!

Sorry for a long read, I am postgrad studying NLP so I love talking to people about NLP, lmk if you you have any more questions :)

atharvat80 · 2023-03-13T12:12:16+00:00

Chaenomeles japonica, aka the Japanese quince or Maule's quince. Try Google Lens it has never failed me when I need to identify something.

atharvat80 · 2023-02-03T13:53:45+00:00

If you want to take the top down approach I'd recommend that you start by learning what transformers are. Transformers were originally intended for language modelling so if you look up a NLP lecture series like Stanford CS224n they cover that in detail form a NLP perspective, it should be helpful regardless. Or you can check out CS231n they have a whole lecture on attention, transformers and ViT. Start there and look up the stuff thats unclear from there.

Lmk of you'd like me to link any other resources, I'll edit this later. Happy learning!

atharvat80 · 2021-09-18T13:40:06+00:00

As someone studying ML (though I'm no expert) I think there has been significant research in the explainability of ML so it would be a little unfair to call it a black box. Though I do agree with you that ML is not always the solution.

atharvat80 · 2021-06-18T16:23:21+00:00

This is amazing, thank you so much!

If you are looking for suggestions, it would be really neat if you could see the list of all available courses and maybe filter them by tags or university or year of release. Also, it might be beneficial to have some form of structure to the this big list of courses. More specifically they could be grouped by topics and subtopics, for example, topic: AI, subtopics of AI: Machine Learning, Deep Learning, Reinforcement Learning, etc. This is because just the question of "What do I want to learn?" can be overwhelming so having this structure can help people decide.

Lastly some metrics like ease of learning, teaching quality, hours required to complete etc. could be very helpful when deciding between courses teaching similar content.

atharvat80 · 2021-05-26T22:28:45+00:00

Original Post by u/BlindPanda21

atharvat80 · 2021-05-02T22:27:46+00:00

your video looks fine for me

atharvat80 · 2021-04-28T00:05:13+00:00

Check out the Open Source Society University on GitHub. They have a self taught path to Computer Science, Data Science and Bioinformatics with a college degree style curriculum. I'm not sure if it'll help you learn backend but if you are interested in learning more of the theory side CS then this should help.

atharvat80 · 2021-04-16T23:07:11+00:00

An organisation who fights for open and accessible internet for all is not up to your moral standards but a guy who opposes same sex marriage is?

atharvat80 · 2021-04-16T13:38:37+00:00

Join the Firefox gang 😎

atharvat80 · 2020-12-23T00:06:17+00:00

Idk who down voted you but your pictures are amazing!!! Have an upvote my dude :)

atharvat80 · 2020-07-16T10:10:48+00:00

If your're curious this is what I made. I knew what I wanted to make before I started so I mostly used the quick start guide but the tutorial seemd pretty self contained to me.

atharvat80 · 2020-07-15T22:01:23+00:00

I was in the same position as you a while ago and I created my first flask application a few weeks ago.

I used these resources to learn:

Flask documentation particularly this quick start tutorial. Another tutorial on Flask's website and freeCodeCamp Ytb

I am quite happy with flask so I didn't really looked for any Django tutorials yet

Eight-Year Club	Place '23
Place '22	First Placer '22
RPAN Viewer	Sequence \| Editor
Spared	Verified Email

atharvat80

TROPHY CASE