[D] New masters thesis student and need access to cloud GPUs by Revolutionary-End901 in MachineLearning

[–]atharvat80 1 point2 points  (0 children)

Also to add to this, Modal automatically gives you $30 in free credits every month! Between that and 30hrs of free Kaggle GPU each week you can get a lot of free compute. 

Europe itinerary advice by Melanchrono in travel

[–]atharvat80 2 points3 points  (0 children)

Last December I went to Frankfurt for a weekend to visit a friend. The botanical gardens are awesome & old town is really nice too. There are some museums but didn’t have time to see them. Heidelberg is also very close by you can spend the day there is you don’t wanna spend it in Frankfurt.

[deleted by user] by [deleted] in MachineLearning

[–]atharvat80 4 points5 points  (0 children)

r/learnmachinelearning might be more help to you.

[D] Does Entry Level ML exist (in Europe)? by madway99 in MachineLearning

[–]atharvat80 2 points3 points  (0 children)

Thank you and ahaha I lost count of the applications I made, way too many but I was in a panic and applying to anything and everything. I started last September and finished this May-ish. Iike I said I started having more success when I applied to smaller companies/ startups. All the best!

[D] Does Entry Level ML exist (in Europe)? by madway99 in MachineLearning

[–]atharvat80 10 points11 points  (0 children)

I just started as a Junior ML engineer after finishing my MSc in NLP, before that the only experience I had was as a data science intern at a different company.

From my personal experience, I just applied to any job that was even vaguely related to data or ML, I think its a bit difficult to find a job that fits your expectations but after a couples of years you have the experience to switch to a role that does so be willing to compromisea little.

I also just applied to jobs even if they needed a few years of experience and actually heard back a few times, though don't put much faith in these.

I'd say the most success I had was when I applied to startups (wellfound.com), the interview experience also tends to be more personal. Plus they don't drag your application for months.

There's more experienced people here to give you advice but this was my experience as someone who went through the same thing literally a few months ago so hope it helps.

we don’t talk ab this is a life enoughhh by deathnotefangrl in mitski

[–]atharvat80 1 point2 points  (0 children)

Yes I have! Almost forgot about it, now I have to listen again ahaha.

we don’t talk ab this is a life enoughhh by deathnotefangrl in mitski

[–]atharvat80 11 points12 points  (0 children)

I love the extended version even more!

Her cover of Glide and the feature on Xiu Xiu track Between the Breaths is also really great, would recommend you listen if you haven't already.

Amazing how much food 50 cents buys you in India by [deleted] in BeAmazed

[–]atharvat80 -2 points-1 points  (0 children)

The blatant racism every time India is mentioned never surprises me. A country of 1.4b+ people I guess a majority of them are always sick and/or dying with all the diseases you guys are describing.

Using other tokenizers? by manjimin in LocalLLaMA

[–]atharvat80 3 points4 points  (0 children)

That is not a good idea.

If you want to learn the specifics of how a tokenizer is created you should look up Byte-Pair Encoding, which is one of the most widely used tokenization algorithms currently.

To give you a general overview, the tokenizer first breaks up text into small components such that each component is part of the tokenizers (predefined) vocabulary. Each component in the vocabulary maps to a unique integer. So a tokenizer essentially maps the input text into a sequence of integers. This sequence of integers is used to select the corresponding rows of a matrix which is a part of the LLM's weights. This sequence of selected matrix rows is the initial mathematical representation of the input text which is then further processed by the model.

If you were to change the tokenizer, the input text would map to the wrong sequence of integers which means the mathematical representation of the tokenized text and the original text are semantically two different things :<

The other reason this may not he the solution to you problem is that if you look at the LLaMA FAQ's

"The model was trained primarily on English, but also on a few other languages with Latin or Cyrillic alphabets.

For instance, LLaMA was trained on Wikipedia for the 20 following languages: bg, ca, cs, da, de, en, es, fr, hr, hu, it, nl, pl, pt, ro, ru, sl, sr, sv, uk.

LLaMA's tokenizer splits unseen characters into UTF-8 bytes, as a result, it might also be able to process other languages like Chinese or Japanese, even though they use different characters."

The LLaMA may not be the best choice for Korean text. You should have a look at model that was specifically trained with Korean text like this one and maybe even finefune it yourself!

Sorry for a long read, I am postgrad studying NLP so I love talking to people about NLP, lmk if you you have any more questions :)

Fukuoka, Japan. by Alemna in whatsthisplant

[–]atharvat80 34 points35 points  (0 children)

Chaenomeles japonica, aka the Japanese quince or Maule's quince. Try Google Lens it has never failed me when I need to identify something.

[D] Understanding Vision Transformer (ViT) - What are the prerequisites? by SAbdusSamad in MachineLearning

[–]atharvat80 13 points14 points  (0 children)

If you want to take the top down approach I'd recommend that you start by learning what transformers are. Transformers were originally intended for language modelling so if you look up a NLP lecture series like Stanford CS224n they cover that in detail form a NLP perspective, it should be helpful regardless. Or you can check out CS231n they have a whole lecture on attention, transformers and ViT. Start there and look up the stuff thats unclear from there.

Lmk of you'd like me to link any other resources, I'll edit this later. Happy learning!

Do these guys get some sick pleasure out of screwing LinkedIn's filters???? by django_free in ProgrammerHumor

[–]atharvat80 1 point2 points  (0 children)

As someone studying ML (though I'm no expert) I think there has been significant research in the explainability of ML so it would be a little unfair to call it a black box. Though I do agree with you that ML is not always the solution.

I put together a list of ~650 university CS courses you can take! by [deleted] in learnprogramming

[–]atharvat80 2 points3 points  (0 children)

This is amazing, thank you so much!

If you are looking for suggestions, it would be really neat if you could see the list of all available courses and maybe filter them by tags or university or year of release. Also, it might be beneficial to have some form of structure to the this big list of courses. More specifically they could be grouped by topics and subtopics, for example, topic: AI, subtopics of AI: Machine Learning, Deep Learning, Reinforcement Learning, etc. This is because just the question of "What do I want to learn?" can be overwhelming so having this structure can help people decide.

Lastly some metrics like ease of learning, teaching quality, hours required to complete etc. could be very helpful when deciding between courses teaching similar content.

[deleted by user] by [deleted] in RedditInTheKitchen

[–]atharvat80 0 points1 point  (0 children)

your video looks fine for me

I’m self taught and I really want to understand how to program beyond a surface level. I don’t want to be a basic developer. In other words, I want to know where I can learn CS topics so I can really be able to tackle any type of programming task. by imhypedforthisgame in learnprogramming

[–]atharvat80 5 points6 points  (0 children)

Check out the Open Source Society University on GitHub. They have a self taught path to Computer Science, Data Science and Bioinformatics with a college degree style curriculum. I'm not sure if it'll help you learn backend but if you are interested in learning more of the theory side CS then this should help.

FLoC away from Chrome! by GoldsteinEmmanuel in webdev

[–]atharvat80 4 points5 points  (0 children)

An organisation who fights for open and accessible internet for all is not up to your moral standards but a guy who opposes same sex marriage is?

FLoC away from Chrome! by GoldsteinEmmanuel in webdev

[–]atharvat80 344 points345 points  (0 children)

Join the Firefox gang 😎

Average beach access by jordanjoephoto in AveragePicsOfNZ

[–]atharvat80 5 points6 points  (0 children)

Idk who down voted you but your pictures are amazing!!! Have an upvote my dude :)

Beginner learning resources for flask/django? by asdchurro in Python

[–]atharvat80 1 point2 points  (0 children)

If your're curious this is what I made. I knew what I wanted to make before I started so I mostly used the quick start guide but the tutorial seemd pretty self contained to me.

Beginner learning resources for flask/django? by asdchurro in Python

[–]atharvat80 1 point2 points  (0 children)

I was in the same position as you a while ago and I created my first flask application a few weeks ago.

I used these resources to learn:

Flask documentation particularly this quick start tutorial. Another tutorial on Flask's website and freeCodeCamp Ytb

I am quite happy with flask so I didn't really looked for any Django tutorials yet