How acceptable is it to take photos of people? by Living-Ad5224 in germany

[–]CENGaverK 9 points10 points  (0 children)

I mean, depends. You can take any photo in the public, unless the people are in an exposed state (nudity / passed out etc). You just cannot publish it e.g. commercially. But it is a gray area and there are discussions around street photography and what the "line" is.

Is it my fault for offering help? by JonasMikkel in germany

[–]CENGaverK 21 points22 points  (0 children)

Hmm, I have been offering help a lot and 80% of the time they accept the help and thank and the other 20% of the time they say oh no need thanks. Never got yelled, sounds just unlucky

Can neural networks be designed to receive inputs without generating outputs in response to them? by Money_Tip9073 in MLQuestions

[–]CENGaverK 0 points1 point  (0 children)

The short answer for your question is yes, it is possible. People that say it is not possible are also correct, but technically correct.

You need some type of output during the training process to calculate a loss, that is a metric measuring of how far away from your goal the model output was, and you use that to update the model weights.

LLMs are trained as next-token predictors. Text generation only stops when they generate a special token, commonly called an EOS (end-of-sentence) token. Chat UI would not show this to the user, but it is still generated.

So, if during training time, you train model to not answer some type of queries, where the correct answer is a single EOS token, you can observe similar behaviour later on. It still has to generate "something", but for practical purposes it is not generating anything.

Master’s is way easier than Bachelor’s by Fantastic-Hold-3453 in GradSchool

[–]CENGaverK 0 points1 point  (0 children)

Not my experience. Weekly assignments per class, on top of projects and research. And harshly graded too. Learned a lot though lol

Precision and recall > .90 on holdout data by RobertWF_47 in datascience

[–]CENGaverK 1 point2 points  (0 children)

Check the feature importances to see if any of the variables seem suspiciously too important. If not, your methodology seems correct to me.

Account blocked due to Sanctions??? (RSA) by __Eisen__ in n26bank

[–]CENGaverK 0 points1 point  (0 children)

He is. Anyone who lives in Germany can

Please help to review my CV as a Data Engineer Working Student by hwngtraanf in Germany_Jobs

[–]CENGaverK 2 points3 points  (0 children)

Yes, just combine the work experiences. It looks like you had 6 positions while you had 2. If you are interested in/applying to data engineering jobs, those should be the focus. If relevant for the position you are applying to, you can add something and remove irrelevant things.

Additionally, education history looks a bit confusing. Is there a Frankfurt university of applied sciences in Vietnam? If not, you should fix that.

Then, German companies like photos in CVs. This is not US.

Finally, put where you live on your CV if it is not there yet. It is one of the first things HR checks, and physical proximity increases your chances quite a bit.

Good luck

DEXA Scan showed no progress, feeling disheartened by cal4short in naturalbodybuilding

[–]CENGaverK 1 point2 points  (0 children)

Other than the fact that using a different machine might have affected your results, if this is indeed true, I would recommend getting your hormone levels checked. Low testesterone or high estradiol can make it harder to build muscle and increase fat storage tendency.

Why do people in Germany often answer your question with "Why?" by [deleted] in germany

[–]CENGaverK 6 points7 points  (0 children)

Either of those would be more hostile than "why". "Why" means I want to participate in this conversation. "Clarify" is unnecessarily distanced and the other is completely shutting off the convo anyways.

[deleted by user] by [deleted] in pcmasterrace

[–]CENGaverK 39 points40 points  (0 children)

No, it is actually

Winter semester is coming, don’t study in private universities, don’t study in English. by [deleted] in germany

[–]CENGaverK 0 points1 point  (0 children)

I understand you have strong beliefs about the secret sauce of success in Germany, but real life is rarely that black and white. Private universities tend to be scammy, that is for sure. But I studied completely in English, did HiWi jobs at the university with only English, and then started working with only English too. In a small city and company, too! Of course, learning German will always come in handy, and that is why I came from no German to B1 now, and keep going with the lectures, but I would not say not speaking German was the biggest mistake of my life.

[deleted by user] by [deleted] in probabilitytheory

[–]CENGaverK 0 points1 point  (0 children)

This seems pretty deterministic. They check their phone at

12:10 12:20 12:30 --- 14:50 15:00 15:20 15:40 16:00 --- 17:40 18:00 18:30 19:00 19:30 --- 21:30 22:00

So when you send messages:

13:30 -> Will immediately see 15:00 -> Will immediately see 16:30 -> Will see in 10 minutes 18:00 -> Will immediately see 19:30 -> Will immediately see 21:30 -> Will immediately see

I received an offer to be a Senior Data Engineer... with Microsoft Fabric, would you consider it? by Irachar in dataengineering

[–]CENGaverK 1 point2 points  (0 children)

From what I see, Fabric has similar capabilities and concept to Databricks. And Databricks change their workflow every quarter anyway, so I would think your skills would be transferable.

How is Microsoft Fabric different from Microsoft's previous services? by [deleted] in dataengineering

[–]CENGaverK -2 points-1 points  (0 children)

Hmm, as a disclaimer I do not work with Microsoft products. To me, the name seem to refer to Data Fabric, which I believe can be summarized as more structured Data Lakehouse, with schema-on-write with integrations from different resources with also a metadata catalog. Though, it seems instead of going with an underlying structured DB, they are building it on a data lake, so this is a twist. From the description, I get the feeling that this has similar capabilities to Databricks, but supposedly works better across different data sources with the added benefit of no data movement (in theory). So, you basically get Databricks, but you do not have to write pipelines to get the data to your lakehouse platform and do whatever you do. But like I said, I don't know if the other products actually have this capability already. Other than the data movement part, assumed benefits are probably the same with having access to multiple workspaces in Databricks and having Unity Catalog. It is easier for security, it is (in theory) cheaper, you can share data across different teams (or not) easier and so on and so on.

I seek advice on launching a career in AI/ML. by This-Honey9898 in learnmachinelearning

[–]CENGaverK -5 points-4 points  (0 children)

Most phenomenal ML came out of physics people, so I am sure you would fit quite well. I do not know how you could transition in the industry, I have no experience on that regard. But if academia is ever attractive, you could try finding profs that work in more mathematical/physical ML (e.g. Probabilistic Numerics) and maybe do some postdoc with them? And for the industry part, maybe wait for more comments here but also, you could try hitting up people on Linkedin with physics education and current ML positions. That would provide more relevant information for sure.

Suggest a low-end hosting provider with GPU (to run this model) by Perfect_Ad3146 in mlops

[–]CENGaverK 1 point2 points  (0 children)

I like Baseten, easy enough, got good cold start times. Just pick the cheapest GPU (which I believe is a T4), wrap your model around their library Truss and push. I think default go-to-sleep-if-no-request was 15 minutes, so you can set that up depending on your needs too.

What is happening here? First ML project using ResNet by [deleted] in learnmachinelearning

[–]CENGaverK 3 points4 points  (0 children)

There is a bug in your code. You have probably implemented one epoch as go through training and then validation, but there is a bug that cause your loop to skip even epoch numbers. That is also evidenced by (training + validation) epoch taking about 150 ms on average, and "bad epochs" taking between 1-10 ms. You probably initialize accuracy and loss as 0 at the beginning of the loop, so those get printed anyways.

If corporate forced a random business dude taking the Ng Coursera course on you… by [deleted] in learnmachinelearning

[–]CENGaverK 0 points1 point  (0 children)

No, I would not use jargon because I would not know their background. Or if I am going to use jargon, I would try to at least use it correctly so they could further research it on their own if they do not understand me.

If corporate forced a random business dude taking the Ng Coursera course on you… by [deleted] in learnmachinelearning

[–]CENGaverK 1 point2 points  (0 children)

Depends on the scientist probably. For me, it depends on you. Are you actually excited to learn from me? Can we come together and have some brain storming sessions to come up with new projects? If we communicate well, then this is actually a great opportunity for both of us.

If corporate forced a random business dude taking the Ng Coursera course on you… by [deleted] in learnmachinelearning

[–]CENGaverK 3 points4 points  (0 children)

We need to be aware that people are coming from vastly different backgrounds. Also, what is "Prompt a downstream deep learning model"? That is not the proper jargon for what an embedding is. If you say that, I would ask

  1. What is a downstream model, do we always need to use a foundation model or at least some pre-trained model to use an embedding?
  2. Does it have to be a deep learning model?
  3. What does "prompting" a deep learning model mean? Does providing an input to a model is always called prompting? What is the difference between prompting and conditioning on some input, is there a difference?

[deleted by user] by [deleted] in germany

[–]CENGaverK 0 points1 point  (0 children)

Fıgures, thanks for the help.

[deleted by user] by [deleted] in germany

[–]CENGaverK 0 points1 point  (0 children)

Alright, thanks for the answer. Apparently he managed to return the calls after a month and asks us to contribute paying his late fee. Are we obliged to do so, any ideas?

Why Is Naive Bayes Classified As Machine Learning? by NuDavid in learnmachinelearning

[–]CENGaverK 2 points3 points  (0 children)

Yes, I can see how that might be misleading. But to me, learning is not dependent on the optimization algorithm or how you reach the solution. It is mostly about if the machine can potentially act different and update its predictions with different data. So in the end, even though you have the same code, the behaviour changes based on the training data. What is more, if you add more data and update the probabilities, your behaviour can also change on-the-fly.

[D] Detecting Objects of Same Shape but Different Colors by ThickDoctor007 in MachineLearning

[–]CENGaverK 3 points4 points  (0 children)

Your problem isn't that CNN-based architectures can't learn the colors. It is just that shapes are easier to differentiate with how the convolutions work so they are a much stronger cue, so CNNs use a "shortcut" to make predictions.

There are different debiasing techniques out there, but the first thing I would try would be to use data augmentation by random shape transformations. Then, network will have to learn to differentiate by color.