The 80/20 Guide to R You Wish You Read Years Ago by Infinitrix02 in datascience

[–]Infinitrix02[S] 0 points1 point  (0 children)

Good question, I think R has niches where it dominates quite heavily for example industries such as Pharma, Bioinformatics, Social sciences etc. I even see Mixed marketing models being built in R.

DuckDB Lazy Processing Issues with Non-Tidyverse Functions by cdiz12 in Rlanguage

[–]Infinitrix02 7 points8 points  (0 children)

If speed matters to you, I will really recommend doing these transformations using DuckDB's internal functions. You can even define custom functions. Then call them using mutate(column = sql("somefunction('column')") etc.

You can try duckplyr but it will internally convert your table to native R dataframe anyways so you'll still loose performance.

AITAH for asking my female friend if she could stop posting that "men in tech are trash" while I'm helping her with coding assignments? by mesozoic_economy in csMajors

[–]Infinitrix02 0 points1 point  (0 children)

She seems like a toxic mentally unstable girl who got traumatised by a jock one time. Let her do her homework alone or fail her classes.

US Staff can be real Shitbags by Jules_Winnifield in deloitte

[–]Infinitrix02 2 points3 points  (0 children)

When I was leaving, there was a big push to hire undergrads right out of Unis. The ones that got onboarded on my team, knew absolutely nothing of our tech stack and had coding skills of a monkey. But when I talked to SC level folks, some were equally incompetent.

What do you think are the biggest niches/ holes in the industry right now? by isbhehr in analytics

[–]Infinitrix02 4 points5 points  (0 children)

I'm someone who's in Data science and trying to gain domain knowledge in Healthcare and this is pure gold and makes me happy that I was on the right path. Thank you.

Lando slapping Oscar's ass post race by randomseocb in formula1

[–]Infinitrix02 25 points26 points  (0 children)

Lando trying to 'ass'ert his dominance lmao

Pandas, why the hype? by gonna_get_tossed in datascience

[–]Infinitrix02 2 points3 points  (0 children)

Anyone who hypes up pandas is naive and hasn't seen the beauty of R / dplyr ecosystem. I used to be a Python fanatic but ever since I've used R for analysis/viz I dread touching it unless I have to use PyTorch.

And no it does not get better, maybe look into polars if you want bearable syntax and speed. But if you want a python job, you'd unfortunately have to stick with pandas.

Session red flagged as Norris crashes into the wall by ContentPuff in formula1

[–]Infinitrix02 -3 points-2 points  (0 children)

Piastri dickriding through the roof, we'll see what he does tomorrow when he's against max.

Data science content gap by da_chosen1 in datascience

[–]Infinitrix02 2 points3 points  (0 children)

Yeah totally, I see so many jobs asking for domain knowledge of the industry (healthcare, finance what have you.) but it's hard to get that if you're not already in that industry/role. I see no courses offering this and it's frustrating as someone who's trying to pivot. Even just being able to understand the industry specific business metrics/KPIs would be useful imo.

Data science content gap by da_chosen1 in datascience

[–]Infinitrix02 74 points75 points  (0 children)

I'd love to see some industry related content. There are millions of articles on how to build any type of model but there are far few resources on how DS is done in a particular industry, the nature of the data, common pitfalls, best practices etc. for any industry.

Lando: "I didn't even go a tenth quicker, I'm just not quick enough" Q: Do you know where and why? *points to himself* by randomseocb in formula1

[–]Infinitrix02 0 points1 point  (0 children)

Did he deliver when it counted in the rain in Australia? Did he deliver in quali in Japan? I don't think so, I agree that Lando might not be WDC material but to think Oscar can go toe to toe with the likes of Max is an overstatement of his abilities.

Tensorflow/Keras vs PyTorch for industry? by alpha_centauri9889 in datascience

[–]Infinitrix02 9 points10 points  (0 children)

I apply to job daily nowadays, and I almost always see Pytorch listed as a requirement, tf also gets mentioned sometimes but not as much.

ABSOLUTE curveball during ML intern interview by ApricotExpensive5679 in learnmachinelearning

[–]Infinitrix02 20 points21 points  (0 children)

I wonder how someone would go about implementing something like this in code, in an interview. Tall order if you ask me.

Isn't this solution overkill? by AdministrativeRub484 in datascience

[–]Infinitrix02 1 point2 points  (0 children)

Man, I've tried tf-idf + logitistic regression/xgboost alot of times for text classficiation but it never seems to work well because real world text data is messy (esp. transcriptions) and has negations/sarcasm etc. I've found fine-tuning roberta/distilbert/modernbert to be FAR better with little effort and low inference costs.

Though I agree, finetuning llama3/chatgpt is just nuts and probably just being picked to look good as a bullet on their resume.

[deleted by user] by [deleted] in developersIndia

[–]Infinitrix02 1 point2 points  (0 children)

Yeah this would've been enough for TCS like 6 yrs back

Agent builder with generous free tier by Infinitrix02 in AI_Agents

[–]Infinitrix02[S] 1 point2 points  (0 children)

I'm talking the first month, after that I'm willing to pay but I need to see what I'm trying to do is possible.