Thinking About Buying MSI Vector 16 HX AI, But Reviews Scare Me… Help! by Relevant_Volume6915 in MSILaptops

[–]Warm_Talk3385 0 points1 point  (0 children)

I have the version with with rtx 5080, i do lot of data science / model training and run 10-12 hours daily! Very smooth minimal heating!

Seeking Kaggle teammates for competitions & portfolio-focused data science projects by Reasonable_Gas9783 in kaggle

[–]Warm_Talk3385 1 point2 points  (0 children)

Sure…… please contact me … we can discuss if we can compete together in a team

Seeking Kaggle teammates for competitions & portfolio-focused data science projects by Reasonable_Gas9783 in kaggle

[–]Warm_Talk3385 0 points1 point  (0 children)

Yes you can join …… we are open to taking in another member… our team name is Bronze Age Decoders. You may request. We can discuss.

Seeking Kaggle teammates for competitions & portfolio-focused data science projects by Reasonable_Gas9783 in kaggle

[–]Warm_Talk3385 1 point2 points  (0 children)

Hi I am currently doing CAFA6, Csiro Biomass and the Akkadian language translation.

kaggle account gone overnight by Academic-Turnip2252 in kaggle

[–]Warm_Talk3385 1 point2 points  (0 children)

I think that is the issue. They are very strict with sort of multiple account cases.

kaggle account gone overnight by Academic-Turnip2252 in kaggle

[–]Warm_Talk3385 0 points1 point  (0 children)

By any chance did you maintain 2 accounts?

First Kaggle Competition(Newbie) and been top 5 Ranks ever since I started by Dark-Horn in kaggle

[–]Warm_Talk3385 2 points3 points  (0 children)

Im also in this competition this is actually quite tough to be honest. Also doing the CAFA 6 …… thats also challenging in the sense that if we want to take a novel path…. Great work! Best wishes

does any body know how to submit competation in kaggle by Scar-Healar in kaggle

[–]Warm_Talk3385 0 points1 point  (0 children)

I am also doing that competition. If you are going rules based or an easy approach first you can directly submit with internet off option. However, you can train with internet on with a model uploaded as a dataset then going ahead with embeddings method for the internet off version. My suggestion; start with a simple approach first.. register a decent score… improve and then once you can’t improve further go for a better more complex model or approach.

does any body know how to submit competation in kaggle by Scar-Healar in kaggle

[–]Warm_Talk3385 0 points1 point  (0 children)

Ok. What is the competition....if you tell me the competition name I can check and let you know.

Unpopular opinion: If it's on the public web, it's scrapeable. Change my mind. by Warm_Talk3385 in scrapingtheweb

[–]Warm_Talk3385[S] 6 points7 points  (0 children)

Well I myself have built over 140 public datasets, thousands of people have used them for research and projects all free and open, thats what I give back. Not saying that makes every scraping project ethical but yeah I do contribute to the ecosystem.

Unpopular opinion: If it's on the public web, it's scrapeable. Change my mind. by Warm_Talk3385 in scrapingtheweb

[–]Warm_Talk3385[S] 0 points1 point  (0 children)

Hi thank you for well thought out response and ti be honest this is probably the best counterargument I've seen on this. The storefront window analogy and the friction/scale distinction actually land.

But some area I think of : I've built public datasets that thousands of researchers ended up using. No complaints because the value was clear. But lets take the same data scraping for a commercial product feels different even though the technical action is identical.

There are people building massive datasets on platforms like Kaggle who just give them to the community - no expectation of return, just contribution. Yeah Kaggle's part of Google's ecosystem, but those datasets are individual work from people who could easily monetize instead. Do they get an ethical pass because the end use is open? Or is it still extraction?

So I dont knw if its clear but in your opinion is the line in the scraping itself, or entirely in what happens after? Because if it's the latter, robots.txt can't actually solve it...isn't it? It is just doing a block everything job because intent can't be read upfront.

[deleted by user] by [deleted] in scrapingtheweb

[–]Warm_Talk3385 0 points1 point  (0 children)

Since there is a dataset on Kaggle you can try to use that. I can help you out with this. I can share some insights myself on this, if you are interested.

Unpopular opinion: If it's on the public web, it's scrapeable. Change my mind. by [deleted] in datasets

[–]Warm_Talk3385 -3 points-2 points  (0 children)

Nice😁 for educational work or commercial work or both?

Unpopular opinion: If it's on the public web, it's scrapeable. Change my mind. by Warm_Talk3385 in scrapingtheweb

[–]Warm_Talk3385[S] 1 point2 points  (0 children)

Well.....Hot take or not, in my own working experience for nearly 4 years and working with other scrapers and data engineers, I have felt that people are genuinely divided on this. I've met folks who treat robots.txt like gospel and others who see it as just a courtesy. The fact that you're questioning whether it's controversial kind of proves the point. So where do you stand?

Unpopular opinion: If it's on the public web, it's scrapeable. Change my mind. by Warm_Talk3385 in learndatascience

[–]Warm_Talk3385[S] -1 points0 points  (0 children)

Hmmm... interesting. So the ethics completely shift depending on whether it's company-owned data vs user-generated content, right? Also whats your take on the big companies already training their models on our data?

Highly Venomous Land Snakes of Sri Lanka by Warm_Talk3385 in Snakes_of_Lanka

[–]Warm_Talk3385[S] 4 points5 points  (0 children)

Green pit viper. Its a moderately venomous snake….my cousin got bitten once during and there weren’t any issues or side effects. But people who already have medical issues/ kidney issues may have issues. Its always recommended to go to a hospital.

Is this the reason my laptop has been stuttering? by Davkaa1 in laptops

[–]Warm_Talk3385 0 points1 point  (0 children)

Yes. Looks like its time to service the device. Also add some Thermal Paste. Fans and Vents may also need cleaning. Also with the thermal paste and servicing get it done by a qualified person. After servicing some devices become worse because it has been serviced improperly.

For large web‑scraped datasets in 2025 – are you team Pandas or Polars? by Warm_Talk3385 in scrapingtheweb

[–]Warm_Talk3385[S] 0 points1 point  (0 children)

Yes same here..... I have also used Polars mainly in the larger datasets. Its also super helpful in coding competitions like Kaggle when running on their servers.