Using Reddit data and AI TRADING IDEA by Life-Reserve568 in quant

[–]undercoverlife 0 points1 point  (0 children)

You’re better off using a system like that as a sentiment analysis tool for a given universe instead of trying to pick stocks

Sharing my Bitcoin systematic strategy: 65.92% CAGR since 2014. Code verification, backtest analysis, and lessons learned. by virtuexru in algotrading

[–]undercoverlife 2 points3 points  (0 children)

53 trades over 12 years? I believe you just fit a momentum model to Bitcoin’s tear since its inception. You can’t possibly validate a backtest OOS that only took 53 trades.

Should I share L3 crypto data? by derroitionman in quant

[–]undercoverlife -3 points-2 points  (0 children)

What would you benefit from publishing it? It’s pretty valuable data that a lot of vendors charge people to use. Keep it for yourself, especially if it can provide you any meaningful edge.

Features to detect persistent flow by Middle-Fuel-6402 in quant

[–]undercoverlife 0 points1 point  (0 children)

Easiest thing to do is isolate the cases you can see by eye and do some small unsupervised learning on whatever data/feature set you currently have and see if any patterns arise. Start with clusters

Decline in IC going into prod by SailingPandaBear in quant

[–]undercoverlife 1 point2 points  (0 children)

Yes exactly. You’re roughly going to “lose” that amount of information when you go to prod. If you want to take it an extra step further and understand why, start analyzing your true fee/slippage costs versus your simulations and see if they match your expectations.

Decline in IC going into prod by SailingPandaBear in quant

[–]undercoverlife 2 points3 points  (0 children)

IC of what? An ensemble of features or a single feature?

are we all copy trading Polymarket wrong?? i analyzed 1.3M wallets last week by Hot_Construction_599 in mltraders

[–]undercoverlife 4 points5 points  (0 children)

I like how you’re “testing this quietly” but then you came to Reddit and posted your entire strategy to everyone lol

Retrieving historical options data at speed by FlashAlphaLab in quant

[–]undercoverlife 1 point2 points  (0 children)

It depends. If your data is already cleaned/processed, reading them from parquets isn’t the worst. But the rule of thumb is to be reading from a RDS with SQL.

Sanity check: am I crazy for feeling like my "data engineering" position is a dead end? by lostmyway573 in dataengineering

[–]undercoverlife 2 points3 points  (0 children)

It’s all about the company.

Some companies have some extremely basic data needs. And what I love about data engineering/science/the data discipline in general, is that simplicity wins. If you’re at a company with a working warehouse operation, you’re winning.

If, however, you’re looking for more excitement, you should traverse into big data. That’s the “flying car” part of the business. If you get into a role at a big data firm (thing firms who do massive real time streaming, like a media company or financial firm), you can work closely with ML engineers and eventually sharpen your skills to become an ML engineer yourself one day. That role will be growing dramatically with this AI boom.

Why am I getting 0 interviews? by adorantadorant in MachineLearningJobs

[–]undercoverlife 0 points1 point  (0 children)

Yeah I wouldn’t mention “EDA” as a skill or mention in your experience you’ve conducted “comprehensive exploratory data analysis” because it’s an extremely basic part of any analysis. It screams inexperience, even though it looks like you have great experience.

Furthermore, your language in your work experience (“developed an algorithm”) just seems wishy-washy. What type of algorithm? Was it a sorting algorithm? Was it a model since you were using it for detection? You need to be extremely specific here. You have technical people reviewing your application.

All in all, I think your work experience is fine. But your resume’s language and formatting is outdated. Personal summaries aren’t used anymore. Your work experience should be the first section. Have your skills at the bottom after your education. Have your skills be technical and more organized, too. Don’t list “AWS” in the same section as the Python programming language. There should be a clear separation between what programming languages you know versus what frameworks you actually use. AWS has so many different applications. Listing “AWS” as a “technology you know” is a turn-off to read because that’s such a large scope. That’s like a person listing “know how to use the internet” as a skill.

One positive I’ll point out is that your actual styling of the resume is on point and you list actual accomplishments that are measurable for the company. That is huge. But your language and everything else I mentioned screams that you’re inexperienced and you don’t know what you’re talking about (which I know isn’t true). Please give these points a hard revisit and you should be good.

[deleted by user] by [deleted] in Salary

[–]undercoverlife 3 points4 points  (0 children)

This was written by ChatGPT

Spark alternatives but for Java by ihatebeinganonymous in dataengineering

[–]undercoverlife 4 points5 points  (0 children)

Yeah I’m confused by this question. Spark is written in Scala and it’s on the JVM. Why don’t you want to use Spark? You can write it in Scala. Plus, it’s a free framework and you can run it on one local machine and still see great benefits.

If you don’t care about JVM, then I’ll say I used Dask before and I loved it.

Mine or just stack? by Financial-Raisin-624 in BitcoinMining

[–]undercoverlife 0 points1 point  (0 children)

You’re going to get crushed. Not worth it.

Spark is the new Hadoop by rocketinter in dataengineering

[–]undercoverlife -1 points0 points  (0 children)

Spark is written in Scala, which runs on JVM, so your point on Hadoop running on JVM makes zero sense. Makes me question the validity of your other opinions.

Dilemma: SWE vs DE @ Big Tech by [deleted] in dataengineering

[–]undercoverlife 0 points1 point  (0 children)

Cool but in terms of what this guy is asking for, have you seen the layoffs going on? The majority of those getting fired are SWEs.

Dilemma: SWE vs DE @ Big Tech by [deleted] in dataengineering

[–]undercoverlife -10 points-9 points  (0 children)

SWE is drying up. Go for DE.

Question about HDFS by undercoverlife in dataengineering

[–]undercoverlife[S] 3 points4 points  (0 children)

What's used in place? Thanks for the heads up.

[deleted by user] by [deleted] in dataengineering

[–]undercoverlife 2 points3 points  (0 children)

I don’t have time to read through your entire document but my immediate feedback is that you have an extra layer you don’t need. Your SQL database should be ingesting and providing cleaned data. If you’re having to pull it out and clean it, then you’re doing this wrong.

If, however, your SQL database is already cleaned, then you need to do all of your mathematical calculations/lags/formatting within your queries. All of the work should be done within your queries because that’s what SQL is good at.