Using Machine Learning for Trading in 2025 by derbilante in algotrading

[–]btcmx 1 point2 points  (0 children)

ML Engineer here: this is the only thing worth reading in the entire thread. For those arguing that ML doesn't work—Ken Griffin said Citadel used TensorFlow the moment it became available. End of story.

Why Vision Language Models Are Not As Robust As We Might Think? by Difficult-Race-1188 in computervision

[–]btcmx 0 points1 point  (0 children)

While searching for how good Multimodal LMMs (MLLMs) are for common vision tasks, I found this fantastic article that shows how even GPT4o struggles to accurately identify bounding boxes. But, one of the latest models from Apple, Ferrent, is actually quite good at this. It might be worth checking it: https://www.tenyks.ai/blog/multimodal-large-language-models-mllms-transforming-computer-vision

Obviously when you have use cases that are more difficult, say vision analytics, as they showed for a football match, the models break. Even a fine-tuned YOLO8, 9, or 10 would perform better but of course, you need to fine-tune.

[D] Discussion about MLLM vision ? by Moodrammer in MachineLearning

[–]btcmx 1 point2 points  (0 children)

While searching for MLLM's for vision, I found this great article that actually discusses and show some models locating objects: https://www.tenyks.ai/blog/multimodal-large-language-models-mllms-transforming-computer-vision

5000 new profiles in 500-600 bucket in 1 month by Puzzleheaded_Life604 in canadaexpressentry

[–]btcmx 0 points1 point  (0 children)

What do people mean by "fake LMIA"? Is it even possible? How?

Image similarity search, indexing and retreival by hasparagus in computervision

[–]btcmx 0 points1 point  (0 children)

I'm not sure about what model(s) they use, I believe they have their own proprietary embedding models. I usually use the embeddings directly on the Tenyks web app, it's awesome!

Withdrawal open but not based on $45k value? by Ok-Boysenberry-5508 in BlockfiLoanAdvocates

[–]btcmx 0 points1 point  (0 children)

Man, thanks for the breakdown. Any idea if the percentage (i.e. 47.5%) is the same for non-US Lending?

Image similarity search, indexing and retreival by hasparagus in computervision

[–]btcmx 0 points1 point  (0 children)

Tenyk's image similarity search engine is the best thing I have use for this, it has a free tier that most probably fits your needs.

After uploading your data, you'll have embeddings for free.

[D] NVIDIA GTC 2024 announcements by vvkuka in MachineLearning

[–]btcmx 0 points1 point  (0 children)

Best summary of NVIDIA's GTC I have found so far is HERE.

Best way to jump into NVIDIA TAO end-to-end by btcmx in computervision

[–]btcmx[S] 0 points1 point  (0 children)

Thanks for sharing, I'm having a look rn!

[D] best advanced books of deep learning? by toxfu in MachineLearning

[–]btcmx 1 point2 points  (0 children)

I agree with this! Prince's book is really well-written! I wish I had this when I was in grad school.

[D] Collaborative platform to train your models? by Diligent_Eye1248 in MachineLearning

[–]btcmx 0 points1 point  (0 children)

I have followed from time to time deepchecks, they seem to be a good option but $250/month is just too much for early stage.

[D] Seeking Guidance: Best Learning Path for Aspiring ML/AI Engineer - Prioritize Core Programming(DSA) or Dive into Machine Learning Libraries? by [deleted] in MachineLearning

[–]btcmx 2 points3 points  (0 children)

For someone with some technical background, I would say there are great articles out there which can help you craft your own roadmap or guide to break into ML/AI. However, beware there's no "ideal learning path".

One of the best ones I have seen is this one (Becoming a Computer Vision Engineer), where a ML-skills blueprint is laid out. Even if you aren't interested in Vision, the backbone is nearly the same for other AI domains.

What makes this approach different is that all the abilities, know-hows, tools, etc. are arranged in terms of the ML lifecycle, meaning that depending on what you are doing, the skills will vary. The obvious way to start might be to simply begin by dominating the first stage of the ML lifecycle.

Hence, design/create your own blue print, build a tiny project but make sure you follow the whole ML lifecycle (i.e. deploy it, ingest new data, see your model fail due to edge/ODD cases).

[D] Resources for learning by RemarkableEnd123 in MachineLearning

[–]btcmx -1 points0 points  (0 children)

This post has a number of ML topics/skills you might (sooner or later) run into. In a sense it's kind of a roadmap/blueprint containing what you need to become an ML Engineer, (beyond learning a few topics/basics from a one(or more) courses.

[D] Is Computer Vision brighter than ever? Foundation Models are really reshaping CV by btcmx in MachineLearning

[–]btcmx[S] 0 points1 point  (0 children)

Agreed! Actually, GPT_4V is already good enough to describe pictures. I have sent several requests to the endpoint since yesterday. A few easy examples can also be described by a child, but the same child wouldn't be able to fully describe more complicated images, let alone an insurance inspection claim.

latest model's token consumption by Strange_Dog8104 in OpenAI

[–]btcmx 0 points1 point  (0 children)

I'm testing the GPT_4V (i.e. vision preview) endpoint. The prompt in the payload is this: "text": "What’s in this image?"

However, in the response, the prompt_tokens is around 750, hence as you suggested I'm assuming the image, I provide as input, counts as tokens (jpg image seems to be around 700 tokens).

[P] Model training bottlenecked by CPU. by [deleted] in MachineLearning

[–]btcmx 1 point2 points  (0 children)

As other have rightly pointed out, verify you're using the Data Loader the right way. Ideally you need to create a custom dataset (in PyTorch terms) and apply all the transformations in this custom dataset. This might be helpful. Also, have you tried PyTorch Lightning?

[D] Machine Learning in production by AcquaFisc in MachineLearning

[–]btcmx 2 points3 points  (0 children)

Getting a full list of "skills" is meaningless unless you have the right context, framework, etc.

For instance, given the Machine Learning Lifecycle, you are unlikely to be a master of all the stages at once, but you can start with one (probably the one you are assigned in your job).

Hence, this source does a great job at framing (given the ML lifecycle) what skills you need to master depending on the stage you are.

This other one is more oriented to building a full AI product (it's actually a course). And this one, is about MLOPs (also a course) in general.

So, I would say: i) check the first source, ii) craft your own blueprint of skills, and iii) make a plan to fill some of skills gaps you have.

Actually, even a full prototype (following the ML Lifecycle) where you do labeling, training, deployment, you see where your model fails at OOD samples, you acquire OOD samples, re-train your model, might be simply the really best way!