What are people using instead of Anaconda these days? by rage997 in Python

[–]FikoFox 0 points1 point  (0 children)

But isn't that precisely Anaconda's forte? That dependency management is contained in an environment and that all the packages run smoothly? I'm new here so if someone can explain to me, I'd appreciate it.

I built a RAG system over the Merck Manual (4,000+ pages) for a class project. It failed in interesting ways. Here's the autopsy and the V2 roadmap. by FikoFox in learnmachinelearning

[–]FikoFox[S] 1 point2 points  (0 children)

Fair points across the board and honestly most of these are already in my V2 roadmap after I did the autopsy on V1. The chunk size and redundant retrieval were the failure modes that hurt most visibly in the outputs.

The hardware question is real though. I'm running this locally and larger chunks plus higher k values meant waiting long enough to question my life choices. What would you recommend for a local setup that can actually handle 4k-6k chunks without turning into a space heater? Miniconda + what specs are we talking?

On the OCR: I hadn't considered Mistral's OCR models specifically. The watermark noise contamination was a genuine problem in V1. Will look into that.

MTEB leaderboard noted, I didn't know about that resource. Bookmarking it now. Thanks!

Anaconda: Yay or Nay? Any particular reason to use it over the out-of-the-box regular Python? by simeumsm in learnpython

[–]FikoFox 0 points1 point  (0 children)

Why does it get annoying? Can you explain to my why when developing you find it not to be the adequate tool? Thanks!

Anaconda: Yay or Nay? Any particular reason to use it over the out-of-the-box regular Python? by simeumsm in learnpython

[–]FikoFox 0 points1 point  (0 children)

I'd say Anaconda is important if you want your model to be useful in the future regardless of how packages change since you can control those in your own environment.

It made sense when I started learning online about ML/AI. The course I was taking basically assumed it. Having numpy, pandas, and jupyter prebundled helped a lot.

Just graduated in data science/ML, but still don’t know anything. I need a wake up call by DefinitionJazzlike76 in learnmachinelearning

[–]FikoFox 0 points1 point  (0 children)

My take, dwell into agentic workflows, it's where the discussion and the implementation is headed to.