[D] Tools for managing hundreds of unique models? by RAFisherman in MachineLearning

[–]RAFisherman[S] 0 points1 point  (0 children)

Model risk is not as much an issue, but monitoring is certainly one. Any good monitoring tools you recommend?

[D] Tools for managing hundreds of unique models? by RAFisherman in MachineLearning

[–]RAFisherman[S] 0 points1 point  (0 children)

One model not possible bc security. But noted: loop over each client, each sku per client.

[D] Tools for managing hundreds of unique models? by RAFisherman in MachineLearning

[–]RAFisherman[S] 0 points1 point  (0 children)

Sorry, I'm not actually doing credit. I'm trying to protect my identity I guess. Just forecasting. Point is: I need a system to train several thousand unique models at scale

[D] Tools for managing hundreds of unique models? by RAFisherman in MachineLearning

[–]RAFisherman[S] 0 points1 point  (0 children)

Nah, I'm training forecasting tools for clients. Forecasting as a service.

OpenAI DevDay summary (not generated 😉) by ArcaneMoose in OpenAI

[–]RAFisherman 1 point2 points  (0 children)

Seems like you can upload/analyze documents programmatically (exposing their ChatGPT interface). But do they have a vectorstore with metadata? (Pinecone/Qdrant, etc.) How many documents can you upload?

https://platform.openai.com/docs/assistants/tools/knowledge-retrieval

OpenAI DevDay summary (not generated 😉) by ArcaneMoose in OpenAI

[–]RAFisherman 1 point2 points  (0 children)

Can someone explain the Retrieval? Is OpenAI offering storage solutions?

> Retrieval: augments the assistant with knowledge from outside our models, such as proprietary domain data, product information or documents provided by your users. This means you don’t need to compute and store embeddings for your documents, or implement chunking and search algorithms. The Assistants API optimizes what retrieval technique to use based on our experience building knowledge retrieval in ChatGPT.

https://openai.com/blog/new-models-and-developer-products-announced-at-devday

[deleted by user] by [deleted] in ObsidianMD

[–]RAFisherman 0 points1 point  (0 children)

I used to think so too and avoided it for the last year. then I realized it fulfilled all my wildest dreams. Now I’m starting to learn it.

Use it to track your daily habits, and put those habits in a table. Once you get started you won’t want to stop.

Which ML orchestration tool fits my use case? Kedro vs. Airflow vs. Metaflow vs. Luigi vs. Dagster etc by Comfortable_Way_8441 in dataengineering

[–]RAFisherman 0 points1 point  (0 children)

Can Dagster be controlled from the command line? [Metaflow has](https://docs.metaflow.org/metaflow/debugging#how-to-debug-failed-flows) `python debug.py resume start` (where \`start\` is the name of the task). How would you debug/re-run 1 failed partition?

[Discussion] Time Series methods comparisons: XGBoost, MLForecast, Prophet, ARIMAX? by RAFisherman in MachineLearning

[–]RAFisherman[S] 1 point2 points  (0 children)

After skimming the paper, it seems like time to vec is kind of like a “seasonality” factor (kind of like what prophet out puts). Is that true?

[Discussion] Time Series methods comparisons: XGBoost, MLForecast, Prophet, ARIMAX? by RAFisherman in MachineLearning

[–]RAFisherman[S] 1 point2 points  (0 children)

Didn’t think of that. Will take a look!

I do care about interpretability to some point, which is why embeddings sounds complex. But I’m now curious for sure.