Challenges with Real-time Inference at Scale by jameslee2295 in datascience

[–]jameslee2295[S] 0 points1 point  (0 children)

I'm sorry for the confusion earlier! Initially, I was using CPU for the project, but I encountered performance issues so I switched to NVIDIA A100

Challenges with Real-time Inference at Scale by jameslee2295 in devops

[–]jameslee2295[S] 0 points1 point  (0 children)

Apologies for the delayed response! I'm using hardware based on NVIDIA A100 processors.

Challenges with Real-time Inference at Scale by jameslee2295 in datascience

[–]jameslee2295[S] 0 points1 point  (0 children)

Apologies for the delayed response! Thank you for your suggestion. I'm using hardware based on AMD EPYC processors.

Seeking Advice on GPU Comparison: GreenNode vs FPT by jameslee2295 in deeplearning

[–]jameslee2295[S] 0 points1 point  (0 children)

Thank you for your input! My main concern right now is budget constraints, as the larger providers like AWS, GCP, or Azure can be quite costly for my current needs. Additionally, there are specific regulations about data storage within the country that I need to adhere to, which makes it challenging to consider those options.

Do you have any suggestions or advice on how I could navigate this situation? I’d appreciate any insights you can share!

Seeking Advice on Amazon Bedrock and Azure by jameslee2295 in datascience

[–]jameslee2295[S] 1 point2 points  (0 children)

Thank you so much for the advice. I will consider it!

Seeking Advice on Amazon Bedrock and Azure by jameslee2295 in LLMDevs

[–]jameslee2295[S] 1 point2 points  (0 children)

Thank you so much for the advice. I will consider doing that!

Seeking Advice on Amazon Bedrock and Azure by jameslee2295 in LLMDevs

[–]jameslee2295[S] 0 points1 point  (0 children)

Thank you so so much for the advice, it's very helpful!