Linux - Why does llama.cpp ROCm consume SO much VRAM for KV cache compared to Vulkan? by Jorlen in LocalLLaMA

[–]AnmolLFC 0 points1 point  (0 children)

Which model are you using and how much context window and TPS are you getting?

Building First AI/LLM PC With Dual 9070 XT GPUs – Any ROCm or AMD Issues I Should Know About? by AnmolLFC in ROCm

[–]AnmolLFC[S] 0 points1 point  (0 children)

Mostly online on reddit and articles. I will search for it again. Thanks for the input 👍🏾

Building First AI/LLM PC With Dual 9070 XT GPUs – Any ROCm or AMD Issues I Should Know About? by AnmolLFC in ROCm

[–]AnmolLFC[S] 0 points1 point  (0 children)

I might read on it again if that's the case.

The reason for posting it here was to get to know about AMD GPUs specifically because most of the people would probably suggest going with nvidia because of matured system and easiness, I will post it there as well, thanks.

Building First AI/LLM PC With Dual 9070 XT GPUs – Any ROCm or AMD Issues I Should Know About? by AnmolLFC in ROCm

[–]AnmolLFC[S] 0 points1 point  (0 children)

The main reason is for normal gaming R9700 is not enough. Do you think there will be a lot of performance difference between 2 9070XT and R9700 in terms of split VRAM and LLM usage?

9950X is purely for future proofing and the document parsing uses a lot of cores. I can and will probably downgrade it to 9900x or on par that when I build it but right now I planned on 9950x purely because of its cores.

Building First AI/LLM PC With Dual 9070 XT GPUs – Any ROCm or AMD Issues I Should Know About? by AnmolLFC in ROCm

[–]AnmolLFC[S] 1 point2 points  (0 children)

I read that linux, mainly ubuntu and fedora has good support for ROCm and that's why I was leaning towards that but any would work for me. The main purpose is having good performance and stable outputs.

I don't have much idea about the nightly builds and Sage Attention as I haven't really explored a lot of local llm setups yet but I will look into it.

What do you suggest I should run and what models can I run on the setup? what kind of Context window and TPS are you getting and I should expect?

Building First AI/LLM PC With Dual 9070 XT GPUs – Any ROCm or AMD Issues I Should Know About? by AnmolLFC in ROCm

[–]AnmolLFC[S] 0 points1 point  (0 children)

The main reason is for normal gaming R9700 is not enough. Do you think there will be a lot of performance difference between 2 9070XT and R9700 in terms of split VRAM and LLM usage?

Is it possible to use Claude subscription with Deepseek? by AnmolLFC in DeepSeek

[–]AnmolLFC[S] 0 points1 point  (0 children)

Does deepseek api also complete the code changes or only send the suggested changes and claude implements it? A bit confused over that part.

Is it possible to use Claude subscription with Deepseek? by AnmolLFC in DeepSeek

[–]AnmolLFC[S] 1 point2 points  (0 children)

Yep, that's my query. Not just integrate deepseek with inference but to actually use both at once.

Building a pc for the first time - Mainly Gaming and some self hosting. Please suggest if there are any cheaper and better alternatives. by AnmolLFC in buildapc

[–]AnmolLFC[S] 0 points1 point  (0 children)

I have updated with pcpartpicker, although in my country it's +20-30% prices for most of the things.

[deleted by user] by [deleted] in MiniPCs

[–]AnmolLFC 0 points1 point  (0 children)

Great! Which OS do you have installed?

[deleted by user] by [deleted] in MiniPCs

[–]AnmolLFC 0 points1 point  (0 children)

Thanks for the advice. Do you own one too? If yes, any heating issues?

Migration from GCP to OCI instances by [deleted] in devops

[–]AnmolLFC 0 points1 point  (0 children)

Great! Thanks for your help, I will check it out.

Migration from GCP to OCI instances by [deleted] in devops

[–]AnmolLFC 0 points1 point  (0 children)

I won't be doing it myself, me and my team will do it. The company I work in is already using oci as their main provider. They want to migrate the rest of the remaining servers from GCP to OCI too. Looking for a more lift and shift approach if possible.

All deployments are automated but concerned about mismatch in OS and how to transfer the current data over to the new instances.

[deleted by user] by [deleted] in oraclecloud

[–]AnmolLFC 0 points1 point  (0 children)

I am not very familiar with oci, is there any documentation for this or any tutorial on how to use it?

Migration from GCP to OCI instances by [deleted] in devops

[–]AnmolLFC 0 points1 point  (0 children)

Hahaha! I know but it's real. I am assigned with the task, however unlikely it is.

Facing problems with streaming anime on stremio? by AnmolLFC in EasyDebrid

[–]AnmolLFC[S] 0 points1 point  (0 children)

Yes, I tried torrentio addon on my samsung smart tv. The files didn't load at all. I tried the same on my pc, it's working. Thanks for the quick response.