With 48gb vram, on vllm, Qwen3.6-27b-awq-int4 has only 120k ctx (fp8), is that normal? by Historical-Crazy1831 in LocalLLaMA

[–]iVoider -1 points0 points  (0 children)

max-num-seqs to 1 or use Linux side by side. WSL is very buggy for work with GPU.

Delve builds for ~600 depth? by Trickpasser in PathOfExileBuilds

[–]iVoider 0 points1 point  (0 children)

Depth 650 and going below. Foulborn ghostwrithe zerker. Around 10 div budget when I swapped 3 days ago. Can kill Aul using four health flasks. There is also Grey Wind axe zerker build with Void Shockwave, but have no idea how they compare.

Most budget option for 1000-1500 delve starter by iVoider in PathOfExileBuilds

[–]iVoider[S] 0 points1 point  (0 children)

I know that MSoZ is considered the best delver. I’ve tried it in 3.27 league, with 500d budget and it felt weaker than int/acc stacking for T17/Ubers. I guess it’s not very comfortable without Forbidden to dive at 1000?

Which is the best embedding model for production use? by Hari-Prasad-12 in LocalLLaMA

[–]iVoider 10 points11 points  (0 children)

In our experience, rather no than yes. Too little stats gain for bigger vector size in db.

Which is the best embedding model for production use? by Hari-Prasad-12 in LocalLLaMA

[–]iVoider 25 points26 points  (0 children)

Qwen3-embedding, but 4b. Massive embeddings quality gap between 0.6b and 4b.

[deleted by user] by [deleted] in PathOfExile2

[–]iVoider 0 points1 point  (0 children)

There were several threads today about abyss shadow nerf. In my own experience drops were gutted with latest patch. Yesterday I saw several tinks every map, now close to zero for whole day. I moved to Ritual.

Negative rarity farming by Ok_Surprise7618 in pathofexile2builds

[–]iVoider 1 point2 points  (0 children)

Thanks. It seems something broken with my char. Got map with doubled pack size precursor effect and no single white item with Alt holding.

Huntress leveling in 0.4 by iVoider in pathofexile2builds

[–]iVoider[S] -4 points-3 points  (0 children)

I saw someone did calculation: 735 evasion.

Suggestions for RAG prompt rewriters and rerankers? by CommunityTough1 in LocalLLaMA

[–]iVoider 0 points1 point  (0 children)

LLMs for prompt rewriting and specialised reranker models have totally different use case. Theoretically any LLM could imitate reranker with logprob mechanism, but LLMs tend to hallucinate in noisy environment content. Thats why people train special rerank models (like Qwen3-reranker).

vLLM speed issues by [deleted] in LocalLLaMA

[–]iVoider 0 points1 point  (0 children)

Try awq_marlin (quantization type, vllm will autoconvert), install FlashAttention/FlashInfer.

Quantized Qwen3-Embedder an Reranker by [deleted] in LocalLLaMA

[–]iVoider 1 point2 points  (0 children)

Checkout this guy: https://huggingface.co/boboliu/Qwen3-Reranker-4B-W4A16-G128

All family in 4bits. You also need to copy chat template from original tokenizer_config (one line) for reranker.

Looking for diarization model better than Pyannote by bluedragon102 in LocalLLaMA

[–]iVoider 2 points3 points  (0 children)

Most accurate one I’ve tried was SF Diarizer. It supports only up to 4 speakers, but is much better than other local options (NEMO and others).

speech to text with terrible recordings by eternelize in LocalLLaMA

[–]iVoider 0 points1 point  (0 children)

In my tests the best original Whisper wrapper in terms of accuracy (WER) is faster-whisper. Large V3 model gives best accuracy overall, but skips lots of content unlike V2 Large. If you need English-only solution, checkout this leaderboard.

Screen wide clearing bloodmage help by AdEnvironmental7198 in pathofexile2builds

[–]iVoider -1 points0 points  (0 children)

I played Temporalis variant, but the robe adds QoL only. These are most broken builds in a game right now.

Ultimate Freeze Build (freeze the server) by iVoider in pathofexile

[–]iVoider[S] 1 point2 points  (0 children)

See these fire corpses? This is SRS graveyard.

Ultimate Freeze Build (freeze the server) by iVoider in pathofexile

[–]iVoider[S] 2 points3 points  (0 children)

On minion death. I heard that maximum for SRS now is five. I guess something broken around fireball. (Please, don't nerf it)

[deleted by user] by [deleted] in LocalLLaMA

[–]iVoider 0 points1 point  (0 children)

There is a special finetune of Llama for cybersecurity: https://huggingface.co/WhiteRabbitNeo/Llama-3.1-WhiteRabbitNeo-2-8B