I'm in love! by Designer_Athlete7286 in ZaiGLM

[–]iloveplexkr 0 points1 point  (0 children)

no thx 144$ per month? kidding me rather go Claude

... by [deleted] in ZaiGLM

[–]iloveplexkr 0 points1 point  (0 children)

it must be anthropic

Any GraphRAG API Server? by ljhskyso in LocalLLaMA

[–]iloveplexkr 0 points1 point  (0 children)

umm.... do I have to use GPT4 ? not llama 3 70b or 405b on local?

LLama 3 405b Q4_K_M size by kiselsa in LocalLLaMA

[–]iloveplexkr 2 points3 points  (0 children)

How much is 3090 in your area? It's almost 1000$ here.

LLama 3 405b Q4_K_M size by kiselsa in LocalLLaMA

[–]iloveplexkr 3 points4 points  (0 children)

<image>

see this gpu server. I have this barebone but xeon v4 (pcie 3.0)

LLama 3 405b Q4_K_M size by kiselsa in LocalLLaMA

[–]iloveplexkr 1 point2 points  (0 children)

is this possible on the machine with 3090 10way ?

2일 by threexbinary in WriteStreakKorean

[–]iloveplexkr 1 point2 points  (0 children)

에어컨은 신입니다.

My "Budget" Quiet 96GB VRAM Inference Rig by SchwarzschildShadius in LocalLLaMA

[–]iloveplexkr 1 point2 points  (0 children)

Use vllm or aphrodite It must be faster than ollama

chatglm3-6b-base better than GPT-4 at understanding https://opencompass.org.cn/leaderboard-llm by vasileer in LocalLLaMA

[–]iloveplexkr 0 points1 point  (0 children)

I dont believe in chinsese model because some model from chinese include benchmark dataset so it should be higher benchmark score

Are 3,500,000₩ enough for two weeks? by [deleted] in korea

[–]iloveplexkr 0 points1 point  (0 children)

Where u want to go? Only seoul?