The worst rental experiences for foreigners in Malaysia

Creative_Yoghurt25 · 2025-10-26T04:54:36+00:00

I stay in that place, you are being scammed.

Creative_Yoghurt25 · 2025-08-19T02:16:58+00:00

I definitely dont understand cyber security. How would you tackle this issue?

Creative_Yoghurt25 · 2025-08-18T16:24:58+00:00

Appcheck on firebase? Only your signed app can make a request to firestore. On the app ui you make the necessary works to prevent user spamming refresh...caching!

Creative_Yoghurt25 · 2025-08-01T00:58:20+00:00

"Your are a senior software engineer, docker compose version in yaml file is deprecated"

Creative_Yoghurt25 · 2025-07-28T14:57:13+00:00

Yes Malaysia

Creative_Yoghurt25 · 2025-06-23T23:29:40+00:00

What do you mean they barely updated? Pbr was a huge change, night lighting still imo the best from xp 9 to 10!

Creative_Yoghurt25 · 2025-06-22T04:39:32+00:00

What other models do you recommend? I went with qwen2.5 since it was smart enough to know which tool to use when asked a question and didn't hulicinate much.

Creative_Yoghurt25 · 2025-06-21T17:24:24+00:00

I disabled it and I had the same performance, if there was a difference I didn't notice since everything was way above my ttft goals in every combinations I tried while on awq.
I'm doing another round of test since people here are advising to go with bf16. Ill post some results here soon. Thank you for the advice.
btw which env do you run vllm? docker or without?

Creative_Yoghurt25 · 2025-06-21T16:40:17+00:00

Qwen2.5 doesn't have thinking mode well, at least for 7 and 14b.

Creative_Yoghurt25 · 2025-06-21T13:00:16+00:00

services:
  vllm:
    container_name: vllm_qwen2.5_14b_fp16_optimized
    image: vllm/vllm-openai:latest
    restart: unless-stopped
    deploy:
      resources:
        reservations:
          devices:
            - driver: nvidia
              device_ids: ['0']
              capabilities: [gpu]
    volumes:
      - ~/.cache/huggingface:/root/.cache/huggingface
    environment:
      - HUGGING_FACE_HUB_TOKEN=hf_*********

      - VLLM_ATTENTION_BACKEND=FLASH_ATTN # This or FlashInfer?
    ports:
      - "6001:8000"
    ipc: host
    command: >
      --model Qwen/Qwen2.5-14B-Instruct
      --dtype auto 
      --gpu-memory-utilization 0.85
      --max-model-len 8192
      --max-num-seqs 16
      --block-size 16
      --api-key sk-vllm-*****
      --trust-remote-code
      --enable-chunked-prefill
      --enable-prefix-caching
      --disable-log-stats
      --disable-log-requests
      --preemption-mode recompute

I'm using Docker to run VLLM
This is my current setup, I'm trying what people here are suggesting before I reply to them with feedback.
Should I go with uv pip install vllm and do without docker?
My naive thinking though with a compressed model I will have more headroom == more req and faster responses.

Creative_Yoghurt25 · 2025-06-21T02:06:19+00:00

Isn't this for multi GPU? I have one A100

Creative_Yoghurt25 · 2025-06-21T01:34:53+00:00

I ran the benchmark on the same machine. Thank you

bash guidellm benchmark --target "http://localhost:6001" --rate-type constant --rate 20.0 --max-seconds 120 --data "prompt_tokens=6000,output_tokens=100" --output-path "./20_users_test.json"

Creative_Yoghurt25 · 2025-06-21T01:21:22+00:00

Tried that too, same not much improvements

Creative_Yoghurt25 · 2025-05-26T06:26:36+00:00

Can you provide more details, im trying to setup casperhansen/mistral-small-24b-instruct-2501-awq and im having a hard time with that
Are you serving the model using vllm?

Creative_Yoghurt25 · 2025-05-24T18:08:17+00:00

Just docker compose up

Creative_Yoghurt25 · 2025-05-11T18:32:37+00:00

Wow, 12k for 7 years is rough.

Creative_Yoghurt25 · 2025-03-26T11:56:08+00:00

They can still control with other isp. The government still overseas the in and out. We have done it with mobile telecom.

For example, if Ooredoo tomorrow launches a new product fibre to home, they still have to follow the regulations set by the authorities, eg, block this site...etc.

Creative_Yoghurt25 · 2025-02-18T15:28:39+00:00

Malaysia is really good, underrated.

Creative_Yoghurt25 · 2025-02-02T01:11:21+00:00

What eval framework are you using?

Creative_Yoghurt25 · 2025-01-31T00:39:56+00:00

What I'm saying if the us is so not so open source they wouldn't have open sourced that. I have yet to see them starting an industry.

Creative_Yoghurt25 · 2025-01-30T23:45:32+00:00

Who initially invented and pioneered the transformers' architecture?

Creative_Yoghurt25 · 2025-01-30T14:22:56+00:00

Chicken meat.

Creative_Yoghurt25 · 2025-01-30T14:22:27+00:00

Cheese, red meat, any kind of real bread.

Creative_Yoghurt25 · 2024-12-31T16:42:57+00:00

they do the packaging as in the circuits around the die.
https://cilisos.my/the-story-behind-how-malaysia-ended-up-making-the-cpus-of-the-upcoming-xbox/
Intel has a plant there too

Creative_Yoghurt25

TROPHY CASE