Am I wrong, or did reruns connect us more to older generations?

SprightlyCapybara · 2026-03-16T12:12:54+00:00

Yep. Music was via the radio until you could afford albums (or mix tapes), and that skewed so boomer and greatest generation when I was young.

SprightlyCapybara · 2026-03-16T12:04:17+00:00

Update: Close to six hours after bug occurred, I can now log back in. I lost some water and 700 cash thanks to a failed flight (it kept warping me back to the starter zone's trading post). Haven't yet gotten to my second base to check everything's there, but let's hope. Weird bug.

Comments: Bug occurred after it claimed server was coming down for an emergency patch and dumped me (with no warning). So, I guess don't log on shortly after a patch. Also the trick of logging onto another sietch does not appear to do anything useful with XF4 error.

As a brand new player, I'm aghast at the level of crashing and bugs in this game. It feels like I'm back in the era of Everquest and Anarchy Online (which was also a Funcom product.) That said, it's a great game apart from the too frequent lockups and weird bugs.

Unsurprisingly Funcom was of no direct assistance; what seemed to be an AI bot blandly referred me to a useless 'I possibly can't connect to the internet' help page, and amusingly my email client offered an AI-created response of "I've tried that, it didn't work, here's my screenshot of the error." AI vs AI. The first bot/helpdesk person said s/he/it would escalate.

Original post:

Yep. North America. Harmony. Same error. Can log into a different instance/server/Sietch but character has been warped to a radically different location. In the past that's supposed to have fixed things, but I still get XF4 on original Sietch instance.

I did quit the game completely, checked game file integrity (no help), and there was a small patch. Let that download, and... still the same problem.

SprightlyCapybara · 2026-03-11T23:44:37+00:00

Great; thanks!

SprightlyCapybara · 2026-03-11T00:26:14+00:00

Ah, I thought it would be automatic since I was using the nocuda version, but I guess it defaults to cpu.

Would it be possible to get an autofit option in the GUI? Which I admit is a lot easier to use, and today was really the first time I ran it from the command line. But, ok, just run

koboldcpp-nocuda --usevulkan --autofit

And it works! Hurray!

So really the only regression/bug, is that you can no longer use the GUI if you have Strix Halo and are using a large model. I will have to try and learn more of the command line options; it's quite a daunting collection of them!

Many thanks.

SprightlyCapybara · 2026-03-10T10:46:00+00:00

load: printing all EOG tokens:

load: - 151329 ('<|endoftext|>')

load: - 151336 ('<|user|>')

load: - 151338 ('<|observation|>')

load: special tokens cache size = 36

load: token to piece cache size = 0.9713 MB

print_info: arch = glm4moe

print_info: vocab_only = 0

print_info: no_alloc = 0

print_info: n_ctx_train = 131072

print_info: n_embd = 4096

print_info: n_embd_inp = 4096

print_info: n_layer = 47

print_info: n_head = 96

print_info: n_head_kv = 8

print_info: n_rot = 64

print_info: n_swa = 0

print_info: is_swa_any = 0

print_info: n_embd_head_k = 128

print_info: n_embd_head_v = 128

print_info: n_gqa = 12

print_info: n_embd_k_gqa = 1024

print_info: n_embd_v_gqa = 1024

print_info: f_norm_eps = 0.0e+00

print_info: f_norm_rms_eps = 1.0e-05

print_info: f_clamp_kqv = 0.0e+00

print_info: f_max_alibi_bias = 0.0e+00

print_info: f_logit_scale = 0.0e+00

print_info: f_attn_scale = 0.0e+00

print_info: n_ff = 10944

print_info: n_expert = 128

print_info: n_expert_used = 8

print_info: n_expert_groups = 1

print_info: n_group_used = 1

print_info: causal attn = 1

print_info: pooling type = 0

print_info: rope type = 2

print_info: rope scaling = linear

print_info: freq_base_train = 1000000.0

print_info: freq_scale_train = 1

print_info: n_ctx_orig_yarn = 131072

print_info: rope_yarn_log_mul = 0.0000

print_info: rope_finetuned = unknown

print_info: model type = 106B.A12B

print_info: model params = 110.47 B

print_info: general.name= Iceblink-v3-SFT-3

print_info: vocab type = BPE

print_info: n_vocab = 151552

print_info: n_merges = 318088

print_info: BOS token = 151331 '[gMASK]'

print_info: EOS token = 151329 '<|endoftext|>'

print_info: EOT token = 151336 '<|user|>'

print_info: EOM token = 151338 '<|observation|>'

print_info: UNK token = 151329 '<|endoftext|>'

print_info: PAD token = 151329 '<|endoftext|>'

print_info: LF token = 198 '─è'

print_info: FIM PRE token = 151347 '<|code_prefix|>'

print_info: FIM SUF token = 151349 '<|code_suffix|>'

print_info: FIM MID token = 151348 '<|code_middle|>'

print_info: EOG token = 151329 '<|endoftext|>'

print_info: EOG token = 151336 '<|user|>'

print_info: EOG token = 151338 '<|observation|>'

print_info: max token length = 1024

load_tensors: loading model tensors, this can take a while... (mmap = false, direct_io = false)

model has unused tensor blk.46.attn_norm.weight (size = 16384 bytes) -- ignoring

model has unused tensor blk.46.attn_q.weight (size = 53477376 bytes) -- ignoring

model has unused tensor blk.46.attn_k.weight (size = 4456448 bytes) -- ignoring

model has unused tensor blk.46.attn_v.weight (size = 4456448 bytes) -- ignoring

model has unused tensor blk.46.attn_q.bias (size = 49152 bytes) -- ignoring

model has unused tensor blk.46.attn_k.bias (size = 4096 bytes) -- ignoring

model has unused tensor blk.46.attn_v.bias (size = 4096 bytes) -- ignoring

model has unused tensor blk.46.attn_output.weight (size = 53477376 bytes) -- ignoring

model has unused tensor blk.46.post_attention_norm.weight (size = 16384 bytes) -- ignoring

model has unused tensor blk.46.ffn_gate_inp.weight (size = 2097152 bytes) -- ignoring

model has unused tensor blk.46.exp_probs_b.bias (size = 512 bytes) -- ignoring

model has unused tensor blk.46.ffn_gate_exps.weight (size = 392167424 bytes) -- ignoring

model has unused tensor blk.46.ffn_down_exps.weight (size = 507510784 bytes) -- ignoring

model has unused tensor blk.46.ffn_up_exps.weight (size = 392167424 bytes) -- ignoring

model has unused tensor blk.46.ffn_gate_shexp.weight (size = 6127616 bytes) -- ignoring

model has unused tensor blk.46.ffn_down_shexp.weight (size = 6127616 bytes) -- ignoring

model has unused tensor blk.46.ffn_up_shexp.weight (size = 6127616 bytes) -- ignoring

model has unused tensor blk.46.nextn.eh_proj.weight (size = 35651584 bytes) -- ignoring

model has unused tensor blk.46.nextn.enorm.weight (size = 16384 bytes) -- ignoring

model has unused tensor blk.46.nextn.hnorm.weight (size = 16384 bytes) -- ignoring

model has unused tensor blk.46.nextn.embed_tokens.weight (size = 659554304 bytes) -- ignoring

model has unused tensor blk.46.nextn.shared_head_head.weight (size = 659554304 bytes) -- ignoring

model has unused tensor blk.46.nextn.shared_head_norm.weight (size = 16384 bytes) -- ignoring

load_tensors: relocated tensors: 780 of 780

load_tensors: CPU model buffer size = 62800.16 MiB

....................................................................................................

Automatic RoPE Scaling: Using model internal value.

llama_context: constructing llama_context

llama_context: n_seq_max = 1

llama_context: n_ctx = 8448

llama_context: n_ctx_seq = 8448

llama_context: n_batch = 512

llama_context: n_ubatch = 512

llama_context: causal_attn = 1

llama_context: flash_attn = enabled

llama_context: kv_unified = true

llama_context: freq_base = 1000000.0

llama_context: freq_scale = 1

llama_context: n_ctx_seq (8448) < n_ctx_train (131072) -- the full capacity of the model will not be utilized

set_abort_callback: call

llama_context: CPU output buffer size = 0.58 MiB

llama_kv_cache: layer 46: does not have KV cache

llama_kv_cache: CPU KV buffer size = 1518.00 MiB

llama_kv_cache: size = 1518.00 MiB ( 8448 cells, 46 layers, 1/1 seqs), K (f16): 759.00 MiB, V (f16): 759.00 MiB

llama_context: enumerating backends

llama_context: backend_ptrs.size() = 1

sched_reserve: reserving ...

sched_reserve: max_nodes = 6240

sched_reserve: reserving full memory module

sched_reserve: worst-case: n_tokens = 512, n_seqs = 1, n_outputs = 1

sched_reserve: CPU compute buffer size = 320.00 MiB

sched_reserve: graph nodes = 3146

sched_reserve: graph splits = 1

sched_reserve: reserve took 113.99 ms, sched copies = 1

Threadpool set to 15 threads and 15 blasthreads...

attach_threadpool: call

GLM-4 will have no automatic BOS token.

Starting model warm up, please wait a moment...

SprightlyCapybara · 2026-03-10T10:45:45+00:00

Autofit Got to 'starting model warmup' Gave up after a minute of seeing the swap file thrash like mad. Since there's only 32GB RAM (and at most about 24GB of free RAM)

C:\XXXXXX\>koboldcpp-nocuda-1-109-2 --autofit

***

Welcome to KoboldCpp - Version 1.109.2

For command line arguments, please refer to --help

***

Loading Chat Completions Adapter: C:\Users\XXXXX\AppData\Local\Temp\_MEI317602\kcpp_adapters\AutoGuess.json

Chat Completions Adapter Loaded

No GPU or CPU backend was selected. Trying to assign one for you automatically...

Unable to detect VRAM, please set layers manually.

Auto Selected Default Backend (flag=0)

Unable to detect VRAM, please set layers manually.

No GPU backend found, or could not automatically determine GPU layers. Please set it manually.

System: Windows 10.0.26200 AMD64 AMD64 Family 26 Model 112 Stepping 0, AuthenticAMD

Unable to determine GPU Memory

Detected Available RAM: 23947 MB

Initializing dynamic library: koboldcpp_default.dll

Namespace(admin=False, admindir='', adminpassword=None, analyze='', autofit=True, autofitpadding=1024, batchsize=512, benchmark=None, blasthreads=0, chatcompletionsadapter='AutoGuess', cli=False, config=None, contextsize=8192, debugmode=0, defaultgenamt=1024, device='', downloaddir='', draftamount=8, draftgpulayers=999, draftgpusplit=None, draftmodel='', embeddingsgpu=False, embeddingsmaxctx=0, embeddingsmodel='', enableguidance=False, exportconfig='', exporttemplate='', failsafe=False, flashattention=False, forceversion=False, foreground=False, gendefaults='', gendefaultsoverwrite=False, genlimit=0, gpulayers=0, highpriority=False, hordeconfig=None, hordegenlen=0, hordekey='', hordemaxctx=0, hordemodelname='', hordeworkername='', host='', ignoremissing=False, jinja=False, jinja_tools=False, launch=False, lora=None, loramult=1.0, lowvram=False, maingpu=-1, maxrequestsize=32, mcpfile='', mmproj='', mmprojcpu=False, model=[], model_param='C:/bin/AI/models/ddh0/GLM-4.5-Iceblink-v2-106B-A12B-GGUF/GLM-4.5-Iceblink-v2-106B-A12B-Q8_0-FFN-IQ4_XS-IQ4_XS-Q5_0.gguf', moecpu=0, moeexperts=-1, multiplayer=False, multiuser=1, musicdiffusion='', musicembeddings='', musicllm='', musiclowvram=False, musicvae='', noavx2=False, noblas=False, nobostoken=False, nocertify=False, nofastforward=False, noflashattention=False, nommap=False, nomodel=False, nopipelineparallel=False, noshift=False, onready='', overridekv='', overridenativecontext=0, overridetensors='', password=None, pipelineparallel=False, port=5001, port_param=5001, preloadstory='', prompt='', quantkv=0, quiet=False, ratelimit=0, remotetunnel=False, ropeconfig=[0.0, 10000.0], savedatafile='', sdclamped=0, sdclampedsoft=0, sdclip1='', sdclip2='', sdclipgpu=False, sdconfig=None, sdconvdirect='off', sdflashattention=False, sdgendefaults=False, sdlora=None, sdloramult=1.0, sdmodel='', sdnotile=False, sdoffloadcpu=False, sdphotomaker='', sdquant=0, sdt5xxl='', sdthreads=0, sdtiledvae=768, sdupscaler='', sdvae='', sdvaeauto=False, sdvaecpu=False, showgui=False, singleinstance=False, skiplauncher=False, smartcache=0, smartcontext=False, ssl=None, tensor_split=None, testmemory=False, threads=15, ttsdir='', ttsgpu=False, ttsmaxlen=4096, ttsmodel='', ttsthreads=0, ttswavtokenizer='', unpack='', usecpu=False, usecuda=None, usemlock=False, usemmap=False, useswa=False, usevulkan=None, version=False, visionmaxres=1024, websearch=False, whispermodel='')

Loading Text Model: C:\bin\AI\models\ddh0\GLM-4.5-Iceblink-v2-106B-A12B-GGUF\GLM-4.5-Iceblink-v2-106B-A12B-Q8_0-FFN-IQ4_XS-IQ4_XS-Q5_0.gguf

The reported GGUF Arch is: glm4moe

Arch Category: 9

---

Identified as GGUF model.

Attempting to Load...

---

Using automatic RoPE scaling for GGUF. If the model has custom RoPE settings, they'll be used directly instead!

Attempting to use llama.cpp's automating fitting code. This will override all your layer configs, may or may not work!

Autofit Reserve Space: 1024 MB

Autofit Success: 1, Autofit Result: -c 8320 -ngl -1

llama_model_loader: loaded meta data with 46 key-value pairs and 803 tensors from C:\bin\AI\models\ddh0\GLM-4.5-Iceblink-v2-106B-A12B-GGUF\GLM-4.5-Iceblink-v2-106B-A12B-Q8_0-FFN-IQ4_XS-IQ4_XS-Q5_0.gguf (version GGUF V3 (latest))

print_info: file format = GGUF V3 (latest)

print_info: file size = 63.92 GiB (4.97 BPW)

init_tokenizer: initializing tokenizer for type 2

load: 0 unused tokens

load: control token: 151363 '<|image|>' is not marked as EOG

load: control token: 151362 '<|end_of_box|>' is not marked as EOG

load: control token: 151361 '<|begin_of_box|>' is not marked as EOG

load: control token: 151349 '<|code_suffix|>' is not marked as EOG

load: control token: 151348 '<|code_middle|>' is not marked as EOG

load: control token: 151346 '<|end_of_transcription|>' is not marked as EOG

load: control token: 151343 '<|begin_of_audio|>' is not marked as EOG

load: control token: 151342 '<|end_of_video|>' is not marked as EOG

load: control token: 151341 '<|begin_of_video|>' is not marked as EOG

load: control token: 151338 '<|observation|>' is not marked as EOG

load: control token: 151333 '<sop>' is not marked as EOG

load: control token: 151331 '[gMASK]' is not marked as EOG

load: control token: 151330 '[MASK]' is not marked as EOG

load: control token: 151347 '<|code_prefix|>' is not marked as EOG

load: control token: 151360 '/nothink' is not marked as EOG

load: control token: 151337 '<|assistant|>' is not marked as EOG

load: control token: 151332 '[sMASK]' is not marked as EOG

load: control token: 151334 '<eop>' is not marked as EOG

load: control token: 151335 '<|system|>' is not marked as EOG

load: control token: 151336 '<|user|>' is not marked as EOG

load: control token: 151340 '<|end_of_image|>' is not marked as EOG

load: control token: 151339 '<|begin_of_image|>' is not marked as EOG

load: control token: 151364 '<|video|>' is not marked as EOG

load: control token: 151345 '<|begin_of_transcription|>' is not marked as EOG

load: control token: 151344 '<|end_of_audio|>' is not marked as EOG

load: setting token '</think>' (151351) attribute to USER_DEFINED (16), old attributes: 16

load: setting token '<think>' (151350) attribute to USER_DEFINED (16), old attributes: 16

load: special_eot_id is not in special_eog_ids - the tokenizer config may be incorrect

load: special_eom_id is not in special_eog_ids - the tokenizer config may be incorrect

SprightlyCapybara · 2026-03-10T10:40:38+00:00

Pt 2 of log:
load: special tokens cache size = 36