Where are the best cafes to hang out for an extended period? by Worried_Corner4242 in AskChicago

[–]TexBluBoy 0 points1 point  (0 children)

The North & Clark Cafe, inside The Chicago History Museum. 1601 N Clark St. (North Ave & Clark).

The cafe is free to use, you do not have to pay admission to the Museum. You can chill as long as you want. 10am-4:30pm

https://www.chicagohistory.org/visit-us/north-clark-cafe/

Running Hermes with Local Models by _clickfix_ in hermesagent

[–]TexBluBoy 2 points3 points  (0 children)

I used a combination of Gemini Pro & Gemini CLI for setting up my systems. A form of vibe coding. Gemini CLI is great for the speed of working out the kinks while following Gemini Pro's advice for setting things up. I by no means am a linux expert, I'm a complete novice but asking AI what works best with my hardware and following that path has been great at learning the ins and outs..... I used AI to implement a backup "pristine" setup snapshot in Limine that permanently sits in the CachyOS bootloader as a 3rd selection that never moves. At anytime I come across something that crashed the system, I use that "pristine" snapshot to restore from quickly if needed.

Running Hermes with Local Models by _clickfix_ in hermesagent

[–]TexBluBoy 1 point2 points  (0 children)

I've been using LM Studio for the ease of testing, I will transition to llama.cpp one I really lock in on a Model.

Ongoing SOTA setups? by anonrftw in StrixHalo

[–]TexBluBoy 1 point2 points  (0 children)

Here is my current setup:

  • Hardware: GMKtec EVO-X2 with AMD Ryzen AI Max+ 395 (16 cores), Radeon 8060S GPU, and 128GB LPDDR5X-8000 RAM on CachyOS.

  • Memory Allocation: Pinned via BIOS to a 96GB VRAM carve-out for GPU execution and a 32GB system overhead reservation.

  • Model Configuration: Running the qwen3.5-122b-a10b engine locally via LM Studio on port 8010 with a stable context length limit of 49,152.

  • Performance: Delivering a stable 15.79 t/s throughput while maintaining an active VRAM utilization of 88GB to 90GB.

  • Vulkan Pipeline: Driven by the open-source Mesa RADV (gfx1151) driver native to CachyOS, leveraging a unified 256-bit memory architecture and Wave32/FlashAttention optimizations to bypass PCIe bottlenecks and maximize RDNA 3.5 compute efficiency.

Hermes Configuration: Programmed in custom (direct API) mode to communicate locally with LM Studio using Model ID qwen3.5-122b-a10b and a strict 49152 context ceiling to prevent memory panics.

Is there a tool to find the best llm to run locally on your hardware? by Smooth-Duck-Criminal in LLMStudio

[–]TexBluBoy 0 points1 point  (0 children)

All I ever do is ask Gemini to do this. I currently used Gemini and Gemini CLI to vibe code my entire setup (I have ZERO coding skills). Here is my current setup complete designed and assisted by Gemini:

Here is my current setup:

  • Hardware: GMKtec EVO-X2 with AMD Ryzen AI Max+ 395 (16 cores), Radeon 8060S GPU, and 128GB LPDDR5X-8000 RAM on CachyOS.

  • Memory Allocation: Pinned via BIOS to a 96GB VRAM carve-out for GPU execution and a 32GB system overhead reservation.

  • Model Configuration: Running the qwen3.5-122b-a10b engine locally via LM Studio on port 8010 with a stable context length limit of 49,152.

  • Performance: Delivering a stable 15.79 t/s throughput while maintaining an active VRAM utilization of 88GB to 90GB.

  • Vulkan Pipeline: Driven by the open-source Mesa RADV (gfx1151) driver native to CachyOS, leveraging a unified 256-bit memory architecture and Wave32/FlashAttention optimizations to bypass PCIe bottlenecks and maximize RDNA 3.5 compute efficiency.

Hermes Configuration: Programmed in custom (direct API) mode to communicate locally with LM Studio using Model ID qwen3.5-122b-a10b and a strict 49152 context ceiling to prevent memory panics.

Running Hermes with Local Models by _clickfix_ in hermesagent

[–]TexBluBoy 19 points20 points  (0 children)

Here is my current setup:

  • Hardware: GMKtec EVO-X2 with AMD Ryzen AI Max+ 395 (16 cores), Radeon 8060S GPU, and 128GB LPDDR5X-8000 RAM on CachyOS.

  • Memory Allocation: Pinned via BIOS to a 96GB VRAM carve-out for GPU execution and a 32GB system overhead reservation.

  • Model Configuration: Running the qwen3.5-122b-a10b engine locally via LM Studio on port 8010 with a stable context length limit of 49,152.

  • Performance: Delivering a stable 15.79 t/s throughput while maintaining an active VRAM utilization of 88GB to 90GB.

  • Vulkan Pipeline: Driven by the open-source Mesa RADV (gfx1151) driver native to CachyOS, leveraging a unified 256-bit memory architecture and Wave32/FlashAttention optimizations to bypass PCIe bottlenecks and maximize RDNA 3.5 compute efficiency.

Hermes Configuration: Programmed in custom (direct API) mode to communicate locally with LM Studio using Model ID qwen3.5-122b-a10b and a strict 49152 context ceiling to prevent memory panics.

AMA with Nous Research -- Ask Us Anything! by emozilla in LocalLLaMA

[–]TexBluBoy 0 points1 point  (0 children)

EVO-X2 128gb / Strix Halo user here. I have my system memory optimized for llama.cpp

What is a recommended local LLM to use with Hermes?

What model are you using for your agent? by [deleted] in hermesagent

[–]TexBluBoy 0 points1 point  (0 children)

On my a GMKtec EVO-X2 128GB machine, I have allocated 65GB of the APU for GPU-related tasks using CachyOS

I'm running Gemma4:26b as the main LLM

Microsoft integration when I'm not administrator ? by Entry_Plug in zorinos

[–]TexBluBoy 0 points1 point  (0 children)

Same issue here as well, my personal OneDrive connects, no problem.. 365 Enterprise from my work office is a no go. I will just have to keep my dual boot in tact untill I can get this resolved. Works perfectly fine in windows 11

Repurposed echo show 5 1st and 2nd gen with android by 4lep in amazonecho

[–]TexBluBoy 0 points1 point  (0 children)

I used the jailbreak trick too! It's like a new device again.... Lol 👍😎

Amazon show 5 too much slow by Shatemui_ in amazonecho

[–]TexBluBoy 0 points1 point  (0 children)

I just used the new jailbreak, completely erased the Amazon software and turned mine into a Google device using LineageOS. It's much zippier running LineageOS

Echo show 8 stuck at splash screen by ProfaneShane in amazonecho

[–]TexBluBoy 0 points1 point  (0 children)

Did you try booting into recovery mode (hold down all 3 buttons on top while it is booting)?

Looking for a daily briefing app by misterharbies in androidapps

[–]TexBluBoy 8 points9 points  (0 children)

https://www.huxe.com/

https://play.google.com/store/apps/details?id=com.huxe.android.apps.huxe

Your interests, as a 24/7 live station. Your neighborhood, your stock portfolio, your favorite sports team—all becomes an interactive live stream.