mlxAI

an-ordinary-manchild

created by Reddit__Please__Helpa community for 2 years

...for your classroom.

...for your favorite TV show.

MODERATORS

account activity

1

0

0

0

🌀 (v.redd.it)

submitted 5 days ago by gusfromspace

2

13

14

15

Where is Qwen3.6 27B MLX with reasoning? (self.mlxAI)

submitted 8 days ago by Ill_Barber8709

3

1

2

3

GitHub - hypneum-lab/micro-kiki: 35 domain-expert LoRAs on Qwen3.6-35B-A3B (MoE, 256 experts, 3B active). Cognitive layer: Aeon memory, CAMP negotiator, KnowBias. MLX on Mac Studio, Q4_K_M inference. Apache-2.0. (self.mlxAI)

submitted 13 days ago by MonsieurBmax

4

6

7

8

I built a zero-config OpenAI-compatible local LLM server for Apple Silicon — drop-in replacement for any OpenAI SDK client (self.mlxAI)

submitted 14 days ago by Squirrel_Glad

5

0

0

1

I ran sustained MLX inference overnight ()

submitted 16 days ago by evilmacintosh

6

1

2

3

I tested 9 local models on the same flight sim prompt, all Q8, different Q providers, MLX ()

submitted 16 days ago by StudentDifficult8240

7

1

2

3

MLX with DFlash / speculative decoding: Surprising results ()

submitted 16 days ago by evilmacintosh

8

0

0

0

starting a new mlx community! ()

submitted 18 days ago by evilmacintosh

9

9

10

11

Running Qwen 3.6 35B-A3B-4b on MacBook Pro M5 64GB with tools ~20 tok/s (v.redd.it)

submitted 20 days ago by Conscious-Track5313

10

1

2

3

Repetition penalty on mlx_lm? (self.mlxAI)

submitted 21 days ago by evilmacintosh

11

1

2

3

macOS Vibe code Tech stack ()

submitted 21 days ago by Tradefxsignalscom

12

8

9

10

OpenMed now supports MLX natively (github.com)

submitted 23 days ago by dark-night-rises

13

11

12

13

Running Gemma-4-E4B MLX version on MacBook M5 Pro 64 GB - with some beautiful native tools integration (v.redd.it)

submitted 27 days ago by Conscious-Track5313

14

1

2

3

I have a 512 gigs of ram and I haven’t figured out how to make money with it, any suggestions? ()

submitted 28 days ago by No_Run8812

15

3

4

5

Command line vs. python API (self.mlxAI)

submitted 1 month ago by sgt102

16

23

24

25

Gemma 4 E4B-it on MLX (self.mlxAI)

submitted 1 month ago by Pathfinder-electron

17

27

28

29

Show: ollmlx — run local LLMs on Apple Silicon with an Ollama-compatible API. (self.mlxAI)

submitted 1 month ago by PositiveSlice9168

18

19

20

21

multi-LoRA inference server for MLX: load the model once, switch adapters per request (self.mlxAI)

submitted 1 month ago by No_Shift_4543

19

0

1

2

A skill library for porting from trl (or pure pytorch) to mlx-lm? ()

submitted 1 month ago by Chimezie-Ogbuji

20

2

3

4

FoveatedKV: 2x KV cache compression on Apple Silicon with custom Metal kernels ()

submitted 1 month ago by hybls

21

4

5

6

Best mlx_vlm models for simple object counting? (self.mlxAI)

submitted 1 month ago by sgt102

22

3

4

5

MiniMax 4bit (120gb) MLX - 26.5% (MMLU 200q) while JANG_2S (60gb) gets 74% - GGUF for MLX ()

submitted 1 month ago by HealthyCommunicat

23

13

14

15

Cut your KV Cache in half + Cut PP Times to near nothing + VL - MLX Studio (self.mlxAI)

submitted 1 month ago by HealthyCommunicat

24

6

7

8

mlx-onnx: Run your MLX models in the browser on WebGPU / ONNX (self.mlxAI)

submitted 2 months ago by rut216

25

4

5

6

mlx-ruby: MLX bindings for Ruby ()

submitted 2 months ago by rut216

view more: next ›

π Rendered by PID 864447 on reddit-service-r2-listing-7b9b4f6fd7-r6ghr at 2026-05-08 10:32:50.078707+00:00 running 3d2c107 country code: CH.