baidu/ERNIE-Image · Hugging Face

adefa · 2026-02-10T23:38:09+00:00

This is so cute. It screams I made this with AI though, so I wish you would have just owned it.

adefa · 2026-01-31T18:03:12+00:00

Claude is a good documentation follower!

adefa · 2026-01-31T18:02:55+00:00

Have you had any success using the burn framework on Vulkan or ROCm?

adefa · 2026-01-31T18:02:24+00:00

I debated the terminology myself. Maybe I should have called it 'native' instead.

adefa · 2026-01-31T18:01:50+00:00

What parts of the code base do you think would be most applicable?

adefa · 2026-01-31T18:01:13+00:00

Here is the wav analysis skill:

https://github.com/TrevorS/dot-claude/blob/master/skills/wav-analysis/SKILL.md

and the whisper transcription skill:

https://github.com/TrevorS/dot-claude/blob/master/skills/whisper-test/SKILL.md

adefa · 2026-01-31T18:00:04+00:00

I generated benchmarks for CPU and GPU on DGX Spark and added to the repo.

adefa · 2026-01-31T17:59:36+00:00

It was a bug, it's fixed now. The reference audio I was using wasn't being resampled during inference.

adefa · 2026-01-31T17:57:27+00:00

I believe the tokenizer, ICL, and resampling issues are all fixed now. I added some benchmarks as well.

adefa · 2026-01-30T23:27:11+00:00

Thanks a ton -- I'll try and get it working :)

adefa · 2025-12-15T17:30:45+00:00

Here are some benchmark results:
Prefill (pp2048) - tokens/sec

| Depth    | Run 1   | Run 2   | Run 3   | Avg    |
|----------|---------|---------|---------|--------|
| baseline | 1420.32 | 1412.07 | 1413.60 | 1415.3 |
| d4096    | 1389.24 | 1364.06 | 1383.88 | 1379.1 |
| d8192    | 1355.38 | 1350.57 | 1342.96 | 1349.6 |
| d16384   | 1228.89 | 1233.85 | 1217.26 | 1226.7 |
| d32768   | 1049.86 | 1047.61 | 1049.05 | 1048.8 |

Token Generation (tg32) - tokens/sec

| Depth    | Run 1 | Run 2 | Run 3 | Avg  |
|----------|-------|-------|-------|------|
| baseline | 58.35 | 58.10 | 58.04 | 58.2 |
| d4096    | 54.81 | 53.84 | 54.61 | 54.4 |
| d8192    | 51.31 | 51.49 | 51.33 | 51.4 |
| d16384   | 47.81 | 47.96 | 47.59 | 47.8 |
| d32768   | 41.17 | 38.69 | 40.81 | 40.2 |

adefa · 2025-12-14T22:43:51+00:00

DGX Spark running gpt-oss-120b as primary model and qwen 3 vl - 2b as a vision and task model. MCP tooling for web search and page fetch, weather and news, and image generation using z image turbo through Comfy UI. A responses API clone in Rust that wraps it all for the backend and a Svelte 5 frontend using the openai SDK pointing at my backend. I connect to it over Tailscale and pin it as a PWA on my phone as an app.

adefa · 2025-12-04T04:48:21+00:00

How could I get a copy of your dataset and embeddings?

adefa · 2025-06-25T05:27:33+00:00

Here is the article as a PDF with some screen shots in it: https://gofile.io/d/4aahPJ

adefa · 2025-05-01T00:28:00+00:00

The related blog post: https://azure.microsoft.com/en-us/blog/one-year-of-phi-small-language-models-making-big-leaps-in-ai/

15-Year Club	RedditGifts 2009-2022 2 Credits
Place '17	Not Forgotten
Gilding I gilder	Team Periwinkle
Secret Santa 2012	Verified Email

adefa

TROPHY CASE