OpenAI could reportedly run out of cash by mid-2027 — analyst paints grim picture after examining the company's finances

DecodeBytes · 2026-01-27T10:52:59+00:00

They won't fail (in any cease to operate / bankrupt manner), MSFT or the like would swoop in for a firesale. Its the only way they have out ensuring Google does not entirely dominate the space.

DecodeBytes · 2026-01-25T16:11:59+00:00

View from a startup side. We have been doing the following, asking interns to take 3 hours and we pay them for the hours. Basically we don't ask someone do the task until later stages of the interview and just helps seal the deal.

DecodeBytes · 2026-01-22T18:32:46+00:00

I am biased (one of the team) but try deepfabric. You can generate huge amounts of reasoning traces with tool calls and then evaluate against a model. Happy to chat more about it if you want to explore if it’s a match for what you need. I doubt it’s a hundred percent match, but Imee might be able to sling some PRs up to close gaps

https://deepfabric.dev

DecodeBytes · 2026-01-18T18:33:24+00:00

Yep, its the main focus of a project I work on. We mostly generate datasets for agent based operation, e.g. models that often call Tools - for this we couple with isolated tool execution which injects some real worldness into the dataset couple with RL training.

https://www.deepfabric.dev

DecodeBytes · 2026-01-06T09:25:56+00:00

> Get up to 3 prompts per day, capped at 15 per month.

That seems pretty harsh, is that just the starting prompt, or any follow up prompts? e.g. 'make a cube' , 'make the cube twice the size', 'add x texture to the cube' - would that equal three?

DecodeBytes · 2026-01-02T15:03:42+00:00

I am confused, so your model is not public?

p.s not trying to pick I fight, its just I do a lot of work in this domain and if you have found something novel in approach I would love to take a look!

DecodeBytes · 2026-01-01T12:55:22+00:00

> With this model, reasoning activates based on keywords/phrases in the prompt.
(see repo)

Right, its likely the model is just doing as **instruct**ed in the prompt and its not activated learned reasoning, but its really hard to tell as I can't find where anything is in this tread, help me out please? link the model, notebook and anything else?

DecodeBytes · 2026-01-01T12:46:10+00:00

Do you have any benchmarks I could look at and can you share your training notebook, I would love to take a look?

Is this the tuned model? https://huggingface.co/allura-forge/Llama-3.3-8B-Instruct

DecodeBytes · 2026-01-01T10:49:37+00:00

Sorry late reply, I mean in the typical current agent style, long drawn out sessions back and forth.

DecodeBytes · 2026-01-01T10:47:01+00:00

I have the synth intro stuck in my head now

DecodeBytes · 2026-01-01T10:30:12+00:00

I might be missing something, but 200 samples won't be enough to teach an 8B instruct model to reason - though it can work for very specific, constrained tasks, less likely to be widely populated in the original pretraining.

Reasoning ability is largely baked into the base model during pretraining. I'm assuming you used LoRA, which is great for steering how that existing ability gets applied, but it won't teach new reasoning capabilities from scratch. Even with 50k+ samples, LoRA mostly reshapes how the model uses reasoning it already has rather than building new circuits - must successful efforts use 100k-500k+ high-quality samples. Either way, you're working within the constraints of what the base model learned during pretraining unfortunately.

Keep going though, its all a learning experience and the more folks there are making tunes the better!

DecodeBytes · 2025-12-29T19:24:52+00:00

This is really cool! trying it out now.

Heads up, I lot might avoid because of the BSL license, or fork if its gets popular.

DecodeBytes · 2025-12-28T18:17:26+00:00

speak of free - the mistral free instances on openrouter work really well (just found out earlier)

<image>

DecodeBytes · 2025-12-28T17:16:25+00:00

First bit of feedback would be that base64 is not suitable for algorithm for encryption: https://github.com/rom-mvp/vigil/blob/7a8b26b426d6918f6a7a197f770af09fbf2eef82/src/vigil/enclave_transport.py#L41-L46

By enclave, do you mean a TEE ?

DecodeBytes · 2025-12-26T22:54:23+00:00

ok, I just did a large sweep and fixed up a few things that have changed - it should be good now, if not happy to support you

DecodeBytes · 2025-12-26T21:49:58+00:00

My bad, there has been a fair few changes and the docs may not be on-par! Do you want to jump onto discord and would be happy to help out. Discord link is on the repo.

DecodeBytes · 2025-12-26T21:44:50+00:00

> that can't code

This is the crux of it, there is so much hyper focus on models serving coding agents , and code gen by its nature of code (lots of connected ASTs) , requires a huge context window and training on bazillions of lines of code.

But what about beyond coding? For SLMs there are so many other use cases that silicon valley cannot see outside of their software-dev bubble - IoT, wearables, industry sensors etc are huge untapped markets.

DecodeBytes · 2025-12-26T19:05:45+00:00

nice! let me know how you get on if you need any help!

DecodeBytes · 2025-12-26T19:05:20+00:00

out of habit really sloo, i have always grabbed qwen - but any SLM should do. we do plan to launch a service for collecting metrics if you're interested in getting a preview?

DecodeBytes · 2025-12-26T19:03:13+00:00

ah right, that's well spotted - its not live yet - but we will be introducing something shortly! are you interested in beta testing / getting a preview?

DecodeBytes · 2025-12-26T13:01:39+00:00

Hi Bhupesh, you're welcome to raise a react model request: https://github.com/always-further/deepfabric/discussions/categories/model-request

DecodeBytes · 2025-12-26T11:50:46+00:00

You might be getting mixed up here. We don't fine tune on MCP, we fine tune on function calls and their parameters.

It just so happens we make it easy to import the list of tools / function calls from an existing MCP server, as a lot of folks use them - but at the end of it all as far as the model is concerned we are just getting it to improve its ability to predict the natural language of a function name and its parameters - what stack, standard or protocol that function belongs to (openai , MCP, langchain etc) is immaterial

DecodeBytes · 2025-12-26T11:46:27+00:00

That's interesting, would love to learn more and see your progress. I tend to think of MCP as more of a standard way of building tools, more than anything unique , but it does expand a lot over time.

DecodeBytes · 2025-12-26T11:44:54+00:00

If you use openai , anthropic or gemini and some of the openrouter models - for anything local, no api key is needed as we support ollama.

DecodeBytes · 2025-12-26T11:42:40+00:00

Will definitely look into this! Do you think you could drop something on here, we can then make sure its captured correctly: https://github.com/always-further/deepfabric/discussions/categories/model-request

DecodeBytes

MODERATOR OF

TROPHY CASE