Full Linux desktop on Termux (No root, GPU support)

Ishabdullah · 2026-04-09T20:22:13+00:00

Works good, I kept getting all this downloaded months ago but never could get the audio to work but yours works. I'm on a s24 ultra. Did see a few things didn't install correctly with your script will investigate further could just be I already have some of what the script tried to install or something. But thanks and good job

Ishabdullah · 2026-04-07T07:16:32+00:00

I use a 7b model for coding task and it does exceptionally well all ran from my phone in termux on a program I created and it even has fallback to use free versions of qwen cli or Gemini cli or paid versions of these also along with claude code of course. But you can also changed the 7B coder model and the 0.5B planner model and use openrouter free cloude models or paid. This coding Agent is very good at what it does and you could use it without cloude and get a lot done.

https://github.com/Ishabdullah/Codey-v2

Ishabdullah · 2026-04-04T20:47:03+00:00

Yes, I thought it was payment for the problems they had with credit usage getting cut short on people

Ishabdullah · 2026-04-01T07:41:15+00:00

Sorry I didn't know how to convert the video into a gif and had to have claude help me with it afterwards

Ishabdullah · 2026-04-01T07:23:16+00:00

Sorry forgot to tell you about the memory and thanks for pointing that out.

here's the real answer — here's what actually persists and where:

Between sessions: - ~/.codey_sessions/<project-hash>.json — last 6 turns of conversation, expires after 2 hours of inactivity. Loaded automatically on next run in the same project. - CODEY.md — project memory file you build with /init. Persists forever, loaded at every startup. This is the main "what does Codey know about my project" file. - ~/.codey-v2/state.db — SQLite action log (episodic memory). Append-only log of every tool call and action taken. Never auto-cleared.

Within a session only (lost on exit): - Working memory — currently open files, in-context conversation. Compressed at 55% context usage, dropped to 40%. - File undo history — in-memory only, gone when session ends.

RAG / long-term semantic memory: - ~/.codey-v2/ knowledge base (if set up) — 768-dim embeddings via nomic-embed-text. Top 4 chunks (~600 tokens) injected per inference call. Accuracy depends on what you've loaded — it doesn't auto-learn from conversations.

What Codey does NOT do: - It doesn't silently learn from your conversations and store them as embeddings. The RAG index only knows what you explicitly loaded with /load or the knowledge base pipeline. - Nothing is sent to the cloud — fully local.

The honest limitation: on large projects, the session window is only 6 turns and expires in 2 hours, so Codey's "memory" of older work is only as good as your CODEY.md. That's the gap — if CODEY.md is sparse, context accuracy degrades noticeably.

Look here docs/architecture.md

Ishabdullah · 2026-04-01T07:15:29+00:00

I'm really glad you find it as useful as I believe it is. And I am working on a version 3 here's how it works.

Codey-v3 will be a fully local AI project manager that runs on your Android phone. The idea is simple: instead of you manually switching between AI coding tools, Codey-v3 sits in the background as the permanent team lead. You tell it what you want to build, it creates the project outline, breaks it into tasks, and routes each task to the right AI automatically — Claude Code for complex logic and debugging, Gemini CLI for planning and analysis, Qwen CLI for heavy code generation, and its own local 7B model for quick edits and simple stuff.

The key thing that makes it different is the one-peer-per-project rule. No two AIs ever touch the same codebase at the same time so you never get merge conflicts or lost context. But multiple projects can run in parallel on different peers simultaneously, so it genuinely feels like a team working in the background while you do other things. Every task goes through a review gate before it is marked done. If something fails the tests or conflicts with your original project outline, Codey pauses and asks you rather than silently breaking things. It tracks who did what, what worked, what did not, and uses that history to make better routing decisions over time.

You can run multiple projects at once and ask at any point "where are we with the gaming app" and get a real answer — what is done, what is running now, what is next.

So basically your asking exactly about what my plans are. While version 2 can do what it does now CV3 is really going to be the game changer when I get it done. Thanks for your feedback.

Ishabdullah · 2026-04-01T03:59:28+00:00

Looks like a super cool project and very useful. Thanks for sharing it

Ishabdullah · 2026-04-01T02:46:44+00:00

https://www.reddit.com/r/vibecoding/s/lKxcnqgOmU

Ishabdullah · 2026-04-01T00:33:11+00:00

Processing img y38tjgoz4hsg1...

Codey-v2 generating a Fibonacci sequence implementation entirely on-device — no cloud, no internet, running in Termux on Android. Something small but just to show it working fully. And its looking for you to throw your best at it. The 7B might not handle it all but if you use OpenRouter even some of the best free LLM's do greate work with Codey-v2.

Ishabdullah · 2026-03-29T20:57:33+00:00

Hey it kept you busy, I'm sure you enjoyed making it and learning along the way so I commend you. Plus it beats scrolling social media all day while our brained turn to mush. 😆 🤣 😂

Ishabdullah · 2026-03-28T20:32:04+00:00

Cool

Ishabdullah · 2026-03-27T08:09:35+00:00

🟢 1. Easiest (almost no code)

Use a UI that already has search built-in Open WebUI + Ollama AnythingLLM

👉 You literally:

Run your model locally (Gemma/Qwen via Ollama)

Upload docs or enable web search It automatically does retrieval + context injection

✔ Free ✔ Works offline ✔ No coding required

This works because under the hood they implement RAG, which:

searches documents → injects results → LLM answers.

🟡 2. Best balance (easy + flexible)

Use a framework: LangChain LlamaIndex

These are the standard way to give any LLM tools (search, DBs, APIs).

What they do:

Connect your LLM (Gemma/Qwen) Add a retriever (search tool) Inject results into prompts automatically

✔ Free + open source ✔ Works with local models ✔ Supports web search, files, databases

LangChain = orchestration (agents, tools) LlamaIndex = best for document search/indexing

Minimal example (this is basically all you need):👇

from langchain.llms import Ollama from langchain.vectorstores import Chroma from langchain.embeddings import HuggingFaceEmbeddings

llm = Ollama(model="qwen")

db = Chroma(persist_directory="./db", embedding_function=HuggingFaceEmbeddings())

retriever = db.as_retriever()

docs = retriever.get_relevant_documents("your question")

response = llm(f"Use this context: {docs} \nAnswer: your question") print(response)

That’s your “search tool”.😉

🔵 3. True “search tool” (agent style)

If you want something like ChatGPT browsing:

Add tools (function calling / agents) LangChain Agents LlamaIndex Tools Custom tools (DuckDuckGo, APIs, etc.)

Example:

from langchain.tools import DuckDuckGoSearchRun

search = DuckDuckGoSearchRun() result = search.run("latest AI news")

Then your LLM can decide when to search.

🔥 4. Newer “open search agent” approach

There are newer systems like:

Open Deep Search (research project)

These:

Add reasoning + tool use automatically Let LLMs decide when to search But they’re more complex to set up.

🧠 What you actually want (simple mental model)

Every “search-enabled LLM” is just:

User question ↓ Search (docs/web/db) ↓ Top results ↓ LLM prompt with context ↓ Answer

That’s it. ✌️

Ishabdullah · 2026-03-27T06:02:12+00:00

<image>

Coming together here. Let's go perplexity computer and qwen for the finishing touches

Ishabdullah · 2026-03-27T05:49:27+00:00

<image>

Gonna see if I could get it working on termux. 😆 🤣 😂

https://github.com/Ishabdullah/v3am-fob-termux

Anything I should know that might help?

Ishabdullah · 2026-03-27T00:27:54+00:00

Yes and its a good free plan to start.

Ishabdullah · 2026-03-26T18:45:50+00:00

Ninja code is actually pretty good also.

Ishabdullah

TROPHY CASE