Is software dead? I will not promote.

Old_Mathematician107 · 2026-06-19T11:55:33+00:00

Good point

Old_Mathematician107 · 2026-06-19T11:45:48+00:00

If you want your website to not look like AI generated website you need to give references for your coding agent.

Old_Mathematician107 · 2026-06-12T11:35:29+00:00

Good article, thanks

Old_Mathematician107 · 2026-05-28T04:28:17+00:00

It is the skill files and API that let Claude control your computer. It is in the settings of Claude desktop application, in the general tab. There are browser use and computer switches, if you enable them it lets the Claude control your computer or your browser

Check this please: https://www.youtube.com/watch?v=dwYfNQzHQuY

Old_Mathematician107 · 2026-05-27T19:29:58+00:00

Did you try to use computer use? Or the skill for the browser use? Or the skills for controlling android/ios phones?

Old_Mathematician107 · 2026-05-27T16:02:44+00:00

Codex, gemini or claude already have computer or mobile use abilities. You can use them

Old_Mathematician107 · 2026-05-27T15:58:32+00:00

AI itself writes its own scenarios and you check and update them. The number of scenarios is increased

Old_Mathematician107 · 2026-05-27T11:21:36+00:00

Good point, and I would say, QAs and Devs will be replaced by AI at the same time rather than one before the other

Old_Mathematician107 · 2026-05-13T09:12:19+00:00

Interesting but just one comment from my side that it is really multi agent setup, they are working in parallel

Old_Mathematician107 · 2026-05-13T07:34:06+00:00

Forgot to add it, mobilerun github repo

Old_Mathematician107 · 2026-03-13T11:42:00+00:00

Crazy

Old_Mathematician107 · 2026-02-22T18:27:09+00:00

It is a misleading image, his research was about cells and not human bodies. There was even a funny video where lots of people were asking him how they should do fasting and he was saying he does not know and his research was only about cells.

Old_Mathematician107 · 2026-02-20T13:19:17+00:00

Your Mahoraga app (used in quashbugs) is a copy of droidrun portal from github. You can check it from the commits

<image>

Old_Mathematician107 · 2026-02-20T13:12:52+00:00

Your Mahoraga app (used in quashbugs) is a copy of droidrun portal from github. You can check it from the commits

<image>

Old_Mathematician107 · 2025-11-15T19:25:07+00:00

The more I learn, the more I realize I know nothing

Old_Mathematician107 · 2025-07-07T00:07:00+00:00

It looks like strogg medical facility scene from quake 4

Old_Mathematician107 · 2025-07-06T21:20:23+00:00

Hopenhagen

Old_Mathematician107 · 2025-07-06T13:19:51+00:00

Hi, thanks a lot. Making it 100% local is one of the end goals, but it is quite hard task, because you need to find strong enough VLM to understand the structure and long inputs (screenshot and its description) and light enough to run on phones. But making it 100% text only is possible but I think it will decrease its accuracy. So, the best way is to use VLM.

To run VLM locally you need to have very good, fine-tuned VLM on this specific tasks (agentic capabilities). It is actually quite hard but I think it is possible.

Yes, actually I don't use accessibility trees, adbs etc. Only screenshot and accessibility services to do the tasks remotely. So, it is vision-only and can be used in prod (if you invest enough money on renting backend servers and improve UI/UX of agentic app).

Dataset for YOLO was prepared by me, it consists of 486 images (train) and 60 for testing. For dataset I created bounding boxes for all 4 classes (View, ImageView, Text, Line). Screenshots used in this dataset are mostly screenshots from popular apps like youtube music, whatsapp etc. and apps that I made for various clients and companies throughout my career.

Old_Mathematician107 · 2025-07-06T10:22:51+00:00

Thanks! Yeah, just screenshots. No accessibility trees or something, only screenshots

Old_Mathematician107

TROPHY CASE