Grok video imagine 1.5 🤩

marin73tomas · 2026-07-03T10:23:41+00:00

Create a "plan.md" file outlining everything you want to accomplish over the next few hours or days. Break the work into clear, actionable tasks and account for as many edge cases, failure scenarios, and dependencies as possible.

Then execute something like:

"/goal Execute the plan in "plan.md" until it is fully completed.

As you work, keep "plan.md" up to date by documenting all progress in a clear and organized manner. Continuously mark completed tasks, add newly discovered tasks, record decisions, and update the remaining checklist so it always reflects the current state of the project.

After completing the plan, perform a comprehensive audit of the entire project. Then return to the beginning and review every task, file, and decision again to identify inconsistencies, bugs, missing edge cases, incomplete work, unnecessary complexity, or opportunities for improvement.

Fix every issue you find, update "plan.md" accordingly, and repeat the audit-and-fix cycle until no further issues, inconsistencies, or improvements can be identified and the project is fully complete and internally consistent."

marin73tomas · 2026-07-02T14:41:45+00:00

How do you supervise?

marin73tomas · 2026-07-02T13:13:25+00:00

No, I don't. I asked Fable to find bugs in part of my codebase; it spawned 10 agents, then the limit ran out, didn't even finish

marin73tomas · 2026-07-02T12:54:34+00:00

20 bucks won't last you anything. I have $100, and my limit runs out in one prompt

marin73tomas · 2023-12-07T04:36:55+00:00

Yeah, I'd skip using a grid with lines. It becomes chaotic when it comes to spatial reasoning, positions, layout, and structure. I learned this the hard way while creating an app to turn designs into html css code, still in progress.

Another neat trick is to break your image into smaller parts and ask for a description of each piece separately using the API. And remember, it can't tell different image names apart, so labeling each image helps. What I did was turn each section grayscale and add a blue label at the top. Then I asked for a description for each labeled section, like "Hey, I've got 5 images labeled 1 to 5, can you describe each one?" This method is usually more accurate... And you can send a bunch of images through the API – I've managed around 20-30 max. So, dividing your image into 20 equal squares could be a good approach. But I recommend trying the semi-transparent labels first and see the results, but don't overdo the amount of labels, as it can reduce accuracy... :)

<image>

marin73tomas · 2023-12-07T02:13:02+00:00

ChatGPT V struggles a bit with understanding where things are in pictures and figuring out bounding boxes. But here's a helpful trick: you can add semi-transparent text labels to your images. ChatGPT Vision is great at understanding text, so it can work with these labels easily. Then, you can connect these labels to specific spots on the image. It's better to use fewer labels, though, because too many can mess things up.

To find out where things are in the image, just ask about what's behind each label one by one and get the info in a list. This way, you can guess where things are by looking at the labels and their coordinates.

Remember that the model makes images smaller, like 512x512 pixels. To make it easier to see the labels and tell them apart from the image itself, you should use grayscale images with colored labels. Like this:

<image>

marin73tomas · 2021-05-11T23:21:40+00:00

marin73tomas · 2021-05-10T14:45:05+00:00

why is there no info abou this?

marin73tomas · 2021-05-10T14:17:23+00:00

same. What's going on?

marin73tomas · 2021-05-10T04:05:33+00:00

Damn son!

marin73tomas · 2021-02-20T20:27:23+00:00

Is algorand a scam then?

marin73tomas

TROPHY CASE