Cool. Putting vision in the loop is a great idea. Ambitious idea, but I like it.

ericghildyal · 2025-05-16T21:34:19 1747431259

I used Cline to build a tiny testing helper app and this is exactly what it did!

It made changes in TS/Next.js given just the boiletplate from create-next-app, ran `yarn dev` then opened its mini LLM browser and navigated to localhost to verify everything looked correct.

It found 1 mistake and fixed the issue then ran `yarn dev` again, opened a new browser, navigated to localhost (pointing at the original server it brought up, not the new one at another port) and confirmed the change was correct.

I was very impressed but still laughed at how it somehow backed its way into a flow the worked, but only because Next has hot-reloading.

tough · 2025-05-16T18:44:50 1747421090

SmolVLM, Gemma, LlaVa, in case you wanna play with some of the ones i've tried.

https://huggingface.co/blog/smolvlm

recently both llama.cpp and ollama got better support for them too, which makes this kind of integration with local/self-hosted models now more attainable/less expensive

tough · 2025-05-16T18:51:50 1747421510

also this for the visual regression testing parts, but you can add some AI onto the mix ;) https://github.com/lost-pixel/lost-pixel

nadis · 2025-05-19T23:45:50 1747698350

Yes, the above reply is more what I meant! Vision / visualization not just more automated testing.

Definitely ambitious!