Claude / vision LLM power-user
“Click exactly here” beats “somewhere top-left”.
When you describe a UI screenshot to Claude, the model has to guess pixels. Coordinate hallucinations kill agent tasks and burn tokens. Writing structured JSON with numbered elements is still manual.
Click on the screenshot, describe in one sentence, hit “Copy for Claude”. Claude gets exact pixel-coords + labels. Zero hallucinations, zero re-prompting.
- Point + polygon
- Semantic tags
- Copy for Claude
- Multi-image bundle
“From 15 minutes of describing UI to 30 seconds of clicks.”Try with the Claude template