In my own tests I have found opus to be very good at writing plans, terrible at ...

Sammi · 2026-02-22T10:51:33 1771757493

1. Don't implement too much at at time

2. Have the agent review if it followed the plan and relevant skills accurately.

irthomasthomas · 2026-02-22T11:03:00 1771758180

the first link was from a simple request with fewer than 1000 tokens total in the context window, just a short shell script.

here is another one which had about 200 tokens and opus decided to change the model name i requested.

opus is bad at instruction following now.