Does the benchmark reflect your opinion on 3.7? I've been using 3.7 via Cursor a...

		pawelduda on Feb 27, 2025 \| parent \| context \| favorite \| on: GPT-4.5 Does the benchmark reflect your opinion on 3.7? I've been using 3.7 via Cursor and it's noticeably worse than 3.5. I've heard using the standalone model works fine, didn't get a chance to try it yet though.

personal anecdote - claude code is the best llm devx i've had.