But it did poorly only on the problems it hadn't seen before. Was it prompted differently on one kind of problem, compared to the other?
But it did poorly only on the problems it hadn't seen before. Was it prompted differently on one kind of problem, compared to the other?