The problem is really that schematics are at the very heart of electronics design (and teaching/instruction), so to train a model you need a very powerful vision model to really unlock all the good training data.
The models can also output code that can be turned into a schematic through an interpreter, but there is virtually zero training data for this because humans always use and work with schematics.