Yes, it's certainly more complicated than that, but the lithography is a huge part since they can cram more transistors into a smaller area, which is critical for power savings.
I highly doubt instruction decoding is a significant factor, but I'd love to be proven wrong. If you know of a good writeup about it, I'd love to read it.