With so many cores, it would be interesting if future compilers use the cores themselves to determine things like pipelining, out-of-order execution, and such that are now handled by specialized hardware in each core, resulting in more complex systems. On a four-core system, for example, one core could be dedicated to ensuring that the other three cores are properly "fed" to maximize their throughput, resulting in less complex cores, overall. This would lower transistor count, which would reduce production cost and waste heat, allowing for cheaper cooling systems and more compact systems.