I also never said anything about making code smaller I said making it more efficient. It’s not about compressing it it’s about finding better, less CPU expensive ways to do things, which we absolutely do
Another AI based example, video chats currently work streaming video, but there’s a technology in development that takes one screenshot, sends that, then sends expression data to be reconstructed on the other side
Far more efficient network wise
Hardware speed has increased, sure but that applies to both consumer hardware and servers, all a theoretical AGI would have to do is improve on its own training/code enough that it will run at all on consumer level hardware (which language models currently will do
(For reference, llama 40B runs just fine on my ThinkPad from 2016, pre-trained models are not that difficult to run, training is the expensive part)