These transformer models are so huge, they require extremely expensive and specialist hardware beyond what enthusiasts and even many academica access to.
There is no chance in the near future consumers or Edge devices will be able to run these models locally, data is going to have to be fed back into the cloud.
Smaller models with better performance are beginning to arrive. Things like RETRO, better training data, longer training time, and scale optimization will have these models on phones and desktops doing crazy things in the near future.
There is no chance in the near future consumers or Edge devices will be able to run these models locally, data is going to have to be fed back into the cloud.