Why not? I think there is some interesting research here at the computationally level / distributed that could lead to some interesting architecture and discoveries.
If LLM's are token predictors for language, what happens when you do token prediction for computation across a distributed network? Then run a NN on the cache and clustering itself? Lots of potential use cases.
Fully distributed OS's/Virtual Machines/LLM's/Neural Networks
If LLM's are token predictors for language, what happens when you do token prediction for computation across a distributed network? Then run a NN on the cache and clustering itself? Lots of potential use cases.