Comment by quantumgarbage
6 months ago
> A modern sparse Transformer, for instance, is not "conscious," but it is an excellent engineering approximation of two core brain functions: the Global Workspace (via self-attention) and Dynamic Sparsity (via MoE).
Could you suggest some literature supporting this claim? Went through your blog post but couldn't find any.
Sorry, I didn't have time to find the relevant references at the time, so I'm attaching some now
https://www.frontiersin.org/journals/computational-neuroscie...
https://arxiv.org/abs/2305.15775