Discussion about this post

User's avatar
Alexander Kurz's avatar

"In the modern artificial intelligence ecosystem, this is what the transformer architecture is. It’s a complex systems which performs a large number of information processing and it is best described in terms of its architecture rather than in terms of its end-directed “computation”. "

I agree, but there are impressive efforts in distilling a "computational level" from the architecture. Do you know about https://transformer-circuits.pub/2024/scaling-monosemanticity/index.html ?

Expand full comment
Alexander Kurz's avatar

Maybe it is convenient to add a link to Marr's paper "Artificial intelligence—a personal view" ... thanks for pointing it out ... https://scholar.google.com/scholar?hl=en&as_sdt=0%2C5&q=Artificial+intelligence+%E2%80%94+a+personal+view&btnG=

Expand full comment
2 more comments...

No posts