Microsoft interview question

How do transformers and attention work