Zinga! interview question

What is gradient decent?? Explain Transformers in NLP?