Appier interview question

Why vanishing and exploding gradient happens in RNN? What is CRF? How do you learn embeddings for users based on their visits on websites?