Phone screen - 1 LeetCode medium for grouping points based on distance (Did a more naive approach but ideally graph based or Union-Find / DST ) and ViT layers, trade-offs questions.
Interview questions [1]
Question 1
Vision transformer internals and memory compute vs latency