Amazon interview question

about large data sets