Infosys interview question

How would you design a data pipeline to handle large-scale data processing?