Data engineering is the process of building a data pipeline that determines the flow of data within the enterprise. Where will data be collected from? Where to store such huge volumes of data? How will this data be cleaned and structured? In which format will this data reach data scientists? How will the analytics be shared with employees?