A data lake is a centralised repository that allows you to store all your structured and unstructured data at any scale. Think of a data lake as a real lake. Just like a lake that has multiple tributaries coming in, a data lake has structured data, semi-structured data, unstructured data, machine to machine, logs flowing through in real-time. To simplify it a bit, a data lake can be defined as a repository that holds all the business data at any scale. Data lakes provide a high level of flexibility when it comes to the nature of data coming into it. No matter the type of data. The data lake keeps the data in its native format that you can later transform into something useful.
Here are some amazing companies in the Data Lakes.