Currently, the quantity of data obtained from various sources is growing intensively. Many companies are not able to follow solutions required for their processing. Companies do not know how to use the information obtained as effectively as possible, and thus – they are not able to present them in a manner understandable to the managers.
A similar problem occurred in the analysed pharmaceutical company. Difficulties with big data whose processing took more and more time as well as their volume convinced the managers to take efforts to transfer information from many dispersed databases to one Data Lake. It was to work by means of the Apache Hadoop environment. This solution was to boost considerably data processing and such data were used for establishing a global sales strategy.