Big data has a lot of potential to benefit organizations in any industry, everywhere across the globe. Big data is much more than just a lot of data and especially combining different data sets will provide organizations with real insights that can be used in the decision-making and to improve the financial position of an organization. Before we can understand how big data can help your organization, let’s see what big data actually is:
It is generally accepted that big data can be explained according to three V’s: Velocity, Variety and Volume. However, I would like to add a few more V’s to better explain the impact and implications of a well thought through big data strategy.
The Velocity is the speed at which data is created, stored, analyzed and visualized. In the past, when batch processing was common practice, it was normal to receive an update to the database every night or even every week. Computers and servers required substantial time to process the data and update the databases. In the big data era, data is created in real-time or near real-time. With the availability of Internet connected devices, wireless or wired, machines and devices can pass-on their data the moment it is created.
The speed at which data is created currently is almost unimaginable: Every minute we upload 100 hours of video on YouTube. In addition, over 200 million emails are sent every minute, around 20 million photos are viewed and 30.000 uploaded on Flickr, almost 300.000 tweets are sent and almost 2,5 million queries on Google are performed.
The challenge organizations have is to cope with the enormous data hk speed the data is created and use it in real-time.
In the past, all data that was created was structured data, it neatly fitted in columns and rows but those days are over. Nowadays, 90% of the data that is generated by organization is unstructured data. Data today comes in many different formats: structured data, semi-structured data, unstructured data and even complex structured data. The wide variety of data requires a different approach as well as different techniques to store all raw data.