I set out to investigate. When I goggled on “Cloud Computing” around 253,000,000 results (0.32 seconds) were thrown up. However when I goggled on “Big Data” around 1,960,000,000 results (0.64 seconds) searches were reflected. 253 Million cloud computing references versus 1,960 Million reference of Big Data reference on the net does speaks volumes on the topic Big Data.
So what is Big Data ?
Research says that other than the business batch data (Structured data) ; every 60 seconds, over 168,000,000 emails are generated across the globe, 695,000 status updates, 79,364 wall posts and 510,040 comments are published on Facebook, and 600 videos (unstructured data) are uploaded on YouTube. And its drowning in voluminous data; structured and unstructured and business realize that data is key to gaining insight into vendors, partners, customers and their buying habits and demographics. The accumulating data reflects 3 V’s challenge i.e.
VOLUME: Data estimate varies from business to business and has the capacity to reach into petabytes and zeta bytes. And tomorrow it will shoot up higher.
VARIETY: Data is being generated in countless forms: Text, web data, tweets, sensor data, audio, video click streams, log files and more.
VELOCITY: The speed at which data is being created is on the upward trajectory and will continue to accelerates.
Having said that vendors define the 4th V i.e VERACITY or VALUE of data being generated.
The 4x V’s challenges has brought in focus from business to find viable process to make smarter use of the data that’s available to them. They must be able to prioritise, organise and analyse both structured and unstructured data. And every major systems integrators is playing a role in this area including IBM, TCS, Infosys, Huwaei.
I am sharing two YouTube cuts from TCS and Infosys on Big Data to share more clarity; after all “A picture is worth a thousand scripted words”