3Vs OF BIG DATA

Big Data: The 3Vs


The following 3 dimensions are commonly used to define more precisely what these Big Data are and to know the challenges that companies are facing (these dimensions are usually called the 3 Vs of Big Data):

The 3 dimensions of Big Data

Volume
Variety
Velocity

Volume

Volumes are huge and constantly growing. These volumes are expressed in terabytes (noted Tb) and even petabytes (noted Po).

Variety

These data can be found in varied formats: What is common between a video, a song, a message on Twitter, a photo posted online, a post on a Facebook page or a measure on an electric meter for example.

Velocity

The velocity dimension is the speed at which must be collected, analyzed and used data. More than often, these data are to be processed in real time.


Two other dimensions were added later to these 3 initials dimensions (all five dimensions are also referred to as as the “5 V”):

Veracity
Value

Veracity

Veracity refers to data reliability and accuracy. With so many various forms of data, it becomes more and more difficult to control data quality and accuracy (just imagine Facebook or Twitter messagess with abbre8viations, slang words, hash tags and typos). But today, thanks to data and analysis technology it is possible to work with this type of data.

Value

This dimension focuses on a surprising aspect. In many situations, a small amount of data may not have much interest, whereas a huge data volumme can make sense. This is the case for example of a tweet on a current event. The content of the tweet in itself may not have great importance, but if the tweet is commented thousands of times, the number of messages associated with this tweet that becomes valuable information (an opinion shared by many, a subject that is consensus or controversy …).
 
See also: Definition