What is Big Data?

  • Big Data is one of the hottest topics in the tech world.
  • This is not only related to computers, it comes under a blanket term called IT, which is now part of almost all other technologies and fields of studies and business.
  • Big data is not a Big Deal. Sure pretty big deal to confuse all.
  • Here let me try to explain what big data is.
What is Big Data?
  • All the data categorized or not categorized, present in user nodes are collectively called Big Data.
  • All this data can be used to get different results using different types of analysis.
  • Not necessary that all analysis uses all the data.
  • Different analysis uses different parts of big data to produce the results & predictions necessary.
How big is Big Data?
  • Collecting of Hugh data is Big Data, but many researchers agree that big data cannot be manipulated using normal spreadsheets and regular tools of database management.
  • It requires tools like HADOOP, so that all data can be analyzed at one go.
  • For analyzing data, people use to create different sets based on one or more common fields, so that analysis can be easy.
  • Now we have tools that can analysis data of Hugh volume, these tools will categorize that data even as they are analyzing it.
Big Data Concepts
Experts say that the Big Data concepts are three Vs
  1. Volume
  2. Velocity
  3. Variety
Some others add few more to the list
  1. Visualization
  2. Veracity (Reliability)
  3. Variability
  4. Value

Comments

  1. a single place with good collection of topics. nice for a person who starts to read whats is big data.

    But why there is nothing related to gpdb! Expecting that too with similar write-up.

    Altogether nice approach.

    Regards,

    ReplyDelete
    Replies
    1. Thanks, As expected gpdb related topic will be published soon...

      Delete

Post a Comment

Popular posts from this blog

GP - Kerberos errors and resolutions

How to set Optimizer at database level in greenplum

GP - SQL Joins