Intro to Greenplum

Greenplum Database stores and processes large amounts of data by distributing the load across several servers or hosts
A logical database in Greenplum is an array of individual PostgreSQL databases working together to present a single database image. 
The master is the entry point to the Greenplum Database system. 
It is the database instance to which users connect and submit SQL statements. 
The master coordinates the workload across the other database instances in the system, called segments, which handle data processing and storage. 
The segments communicate with each other and the master over the interconnect the networking layer of Greenplum Database.

Greenplum Database is a software-only solution; the hardware and database software are not coupled. 
Greenplum Database runs on a variety of commodity server platforms from Greenplum-certified hardware vendors. 
Performance depends on the hardware on which it is installed. 
Because the database is distributed across multiple machines in a Greenplum Database system, proper selection and configuration of hardware is vital to achieving the best possible performance.

With the above said, Before deep diving into each major components of a Greenplum Database system and the hardware considerations and concepts associated with each components we are in need to have a clear picture of few basic things listed below
  1. Disks
  2. RAID
  3. XOR Operation
  4. Basic Networking
  5. LAN
  6. Linux - Basic

After going thru all above said hope now your equipped with enough knowledge to understand the technical terms mentioned in up-coming posts on below listed major components of a Greenplum Database system and the hardware considerations and concepts associated with each component: 
  1. The Greenplum Master 
  2. The Segments
  3. The Interconnect

Additionally, a system may have optional ETL Hosts for Data Loading and the Greenplum Performance Monitor for monitoring query workload and performance.

Comments

Popular posts from this blog

GP - Kerberos errors and resolutions

How to set Optimizer at database level in greenplum

GP - SQL Joins