Greenplum vs Redshift
1st kindly do have a look onto the link Click Here... Redshift is lack of procedural languages and Greenplum has mapreduce applications. Obviously, Greenplum runs anywhere while Redshift runs only in AWS. Redshift only uses instance types with ephemeral storage so if you stop the cluster, you lose the data. For instance types with ephemeral storage, you can't replace a disk. Maybe Amazon does for Redshift instances but everyone else, they recommend to replace the node if you have a disk failure. Redshift ds2.8xlarge instance type has 16TB of usable storage but the d2.8xlarge instance type has 12 2TB disks. I'm guessing they are using RAID5 but why? If they lose a disk, they will replace the entire node unless they do something special for Redshift. Greenplum running on d2.8xlarge uses RAID0 so you get denser storage and all 48TB. If you lose a disk, the software HA kicks in and the mirror segments take over. Y...