Category Archives: HDFS

Pivotal HAWQ – MPP database on HDFS

In this post I will go through the architecture of Pivotal HAWQ and how it works. I strongly suggest to go through Introduction to Massively Parallel Processing (MPP) database before reading this as you will need some concepts of MPP … Continue reading

Posted in Big Data, Hadoop, HDFS, MPP, Pivotal HAWQ, Postgress | 7 Comments

Greenplum and Hadoop HDFS integration

One of the features of Greenplum 4.2 version is the use of Hadoop HDFS file system to create external tables. This is extremely useful when you want to avoid file movement from HDFS to local folder for data loading. In … Continue reading

Posted in gphdfs, Greenplum Database, Hadoop, HDFS, Postgress | 39 Comments