Parquet is best suitable for Nested Data , But Why ?

Yes, all the folks who are working with big data must have heard this statement,but very few of them are aware of the real reason why parquet has the upper hand when it comes to storing nested data.

It is the underlying architecture and hierarchical data model of parqeut which is making things easy for the nested data.

This architecture of parqeut follows the Dremel paper published by Google.
Here is the blog post which justifies the famous statement:

https://blog.twitter.com/engineering/en_us/a/2013/dremel-made-simple-with-parquet

Comments

Popular posts from this blog

Calculating Top N items per Group (Without Window Functions)

Single vs Double Quotation marks (In PostgreSQL)

Leverage the mind smartly, How?