BlinkDB a project being developed by the Berkeley University where the evolution of Spark started is a massively parallel interactive Query Engine processing tens of TB of data with response time of just a blink of an eye. BlinkDB allows users to trade-off query accuracy for response time, enabling interactive queries over massive data by […]
It’s very clear that every stake holders from Business to IT teams are traction towards Big Data, but the first and foremost challenge is getting the right tool fitment. And the power of open source brings us more additional and potential tools often. Wish, the study on Hadoop Distribution will help us on the first […]
Thank you for your valuable time & it’s much appreciated. This time i like to share the blog called “Quick Card On – Apache Hive Joins !” – a handy Apache Hive Joins reference card or cheat sheet. An SQL JOIN clause is used to combine rows from two or more tables, based on a common […]
This time i go with a blog called, The 10 Distributed SQL Query Engine for Big Data! A Much Thank for your time, it’s truly appreciated! Data…Data…Data…Yep, it’s everywhere starting from Software to Salt stores which is tagged as Big Data. But who is the friend who can help us to get the insights/values from the data […]
Computes an approximate histogram of a numerical column using a user-specified number of bins. The output is an array of (x,y) pairs as Hive struct objects that represents the histogram’s bin centers(x value) & the histogram height(y value). Even though this function creates a histogram with non- uniform bin widths but to some extent its […]
Step By Steps for deploying an Hello World, APP on Google Cloud Platform Container using Docker & Kubernetes
Step By Steps for deploying an Hello World, APP on Google...
The Bot 101 [ Part 2 ] Thanks for reading and sharing the...
“The Top 10 Container Orchestration tools” This...
The 11 DevOps Misconceptions ! In this blog we’ll have...