The top 79 beautiful lines for taking big data architecture from drawing board to production! Dear Data Community, Instead of titling this blog is “The top 79 beautiful lines for taking big data architecture from drawing board to production”, It would be very suitable if we call it as book talk, which is inspired by […]
High Level Framework of Big Data Graph Databases! In Big Data world, it was very much clear that the connected data to store and processing the data was first challenge. And the first ideation is to replace and leverage the tabular SQL Semantic with the graph-centric model. And then the graph is new to big […]
Comparing Architecture Characteristics in Big Data Context! In this blog we’ll explore the differences between microservices and SOA in terms of the defining characteristics of the architecture pattern. In Big Data world, Apache Hadoop has come a long way in its relatively short lifespan. From its beginnings as a reliable storage pool with integrated batch […]
The 7 Habits Of Successful Big Data and NoSQL Projects by Ben Lorica ! Let’s have firstname.lastname@example.org
Big Data Splunk’s Best & Better Practices ! Introduction to Splunk We see servers, devices, apps, logs, traffic, and clouds. We see data, big data, and fat data everywhere. Splunk offers the leading platform for Operational Intelligence. It enables the curious to look closely at what others ignore which is called machine data and find […]
Scalable Apache Spark Solution to Big Data Secondary Sort Problem! – Part 2 In Part -1, we have discussed about the Spark solution to Secondary for larger data sets. Now let’s deep dive in Choice #2 Choice #2: If we have smaller data set then choice will fit, like read and buffer all of the […]
Relationship between MapReduce, Spark, YARN, and HDFS ! In Big Data era Hadoop is the de facto standard for developing of big data applications by using MapReduce framework. And Hadoop is composed of one or more master nodes and any number of slave nodes depends up on the data needed. Hadoop simplifies distributed applications by […]
Today emerging big data technology firm focused on helping enterprises build breakthrough software solutions powered by disruptive enterprise software trends like Machine learning and data science, Cyber-security, Enterprise IOT, and Cloud. So Hadoop is one of the proven software in big data space, but is it only Hadoop. Nope we have many more technologies which […]
Tons of thanks for all 800+ views, 100+ likes, and 15+ comments for (Big) Data in Data Lake vs. Data Warehouse. And all the comments and suggestions are deep motivation behind 2nd Version of Data Lake. To name few Ricky Barron, Winston Sucher, Vinay, Ben Sharma, and Sanjay Pande. The Data Lake Architecture, Four functions […]
As of this writing, Drill is a very active Apache incubating project led by MapR with six to seven companies actively participating, and more than 250+ people currently on the Drill mailing list. The goal of Drill is to create an interactive analysis platform for Big Data using a standard SQL-supporting relational database management system […]
Big Data Meets Microsoft Azure ! For Big Data & Cloud...
How to Ingest HDFS in JSON format using Apache Sqoop ?...
The 4 Key Concepts in the Anatomy of an Apache Spark Job!...
The 1-2-3-4-5-6-7-8-9 of Cognitive Computing ! Dear Data...