Big Data Meets Microsoft Azure ! For Big Data & Cloud Community members this post on “Big Data, Meet Azure” is all about doing big on public cloud Azure. And sure, we no need definition for Big Data and Cloud Computing, but in a line; I would like to called both as Super Nova for […]
The 4 Key Concepts in the Anatomy of an Apache Spark Job! For Big Data & Cloud Community members Apache Spark is Awesome to handle any workloads such as Batch, Streaming, Real-Time, and Ad-hoc. However, to fine tune and optimize of our Apache Spark Applications we need to have a grip on the Apache Spark […]
The top 79 beautiful lines for taking big data architecture from drawing board to production! Dear Data Community, Instead of titling this blog is “The top 79 beautiful lines for taking big data architecture from drawing board to production”, It would be very suitable if we call it as book talk, which is inspired by […]
Top 12 excuses for why our big data isn’t paying off Dear Data Community, recently I read a book from Arcadia Data, which they talked about they reasons & excuses for why the big data fails. Yes, we the big data is momentum, which brings many insights and intelligence for the enterprise. But however few […]
Top 150 Big Data & Cloud Computing Terminologies for Data Professional Big Data + Cloud Computing Glossary for Community Say Hi to Henry the owl! Henry, the smartest and wisest of all, is the Big Data and Cloud expert who has gained his knowledge by surfing the clouds his entire life . Being a curious Owl, […]
Get started with R Lang for Beginner’s In this modern world, huge amount of data originating from various sources like financial transactions, or geographical data or ecommerce websites there are plenty of data sources in the form of raw data’s. Next question will running in our mind how we are going to make those raw […]
“The Top 10 Container Orchestration tools” This time in the “The x Series of blog”, we love to bring “The Top 10 Container Orchestration tools”. And now what is container orchestration ? In Development and Quality Assurance (QA) environments, we can get away with running containers on a single host to develop and test applications. […]
Big Data Stack 2.0 and Beyond! The Google File System (GFS), MapReduce, and Bigtable are Googles & data industries Big Data revolution, which constructs Big Data Stack 1.0. Dough Cutting actually integrated the above released concepts into a tool called Hadoop. GFS + MapReduce + Bigtable > HDFS + MapReduce + HBase; which is together […]
The 7 Habits Of Successful Big Data and NoSQL Projects by Ben Lorica ! Let’s have firstname.lastname@example.org
Top 16 Hadoop Built-in Ingress and Egress Tools ! Hadoop has revolutionized data ingestion, data processing and enterprise data warehousing, but its explosive growth has come with a large amount of uncertainty, hype, and confusion. With this blog, enterprise decision makers will receive short quick insights on what all the 16 Hadoop build-in Ingress and […]
The 9 Key steps to implement Big Data DevOps ! Per WiKi Definition: DevOps (a clipped compound of development and operations) is a culture, movement or practice that emphasizes the collaboration and communication of both software developers and other information-technology (IT) professionals while automating the process of software delivery and infrastructure changes. Per Gene Kim(author of The […]
Tuning Handbook of Apache Kafka! We all know the power and advantages of Apache Kafka. It is publish-subscribe messaging system which basically has three major components Apache Kafka Consumer Apache Producer Apache Kafka Broker This doc is all about how we can achieve maximum throughput while planning to have Kafka in production or in POCs. […]
Big Data Meets Microsoft Azure ! For Big Data & Cloud...
How to Ingest HDFS in JSON format using Apache Sqoop ?...
The 4 Key Concepts in the Anatomy of an Apache Spark Job!...
The 1-2-3-4-5-6-7-8-9 of Cognitive Computing ! Dear Data...