Self-Learn Yourself Apache Spark in 21 Blogs – #8 In this blog let us discuss on How to loading data, what is Lambdas, How to do Transforming Data and more on Transformations. And want to have quick read on the other blogs in this learning series. Apache Spark can load from any input sources like […]
Self-Learn Yourself Scala in 21 Blogs – #5 Blog 5 – Does functional programming matters and what are monads? Missed the previous blogs have a quick look with Self-Learn Yourself Scala in 21 Blogs (#1, #2, #3, #4). In this blog let’s understand for Scala developers does the functional programming matters and also what is […]
11 Key Tuning Checklists for Apache Hadoop! Apache Hadoop is a well know and de-facto framework for processing large big data sets through distributed & parallel computing. YARN(Yet Another Resources Negotiator) allowed Hadoop to evolve from a simple MapReduce engine to a big data ecosystem that can run heterogeneous (MapReduce and non-MapReduce) apps concurrently. This results […]
10 Key Features in Apache Storm 1.0.0 The Apache Storm community recently announced the release of Apache Storm 1.0.0 stable. This is a noteworthy release that delivers several powerful features that relate to enterprise readiness, operational simplicity and ease of use by dramatically enhancing areas around performance, scalability, debugging ability, maintainability, and manageability. Apache Storm […]
This blog introduces the convergence of complementary technologies – Spark, Mesos, Akka, Cassandra and Kafka (SMACK) stack. And we will see how Apache Kafka can help us to get data under control and what is it role in our data pipeline, how Spark & Akka help us to process the data, and how Cassandra to […]
Today emerging big data technology firm focused on helping enterprises build breakthrough software solutions powered by disruptive enterprise software trends like Machine learning and data science, Cyber-security, Enterprise IOT, and Cloud. So Hadoop is one of the proven software in big data space, but is it only Hadoop. Nope we have many more technologies which […]
Blog 4 – OOP to Functional Programming We are already using functional programming using Scala with the previous blog series of Self-Learn Yourself Scala in 21 Blogs (#1, #2, #3). Let’s start with defining functional programming which is a programming paradigm that models computation as the evaluation of expressions and expressions are built using functions […]
Tons of thanks for all 800+ views, 100+ likes, and 15+ comments for (Big) Data in Data Lake vs. Data Warehouse. And all the comments and suggestions are deep motivation behind 2nd Version of Data Lake. To name few Ricky Barron, Winston Sucher, Vinay, Ben Sharma, and Sanjay Pande. The Data Lake Architecture, Four functions […]
Blog 3 – Functional Programming & Data Structures In Scala Functional programming and functional data structures are very interesting and powerful. Actually it supports both types data structures called immutable & mutable. And let us discuss on two more vital concepts called type parameterization and higher-order functions. The type is similar to Java generics; it […]
8 Breaking Changes in Apache Flink 1.0.0 ! Apache Flink is an open source platform for distributed stream and batch data processing. Flink’s core is a streaming dataflow engine that provides data distribution, communication, and fault tolerance for distributed computations over data streams. Flink also builds batch processing on top of the streaming engine, overlaying […]
Looking For College Projects ?
We should be excited that Apache Hive community have released the largest release and announced the availability of Apache Hive 2.0.0. It brings great and exciting improvements in the category of new functionality, Performance, Optimizations, Security, and Usability. Let us explore the features in detail below; HBase to store Hive Metadata – The current metastore […]
Top 12 excuses for why our big data isn’t paying off...
The Bot 101 [ Part 4 ] Dear Bot community members, thanks...
The List of 10+ Bot Platform for Developer and Architects!...
Top 150 Big Data & Cloud Computing Terminologies for...