Self-Learn Yourself IoT in 21 Blogs – #1 – In this we will be seeing What is IoT ? Why do we need it? Significance & Impact on Modern life? Time to Greet the New Clone that it set to rule the world ! Hello All ! Well, this is my first blog for dataottam and […]
Big Data is problem statement and it can be solved with one of the tools like Apache Hadoop. But having Apache Hadoop as infra to do our proof of concepts, proof of values is little challenging. Hence we brought 3 click ideas to have your Apache Hadoop installed. What is Perquisite? Ubuntu 14.04 Internet Connection […]
In Blog 4, we will see what are Apache Spark Core and its ecosystem and Apache Spark on AWS Cloud. Click to have quick read on blog 1, blog 2, and blog 3 in this learning series. Apache Spark has many components including Spark Core which is responsible for Task Scheduling, Memory Management, Fault Recovery, […]
In this Blog 3 – We will see what is Apache Spark’s History and Unified Platform for Big Data, and like to have quick read on blog 1 and blog 2. Spark was initially started by Matei at UC Berkeley AMPLab in 2009, and open sourced in 2010 under a BSD license. In 2013, the […]
By this blog we will share the titles for learning Apache Spark, Basics on Hadoop which is one of the big data tool, and motivations for Apache Spark which is not replacement of Apache Hadoop, but its friend of big data. Blog 1 – Introduction to Big Data Blog 2 – Hadoop, Spark’s Motivations Blog […]
In this new year 2016, we should be excited that Apache Spark community have released and announced the availability of Apache Spark 1.6, which is the 7th release on the 1.x line. Committers – Contributors to Spark had crossed 1000, which is doubled. Patches – Apache Spark 1.6 version includes & covers 1000 patches. Run […]
The term Data Lake has been gaining popularity recently as most of the enterprises have incorporated it into their analytics software’s. Every word and phrase that is used to describe Data Lake have provided us much useful information about how we interpret it. So we at dataottam decided to understand the various ways Data Lake […]
We have received many requests from friends who are constantly reading our blogs to provide them a complete guide to sparkle in Apache Spark. So here we have come up with learning initiative called “Self-Learn Yourself Apache Spark in 21 Blogs”. We have drilled down various sources and archives to provide a perfect learning path […]
Best wishes to you this holiday, and Happy New Year, from all of us at dataottam. This blog introduces Spark’s core abstraction for working with data, the RDD (Resilient Distributed Dataset). An RDD is simply a distributed collection of elements or objects (Java, Scala, Python, and user defined functions) across the Spark cluster. In Spark […]
As of this writing, Drill is a very active Apache incubating project led by MapR with six to seven companies actively participating, and more than 250+ people currently on the Drill mailing list. The goal of Drill is to create an interactive analysis platform for Big Data using a standard SQL-supporting relational database management system […]
Is Apache Hadoop the only option to implement Big Data? Yes, Hadoop is not only the options to big data problem. Hadoop is one of the solutions. The HPCC (High Performance Computing Cluster) Systems technology is an open source data driven and intensive processing and delivery platform developed by LexisNexis Risk Solutions. HPCC Systems incorporates […]
FINALISTS 2016 42Gears Ezetap MedGenome Sankalp Semiconductor Ad2pro Media Foradian MoEngage Sapience Adadyn Gramener Nanobi Seclore Amagi Media GS Lab Nihilent SilverPush Attune Technologies Happiest Minds OSSCube Squareyards Aujas i-exceed Perpetuuiti Stelae Technologies BRIDGEi2i Incture Pervazive Take Solutions Cactus Communications Indix PurpleTalk Teabox Cross-Tab Marketing IntelliGrape Software Quatrro TechFront Curadev Kyazoonga RapidValue Telerad Tech Dexler […]
The Bot 101 [ Part 1 ] For me bot is new word, on first time...
Getting Started with Google Cloud Platform ! Last month got...
PocketGear on Getting Started with Google Cloud Platform !...
Top 10 Reasons to Run Hadoop in the Public Cloud ! Hadoop...