Data Lake Architecture Considerations & Composition In our last blog we saw the key benefits of Data Lake, but let’s deep dive in to the internals of a Data Lake via discussing the key considerations and compositions. Architecture Considerations: Take in any solution considerations it is practical difficult to arrives with a one-size-fit-all architecture; hence […]
TCP/IP Layer-wise IoT Protocols Hello !! Hello everyone !! Thanks a lot for your valuable response for the previous blog. In this post, I will be explaining the basics of TCP (Transmission Control Protocol)/IP (Internet Protocol) stack and the respective IoT protocols associated with each layer. Anyone who has prior knowledge on TCP/IP stack can […]
Celebrate the Big Data Problems – #2 How to identify the no of buckets for a Hive table while executing the HiveQL DDLs ? The dataottam team has come up with blog sharing initiative called “Celebrate the Big Data Problems”. In this series of blogs we will share our big data problems using CPS (Context, […]
Celebrate the Big Data Problems – #1 Daily we are facing many big data problems in production, PoC, and more perspective. Do we have any common repo to collect and share? No, as we know we don’t have any. As always dataottam is looking forward to share the learnings with community to celebrate their similar, […]
Big Data is problem statement and it can be solved with one of the tools like Apache Hadoop. But having Apache Hadoop as infra to do our proof of concepts, proof of values is little challenging. Hence we brought 3 click ideas to have your Apache Hadoop installed. What is Perquisite? Ubuntu 14.04 Internet Connection […]
Is Apache Hadoop the only option to implement Big Data? Yes, Hadoop is not only the options to big data problem. Hadoop is one of the solutions. The HPCC (High Performance Computing Cluster) Systems technology is an open source data driven and intensive processing and delivery platform developed by LexisNexis Risk Solutions. HPCC Systems incorporates […]
Thanks to Zaloni and Creating a Data-Driven Organization, Carl Anderson. The fantastic book, very well narrated in this book and I like to share our learning with our big data & IoT community. Many organizations think that simply because they generate a lot of reports or have many dashboards, they are data-driven. Although those activities […]
It’s very clear that every stake holders from Business to IT teams are traction towards Big Data, but the first and foremost challenge is getting the right tool fitment. And the power of open source brings us more additional and potential tools often. Wish, the study on Hadoop Distribution will help us on the first […]
Listened and got it from atscale webinar, felt great and use full hence happy to share with our big data & analytics community. It’s very clear Hadoop is budding from its batch processing origins into a flexible, economical hub where enterprise store raw data, keep archival data active, and grow their options for data investigation, […]
The Data Lake vs. Data Warehouse in Big Data ! Big Data use cases are in evolution from all over the verticals like Insurance, Healthcare, Manufacturing, Financial, Retail and more. Customers are using Big Data to improve top & bottom line revenue with business values. With this data driven era, enterprise readiness and data management […]
Team, tons & thousands of thanks for reading and engaging ! This time am pleasure to share with you all my learning’s in Data Import, Export from Hadoop’s file system; which is core component to pump the data to Database, Warehouse, Analytics and Business. We titled as “Heart of the Hadoop is HDFS“. It’s no […]
Team, this time i go with the title called “Top 3 methods of skipping big data’s bad data using Hadoop !“ which describes about how to get corrupt records out from the large data sets which has different format of data. While doing our analysis if the corrupt records are in small percentage we can ignore or […]
Step By Steps for deploying an Hello World, APP on Google Cloud Platform Container using Docker & Kubernetes
Step By Steps for deploying an Hello World, APP on Google...
The Bot 101 [ Part 2 ] Thanks for reading and sharing the...
“The Top 10 Container Orchestration tools” This...
The 11 DevOps Misconceptions ! In this blog we’ll have...