Big Data is problem statement and it can be solved with one of the tools like Apache Hadoop. But having Apache Hadoop as infra to do our proof of concepts, proof of values is little challenging. Hence we brought 3 click ideas to have your Apache Hadoop installed. What is Perquisite? Ubuntu 14.04 Internet Connection […]
Is Apache Hadoop the only option to implement Big Data? Yes, Hadoop is not only the options to big data problem. Hadoop is one of the solutions. The HPCC (High Performance Computing Cluster) Systems technology is an open source data driven and intensive processing and delivery platform developed by LexisNexis Risk Solutions. HPCC Systems incorporates […]
Thanks to Zaloni and Creating a Data-Driven Organization, Carl Anderson. The fantastic book, very well narrated in this book and I like to share our learning with our big data & IoT community. Many organizations think that simply because they generate a lot of reports or have many dashboards, they are data-driven. Although those activities […]
It’s very clear that every stake holders from Business to IT teams are traction towards Big Data, but the first and foremost challenge is getting the right tool fitment. And the power of open source brings us more additional and potential tools often. Wish, the study on Hadoop Distribution will help us on the first […]
Listened and got it from atscale webinar, felt great and use full hence happy to share with our big data & analytics community. It’s very clear Hadoop is budding from its batch processing origins into a flexible, economical hub where enterprise store raw data, keep archival data active, and grow their options for data investigation, […]
The Data Lake vs. Data Warehouse in Big Data ! Big Data use cases are in evolution from all over the verticals like Insurance, Healthcare, Manufacturing, Financial, Retail and more. Customers are using Big Data to improve top & bottom line revenue with business values. With this data driven era, enterprise readiness and data management […]
Team, tons & thousands of thanks for reading and engaging ! This time am pleasure to share with you all my learning’s in Data Import, Export from Hadoop’s file system; which is core component to pump the data to Database, Warehouse, Analytics and Business. We titled as “Heart of the Hadoop is HDFS“. It’s no […]
Team, this time i go with the title called “Top 3 methods of skipping big data’s bad data using Hadoop !“ which describes about how to get corrupt records out from the large data sets which has different format of data. While doing our analysis if the corrupt records are in small percentage we can ignore or […]
Team thanks for reading & engaging ! This time am planned to share with you the my learning on Hadoop Schedulers; titled “Simplified Hadoop Schedulers Overview !” With the help of choosing suitable scheduler, we can make the response times faster for all smaller jobs and also for all the production jobs it’s guaranteed with SLA’s (Service […]
Big Data gives new insights into what people do on their own, and on a massive scale. Thick Data reveals motivations, intent, emotions that might not be obvious from Big Data. It’s not mine, learnt from Data-informed Product Design book by Pamela Pavliscak. Tons of Thanks to O’Reilly team. Much Thanks for your valuable time.
Pleased to share the Single Slider On Data Lake vs. Data Warehouse includes defintion, key properties, use cases & user groups. To conclude, for enterprise data driven organization both data ware house & data lake plays vital role. In nutshell Data Lake + Data Warehouse = Business Value ! Thanks for your time (TREASURED) and engaging. As […]
We don’t have any fixed definition for big data. Basically it refers to the technologies which we are used to extract, store, transform and access in organizations. It can extract structured, unstructured and semi structured data. Currently the data is multiplying at a rapid speed and it becomes difficult for organizations to have innovative & customer centric […]
Big Data Meets Microsoft Azure ! For Big Data & Cloud...
How to Ingest HDFS in JSON format using Apache Sqoop ?...
The 4 Key Concepts in the Anatomy of an Apache Spark Job!...
The 1-2-3-4-5-6-7-8-9 of Cognitive Computing ! Dear Data...