Data Analytics using Hadoop and Apache Spark Report
C-CODES in association with the Department of Computer Engineering of Don Bosco College of Engineering, organised an online webinar on “Data Analytics using Hadoop and Apache Spark”. The speaker for the session was Mr. Virendra Dakhode a Senior Analyst Programmer at TCS. The webinar was held for the S.E and T.E computer students through Google Meet on October 9, 2021 under the guidance Prof. Merwyn D’Souza and Prof. Siya Naik.
The session began with a concise introduction of the speaker by Ms. Ritika Ghanti. Following that Mr. Virendra Dakhode started his presentation off by explaining about data analytics and its importance. He also connected it to everyday life giving relevant examples of how it was used by google and in fraud detection. Going deeper into the subject he explained the types of data and its aspects. He also explained how legacy framework works. He then went on to discussing a scaling problem about servers crashing when data in data bases increasing exponentially and how Hadoop and Apache spark are data processing engines used to solve this problem where big data is generated by telecom, bank, e-commerce, search engine, healthcare, android and IOS apps, advertisement and other such industries which store data that shouldn’t be deleted, data generated through social media apps through millions of posts on Instagram and Facebook, tweets on Tweeter, videos on YouTube and votes on Reddit, etc. Further he also interacted with the students ensuring they understood what is being explained. He then carried on proposing questions on Big Data and giving elaborate explanations. He also explained the characteristics of Big data, the 3V’s: volume, velocity and variety. Finally, he gave a detailed step by step breakdown of the Hadoop ecosystem and simplified the complex jargon so that it was easily understood. He then concluded his presentation with use cases of Hadoop and Apache spark.
Mr. Virendra said that “Data analytics technologies and techniques give organizations a way to analyze data sets and gather new information.”
His presentation was followed up by a Q&A session where everyone’s doubts were cleared and Mr. Virendra shared his own experience in his industry. To conclude the meet Ms. Ritika Ghanti thanked everyone.