About Author:

Mohmmad Shahnawaz Ahangar

Mohmmad Shahnawaz Ahangar

Shahnawaz is a Research Associate at CloudThat. He is certified as a Microsoft Azure Administrator. He has experience working on Data Analytics, Machine Learning, and AI project migrations on the cloud for clients from various industry domains. He is interested to learn new technologies and write blogs on advanced tech topics.

Spark DataFrames for Machine Learning Applications – Part 2

Posted on November 9, 2022 by Mohmmad Shahnawaz Ahangar | Comments(0)

Spark DataFrame is a collection of data organized in the form of columns and rows, where each column can be feature data. They are akin to database tables or Pandas DataFrames but with richer optimization, to integrate with large-scale datasets for machine learning applications or algorithms. A DataFrame fundamentally is a Dataset that is organized into named columns.

Continue Reading…