With the explosive growth of data, the use of big data technology machine learning classification algorithms to predict the results can improve the intelligent classification of data. It can provide data support for predicting classification in advance. Filter out the classification results to improve the efficiency of data processing and data realization. This article first introduces the development process of machine learning under big data, introduces the mainstream distributed processing framework spark, and then compares the advantages and disadvantages of classification algorithms under big data.