We imported tens of thousands of organic molecules from Reaxys database as the sample data set for predicting the molecular melting points. We used the PaDEL package to generate the molecular descriptors, followed by a machine learning approach to find the mathematic relationship of describing the melting point in terms of structural characteristics. In this study, we showed a systematic process of using clustering method to reduce the descriptor dimensions and to categorize a highly diverse data set. We finally applied the XGBoost nonlinear regression method to the subgroup data set and obtained a statistically significant model. The model was found to fulfill the chemical consensus of molecular melting point, having contribution from the polarization force, dispersion force, and symmetry of the molecular configuration in the molecular structure.