In image processing, the convolutional neural net- work extracts better features than the previous manual features because of the special structure of the CNN. The combination of convolution and pooling layers enables the CNN to extract better features in the image.There are many network models of convolution neural networks, but a convolution neural network model generally consists of several convolution layers, pooling layers and full connection layers. In this paper, we will discuss some newest models, MobileNet and EfficientNet, using Google Landmark Recognition's dataset to visualize these two CNN models' performance on image classification by comparing their accuracy and public score.