We design a big data platform based on Hadoop (HDFS, Spark, MapReduce) and Hive. The platform is built on the virtualization cloud platform. Firstly, multiple virtual machines are created on the cloud platform, and then Hadoop distributed storage system and distributed computing system are deployed on the virtual machines cluster. The Hive data query and analysis platform is deployed based on Hadoop. At the same time, we have done Internet search and social data analysis based on the big data platform. The experimental results show that in the loading, mapping, query and statistical analysis of Internet big data, the big data platform cluster constructed by multiple machines has higher efficiency and throughput rather than that of a single machine.