The objective of the thesis is to combine artificial intelligence with the robot. The presented method applies deep reinforcement learning to 6-DoF manipulator to perform the specific task. By designing the neural network and reward function, the joint torque produced by an agent can move the end-effector to the desired position and orientation during the process. Its training algorithm implements deep deterministic policy gradient (DDPG) and twin delayed DDPG (TD3) respectively. Furthermore, the training error of manipulator is analyzed in various situations, such as joint angle constraints, trajectories, algorithms and neural network architectures. Ultimately, the optimal controller is obtained to achieve precise control over robot arm.