Since computer vision has been widely applied in daily life, such as in autonomous vehicles, portable applications, augmented reality systems, and medical image analysis, the demand for architectures with lower complexity and higher accuracy has become a priority. To improve the accuracy and complexity of object detection, various methods have been developed, such as R-CNN and YOLO models. In this study, we propose an enhanced version of the YOLOv8 model to further improve detection accuracy and efficiency. Specifically, we adopted EfficientHead as the detection head, which optimizes computational resource utilization and improves inference speed while maintaining detection accuracy. For the backbone network, we incorporated the ResNet18d module along with the SPPF_LSKA module, which enhances the network's ability to learn multi-scale features, surpassing traditional convolutional layers. The deep stem structure of ResNet18d helps retain more spatial information, while SPPF_LSKA introduces Large Separable Kernel Attention (LSKA) to enhance the SPPF feature extractor, improving multi-scale feature extraction and handling of complex scenes. Experiments on the VOC dataset demonstrate that the ResEff-YOLO model outperforms the YOLOv8 series, with a mean average precision (mAP) improvement of approximately 4% and an mAP50-95 improvement of 4.2%.