Implementation of ResNet-50 on End-to-End Object Detection (DETR) on Objects

Authors

  • Endang Suherman Universitas Pradita, Serpong, Banten, Indonesia
  • Ben Rahman Fakultas Teknologi Komunikasi dan Informatika, Universitas Nasional, Indonesia
  • Djarot Hindarto Fakultas Teknologi Komunikasi dan Informatika, Universitas Nasional, Indonesia
  • Handri Santoso Universitas Pradita, Serpong, Banten, Indonesia

DOI:

10.33395/sinkron.v8i2.12378

Keywords:

CNN, Computer Vision COCO Dataset, End-to-End Object Detection, Object Detection, ResNet-50

Abstract

Object recognition in images is one of the problems that continues to be faced in the world of computer vision. Various approaches have been developed to address this problem, and end-to-end object detection is one relatively new approach. End-to-end object detection involves using the CNN and Transformer architectures to learn object information directly from the image and can produce very good results in object detection. In this research, we implemented ResNet-50 in an End-to-End Object Detection system to improve object detection performance in images. ResNet-50 is a CNN architecture that is well-known for its effectiveness in image recognition tasks, while DETR utilizes Transformers to study object representations directly from images. We tested our system performance on the COCO dataset and demonstrated that ResNet-50 + DETR achieves a better level of accuracy than DETR models that do not use ResNet-50. In addition, we also show that ResNet-50 + DETR can detect objects more quickly than similar traditional CNN models. The results of our research show that the use of ResNet-50 in the DETR system can improve object detection performance in images by about 90%. We also show that using ResNet-50 in DETR systems can improve object detection speed, which is a huge advantage in real-time applications. We hope that the results of this research can contribute to the development of object detection technology in images in the world of computer vision.

GS Cited Analysis

Downloads

Download data is not yet available.

References

Al Jaberi, S. M., Patel, A., & AL-Masri, A. N. (2023). Object tracking and detection techniques under GANN threats: A systemic review. Applied Soft Computing, 139, 110224. https://doi.org/10.1016/j.asoc.2023.110224

Carion, N., Massa, F., Synnaeve, G., Usunier, N., Kirillov, A., & Zagoruyko, S. (2020). End-to-End Object Detection with Transformers. Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 12346 LNCS, 213–229. https://doi.org/10.1007/978-3-030-58452-8_13

Chen, Y., Lin, Y., Xu, X., Ding, J., Li, C., Zeng, Y., Liu, W., Xie, W., & Huang, J. (2022). Classification of lungs infected COVID-19 images based on inception-ResNet. Computer Methods and Programs in Biomedicine, 225, 107053. https://doi.org/10.1016/j.cmpb.2022.107053

Chirgaiya, S., & Rajavat, A. (2023). Tiny object detection model based on competitive multi-layer neural network (TOD-CMLNN). Intelligent Systems with Applications, 18(September 2022), 200217. https://doi.org/10.1016/j.iswa.2023.200217

de Zarzà, I., de Curtò, J., & Calafate, C. T. (2022). Detection of glaucoma using three-stage training with EfficientNet. Intelligent Systems with Applications, 16(September), 1–10. https://doi.org/10.1016/j.iswa.2022.200140

García-Aguilar, I., García-González, J., Luque-Baena, R. M., & López-Rubio, E. (2023). Automated labeling of training data for improved object detection in traffic videos by fine-tuned deep convolutional neural networks. Pattern Recognition Letters, 167, 45–52. https://doi.org/10.1016/j.patrec.2023.01.015

Kong, L., Wang, J., & Zhao, P. (2022). YOLO-G: A Lightweight Network Model for Improving the Performance of Military Targets Detection. IEEE Access, 10, 55546–55564. https://doi.org/10.1109/ACCESS.2022.3177628

Li, B., & Lima, D. (2021). Facial expression recognition via ResNet-50. International Journal of Cognitive Computing in Engineering, 2(January), 57–64. https://doi.org/10.1016/j.ijcce.2021.02.002

Ouf, N. S. (2023). Leguminous seeds detection based on convolutional neural networks: Comparison of faster R-CNN and YOLOv4 on a small custom dataset. Artificial Intelligence in Agriculture. https://doi.org/10.1016/j.aiia.2023.03.002

Paymode, A. S., & Malode, V. B. (2022). Transfer Learning for Multi-Crop Leaf Disease Image Classification using Convolutional Neural Network VGG. Artificial Intelligence in Agriculture, 6, 23–33. https://doi.org/10.1016/j.aiia.2021.12.002

Prabhakaran, K., & Debebe, T. (2023). Skin Cancer Cancer diagnosis diagnosis with with Yolo Yolo Deep Deep Neural Neural Network Network. Procedia Computer Science, 220, 651–658. https://doi.org/10.1016/j.procs.2023.03.083

Rajeshkumar, G., Braveen, M., Venkatesh, R., Josephin Shermila, P., Ganesh Prabu, B., Veerasamy, B., Bharathi, B., & Jeyam, A. (2023). Smart office automation via faster R-CNN based face recognition and internet of things. Measurement: Sensors, 27(February), 100719. https://doi.org/10.1016/j.measen.2023.100719

Santos-Bustos, D. F., Nguyen, B. M., & Espitia, H. E. (2022). Towards automated eye cancer classification via VGG and ResNet networks using transfer learning. Engineering Science and Technology, an International Journal, 35, 101214. https://doi.org/10.1016/j.jestch.2022.101214

Sarwinda, D., Paradisa, R. H., Bustamam, A., & Anggia, P. (2021). Deep Learning in Image Classification using Residual Network (ResNet) Variants for Detection of Colorectal Cancer. Procedia Computer Science, 179(2019), 423–431. https://doi.org/10.1016/j.procs.2021.01.025

Sze, E., Santoso, H., & Hindarto, D. (2022). Review Star Hotels Using Convolutional Neural Network. 7(1), 2469–2477.

Wahjuni, S., & Nurarifah, H. (2023). Faster RCNN based leaf segmentation using stereo images. Journal of Agriculture and Food Research, 11(November 2022), 100514. https://doi.org/10.1016/j.jafr.2023.100514

Xue, G., Li, S., Hou, P., Gao, S., & Tan, R. (2023). Research on lightweight Yolo coal gangue detection algorithm based on resnet18 backbone feature network. Internet of Things (Netherlands), 22(March), 100762. https://doi.org/10.1016/j.iot.2023.100762

Downloads


Crossmark Updates

How to Cite

Suherman, E. ., Rahman, B. ., Hindarto, D. ., & Santoso, H. . (2023). Implementation of ResNet-50 on End-to-End Object Detection (DETR) on Objects. Sinkron : Jurnal Dan Penelitian Teknik Informatika, 8(2), 1085-1096. https://doi.org/10.33395/sinkron.v8i2.12378

Most read articles by the same author(s)

1 2 3 4 > >>