Django Girls blog

Feb. 6, 2026, 11:05 a.m.

From CNNs to YOLO: Understanding Object Detection

Fundamentals Before Training a Steel Defect Detection Model

This article is a learning reflection. I share how I came to understand the workflow of CNNs for image analysis, supported by visualization tools and a few sketches I drew during the learning process.

With this foundation, I then look at how YOLO builds on CNNs to reframe object detection.

Object Detection and Image Analysis

When I first started learning about neural networks, machine learning, PyTorch, etc., my understanding of image analysis was rather vague. I knew that images could be “analyzed” by models, but I did not clearly distinguish between different types of tasks.

Over time, I realized that image analysis involves various tasks, while object detection is one of them, different from classification, image segmentation, etc. Instead of labelling the whole image with a single category, such as “shirt”, “dress”, or “coat”, in image classification, object detection refers to analysing:

Please read the full article in my Medium stories. Here is the link: 👉 From CNNs to YOLO: Understanding Object Detection