skip to content
gigarouter gigarouter
models / object detection · coming soon

YOLOv8 Anime Face Detector

Fuyucchi/yolov8_animeface

published Oct 2024 · updated Oct 2024

YOLOv8 Anime Face Detector is a detection model that identifies and localizes anime character faces in images, based on the YOLOv8x architecture and trained on 10,000 manually annotated anime illustrations from safebooru.

status
coming soon
API providers
0
downloads / mo
7.8K
license
agpl-3.0

specs

TaskObject Detection (anime face detection)
ArchitectureYOLOv8x (variant trained at 1280x1280 input)
Parameters68.2 million
LicenseAGPL-3.0 (Ultralytics YOLOv8 base)

about this model

Yolov8_animeface is a detection model that identifies anime faces in images, based on the YOLOv8x architecture and fine-tuned on a custom dataset of 10,000 manually annotated images from Safebooru.

Architecture and Training

The model uses the YOLOv8x6 variant (a larger, non-standard input size variant of YOLOv8x) and was trained for 300 epochs at 1280x1280 pixel resolution. Training took approximately 110 hours on an RTX A4000 GPU. The dataset was split 70% training, 20% validation, and 10% testing.

Performance

On the test set (1,002 images with 1,562 instances), the model achieves:

MetricValue
Box Precision (P)0.957
Recall (R)0.924
mAP500.955
mAP50-950.534

Inference speed is 81.9ms per image at 1280x1280 resolution, with 1.3ms preprocessing and 0.8ms postprocessing.

Comparison with YOLOv5-based Model

Compared to the earlier YOLOv5-based anime face detector (zymk9/yolov5_anime, now unavailable), this model shows significant improvements on the same test set:

MetricYOLOv8-animefaceYOLOv5-anime
Precision0.9560.778
Recall0.9190.685
mAP500.9530.633
mAP50-950.5320.232

The YOLOv8-based model demonstrates higher precision and recall with fewer false positives, though the YOLOv5 model showed higher confidence in its predictions.

Confusion matrix on test set Precision-recall curve on test set

Visual Results

Comparison of manual annotations (left) and model predictions (right):

Manual annotation example 1 YOLOv8-animeface prediction example 1 Manual annotation example 2 YOLOv8-animeface prediction example 2

The base YOLOv8x model (from which this is fine-tuned) has 68.2M parameters and 257.8B FLOPs, achieving 53.9 mAP50-95 on COCO at 640px input. This fine-tuned model trades some general object detection capability for specialized anime face detection performance at higher resolution.

best for

FAQ

What is the model's detection accuracy?

On the test set it achieves 0.955 mAP50 and 0.534 mAP50-95 at 1280x1280 input.

How does this model compare to the older YOLOv5 anime face detector?

It significantly outperforms YOLOv5-anime on the same dataset: mAP50 0.953 vs 0.633, mAP50-95 0.532 vs 0.232.

What input size and format does the model expect?

It expects images resized to 1280x1280 pixels, in standard RGB format.

What is the output format?

It outputs bounding boxes with confidence scores for each detected anime face.

How can I call this model via the gigarouter API?

Use the gigarouter OpenAI-compatible endpoint with your API key; refer to the gigarouter documentation for the specific endpoint and request schema.

not yet live

We're benchmarking and onboarding YOLOv8 Anime Face Detector as a hosted, OpenAI-compatible API. Sign in for free credit and be ready when it lands, or tell us you want it and we'll prioritize it.

related object detection models

compare all →