Sara Inácio1, Hugo Proença1, 2, João C. Neves1, 3
1University of Beira Interior
2IT: Instituto de Telecomunicações
3NOVA Lincs
SortWaste is a densely annotated dataset for waste detection created to support research in computer vision for waste management. Collected at an industrial sorting facility, it captures real-world conditions such as cluttered scenes, overlapping objects, and deformed materials, reflecting the visual complexity encountered in operational recycling lines.
The dataset contains high-quality labeled images across 8 waste categories and a range of environmental conditions. By providing accurate and consistent annotations, SortWaste aims to accelerate the development of automated sorting systems and contribute to more efficient recycling and resource recovery.
Annotated examples from SortWaste
Top-down capture setup at the sorting line
Collected at a Mechanical-Biological Treatment (MBT) facility, the dataset captures waste as it appears on a real conveyor belt:
Examples of all annotated classes
| Split | HDPE | ECAL | PET | Mixed Soft Plastic | Mixed Rigid Plastic | Cardboard | Metal | PET Oil | # Images | # All Objects |
|---|---|---|---|---|---|---|---|---|---|---|
| Train | 16803 | 13649 | 11976 | 9077 | 7066 | 1524 | 945 | 802 | 3705 | 61842 |
| Validation | 4972 | 2552 | 2108 | 1443 | 1120 | 425 | 277 | 168 | 780 | 13065 |
| Test | 3269 | 3026 | 2722 | 1817 | 1230 | 207 | 215 | 132 | 776 | 12618 |
| Total | 25044 | 19227 | 16806 | 12337 | 9416 | 2156 | 1437 | 1102 | 5261 | 87252 |
We define ClutterScore to gauge the scene’s hardness level using proxies that affect visual complexity:
ClutterScore = α·Hc + β·N + γ·Hs + δ·O
For implementation details and parameter settings, please refer to the paper or code.
Very Low Clutter
Low Clutter
Medium Clutter
High Clutter
The dataset is available in multiple formats:
@misc{inácio2026sortwastedenselyannotateddataset,
title={SortWaste: A Densely Annotated Dataset for Object Detection in Industrial Waste Sorting},
author={Sara Inácio and Hugo Proença and João C. Neves},
year={2026},
eprint={2601.02299},
archivePrefix={arXiv},
primaryClass={cs.CV},
url={https://arxiv.org/abs/2601.02299},
}