Peerannot: classification for crowdsourced image datasets with Python

Published: 14 Feb 2024, Last Modified: 14 Feb 2024Accepted by ComputoEveryoneRevisionsBibTeX
Abstract: Crowdsourcing is a quick and easy way to collect labels for large datasets, involving many workers. However, workers often disagree with each other. Sources of error can arise from the workers’ skills, but also from the intrinsic difficulty of the task. We present peerannot: a Python library for managing and learning from crowdsourced labels for classification. Our library allows users to aggregate labels from common noise models or train a deep learning-based classifier directly from crowdsourced labels. In addition, we provide an identification module to easily explore the task difficulty of datasets and worker capabilities.
Code: https://github.com/tanglef/computo_2023
Assigned Action Editor: ~Marie-Pierre_Etienne1
URL: https://github.com/tanglef/computo_2023
License: Creative Commons Attribution 4.0 International (CC BY 4.0)
Submission Number: 2
Loading