We are proud to offer the Sama-Coco dataset, a relabelling of the Coco-2017 dataset by our own in-house Sama associates (here’s more information about our people!). We invite the Machine Learning (ML) community to use it for anything you would like to do – all free of charge and ungated.
This is part of our ongoing effort to redefine data quality for the modern age, and to contribute to the wider research and development efforts of the ML community. Here are the ungated links to the two datasets (both covered by the Creative Commons license) so that you can get started right away.
Here’s a quick overview of the two datasets’ most important characteristics:
{{tables}}
(10 most frequent classes)
Some key features should be highlighted:
Here, we cover two images that are illustrative of some of the differences between Sama-Coco and Coco-2017.
In this first example, Coco labellers largely treated this as one singular crowd, whereas in Sama-Coco, each person was individually labeled.
This second example shows how most annotations were carried out with an acute level of precision. Coco’s motorcycle annotation is rather coarse, whereas Sama-Coco’s is more fine-grained.
We revisited all 123 287 images pre-loaded with annotations from the Coco-2017 dataset with up to 500 associates performing three key tasks. They had to:
Load Sama-Coco directly from the FiftyOne app. Explore all 123,287 images directly within FiftyOne and compare them side by side with the original MS Coco dataset.
import
fiftyone
as
fo
import
fiftyone.zoo
as
foz
dataset = foz.load_zoo_dataset("sama-coco"
, splits=
"validation"
, label_types=
"segmentations"
, include_id=
True
)
coco_val_dataset = foz.load_zoo_dataset("coco-2017"
, split=
"validation"
, label_types=
"segmentations"
, include_id=
True
)
dataset.rename_sample_field("segmentations"
,
"sama_segmentations"
)
coco_val_dataset.rename_sample_field("segmentations"
,
"coco_segmentations"
)
dataset.merge_samples(coco_val_dataset, key_field="coco_id"
)
session = fo.launch_app(dataset)
To set up simply:
We’d love to hear from you about your experience with Sama-Coco! Please contact sama-coco@sama.com with your feedback. Thanks!