Personally identifiable information has been found in DataComp CommonPool, one of the largest open-source data sets used to train image generation models. Millions of images of passports, credit cards ...
This page includes examples of completed secondary data reviews that can be used as helpful illustrations for UNHCR staff and partners about to engage on similar exercises. Please note that these ...