case. Analog to digital conversion Data is often still collected on paper forms, before moving to a digital format, resulting in errors. Unstructured data Data is often stored in dense PDFs or text files, rather than structured formats like CSVs. Unreliable data Data across different systems and data sets is often contradictory, inaccurate and outdated. Local language data Data is often recorded in different languages, making it harder to match and analyze data. Scattered data Rather than a central repository, data is scattered into disconnected, siloed systems. Dirty data Standard geographic conventions or metadata standards are often not followed.