Heterogeneous data sources present a significant barrier to rapid, accurate analysis during disease outbreaks. another data transformation language (adtl) streamlines the process of converting raw epidemiological and clinical data into clean, schema-compliant formats ready for downstream processing. Using simple JSON or TOML specification files, it provides a robust and reproducible transformation engine used both within the Global.health ecosystem and by our partners. Its companion module, AutoParser, accelerates onboarding of new data sources by using AI to semi-automate the creation of new specification files from data descriptions and target schemas. Together, they offer a complete, open-source solution for scalable data harmonisation.
Currently in development, launching early 2021.