v2.4.0 | Report Errata
docs development docs development

Dataset Documentation Cards are the consolidated artefacts that present the information in a structured, reviewable format for each dataset in the system’s lifecycle. They follow the Datasheets for Datasets framework, extended with the EU AI Act-specific requirements.

Each card covers the seven standard sections (motivation, composition, collection process, preprocessing, uses, distribution, maintenance) along with the additional sections required for AISDP compliance: GDPR lawful basis, protected characteristic distributions, representativeness assessment, known limitations contextualised to the intended purpose, and versioning metadata.

The cards are treated as living artefacts. Version bumps to the underlying dataset trigger corresponding updates to the card. Tools such as OpenMetadata, DataHub, or a Markdown file co-located with the dataset in the versioning system provide version-controlled documentation that evolves alongside the data. The AI System Assessor verifies that the cards are current at each phase gate and during conformity assessment.

Documentation depth is proportionate to the dataset’s role: training datasets warrant comprehensive cards; static reference datasets warrant lighter treatment. The proportionality rationale is itself documented.

Key outputs

  • Dataset Documentation Card per dataset
  • Proportionality rationale document
  • Version tracking for card updates
On This Page