Error Checking Process

NACC Data Validation Pipeline

Submitting a form data CSV file to center's Flywheel ingest project will trigger the NACC data validation pipeline in Flywheel.

For the centers who are using REDCap direct entry, the records that are marked as "ready for Flywheel upload" will be transferred to the center's Flywheel ingest project each night by NACC, and the same validation pipeline will be triggered.

The stages of the validation pipeline are

CSV format check:
Check whether the submitted CSV has correct headers and datatypes. The CSV should include only NACC accepted variables for the module, entire file will be rejected if there are extra fields.
NACCID lookup:
For each record in CSV file, look up the NACCID using the adcid and ptid for that record. Record will not be processed further if there's no matching NACCID.
Generate JSON file:
Create a JSON file per CSV record and store in Flywheel ingest project as aquisition data. A Flywheel hierarchy (Subject/Session/Aquisition) will be created for the file if it doesn't already exist.
Extract metadata:
Extract information from the JSON files and store as Flywheel metadata, these will be used for search, dataviews, reports, etc.
NACC QC checks:
Run NACC data quality checks according to NACC published error checks.

Each stage of the validation pipeline will trigger a job in Flywheel. Check the job completion status in the Jobs Log. Flywheel jobs log

Error Report

Once all the jobs in the pipeline are complete, you will be able to view the error report in the Flywheel Issue Manager interface. Flywheel Issue Manager

The Issue Manager UI will display all the errors and alerts that are currently unresolved in the project. It has a tab for each stage of the data validation pipeline. If there are no errors occurred in a certain stage, that tab will be empty. Issue Manager - CSV validation