Managed access project - Data and metadata review and export SOP
Once the wrangler has verified that the data and metadata files were transferred correctly to the secure storage area using hca-util are they can proceed with project submission creation and validation
Submission review and validation
Data and metadata in a managed access submission are considered sensitive information and must be kept encrypted. Unlike in the case of open access projects, review for a managed access project must be done without ever downloading the metadata spreadsheet - or the data - to a non-encrypted location, like a laptop.
- Once the metadata spreadsheet is uploaded to the secure AWS storage area a lambda function will automatically notify the wrangler.
- The wrangler can review and edit the spreadsheet in a secure space through aws WorkSpace - see here for set-up instructions
- The wrangler triggers the upload of the spreadsheet to its respective project in the HCA Data Repository Ingest Service. If the spreadsheet upload fails because HCA Data Repository Ingest Service cannot process the spreadsheet, the wrangler will get notified of the problem.
- If the spreadsheet imports successfully in HCA Data Repository Ingest Service, review the submission for metadata errors and communicate with the contributor on how to fix them. Ask the contributor to upload the corrected spreadsheet to the secure storage area and repeat the activities from step 1 onwards.
- Sync the data files directly from the secure storage area provided to the contributor to the HCA Data Repository Ingest Service AWS staging area
$ hca-util sync s3://org-hca-data-archive-upload-prod/<submission-uuid>
- Remove the metadata spreadsheet from the data section so the project can validate, and from the
s3://org-hca-data-archive-upload-prod/<uuid>
so it’s not exported with the project - Within HCA Data Repository Ingest Service, add ontology terms where necessary, for example for methods, species and developmental stage.
- Validate the experimental design with the graph validation step. If there are any errors, communicate with the contributor so that they can amend the metadata spreadsheet and upload the corrected spreadsheet to the secure storage area and repeat the activities from step 1 onwards.
Confirm Project data release
Once the Validation for the Project data submission is complete:
- Notify the data contributor that the Project data submission is complete
- Confirm that they’re happy to proceed to publishing the Project under Managed access
- If the contributor wishes to hold the Project private for a period of time, confirm the desired release date and put a reminder in the calendar for that date
- If the Project is to be published with the first available release, give an estimate of when they can expect to see the project live on the HCA Data Portal.
Export
Within the timeframe agreed for the data release:
- Export the project by clicking Export on Project in HCA Data Repository Ingest service backoffice and select ‘Export files and metadata’
- Once the export is complete, fill out the usual Data release import form.
Verify
Once the project is live on the HCA Data Portal:
- Verify that the project is displayed correctly
- Send a link to the data contributor so that they may reference it in their publications.