Managed access project - Data and metadata review and export SOP

Once the wrangler has verified that the data and metadata files were transferred correctly to the secure storage area using hca-util are they can proceed with project submission creation and validation

Submission review and validation

Data and metadata in a managed access submission are considered sensitive information and must be kept encrypted. Unlike in the case of open access projects, review for a managed access project must be done without ever downloading the metadata spreadsheet - or the data - to a non-encrypted location, like a laptop.

  1. Once the metadata spreadsheet is uploaded to the secure AWS storage area a lambda function will automatically trigger the upload of the spreadsheet to the respective project in the HCA Data Repository Ingest Service.
  2. If the spreadsheet upload fails because HCA Data Repository Ingest Service cannot process the spreadsheet, the wrangler will get notified of the problem. Get in touch with the contributor and suggest to review their spreadsheet over a video call. The contributor should share their screen and walk through the spreadsheet.
  3. If the spreadsheet imports successfully in HCA Data Repository Ingest Service, review the submission for metadata errors and communicate with the contributor on how to fix them. Ask the contributor to upload the corrected spreadsheet to the secure storage area and repeat the activities from step 1 onwards.
  4. Sync the data files directly from the secure storage area provided to the contributor to the HCA Data Repository Ingest Service AWS staging area
    $ hca-util sync s3://org-hca-data-archive-upload-prod/<submission-uuid>
  5. Within HCA Data Repository Ingest Service, add ontology terms where necessary, for example for methods, species and developmental stage.
  6. Validate the experimental design with the graph validation step. If there are any errors, communicate with the contributor so that they can amend the metadata spreadsheet and upload the corrected spreadsheet to the secure storage area and repeat the activities from step 1 onwards.

Confirm Project data release

Once the Validation for the Project data submission is complete:

  1. Notify the data contributor that the Project data submission is complete
  2. Confirm that they’re happy to proceed to publishing the Project under Managed access
  3. If the contributor wishes to hold the Project private for a period of time, confirm the desired release date and put a reminder in the calendar for that date
  4. If the Project is to be published with the first available release, give an estimate of when they can expect to see the project live on the HCA Data Portal.

Export

Within the timeframe agreed for the data release:

  1. Export the project by clicking Export on Project in HCA Data Repository Ingest service backoffice and select ‘Export files and metadata’
  2. Once the export is complete, fill out the usual Data release import form.

Verify

Once the project is live on the HCA Data Portal:

  1. Verify that the project is displayed correctly
  2. Send a link to the data contributor so that they may reference it in their publications.