Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Migrate CleanVcf into GATK #733

Draft
wants to merge 5 commits into
base: main
Choose a base branch
from
Draft

Migrate CleanVcf into GATK #733

wants to merge 5 commits into from

Conversation

kjaisingh
Copy link
Collaborator

@kjaisingh kjaisingh commented Oct 17, 2024

Description

This PR is intended to migrate the pipeline to leverage several new tools built in GATK to complete the CleanVcf workflow, as detailed in broadinstitute/gatk#8996. This is intended to introduce several enhancements over the existing implementation, including but not limited to:

  • Introduce various unit and integration tests into the workflow.
  • Create more robust and generalizable tools that can be used independent of CleanVcf.
  • Improve runtime and execution speed by leveraging Java.

Testing

  • This Terra job shows an example run of the pipeline prior to this change.
  • This Terra job shows an example run of the pipeline with this change. You can compare the output VCF files to see the lack of difference between the two.
  • Validated all WDLs with womtool.

Pre-Merge Changes Required

Remove automated Dockstore image sync for development branch.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant