Details
-
Type: Improvement
-
Status: Closed (View Workflow)
-
Priority: Minor
-
Resolution: Done
-
Affects Version/s: None
-
Fix Version/s: None
-
Labels:None
-
Story Points:0.5
-
Sprint:Fall 2018 Sprint 3
Description
Find real soft clip data for testing purposes. The ideal dataset would have the following:
-paired-end data
-data with known structural variants
-preferably aligned in BAM format
One user who uses soft clipping in their research suggested the following links:
http://jimb.stanford.edu/giab-resources/
https://toolbox.google.com/datasetsearch
Attachments
Issue Links
- relates to
-
IGBF-1291 Show soft-clipped bases in BAM files
- Closed
Activity
Anh Moss (Inactive)
created issue -
Anh Moss (Inactive)
made changes -
Anh Moss (Inactive)
made changes -
Status | Open [ 1 ] | In Progress [ 3 ] |
Anh Moss (Inactive)
made changes -
Summary | Real soft clip data | Find real soft clip data |
Description |
Look for real world data for soft clipping.
-pair-end would be great -with known structural variants -aligned as a BAM file, preferably |
Find real soft clip data for testing purposes. The ideal dataset would have the following:
-paired-end data -data with known structural variants -preferably aligned in BAM format One user who uses soft clipping in their research suggested the following links: ftp://ftp-trace.ncbi.nlm.nih.gov/giab/ftp/data/NA12878/NIST_NA12878_HG001_HiSeq_300x/NHGRI_Illumina300X_novoalign_bams/ ftp://ftp.1000genomes.ebi.ac.uk/vol1/ftp/technical/reference/phase2_reference_assembly_sequence/ |
Anh Moss (Inactive)
made changes -
Description |
Find real soft clip data for testing purposes. The ideal dataset would have the following:
-paired-end data -data with known structural variants -preferably aligned in BAM format One user who uses soft clipping in their research suggested the following links: ftp://ftp-trace.ncbi.nlm.nih.gov/giab/ftp/data/NA12878/NIST_NA12878_HG001_HiSeq_300x/NHGRI_Illumina300X_novoalign_bams/ ftp://ftp.1000genomes.ebi.ac.uk/vol1/ftp/technical/reference/phase2_reference_assembly_sequence/ |
Find real soft clip data for testing purposes. The ideal dataset would have the following:
-paired-end data -data with known structural variants -preferably aligned in BAM format One user who uses soft clipping in their research suggested the following links: http://jimb.stanford.edu/giab-resources/ https://toolbox.google.com/datasetsearch |
Ann Loraine
made changes -
Assignee | Anh Moss [ aduong ] | Ann Loraine [ aloraine ] |
Ann Loraine
made changes -
Assignee | Ann Loraine [ aloraine ] |
Ann Loraine
made changes -
Status | In Progress [ 3 ] | Open [ 1 ] |
Ann Loraine
made changes -
Status | Open [ 1 ] | In Progress [ 3 ] |
Ann Loraine
made changes -
Assignee | Ann Loraine [ aloraine ] |
Ann Loraine
made changes -
Attachment | SoftClipRegion.png [ 14162 ] |
Ann Loraine
made changes -
Resolution | Done [ 10000 ] | |
Status | In Progress [ 3 ] | Closed [ 6 ] |
Ann Loraine
made changes -
Summary | Find real soft clip data | Find example data sets where soft clipping reveals polymorphisms |
Ann Loraine
made changes -
Assignee | Ann Loraine [ aloraine ] | Anh Moss [ aduong ] |
Ann Loraine
made changes -
Workflow | Loraine Lab Workflow [ 18128 ] | Fall 2019 Workflow Update [ 19954 ] |
Ann Loraine
made changes -
Workflow | Fall 2019 Workflow Update [ 19954 ] | Revised Fall 2019 Workflow Update [ 22075 ] |
Here is a link to the following dataset:
https://figshare.com/articles/BAM_and_BAI_files/4530707/1
Data posted by: Reetta Holmila: Massively parallel deep sequencing of plasma cfDNA methylation in HCC and controls (categories: biomarkers)
This page lists both the BAM and BAI files.