Shine-Dalgarno from single protein ID

Annotation: Input a protein accession and receive the upstream genomic sequence

StepAnnotation
Step 1: NCBI ELink
Protein
Neighbor: Fetch a set of UIDs in DB linked to input UIDs in DBFROM
Nucleotide
Direct Entry
Empty.
Input NCBI Protein Accession
Step 2: XPath
Output dataset 'default' from step 1
//Id/text()
Step 3: Select last
1
Output dataset 'output' from step 2
Step 4: NCBI EFetch
Nuccore
GenBank with Contig Sequences (text)
File containing IDs (one per line)
Output dataset 'out_file1' from step 3
4d4c37fdab732a93bbab2f748d4ba63d9309
Step 5: Collapse Collection
Output dataset 'output1' from step 4
False
False
Step 6: Split Genbank On Qualifier
Output dataset 'output' from step 5
protein_id
Not available.
0
0
Enter the same protein accession here as used in step 1
Step 7: (CPT) Genbank to GFF3:
Output dataset 'output' from step 5
True
True
protein_id
Step 8: (CPT) Genbank to GFF3:
Output dataset 'output' from step 6
True
True
protein_id
Step 9: Rebase GFF feature tree
Output dataset 'default' from step 8
CDS
gene mRNA
Step 10: Shine Find
From History
Output dataset 'fastaOut' from step 7
Output dataset 'output' from step 9
3
17
True
True