Coding regions (CDS) can be on the plus strand or the minus (reverse complement) strand of a genomic sequence. Nucleotide BLAST (blastn) can help you to determine the correct coding strand by using the CDS feature display on the BLAST search results page. See the article on blastn and CDS feature set up.
To determine the correct coding strand:
These four combinations can help you determine the coding strand for your sequence (Query):
Plus/Plus BLAST alignment + CDS plus strand in Subject record --> Query is the coding (plus) strand
Plus/Minus BLAST alignment + CDS minus strand in Subject record --> Query is the coding (plus) strand
Plus/Plus BLAST alignment + CDS minus strand in Subject record --> CDS is on the minus strand
Plus/Minus BLAST alignment + CDS plus strand in Subject record --> CDS is on the minus strand
See Figures 1 and 2 for an illustrated example. The sequence in the example represents the reverse complement of the coding strand.
Figure 1: A pairwise BLAST alignment of a 250 bp Query sequence to the MF398235.1 (Subject) sequence. Query and Subject represent the same strand. BLAST reports it as Plus/Plus Strand (purple rectangle). The GenBank link in the Range row (yellow rectangle) displays the aligned region of the Subject record. The record (Figure 2) shows CDS on the minus (reverse complement) strand. Since the Query and Subject have the same strands, the Query also represents the reverse complement of the coding strand. Check the alignment to see these clues for the minus strand:
-Positions for the amino-acid residues on the Query (blue boxes) drop while those for nucleotides rise.
-The codons read in the opposite direction. For example, the “CAT” triplet (red oval) represents the reverse complement of the methionine codon (M), “ATG”.
-The tilde symbols (~~~~) mark an intron. Its “AC” ending bases (orange oval) represent the reverse complement of “GT”. “GT” indicates the 5’ splice site (the beginning rather than the end) of an intron.
Figure 2: The part of the MF398235.1 (Subject) sequence that aligns with the 250-bp Query sequence from Figure 1. The record shows CDS on the minus (reverse complement) strand (yellow box).