ANSWERS TO HOMEWORK 3

  1. The result for searching the pattern "p1=TATA p2=15...100 p3=ATG" (which is the same pattern as "TATA 15...100 ATG") from the human unmasked genomic sequence is here.

  2. The result for searching the same pattern from the masked human genomic sequence is here. In the masked genomic sequence, all the repeated nucleotides were replaced with "N". If the sequences with the pattern were inside these repeated regions, they would not be recognized by the PatScan program. So, there were fewer numbers of sequences with the same pattern in the genomic sequence masked by RepeatMasker.