To do:
BL MOTIF 1 width=11 seqs=13 iYIL003W ( 153) TTTACCCGGCC 1 iYDL086W ( 599) TTTACCCGGCC 1 iYEL055C ( 125) TTTACCCGGCC 1 iYDR498C ( 122) TTTACCCGGAC 1 iYBR179C ( 182) TTTACCCGGAC 1 iYBR229C ( 104) TTTACCCGGAC 1 iYNR011C ( 98) GTTACCCGGAC 1 itF(GAA)N ( 145) TTTACCCGGAA 1 iYLR458W ( 379) TTTACCCGGAA 1 iYBR035C ( 127) TTTACCCGGCG 1 iYGL152C ( 9) GTTACCCGGAA 1 iYFL006W ( 117) ATTACCCGGCA 1 iYGR093W ( 66) TTTACCCGGTT 1and paste it into a text file. Opening the text file in Excel and splitting on spaces should produce a column like this:
TTTACCCGGCC TTTACCCGGCC TTTACCCGGCC TTTACCCGGAC TTTACCCGGAC TTTACCCGGAC GTTACCCGGAC TTTACCCGGAA TTTACCCGGAA TTTACCCGGCG GTTACCCGGAA ATTACCCGGCA TTTACCCGGTT
>PHO4_TRANSFAC 1 2 1 4 3 2 2 1 2 3 3 0 0 8 0 0 8 0 0 0 0 8 0 0 0 0 8 0 0 0 0 8 0 0 5 3 0 2 4 2 1 0 5 2 2 2 2 2 |
NT A C G T consensus 01 1 2 1 4 N 02 3 2 2 1 N 03 2 3 3 0 V 04 0 8 0 0 C 05 8 0 0 0 A 06 0 8 0 0 C 07 0 0 8 0 G 08 0 0 0 8 T 09 0 0 5 3 K 10 0 2 4 2 B 11 1 0 5 2 G 12 2 2 2 2 N |
NN[ACG]CACGT[GT][CGT]GNfor PHO4, reading down the matrix), you can also use the tools from exercise 1.
To search the PHO4-bound sequences (from step 5) with this matrix