Académique Documents
Professionnel Documents
Culture Documents
(PSSM)
April, 2017
What is PSSM?
Column1 : fA , 1 = 0, fC , 1 = 1, fG , 1 = 1, fT , 1 = 1
Column2 : fA , 1 = 2, fC , 1 = 1, fG , 1 = 0, fT , 1 = 0
Pseudo-counts
0 fi,j + pseudocount
fi,j =
N + sum of pseudocounts
ACTCAGCCCCAGCGGAGGTGAAGGACGTCCTTCCCCAGGAGCC
The sequence score for the current window is calculated by adding
the values at each position in PSSM
Score = 1.454
The score is less than 0, so it is more likely to be a random site
than functional.
Conclusion
Advantages
I Good for short, conserved regions
I Relatively fast and simple to implement
I Produce match score that can be interpreted based on
statistical theory.
Limitations
I insertion and deletion forbidden
I Relatively long sequence methods can therefore not be
described with this method
When to use?
I To model small regions with high-variability but constant
length.