Mathematical tools developed in the context of Shannon information theory were used to analyze the meaning of the BLOSUM score, which was split into three components termed as the BLOSUM spectrum (or BLOSpectrum). These relate respectively to the sequence convergence (the stochastic similarity of the two protein sequences), to the background frequency divergence (typicality of the amino acid probability distribution in each sequence), and to the target frequency divergence (compliance of the amino acid variations between the two sequences to the protein model implicit in the BLOCKS database). This treatment sharpens the protein sequence comparison, providing a rationale for the biological significance of the obtained score, and helps to identify weakly related sequences. Moreover, the BLOSpectrum can guide the choice of the most appropriate scoring matrix, tailoring it to the evolutionary divergence associated with the two sequences, or indicate if a compositionally adjusted matrix cou

Splitting the blosum score into numbers of biological significance.

FABRIS, FRANCESCO;SGARRO, ANDREA;TOSSI, ALESSANDRO
2007-01-01

Abstract

Mathematical tools developed in the context of Shannon information theory were used to analyze the meaning of the BLOSUM score, which was split into three components termed as the BLOSUM spectrum (or BLOSpectrum). These relate respectively to the sequence convergence (the stochastic similarity of the two protein sequences), to the background frequency divergence (typicality of the amino acid probability distribution in each sequence), and to the target frequency divergence (compliance of the amino acid variations between the two sequences to the protein model implicit in the BLOCKS database). This treatment sharpens the protein sequence comparison, providing a rationale for the biological significance of the obtained score, and helps to identify weakly related sequences. Moreover, the BLOSpectrum can guide the choice of the most appropriate scoring matrix, tailoring it to the evolutionary divergence associated with the two sequences, or indicate if a compositionally adjusted matrix cou
2007
http://www.google.it/#hl=it&sclient=psy-ab&q=Splitting+the+BLOSUM+score+into+numbers+of+biological+significance+hindawi&oq=Splitting+the+BLOSUM+score+into+numbers+of+biological+significance+hindawi&aq=f&aqi=&aql=&gs_sm=3&gs_upl=18605l20926l3l21340l8l8l0l0l0l0l94l658l8l8l0&gs_l=hp.3...18605l20926l3l21340l8l8l0l0l0l0l94l658l8l8l0&pbx=1&bav=on.2,or.r_gc.r_pw.r_qf.,cf.osb&fp=9a73a6f1ff0d1de9&biw=1280&bih=664
http://www.hindawi.com/journals/bsb/2007/031450/cta/
http://www.ncbi.nlm.nih.gov/pubmed/18369412
File in questo prodotto:
Non ci sono file associati a questo prodotto.
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11368/1702272
 Avviso

Registrazione in corso di verifica.
La registrazione di questo prodotto non è ancora stata validata in ArTS.

Citazioni
  • ???jsp.display-item.citation.pmc??? 1
  • Scopus 4
  • ???jsp.display-item.citation.isi??? ND
social impact