FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB0961, 193 aa
1>>>pF1KB0961 193 - 193 aa - 193 aa
Library: /omim/omim.rfq.tfa
60827320 residues in 85289 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.8639+/-0.000379; mu= 12.8178+/- 0.024
mean_var=175.0290+/-38.495, 0's: 0 Z-trim(118.4): 243 B-trim: 3163 in 2/52
Lambda= 0.096944
statistics sampled from 31038 (31413) to 31038 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.719), E-opt: 0.2 (0.368), width: 16
Scan time: 4.050
The best scores are: opt bits E(85289)
NP_001180501 (OMIM: 123876) cysteine and glycine-r ( 193) 1395 206.5 2.3e-53
NP_004069 (OMIM: 123876) cysteine and glycine-rich ( 193) 1395 206.5 2.3e-53
NP_001180500 (OMIM: 123876) cysteine and glycine-r ( 193) 1395 206.5 2.3e-53
NP_001180499 (OMIM: 123876) cysteine and glycine-r ( 187) 1336 198.2 6.7e-51
NP_001312 (OMIM: 601871) cysteine and glycine-rich ( 193) 1159 173.4 1.9e-43
NP_001287894 (OMIM: 601871) cysteine and glycine-r ( 193) 1159 173.4 1.9e-43
NP_003467 (OMIM: 600824,607482,612124) cysteine an ( 194) 968 146.7 2.1e-35
NP_001303 (OMIM: 601183) cysteine-rich protein 2 i ( 208) 293 52.4 5.9e-07
NP_001257766 (OMIM: 601183) cysteine-rich protein ( 282) 293 52.6 7e-07
NP_001302 (OMIM: 123875) cysteine-rich protein 1 [ ( 77) 283 50.4 8.8e-07
NP_001257770 (OMIM: 601183) cysteine-rich protein ( 87) 267 48.2 4.4e-06
>>NP_001180501 (OMIM: 123876) cysteine and glycine-rich (193 aa)
initn: 1395 init1: 1395 opt: 1395 Z-score: 1079.2 bits: 206.5 E(85289): 2.3e-53
Smith-Waterman score: 1395; 100.0% identity (100.0% similar) in 193 aa overlap (1-193:1-193)
10 20 30 40 50 60
pF1KB0 MPNWGGGKKCGVCQKTVYFAEEVQCEGNSFHKSCFLCMVCKKNLDSTTVAVHGEEIYCKS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 MPNWGGGKKCGVCQKTVYFAEEVQCEGNSFHKSCFLCMVCKKNLDSTTVAVHGEEIYCKS
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB0 CYGKKYGPKGYGYGQGAGTLSTDKGESLGIKHEEAPGHRPTTNPNASKFAQKIGGSERCP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 CYGKKYGPKGYGYGQGAGTLSTDKGESLGIKHEEAPGHRPTTNPNASKFAQKIGGSERCP
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB0 RCSQAVYAAEKVIGAGKSWHKACFRCAKCGKGLESTTLADKDGEIYCKGCYAKNFGPKGF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 RCSQAVYAAEKVIGAGKSWHKACFRCAKCGKGLESTTLADKDGEIYCKGCYAKNFGPKGF
130 140 150 160 170 180
190
pF1KB0 GFGQGAGALVHSE
:::::::::::::
NP_001 GFGQGAGALVHSE
190
>>NP_004069 (OMIM: 123876) cysteine and glycine-rich pro (193 aa)
initn: 1395 init1: 1395 opt: 1395 Z-score: 1079.2 bits: 206.5 E(85289): 2.3e-53
Smith-Waterman score: 1395; 100.0% identity (100.0% similar) in 193 aa overlap (1-193:1-193)
10 20 30 40 50 60
pF1KB0 MPNWGGGKKCGVCQKTVYFAEEVQCEGNSFHKSCFLCMVCKKNLDSTTVAVHGEEIYCKS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_004 MPNWGGGKKCGVCQKTVYFAEEVQCEGNSFHKSCFLCMVCKKNLDSTTVAVHGEEIYCKS
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB0 CYGKKYGPKGYGYGQGAGTLSTDKGESLGIKHEEAPGHRPTTNPNASKFAQKIGGSERCP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_004 CYGKKYGPKGYGYGQGAGTLSTDKGESLGIKHEEAPGHRPTTNPNASKFAQKIGGSERCP
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB0 RCSQAVYAAEKVIGAGKSWHKACFRCAKCGKGLESTTLADKDGEIYCKGCYAKNFGPKGF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_004 RCSQAVYAAEKVIGAGKSWHKACFRCAKCGKGLESTTLADKDGEIYCKGCYAKNFGPKGF
130 140 150 160 170 180
190
pF1KB0 GFGQGAGALVHSE
:::::::::::::
NP_004 GFGQGAGALVHSE
190
>>NP_001180500 (OMIM: 123876) cysteine and glycine-rich (193 aa)
initn: 1395 init1: 1395 opt: 1395 Z-score: 1079.2 bits: 206.5 E(85289): 2.3e-53
Smith-Waterman score: 1395; 100.0% identity (100.0% similar) in 193 aa overlap (1-193:1-193)
10 20 30 40 50 60
pF1KB0 MPNWGGGKKCGVCQKTVYFAEEVQCEGNSFHKSCFLCMVCKKNLDSTTVAVHGEEIYCKS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 MPNWGGGKKCGVCQKTVYFAEEVQCEGNSFHKSCFLCMVCKKNLDSTTVAVHGEEIYCKS
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB0 CYGKKYGPKGYGYGQGAGTLSTDKGESLGIKHEEAPGHRPTTNPNASKFAQKIGGSERCP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 CYGKKYGPKGYGYGQGAGTLSTDKGESLGIKHEEAPGHRPTTNPNASKFAQKIGGSERCP
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB0 RCSQAVYAAEKVIGAGKSWHKACFRCAKCGKGLESTTLADKDGEIYCKGCYAKNFGPKGF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 RCSQAVYAAEKVIGAGKSWHKACFRCAKCGKGLESTTLADKDGEIYCKGCYAKNFGPKGF
130 140 150 160 170 180
190
pF1KB0 GFGQGAGALVHSE
:::::::::::::
NP_001 GFGQGAGALVHSE
190
>>NP_001180499 (OMIM: 123876) cysteine and glycine-rich (187 aa)
initn: 969 init1: 943 opt: 1336 Z-score: 1034.8 bits: 198.2 E(85289): 6.7e-51
Smith-Waterman score: 1336; 96.9% identity (96.9% similar) in 193 aa overlap (1-193:1-187)
10 20 30 40 50 60
pF1KB0 MPNWGGGKKCGVCQKTVYFAEEVQCEGNSFHKSCFLCMVCKKNLDSTTVAVHGEEIYCKS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 MPNWGGGKKCGVCQKTVYFAEEVQCEGNSFHKSCFLCMVCKKNLDSTTVAVHGEEIYCKS
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB0 CYGKKYGPKGYGYGQGAGTLSTDKGESLGIKHEEAPGHRPTTNPNASKFAQKIGGSERCP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 CYGKKYGPKGYGYGQGAGTLSTDKGESLGIKHEEAPGHRPTTNPNASKFAQKIGGSERCP
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB0 RCSQAVYAAEKVIGAGKSWHKACFRCAKCGKGLESTTLADKDGEIYCKGCYAKNFGPKGF
::::::::::: :::::::::::::::::::::::::::::::::::::::::::
NP_001 RCSQAVYAAEK------SWHKACFRCAKCGKGLESTTLADKDGEIYCKGCYAKNFGPKGF
130 140 150 160 170
190
pF1KB0 GFGQGAGALVHSE
:::::::::::::
NP_001 GFGQGAGALVHSE
180
>>NP_001312 (OMIM: 601871) cysteine and glycine-rich pro (193 aa)
initn: 1496 init1: 1159 opt: 1159 Z-score: 900.8 bits: 173.4 E(85289): 1.9e-43
Smith-Waterman score: 1159; 79.3% identity (91.7% similar) in 193 aa overlap (1-193:1-193)
10 20 30 40 50 60
pF1KB0 MPNWGGGKKCGVCQKTVYFAEEVQCEGNSFHKSCFLCMVCKKNLDSTTVAVHGEEIYCKS
:: ::::.:::.: .::: ::::::.: :::. :::::::.:::::::::.: :::::::
NP_001 MPVWGGGNKCGACGRTVYHAEEVQCDGRSFHRCCFLCMVCRKNLDSTTVAIHDEEIYCKS
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB0 CYGKKYGPKGYGYGQGAGTLSTDKGESLGIKHEEAPGHRPTTNPNASKFAQKIGGSERCP
::::::::::::::::::::. :.:: :::: : . ::::::::.:::::: ::.:.:
NP_001 CYGKKYGPKGYGYGQGAGTLNMDRGERLGIKPESVQPHRPTTNPNTSKFAQKYGGAEKCS
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB0 RCSQAVYAAEKVIGAGKSWHKACFRCAKCGKGLESTTLADKDGEIYCKGCYAKNFGPKGF
::...::::::.::::: ::: :::::::::.::::::..:.::::::::::::::::::
NP_001 RCGDSVYAAEKIIGAGKPWHKNCFRCAKCGKSLESTTLTEKEGEIYCKGCYAKNFGPKGF
130 140 150 160 170 180
190
pF1KB0 GFGQGAGALVHSE
:.:::::::::..
NP_001 GYGQGAGALVHAQ
190
>>NP_001287894 (OMIM: 601871) cysteine and glycine-rich (193 aa)
initn: 1496 init1: 1159 opt: 1159 Z-score: 900.8 bits: 173.4 E(85289): 1.9e-43
Smith-Waterman score: 1159; 79.3% identity (91.7% similar) in 193 aa overlap (1-193:1-193)
10 20 30 40 50 60
pF1KB0 MPNWGGGKKCGVCQKTVYFAEEVQCEGNSFHKSCFLCMVCKKNLDSTTVAVHGEEIYCKS
:: ::::.:::.: .::: ::::::.: :::. :::::::.:::::::::.: :::::::
NP_001 MPVWGGGNKCGACGRTVYHAEEVQCDGRSFHRCCFLCMVCRKNLDSTTVAIHDEEIYCKS
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB0 CYGKKYGPKGYGYGQGAGTLSTDKGESLGIKHEEAPGHRPTTNPNASKFAQKIGGSERCP
::::::::::::::::::::. :.:: :::: : . ::::::::.:::::: ::.:.:
NP_001 CYGKKYGPKGYGYGQGAGTLNMDRGERLGIKPESVQPHRPTTNPNTSKFAQKYGGAEKCS
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB0 RCSQAVYAAEKVIGAGKSWHKACFRCAKCGKGLESTTLADKDGEIYCKGCYAKNFGPKGF
::...::::::.::::: ::: :::::::::.::::::..:.::::::::::::::::::
NP_001 RCGDSVYAAEKIIGAGKPWHKNCFRCAKCGKSLESTTLTEKEGEIYCKGCYAKNFGPKGF
130 140 150 160 170 180
190
pF1KB0 GFGQGAGALVHSE
:.:::::::::..
NP_001 GYGQGAGALVHAQ
190
>>NP_003467 (OMIM: 600824,607482,612124) cysteine and gl (194 aa)
initn: 970 init1: 536 opt: 968 Z-score: 756.4 bits: 146.7 E(85289): 2.1e-35
Smith-Waterman score: 968; 69.0% identity (86.4% similar) in 184 aa overlap (1-183:1-184)
10 20 30 40 50 60
pF1KB0 MPNWGGGKKCGVCQKTVYFAEEVQCEGNSFHKSCFLCMVCKKNLDSTTVAVHGEEIYCKS
::::::: :::.:.:::: :::.::.: ::::.:: ::.:.: :::::::.: :::::
NP_003 MPNWGGGAKCGACEKTVYHAEEIQCNGRSFHKTCFHCMACRKALDSTTVAAHESEIYCKV
10 20 30 40 50 60
70 80 90 100 110
pF1KB0 CYGKKYGPKGYGYGQGAGTLSTDKGESLGIKHEEAPGH-RPTTNPNASKFAQKIGGSERC
:::..::::: ::::::: :::: :: ::.. ...: : .:. : :::. :.: ::.:
NP_003 CYGRRYGPKGIGYGQGAGCLSTDTGEHLGLQFQQSPKPARSVTTSNPSKFTAKFGESEKC
70 80 90 100 110 120
120 130 140 150 160 170
pF1KB0 PRCSQAVYAAEKVIGAGKSWHKACFRCAKCGKGLESTTLADKDGEIYCKGCYAKNFGPKG
:::...:::::::.:.:: :::.::::: :::.::::...:::::.::: :::::::: :
NP_003 PRCGKSVYAAEKVMGGGKPWHKTCFRCAICGKSLESTNVTDKDGELYCKVCYAKNFGPTG
130 140 150 160 170 180
180 190
pF1KB0 FGFGQGAGALVHSE
.:::
NP_003 IGFGGLTQQVEKKE
190
>>NP_001303 (OMIM: 601183) cysteine-rich protein 2 isofo (208 aa)
initn: 451 init1: 225 opt: 293 Z-score: 245.9 bits: 52.4 E(85289): 5.9e-07
Smith-Waterman score: 474; 40.6% identity (61.4% similar) in 197 aa overlap (9-191:4-198)
10 20 30 40 50
pF1KB0 MPNWGGGKKCGVCQKTVYFAEEVQCEGNSFHKSCFLCMVCKKNLDSTTVAVHGEEIYC-K
:: :.:::::::.:. :...:: :. : :.:.: : : . .: :
NP_001 MASKCPKCDKTVYFAEKVSSLGKDWHKFCLKCERCSKTLTPGGHAEHDGKPFCHK
10 20 30 40 50
60 70 80 90 100
pF1KB0 SCYGKKYGPKGYGYGQGAGTLSTDKGESLGIKHE---EAPGHR--------PTTNPN-AS
::. .:::: . : :::. .: . : . :.:. : : .:. ::
NP_001 PCYATLFGPKGVNIG-GAGSYIYEKPLAEGPQVTGPIEVPAARAEERKASGPPKGPSRAS
60 70 80 90 100 110
110 120 130 140 150 160
pF1KB0 KFAQKIGGSERCPRCSQAVYAAEKVIGAGKSWHKACFRCAKCGKGLESTTLADKDGEIYC
. . : . :::::. :: :::: . ::.::. :.:: .::: : :..::. ::
NP_001 SVTTFTGEPNTCPRCSKKVYFAEKVTSLGKDWHRPCLRCERCGKTLTPGGHAEHDGQPYC
120 130 140 150 160 170
170 180 190
pF1KB0 -KGCYAKNFGPKGFGFGQGAGALVHSE
: ::. ::::: . : ..:. ..
NP_001 HKPCYGILFGPKGVNTG-AVGSYIYDRDPEGKVQP
180 190 200
>>NP_001257766 (OMIM: 601183) cysteine-rich protein 2 is (282 aa)
initn: 270 init1: 225 opt: 293 Z-score: 244.6 bits: 52.6 E(85289): 7e-07
Smith-Waterman score: 423; 38.0% identity (59.0% similar) in 200 aa overlap (6-191:76-272)
10 20 30
pF1KB0 MPNWGGGKKCGVCQKTVYFAEEVQCEGNSFHKSCF
:.. : : : ::.:. :...:: :.
NP_001 AGCVCKGGGCCHREPSQDHHESQEHRGPLVGSQTCLVHQAE-GTAEKVSSLGKDWHKFCL
50 60 70 80 90 100
40 50 60 70 80 90
pF1KB0 LCMVCKKNLDSTTVAVHGEEIYC-KSCYGKKYGPKGYGYGQGAGTLSTDKGESLGIKHE-
: :.:.: : : . .: : ::. .:::: . : :::. .: . : .
NP_001 KCERCSKTLTPGGHAEHDGKPFCHKPCYATLFGPKGVNIG-GAGSYIYEKPLAEGPQVTG
110 120 130 140 150 160
100 110 120 130 140
pF1KB0 --EAPGHR--------PTTNPN-ASKFAQKIGGSERCPRCSQAVYAAEKVIGAGKSWHKA
:.:. : : .:. ::. . : . :::::. :: :::: . ::.::.
NP_001 PIEVPAARAEERKASGPPKGPSRASSVTTFTGEPNTCPRCSKKVYFAEKVTSLGKDWHRP
170 180 190 200 210 220
150 160 170 180 190
pF1KB0 CFRCAKCGKGLESTTLADKDGEIYC-KGCYAKNFGPKGFGFGQGAGALVHSE
:.:: .::: : :..::. :: : ::. ::::: . : ..:. ..
NP_001 CLRCERCGKTLTPGGHAEHDGQPYCHKPCYGILFGPKGVNTG-AVGSYIYDRDPEGKVQP
230 240 250 260 270 280
>>NP_001302 (OMIM: 123875) cysteine-rich protein 1 [Homo (77 aa)
initn: 298 init1: 209 opt: 283 Z-score: 242.8 bits: 50.4 E(85289): 8.8e-07
Smith-Waterman score: 283; 50.0% identity (71.1% similar) in 76 aa overlap (118-192:3-75)
90 100 110 120 130 140
pF1KB0 LGIKHEEAPGHRPTTNPNASKFAQKIGGSERCPRCSQAVYAAEKVIGAGKSWHKACFRCA
.::.:.. :: ::.: . ::.::. :..:
NP_001 MPKCPKCNKEVYFAERVTSLGKDWHRPCLKCE
10 20 30
150 160 170 180 190
pF1KB0 KCGKGLESTTLADKDGEIYCKG-CYAKNFGPKGFGFGQGAGALVHSE
:::: : : :...:. ::. ::: ::::::: : :: :.
NP_001 KCGKTLTSGGHAEHEGKPYCNHPCYAAMFGPKGFGRG---GAESHTFK
40 50 60 70
193 residues in 1 query sequences
60827320 residues in 85289 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sat Nov 5 17:29:50 2016 done: Sat Nov 5 17:29:51 2016
Total Scan time: 4.050 Total Display time: 0.000
Function used was FASTA [36.3.4 Apr, 2011]