FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB0965, 194 aa
1>>>pF1KB0965 194 - 194 aa - 194 aa
Library: /omim/omim.rfq.tfa
60827320 residues in 85289 sequences
Statistics: Expectation_n fit: rho(ln(x))= 6.7356+/-0.000396; mu= 8.0269+/- 0.024
mean_var=210.3202+/-48.841, 0's: 0 Z-trim(118.4): 177 B-trim: 1908 in 2/48
Lambda= 0.088437
statistics sampled from 31010 (31317) to 31010 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.721), E-opt: 0.2 (0.367), width: 16
Scan time: 5.090
The best scores are: opt bits E(85289)
NP_003467 (OMIM: 600824,607482,612124) cysteine an ( 194) 1403 190.9 1.1e-48
NP_001287894 (OMIM: 601871) cysteine and glycine-r ( 193) 977 136.6 2.5e-32
NP_001312 (OMIM: 601871) cysteine and glycine-rich ( 193) 977 136.6 2.5e-32
NP_004069 (OMIM: 123876) cysteine and glycine-rich ( 193) 968 135.4 5.4e-32
NP_001180500 (OMIM: 123876) cysteine and glycine-r ( 193) 968 135.4 5.4e-32
NP_001180501 (OMIM: 123876) cysteine and glycine-r ( 193) 968 135.4 5.4e-32
NP_001180499 (OMIM: 123876) cysteine and glycine-r ( 187) 917 128.9 4.9e-30
NP_001303 (OMIM: 601183) cysteine-rich protein 2 i ( 208) 282 47.9 1.3e-05
NP_001257766 (OMIM: 601183) cysteine-rich protein ( 282) 282 48.1 1.5e-05
NP_001302 (OMIM: 123875) cysteine-rich protein 1 [ ( 77) 265 45.2 3.2e-05
NP_001257770 (OMIM: 601183) cysteine-rich protein ( 87) 263 45.0 4.1e-05
>>NP_003467 (OMIM: 600824,607482,612124) cysteine and gl (194 aa)
initn: 1403 init1: 1403 opt: 1403 Z-score: 995.2 bits: 190.9 E(85289): 1.1e-48
Smith-Waterman score: 1403; 100.0% identity (100.0% similar) in 194 aa overlap (1-194:1-194)
10 20 30 40 50 60
pF1KB0 MPNWGGGAKCGACEKTVYHAEEIQCNGRSFHKTCFHCMACRKALDSTTVAAHESEIYCKV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_003 MPNWGGGAKCGACEKTVYHAEEIQCNGRSFHKTCFHCMACRKALDSTTVAAHESEIYCKV
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB0 CYGRRYGPKGIGYGQGAGCLSTDTGEHLGLQFQQSPKPARSVTTSNPSKFTAKFGESEKC
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_003 CYGRRYGPKGIGYGQGAGCLSTDTGEHLGLQFQQSPKPARSVTTSNPSKFTAKFGESEKC
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB0 PRCGKSVYAAEKVMGGGKPWHKTCFRCAICGKSLESTNVTDKDGELYCKVCYAKNFGPTG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_003 PRCGKSVYAAEKVMGGGKPWHKTCFRCAICGKSLESTNVTDKDGELYCKVCYAKNFGPTG
130 140 150 160 170 180
190
pF1KB0 IGFGGLTQQVEKKE
::::::::::::::
NP_003 IGFGGLTQQVEKKE
190
>>NP_001287894 (OMIM: 601871) cysteine and glycine-rich (193 aa)
initn: 965 init1: 523 opt: 977 Z-score: 701.5 bits: 136.6 E(85289): 2.5e-32
Smith-Waterman score: 977; 69.0% identity (85.3% similar) in 184 aa overlap (1-184:1-183)
10 20 30 40 50 60
pF1KB0 MPNWGGGAKCGACEKTVYHAEEIQCNGRSFHKTCFHCMACRKALDSTTVAAHESEIYCKV
:: :::: ::::: .:::::::.::.:::::. :: ::.::: ::::::: :. :::::
NP_001 MPVWGGGNKCGACGRTVYHAEEVQCDGRSFHRCCFLCMVCRKNLDSTTVAIHDEEIYCKS
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB0 CYGRRYGPKGIGYGQGAGCLSTDTGEHLGLQFQQSPKPARSVTTSNPSKFTAKFGESEKC
:::..::::: ::::::: :. : ::.::.. .: .: : .:. : :::. :.: .:::
NP_001 CYGKKYGPKGYGYGQGAGTLNMDRGERLGIK-PESVQPHRPTTNPNTSKFAQKYGGAEKC
70 80 90 100 110
130 140 150 160 170 180
pF1KB0 PRCGKSVYAAEKVMGGGKPWHKTCFRCAICGKSLESTNVTDKDGELYCKVCYAKNFGPTG
::: :::::::..:.::::::.::::: ::::::::..:.:.::.::: :::::::: :
NP_001 SRCGDSVYAAEKIIGAGKPWHKNCFRCAKCGKSLESTTLTEKEGEIYCKGCYAKNFGPKG
120 130 140 150 160 170
190
pF1KB0 IGFGGLTQQVEKKE
.:.:
NP_001 FGYGQGAGALVHAQ
180 190
>>NP_001312 (OMIM: 601871) cysteine and glycine-rich pro (193 aa)
initn: 965 init1: 523 opt: 977 Z-score: 701.5 bits: 136.6 E(85289): 2.5e-32
Smith-Waterman score: 977; 69.0% identity (85.3% similar) in 184 aa overlap (1-184:1-183)
10 20 30 40 50 60
pF1KB0 MPNWGGGAKCGACEKTVYHAEEIQCNGRSFHKTCFHCMACRKALDSTTVAAHESEIYCKV
:: :::: ::::: .:::::::.::.:::::. :: ::.::: ::::::: :. :::::
NP_001 MPVWGGGNKCGACGRTVYHAEEVQCDGRSFHRCCFLCMVCRKNLDSTTVAIHDEEIYCKS
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB0 CYGRRYGPKGIGYGQGAGCLSTDTGEHLGLQFQQSPKPARSVTTSNPSKFTAKFGESEKC
:::..::::: ::::::: :. : ::.::.. .: .: : .:. : :::. :.: .:::
NP_001 CYGKKYGPKGYGYGQGAGTLNMDRGERLGIK-PESVQPHRPTTNPNTSKFAQKYGGAEKC
70 80 90 100 110
130 140 150 160 170 180
pF1KB0 PRCGKSVYAAEKVMGGGKPWHKTCFRCAICGKSLESTNVTDKDGELYCKVCYAKNFGPTG
::: :::::::..:.::::::.::::: ::::::::..:.:.::.::: :::::::: :
NP_001 SRCGDSVYAAEKIIGAGKPWHKNCFRCAKCGKSLESTTLTEKEGEIYCKGCYAKNFGPKG
120 130 140 150 160 170
190
pF1KB0 IGFGGLTQQVEKKE
.:.:
NP_001 FGYGQGAGALVHAQ
180 190
>>NP_004069 (OMIM: 123876) cysteine and glycine-rich pro (193 aa)
initn: 970 init1: 536 opt: 968 Z-score: 695.3 bits: 135.4 E(85289): 5.4e-32
Smith-Waterman score: 968; 69.0% identity (86.4% similar) in 184 aa overlap (1-184:1-183)
10 20 30 40 50 60
pF1KB0 MPNWGGGAKCGACEKTVYHAEEIQCNGRSFHKTCFHCMACRKALDSTTVAAHESEIYCKV
::::::: :::.:.:::: :::.::.: ::::.:: ::.:.: :::::::.: :::::
NP_004 MPNWGGGKKCGVCQKTVYFAEEVQCEGNSFHKSCFLCMVCKKNLDSTTVAVHGEEIYCKS
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB0 CYGRRYGPKGIGYGQGAGCLSTDTGEHLGLQFQQSPKPARSVTTSNPSKFTAKFGESEKC
:::..::::: ::::::: :::: :: ::.. ...: : .:. : :::. :.: ::.:
NP_004 CYGKKYGPKGYGYGQGAGTLSTDKGESLGIKHEEAPGH-RPTTNPNASKFAQKIGGSERC
70 80 90 100 110
130 140 150 160 170 180
pF1KB0 PRCGKSVYAAEKVMGGGKPWHKTCFRCAICGKSLESTNVTDKDGELYCKVCYAKNFGPTG
:::...:::::::.:.:: :::.::::: :::.::::...:::::.::: :::::::: :
NP_004 PRCSQAVYAAEKVIGAGKSWHKACFRCAKCGKGLESTTLADKDGEIYCKGCYAKNFGPKG
120 130 140 150 160 170
190
pF1KB0 IGFGGLTQQVEKKE
.:::
NP_004 FGFGQGAGALVHSE
180 190
>>NP_001180500 (OMIM: 123876) cysteine and glycine-rich (193 aa)
initn: 970 init1: 536 opt: 968 Z-score: 695.3 bits: 135.4 E(85289): 5.4e-32
Smith-Waterman score: 968; 69.0% identity (86.4% similar) in 184 aa overlap (1-184:1-183)
10 20 30 40 50 60
pF1KB0 MPNWGGGAKCGACEKTVYHAEEIQCNGRSFHKTCFHCMACRKALDSTTVAAHESEIYCKV
::::::: :::.:.:::: :::.::.: ::::.:: ::.:.: :::::::.: :::::
NP_001 MPNWGGGKKCGVCQKTVYFAEEVQCEGNSFHKSCFLCMVCKKNLDSTTVAVHGEEIYCKS
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB0 CYGRRYGPKGIGYGQGAGCLSTDTGEHLGLQFQQSPKPARSVTTSNPSKFTAKFGESEKC
:::..::::: ::::::: :::: :: ::.. ...: : .:. : :::. :.: ::.:
NP_001 CYGKKYGPKGYGYGQGAGTLSTDKGESLGIKHEEAPGH-RPTTNPNASKFAQKIGGSERC
70 80 90 100 110
130 140 150 160 170 180
pF1KB0 PRCGKSVYAAEKVMGGGKPWHKTCFRCAICGKSLESTNVTDKDGELYCKVCYAKNFGPTG
:::...:::::::.:.:: :::.::::: :::.::::...:::::.::: :::::::: :
NP_001 PRCSQAVYAAEKVIGAGKSWHKACFRCAKCGKGLESTTLADKDGEIYCKGCYAKNFGPKG
120 130 140 150 160 170
190
pF1KB0 IGFGGLTQQVEKKE
.:::
NP_001 FGFGQGAGALVHSE
180 190
>>NP_001180501 (OMIM: 123876) cysteine and glycine-rich (193 aa)
initn: 970 init1: 536 opt: 968 Z-score: 695.3 bits: 135.4 E(85289): 5.4e-32
Smith-Waterman score: 968; 69.0% identity (86.4% similar) in 184 aa overlap (1-184:1-183)
10 20 30 40 50 60
pF1KB0 MPNWGGGAKCGACEKTVYHAEEIQCNGRSFHKTCFHCMACRKALDSTTVAAHESEIYCKV
::::::: :::.:.:::: :::.::.: ::::.:: ::.:.: :::::::.: :::::
NP_001 MPNWGGGKKCGVCQKTVYFAEEVQCEGNSFHKSCFLCMVCKKNLDSTTVAVHGEEIYCKS
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB0 CYGRRYGPKGIGYGQGAGCLSTDTGEHLGLQFQQSPKPARSVTTSNPSKFTAKFGESEKC
:::..::::: ::::::: :::: :: ::.. ...: : .:. : :::. :.: ::.:
NP_001 CYGKKYGPKGYGYGQGAGTLSTDKGESLGIKHEEAPGH-RPTTNPNASKFAQKIGGSERC
70 80 90 100 110
130 140 150 160 170 180
pF1KB0 PRCGKSVYAAEKVMGGGKPWHKTCFRCAICGKSLESTNVTDKDGELYCKVCYAKNFGPTG
:::...:::::::.:.:: :::.::::: :::.::::...:::::.::: :::::::: :
NP_001 PRCSQAVYAAEKVIGAGKSWHKACFRCAKCGKGLESTTLADKDGEIYCKGCYAKNFGPKG
120 130 140 150 160 170
190
pF1KB0 IGFGGLTQQVEKKE
.:::
NP_001 FGFGQGAGALVHSE
180 190
>>NP_001180499 (OMIM: 123876) cysteine and glycine-rich (187 aa)
initn: 811 init1: 536 opt: 917 Z-score: 660.3 bits: 128.9 E(85289): 4.9e-30
Smith-Waterman score: 917; 66.8% identity (83.2% similar) in 184 aa overlap (1-184:1-177)
10 20 30 40 50 60
pF1KB0 MPNWGGGAKCGACEKTVYHAEEIQCNGRSFHKTCFHCMACRKALDSTTVAAHESEIYCKV
::::::: :::.:.:::: :::.::.: ::::.:: ::.:.: :::::::.: :::::
NP_001 MPNWGGGKKCGVCQKTVYFAEEVQCEGNSFHKSCFLCMVCKKNLDSTTVAVHGEEIYCKS
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB0 CYGRRYGPKGIGYGQGAGCLSTDTGEHLGLQFQQSPKPARSVTTSNPSKFTAKFGESEKC
:::..::::: ::::::: :::: :: ::.. ...: : .:. : :::. :.: ::.:
NP_001 CYGKKYGPKGYGYGQGAGTLSTDKGESLGIKHEEAPGH-RPTTNPNASKFAQKIGGSERC
70 80 90 100 110
130 140 150 160 170 180
pF1KB0 PRCGKSVYAAEKVMGGGKPWHKTCFRCAICGKSLESTNVTDKDGELYCKVCYAKNFGPTG
:::...:::::: :::.::::: :::.::::...:::::.::: :::::::: :
NP_001 PRCSQAVYAAEK------SWHKACFRCAKCGKGLESTTLADKDGEIYCKGCYAKNFGPKG
120 130 140 150 160 170
190
pF1KB0 IGFGGLTQQVEKKE
.:::
NP_001 FGFGQGAGALVHSE
180
>>NP_001303 (OMIM: 601183) cysteine-rich protein 2 isofo (208 aa)
initn: 465 init1: 220 opt: 282 Z-score: 221.9 bits: 47.9 E(85289): 1.3e-05
Smith-Waterman score: 462; 37.5% identity (60.5% similar) in 200 aa overlap (8-194:3-201)
10 20 30 40 50
pF1KB0 MPNWGGGAKCGACEKTVYHAEEIQCNGRSFHKTCFHCMACRKALDSTTVAAHESEIYC-K
.:: :.:::: ::... :...:: :..: : :.: : :... .: :
NP_001 MASKCPKCDKTVYFAEKVSSLGKDWHKFCLKCERCSKTLTPGGHAEHDGKPFCHK
10 20 30 40 50
60 70 80 90 100
pF1KB0 VCYGRRYGPKGIGYGQGAGCLSTDTGEHLGLQFQ---QSPKP-ARSVTTSNPSK------
::. .::::.. : ::: . : : . : :. .:.: :
NP_001 PCYATLFGPKGVNIG-GAGSYIYEKPLAEGPQVTGPIEVPAARAEERKASGPPKGPSRAS
60 70 80 90 100 110
110 120 130 140 150 160
pF1KB0 -FTAKFGESEKCPRCGKSVYAAEKVMGGGKPWHKTCFRCAICGKSLESTNVTDKDGELYC
:. :: . ::::.:.:: :::: . :: ::. :.:: :::.: . ...::. ::
NP_001 SVTTFTGEPNTCPRCSKKVYFAEKVTSLGKDWHRPCLRCERCGKTLTPGGHAEHDGQPYC
120 130 140 150 160 170
170 180 190
pF1KB0 -KVCYAKNFGPTGIGFGGLTQQVEKKE
: ::. ::: :.. :.. . . ..
NP_001 HKPCYGILFGPKGVNTGAVGSYIYDRDPEGKVQP
180 190 200
>>NP_001257766 (OMIM: 601183) cysteine-rich protein 2 is (282 aa)
initn: 320 init1: 220 opt: 282 Z-score: 220.5 bits: 48.1 E(85289): 1.5e-05
Smith-Waterman score: 407; 36.2% identity (59.6% similar) in 188 aa overlap (20-194:89-275)
10 20 30 40
pF1KB0 MPNWGGGAKCGACEKTVYHAEEIQCNGRSFHKTCFHCMACRKALDSTTV
::... :...:: :..: : :.:
NP_001 EPSQDHHESQEHRGPLVGSQTCLVHQAEGTAEKVSSLGKDWHKFCLKCERCSKTLTPGGH
60 70 80 90 100 110
50 60 70 80 90 100
pF1KB0 AAHESEIYC-KVCYGRRYGPKGIGYGQGAGCLSTDTGEHLGLQFQ---QSPKP-ARSVTT
: :... .: : ::. .::::.. : ::: . : : . : :. .
NP_001 AEHDGKPFCHKPCYATLFGPKGVNIG-GAGSYIYEKPLAEGPQVTGPIEVPAARAEERKA
120 130 140 150 160 170
110 120 130 140 150
pF1KB0 SNPSK-------FTAKFGESEKCPRCGKSVYAAEKVMGGGKPWHKTCFRCAICGKSLEST
:.: : :. :: . ::::.:.:: :::: . :: ::. :.:: :::.:
NP_001 SGPPKGPSRASSVTTFTGEPNTCPRCSKKVYFAEKVTSLGKDWHRPCLRCERCGKTLTPG
180 190 200 210 220 230
160 170 180 190
pF1KB0 NVTDKDGELYC-KVCYAKNFGPTGIGFGGLTQQVEKKE
. ...::. :: : ::. ::: :.. :.. . . ..
NP_001 GHAEHDGQPYCHKPCYGILFGPKGVNTGAVGSYIYDRDPEGKVQP
240 250 260 270 280
>>NP_001302 (OMIM: 123875) cysteine-rich protein 1 [Homo (77 aa)
initn: 265 init1: 202 opt: 265 Z-score: 214.8 bits: 45.2 E(85289): 3.2e-05
Smith-Waterman score: 265; 46.7% identity (72.0% similar) in 75 aa overlap (119-192:3-77)
90 100 110 120 130 140
pF1KB0 GLQFQQSPKPARSVTTSNPSKFTAKFGESEKCPRCGKSVYAAEKVMGGGKPWHKTCFRCA
:::.:.: :: ::.: . :: ::. :..:
NP_001 MPKCPKCNKEVYFAERVTSLGKDWHRPCLKCE
10 20 30
150 160 170 180 190
pF1KB0 ICGKSLESTNVTDKDGELYCK-VCYAKNFGPTGIGFGGLTQQVEKKE
:::.: : . ....:. ::. ::: ::: :.: :: ... :
NP_001 KCGKTLTSGGHAEHEGKPYCNHPCYAAMFGPKGFGRGGAESHTFK
40 50 60 70
194 residues in 1 query sequences
60827320 residues in 85289 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sat Nov 5 17:32:10 2016 done: Sat Nov 5 17:32:11 2016
Total Scan time: 5.090 Total Display time: -0.030
Function used was FASTA [36.3.4 Apr, 2011]