FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB0963, 193 aa 1>>>pF1KB0963 193 - 193 aa - 193 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.4479+/-0.000364; mu= 15.6010+/- 0.023 mean_var=196.1994+/-46.524, 0's: 0 Z-trim(118.8): 259 B-trim: 1901 in 1/48 Lambda= 0.091564 statistics sampled from 31734 (32100) to 31734 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.72), E-opt: 0.2 (0.376), width: 16 Scan time: 5.850 The best scores are: opt bits E(85289) NP_001312 (OMIM: 601871) cysteine and glycine-rich ( 193) 1417 198.8 4.7e-51 NP_001287894 (OMIM: 601871) cysteine and glycine-r ( 193) 1417 198.8 4.7e-51 NP_001180500 (OMIM: 123876) cysteine and glycine-r ( 193) 1159 164.7 8.5e-41 NP_004069 (OMIM: 123876) cysteine and glycine-rich ( 193) 1159 164.7 8.5e-41 NP_001180501 (OMIM: 123876) cysteine and glycine-r ( 193) 1159 164.7 8.5e-41 NP_001180499 (OMIM: 123876) cysteine and glycine-r ( 187) 1101 157.0 1.7e-38 NP_003467 (OMIM: 600824,607482,612124) cysteine an ( 194) 977 140.6 1.5e-33 NP_001302 (OMIM: 123875) cysteine-rich protein 1 [ ( 77) 275 47.2 7.6e-06 NP_001303 (OMIM: 601183) cysteine-rich protein 2 i ( 208) 262 46.2 4.1e-05 NP_001257766 (OMIM: 601183) cysteine-rich protein ( 282) 262 46.5 4.8e-05 NP_001257770 (OMIM: 601183) cysteine-rich protein ( 87) 255 44.7 5.1e-05 >>NP_001312 (OMIM: 601871) cysteine and glycine-rich pro (193 aa) initn: 1417 init1: 1417 opt: 1417 Z-score: 1037.6 bits: 198.8 E(85289): 4.7e-51 Smith-Waterman score: 1417; 100.0% identity (100.0% similar) in 193 aa overlap (1-193:1-193) 10 20 30 40 50 60 pF1KB0 MPVWGGGNKCGACGRTVYHAEEVQCDGRSFHRCCFLCMVCRKNLDSTTVAIHDEEIYCKS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 MPVWGGGNKCGACGRTVYHAEEVQCDGRSFHRCCFLCMVCRKNLDSTTVAIHDEEIYCKS 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB0 CYGKKYGPKGYGYGQGAGTLNMDRGERLGIKPESVQPHRPTTNPNTSKFAQKYGGAEKCS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 CYGKKYGPKGYGYGQGAGTLNMDRGERLGIKPESVQPHRPTTNPNTSKFAQKYGGAEKCS 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB0 RCGDSVYAAEKIIGAGKPWHKNCFRCAKCGKSLESTTLTEKEGEIYCKGCYAKNFGPKGF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 RCGDSVYAAEKIIGAGKPWHKNCFRCAKCGKSLESTTLTEKEGEIYCKGCYAKNFGPKGF 130 140 150 160 170 180 190 pF1KB0 GYGQGAGALVHAQ ::::::::::::: NP_001 GYGQGAGALVHAQ 190 >>NP_001287894 (OMIM: 601871) cysteine and glycine-rich (193 aa) initn: 1417 init1: 1417 opt: 1417 Z-score: 1037.6 bits: 198.8 E(85289): 4.7e-51 Smith-Waterman score: 1417; 100.0% identity (100.0% similar) in 193 aa overlap (1-193:1-193) 10 20 30 40 50 60 pF1KB0 MPVWGGGNKCGACGRTVYHAEEVQCDGRSFHRCCFLCMVCRKNLDSTTVAIHDEEIYCKS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 MPVWGGGNKCGACGRTVYHAEEVQCDGRSFHRCCFLCMVCRKNLDSTTVAIHDEEIYCKS 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB0 CYGKKYGPKGYGYGQGAGTLNMDRGERLGIKPESVQPHRPTTNPNTSKFAQKYGGAEKCS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 CYGKKYGPKGYGYGQGAGTLNMDRGERLGIKPESVQPHRPTTNPNTSKFAQKYGGAEKCS 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB0 RCGDSVYAAEKIIGAGKPWHKNCFRCAKCGKSLESTTLTEKEGEIYCKGCYAKNFGPKGF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 RCGDSVYAAEKIIGAGKPWHKNCFRCAKCGKSLESTTLTEKEGEIYCKGCYAKNFGPKGF 130 140 150 160 170 180 190 pF1KB0 GYGQGAGALVHAQ ::::::::::::: NP_001 GYGQGAGALVHAQ 190 >>NP_001180500 (OMIM: 123876) cysteine and glycine-rich (193 aa) initn: 1511 init1: 1159 opt: 1159 Z-score: 853.4 bits: 164.7 E(85289): 8.5e-41 Smith-Waterman score: 1159; 79.3% identity (91.7% similar) in 193 aa overlap (1-193:1-193) 10 20 30 40 50 60 pF1KB0 MPVWGGGNKCGACGRTVYHAEEVQCDGRSFHRCCFLCMVCRKNLDSTTVAIHDEEIYCKS :: ::::.:::.: .::: ::::::.: :::. :::::::.:::::::::.: ::::::: NP_001 MPNWGGGKKCGVCQKTVYFAEEVQCEGNSFHKSCFLCMVCKKNLDSTTVAVHGEEIYCKS 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB0 CYGKKYGPKGYGYGQGAGTLNMDRGERLGIKPESVQPHRPTTNPNTSKFAQKYGGAEKCS ::::::::::::::::::::. :.:: :::: : . ::::::::.:::::: ::.:.: NP_001 CYGKKYGPKGYGYGQGAGTLSTDKGESLGIKHEEAPGHRPTTNPNASKFAQKIGGSERCP 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB0 RCGDSVYAAEKIIGAGKPWHKNCFRCAKCGKSLESTTLTEKEGEIYCKGCYAKNFGPKGF ::...::::::.::::: ::: :::::::::.::::::..:.:::::::::::::::::: NP_001 RCSQAVYAAEKVIGAGKSWHKACFRCAKCGKGLESTTLADKDGEIYCKGCYAKNFGPKGF 130 140 150 160 170 180 190 pF1KB0 GYGQGAGALVHAQ :.:::::::::.. NP_001 GFGQGAGALVHSE 190 >>NP_004069 (OMIM: 123876) cysteine and glycine-rich pro (193 aa) initn: 1511 init1: 1159 opt: 1159 Z-score: 853.4 bits: 164.7 E(85289): 8.5e-41 Smith-Waterman score: 1159; 79.3% identity (91.7% similar) in 193 aa overlap (1-193:1-193) 10 20 30 40 50 60 pF1KB0 MPVWGGGNKCGACGRTVYHAEEVQCDGRSFHRCCFLCMVCRKNLDSTTVAIHDEEIYCKS :: ::::.:::.: .::: ::::::.: :::. :::::::.:::::::::.: ::::::: NP_004 MPNWGGGKKCGVCQKTVYFAEEVQCEGNSFHKSCFLCMVCKKNLDSTTVAVHGEEIYCKS 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB0 CYGKKYGPKGYGYGQGAGTLNMDRGERLGIKPESVQPHRPTTNPNTSKFAQKYGGAEKCS ::::::::::::::::::::. :.:: :::: : . ::::::::.:::::: ::.:.: NP_004 CYGKKYGPKGYGYGQGAGTLSTDKGESLGIKHEEAPGHRPTTNPNASKFAQKIGGSERCP 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB0 RCGDSVYAAEKIIGAGKPWHKNCFRCAKCGKSLESTTLTEKEGEIYCKGCYAKNFGPKGF ::...::::::.::::: ::: :::::::::.::::::..:.:::::::::::::::::: NP_004 RCSQAVYAAEKVIGAGKSWHKACFRCAKCGKGLESTTLADKDGEIYCKGCYAKNFGPKGF 130 140 150 160 170 180 190 pF1KB0 GYGQGAGALVHAQ :.:::::::::.. NP_004 GFGQGAGALVHSE 190 >>NP_001180501 (OMIM: 123876) cysteine and glycine-rich (193 aa) initn: 1511 init1: 1159 opt: 1159 Z-score: 853.4 bits: 164.7 E(85289): 8.5e-41 Smith-Waterman score: 1159; 79.3% identity (91.7% similar) in 193 aa overlap (1-193:1-193) 10 20 30 40 50 60 pF1KB0 MPVWGGGNKCGACGRTVYHAEEVQCDGRSFHRCCFLCMVCRKNLDSTTVAIHDEEIYCKS :: ::::.:::.: .::: ::::::.: :::. :::::::.:::::::::.: ::::::: NP_001 MPNWGGGKKCGVCQKTVYFAEEVQCEGNSFHKSCFLCMVCKKNLDSTTVAVHGEEIYCKS 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB0 CYGKKYGPKGYGYGQGAGTLNMDRGERLGIKPESVQPHRPTTNPNTSKFAQKYGGAEKCS ::::::::::::::::::::. :.:: :::: : . ::::::::.:::::: ::.:.: NP_001 CYGKKYGPKGYGYGQGAGTLSTDKGESLGIKHEEAPGHRPTTNPNASKFAQKIGGSERCP 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB0 RCGDSVYAAEKIIGAGKPWHKNCFRCAKCGKSLESTTLTEKEGEIYCKGCYAKNFGPKGF ::...::::::.::::: ::: :::::::::.::::::..:.:::::::::::::::::: NP_001 RCSQAVYAAEKVIGAGKSWHKACFRCAKCGKGLESTTLADKDGEIYCKGCYAKNFGPKGF 130 140 150 160 170 180 190 pF1KB0 GYGQGAGALVHAQ :.:::::::::.. NP_001 GFGQGAGALVHSE 190 >>NP_001180499 (OMIM: 123876) cysteine and glycine-rich (187 aa) initn: 1139 init1: 757 opt: 1101 Z-score: 812.2 bits: 157.0 E(85289): 1.7e-38 Smith-Waterman score: 1101; 76.7% identity (88.6% similar) in 193 aa overlap (1-193:1-187) 10 20 30 40 50 60 pF1KB0 MPVWGGGNKCGACGRTVYHAEEVQCDGRSFHRCCFLCMVCRKNLDSTTVAIHDEEIYCKS :: ::::.:::.: .::: ::::::.: :::. :::::::.:::::::::.: ::::::: NP_001 MPNWGGGKKCGVCQKTVYFAEEVQCEGNSFHKSCFLCMVCKKNLDSTTVAVHGEEIYCKS 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB0 CYGKKYGPKGYGYGQGAGTLNMDRGERLGIKPESVQPHRPTTNPNTSKFAQKYGGAEKCS ::::::::::::::::::::. :.:: :::: : . ::::::::.:::::: ::.:.: NP_001 CYGKKYGPKGYGYGQGAGTLSTDKGESLGIKHEEAPGHRPTTNPNASKFAQKIGGSERCP 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB0 RCGDSVYAAEKIIGAGKPWHKNCFRCAKCGKSLESTTLTEKEGEIYCKGCYAKNFGPKGF ::...:::::: ::: :::::::::.::::::..:.:::::::::::::::::: NP_001 RCSQAVYAAEK------SWHKACFRCAKCGKGLESTTLADKDGEIYCKGCYAKNFGPKGF 130 140 150 160 170 190 pF1KB0 GYGQGAGALVHAQ :.:::::::::.. NP_001 GFGQGAGALVHSE 180 >>NP_003467 (OMIM: 600824,607482,612124) cysteine and gl (194 aa) initn: 989 init1: 523 opt: 977 Z-score: 723.5 bits: 140.6 E(85289): 1.5e-33 Smith-Waterman score: 977; 69.0% identity (85.3% similar) in 184 aa overlap (1-183:1-184) 10 20 30 40 50 60 pF1KB0 MPVWGGGNKCGACGRTVYHAEEVQCDGRSFHRCCFLCMVCRKNLDSTTVAIHDEEIYCKS :: :::: ::::: .:::::::.::.:::::. :: ::.::: ::::::: :. ::::: NP_003 MPNWGGGAKCGACEKTVYHAEEIQCNGRSFHKTCFHCMACRKALDSTTVAAHESEIYCKV 10 20 30 40 50 60 70 80 90 100 110 pF1KB0 CYGKKYGPKGYGYGQGAGTLNMDRGERLGIK-PESVQPHRPTTNPNTSKFAQKYGGAEKC :::..::::: ::::::: :. : ::.::.. .: .: : .:. : :::. :.: .::: NP_003 CYGRRYGPKGIGYGQGAGCLSTDTGEHLGLQFQQSPKPARSVTTSNPSKFTAKFGESEKC 70 80 90 100 110 120 120 130 140 150 160 170 pF1KB0 SRCGDSVYAAEKIIGAGKPWHKNCFRCAKCGKSLESTTLTEKEGEIYCKGCYAKNFGPKG ::: :::::::..:.::::::.::::: ::::::::..:.:.::.::: :::::::: : NP_003 PRCGKSVYAAEKVMGGGKPWHKTCFRCAICGKSLESTNVTDKDGELYCKVCYAKNFGPTG 130 140 150 160 170 180 180 190 pF1KB0 FGYGQGAGALVHAQ .:.: NP_003 IGFGGLTQQVEKKE 190 >>NP_001302 (OMIM: 123875) cysteine-rich protein 1 [Homo (77 aa) initn: 274 init1: 201 opt: 275 Z-score: 225.9 bits: 47.2 E(85289): 7.6e-06 Smith-Waterman score: 275; 50.7% identity (68.0% similar) in 75 aa overlap (118-191:3-74) 90 100 110 120 130 140 pF1KB0 LGIKPESVQPHRPTTNPNTSKFAQKYGGAEKCSRCGDSVYAAEKIIGAGKPWHKNCFRCA :: .:. :: ::.. . :: ::. :..: NP_001 MPKCPKCNKEVYFAERVTSLGKDWHRPCLKCE 10 20 30 150 160 170 180 190 pF1KB0 KCGKSLESTTLTEKEGEIYCKG-CYAKNFGPKGFGYGQGAGALVHAQ ::::.: : .:.::. ::. ::: ::::::: : :: : NP_001 KCGKTLTSGGHAEHEGKPYCNHPCYAAMFGPKGFGRG---GAESHTFK 40 50 60 70 >>NP_001303 (OMIM: 601183) cysteine-rich protein 2 isofo (208 aa) initn: 282 init1: 195 opt: 262 Z-score: 212.8 bits: 46.2 E(85289): 4.1e-05 Smith-Waterman score: 437; 35.9% identity (59.1% similar) in 198 aa overlap (8-191:3-198) 10 20 30 40 50 pF1KB0 MPVWGGGNKCGACGRTVYHAEEVQCDGRSFHRCCFLCMVCRKNLDSTTVAIHDEEIYC-K .:: : .::: ::.:. :...:. :. : : :.: : :: . .: : NP_001 MASKCPKCDKTVYFAEKVSSLGKDWHKFCLKCERCSKTLTPGGHAEHDGKPFCHK 10 20 30 40 50 60 70 80 90 100 pF1KB0 SCYGKKYGPKGYGYGQGAGTLNMDRGERLGIK---PESVQPHR--------PTTNPNTSK ::. .:::: . : :::. ... : . : : : : .:. .. NP_001 PCYATLFGPKGVNIG-GAGSYIYEKPLAEGPQVTGPIEVPAARAEERKASGPPKGPSRAS 60 70 80 90 100 110 110 120 130 140 150 160 pF1KB0 FAQKYGGAEK-CSRCGDSVYAAEKIIGAGKPWHKNCFRCAKCGKSLESTTLTEKEGEIYC . . : . : ::. .:: :::. . :: ::. :.:: .:::.: .:..:. :: NP_001 SVTTFTGEPNTCPRCSKKVYFAEKVTSLGKDWHRPCLRCERCGKTLTPGGHAEHDGQPYC 120 130 140 150 160 170 170 180 190 pF1KB0 -KGCYAKNFGPKGFGYGQGAGALVHAQ : ::. ::::: . : ..:. .. NP_001 HKPCYGILFGPKGVNTG-AVGSYIYDRDPEGKVQP 180 190 200 >>NP_001257766 (OMIM: 601183) cysteine-rich protein 2 is (282 aa) initn: 240 init1: 195 opt: 262 Z-score: 211.6 bits: 46.5 E(85289): 4.8e-05 Smith-Waterman score: 388; 34.9% identity (58.6% similar) in 186 aa overlap (20-191:89-272) 10 20 30 40 pF1KB0 MPVWGGGNKCGACGRTVYHAEEVQCDGRSFHRCCFLCMVCRKNLDSTTV ::.:. :...:. :. : : :.: NP_001 EPSQDHHESQEHRGPLVGSQTCLVHQAEGTAEKVSSLGKDWHKFCLKCERCSKTLTPGGH 60 70 80 90 100 110 50 60 70 80 90 pF1KB0 AIHDEEIYC-KSCYGKKYGPKGYGYGQGAGTLNMDRGERLGIK---PESVQPHR------ : :: . .: : ::. .:::: . : :::. ... : . : : : NP_001 AEHDGKPFCHKPCYATLFGPKGVNIG-GAGSYIYEKPLAEGPQVTGPIEVPAARAEERKA 120 130 140 150 160 170 100 110 120 130 140 150 pF1KB0 --PTTNPNTSKFAQKYGGAEK-CSRCGDSVYAAEKIIGAGKPWHKNCFRCAKCGKSLEST : .:. .. . . : . : ::. .:: :::. . :: ::. :.:: .:::.: NP_001 SGPPKGPSRASSVTTFTGEPNTCPRCSKKVYFAEKVTSLGKDWHRPCLRCERCGKTLTPG 180 190 200 210 220 230 160 170 180 190 pF1KB0 TLTEKEGEIYC-KGCYAKNFGPKGFGYGQGAGALVHAQ .:..:. :: : ::. ::::: . : ..:. .. NP_001 GHAEHDGQPYCHKPCYGILFGPKGVNTG-AVGSYIYDRDPEGKVQP 240 250 260 270 280 193 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sat Nov 5 17:31:06 2016 done: Sat Nov 5 17:31:07 2016 Total Scan time: 5.850 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]