FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB0961, 193 aa 1>>>pF1KB0961 193 - 193 aa - 193 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.8639+/-0.000379; mu= 12.8178+/- 0.024 mean_var=175.0290+/-38.495, 0's: 0 Z-trim(118.4): 243 B-trim: 3163 in 2/52 Lambda= 0.096944 statistics sampled from 31038 (31413) to 31038 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.719), E-opt: 0.2 (0.368), width: 16 Scan time: 4.050 The best scores are: opt bits E(85289) NP_001180501 (OMIM: 123876) cysteine and glycine-r ( 193) 1395 206.5 2.3e-53 NP_004069 (OMIM: 123876) cysteine and glycine-rich ( 193) 1395 206.5 2.3e-53 NP_001180500 (OMIM: 123876) cysteine and glycine-r ( 193) 1395 206.5 2.3e-53 NP_001180499 (OMIM: 123876) cysteine and glycine-r ( 187) 1336 198.2 6.7e-51 NP_001312 (OMIM: 601871) cysteine and glycine-rich ( 193) 1159 173.4 1.9e-43 NP_001287894 (OMIM: 601871) cysteine and glycine-r ( 193) 1159 173.4 1.9e-43 NP_003467 (OMIM: 600824,607482,612124) cysteine an ( 194) 968 146.7 2.1e-35 NP_001303 (OMIM: 601183) cysteine-rich protein 2 i ( 208) 293 52.4 5.9e-07 NP_001257766 (OMIM: 601183) cysteine-rich protein ( 282) 293 52.6 7e-07 NP_001302 (OMIM: 123875) cysteine-rich protein 1 [ ( 77) 283 50.4 8.8e-07 NP_001257770 (OMIM: 601183) cysteine-rich protein ( 87) 267 48.2 4.4e-06 >>NP_001180501 (OMIM: 123876) cysteine and glycine-rich (193 aa) initn: 1395 init1: 1395 opt: 1395 Z-score: 1079.2 bits: 206.5 E(85289): 2.3e-53 Smith-Waterman score: 1395; 100.0% identity (100.0% similar) in 193 aa overlap (1-193:1-193) 10 20 30 40 50 60 pF1KB0 MPNWGGGKKCGVCQKTVYFAEEVQCEGNSFHKSCFLCMVCKKNLDSTTVAVHGEEIYCKS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 MPNWGGGKKCGVCQKTVYFAEEVQCEGNSFHKSCFLCMVCKKNLDSTTVAVHGEEIYCKS 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB0 CYGKKYGPKGYGYGQGAGTLSTDKGESLGIKHEEAPGHRPTTNPNASKFAQKIGGSERCP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 CYGKKYGPKGYGYGQGAGTLSTDKGESLGIKHEEAPGHRPTTNPNASKFAQKIGGSERCP 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB0 RCSQAVYAAEKVIGAGKSWHKACFRCAKCGKGLESTTLADKDGEIYCKGCYAKNFGPKGF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 RCSQAVYAAEKVIGAGKSWHKACFRCAKCGKGLESTTLADKDGEIYCKGCYAKNFGPKGF 130 140 150 160 170 180 190 pF1KB0 GFGQGAGALVHSE ::::::::::::: NP_001 GFGQGAGALVHSE 190 >>NP_004069 (OMIM: 123876) cysteine and glycine-rich pro (193 aa) initn: 1395 init1: 1395 opt: 1395 Z-score: 1079.2 bits: 206.5 E(85289): 2.3e-53 Smith-Waterman score: 1395; 100.0% identity (100.0% similar) in 193 aa overlap (1-193:1-193) 10 20 30 40 50 60 pF1KB0 MPNWGGGKKCGVCQKTVYFAEEVQCEGNSFHKSCFLCMVCKKNLDSTTVAVHGEEIYCKS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_004 MPNWGGGKKCGVCQKTVYFAEEVQCEGNSFHKSCFLCMVCKKNLDSTTVAVHGEEIYCKS 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB0 CYGKKYGPKGYGYGQGAGTLSTDKGESLGIKHEEAPGHRPTTNPNASKFAQKIGGSERCP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_004 CYGKKYGPKGYGYGQGAGTLSTDKGESLGIKHEEAPGHRPTTNPNASKFAQKIGGSERCP 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB0 RCSQAVYAAEKVIGAGKSWHKACFRCAKCGKGLESTTLADKDGEIYCKGCYAKNFGPKGF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_004 RCSQAVYAAEKVIGAGKSWHKACFRCAKCGKGLESTTLADKDGEIYCKGCYAKNFGPKGF 130 140 150 160 170 180 190 pF1KB0 GFGQGAGALVHSE ::::::::::::: NP_004 GFGQGAGALVHSE 190 >>NP_001180500 (OMIM: 123876) cysteine and glycine-rich (193 aa) initn: 1395 init1: 1395 opt: 1395 Z-score: 1079.2 bits: 206.5 E(85289): 2.3e-53 Smith-Waterman score: 1395; 100.0% identity (100.0% similar) in 193 aa overlap (1-193:1-193) 10 20 30 40 50 60 pF1KB0 MPNWGGGKKCGVCQKTVYFAEEVQCEGNSFHKSCFLCMVCKKNLDSTTVAVHGEEIYCKS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 MPNWGGGKKCGVCQKTVYFAEEVQCEGNSFHKSCFLCMVCKKNLDSTTVAVHGEEIYCKS 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB0 CYGKKYGPKGYGYGQGAGTLSTDKGESLGIKHEEAPGHRPTTNPNASKFAQKIGGSERCP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 CYGKKYGPKGYGYGQGAGTLSTDKGESLGIKHEEAPGHRPTTNPNASKFAQKIGGSERCP 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB0 RCSQAVYAAEKVIGAGKSWHKACFRCAKCGKGLESTTLADKDGEIYCKGCYAKNFGPKGF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 RCSQAVYAAEKVIGAGKSWHKACFRCAKCGKGLESTTLADKDGEIYCKGCYAKNFGPKGF 130 140 150 160 170 180 190 pF1KB0 GFGQGAGALVHSE ::::::::::::: NP_001 GFGQGAGALVHSE 190 >>NP_001180499 (OMIM: 123876) cysteine and glycine-rich (187 aa) initn: 969 init1: 943 opt: 1336 Z-score: 1034.8 bits: 198.2 E(85289): 6.7e-51 Smith-Waterman score: 1336; 96.9% identity (96.9% similar) in 193 aa overlap (1-193:1-187) 10 20 30 40 50 60 pF1KB0 MPNWGGGKKCGVCQKTVYFAEEVQCEGNSFHKSCFLCMVCKKNLDSTTVAVHGEEIYCKS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 MPNWGGGKKCGVCQKTVYFAEEVQCEGNSFHKSCFLCMVCKKNLDSTTVAVHGEEIYCKS 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB0 CYGKKYGPKGYGYGQGAGTLSTDKGESLGIKHEEAPGHRPTTNPNASKFAQKIGGSERCP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 CYGKKYGPKGYGYGQGAGTLSTDKGESLGIKHEEAPGHRPTTNPNASKFAQKIGGSERCP 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB0 RCSQAVYAAEKVIGAGKSWHKACFRCAKCGKGLESTTLADKDGEIYCKGCYAKNFGPKGF ::::::::::: ::::::::::::::::::::::::::::::::::::::::::: NP_001 RCSQAVYAAEK------SWHKACFRCAKCGKGLESTTLADKDGEIYCKGCYAKNFGPKGF 130 140 150 160 170 190 pF1KB0 GFGQGAGALVHSE ::::::::::::: NP_001 GFGQGAGALVHSE 180 >>NP_001312 (OMIM: 601871) cysteine and glycine-rich pro (193 aa) initn: 1496 init1: 1159 opt: 1159 Z-score: 900.8 bits: 173.4 E(85289): 1.9e-43 Smith-Waterman score: 1159; 79.3% identity (91.7% similar) in 193 aa overlap (1-193:1-193) 10 20 30 40 50 60 pF1KB0 MPNWGGGKKCGVCQKTVYFAEEVQCEGNSFHKSCFLCMVCKKNLDSTTVAVHGEEIYCKS :: ::::.:::.: .::: ::::::.: :::. :::::::.:::::::::.: ::::::: NP_001 MPVWGGGNKCGACGRTVYHAEEVQCDGRSFHRCCFLCMVCRKNLDSTTVAIHDEEIYCKS 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB0 CYGKKYGPKGYGYGQGAGTLSTDKGESLGIKHEEAPGHRPTTNPNASKFAQKIGGSERCP ::::::::::::::::::::. :.:: :::: : . ::::::::.:::::: ::.:.: NP_001 CYGKKYGPKGYGYGQGAGTLNMDRGERLGIKPESVQPHRPTTNPNTSKFAQKYGGAEKCS 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB0 RCSQAVYAAEKVIGAGKSWHKACFRCAKCGKGLESTTLADKDGEIYCKGCYAKNFGPKGF ::...::::::.::::: ::: :::::::::.::::::..:.:::::::::::::::::: NP_001 RCGDSVYAAEKIIGAGKPWHKNCFRCAKCGKSLESTTLTEKEGEIYCKGCYAKNFGPKGF 130 140 150 160 170 180 190 pF1KB0 GFGQGAGALVHSE :.:::::::::.. NP_001 GYGQGAGALVHAQ 190 >>NP_001287894 (OMIM: 601871) cysteine and glycine-rich (193 aa) initn: 1496 init1: 1159 opt: 1159 Z-score: 900.8 bits: 173.4 E(85289): 1.9e-43 Smith-Waterman score: 1159; 79.3% identity (91.7% similar) in 193 aa overlap (1-193:1-193) 10 20 30 40 50 60 pF1KB0 MPNWGGGKKCGVCQKTVYFAEEVQCEGNSFHKSCFLCMVCKKNLDSTTVAVHGEEIYCKS :: ::::.:::.: .::: ::::::.: :::. :::::::.:::::::::.: ::::::: NP_001 MPVWGGGNKCGACGRTVYHAEEVQCDGRSFHRCCFLCMVCRKNLDSTTVAIHDEEIYCKS 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB0 CYGKKYGPKGYGYGQGAGTLSTDKGESLGIKHEEAPGHRPTTNPNASKFAQKIGGSERCP ::::::::::::::::::::. :.:: :::: : . ::::::::.:::::: ::.:.: NP_001 CYGKKYGPKGYGYGQGAGTLNMDRGERLGIKPESVQPHRPTTNPNTSKFAQKYGGAEKCS 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB0 RCSQAVYAAEKVIGAGKSWHKACFRCAKCGKGLESTTLADKDGEIYCKGCYAKNFGPKGF ::...::::::.::::: ::: :::::::::.::::::..:.:::::::::::::::::: NP_001 RCGDSVYAAEKIIGAGKPWHKNCFRCAKCGKSLESTTLTEKEGEIYCKGCYAKNFGPKGF 130 140 150 160 170 180 190 pF1KB0 GFGQGAGALVHSE :.:::::::::.. NP_001 GYGQGAGALVHAQ 190 >>NP_003467 (OMIM: 600824,607482,612124) cysteine and gl (194 aa) initn: 970 init1: 536 opt: 968 Z-score: 756.4 bits: 146.7 E(85289): 2.1e-35 Smith-Waterman score: 968; 69.0% identity (86.4% similar) in 184 aa overlap (1-183:1-184) 10 20 30 40 50 60 pF1KB0 MPNWGGGKKCGVCQKTVYFAEEVQCEGNSFHKSCFLCMVCKKNLDSTTVAVHGEEIYCKS ::::::: :::.:.:::: :::.::.: ::::.:: ::.:.: :::::::.: ::::: NP_003 MPNWGGGAKCGACEKTVYHAEEIQCNGRSFHKTCFHCMACRKALDSTTVAAHESEIYCKV 10 20 30 40 50 60 70 80 90 100 110 pF1KB0 CYGKKYGPKGYGYGQGAGTLSTDKGESLGIKHEEAPGH-RPTTNPNASKFAQKIGGSERC :::..::::: ::::::: :::: :: ::.. ...: : .:. : :::. :.: ::.: NP_003 CYGRRYGPKGIGYGQGAGCLSTDTGEHLGLQFQQSPKPARSVTTSNPSKFTAKFGESEKC 70 80 90 100 110 120 120 130 140 150 160 170 pF1KB0 PRCSQAVYAAEKVIGAGKSWHKACFRCAKCGKGLESTTLADKDGEIYCKGCYAKNFGPKG :::...:::::::.:.:: :::.::::: :::.::::...:::::.::: :::::::: : NP_003 PRCGKSVYAAEKVMGGGKPWHKTCFRCAICGKSLESTNVTDKDGELYCKVCYAKNFGPTG 130 140 150 160 170 180 180 190 pF1KB0 FGFGQGAGALVHSE .::: NP_003 IGFGGLTQQVEKKE 190 >>NP_001303 (OMIM: 601183) cysteine-rich protein 2 isofo (208 aa) initn: 451 init1: 225 opt: 293 Z-score: 245.9 bits: 52.4 E(85289): 5.9e-07 Smith-Waterman score: 474; 40.6% identity (61.4% similar) in 197 aa overlap (9-191:4-198) 10 20 30 40 50 pF1KB0 MPNWGGGKKCGVCQKTVYFAEEVQCEGNSFHKSCFLCMVCKKNLDSTTVAVHGEEIYC-K :: :.:::::::.:. :...:: :. : :.:.: : : . .: : NP_001 MASKCPKCDKTVYFAEKVSSLGKDWHKFCLKCERCSKTLTPGGHAEHDGKPFCHK 10 20 30 40 50 60 70 80 90 100 pF1KB0 SCYGKKYGPKGYGYGQGAGTLSTDKGESLGIKHE---EAPGHR--------PTTNPN-AS ::. .:::: . : :::. .: . : . :.:. : : .:. :: NP_001 PCYATLFGPKGVNIG-GAGSYIYEKPLAEGPQVTGPIEVPAARAEERKASGPPKGPSRAS 60 70 80 90 100 110 110 120 130 140 150 160 pF1KB0 KFAQKIGGSERCPRCSQAVYAAEKVIGAGKSWHKACFRCAKCGKGLESTTLADKDGEIYC . . : . :::::. :: :::: . ::.::. :.:: .::: : :..::. :: NP_001 SVTTFTGEPNTCPRCSKKVYFAEKVTSLGKDWHRPCLRCERCGKTLTPGGHAEHDGQPYC 120 130 140 150 160 170 170 180 190 pF1KB0 -KGCYAKNFGPKGFGFGQGAGALVHSE : ::. ::::: . : ..:. .. NP_001 HKPCYGILFGPKGVNTG-AVGSYIYDRDPEGKVQP 180 190 200 >>NP_001257766 (OMIM: 601183) cysteine-rich protein 2 is (282 aa) initn: 270 init1: 225 opt: 293 Z-score: 244.6 bits: 52.6 E(85289): 7e-07 Smith-Waterman score: 423; 38.0% identity (59.0% similar) in 200 aa overlap (6-191:76-272) 10 20 30 pF1KB0 MPNWGGGKKCGVCQKTVYFAEEVQCEGNSFHKSCF :.. : : : ::.:. :...:: :. NP_001 AGCVCKGGGCCHREPSQDHHESQEHRGPLVGSQTCLVHQAE-GTAEKVSSLGKDWHKFCL 50 60 70 80 90 100 40 50 60 70 80 90 pF1KB0 LCMVCKKNLDSTTVAVHGEEIYC-KSCYGKKYGPKGYGYGQGAGTLSTDKGESLGIKHE- : :.:.: : : . .: : ::. .:::: . : :::. .: . : . NP_001 KCERCSKTLTPGGHAEHDGKPFCHKPCYATLFGPKGVNIG-GAGSYIYEKPLAEGPQVTG 110 120 130 140 150 160 100 110 120 130 140 pF1KB0 --EAPGHR--------PTTNPN-ASKFAQKIGGSERCPRCSQAVYAAEKVIGAGKSWHKA :.:. : : .:. ::. . : . :::::. :: :::: . ::.::. NP_001 PIEVPAARAEERKASGPPKGPSRASSVTTFTGEPNTCPRCSKKVYFAEKVTSLGKDWHRP 170 180 190 200 210 220 150 160 170 180 190 pF1KB0 CFRCAKCGKGLESTTLADKDGEIYC-KGCYAKNFGPKGFGFGQGAGALVHSE :.:: .::: : :..::. :: : ::. ::::: . : ..:. .. NP_001 CLRCERCGKTLTPGGHAEHDGQPYCHKPCYGILFGPKGVNTG-AVGSYIYDRDPEGKVQP 230 240 250 260 270 280 >>NP_001302 (OMIM: 123875) cysteine-rich protein 1 [Homo (77 aa) initn: 298 init1: 209 opt: 283 Z-score: 242.8 bits: 50.4 E(85289): 8.8e-07 Smith-Waterman score: 283; 50.0% identity (71.1% similar) in 76 aa overlap (118-192:3-75) 90 100 110 120 130 140 pF1KB0 LGIKHEEAPGHRPTTNPNASKFAQKIGGSERCPRCSQAVYAAEKVIGAGKSWHKACFRCA .::.:.. :: ::.: . ::.::. :..: NP_001 MPKCPKCNKEVYFAERVTSLGKDWHRPCLKCE 10 20 30 150 160 170 180 190 pF1KB0 KCGKGLESTTLADKDGEIYCKG-CYAKNFGPKGFGFGQGAGALVHSE :::: : : :...:. ::. ::: ::::::: : :: :. NP_001 KCGKTLTSGGHAEHEGKPYCNHPCYAAMFGPKGFGRG---GAESHTFK 40 50 60 70 193 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sat Nov 5 17:29:50 2016 done: Sat Nov 5 17:29:51 2016 Total Scan time: 4.050 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]