FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB0965, 194 aa 1>>>pF1KB0965 194 - 194 aa - 194 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.7356+/-0.000396; mu= 8.0269+/- 0.024 mean_var=210.3202+/-48.841, 0's: 0 Z-trim(118.4): 177 B-trim: 1908 in 2/48 Lambda= 0.088437 statistics sampled from 31010 (31317) to 31010 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.721), E-opt: 0.2 (0.367), width: 16 Scan time: 5.090 The best scores are: opt bits E(85289) NP_003467 (OMIM: 600824,607482,612124) cysteine an ( 194) 1403 190.9 1.1e-48 NP_001287894 (OMIM: 601871) cysteine and glycine-r ( 193) 977 136.6 2.5e-32 NP_001312 (OMIM: 601871) cysteine and glycine-rich ( 193) 977 136.6 2.5e-32 NP_004069 (OMIM: 123876) cysteine and glycine-rich ( 193) 968 135.4 5.4e-32 NP_001180500 (OMIM: 123876) cysteine and glycine-r ( 193) 968 135.4 5.4e-32 NP_001180501 (OMIM: 123876) cysteine and glycine-r ( 193) 968 135.4 5.4e-32 NP_001180499 (OMIM: 123876) cysteine and glycine-r ( 187) 917 128.9 4.9e-30 NP_001303 (OMIM: 601183) cysteine-rich protein 2 i ( 208) 282 47.9 1.3e-05 NP_001257766 (OMIM: 601183) cysteine-rich protein ( 282) 282 48.1 1.5e-05 NP_001302 (OMIM: 123875) cysteine-rich protein 1 [ ( 77) 265 45.2 3.2e-05 NP_001257770 (OMIM: 601183) cysteine-rich protein ( 87) 263 45.0 4.1e-05 >>NP_003467 (OMIM: 600824,607482,612124) cysteine and gl (194 aa) initn: 1403 init1: 1403 opt: 1403 Z-score: 995.2 bits: 190.9 E(85289): 1.1e-48 Smith-Waterman score: 1403; 100.0% identity (100.0% similar) in 194 aa overlap (1-194:1-194) 10 20 30 40 50 60 pF1KB0 MPNWGGGAKCGACEKTVYHAEEIQCNGRSFHKTCFHCMACRKALDSTTVAAHESEIYCKV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_003 MPNWGGGAKCGACEKTVYHAEEIQCNGRSFHKTCFHCMACRKALDSTTVAAHESEIYCKV 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB0 CYGRRYGPKGIGYGQGAGCLSTDTGEHLGLQFQQSPKPARSVTTSNPSKFTAKFGESEKC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_003 CYGRRYGPKGIGYGQGAGCLSTDTGEHLGLQFQQSPKPARSVTTSNPSKFTAKFGESEKC 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB0 PRCGKSVYAAEKVMGGGKPWHKTCFRCAICGKSLESTNVTDKDGELYCKVCYAKNFGPTG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_003 PRCGKSVYAAEKVMGGGKPWHKTCFRCAICGKSLESTNVTDKDGELYCKVCYAKNFGPTG 130 140 150 160 170 180 190 pF1KB0 IGFGGLTQQVEKKE :::::::::::::: NP_003 IGFGGLTQQVEKKE 190 >>NP_001287894 (OMIM: 601871) cysteine and glycine-rich (193 aa) initn: 965 init1: 523 opt: 977 Z-score: 701.5 bits: 136.6 E(85289): 2.5e-32 Smith-Waterman score: 977; 69.0% identity (85.3% similar) in 184 aa overlap (1-184:1-183) 10 20 30 40 50 60 pF1KB0 MPNWGGGAKCGACEKTVYHAEEIQCNGRSFHKTCFHCMACRKALDSTTVAAHESEIYCKV :: :::: ::::: .:::::::.::.:::::. :: ::.::: ::::::: :. ::::: NP_001 MPVWGGGNKCGACGRTVYHAEEVQCDGRSFHRCCFLCMVCRKNLDSTTVAIHDEEIYCKS 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB0 CYGRRYGPKGIGYGQGAGCLSTDTGEHLGLQFQQSPKPARSVTTSNPSKFTAKFGESEKC :::..::::: ::::::: :. : ::.::.. .: .: : .:. : :::. :.: .::: NP_001 CYGKKYGPKGYGYGQGAGTLNMDRGERLGIK-PESVQPHRPTTNPNTSKFAQKYGGAEKC 70 80 90 100 110 130 140 150 160 170 180 pF1KB0 PRCGKSVYAAEKVMGGGKPWHKTCFRCAICGKSLESTNVTDKDGELYCKVCYAKNFGPTG ::: :::::::..:.::::::.::::: ::::::::..:.:.::.::: :::::::: : NP_001 SRCGDSVYAAEKIIGAGKPWHKNCFRCAKCGKSLESTTLTEKEGEIYCKGCYAKNFGPKG 120 130 140 150 160 170 190 pF1KB0 IGFGGLTQQVEKKE .:.: NP_001 FGYGQGAGALVHAQ 180 190 >>NP_001312 (OMIM: 601871) cysteine and glycine-rich pro (193 aa) initn: 965 init1: 523 opt: 977 Z-score: 701.5 bits: 136.6 E(85289): 2.5e-32 Smith-Waterman score: 977; 69.0% identity (85.3% similar) in 184 aa overlap (1-184:1-183) 10 20 30 40 50 60 pF1KB0 MPNWGGGAKCGACEKTVYHAEEIQCNGRSFHKTCFHCMACRKALDSTTVAAHESEIYCKV :: :::: ::::: .:::::::.::.:::::. :: ::.::: ::::::: :. ::::: NP_001 MPVWGGGNKCGACGRTVYHAEEVQCDGRSFHRCCFLCMVCRKNLDSTTVAIHDEEIYCKS 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB0 CYGRRYGPKGIGYGQGAGCLSTDTGEHLGLQFQQSPKPARSVTTSNPSKFTAKFGESEKC :::..::::: ::::::: :. : ::.::.. .: .: : .:. : :::. :.: .::: NP_001 CYGKKYGPKGYGYGQGAGTLNMDRGERLGIK-PESVQPHRPTTNPNTSKFAQKYGGAEKC 70 80 90 100 110 130 140 150 160 170 180 pF1KB0 PRCGKSVYAAEKVMGGGKPWHKTCFRCAICGKSLESTNVTDKDGELYCKVCYAKNFGPTG ::: :::::::..:.::::::.::::: ::::::::..:.:.::.::: :::::::: : NP_001 SRCGDSVYAAEKIIGAGKPWHKNCFRCAKCGKSLESTTLTEKEGEIYCKGCYAKNFGPKG 120 130 140 150 160 170 190 pF1KB0 IGFGGLTQQVEKKE .:.: NP_001 FGYGQGAGALVHAQ 180 190 >>NP_004069 (OMIM: 123876) cysteine and glycine-rich pro (193 aa) initn: 970 init1: 536 opt: 968 Z-score: 695.3 bits: 135.4 E(85289): 5.4e-32 Smith-Waterman score: 968; 69.0% identity (86.4% similar) in 184 aa overlap (1-184:1-183) 10 20 30 40 50 60 pF1KB0 MPNWGGGAKCGACEKTVYHAEEIQCNGRSFHKTCFHCMACRKALDSTTVAAHESEIYCKV ::::::: :::.:.:::: :::.::.: ::::.:: ::.:.: :::::::.: ::::: NP_004 MPNWGGGKKCGVCQKTVYFAEEVQCEGNSFHKSCFLCMVCKKNLDSTTVAVHGEEIYCKS 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB0 CYGRRYGPKGIGYGQGAGCLSTDTGEHLGLQFQQSPKPARSVTTSNPSKFTAKFGESEKC :::..::::: ::::::: :::: :: ::.. ...: : .:. : :::. :.: ::.: NP_004 CYGKKYGPKGYGYGQGAGTLSTDKGESLGIKHEEAPGH-RPTTNPNASKFAQKIGGSERC 70 80 90 100 110 130 140 150 160 170 180 pF1KB0 PRCGKSVYAAEKVMGGGKPWHKTCFRCAICGKSLESTNVTDKDGELYCKVCYAKNFGPTG :::...:::::::.:.:: :::.::::: :::.::::...:::::.::: :::::::: : NP_004 PRCSQAVYAAEKVIGAGKSWHKACFRCAKCGKGLESTTLADKDGEIYCKGCYAKNFGPKG 120 130 140 150 160 170 190 pF1KB0 IGFGGLTQQVEKKE .::: NP_004 FGFGQGAGALVHSE 180 190 >>NP_001180500 (OMIM: 123876) cysteine and glycine-rich (193 aa) initn: 970 init1: 536 opt: 968 Z-score: 695.3 bits: 135.4 E(85289): 5.4e-32 Smith-Waterman score: 968; 69.0% identity (86.4% similar) in 184 aa overlap (1-184:1-183) 10 20 30 40 50 60 pF1KB0 MPNWGGGAKCGACEKTVYHAEEIQCNGRSFHKTCFHCMACRKALDSTTVAAHESEIYCKV ::::::: :::.:.:::: :::.::.: ::::.:: ::.:.: :::::::.: ::::: NP_001 MPNWGGGKKCGVCQKTVYFAEEVQCEGNSFHKSCFLCMVCKKNLDSTTVAVHGEEIYCKS 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB0 CYGRRYGPKGIGYGQGAGCLSTDTGEHLGLQFQQSPKPARSVTTSNPSKFTAKFGESEKC :::..::::: ::::::: :::: :: ::.. ...: : .:. : :::. :.: ::.: NP_001 CYGKKYGPKGYGYGQGAGTLSTDKGESLGIKHEEAPGH-RPTTNPNASKFAQKIGGSERC 70 80 90 100 110 130 140 150 160 170 180 pF1KB0 PRCGKSVYAAEKVMGGGKPWHKTCFRCAICGKSLESTNVTDKDGELYCKVCYAKNFGPTG :::...:::::::.:.:: :::.::::: :::.::::...:::::.::: :::::::: : NP_001 PRCSQAVYAAEKVIGAGKSWHKACFRCAKCGKGLESTTLADKDGEIYCKGCYAKNFGPKG 120 130 140 150 160 170 190 pF1KB0 IGFGGLTQQVEKKE .::: NP_001 FGFGQGAGALVHSE 180 190 >>NP_001180501 (OMIM: 123876) cysteine and glycine-rich (193 aa) initn: 970 init1: 536 opt: 968 Z-score: 695.3 bits: 135.4 E(85289): 5.4e-32 Smith-Waterman score: 968; 69.0% identity (86.4% similar) in 184 aa overlap (1-184:1-183) 10 20 30 40 50 60 pF1KB0 MPNWGGGAKCGACEKTVYHAEEIQCNGRSFHKTCFHCMACRKALDSTTVAAHESEIYCKV ::::::: :::.:.:::: :::.::.: ::::.:: ::.:.: :::::::.: ::::: NP_001 MPNWGGGKKCGVCQKTVYFAEEVQCEGNSFHKSCFLCMVCKKNLDSTTVAVHGEEIYCKS 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB0 CYGRRYGPKGIGYGQGAGCLSTDTGEHLGLQFQQSPKPARSVTTSNPSKFTAKFGESEKC :::..::::: ::::::: :::: :: ::.. ...: : .:. : :::. :.: ::.: NP_001 CYGKKYGPKGYGYGQGAGTLSTDKGESLGIKHEEAPGH-RPTTNPNASKFAQKIGGSERC 70 80 90 100 110 130 140 150 160 170 180 pF1KB0 PRCGKSVYAAEKVMGGGKPWHKTCFRCAICGKSLESTNVTDKDGELYCKVCYAKNFGPTG :::...:::::::.:.:: :::.::::: :::.::::...:::::.::: :::::::: : NP_001 PRCSQAVYAAEKVIGAGKSWHKACFRCAKCGKGLESTTLADKDGEIYCKGCYAKNFGPKG 120 130 140 150 160 170 190 pF1KB0 IGFGGLTQQVEKKE .::: NP_001 FGFGQGAGALVHSE 180 190 >>NP_001180499 (OMIM: 123876) cysteine and glycine-rich (187 aa) initn: 811 init1: 536 opt: 917 Z-score: 660.3 bits: 128.9 E(85289): 4.9e-30 Smith-Waterman score: 917; 66.8% identity (83.2% similar) in 184 aa overlap (1-184:1-177) 10 20 30 40 50 60 pF1KB0 MPNWGGGAKCGACEKTVYHAEEIQCNGRSFHKTCFHCMACRKALDSTTVAAHESEIYCKV ::::::: :::.:.:::: :::.::.: ::::.:: ::.:.: :::::::.: ::::: NP_001 MPNWGGGKKCGVCQKTVYFAEEVQCEGNSFHKSCFLCMVCKKNLDSTTVAVHGEEIYCKS 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB0 CYGRRYGPKGIGYGQGAGCLSTDTGEHLGLQFQQSPKPARSVTTSNPSKFTAKFGESEKC :::..::::: ::::::: :::: :: ::.. ...: : .:. : :::. :.: ::.: NP_001 CYGKKYGPKGYGYGQGAGTLSTDKGESLGIKHEEAPGH-RPTTNPNASKFAQKIGGSERC 70 80 90 100 110 130 140 150 160 170 180 pF1KB0 PRCGKSVYAAEKVMGGGKPWHKTCFRCAICGKSLESTNVTDKDGELYCKVCYAKNFGPTG :::...:::::: :::.::::: :::.::::...:::::.::: :::::::: : NP_001 PRCSQAVYAAEK------SWHKACFRCAKCGKGLESTTLADKDGEIYCKGCYAKNFGPKG 120 130 140 150 160 170 190 pF1KB0 IGFGGLTQQVEKKE .::: NP_001 FGFGQGAGALVHSE 180 >>NP_001303 (OMIM: 601183) cysteine-rich protein 2 isofo (208 aa) initn: 465 init1: 220 opt: 282 Z-score: 221.9 bits: 47.9 E(85289): 1.3e-05 Smith-Waterman score: 462; 37.5% identity (60.5% similar) in 200 aa overlap (8-194:3-201) 10 20 30 40 50 pF1KB0 MPNWGGGAKCGACEKTVYHAEEIQCNGRSFHKTCFHCMACRKALDSTTVAAHESEIYC-K .:: :.:::: ::... :...:: :..: : :.: : :... .: : NP_001 MASKCPKCDKTVYFAEKVSSLGKDWHKFCLKCERCSKTLTPGGHAEHDGKPFCHK 10 20 30 40 50 60 70 80 90 100 pF1KB0 VCYGRRYGPKGIGYGQGAGCLSTDTGEHLGLQFQ---QSPKP-ARSVTTSNPSK------ ::. .::::.. : ::: . : : . : :. .:.: : NP_001 PCYATLFGPKGVNIG-GAGSYIYEKPLAEGPQVTGPIEVPAARAEERKASGPPKGPSRAS 60 70 80 90 100 110 110 120 130 140 150 160 pF1KB0 -FTAKFGESEKCPRCGKSVYAAEKVMGGGKPWHKTCFRCAICGKSLESTNVTDKDGELYC :. :: . ::::.:.:: :::: . :: ::. :.:: :::.: . ...::. :: NP_001 SVTTFTGEPNTCPRCSKKVYFAEKVTSLGKDWHRPCLRCERCGKTLTPGGHAEHDGQPYC 120 130 140 150 160 170 170 180 190 pF1KB0 -KVCYAKNFGPTGIGFGGLTQQVEKKE : ::. ::: :.. :.. . . .. NP_001 HKPCYGILFGPKGVNTGAVGSYIYDRDPEGKVQP 180 190 200 >>NP_001257766 (OMIM: 601183) cysteine-rich protein 2 is (282 aa) initn: 320 init1: 220 opt: 282 Z-score: 220.5 bits: 48.1 E(85289): 1.5e-05 Smith-Waterman score: 407; 36.2% identity (59.6% similar) in 188 aa overlap (20-194:89-275) 10 20 30 40 pF1KB0 MPNWGGGAKCGACEKTVYHAEEIQCNGRSFHKTCFHCMACRKALDSTTV ::... :...:: :..: : :.: NP_001 EPSQDHHESQEHRGPLVGSQTCLVHQAEGTAEKVSSLGKDWHKFCLKCERCSKTLTPGGH 60 70 80 90 100 110 50 60 70 80 90 100 pF1KB0 AAHESEIYC-KVCYGRRYGPKGIGYGQGAGCLSTDTGEHLGLQFQ---QSPKP-ARSVTT : :... .: : ::. .::::.. : ::: . : : . : :. . NP_001 AEHDGKPFCHKPCYATLFGPKGVNIG-GAGSYIYEKPLAEGPQVTGPIEVPAARAEERKA 120 130 140 150 160 170 110 120 130 140 150 pF1KB0 SNPSK-------FTAKFGESEKCPRCGKSVYAAEKVMGGGKPWHKTCFRCAICGKSLEST :.: : :. :: . ::::.:.:: :::: . :: ::. :.:: :::.: NP_001 SGPPKGPSRASSVTTFTGEPNTCPRCSKKVYFAEKVTSLGKDWHRPCLRCERCGKTLTPG 180 190 200 210 220 230 160 170 180 190 pF1KB0 NVTDKDGELYC-KVCYAKNFGPTGIGFGGLTQQVEKKE . ...::. :: : ::. ::: :.. :.. . . .. NP_001 GHAEHDGQPYCHKPCYGILFGPKGVNTGAVGSYIYDRDPEGKVQP 240 250 260 270 280 >>NP_001302 (OMIM: 123875) cysteine-rich protein 1 [Homo (77 aa) initn: 265 init1: 202 opt: 265 Z-score: 214.8 bits: 45.2 E(85289): 3.2e-05 Smith-Waterman score: 265; 46.7% identity (72.0% similar) in 75 aa overlap (119-192:3-77) 90 100 110 120 130 140 pF1KB0 GLQFQQSPKPARSVTTSNPSKFTAKFGESEKCPRCGKSVYAAEKVMGGGKPWHKTCFRCA :::.:.: :: ::.: . :: ::. :..: NP_001 MPKCPKCNKEVYFAERVTSLGKDWHRPCLKCE 10 20 30 150 160 170 180 190 pF1KB0 ICGKSLESTNVTDKDGELYCK-VCYAKNFGPTGIGFGGLTQQVEKKE :::.: : . ....:. ::. ::: ::: :.: :: ... : NP_001 KCGKTLTSGGHAEHEGKPYCNHPCYAAMFGPKGFGRGGAESHTFK 40 50 60 70 194 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sat Nov 5 17:32:10 2016 done: Sat Nov 5 17:32:11 2016 Total Scan time: 5.090 Total Display time: -0.030 Function used was FASTA [36.3.4 Apr, 2011]