FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE0495, 207 aa 1>>>pF1KE0495 207 - 207 aa - 207 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.4275+/-0.000743; mu= 12.3122+/- 0.045 mean_var=59.5421+/-11.876, 0's: 0 Z-trim(108.3): 16 B-trim: 0 in 0/51 Lambda= 0.166212 statistics sampled from 10141 (10155) to 10141 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.699), E-opt: 0.2 (0.312), width: 16 Scan time: 2.060 The best scores are: opt bits E(32554) CCDS13841.1 CRYBA4 gene_id:1413|Hs108|chr22 ( 196) 1418 348.0 2.3e-96 CCDS11249.1 CRYBA1 gene_id:1411|Hs108|chr17 ( 215) 1002 248.3 2.6e-66 CCDS2429.1 CRYBA2 gene_id:1412|Hs108|chr2 ( 197) 749 187.6 4.4e-48 CCDS13840.1 CRYBB1 gene_id:1414|Hs108|chr22 ( 252) 566 143.8 9e-35 CCDS13831.1 CRYBB2 gene_id:1415|Hs108|chr22 ( 205) 512 130.8 5.9e-31 CCDS13830.1 CRYBB3 gene_id:1417|Hs108|chr22 ( 211) 499 127.7 5.3e-30 CCDS2379.1 CRYGC gene_id:1420|Hs108|chr2 ( 174) 351 92.2 2.1e-19 CCDS2380.1 CRYGB gene_id:1419|Hs108|chr2 ( 175) 323 85.4 2.3e-17 CCDS5926.1 CRYGN gene_id:155051|Hs108|chr7 ( 182) 308 81.8 2.8e-16 CCDS34506.1 AIM1 gene_id:202|Hs108|chr6 (1723) 318 84.6 4.1e-16 CCDS3275.1 CRYGS gene_id:1427|Hs108|chr3 ( 178) 303 80.6 6.4e-16 >>CCDS13841.1 CRYBA4 gene_id:1413|Hs108|chr22 (196 aa) initn: 1418 init1: 1418 opt: 1418 Z-score: 1843.7 bits: 348.0 E(32554): 2.3e-96 Smith-Waterman score: 1418; 100.0% identity (100.0% similar) in 196 aa overlap (12-207:1-196) 10 20 30 40 50 60 pF1KE0 MFPGPISEGATMTLQCTKSAGPWKMVVWDEDGFQGRRHEFTAECPSVLELGFETVRSLKV ::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 MTLQCTKSAGPWKMVVWDEDGFQGRRHEFTAECPSVLELGFETVRSLKV 10 20 30 40 70 80 90 100 110 120 pF1KE0 LSGAWVGFEHAGFQGQQYILERGEYPSWDAWGGNTAYPAERLTSFRPAACANHRDSRLTI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 LSGAWVGFEHAGFQGQQYILERGEYPSWDAWGGNTAYPAERLTSFRPAACANHRDSRLTI 50 60 70 80 90 100 130 140 150 160 170 180 pF1KE0 FEQENFLGKKGELSDDYPSLQAMGWEGNEVGSFHVHSGAWVCSQFPGYRGFQYVLECDHH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 FEQENFLGKKGELSDDYPSLQAMGWEGNEVGSFHVHSGAWVCSQFPGYRGFQYVLECDHH 110 120 130 140 150 160 190 200 pF1KE0 SGDYKHFREWGSHAPTFQVQSIRRIQQ ::::::::::::::::::::::::::: CCDS13 SGDYKHFREWGSHAPTFQVQSIRRIQQ 170 180 190 >>CCDS11249.1 CRYBA1 gene_id:1411|Hs108|chr17 (215 aa) initn: 1002 init1: 1002 opt: 1002 Z-score: 1303.9 bits: 248.3 E(32554): 2.6e-66 Smith-Waterman score: 1002; 68.3% identity (89.9% similar) in 189 aa overlap (19-207:27-215) 10 20 30 40 50 pF1KE0 MFPGPISEGATMTLQCTKSAGPWKMVVWDEDGFQGRRHEFTAECPSVLELGF : ::::....:...:::.: :::. ::.: : .: CCDS11 METQAEQQELETLPTTKMAQTNPTPGSLGPWKITIYDQENFQGKRMEFTSSCPNVSERSF 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE0 ETVRSLKVLSGAWVGFEHAGFQGQQYILERGEYPSWDAWGGNTAYPAERLTSFRPAACAN ..:::::: ::::.:.::..: :::.:::::::: ::::.:..:: ::: :::: :: CCDS11 DNVRSLKVESGAWIGYEHTSFCGQQFILERGEYPRWDAWSGSNAYHIERLMSFRPICSAN 70 80 90 100 110 120 120 130 140 150 160 170 pF1KE0 HRDSRLTIFEQENFLGKKGELSDDYPSLQAMGWEGNEVGSFHVHSGAWVCSQFPGYRGFQ :..:..::::.:::.:.. :.:::::::::::: .:::::....:::::: :.:::::.: CCDS11 HKESKMTIFEKENFIGRQWEISDDYPSLQAMGWFNNEVGSMKIQSGAWVCYQYPGYRGYQ 130 140 150 160 170 180 180 190 200 pF1KE0 YVLECDHHSGDYKHFREWGSHAPTFQVQSIRRIQQ :.::::::.:::::.::::::: : :.:::::::: CCDS11 YILECDHHGGDYKHWREWGSHAQTSQIQSIRRIQQ 190 200 210 >>CCDS2429.1 CRYBA2 gene_id:1412|Hs108|chr2 (197 aa) initn: 874 init1: 664 opt: 749 Z-score: 976.6 bits: 187.6 E(32554): 4.4e-48 Smith-Waterman score: 749; 54.5% identity (81.3% similar) in 187 aa overlap (22-207:11-197) 10 20 30 40 50 pF1KE0 MFPGPISEGATMTLQCTKSAGPWKMVVWDEDGFQGRRHEFTAECPSVLELG-FETVRSLK : ....:::. ::::: .. ..: .: : : . :::.: CCDS24 MSSAPAPGPAPASLTLWDEEDFQGRRCRLLSDCANVCERGGLPRVRSVK 10 20 30 40 60 70 80 90 100 110 pF1KE0 VLSGAWVGFEHAGFQGQQYILERGEYPSWDAWGGNTAYPAERLTSFRPAACANHRDSRLT : .:.::.::. :::::.:::.:.:: :.::.:.... ...: ::::. :::: :::.: CCDS24 VENGVWVAFEYPDFQGQQFILEKGDYPRWSAWSGSSSHNSNQLLSFRPVLCANHNDSRVT 50 60 70 80 90 100 120 130 140 150 160 170 pF1KE0 IFEQENFLGKKGELSDDYPSLQAMGWEGNEVGSFHVHSGAWVCSQFPGYRGFQYVLECDH .:: .:: : : .: :::::: .::: ...:::..: ::::: :.:::::.::::: :. CCDS24 LFEGDNFQGCKFDLVDDYPSLPSMGWASKDVGSLKVSSGAWVAYQYPGYRGYQYVLERDR 110 120 130 140 150 160 180 190 200 pF1KE0 HSGDYKHFREWGSHAPTFQVQSIRRIQQ :::.. . : :..: : :.:::::.:. CCDS24 HSGEFCTYGELGTQAHTGQLQSIRRVQH 170 180 190 >>CCDS13840.1 CRYBB1 gene_id:1414|Hs108|chr22 (252 aa) initn: 493 init1: 279 opt: 566 Z-score: 737.7 bits: 143.8 E(32554): 9e-35 Smith-Waterman score: 585; 42.1% identity (73.2% similar) in 209 aa overlap (3-206:35-233) 10 20 pF1KE0 MFPGPISEGATMTLQCTKSA----GPWKMVVW :: .:. . .:.: : ...::. CCDS13 AKASASATVAVNPGPDTKGKGAPPAGTSPSPGTTLAPTTVPITSAKAAELPPGNYRLVVF 10 20 30 40 50 60 30 40 50 60 70 80 pF1KE0 DEDGFQGRRHEFTAECPSVLELGFETVRSLKVLSGAWVGFEHAGFQGQQYILERGEYPSW . ..::::: ::..:: .. . ::. :::. : .: ::.::...:.:...:::.:::: : CCDS13 ELENFQGRRAEFSGECSNLADRGFDRVRSIIVSAGPWVAFEQSNFRGEMFILEKGEYPRW 70 80 90 100 110 120 90 100 110 120 130 140 pF1KE0 DAWGGNTAYPAERLTSFRPAACANHRDSRLTIFEQENFLGKKGELS-DDYPSLQAMGWEG ..: ...: ..:: :::: . .. ....:: :: :. :.. :: ::: ..:. . CCDS13 NTW--SSSYRSDRLMSFRPIK-MDAQEHKISLFEGANFKGNTIEIQGDDAPSLWVYGF-S 130 140 150 160 170 180 150 160 170 180 190 200 pF1KE0 NEVGSFHVHSGAWVCSQFPGYRGFQYVLECDHHSGDYKHFREWGSHAPTFQVQSIRRIQQ ..::: .: ::.:: :.:::::.::.:: ::..:. :::. : :.::.::.. CCDS13 DRVGSVKVSSGTWVGYQYPGYRGYQYLLE----PGDFRHWNEWGAFQP--QMQSLRRLRD 190 200 210 220 230 CCDS13 KQWHLEGSFPVLATEPPK 240 250 >>CCDS13831.1 CRYBB2 gene_id:1415|Hs108|chr22 (205 aa) initn: 477 init1: 237 opt: 512 Z-score: 669.2 bits: 130.8 E(32554): 5.9e-31 Smith-Waterman score: 531; 41.6% identity (75.8% similar) in 190 aa overlap (18-206:13-191) 10 20 30 40 50 60 pF1KE0 MFPGPISEGATMTLQCTKSAGPWKMVVWDEDGFQGRRHEFTAECPSVLELGFETVRSLKV .: .: :........:::. ::... ::.. : : : . :. : CCDS13 MASDHQTQAGKPQSLNP-KIIIFEQENFQGHSHELNGPCPNLKETGVEKAGSVLV 10 20 30 40 50 70 80 90 100 110 120 pF1KE0 LSGAWVGFEHAGFQGQQYILERGEYPSWDAWGGNTAYPAERLTSFRPAACANHRDSRLTI .: :::.:.:. .:.:...:.:::: ::.: ... .. :.:.:: .. .. .. . CCDS13 QAGPWVGYEQANCKGEQFVFEKGEYPRWDSW--TSSRRTDSLSSLRPIK-VDSQEHKIIL 60 70 80 90 100 110 130 140 150 160 170 pF1KE0 FEQENFLGKKGEL-SDDYPSLQAMGWEGNEVGSFHVHSGAWVCSQFPGYRGFQYVLECDH .:. :: ::: :. .:: ::..: :.. ..:.: .:.::.:: :.:::::.::.:: CCDS13 YENPNFTGKKMEIIDDDVPSFHAHGYQ-EKVSSVRVQSGTWVGYQYPGYRGLQYLLE--- 120 130 140 150 160 180 190 200 pF1KE0 HSGDYKHFREWGSHAPTFQVQSIRRIQQ .:::: ..: :: ::::.:::. CCDS13 -KGDYKDSSDFG--APHPQVQSVRRIRDMQWHQRGAFHPSN 170 180 190 200 >>CCDS13830.1 CRYBB3 gene_id:1417|Hs108|chr22 (211 aa) initn: 427 init1: 250 opt: 499 Z-score: 652.2 bits: 127.7 E(32554): 5.3e-30 Smith-Waterman score: 517; 41.7% identity (71.7% similar) in 187 aa overlap (21-206:22-198) 10 20 30 40 50 pF1KE0 MFPGPISEGATMTLQCTKSAGPWKMVVWDEDGFQGRRHEFTAECPSVLELGFETVRSLK : .:..... ..:::.: :..:::::. . .: : :.. CCDS13 MAEQHGAPEQAAAGKSHGDLGGSYKVILYELENFQGKRCELSAECPSLTDSLLEKVGSIQ 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE0 VLSGAWVGFEHAGFQGQQYILERGEYPSWDAWGGNTAYPAERLTSFRPAACANHRDSRLT : :: :..:: .:.:.:..::.:.:: :::: ... .. : :.:: . .: CCDS13 VESGPWLAFESRAFRGEQFVLEKGDYPRWDAW--SNSRDSDSLLSLRPLN-IDSPHHKLH 70 80 90 100 110 120 130 140 150 160 170 pF1KE0 IFEQENFLGKKGEL-SDDYPSLQAMGWEGNEVGSFHVHSGAWVCSQFPGYRGFQYVLECD .::. : :.: :. .:: ::: : :.. ..:.: .. .:.:: .:::::: :::.: CCDS13 LFENPAFSGRKMEIVDDDVPSLWAHGFQ-DRVASVRAINGTWVGYEFPGYRGRQYVFE-- 120 130 140 150 160 170 180 190 200 pF1KE0 HHSGDYKHFREWGSHAPTFQVQSIRRIQQ :.:.:. :: . : :.::.:::. CCDS13 --RGEYRHWNEWDASQP--QLQSVRRIRDQKWHKRGRFPSS 180 190 200 210 >>CCDS2379.1 CRYGC gene_id:1420|Hs108|chr2 (174 aa) initn: 329 init1: 141 opt: 351 Z-score: 461.7 bits: 92.2 E(32554): 2.1e-19 Smith-Waterman score: 376; 36.3% identity (65.9% similar) in 182 aa overlap (24-205:3-170) 10 20 30 40 50 60 pF1KE0 MFPGPISEGATMTLQCTKSAGPWKMVVWDEDGFQGRRHEFTAECPSVLELGFETVRSLKV :.. ... .:::: .: :..::. :. : :..: CCDS23 MGKITFYEDRAFQGRSYETTTDCPN-LQPYFSRCNSIRV 10 20 30 70 80 90 100 110 120 pF1KE0 LSGAWVGFEHAGFQGQQYILERGEYPSWDAWGGNTAYPAERLTSFRPAACANHRDSRLTI :: :. .:. ..:::::.:.:::::... : : . . : . : . ..:: : . CCDS23 ESGCWMLYERPNYQGQQYLLRRGEYPDYQQWMGLS--DSIRSCCLIPQT-VSHR---LRL 40 50 60 70 80 90 130 140 150 160 170 180 pF1KE0 FEQENFLGKKGELSDDYPSLQAMGWEGNEVGSFHVHSGAWVCSQFPGYRGFQYVLECDHH .:.:. : :::.: ::.: .. .:. :.:: : :: ..:.::: ::.:. CCDS23 YEREDHKGLMMELSEDCPSIQDR-FHLSEIRSLHVLEGCWVLYELPNYRGRQYLLR---- 100 110 120 130 140 190 200 pF1KE0 SGDYKHFREWGSHAPTFQVQSIRRIQQ .:.. ..:: : .. :.::. CCDS23 PQEYRRCQDWG--AMDAKAGSLRRVVDLY 150 160 170 >>CCDS2380.1 CRYGB gene_id:1419|Hs108|chr2 (175 aa) initn: 323 init1: 133 opt: 323 Z-score: 425.4 bits: 85.4 E(32554): 2.3e-17 Smith-Waterman score: 383; 34.6% identity (68.1% similar) in 182 aa overlap (24-205:3-171) 10 20 30 40 50 60 pF1KE0 MFPGPISEGATMTLQCTKSAGPWKMVVWDEDGFQGRRHEFTAECPSVLELGFETVRSLKV :.. ... .:::: .: :..::. :. : :..: CCDS23 MGKITFYEDRAFQGRSYECTTDCPN-LQPYFSRCNSIRV 10 20 30 70 80 90 100 110 120 pF1KE0 LSGAWVGFEHAGFQGQQYILERGEYPSWDAWGGNTAYPAERLTSFRPAACANHRDSRLTI :: :. .:. ..::.::.:.:::::... : : . . : . : . .: . : CCDS23 ESGCWMIYERPNYQGHQYFLRRGEYPDYQQWMGLS--DSIRSCCLIPPHSGAYR---MKI 40 50 60 70 80 90 130 140 150 160 170 180 pF1KE0 FEQENFLGKKGELSDDYPSLQAMGWEGNEVGSFHVHSGAWVCSQFPGYRGFQYVLECDHH ...... :. .::.:: :.: .. .:. :..: :.:. ..:.::: ::.:. CCDS23 YDRDELRGQMSELTDDCISVQDR-FHLTEIHSLNVLEGSWILYEMPNYRGRQYLLR---- 100 110 120 130 140 190 200 pF1KE0 SGDYKHFREWGSHAPTFQVQSIRRIQQ :.:..: .:: ::. .: :.::. CCDS23 PGEYRRFLDWG--APNAKVGSLRRVMDLY 150 160 170 >>CCDS5926.1 CRYGN gene_id:155051|Hs108|chr7 (182 aa) initn: 212 init1: 131 opt: 308 Z-score: 405.7 bits: 81.8 E(32554): 2.8e-16 Smith-Waterman score: 308; 32.8% identity (61.6% similar) in 177 aa overlap (24-197:7-177) 10 20 30 40 50 pF1KE0 MFPGPISEGATMTLQCTKSAGPWKMVVWDEDGFQGRRHEFTAECPSVLELGF-ETVRSLK :..... : :.. : ..: . . :: . : :.. CCDS59 MAQRSGKITLYEGKHFTGQKLEVFGDCDNFQDRGFMNRVNSIH 10 20 30 40 60 70 80 90 100 110 pF1KE0 VLSGAWVGFEHAGFQGQQYILERGEYPSWDAWGGNTAYPAERLTSFRPAACANHRDSRLT : ::::: :.: :.:::.:::.:.::.. :.... ... : ::.. . . :: CCDS59 VESGAWVCFNHPDFRGQQFILEHGDYPDFFRWNSHS----DHMGSCRPVG-MHGEHFRLE 50 60 70 80 90 120 130 140 150 160 170 pF1KE0 IFEQENFLGKKGELSDDYPSLQAMGWEGNEVGSFHVHS--GAWVCSQFPGYRGFQYVLEC ::: :: :. :. .: : ::. :: : :....:.. .:: .: : . :: CCDS59 IFEGCNFTGQCLEFLEDSPFLQSRGWVKNCVNTIKVYGDGAAWSPRSF-GAEDFQLSSSL 100 110 120 130 140 150 180 190 200 pF1KE0 DHHSGDYKHFREWGSHAPTFQVQSIRRIQQ . .: . . .. : : CCDS59 QSDQGPEEATTKPATTQPPFLTANL 160 170 180 >>CCDS34506.1 AIM1 gene_id:202|Hs108|chr6 (1723 aa) initn: 296 init1: 194 opt: 318 Z-score: 402.8 bits: 84.6 E(32554): 4.1e-16 Smith-Waterman score: 338; 34.8% identity (62.1% similar) in 198 aa overlap (24-205:1219-1403) 10 20 30 40 50 pF1KE0 MFPGPISEGATMTLQCTKSAGPWKMVVWDEDGFQGRRHEF-TAECPSVLELG- :.::... :.:. :. :. : :.: : CCDS34 LSFWDTEEAYIGSMRPLKMGGRKVEFPTDPKVVVYEKPFFEGKCVELETGMCSFVMEGGE 1190 1200 1210 1220 1230 1240 60 70 80 90 100 pF1KE0 -----------FETVRSLKVLSGAWVGFEHAGFQGQQYILERGEYPSWDAWGGNTAYPAE : .: :.::: : ::..:. :: :.::.::.::: .: :::: : .: CCDS34 TEEATGDDHLPFTSVGSMKVLRGIWVAYEKPGFTGHQYLLEEGEYRDWKAWGG---YNGE 1250 1260 1270 1280 1290 1300 110 120 130 140 150 pF1KE0 RLTSFRPAACANHRDSRLTIFEQENFLGKKGELSDDY---PSLQAMGWEGNEVGSFHVHS : :.:: .. .... .. ..:: :.:: : .:. :. : .. :..: : CCDS34 -LQSLRPIL-GDFSNAHMIMYSEKNF-GSKGSSIDVLGIVANLKETGY-GVKTQSINVLS 1310 1320 1330 1340 1350 1360 160 170 180 190 200 pF1KE0 GAWVCSQFPGYRGFQYVLECDHHSGDYKHFREWGSHAPTFQVQSIRRIQQ :.:: . : . : ::.:. .: : :..::.. . ...:.. : CCDS34 GVWVAYENPDFTGEQYILD----KGFYTSFEDWGGK--NCKISSVQPICLDSFTGPRRRN 1370 1380 1390 1400 1410 CCDS34 QIHLFSEPQFQGHSQSFEETTSQIDDSFSTKSCRVSGGSWVVYDGENFTGNQYVLEEGHY 1420 1430 1440 1450 1460 1470 207 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Thu Nov 3 04:18:28 2016 done: Thu Nov 3 04:18:29 2016 Total Scan time: 2.060 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]