FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB5053, 780 aa 1>>>pF1KB5053 780 - 780 aa - 780 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.6481+/-0.000993; mu= 18.4132+/- 0.060 mean_var=67.5270+/-13.665, 0's: 0 Z-trim(104.5): 39 B-trim: 315 in 2/49 Lambda= 0.156076 statistics sampled from 7914 (7934) to 7914 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.603), E-opt: 0.2 (0.244), width: 16 Scan time: 3.620 The best scores are: opt bits E(32554) CCDS14017.1 ACO2 gene_id:50|Hs108|chr22 ( 780) 5255 1192.7 0 CCDS6525.1 ACO1 gene_id:48|Hs108|chr9 ( 889) 352 88.8 4.3e-17 CCDS10302.1 IREB2 gene_id:3658|Hs108|chr15 ( 963) 325 82.7 3.1e-15 >>CCDS14017.1 ACO2 gene_id:50|Hs108|chr22 (780 aa) initn: 5255 init1: 5255 opt: 5255 Z-score: 6387.7 bits: 1192.7 E(32554): 0 Smith-Waterman score: 5255; 100.0% identity (100.0% similar) in 780 aa overlap (1-780:1-780) 10 20 30 40 50 60 pF1KB5 MAPYSLLVTRLQKALGVRQYHVASVLCQRAKVAMSHFEPNEYIHYDLLEKNINIVRKRLN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 MAPYSLLVTRLQKALGVRQYHVASVLCQRAKVAMSHFEPNEYIHYDLLEKNINIVRKRLN 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB5 RPLTLSEKIVYGHLDDPASQEIERGKSYLRLRPDRVAMQDATAQMAMLQFISSGLSKVAV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 RPLTLSEKIVYGHLDDPASQEIERGKSYLRLRPDRVAMQDATAQMAMLQFISSGLSKVAV 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB5 PSTIHCDHLIEAQVGGEKDLRRAKDINQEVYNFLATAGAKYGVGFWKPGSGIIHQIILEN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 PSTIHCDHLIEAQVGGEKDLRRAKDINQEVYNFLATAGAKYGVGFWKPGSGIIHQIILEN 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB5 YAYPGVLLIGTDSHTPNGGGLGGICIGVGGADAVDVMAGIPWELKCPKVIGVKLTGSLSG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 YAYPGVLLIGTDSHTPNGGGLGGICIGVGGADAVDVMAGIPWELKCPKVIGVKLTGSLSG 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB5 WSSPKDVILKVAGILTVKGGTGAIVEYHGPGVDSISCTGMATICNMGAEIGATTSVFPYN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 WSSPKDVILKVAGILTVKGGTGAIVEYHGPGVDSISCTGMATICNMGAEIGATTSVFPYN 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB5 HRMKKYLSKTGREDIANLADEFKDHLVPDPGCHYDQLIEINLSELKPHINGPFTPDLAHP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 HRMKKYLSKTGREDIANLADEFKDHLVPDPGCHYDQLIEINLSELKPHINGPFTPDLAHP 310 320 330 340 350 360 370 380 390 400 410 420 pF1KB5 VAEVGKVAEKEGWPLDIRVGLIGSCTNSSYEDMGRSAAVAKQALAHGLKCKSQFTITPGS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 VAEVGKVAEKEGWPLDIRVGLIGSCTNSSYEDMGRSAAVAKQALAHGLKCKSQFTITPGS 370 380 390 400 410 420 430 440 450 460 470 480 pF1KB5 EQIRATIERDGYAQILRDLGGIVLANACGPCIGQWDRKDIKKGEKNTIVTSYNRNFTGRN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 EQIRATIERDGYAQILRDLGGIVLANACGPCIGQWDRKDIKKGEKNTIVTSYNRNFTGRN 430 440 450 460 470 480 490 500 510 520 530 540 pF1KB5 DANPETHAFVTSPEIVTALAIAGTLKFNPETDYLTGTDGKKFRLEAPDADELPKGEFDPG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 DANPETHAFVTSPEIVTALAIAGTLKFNPETDYLTGTDGKKFRLEAPDADELPKGEFDPG 490 500 510 520 530 540 550 560 570 580 590 600 pF1KB5 QDTYQHPPKDSSGQHVDVSPTSQRLQLLEPFDKWDGKDLEDLQILIKVKGKCTTDHISAA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 QDTYQHPPKDSSGQHVDVSPTSQRLQLLEPFDKWDGKDLEDLQILIKVKGKCTTDHISAA 550 560 570 580 590 600 610 620 630 640 650 660 pF1KB5 GPWLKFRGHLDNISNNLLIGAINIENGKANSVRNAVTQEFGPVPDTARYYKKHGIRWVVI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 GPWLKFRGHLDNISNNLLIGAINIENGKANSVRNAVTQEFGPVPDTARYYKKHGIRWVVI 610 620 630 640 650 660 670 680 690 700 710 720 pF1KB5 GDENYGEGSSREHAALEPRHLGGRAIITKSFARIHETNLKKQGLLPLTFADPADYNKIHP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 GDENYGEGSSREHAALEPRHLGGRAIITKSFARIHETNLKKQGLLPLTFADPADYNKIHP 670 680 690 700 710 720 730 740 750 760 770 780 pF1KB5 VDKLTIQGLKDFTPGKPLKCIIKHPNGTQETILLNHTFNETQIEWFRAGSALNRMKELQQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 VDKLTIQGLKDFTPGKPLKCIIKHPNGTQETILLNHTFNETQIEWFRAGSALNRMKELQQ 730 740 750 760 770 780 >>CCDS6525.1 ACO1 gene_id:48|Hs108|chr9 (889 aa) initn: 689 init1: 303 opt: 352 Z-score: 420.3 bits: 88.8 E(32554): 4.3e-17 Smith-Waterman score: 737; 28.5% identity (54.4% similar) in 744 aa overlap (91-709:78-818) 70 80 90 100 110 pF1KB5 RPLTLSEKIVYGHLDDPASQEIERGKSYLRLRPDRVAMQDATAQMAMLQF--ISSGLSKV ..: :: .:: :. :...: . ....:. CCDS65 RNCDEFLVKKQDIENILHWNVTQHKNIEVPFKPARVILQDFTGVPAVVDFAAMRDAVKKL 50 60 70 80 90 100 120 130 140 150 160 pF1KB5 A---------VPSTIHCDHLIEAQVGGEKD-LRRAKDI----NQEVYNFLATAG-AKYGV . :. . :: :... . . : :.. .:. :.: ..:: .. : ... CCDS65 GGDPEKINPVCPADLVIDHSIQVDFNRRADSLQKNQDLEFERNRERFEFLKWGSQAFHNM 110 120 130 140 150 160 170 180 190 200 210 pF1KB5 GFWKPGSGIIHQIILE----------NYAYPGVLLIGTDSHTPNGGGLGGICIGVGGADA . ::::::::. :: .: :: : .:::::: ::: . :::: .: CCDS65 RIIPPGSGIIHQVNLEYLARVVFDQDGYYYPDSL-VGTDSHTTMIDGLGILGWGVGGIEA 170 180 190 200 210 220 220 230 240 250 260 270 pF1KB5 VDVMAGIPWELKCPKVIGVKLTGSLSGWSSPKDVILKVAGILTVKGGTGAIVEYHGPGVD :: : : . :.::: .: :. . :..: .. : : .: .::. :::: CCDS65 EAVMLGQPISMVLPQVIGYRLMGKPHPLVTSTDIVLTITKHLRQVGVVGKFVEFFGPGVA 230 240 250 260 270 280 280 290 300 310 320 pF1KB5 SISCTGMATICNMGAEIGATTSVFPYNHRMKKYLSKTGRED--------IANLADEFKDH ..: . ::: :: : :::.. :: .. :: .:::.. . . :.: CCDS65 QLSIADRATIANMCPEYGATAAFFPVDEVSITYLVQTGRDEEKLKYIKKYLQAVGMFRDF 290 300 310 320 330 340 330 340 350 360 370 pF1KB5 LVPDPGCHYDQLIEINLSELKPHINGPFTPDLAHPVAEVGK-----VAEKEGW------P :. . :..:..:. . : .:: :. :... : .. :.:. : CCDS65 NDPSQDPDFTQVVELDLKTVVPCCSGPKRPQDKVAVSDMKKDFESCLGAKQGFKGFQVAP 350 360 370 380 390 400 380 390 400 410 pF1KB5 ----------LD----------IRVGLIGSCTNSSYEDMGRSAAV-AKQALAHGLKCKS- : . .. : ::::.: .. .:.. ::.:. ::. CCDS65 EHHNDHKTFIYDNTEFTLAHGSVVIAAITSCTNTSNPSVMLGAGLLAKKAVDAGLNVMPY 410 420 430 440 450 460 420 430 440 450 460 pF1KB5 -QFTITPGSEQIRATIERDGYAQILRDLGGIVLANACGPCIGQWDR------KDIKKGEK . ...::: . ....: : .:: :.. .: :::. . : .:. CCDS65 IKTSLSPGSGVVTYYLQESGVMPYLSQLGFDVVGYGCMTCIGNSGPLPEPVVEAITQGDL 470 480 490 500 510 520 470 480 490 500 510 520 pF1KB5 NTI-VTSYNRNFTGRNDANPETHA-FVTSPEIVTALAIAGTLKFNPETDYL-TGTDGKKF .. : : :::: :: ..:.:.: ...:: .: : :::::.... : . : ... :.. CCDS65 VAVGVLSGNRNFEGR--VHPNTRANYLASPPLVIAYAIAGTIRIDFEKEPLGVNAKGQQV 530 540 550 560 570 580 530 540 550 560 pF1KB5 RLEA--PDADELPKGEFD---PG--QDTYQHPPKDSSGQHVDVSPT-------SQRLQLL :. : ::. : . :: ...::. . . .. ..:. :. . CCDS65 FLKDIWPTRDEIQAVERQYVIPGMFKEVYQKIETVNESWNALATPSDKLFFWNSKSTYIK 590 600 610 620 630 640 570 580 590 600 pF1KB5 EP-------FDKWDGKDLEDLQILIKVKGKCTTDHISAAG------PWLKF---RG---- : .: :.. : .:... . :::::: :: : .. :: CCDS65 SPPFFENLTLDLQPPKSIVDAYVLLNLGDSVTTDHISPAGNIARNSPAARYLTNRGLTPR 650 660 670 680 690 700 610 620 630 640 650 pF1KB5 HLDNISNNLLIGAI-------NIE------NGKANSVRNAVTQEFGPVPDTARYYKKHGI .... .. :. ::. : .: .. . . :. : :.:. :.. :. CCDS65 EFNSYGSRRGNDAVMARGTFANIRLLNRFLNKQAPQTIHLPSGEILDVFDAAERYQQAGL 710 720 730 740 750 760 660 670 680 690 700 710 pF1KB5 RWVVIGDENYGEGSSREHAALEPRHLGGRAIITKSFARIHETNLKKQGLLPLTFADPADY .:.. ..:: ::::. :: : :: .:....:. :::..:: .:..:: . CCDS65 PLIVLAGKEYGAGSSRDWAAKGPFLLGIKAVLAESYERIHRSNLVGMGVIPLEYLPGENA 770 780 790 800 810 820 720 730 740 750 760 770 pF1KB5 NKIHPVDKLTIQGLKDFTPGKPLKCIIKHPNGTQETILLNHTFNETQIEWFRAGSALNRM CCDS65 DALGLTGQERYTIIIPENLKPQMKVQVKLDTGKTFQAVMRFDTDVELTYFLNGGILNYMI 830 840 850 860 870 880 >>CCDS10302.1 IREB2 gene_id:3658|Hs108|chr15 (963 aa) initn: 571 init1: 283 opt: 325 Z-score: 386.9 bits: 82.7 E(32554): 3.1e-15 Smith-Waterman score: 598; 28.3% identity (52.1% similar) in 674 aa overlap (147-709:225-893) 120 130 140 150 160 170 pF1KB5 KVAVPSTIHCDHLIEAQVGGEKDLRRAKDINQEVYNFLATAGAKY-GVGFWKPGSGIIHQ :.: .:. .. . .:. ::.:. :: CCDS10 ENTPILCPFHLQPVPEPETVLKNQEVEFGRNRERLQFFKWSSRVFKNVAVIPPGTGMAHQ 200 210 220 230 240 250 180 190 200 210 220 pF1KB5 IILE----------NYAYPGVLLIGTDSHTPNGGGLGGICIGVGGADAVDVMAGIPWELK : :: . .: . .::::: .::: . :::: .. :: :.: : CCDS10 INLEYLSRVVFEEKDLLFPDSV-VGTDSHITMVNGLGILGWGVGGIETEAVMLGLPVSLT 260 270 280 290 300 310 230 240 250 260 270 280 pF1KB5 CPKVIGVKLTGSLSGWSSPKDVILKVAGILTVKGGTGAIVEYHGPGVDSISCTGMATICN :.:.: .:::: . . . ::.: .. : : .: .::. : ::...: . .:: : CCDS10 LPEVVGCELTGSSNPFVTSIDVVLGITKHLRQVGVAGKFVEFFGSGVSQLSIVDRTTIAN 320 330 340 350 360 370 290 300 310 320 330 pF1KB5 MGAEIGATTSVFPYNHRMKKYLSKTG--------REDIANLADEFKDHLVPDPGCHYDQL : : :: : :: .. :.: .:: : . . :.. . .:.:. CCDS10 MCPEYGAILSFFPVDNVTLKHLEHTGFSKAKLESMETYLKAVKLFRNDQNSSGEPEYSQV 380 390 400 410 420 430 340 350 360 370 pF1KB5 IEINLSELKPHINGPFTP-----------DLAHPVAE-VG-----KVAEK---------E :.:::. . : ..:: : :. . : :: .::: : CCDS10 IQINLNSIVPSVSGPKRPQDRVAVTDMKSDFQACLNEKVGFKGFQIAAEKQKDIVSIHYE 440 450 460 470 480 490 380 390 400 410 420 pF1KB5 G--WPLD---IRVGLIGSCTNSSYEDMGRSAAV-AKQALAHGLKCKSQF--TITPGSEQI : . :. . .. . ::::. .. .:.. ::.:. ::. : . ...::: .. CCDS10 GSEYKLSHGSVVIAAVISCTNNCNPSVMLAAGLLAKKAVEAGLRVKPYIRTSLSPGSGMV 500 510 520 530 540 550 430 440 450 460 470 pF1KB5 RATIERDGYAQILRDLGGIVLANACGPCIGQW-DRKD-----IKKGEKNTI-VTSYNRNF . .: : :: ... .:. :.:. .: .:.:. : . : :.:: CCDS10 THYLSSSGVLPYLSKLGFEIVGYGCSICVGNTAPLSDAVLNAVKQGDLVTCGILSGNKNF 560 570 580 590 600 610 480 490 500 510 520 530 pF1KB5 TGRNDANPETHAFVTSPEIVTALAIAGTLKFNPETDYLTGTD--GKKFRLEA--PDADEL :: . ...:: .:.: :::::.... .:. : ::: ::.. :. :. .:. CCDS10 EGRL-CDCVRANYLASPPLVVAYAIAGTVNIDFQTEPL-GTDPTGKNIYLHDIWPSREEV 620 630 640 650 660 670 540 550 560 570 pF1KB5 PKGE--------FDPGQDTYQHPPK--------DSSGQHVDVSPTSQRLQLLEPFDKWDG . : : .: . : :: :.. : : . ::: CCDS10 HRVEEEHVILSMFKALKDKIEMGNKRWNSLEAPDSVLFPWDLKSTYIRCPSF--FDKLTK 680 690 700 710 720 580 590 600 610 620 pF1KB5 KDL-----EDLQILIKVKGKCTTDHISAAGPWLKFRGHLDNISNNLL----IGAINIENG . . :. ..:. . . :::::: :: . . ..: : ... . . : CCDS10 EPIALQAIENAHVLLYLGDSVTTDHISPAGSIARNSAAAKYLTNRGLTPREFNSYGARRG 730 740 750 760 770 780 630 640 650 660 pF1KB5 K-ANSVRN--AVTQEFG-----PVPDT--------------ARYYKKHGIRWVVIGDENY . : .:. : . :. :.: : :. :.:.:: .... ..: CCDS10 NDAVMTRGTFANIKLFNKFIGKPAPKTIHFPSGQTLDVFEAAELYQKEGIPLIILAGKKY 790 800 810 820 830 840 670 680 690 700 710 720 pF1KB5 GEGSSREHAALEPRHLGGRAIITKSFARIHETNLKKQGLLPLTFADPADYNKIHPVDKLT : :.::. :: : :: .:....:. .::. .: :. :: : CCDS10 GSGNSRDWAAKGPYLLGVKAVLAESYEKIHKDHLIGIGIAPLQFLPGENADSLGLSGRET 850 860 870 880 890 900 730 740 750 760 770 780 pF1KB5 IQGLKDFTPGKPLKCIIKHPNGTQETILLNHTFNETQIEWFRAGSALNRMKELQQ CCDS10 FSLTFPEELSPGITLNIQTSTGKVFSVIASFEDDVEITLYKHGGLLNFVARKFS 910 920 930 940 950 960 780 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Thu Nov 3 15:50:09 2016 done: Thu Nov 3 15:50:10 2016 Total Scan time: 3.620 Total Display time: 0.060 Function used was FASTA [36.3.4 Apr, 2011]