FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB8381, 502 aa 1>>>pF1KB8381 502 - 502 aa - 502 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 7.5043+/-0.000324; mu= 8.8077+/- 0.020 mean_var=136.5611+/-28.021, 0's: 0 Z-trim(119.4): 28 B-trim: 812 in 1/56 Lambda= 0.109752 statistics sampled from 33374 (33402) to 33374 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.731), E-opt: 0.2 (0.392), width: 16 Scan time: 10.250 The best scores are: opt bits E(85289) NP_005644 (OMIM: 189889) alpha-globin transcriptio ( 502) 3395 548.9 1.2e-155 NP_001166923 (OMIM: 189889) alpha-globin transcrip ( 501) 3378 546.2 8.1e-155 NP_001121632 (OMIM: 609784) upstream-binding prote ( 504) 2517 409.9 8.9e-114 NP_055368 (OMIM: 609785) transcription factor CP2- ( 479) 2414 393.6 6.9e-109 XP_016859391 (OMIM: 609785) PREDICTED: transcripti ( 406) 2046 335.3 2.1e-91 NP_001166924 (OMIM: 189889) alpha-globin transcrip ( 450) 1782 293.5 8.7e-79 NP_001121633 (OMIM: 609784) upstream-binding prote ( 540) 1674 276.4 1.4e-73 NP_055332 (OMIM: 609784) upstream-binding protein ( 540) 1674 276.4 1.4e-73 XP_016859393 (OMIM: 609785) PREDICTED: transcripti ( 273) 1420 236.0 1e-61 XP_016859394 (OMIM: 609785) PREDICTED: transcripti ( 273) 1408 234.1 3.9e-61 XP_016859392 (OMIM: 609785) PREDICTED: transcripti ( 274) 1011 171.3 3.2e-42 XP_005246216 (OMIM: 609786) PREDICTED: grainyhead- ( 429) 356 67.7 7.8e-11 XP_016859390 (OMIM: 609786) PREDICTED: grainyhead- ( 467) 354 67.4 1e-10 NP_937825 (OMIM: 609786) grainyhead-like protein 1 ( 618) 354 67.4 1.3e-10 XP_006711947 (OMIM: 609786) PREDICTED: grainyhead- ( 441) 346 66.1 2.4e-10 XP_016859389 (OMIM: 609786) PREDICTED: grainyhead- ( 479) 344 65.8 3.2e-10 XP_011508645 (OMIM: 609786) PREDICTED: grainyhead- ( 583) 344 65.9 3.8e-10 XP_006711945 (OMIM: 609786) PREDICTED: grainyhead- ( 630) 344 65.9 4e-10 XP_011540172 (OMIM: 606713,608317) PREDICTED: grai ( 509) 339 65.0 5.8e-10 NP_001181939 (OMIM: 606713,608317) grainyhead-like ( 556) 339 65.0 6.2e-10 XP_011540171 (OMIM: 606713,608317) PREDICTED: grai ( 556) 339 65.0 6.2e-10 NP_937816 (OMIM: 606713,608317) grainyhead-like pr ( 602) 339 65.1 6.7e-10 NP_067003 (OMIM: 606713,608317) grainyhead-like pr ( 607) 339 65.1 6.7e-10 NP_937817 (OMIM: 606713,608317) grainyhead-like pr ( 626) 339 65.1 6.9e-10 XP_011515609 (OMIM: 608576,608641,616029) PREDICTE ( 591) 336 64.6 9.1e-10 XP_011515608 (OMIM: 608576,608641,616029) PREDICTE ( 609) 336 64.6 9.4e-10 NP_001317522 (OMIM: 608576,608641,616029) grainyhe ( 609) 336 64.6 9.4e-10 NP_079191 (OMIM: 608576,608641,616029) grainyhead- ( 625) 336 64.6 9.6e-10 >>NP_005644 (OMIM: 189889) alpha-globin transcription fa (502 aa) initn: 3395 init1: 3395 opt: 3395 Z-score: 2915.0 bits: 548.9 E(85289): 1.2e-155 Smith-Waterman score: 3395; 100.0% identity (100.0% similar) in 502 aa overlap (1-502:1-502) 10 20 30 40 50 60 pF1KB8 MAWALKLPLADEVIESGLVQDFDASLSGIGQELGAGAYSMSDVLALPIFKQEESSLPPDN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_005 MAWALKLPLADEVIESGLVQDFDASLSGIGQELGAGAYSMSDVLALPIFKQEESSLPPDN 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB8 ENKILPFQYVLCAATSPAVKLHDETLTYLNQGQSYEIRMLDNRKLGELPEINGKLVKSIF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_005 ENKILPFQYVLCAATSPAVKLHDETLTYLNQGQSYEIRMLDNRKLGELPEINGKLVKSIF 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB8 RVVFHDRRLQYTEHQQLEGWRWNRPGDRILDIDIPMSVGIIDPRANPTQLNTVEFLWDPA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_005 RVVFHDRRLQYTEHQQLEGWRWNRPGDRILDIDIPMSVGIIDPRANPTQLNTVEFLWDPA 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB8 KRTSVFIQVHCISTEFTMRKHGGEKGVPFRVQIDTFKENENGEYTEHLHSASCQIKVFKP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_005 KRTSVFIQVHCISTEFTMRKHGGEKGVPFRVQIDTFKENENGEYTEHLHSASCQIKVFKP 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB8 KGADRKQKTDREKMEKRTPHEKEKYQPSYETTILTECSPWPEITYVNNSPSPGFNSSHSS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_005 KGADRKQKTDREKMEKRTPHEKEKYQPSYETTILTECSPWPEITYVNNSPSPGFNSSHSS 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB8 FSLGEGNGSPNHQPEPPPPVTDNLLPTTTPQEAQQWLHRNRFSTFTRLFTNFSGADLLKL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_005 FSLGEGNGSPNHQPEPPPPVTDNLLPTTTPQEAQQWLHRNRFSTFTRLFTNFSGADLLKL 310 320 330 340 350 360 370 380 390 400 410 420 pF1KB8 TRDDVIQICGPADGIRLFNALKGRMVRPRLTIYVCQESLQLREQQQQQQQQQQKHEDGDS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_005 TRDDVIQICGPADGIRLFNALKGRMVRPRLTIYVCQESLQLREQQQQQQQQQQKHEDGDS 370 380 390 400 410 420 430 440 450 460 470 480 pF1KB8 NGTFFVYHAIYLEELTAVELTEKIAQLFSISPCQISQIYKQGPTGIHVLISDEMIQNFQE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_005 NGTFFVYHAIYLEELTAVELTEKIAQLFSISPCQISQIYKQGPTGIHVLISDEMIQNFQE 430 440 450 460 470 480 490 500 pF1KB8 EACFILDTMKAETNDSYHIILK :::::::::::::::::::::: NP_005 EACFILDTMKAETNDSYHIILK 490 500 >>NP_001166923 (OMIM: 189889) alpha-globin transcription (501 aa) initn: 3601 init1: 3323 opt: 3378 Z-score: 2900.5 bits: 546.2 E(85289): 8.1e-155 Smith-Waterman score: 3378; 99.8% identity (99.8% similar) in 502 aa overlap (1-502:1-501) 10 20 30 40 50 60 pF1KB8 MAWALKLPLADEVIESGLVQDFDASLSGIGQELGAGAYSMSDVLALPIFKQEESSLPPDN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 MAWALKLPLADEVIESGLVQDFDASLSGIGQELGAGAYSMSDVLALPIFKQEESSLPPDN 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB8 ENKILPFQYVLCAATSPAVKLHDETLTYLNQGQSYEIRMLDNRKLGELPEINGKLVKSIF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 ENKILPFQYVLCAATSPAVKLHDETLTYLNQGQSYEIRMLDNRKLGELPEINGKLVKSIF 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB8 RVVFHDRRLQYTEHQQLEGWRWNRPGDRILDIDIPMSVGIIDPRANPTQLNTVEFLWDPA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 RVVFHDRRLQYTEHQQLEGWRWNRPGDRILDIDIPMSVGIIDPRANPTQLNTVEFLWDPA 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB8 KRTSVFIQVHCISTEFTMRKHGGEKGVPFRVQIDTFKENENGEYTEHLHSASCQIKVFKP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 KRTSVFIQVHCISTEFTMRKHGGEKGVPFRVQIDTFKENENGEYTEHLHSASCQIKVFKP 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB8 KGADRKQKTDREKMEKRTPHEKEKYQPSYETTILTECSPWPEITYVNNSPSPGFNSSHSS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 KGADRKQKTDREKMEKRTPHEKEKYQPSYETTILTECSPWPEITYVNNSPSPGFNSSHSS 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB8 FSLGEGNGSPNHQPEPPPPVTDNLLPTTTPQEAQQWLHRNRFSTFTRLFTNFSGADLLKL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 FSLGEGNGSPNHQPEPPPPVTDNLLPTTTPQEAQQWLHRNRFSTFTRLFTNFSGADLLKL 310 320 330 340 350 360 370 380 390 400 410 420 pF1KB8 TRDDVIQICGPADGIRLFNALKGRMVRPRLTIYVCQESLQLREQQQQQQQQQQKHEDGDS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 TRDDVIQICGPADGIRLFNALKGRMVRPRLTIYVCQESLQLREQQQQQQQQQQKHEDGDS 370 380 390 400 410 420 430 440 450 460 470 480 pF1KB8 NGTFFVYHAIYLEELTAVELTEKIAQLFSISPCQISQIYKQGPTGIHVLISDEMIQNFQE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 NGTFFVYHAIYLEELTAVELTEKIAQLFSISPCQISQIYKQGPTGIHVLISDEMIQNFQE 430 440 450 460 470 480 490 500 pF1KB8 EACFILDTMKAETNDSYHIILK :::::::::: ::::::::::: NP_001 EACFILDTMK-ETNDSYHIILK 490 500 >>NP_001121632 (OMIM: 609784) upstream-binding protein 1 (504 aa) initn: 2124 init1: 1693 opt: 2517 Z-score: 2163.7 bits: 409.9 E(85289): 8.9e-114 Smith-Waterman score: 2517; 73.3% identity (90.4% similar) in 509 aa overlap (1-502:1-504) 10 20 30 40 50 60 pF1KB8 MAWALKLPLADEVIESGLVQDFDASLSGIGQELGAGAYSMSDVLALPIFKQEESSLPPDN :::.::. :::::::::.::::::::::::::::::::::::::::::::.:::: :. NP_001 MAWVLKM---DEVIESGLVHDFDASLSGIGQELGAGAYSMSDVLALPIFKQEDSSLPLDG 10 20 30 40 50 70 80 90 100 110 120 pF1KB8 ENKILPFQYVLCAATSPAVKLHDETLTYLNQGQSYEIRMLDNRKLGELPEINGKLVKSIF :.. :::::.:::::::::::::::::::::::::::::::::.:..:::::::::::. NP_001 ETEHPPFQYVMCAATSPAVKLHDETLTYLNQGQSYEIRMLDNRKMGDMPEINGKLVKSII 60 70 80 90 100 110 130 140 150 160 170 180 pF1KB8 RVVFHDRRLQYTEHQQLEGWRWNRPGDRILDIDIPMSVGIIDPRANPTQLNTVEFLWDPA ::::::::::::::::::::.:::::::.::.:::::::::: :.::.:::.:::::::: NP_001 RVVFHDRRLQYTEHQQLEGWKWNRPGDRLLDLDIPMSVGIIDTRTNPSQLNAVEFLWDPA 120 130 140 150 160 170 190 200 210 220 230 240 pF1KB8 KRTSVFIQVHCISTEFTMRKHGGEKGVPFRVQIDTFKENENGEYTEHLHSASCQIKVFKP ::::.:::::::::::: ::::::::::::.:.::::.:::::::.:::::::::::::: NP_001 KRTSAFIQVHCISTEFTPRKHGGEKGVPFRIQVDTFKQNENGEYTDHLHSASCQIKVFKP 180 190 200 210 220 230 250 260 270 280 290 pF1KB8 KGADRKQKTDREKMEKRTPHEKEKYQPSYETTILTECSPWPEI--TYVNNSPSPG--FNS :::::::::::::::::: ::::::::::.:::::::::::. .::::::::. :.: NP_001 KGADRKQKTDREKMEKRTAHEKEKYQPSYDTTILTECSPWPDAPTAYVNNSPSPAPTFTS 240 250 260 270 280 290 300 310 320 330 340 350 pF1KB8 -SHSSFSLGEGNGS-PNHQPEPPPPVT-DNLLPTTTPQEAQQWLHRNRFSTFTRLFTNFS ..:. :. ..:.: :::: . .. ... :..: ::.:::: .::::..::::.::: NP_001 PQQSTCSVPDSNSSSPNHQGDGASQTSGEQIQPSATIQETQQWLLKNRFSSYTRLFSNFS 300 310 320 330 340 350 360 370 380 390 400 410 pF1KB8 GADLLKLTRDDVIQICGPADGIRLFNALKGRMVRPRLTIYVCQESLQLREQQQQQQQQQQ ::::::::..:..:::: ::::::.:.::.: ::::::::::.:. . : ::: .. NP_001 GADLLKLTKEDLVQICGAADGIRLYNSLKSRSVRPRLTIYVCREQPSSTVLQGQQQAASS 360 370 380 390 400 410 420 430 440 450 460 470 pF1KB8 KHEDGDSNGTFFVYHAIYLEELTAVELTEKIAQLFSISPCQISQIYKQGPTGIHVLISDE :.: .:. .:::::::::. : :...:.: .:.: ::.:.:.:::::::.:.::. NP_001 ASENG--SGAPYVYHAIYLEEMIASEVARKLALVFNIPLHQINQVYRQGPTGIHILVSDQ 420 430 440 450 460 470 480 490 500 pF1KB8 MIQNFQEEACFILDTMKAETNDSYHIILK :.::::.:.::...:.:::..:. ::::: NP_001 MVQNFQDESCFLFSTVKAESSDGIHIILK 480 490 500 >>NP_055368 (OMIM: 609785) transcription factor CP2-like (479 aa) initn: 2065 init1: 1475 opt: 2414 Z-score: 2075.9 bits: 393.6 E(85289): 6.9e-109 Smith-Waterman score: 2414; 74.6% identity (91.5% similar) in 469 aa overlap (35-502:16-476) 10 20 30 40 50 60 pF1KB8 LKLPLADEVIESGLVQDFDASLSGIGQELGAGAYSMSDVLALPIFKQEESSLPPDNENKI .:.: . :::::::::::: .: :.:: .. NP_055 MLFWHTQPEHYNQHNSGSY-LRDVLALPIFKQEEPQLSPENEARL 10 20 30 40 70 80 90 100 110 120 pF1KB8 LPFQYVLCAATSPAVKLHDETLTYLNQGQSYEIRMLDNRKLGELPEINGKLVKSIFRVVF :.:::::::::::::::.:::::::::::::::.:.:::::.. ..: : ::::.:::: NP_055 PPLQYVLCAATSPAVKLHEETLTYLNQGQSYEIRLLENRKLGDFQDLNTKYVKSIIRVVF 50 60 70 80 90 100 130 140 150 160 170 180 pF1KB8 HDRRLQYTEHQQLEGWRWNRPGDRILDIDIPMSVGIIDPRANPTQLNTVEFLWDPAKRTS ::::::::::::::::::.::::::::::::.::::.::::.:::::.::::::::::.: NP_055 HDRRLQYTEHQQLEGWRWSRPGDRILDIDIPLSVGILDPRASPTQLNAVEFLWDPAKRAS 110 120 130 140 150 160 190 200 210 220 230 240 pF1KB8 VFIQVHCISTEFTMRKHGGEKGVPFRVQIDTFKENENGEYTEHLHSASCQIKVFKPKGAD .:::::::::::: :::::::::::::::::::.:::::::::::::::::::::::::: NP_055 AFIQVHCISTEFTPRKHGGEKGVPFRVQIDTFKQNENGEYTEHLHSASCQIKVFKPKGAD 170 180 190 200 210 220 250 260 270 280 290 300 pF1KB8 RKQKTDREKMEKRTPHEKEKYQPSYETTILTECSPWPEITY-VNNSPSPGFNSSHSSFSL :::::::::::::: .:::::::::::::::::::::...: ::..:::..:.: .::.: NP_055 RKQKTDREKMEKRTAQEKEKYQPSYETTILTECSPWPDVAYQVNSAPSPSYNGSPNSFGL 230 240 250 260 270 280 310 320 330 340 350 360 pF1KB8 GEGNGSPNHQPEPPPPVTDNLLPTTTPQEAQQWLHRNRFSTFTRLFTNFSGADLLKLTRD ::::.::.: : : .:.:::... :.::::::::::: : :::..::::::::..:: NP_055 GEGNASPTHPVEALPVGSDHLLPSASIQDAQQWLHRNRFSQFCRLFASFSGADLLKMSRD 290 300 310 320 330 340 370 380 390 400 410 420 pF1KB8 DVIQICGPADGIRLFNALKGRMVRPRLTIYVCQESLQLREQQQQQQQQQQKHEDGDSNGT :..::::::::::::::.::: :::..::::::: .:.. ::.. .:::: NP_055 DLVQICGPADGIRLFNAIKGRNVRPKMTIYVCQEL-----EQNRVPLQQKRDGSGDSN-- 350 360 370 380 390 430 440 450 460 470 480 pF1KB8 FFVYHAIYLEELTAVELTEKIAQLFSISPCQISQIYKQGPTGIHVLISDEMIQNFQEEAC . :::::.:::::..:: ::::.:.:::: .: ..:.::::::::..:.::.::::.:.: NP_055 LSVYHAIFLEELTTLELIEKIANLYSISPQHIHRVYRQGPTGIHVVVSNEMVQNFQDESC 400 410 420 430 440 450 490 500 pF1KB8 FILDTMKAETNDSYHIILK :.:.:.:::.::.:::::: NP_055 FVLSTIKAESNDGYHIILKCGL 460 470 >>XP_016859391 (OMIM: 609785) PREDICTED: transcription f (406 aa) initn: 2035 init1: 1475 opt: 2046 Z-score: 1762.0 bits: 335.3 E(85289): 2.1e-91 Smith-Waterman score: 2046; 76.3% identity (91.2% similar) in 388 aa overlap (35-421:16-397) 10 20 30 40 50 60 pF1KB8 LKLPLADEVIESGLVQDFDASLSGIGQELGAGAYSMSDVLALPIFKQEESSLPPDNENKI .:.: . :::::::::::: .: :.:: .. XP_016 MLFWHTQPEHYNQHNSGSY-LRDVLALPIFKQEEPQLSPENEARL 10 20 30 40 70 80 90 100 110 120 pF1KB8 LPFQYVLCAATSPAVKLHDETLTYLNQGQSYEIRMLDNRKLGELPEINGKLVKSIFRVVF :.:::::::::::::::.:::::::::::::::.:.:::::.. ..: : ::::.:::: XP_016 PPLQYVLCAATSPAVKLHEETLTYLNQGQSYEIRLLENRKLGDFQDLNTKYVKSIIRVVF 50 60 70 80 90 100 130 140 150 160 170 180 pF1KB8 HDRRLQYTEHQQLEGWRWNRPGDRILDIDIPMSVGIIDPRANPTQLNTVEFLWDPAKRTS ::::::::::::::::::.::::::::::::.::::.::::.:::::.::::::::::.: XP_016 HDRRLQYTEHQQLEGWRWSRPGDRILDIDIPLSVGILDPRASPTQLNAVEFLWDPAKRAS 110 120 130 140 150 160 190 200 210 220 230 240 pF1KB8 VFIQVHCISTEFTMRKHGGEKGVPFRVQIDTFKENENGEYTEHLHSASCQIKVFKPKGAD .:::::::::::: :::::::::::::::::::.:::::::::::::::::::::::::: XP_016 AFIQVHCISTEFTPRKHGGEKGVPFRVQIDTFKQNENGEYTEHLHSASCQIKVFKPKGAD 170 180 190 200 210 220 250 260 270 280 290 300 pF1KB8 RKQKTDREKMEKRTPHEKEKYQPSYETTILTECSPWPEITY-VNNSPSPGFNSSHSSFSL :::::::::::::: .:::::::::::::::::::::...: ::..:::..:.: .::.: XP_016 RKQKTDREKMEKRTAQEKEKYQPSYETTILTECSPWPDVAYQVNSAPSPSYNGSPNSFGL 230 240 250 260 270 280 310 320 330 340 350 360 pF1KB8 GEGNGSPNHQPEPPPPVTDNLLPTTTPQEAQQWLHRNRFSTFTRLFTNFSGADLLKLTRD ::::.::.: : : .:.:::... :.::::::::::: : :::..::::::::..:: XP_016 GEGNASPTHPVEALPVGSDHLLPSASIQDAQQWLHRNRFSQFCRLFASFSGADLLKMSRD 290 300 310 320 330 340 370 380 390 400 410 420 pF1KB8 DVIQICGPADGIRLFNALKGRMVRPRLTIYVCQESLQLREQQQQQQQQQQKHEDGDSNGT :..::::::::::::::.::: :::..::::::: .:.. ::.. .:::: XP_016 DLVQICGPADGIRLFNAIKGRNVRPKMTIYVCQE-----LEQNRVPLQQKRDGSGDSNLS 350 360 370 380 390 430 440 450 460 470 480 pF1KB8 FFVYHAIYLEELTAVELTEKIAQLFSISPCQISQIYKQGPTGIHVLISDEMIQNFQEEAC XP_016 DGAELPR 400 >>NP_001166924 (OMIM: 189889) alpha-globin transcription (450 aa) initn: 3196 init1: 1714 opt: 1782 Z-score: 1535.5 bits: 293.5 E(85289): 8.7e-79 Smith-Waterman score: 2915; 89.6% identity (89.6% similar) in 502 aa overlap (1-502:1-450) 10 20 30 40 50 60 pF1KB8 MAWALKLPLADEVIESGLVQDFDASLSGIGQELGAGAYSMSDVLALPIFKQEESSLPPDN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 MAWALKLPLADEVIESGLVQDFDASLSGIGQELGAGAYSMSDVLALPIFKQEESSLPPDN 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB8 ENKILPFQYVLCAATSPAVKLHDETLTYLNQGQSYEIRMLDNRKLGELPEINGKLVKSIF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 ENKILPFQYVLCAATSPAVKLHDETLTYLNQGQSYEIRMLDNRKLGELPEINGKLVKSIF 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB8 RVVFHDRRLQYTEHQQLEGWRWNRPGDRILDIDIPMSVGIIDPRANPTQLNTVEFLWDPA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 RVVFHDRRLQYTEHQQLEGWRWNRPGDRILDIDIPMSVGIIDPRANPTQLNTVEFLWDPA 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB8 KRTSVFIQVHCISTEFTMRKHGGEKGVPFRVQIDTFKENENGEYTEHLHSASCQIKVFKP :::::::: : NP_001 KRTSVFIQ---------------------------------------------------P 250 260 270 280 290 300 pF1KB8 KGADRKQKTDREKMEKRTPHEKEKYQPSYETTILTECSPWPEITYVNNSPSPGFNSSHSS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 KGADRKQKTDREKMEKRTPHEKEKYQPSYETTILTECSPWPEITYVNNSPSPGFNSSHSS 190 200 210 220 230 240 310 320 330 340 350 360 pF1KB8 FSLGEGNGSPNHQPEPPPPVTDNLLPTTTPQEAQQWLHRNRFSTFTRLFTNFSGADLLKL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 FSLGEGNGSPNHQPEPPPPVTDNLLPTTTPQEAQQWLHRNRFSTFTRLFTNFSGADLLKL 250 260 270 280 290 300 370 380 390 400 410 420 pF1KB8 TRDDVIQICGPADGIRLFNALKGRMVRPRLTIYVCQESLQLREQQQQQQQQQQKHEDGDS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 TRDDVIQICGPADGIRLFNALKGRMVRPRLTIYVCQESLQLREQQQQQQQQQQKHEDGDS 310 320 330 340 350 360 430 440 450 460 470 480 pF1KB8 NGTFFVYHAIYLEELTAVELTEKIAQLFSISPCQISQIYKQGPTGIHVLISDEMIQNFQE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 NGTFFVYHAIYLEELTAVELTEKIAQLFSISPCQISQIYKQGPTGIHVLISDEMIQNFQE 370 380 390 400 410 420 490 500 pF1KB8 EACFILDTMKAETNDSYHIILK :::::::::: ::::::::::: NP_001 EACFILDTMK-ETNDSYHIILK 430 440 450 >>NP_001121633 (OMIM: 609784) upstream-binding protein 1 (540 aa) initn: 2108 init1: 1634 opt: 1674 Z-score: 1441.9 bits: 276.4 E(85289): 1.4e-73 Smith-Waterman score: 2396; 68.5% identity (84.3% similar) in 536 aa overlap (1-493:1-531) 10 20 30 40 50 60 pF1KB8 MAWALKLPLADEVIESGLVQDFDASLSGIGQELGAGAYSMSDVLALPIFKQEESSLPPDN :::.::. :::::::::.::::::::::::::::::::::::::::::::.:::: :. NP_001 MAWVLKM---DEVIESGLVHDFDASLSGIGQELGAGAYSMSDVLALPIFKQEDSSLPLDG 10 20 30 40 50 70 80 90 100 110 120 pF1KB8 ENKILPFQYVLCAATSPAVKLHDETLTYLNQGQSYEIRMLDNRKLGELPEINGKLVKSIF :.. :::::.:::::::::::::::::::::::::::::::::.:..:::::::::::. NP_001 ETEHPPFQYVMCAATSPAVKLHDETLTYLNQGQSYEIRMLDNRKMGDMPEINGKLVKSII 60 70 80 90 100 110 130 140 150 160 170 180 pF1KB8 RVVFHDRRLQYTEHQQLEGWRWNRPGDRILDIDIPMSVGIIDPRANPTQLNTVEFLWDPA ::::::::::::::::::::.:::::::.::.:::::::::: :.::.:::.:::::::: NP_001 RVVFHDRRLQYTEHQQLEGWKWNRPGDRLLDLDIPMSVGIIDTRTNPSQLNAVEFLWDPA 120 130 140 150 160 170 190 200 210 220 230 240 pF1KB8 KRTSVFIQVHCISTEFTMRKHGGEKGVPFRVQIDTFKENENGEYTEHLHSASCQIKVFKP ::::.:::::::::::: ::::::::::::.:.::::.:::::::.:::::::::::::: NP_001 KRTSAFIQVHCISTEFTPRKHGGEKGVPFRIQVDTFKQNENGEYTDHLHSASCQIKVFKP 180 190 200 210 220 230 250 260 270 pF1KB8 KGADRKQKTDREKMEKRTPHEKEKYQPSYETTILTE------------------------ :::::::::::::::::: ::::::::::.:::::: NP_001 KGADRKQKTDREKMEKRTAHEKEKYQPSYDTTILTEMRLEPIIEDAVEHEQKKSSKRTLP 240 250 260 270 280 290 280 290 300 310 pF1KB8 ------------CSPWPEI--TYVNNSPSPG--FNS-SHSSFSLGEGNGS-PNHQPEPPP :::::. .::::::::. :.: ..:. :. ..:.: :::: . NP_001 ADYGDSLAKRGSCSPWPDAPTAYVNNSPSPAPTFTSPQQSTCSVPDSNSSSPNHQGDGAS 300 310 320 330 340 350 320 330 340 350 360 370 pF1KB8 PVT-DNLLPTTTPQEAQQWLHRNRFSTFTRLFTNFSGADLLKLTRDDVIQICGPADGIRL .. ... :..: ::.:::: .::::..::::.:::::::::::..:..:::: :::::: NP_001 QTSGEQIQPSATIQETQQWLLKNRFSSYTRLFSNFSGADLLKLTKEDLVQICGAADGIRL 360 370 380 390 400 410 380 390 400 410 420 430 pF1KB8 FNALKGRMVRPRLTIYVCQESLQLREQQQQQQQQQQKHEDGDSNGTFFVYHAIYLEELTA .:.::.: ::::::::::.:. . : ::: .. :.: .:. .:::::::::. : NP_001 YNSLKSRSVRPRLTIYVCREQPSSTVLQGQQQAASSASENG--SGAPYVYHAIYLEEMIA 420 430 440 450 460 470 440 450 460 470 480 490 pF1KB8 VELTEKIAQLFSISPCQISQIYKQGPTGIHVLISDEMIQNFQEEACFILDTMKAETNDSY :...:.: .:.: ::.:.:.:::::::.:.::.:.::::.:.::...:.:::. NP_001 SEVARKLALVFNIPLHQINQVYRQGPTGIHILVSDQMVQNFQDESCFLFSTVKAESSDGI 480 490 500 510 520 530 500 pF1KB8 HIILK NP_001 HIILK 540 >>NP_055332 (OMIM: 609784) upstream-binding protein 1 is (540 aa) initn: 2108 init1: 1634 opt: 1674 Z-score: 1441.9 bits: 276.4 E(85289): 1.4e-73 Smith-Waterman score: 2396; 68.5% identity (84.3% similar) in 536 aa overlap (1-493:1-531) 10 20 30 40 50 60 pF1KB8 MAWALKLPLADEVIESGLVQDFDASLSGIGQELGAGAYSMSDVLALPIFKQEESSLPPDN :::.::. :::::::::.::::::::::::::::::::::::::::::::.:::: :. NP_055 MAWVLKM---DEVIESGLVHDFDASLSGIGQELGAGAYSMSDVLALPIFKQEDSSLPLDG 10 20 30 40 50 70 80 90 100 110 120 pF1KB8 ENKILPFQYVLCAATSPAVKLHDETLTYLNQGQSYEIRMLDNRKLGELPEINGKLVKSIF :.. :::::.:::::::::::::::::::::::::::::::::.:..:::::::::::. NP_055 ETEHPPFQYVMCAATSPAVKLHDETLTYLNQGQSYEIRMLDNRKMGDMPEINGKLVKSII 60 70 80 90 100 110 130 140 150 160 170 180 pF1KB8 RVVFHDRRLQYTEHQQLEGWRWNRPGDRILDIDIPMSVGIIDPRANPTQLNTVEFLWDPA ::::::::::::::::::::.:::::::.::.:::::::::: :.::.:::.:::::::: NP_055 RVVFHDRRLQYTEHQQLEGWKWNRPGDRLLDLDIPMSVGIIDTRTNPSQLNAVEFLWDPA 120 130 140 150 160 170 190 200 210 220 230 240 pF1KB8 KRTSVFIQVHCISTEFTMRKHGGEKGVPFRVQIDTFKENENGEYTEHLHSASCQIKVFKP ::::.:::::::::::: ::::::::::::.:.::::.:::::::.:::::::::::::: NP_055 KRTSAFIQVHCISTEFTPRKHGGEKGVPFRIQVDTFKQNENGEYTDHLHSASCQIKVFKP 180 190 200 210 220 230 250 260 270 pF1KB8 KGADRKQKTDREKMEKRTPHEKEKYQPSYETTILTE------------------------ :::::::::::::::::: ::::::::::.:::::: NP_055 KGADRKQKTDREKMEKRTAHEKEKYQPSYDTTILTEMRLEPIIEDAVEHEQKKSSKRTLP 240 250 260 270 280 290 280 290 300 310 pF1KB8 ------------CSPWPEI--TYVNNSPSPG--FNS-SHSSFSLGEGNGS-PNHQPEPPP :::::. .::::::::. :.: ..:. :. ..:.: :::: . NP_055 ADYGDSLAKRGSCSPWPDAPTAYVNNSPSPAPTFTSPQQSTCSVPDSNSSSPNHQGDGAS 300 310 320 330 340 350 320 330 340 350 360 370 pF1KB8 PVT-DNLLPTTTPQEAQQWLHRNRFSTFTRLFTNFSGADLLKLTRDDVIQICGPADGIRL .. ... :..: ::.:::: .::::..::::.:::::::::::..:..:::: :::::: NP_055 QTSGEQIQPSATIQETQQWLLKNRFSSYTRLFSNFSGADLLKLTKEDLVQICGAADGIRL 360 370 380 390 400 410 380 390 400 410 420 430 pF1KB8 FNALKGRMVRPRLTIYVCQESLQLREQQQQQQQQQQKHEDGDSNGTFFVYHAIYLEELTA .:.::.: ::::::::::.:. . : ::: .. :.: .:. .:::::::::. : NP_055 YNSLKSRSVRPRLTIYVCREQPSSTVLQGQQQAASSASENG--SGAPYVYHAIYLEEMIA 420 430 440 450 460 470 440 450 460 470 480 490 pF1KB8 VELTEKIAQLFSISPCQISQIYKQGPTGIHVLISDEMIQNFQEEACFILDTMKAETNDSY :...:.: .:.: ::.:.:.:::::::.:.::.:.::::.:.::...:.:::. NP_055 SEVARKLALVFNIPLHQINQVYRQGPTGIHILVSDQMVQNFQDESCFLFSTVKAESSDGI 480 490 500 510 520 530 500 pF1KB8 HIILK NP_055 HIILK 540 >>XP_016859393 (OMIM: 609785) PREDICTED: transcription f (273 aa) initn: 1406 init1: 1406 opt: 1420 Z-score: 1228.9 bits: 236.0 E(85289): 1e-61 Smith-Waterman score: 1420; 82.4% identity (92.2% similar) in 256 aa overlap (35-282:16-270) 10 20 30 40 50 60 pF1KB8 LKLPLADEVIESGLVQDFDASLSGIGQELGAGAYSMSDVLALPIFKQEESSLPPDNENKI .:.: . :::::::::::: .: :.:: .. XP_016 MLFWHTQPEHYNQHNSGSY-LRDVLALPIFKQEEPQLSPENEARL 10 20 30 40 70 80 90 100 110 120 pF1KB8 LPFQYVLCAATSPAVKLHDETLTYLNQGQSYEIRMLDNRKLGELPEINGKLVKSIFRVVF :.:::::::::::::::.:::::::::::::::.:.:::::.. ..: : ::::.:::: XP_016 PPLQYVLCAATSPAVKLHEETLTYLNQGQSYEIRLLENRKLGDFQDLNTKYVKSIIRVVF 50 60 70 80 90 100 130 140 150 160 170 180 pF1KB8 HDRRLQYTEHQQLEGWRWNRPGDRILDIDIPMSVGIIDPRANPTQLNTVEFLWDPAKRTS ::::::::::::::::::.::::::::::::.::::.::::.:::::.::::::::::.: XP_016 HDRRLQYTEHQQLEGWRWSRPGDRILDIDIPLSVGILDPRASPTQLNAVEFLWDPAKRAS 110 120 130 140 150 160 190 200 210 220 230 240 pF1KB8 VFIQVHCISTEFTMRKHGGEKGVPFRVQIDTFKENENGEYTEHLHSASCQIKVFKPKGAD .:::::::::::: :::::::::::::::::::.:::::::::::::::::::::::::: XP_016 AFIQVHCISTEFTPRKHGGEKGVPFRVQIDTFKQNENGEYTEHLHSASCQIKVFKPKGAD 170 180 190 200 210 220 250 260 270 280 290 pF1KB8 RKQKTDREKMEKRTPHEKEKYQPSYETTILTE----CS----PWPEITYVNNSPSPGFNS :::::::::::::: .:::::::::::::::: :: :: : XP_016 RKQKTDREKMEKRTAQEKEKYQPSYETTILTEPQPLCSSADCPWEERSW 230 240 250 260 270 300 310 320 330 340 350 pF1KB8 SHSSFSLGEGNGSPNHQPEPPPPVTDNLLPTTTPQEAQQWLHRNRFSTFTRLFTNFSGAD >>XP_016859394 (OMIM: 609785) PREDICTED: transcription f (273 aa) initn: 1402 init1: 1402 opt: 1408 Z-score: 1218.6 bits: 234.1 E(85289): 3.9e-61 Smith-Waterman score: 1408; 85.1% identity (95.5% similar) in 242 aa overlap (35-276:16-256) 10 20 30 40 50 60 pF1KB8 LKLPLADEVIESGLVQDFDASLSGIGQELGAGAYSMSDVLALPIFKQEESSLPPDNENKI .:.: . :::::::::::: .: :.:: .. XP_016 MLFWHTQPEHYNQHNSGSY-LRDVLALPIFKQEEPQLSPENEARL 10 20 30 40 70 80 90 100 110 120 pF1KB8 LPFQYVLCAATSPAVKLHDETLTYLNQGQSYEIRMLDNRKLGELPEINGKLVKSIFRVVF :.:::::::::::::::.:::::::::::::::.:.:::::.. ..: : ::::.:::: XP_016 PPLQYVLCAATSPAVKLHEETLTYLNQGQSYEIRLLENRKLGDFQDLNTKYVKSIIRVVF 50 60 70 80 90 100 130 140 150 160 170 180 pF1KB8 HDRRLQYTEHQQLEGWRWNRPGDRILDIDIPMSVGIIDPRANPTQLNTVEFLWDPAKRTS ::::::::::::::::::.::::::::::::.::::.::::.:::::.::::::::::.: XP_016 HDRRLQYTEHQQLEGWRWSRPGDRILDIDIPLSVGILDPRASPTQLNAVEFLWDPAKRAS 110 120 130 140 150 160 190 200 210 220 230 240 pF1KB8 VFIQVHCISTEFTMRKHGGEKGVPFRVQIDTFKENENGEYTEHLHSASCQIKVFKPKGAD .:::::::::::: :::::::::::::::::::.:::::::::::::::::::::::::: XP_016 AFIQVHCISTEFTPRKHGGEKGVPFRVQIDTFKQNENGEYTEHLHSASCQIKVFKPKGAD 170 180 190 200 210 220 250 260 270 280 290 300 pF1KB8 RKQKTDREKMEKRTPHEKEKYQPSYETTILTECSPWPEITYVNNSPSPGFNSSHSSFSLG :::::::::::::: .:::::::::::::::: XP_016 RKQKTDREKMEKRTAQEKEKYQPSYETTILTEEAQRDPLGTSLAPGVTY 230 240 250 260 270 310 320 330 340 350 360 pF1KB8 EGNGSPNHQPEPPPPVTDNLLPTTTPQEAQQWLHRNRFSTFTRLFTNFSGADLLKLTRDD 502 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Fri Nov 4 12:25:10 2016 done: Fri Nov 4 12:25:11 2016 Total Scan time: 10.250 Total Display time: 0.070 Function used was FASTA [36.3.4 Apr, 2011]