FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB7841, 504 aa 1>>>pF1KB7841 504 - 504 aa - 504 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 7.4870+/-0.000986; mu= 8.4842+/- 0.060 mean_var=117.4324+/-23.715, 0's: 0 Z-trim(108.0): 16 B-trim: 0 in 0/52 Lambda= 0.118353 statistics sampled from 9938 (9952) to 9938 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.67), E-opt: 0.2 (0.306), width: 16 Scan time: 2.800 The best scores are: opt bits E(32554) CCDS46788.1 UBP1 gene_id:7342|Hs108|chr3 ( 504) 3376 587.5 1.1e-167 CCDS8808.1 TFCP2 gene_id:7024|Hs108|chr12 ( 502) 2517 440.9 1.6e-123 CCDS2659.1 UBP1 gene_id:7342|Hs108|chr3 ( 540) 1867 329.9 4.4e-90 CCDS2134.1 TFCP2L1 gene_id:29842|Hs108|chr2 ( 479) 1854 327.7 1.8e-89 CCDS55827.1 TFCP2 gene_id:7024|Hs108|chr12 ( 450) 1099 198.7 1.1e-50 CCDS33144.2 GRHL1 gene_id:29841|Hs108|chr2 ( 618) 385 76.9 7.4e-14 CCDS53284.1 GRHL3 gene_id:57822|Hs108|chr1 ( 556) 351 71.0 3.8e-12 CCDS44088.1 GRHL3 gene_id:57822|Hs108|chr1 ( 602) 351 71.1 4e-12 CCDS251.1 GRHL3 gene_id:57822|Hs108|chr1 ( 607) 351 71.1 4.1e-12 CCDS252.2 GRHL3 gene_id:57822|Hs108|chr1 ( 626) 351 71.1 4.2e-12 CCDS83312.1 GRHL2 gene_id:79977|Hs108|chr8 ( 609) 343 69.7 1e-11 CCDS34931.1 GRHL2 gene_id:79977|Hs108|chr8 ( 625) 343 69.7 1.1e-11 >>CCDS46788.1 UBP1 gene_id:7342|Hs108|chr3 (504 aa) initn: 3376 init1: 3376 opt: 3376 Z-score: 3123.8 bits: 587.5 E(32554): 1.1e-167 Smith-Waterman score: 3376; 100.0% identity (100.0% similar) in 504 aa overlap (1-504:1-504) 10 20 30 40 50 60 pF1KB7 MAWVLKMDEVIESGLVHDFDASLSGIGQELGAGAYSMSDVLALPIFKQEDSSLPLDGETE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS46 MAWVLKMDEVIESGLVHDFDASLSGIGQELGAGAYSMSDVLALPIFKQEDSSLPLDGETE 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB7 HPPFQYVMCAATSPAVKLHDETLTYLNQGQSYEIRMLDNRKMGDMPEINGKLVKSIIRVV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS46 HPPFQYVMCAATSPAVKLHDETLTYLNQGQSYEIRMLDNRKMGDMPEINGKLVKSIIRVV 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB7 FHDRRLQYTEHQQLEGWKWNRPGDRLLDLDIPMSVGIIDTRTNPSQLNAVEFLWDPAKRT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS46 FHDRRLQYTEHQQLEGWKWNRPGDRLLDLDIPMSVGIIDTRTNPSQLNAVEFLWDPAKRT 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB7 SAFIQVHCISTEFTPRKHGGEKGVPFRIQVDTFKQNENGEYTDHLHSASCQIKVFKPKGA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS46 SAFIQVHCISTEFTPRKHGGEKGVPFRIQVDTFKQNENGEYTDHLHSASCQIKVFKPKGA 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB7 DRKQKTDREKMEKRTAHEKEKYQPSYDTTILTECSPWPDAPTAYVNNSPSPAPTFTSPQQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS46 DRKQKTDREKMEKRTAHEKEKYQPSYDTTILTECSPWPDAPTAYVNNSPSPAPTFTSPQQ 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB7 STCSVPDSNSSSPNHQGDGASQTSGEQIQPSATIQETQQWLLKNRFSSYTRLFSNFSGAD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS46 STCSVPDSNSSSPNHQGDGASQTSGEQIQPSATIQETQQWLLKNRFSSYTRLFSNFSGAD 310 320 330 340 350 360 370 380 390 400 410 420 pF1KB7 LLKLTKEDLVQICGAADGIRLYNSLKSRSVRPRLTIYVCREQPSSTVLQGQQQAASSASE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS46 LLKLTKEDLVQICGAADGIRLYNSLKSRSVRPRLTIYVCREQPSSTVLQGQQQAASSASE 370 380 390 400 410 420 430 440 450 460 470 480 pF1KB7 NGSGAPYVYHAIYLEEMIASEVARKLALVFNIPLHQINQVYRQGPTGIHILVSDQMVQNF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS46 NGSGAPYVYHAIYLEEMIASEVARKLALVFNIPLHQINQVYRQGPTGIHILVSDQMVQNF 430 440 450 460 470 480 490 500 pF1KB7 QDESCFLFSTVKAESSDGIHIILK :::::::::::::::::::::::: CCDS46 QDESCFLFSTVKAESSDGIHIILK 490 500 >>CCDS8808.1 TFCP2 gene_id:7024|Hs108|chr12 (502 aa) initn: 2124 init1: 1693 opt: 2517 Z-score: 2331.1 bits: 440.9 E(32554): 1.6e-123 Smith-Waterman score: 2517; 73.3% identity (90.4% similar) in 509 aa overlap (1-504:1-502) 10 20 30 40 50 pF1KB7 MAWVLKM---DEVIESGLVHDFDASLSGIGQELGAGAYSMSDVLALPIFKQEDSSLPLDG :::.::. :::::::::.::::::::::::::::::::::::::::::::.:::: :. CCDS88 MAWALKLPLADEVIESGLVQDFDASLSGIGQELGAGAYSMSDVLALPIFKQEESSLPPDN 10 20 30 40 50 60 60 70 80 90 100 110 pF1KB7 ETEHPPFQYVMCAATSPAVKLHDETLTYLNQGQSYEIRMLDNRKMGDMPEINGKLVKSII :.. :::::.:::::::::::::::::::::::::::::::::.:..:::::::::::. CCDS88 ENKILPFQYVLCAATSPAVKLHDETLTYLNQGQSYEIRMLDNRKLGELPEINGKLVKSIF 70 80 90 100 110 120 120 130 140 150 160 170 pF1KB7 RVVFHDRRLQYTEHQQLEGWKWNRPGDRLLDLDIPMSVGIIDTRTNPSQLNAVEFLWDPA ::::::::::::::::::::.:::::::.::.:::::::::: :.::.:::.:::::::: CCDS88 RVVFHDRRLQYTEHQQLEGWRWNRPGDRILDIDIPMSVGIIDPRANPTQLNTVEFLWDPA 130 140 150 160 170 180 180 190 200 210 220 230 pF1KB7 KRTSAFIQVHCISTEFTPRKHGGEKGVPFRIQVDTFKQNENGEYTDHLHSASCQIKVFKP ::::.:::::::::::: ::::::::::::.:.::::.:::::::.:::::::::::::: CCDS88 KRTSVFIQVHCISTEFTMRKHGGEKGVPFRVQIDTFKENENGEYTEHLHSASCQIKVFKP 190 200 210 220 230 240 240 250 260 270 280 290 pF1KB7 KGADRKQKTDREKMEKRTAHEKEKYQPSYDTTILTECSPWPDAPTAYVNNSPSPAPTFTS :::::::::::::::::: ::::::::::.:::::::::::. .::::::::. :.: CCDS88 KGADRKQKTDREKMEKRTPHEKEKYQPSYETTILTECSPWPE--ITYVNNSPSPG--FNS 250 260 270 280 290 300 310 320 330 340 350 pF1KB7 PQQSTCSVPDSNSSSPNHQGDGASQTSGEQIQPSATIQETQQWLLKNRFSSYTRLFSNFS ..:. :. ..:.: :::: . .. ... :..: ::.:::: .::::..::::.::: CCDS88 -SHSSFSLGEGNGS-PNHQPEPPPPVT-DNLLPTTTPQEAQQWLHRNRFSTFTRLFTNFS 300 310 320 330 340 350 360 370 380 390 400 410 pF1KB7 GADLLKLTKEDLVQICGAADGIRLYNSLKSRSVRPRLTIYVCREQPSSTVLQGQQQAASS ::::::::..:..:::: ::::::.:.::.: ::::::::::.:. . : ::: .. CCDS88 GADLLKLTRDDVIQICGPADGIRLFNALKGRMVRPRLTIYVCQESLQLREQQQQQQQQQQ 360 370 380 390 400 410 420 430 440 450 460 470 pF1KB7 ASENG--SGAPYVYHAIYLEEMIASEVARKLALVFNIPLHQINQVYRQGPTGIHILVSDQ :.: .:. .:::::::::. : :...:.: .:.: ::.:.:.:::::::.:.::. CCDS88 KHEDGDSNGTFFVYHAIYLEELTAVELTEKIAQLFSISPCQISQIYKQGPTGIHVLISDE 420 430 440 450 460 470 480 490 500 pF1KB7 MVQNFQDESCFLFSTVKAESSDGIHIILK :.::::.:.::...:.:::..:. ::::: CCDS88 MIQNFQEEACFILDTMKAETNDSYHIILK 480 490 500 >>CCDS2659.1 UBP1 gene_id:7342|Hs108|chr3 (540 aa) initn: 3362 init1: 1847 opt: 1867 Z-score: 1730.8 bits: 329.9 E(32554): 4.4e-90 Smith-Waterman score: 3273; 93.3% identity (93.3% similar) in 536 aa overlap (1-500:1-536) 10 20 30 40 50 60 pF1KB7 MAWVLKMDEVIESGLVHDFDASLSGIGQELGAGAYSMSDVLALPIFKQEDSSLPLDGETE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS26 MAWVLKMDEVIESGLVHDFDASLSGIGQELGAGAYSMSDVLALPIFKQEDSSLPLDGETE 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB7 HPPFQYVMCAATSPAVKLHDETLTYLNQGQSYEIRMLDNRKMGDMPEINGKLVKSIIRVV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS26 HPPFQYVMCAATSPAVKLHDETLTYLNQGQSYEIRMLDNRKMGDMPEINGKLVKSIIRVV 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB7 FHDRRLQYTEHQQLEGWKWNRPGDRLLDLDIPMSVGIIDTRTNPSQLNAVEFLWDPAKRT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS26 FHDRRLQYTEHQQLEGWKWNRPGDRLLDLDIPMSVGIIDTRTNPSQLNAVEFLWDPAKRT 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB7 SAFIQVHCISTEFTPRKHGGEKGVPFRIQVDTFKQNENGEYTDHLHSASCQIKVFKPKGA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS26 SAFIQVHCISTEFTPRKHGGEKGVPFRIQVDTFKQNENGEYTDHLHSASCQIKVFKPKGA 190 200 210 220 230 240 250 260 270 pF1KB7 DRKQKTDREKMEKRTAHEKEKYQPSYDTTILTE--------------------------- ::::::::::::::::::::::::::::::::: CCDS26 DRKQKTDREKMEKRTAHEKEKYQPSYDTTILTEMRLEPIIEDAVEHEQKKSSKRTLPADY 250 260 270 280 290 300 280 290 300 310 320 pF1KB7 ---------CSPWPDAPTAYVNNSPSPAPTFTSPQQSTCSVPDSNSSSPNHQGDGASQTS ::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS26 GDSLAKRGSCSPWPDAPTAYVNNSPSPAPTFTSPQQSTCSVPDSNSSSPNHQGDGASQTS 310 320 330 340 350 360 330 340 350 360 370 380 pF1KB7 GEQIQPSATIQETQQWLLKNRFSSYTRLFSNFSGADLLKLTKEDLVQICGAADGIRLYNS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS26 GEQIQPSATIQETQQWLLKNRFSSYTRLFSNFSGADLLKLTKEDLVQICGAADGIRLYNS 370 380 390 400 410 420 390 400 410 420 430 440 pF1KB7 LKSRSVRPRLTIYVCREQPSSTVLQGQQQAASSASENGSGAPYVYHAIYLEEMIASEVAR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS26 LKSRSVRPRLTIYVCREQPSSTVLQGQQQAASSASENGSGAPYVYHAIYLEEMIASEVAR 430 440 450 460 470 480 450 460 470 480 490 500 pF1KB7 KLALVFNIPLHQINQVYRQGPTGIHILVSDQMVQNFQDESCFLFSTVKAESSDGIHIILK :::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS26 KLALVFNIPLHQINQVYRQGPTGIHILVSDQMVQNFQDESCFLFSTVKAESSDGIHIILK 490 500 510 520 530 540 >>CCDS2134.1 TFCP2L1 gene_id:29842|Hs108|chr2 (479 aa) initn: 2156 init1: 1446 opt: 1854 Z-score: 1719.6 bits: 327.7 E(32554): 1.8e-89 Smith-Waterman score: 2176; 68.1% identity (89.2% similar) in 474 aa overlap (32-504:16-476) 10 20 30 40 50 60 pF1KB7 AWVLKMDEVIESGLVHDFDASLSGIGQELGAGAYSMSDVLALPIFKQEDSSLPLDGETEH .:.: . :::::::::::. .: ..:.. CCDS21 MLFWHTQPEHYNQHNSGSY-LRDVLALPIFKQEEPQLSPENEARL 10 20 30 40 70 80 90 100 110 120 pF1KB7 PPFQYVMCAATSPAVKLHDETLTYLNQGQSYEIRMLDNRKMGDMPEINGKLVKSIIRVVF ::.:::.:::::::::::.:::::::::::::::.:.:::.::. ..: : ::::::::: CCDS21 PPLQYVLCAATSPAVKLHEETLTYLNQGQSYEIRLLENRKLGDFQDLNTKYVKSIIRVVF 50 60 70 80 90 100 130 140 150 160 170 180 pF1KB7 HDRRLQYTEHQQLEGWKWNRPGDRLLDLDIPMSVGIIDTRTNPSQLNAVEFLWDPAKRTS ::::::::::::::::.:.:::::.::.:::.::::.: :..:.::::::::::::::.: CCDS21 HDRRLQYTEHQQLEGWRWSRPGDRILDIDIPLSVGILDPRASPTQLNAVEFLWDPAKRAS 110 120 130 140 150 160 190 200 210 220 230 240 pF1KB7 AFIQVHCISTEFTPRKHGGEKGVPFRIQVDTFKQNENGEYTDHLHSASCQIKVFKPKGAD ::::::::::::::::::::::::::.:.::::::::::::.:::::::::::::::::: CCDS21 AFIQVHCISTEFTPRKHGGEKGVPFRVQIDTFKQNENGEYTEHLHSASCQIKVFKPKGAD 170 180 190 200 210 220 250 260 270 280 290 300 pF1KB7 RKQKTDREKMEKRTAHEKEKYQPSYDTTILTECSPWPDAPTAY-VNNSPSPAPTFTSPQQ :::::::::::::::.:::::::::.:::::::::::: .:: ::..:::. . :: CCDS21 RKQKTDREKMEKRTAQEKEKYQPSYETTILTECSPWPD--VAYQVNSAPSPSYN-GSP-- 230 240 250 260 270 310 320 330 340 350 360 pF1KB7 STCSVPDSNSSSPNHQGDGASQTSGEQIQPSATIQETQQWLLKNRFSSYTRLFSNFSGAD .. .. ..:.: :.: . : ...... :::.::..:::: .::::.. :::..::::: CCDS21 NSFGLGEGNAS-PTHPVE-ALPVGSDHLLPSASIQDAQQWLHRNRFSQFCRLFASFSGAD 280 290 300 310 320 330 370 380 390 400 410 420 pF1KB7 LLKLTKEDLVQICGAADGIRLYNSLKSRSVRPRLTIYVCREQPSSTVLQGQQQAASSASE :::....::::::: ::::::.:..:.:.:::..:::::.: .. : ::. .:.. CCDS21 LLKMSRDDLVQICGPADGIRLFNAIKGRNVRPKMTIYVCQELEQNRV-PLQQKRDGSGDS 340 350 360 370 380 390 430 440 450 460 470 480 pF1KB7 NGSGAPYVYHAIYLEEMIASEVARKLALVFNIPLHQINQVYRQGPTGIHILVSDQMVQNF : : :::::.:::. . :. .:.: ...: ..:..::::::::::..::..::::: CCDS21 NLS----VYHAIFLEELTTLELIEKIANLYSISPQHIHRVYRQGPTGIHVVVSNEMVQNF 400 410 420 430 440 450 490 500 pF1KB7 QDESCFLFSTVKAESSDGIHIILK ::::::..::.::::.:: ::::: CCDS21 QDESCFVLSTIKAESNDGYHIILKCGL 460 470 >>CCDS55827.1 TFCP2 gene_id:7024|Hs108|chr12 (450 aa) initn: 1850 init1: 1074 opt: 1099 Z-score: 1023.4 bits: 198.7 E(32554): 1.1e-50 Smith-Waterman score: 2057; 64.0% identity (80.4% similar) in 509 aa overlap (1-504:1-450) 10 20 30 40 50 pF1KB7 MAWVLKM---DEVIESGLVHDFDASLSGIGQELGAGAYSMSDVLALPIFKQEDSSLPLDG :::.::. :::::::::.::::::::::::::::::::::::::::::::.:::: :. CCDS55 MAWALKLPLADEVIESGLVQDFDASLSGIGQELGAGAYSMSDVLALPIFKQEESSLPPDN 10 20 30 40 50 60 60 70 80 90 100 110 pF1KB7 ETEHPPFQYVMCAATSPAVKLHDETLTYLNQGQSYEIRMLDNRKMGDMPEINGKLVKSII :.. :::::.:::::::::::::::::::::::::::::::::.:..:::::::::::. CCDS55 ENKILPFQYVLCAATSPAVKLHDETLTYLNQGQSYEIRMLDNRKLGELPEINGKLVKSIF 70 80 90 100 110 120 120 130 140 150 160 170 pF1KB7 RVVFHDRRLQYTEHQQLEGWKWNRPGDRLLDLDIPMSVGIIDTRTNPSQLNAVEFLWDPA ::::::::::::::::::::.:::::::.::.:::::::::: :.::.:::.:::::::: CCDS55 RVVFHDRRLQYTEHQQLEGWRWNRPGDRILDIDIPMSVGIIDPRANPTQLNTVEFLWDPA 130 140 150 160 170 180 180 190 200 210 220 230 pF1KB7 KRTSAFIQVHCISTEFTPRKHGGEKGVPFRIQVDTFKQNENGEYTDHLHSASCQIKVFKP ::::.::: : CCDS55 KRTSVFIQ---------------------------------------------------P 240 250 260 270 280 290 pF1KB7 KGADRKQKTDREKMEKRTAHEKEKYQPSYDTTILTECSPWPDAPTAYVNNSPSPAPTFTS :::::::::::::::::: ::::::::::.:::::::::::. .::::::::. :.: CCDS55 KGADRKQKTDREKMEKRTPHEKEKYQPSYETTILTECSPWPE--ITYVNNSPSPG--FNS 190 200 210 220 230 240 300 310 320 330 340 350 pF1KB7 PQQSTCSVPDSNSSSPNHQGDGASQTSGEQIQPSATIQETQQWLLKNRFSSYTRLFSNFS ..:. :. ..:.: :::: . .. ... :..: ::.:::: .::::..::::.::: CCDS55 -SHSSFSLGEGNGS-PNHQPEPPPPVT-DNLLPTTTPQEAQQWLHRNRFSTFTRLFTNFS 250 260 270 280 290 300 360 370 380 390 400 410 pF1KB7 GADLLKLTKEDLVQICGAADGIRLYNSLKSRSVRPRLTIYVCREQPSSTVLQGQQQAASS ::::::::..:..:::: ::::::.:.::.: ::::::::::.:. . : ::: .. CCDS55 GADLLKLTRDDVIQICGPADGIRLFNALKGRMVRPRLTIYVCQESLQLREQQQQQQQQQQ 310 320 330 340 350 360 420 430 440 450 460 470 pF1KB7 ASENG--SGAPYVYHAIYLEEMIASEVARKLALVFNIPLHQINQVYRQGPTGIHILVSDQ :.: .:. .:::::::::. : :...:.: .:.: ::.:.:.:::::::.:.::. CCDS55 KHEDGDSNGTFFVYHAIYLEELTAVELTEKIAQLFSISPCQISQIYKQGPTGIHVLISDE 370 380 390 400 410 420 480 490 500 pF1KB7 MVQNFQDESCFLFSTVKAESSDGIHIILK :.::::.:.::...:.: :..:. ::::: CCDS55 MIQNFQEEACFILDTMK-ETNDSYHIILK 430 440 450 >>CCDS33144.2 GRHL1 gene_id:29841|Hs108|chr2 (618 aa) initn: 284 init1: 106 opt: 385 Z-score: 362.3 bits: 76.9 E(32554): 7.4e-14 Smith-Waterman score: 385; 30.7% identity (56.8% similar) in 345 aa overlap (20-345:210-536) 10 20 30 40 pF1KB7 MAWVLKMDEVIESGLVHDFDASLSGIGQELGAGAYSMSDV-LALPIFKQ :...: .: .. ::. : .: ... CCDS33 TERVVVFDRNLNTDQFSSGAQAPNAQRRTPDSTFSETFKEGVQEVFFPSDLSLRMPGMNS 180 190 200 210 220 230 50 60 70 80 90 100 pF1KB7 EDSSLP-LDGETEHPPFQYVMCAATSPAVKLHDETLTYLNQGQSYEIRMLDNRKMGDMPE :: . ..:.. :.:.. :. : : : :.::::.:: : : . . . . . CCDS33 EDYVFDSVSGNN----FEYTLEASKSLRQKPGDSTMTYLNKGQFYPITLKEVSSSEGIHH 240 250 260 270 280 290 110 120 130 140 150 160 pF1KB7 INGKLVKSIIRVVFHDRRLQYTEHQQLEGWK-WNR----PGDRLLDL-DIPMSVGIIDTR .: :.:.: ::: . . ....::. :: :. .: .:. : : . : . CCDS33 PISK-VRSVIMVVFAEDK---SREDQLRHWKYWHSRQHTAKQRCIDIADYKESFNTI-SN 300 310 320 330 340 350 170 180 190 200 210 220 pF1KB7 TNPSQLNAVEFLWDPAKRTSAFIQVHCISTEFTPRKHGGEKGVPFRIQVDTFKQNENGEY . ::. : :: ....::.:.:.::.:. .: : ::.:. :::::.. :. .. CCDS33 IEEIAYNAISFTWDINDEAKVFISVNCLSTDFSSQK--GVKGLPLNIQVDTYSYNNRSNK 360 370 380 390 400 230 240 250 260 270 280 pF1KB7 TDHLHSASCQIKVFKPKGADRKQKTDREKMEKRTAHE-KEKYQPSYDTTILTECSPWPDA .: : :::::: :::.:: . ...:. :: . . : ::. .: .:. : CCDS33 P--VHRAYCQIKVFCDKGAERKIRDEERKQSKRKVSDVKVPLLPSHKRMDITVFKPFIDL 410 420 430 440 450 460 290 300 310 320 330 pF1KB7 PTAYVNNSPSPAPTFTSPQQSTCSVPDSNSSSPNHQGDGASQTSGEQ-------IQPS-- : : :. :.. :..: .: .: . .:.:. : . :: CCDS33 DTQPVLFIPDV--HFANLQRGTHVLP---IASEELEGEGSVLKRGPYGTEDDFAVPPSTK 470 480 490 500 510 520 340 350 360 370 380 390 pF1KB7 -ATIQETQQWLLKNRFSSYTRLFSNFSGADLLKLTKEDLVQICGAADGIRLYNSLKSRSV : :.: .. :: : CCDS33 LARIEEPKRVLLYVRKESEEVFDALMLKTPSLKGLMEAISDKYDVPHDKIGKIFKKCKKG 530 540 550 560 570 580 >>CCDS53284.1 GRHL3 gene_id:57822|Hs108|chr1 (556 aa) initn: 141 init1: 96 opt: 351 Z-score: 331.6 bits: 71.0 E(32554): 3.8e-12 Smith-Waterman score: 353; 24.9% identity (54.4% similar) in 458 aa overlap (54-504:175-554) 30 40 50 60 70 80 pF1KB7 SGIGQELGAGAYSMSDVLALPIFKQEDSSLPLDGETEHPPFQYVMCAATSPAVKLHDETL : : . . :.:.. . . .: . . CCDS53 RWQPDSTFKDDPQESMLFPDILKTSPEPPCPEDYPSLKSDFEYTLGSPKAIHIKSGESPM 150 160 170 180 190 200 90 100 110 120 130 140 pF1KB7 TYLNQGQSYEIRMLDNRKMGDMPEINGKLVKSIIRVVFHDRRLQYTEHQQLEGWK-WN-- .:::.:: : . : . : .... :::.. ::: .... .::. :: :. CCDS53 AYLNKGQFYPVT-LRTPAGGKGLALSSNKVKSVVMVVFDNEKVPV---EQLRFWKHWHSR 210 220 230 240 250 260 150 160 170 180 190 pF1KB7 RPG--DRLLDL-DIPMSVGIIDTRTNPSQLNAVEFLWDPAKRTSAFIQVHCISTEFTPRK .: .:..:. : . . .. . . ::. :.:. .....:: :.:.::.:. .: CCDS53 QPTAKQRVIDVADCKENFNTVE-HIEEVAYNALSFVWNVNEEAKVFIGVNCLSTDFSSQK 270 280 290 300 310 200 210 220 230 240 250 pF1KB7 HGGEKGVPFRIQVDTFKQNENGEYTDHL-HSASCQIKVFKPKGADRKQKTDREKMEKRTA : ::::. .:.::. . : :..: : : ::::.: :::.::.. :..:. .: . CCDS53 --GVKGVPLNLQIDTY---DCGLGTERLVHRAVCQIKIFCDKGAERKMRDDERKQFRRKV 320 330 340 350 360 370 260 270 280 290 300 310 pF1KB7 HEKEKYQPSYDTTILTECSPWPDAPTAYVNNSPSPAPTFTSPQQSTCSVPDSNSSSPNHQ . .. . . .: : . :.:. : . .: . .:. . :: ... CCDS53 KCPDSSNSGVKGCLL---SGFRGNETTYLR----PETDLETPP--VLFIPNVHFSSLQRS 380 390 400 410 420 320 330 340 350 360 370 pF1KB7 GDGASQTSGEQIQPSATIQETQQWLLKNRFSSYTRLFSNFSGADLLKLTKEDLVQICGAA : ::. ..: ::.. .. :: : .:. : . . : .:: .: CCDS53 G-GAAPSAG----PSSS----NRLPLKRTCSPFTEEFEPLPS----KQAKEGDLQ----- 430 440 450 460 380 390 400 410 420 430 pF1KB7 DGIRLYNSLKSRSVRPRLTIYVCREQPSSTVLQGQQQAASSASENGSGAPYVYHAIYLEE :. .:: :: .:. :. :..:. CCDS53 ----------------RVLLYVRRE-----------------TEE------VFDALMLKT 470 480 440 450 460 470 480 490 pF1KB7 MIASEVARKLALVFNIPLHQINQVYRQGPTGIHILVSDQMVQNFQDESCFLFSTVKAESS . . .. ...: ..: .::.. :: . ......:..... ::.. .: . CCDS53 PDLKGLRNAISEKYGFPEENIYKVYKKCKRGILVNMDNNIIQHYSNHVAFLLDM--GELD 490 500 510 520 530 540 500 pF1KB7 DGIHIILK :.:::: CCDS53 GKIQIILKEL 550 >>CCDS44088.1 GRHL3 gene_id:57822|Hs108|chr1 (602 aa) initn: 141 init1: 96 opt: 351 Z-score: 331.1 bits: 71.1 E(32554): 4e-12 Smith-Waterman score: 353; 24.9% identity (54.4% similar) in 458 aa overlap (54-504:221-600) 30 40 50 60 70 80 pF1KB7 SGIGQELGAGAYSMSDVLALPIFKQEDSSLPLDGETEHPPFQYVMCAATSPAVKLHDETL : : . . :.:.. . . .: . . CCDS44 RWQPDSTFKDDPQESMLFPDILKTSPEPPCPEDYPSLKSDFEYTLGSPKAIHIKSGESPM 200 210 220 230 240 250 90 100 110 120 130 140 pF1KB7 TYLNQGQSYEIRMLDNRKMGDMPEINGKLVKSIIRVVFHDRRLQYTEHQQLEGWK-WN-- .:::.:: : . : . : .... :::.. ::: .... .::. :: :. CCDS44 AYLNKGQFYPVT-LRTPAGGKGLALSSNKVKSVVMVVFDNEKVPV---EQLRFWKHWHSR 260 270 280 290 300 150 160 170 180 190 pF1KB7 RPG--DRLLDL-DIPMSVGIIDTRTNPSQLNAVEFLWDPAKRTSAFIQVHCISTEFTPRK .: .:..:. : . . .. . . ::. :.:. .....:: :.:.::.:. .: CCDS44 QPTAKQRVIDVADCKENFNTVE-HIEEVAYNALSFVWNVNEEAKVFIGVNCLSTDFSSQK 310 320 330 340 350 360 200 210 220 230 240 250 pF1KB7 HGGEKGVPFRIQVDTFKQNENGEYTDHL-HSASCQIKVFKPKGADRKQKTDREKMEKRTA : ::::. .:.::. . : :..: : : ::::.: :::.::.. :..:. .: . CCDS44 --GVKGVPLNLQIDTY---DCGLGTERLVHRAVCQIKIFCDKGAERKMRDDERKQFRRKV 370 380 390 400 410 420 260 270 280 290 300 310 pF1KB7 HEKEKYQPSYDTTILTECSPWPDAPTAYVNNSPSPAPTFTSPQQSTCSVPDSNSSSPNHQ . .. . . .: : . :.:. : . .: . .:. . :: ... CCDS44 KCPDSSNSGVKGCLL---SGFRGNETTYLR----PETDLETPP--VLFIPNVHFSSLQRS 430 440 450 460 470 320 330 340 350 360 370 pF1KB7 GDGASQTSGEQIQPSATIQETQQWLLKNRFSSYTRLFSNFSGADLLKLTKEDLVQICGAA : ::. ..: ::.. .. :: : .:. : . . : .:: .: CCDS44 G-GAAPSAG----PSSS----NRLPLKRTCSPFTEEFEPLPS----KQAKEGDLQ----- 480 490 500 510 380 390 400 410 420 430 pF1KB7 DGIRLYNSLKSRSVRPRLTIYVCREQPSSTVLQGQQQAASSASENGSGAPYVYHAIYLEE :. .:: :: .:. :. :..:. CCDS44 ----------------RVLLYVRRE-----------------TEE------VFDALMLKT 520 530 440 450 460 470 480 490 pF1KB7 MIASEVARKLALVFNIPLHQINQVYRQGPTGIHILVSDQMVQNFQDESCFLFSTVKAESS . . .. ...: ..: .::.. :: . ......:..... ::.. .: . CCDS44 PDLKGLRNAISEKYGFPEENIYKVYKKCKRGILVNMDNNIIQHYSNHVAFLLDM--GELD 540 550 560 570 580 590 500 pF1KB7 DGIHIILK :.:::: CCDS44 GKIQIILKEL 600 >>CCDS251.1 GRHL3 gene_id:57822|Hs108|chr1 (607 aa) initn: 141 init1: 96 opt: 351 Z-score: 331.0 bits: 71.1 E(32554): 4.1e-12 Smith-Waterman score: 353; 24.9% identity (54.4% similar) in 458 aa overlap (54-504:226-605) 30 40 50 60 70 80 pF1KB7 SGIGQELGAGAYSMSDVLALPIFKQEDSSLPLDGETEHPPFQYVMCAATSPAVKLHDETL : : . . :.:.. . . .: . . CCDS25 RWQPDSTFKDDPQESMLFPDILKTSPEPPCPEDYPSLKSDFEYTLGSPKAIHIKSGESPM 200 210 220 230 240 250 90 100 110 120 130 140 pF1KB7 TYLNQGQSYEIRMLDNRKMGDMPEINGKLVKSIIRVVFHDRRLQYTEHQQLEGWK-WN-- .:::.:: : . : . : .... :::.. ::: .... .::. :: :. CCDS25 AYLNKGQFYPVT-LRTPAGGKGLALSSNKVKSVVMVVFDNEKVPV---EQLRFWKHWHSR 260 270 280 290 300 310 150 160 170 180 190 pF1KB7 RPG--DRLLDL-DIPMSVGIIDTRTNPSQLNAVEFLWDPAKRTSAFIQVHCISTEFTPRK .: .:..:. : . . .. . . ::. :.:. .....:: :.:.::.:. .: CCDS25 QPTAKQRVIDVADCKENFNTVE-HIEEVAYNALSFVWNVNEEAKVFIGVNCLSTDFSSQK 320 330 340 350 360 370 200 210 220 230 240 250 pF1KB7 HGGEKGVPFRIQVDTFKQNENGEYTDHL-HSASCQIKVFKPKGADRKQKTDREKMEKRTA : ::::. .:.::. . : :..: : : ::::.: :::.::.. :..:. .: . CCDS25 --GVKGVPLNLQIDTY---DCGLGTERLVHRAVCQIKIFCDKGAERKMRDDERKQFRRKV 380 390 400 410 420 260 270 280 290 300 310 pF1KB7 HEKEKYQPSYDTTILTECSPWPDAPTAYVNNSPSPAPTFTSPQQSTCSVPDSNSSSPNHQ . .. . . .: : . :.:. : . .: . .:. . :: ... CCDS25 KCPDSSNSGVKGCLL---SGFRGNETTYLR----PETDLETPP--VLFIPNVHFSSLQRS 430 440 450 460 470 320 330 340 350 360 370 pF1KB7 GDGASQTSGEQIQPSATIQETQQWLLKNRFSSYTRLFSNFSGADLLKLTKEDLVQICGAA : ::. ..: ::.. .. :: : .:. : . . : .:: .: CCDS25 G-GAAPSAG----PSSS----NRLPLKRTCSPFTEEFEPLPS----KQAKEGDLQ----- 480 490 500 510 380 390 400 410 420 430 pF1KB7 DGIRLYNSLKSRSVRPRLTIYVCREQPSSTVLQGQQQAASSASENGSGAPYVYHAIYLEE :. .:: :: .:. :. :..:. CCDS25 ----------------RVLLYVRRE-----------------TEE------VFDALMLKT 520 530 440 450 460 470 480 490 pF1KB7 MIASEVARKLALVFNIPLHQINQVYRQGPTGIHILVSDQMVQNFQDESCFLFSTVKAESS . . .. ...: ..: .::.. :: . ......:..... ::.. .: . CCDS25 PDLKGLRNAISEKYGFPEENIYKVYKKCKRGILVNMDNNIIQHYSNHVAFLLDM--GELD 540 550 560 570 580 590 500 pF1KB7 DGIHIILK :.:::: CCDS25 GKIQIILKEL 600 >>CCDS252.2 GRHL3 gene_id:57822|Hs108|chr1 (626 aa) initn: 141 init1: 96 opt: 351 Z-score: 330.8 bits: 71.1 E(32554): 4.2e-12 Smith-Waterman score: 351; 28.5% identity (59.9% similar) in 274 aa overlap (54-314:221-484) 30 40 50 60 70 80 pF1KB7 SGIGQELGAGAYSMSDVLALPIFKQEDSSLPLDGETEHPPFQYVMCAATSPAVKLHDETL : : . . :.:.. . . .: . . CCDS25 RWQPDSTFKDDPQESMLFPDILKTSPEPPCPEDYPSLKSDFEYTLGSPKAIHIKSGESPM 200 210 220 230 240 250 90 100 110 120 130 140 pF1KB7 TYLNQGQSYEIRMLDNRKMGDMPEINGKLVKSIIRVVFHDRRLQYTEHQQLEGWK-WN-- .:::.:: : . : . : .... :::.. ::: .... .::. :: :. CCDS25 AYLNKGQFYPVT-LRTPAGGKGLALSSNKVKSVVMVVFDNEKVPV---EQLRFWKHWHSR 260 270 280 290 300 150 160 170 180 190 pF1KB7 RPG--DRLLDL-DIPMSVGIIDTRTNPSQLNAVEFLWDPAKRTSAFIQVHCISTEFTPRK .: .:..:. : . . .. . . ::. :.:. .....:: :.:.::.:. .: CCDS25 QPTAKQRVIDVADCKENFNTVE-HIEEVAYNALSFVWNVNEEAKVFIGVNCLSTDFSSQK 310 320 330 340 350 360 200 210 220 230 240 250 pF1KB7 HGGEKGVPFRIQVDTFKQNENGEYTDHL-HSASCQIKVFKPKGADRKQKTDREKMEKRTA : ::::. .:.::. . : :..: : : ::::.: :::.::.. :..:. .: . CCDS25 --GVKGVPLNLQIDTY---DCGLGTERLVHRAVCQIKIFCDKGAERKMRDDERKQFRRKV 370 380 390 400 410 420 260 270 280 290 300 310 pF1KB7 HEKEKYQPSYDTTILTECSPWPDA---PTAYVNNSPS---PAPTFTSPQQSTCSVPDSNS . .. . . .:. . : . ... : : :.: :.: ..:... CCDS25 KCPDSSNSGVKGCLLSGFRGNETTYLRPETDLETPPVLFIPNVHFSSLQRSGGAAPSAGP 430 440 450 460 470 480 320 330 340 350 360 370 pF1KB7 SSPNHQGDGASQTSGEQIQPSATIQETQQWLLKNRFSSYTRLFSNFSGADLLKLTKEDLV :: : CCDS25 SSSNRLPLKRTCSPFTEEFEPLPSKQAKEGDLQRVLLYVRRETEEVFDALMLKTPDLKGL 490 500 510 520 530 540 504 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sat Nov 5 09:52:06 2016 done: Sat Nov 5 09:52:07 2016 Total Scan time: 2.800 Total Display time: 0.080 Function used was FASTA [36.3.4 Apr, 2011]