FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB7984, 324 aa 1>>>pF1KB7984 324 - 324 aa - 324 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 7.9035+/-0.000782; mu= 5.7939+/- 0.048 mean_var=169.7870+/-35.039, 0's: 0 Z-trim(113.7): 196 B-trim: 11 in 1/52 Lambda= 0.098429 statistics sampled from 14097 (14306) to 14097 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.771), E-opt: 0.2 (0.439), width: 16 Scan time: 2.570 The best scores are: opt bits E(32554) CCDS3694.1 PITX2 gene_id:5308|Hs108|chr4 ( 324) 2186 321.8 4.6e-88 CCDS3692.1 PITX2 gene_id:5308|Hs108|chr4 ( 317) 1744 259.0 3.5e-69 CCDS3693.1 PITX2 gene_id:5308|Hs108|chr4 ( 271) 1731 257.1 1.1e-68 CCDS4182.1 PITX1 gene_id:5307|Hs108|chr5 ( 314) 1220 184.6 8.8e-47 CCDS7532.1 PITX3 gene_id:5309|Hs108|chr10 ( 302) 1008 154.5 9.8e-38 CCDS14215.1 ARX gene_id:170302|Hs108|chrX ( 562) 387 66.5 5.6e-11 CCDS9028.1 ALX1 gene_id:8092|Hs108|chr12 ( 326) 382 65.6 6e-11 >>CCDS3694.1 PITX2 gene_id:5308|Hs108|chr4 (324 aa) initn: 2186 init1: 2186 opt: 2186 Z-score: 1694.5 bits: 321.8 E(32554): 4.6e-88 Smith-Waterman score: 2186; 100.0% identity (100.0% similar) in 324 aa overlap (1-324:1-324) 10 20 30 40 50 60 pF1KB7 MNCMKGPLHLEHRAAGTKLSAVSSSSCHHPQPLAMASVLAPGQPRSLDSSKHRLEVHTIS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS36 MNCMKGPLHLEHRAAGTKLSAVSSSSCHHPQPLAMASVLAPGQPRSLDSSKHRLEVHTIS 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB7 DTSSPEAAEKDKSQQGKNEDVGAEDPSKKKRQRRQRTHFTSQQLQELEATFQRNRYPDMS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS36 DTSSPEAAEKDKSQQGKNEDVGAEDPSKKKRQRRQRTHFTSQQLQELEATFQRNRYPDMS 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB7 TREEIAVWTNLTEARVRVWFKNRRAKWRKRERNQQAELCKNGFGPQFNGLMQPYDDMYPG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS36 TREEIAVWTNLTEARVRVWFKNRRAKWRKRERNQQAELCKNGFGPQFNGLMQPYDDMYPG 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB7 YSYNNWAAKGLTSASLSTKSFPFFNSMNVNPLSSQSMFSPPNSISSMSMSSSMVPSAVTG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS36 YSYNNWAAKGLTSASLSTKSFPFFNSMNVNPLSSQSMFSPPNSISSMSMSSSMVPSAVTG 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB7 VPGSSLNSLNNLNNLSSPSLNSAVPTPACPYAPPTPPYVYRDTCNSSLASLRLKAKQHSS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS36 VPGSSLNSLNNLNNLSSPSLNSAVPTPACPYAPPTPPYVYRDTCNSSLASLRLKAKQHSS 250 260 270 280 290 300 310 320 pF1KB7 FGYASVQNPASNLSACQYAVDRPV :::::::::::::::::::::::: CCDS36 FGYASVQNPASNLSACQYAVDRPV 310 320 >>CCDS3692.1 PITX2 gene_id:5308|Hs108|chr4 (317 aa) initn: 1742 init1: 1742 opt: 1744 Z-score: 1355.5 bits: 259.0 E(32554): 3.5e-69 Smith-Waterman score: 1744; 93.9% identity (95.7% similar) in 279 aa overlap (48-324:39-317) 20 30 40 50 60 70 pF1KB7 KLSAVSSSSCHHPQPLAMASVLAPGQPRSLDSSKHRLEVHTIS--DTSSPEAAEKDKSQQ :: . : :. . . . : : ::::::: CCDS36 VSACVQLGVQPAAVECLFSKDSEIKKVEFTDSPESRKEAASSKFFPRQHPGANEKDKSQQ 10 20 30 40 50 60 80 90 100 110 120 130 pF1KB7 GKNEDVGAEDPSKKKRQRRQRTHFTSQQLQELEATFQRNRYPDMSTREEIAVWTNLTEAR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS36 GKNEDVGAEDPSKKKRQRRQRTHFTSQQLQELEATFQRNRYPDMSTREEIAVWTNLTEAR 70 80 90 100 110 120 140 150 160 170 180 190 pF1KB7 VRVWFKNRRAKWRKRERNQQAELCKNGFGPQFNGLMQPYDDMYPGYSYNNWAAKGLTSAS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS36 VRVWFKNRRAKWRKRERNQQAELCKNGFGPQFNGLMQPYDDMYPGYSYNNWAAKGLTSAS 130 140 150 160 170 180 200 210 220 230 240 250 pF1KB7 LSTKSFPFFNSMNVNPLSSQSMFSPPNSISSMSMSSSMVPSAVTGVPGSSLNSLNNLNNL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS36 LSTKSFPFFNSMNVNPLSSQSMFSPPNSISSMSMSSSMVPSAVTGVPGSSLNSLNNLNNL 190 200 210 220 230 240 260 270 280 290 300 310 pF1KB7 SSPSLNSAVPTPACPYAPPTPPYVYRDTCNSSLASLRLKAKQHSSFGYASVQNPASNLSA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS36 SSPSLNSAVPTPACPYAPPTPPYVYRDTCNSSLASLRLKAKQHSSFGYASVQNPASNLSA 250 260 270 280 290 300 320 pF1KB7 CQYAVDRPV ::::::::: CCDS36 CQYAVDRPV 310 >>CCDS3693.1 PITX2 gene_id:5308|Hs108|chr4 (271 aa) initn: 1731 init1: 1731 opt: 1731 Z-score: 1346.4 bits: 257.1 E(32554): 1.1e-68 Smith-Waterman score: 1731; 100.0% identity (100.0% similar) in 256 aa overlap (69-324:16-271) 40 50 60 70 80 90 pF1KB7 LAPGQPRSLDSSKHRLEVHTISDTSSPEAAEKDKSQQGKNEDVGAEDPSKKKRQRRQRTH :::::::::::::::::::::::::::::: CCDS36 METNCRKLVSACVQLEKDKSQQGKNEDVGAEDPSKKKRQRRQRTH 10 20 30 40 100 110 120 130 140 150 pF1KB7 FTSQQLQELEATFQRNRYPDMSTREEIAVWTNLTEARVRVWFKNRRAKWRKRERNQQAEL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS36 FTSQQLQELEATFQRNRYPDMSTREEIAVWTNLTEARVRVWFKNRRAKWRKRERNQQAEL 50 60 70 80 90 100 160 170 180 190 200 210 pF1KB7 CKNGFGPQFNGLMQPYDDMYPGYSYNNWAAKGLTSASLSTKSFPFFNSMNVNPLSSQSMF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS36 CKNGFGPQFNGLMQPYDDMYPGYSYNNWAAKGLTSASLSTKSFPFFNSMNVNPLSSQSMF 110 120 130 140 150 160 220 230 240 250 260 270 pF1KB7 SPPNSISSMSMSSSMVPSAVTGVPGSSLNSLNNLNNLSSPSLNSAVPTPACPYAPPTPPY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS36 SPPNSISSMSMSSSMVPSAVTGVPGSSLNSLNNLNNLSSPSLNSAVPTPACPYAPPTPPY 170 180 190 200 210 220 280 290 300 310 320 pF1KB7 VYRDTCNSSLASLRLKAKQHSSFGYASVQNPASNLSACQYAVDRPV :::::::::::::::::::::::::::::::::::::::::::::: CCDS36 VYRDTCNSSLASLRLKAKQHSSFGYASVQNPASNLSACQYAVDRPV 230 240 250 260 270 >>CCDS4182.1 PITX1 gene_id:5307|Hs108|chr5 (314 aa) initn: 1148 init1: 595 opt: 1220 Z-score: 953.4 bits: 184.6 E(32554): 8.8e-47 Smith-Waterman score: 1228; 62.1% identity (78.2% similar) in 330 aa overlap (1-318:1-312) 10 20 30 40 50 pF1KB7 MNCMKGPLHLEHRAAGTKLSAVSSSSCHHPQPLAMASVLA-PGQPRSLDSSKHRLEVHTI :. .:: . ::. : . :. .. : :: :..:: . :: .. CCDS41 MDAFKGGMSLERLPEGLRPPPPP------PHDMGPAFHLARPADPR------EPLE-NSA 10 20 30 40 60 70 80 90 100 110 pF1KB7 SDTSSPEAAEKDKSQQGKN-EDVGA--------EDPSKKKRQRRQRTHFTSQQLQELEAT :..:. : ::... . :. :: :: .::.:::.::::::::::::::::::: CCDS41 SESSDTELPEKERGGEPKGPEDSGAGGTGCGGADDPAKKKKQRRQRTHFTSQQLQELEAT 50 60 70 80 90 100 120 130 140 150 160 170 pF1KB7 FQRNRYPDMSTREEIAVWTNLTEARVRVWFKNRRAKWRKRERNQQAELCKNGFGPQFNGL :::::::::: :::::::::::: ::::::::::::::::::::: .:::.:. :::.:: CCDS41 FQRNRYPDMSMREEIAVWTNLTEPRVRVWFKNRRAKWRKRERNQQLDLCKGGYVPQFSGL 110 120 130 140 150 160 180 190 200 210 220 pF1KB7 MQPYDDMYP-GYSYNNWAAKGLTSASLSTKSFPFFNSMNVNPLSSQSMFSPPNSISSMSM .:::.:.: ::::::::::.:. : :::::: :::::. ::::::::: :.:::::.: CCDS41 VQPYEDVYAAGYSYNNWAAKSLAPAPLSTKSFTFFNSMS--PLSSQSMFSAPSSISSMTM 170 180 190 200 210 220 230 240 250 260 270 280 pF1KB7 SSSMVPSAVTGVPGSSLNSLNNLNNLSSPSLNSAVPTPACPYAPPTPPY-VYRDTCNSSL ::: :.:: :.:.:. :::.:::.. :::::. ::::. :. :: :::::::::: CCDS41 PSSMGPGAVPGMPNSG---LNNINNLTGSSLNSAMSPGACPYGTPASPYSVYRDTCNSSL 230 240 250 260 270 280 290 300 310 320 pF1KB7 ASLRLKAKQHSSFGYASVQNPASNLSACQYAVDRPV ::::::.::::::::...:.:::.:.:::: CCDS41 ASLRLKSKQHSSFGYGGLQGPASGLNACQYNS 290 300 310 >>CCDS7532.1 PITX3 gene_id:5309|Hs108|chr10 (302 aa) initn: 1012 init1: 712 opt: 1008 Z-score: 790.9 bits: 154.5 E(32554): 9.8e-38 Smith-Waterman score: 1013; 56.9% identity (77.6% similar) in 304 aa overlap (47-324:6-302) 20 30 40 50 60 70 pF1KB7 TKLSAVSSSSCHHPQPLAMASVLAPGQPRSLDSSKHRLEVHTISDTSSPEAAEKDKSQQG :. .. : . ..::...:. ... .: CCDS75 MEFGLLSEAEARSPALSLSDAGTPHPQLPEHGCKG 10 20 30 80 90 100 110 120 pF1KB7 K----NEDVGA-------EDPSKKKRQRRQRTHFTSQQLQELEATFQRNRYPDMSTREEI . .: ..: :: : ::.:::::::::::::::::::::::::::::::::: CCDS75 QEHSDSEKASASLPGGSPEDGSLKKKQRRQRTHFTSQQLQELEATFQRNRYPDMSTREEI 40 50 60 70 80 90 130 140 150 160 170 180 pF1KB7 AVWTNLTEARVRVWFKNRRAKWRKRERNQQAELCKNGFGPQFNGLMQPYDDMYPGYSYNN :::::::::::::::::::::::::::.:::::::..:. ..::. ::...::::::.: CCDS75 AVWTNLTEARVRVWFKNRRAKWRKRERSQQAELCKGSFAAPLGGLVPPYEEVYPGYSYGN 100 110 120 130 140 150 190 200 210 220 230 240 pF1KB7 WAAKGLTSASLSTKSFPF-FNSMNVNPLSSQSMFSPPNSIS-SMSMSSSMVPSAVTGVPG : :.: . :..:.::: :::.::.::.:: .::::.::. :: :.. .:..: : :: CCDS75 WPPKAL-APPLAAKTFPFAFNSVNVGPLASQPVFSPPSSIAASMVPSAAAAPGTVPG-PG 160 170 180 190 200 210 250 260 270 280 290 pF1KB7 SSLNSLNNLNNLSSPSLN-SAVPTPA--CPYAPP--------TPPYVYRDTCNSSLASLR . :..:.. . :.: .:: . : :::: . :::::: ::::::::: CCDS75 A----LQGLGG-GPPGLAPAAVSSGAVSCPYASAAAAAAAAASSPYVYRDPCNSSLASLR 220 230 240 250 260 300 310 320 pF1KB7 LKAKQHSSFGYASVQNP--ASNLSACQYAVDRPV ::::::.::.: .:..: :.::: :::::.::: CCDS75 LKAKQHASFSYPAVHGPPPAANLSPCQYAVERPV 270 280 290 300 >>CCDS14215.1 ARX gene_id:170302|Hs108|chrX (562 aa) initn: 340 init1: 288 opt: 387 Z-score: 310.6 bits: 66.5 E(32554): 5.6e-11 Smith-Waterman score: 387; 35.4% identity (61.0% similar) in 254 aa overlap (65-300:299-544) 40 50 60 70 80 90 pF1KB7 MASVLAPGQPRSLDSSKHRLEVHTISDTSSPEAAEKDKSQQGKNEDVGA--EDPSKKKRQ :: :: .... ..:. :. :..: CCDS14 AATGAVAAAAAAAVATEGGELSPKEELLLHPEDAEGKDGEDSVCLSAGSDSEEGLLKRKQ 270 280 290 300 310 320 100 110 120 130 140 150 pF1KB7 RRQRTHFTSQQLQELEATFQRNRYPDMSTREEIAVWTNLTEARVRVWFKNRRAKWRKRER :: :: ::: ::.::: .::...:::. ::::.:. .::::::.:::.::::::::::. CCDS14 RRYRTTFTSYQLEELERAFQKTHYPDVFTREELAMRLDLTEARVQVWFQNRRAKWRKREK 330 340 350 360 370 380 160 170 180 190 200 pF1KB7 NQQAELCKNGF---GP-QFNGLMQPYDDMYPGYSYNNWAAKGLTSASLSTKS-FPFFNSM :. :. :: . . ..:: : : .. .. :.:. .. . :: :. CCDS14 AG-AQTHPPGLPFPGPLSATHPLSPYLDASPFPPHHPALDSAWTAAAAAAAAAFP---SL 390 400 410 420 430 440 210 220 230 240 250 260 pF1KB7 NVNPLSSQSMFSPPNSISSMSMSSSMV------PSAVTGVPGSSLNSLNNLNNLSSPSLN : .: :. :: : . ...:. . :. .. . : .... :.. :. . CCDS14 PPPP-GSASL--PP-SGAPLGLSTFLGAAVFRHPAFISPAFGRLFSTMAPLTSASTAAAL 450 460 470 480 490 500 270 280 290 300 310 pF1KB7 SAVPTPACPYAPPT-----PPYVYRDTCNSSLASLRLKAKQHSSFGYASVQNPASNLSAC :::: : . : . : ::.:.::::::.:.. CCDS14 LRQPTPAVEGAVASGALADPATAAADRRASSIAALRLKAKEHAAQLTQLNILPGTSTGKE 510 520 530 540 550 560 320 pF1KB7 QYAVDRPV CCDS14 VC >>CCDS9028.1 ALX1 gene_id:8092|Hs108|chr12 (326 aa) initn: 396 init1: 348 opt: 382 Z-score: 310.0 bits: 65.6 E(32554): 6e-11 Smith-Waterman score: 430; 33.6% identity (60.1% similar) in 301 aa overlap (3-300:49-320) 10 20 pF1KB7 MNCMK--GPL-HLEHRAAGTKLSAVSSSSCHH :.. ::: . ::.. . : ..:: .. CCDS90 DFYMGAGGPLEHVMETLDNESFYSKASAGKCVQAFGPLPRAEHHVRLERTSPCQDSSVNY 20 30 40 50 60 70 30 40 50 60 70 80 pF1KB7 PQPLAMASVLAPGQPRSLDSSKHRLEVHTISDTSSPEAAEKDKSQQGKNEDVGAEDPSKK ....: ::: : . .: . : :: . ..:.. . : . :.. CCDS90 ----GITKV--EGQP--LHTELNRAMDNCNSLRMSPVKGMQEKGELDELGDKCDSNVSSS 80 90 100 110 120 130 90 100 110 120 130 140 pF1KB7 KRQRRQRTHFTSQQLQELEATFQRNRYPDMSTREEIAVWTNLTEARVRVWFKNRRAKWRK :. ::.:: ::: ::.::: .::...:::. .::..:. :.::::::.:::.:::::::: CCDS90 KK-RRHRTTFTSLQLEELEKVFQKTHYPDVYVREQLALRTELTEARVQVWFQNRRAKWRK 140 150 160 170 180 150 160 170 180 190 200 pF1KB7 RERNQQAELCKNGFGPQFNGLMQPYDDMYPGYSYNNWAAKGLTSASLSTKSFPFFNSMNV ::: : . :. :. .. . : : :: . : ::... .. ... .: .: . CCDS90 RERYGQIQQAKSHFAATYDISVLPRTDSYPQIQNNLWAGNASGGSVVTSCMLPRDTSSCM 190 200 210 220 230 240 210 220 230 240 250 260 pF1KB7 NPLSSQSMFSPPNSISSMSMSSSMVPSAVTGVPGSSLNSLNNLNNLSSPSLNSAVPTPAC .: : . : . ::.. :. . . :: :::. . :: ... . CCDS90 TPYSHS-----PRTDSSYTGFSNH-QNQFSHVP---------LNNFFTDSLLTGATNG-- 250 260 270 280 290 270 280 290 300 310 320 pF1KB7 PYAPPTPPYVYRDTCNSSLASLRLKAKQHSSFGYASVQNPASNLSACQYAVDRPV .: : : : . ::.: ::.:::.:.. CCDS90 -HAFETKPEFERRS--SSIAVLRMKAKEHTANISWAM 300 310 320 324 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sat Nov 5 10:22:25 2016 done: Sat Nov 5 10:22:25 2016 Total Scan time: 2.570 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]