FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB7623, 314 aa 1>>>pF1KB7623 314 - 314 aa - 314 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 8.0294+/-0.000743; mu= 6.7155+/- 0.046 mean_var=205.6504+/-42.428, 0's: 0 Z-trim(116.7): 183 B-trim: 0 in 0/53 Lambda= 0.089435 statistics sampled from 17113 (17307) to 17113 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.82), E-opt: 0.2 (0.532), width: 16 Scan time: 2.600 The best scores are: opt bits E(32554) CCDS4182.1 PITX1 gene_id:5307|Hs108|chr5 ( 314) 2174 292.0 4e-79 CCDS3694.1 PITX2 gene_id:5308|Hs108|chr4 ( 324) 1220 168.9 4.6e-42 CCDS3692.1 PITX2 gene_id:5308|Hs108|chr4 ( 317) 1214 168.2 7.8e-42 CCDS3693.1 PITX2 gene_id:5308|Hs108|chr4 ( 271) 1202 166.5 2e-41 CCDS7532.1 PITX3 gene_id:5309|Hs108|chr10 ( 302) 801 114.8 8.3e-26 >>CCDS4182.1 PITX1 gene_id:5307|Hs108|chr5 (314 aa) initn: 2174 init1: 2174 opt: 2174 Z-score: 1534.0 bits: 292.0 E(32554): 4e-79 Smith-Waterman score: 2174; 100.0% identity (100.0% similar) in 314 aa overlap (1-314:1-314) 10 20 30 40 50 60 pF1KB7 MDAFKGGMSLERLPEGLRPPPPPPHDMGPAFHLARPADPREPLENSASESSDTELPEKER :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS41 MDAFKGGMSLERLPEGLRPPPPPPHDMGPAFHLARPADPREPLENSASESSDTELPEKER 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB7 GGEPKGPEDSGAGGTGCGGADDPAKKKKQRRQRTHFTSQQLQELEATFQRNRYPDMSMRE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS41 GGEPKGPEDSGAGGTGCGGADDPAKKKKQRRQRTHFTSQQLQELEATFQRNRYPDMSMRE 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB7 EIAVWTNLTEPRVRVWFKNRRAKWRKRERNQQLDLCKGGYVPQFSGLVQPYEDVYAAGYS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS41 EIAVWTNLTEPRVRVWFKNRRAKWRKRERNQQLDLCKGGYVPQFSGLVQPYEDVYAAGYS 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB7 YNNWAAKSLAPAPLSTKSFTFFNSMSPLSSQSMFSAPSSISSMTMPSSMGPGAVPGMPNS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS41 YNNWAAKSLAPAPLSTKSFTFFNSMSPLSSQSMFSAPSSISSMTMPSSMGPGAVPGMPNS 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB7 GLNNINNLTGSSLNSAMSPGACPYGTPASPYSVYRDTCNSSLASLRLKSKQHSSFGYGGL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS41 GLNNINNLTGSSLNSAMSPGACPYGTPASPYSVYRDTCNSSLASLRLKSKQHSSFGYGGL 250 260 270 280 290 300 310 pF1KB7 QGPASGLNACQYNS :::::::::::::: CCDS41 QGPASGLNACQYNS 310 >>CCDS3694.1 PITX2 gene_id:5308|Hs108|chr4 (324 aa) initn: 1148 init1: 595 opt: 1220 Z-score: 868.6 bits: 168.9 E(32554): 4.6e-42 Smith-Waterman score: 1228; 62.1% identity (78.2% similar) in 330 aa overlap (1-312:1-318) 10 20 30 40 pF1KB7 MDAFKGGMSLERLPEGLRPPPPP------PHDMGPAFHLARPADPR------EPLE-NSA :. .:: . ::. : . :. .. : :: :..:: . :: .. CCDS36 MNCMKGPLHLEHRAAGTKLSAVSSSSCHHPQPLAMASVLA-PGQPRSLDSSKHRLEVHTI 10 20 30 40 50 50 60 70 80 90 100 pF1KB7 SESSDTELPEKERGGEPKGPEDSGAGGTGCGGADDPAKKKKQRRQRTHFTSQQLQELEAT :..:. : ::... . :. :: ::.::.:::.::::::::::::::::::: CCDS36 SDTSSPEAAEKDKSQQGKN-EDV--------GAEDPSKKKRQRRQRTHFTSQQLQELEAT 60 70 80 90 100 110 110 120 130 140 150 160 pF1KB7 FQRNRYPDMSMREEIAVWTNLTEPRVRVWFKNRRAKWRKRERNQQLDLCKGGYVPQFSGL :::::::::: :::::::::::: ::::::::::::::::::::: .:::.:. :::.:: CCDS36 FQRNRYPDMSTREEIAVWTNLTEARVRVWFKNRRAKWRKRERNQQAELCKNGFGPQFNGL 120 130 140 150 160 170 170 180 190 200 210 220 pF1KB7 VQPYEDVYAAGYSYNNWAAKSLAPAPLSTKSFTFFNSMS--PLSSQSMFSAPSSISSMTM .:::.:.: ::::::::::.:. : :::::: :::::. ::::::::: :.:::::.: CCDS36 MQPYDDMYP-GYSYNNWAAKGLTSASLSTKSFPFFNSMNVNPLSSQSMFSPPNSISSMSM 180 190 200 210 220 230 240 250 260 270 280 pF1KB7 PSSMGPGAVPGMPNSGLN---NINNLTGSSLNSAMSPGACPYGTPASPYSVYRDTCNSSL ::: :.:: :.:.:.:: :.:::.. :::::. ::::. :. :: :::::::::: CCDS36 SSSMVPSAVTGVPGSSLNSLNNLNNLSSPSLNSAVPTPACPYAPPTPPY-VYRDTCNSSL 230 240 250 260 270 280 290 300 310 pF1KB7 ASLRLKSKQHSSFGYGGLQGPASGLNACQYNS ::::::.::::::::...:.:::.:.:::: CCDS36 ASLRLKAKQHSSFGYASVQNPASNLSACQYAVDRPV 290 300 310 320 >>CCDS3692.1 PITX2 gene_id:5308|Hs108|chr4 (317 aa) initn: 1148 init1: 595 opt: 1214 Z-score: 864.6 bits: 168.2 E(32554): 7.8e-42 Smith-Waterman score: 1214; 69.0% identity (84.7% similar) in 274 aa overlap (44-312:42-311) 20 30 40 50 60 70 pF1KB7 PEGLRPPPPPPHDMGPAFHLARPADPREPLENSASESSDTELPEKERGGEPKGPEDSGAG :. .:. .:... :.. : . : : CCDS36 CVQLGVQPAAVECLFSKDSEIKKVEFTDSPESRKEAASSKFFPRQHPGANEK--DKSQQG 20 30 40 50 60 80 90 100 110 120 130 pF1KB7 GTGCGGADDPAKKKKQRRQRTHFTSQQLQELEATFQRNRYPDMSMREEIAVWTNLTEPRV . ::.::.:::.::::::::::::::::::::::::::::: :::::::::::: :: CCDS36 KNEDVGAEDPSKKKRQRRQRTHFTSQQLQELEATFQRNRYPDMSTREEIAVWTNLTEARV 70 80 90 100 110 120 140 150 160 170 180 190 pF1KB7 RVWFKNRRAKWRKRERNQQLDLCKGGYVPQFSGLVQPYEDVYAAGYSYNNWAAKSLAPAP ::::::::::::::::::: .:::.:. :::.::.:::.:.: ::::::::::.:. : CCDS36 RVWFKNRRAKWRKRERNQQAELCKNGFGPQFNGLMQPYDDMYP-GYSYNNWAAKGLTSAS 130 140 150 160 170 180 200 210 220 230 240 pF1KB7 LSTKSFTFFNSMS--PLSSQSMFSAPSSISSMTMPSSMGPGAVPGMPNSGLN---NINNL :::::: :::::. ::::::::: :.:::::.: ::: :.:: :.:.:.:: :.::: CCDS36 LSTKSFPFFNSMNVNPLSSQSMFSPPNSISSMSMSSSMVPSAVTGVPGSSLNSLNNLNNL 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB7 TGSSLNSAMSPGACPYGTPASPYSVYRDTCNSSLASLRLKSKQHSSFGYGGLQGPASGLN .. :::::. ::::. :. :: ::::::::::::::::.::::::::...:.:::.:. CCDS36 SSPSLNSAVPTPACPYAPPTPPY-VYRDTCNSSLASLRLKAKQHSSFGYASVQNPASNLS 250 260 270 280 290 300 310 pF1KB7 ACQYNS :::: CCDS36 ACQYAVDRPV 310 >>CCDS3693.1 PITX2 gene_id:5308|Hs108|chr4 (271 aa) initn: 1148 init1: 595 opt: 1202 Z-score: 857.1 bits: 166.5 E(32554): 2e-41 Smith-Waterman score: 1202; 76.2% identity (89.5% similar) in 239 aa overlap (79-312:29-265) 50 60 70 80 90 100 pF1KB7 ESSDTELPEKERGGEPKGPEDSGAGGTGCGGADDPAKKKKQRRQRTHFTSQQLQELEATF ::.::.:::.:::::::::::::::::::: CCDS36 METNCRKLVSACVQLEKDKSQQGKNEDVGAEDPSKKKRQRRQRTHFTSQQLQELEATF 10 20 30 40 50 110 120 130 140 150 160 pF1KB7 QRNRYPDMSMREEIAVWTNLTEPRVRVWFKNRRAKWRKRERNQQLDLCKGGYVPQFSGLV ::::::::: :::::::::::: ::::::::::::::::::::: .:::.:. :::.::. CCDS36 QRNRYPDMSTREEIAVWTNLTEARVRVWFKNRRAKWRKRERNQQAELCKNGFGPQFNGLM 60 70 80 90 100 110 170 180 190 200 210 220 pF1KB7 QPYEDVYAAGYSYNNWAAKSLAPAPLSTKSFTFFNSMS--PLSSQSMFSAPSSISSMTMP :::.:.: ::::::::::.:. : :::::: :::::. ::::::::: :.:::::.: CCDS36 QPYDDMYP-GYSYNNWAAKGLTSASLSTKSFPFFNSMNVNPLSSQSMFSPPNSISSMSMS 120 130 140 150 160 170 230 240 250 260 270 280 pF1KB7 SSMGPGAVPGMPNSGLN---NINNLTGSSLNSAMSPGACPYGTPASPYSVYRDTCNSSLA ::: :.:: :.:.:.:: :.:::.. :::::. ::::. :. :: ::::::::::: CCDS36 SSMVPSAVTGVPGSSLNSLNNLNNLSSPSLNSAVPTPACPYAPPTPPY-VYRDTCNSSLA 180 190 200 210 220 230 290 300 310 pF1KB7 SLRLKSKQHSSFGYGGLQGPASGLNACQYNS :::::.::::::::...:.:::.:.:::: CCDS36 SLRLKAKQHSSFGYASVQNPASNLSACQYAVDRPV 240 250 260 270 >>CCDS7532.1 PITX3 gene_id:5309|Hs108|chr10 (302 aa) initn: 841 init1: 558 opt: 801 Z-score: 576.8 bits: 114.8 E(32554): 8.3e-26 Smith-Waterman score: 969; 56.3% identity (75.3% similar) in 300 aa overlap (31-312:3-296) 10 20 30 40 50 pF1KB7 MDAFKGGMSLERLPEGLRPPPPPPHDMGPAFHLARPADPREP-LENSASESSDTELPEKE : : :. : : : : . . .:::. CCDS75 MEFGLLSEAEARSPALSLSDAGTPHPQLPEHG 10 20 30 60 70 80 90 100 110 pF1KB7 -RGGEPKGPEDSGAGGTGCGGADDPAKKKKQRRQRTHFTSQQLQELEATFQRNRYPDMSM .: : . : ..:. : :. .: . :::::::::::::::::::::::::::::::: CCDS75 CKGQEHSDSEKASASLPG-GSPEDGSLKKKQRRQRTHFTSQQLQELEATFQRNRYPDMST 40 50 60 70 80 90 120 130 140 150 160 170 pF1KB7 REEIAVWTNLTEPRVRVWFKNRRAKWRKRERNQQLDLCKGGYVPQFSGLVQPYEDVYAAG :::::::::::: ::::::::::::::::::.:: .::::... ..::: :::.:: : CCDS75 REEIAVWTNLTEARVRVWFKNRRAKWRKRERSQQAELCKGSFAAPLGGLVPPYEEVYP-G 100 110 120 130 140 150 180 190 200 210 220 230 pF1KB7 YSYNNWAAKSLAPAPLSTKSFTF-FNSMS--PLSSQSMFSAPSSISSMTMPSSMG-PGAV :::.:: :.::: ::..:.: : :::.. ::.:: .:: ::::.. .::. . ::.: CCDS75 YSYGNWPPKALAP-PLAAKTFPFAFNSVNVGPLASQPVFSPPSSIAASMVPSAAAAPGTV 160 170 180 190 200 240 250 260 270 280 pF1KB7 PGMPNSGLNNINNLTGSSLNSAMSPGA--CPYGTPA--------SPYSVYRDTCNSSLAS :: :.. :..... . .:.: :: :::.. : ::: :::: ::::::: CCDS75 PG-PGA-LQGLGGGPPGLAPAAVSSGAVSCPYASAAAAAAAAASSPY-VYRDPCNSSLAS 210 220 230 240 250 260 290 300 310 pF1KB7 LRLKSKQHSSFGYGGLQGP--ASGLNACQYNS ::::.:::.::.: ...:: :..:. ::: CCDS75 LRLKAKQHASFSYPAVHGPPPAANLSPCQYAVERPV 270 280 290 300 314 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Fri Nov 4 09:00:55 2016 done: Fri Nov 4 09:00:55 2016 Total Scan time: 2.600 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]