FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB5190, 588 aa 1>>>pF1KB5190 588 - 588 aa - 588 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 7.2765+/-0.000825; mu= 12.4487+/- 0.050 mean_var=180.8431+/-36.140, 0's: 0 Z-trim(114.1): 54 B-trim: 150 in 1/51 Lambda= 0.095373 statistics sampled from 14646 (14698) to 14646 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.771), E-opt: 0.2 (0.451), width: 16 Scan time: 4.090 The best scores are: opt bits E(32554) CCDS35316.1 PJA1 gene_id:64219|Hs108|chrX ( 588) 4049 569.4 4.4e-162 CCDS14393.1 PJA1 gene_id:64219|Hs108|chrX ( 643) 4049 569.4 4.7e-162 CCDS14392.1 PJA1 gene_id:64219|Hs108|chrX ( 455) 2484 354.0 2.5e-97 CCDS4099.1 PJA2 gene_id:9867|Hs108|chr5 ( 708) 1211 179.0 1.8e-44 >>CCDS35316.1 PJA1 gene_id:64219|Hs108|chrX (588 aa) initn: 4049 init1: 4049 opt: 4049 Z-score: 3023.3 bits: 569.4 E(32554): 4.4e-162 Smith-Waterman score: 4049; 100.0% identity (100.0% similar) in 588 aa overlap (1-588:1-588) 10 20 30 40 50 60 pF1KB5 MHRSAPSQTTKRSRSPFSTTRRSWDDSESSGTNLNIDNEDYSRYPPREYRASGSRRGMAY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS35 MHRSAPSQTTKRSRSPFSTTRRSWDDSESSGTNLNIDNEDYSRYPPREYRASGSRRGMAY 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB5 GHIDSYGADDSEEEGAGPVERPPVRGKTGKFKDDKLYDPEKGARSLAGPPPHFSSFSRDV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS35 GHIDSYGADDSEEEGAGPVERPPVRGKTGKFKDDKLYDPEKGARSLAGPPPHFSSFSRDV 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB5 REERDKLDPVPAARCSASRADFLPQSSVASQSSSEGKLATKGDSSERERREQNLPARPSR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS35 REERDKLDPVPAARCSASRADFLPQSSVASQSSSEGKLATKGDSSERERREQNLPARPSR 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB5 APVSICGGGENTSKSAEEPVVRPKIRNLASPNCVKPKIFFDTDDDDDMPHSTSRWRDTAN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS35 APVSICGGGENTSKSAEEPVVRPKIRNLASPNCVKPKIFFDTDDDDDMPHSTSRWRDTAN 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB5 DNEGHSDGLARRGRGESSSGYPEPKYPEDKREARSDQVKPEKVPRRRRTMADPDFWTHSD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS35 DNEGHSDGLARRGRGESSSGYPEPKYPEDKREARSDQVKPEKVPRRRRTMADPDFWTHSD 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB5 DYYKYCDEDSDSDKEWIAALRRKYRSREQTLSSSGESWETLPGKEEREPPQAKVSASTGT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS35 DYYKYCDEDSDSDKEWIAALRRKYRSREQTLSSSGESWETLPGKEEREPPQAKVSASTGT 310 320 330 340 350 360 370 380 390 400 410 420 pF1KB5 SPGPGASASAGAGAGASAGSNGSNYLEEVREPSLQEEQASLEEGEIPWLQYHENDSSSEG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS35 SPGPGASASAGAGAGASAGSNGSNYLEEVREPSLQEEQASLEEGEIPWLQYHENDSSSEG 370 380 390 400 410 420 430 440 450 460 470 480 pF1KB5 DNDSGHELMQPGVFMLDGNNNLEDDSSVSEDLEVDWSLFDGFADGLGVAEAISYVDPQFL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS35 DNDSGHELMQPGVFMLDGNNNLEDDSSVSEDLEVDWSLFDGFADGLGVAEAISYVDPQFL 430 440 450 460 470 480 490 500 510 520 530 540 pF1KB5 TYMALEERLAQAMETALAHLESLAVDVEVANPPASKESIDALPEILVTEDHGAVGQEMCC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS35 TYMALEERLAQAMETALAHLESLAVDVEVANPPASKESIDALPEILVTEDHGAVGQEMCC 490 500 510 520 530 540 550 560 570 580 pF1KB5 PICCSEYVKGEVATELPCHHYFHKPCVSIWLQKSGTCPVCRCMFPPPL :::::::::::::::::::::::::::::::::::::::::::::::: CCDS35 PICCSEYVKGEVATELPCHHYFHKPCVSIWLQKSGTCPVCRCMFPPPL 550 560 570 580 >>CCDS14393.1 PJA1 gene_id:64219|Hs108|chrX (643 aa) initn: 4049 init1: 4049 opt: 4049 Z-score: 3022.9 bits: 569.4 E(32554): 4.7e-162 Smith-Waterman score: 4049; 100.0% identity (100.0% similar) in 588 aa overlap (1-588:56-643) 10 20 30 pF1KB5 MHRSAPSQTTKRSRSPFSTTRRSWDDSESS :::::::::::::::::::::::::::::: CCDS14 GRRHAYVSFRPPTSQRERIASQRKTNSEVPMHRSAPSQTTKRSRSPFSTTRRSWDDSESS 30 40 50 60 70 80 40 50 60 70 80 90 pF1KB5 GTNLNIDNEDYSRYPPREYRASGSRRGMAYGHIDSYGADDSEEEGAGPVERPPVRGKTGK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 GTNLNIDNEDYSRYPPREYRASGSRRGMAYGHIDSYGADDSEEEGAGPVERPPVRGKTGK 90 100 110 120 130 140 100 110 120 130 140 150 pF1KB5 FKDDKLYDPEKGARSLAGPPPHFSSFSRDVREERDKLDPVPAARCSASRADFLPQSSVAS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 FKDDKLYDPEKGARSLAGPPPHFSSFSRDVREERDKLDPVPAARCSASRADFLPQSSVAS 150 160 170 180 190 200 160 170 180 190 200 210 pF1KB5 QSSSEGKLATKGDSSERERREQNLPARPSRAPVSICGGGENTSKSAEEPVVRPKIRNLAS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 QSSSEGKLATKGDSSERERREQNLPARPSRAPVSICGGGENTSKSAEEPVVRPKIRNLAS 210 220 230 240 250 260 220 230 240 250 260 270 pF1KB5 PNCVKPKIFFDTDDDDDMPHSTSRWRDTANDNEGHSDGLARRGRGESSSGYPEPKYPEDK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 PNCVKPKIFFDTDDDDDMPHSTSRWRDTANDNEGHSDGLARRGRGESSSGYPEPKYPEDK 270 280 290 300 310 320 280 290 300 310 320 330 pF1KB5 REARSDQVKPEKVPRRRRTMADPDFWTHSDDYYKYCDEDSDSDKEWIAALRRKYRSREQT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 REARSDQVKPEKVPRRRRTMADPDFWTHSDDYYKYCDEDSDSDKEWIAALRRKYRSREQT 330 340 350 360 370 380 340 350 360 370 380 390 pF1KB5 LSSSGESWETLPGKEEREPPQAKVSASTGTSPGPGASASAGAGAGASAGSNGSNYLEEVR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 LSSSGESWETLPGKEEREPPQAKVSASTGTSPGPGASASAGAGAGASAGSNGSNYLEEVR 390 400 410 420 430 440 400 410 420 430 440 450 pF1KB5 EPSLQEEQASLEEGEIPWLQYHENDSSSEGDNDSGHELMQPGVFMLDGNNNLEDDSSVSE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 EPSLQEEQASLEEGEIPWLQYHENDSSSEGDNDSGHELMQPGVFMLDGNNNLEDDSSVSE 450 460 470 480 490 500 460 470 480 490 500 510 pF1KB5 DLEVDWSLFDGFADGLGVAEAISYVDPQFLTYMALEERLAQAMETALAHLESLAVDVEVA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 DLEVDWSLFDGFADGLGVAEAISYVDPQFLTYMALEERLAQAMETALAHLESLAVDVEVA 510 520 530 540 550 560 520 530 540 550 560 570 pF1KB5 NPPASKESIDALPEILVTEDHGAVGQEMCCPICCSEYVKGEVATELPCHHYFHKPCVSIW :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 NPPASKESIDALPEILVTEDHGAVGQEMCCPICCSEYVKGEVATELPCHHYFHKPCVSIW 570 580 590 600 610 620 580 pF1KB5 LQKSGTCPVCRCMFPPPL :::::::::::::::::: CCDS14 LQKSGTCPVCRCMFPPPL 630 640 >>CCDS14392.1 PJA1 gene_id:64219|Hs108|chrX (455 aa) initn: 2742 init1: 2477 opt: 2484 Z-score: 1861.0 bits: 354.0 E(32554): 2.5e-97 Smith-Waterman score: 2484; 98.6% identity (99.5% similar) in 365 aa overlap (224-588:92-455) 200 210 220 230 240 250 pF1KB5 KSAEEPVVRPKIRNLASPNCVKPKIFFDTDDDDDMPHSTSRWRDTANDNEGHSDGLARRG :..:. ::::::::::::::::::::::: CCDS14 SQTTKRSRSPFSTTRRSWDDSESSGTNLNIDNEDYS-STSRWRDTANDNEGHSDGLARRG 70 80 90 100 110 120 260 270 280 290 300 310 pF1KB5 RGESSSGYPEPKYPEDKREARSDQVKPEKVPRRRRTMADPDFWTHSDDYYKYCDEDSDSD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 RGESSSGYPEPKYPEDKREARSDQVKPEKVPRRRRTMADPDFWTHSDDYYKYCDEDSDSD 130 140 150 160 170 180 320 330 340 350 360 370 pF1KB5 KEWIAALRRKYRSREQTLSSSGESWETLPGKEEREPPQAKVSASTGTSPGPGASASAGAG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 KEWIAALRRKYRSREQTLSSSGESWETLPGKEEREPPQAKVSASTGTSPGPGASASAGAG 190 200 210 220 230 240 380 390 400 410 420 430 pF1KB5 AGASAGSNGSNYLEEVREPSLQEEQASLEEGEIPWLQYHENDSSSEGDNDSGHELMQPGV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 AGASAGSNGSNYLEEVREPSLQEEQASLEEGEIPWLQYHENDSSSEGDNDSGHELMQPGV 250 260 270 280 290 300 440 450 460 470 480 490 pF1KB5 FMLDGNNNLEDDSSVSEDLEVDWSLFDGFADGLGVAEAISYVDPQFLTYMALEERLAQAM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 FMLDGNNNLEDDSSVSEDLEVDWSLFDGFADGLGVAEAISYVDPQFLTYMALEERLAQAM 310 320 330 340 350 360 500 510 520 530 540 550 pF1KB5 ETALAHLESLAVDVEVANPPASKESIDALPEILVTEDHGAVGQEMCCPICCSEYVKGEVA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 ETALAHLESLAVDVEVANPPASKESIDALPEILVTEDHGAVGQEMCCPICCSEYVKGEVA 370 380 390 400 410 420 560 570 580 pF1KB5 TELPCHHYFHKPCVSIWLQKSGTCPVCRCMFPPPL ::::::::::::::::::::::::::::::::::: CCDS14 TELPCHHYFHKPCVSIWLQKSGTCPVCRCMFPPPL 430 440 450 >>CCDS4099.1 PJA2 gene_id:9867|Hs108|chr5 (708 aa) initn: 1353 init1: 1090 opt: 1211 Z-score: 912.0 bits: 179.0 E(32554): 1.8e-44 Smith-Waterman score: 1624; 46.9% identity (67.9% similar) in 605 aa overlap (4-586:107-680) 10 20 30 pF1KB5 MHRSAPSQTTKRSRSPFSTTRRSWDDSESSGTN :: .:::. :.: : ....: . .. :.. CCDS40 SGSSPLDQVDSSLPSEPIFEKSETEIPTCGSALNQTTESSQS-FVAVHHSEEGRDTLGSS 80 90 100 110 120 130 40 50 60 70 80 90 pF1KB5 LNIDNEDYSRYPPREYRASGSRRGMAYGHIDSYGADDSEEEGAGPVERPPVRGKTGKFKD :. :.. ..: : ::. . :.: : ::: : .. : .. . ..... CCDS40 TNLHNHSEGEYIPGACSASSVQNGIALVHTDSYDPDGKHGEDNDHLQLSAEVVEGSRYQE 140 150 160 170 180 190 100 110 120 130 140 pF1KB5 ---DKLYDPE-KGARSLAGPPPHFSSFSRDVREERDKLDPVPAARCSASRADFLPQSSVA . ... : . :.. .: : ::. .::.: ..:: :: .. ::. ..:. :.: CCDS40 SLGNTVFELENREAEAYTGLSPPVPSFNCEVRDEFEELDSVPLVKSSAGDTEFVHQNSQE 200 210 220 230 240 250 150 160 170 180 190 200 pF1KB5 SQSSSEGKLAT--KGDSSERERREQNLPARPSRAPVSICG-----GGENTSKSAEEPVVR : ::. .... . ... .::. .. : . .: ::. :.. :. : ::: CCDS40 IQRSSQDEMVSTKQQNNTSQERQTEHSPEDAACGPGHICSEQNTNDREKNHGSSPEQVVR 260 270 280 290 300 310 210 220 230 240 250 pF1KB5 PKIRNLASPNCVKPKIFFDTDDDDDMPHSTSRWRDTANDNEGHSDGLARRGR---GESSS ::.:.: : . : . :. . . .:..:::.. . .:. :: : . . :: . CCDS40 PKVRKLISSSQVDQETGFNRHEAKQ--RSVQRWREALEVEESGSDDLLIKCEEYDGEHDC 320 330 340 350 360 370 260 270 280 290 300 310 pF1KB5 GYPEPKYPE--DKREARSDQVKPEKVPRRRRTMADPDFWTHSDDYYKYCDEDSDS----D . .: : . .::....:. :. : .: ::. :::. :.: :: : CCDS40 MFLDPPYSRVITQRETENNQMTSESGATAGRQEVDNTFWNGCGDYYQLYDKDEDSSECSD 380 390 400 410 420 430 320 330 340 350 360 370 pF1KB5 KEWIAALRRKYRSREQTLSSSGESWETLPGKEEREPPQAKVSASTGTSPGPGASASAGAG :: :.: ... . :. ::: :::::::::.: :: . : :: CCDS40 GEWSASLPHRFSGTEKDQSSSDESWETLPGKDENEPEL------QSDSSGPE-------- 440 450 460 470 380 390 400 410 420 430 pF1KB5 AGASAGSNGSNYLEEVREPSLQE-EQASLEEGEIPWLQYHE-NDSSSEGDNDSGHELMQP :: .: :::: ::.::::::::::::.: :.:::. :. ..:. :: CCDS40 -------------EENQELSLQEGEQTSLEEGEIPWLQYNEVNESSSDEGNEPANEFAQP 480 490 500 510 520 440 450 460 470 480 490 pF1KB5 GVFMLDGNNNLEDDSSVSEDLEVDWSLFDGFADGLGVAEAISYVDPQFLTYMALEERLAQ . :::::::::::::::::::.:::::::::::::::::::::::::::::::::::::: CCDS40 A-FMLDGNNNLEDDSSVSEDLDVDWSLFDGFADGLGVAEAISYVDPQFLTYMALEERLAQ 530 540 550 560 570 580 500 510 520 530 540 550 pF1KB5 AMETALAHLESLAVDVEVANPPASKESIDALPEILVTEDHGAVGQEMCCPICCSEYVKGE :::::::::::::::::::::::::::::.::: :: ::: :.:::.:::::::::.: . CCDS40 AMETALAHLESLAVDVEVANPPASKESIDGLPETLVLEDHTAIGQEQCCPICCSEYIKDD 590 600 610 620 630 640 560 570 580 pF1KB5 VATELPCHHYFHKPCVSIWLQKSGTCPVCRCMFPPPL .::::::::.:::::::::::::::::::: ::: CCDS40 IATELPCHHFFHKPCVSIWLQKSGTCPVCRRHFPPAVIEASAAPSSEPDPDAPPSNDSIA 650 660 670 680 690 700 CCDS40 EAP 588 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Fri Nov 4 22:21:08 2016 done: Fri Nov 4 22:21:09 2016 Total Scan time: 4.090 Total Display time: 0.040 Function used was FASTA [36.3.4 Apr, 2011]