FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KA1667, 708 aa 1>>>pF1KA1667 708 - 708 aa - 708 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 8.1846+/-0.000941; mu= 6.7067+/- 0.057 mean_var=149.5952+/-29.664, 0's: 0 Z-trim(110.3): 17 B-trim: 0 in 0/50 Lambda= 0.104861 statistics sampled from 11496 (11502) to 11496 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.705), E-opt: 0.2 (0.353), width: 16 Scan time: 4.310 The best scores are: opt bits E(32554) CCDS13835.1 HPS4 gene_id:89781|Hs108|chr22 ( 708) 4727 727.2 2e-209 CCDS46677.1 HPS4 gene_id:89781|Hs108|chr22 ( 703) 4644 714.7 1.2e-205 >>CCDS13835.1 HPS4 gene_id:89781|Hs108|chr22 (708 aa) initn: 4727 init1: 4727 opt: 4727 Z-score: 3873.5 bits: 727.2 E(32554): 2e-209 Smith-Waterman score: 4727; 99.3% identity (99.9% similar) in 708 aa overlap (1-708:1-708) 10 20 30 40 50 60 pF1KA1 MATSTSTEAKSASWWNYFFLYDGSKVKEEGDPTRAGICYFYPSQTLLDQQELLCGQIAGV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 MATSTSTEAKSASWWNYFFLYDGSKVKEEGDPTRAGICYFYPSQTLLDQQELLCGQIAGV 10 20 30 40 50 60 70 80 90 100 110 120 pF1KA1 VRCVSDISDSPPTLVRLRKLKFAIKVDGDYLWVLGCAVELPDVSCKRFLDQLVGFFNFYN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 VRCVSDISDSPPTLVRLRKLKFAIKVDGDYLWVLGCAVELPDVSCKRFLDQLVGFFNFYN 70 80 90 100 110 120 130 140 150 160 170 180 pF1KA1 GPVSLAYENCSQEELSTEWDTFIEQILKNTSDLHKIFNSLWNLDQTKVEPLLLLKAARIL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 GPVSLAYENCSQEELSTEWDTFIEQILKNTSDLHKIFNSLWNLDQTKVEPLLLLKAARIL 130 140 150 160 170 180 190 200 210 220 230 240 pF1KA1 QTCQRSPHILAGCILYKGLIVSTQLPPSLTAKVLLHRTAPQEQRLPTGGDAPQEHGAALP :::::::::::::::::::::::::::::::::::::::::::::::: ::::::::::: CCDS13 QTCQRSPHILAGCILYKGLIVSTQLPPSLTAKVLLHRTAPQEQRLPTGEDAPQEHGAALP 190 200 210 220 230 240 250 260 270 280 290 300 pF1KA1 PNVQIIPVFVTKEEAISLHEFPVEQMTRSLASPAGLQDGSAQHHPKGGSTSALKENATGH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 PNVQIIPVFVTKEEAISLHEFPVEQMTRSLASPAGLQDGSAQHHPKGGSTSALKENATGH 250 260 270 280 290 300 310 320 330 340 350 360 pF1KA1 VESMAWTTPDPTSPDEACPDGRKENGCLSGHDLESIRPAGLHNSARGEVLGLSSSLGKEL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 VESMAWTTPDPTSPDEACPDGRKENGCLSGHDLESIRPAGLHNSARGEVLGLSSSLGKEL 310 320 330 340 350 360 370 380 390 400 410 420 pF1KA1 VFLQEELDLSEIHIPEAQEVEMASGHFAFLHVPVPDGRAPYCKASLSASSSLEPTPPEDT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 VFLQEELDLSEIHIPEAQEVEMASGHFAFLHVPVPDGRAPYCKASLSASSSLEPTPPEDT 370 380 390 400 410 420 430 440 450 460 470 480 pF1KA1 AISSLRPPSAPEMLTQHGAQEQVEDHPGHSSQAPIPRADPLPRRTRRPLLLPRLDPGQRG ::::::::::::::::::::::.::::::::::::::::::::::::::::::::::::: CCDS13 AISSLRPPSAPEMLTQHGAQEQLEDHPGHSSQAPIPRADPLPRRTRRPLLLPRLDPGQRG 430 440 450 460 470 480 490 500 510 520 530 540 pF1KA1 NKLPTGEQGLDEDVDGVCESHAAPGLECSSGSANCQGAGPSADGISSRLTPAESCMGLVR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 NKLPTGEQGLDEDVDGVCESHAAPGLECSSGSANCQGAGPSADGISSRLTPAESCMGLVR 490 500 510 520 530 540 550 560 570 580 590 600 pF1KA1 MNLYTHCVKGLMLSLLAEEPLLGDSAAIEEVYHSSLASLNGLEVHLKETLPRDEAASTSS :::::::::::.:::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 MNLYTHCVKGLVLSLLAEEPLLGDSAAIEEVYHSSLASLNGLEVHLKETLPRDEAASTSS 550 560 570 580 590 600 610 620 630 640 650 660 pF1KA1 TYNFTYYDRIQSLLMANLPQVATPHDRRFLQAVSLMHSEFAQLPALYEMTVRNASTAVYA :::::.::::::::::::::::::.::::::::::::::::::::::::::::::::::: CCDS13 TYNFTHYDRIQSLLMANLPQVATPQDRRFLQAVSLMHSEFAQLPALYEMTVRNASTAVYA 610 620 630 640 650 660 670 680 690 700 pF1KA1 CCNPIQETYFQQLAPAARSSGFPNPQDGAFSLSGKAKQKLLKHGVNLL :::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 CCNPIQETYFQQLAPAARSSGFPNPQDGAFSLSGKAKQKLLKHGVNLL 670 680 690 700 >>CCDS46677.1 HPS4 gene_id:89781|Hs108|chr22 (703 aa) initn: 4644 init1: 4644 opt: 4644 Z-score: 3805.7 bits: 714.7 E(32554): 1.2e-205 Smith-Waterman score: 4644; 98.9% identity (99.6% similar) in 698 aa overlap (11-708:6-703) 10 20 30 40 50 60 pF1KA1 MATSTSTEAKSASWWNYFFLYDGSKVKEEGDPTRAGICYFYPSQTLLDQQELLCGQIAGV : . :::::::::::::::::::::::::::::::::::::::::::::: CCDS46 MAPLCSLARWNYFFLYDGSKVKEEGDPTRAGICYFYPSQTLLDQQELLCGQIAGV 10 20 30 40 50 70 80 90 100 110 120 pF1KA1 VRCVSDISDSPPTLVRLRKLKFAIKVDGDYLWVLGCAVELPDVSCKRFLDQLVGFFNFYN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS46 VRCVSDISDSPPTLVRLRKLKFAIKVDGDYLWVLGCAVELPDVSCKRFLDQLVGFFNFYN 60 70 80 90 100 110 130 140 150 160 170 180 pF1KA1 GPVSLAYENCSQEELSTEWDTFIEQILKNTSDLHKIFNSLWNLDQTKVEPLLLLKAARIL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS46 GPVSLAYENCSQEELSTEWDTFIEQILKNTSDLHKIFNSLWNLDQTKVEPLLLLKAARIL 120 130 140 150 160 170 190 200 210 220 230 240 pF1KA1 QTCQRSPHILAGCILYKGLIVSTQLPPSLTAKVLLHRTAPQEQRLPTGGDAPQEHGAALP :::::::::::::::::::::::::::::::::::::::::::::::: ::::::::::: CCDS46 QTCQRSPHILAGCILYKGLIVSTQLPPSLTAKVLLHRTAPQEQRLPTGEDAPQEHGAALP 180 190 200 210 220 230 250 260 270 280 290 300 pF1KA1 PNVQIIPVFVTKEEAISLHEFPVEQMTRSLASPAGLQDGSAQHHPKGGSTSALKENATGH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS46 PNVQIIPVFVTKEEAISLHEFPVEQMTRSLASPAGLQDGSAQHHPKGGSTSALKENATGH 240 250 260 270 280 290 310 320 330 340 350 360 pF1KA1 VESMAWTTPDPTSPDEACPDGRKENGCLSGHDLESIRPAGLHNSARGEVLGLSSSLGKEL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS46 VESMAWTTPDPTSPDEACPDGRKENGCLSGHDLESIRPAGLHNSARGEVLGLSSSLGKEL 300 310 320 330 340 350 370 380 390 400 410 420 pF1KA1 VFLQEELDLSEIHIPEAQEVEMASGHFAFLHVPVPDGRAPYCKASLSASSSLEPTPPEDT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS46 VFLQEELDLSEIHIPEAQEVEMASGHFAFLHVPVPDGRAPYCKASLSASSSLEPTPPEDT 360 370 380 390 400 410 430 440 450 460 470 480 pF1KA1 AISSLRPPSAPEMLTQHGAQEQVEDHPGHSSQAPIPRADPLPRRTRRPLLLPRLDPGQRG ::::::::::::::::::::::.::::::::::::::::::::::::::::::::::::: CCDS46 AISSLRPPSAPEMLTQHGAQEQLEDHPGHSSQAPIPRADPLPRRTRRPLLLPRLDPGQRG 420 430 440 450 460 470 490 500 510 520 530 540 pF1KA1 NKLPTGEQGLDEDVDGVCESHAAPGLECSSGSANCQGAGPSADGISSRLTPAESCMGLVR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS46 NKLPTGEQGLDEDVDGVCESHAAPGLECSSGSANCQGAGPSADGISSRLTPAESCMGLVR 480 490 500 510 520 530 550 560 570 580 590 600 pF1KA1 MNLYTHCVKGLMLSLLAEEPLLGDSAAIEEVYHSSLASLNGLEVHLKETLPRDEAASTSS :::::::::::.:::::::::::::::::::::::::::::::::::::::::::::::: CCDS46 MNLYTHCVKGLVLSLLAEEPLLGDSAAIEEVYHSSLASLNGLEVHLKETLPRDEAASTSS 540 550 560 570 580 590 610 620 630 640 650 660 pF1KA1 TYNFTYYDRIQSLLMANLPQVATPHDRRFLQAVSLMHSEFAQLPALYEMTVRNASTAVYA :::::.::::::::::::::::::.::::::::::::::::::::::::::::::::::: CCDS46 TYNFTHYDRIQSLLMANLPQVATPQDRRFLQAVSLMHSEFAQLPALYEMTVRNASTAVYA 600 610 620 630 640 650 670 680 690 700 pF1KA1 CCNPIQETYFQQLAPAARSSGFPNPQDGAFSLSGKAKQKLLKHGVNLL :::::::::::::::::::::::::::::::::::::::::::::::: CCDS46 CCNPIQETYFQQLAPAARSSGFPNPQDGAFSLSGKAKQKLLKHGVNLL 660 670 680 690 700 708 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Fri Nov 4 01:42:00 2016 done: Fri Nov 4 01:42:00 2016 Total Scan time: 4.310 Total Display time: 0.030 Function used was FASTA [36.3.4 Apr, 2011]