FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB8403, 715 aa 1>>>pF1KB8403 715 - 715 aa - 715 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.5560+/-0.00103; mu= 14.1076+/- 0.062 mean_var=90.9061+/-17.568, 0's: 0 Z-trim(105.2): 13 B-trim: 0 in 0/51 Lambda= 0.134517 statistics sampled from 8297 (8304) to 8297 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.627), E-opt: 0.2 (0.255), width: 16 Scan time: 3.000 The best scores are: opt bits E(32554) CCDS11954.2 POLI gene_id:11201|Hs108|chr18 ( 740) 4692 921.2 0 CCDS4030.1 POLK gene_id:51426|Hs108|chr5 ( 870) 317 72.2 3.7e-12 >>CCDS11954.2 POLI gene_id:11201|Hs108|chr18 (740 aa) initn: 4692 init1: 4692 opt: 4692 Z-score: 4921.5 bits: 921.2 E(32554): 0 Smith-Waterman score: 4692; 99.6% identity (99.9% similar) in 715 aa overlap (1-715:26-740) 10 20 30 pF1KB8 MELADVGAAASSQGVHDQVLPTPNASSRVIVHVDL ::::::::::::::::::::::::::::::::::: CCDS11 MEKLGVEPEEEGGGDDDEEDAEAWAMELADVGAAASSQGVHDQVLPTPNASSRVIVHVDL 10 20 30 40 50 60 40 50 60 70 80 90 pF1KB8 DCFYAQVEMISNPELKDKPLGVQQKYLVVTCNYEARKLGVKKLMNVRDAKEKCPQLVLVN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 DCFYAQVEMISNPELKDKPLGVQQKYLVVTCNYEARKLGVKKLMNVRDAKEKCPQLVLVN 70 80 90 100 110 120 100 110 120 130 140 150 pF1KB8 GEDLTRYREMSYKVTELLEEFSPVVERLGFDENFVDLTEMVEKRLQQLQSDELSAVTVSG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 GEDLTRYREMSYKVTELLEEFSPVVERLGFDENFVDLTEMVEKRLQQLQSDELSAVTVSG 130 140 150 160 170 180 160 170 180 190 200 210 pF1KB8 HVYNNQSINLLDVLHIRLLVGSQIAAEMREAMYNQLGLTGCAGVASNKLLAKLVSGVFKP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 HVYNNQSINLLDVLHIRLLVGSQIAAEMREAMYNQLGLTGCAGVASNKLLAKLVSGVFKP 190 200 210 220 230 240 220 230 240 250 260 270 pF1KB8 NQQTVLLPESCQHLIHSLNHIKEIPGIGYKTAKCLEALGINSVRDLQTFSPKILEKELGI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 NQQTVLLPESCQHLIHSLNHIKEIPGIGYKTAKCLEALGINSVRDLQTFSPKILEKELGI 250 260 270 280 290 300 280 290 300 310 320 330 pF1KB8 SVAQRIQKLSFGEDNSPVILSGPPQSFSEEDSFKKCTSEVEAKNKIEELLASLLNRVCQD ::::::::::::::::::::::::::::::::::::.::::::::::::::::::::::: CCDS11 SVAQRIQKLSFGEDNSPVILSGPPQSFSEEDSFKKCSSEVEAKNKIEELLASLLNRVCQD 310 320 330 340 350 360 340 350 360 370 380 390 pF1KB8 GRKPHTVRLIIRRYSSEKHYGRESRQCPIPSHVIQKLGTGNYDVMTPMVDILMKLFRNMV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 GRKPHTVRLIIRRYSSEKHYGRESRQCPIPSHVIQKLGTGNYDVMTPMVDILMKLFRNMV 370 380 390 400 410 420 400 410 420 430 440 450 pF1KB8 NVKMPFHLTLLSVCFCNLKALNTAKKGLIDYYLMPSLSTTSRSGKHSFKMKDTHMEDFPK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 NVKMPFHLTLLSVCFCNLKALNTAKKGLIDYYLMPSLSTTSRSGKHSFKMKDTHMEDFPK 430 440 450 460 470 480 460 470 480 490 500 510 pF1KB8 DKETNRDFLPSGRIESTRTRESPLDTTNFSKEKDINEFPLCSLPEGVDQEVSKQLPVDIQ ::::::::::::::::::::::::::::::::::::::::::::::::::: :::::::: CCDS11 DKETNRDFLPSGRIESTRTRESPLDTTNFSKEKDINEFPLCSLPEGVDQEVFKQLPVDIQ 490 500 510 520 530 540 520 530 540 550 560 570 pF1KB8 EEILSGKSREKFQGKGSVSCPLHASRGVLSFFSKKQMQDIPINPRDHLSSSKQVSSVSPC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 EEILSGKSREKFQGKGSVSCPLHASRGVLSFFSKKQMQDIPINPRDHLSSSKQVSSVSPC 550 560 570 580 590 600 580 590 600 610 620 630 pF1KB8 EPGTSGFNSSSSSYMSSQKDYSYYLDNRLKDERISQGPKEPQGFHFTNSNPAVSAFHSFP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 EPGTSGFNSSSSSYMSSQKDYSYYLDNRLKDERISQGPKEPQGFHFTNSNPAVSAFHSFP 610 620 630 640 650 660 640 650 660 670 680 690 pF1KB8 NLQSEQLFSRNHTTDSHKQTVATDSHEGLTENREPDSVDEKITFPSDIDPQVFYELPEAV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 NLQSEQLFSRNHTTDSHKQTVATDSHEGLTENREPDSVDEKITFPSDIDPQVFYELPEAV 670 680 690 700 710 720 700 710 pF1KB8 QKELLAEWKRTGSDFHIGHK ::::::::::.::::::::: CCDS11 QKELLAEWKRAGSDFHIGHK 730 740 >>CCDS4030.1 POLK gene_id:51426|Hs108|chr5 (870 aa) initn: 358 init1: 237 opt: 317 Z-score: 331.8 bits: 72.2 E(32554): 3.7e-12 Smith-Waterman score: 436; 23.8% identity (53.0% similar) in 677 aa overlap (27-636:100-753) 10 20 30 40 50 pF1KB8 MELADVGAAASSQGVHDQVLPTPNASSRVIVHVDLDCFYAQVEMISNPELKDKPLG : .:::.:.: ::: ::: .::::::::.. CCDS40 QKAQITSQQLRKAQLQVDRFAMELEQSRNLSNTIVHIDMDAFYAAVEMRDNPELKDKPIA 70 80 90 100 110 120 60 70 80 90 100 110 pF1KB8 VQQKYLVVTCNYEARKLGVKKLMNVRDAKEKCPQLVLVNGEDLTRYREMSYKVTELLEEF : . .. : ::.::..::. : ::. ::::..: .. .:: .: .: :.: .. CCDS40 VGSMSMLSTSNYHARRFGVRAAMPGFIAKRLCPQLIIVP-PNFDKYRAVSKEVKEILADY 130 140 150 160 170 180 120 130 140 150 160 pF1KB8 SPVVERLGFDENFVDLTEMVEKRLQQLQSDELSAVTVSGHVYNN---QSINLLD------ .: ...:: ....:. .:.: . .. . . ... : :. . .: :. CCDS40 DPNFMAMSLDEAYLNITKHLEERQNWPEDKRRYFIKMGSSVENDNPGKEVNKLSEHERSI 190 200 210 220 230 240 170 180 190 pF1KB8 ------------------------------VLHIRLLVGS---QIAAEMREAMYNQLGLT .:. .. :. ... :.: . .. :: CCDS40 SPLLFEESPSDVQPPGDPFQVNFEEQNNPQILQNSVVFGTSAQEVVKEIRFRIEQKTTLT 250 260 270 280 290 300 200 210 220 230 240 250 pF1KB8 GCAGVASNKLLAKLVSGVFKPNQQTVLLP--ESCQHLIHSLNHIKEIPGIGYKTAKCLEA . ::.: : .:::. : ::: : .:: .. . .:..: :... ::: : : :.: CCDS40 ASAGIAPNTMLAKVCSDKNKPNGQYQILPNRQAVMDFIKDLP-IRKVSGIGKVTEKMLKA 310 320 330 340 350 360 260 270 280 290 300 310 pF1KB8 LGINSVRDLQTFSPKILEKELGISVA-QRIQKLSFGEDNSPVILSGPPQSFSEEDSFKKC ::: . .: .. . : . : .. . . ..:.: .. . .: .:.: : .:.. CCDS40 LGIITCTEL--YQQRALLSLLFSETSWHYFLHISLGLGSTHLTRDGERKSMSVERTFSEI 370 380 390 400 410 420 320 330 340 350 360 pF1KB8 TSEVEAKNKIEELLASLLNRVCQDGRKPHTVRLIIRRYSSEKHYGRESRQCPIPSH---- .. : . .:: . : . . .. : .:: . .. . : . : : . : CCDS40 NKAEEQYSLCQELCSELAQDLQKERLKGRTVTIKLKNVNFEVKT-RASTVSSVVSTAEEI 430 440 450 460 470 480 370 380 390 400 410 pF1KB8 --VIQKLGTGNYDVMTP------MVDILMKLFRNMVNVKMPFHLTLLSVCFCNLKALN-- . ..: . :. : .. . .. : : . : . .... . .::. CCDS40 FAIAKELLKTEIDADFPHPLRLRLMGVRISSFPNEEDRKHQ-QRSIIGFLQAGNQALSAT 490 500 510 520 530 540 420 430 440 450 460 470 pF1KB8 --TAKKGLIDYYLMPSLSTTSRS--GKHSFKMKDTHMEDFPKDKETNRDFLPSGRIESTR : .: : .. : . ..: :. . : .:.. : . ....: : .. . CCDS40 ECTLEKTDKDKFVKPLEMSHKKSFFDKKRSERKWSHQDTFKCEAVNKQSFQTSQPFQVLK 550 560 570 580 590 600 480 490 500 510 520 pF1KB8 TRESP-LDTTNFSKEKDINEFPLCSLPEGVDQEVSKQLPVDIQEEILSGKS-REKFQ--G . . :. .. : . .: :.: .: . . . :: : :.: : :.:. . CCDS40 KKMNENLEISENSDDCQILTCPVCFRAQGCISLEALNKHVD---ECLDGPSISENFKMFS 610 620 630 640 650 660 530 540 550 560 570 580 pF1KB8 KGSVSCPLHASRGVLSFFSKKQMQDIPINPRDHLSSSKQVSSVSPCEPGTSGFNSSSSSY . :: .. . : . :: .:. :..:::. : .. ...::.. CCDS40 CSHVSATKVNKKENVPASSLCEKQDYEAHPK-----IKEISSVD-CIALVDTIDNSSKA- 670 680 690 700 710 590 600 610 620 630 640 pF1KB8 MSSQKDYSYYLDNRLKDERISQGPKEPQGFHFTNSNPAVSAFHSFPNLQSEQLFSRNHTT . :.:. . :. :. :. ..:.. . . :. :. : CCDS40 -----ESIDALSNKHSKEECSSLPS--KSFNIEHCHQNSSSTVSLENEDVGSFRQEYRQP 720 730 740 750 760 650 660 670 680 690 700 pF1KB8 DSHKQTVATDSHEGLTENREPDSVDEKITFPSDIDPQVFYELPEAVQKELLAEWKRTGSD CCDS40 YLCEVKTGQALVCPVCNVEQKTSDLTLFNVHVDVCLNKSFIQELRKDKFNPVNQPKESSR 770 780 790 800 810 820 715 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Fri Nov 4 12:38:04 2016 done: Fri Nov 4 12:38:04 2016 Total Scan time: 3.000 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]