FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB3002, 575 aa 1>>>pF1KB3002 575 - 575 aa - 575 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 9.3931+/-0.000903; mu= 1.1666+/- 0.055 mean_var=201.7245+/-40.351, 0's: 0 Z-trim(113.5): 8 B-trim: 0 in 0/54 Lambda= 0.090301 statistics sampled from 14140 (14148) to 14140 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.757), E-opt: 0.2 (0.435), width: 16 Scan time: 4.210 The best scores are: opt bits E(32554) CCDS7513.1 POLL gene_id:27343|Hs108|chr10 ( 575) 3868 516.4 3.9e-146 CCDS76332.1 POLL gene_id:27343|Hs108|chr10 ( 300) 1870 255.9 5.1e-68 CCDS6129.1 POLB gene_id:5423|Hs108|chr8 ( 335) 653 97.4 3e-20 CCDS34625.1 POLM gene_id:27434|Hs108|chr7 ( 494) 471 73.8 5.7e-13 >>CCDS7513.1 POLL gene_id:27343|Hs108|chr10 (575 aa) initn: 3868 init1: 3868 opt: 3868 Z-score: 2737.0 bits: 516.4 E(32554): 3.9e-146 Smith-Waterman score: 3868; 100.0% identity (100.0% similar) in 575 aa overlap (1-575:1-575) 10 20 30 40 50 60 pF1KB3 MDPRGILKAFPKRQKIHADASSKVLAKIPRREEGEEAEEWLSSLRAHVVRTGIGRARAEL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS75 MDPRGILKAFPKRQKIHADASSKVLAKIPRREEGEEAEEWLSSLRAHVVRTGIGRARAEL 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB3 FEKQIVQHGGQLCPAQGPGVTHIVVDEGMDYERALRLLRLPQLPPGAQLVKSAWLSLCLQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS75 FEKQIVQHGGQLCPAQGPGVTHIVVDEGMDYERALRLLRLPQLPPGAQLVKSAWLSLCLQ 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB3 ERRLVDVAGFSIFIPSRYLDHPQPSKAEQDASIPPGTHEALLQTALSPPPPPTRPVSPPQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS75 ERRLVDVAGFSIFIPSRYLDHPQPSKAEQDASIPPGTHEALLQTALSPPPPPTRPVSPPQ 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB3 KAKEAPNTQAQPISDDEASDGEETQVSAADLEALISGHYPTSLEGDCEPSPAPAVLDKWV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS75 KAKEAPNTQAQPISDDEASDGEETQVSAADLEALISGHYPTSLEGDCEPSPAPAVLDKWV 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB3 CAQPSSQKATNHNLHITEKLEVLAKAYSVQGDKWRALGYAKAINALKSFHKPVTSYQEAC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS75 CAQPSSQKATNHNLHITEKLEVLAKAYSVQGDKWRALGYAKAINALKSFHKPVTSYQEAC 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB3 SIPGIGKRMAEKIIEILESGHLRKLDHISESVPVLELFSNIWGAGTKTAQMWYQQGFRSL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS75 SIPGIGKRMAEKIIEILESGHLRKLDHISESVPVLELFSNIWGAGTKTAQMWYQQGFRSL 310 320 330 340 350 360 370 380 390 400 410 420 pF1KB3 EDIRSQASLTTQQAIGLKHYSDFLERMPREEATEIEQTVQKAAQAFNSGLLCVACGSYRR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS75 EDIRSQASLTTQQAIGLKHYSDFLERMPREEATEIEQTVQKAAQAFNSGLLCVACGSYRR 370 380 390 400 410 420 430 440 450 460 470 480 pF1KB3 GKATCGDVDVLITHPDGRSHRGIFSRLLDSLRQEGFLTDDLVSQEENGQQQKYLGVCRLP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS75 GKATCGDVDVLITHPDGRSHRGIFSRLLDSLRQEGFLTDDLVSQEENGQQQKYLGVCRLP 430 440 450 460 470 480 490 500 510 520 530 540 pF1KB3 GPGRRHRRLDIIVVPYSEFACALLYFTGSAHFNRSMRALAKTKGMSLSEHALSTAVVRNT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS75 GPGRRHRRLDIIVVPYSEFACALLYFTGSAHFNRSMRALAKTKGMSLSEHALSTAVVRNT 490 500 510 520 530 540 550 560 570 pF1KB3 HGCKVGPGRVLPTPTEKDVFRLLGLPYREPAERDW ::::::::::::::::::::::::::::::::::: CCDS75 HGCKVGPGRVLPTPTEKDVFRLLGLPYREPAERDW 550 560 570 >>CCDS76332.1 POLL gene_id:27343|Hs108|chr10 (300 aa) initn: 1870 init1: 1870 opt: 1870 Z-score: 1334.6 bits: 255.9 E(32554): 5.1e-68 Smith-Waterman score: 1870; 99.6% identity (100.0% similar) in 279 aa overlap (297-575:22-300) 270 280 290 300 310 320 pF1KB3 YSVQGDKWRALGYAKAINALKSFHKPVTSYQEACSIPGIGKRMAEKIIEILESGHLRKLD .::::::::::::::::::::::::::::: CCDS76 MLMHHQKYLQRFLGGKREKKQKEACSIPGIGKRMAEKIIEILESGHLRKLD 10 20 30 40 50 330 340 350 360 370 380 pF1KB3 HISESVPVLELFSNIWGAGTKTAQMWYQQGFRSLEDIRSQASLTTQQAIGLKHYSDFLER :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS76 HISESVPVLELFSNIWGAGTKTAQMWYQQGFRSLEDIRSQASLTTQQAIGLKHYSDFLER 60 70 80 90 100 110 390 400 410 420 430 440 pF1KB3 MPREEATEIEQTVQKAAQAFNSGLLCVACGSYRRGKATCGDVDVLITHPDGRSHRGIFSR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS76 MPREEATEIEQTVQKAAQAFNSGLLCVACGSYRRGKATCGDVDVLITHPDGRSHRGIFSR 120 130 140 150 160 170 450 460 470 480 490 500 pF1KB3 LLDSLRQEGFLTDDLVSQEENGQQQKYLGVCRLPGPGRRHRRLDIIVVPYSEFACALLYF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS76 LLDSLRQEGFLTDDLVSQEENGQQQKYLGVCRLPGPGRRHRRLDIIVVPYSEFACALLYF 180 190 200 210 220 230 510 520 530 540 550 560 pF1KB3 TGSAHFNRSMRALAKTKGMSLSEHALSTAVVRNTHGCKVGPGRVLPTPTEKDVFRLLGLP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS76 TGSAHFNRSMRALAKTKGMSLSEHALSTAVVRNTHGCKVGPGRVLPTPTEKDVFRLLGLP 240 250 260 270 280 290 570 pF1KB3 YREPAERDW ::::::::: CCDS76 YREPAERDW 300 >>CCDS6129.1 POLB gene_id:5423|Hs108|chr8 (335 aa) initn: 582 init1: 207 opt: 653 Z-score: 477.0 bits: 97.4 E(32554): 3e-20 Smith-Waterman score: 653; 35.1% identity (66.7% similar) in 345 aa overlap (245-573:2-333) 220 230 240 250 260 pF1KB3 ISGHYPTSLEGDCEPSPAPAVLDKWVCAQPSSQKATNHNLH--ITEKLEVLA---KAYSV :..:: ...:. ::. : :: : : CCDS61 MSKRKAPQETLNGGITDMLTELANFEKNVSQ 10 20 30 270 280 290 300 310 320 pF1KB3 QGDKWRALGYAKAINALKSFHKPVTSYQEACSIPGIGKRMAEKIIEILESGHLRKLDHI- :. : : :: ... .. . . : :: ..::.: ..:::: :.: .:.::::..: CCDS61 AIHKYNA--YRKAASVIAKYPHKIKSGAEAKKLPGVGTKIAEKIDEFLATGKLRKLEKIR 40 50 60 70 80 330 340 350 360 370 380 pF1KB3 -SESVPVLELFSNIWGAGTKTAQMWYQQGFRSLEDIR-SQASLTTQQAIGLKHYSDFLER ... ..... . : : ..:. . ..:...:::.: .. .:. .: ::::...:: .: CCDS61 QDDTSSSINFLTRVSGIGPSAARKFVDEGIKTLEDLRKNEDKLNHHQRIGLKYFGDFEKR 90 100 110 120 130 140 390 400 410 420 430 440 pF1KB3 MPREEATEIEQTVQKAAQAFNSGLLCVACGSYRRGKATCGDVDVLITHPDGRSHRGIFSR .:::: .... : . .. .: . ..:::.::: . ::.:::.:::. :. . CCDS61 IPREEMLQMQDIVLNEVKKVDSEYIATVCGSFRRGAESSGDMDVLLTHPSFTSESTKQPK 150 160 170 180 190 200 450 460 470 480 490 pF1KB3 LL----DSLRQEGFLTDDLVSQEENGQQQKYLGVCRLPGPGRR----HRRLDIIVVPYSE :: ..:.. :.:: : . : :..:::.::. . . :::.:: ..: .. CCDS61 LLHQVVEQLQKVHFITDTLSKGET-----KFMGVCQLPSKNDEKEYPHRRIDIRLIPKDQ 210 220 230 240 250 260 500 510 520 530 540 550 pF1KB3 FACALLYFTGSAHFNRSMRALAKTKGMSLSEHALSTAVVRNTHGCKVGPGRVLPTPTEKD . :..:::::: ::..::: : ::....:... : .. :. ::. .::: CCDS61 YYCGVLYFTGSDIFNKNMRAHALEKGFTINEYTIRPLGVTGV------AGEPLPVDSEKD 270 280 290 300 310 560 570 pF1KB3 VFRLLGLPYREPAERDW .: . :::: .: CCDS61 IFDYIQWKYREPKDRSE 320 330 >>CCDS34625.1 POLM gene_id:27434|Hs108|chr7 (494 aa) initn: 289 init1: 154 opt: 471 Z-score: 346.3 bits: 73.8 E(32554): 5.7e-13 Smith-Waterman score: 591; 30.9% identity (61.0% similar) in 372 aa overlap (232-574:134-493) 210 220 230 240 250 260 pF1KB3 EETQVSAADLEALISGHYPTSLEGDCEPSPAPAVLDKWVCAQPSSQKATNHNLHITEKLE .:: . ..: .:. :.:: ..: :: CCDS34 WLTESLGAGQPVPVECRHRLEVAGPRKGPLSPAWMPAYACQRPTPL--THHNTGLSEALE 110 120 130 140 150 160 270 280 290 300 310 320 pF1KB3 VLAKAYSVQGDKWRALGYAKAINALKSFHKPVTSYQEACSIPGIGKRMAEKIIEILESGH .::.: . .:.. : : . .: ..::.. .:::. .. ..: .:.. .. . :.:: : CCDS34 ILAEAAGFEGSEGRLLTFCRAASVLKALPSPVTTLSQLQGLPHFGEHSSRVVQELLEHGV 170 180 190 200 210 220 330 340 350 360 370 pF1KB3 LRKLDHI--SESVPVLELFSNIWGAGTKTAQMWYQQGFRSLEDIRSQAS-LTTQQAIGLK ...... :: ...::..:.:.:.:::. ::..:.:.:.:.: : . :: :: ::. CCDS34 CEEVERVRRSERYQTMKLFTQIFGVGVKTADRWYREGLRTLDDLREQPQKLTQQQKAGLQ 230 240 250 260 270 280 380 390 400 410 420 430 pF1KB3 HYSDFLERMPREEATEIEQTVQKAAQAFNSGLLCVACGSYRRGKATCGDVDVLITHPDGR :..:. . : .. ..:.:..:. : . :..:::: ::: ::::: CCDS34 HHQDLSTPVLRSDVDALQQVVEEAVGQALPGATVTLTGGFRRGKLQGHDVDFLITHPKEG 290 300 310 320 330 340 440 450 460 470 480 pF1KB3 SHRGIFSRLLDSLRQEGFLT------------DDLVSQEENGQQQKYLGVCRLPGP---- .. :.. :.. :...:.. :..: . .. . . ::: : CCDS34 QEAGLLPRVMCRLQDQGLILYHQHQHSCCESPTRLAQQSHMDAFERSFCIFRLPQPPGAA 350 360 370 380 390 400 490 500 510 520 530 pF1KB3 -GRRHR--------RLDIIVVPYSEFACALLYFTGSAHFNRSMRALA-KTKGMSLSEHAL : : :.:..:.: :.: ::: .::: :.: .: .. : ::. :. :.: CCDS34 VGGSTRPCPSWKAVRVDLVVAPVSQFPFALLGWTGSKLFQRELRRFSRKEKGLWLNSHGL 410 420 430 440 450 460 540 550 560 570 pF1KB3 STAVVRNTHGCKVGPGRVLPTPTEKDVFRLLGLPYREPAERDW .. . . .:.:.:: ::: : : .:. CCDS34 FDPEQKT----------FFQAASEEDIFRHLGLEYLPPEQRNA 470 480 490 575 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Thu Nov 3 12:02:27 2016 done: Thu Nov 3 12:02:27 2016 Total Scan time: 4.210 Total Display time: 0.030 Function used was FASTA [36.3.4 Apr, 2011]