FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB3002, 575 aa
1>>>pF1KB3002 575 - 575 aa - 575 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 9.3931+/-0.000903; mu= 1.1666+/- 0.055
mean_var=201.7245+/-40.351, 0's: 0 Z-trim(113.5): 8 B-trim: 0 in 0/54
Lambda= 0.090301
statistics sampled from 14140 (14148) to 14140 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.757), E-opt: 0.2 (0.435), width: 16
Scan time: 4.210
The best scores are: opt bits E(32554)
CCDS7513.1 POLL gene_id:27343|Hs108|chr10 ( 575) 3868 516.4 3.9e-146
CCDS76332.1 POLL gene_id:27343|Hs108|chr10 ( 300) 1870 255.9 5.1e-68
CCDS6129.1 POLB gene_id:5423|Hs108|chr8 ( 335) 653 97.4 3e-20
CCDS34625.1 POLM gene_id:27434|Hs108|chr7 ( 494) 471 73.8 5.7e-13
>>CCDS7513.1 POLL gene_id:27343|Hs108|chr10 (575 aa)
initn: 3868 init1: 3868 opt: 3868 Z-score: 2737.0 bits: 516.4 E(32554): 3.9e-146
Smith-Waterman score: 3868; 100.0% identity (100.0% similar) in 575 aa overlap (1-575:1-575)
10 20 30 40 50 60
pF1KB3 MDPRGILKAFPKRQKIHADASSKVLAKIPRREEGEEAEEWLSSLRAHVVRTGIGRARAEL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS75 MDPRGILKAFPKRQKIHADASSKVLAKIPRREEGEEAEEWLSSLRAHVVRTGIGRARAEL
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB3 FEKQIVQHGGQLCPAQGPGVTHIVVDEGMDYERALRLLRLPQLPPGAQLVKSAWLSLCLQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS75 FEKQIVQHGGQLCPAQGPGVTHIVVDEGMDYERALRLLRLPQLPPGAQLVKSAWLSLCLQ
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB3 ERRLVDVAGFSIFIPSRYLDHPQPSKAEQDASIPPGTHEALLQTALSPPPPPTRPVSPPQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS75 ERRLVDVAGFSIFIPSRYLDHPQPSKAEQDASIPPGTHEALLQTALSPPPPPTRPVSPPQ
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB3 KAKEAPNTQAQPISDDEASDGEETQVSAADLEALISGHYPTSLEGDCEPSPAPAVLDKWV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS75 KAKEAPNTQAQPISDDEASDGEETQVSAADLEALISGHYPTSLEGDCEPSPAPAVLDKWV
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB3 CAQPSSQKATNHNLHITEKLEVLAKAYSVQGDKWRALGYAKAINALKSFHKPVTSYQEAC
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS75 CAQPSSQKATNHNLHITEKLEVLAKAYSVQGDKWRALGYAKAINALKSFHKPVTSYQEAC
250 260 270 280 290 300
310 320 330 340 350 360
pF1KB3 SIPGIGKRMAEKIIEILESGHLRKLDHISESVPVLELFSNIWGAGTKTAQMWYQQGFRSL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS75 SIPGIGKRMAEKIIEILESGHLRKLDHISESVPVLELFSNIWGAGTKTAQMWYQQGFRSL
310 320 330 340 350 360
370 380 390 400 410 420
pF1KB3 EDIRSQASLTTQQAIGLKHYSDFLERMPREEATEIEQTVQKAAQAFNSGLLCVACGSYRR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS75 EDIRSQASLTTQQAIGLKHYSDFLERMPREEATEIEQTVQKAAQAFNSGLLCVACGSYRR
370 380 390 400 410 420
430 440 450 460 470 480
pF1KB3 GKATCGDVDVLITHPDGRSHRGIFSRLLDSLRQEGFLTDDLVSQEENGQQQKYLGVCRLP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS75 GKATCGDVDVLITHPDGRSHRGIFSRLLDSLRQEGFLTDDLVSQEENGQQQKYLGVCRLP
430 440 450 460 470 480
490 500 510 520 530 540
pF1KB3 GPGRRHRRLDIIVVPYSEFACALLYFTGSAHFNRSMRALAKTKGMSLSEHALSTAVVRNT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS75 GPGRRHRRLDIIVVPYSEFACALLYFTGSAHFNRSMRALAKTKGMSLSEHALSTAVVRNT
490 500 510 520 530 540
550 560 570
pF1KB3 HGCKVGPGRVLPTPTEKDVFRLLGLPYREPAERDW
:::::::::::::::::::::::::::::::::::
CCDS75 HGCKVGPGRVLPTPTEKDVFRLLGLPYREPAERDW
550 560 570
>>CCDS76332.1 POLL gene_id:27343|Hs108|chr10 (300 aa)
initn: 1870 init1: 1870 opt: 1870 Z-score: 1334.6 bits: 255.9 E(32554): 5.1e-68
Smith-Waterman score: 1870; 99.6% identity (100.0% similar) in 279 aa overlap (297-575:22-300)
270 280 290 300 310 320
pF1KB3 YSVQGDKWRALGYAKAINALKSFHKPVTSYQEACSIPGIGKRMAEKIIEILESGHLRKLD
.:::::::::::::::::::::::::::::
CCDS76 MLMHHQKYLQRFLGGKREKKQKEACSIPGIGKRMAEKIIEILESGHLRKLD
10 20 30 40 50
330 340 350 360 370 380
pF1KB3 HISESVPVLELFSNIWGAGTKTAQMWYQQGFRSLEDIRSQASLTTQQAIGLKHYSDFLER
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS76 HISESVPVLELFSNIWGAGTKTAQMWYQQGFRSLEDIRSQASLTTQQAIGLKHYSDFLER
60 70 80 90 100 110
390 400 410 420 430 440
pF1KB3 MPREEATEIEQTVQKAAQAFNSGLLCVACGSYRRGKATCGDVDVLITHPDGRSHRGIFSR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS76 MPREEATEIEQTVQKAAQAFNSGLLCVACGSYRRGKATCGDVDVLITHPDGRSHRGIFSR
120 130 140 150 160 170
450 460 470 480 490 500
pF1KB3 LLDSLRQEGFLTDDLVSQEENGQQQKYLGVCRLPGPGRRHRRLDIIVVPYSEFACALLYF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS76 LLDSLRQEGFLTDDLVSQEENGQQQKYLGVCRLPGPGRRHRRLDIIVVPYSEFACALLYF
180 190 200 210 220 230
510 520 530 540 550 560
pF1KB3 TGSAHFNRSMRALAKTKGMSLSEHALSTAVVRNTHGCKVGPGRVLPTPTEKDVFRLLGLP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS76 TGSAHFNRSMRALAKTKGMSLSEHALSTAVVRNTHGCKVGPGRVLPTPTEKDVFRLLGLP
240 250 260 270 280 290
570
pF1KB3 YREPAERDW
:::::::::
CCDS76 YREPAERDW
300
>>CCDS6129.1 POLB gene_id:5423|Hs108|chr8 (335 aa)
initn: 582 init1: 207 opt: 653 Z-score: 477.0 bits: 97.4 E(32554): 3e-20
Smith-Waterman score: 653; 35.1% identity (66.7% similar) in 345 aa overlap (245-573:2-333)
220 230 240 250 260
pF1KB3 ISGHYPTSLEGDCEPSPAPAVLDKWVCAQPSSQKATNHNLH--ITEKLEVLA---KAYSV
:..:: ...:. ::. : :: : :
CCDS61 MSKRKAPQETLNGGITDMLTELANFEKNVSQ
10 20 30
270 280 290 300 310 320
pF1KB3 QGDKWRALGYAKAINALKSFHKPVTSYQEACSIPGIGKRMAEKIIEILESGHLRKLDHI-
:. : : :: ... .. . . : :: ..::.: ..:::: :.: .:.::::..:
CCDS61 AIHKYNA--YRKAASVIAKYPHKIKSGAEAKKLPGVGTKIAEKIDEFLATGKLRKLEKIR
40 50 60 70 80
330 340 350 360 370 380
pF1KB3 -SESVPVLELFSNIWGAGTKTAQMWYQQGFRSLEDIR-SQASLTTQQAIGLKHYSDFLER
... ..... . : : ..:. . ..:...:::.: .. .:. .: ::::...:: .:
CCDS61 QDDTSSSINFLTRVSGIGPSAARKFVDEGIKTLEDLRKNEDKLNHHQRIGLKYFGDFEKR
90 100 110 120 130 140
390 400 410 420 430 440
pF1KB3 MPREEATEIEQTVQKAAQAFNSGLLCVACGSYRRGKATCGDVDVLITHPDGRSHRGIFSR
.:::: .... : . .. .: . ..:::.::: . ::.:::.:::. :. .
CCDS61 IPREEMLQMQDIVLNEVKKVDSEYIATVCGSFRRGAESSGDMDVLLTHPSFTSESTKQPK
150 160 170 180 190 200
450 460 470 480 490
pF1KB3 LL----DSLRQEGFLTDDLVSQEENGQQQKYLGVCRLPGPGRR----HRRLDIIVVPYSE
:: ..:.. :.:: : . : :..:::.::. . . :::.:: ..: ..
CCDS61 LLHQVVEQLQKVHFITDTLSKGET-----KFMGVCQLPSKNDEKEYPHRRIDIRLIPKDQ
210 220 230 240 250 260
500 510 520 530 540 550
pF1KB3 FACALLYFTGSAHFNRSMRALAKTKGMSLSEHALSTAVVRNTHGCKVGPGRVLPTPTEKD
. :..:::::: ::..::: : ::....:... : .. :. ::. .:::
CCDS61 YYCGVLYFTGSDIFNKNMRAHALEKGFTINEYTIRPLGVTGV------AGEPLPVDSEKD
270 280 290 300 310
560 570
pF1KB3 VFRLLGLPYREPAERDW
.: . :::: .:
CCDS61 IFDYIQWKYREPKDRSE
320 330
>>CCDS34625.1 POLM gene_id:27434|Hs108|chr7 (494 aa)
initn: 289 init1: 154 opt: 471 Z-score: 346.3 bits: 73.8 E(32554): 5.7e-13
Smith-Waterman score: 591; 30.9% identity (61.0% similar) in 372 aa overlap (232-574:134-493)
210 220 230 240 250 260
pF1KB3 EETQVSAADLEALISGHYPTSLEGDCEPSPAPAVLDKWVCAQPSSQKATNHNLHITEKLE
.:: . ..: .:. :.:: ..: ::
CCDS34 WLTESLGAGQPVPVECRHRLEVAGPRKGPLSPAWMPAYACQRPTPL--THHNTGLSEALE
110 120 130 140 150 160
270 280 290 300 310 320
pF1KB3 VLAKAYSVQGDKWRALGYAKAINALKSFHKPVTSYQEACSIPGIGKRMAEKIIEILESGH
.::.: . .:.. : : . .: ..::.. .:::. .. ..: .:.. .. . :.:: :
CCDS34 ILAEAAGFEGSEGRLLTFCRAASVLKALPSPVTTLSQLQGLPHFGEHSSRVVQELLEHGV
170 180 190 200 210 220
330 340 350 360 370
pF1KB3 LRKLDHI--SESVPVLELFSNIWGAGTKTAQMWYQQGFRSLEDIRSQAS-LTTQQAIGLK
...... :: ...::..:.:.:.:::. ::..:.:.:.:.: : . :: :: ::.
CCDS34 CEEVERVRRSERYQTMKLFTQIFGVGVKTADRWYREGLRTLDDLREQPQKLTQQQKAGLQ
230 240 250 260 270 280
380 390 400 410 420 430
pF1KB3 HYSDFLERMPREEATEIEQTVQKAAQAFNSGLLCVACGSYRRGKATCGDVDVLITHPDGR
:..:. . : .. ..:.:..:. : . :..:::: ::: :::::
CCDS34 HHQDLSTPVLRSDVDALQQVVEEAVGQALPGATVTLTGGFRRGKLQGHDVDFLITHPKEG
290 300 310 320 330 340
440 450 460 470 480
pF1KB3 SHRGIFSRLLDSLRQEGFLT------------DDLVSQEENGQQQKYLGVCRLPGP----
.. :.. :.. :...:.. :..: . .. . . ::: :
CCDS34 QEAGLLPRVMCRLQDQGLILYHQHQHSCCESPTRLAQQSHMDAFERSFCIFRLPQPPGAA
350 360 370 380 390 400
490 500 510 520 530
pF1KB3 -GRRHR--------RLDIIVVPYSEFACALLYFTGSAHFNRSMRALA-KTKGMSLSEHAL
: : :.:..:.: :.: ::: .::: :.: .: .. : ::. :. :.:
CCDS34 VGGSTRPCPSWKAVRVDLVVAPVSQFPFALLGWTGSKLFQRELRRFSRKEKGLWLNSHGL
410 420 430 440 450 460
540 550 560 570
pF1KB3 STAVVRNTHGCKVGPGRVLPTPTEKDVFRLLGLPYREPAERDW
.. . . .:.:.:: ::: : : .:.
CCDS34 FDPEQKT----------FFQAASEEDIFRHLGLEYLPPEQRNA
470 480 490
575 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Thu Nov 3 12:02:27 2016 done: Thu Nov 3 12:02:27 2016
Total Scan time: 4.210 Total Display time: 0.030
Function used was FASTA [36.3.4 Apr, 2011]