FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB8665, 499 aa
1>>>pF1KB8665 499 - 499 aa - 499 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.9238+/-0.000828; mu= 16.4532+/- 0.050
mean_var=88.6368+/-17.360, 0's: 0 Z-trim(108.3): 22 B-trim: 0 in 0/52
Lambda= 0.136228
statistics sampled from 10128 (10139) to 10128 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.668), E-opt: 0.2 (0.311), width: 16
Scan time: 3.280
The best scores are: opt bits E(32554)
CCDS12879.1 PRPF31 gene_id:26121|Hs108|chr19 ( 499) 3156 630.2 1.5e-180
CCDS2353.1 NOP58 gene_id:51602|Hs108|chr2 ( 529) 427 93.9 4.6e-19
CCDS13030.1 NOP56 gene_id:10528|Hs108|chr20 ( 594) 399 88.4 2.3e-17
>>CCDS12879.1 PRPF31 gene_id:26121|Hs108|chr19 (499 aa)
initn: 3156 init1: 3156 opt: 3156 Z-score: 3354.7 bits: 630.2 E(32554): 1.5e-180
Smith-Waterman score: 3156; 99.8% identity (100.0% similar) in 499 aa overlap (1-499:1-499)
10 20 30 40 50 60
pF1KB8 MSLADELLADLEEAAEEEEGGSYGEEEEEPAIEDVQEETQLDLSGDSVKTIAKLWDSKMF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS12 MSLADELLADLEEAAEEEEGGSYGEEEEEPAIEDVQEETQLDLSGDSVKTIAKLWDSKMF
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB8 AEIMMKIEEYISKQAKASEVMGPVEAAPEYRVIVDANNLTVEIENELNIIHKFIRDKYSK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS12 AEIMMKIEEYISKQAKASEVMGPVEAAPEYRVIVDANNLTVEIENELNIIHKFIRDKYSK
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB8 RFPELESLVPNALDYIRTVKELGNSLDKCKNNENLQQILTNATIMVVSVTASTTQGQQLS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS12 RFPELESLVPNALDYIRTVKELGNSLDKCKNNENLQQILTNATIMVVSVTASTTQGQQLS
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB8 EEELERLEEACDMALELNASKHRIYEYVESRMSFIAPNLSIIIGASTAAKIMGVAGGLTN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS12 EEELERLEEACDMALELNASKHRIYEYVESRMSFIAPNLSIIIGASTAAKIMGVAGGLTN
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB8 LSKVPACNIMLLGAQRKTLSGFSSTSVLPHTGYIYHSDIVQSLPPDLRRKAARLVAAKCT
:::.::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS12 LSKMPACNIMLLGAQRKTLSGFSSTSVLPHTGYIYHSDIVQSLPPDLRRKAARLVAAKCT
250 260 270 280 290 300
310 320 330 340 350 360
pF1KB8 LAARVDSFHESTEGKVGYELKDEIERKFDKWQEPPPVKQVKPLPAPLDGQRKKRGGRRYR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS12 LAARVDSFHESTEGKVGYELKDEIERKFDKWQEPPPVKQVKPLPAPLDGQRKKRGGRRYR
310 320 330 340 350 360
370 380 390 400 410 420
pF1KB8 KMKERLGLTEIRKQANRMSFGEIEEDAYQEDLGFSLGHLGKSGSGRVRQTQVNEATKARI
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS12 KMKERLGLTEIRKQANRMSFGEIEEDAYQEDLGFSLGHLGKSGSGRVRQTQVNEATKARI
370 380 390 400 410 420
430 440 450 460 470 480
pF1KB8 SKTLQRTLQKQSVVYGGKSTIRDRSSGTASSVAFTPLQGLEIVNPQAAEKKVAEANQKYF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS12 SKTLQRTLQKQSVVYGGKSTIRDRSSGTASSVAFTPLQGLEIVNPQAAEKKVAEANQKYF
430 440 450 460 470 480
490
pF1KB8 SSMAEFLKVKGEKSGLMST
:::::::::::::::::::
CCDS12 SSMAEFLKVKGEKSGLMST
490
>>CCDS2353.1 NOP58 gene_id:51602|Hs108|chr2 (529 aa)
initn: 402 init1: 324 opt: 427 Z-score: 455.7 bits: 93.9 E(32554): 4.6e-19
Smith-Waterman score: 437; 23.6% identity (61.2% similar) in 381 aa overlap (92-471:161-521)
70 80 90 100 110 120
pF1KB8 EIMMKIEEYISKQAKASEVMGPVEAAPEYRVIVDANNLTVEIENELNIIHKFIRDKYSKR
.::.: .: ....::: :. :. .
CCDS23 EPREMAAMCLGLAHSLSRYRLKFSADKVDTMIVQAISLLDDLDKELNNYIMRCREWYGWH
140 150 160 170 180 190
130 140 150 160 170 180
pF1KB8 FPELESLVPNALDYIRTVKELGNSLDKCKNNENLQQILTNATIMVVSVTASTTQGQQLSE
:::: ... . : : . ....:. : . .:...: . . :...: ..: ..::
CCDS23 FPELGKIISDNLTYCKCLQKVGDR--KNYASAKLSELLPEEVEAEVKAAAEISMGTEVSE
200 210 220 230 240
190 200 210 220 230 240
pF1KB8 EELERLEEACDMALELNASKHRIYEYVESRMSFIAPNLSIIIGASTAAKIMGVAGGLTNL
:.. . . : ...:.. . ..:::...:: ::::.....: ..:.... ::.: ::
CCDS23 EDICNILHLCTQVIEISEYRTQLYEYLQNRMMAIAPNVTVMVGELVGARLIAHAGSLLNL
250 260 270 280 290 300
250 260 270 280 290 300
pF1KB8 SKVPACNIMLLGAQRKTLSGFSSTSVLPHTGYIYHSDIVQSLPPDLRRKAARLVAAKCTL
.: : ....:::.. . ...: :. : :::...: . : . : .:..::: .:
CCDS23 AKHAASTVQILGAEKALFRALKSRRDTPKYGLIYHASLVGQTSPKHKGKISRMLAAKTVL
310 320 330 340 350 360
310 320 330 340 350 360
pF1KB8 AARVDSFHESTEGKVGYELKDEIERKFDKWQEPPPVKQVKPLPAPLDGQRKKRGGRRYRK
: : :.: :.. . .: : . ..: .. . : ..... : .. .: . :
CCDS23 AIRYDAFGEDSSSAMGVENRAKLEARL-RTLEDRGIRKISGTGKAL-AKTEKYEHKSEVK
370 380 390 400 410 420
370 380 390 400 410 420
pF1KB8 MKERLGLTEIRKQANRMSFGEIE-EDAYQEDLGFSLGHLGKSGSGRVRQTQVNEATKARI
. : . . ... .. ... :: : :. ..... ..:.: . ..
CCDS23 TYDPSGDSTLPTCSKKRKIEQVDKEDEITEK---------KAKKAKIK-VKVEEEEEEKV
430 440 450 460 470
430 440 450 460 470 480
pF1KB8 SKTLQRTLQKQSVVYGGKSTIRDRSSGTASSVAFTPLQGLEIVNPQAAEKKVAEANQKYF
.. . ...:.. : :. :... . : . :..:. .::
CCDS23 AEEEETSVKKKKK-RGKKKHIKEEPLSEEE-----PCTSTAIASPEKKKKKKKKRENED
480 490 500 510 520
490
pF1KB8 SSMAEFLKVKGEKSGLMST
>>CCDS13030.1 NOP56 gene_id:10528|Hs108|chr20 (594 aa)
initn: 388 init1: 295 opt: 399 Z-score: 425.2 bits: 88.4 E(32554): 2.3e-17
Smith-Waterman score: 410; 24.7% identity (63.0% similar) in 308 aa overlap (92-386:167-474)
70 80 90 100 110 120
pF1KB8 EIMMKIEEYISKQAKASEVMGPVEAAPEYRVIVDANNLTVEIENELNIIHKFIRDKYSKR
.:... .: ......: . .:. :. .
CCDS13 TDLSACKAQLGLGHSYSRAKVKFNVNRVDNMIIQSISLLDQLDKDINTFSMRVREWYGYH
140 150 160 170 180 190
130 140 150 160 170
pF1KB8 FPELESLVPNALDYIRTVKELGNSLDKCKNN-ENLQQILTNATIMVVSVTAS-TTQGQQL
:::: ... . : : .. .:: . ... :.:... ... . . :: ...:...
CCDS13 FPELVKIINDNATYCRLAQFIGNRRELNEDKLEKLEELTMDGAKAKAILDASRSSMGMDI
200 210 220 230 240 250
180 190 200 210 220 230
pF1KB8 SEEELERLEEACDMALELNASKHRIYEYVESRMSFIAPNLSIIIGASTAAKIMGVAGGLT
: .: .: . .. :. .. .. :..:.:: .::.:: .:: ...:.... ::.::
CCDS13 SAIDLINIESFSSRVVSLSEYRQSLHTYLRSKMSQVAPSLSALIGEAVGARLIAHAGSLT
260 270 280 290 300 310
240 250 260 270 280 290
pF1KB8 NLSKVPACNIMLLGAQRKTLSGFSSTSVLPHTGYIYHSDIVQSLPPDLRRKAARLVAAKC
::.: :: ....:::.. . .... . :. : :.:: .. . . .: .: ::
CCDS13 NLAKYPASTVQILGAEKALFRALKTRGNTPKYGLIFHSTFIGRAAAKNKGRISRYLANKC
320 330 340 350 360 370
300 310 320 330 340
pF1KB8 TLAARVDSFHESTEGKVGYELKDEIERKFDKWQ--EPP---------PVKQVKPLPAPLD
..:.:.: : : . : .:....:.... .. : : . :.. : .
CCDS13 SIASRIDCFSEVPTSVFGEKLREQVEERLSFYETGEIPRKNLDVMKEAMVQAEEAAAEIT
380 390 400 410 420 430
350 360 370 380 390 400
pF1KB8 GQRKKRGGRRYRKMKERLGLTEIRKQANRMSFGEIEEDAYQEDLGFSLGHLGKSGSGRVR
. .:. .: .: :.::. . .. : : : :.
CCDS13 RKLEKQEKKRLKKEKKRLAALALASSENSSSTPEECEEMSEKPKKKKKQKPQEVPQENGM
440 450 460 470 480 490
410 420 430 440 450 460
pF1KB8 QTQVNEATKARISKTLQRTLQKQSVVYGGKSTIRDRSSGTASSVAFTPLQGLEIVNPQAA
CCDS13 EDPSISFSKPKKKKSFSKEELMSSDLEETAGSTSIPKRKKSTPKEETVNDPEEAGHRSGS
500 510 520 530 540 550
499 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Fri Nov 4 14:21:20 2016 done: Fri Nov 4 14:21:20 2016
Total Scan time: 3.280 Total Display time: 0.010
Function used was FASTA [36.3.4 Apr, 2011]