FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB8665, 499 aa 1>>>pF1KB8665 499 - 499 aa - 499 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.9238+/-0.000828; mu= 16.4532+/- 0.050 mean_var=88.6368+/-17.360, 0's: 0 Z-trim(108.3): 22 B-trim: 0 in 0/52 Lambda= 0.136228 statistics sampled from 10128 (10139) to 10128 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.668), E-opt: 0.2 (0.311), width: 16 Scan time: 3.280 The best scores are: opt bits E(32554) CCDS12879.1 PRPF31 gene_id:26121|Hs108|chr19 ( 499) 3156 630.2 1.5e-180 CCDS2353.1 NOP58 gene_id:51602|Hs108|chr2 ( 529) 427 93.9 4.6e-19 CCDS13030.1 NOP56 gene_id:10528|Hs108|chr20 ( 594) 399 88.4 2.3e-17 >>CCDS12879.1 PRPF31 gene_id:26121|Hs108|chr19 (499 aa) initn: 3156 init1: 3156 opt: 3156 Z-score: 3354.7 bits: 630.2 E(32554): 1.5e-180 Smith-Waterman score: 3156; 99.8% identity (100.0% similar) in 499 aa overlap (1-499:1-499) 10 20 30 40 50 60 pF1KB8 MSLADELLADLEEAAEEEEGGSYGEEEEEPAIEDVQEETQLDLSGDSVKTIAKLWDSKMF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 MSLADELLADLEEAAEEEEGGSYGEEEEEPAIEDVQEETQLDLSGDSVKTIAKLWDSKMF 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB8 AEIMMKIEEYISKQAKASEVMGPVEAAPEYRVIVDANNLTVEIENELNIIHKFIRDKYSK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 AEIMMKIEEYISKQAKASEVMGPVEAAPEYRVIVDANNLTVEIENELNIIHKFIRDKYSK 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB8 RFPELESLVPNALDYIRTVKELGNSLDKCKNNENLQQILTNATIMVVSVTASTTQGQQLS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 RFPELESLVPNALDYIRTVKELGNSLDKCKNNENLQQILTNATIMVVSVTASTTQGQQLS 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB8 EEELERLEEACDMALELNASKHRIYEYVESRMSFIAPNLSIIIGASTAAKIMGVAGGLTN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 EEELERLEEACDMALELNASKHRIYEYVESRMSFIAPNLSIIIGASTAAKIMGVAGGLTN 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB8 LSKVPACNIMLLGAQRKTLSGFSSTSVLPHTGYIYHSDIVQSLPPDLRRKAARLVAAKCT :::.:::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 LSKMPACNIMLLGAQRKTLSGFSSTSVLPHTGYIYHSDIVQSLPPDLRRKAARLVAAKCT 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB8 LAARVDSFHESTEGKVGYELKDEIERKFDKWQEPPPVKQVKPLPAPLDGQRKKRGGRRYR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 LAARVDSFHESTEGKVGYELKDEIERKFDKWQEPPPVKQVKPLPAPLDGQRKKRGGRRYR 310 320 330 340 350 360 370 380 390 400 410 420 pF1KB8 KMKERLGLTEIRKQANRMSFGEIEEDAYQEDLGFSLGHLGKSGSGRVRQTQVNEATKARI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 KMKERLGLTEIRKQANRMSFGEIEEDAYQEDLGFSLGHLGKSGSGRVRQTQVNEATKARI 370 380 390 400 410 420 430 440 450 460 470 480 pF1KB8 SKTLQRTLQKQSVVYGGKSTIRDRSSGTASSVAFTPLQGLEIVNPQAAEKKVAEANQKYF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 SKTLQRTLQKQSVVYGGKSTIRDRSSGTASSVAFTPLQGLEIVNPQAAEKKVAEANQKYF 430 440 450 460 470 480 490 pF1KB8 SSMAEFLKVKGEKSGLMST ::::::::::::::::::: CCDS12 SSMAEFLKVKGEKSGLMST 490 >>CCDS2353.1 NOP58 gene_id:51602|Hs108|chr2 (529 aa) initn: 402 init1: 324 opt: 427 Z-score: 455.7 bits: 93.9 E(32554): 4.6e-19 Smith-Waterman score: 437; 23.6% identity (61.2% similar) in 381 aa overlap (92-471:161-521) 70 80 90 100 110 120 pF1KB8 EIMMKIEEYISKQAKASEVMGPVEAAPEYRVIVDANNLTVEIENELNIIHKFIRDKYSKR .::.: .: ....::: :. :. . CCDS23 EPREMAAMCLGLAHSLSRYRLKFSADKVDTMIVQAISLLDDLDKELNNYIMRCREWYGWH 140 150 160 170 180 190 130 140 150 160 170 180 pF1KB8 FPELESLVPNALDYIRTVKELGNSLDKCKNNENLQQILTNATIMVVSVTASTTQGQQLSE :::: ... . : : . ....:. : . .:...: . . :...: ..: ..:: CCDS23 FPELGKIISDNLTYCKCLQKVGDR--KNYASAKLSELLPEEVEAEVKAAAEISMGTEVSE 200 210 220 230 240 190 200 210 220 230 240 pF1KB8 EELERLEEACDMALELNASKHRIYEYVESRMSFIAPNLSIIIGASTAAKIMGVAGGLTNL :.. . . : ...:.. . ..:::...:: ::::.....: ..:.... ::.: :: CCDS23 EDICNILHLCTQVIEISEYRTQLYEYLQNRMMAIAPNVTVMVGELVGARLIAHAGSLLNL 250 260 270 280 290 300 250 260 270 280 290 300 pF1KB8 SKVPACNIMLLGAQRKTLSGFSSTSVLPHTGYIYHSDIVQSLPPDLRRKAARLVAAKCTL .: : ....:::.. . ...: :. : :::...: . : . : .:..::: .: CCDS23 AKHAASTVQILGAEKALFRALKSRRDTPKYGLIYHASLVGQTSPKHKGKISRMLAAKTVL 310 320 330 340 350 360 310 320 330 340 350 360 pF1KB8 AARVDSFHESTEGKVGYELKDEIERKFDKWQEPPPVKQVKPLPAPLDGQRKKRGGRRYRK : : :.: :.. . .: : . ..: .. . : ..... : .. .: . : CCDS23 AIRYDAFGEDSSSAMGVENRAKLEARL-RTLEDRGIRKISGTGKAL-AKTEKYEHKSEVK 370 380 390 400 410 420 370 380 390 400 410 420 pF1KB8 MKERLGLTEIRKQANRMSFGEIE-EDAYQEDLGFSLGHLGKSGSGRVRQTQVNEATKARI . : . . ... .. ... :: : :. ..... ..:.: . .. CCDS23 TYDPSGDSTLPTCSKKRKIEQVDKEDEITEK---------KAKKAKIK-VKVEEEEEEKV 430 440 450 460 470 430 440 450 460 470 480 pF1KB8 SKTLQRTLQKQSVVYGGKSTIRDRSSGTASSVAFTPLQGLEIVNPQAAEKKVAEANQKYF .. . ...:.. : :. :... . : . :..:. .:: CCDS23 AEEEETSVKKKKK-RGKKKHIKEEPLSEEE-----PCTSTAIASPEKKKKKKKKRENED 480 490 500 510 520 490 pF1KB8 SSMAEFLKVKGEKSGLMST >>CCDS13030.1 NOP56 gene_id:10528|Hs108|chr20 (594 aa) initn: 388 init1: 295 opt: 399 Z-score: 425.2 bits: 88.4 E(32554): 2.3e-17 Smith-Waterman score: 410; 24.7% identity (63.0% similar) in 308 aa overlap (92-386:167-474) 70 80 90 100 110 120 pF1KB8 EIMMKIEEYISKQAKASEVMGPVEAAPEYRVIVDANNLTVEIENELNIIHKFIRDKYSKR .:... .: ......: . .:. :. . CCDS13 TDLSACKAQLGLGHSYSRAKVKFNVNRVDNMIIQSISLLDQLDKDINTFSMRVREWYGYH 140 150 160 170 180 190 130 140 150 160 170 pF1KB8 FPELESLVPNALDYIRTVKELGNSLDKCKNN-ENLQQILTNATIMVVSVTAS-TTQGQQL :::: ... . : : .. .:: . ... :.:... ... . . :: ...:... CCDS13 FPELVKIINDNATYCRLAQFIGNRRELNEDKLEKLEELTMDGAKAKAILDASRSSMGMDI 200 210 220 230 240 250 180 190 200 210 220 230 pF1KB8 SEEELERLEEACDMALELNASKHRIYEYVESRMSFIAPNLSIIIGASTAAKIMGVAGGLT : .: .: . .. :. .. .. :..:.:: .::.:: .:: ...:.... ::.:: CCDS13 SAIDLINIESFSSRVVSLSEYRQSLHTYLRSKMSQVAPSLSALIGEAVGARLIAHAGSLT 260 270 280 290 300 310 240 250 260 270 280 290 pF1KB8 NLSKVPACNIMLLGAQRKTLSGFSSTSVLPHTGYIYHSDIVQSLPPDLRRKAARLVAAKC ::.: :: ....:::.. . .... . :. : :.:: .. . . .: .: :: CCDS13 NLAKYPASTVQILGAEKALFRALKTRGNTPKYGLIFHSTFIGRAAAKNKGRISRYLANKC 320 330 340 350 360 370 300 310 320 330 340 pF1KB8 TLAARVDSFHESTEGKVGYELKDEIERKFDKWQ--EPP---------PVKQVKPLPAPLD ..:.:.: : : . : .:....:.... .. : : . :.. : . CCDS13 SIASRIDCFSEVPTSVFGEKLREQVEERLSFYETGEIPRKNLDVMKEAMVQAEEAAAEIT 380 390 400 410 420 430 350 360 370 380 390 400 pF1KB8 GQRKKRGGRRYRKMKERLGLTEIRKQANRMSFGEIEEDAYQEDLGFSLGHLGKSGSGRVR . .:. .: .: :.::. . .. : : : :. CCDS13 RKLEKQEKKRLKKEKKRLAALALASSENSSSTPEECEEMSEKPKKKKKQKPQEVPQENGM 440 450 460 470 480 490 410 420 430 440 450 460 pF1KB8 QTQVNEATKARISKTLQRTLQKQSVVYGGKSTIRDRSSGTASSVAFTPLQGLEIVNPQAA CCDS13 EDPSISFSKPKKKKSFSKEELMSSDLEETAGSTSIPKRKKSTPKEETVNDPEEAGHRSGS 500 510 520 530 540 550 499 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Fri Nov 4 14:21:20 2016 done: Fri Nov 4 14:21:20 2016 Total Scan time: 3.280 Total Display time: 0.010 Function used was FASTA [36.3.4 Apr, 2011]