FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB8665, 499 aa 1>>>pF1KB8665 499 - 499 aa - 499 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.3409+/-0.000346; mu= 20.1030+/- 0.022 mean_var=93.3721+/-18.440, 0's: 0 Z-trim(115.8): 15 B-trim: 156 in 1/59 Lambda= 0.132729 statistics sampled from 26422 (26436) to 26422 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.669), E-opt: 0.2 (0.31), width: 16 Scan time: 10.520 The best scores are: opt bits E(85289) XP_006723200 (OMIM: 600138,606419) PREDICTED: U4/U ( 499) 3156 614.6 2.1e-175 NP_056444 (OMIM: 600138,606419) U4/U6 small nuclea ( 499) 3156 614.6 2.1e-175 NP_057018 (OMIM: 616742) nucleolar protein 58 [Hom ( 529) 427 92.0 4.4e-18 NP_006383 (OMIM: 614153,614154) nucleolar protein ( 594) 399 86.7 2e-16 >>XP_006723200 (OMIM: 600138,606419) PREDICTED: U4/U6 sm (499 aa) initn: 3156 init1: 3156 opt: 3156 Z-score: 3270.1 bits: 614.6 E(85289): 2.1e-175 Smith-Waterman score: 3156; 99.8% identity (100.0% similar) in 499 aa overlap (1-499:1-499) 10 20 30 40 50 60 pF1KB8 MSLADELLADLEEAAEEEEGGSYGEEEEEPAIEDVQEETQLDLSGDSVKTIAKLWDSKMF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_006 MSLADELLADLEEAAEEEEGGSYGEEEEEPAIEDVQEETQLDLSGDSVKTIAKLWDSKMF 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB8 AEIMMKIEEYISKQAKASEVMGPVEAAPEYRVIVDANNLTVEIENELNIIHKFIRDKYSK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_006 AEIMMKIEEYISKQAKASEVMGPVEAAPEYRVIVDANNLTVEIENELNIIHKFIRDKYSK 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB8 RFPELESLVPNALDYIRTVKELGNSLDKCKNNENLQQILTNATIMVVSVTASTTQGQQLS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_006 RFPELESLVPNALDYIRTVKELGNSLDKCKNNENLQQILTNATIMVVSVTASTTQGQQLS 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB8 EEELERLEEACDMALELNASKHRIYEYVESRMSFIAPNLSIIIGASTAAKIMGVAGGLTN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_006 EEELERLEEACDMALELNASKHRIYEYVESRMSFIAPNLSIIIGASTAAKIMGVAGGLTN 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB8 LSKVPACNIMLLGAQRKTLSGFSSTSVLPHTGYIYHSDIVQSLPPDLRRKAARLVAAKCT :::.:::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_006 LSKMPACNIMLLGAQRKTLSGFSSTSVLPHTGYIYHSDIVQSLPPDLRRKAARLVAAKCT 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB8 LAARVDSFHESTEGKVGYELKDEIERKFDKWQEPPPVKQVKPLPAPLDGQRKKRGGRRYR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_006 LAARVDSFHESTEGKVGYELKDEIERKFDKWQEPPPVKQVKPLPAPLDGQRKKRGGRRYR 310 320 330 340 350 360 370 380 390 400 410 420 pF1KB8 KMKERLGLTEIRKQANRMSFGEIEEDAYQEDLGFSLGHLGKSGSGRVRQTQVNEATKARI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_006 KMKERLGLTEIRKQANRMSFGEIEEDAYQEDLGFSLGHLGKSGSGRVRQTQVNEATKARI 370 380 390 400 410 420 430 440 450 460 470 480 pF1KB8 SKTLQRTLQKQSVVYGGKSTIRDRSSGTASSVAFTPLQGLEIVNPQAAEKKVAEANQKYF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_006 SKTLQRTLQKQSVVYGGKSTIRDRSSGTASSVAFTPLQGLEIVNPQAAEKKVAEANQKYF 430 440 450 460 470 480 490 pF1KB8 SSMAEFLKVKGEKSGLMST ::::::::::::::::::: XP_006 SSMAEFLKVKGEKSGLMST 490 >>NP_056444 (OMIM: 600138,606419) U4/U6 small nuclear ri (499 aa) initn: 3156 init1: 3156 opt: 3156 Z-score: 3270.1 bits: 614.6 E(85289): 2.1e-175 Smith-Waterman score: 3156; 99.8% identity (100.0% similar) in 499 aa overlap (1-499:1-499) 10 20 30 40 50 60 pF1KB8 MSLADELLADLEEAAEEEEGGSYGEEEEEPAIEDVQEETQLDLSGDSVKTIAKLWDSKMF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_056 MSLADELLADLEEAAEEEEGGSYGEEEEEPAIEDVQEETQLDLSGDSVKTIAKLWDSKMF 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB8 AEIMMKIEEYISKQAKASEVMGPVEAAPEYRVIVDANNLTVEIENELNIIHKFIRDKYSK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_056 AEIMMKIEEYISKQAKASEVMGPVEAAPEYRVIVDANNLTVEIENELNIIHKFIRDKYSK 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB8 RFPELESLVPNALDYIRTVKELGNSLDKCKNNENLQQILTNATIMVVSVTASTTQGQQLS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_056 RFPELESLVPNALDYIRTVKELGNSLDKCKNNENLQQILTNATIMVVSVTASTTQGQQLS 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB8 EEELERLEEACDMALELNASKHRIYEYVESRMSFIAPNLSIIIGASTAAKIMGVAGGLTN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_056 EEELERLEEACDMALELNASKHRIYEYVESRMSFIAPNLSIIIGASTAAKIMGVAGGLTN 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB8 LSKVPACNIMLLGAQRKTLSGFSSTSVLPHTGYIYHSDIVQSLPPDLRRKAARLVAAKCT :::.:::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_056 LSKMPACNIMLLGAQRKTLSGFSSTSVLPHTGYIYHSDIVQSLPPDLRRKAARLVAAKCT 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB8 LAARVDSFHESTEGKVGYELKDEIERKFDKWQEPPPVKQVKPLPAPLDGQRKKRGGRRYR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_056 LAARVDSFHESTEGKVGYELKDEIERKFDKWQEPPPVKQVKPLPAPLDGQRKKRGGRRYR 310 320 330 340 350 360 370 380 390 400 410 420 pF1KB8 KMKERLGLTEIRKQANRMSFGEIEEDAYQEDLGFSLGHLGKSGSGRVRQTQVNEATKARI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_056 KMKERLGLTEIRKQANRMSFGEIEEDAYQEDLGFSLGHLGKSGSGRVRQTQVNEATKARI 370 380 390 400 410 420 430 440 450 460 470 480 pF1KB8 SKTLQRTLQKQSVVYGGKSTIRDRSSGTASSVAFTPLQGLEIVNPQAAEKKVAEANQKYF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_056 SKTLQRTLQKQSVVYGGKSTIRDRSSGTASSVAFTPLQGLEIVNPQAAEKKVAEANQKYF 430 440 450 460 470 480 490 pF1KB8 SSMAEFLKVKGEKSGLMST ::::::::::::::::::: NP_056 SSMAEFLKVKGEKSGLMST 490 >>NP_057018 (OMIM: 616742) nucleolar protein 58 [Homo sa (529 aa) initn: 402 init1: 324 opt: 427 Z-score: 445.6 bits: 92.0 E(85289): 4.4e-18 Smith-Waterman score: 437; 23.6% identity (61.2% similar) in 381 aa overlap (92-471:161-521) 70 80 90 100 110 120 pF1KB8 EIMMKIEEYISKQAKASEVMGPVEAAPEYRVIVDANNLTVEIENELNIIHKFIRDKYSKR .::.: .: ....::: :. :. . NP_057 EPREMAAMCLGLAHSLSRYRLKFSADKVDTMIVQAISLLDDLDKELNNYIMRCREWYGWH 140 150 160 170 180 190 130 140 150 160 170 180 pF1KB8 FPELESLVPNALDYIRTVKELGNSLDKCKNNENLQQILTNATIMVVSVTASTTQGQQLSE :::: ... . : : . ....:. : . .:...: . . :...: ..: ..:: NP_057 FPELGKIISDNLTYCKCLQKVGDR--KNYASAKLSELLPEEVEAEVKAAAEISMGTEVSE 200 210 220 230 240 190 200 210 220 230 240 pF1KB8 EELERLEEACDMALELNASKHRIYEYVESRMSFIAPNLSIIIGASTAAKIMGVAGGLTNL :.. . . : ...:.. . ..:::...:: ::::.....: ..:.... ::.: :: NP_057 EDICNILHLCTQVIEISEYRTQLYEYLQNRMMAIAPNVTVMVGELVGARLIAHAGSLLNL 250 260 270 280 290 300 250 260 270 280 290 300 pF1KB8 SKVPACNIMLLGAQRKTLSGFSSTSVLPHTGYIYHSDIVQSLPPDLRRKAARLVAAKCTL .: : ....:::.. . ...: :. : :::...: . : . : .:..::: .: NP_057 AKHAASTVQILGAEKALFRALKSRRDTPKYGLIYHASLVGQTSPKHKGKISRMLAAKTVL 310 320 330 340 350 360 310 320 330 340 350 360 pF1KB8 AARVDSFHESTEGKVGYELKDEIERKFDKWQEPPPVKQVKPLPAPLDGQRKKRGGRRYRK : : :.: :.. . .: : . ..: .. . : ..... : .. .: . : NP_057 AIRYDAFGEDSSSAMGVENRAKLEARL-RTLEDRGIRKISGTGKAL-AKTEKYEHKSEVK 370 380 390 400 410 420 370 380 390 400 410 420 pF1KB8 MKERLGLTEIRKQANRMSFGEIE-EDAYQEDLGFSLGHLGKSGSGRVRQTQVNEATKARI . : . . ... .. ... :: : :. ..... ..:.: . .. NP_057 TYDPSGDSTLPTCSKKRKIEQVDKEDEITEK---------KAKKAKIK-VKVEEEEEEKV 430 440 450 460 470 430 440 450 460 470 480 pF1KB8 SKTLQRTLQKQSVVYGGKSTIRDRSSGTASSVAFTPLQGLEIVNPQAAEKKVAEANQKYF .. . ...:.. : :. :... . : . :..:. .:: NP_057 AEEEETSVKKKKK-RGKKKHIKEEPLSEEE-----PCTSTAIASPEKKKKKKKKRENED 480 490 500 510 520 490 pF1KB8 SSMAEFLKVKGEKSGLMST >>NP_006383 (OMIM: 614153,614154) nucleolar protein 56 [ (594 aa) initn: 388 init1: 295 opt: 399 Z-score: 415.9 bits: 86.7 E(85289): 2e-16 Smith-Waterman score: 410; 24.7% identity (63.0% similar) in 308 aa overlap (92-386:167-474) 70 80 90 100 110 120 pF1KB8 EIMMKIEEYISKQAKASEVMGPVEAAPEYRVIVDANNLTVEIENELNIIHKFIRDKYSKR .:... .: ......: . .:. :. . NP_006 TDLSACKAQLGLGHSYSRAKVKFNVNRVDNMIIQSISLLDQLDKDINTFSMRVREWYGYH 140 150 160 170 180 190 130 140 150 160 170 pF1KB8 FPELESLVPNALDYIRTVKELGNSLDKCKNN-ENLQQILTNATIMVVSVTAS-TTQGQQL :::: ... . : : .. .:: . ... :.:... ... . . :: ...:... NP_006 FPELVKIINDNATYCRLAQFIGNRRELNEDKLEKLEELTMDGAKAKAILDASRSSMGMDI 200 210 220 230 240 250 180 190 200 210 220 230 pF1KB8 SEEELERLEEACDMALELNASKHRIYEYVESRMSFIAPNLSIIIGASTAAKIMGVAGGLT : .: .: . .. :. .. .. :..:.:: .::.:: .:: ...:.... ::.:: NP_006 SAIDLINIESFSSRVVSLSEYRQSLHTYLRSKMSQVAPSLSALIGEAVGARLIAHAGSLT 260 270 280 290 300 310 240 250 260 270 280 290 pF1KB8 NLSKVPACNIMLLGAQRKTLSGFSSTSVLPHTGYIYHSDIVQSLPPDLRRKAARLVAAKC ::.: :: ....:::.. . .... . :. : :.:: .. . . .: .: :: NP_006 NLAKYPASTVQILGAEKALFRALKTRGNTPKYGLIFHSTFIGRAAAKNKGRISRYLANKC 320 330 340 350 360 370 300 310 320 330 340 pF1KB8 TLAARVDSFHESTEGKVGYELKDEIERKFDKWQ--EPP---------PVKQVKPLPAPLD ..:.:.: : : . : .:....:.... .. : : . :.. : . NP_006 SIASRIDCFSEVPTSVFGEKLREQVEERLSFYETGEIPRKNLDVMKEAMVQAEEAAAEIT 380 390 400 410 420 430 350 360 370 380 390 400 pF1KB8 GQRKKRGGRRYRKMKERLGLTEIRKQANRMSFGEIEEDAYQEDLGFSLGHLGKSGSGRVR . .:. .: .: :.::. . .. : : : :. NP_006 RKLEKQEKKRLKKEKKRLAALALASSENSSSTPEECEEMSEKPKKKKKQKPQEVPQENGM 440 450 460 470 480 490 410 420 430 440 450 460 pF1KB8 QTQVNEATKARISKTLQRTLQKQSVVYGGKSTIRDRSSGTASSVAFTPLQGLEIVNPQAA NP_006 EDPSISFSKPKKKKSFSKEELMSSDLEETAGSTSIPKRKKSTPKEETVNDPEEAGHRSGS 500 510 520 530 540 550 499 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Fri Nov 4 14:21:21 2016 done: Fri Nov 4 14:21:22 2016 Total Scan time: 10.520 Total Display time: 0.020 Function used was FASTA [36.3.4 Apr, 2011]