FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB0487, 529 aa 1>>>pF1KB0487 529 - 529 aa - 529 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 8.0734+/-0.00113; mu= 7.2587+/- 0.068 mean_var=191.5134+/-38.033, 0's: 0 Z-trim(107.6): 19 B-trim: 0 in 0/54 Lambda= 0.092678 statistics sampled from 9657 (9665) to 9657 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.624), E-opt: 0.2 (0.297), width: 16 Scan time: 3.540 The best scores are: opt bits E(32554) CCDS2353.1 NOP58 gene_id:51602|Hs108|chr2 ( 529) 3358 461.9 8.1e-130 CCDS13030.1 NOP56 gene_id:10528|Hs108|chr20 ( 594) 1130 164.1 4.2e-40 CCDS12879.1 PRPF31 gene_id:26121|Hs108|chr19 ( 499) 430 70.4 5.5e-12 >>CCDS2353.1 NOP58 gene_id:51602|Hs108|chr2 (529 aa) initn: 3358 init1: 3358 opt: 3358 Z-score: 2444.1 bits: 461.9 E(32554): 8.1e-130 Smith-Waterman score: 3358; 100.0% identity (100.0% similar) in 529 aa overlap (1-529:1-529) 10 20 30 40 50 60 pF1KB0 MLVLFETSVGYAIFKVLNEKKLQEVDSLWKEFETPEKANKIVKLKHFEKFQDTAEALAAF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS23 MLVLFETSVGYAIFKVLNEKKLQEVDSLWKEFETPEKANKIVKLKHFEKFQDTAEALAAF 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB0 TALMEGKINKQLKKVLKKIVKEAHEPLAVADAKLGGVIKEKLNLSCIHSPVVNELMRGIR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS23 TALMEGKINKQLKKVLKKIVKEAHEPLAVADAKLGGVIKEKLNLSCIHSPVVNELMRGIR 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB0 SQMDGLIPGVEPREMAAMCLGLAHSLSRYRLKFSADKVDTMIVQAISLLDDLDKELNNYI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS23 SQMDGLIPGVEPREMAAMCLGLAHSLSRYRLKFSADKVDTMIVQAISLLDDLDKELNNYI 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB0 MRCREWYGWHFPELGKIISDNLTYCKCLQKVGDRKNYASAKLSELLPEEVEAEVKAAAEI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS23 MRCREWYGWHFPELGKIISDNLTYCKCLQKVGDRKNYASAKLSELLPEEVEAEVKAAAEI 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB0 SMGTEVSEEDICNILHLCTQVIEISEYRTQLYEYLQNRMMAIAPNVTVMVGELVGARLIA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS23 SMGTEVSEEDICNILHLCTQVIEISEYRTQLYEYLQNRMMAIAPNVTVMVGELVGARLIA 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB0 HAGSLLNLAKHAASTVQILGAEKALFRALKSRRDTPKYGLIYHASLVGQTSPKHKGKISR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS23 HAGSLLNLAKHAASTVQILGAEKALFRALKSRRDTPKYGLIYHASLVGQTSPKHKGKISR 310 320 330 340 350 360 370 380 390 400 410 420 pF1KB0 MLAAKTVLAIRYDAFGEDSSSAMGVENRAKLEARLRTLEDRGIRKISGTGKALAKTEKYE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS23 MLAAKTVLAIRYDAFGEDSSSAMGVENRAKLEARLRTLEDRGIRKISGTGKALAKTEKYE 370 380 390 400 410 420 430 440 450 460 470 480 pF1KB0 HKSEVKTYDPSGDSTLPTCSKKRKIEQVDKEDEITEKKAKKAKIKVKVEEEEEEKVAEEE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS23 HKSEVKTYDPSGDSTLPTCSKKRKIEQVDKEDEITEKKAKKAKIKVKVEEEEEEKVAEEE 430 440 450 460 470 480 490 500 510 520 pF1KB0 ETSVKKKKKRGKKKHIKEEPLSEEEPCTSTAIASPEKKKKKKKKRENED ::::::::::::::::::::::::::::::::::::::::::::::::: CCDS23 ETSVKKKKKRGKKKHIKEEPLSEEEPCTSTAIASPEKKKKKKKKRENED 490 500 510 520 >>CCDS13030.1 NOP56 gene_id:10528|Hs108|chr20 (594 aa) initn: 1146 init1: 553 opt: 1130 Z-score: 833.5 bits: 164.1 E(32554): 4.2e-40 Smith-Waterman score: 1147; 39.8% identity (66.6% similar) in 548 aa overlap (3-529:6-546) 10 20 30 40 50 pF1KB0 MLVLFETSVGYAIFKVLNEKKLQEVDSLWKEFETPE----KANKIVKLKHFEKFQDT :::: .::::. : :...:.. : . : : ..::.: : : .. CCDS13 MVLLHVLFEHAVGYAL---LALKEVEEISLLQPQVEESVLNLGKFHSIVRLVAFCPFASS 10 20 30 40 50 60 70 80 90 100 110 pF1KB0 AEALAAFTALMEGKINKQLKKVLKKIV--KEAHEPLAVADAKLGGVIKEKLNLSCIHSPV :: .:. :: ....:. .:. . :. . :.:.: :.:..:.:.:. .: . : CCDS13 QVALENANAVSEGVVHEDLRLLLETHLPSKKKKVLLGVGDPKIGAAIQEELGYNCQTGGV 60 70 80 90 100 110 120 130 140 150 160 170 pF1KB0 VNELMRGIRSQMDGLIPGVEPREMAAMCLGLAHSLSRYRLKFSADKVDTMIVQAISLLDD . :..::.: .. .:. :. :::.:: :: ..::....::.::.:.:::::. CCDS13 IAEILRGVRLHFHNLVKGLTDLSACKAQLGLGHSYSRAKVKFNVNRVDNMIIQSISLLDQ 120 130 140 150 160 170 180 190 200 210 220 230 pF1KB0 LDKELNNYIMRCREWYGWHFPELGKIISDNLTYCKCLQKVGDRKNYASAKLSELLPEEVE :::..:.. :: :::::.::::: :::.:: :::. : .:.:.. :: .: .. CCDS13 LDKDINTFSMRVREWYGYHFPELVKIINDNATYCRLAQFIGNRRELNEDKLEKLEELTMD 180 190 200 210 220 230 240 250 260 270 280 pF1KB0 -AEVKA---AAEISMGTEVSEEDICNILHLCTQVIEISEYRTQLYEYLQNRMMAIAPNVT :..:: :.. ::: ..: :. :: . ..:. .:::: .:. ::...: .::... CCDS13 GAKAKAILDASRSSMGMDISAIDLINIESFSSRVVSLSEYRQSLHTYLRSKMSQVAPSLS 240 250 260 270 280 290 290 300 310 320 330 340 pF1KB0 VMVGELVGARLIAHAGSLLNLAKHAASTVQILGAEKALFRALKSRRDTPKYGLIYHASLV ...:: :::::::::::: ::::. ::::::::::::::::::.: .:::::::.:.... CCDS13 ALIGEAVGARLIAHAGSLTNLAKYPASTVQILGAEKALFRALKTRGNTPKYGLIFHSTFI 300 310 320 330 340 350 350 360 370 380 390 400 pF1KB0 GQTSPKHKGKISRMLAAKTVLAIRYDAFGEDSSSAMGVENRAKLEARLRTLEDRGI-RKI :... :.::.:::.:: : .: : : :.: .:..: . : ..: :: : : :: CCDS13 GRAAAKNKGRISRYLANKCSIASRIDCFSEVPTSVFGEKLREQVEERLSFYETGEIPRKN 360 370 380 390 400 410 410 420 430 440 450 pF1KB0 SGTGK--------ALAKTEKYEHKSEVKTYDPSGD--STLPTCSKKRKIEQVDKEDEITE . : : :. . .:.: : ..: :.. . .. .:..: CCDS13 LDVMKEAMVQAEEAAAEITRKLEKQEKKRLKKEKKRLAALALASSENSSSTPEECEEMSE 420 430 440 450 460 470 460 470 480 490 500 510 pF1KB0 KKAKKAKIKVKVEEEEEEKVAEEEETSVKKKKKRGKKKHIKEEPLSEEEPCTSTAIASPE : :: : : . : .:. :. : .: :: ::. ::: .: . :. . . :. CCDS13 KPKKKKKQKPQ--EVPQENGMEDPSISFSKPKK--KKSFSKEELMSSDLEETAGSTSIPK 480 490 500 510 520 530 520 pF1KB0 KKKKKKKKRENED .::. :.. .: CCDS13 RKKSTPKEETVNDPEEAGHRSGSKKKRKFSKEEPVSSGPEEAVGKSSSKKKKKFHKASQE 540 550 560 570 580 590 >>CCDS12879.1 PRPF31 gene_id:26121|Hs108|chr19 (499 aa) initn: 405 init1: 327 opt: 430 Z-score: 328.7 bits: 70.4 E(32554): 5.5e-12 Smith-Waterman score: 440; 23.6% identity (61.2% similar) in 381 aa overlap (161-521:92-471) 140 150 160 170 180 190 pF1KB0 EPREMAAMCLGLAHSLSRYRLKFSADKVDTMIVQAISLLDDLDKELNNYIMRCREWYGWH .::.: .: ....::: :. :. . CCDS12 EIMMKIEEYISKQAKASEVMGPVEAAPEYRVIVDANNLTVEIENELNIIHKFIRDKYSKR 70 80 90 100 110 120 200 210 220 230 240 pF1KB0 FPELGKIISDNLTYCKCLQKVGDR--KNYASAKLSELLPEEVEAEVKAAAEISMGTEVSE :::: ... . : : . ....:. : . .:...: . . :...: ..: ..:: CCDS12 FPELESLVPNALDYIRTVKELGNSLDKCKNNENLQQILTNATIMVVSVTASTTQGQQLSE 130 140 150 160 170 180 250 260 270 280 290 300 pF1KB0 EDICNILHLCTQVIEISEYRTQLYEYLQNRMMAIAPNVTVMVGELVGARLIAHAGSLLNL :.. . . : ...:.. . ..:::...:: ::::.....: ..:.... ::.: :: CCDS12 EELERLEEACDMALELNASKHRIYEYVESRMSFIAPNLSIIIGASTAAKIMGVAGGLTNL 190 200 210 220 230 240 310 320 330 340 350 360 pF1KB0 AKHAASTVQILGAEKALFRALKSRRDTPKYGLIYHASLVGQTSPKHKGKISRMLAAKTVL .: : ....:::.. . ...: :. : :::...: . : . : .:..::: .: CCDS12 SKMPACNIMLLGAQRKTLSGFSSTSVLPHTGYIYHSDIVQSLPPDLRRKAARLVAAKCTL 250 260 270 280 290 300 370 380 390 400 410 420 pF1KB0 AIRYDAFGEDSSSAMGVENRAKLEARL-RTLEDRGIRKISGTGKAL-AKTEKYEHKSEVK : : :.: :.. . .: : . ..: .. . : ..... : .. .: . : CCDS12 AARVDSFHESTEGKVGYELKDEIERKFDKWQEPPPVKQVKPLPAPLDGQRKKRGGRRYRK 310 320 330 340 350 360 430 440 450 460 470 pF1KB0 TYDPSGDSTLPTCSKKRKIEQVDKEDEITEK---------KAKKAKIK-VKVEEEEEEKV . : . . ... .. ... :: : :. ..... ..:.: . .. CCDS12 MKERLGLTEIRKQANRMSFGEIE-EDAYQEDLGFSLGHLGKSGSGRVRQTQVNEATKARI 370 380 390 400 410 420 480 490 500 510 520 pF1KB0 AEEEETSVKKKKK-RGKKKHIKEEPLSEEE-----PCTSTAIASPEKKKKKKKKRENED .. . ...:.. : :. :... . : . :..:. .:: CCDS12 SKTLQRTLQKQSVVYGGKSTIRDRSSGTASSVAFTPLQGLEIVNPQAAEKKVAEANQKYF 430 440 450 460 470 480 CCDS12 SSMAEFLKVKGEKSGLMST 490 529 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sat Nov 5 08:04:07 2016 done: Sat Nov 5 08:04:07 2016 Total Scan time: 3.540 Total Display time: 0.030 Function used was FASTA [36.3.4 Apr, 2011]