FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KA0179, 740 aa 1>>>pF1KA0179 740 - 740 aa - 740 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 9.9727+/-0.000929; mu= -0.3438+/- 0.056 mean_var=236.4118+/-48.212, 0's: 0 Z-trim(113.3): 24 B-trim: 151 in 2/51 Lambda= 0.083414 statistics sampled from 13916 (13935) to 13916 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.752), E-opt: 0.2 (0.428), width: 16 Scan time: 4.750 The best scores are: opt bits E(32554) CCDS33577.1 RRP1B gene_id:23076|Hs108|chr21 ( 758) 4560 562.0 1.2e-159 CCDS42951.1 RRP1 gene_id:8568|Hs108|chr21 ( 461) 984 131.5 2.8e-30 >>CCDS33577.1 RRP1B gene_id:23076|Hs108|chr21 (758 aa) initn: 4560 init1: 4560 opt: 4560 Z-score: 2979.6 bits: 562.0 E(32554): 1.2e-159 Smith-Waterman score: 4825; 97.5% identity (97.5% similar) in 758 aa overlap (1-740:1-758) 10 20 30 40 50 pF1KA0 MAPAMQPAEIQFAQRLASSEKGIRDRAVKKLRQYISVKTQRETGGFSQEELL-------- :::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 MAPAMQPAEIQFAQRLASSEKGIRDRAVKKLRQYISVKTQRETGGFSQEELLKIWKGLFY 10 20 30 40 50 60 60 70 80 90 100 pF1KA0 ----------QEELANTIAQLVHAVNNSAAQHLFIQTFWQTMNREWKGIDRLRLDKYYML :::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 CMWVQDEPLLQEELANTIAQLVHAVNNSAAQHLFIQTFWQTMNREWKGIDRLRLDKYYML 70 80 90 100 110 120 110 120 130 140 150 160 pF1KA0 IRLVLRQSFEVLKRNGWEESRIKVFLDVLMKEVLCPESQSPNGVRFHFIDIYLDELSKVG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 IRLVLRQSFEVLKRNGWEESRIKVFLDVLMKEVLCPESQSPNGVRFHFIDIYLDELSKVG 130 140 150 160 170 180 170 180 190 200 210 220 pF1KA0 GKELLADQNLKFIDPFCKIAAKTKDHTLVQTIARGVFEAIVDQSPFVPEETMEEQKTKVG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 GKELLADQNLKFIDPFCKIAAKTKDHTLVQTIARGVFEAIVDQSPFVPEETMEEQKTKVG 190 200 210 220 230 240 230 240 250 260 270 280 pF1KA0 DGDLSAEEIPENEVSLRRAVSKKKTALGKNHSRKDGLSDERGRDDCGTFEDTGPLLQFDY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 DGDLSAEEIPENEVSLRRAVSKKKTALGKNHSRKDGLSDERGRDDCGTFEDTGPLLQFDY 250 260 270 280 290 300 290 300 310 320 330 340 pF1KA0 KAVADRLLEMTSRKNTPHFNRKRLSKLIKKFQDLSEGSSISQLSFAEDISADEDDQILSQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 KAVADRLLEMTSRKNTPHFNRKRLSKLIKKFQDLSEGSSISQLSFAEDISADEDDQILSQ 310 320 330 340 350 360 350 360 370 380 390 400 pF1KA0 GKHKKKGNKLLEKTNLEKEKGSRVFCVEEEDSESSLQKRRRKKKKKHHLQPENPGPGGAA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 GKHKKKGNKLLEKTNLEKEKGSRVFCVEEEDSESSLQKRRRKKKKKHHLQPENPGPGGAA 370 380 390 400 410 420 410 420 430 440 450 460 pF1KA0 PSLEQNRGREPEASGPKALKARVAEPGAEATSSTGEESGSEHPPAVPMHNKRKRPRKKSP ::::::::::::::: :::::::::::::::::::::::::::::::::::::::::::: CCDS33 PSLEQNRGREPEASGLKALKARVAEPGAEATSSTGEESGSEHPPAVPMHNKRKRPRKKSP 430 440 450 460 470 480 470 480 490 500 510 520 pF1KA0 RAHREMLESAVLPPEDMSQSGPSGSHPQGPRGSPTGGAQLLKRKRKLGVVPVNGSGLSTP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 RAHREMLESAVLPPEDMSQSGPSGSHPQGPRGSPTGGAQLLKRKRKLGVVPVNGSGLSTP 490 500 510 520 530 540 530 540 550 560 570 580 pF1KA0 AWPPLQQEGPPTGPAEGANSHTTLPQRRRLQKKKAGPGSLELCGLPSQKTASLKKRKKMR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 AWPPLQQEGPPTGPAEGANSHTTLPQRRRLQKKKAGPGSLELCGLPSQKTASLKKRKKMR 550 560 570 580 590 600 590 600 610 620 630 640 pF1KA0 VMSNLVEHNGVLESEAGQPQALGSSGTCSSLKKQKLRAESDFVKFDTPFLPKPLFFRRAK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 VMSNLVEHNGVLESEAGQPQALGSSGTCSSLKKQKLRAESDFVKFDTPFLPKPLFFRRAK 610 620 630 640 650 660 650 660 670 680 690 700 pF1KA0 SSTATHPPGPAVQLNKTPSSSKKVTFGLNRNMTAEFKKTDKSILVSPTGPSRVAFDPEQK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 SSTATHPPGPAVQLNKTPSSSKKVTFGLNRNMTAEFKKTDKSILVSPTGPSRVAFDPEQK 670 680 690 700 710 720 710 720 730 740 pF1KA0 PLHGVLKTPTSSPASSPLVAKKPLTTTPRRRPRAMDFF :::::::::::::::::::::::::::::::::::::: CCDS33 PLHGVLKTPTSSPASSPLVAKKPLTTTPRRRPRAMDFF 730 740 750 >>CCDS42951.1 RRP1 gene_id:8568|Hs108|chr21 (461 aa) initn: 986 init1: 745 opt: 984 Z-score: 657.0 bits: 131.5 E(32554): 2.8e-30 Smith-Waterman score: 1156; 43.4% identity (70.6% similar) in 472 aa overlap (7-456:8-459) 10 20 30 40 50 pF1KA0 MAPAMQPAEIQFAQRLASSEKGIRDRAVKKLRQYISVKTQRETGGFSQEELL------- : :::.:::::..:. :::::.:::.:: ..::: .:::...::: CCDS42 MVSRVQLPPEIQLAQRLAGNEQVTRDRAVRKLRKYIVARTQRAAGGFTHDELLKVWKGLF 10 20 30 40 50 60 60 70 80 90 100 pF1KA0 -----------QEELANTIAQLVHAVNNSAAQHLFIQTFWQTMNREWKGIDRLRLDKYYM ::::. ::.::::: ... :::::.:.::::::::: :::::::::.:: CCDS42 YCMWMQDKPLLQEELGRTISQLVHAFQTTEAQHLFLQAFWQTMNREWTGIDRLRLDKFYM 70 80 90 100 110 120 110 120 130 140 150 160 pF1KA0 LIRLVLRQSFEVLKRNGWEESRIKVFLDVLMKEVLCPESQSPNGVRFHFIDIYLDELSKV :.:.:: .:..::: .:::: .:. .:..:: :.: : ::.::::. :::.:.:.::.:: CCDS42 LMRMVLNESLKVLKMQGWEERQIEELLELLMTEILHPSSQAPNGVKSHFIEIFLEELTKV 130 140 150 160 170 180 170 180 190 200 210 220 pF1KA0 GGKELLADQNLKFIDPFCKIAAKTKDHTLVQTIARGVFEAIVDQSPFVPEETMEEQKTKV :..:: ::::::::::::.:::.::: ....:.::.::.::.:.:.. :. ..: :. CCDS42 GAEELTADQNLKFIDPFCRIAARTKDSLVLNNITRGIFETIVEQAPLAIEDLLNELDTQ- 190 200 210 220 230 230 240 250 260 270 pF1KA0 GDGDLSAEEIPENEVSLR-RAVSKKKTALGKNHSRKDGLSDERGRDDCGTFEDTG-PLLQ : ..... .: . : :.:.:.. : . : :... : .:.: :.:: CCDS42 -DEEVASDSDESSEGGERGDALSQKRSEKPPAGSICRA-EPEAGEEQAGDDRDSGGPVLQ 240 250 260 270 280 290 280 290 300 310 320 330 pF1KA0 FDYKAVADRLLEMTSRKNTPHFNRKRLSKLIKKFQDLSEGSSISQLSFAEDISADEDDQI :::.:::.::.::.::..:: ::::: :.:.:.:::. : : :: .. . CCDS42 FDYEAVANRLFEMASRQSTPSQNRKRLYKVIRKLQDLAGGI------FPEDEIPEKACRR 300 310 320 330 340 350 340 350 360 370 380 390 pF1KA0 LSQGKHKKKGNKLLEKTNLEKEKGSRVFCVEEEDSESSLQKRRRKKKKKHHLQPENPGPG : .:...:: .: . :..:.:. :.. : ..:.:.... .:: . . CCDS42 LLEGRRQKKTKKQKRLLRLQQERGKG-----EKEPPSPGMERKRSRRRGVGADPEARAEA 360 370 380 390 400 400 410 420 430 440 450 pF1KA0 GAAP-SLEQNRGRE-PEASGPKALKARVAEPGAEATSSTGEESGSEHPPAVPMHNKRKRP : : . :. :. :.. : .. . : : ::. .. .. ..: ..:.:: CCDS42 GEQPGTAERALLRDQPRGRGQRGARQRRRTP-RPLTSARAKAANVQEP-----EKKKKRR 410 420 430 440 450 460 460 470 480 490 500 510 pF1KA0 RKKSPRAHREMLESAVLPPEDMSQSGPSGSHPQGPRGSPTGGAQLLKRKRKLGVVPVNGS CCDS42 E 740 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Thu Nov 3 09:10:05 2016 done: Thu Nov 3 09:10:05 2016 Total Scan time: 4.750 Total Display time: 0.020 Function used was FASTA [36.3.4 Apr, 2011]