FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB5454, 514 aa 1>>>pF1KB5454 514 - 514 aa - 514 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.6093+/-0.000938; mu= 17.2136+/- 0.057 mean_var=72.0203+/-14.564, 0's: 0 Z-trim(105.3): 24 B-trim: 262 in 1/49 Lambda= 0.151129 statistics sampled from 8353 (8365) to 8353 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.624), E-opt: 0.2 (0.257), width: 16 Scan time: 3.100 The best scores are: opt bits E(32554) CCDS795.1 SARS gene_id:6301|Hs108|chr1 ( 514) 3406 752.0 3.5e-217 CCDS81356.1 SARS gene_id:6301|Hs108|chr1 ( 536) 2787 617.1 1.6e-176 CCDS33017.1 SARS2 gene_id:54938|Hs108|chr19 ( 518) 590 138.1 2.4e-32 CCDS54265.1 SARS2 gene_id:54938|Hs108|chr19 ( 520) 590 138.1 2.4e-32 >>CCDS795.1 SARS gene_id:6301|Hs108|chr1 (514 aa) initn: 3406 init1: 3406 opt: 3406 Z-score: 4012.5 bits: 752.0 E(32554): 3.5e-217 Smith-Waterman score: 3406; 100.0% identity (100.0% similar) in 514 aa overlap (1-514:1-514) 10 20 30 40 50 60 pF1KB5 MVLDLDLFRVDKGGDPALIRETQEKRFKDPGLVDQLVKADSEWRRCRFRADNLNKLKNLC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS79 MVLDLDLFRVDKGGDPALIRETQEKRFKDPGLVDQLVKADSEWRRCRFRADNLNKLKNLC 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB5 SKTIGEKMKKKEPVGDDESVPENVLSFDDLTADALANLKVSQIKKVRLLIDEAILKCDAE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS79 SKTIGEKMKKKEPVGDDESVPENVLSFDDLTADALANLKVSQIKKVRLLIDEAILKCDAE 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB5 RIKLEAERFENLREIGNLLHPSVPISNDEDVDNKVERIWGDCTVRKKYSHVDLVVMVDGF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS79 RIKLEAERFENLREIGNLLHPSVPISNDEDVDNKVERIWGDCTVRKKYSHVDLVVMVDGF 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB5 EGEKGAVVAGSRGYFLKGVLVFLEQALIQYALRTLGSRGYIPIYTPFFMRKEVMQEVAQL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS79 EGEKGAVVAGSRGYFLKGVLVFLEQALIQYALRTLGSRGYIPIYTPFFMRKEVMQEVAQL 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB5 SQFDEELYKVIGKGSEKSDDNSYDEKYLIATSEQPIAALHRDEWLRPEDLPIKYAGLSTC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS79 SQFDEELYKVIGKGSEKSDDNSYDEKYLIATSEQPIAALHRDEWLRPEDLPIKYAGLSTC 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB5 FRQEVGSHGRDTRGIFRVHQFEKIEQFVYSSPHDNKSWEMFEEMITTAEEFYQSLGIPYH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS79 FRQEVGSHGRDTRGIFRVHQFEKIEQFVYSSPHDNKSWEMFEEMITTAEEFYQSLGIPYH 310 320 330 340 350 360 370 380 390 400 410 420 pF1KB5 IVNIVSGSLNHAASKKLDLEAWFPGSGAFRELVSCSNCTDYQARRLRIRYGQTKKMMDKV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS79 IVNIVSGSLNHAASKKLDLEAWFPGSGAFRELVSCSNCTDYQARRLRIRYGQTKKMMDKV 370 380 390 400 410 420 430 440 450 460 470 480 pF1KB5 EFVHMLNATMCATTRTICAILENYQTEKGITVPEKLKEFMPPGLQELIPFVKPAPIEQEP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS79 EFVHMLNATMCATTRTICAILENYQTEKGITVPEKLKEFMPPGLQELIPFVKPAPIEQEP 430 440 450 460 470 480 490 500 510 pF1KB5 SKKQKKQHEGSKKKAAARDVTLENRLQNMEVTDA :::::::::::::::::::::::::::::::::: CCDS79 SKKQKKQHEGSKKKAAARDVTLENRLQNMEVTDA 490 500 510 >>CCDS81356.1 SARS gene_id:6301|Hs108|chr1 (536 aa) initn: 2787 init1: 2787 opt: 2787 Z-score: 3282.8 bits: 617.1 E(32554): 1.6e-176 Smith-Waterman score: 3352; 95.9% identity (95.9% similar) in 536 aa overlap (1-514:1-536) 10 20 30 40 50 60 pF1KB5 MVLDLDLFRVDKGGDPALIRETQEKRFKDPGLVDQLVKADSEWRRCRFRADNLNKLKNLC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS81 MVLDLDLFRVDKGGDPALIRETQEKRFKDPGLVDQLVKADSEWRRCRFRADNLNKLKNLC 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB5 SKTIGEKMKKKEPVGDDESVPENVLSFDDLTADALANLKVSQIKKVRLLIDEAILKCDAE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS81 SKTIGEKMKKKEPVGDDESVPENVLSFDDLTADALANLKVSQIKKVRLLIDEAILKCDAE 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB5 RIKLEAERFENLREIGNLLHPSVPISNDEDVDNKVERIWGDCTVRKKYSHVDLVVMVDGF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS81 RIKLEAERFENLREIGNLLHPSVPISNDEDVDNKVERIWGDCTVRKKYSHVDLVVMVDGF 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB5 EGEKGAVVAGSRGYFLKGVLVFLEQALIQYALRTLGSRGYIPIYTPFFMRKEVMQEVAQL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS81 EGEKGAVVAGSRGYFLKGVLVFLEQALIQYALRTLGSRGYIPIYTPFFMRKEVMQEVAQL 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB5 SQFDEELYKVIGKGSEKSDDNSYDEKYLIATSEQPIAALHRDEWLRPEDLPIKYAGLSTC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS81 SQFDEELYKVIGKGSEKSDDNSYDEKYLIATSEQPIAALHRDEWLRPEDLPIKYAGLSTC 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB5 FRQEVGSHGRDTRGIFRVHQFEKIEQFVYSSPHDNKSWEMFEEMITTAEEFYQSLGIPYH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS81 FRQEVGSHGRDTRGIFRVHQFEKIEQFVYSSPHDNKSWEMFEEMITTAEEFYQSLGIPYH 310 320 330 340 350 360 370 380 390 400 410 pF1KB5 IVNIVSGSLNHAASKKLDLEAWFPGSGAFRELVSCSNCTDYQARRLRIRYGQTKKMMDK- ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS81 IVNIVSGSLNHAASKKLDLEAWFPGSGAFRELVSCSNCTDYQARRLRIRYGQTKKMMDKR 370 380 390 400 410 420 420 430 440 450 pF1KB5 ---------------------VEFVHMLNATMCATTRTICAILENYQTEKGITVPEKLKE ::::::::::::::::::::::::::::::::::::::: CCDS81 KNNLHLSTQNKLEASLFSPKKVEFVHMLNATMCATTRTICAILENYQTEKGITVPEKLKE 430 440 450 460 470 480 460 470 480 490 500 510 pF1KB5 FMPPGLQELIPFVKPAPIEQEPSKKQKKQHEGSKKKAAARDVTLENRLQNMEVTDA :::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS81 FMPPGLQELIPFVKPAPIEQEPSKKQKKQHEGSKKKAAARDVTLENRLQNMEVTDA 490 500 510 520 530 >>CCDS33017.1 SARS2 gene_id:54938|Hs108|chr19 (518 aa) initn: 433 init1: 287 opt: 590 Z-score: 694.2 bits: 138.1 E(32554): 2.4e-32 Smith-Waterman score: 596; 30.1% identity (66.4% similar) in 342 aa overlap (123-460:158-485) 100 110 120 130 140 150 pF1KB5 DALANLKVSQIKKVRLLIDEAILKCDAERIKLEAERFENLREIGNLLHPSVPISNDEDVD .:: . . . .. : ::.::.. ::. . CCDS33 EVQQDPKYQGLRARGREIRKELVHLYPREAQLEEQFYLQALKLPNQTHPDVPVG-DES-Q 130 140 150 160 170 180 160 170 180 190 200 210 pF1KB5 NKVERIWGDCTVR--KKYSHVDLVVMVDGFEGEKGAVVAGSRGYFLKGVLVFLEQALIQY .: .. :: : . .:... .: .. .. . :.: :.:.:.:. ..:...:... CCDS33 ARVLHMVGDKPVFSFQPRGHLEIGEKLDIIRQKRLSHVSGHRSYYLRGAGALLQHGLVNF 190 200 210 220 230 240 220 230 240 250 260 pF1KB5 ALRTLGSRGYIPIYTPFFMRKEVMQEVAQLSQFDE-ELYKVIGKGSEKSDDNSYDEKYLI .. : ::. :. .: ..: :.. .. . . ..:.. : . . : CCDS33 TFNKLLRRGFTPMTVPDLLRGAVFEGCGMTPNANPSQIYNI--------DPARFKDLNLA 250 260 270 280 290 270 280 290 300 310 320 pF1KB5 ATSEQPIAALHRDEWLRPEDLPIKYAGLSTCFRQEVGSHGRDTRGIFRVHQFEKIEQFVY .:.: .:. :. . .:::.... :::.: :... :.. ::..:::.: :.:.: CCDS33 GTAEVGLAGYFMDHTVAFRDLPVRMVCSSTCYRAETNT-GQEPRGLYRVHHFTKVEMFGV 300 310 320 330 340 350 330 340 350 360 370 380 pF1KB5 SSPHDNKSWEMFEEMITTAEEFYQSLGIPYHIVNIVSGSLNHAASKKLDLEAWFPGSGAF ..: ..: ...::... :. ::. ...... . :. : .:.:.:::.:: : : CCDS33 TGPGLEQSSQLLEEFLSLQMEILTELGLHFRVLDMPTQELGLPAYRKFDIEAWMPGRGRF 360 370 380 390 400 410 390 400 410 420 430 440 pF1KB5 RELVSCSNCTDYQARRLRIRYGQTKKMMDKVEFVHMLNATMCATTRTICAILENYQTEKG :..: :::::.:.:::.: . ::. ...:.: .::: ::. : . :.::. : . : CCDS33 GEVTSASNCTDFQSRRLHIMF-QTEA--GELQFAHTVNATACAVPRLLIALLESNQQKDG 420 430 440 450 460 470 450 460 470 480 490 500 pF1KB5 -ITVPEKLKEFMPPGLQELIPFVKPAPIEQEPSKKQKKQHEGSKKKAAARDVTLENRLQN . :: :. .. CCDS33 SVLVPPALQSYLGTDRITAPTHVPLQYIGPNQPRKPGLPGQPAVS 480 490 500 510 >>CCDS54265.1 SARS2 gene_id:54938|Hs108|chr19 (520 aa) initn: 433 init1: 287 opt: 590 Z-score: 694.2 bits: 138.1 E(32554): 2.4e-32 Smith-Waterman score: 596; 30.1% identity (66.4% similar) in 342 aa overlap (123-460:160-487) 100 110 120 130 140 150 pF1KB5 DALANLKVSQIKKVRLLIDEAILKCDAERIKLEAERFENLREIGNLLHPSVPISNDEDVD .:: . . . .. : ::.::.. ::. . CCDS54 QQVRLDPGAGSIFGPTFLPFPGQLSLLVEAQLEEQFYLQALKLPNQTHPDVPVG-DES-Q 130 140 150 160 170 180 160 170 180 190 200 210 pF1KB5 NKVERIWGDCTVR--KKYSHVDLVVMVDGFEGEKGAVVAGSRGYFLKGVLVFLEQALIQY .: .. :: : . .:... .: .. .. . :.: :.:.:.:. ..:...:... CCDS54 ARVLHMVGDKPVFSFQPRGHLEIGEKLDIIRQKRLSHVSGHRSYYLRGAGALLQHGLVNF 190 200 210 220 230 240 220 230 240 250 260 pF1KB5 ALRTLGSRGYIPIYTPFFMRKEVMQEVAQLSQFDE-ELYKVIGKGSEKSDDNSYDEKYLI .. : ::. :. .: ..: :.. .. . . ..:.. : . . : CCDS54 TFNKLLRRGFTPMTVPDLLRGAVFEGCGMTPNANPSQIYNI--------DPARFKDLNLA 250 260 270 280 290 270 280 290 300 310 320 pF1KB5 ATSEQPIAALHRDEWLRPEDLPIKYAGLSTCFRQEVGSHGRDTRGIFRVHQFEKIEQFVY .:.: .:. :. . .:::.... :::.: :... :.. ::..:::.: :.:.: CCDS54 GTAEVGLAGYFMDHTVAFRDLPVRMVCSSTCYRAETNT-GQEPRGLYRVHHFTKVEMFGV 300 310 320 330 340 350 330 340 350 360 370 380 pF1KB5 SSPHDNKSWEMFEEMITTAEEFYQSLGIPYHIVNIVSGSLNHAASKKLDLEAWFPGSGAF ..: ..: ...::... :. ::. ...... . :. : .:.:.:::.:: : : CCDS54 TGPGLEQSSQLLEEFLSLQMEILTELGLHFRVLDMPTQELGLPAYRKFDIEAWMPGRGRF 360 370 380 390 400 410 390 400 410 420 430 440 pF1KB5 RELVSCSNCTDYQARRLRIRYGQTKKMMDKVEFVHMLNATMCATTRTICAILENYQTEKG :..: :::::.:.:::.: . ::. ...:.: .::: ::. : . :.::. : . : CCDS54 GEVTSASNCTDFQSRRLHIMF-QTEA--GELQFAHTVNATACAVPRLLIALLESNQQKDG 420 430 440 450 460 470 450 460 470 480 490 500 pF1KB5 -ITVPEKLKEFMPPGLQELIPFVKPAPIEQEPSKKQKKQHEGSKKKAAARDVTLENRLQN . :: :. .. CCDS54 SVLVPPALQSYLGTDRITAPTHVPLQYIGPNQPRKPGLPGQPAVS 480 490 500 510 520 514 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sat Nov 5 12:50:31 2016 done: Sat Nov 5 12:50:31 2016 Total Scan time: 3.100 Total Display time: 0.020 Function used was FASTA [36.3.4 Apr, 2011]