FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB5141, 523 aa 1>>>pF1KB5141 523 - 523 aa - 523 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.4680+/-0.000811; mu= 17.1056+/- 0.049 mean_var=64.4455+/-12.669, 0's: 0 Z-trim(106.7): 18 B-trim: 0 in 0/54 Lambda= 0.159764 statistics sampled from 9118 (9132) to 9118 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.656), E-opt: 0.2 (0.281), width: 16 Scan time: 3.360 The best scores are: opt bits E(32554) CCDS10725.1 GPT2 gene_id:84706|Hs108|chr16 ( 523) 3505 816.8 0 CCDS45478.1 GPT2 gene_id:84706|Hs108|chr16 ( 423) 2869 670.1 1.3e-192 CCDS6430.1 GPT gene_id:2875|Hs108|chr8 ( 496) 2363 553.5 2e-157 >>CCDS10725.1 GPT2 gene_id:84706|Hs108|chr16 (523 aa) initn: 3505 init1: 3505 opt: 3505 Z-score: 4362.0 bits: 816.8 E(32554): 0 Smith-Waterman score: 3505; 100.0% identity (100.0% similar) in 523 aa overlap (1-523:1-523) 10 20 30 40 50 60 pF1KB5 MQRAAALVRRGCGPRTPSSWGRSQSSAAAEASAVLKVRPERSRRERILTLESMNPQVKAV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 MQRAAALVRRGCGPRTPSSWGRSQSSAAAEASAVLKVRPERSRRERILTLESMNPQVKAV 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB5 EYAVRGPIVLKAGEIELELQRGIKKPFTEVIRANIGDAQAMGQQPITFLRQVMALCTYPN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 EYAVRGPIVLKAGEIELELQRGIKKPFTEVIRANIGDAQAMGQQPITFLRQVMALCTYPN 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB5 LLDSPSFPEDAKKRARRILQACGGNSLGSYSASQGVNCIREDVAAYITRRDGGVPADPDN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 LLDSPSFPEDAKKRARRILQACGGNSLGSYSASQGVNCIREDVAAYITRRDGGVPADPDN 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB5 IYLTTGASDGISTILKILVSGGGKSRTGVMIPIPQYPLYSAVISELDAIQVNYYLDEENC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 IYLTTGASDGISTILKILVSGGGKSRTGVMIPIPQYPLYSAVISELDAIQVNYYLDEENC 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB5 WALNVNELRRAVQEAKDHCDPKVLCIINPGNPTGQVQSRKCIEDVIHFAWEEKLFLLADE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 WALNVNELRRAVQEAKDHCDPKVLCIINPGNPTGQVQSRKCIEDVIHFAWEEKLFLLADE 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB5 VYQDNVYSPDCRFHSFKKVLYEMGPEYSSNVELASFHSTSKGYMGECGYRGGYMEVINLH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 VYQDNVYSPDCRFHSFKKVLYEMGPEYSSNVELASFHSTSKGYMGECGYRGGYMEVINLH 310 320 330 340 350 360 370 380 390 400 410 420 pF1KB5 PEIKGQLVKLLSVRLCPPVSGQAAMDIVVNPPVAGEESFEQFSREKESVLGNLAKKAKLT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 PEIKGQLVKLLSVRLCPPVSGQAAMDIVVNPPVAGEESFEQFSREKESVLGNLAKKAKLT 370 380 390 400 410 420 430 440 450 460 470 480 pF1KB5 EDLFNQVPGIHCNPLQGAMYAFPRIFIPAKAVEAAQAHQMAPDMFYCMKLLEETGICVVP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 EDLFNQVPGIHCNPLQGAMYAFPRIFIPAKAVEAAQAHQMAPDMFYCMKLLEETGICVVP 430 440 450 460 470 480 490 500 510 520 pF1KB5 GSGFGQREGTYHFRMTILPPVEKLKTVLQKVKDFHINFLEKYA ::::::::::::::::::::::::::::::::::::::::::: CCDS10 GSGFGQREGTYHFRMTILPPVEKLKTVLQKVKDFHINFLEKYA 490 500 510 520 >>CCDS45478.1 GPT2 gene_id:84706|Hs108|chr16 (423 aa) initn: 2869 init1: 2869 opt: 2869 Z-score: 3571.2 bits: 670.1 E(32554): 1.3e-192 Smith-Waterman score: 2869; 100.0% identity (100.0% similar) in 423 aa overlap (101-523:1-423) 80 90 100 110 120 130 pF1KB5 KAGEIELELQRGIKKPFTEVIRANIGDAQAMGQQPITFLRQVMALCTYPNLLDSPSFPED :::::::::::::::::::::::::::::: CCDS45 MGQQPITFLRQVMALCTYPNLLDSPSFPED 10 20 30 140 150 160 170 180 190 pF1KB5 AKKRARRILQACGGNSLGSYSASQGVNCIREDVAAYITRRDGGVPADPDNIYLTTGASDG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 AKKRARRILQACGGNSLGSYSASQGVNCIREDVAAYITRRDGGVPADPDNIYLTTGASDG 40 50 60 70 80 90 200 210 220 230 240 250 pF1KB5 ISTILKILVSGGGKSRTGVMIPIPQYPLYSAVISELDAIQVNYYLDEENCWALNVNELRR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 ISTILKILVSGGGKSRTGVMIPIPQYPLYSAVISELDAIQVNYYLDEENCWALNVNELRR 100 110 120 130 140 150 260 270 280 290 300 310 pF1KB5 AVQEAKDHCDPKVLCIINPGNPTGQVQSRKCIEDVIHFAWEEKLFLLADEVYQDNVYSPD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 AVQEAKDHCDPKVLCIINPGNPTGQVQSRKCIEDVIHFAWEEKLFLLADEVYQDNVYSPD 160 170 180 190 200 210 320 330 340 350 360 370 pF1KB5 CRFHSFKKVLYEMGPEYSSNVELASFHSTSKGYMGECGYRGGYMEVINLHPEIKGQLVKL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 CRFHSFKKVLYEMGPEYSSNVELASFHSTSKGYMGECGYRGGYMEVINLHPEIKGQLVKL 220 230 240 250 260 270 380 390 400 410 420 430 pF1KB5 LSVRLCPPVSGQAAMDIVVNPPVAGEESFEQFSREKESVLGNLAKKAKLTEDLFNQVPGI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 LSVRLCPPVSGQAAMDIVVNPPVAGEESFEQFSREKESVLGNLAKKAKLTEDLFNQVPGI 280 290 300 310 320 330 440 450 460 470 480 490 pF1KB5 HCNPLQGAMYAFPRIFIPAKAVEAAQAHQMAPDMFYCMKLLEETGICVVPGSGFGQREGT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 HCNPLQGAMYAFPRIFIPAKAVEAAQAHQMAPDMFYCMKLLEETGICVVPGSGFGQREGT 340 350 360 370 380 390 500 510 520 pF1KB5 YHFRMTILPPVEKLKTVLQKVKDFHINFLEKYA ::::::::::::::::::::::::::::::::: CCDS45 YHFRMTILPPVEKLKTVLQKVKDFHINFLEKYA 400 410 420 >>CCDS6430.1 GPT gene_id:2875|Hs108|chr8 (496 aa) initn: 2387 init1: 2350 opt: 2363 Z-score: 2939.9 bits: 553.5 E(32554): 2e-157 Smith-Waterman score: 2363; 69.0% identity (90.2% similar) in 480 aa overlap (44-523:17-496) 20 30 40 50 60 70 pF1KB5 PRTPSSWGRSQSSAAAEASAVLKVRPERSRRERILTLESMNPQVKAVEYAVRGPIVLKAG : ..:::..:::.:. :::::::::: .: CCDS64 MASSTGDRSQAVRHGLRAKVLTLDGMNPRVRRVEYAVRGPIVQRAL 10 20 30 40 80 90 100 110 120 130 pF1KB5 EIELELQRGIKKPFTEVIRANIGDAQAMGQQPITFLRQVMALCTYPNLLDSPSFPEDAKK :.: ::..:.::::::::::::::::::::.::::::::.:::. :.::.::.::.:::: CCDS64 ELEQELRQGVKKPFTEVIRANIGDAQAMGQRPITFLRQVLALCVNPDLLSSPNFPDDAKK 50 60 70 80 90 100 140 150 160 170 180 190 pF1KB5 RARRILQACGGNSLGSYSASQGVNCIREDVAAYITRRDGGVPADPDNIYLTTGASDGIST ::.::::::::.:::.::.:.:.. :::::: :: :::::.::::.:..:.:::::.: : CCDS64 RAERILQACGGHSLGAYSVSSGIQLIREDVARYIERRDGGIPADPNNVFLSTGASDAIVT 110 120 130 140 150 160 200 210 220 230 240 250 pF1KB5 ILKILVSGGGKSRTGVMIPIPQYPLYSAVISELDAIQVNYYLDEENCWALNVNELRRAVQ .::.::.: :..::::.:::::::::::...:: :.::.:::::: :::.: ::.::. CCDS64 VLKLLVAGEGHTRTGVLIPIPQYPLYSATLAELGAVQVDYYLDEERAWALDVAELHRALG 170 180 190 200 210 220 260 270 280 290 300 310 pF1KB5 EAKDHCDPKVLCIINPGNPTGQVQSRKCIEDVIHFAWEEKLFLLADEVYQDNVYSPDCRF .:.::: :..::.:::::::::::.:.::: ::.::.::.::::::::::::::. .: CCDS64 QARDHCRPRALCVINPGNPTGQVQTRECIEAVIRFAFEERLFLLADEVYQDNVYAAGSQF 230 240 250 260 270 280 320 330 340 350 360 370 pF1KB5 HSFKKVLYEMGPEYSSNVELASFHSTSKGYMGECGYRGGYMEVINLHPEIKGQLVKLLSV :::::::.:::: :... :::::::::::::::::.::::.::.:. .. :..::.:: CCDS64 HSFKKVLMEMGPPYAGQQELASFHSTSKGYMGECGFRGGYVEVVNMDAAVQQQMLKLMSV 290 300 310 320 330 340 380 390 400 410 420 430 pF1KB5 RLCPPVSGQAAMDIVVNPPVAGEESFEQFSREKESVLGNLAKKAKLTEDLFNQVPGIHCN :::::: ::: .:.::.::. . :: ::. ::..::..:: ::::::..::..::: :: CCDS64 RLCPPVPGQALLDLVVSPPAPTDPSFAQFQAEKQAVLAELAAKAKLTEQVFNEAPGISCN 350 360 370 380 390 400 440 450 460 470 480 490 pF1KB5 PLQGAMYAFPRIFIPAKAVEAAQAHQMAPDMFYCMKLLEETGICVVPGSGFGQREGTYHF :.:::::.:::. .: .::: :: .:::::.:..:::::::::::::::::::::::: CCDS64 PVQGAMYSFPRVQLPPRAVERAQELGLAPDMFFCLRLLEETGICVVPGSGFGQREGTYHF 410 420 430 440 450 460 500 510 520 pF1KB5 RMTILPPVEKLKTVLQKVKDFHINFLEKYA :::::::.:::. .:.:.. :: .: .:. CCDS64 RMTILPPLEKLRLLLEKLSRFHAKFTLEYS 470 480 490 523 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sat Nov 5 06:20:42 2016 done: Sat Nov 5 06:20:42 2016 Total Scan time: 3.360 Total Display time: 0.010 Function used was FASTA [36.3.4 Apr, 2011]