FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KA0094, 386 aa 1>>>pF1KA0094 386 - 386 aa - 386 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.1169+/-0.00091; mu= 18.6212+/- 0.055 mean_var=71.6101+/-14.255, 0's: 0 Z-trim(106.2): 23 B-trim: 73 in 1/50 Lambda= 0.151561 statistics sampled from 8865 (8874) to 8865 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.653), E-opt: 0.2 (0.273), width: 16 Scan time: 2.050 The best scores are: opt bits E(32554) CCDS47110.1 METAP1 gene_id:23173|Hs108|chr4 ( 386) 2695 598.4 3.5e-171 CCDS2246.1 METAP1D gene_id:254042|Hs108|chr2 ( 335) 844 193.7 2.1e-49 >>CCDS47110.1 METAP1 gene_id:23173|Hs108|chr4 (386 aa) initn: 2695 init1: 2695 opt: 2695 Z-score: 3186.8 bits: 598.4 E(32554): 3.5e-171 Smith-Waterman score: 2695; 100.0% identity (100.0% similar) in 386 aa overlap (1-386:1-386) 10 20 30 40 50 60 pF1KA0 MAAVETRVCETDGCSSEAKLQCPTCIKLGIQGSYFCSQECFKGSWATHKLLHKKAKDEKA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 MAAVETRVCETDGCSSEAKLQCPTCIKLGIQGSYFCSQECFKGSWATHKLLHKKAKDEKA 10 20 30 40 50 60 70 80 90 100 110 120 pF1KA0 KREVSSWTVEGDINTDPWAGYRYTGKLRPHYPLMPTRPVPSYIQRPDYADHPLGMSESEQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 KREVSSWTVEGDINTDPWAGYRYTGKLRPHYPLMPTRPVPSYIQRPDYADHPLGMSESEQ 70 80 90 100 110 120 130 140 150 160 170 180 pF1KA0 ALKGTSQIKLLSSEDIEGMRLVCRLAREVLDVAAGMIKPGVTTEEIDHAVHLACIARNCY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 ALKGTSQIKLLSSEDIEGMRLVCRLAREVLDVAAGMIKPGVTTEEIDHAVHLACIARNCY 130 140 150 160 170 180 190 200 210 220 230 240 pF1KA0 PSPLNYYNFPKSCCTSVNEVICHGIPDRRPLQEGDIVNVDITLYRNGYHGDLNETFFVGE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 PSPLNYYNFPKSCCTSVNEVICHGIPDRRPLQEGDIVNVDITLYRNGYHGDLNETFFVGE 190 200 210 220 230 240 250 260 270 280 290 300 pF1KA0 VDDGARKLVQTTYECLMQAIDAVKPGVRYRELGNIIQKHAQANGFSVVRSYCGHGIHKLF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 VDDGARKLVQTTYECLMQAIDAVKPGVRYRELGNIIQKHAQANGFSVVRSYCGHGIHKLF 250 260 270 280 290 300 310 320 330 340 350 360 pF1KA0 HTAPNVPHYAKNKAVGVMKSGHVFTIEPMICEGGWQDETWPDGWTAVTRDGKRSAQFEHT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 HTAPNVPHYAKNKAVGVMKSGHVFTIEPMICEGGWQDETWPDGWTAVTRDGKRSAQFEHT 310 320 330 340 350 360 370 380 pF1KA0 LLVTDTGCEILTRRLDSARPHFMSQF :::::::::::::::::::::::::: CCDS47 LLVTDTGCEILTRRLDSARPHFMSQF 370 380 >>CCDS2246.1 METAP1D gene_id:254042|Hs108|chr2 (335 aa) initn: 811 init1: 595 opt: 844 Z-score: 1000.3 bits: 193.7 E(32554): 2.1e-49 Smith-Waterman score: 844; 47.1% identity (75.7% similar) in 276 aa overlap (98-373:63-330) 70 80 90 100 110 120 pF1KA0 TVEGDINTDPWAGYRYTGKLRPHYPLMPTRPVPSYIQRPDYADHPLGMSESEQALKGTSQ :::..:..:::. :. . : : CCDS22 SSQQRRNFFFRRQRDISHSIVLPAAVSSAHPVPKHIKKPDYV--TTGIVPD----WGDS- 40 50 60 70 80 130 140 150 160 170 180 pF1KA0 IKLLSSEDIEGMRLVCRLAREVLDVAAGMIKPGVTTEEIDHAVHLACIARNCYPSPLNYY :.. . ..:.:.. .:.:::.:: .:. .: .:::::: :: :..: :::::.: CCDS22 IEVKNEDQIQGLHQACQLARHVLLLAGKSLKVDMTTEEIDALVHREIISHNAYPSPLGYG 90 100 110 120 130 140 190 200 210 220 230 240 pF1KA0 NFPKSCCTSVNEVICHGIPDRRPLQEGDIVNVDITLYRNGYHGDLNETFFVGEVDDGARK .:::: :::::.:.:::::: ::::.:::.:.:.:.: :::::: .:::.::.::. ..: CCDS22 GFPKSVCTSVNNVLCHGIPDSRPLQDGDIINIDVTVYYNGYHGDTSETFLVGNVDECGKK 150 160 170 180 190 200 250 260 270 280 290 300 pF1KA0 LVQTTYECLMQAIDAVKPGVRYRELGNIIQKHAQANGFSVVRSYCGHGIHKLFHTAPNVP ::... .: .:: : . :. . .:: :.. .. :::.: . :::: . :: :.. CCDS22 LVEVARRCRDEAIAACRAGAPFSVIGNTISHITHQNGFQVCPHFVGHGIGSYFHGHPEIW 210 220 230 240 250 260 310 320 330 340 350 360 pF1KA0 HYAKNKAVGVMKSGHVFTIEPMICEGGWQDETWPDGWTAVTRDGKRSAQFEHTLLVTDTG :.:... . :. : .:::::.: ::. . .. :.::.:. :..::::::::.:.:. : CCDS22 HHANDSDLP-MEEGMAFTIEPIITEGSPEFKVLEDAWTVVSLDNQRSAQFEHTVLITSRG 270 280 290 300 310 320 370 380 pF1KA0 CEILTRRLDSARPHFMSQF .:::. CCDS22 AQILTKLPHEA 330 386 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Wed Nov 2 18:02:18 2016 done: Wed Nov 2 18:02:18 2016 Total Scan time: 2.050 Total Display time: -0.020 Function used was FASTA [36.3.4 Apr, 2011]