FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE1860, 380 aa 1>>>pF1KE1860 380 - 380 aa - 380 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 13.7955+/-0.00114; mu= -16.1346+/- 0.069 mean_var=708.8464+/-145.517, 0's: 0 Z-trim(118.8): 24 B-trim: 203 in 1/52 Lambda= 0.048172 statistics sampled from 19796 (19819) to 19796 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.841), E-opt: 0.2 (0.609), width: 16 Scan time: 3.550 The best scores are: opt bits E(32554) CCDS33051.1 VASP gene_id:7408|Hs108|chr19 ( 380) 2636 197.3 1.9e-50 CCDS81851.1 EVL gene_id:51466|Hs108|chr14 ( 416) 958 80.7 2.6e-15 CCDS9955.1 EVL gene_id:51466|Hs108|chr14 ( 418) 951 80.3 3.6e-15 >>CCDS33051.1 VASP gene_id:7408|Hs108|chr19 (380 aa) initn: 2636 init1: 2636 opt: 2636 Z-score: 1019.2 bits: 197.3 E(32554): 1.9e-50 Smith-Waterman score: 2636; 100.0% identity (100.0% similar) in 380 aa overlap (1-380:1-380) 10 20 30 40 50 60 pF1KE1 MSETVICSSRATVMLYDDGNKRWLPAGTGPQAFSRVQIYHNPTANSFRVVGRKMQPDQQV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 MSETVICSSRATVMLYDDGNKRWLPAGTGPQAFSRVQIYHNPTANSFRVVGRKMQPDQQV 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 VINCAIVRGVKYNQATPNFHQWRDARQVWGLNFGSKEDAAQFAAGMASALEALEGGGPPP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 VINCAIVRGVKYNQATPNFHQWRDARQVWGLNFGSKEDAAQFAAGMASALEALEGGGPPP 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE1 PPALPTWSVPNGPSPEEVEQQKRQQPGPSEHIERRVSNAGGPPAPPAGGPPPPPGPPPPP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 PPALPTWSVPNGPSPEEVEQQKRQQPGPSEHIERRVSNAGGPPAPPAGGPPPPPGPPPPP 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE1 GPPPPPGLPPSGVPAAAHGAGGGPPPAPPLPAAQGPGGGGAGAPGLAAAIAGAKLRKVSK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 GPPPPPGLPPSGVPAAAHGAGGGPPPAPPLPAAQGPGGGGAGAPGLAAAIAGAKLRKVSK 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE1 QEEASGGPTAPKAESGRSGGGGLMEEMNAMLARRRKATQVGEKTPKDESANQEEPEARVP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 QEEASGGPTAPKAESGRSGGGGLMEEMNAMLARRRKATQVGEKTPKDESANQEEPEARVP 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE1 AQSESVRRPWEKNSTTLPRMKSSSSVTTSETQPCTPSSSDYSDLQRVKQELLEEVKKELQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 AQSESVRRPWEKNSTTLPRMKSSSSVTTSETQPCTPSSSDYSDLQRVKQELLEEVKKELQ 310 320 330 340 350 360 370 380 pF1KE1 KVKEEIIEAFVQELRKRGSP :::::::::::::::::::: CCDS33 KVKEEIIEAFVQELRKRGSP 370 380 >>CCDS81851.1 EVL gene_id:51466|Hs108|chr14 (416 aa) initn: 338 init1: 338 opt: 958 Z-score: 388.5 bits: 80.7 E(32554): 2.6e-15 Smith-Waterman score: 1100; 48.1% identity (67.5% similar) in 428 aa overlap (1-374:1-410) 10 20 30 40 50 60 pF1KE1 MSETVICSSRATVMLYDDGNKRWLPAGTGPQAFSRVQIYHNPTANSFRVVGRKMQPDQQV ::: ::..::.::.::: .:.:.: : :.:::..:::: ..:.::::: :.: :::: CCDS81 MSEQSICQARASVMVYDDTSKKWVPIKPGQQGFSRINIYHNTASNTFRVVGVKLQ-DQQV 10 20 30 40 50 70 80 90 100 110 pF1KE1 VINCAIVRGVKYNQATPNFHQWRDARQVWGLNFGSKEDAAQFAAGMASALEALEG--GGP ::: .::.:.:::::::.::::::::::.::::.:::.:. :. .: ::. ... ::: CCDS81 VINYSIVKGLKYNQATPTFHQWRDARQVYGLNFASKEEATTFSNAMLFALNIMNSQEGGP 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE1 PPPPALPTWSVPNGPSPEEVEQQKRQ-----QPGPSEHIERRVSNAGG--PPAPPAGGPP .: :::::.:.. :.:: : .: .:::.: .: ::. :... CCDS81 SSQR-----QVQNGPSPDEMDIQRRQVMEQHQQQRQESLERRTSATGPILPPGHPSSAAS 120 130 140 150 160 170 180 190 200 210 220 pF1KE1 PP---PGPPPPPGPPPPPGLPPSGVPAAAHGAGGGPPPAPPLPA--AQGPGGGGAGAPGL : :::::: :: :: ::.:. ::: ::::: ::: . .. :: CCDS81 APVSCSGPPPPPPPPVPP--PPTGAT---------PPPPPPLPAGGAQGSSHDESSMSGL 180 190 200 210 220 230 240 250 260 270 pF1KE1 AAAIAGAKLRKVSKQEEASGG--PT------APKAESGRSGGGGLMEEMNAMLARRRKA- ::::::::::.:.. :.:::: :. : .: :: .:::::::::: .::.:::: CCDS81 AAAIAGAKLRRVQRPEDASGGSSPSGTSKSDANRASSG-GGGGGLMEEMNKLLAKRRKAA 230 240 250 260 270 280 280 290 300 310 320 pF1KE1 TQVGEKTPKDESANQEE-------PEARV----PAQSESVRRPWEK-NSTTLPR---MKS .: . . : :. .: : : .:. : .::. :.:::. ::. : .. CCDS81 SQSDKPAEKKEDESQMEDPSTSPSPGTRAASQPPNSSEAGRKPWERSNSVEKPVSSILSR 290 300 310 320 330 340 330 340 350 360 pF1KE1 SSSVTTS-------ETQPCT---PSSS------DYSDLQRVKQELLEEVKKELQKVKEEI . ::. : ..:: . :..: : ::.:.:::.:::: .::.:::::: CCDS81 TPSVAKSPEAKSPLQSQPHSRMKPAGSVNDMALDAFDLDRMKQEILEEVVRELHKVKEEI 350 360 370 380 390 400 370 380 pF1KE1 IEAFVQELRKRGSP :.:. ::: CCDS81 IDAIRQELSGISTT 410 >>CCDS9955.1 EVL gene_id:51466|Hs108|chr14 (418 aa) initn: 338 init1: 338 opt: 951 Z-score: 385.9 bits: 80.3 E(32554): 3.6e-15 Smith-Waterman score: 1093; 48.0% identity (67.4% similar) in 427 aa overlap (2-374:4-412) 10 20 30 40 50 pF1KE1 MSETVICSSRATVMLYDDGNKRWLPAGTGPQAFSRVQIYHNPTANSFRVVGRKMQPDQ :: ::..::.::.::: .:.:.: : :.:::..:::: ..:.::::: :.: :: CCDS99 MATSEQSICQARASVMVYDDTSKKWVPIKPGQQGFSRINIYHNTASNTFRVVGVKLQ-DQ 10 20 30 40 50 60 70 80 90 100 110 pF1KE1 QVVINCAIVRGVKYNQATPNFHQWRDARQVWGLNFGSKEDAAQFAAGMASALEALEG--G ::::: .::.:.:::::::.::::::::::.::::.:::.:. :. .: ::. ... : CCDS99 QVVINYSIVKGLKYNQATPTFHQWRDARQVYGLNFASKEEATTFSNAMLFALNIMNSQEG 60 70 80 90 100 110 120 130 140 150 160 pF1KE1 GPPPPPALPTWSVPNGPSPEEVEQQKRQ-----QPGPSEHIERRVSNAGG--PPAPPAGG :: .: :::::.:.. :.:: : .: .:::.: .: ::. :... CCDS99 GPSSQR-----QVQNGPSPDEMDIQRRQVMEQHQQQRQESLERRTSATGPILPPGHPSSA 120 130 140 150 160 170 170 180 190 200 210 220 pF1KE1 PPPP---PGPPPPPGPPPPPGLPPSGVPAAAHGAGGGPPPAPPLPA--AQGPGGGGAGAP : :::::: :: :: ::.:. ::: ::::: ::: . .. CCDS99 ASAPVSCSGPPPPPPPPVPP--PPTGAT---------PPPPPPLPAGGAQGSSHDESSMS 180 190 200 210 220 230 240 250 260 270 pF1KE1 GLAAAIAGAKLRKVSKQEEASGG--PT------APKAESGRSGGGGLMEEMNAMLARRRK ::::::::::::.:.. :.:::: :. : .: :: .:::::::::: .::.::: CCDS99 GLAAAIAGAKLRRVQRPEDASGGSSPSGTSKSDANRASSG-GGGGGLMEEMNKLLAKRRK 230 240 250 260 270 280 280 290 300 310 320 pF1KE1 A-TQVGEKTPKDESANQEE-------PEARV----PAQSESVRRPWEK-NSTTLPR---M : .: . . : :. .: : : .:. : .::. :.:::. ::. : . CCDS99 AASQSDKPAEKKEDESQMEDPSTSPSPGTRAASQPPNSSEAGRKPWERSNSVEKPVSSIL 290 300 310 320 330 340 330 340 350 360 pF1KE1 KSSSSVTTS-------ETQPCT---PSSS------DYSDLQRVKQELLEEVKKELQKVKE . . ::. : ..:: . :..: : ::.:.:::.:::: .::.:::: CCDS99 SRTPSVAKSPEAKSPLQSQPHSRMKPAGSVNDMALDAFDLDRMKQEILEEVVRELHKVKE 350 360 370 380 390 400 370 380 pF1KE1 EIIEAFVQELRKRGSP :::.:. ::: CCDS99 EIIDAIRQELSGISTT 410 380 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 12:09:57 2016 done: Sun Nov 6 12:09:57 2016 Total Scan time: 3.550 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]