FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB6907, 204 aa 1>>>pF1KB6907 204 - 204 aa - 204 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.5108+/-0.000732; mu= 12.7484+/- 0.044 mean_var=70.5128+/-13.820, 0's: 0 Z-trim(109.1): 14 B-trim: 0 in 0/53 Lambda= 0.152736 statistics sampled from 10625 (10634) to 10625 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.71), E-opt: 0.2 (0.327), width: 16 Scan time: 1.950 The best scores are: opt bits E(32554) CCDS11788.1 ARHGDIA gene_id:396|Hs108|chr17 ( 204) 1338 303.4 6.5e-83 CCDS77133.1 ARHGDIA gene_id:396|Hs108|chr17 ( 235) 1083 247.2 6e-66 CCDS8671.1 ARHGDIB gene_id:397|Hs108|chr12 ( 201) 938 215.2 2.2e-56 CCDS58609.1 ARHGDIA gene_id:396|Hs108|chr17 ( 160) 896 205.9 1.1e-53 CCDS10404.1 ARHGDIG gene_id:398|Hs108|chr16 ( 225) 793 183.3 1e-46 >>CCDS11788.1 ARHGDIA gene_id:396|Hs108|chr17 (204 aa) initn: 1338 init1: 1338 opt: 1338 Z-score: 1602.0 bits: 303.4 E(32554): 6.5e-83 Smith-Waterman score: 1338; 100.0% identity (100.0% similar) in 204 aa overlap (1-204:1-204) 10 20 30 40 50 60 pF1KB6 MAEQEPTAEQLAQIAAENEEDEHSVNYKPPAQKSIQEIQELDKDDESLRKYKEALLGRVA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 MAEQEPTAEQLAQIAAENEEDEHSVNYKPPAQKSIQEIQELDKDDESLRKYKEALLGRVA 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB6 VSADPNVPNVVVTGLTLVCSSAPGPLELDLTGDLESFKKQSFVLKEGVEYRIKISFRVNR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 VSADPNVPNVVVTGLTLVCSSAPGPLELDLTGDLESFKKQSFVLKEGVEYRIKISFRVNR 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB6 EIVSGMKYIQHTYRKGVKIDKTDYMVGSYGPRAEEYEFLTPVEEAPKGMLARGSYSIKSR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 EIVSGMKYIQHTYRKGVKIDKTDYMVGSYGPRAEEYEFLTPVEEAPKGMLARGSYSIKSR 130 140 150 160 170 180 190 200 pF1KB6 FTDDDKTDHLSWEWNLTIKKDWKD :::::::::::::::::::::::: CCDS11 FTDDDKTDHLSWEWNLTIKKDWKD 190 200 >>CCDS77133.1 ARHGDIA gene_id:396|Hs108|chr17 (235 aa) initn: 1083 init1: 1083 opt: 1083 Z-score: 1297.4 bits: 247.2 E(32554): 6e-66 Smith-Waterman score: 1083; 98.2% identity (99.4% similar) in 171 aa overlap (1-171:1-171) 10 20 30 40 50 60 pF1KB6 MAEQEPTAEQLAQIAAENEEDEHSVNYKPPAQKSIQEIQELDKDDESLRKYKEALLGRVA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS77 MAEQEPTAEQLAQIAAENEEDEHSVNYKPPAQKSIQEIQELDKDDESLRKYKEALLGRVA 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB6 VSADPNVPNVVVTGLTLVCSSAPGPLELDLTGDLESFKKQSFVLKEGVEYRIKISFRVNR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS77 VSADPNVPNVVVTGLTLVCSSAPGPLELDLTGDLESFKKQSFVLKEGVEYRIKISFRVNR 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB6 EIVSGMKYIQHTYRKGVKIDKTDYMVGSYGPRAEEYEFLTPVEEAPKGMLARGSYSIKSR :::::::::::::::::::::::::::::::::::::::::::::::: .. CCDS77 EIVSGMKYIQHTYRKGVKIDKTDYMVGSYGPRAEEYEFLTPVEEAPKGSISPSHPRPGFR 130 140 150 160 170 180 190 200 pF1KB6 FTDDDKTDHLSWEWNLTIKKDWKD CCDS77 RERSSHSPGPVVAPGRVRLLLRGGAGVWDARPRGGRAVLQPRCSLASPLVAVGPV 190 200 210 220 230 >>CCDS8671.1 ARHGDIB gene_id:397|Hs108|chr12 (201 aa) initn: 961 init1: 925 opt: 938 Z-score: 1125.8 bits: 215.2 E(32554): 2.2e-56 Smith-Waterman score: 938; 67.0% identity (85.0% similar) in 206 aa overlap (1-204:1-201) 10 20 30 40 50 pF1KB6 MAEQEPTAEQLAQIAAENEEDE--HSVNYKPPAQKSIQEIQELDKDDESLRKYKEALLGR :.:. : . . :...:: ..::::: :::..:.::.::::::: :::..::: CCDS86 MTEKAPEPH-----VEEDDDDELDSKLNYKPPPQKSLKELQEMDKDDESLIKYKKTLLGD 10 20 30 40 50 60 70 80 90 100 110 pF1KB6 VAVSADPNVPNVVVTGLTLVCSSAPGPLELDLTGDLESFKKQSFVLKEGVEYRIKISFRV : .::..:::::: ::::: :::::. .:::::::..::...::::: :::.:: :.: CCDS86 GPVVTDPKAPNVVVTRLTLVCESAPGPITMDLTGDLEALKKETIVLKEGSEYRVKIHFKV 60 70 80 90 100 110 120 130 140 150 160 170 pF1KB6 NREIVSGMKYIQHTYRKGVKIDKTDYMVGSYGPRAEEYEFLTPVEEAPKGMLARGSYSIK ::.::::.::.::::: :::.::. .:::::::: ::::::::::::::::::::.: : CCDS86 NRDIVSGLKYVQHTYRTGVKVDKATFMVGSYGPRPEEYEFLTPVEEAPKGMLARGTYHNK 120 130 140 150 160 170 180 190 200 pF1KB6 SRFTDDDKTDHLSWEWNLTIKKDWKD : :::::: :::::::::.:::.: . CCDS86 SFFTDDDKQDHLSWEWNLSIKKEWTE 180 190 200 >>CCDS58609.1 ARHGDIA gene_id:396|Hs108|chr17 (160 aa) initn: 916 init1: 884 opt: 896 Z-score: 1077.3 bits: 205.9 E(32554): 1.1e-53 Smith-Waterman score: 945; 77.9% identity (78.4% similar) in 204 aa overlap (1-204:1-160) 10 20 30 40 50 60 pF1KB6 MAEQEPTAEQLAQIAAENEEDEHSVNYKPPAQKSIQEIQELDKDDESLRKYKEALLGRVA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 MAEQEPTAEQLAQIAAENEEDEHSVNYKPPAQKSIQEIQELDKDDESLRKYKEALLGRVA 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB6 VSADPNVPNVVVTGLTLVCSSAPGPLELDLTGDLESFKKQSFVLKEGVEYRIKISFRVNR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 VSADPNVPNVVVTGLTLVCSSAPGPLELDLTGDLESFKKQSFVLKEGVEYRIKISFRVNR 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB6 EIVSGMKYIQHTYRKGVKIDKTDYMVGSYGPRAEEYEFLTPVEEAPKGMLARGSYSIKSR :::::::::::::::::: CCDS58 EIVSGMKYIQHTYRKGVK------------------------------------------ 130 190 200 pF1KB6 FTDDDKTDHLSWEWNLTIKKDWKD .::::::::::::::::::::: CCDS58 --NDDKTDHLSWEWNLTIKKDWKD 140 150 160 >>CCDS10404.1 ARHGDIG gene_id:398|Hs108|chr16 (225 aa) initn: 792 init1: 792 opt: 793 Z-score: 952.4 bits: 183.3 E(32554): 1e-46 Smith-Waterman score: 793; 61.1% identity (83.2% similar) in 190 aa overlap (15-204:36-225) 10 20 30 40 pF1KB6 MAEQEPTAEQLAQIAAENEEDEHSVNYKPPAQKSIQEIQELDKD :... :: .:. :..::. ::..:: : CCDS10 ACELGAQLLELLRLALCARVLLADKEGGPPAVDEVLDEAVPEYRAPGRKSLLEIRQLDPD 10 20 30 40 50 60 50 60 70 80 90 100 pF1KB6 DESLRKYKEALLGRVAVSADPNVPNVVVTGLTLVCSSAPGPLELDLTGDLESFKKQSFVL :.:: :::..::: . ..::..::: :: :::. .::::. .:::::: .: : ::: CCDS10 DRSLAKYKRVLLGPLPPAVDPSLPNVQVTRLTLLSEQAPGPVVMDLTGDLAVLKDQVFVL 70 80 90 100 110 120 110 120 130 140 150 160 pF1KB6 KEGVEYRIKISFRVNREIVSGMKYIQHTYRKGVKIDKTDYMVGSYGPRAEEYEFLTPVEE ::::.::.::::.:.::::::.: ..::::.:...::: :::::::: :.::::.::::: CCDS10 KEGVDYRVKISFKVHREIVSGLKCLHHTYRRGLRVDKTVYMVGSYGPSAQEYEFVTPVEE 130 140 150 160 170 180 170 180 190 200 pF1KB6 APKGMLARGSYSIKSRFTDDDKTDHLSWEWNLTIKKDWKD ::.: :.:: : . : :::::.: ::::::.: : .:::: CCDS10 APRGALVRGPYLVVSLFTDDDRTHHLSWEWGLCICQDWKD 190 200 210 220 204 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Fri Nov 4 13:07:06 2016 done: Fri Nov 4 13:07:06 2016 Total Scan time: 1.950 Total Display time: -0.020 Function used was FASTA [36.3.4 Apr, 2011]