FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB6907, 204 aa
1>>>pF1KB6907 204 - 204 aa - 204 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.5108+/-0.000732; mu= 12.7484+/- 0.044
mean_var=70.5128+/-13.820, 0's: 0 Z-trim(109.1): 14 B-trim: 0 in 0/53
Lambda= 0.152736
statistics sampled from 10625 (10634) to 10625 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.71), E-opt: 0.2 (0.327), width: 16
Scan time: 1.950
The best scores are: opt bits E(32554)
CCDS11788.1 ARHGDIA gene_id:396|Hs108|chr17 ( 204) 1338 303.4 6.5e-83
CCDS77133.1 ARHGDIA gene_id:396|Hs108|chr17 ( 235) 1083 247.2 6e-66
CCDS8671.1 ARHGDIB gene_id:397|Hs108|chr12 ( 201) 938 215.2 2.2e-56
CCDS58609.1 ARHGDIA gene_id:396|Hs108|chr17 ( 160) 896 205.9 1.1e-53
CCDS10404.1 ARHGDIG gene_id:398|Hs108|chr16 ( 225) 793 183.3 1e-46
>>CCDS11788.1 ARHGDIA gene_id:396|Hs108|chr17 (204 aa)
initn: 1338 init1: 1338 opt: 1338 Z-score: 1602.0 bits: 303.4 E(32554): 6.5e-83
Smith-Waterman score: 1338; 100.0% identity (100.0% similar) in 204 aa overlap (1-204:1-204)
10 20 30 40 50 60
pF1KB6 MAEQEPTAEQLAQIAAENEEDEHSVNYKPPAQKSIQEIQELDKDDESLRKYKEALLGRVA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 MAEQEPTAEQLAQIAAENEEDEHSVNYKPPAQKSIQEIQELDKDDESLRKYKEALLGRVA
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB6 VSADPNVPNVVVTGLTLVCSSAPGPLELDLTGDLESFKKQSFVLKEGVEYRIKISFRVNR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 VSADPNVPNVVVTGLTLVCSSAPGPLELDLTGDLESFKKQSFVLKEGVEYRIKISFRVNR
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB6 EIVSGMKYIQHTYRKGVKIDKTDYMVGSYGPRAEEYEFLTPVEEAPKGMLARGSYSIKSR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 EIVSGMKYIQHTYRKGVKIDKTDYMVGSYGPRAEEYEFLTPVEEAPKGMLARGSYSIKSR
130 140 150 160 170 180
190 200
pF1KB6 FTDDDKTDHLSWEWNLTIKKDWKD
::::::::::::::::::::::::
CCDS11 FTDDDKTDHLSWEWNLTIKKDWKD
190 200
>>CCDS77133.1 ARHGDIA gene_id:396|Hs108|chr17 (235 aa)
initn: 1083 init1: 1083 opt: 1083 Z-score: 1297.4 bits: 247.2 E(32554): 6e-66
Smith-Waterman score: 1083; 98.2% identity (99.4% similar) in 171 aa overlap (1-171:1-171)
10 20 30 40 50 60
pF1KB6 MAEQEPTAEQLAQIAAENEEDEHSVNYKPPAQKSIQEIQELDKDDESLRKYKEALLGRVA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS77 MAEQEPTAEQLAQIAAENEEDEHSVNYKPPAQKSIQEIQELDKDDESLRKYKEALLGRVA
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB6 VSADPNVPNVVVTGLTLVCSSAPGPLELDLTGDLESFKKQSFVLKEGVEYRIKISFRVNR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS77 VSADPNVPNVVVTGLTLVCSSAPGPLELDLTGDLESFKKQSFVLKEGVEYRIKISFRVNR
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB6 EIVSGMKYIQHTYRKGVKIDKTDYMVGSYGPRAEEYEFLTPVEEAPKGMLARGSYSIKSR
:::::::::::::::::::::::::::::::::::::::::::::::: ..
CCDS77 EIVSGMKYIQHTYRKGVKIDKTDYMVGSYGPRAEEYEFLTPVEEAPKGSISPSHPRPGFR
130 140 150 160 170 180
190 200
pF1KB6 FTDDDKTDHLSWEWNLTIKKDWKD
CCDS77 RERSSHSPGPVVAPGRVRLLLRGGAGVWDARPRGGRAVLQPRCSLASPLVAVGPV
190 200 210 220 230
>>CCDS8671.1 ARHGDIB gene_id:397|Hs108|chr12 (201 aa)
initn: 961 init1: 925 opt: 938 Z-score: 1125.8 bits: 215.2 E(32554): 2.2e-56
Smith-Waterman score: 938; 67.0% identity (85.0% similar) in 206 aa overlap (1-204:1-201)
10 20 30 40 50
pF1KB6 MAEQEPTAEQLAQIAAENEEDE--HSVNYKPPAQKSIQEIQELDKDDESLRKYKEALLGR
:.:. : . . :...:: ..::::: :::..:.::.::::::: :::..:::
CCDS86 MTEKAPEPH-----VEEDDDDELDSKLNYKPPPQKSLKELQEMDKDDESLIKYKKTLLGD
10 20 30 40 50
60 70 80 90 100 110
pF1KB6 VAVSADPNVPNVVVTGLTLVCSSAPGPLELDLTGDLESFKKQSFVLKEGVEYRIKISFRV
: .::..:::::: ::::: :::::. .:::::::..::...::::: :::.:: :.:
CCDS86 GPVVTDPKAPNVVVTRLTLVCESAPGPITMDLTGDLEALKKETIVLKEGSEYRVKIHFKV
60 70 80 90 100 110
120 130 140 150 160 170
pF1KB6 NREIVSGMKYIQHTYRKGVKIDKTDYMVGSYGPRAEEYEFLTPVEEAPKGMLARGSYSIK
::.::::.::.::::: :::.::. .:::::::: ::::::::::::::::::::.: :
CCDS86 NRDIVSGLKYVQHTYRTGVKVDKATFMVGSYGPRPEEYEFLTPVEEAPKGMLARGTYHNK
120 130 140 150 160 170
180 190 200
pF1KB6 SRFTDDDKTDHLSWEWNLTIKKDWKD
: :::::: :::::::::.:::.: .
CCDS86 SFFTDDDKQDHLSWEWNLSIKKEWTE
180 190 200
>>CCDS58609.1 ARHGDIA gene_id:396|Hs108|chr17 (160 aa)
initn: 916 init1: 884 opt: 896 Z-score: 1077.3 bits: 205.9 E(32554): 1.1e-53
Smith-Waterman score: 945; 77.9% identity (78.4% similar) in 204 aa overlap (1-204:1-160)
10 20 30 40 50 60
pF1KB6 MAEQEPTAEQLAQIAAENEEDEHSVNYKPPAQKSIQEIQELDKDDESLRKYKEALLGRVA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS58 MAEQEPTAEQLAQIAAENEEDEHSVNYKPPAQKSIQEIQELDKDDESLRKYKEALLGRVA
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB6 VSADPNVPNVVVTGLTLVCSSAPGPLELDLTGDLESFKKQSFVLKEGVEYRIKISFRVNR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS58 VSADPNVPNVVVTGLTLVCSSAPGPLELDLTGDLESFKKQSFVLKEGVEYRIKISFRVNR
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB6 EIVSGMKYIQHTYRKGVKIDKTDYMVGSYGPRAEEYEFLTPVEEAPKGMLARGSYSIKSR
::::::::::::::::::
CCDS58 EIVSGMKYIQHTYRKGVK------------------------------------------
130
190 200
pF1KB6 FTDDDKTDHLSWEWNLTIKKDWKD
.:::::::::::::::::::::
CCDS58 --NDDKTDHLSWEWNLTIKKDWKD
140 150 160
>>CCDS10404.1 ARHGDIG gene_id:398|Hs108|chr16 (225 aa)
initn: 792 init1: 792 opt: 793 Z-score: 952.4 bits: 183.3 E(32554): 1e-46
Smith-Waterman score: 793; 61.1% identity (83.2% similar) in 190 aa overlap (15-204:36-225)
10 20 30 40
pF1KB6 MAEQEPTAEQLAQIAAENEEDEHSVNYKPPAQKSIQEIQELDKD
:... :: .:. :..::. ::..:: :
CCDS10 ACELGAQLLELLRLALCARVLLADKEGGPPAVDEVLDEAVPEYRAPGRKSLLEIRQLDPD
10 20 30 40 50 60
50 60 70 80 90 100
pF1KB6 DESLRKYKEALLGRVAVSADPNVPNVVVTGLTLVCSSAPGPLELDLTGDLESFKKQSFVL
:.:: :::..::: . ..::..::: :: :::. .::::. .:::::: .: : :::
CCDS10 DRSLAKYKRVLLGPLPPAVDPSLPNVQVTRLTLLSEQAPGPVVMDLTGDLAVLKDQVFVL
70 80 90 100 110 120
110 120 130 140 150 160
pF1KB6 KEGVEYRIKISFRVNREIVSGMKYIQHTYRKGVKIDKTDYMVGSYGPRAEEYEFLTPVEE
::::.::.::::.:.::::::.: ..::::.:...::: :::::::: :.::::.:::::
CCDS10 KEGVDYRVKISFKVHREIVSGLKCLHHTYRRGLRVDKTVYMVGSYGPSAQEYEFVTPVEE
130 140 150 160 170 180
170 180 190 200
pF1KB6 APKGMLARGSYSIKSRFTDDDKTDHLSWEWNLTIKKDWKD
::.: :.:: : . : :::::.: ::::::.: : .::::
CCDS10 APRGALVRGPYLVVSLFTDDDRTHHLSWEWGLCICQDWKD
190 200 210 220
204 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Fri Nov 4 13:07:06 2016 done: Fri Nov 4 13:07:06 2016
Total Scan time: 1.950 Total Display time: -0.020
Function used was FASTA [36.3.4 Apr, 2011]