FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB7121, 305 aa
1>>>pF1KB7121 305 - 305 aa - 305 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.9550+/-0.000676; mu= 13.3818+/- 0.041
mean_var=80.0460+/-15.615, 0's: 0 Z-trim(111.6): 7 B-trim: 8 in 1/50
Lambda= 0.143352
statistics sampled from 12484 (12487) to 12484 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.743), E-opt: 0.2 (0.384), width: 16
Scan time: 2.860
The best scores are: opt bits E(32554)
CCDS6310.1 TMEM74 gene_id:157753|Hs108|chr8 ( 305) 1998 422.1 2.6e-118
CCDS13011.1 TMEM74B gene_id:55321|Hs108|chr20 ( 256) 498 111.9 5.4e-25
>>CCDS6310.1 TMEM74 gene_id:157753|Hs108|chr8 (305 aa)
initn: 1998 init1: 1998 opt: 1998 Z-score: 2237.6 bits: 422.1 E(32554): 2.6e-118
Smith-Waterman score: 1998; 100.0% identity (100.0% similar) in 305 aa overlap (1-305:1-305)
10 20 30 40 50 60
pF1KB7 MELHYLAKKSNQADLCDARDWSSRGLPGDQADTAATRAALCCQKQCASTPRATEMEGSKL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS63 MELHYLAKKSNQADLCDARDWSSRGLPGDQADTAATRAALCCQKQCASTPRATEMEGSKL
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB7 SSSPASPSSSLQNSTLQPDAFPPGLLHSGNNQITAERKVCNCCSQELETSFTYVDKNINL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS63 SSSPASPSSSLQNSTLQPDAFPPGLLHSGNNQITAERKVCNCCSQELETSFTYVDKNINL
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB7 EQRNRSSPSAKGHNHPGELGWENPNEWSQEAAISLISEEEDDTSSEATSSGKSIDYGFIS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS63 EQRNRSSPSAKGHNHPGELGWENPNEWSQEAAISLISEEEDDTSSEATSSGKSIDYGFIS
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB7 AILFLVTGILLVIISYIVPREVTVDPNTVAAREMERLEKESARLGAHLDRCVIAGLCLLT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS63 AILFLVTGILLVIISYIVPREVTVDPNTVAAREMERLEKESARLGAHLDRCVIAGLCLLT
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB7 LGGVILSCLLMMSMWKGELYRRNRFASSKESAKLYGSFNFRMKTSTNENTLELSLVEEDA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS63 LGGVILSCLLMMSMWKGELYRRNRFASSKESAKLYGSFNFRMKTSTNENTLELSLVEEDA
250 260 270 280 290 300
pF1KB7 LAVQS
:::::
CCDS63 LAVQS
>>CCDS13011.1 TMEM74B gene_id:55321|Hs108|chr20 (256 aa)
initn: 543 init1: 473 opt: 498 Z-score: 562.2 bits: 111.9 E(32554): 5.4e-25
Smith-Waterman score: 514; 45.9% identity (65.7% similar) in 242 aa overlap (78-303:24-246)
50 60 70 80 90
pF1KB7 STPRATEMEGSKLSSSPASPSSSLQNSTLQPDAFPPGL-LHSGNNQITAERK--------
: : :::: :.. .: : :.
CCDS13 MPPAQGYEFAAAKGPRDELGPSFPMASPPGLELKTLSNGPQAPRRSAPLGPVA
10 20 30 40 50
100 110 120 130 140 150
pF1KB7 -----VCNCC--SQELETSFTYVDKNINLEQRNRSSPSAKGHNHPGELGWENPNEWSQEA
: : : :.: :: : .: . . : :::: :: .. : ::.
CCDS13 PTREGVENACFSSEEHETHF----QNPG-NTRLGSSPSP-----PGGVS-SLPR--SQRD
60 70 80 90 100
160 170 180 190 200 210
pF1KB7 AISLISEEEDDTSSEATSSGKSIDYGFISAILFLVTGILLVIISYIVPREVTVDPNTVAA
.:: ::: . : .: . .::::.::..:::.:::::. .: .:::. :.:.::.:
CCDS13 DLSLHSEE--GPALEPVS--RPVDYGFVSALVFLVSGILLVVTAYAIPREARVNPDTVTA
110 120 130 140 150
220 230 240 250 260 270
pF1KB7 REMERLEKESARLGAHLDRCVIAGLCLLTLGGVILSCLLMMSMWKGELYRRNRFASSKES
::::::: ::::.:::::.:::: :::.::..:: :::.:. ::::::: :. .: :
CCDS13 REMERLEMYYARLGSHLDRCIIAGLGLLTVGGMLLSVLLMVSLCKGELYRRRTFVPGKGS
160 170 180 190 200 210
280 290 300
pF1KB7 AKLYGSFNFRMKTSTNENTLELSLVEEDALAVQS
: :::.:.::. .... .:::.... :
CCDS13 RKTYGSINLRMRQLNGDGGQ--ALVENEVVQVSETSHTLQRS
220 230 240 250
305 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Fri Nov 4 04:57:34 2016 done: Fri Nov 4 04:57:34 2016
Total Scan time: 2.860 Total Display time: -0.030
Function used was FASTA [36.3.4 Apr, 2011]