FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB8922, 269 aa
1>>>pF1KB8922 269 - 269 aa - 269 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 8.8036+/-0.000903; mu= 0.4282+/- 0.055
mean_var=206.3191+/-41.486, 0's: 0 Z-trim(113.1): 153 B-trim: 0 in 0/54
Lambda= 0.089290
statistics sampled from 13656 (13810) to 13656 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.762), E-opt: 0.2 (0.424), width: 16
Scan time: 2.790
The best scores are: opt bits E(32554)
CCDS11530.1 HOXB5 gene_id:3215|Hs108|chr17 ( 269) 1795 243.0 1.6e-64
CCDS5406.1 HOXA5 gene_id:3202|Hs108|chr7 ( 270) 1066 149.1 3e-36
CCDS8872.1 HOXC5 gene_id:3222|Hs108|chr12 ( 222) 525 79.4 2.5e-15
CCDS8873.1 HOXC4 gene_id:3221|Hs108|chr12 ( 264) 497 75.8 3.4e-14
CCDS5405.1 HOXA4 gene_id:3201|Hs108|chr7 ( 320) 497 75.9 4e-14
CCDS11529.1 HOXB4 gene_id:3214|Hs108|chr17 ( 251) 487 74.5 8.1e-14
CCDS2269.1 HOXD4 gene_id:3233|Hs108|chr2 ( 255) 462 71.3 7.6e-13
CCDS11531.1 HOXB6 gene_id:3216|Hs108|chr17 ( 224) 460 71.0 8.2e-13
CCDS5407.1 HOXA6 gene_id:3203|Hs108|chr7 ( 233) 435 67.8 7.9e-12
CCDS41792.1 HOXC6 gene_id:3223|Hs108|chr12 ( 153) 431 67.2 8.1e-12
CCDS5408.1 HOXA7 gene_id:3204|Hs108|chr7 ( 230) 431 67.3 1.1e-11
CCDS8871.1 HOXC6 gene_id:3223|Hs108|chr12 ( 235) 431 67.3 1.1e-11
CCDS11532.1 HOXB7 gene_id:3217|Hs108|chr17 ( 217) 423 66.2 2.2e-11
>>CCDS11530.1 HOXB5 gene_id:3215|Hs108|chr17 (269 aa)
initn: 1795 init1: 1795 opt: 1795 Z-score: 1271.8 bits: 243.0 E(32554): 1.6e-64
Smith-Waterman score: 1795; 100.0% identity (100.0% similar) in 269 aa overlap (1-269:1-269)
10 20 30 40 50 60
pF1KB8 MSSYFVNSFSGRYPNGPDYQLLNYGSGSSLSGSYRDPAAMHTGSYGYNYNGMDLSVNRSS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 MSSYFVNSFSGRYPNGPDYQLLNYGSGSSLSGSYRDPAAMHTGSYGYNYNGMDLSVNRSS
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB8 ASSSHFGAVGESSRAFPAPAQEPRFRQAASSCSLSSPESLPCTNGDSHGAKPSASSPSDQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 ASSSHFGAVGESSRAFPAPAQEPRFRQAASSCSLSSPESLPCTNGDSHGAKPSASSPSDQ
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB8 ATSASSSANFTEIDEASASSEPEEAASQLSSPSLARAQPEPMATSTAAPEGQTPQIFPWM
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 ATSASSSANFTEIDEASASSEPEEAASQLSSPSLARAQPEPMATSTAAPEGQTPQIFPWM
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB8 RKLHISHDMTGPDGKRARTAYTRYQTLELEKEFHFNRYLTRRRRIEIAHALCLSERQIKI
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 RKLHISHDMTGPDGKRARTAYTRYQTLELEKEFHFNRYLTRRRRIEIAHALCLSERQIKI
190 200 210 220 230 240
250 260
pF1KB8 WFQNRRMKWKKDNKLKSMSLATAGSAFQP
:::::::::::::::::::::::::::::
CCDS11 WFQNRRMKWKKDNKLKSMSLATAGSAFQP
250 260
>>CCDS5406.1 HOXA5 gene_id:3202|Hs108|chr7 (270 aa)
initn: 951 init1: 560 opt: 1066 Z-score: 764.2 bits: 149.1 E(32554): 3e-36
Smith-Waterman score: 1066; 62.3% identity (78.3% similar) in 281 aa overlap (1-269:1-270)
10 20 30 40 50 60
pF1KB8 MSSYFVNSFSGRYPNGPDYQLLNYGSGSSLSGSYRDPAAMHTGSYGYNYNGMDLSVNRSS
::::::::: ::::::::::: :::. ::.: ..:: :.::.: :::.::::::::.::.
CCDS54 MSSYFVNSFCGRYPNGPDYQLHNYGDHSSVSEQFRDSASMHSGRYGYGYNGMDLSVGRSG
10 20 30 40 50 60
70 80 90 100
pF1KB8 ASSSHFGAVGESSRAFPAPAQ----EPRFRQAASSCSLSSPESLPCT------NGDSH-G
:.:::. :: .:.. : :. :::. : :.: .:. :::. ..::: :
CCDS54 --SGHFGS-GERARSYAASASAAPAEPRYSQPATSTHSPQPDPLPCSAVAPSPGSDSHHG
70 80 90 100 110
110 120 130 140 150 160
pF1KB8 AKPSASSPSDQATSASSSANFTEIDEASASSEPEEAASQLSSPSLARAQPEPMATSTAAP
.: : :. : ...:.:. .. ..::. :.: . : : :: :: . ::
CCDS54 GKNSLSNSSGASADAGSTHISSREGVGTASGAEEDAPA---SSEQASAQSEP----SPAP
120 130 140 150 160 170
170 180 190 200 210 220
pF1KB8 EGQTPQIFPWMRKLHISHD-MTGPDGKRARTAYTRYQTLELEKEFHFNRYLTRRRRIEIA
.: :::.::::::::::: . ::.:::::::::::::::::::::::::::::::::::
CCDS54 PAQ-PQIYPWMRKLHISHDNIGGPEGKRARTAYTRYQTLELEKEFHFNRYLTRRRRIEIA
180 190 200 210 220
230 240 250 260
pF1KB8 HALCLSERQIKIWFQNRRMKWKKDNKLKSMSLATAGSAFQP
:::::::::::::::::::::::::::::::.:.::.::.:
CCDS54 HALCLSERQIKIWFQNRRMKWKKDNKLKSMSMAAAGGAFRP
230 240 250 260 270
>>CCDS8872.1 HOXC5 gene_id:3222|Hs108|chr12 (222 aa)
initn: 521 init1: 404 opt: 525 Z-score: 388.8 bits: 79.4 E(32554): 2.5e-15
Smith-Waterman score: 584; 43.0% identity (63.1% similar) in 263 aa overlap (1-257:1-218)
10 20 30 40 50
pF1KB8 MSSYFVNSFSGRYPNGPDYQLL---NYGSGSSLSGSYRDPAAMHTGSYGYNYNGMDLSVN
:::: .::: . :: : :.. ::::.: ...: : :.:.:::..
CCDS88 MSSYVANSFYKQSPNIPAYNMQTCGNYGSASEVQASR------------YCYGGLDLSIT
10 20 30 40
60 70 80 90 100 110
pF1KB8 RSSASSSHFGAVGESSRAFPAPAQEPRFRQA-ASSCSLSSPESLPCTNGDSHGAKPS--A
:: :: .. . .. . :. :. . . : :.
CCDS88 ------------------FPPPAPSNSLHGVDMAANPRAHPDRPACSAAAAPGHAPGRDE
50 60 70 80 90
120 130 140 150 160 170
pF1KB8 SSPSDQATSASSSANFTEIDEASASSEPEEAASQLSSPSLARAQPEPMATSTAAPEGQTP
..: . . ....: . ..:..:.: .: .: ..:. . .:: : :: :
CCDS88 AAPLNPGMYSQKAARPALEERAKSSGEIKEEQAQTGQPA-GLSQP-P------AP----P
100 110 120 130
180 190 200 210 220 230
pF1KB8 QIFPWMRKLHISHDMTGPDGKRARTAYTRYQTLELEKEFHFNRYLTRRRRIEIAHALCLS
::.::: :::.::. ::::.::.::::::::::::::::::::::::::::. :::.
CCDS88 QIYPWMTKLHMSHET---DGKRSRTSYTRYQTLELEKEFHFNRYLTRRRRIEIANNLCLN
140 150 160 170 180 190
240 250 260
pF1KB8 ERQIKIWFQNRRMKWKKDNKLKSMSLATAGSAFQP
::::::::::::::::::.:.::
CCDS88 ERQIKIWFQNRRMKWKKDSKMKSKEAL
200 210 220
>>CCDS8873.1 HOXC4 gene_id:3221|Hs108|chr12 (264 aa)
initn: 452 init1: 384 opt: 497 Z-score: 368.2 bits: 75.8 E(32554): 3.4e-14
Smith-Waterman score: 520; 46.2% identity (67.2% similar) in 195 aa overlap (76-266:53-228)
50 60 70 80 90 100
pF1KB8 GYNYNGMDLSVNRSSASSSHFGAVGESSRAFPAPAQEPRFRQAASSC-SLSSPESLPCTN
.: : .: . . :: ::..:
CCDS88 YSQNSYIPEHSPEYYGRTRESGFQHHHQELYPPPPPRPSYPERQYSCTSLQGP-------
30 40 50 60 70
110 120 130 140 150 160
pF1KB8 GDSHGAKPSASSPSDQATSASSSANFTEIDEASASSEPEEAASQLSSPSLARAQPEPMAT
:.:.: : :.. . ...: . . .. :::: : : .:: :
CCDS88 GNSRGHGP-AQAGHHHPEKSQSLCEPAPLSGASASPSPAPPAC---------SQPAPDHP
80 90 100 110 120
170 180 190 200 210 220
pF1KB8 STAAPEGQTPQIFPWMRKLHISH---DMTGPDGKRARTAYTRYQTLELEKEFHFNRYLTR
:.:: .. : ..:::.:.:.: ...: . ::.:::::: :.::::::::.::::::
CCDS88 SSAA--SKQPIVYPWMKKIHVSTVNPNYNGGEPKRSRTAYTRQQVLELEKEFHYNRYLTR
130 140 150 160 170 180
230 240 250 260
pF1KB8 RRRIEIAHALCLSERQIKIWFQNRRMKWKKDNKLKSMSLATAGSAFQP
::::::::.::::::::::::::::::::::..: . .. .: :
CCDS88 RRRIEIAHSLCLSERQIKIWFQNRRMKWKKDHRLPNTKVRSAPPAGAAPSTLSAATPGTS
190 200 210 220 230 240
CCDS88 EDHSQSATPPEQQRAEDITRL
250 260
>>CCDS5405.1 HOXA4 gene_id:3201|Hs108|chr7 (320 aa)
initn: 494 init1: 404 opt: 497 Z-score: 367.0 bits: 75.9 E(32554): 4e-14
Smith-Waterman score: 531; 35.6% identity (59.6% similar) in 292 aa overlap (1-266:3-287)
10 20 30 40 50
pF1KB8 MSSYFVNS--FSGRYPNGPDYQLLNYGSGSSLSG-----SYRDPAAMHTGSYGYNYNG
:::...:: . ..: .: . :::.. .: .:..: : : .
CCDS54 MTMSSFLINSNYIEPKFPPFEEYAQ-HSGSGGADGGPGGGPGYQQPPAPPTQHLPLQQPQ
10 20 30 40 50
60 70 80 90 100 110
pF1KB8 MDLSVNRSSASSSHFGAVGESSRAFPAPAQEPRFRQAASSCSLSSPESLPCTNGDSHGAK
. . . ..:... :.:: : : : .. . . : . .: : :
CCDS54 LPHAGGGREPTASYYAPRTAREPAYPAAALYP----AHGAADTAYPYGY--RGGASPGRP
60 70 80 90 100 110
120 130 140 150
pF1KB8 PSASSPSDQATSASSSANFTEI----------DEASASSEP---EEAASQLSSPSLARAQ
:. .: :: . . . . ... .: . : : : . . :. . :
CCDS54 PQPEQPPAQAKGPAHGLHASHVLQPQLPPPLQPRAVPPAAPRRCEAAPATPGVPAGGSAP
120 130 140 150 160 170
160 170 180 190 200 210
pF1KB8 PEPMATSTAAP---EGQTPQIFPWMRKLHISH---DMTGPDGKRARTAYTRYQTLELEKE
:. . .: .:. : ..:::.:.:.: ...: . ::.:::::: :.::::::
CCDS54 ACPLLLADKSPLGLKGKEPVVYPWMKKIHVSAVNPSYNGGEPKRSRTAYTRQQVLELEKE
180 190 200 210 220 230
220 230 240 250 260
pF1KB8 FHFNRYLTRRRRIEIAHALCLSERQIKIWFQNRRMKWKKDNKLKSMSLATAGSAFQP
:::::::::::::::::.:::::::.::::::::::::::.:: . .. ...::
CCDS54 FHFNRYLTRRRRIEIAHTLCLSERQVKIWFQNRRMKWKKDHKLPNTKMRSSNSASASAGP
240 250 260 270 280 290
CCDS54 PGKAQTQSPHLHPHPHPSTSTPVPSSI
300 310 320
>>CCDS11529.1 HOXB4 gene_id:3214|Hs108|chr17 (251 aa)
initn: 531 init1: 398 opt: 487 Z-score: 361.6 bits: 74.5 E(32554): 8.1e-14
Smith-Waterman score: 487; 58.7% identity (77.8% similar) in 126 aa overlap (144-266:111-234)
120 130 140 150 160 170
pF1KB8 ASSPSDQATSASSSANFTEIDEASASSEPEEAASQLSSPSLARAQPEPMATSTAAPEGQT
::.: ::: .:. : . .
CCDS11 PPPPPPPGLSPRAPAPPPAGALLPEPGQRCEAVS--SSPPPPPCAQNPLHPSPSHSACKE
90 100 110 120 130
180 190 200 210 220 230
pF1KB8 PQIFPWMRKLHISH---DMTGPDGKRARTAYTRYQTLELEKEFHFNRYLTRRRRIEIAHA
: ..:::::.:.: ...: . ::.:::::: :.::::::::.:::::::::.:::::
CCDS11 PVVYPWMRKVHVSTVNPNYAGGEPKRSRTAYTRQQVLELEKEFHYNRYLTRRRRVEIAHA
140 150 160 170 180 190
240 250 260
pF1KB8 LCLSERQIKIWFQNRRMKWKKDNKLKSMSLATAGSAFQP
::::::::::::::::::::::.:: . .. ..:.:
CCDS11 LCLSERQIKIWFQNRRMKWKKDHKLPNTKIRSGGAAGSAGGPPGRPNGGPRAL
200 210 220 230 240 250
>>CCDS2269.1 HOXD4 gene_id:3233|Hs108|chr2 (255 aa)
initn: 498 init1: 399 opt: 462 Z-score: 344.1 bits: 71.3 E(32554): 7.6e-13
Smith-Waterman score: 466; 41.5% identity (62.8% similar) in 207 aa overlap (63-266:34-226)
40 50 60 70 80 90
pF1KB8 SYRDPAAMHTGSYGYNYNGMDLSVNRSSASSSHFGAVGESSRAFPAPAQEPRFRQAASSC
....:. : .. : :. :: . .
CCDS22 SSYMVNSKYVDPKFPPCEEYLQGGYLGEQGADYYGG-GAQGADFQPPGLYPRPDFGEQPF
10 20 30 40 50 60
100 110 120 130 140 150
pF1KB8 SLSSPESLPCTNGDSHGAKPSASSPSDQATSASSSANFTEIDEASASSEPEEAASQLSSP
. :.: . .:: .:.. . : . : . : : . :.:
CCDS22 GGSGPGPGSALPARGHGQEPGGPGGHYAAPGEPCPAP----PAPPPAPLPGARAYSQSDP
70 80 90 100 110
160 170 180 190 200
pF1KB8 SLARAQPEPMATSTAAPEGQTPQIFPWMRKLHISH---DMTGPDGKRARTAYTRYQTLEL
. :: : .:. : ..:::.:.:.. ..:: . ::.:::::: :.:::
CCDS22 K----QP-PSGTALKQPA----VVYPWMKKVHVNSVNPNYTGGEPKRSRTAYTRQQVLEL
120 130 140 150 160
210 220 230 240 250 260
pF1KB8 EKEFHFNRYLTRRRRIEIAHALCLSERQIKIWFQNRRMKWKKDNKLKSMSLATAGSAFQP
::::::::::::::::::::.::::::::::::::::::::::.:: . . ...:.
CCDS22 EKEFHFNRYLTRRRRIEIAHTLCLSERQIKIWFQNRRMKWKKDHKLPNTKGRSSSSSSSS
170 180 190 200 210 220
CCDS22 SCSSSVAPSQHLQPMAKDHHTDLTTL
230 240 250
>>CCDS11531.1 HOXB6 gene_id:3216|Hs108|chr17 (224 aa)
initn: 507 init1: 411 opt: 460 Z-score: 343.5 bits: 71.0 E(32554): 8.2e-13
Smith-Waterman score: 482; 40.0% identity (58.1% similar) in 270 aa overlap (1-263:1-215)
10 20 30 40 50
pF1KB8 MSSYFVNS-FSGRYPNGPDYQLLNYGSGSSLSGSYRDPAAMHTGSYGYNYNGMDLSVNRS
:::::::: : .: . : :. :..: :: . . :: . :.: ..
CCDS11 MSSYFVNSTFPVTLASGQESFL---GQLPLYSSGYADPLRHYPAPYGPG-PGQD----KG
10 20 30 40 50
60 70 80 90 100 110
pF1KB8 SASSSHFGAVGES-SRAFP---APAQEPRF-RQAASSCSLSSPESLPCTNGDSHGAKPSA
:.::.. .: . .:: : .:: : : :. :.:.::. . : :
CCDS11 FATSSYYPPAGGGYGRAAPCDYGPA--PAFYREKESACALSGADEQP----------PFH
60 70 80 90 100
120 130 140 150 160 170
pF1KB8 SSPSDQATSASSSANFTEIDEASASSEPEEAASQLSSPSLARAQPEPMATSTAAPEGQTP
: .. :.... : : .: . : ::
CCDS11 PEPR-KSDCAQDKSVFGETEEQKCS---------------------------------TP
110 120
180 190 200 210 220 230
pF1KB8 QIFPWMRKLHISHDMT-GPDGKRARTAYTRYQTLELEKEFHFNRYLTRRRRIEIAHALCL
..:::.... .. . ::.:.:.: .::::::::::::::.::::::::::::::::::
CCDS11 -VYPWMQRMNSCNSSSFGPSGRRGRQTYTRYQTLELEKEFHYNRYLTRRRRIEIAHALCL
130 140 150 160 170 180
240 250 260
pF1KB8 SERQIKIWFQNRRMKWKKDNKLKSMSLATAGSAFQP
.:::::::::::::::::..:: : : .:
CCDS11 TERQIKIWFQNRRMKWKKESKLLSASQLSAEEEEEKQAE
190 200 210 220
>>CCDS5407.1 HOXA6 gene_id:3203|Hs108|chr7 (233 aa)
initn: 477 init1: 389 opt: 435 Z-score: 325.8 bits: 67.8 E(32554): 7.9e-12
Smith-Waterman score: 459; 38.8% identity (59.3% similar) in 263 aa overlap (1-255:1-216)
10 20 30 40 50
pF1KB8 MSSYFVN-SFSGRYPNGPDYQLLNYGSGSSLSGSYRDPAAMHTGSYGYNYNGMDLSVNRS
::::::: .: : :.: : :. :. : : .:... :
CCDS54 MSSYFVNPTFPGSLPSGQD----------SFLGQL--PL------YQAGYDAL-----RP
10 20 30
60 70 80 90 100 110
pF1KB8 SASSSHFGAVGESSRAFPAPAQEPRFRQAASS---CSLSSPE-SLPC--TNGDSHGAKPS
.: .:: . .... .: : : ..: :. .: : . : .. : ::.::
CCDS54 FPAS--YGASSLPDKTYTSPC----FYQQSNSVLACNRASYEYGASCFYSDKDLSGASPS
40 50 60 70 80 90
120 130 140 150 160 170
pF1KB8 ASSPSDQATSASSSANFTEIDEASASSEPEEAASQLSSPSLARAQPEPMATSTAAPEGQT
.:. . . .. .:. :: : .:. . .: . . : . .
CCDS54 GSG---KQRGPGDYLHFS----------PE----QQYKPDSSSGQGKALHDEGADRKYTS
100 110 120 130
180 190 200 210 220 230
pF1KB8 PQIFPWMRKLH-ISHDMTGPDGKRARTAYTRYQTLELEKEFHFNRYLTRRRRIEIAHALC
: ..:::.... . . : :.:.: .::::::::::::::::::::::::::::.:::
CCDS54 P-VYPWMQRMNSCAGAVYGSHGRRGRQTYTRYQTLELEKEFHFNRYLTRRRRIEIANALC
140 150 160 170 180 190
240 250 260
pF1KB8 LSERQIKIWFQNRRMKWKKDNKLKSMSLATAGSAFQP
:.:::::::::::::::::.:::
CCDS54 LTERQIKIWFQNRRMKWKKENKLINSTQPSGEDSEAKAGE
200 210 220 230
>>CCDS41792.1 HOXC6 gene_id:3223|Hs108|chr12 (153 aa)
initn: 399 init1: 338 opt: 431 Z-score: 325.6 bits: 67.2 E(32554): 8.1e-12
Smith-Waterman score: 431; 59.6% identity (78.9% similar) in 114 aa overlap (157-266:19-131)
130 140 150 160 170 180
pF1KB8 SANFTEIDEASASSEPEEAASQLSSPSLARAQPEPMATSTAAPEGQ--TPQIFPWMRKLH
:: . .::. : . ::.:::....
CCDS41 MLSNCRQNTLGHNTQTSIAQDFSSEQGRTAPQDQKASIQIYPWMQRMN
10 20 30 40
190 200 210 220 230 240
pF1KB8 ISHDMTG--PDGKRARTAYTRYQTLELEKEFHFNRYLTRRRRIEIAHALCLSERQIKIWF
::. .: : .:.: :.::::::::::::::::::::::::::.::::.::::::::
CCDS41 -SHSGVGYGADRRRGRQIYSRYQTLELEKEFHFNRYLTRRRRIEIANALCLTERQIKIWF
50 60 70 80 90 100
250 260
pF1KB8 QNRRMKWKKDNKLKSMSLATAGSAFQP
:::::::::...: : . .:.:
CCDS41 QNRRMKWKKESNLTSTLSGGGGGATADSLGGKEEKREETEEEKQKE
110 120 130 140 150
269 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Fri Nov 4 16:30:39 2016 done: Fri Nov 4 16:30:40 2016
Total Scan time: 2.790 Total Display time: 0.000
Function used was FASTA [36.3.4 Apr, 2011]