FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB9710, 348 aa
1>>>pF1KB9710 348 - 348 aa - 348 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 9.8607+/-0.000984; mu= -0.3334+/- 0.060
mean_var=375.0393+/-75.801, 0's: 0 Z-trim(116.4): 153 B-trim: 0 in 0/54
Lambda= 0.066227
statistics sampled from 16812 (16970) to 16812 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.808), E-opt: 0.2 (0.521), width: 16
Scan time: 3.390
The best scores are: opt bits E(32554)
CCDS2515.1 GBX2 gene_id:2637|Hs108|chr2 ( 348) 2343 237.0 1.7e-62
CCDS77545.1 GBX2 gene_id:2637|Hs108|chr2 ( 222) 1201 127.7 9.1e-30
CCDS43682.1 GBX1 gene_id:2636|Hs108|chr7 ( 363) 700 80.1 3.2e-15
>>CCDS2515.1 GBX2 gene_id:2637|Hs108|chr2 (348 aa)
initn: 2343 init1: 2343 opt: 2343 Z-score: 1235.3 bits: 237.0 E(32554): 1.7e-62
Smith-Waterman score: 2343; 100.0% identity (100.0% similar) in 348 aa overlap (1-348:1-348)
10 20 30 40 50 60
pF1KB9 MSAAFPPSLMMMQRPLGSSTAFSIDSLIGSPPQPSPGHFVYTGYPMFMPYRPVVLPPPPP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS25 MSAAFPPSLMMMQRPLGSSTAFSIDSLIGSPPQPSPGHFVYTGYPMFMPYRPVVLPPPPP
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB9 PPPALPQAALQPALPPAHPHHQIPSLPTGFCSSLAQGMALTSTLMATLPGGFSASPQHQE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS25 PPPALPQAALQPALPPAHPHHQIPSLPTGFCSSLAQGMALTSTLMATLPGGFSASPQHQE
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB9 AAAARKFAPQPLPGGGNFDKAEALQADAEDGKGFLAKEGSLLAFSAAETVQASLVGAVRG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS25 AAAARKFAPQPLPGGGNFDKAEALQADAEDGKGFLAKEGSLLAFSAAETVQASLVGAVRG
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB9 QGKDESKVEDDPKGKEESFSLESDVDYSSDDNLTGQAAHKEEDPGHALEETPPSSGAAGS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS25 QGKDESKVEDDPKGKEESFSLESDVDYSSDDNLTGQAAHKEEDPGHALEETPPSSGAAGS
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB9 TTSTGKNRRRRTAFTSEQLLELEKEFHCKKYLSLTERSQIAHALKLSEVQVKIWFQNRRA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS25 TTSTGKNRRRRTAFTSEQLLELEKEFHCKKYLSLTERSQIAHALKLSEVQVKIWFQNRRA
250 260 270 280 290 300
310 320 330 340
pF1KB9 KWKRVKAGNANSKTGEPSRNPKIVVPIPVHVSRFAIRSQHQQLEQARP
::::::::::::::::::::::::::::::::::::::::::::::::
CCDS25 KWKRVKAGNANSKTGEPSRNPKIVVPIPVHVSRFAIRSQHQQLEQARP
310 320 330 340
>>CCDS77545.1 GBX2 gene_id:2637|Hs108|chr2 (222 aa)
initn: 1201 init1: 1201 opt: 1201 Z-score: 647.9 bits: 127.7 E(32554): 9.1e-30
Smith-Waterman score: 1201; 100.0% identity (100.0% similar) in 174 aa overlap (1-174:1-174)
10 20 30 40 50 60
pF1KB9 MSAAFPPSLMMMQRPLGSSTAFSIDSLIGSPPQPSPGHFVYTGYPMFMPYRPVVLPPPPP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS77 MSAAFPPSLMMMQRPLGSSTAFSIDSLIGSPPQPSPGHFVYTGYPMFMPYRPVVLPPPPP
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB9 PPPALPQAALQPALPPAHPHHQIPSLPTGFCSSLAQGMALTSTLMATLPGGFSASPQHQE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS77 PPPALPQAALQPALPPAHPHHQIPSLPTGFCSSLAQGMALTSTLMATLPGGFSASPQHQE
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB9 AAAARKFAPQPLPGGGNFDKAEALQADAEDGKGFLAKEGSLLAFSAAETVQASLVGAVRG
::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS77 AAAARKFAPQPLPGGGNFDKAEALQADAEDGKGFLAKEGSLLAFSAAETVQASLGRLCRS
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB9 QGKDESKVEDDPKGKEESFSLESDVDYSSDDNLTGQAAHKEEDPGHALEETPPSSGAAGS
CCDS77 AVTWGSGRGCPRARERRVKGGRRPEGQGGELLAGERCGLQLG
190 200 210 220
>>CCDS43682.1 GBX1 gene_id:2636|Hs108|chr7 (363 aa)
initn: 801 init1: 639 opt: 700 Z-score: 386.7 bits: 80.1 E(32554): 3.2e-15
Smith-Waterman score: 947; 51.9% identity (66.7% similar) in 360 aa overlap (17-348:20-363)
10 20 30 40 50
pF1KB9 MSAAFPPSLMMMQRPLGSSTAFSIDSLIGSPPQPSPGHFVYTGYPMFMPYRPVVLPP
: .:::::::::: :: : ::..::::::::::::.:::
CCDS43 MQRAGGGSAPGGNGGGGGGGPGTAFSIDSLIG-PPPPRSGHLLYTGYPMFMPYRPLVLPQ
10 20 30 40 50
60 70 80 90 100 110
pF1KB9 PPPPPPALPQAALQPALPPAHPHHQIPS-LPTGFCSSLAQGM----ALTSTL--MATLPG
: : :: .::: : .. . : . ::..:.:.. :::..: .: :
CCDS43 ALAPAP-LPA-----GLPPLAPLASFAGRLTNTFCAGLGQAVPSMVALTTALPSFAEPPD
60 70 80 90 100 110
120 130 140 150 160
pF1KB9 GFSASPQHQEAAAARKFAP----QPLPGG----GNFDKAEALQADAEDGKGFLAKEGSLL
.: . ::. :::: : .: ::: :... : : : . .:.
CCDS43 AFYG-PQELAAAAAAAAATAARNNPEPGGRRPEGGLEADELLPAREK-----VAEPPPPP
120 130 140 150 160
170 180 190 200 210
pF1KB9 AFSAAETVQASLVGAVRGQGKDESKVE---DDPKG---KEESFSLESDVDYSSDDNLTGQ
.:: :: . . ..:: :.: :: : .::. . .:. : :.. :
CCDS43 PPHFSETFP-SLPAEGKVYSSDEEKLEASAGDPAGSEQEEEGSGGDSEDDGFLDSSAGGP
170 180 190 200 210 220
220 230 240 250 260 270
pF1KB9 AAHKEEDP------GHALEETPPSSGAAGSTTSTGKNRRRRTAFTSEQLLELEKEFHCKK
.: : : . :: : . :: :. ::.:::::::::::::::::::::::
CCDS43 GALLGPKPKLKGSLGTGAEEGAPVT--AGVTAPGGKSRRRRTAFTSEQLLELEKEFHCKK
230 240 250 260 270 280
280 290 300 310 320 330
pF1KB9 YLSLTERSQIAHALKLSEVQVKIWFQNRRAKWKRVKAGNANSKTGEPSRNPKIVVPIPVH
::::::::::::::::::::::::::::::::::.::::..:..::: ::::::::::::
CCDS43 YLSLTERSQIAHALKLSEVQVKIWFQNRRAKWKRIKAGNVSSRSGEPVRNPKIVVPIPVH
290 300 310 320 330 340
340
pF1KB9 VSRFAIRSQHQQLEQ-ARP
:.:::.::::::.:: :::
CCDS43 VNRFAVRSQHQQMEQGARP
350 360
348 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Fri Nov 4 18:27:45 2016 done: Fri Nov 4 18:27:46 2016
Total Scan time: 3.390 Total Display time: -0.020
Function used was FASTA [36.3.4 Apr, 2011]