FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB5973, 519 aa
1>>>pF1KB5973 519 - 519 aa - 519 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 7.8041+/-0.000833; mu= 9.2423+/- 0.051
mean_var=182.4933+/-37.567, 0's: 0 Z-trim(113.9): 55 B-trim: 696 in 1/51
Lambda= 0.094940
statistics sampled from 14454 (14508) to 14454 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.767), E-opt: 0.2 (0.446), width: 16
Scan time: 3.640
The best scores are: opt bits E(32554)
CCDS3867.1 IRX4 gene_id:50805|Hs108|chr5 ( 519) 3501 491.6 8.9e-139
CCDS75225.1 IRX4 gene_id:50805|Hs108|chr5 ( 545) 2589 366.7 3.7e-101
CCDS32449.1 IRX6 gene_id:79190|Hs108|chr16 ( 446) 876 132.0 1.4e-30
CCDS10750.1 IRX3 gene_id:79191|Hs108|chr16 ( 501) 703 108.4 2e-23
CCDS10751.1 IRX5 gene_id:10265|Hs108|chr16 ( 483) 702 108.2 2.2e-23
CCDS58462.1 IRX5 gene_id:10265|Hs108|chr16 ( 482) 694 107.1 4.6e-23
CCDS3868.1 IRX2 gene_id:153572|Hs108|chr5 ( 471) 568 89.9 7.1e-18
CCDS34132.1 IRX1 gene_id:79192|Hs108|chr5 ( 480) 564 89.3 1.1e-17
>>CCDS3867.1 IRX4 gene_id:50805|Hs108|chr5 (519 aa)
initn: 3501 init1: 3501 opt: 3501 Z-score: 2604.9 bits: 491.6 E(32554): 8.9e-139
Smith-Waterman score: 3501; 100.0% identity (100.0% similar) in 519 aa overlap (1-519:1-519)
10 20 30 40 50 60
pF1KB5 MSYPQFGYPYSSAPQFLMATNSLSTCCESGGRTLADSGPAASAQAPVYCPVYESRLLATA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS38 MSYPQFGYPYSSAPQFLMATNSLSTCCESGGRTLADSGPAASAQAPVYCPVYESRLLATA
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB5 RHELNSAAALGVYGGPYGGSQGYGNYVTYGSEASAFYSLNSFDSKDGSGSAHGGLAPAAA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS38 RHELNSAAALGVYGGPYGGSQGYGNYVTYGSEASAFYSLNSFDSKDGSGSAHGGLAPAAA
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB5 AYYPYEPALGQYPYDRYGTMDSGTRRKNATRETTSTLKAWLQEHRKNPYPTKGEKIMLAI
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS38 AYYPYEPALGQYPYDRYGTMDSGTRRKNATRETTSTLKAWLQEHRKNPYPTKGEKIMLAI
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB5 ITKMTLTQVSTWFANARRRLKKENKMTWPPRNKCADEKRPYAEGEEEEGGEEEAREEPLK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS38 ITKMTLTQVSTWFANARRRLKKENKMTWPPRNKCADEKRPYAEGEEEEGGEEEAREEPLK
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB5 SSKNAEPVGKEEKELELSDLDDFDPLEAEPPACELKPPFHSLDGGLERVPAAPDGPVKEA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS38 SSKNAEPVGKEEKELELSDLDDFDPLEAEPPACELKPPFHSLDGGLERVPAAPDGPVKEA
250 260 270 280 290 300
310 320 330 340 350 360
pF1KB5 SGALRMSLAAGGGAALDEDLERARSCLRSAAAGPEPLPGAEGGPQVCEAKLGFVPAGASA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS38 SGALRMSLAAGGGAALDEDLERARSCLRSAAAGPEPLPGAEGGPQVCEAKLGFVPAGASA
310 320 330 340 350 360
370 380 390 400 410 420
pF1KB5 GLEAKPRIWSLAHTATAAAAAATSLSQTEFPSCMLKRQGPAAPAAVSSAPATSPSVALPH
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS38 GLEAKPRIWSLAHTATAAAAAATSLSQTEFPSCMLKRQGPAAPAAVSSAPATSPSVALPH
370 380 390 400 410 420
430 440 450 460 470 480
pF1KB5 SGALDRHQDSPVTSLRNWVDGVFHDPILRHSTLNQAWATAKGALLDPGPLGRSLGAGANV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS38 SGALDRHQDSPVTSLRNWVDGVFHDPILRHSTLNQAWATAKGALLDPGPLGRSLGAGANV
430 440 450 460 470 480
490 500 510
pF1KB5 LTAPLARAFPPAVPQDAPAAGAARELLALPKAGGKPFCA
:::::::::::::::::::::::::::::::::::::::
CCDS38 LTAPLARAFPPAVPQDAPAAGAARELLALPKAGGKPFCA
490 500 510
>>CCDS75225.1 IRX4 gene_id:50805|Hs108|chr5 (545 aa)
initn: 2589 init1: 2589 opt: 2589 Z-score: 1929.6 bits: 366.7 E(32554): 3.7e-101
Smith-Waterman score: 3439; 95.2% identity (95.2% similar) in 545 aa overlap (1-519:1-545)
10 20 30 40 50 60
pF1KB5 MSYPQFGYPYSSAPQFLMATNSLSTCCESGGRTLADSGPAASAQAPVYCPVYESRLLATA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS75 MSYPQFGYPYSSAPQFLMATNSLSTCCESGGRTLADSGPAASAQAPVYCPVYESRLLATA
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB5 RHELNSAAALGVYGGPYGGSQGYGNYVTYGSEASAFYSLNSFDSKDGSGSAHGGLAPAAA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS75 RHELNSAAALGVYGGPYGGSQGYGNYVTYGSEASAFYSLNSFDSKDGSGSAHGGLAPAAA
70 80 90 100 110 120
130 140 150
pF1KB5 AYYPYEPALGQYPYDR--------------------------YGTMDSGTRRKNATRETT
:::::::::::::::: ::::::::::::::::::
CCDS75 AYYPYEPALGQYPYDRIKRLGGHPHKGIGLDLSGLGRSPGSLYGTMDSGTRRKNATRETT
130 140 150 160 170 180
160 170 180 190 200 210
pF1KB5 STLKAWLQEHRKNPYPTKGEKIMLAIITKMTLTQVSTWFANARRRLKKENKMTWPPRNKC
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS75 STLKAWLQEHRKNPYPTKGEKIMLAIITKMTLTQVSTWFANARRRLKKENKMTWPPRNKC
190 200 210 220 230 240
220 230 240 250 260 270
pF1KB5 ADEKRPYAEGEEEEGGEEEAREEPLKSSKNAEPVGKEEKELELSDLDDFDPLEAEPPACE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS75 ADEKRPYAEGEEEEGGEEEAREEPLKSSKNAEPVGKEEKELELSDLDDFDPLEAEPPACE
250 260 270 280 290 300
280 290 300 310 320 330
pF1KB5 LKPPFHSLDGGLERVPAAPDGPVKEASGALRMSLAAGGGAALDEDLERARSCLRSAAAGP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS75 LKPPFHSLDGGLERVPAAPDGPVKEASGALRMSLAAGGGAALDEDLERARSCLRSAAAGP
310 320 330 340 350 360
340 350 360 370 380 390
pF1KB5 EPLPGAEGGPQVCEAKLGFVPAGASAGLEAKPRIWSLAHTATAAAAAATSLSQTEFPSCM
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS75 EPLPGAEGGPQVCEAKLGFVPAGASAGLEAKPRIWSLAHTATAAAAAATSLSQTEFPSCM
370 380 390 400 410 420
400 410 420 430 440 450
pF1KB5 LKRQGPAAPAAVSSAPATSPSVALPHSGALDRHQDSPVTSLRNWVDGVFHDPILRHSTLN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS75 LKRQGPAAPAAVSSAPATSPSVALPHSGALDRHQDSPVTSLRNWVDGVFHDPILRHSTLN
430 440 450 460 470 480
460 470 480 490 500 510
pF1KB5 QAWATAKGALLDPGPLGRSLGAGANVLTAPLARAFPPAVPQDAPAAGAARELLALPKAGG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS75 QAWATAKGALLDPGPLGRSLGAGANVLTAPLARAFPPAVPQDAPAAGAARELLALPKAGG
490 500 510 520 530 540
pF1KB5 KPFCA
:::::
CCDS75 KPFCA
>>CCDS32449.1 IRX6 gene_id:79190|Hs108|chr16 (446 aa)
initn: 765 init1: 452 opt: 876 Z-score: 662.7 bits: 132.0 E(32554): 1.4e-30
Smith-Waterman score: 973; 44.1% identity (66.4% similar) in 440 aa overlap (1-431:1-407)
10 20 30 40 50 60
pF1KB5 MSYPQFGYPYSSAPQFLMATNSLSTCCESGGRTLADSGPAASAQAPVYCPVYESRLLATA
::.:.::.:: .: ::: ...: .::::: :...: . ... . : :.::::..:
CCDS32 MSFPHFGHPYRGASQFLASASSSTTCCESTQRSVSDVASGSTPAPALCCAPYDSRLLGSA
10 20 30 40 50 60
70 80 90 100 110
pF1KB5 RHELNSAAALGVYGGPYGGS---QGYGNYVTYGSEASAFY-SLN-SFDSKDGSGSAHGGL
: :: .::::.::.::... :.: .:. :. : ..: .:: ... :...:: ..:
CCDS32 RPEL--GAALGIYGAPYAAAAAAQSYPGYLPYSPEPPSLYGALNPQYEFKEAAGSFTSSL
70 80 90 100 110
120 130 140 150 160 170
pF1KB5 APAAAAYYPYEPALGQYPYDRYGTMD-SGT-RRKNATRETTSTLKAWLQEHRKNPYPTKG
: .: ::::: .:::: :.:::... ::. :::::::::::::::::.:::::::::::
CCDS32 AQPGA-YYPYERTLGQYQYERYGAVELSGAGRRKNATRETTSTLKAWLNEHRKNPYPTKG
120 130 140 150 160 170
180 190 200 210 220 230
pF1KB5 EKIMLAIITKMTLTQVSTWFANARRRLKKENKMTWPPRNKCADEKRPYAEGEEEEGGEEE
::::::::::::::::::::::::::::::::::: :.:: ..:.. :::::.
CCDS32 EKIMLAIITKMTLTQVSTWFANARRRLKKENKMTWAPKNKGGEERKA-------EGGEED
180 190 200 210 220 230
240 250 260 270 280 290
pF1KB5 AREEPLKSSKNAEPVGKEEKELELSDLDDFDPLEAEPPACELKPPFHSLDGGLERVPAAP
. ..:.. ...: . :.::::.:.. : : : . : : .:
CCDS32 SLGCLTADTKEVT-ASQEARGLRLSDLEDLEEEEEEEEEAEDE----------EVVATAG
240 250 260 270
300 310 320 330 340 350
pF1KB5 DGPVKEASGALRMSLAAGGGAALDEDLERARSCLRSAAAGPEPLPGAEGGPQVCEAKLGF
: .. .:: .:: . .:: . ::: : : .: :.. . :. :
CCDS32 DRLTEFRKGA--QSLPGPCAAAREGRLER-RECGLAAPRFSFNDPSGSEEADFLSAETGS
280 290 300 310 320 330
360 370 380 390 400 410
pF1KB5 VPAGASAGLEAKPRIWSLAHTATAAAA--AATSLSQTEFPSCMLKRQGPAAPAAVSSAPA
:::::::::::::.:. : . . . : : :. :. : ::
CCDS32 PRLTMHYPCLEKPRIWSLAHTATASAVEGAPPARPRPRSPEC---RMIPGQP------PA
340 350 360 370 380
420 430 440 450 460 470
pF1KB5 TSPSVALPHSGALDRHQDSPVTSLRNWVDGVFHDPILRHSTLNQAWATAKGALLDPGPLG
.. ...:...: :. . :
CCDS32 SARRLSVPRDSACDESSCIPKAFGNPKFALQGLPLNCAPCPRRSEPVVQCQYPSGAEAG
390 400 410 420 430 440
>>CCDS10750.1 IRX3 gene_id:79191|Hs108|chr16 (501 aa)
initn: 654 init1: 473 opt: 703 Z-score: 533.9 bits: 108.4 E(32554): 2e-23
Smith-Waterman score: 739; 36.7% identity (56.9% similar) in 534 aa overlap (1-519:1-465)
10 20 30 40 50 60
pF1KB5 MSYPQFGYPYSSAPQFLMATNSLSTCCESGGRTLADSGPAASAQAPVYCPVYESRLLATA
::.::.:: : : :. .. .. ::: . : .: .:.: :.: :..
CCDS10 MSFPQLGYQYIR-P--LYPSERPGAAGGSGGSAGARGGLGAGA----------SELNASG
10 20 30 40
70 80 90 100 110
pF1KB5 RHELNSAAALGVYGGPYGGS------QGYGNYVTYGSEASAFYSLNS-FDSKDGSGSAHG
:... . .:::.::... :::: .. :..: : .:.. .. ::. : :
CCDS10 --SLSNVLS-SVYGAPYAAAAAAAAAQGYGAFLPYAAELPIFPQLGAQYELKDSPGVQH-
50 60 70 80 90 100
120 130 140 150 160 170
pF1KB5 GLAPAAAAYYPY-EPALGQYPYDRYGTMDSGTRRKNATRETTSTLKAWLQEHRKNPYPTK
::::: .:. .::. ::: .: : .: ::::::.::::::::.::::::::::
CCDS10 ---PAAAAAFPHPHPAF--YPYGQYQFGDP-SRPKNATRESTSTLKAWLNEHRKNPYPTK
110 120 130 140 150
180 190 200 210 220 230
pF1KB5 GEKIMLAIITKMTLTQVSTWFANARRRLKKENKMTWPPRNKCADEKRPYAEGEEEEGGEE
:::::::::::::::::::::::::::::::::::: ::.. .: :. .::: ::
CCDS10 GEKIMLAIITKMTLTQVSTWFANARRRLKKENKMTWAPRSRTDEEGNAYGSEREEEDEEE
160 170 180 190 200 210
240 250 260 270 280
pF1KB5 EAREEPLKSSKNAEPVGKEEKELE---LSDLDDFDPLEAEPPACELKPPFHSLDGGLERV
. .. . . : .: ::.. :.: :. . .. : : :: :. .:
CCDS10 DEEDGKRELELEEEELGGEEEDTGGEGLADDDEDEEIDLENLDGAATEPELSLAGAARRD
220 230 240 250 260 270
290 300 310 320 330 340
pF1KB5 PAAPDGPVKEASGALRMSLAAGGGAALDEDLERARSCLRSAAAGPEPLPGAEGGPQVCEA
::.... . : . .. .: :: : : : : : : : ..:..
CCDS10 GDLGLGPISDS----KNSDSEDSSEGL-ED--RPLPVLSLA---PAPPPVAVASPSLPSP
280 290 300 310 320
350 360 370 380 390 400
pF1KB5 KLGF---VPAGASAGLEAKPRIWSLAHTATAAAAAATSLSQTEFPSCMLKRQGPA-APAA
... .:: : :. ::.:::::.:::. : :. . : : ::.:
CCDS10 PVSLDPCAPAPAPASALQKPKIWSLAETATSPDNPRRS-----PPGAGGSPPGAAVAPSA
330 340 350 360 370 380
410 420 430 440 450 460
pF1KB5 VSSAPATSPSVALPHSGALDRHQDSPVTSLRNWVDGVFHDPILRHSTLNQAWATAKGALL
.. .::.. ..: : : ..:. .. :.. : : : :
CCDS10 LQLSPAAAAAAA--H-----RLVSAPLGKFPAWTNRPFPGP-------------PPGPRL
390 400 410 420
470 480 490 500 510
pF1KB5 DPGPLGRSLGAGANVLTAPLARAFPPAVPQDAPAAGAARELLALPKAGGKPFCA
: : ::.. : ..: :. . : ::. :: : :. :: :.
CCDS10 HPLSL---LGSAP-----PHLLGLPGAAGHPAAAAAFARP--AEPE-GGTDRCSALEVEK
430 440 450 460 470
CCDS10 KLLKTAFQPVPRRPQNHLDAALVLSALSSS
480 490 500
>>CCDS10751.1 IRX5 gene_id:10265|Hs108|chr16 (483 aa)
initn: 562 init1: 451 opt: 702 Z-score: 533.4 bits: 108.2 E(32554): 2.2e-23
Smith-Waterman score: 702; 38.9% identity (59.4% similar) in 406 aa overlap (39-422:11-394)
10 20 30 40 50 60
pF1KB5 PYSSAPQFLMATNSLSTCCESGGRTLADSGPAASAQAPVYCPVYESRLLATARHELNSAA
:.:: : ::.: . ... : . . .
CCDS10 MSYPQGYLYQPSASL-ALYSCPAYSTSVISGPRTDELGRS
10 20 30
70 80 90 100 110 120
pF1KB5 ALGVYGGPYGGSQ-------GYGNYVTYGSEASAFYSLNSFDSKDGSGSAHG-GLAPAAA
. : .::.:: ::.... ::.. .: . .:.: :: : :.: ..
CCDS10 SSGSAFSPYAGSTAFTAPSPGYNSHLQYGADPAA-AAAAAFSSYVGSPYDHTPGMA-GSL
40 50 60 70 80 90
130 140 150 160 170 180
pF1KB5 AYYPYEPALGQYPYDRYGTMDSGTRRKNATRETTSTLKAWLQEHRKNPYPTKGEKIMLAI
.:.:: ::.::: . . ::::::..:.::::::.::::::::::::::::::
CCDS10 GYHPYAAPLGSYPY------GDPAYRKNATRDATATLKAWLNEHRKNPYPTKGEKIMLAI
100 110 120 130 140 150
190 200 210 220 230 240
pF1KB5 ITKMTLTQVSTWFANARRRLKKENKMTWPPRNKCADEKRPYAEGEEEEGGEEEAREEPLK
:::::::::::::::::::::::::::: :::. :: : ::. :.. ..:: :
CCDS10 ITKMTLTQVSTWFANARRRLKKENKMTWTPRNRSEDE-----EEEENIDLEKNDEDEPQK
160 170 180 190 200
250 260 270 280 290
pF1KB5 SSKNAEPVGKEEKELELSDLDDFDPLEAEPPACELKPPFHSL-DGGLERVPAAPDGPVKE
...: : : : . . . :.. ::. : :: :. ... :. .: .
CCDS10 PEDKGDPEGPEAGGAEQKAASGCERLQG-PPTPAGKETEGSLSDSDFKEPPS--EGRLDA
210 220 230 240 250 260
300 310 320 330 340 350
pF1KB5 ASGALRMS--LAAGGGAA-LDEDLERARSCLRSAAAGPEP----LPGAEGGPQVCEAKLG
.: : . :: .:: : :: . . : ::.: .: . :::.: ..
CCDS10 LQGPPRTGGPSPAGPAAARLAED-PAPHYPAGAPAPGPHPAAGEVPPGPGGPSVIHSP--
270 280 290 300 310 320
360 370 380 390 400
pF1KB5 FVPAGASAGLEAKPRIWSLAHTATAAAAAATSLSQTE---FPSCMLKRQGPA---APAAV
: .. :::..::::. ::.. . . . .: : : : : . :.
CCDS10 --PPPPPPAVLAKPKLWSLAEIATSSDKVKDGGGGNEGSPCPPCPGPIAGQALGGSRASP
330 340 350 360 370
410 420 430 440 450 460
pF1KB5 SSAPATSPSVALPHSGALDRHQDSPVTSLRNWVDGVFHDPILRHSTLNQAWATAKGALLD
. ::. :::. : :
CCDS10 APAPSRSPSAQCPFPGGTVLSRPLYYTAPFYPGYTNYGSFGHLHGHPGPGPGPTTGPGSH
380 390 400 410 420 430
>>CCDS58462.1 IRX5 gene_id:10265|Hs108|chr16 (482 aa)
initn: 547 init1: 436 opt: 694 Z-score: 527.5 bits: 107.1 E(32554): 4.6e-23
Smith-Waterman score: 694; 38.9% identity (59.4% similar) in 406 aa overlap (39-422:11-393)
10 20 30 40 50 60
pF1KB5 PYSSAPQFLMATNSLSTCCESGGRTLADSGPAASAQAPVYCPVYESRLLATARHELNSAA
:.:: : ::.: . ... : . . .
CCDS58 MSYPQGYLYQPSASL-ALYSCPAYSTSVISGPRTDELGRS
10 20 30
70 80 90 100 110 120
pF1KB5 ALGVYGGPYGGSQ-------GYGNYVTYGSEASAFYSLNSFDSKDGSGSAHG-GLAPAAA
. : .::.:: ::.... ::.. .: . .:.: :: : :.: ..
CCDS58 SSGSAFSPYAGSTAFTAPSPGYNSHLQYGADPAA-AAAAAFSSYVGSPYDHTPGMA-GSL
40 50 60 70 80 90
130 140 150 160 170 180
pF1KB5 AYYPYEPALGQYPYDRYGTMDSGTRRKNATRETTSTLKAWLQEHRKNPYPTKGEKIMLAI
.:.:: ::.::: . . ::::::..:.::::::.::::::::::::::::::
CCDS58 GYHPYAAPLGSYPY------GDPAYRKNATRDATATLKAWLNEHRKNPYPTKGEKIMLAI
100 110 120 130 140 150
190 200 210 220 230 240
pF1KB5 ITKMTLTQVSTWFANARRRLKKENKMTWPPRNKCADEKRPYAEGEEEEGGEEEAREEPLK
:::::::::::::::::::::::::::: :::. :: : ::. :.. ..:: :
CCDS58 ITKMTLTQVSTWFANARRRLKKENKMTWTPRNRSEDE-----EEEENIDLEKNDEDEPQK
160 170 180 190 200
250 260 270 280 290
pF1KB5 SSKNAEPVGKEEKELELSDLDDFDPLEAEPPACELKPPFHSL-DGGLERVPAAPDGPVKE
...: : : : . . . :.. ::. : :: :. ... :. .: .
CCDS58 PEDKGDPEGPEAGA-EQKAASGCERLQG-PPTPAGKETEGSLSDSDFKEPPS--EGRLDA
210 220 230 240 250 260
300 310 320 330 340 350
pF1KB5 ASGALRMS--LAAGGGAA-LDEDLERARSCLRSAAAGPEP----LPGAEGGPQVCEAKLG
.: : . :: .:: : :: . . : ::.: .: . :::.: ..
CCDS58 LQGPPRTGGPSPAGPAAARLAED-PAPHYPAGAPAPGPHPAAGEVPPGPGGPSVIHSP--
270 280 290 300 310
360 370 380 390 400
pF1KB5 FVPAGASAGLEAKPRIWSLAHTATAAAAAATSLSQTE---FPSCMLKRQGPA---APAAV
: .. :::..::::. ::.. . . . .: : : : : . :.
CCDS58 --PPPPPPAVLAKPKLWSLAEIATSSDKVKDGGGGNEGSPCPPCPGPIAGQALGGSRASP
320 330 340 350 360 370
410 420 430 440 450 460
pF1KB5 SSAPATSPSVALPHSGALDRHQDSPVTSLRNWVDGVFHDPILRHSTLNQAWATAKGALLD
. ::. :::. : :
CCDS58 APAPSRSPSAQCPFPGGTVLSRPLYYTAPFYPGYTNYGSFGHLHGHPGPGPGPTTGPGSH
380 390 400 410 420 430
>>CCDS3868.1 IRX2 gene_id:153572|Hs108|chr5 (471 aa)
initn: 554 init1: 443 opt: 568 Z-score: 434.4 bits: 89.9 E(32554): 7.1e-18
Smith-Waterman score: 638; 36.8% identity (55.7% similar) in 476 aa overlap (40-481:11-439)
10 20 30 40 50 60
pF1KB5 YSSAPQFLMATNSLSTCCESGGRTLADSGPAASAQAPVYCPVYESRLLATARHELNSAAA
: .. : ::.: . ::. : : . .:
CCDS38 MSYPQGYLYQAPGSLALYSCPAYGASALAAPRSEELARSA
10 20 30 40
70 80 90 100 110 120
pF1KB5 LGVYGGPYGGSQ--------GYGNYVTYGSEASAFYSLNSFDSKDGSG-SAHGGLAPAAA
: .:: :: :.:. . :...:.: . .: : :. .:: .:
CCDS38 SGSAFSPYPGSAAFTAQAATGFGSPLQYSADAAA--AAAGFPSYMGAPYDAHTTGMTGAI
50 60 70 80 90
130 140 150 160 170 180
pF1KB5 AYYPYEPALGQYPYDRYGTMDSGTRRKNATRETTSTLKAWLQEHRKNPYPTKGEKIMLAI
.:.:: : :::. ... . ::::::..:.::::::.::::::::::::::::::
CCDS38 SYHPYGSA--AYPYQ----LNDPAYRKNATRDATATLKAWLNEHRKNPYPTKGEKIMLAI
100 110 120 130 140 150
190 200 210 220 230
pF1KB5 ITKMTLTQVSTWFANARRRLKKENKMTWPPRNKCADEKRPYAEGEEEEGGEEEAREE-PL
:::::::::::::::::::::::::::: :::: :: .:.:: ....: :
CCDS38 ITKMTLTQVSTWFANARRRLKKENKMTWAPRNKSEDE-------DEDEGDATRSKDESPD
160 170 180 190 200
240 250 260 270 280 290
pF1KB5 KSSKNAEPVGKEEK-ELELSDLDDFDPLEAEPPACELKPPFHSLDGGLERVPAAPDGPVK
:.....: ...: :....: : .: . :: :..: :.
CCDS38 KAQEGTETSAEDEGISLHVDSLTDH--------SCSAES-----DG--EKLPCRAGDPLC
210 220 230 240 250
300 310 320 330 340 350
pF1KB5 EASGALRMSLAAGGGAALDEDLERARSCLRSAAAGPEPLPGAEG---GPQVCEAKLGF--
: ::. . :.: : :. . :: : :. .: : :
CCDS38 E-SGSECKDKYDDLEDDEDDDEEGERGLAPPKPVTSSPLTGLEAPLLSPPPEAAPRGGRK
260 270 280 290 300
360 370 380 390 400
pF1KB5 VPAGA--SAGLE---AKPRIWSLAHTATAAAAAATSLSQTEF-PSCMLKRQGP------A
.: :. : : .::..::::. :: ..:.: . :.: :: :
CCDS38 TPQGSRTSPGAPPPASKPKLWSLAEIAT------SDLKQPSLGPGC-----GPPGLPAAA
310 320 330 340 350
410 420 430 440 450
pF1KB5 APAAVSSAPATSPSVALPHSGALDRHQDSPV----TSLRNWVDGVFHDPILRHSTLNQAW
:::.... :. :: : : : . :: :. : .. . .::. :.:
CCDS38 APASTGAPPGGSPYPASPLLGR-PLYYTSPFYGNYTNYGNLNAALQGQGLLRY---NSA-
360 370 380 390 400 410
460 470 480 490 500 510
pF1KB5 ATAKGALLDPGPLGRSLG--AGANVLTAPLARAFPPAVPQDAPAAGAARELLALPKAGGK
:.: : : .: . : . :::. :
CCDS38 AAAPGEALHTAPKAASDAGKAGAHPLESHYRSPGGGYEPKKDASEGCTVVGGGVQPYL
420 430 440 450 460 470
>>CCDS34132.1 IRX1 gene_id:79192|Hs108|chr5 (480 aa)
initn: 552 init1: 416 opt: 564 Z-score: 431.3 bits: 89.3 E(32554): 1.1e-17
Smith-Waterman score: 687; 36.3% identity (56.8% similar) in 512 aa overlap (1-502:1-422)
10 20 30 40 50 60
pF1KB5 MSYPQFGYPYSSAPQFLMATNSLSTCCESGGRTLADSGPAASAQAPVYCPVYESRLLATA
::.::.::: :.: :.. . : : .:: .. ::.: . . : :. .
CCDS34 MSFPQLGYP-----QYLSAAGPGAYGGERPG-VLAAAAAAAAAASSGRPGAAE---LGGG
10 20 30 40 50
70 80 90 100 110
pF1KB5 RHELNSAAALGVYG--GPYGGSQGYGNYVTYGSEASAFYSLNS-FDSKDGSGSAHGGLAP
...::.:. :::.:. .:. .. :... : : ...: .. ::. : . .:
CCDS34 AGAAAVTSVLGMYAAAGPYAGAPNYSAFLPYAADLSLFSQMGSQYELKDNPGVHPATFAA
60 70 80 90 100 110
120 130 140 150 160 170
pF1KB5 -AAAAYYPYEPALGQYPYDRYGTMDSGTRRKNATRETTSTLKAWLQEHRKNPYPTKGEKI
.: ::::: ::. .:: : : : ::::::.::::::::.::::::::::::::
CCDS34 HTAPAYYPY----GQF---QYG--DPG-RPKNATRESTSTLKAWLNEHRKNPYPTKGEKI
120 130 140 150 160
180 190 200 210 220 230
pF1KB5 MLAIITKMTLTQVSTWFANARRRLKKENKMTWPPRNKCADEKRPYAEGEEEEGGEEEARE
:::::::::::::::::::::::::::::.:: :.: :.. : . :: :.:..
CCDS34 MLAIITKMTLTQVSTWFANARRRLKKENKVTWGARSK--DQEDGALFGSDTEGDPEKAED
170 180 190 200 210
240 250 260 270 280 290
pF1KB5 EPLKSSKNAEPVGKEEKELELSDLDDFDPLEAEPPACELKPPFHSLDGGLERVPAAPDGP
. . .. . .:.. . :. :: : .:: : :. ::::..
CCDS34 DEEIDLESIDIDKIDEHDGDQSNEDDED--KAEAP--------HA--------PAAPSAL
220 230 240 250 260
300 310 320 330 340 350
pF1KB5 VKEASGALRMSLAAGGGAALDEDLERARSCLRSAAAGPEPLPGAEGGPQVCEAKLGFVPA
... .. : :: : :. : : : .::: :. .. . :.
CCDS34 ARDQGSPL---------AAADV-LKPQDSPLGLAKEAPEP-----GSTRL------LSPG
270 280 290 300
360 370 380 390 400 410
pF1KB5 GASAGLEA----KPRIWSLAHTATAAAAAATSLSQTEFPSCMLKRQGPAAPAAVSSAPAT
.:..::.. ::.:::::.:::. .: .. :. .::.: .::
CCDS34 AAAGGLQGAPHGKPKIWSLAETATSPDGAPK--ASPPPPAGHPGAHGPSA-----GAPLQ
310 320 330 340 350
420 430 440 450 460 470
pF1KB5 SPSVALPHSGALDRHQDSPVTSLRNWVDGVFHDPILRHSTLNQAWATAKGALLDPGPLGR
:. :: : : . .. ::....: :.:.::. .
CCDS34 HPAF-LPSHGLYTCH----IGKFSNWTNSAF---------------LAQGSLLN---MRS
360 370 380 390
480 490 500 510
pF1KB5 SLGAGA--NVLTAPLARAFPPAVPQDAPAAGAARELLALPKAGGKPFCA
::.:: . .: : :: : : : ::
CCDS34 FLGVGAPHAAPHGPHLPAPPPPQPPVAIAPGALNGDKASVRSSPTLPERDLVPRPDSPAQ
400 410 420 430 440 450
CCDS34 QLKSPFQPVRDNSLAPQEGTPRILAALPSA
460 470 480
519 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sat Nov 5 10:49:15 2016 done: Sat Nov 5 10:49:16 2016
Total Scan time: 3.640 Total Display time: 0.090
Function used was FASTA [36.3.4 Apr, 2011]