FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB5138, 795 aa
1>>>pF1KB5138 795 - 795 aa - 795 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 9.6577+/-0.00101; mu= 3.9967+/- 0.062
mean_var=324.9750+/-63.463, 0's: 0 Z-trim(114.4): 24 B-trim: 0 in 0/55
Lambda= 0.071146
statistics sampled from 14952 (14967) to 14952 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.757), E-opt: 0.2 (0.46), width: 16
Scan time: 4.960
The best scores are: opt bits E(32554)
CCDS56242.1 SATB1 gene_id:6304|Hs108|chr3 ( 795) 5299 558.1 2e-158
CCDS2631.1 SATB1 gene_id:6304|Hs108|chr3 ( 763) 3971 421.8 2.1e-117
CCDS2327.1 SATB2 gene_id:23314|Hs108|chr2 ( 733) 2312 251.5 3.8e-66
>>CCDS56242.1 SATB1 gene_id:6304|Hs108|chr3 (795 aa)
initn: 5299 init1: 5299 opt: 5299 Z-score: 2957.5 bits: 558.1 E(32554): 2e-158
Smith-Waterman score: 5299; 100.0% identity (100.0% similar) in 795 aa overlap (1-795:1-795)
10 20 30 40 50 60
pF1KB5 MDHLNEATQGKEHSEMSNNVSDPKGPPAKIARLEQNGSPLGRGRLGSTGAKMQGVPLKHS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS56 MDHLNEATQGKEHSEMSNNVSDPKGPPAKIARLEQNGSPLGRGRLGSTGAKMQGVPLKHS
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB5 GHLMKTNLRKGTMLPVFCVVEHYENAIEYDCKEEHAEFVLVRKDMLFNQLIEMALLSLGY
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS56 GHLMKTNLRKGTMLPVFCVVEHYENAIEYDCKEEHAEFVLVRKDMLFNQLIEMALLSLGY
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB5 SHSSAAQAKGLIQVGKWNPVPLSYVTDAPDATVADMLQDVYHVVTLKIQLHSCPKLEDLP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS56 SHSSAAQAKGLIQVGKWNPVPLSYVTDAPDATVADMLQDVYHVVTLKIQLHSCPKLEDLP
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB5 PEQWSHTTVRNALKDLLKDMNQSSLAKECPLSQSMISSIVNSTYYANVSAAKCQEFGRWY
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS56 PEQWSHTTVRNALKDLLKDMNQSSLAKECPLSQSMISSIVNSTYYANVSAAKCQEFGRWY
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB5 KHFKKTKDMMVEMDSLSELSQQGANHVNFGQQPVPGNTAEQPPSPAQLSHGSQPSVRTPL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS56 KHFKKTKDMMVEMDSLSELSQQGANHVNFGQQPVPGNTAEQPPSPAQLSHGSQPSVRTPL
250 260 270 280 290 300
310 320 330 340 350 360
pF1KB5 PNLHPGLVSTPISPQLVNQQLVMAQLLNQQYAVNRLLAQQSLNQQYLNHPPPVSRSMNKP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS56 PNLHPGLVSTPISPQLVNQQLVMAQLLNQQYAVNRLLAQQSLNQQYLNHPPPVSRSMNKP
310 320 330 340 350 360
370 380 390 400 410 420
pF1KB5 LEQQVSTNTEVSSEIYQWVRDELKRAGISQAVFARVAFNRTQGLLSEILRKEEDPKTASQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS56 LEQQVSTNTEVSSEIYQWVRDELKRAGISQAVFARVAFNRTQGLLSEILRKEEDPKTASQ
370 380 390 400 410 420
430 440 450 460 470 480
pF1KB5 SLLVNLRAMQNFLQLPEAERDRIYQDERERSLNAASAMGPAPLISTPPSRPPQVKTATIA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS56 SLLVNLRAMQNFLQLPEAERDRIYQDERERSLNAASAMGPAPLISTPPSRPPQVKTATIA
430 440 450 460 470 480
490 500 510 520 530 540
pF1KB5 TERNGKPENNTMNINASIYDEIQQEMKRAKVSQALFAKVAATKSQGWLCELLRWKEDPSP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS56 TERNGKPENNTMNINASIYDEIQQEMKRAKVSQALFAKVAATKSQGWLCELLRWKEDPSP
490 500 510 520 530 540
550 560 570 580 590 600
pF1KB5 ENRTLWENLSMIRRFLSLPQPERDAIYEQESNAVHHHGDRPPHIIHVPAEQIQSPSPTTL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS56 ENRTLWENLSMIRRFLSLPQPERDAIYEQESNAVHHHGDRPPHIIHVPAEQIQSPSPTTL
550 560 570 580 590 600
610 620 630 640 650 660
pF1KB5 GKGESRGVFLPGLPTPAPWLGAAPQQQQQQQQQQQQQQQAPPPPQPQQQPQTGPRLPPRQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS56 GKGESRGVFLPGLPTPAPWLGAAPQQQQQQQQQQQQQQQAPPPPQPQQQPQTGPRLPPRQ
610 620 630 640 650 660
670 680 690 700 710 720
pF1KB5 PTVASPAESDEENRQKTRPRTKISVEALGILQSFIQDVGLYPDEEAIQTLSAQLDLPKYT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS56 PTVASPAESDEENRQKTRPRTKISVEALGILQSFIQDVGLYPDEEAIQTLSAQLDLPKYT
670 680 690 700 710 720
730 740 750 760 770 780
pF1KB5 IIKFFQNQRYYLKHHGKLKDNSGLEVDVAEYKEEELLKDLEESVQDKNTNTLFSVKLEEE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS56 IIKFFQNQRYYLKHHGKLKDNSGLEVDVAEYKEEELLKDLEESVQDKNTNTLFSVKLEEE
730 740 750 760 770 780
790
pF1KB5 LSVEGNTDINTDLKD
:::::::::::::::
CCDS56 LSVEGNTDINTDLKD
790
>>CCDS2631.1 SATB1 gene_id:6304|Hs108|chr3 (763 aa)
initn: 4203 init1: 3951 opt: 3971 Z-score: 2221.1 bits: 421.8 E(32554): 2.1e-117
Smith-Waterman score: 4996; 96.0% identity (96.0% similar) in 795 aa overlap (1-795:1-763)
10 20 30 40 50 60
pF1KB5 MDHLNEATQGKEHSEMSNNVSDPKGPPAKIARLEQNGSPLGRGRLGSTGAKMQGVPLKHS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS26 MDHLNEATQGKEHSEMSNNVSDPKGPPAKIARLEQNGSPLGRGRLGSTGAKMQGVPLKHS
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB5 GHLMKTNLRKGTMLPVFCVVEHYENAIEYDCKEEHAEFVLVRKDMLFNQLIEMALLSLGY
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS26 GHLMKTNLRKGTMLPVFCVVEHYENAIEYDCKEEHAEFVLVRKDMLFNQLIEMALLSLGY
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB5 SHSSAAQAKGLIQVGKWNPVPLSYVTDAPDATVADMLQDVYHVVTLKIQLHSCPKLEDLP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS26 SHSSAAQAKGLIQVGKWNPVPLSYVTDAPDATVADMLQDVYHVVTLKIQLHSCPKLEDLP
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB5 PEQWSHTTVRNALKDLLKDMNQSSLAKECPLSQSMISSIVNSTYYANVSAAKCQEFGRWY
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS26 PEQWSHTTVRNALKDLLKDMNQSSLAKECPLSQSMISSIVNSTYYANVSAAKCQEFGRWY
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB5 KHFKKTKDMMVEMDSLSELSQQGANHVNFGQQPVPGNTAEQPPSPAQLSHGSQPSVRTPL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS26 KHFKKTKDMMVEMDSLSELSQQGANHVNFGQQPVPGNTAEQPPSPAQLSHGSQPSVRTPL
250 260 270 280 290 300
310 320 330 340 350 360
pF1KB5 PNLHPGLVSTPISPQLVNQQLVMAQLLNQQYAVNRLLAQQSLNQQYLNHPPPVSRSMNKP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS26 PNLHPGLVSTPISPQLVNQQLVMAQLLNQQYAVNRLLAQQSLNQQYLNHPPPVSRSMNKP
310 320 330 340 350 360
370 380 390 400 410 420
pF1KB5 LEQQVSTNTEVSSEIYQWVRDELKRAGISQAVFARVAFNRTQGLLSEILRKEEDPKTASQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS26 LEQQVSTNTEVSSEIYQWVRDELKRAGISQAVFARVAFNRTQGLLSEILRKEEDPKTASQ
370 380 390 400 410 420
430 440 450 460 470 480
pF1KB5 SLLVNLRAMQNFLQLPEAERDRIYQDERERSLNAASAMGPAPLISTPPSRPPQVKTATIA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS26 SLLVNLRAMQNFLQLPEAERDRIYQDERERSLNAASAMGPAPLISTPPSRPPQVKTATIA
430 440 450 460 470 480
490 500 510 520 530 540
pF1KB5 TERNGKPENNTMNINASIYDEIQQEMKRAKVSQALFAKVAATKSQGWLCELLRWKEDPSP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS26 TERNGKPENNTMNINASIYDEIQQEMKRAKVSQALFAKVAATKSQGWLCELLRWKEDPSP
490 500 510 520 530 540
550 560 570 580 590 600
pF1KB5 ENRTLWENLSMIRRFLSLPQPERDAIYEQESNAVHHHGDRPPHIIHVPAEQIQSPSPTTL
:::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS26 ENRTLWENLSMIRRFLSLPQPERDAIYEQESNAVHHHGDRPPHIIHVPAEQIQ-------
550 560 570 580 590
610 620 630 640 650 660
pF1KB5 GKGESRGVFLPGLPTPAPWLGAAPQQQQQQQQQQQQQQQAPPPPQPQQQPQTGPRLPPRQ
:::::::::::::::::::::::::::::::::::
CCDS26 -------------------------QQQQQQQQQQQQQQAPPPPQPQQQPQTGPRLPPRQ
600 610 620
670 680 690 700 710 720
pF1KB5 PTVASPAESDEENRQKTRPRTKISVEALGILQSFIQDVGLYPDEEAIQTLSAQLDLPKYT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS26 PTVASPAESDEENRQKTRPRTKISVEALGILQSFIQDVGLYPDEEAIQTLSAQLDLPKYT
630 640 650 660 670 680
730 740 750 760 770 780
pF1KB5 IIKFFQNQRYYLKHHGKLKDNSGLEVDVAEYKEEELLKDLEESVQDKNTNTLFSVKLEEE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS26 IIKFFQNQRYYLKHHGKLKDNSGLEVDVAEYKEEELLKDLEESVQDKNTNTLFSVKLEEE
690 700 710 720 730 740
790
pF1KB5 LSVEGNTDINTDLKD
:::::::::::::::
CCDS26 LSVEGNTDINTDLKD
750 760
>>CCDS2327.1 SATB2 gene_id:23314|Hs108|chr2 (733 aa)
initn: 2807 init1: 1042 opt: 2312 Z-score: 1301.0 bits: 251.5 E(32554): 3.8e-66
Smith-Waterman score: 2826; 59.5% identity (77.4% similar) in 792 aa overlap (1-780:1-718)
10 20 30 40 50
pF1KB5 MDHLNEATQGKEHSEMSNNVSDPKGPP-AKIARLEQNGSPLG-RGRLGSTGAKMQGVPLK
:.. .:. .. . .. : :::: .:.:::::::::.: ::: ... :: :
CCDS23 MERRSESPCLRDSPDRRSGSPDVKGPPPVKVARLEQNGSPMGARGRPNGAVAKAVG----
10 20 30 40 50
60 70 80 90 100 110
pF1KB5 HSGHLMKTNLRKGTMLPVFCVVEHYENAIEYDCKEEHAEFVLVRKDMLFNQLIEMALLSL
: :.:::::::. ....::: .::::::::::::.::.::.: :::.:
CCDS23 ------------GLMIPVFCVVEQLDGSLEYDNREEHAEFVLVRKDVLFSQLVETALLAL
60 70 80 90 100
120 130 140 150 160 170
pF1KB5 GYSHSSAAQAKGLIQVGKWNPVPLSYVTDAPDATVADMLQDVYHVVTLKIQLHSCPKLED
::::::::::.:.:..:.:::.::::::::::::::::::::::::::::::.:: ::::
CCDS23 GYSHSSAAQAQGIIKLGRWNPLPLSYVTDAPDATVADMLQDVYHVVTLKIQLQSCSKLED
110 120 130 140 150 160
180 190 200 210 220 230
pF1KB5 LPPEQWSHTTVRNALKDLLKDMNQSSLAKECPLSQSMISSIVNSTYYANVSAAKCQEFGR
:: :::.:.:::::::.:::.::::.::::::::::::::::::::::::::.:::::::
CCDS23 LPAEQWNHATVRNALKELLKEMNQSTLAKECPLSQSMISSIVNSTYYANVSATKCQEFGR
170 180 190 200 210 220
240 250 260 270 280 290
pF1KB5 WYKHFKKTKDMMVEMDSLSE---LSQQGANHVNFGQQPVPGNTAEQPPSPAQLSHGSQPS
:::..:: : :: ..::. :.:. . :..: :.: :: : .:. : : :
CCDS23 WYKKYKKIKVERVERENLSDYCVLGQRPMHLPNMNQLASLGKTNEQSPH-SQIHH-STP-
230 240 250 260 270 280
300 310 320 330 340
pF1KB5 VRTPLPNLHP----GLVSTPISPQLVNQQLVMAQLLNQQYAVNRLLAQQ---SLNQQYLN
.:. .: :.: ::.: .::::: ::..::.:.::: ::.::::.: ..:::.::
CCDS23 IRNQVPALQPIMSPGLLSPQLSPQLVRQQIAMAHLINQQIAVSRLLAHQHPQAINQQFLN
290 300 310 320 330 340
350 360 370 380 390 400
pF1KB5 HPPPVSRSMNKPLEQQVSTNTEVSSEIYQWVRDELKRAGISQAVFARVAFNRTQGLLSEI
::: . :.. :: . .....::: .::: ::::::::..::::::::::::::::::::
CCDS23 HPP-IPRAV-KP--EPTNSSVEVSPDIYQQVRDELKRASVSQAVFARVAFNRTQGLLSEI
350 360 370 380 390
410 420 430 440 450 460
pF1KB5 LRKEEDPKTASQSLLVNLRAMQNFLQLPEAERDRIYQDERERSLNAASAMGPAPLISTPP
:::::::.:::::::::::::::::.:::.:::::::::::::.: .: . :
CCDS23 LRKEEDPRTASQSLLVNLRAMQNFLNLPEVERDRIYQDERERSMNPNVSMVSSASSSPSS
400 410 420 430 440 450
470 480 490 500 510 520
pF1KB5 SRPPQVKTATIATERNGKPENNTMNINASIYDEIQQEMKRAKVSQALFAKVAATKSQGWL
:: ::.::.: .:. : .. ..::.:.::::::::::::::::::::::::.::::::
CCDS23 SRTPQAKTSTPTTDLPIKVDGANINITAAIYDEIQQEMKRAKVSQALFAKVAANKSQGWL
460 470 480 490 500 510
530 540 550 560 570 580
pF1KB5 CELLRWKEDPSPENRTLWENLSMIRRFLSLPQPERDAIYEQESNAVHHHGDRPPHIIHVP
::::::::.:::::::::::: :::::.::: :::.:::.:: :::..: :....:
CCDS23 CELLRWKENPSPENRTLWENLCTIRRFLNLPQHERDVIYEEESR--HHHSERMQHVVQLP
520 530 540 550 560 570
590 600 610 620 630 640
pF1KB5 AEQIQSPSPTTLGKGESRGVFLPGLPTPAPWLGAAPQQQQQQQQQQQQQQQAPPPPQPQQ
: .: .: . ::.: .... ...:::::
CCDS23 PEPVQ-----VLHR----------------------QQSQPAKESSPPREEAPPPP----
580 590 600
650 660 670 680 690 700
pF1KB5 QPQTGPRLPPRQPTVASPAESDEENRQKTRPRTKISVEALGILQSFIQDVGLYPDEEAIQ
:: . . : .: : :::::.::::::::::.:::::::.:::.
CCDS23 --------PPTEDSCA----------KKPRSRTKISLEALGILQSFIHDVGLYPDQEAIH
610 620 630 640
710 720 730 740 750 760
pF1KB5 TLSAQLDLPKYTIIKFFQNQRYYLKHHGKLKDNSGLEVDVAEYKEEELLKDLEESVQDKN
::::::::::.:::::::::::..:::::::.. : :::::::.:::: . ::. ....
CCDS23 TLSAQLDLPKHTIIKFFQNQRYHVKHHGKLKEHLGSAVDVAEYKDEELLTESEENDSEEG
650 660 670 680 690 700
770 780 790
pF1KB5 TNTLFSVKLEEELSVEGNTDINTDLKD
.. ...:. :::
CCDS23 SEEMYKVEAEEENADKSKAAPAEIDQR
710 720 730
795 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Thu Nov 3 16:10:19 2016 done: Thu Nov 3 16:10:19 2016
Total Scan time: 4.960 Total Display time: 0.070
Function used was FASTA [36.3.4 Apr, 2011]