FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB7793, 460 aa
1>>>pF1KB7793 460 - 460 aa - 460 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 8.9738+/-0.00094; mu= 1.8777+/- 0.057
mean_var=208.2673+/-43.085, 0's: 0 Z-trim(113.3): 12 B-trim: 340 in 1/50
Lambda= 0.088872
statistics sampled from 13910 (13918) to 13910 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.764), E-opt: 0.2 (0.428), width: 16
Scan time: 2.860
The best scores are: opt bits E(32554)
CCDS4934.2 TFAP2B gene_id:7021|Hs108|chr6 ( 460) 3127 413.4 2.5e-115
CCDS4510.1 TFAP2A gene_id:7020|Hs108|chr6 ( 437) 2071 278.0 1.4e-74
CCDS43422.1 TFAP2A gene_id:7020|Hs108|chr6 ( 433) 2004 269.4 5.3e-72
CCDS34337.1 TFAP2A gene_id:7020|Hs108|chr6 ( 431) 2000 268.9 7.5e-72
CCDS393.2 TFAP2E gene_id:339488|Hs108|chr1 ( 442) 1737 235.2 1.1e-61
CCDS13454.1 TFAP2C gene_id:7022|Hs108|chr20 ( 450) 1653 224.4 1.9e-58
CCDS4933.1 TFAP2D gene_id:83741|Hs108|chr6 ( 452) 1105 154.1 2.7e-37
>>CCDS4934.2 TFAP2B gene_id:7021|Hs108|chr6 (460 aa)
initn: 3127 init1: 3127 opt: 3127 Z-score: 2184.0 bits: 413.4 E(32554): 2.5e-115
Smith-Waterman score: 3127; 100.0% identity (100.0% similar) in 460 aa overlap (1-460:1-460)
10 20 30 40 50 60
pF1KB7 MHSPPRDQAAIMLWKLVENVKYEDIYEDRHDGVPSHSSRLSQLGSVSQGPYSSAPPLSHT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS49 MHSPPRDQAAIMLWKLVENVKYEDIYEDRHDGVPSHSSRLSQLGSVSQGPYSSAPPLSHT
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB7 PSSDFQPPYFPPPYQPLPYHQSQDPYSHVNDPYSLNPLHQPQQHPWGQRQRQEVGSEAGS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS49 PSSDFQPPYFPPPYQPLPYHQSQDPYSHVNDPYSLNPLHQPQQHPWGQRQRQEVGSEAGS
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB7 LLPQPRAALPQLSGLDPRRDYHSVRRPDVLLHSAHHGLDAGMGDSLSLHGLGHPGMEDVQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS49 LLPQPRAALPQLSGLDPRRDYHSVRRPDVLLHSAHHGLDAGMGDSLSLHGLGHPGMEDVQ
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB7 SVEDANNSGMNLLDQSVIKKVPVPPKSVTSLMMNKDGFLGGMSVNTGEVFCSVPGRLSLL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS49 SVEDANNSGMNLLDQSVIKKVPVPPKSVTSLMMNKDGFLGGMSVNTGEVFCSVPGRLSLL
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB7 SSTSKYKVTVGEVQRRLSPPECLNASLLGGVLRRAKSKNGGRSLRERLEKIGLNLPAGRR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS49 SSTSKYKVTVGEVQRRLSPPECLNASLLGGVLRRAKSKNGGRSLRERLEKIGLNLPAGRR
250 260 270 280 290 300
310 320 330 340 350 360
pF1KB7 KAANVTLLTSLVEGEAVHLARDFGYICETEFPAKAVSEYLNRQHTDPSDLHSRKNMLLAT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS49 KAANVTLLTSLVEGEAVHLARDFGYICETEFPAKAVSEYLNRQHTDPSDLHSRKNMLLAT
310 320 330 340 350 360
370 380 390 400 410 420
pF1KB7 KQLCKEFTDLLAQDRTPIGNSRPSPILEPGIQSCLTHFSLITHGFGAPAICAALTALQNY
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS49 KQLCKEFTDLLAQDRTPIGNSRPSPILEPGIQSCLTHFSLITHGFGAPAICAALTALQNY
370 380 390 400 410 420
430 440 450 460
pF1KB7 LTEALKGMDKMFLNNTTTNRHTSGEGPGSKTGDKEEKHRK
::::::::::::::::::::::::::::::::::::::::
CCDS49 LTEALKGMDKMFLNNTTTNRHTSGEGPGSKTGDKEEKHRK
430 440 450 460
>>CCDS4510.1 TFAP2A gene_id:7020|Hs108|chr6 (437 aa)
initn: 1967 init1: 1259 opt: 2071 Z-score: 1452.6 bits: 278.0 E(32554): 1.4e-74
Smith-Waterman score: 2088; 72.3% identity (86.9% similar) in 458 aa overlap (12-460:1-437)
10 20 30 40 50 60
pF1KB7 MHSPPRDQAAIMLWKLVENVKYEDIYEDRHDGVPSHSSRLSQLGSVSQGPYSSAPPLSHT
:::::..:.:::: ::::::. . ..:: :::.:.:.::.::::::::
CCDS45 MLWKLTDNIKYEDC-EDRHDGTSNGTARLPQLGTVGQSPYTSAPPLSHT
10 20 30 40
70 80 90 100 110
pF1KB7 PSSDFQPPYFPPPYQPLPYHQSQDPYSHVNDPYSLNPLH-QPQ-QHP-W-GQRQRQEVGS
:..:::::::::::::. : ::::::::::::::::::: ::: ::: : :::: :: :
CCDS45 PNADFQPPYFPPPYQPI-YPQSQDPYSHVNDPYSLNPLHAQPQPQHPGWPGQRQSQESG-
50 60 70 80 90 100
120 130 140 150 160 170
pF1KB7 EAGSLLPQPRAALPQLSGLDPRRDYHSVRRPDVLLHSAHHGLDAGMGDSLSLHGLGHPGM
:: :. ::::::::::: :: . :::. : .:..:.:: ::.:.: : ..
CCDS45 ----LLHTHRGLPHQLSGLDPRRDY---RRHEDLLHGPH-ALSSGLGD-LSIHSLPH-AI
110 120 130 140 150
180 190 200 210 220 230
pF1KB7 EDVQSVEDANNSGMNLLDQSVIKKVPVP-PKS----VTSLMMNKDGFLGGMSVNTGEVFC
:.: ::: :.:. ::.:::: :: :: :... .:::...::. :: .::::
CCDS45 EEVPHVEDP---GINIPDQTVIKKGPVSLSKSNSNAVSAIPINKDNLFGGV-VNPNEVFC
160 170 180 190 200 210
240 250 260 270 280 290
pF1KB7 SVPGRLSLLSSTSKYKVTVGEVQRRLSPPECLNASLLGGVLRRAKSKNGGRSLRERLEKI
:::::::::::::::::::.:::::::::::::::::::::::::::::::::::.:.::
CCDS45 SVPGRLSLLSSTSKYKVTVAEVQRRLSPPECLNASLLGGVLRRAKSKNGGRSLREKLDKI
220 230 240 250 260 270
300 310 320 330 340 350
pF1KB7 GLNLPAGRRKAANVTLLTSLVEGEAVHLARDFGYICETEFPAKAVSEYLNRQHTDPSDLH
::::::::::::::::::::::::::::::::::.::::::::::.:.:::::.::..
CCDS45 GLNLPAGRRKAANVTLLTSLVEGEAVHLARDFGYVCETEFPAKAVAEFLNRQHSDPNEQV
280 290 300 310 320 330
360 370 380 390 400 410
pF1KB7 SRKNMLLATKQLCKEFTDLLAQDRTPIGNSRPSPILEPGIQSCLTHFSLITHGFGAPAIC
.::::::::::.::::::::::::.:.:::::.::::::::::::::.::.::::.::.:
CCDS45 TRKNMLLATKQICKEFTDLLAQDRSPLGNSRPNPILEPGIQSCLTHFNLISHGFGSPAVC
340 350 360 370 380 390
420 430 440 450 460
pF1KB7 AALTALQNYLTEALKGMDKMFLNNTTTNRHTSGEGPGSKTGDKEEKHRK
::.::::::::::::.::::.:.:. : ::... .:..::::::::
CCDS45 AAVTALQNYLTEALKAMDKMYLSNNP-NSHTDNN---AKSSDKEEKHRK
400 410 420 430
>>CCDS43422.1 TFAP2A gene_id:7020|Hs108|chr6 (433 aa)
initn: 1924 init1: 1259 opt: 2004 Z-score: 1406.2 bits: 269.4 E(32554): 5.3e-72
Smith-Waterman score: 2021; 72.1% identity (86.9% similar) in 444 aa overlap (26-460:10-433)
10 20 30 40 50 60
pF1KB7 MHSPPRDQAAIMLWKLVENVKYEDIYEDRHDGVPSHSSRLSQLGSVSQGPYSSAPPLSHT
..:::::. . ..:: :::.:.:.::.::::::::
CCDS43 MSILAKMGDWQDRHDGTSNGTARLPQLGTVGQSPYTSAPPLSHT
10 20 30 40
70 80 90 100 110
pF1KB7 PSSDFQPPYFPPPYQPLPYHQSQDPYSHVNDPYSLNPLH-QPQ-QHP-W-GQRQRQEVGS
:..:::::::::::::. : ::::::::::::::::::: ::: ::: : :::: :: :
CCDS43 PNADFQPPYFPPPYQPI-YPQSQDPYSHVNDPYSLNPLHAQPQPQHPGWPGQRQSQESG-
50 60 70 80 90 100
120 130 140 150 160 170
pF1KB7 EAGSLLPQPRAALPQLSGLDPRRDYHSVRRPDVLLHSAHHGLDAGMGDSLSLHGLGHPGM
:: :. ::::::::::: :: . :::. : .:..:.:: ::.:.: : ..
CCDS43 ----LLHTHRGLPHQLSGLDPRRDY---RRHEDLLHGPH-ALSSGLGD-LSIHSLPH-AI
110 120 130 140 150
180 190 200 210 220 230
pF1KB7 EDVQSVEDANNSGMNLLDQSVIKKVPVP-PKS----VTSLMMNKDGFLGGMSVNTGEVFC
:.: ::: :.:. ::.:::: :: :: :... .:::...::. :: .::::
CCDS43 EEVPHVED---PGINIPDQTVIKKGPVSLSKSNSNAVSAIPINKDNLFGGV-VNPNEVFC
160 170 180 190 200
240 250 260 270 280 290
pF1KB7 SVPGRLSLLSSTSKYKVTVGEVQRRLSPPECLNASLLGGVLRRAKSKNGGRSLRERLEKI
:::::::::::::::::::.:::::::::::::::::::::::::::::::::::.:.::
CCDS43 SVPGRLSLLSSTSKYKVTVAEVQRRLSPPECLNASLLGGVLRRAKSKNGGRSLREKLDKI
210 220 230 240 250 260
300 310 320 330 340 350
pF1KB7 GLNLPAGRRKAANVTLLTSLVEGEAVHLARDFGYICETEFPAKAVSEYLNRQHTDPSDLH
::::::::::::::::::::::::::::::::::.::::::::::.:.:::::.::..
CCDS43 GLNLPAGRRKAANVTLLTSLVEGEAVHLARDFGYVCETEFPAKAVAEFLNRQHSDPNEQV
270 280 290 300 310 320
360 370 380 390 400 410
pF1KB7 SRKNMLLATKQLCKEFTDLLAQDRTPIGNSRPSPILEPGIQSCLTHFSLITHGFGAPAIC
.::::::::::.::::::::::::.:.:::::.::::::::::::::.::.::::.::.:
CCDS43 TRKNMLLATKQICKEFTDLLAQDRSPLGNSRPNPILEPGIQSCLTHFNLISHGFGSPAVC
330 340 350 360 370 380
420 430 440 450 460
pF1KB7 AALTALQNYLTEALKGMDKMFLNNTTTNRHTSGEGPGSKTGDKEEKHRK
::.::::::::::::.::::.:.:. : ::... .:..::::::::
CCDS43 AAVTALQNYLTEALKAMDKMYLSNNP-NSHTDNN---AKSSDKEEKHRK
390 400 410 420 430
>>CCDS34337.1 TFAP2A gene_id:7020|Hs108|chr6 (431 aa)
initn: 1920 init1: 1259 opt: 2000 Z-score: 1403.5 bits: 268.9 E(32554): 7.5e-72
Smith-Waterman score: 2017; 72.4% identity (86.9% similar) in 442 aa overlap (28-460:10-431)
10 20 30 40 50 60
pF1KB7 MHSPPRDQAAIMLWKLVENVKYEDIYEDRHDGVPSHSSRLSQLGSVSQGPYSSAPPLSHT
:::::. . ..:: :::.:.:.::.::::::::
CCDS34 MLVHSFSAMDRHDGTSNGTARLPQLGTVGQSPYTSAPPLSHT
10 20 30 40
70 80 90 100 110
pF1KB7 PSSDFQPPYFPPPYQPLPYHQSQDPYSHVNDPYSLNPLH-QPQ-QHP-W-GQRQRQEVGS
:..:::::::::::::. : ::::::::::::::::::: ::: ::: : :::: :: :
CCDS34 PNADFQPPYFPPPYQPI-YPQSQDPYSHVNDPYSLNPLHAQPQPQHPGWPGQRQSQESG-
50 60 70 80 90 100
120 130 140 150 160 170
pF1KB7 EAGSLLPQPRAALPQLSGLDPRRDYHSVRRPDVLLHSAHHGLDAGMGDSLSLHGLGHPGM
:: :. ::::::::::: :: . :::. : .:..:.:: ::.:.: : ..
CCDS34 ----LLHTHRGLPHQLSGLDPRRDY---RRHEDLLHGPH-ALSSGLGD-LSIHSLPH-AI
110 120 130 140 150
180 190 200 210 220 230
pF1KB7 EDVQSVEDANNSGMNLLDQSVIKKVPVP-PKS----VTSLMMNKDGFLGGMSVNTGEVFC
:.: ::: :.:. ::.:::: :: :: :... .:::...::. :: .::::
CCDS34 EEVPHVEDP---GINIPDQTVIKKGPVSLSKSNSNAVSAIPINKDNLFGGV-VNPNEVFC
160 170 180 190 200
240 250 260 270 280 290
pF1KB7 SVPGRLSLLSSTSKYKVTVGEVQRRLSPPECLNASLLGGVLRRAKSKNGGRSLRERLEKI
:::::::::::::::::::.:::::::::::::::::::::::::::::::::::.:.::
CCDS34 SVPGRLSLLSSTSKYKVTVAEVQRRLSPPECLNASLLGGVLRRAKSKNGGRSLREKLDKI
210 220 230 240 250 260
300 310 320 330 340 350
pF1KB7 GLNLPAGRRKAANVTLLTSLVEGEAVHLARDFGYICETEFPAKAVSEYLNRQHTDPSDLH
::::::::::::::::::::::::::::::::::.::::::::::.:.:::::.::..
CCDS34 GLNLPAGRRKAANVTLLTSLVEGEAVHLARDFGYVCETEFPAKAVAEFLNRQHSDPNEQV
270 280 290 300 310 320
360 370 380 390 400 410
pF1KB7 SRKNMLLATKQLCKEFTDLLAQDRTPIGNSRPSPILEPGIQSCLTHFSLITHGFGAPAIC
.::::::::::.::::::::::::.:.:::::.::::::::::::::.::.::::.::.:
CCDS34 TRKNMLLATKQICKEFTDLLAQDRSPLGNSRPNPILEPGIQSCLTHFNLISHGFGSPAVC
330 340 350 360 370 380
420 430 440 450 460
pF1KB7 AALTALQNYLTEALKGMDKMFLNNTTTNRHTSGEGPGSKTGDKEEKHRK
::.::::::::::::.::::.:.:. : ::... .:..::::::::
CCDS34 AAVTALQNYLTEALKAMDKMYLSNNP-NSHTDNN---AKSSDKEEKHRK
390 400 410 420 430
>>CCDS393.2 TFAP2E gene_id:339488|Hs108|chr1 (442 aa)
initn: 1562 init1: 1243 opt: 1737 Z-score: 1221.1 bits: 235.2 E(32554): 1.1e-61
Smith-Waterman score: 1737; 63.9% identity (80.7% similar) in 451 aa overlap (28-460:10-442)
10 20 30 40 50 60
pF1KB7 MHSPPRDQAAIMLWKLVENVKYEDIYEDRHDGVPSHSSRLSQLGSVSQGPYSSAPPLSHT
.: ::. . .. ..:.:. :. :. :::: ::
CCDS39 MLVHTYSAMERPDGLGAAAGG-ARLSSLPQAAYGPAPPLCHT
10 20 30 40
70 80 90 100
pF1KB7 PSS----DFQPPYFPPPYQ--PLPYHQSQDP---YSHV-NDPYS-LNPLHQPQ--QHPWG
:.. .:::::::::: :::: :. : . :. .:::. : :: ::: : :.
CCDS39 PAATAAAEFQPPYFPPPYPQPPLPYGQAPDAAAAFPHLAGDPYGGLAPLAQPQPPQAAWA
50 60 70 80 90 100
110 120 130 140 150 160
pF1KB7 Q-RQRQEVGSEAGSLLPQPRAALPQLSGLDPRRDYHSVRRPDVLLHSAHHGLDAGMGDS-
: .. : .:: : :: :::::::: .. : :::. : :..:.
CCDS39 APRAAARAHEEPPGLLAPPARAL----GLDPRRDYATAV-PR-LLHGLADGAH-GLADAP
110 120 130 140 150
170 180 190 200 210 220
pF1KB7 LSLHGLGH-PGMEDVQSVEDANNSGMNLLDQSVIKKVPVPPK--SVTSLMMNKDGFLGGM
:.: ::. ::.::.:.... ::.:::::::::::.: : :...: . ::...::.
CCDS39 LGLPGLAAAPGLEDLQAMDEP---GMSLLDQSVIKKVPIPSKASSLSALSLAKDSLVGGI
160 170 180 190 200 210
230 240 250 260 270 280
pF1KB7 SVNTGEVFCSVPGRLSLLSSTSKYKVTVGEVQRRLSPPECLNASLLGGVLRRAKSKNGGR
. : ::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS39 T-NPGEVFCSVPGRLSLLSSTSKYKVTVGEVQRRLSPPECLNASLLGGVLRRAKSKNGGR
220 230 240 250 260 270
290 300 310 320 330 340
pF1KB7 SLRERLEKIGLNLPAGRRKAANVTLLTSLVEGEAVHLARDFGYICETEFPAKAVSEYLNR
::::::::::::::::::::::::::::::::::::::::::.:::::::::..::: :
CCDS39 CLRERLEKIGLNLPAGRRKAANVTLLTSLVEGEAVHLARDFGYVCETEFPAKAAAEYLCR
280 290 300 310 320 330
350 360 370 380 390 400
pF1KB7 QHTDPSDLHSRKNMLLATKQLCKEFTDLLAQDRTPIGNSRPSPILEPGIQSCLTHFSLIT
::.::..:::::.::::.::.::::.::.::::.:.:::::. :::::.:::::::::::
CCDS39 QHADPGELHSRKSMLLAAKQICKEFADLMAQDRSPLGNSRPALILEPGVQSCLTHFSLIT
340 350 360 370 380 390
410 420 430 440 450 460
pF1KB7 HGFGAPAICAALTALQNYLTEALKGMDKMFLNNTTTNRHTSGEGPGSKTGDKEEKHRK
::::.:::::::::.:::: :.:::.:::::... ::.: .:...:. ::::
CCDS39 HGFGGPAICAALTAFQNYLLESLKGLDKMFLSSVG-----SGHGE-TKASEKDAKHRK
400 410 420 430 440
>>CCDS13454.1 TFAP2C gene_id:7022|Hs108|chr20 (450 aa)
initn: 1468 init1: 685 opt: 1653 Z-score: 1162.7 bits: 224.4 E(32554): 1.9e-58
Smith-Waterman score: 1653; 59.6% identity (77.2% similar) in 465 aa overlap (12-460:1-450)
10 20 30 40 50 60
pF1KB7 MHSPPRDQAAIMLWKLVENVKYEDIYEDRHDGVPSHSSRLSQLGSVSQGPYSSAPPLSHT
::::...:::::. :::::: . . :. .:.:..: :: :::::::
CCDS13 MLWKITDNVKYEEDCEDRHDGSSNGNPRVPHLSSAGQHLYSPAPPLSHT
10 20 30 40
70 80 90 100 110
pF1KB7 PSSDFQPP-YFPPPYQPLPYHQSQDPYSHVNDPYS--LNPLHQP-----QQHPWGQRQRQ
...::: ::::::: : : :: :::::... :. .:::::: ::. : :: :
CCDS13 GVAEYQPPPYFPPPYQQLAYSQSADPYSHLGEAYAAAINPLHQPAPTGSQQQAWPGRQSQ
50 60 70 80 90 100
120 130 140 150 160
pF1KB7 EVGSEAGSLLPQPRAALPQLSGLDP-----RRDYHSVRRPDVLLHSAHHGLDA-GMGDSL
: :. : .: . ::.::::. ::: . :: :.:: :: .::: :....:
CCDS13 E-GAGLPSHHGRPAGLLPHLSGLEAGAVSARRDAY--RRSDLLLPHAH-ALDAAGLAENL
110 120 130 140 150 160
170 180 190 200 210 220
pF1KB7 SLHGLGHPGMEDVQSVEDANNSGMNLLDQSVIKKVPVP-PKSVTSLMMNKDGFLGGMSVN
.:: . : :..::.:.: . . : ::.::.: :. :. .: .:. : : .:
CCDS13 GLHDMPHQ-MDEVQNVDDQH---LLLHDQTVIRKGPISMTKNPLNLPCQKE--LVGAVMN
170 180 190 200 210
230 240 250 260 270 280
pF1KB7 TGEVFCSVPGRLSLLSSTSKYKVTVGEVQRRLSPPECLNASLLGGVLRRAKSKNGGRSLR
:::::::::::::::::::::::.::::::::::::::::::::::::::::::::::
CCDS13 PTEVFCSVPGRLSLLSSTSKYKVTVAEVQRRLSPPECLNASLLGGVLRRAKSKNGGRSLR
220 230 240 250 260 270
290 300 310 320 330 340
pF1KB7 ERLEKIGLNLPAGRRKAANVTLLTSLVEGEAVHLARDFGYICETEFPAKAVSEYLNRQHT
:.:.::::::::::::::.:::::::::::::::::::.:.::.:::.: :.:::.: :
CCDS13 EKLDKIGLNLPAGRRKAAHVTLLTSLVEGEAVHLARDFAYVCEAEFPSKPVAEYLTRPHL
280 290 300 310 320 330
350 360 370 380 390 400
pF1KB7 DP-SDLHSRKNMLLATKQLCKEFTDLLAQDRTPIGNSRPSPILEPGIQSCLTHFSLITHG
... .:::::::..:::::::.::.::::: :.:: .:.:: .::.::.::::::::
CCDS13 GGRNEMAARKNMLLAAQQLCKEFTELLSQDRTPHGTSRLAPVLETNIQNCLSHFSLITHG
340 350 360 370 380 390
410 420 430 440 450 460
pF1KB7 FGAPAICAALTALQNYLTEALKGMDKMFLNNTTTNRHTSGEGPGSKTGDKEEKHRK
::. :::::..:::::. ::: .:: ..: . : .:: .: :::::
CCDS13 FGSQAICAAVSALQNYIKEALIVIDKSYMNPGDQSPADS-----NKTLEKMEKHRK
400 410 420 430 440 450
>>CCDS4933.1 TFAP2D gene_id:83741|Hs108|chr6 (452 aa)
initn: 1239 init1: 1049 opt: 1105 Z-score: 783.0 bits: 154.1 E(32554): 2.7e-37
Smith-Waterman score: 1212; 51.1% identity (68.8% similar) in 452 aa overlap (27-454:13-432)
10 20 30 40 50
pF1KB7 MHSPPRDQAAIMLWKLVENVKYEDIYEDRHDGVPSHSSRLSQLG---SVSQGP--YSSAP
: :::: :.: :: ::: ::... :::.
CCDS49 MSTTFPGLVHDAEIRHDG--SNSYRLMQLGCLESVANSTVAYSSSS
10 20 30 40
60 70 80 90 100
pF1KB7 PLSH-TPSSDFQPPYFPPPYQPLP-YHQS---QDPYSH---VNDPYSLNPLHQPQQHPWG
::.. : ...: ::: .: : .::: . .:: . : :::: ::. ::.
CCDS49 PLTYSTTGTEFASPYFSTNHQYTPLHHQSFHYEFQHSHPAVTPDAYSLNSLHHSQQY---
50 60 70 80 90 100
110 120 130 140 150 160
pF1KB7 QRQRQEVGSEAGSLLPQPRAALPQLSGLDPRRD----YHSVRRPDVLLHSAHHGLDAGMG
:. . : : ... : . : :: .: . :: :. : : :: . ::
CCDS49 -YQQIHHG-EPTDFINLHNARALKSSCLDEQRRELGCLDAYRRHDLSLMS--HGSQYGMH
110 120 130 140 150
170 180 190 200 210
pF1KB7 DSLSLH-----GLGHPGMEDVQSVEDANNSGMNLLDQS-VIKKVPVPPKSVTSLMMNKDG
. : ::. : .:.:. .:. :. : :. ::..
CCDS49 PDQRLLPGPSLGLAAAGADDLQGSVEAQ-CGLVLNGQGGVIRR-----------------
160 170 180 190
220 230 240 250 260 270
pF1KB7 FLGGMSV-NTGEVFCSVPGRLSLLSSTSKYKVTVGEVQRRLSPPECLNASLLGGVLRRAK
:: : : ..::::::::::::::::::::..::.::::::::::::::::.:::::
CCDS49 --GGTCVVNPTDLFCSVPGRLSLLSSTSKYKVTIAEVKRRLSPPECLNASLLGGILRRAK
200 210 220 230 240 250
280 290 300 310 320 330
pF1KB7 SKNGGRSLRERLEKIGLNLPAGRRKAANVTLLTSLVEGEAVHLARDFGYICETEFPAKAV
:::::: :::.:...:::::::::::::::::::::::::.:::::::: ::::::::::
CCDS49 SKNGGRCLREKLDRLGLNLPAGRRKAANVTLLTSLVEGEALHLARDFGYTCETEFPAKAV
260 270 280 290 300 310
340 350 360 370 380 390
pF1KB7 SEYLNRQHTDPSDLHSRKNMLLATKQLCKEFTDLLAQDRTPIGNSRPSPILEPGIQSCLT
.:.: ::: . .. .::.:.:::::.:::: :::.:::.:.:.:::.:::. :: ::
CCDS49 GEHLARQHMEQKEQTARKKMILATKQICKEFQDLLSQDRSPLGSSRPTPILDLDIQRHLT
320 330 340 350 360 370
400 410 420 430 440 450
pF1KB7 HFSLITHGFGAPAICAALTALQNYLTEALKGMDKMFLNNTTTNRHTSGEGPGSKTGDKEE
::::::::::.:::::::...:. :.: :. ..: ..: : .. : : ...:
CCDS49 HFSLITHGFGTPAICAALSTFQTVLSEMLNYLEK---HTTHKNGGAADSGQGHANSEKAP
380 390 400 410 420 430
460
pF1KB7 KHRK
CCDS49 LRKTSEAAVKEGKTEKTD
440 450
460 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sat Nov 5 18:32:10 2016 done: Sat Nov 5 18:32:10 2016
Total Scan time: 2.860 Total Display time: 0.030
Function used was FASTA [36.3.4 Apr, 2011]